H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference

Jan 1, 2025·
Cong Li
,
Yihan Yin
,
Xintong Wu
,
Jingchen Zhu
,
Zhutianya Gao
,
Dimin Niu
,
Qiang Wu
,
Xin Si
Prof. Yuan Xie
Prof. Yuan Xie
,
Chen Zhang
,
Others
· 0 min read
Type
Publication
Proceedings of the 52nd Annual International Symposium on Computer Architecture