Open Menu
Close Menu
Research
Publications
People
Posts
News
Advices
Contact Us
Contact Us
Chen Zhang
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
Jan 1, 2025
Dstc: Dual-side sparsity tensor core for dnns acceleration on modern gpu architectures
Jan 1, 2024
Rm-stc: Row-merge dataflow inspired gpu sparse tensor core for energy-efficient sparse acceleration
Jan 1, 2023