Open Menu
Close Menu
Research
Publications
People
Posts
News
Advices
Contact Us
Contact Us
Yufei Ding
TRACI: Network Acceleration of Input-Dynamic Communication for Large-Scale Deep Learning Recommendation Model
Jan 1, 2025
Push Multicast: A Speculative and Coherent Interconnect for Mitigating Manycore CPU Communication Bottleneck
Jan 1, 2025
Large-scale self-normalizing neural networks
Jan 1, 2024
Evt: Accelerating deep learning training with epilogue visitor tree
Jan 1, 2024
Spg: Structure-private graph database via squeezepir
Jan 1, 2023
Rm-stc: Row-merge dataflow inspired gpu sparse tensor core for energy-efficient sparse acceleration
Jan 1, 2023
MPU: Memory-centric SIMT processor via in-DRAM near-bank computing
Jan 1, 2023
Ecssd: Hardware/data layout co-designed in-storage-computing architecture for extreme classification
Jan 1, 2023
Dynamic n: M fine-grained structured sparse attention mechanism
Jan 1, 2023
Alcop: Automatic load-compute pipelining in deep learning compiler for ai-gpus
Jan 1, 2023
Next »