CSpace

浏览/检索结果: 共2条,第1-2条 帮助

已选(0)清除 条数/页:   排序方式:
Accelerating Parallel Structures in DNNs via Parallel Fusion and Operator Co-Optimization 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Di, Zhanyuan;  Wang, Leping;  Ma, Zhaojia;  Shao, En;  Zhao, Jie;  Ren, Ziyi;  Feng, Siyuan;  Tao, Dingwen;  Tan, Guangming;  Sun, Ninghui
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Deep learning  tensor compiler  inference optimization  code generation  GPU  
VastPipe: A High-Throughput Inference System via Adaptive Space-Division Multiplexing for Diverse Accelerators 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 2, 页码: 444-463
作者:  Ma, Li-Xian;  Wang, Le-Ping;  Shao, En;  Cao, Rong-Yu;  Tan, Guang-Ming
收藏  |  浏览/下载:20/0  |  提交时间:2025/06/25
cluster scheduling  resource management  reinforcement learning  DNN accelerator