CSpace

浏览/检索结果: 共2条,第1-2条 帮助

已选(0)清除 条数/页:   排序方式:
Accelerating Parallel Structures in DNNs via Parallel Fusion and Operator Co-Optimization 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Di, Zhanyuan;  Wang, Leping;  Ma, Zhaojia;  Shao, En;  Zhao, Jie;  Ren, Ziyi;  Feng, Siyuan;  Tao, Dingwen;  Tan, Guangming;  Sun, Ninghui
收藏  |  浏览/下载:1/0  |  提交时间:2025/12/03
Deep learning  tensor compiler  inference optimization  code generation  GPU  
Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 1, 页码: 155-169
作者:  Bi, Jun;  Wen, Yuanbo;  Li, Xiaqing;  Zhao, Yongwei;  Guo, Yuxuan;  Zhou, Enshuai;  Hu, Xing;  Du, Zidong;  Li, Ling;  Chen, Huaping;  Chen, Tianshi;  Guo, Qi
收藏  |  浏览/下载:11/0  |  提交时间:2025/06/25
Optimization  Space exploration  Schedules  Libraries  Biological cells  Deep learning  Costs  Computers  Search problems  Tensors  Code generation  compiler optimization  tensor computation