CSpace

浏览/检索结果: 共1条,第1-1条 帮助

已选(0)清除 条数/页:   排序方式:
Accelerating Parallel Structures in DNNs via Parallel Fusion and Operator Co-Optimization 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Di, Zhanyuan;  Wang, Leping;  Ma, Zhaojia;  Shao, En;  Zhao, Jie;  Ren, Ziyi;  Feng, Siyuan;  Tao, Dingwen;  Tan, Guangming;  Sun, Ninghui
收藏  |  浏览/下载:2/0  |  提交时间:2025/12/03
Deep learning  tensor compiler  inference optimization  code generation  GPU