CSpace

浏览/检索结果: 共3条,第1-3条 帮助

已选(0)清除 条数/页:   排序方式:
OptiFX: Automatic Optimization for Convolutional Neural Networks with Aggressive Operator Fusion on GPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Wang, Xueying;  Li, Shigang;  Qian, Hao;  Luo, Fan;  Hao, Zhaoyang;  Wu, Tong;  Xu, Ruiyuan;  Cui, Huimin;  Feng, Xiaobing;  Li, Guangli
收藏  |  浏览/下载:2/0  |  提交时间:2025/12/03
Deep learning systems  convolutional neural networks  operator fusion  
SRSparse: Generating Codes for High-Performance Sparse Matrix-Vector Semiring Computations 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 26
作者:  Du, Zhen;  Li, Ying;  Sun, Ninghui;  Cui, Huimin;  Feng, Xiaobing;  Li, Jiajia
收藏  |  浏览/下载:2/0  |  提交时间:2025/12/03
High performance computing  sparse matrix computation  auto-tuning  code generator  semiring computation  
Fast Convolution Meets Low Precision: Exploring Efficient Quantized Winograd Convolution on Modern CPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Wang, Xueying;  Li, Guangli;  Jia, Zhen;  Feng, Xiaobing;  Wang, Yida
收藏  |  浏览/下载:53/0  |  提交时间:2024/05/20
Deep learning  winograd convolution  low-precision computation