CSpace

浏览/检索结果: 共4条,第1-4条 帮助

已选(0)清除 条数/页:   排序方式:
OptiFX: Automatic Optimization for Convolutional Neural Networks with Aggressive Operator Fusion on GPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Wang, Xueying;  Li, Shigang;  Qian, Hao;  Luo, Fan;  Hao, Zhaoyang;  Wu, Tong;  Xu, Ruiyuan;  Cui, Huimin;  Feng, Xiaobing;  Li, Guangli
收藏  |  浏览/下载:2/0  |  提交时间:2025/12/03
Deep learning systems  convolutional neural networks  operator fusion  
Accelerate Point Cloud Structuring for Deep Neural Networks via Fast Spatial-Searching Tree 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 3, 页码: 2570-2585
作者:  Zhan, Jinyu;  Zou, Shiyu;  Jiang, Wei;  Zhang, Youyuan;  Peng, Suidi;  Wang, Ying
收藏  |  浏览/下载:12/0  |  提交时间:2025/06/25
Deep neural networks  point cloud structuring  fast spatial-searching tree  sampling  neighbor query  acceleration  Deep neural networks  point cloud structuring  fast spatial-searching tree  sampling  neighbor query  acceleration  
Redesigning OpenKMC for Multi-Component Trillion-Atom Simulations on the New Sunway Supercomputer 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 7, 页码: 1997-2010
作者:  Xu, Lei;  Shang, Honghui;  Chen, Xin;  Zhang, Yunquan;  Wang, Lifang;  Gao, Xingyu;  Song, Haifeng
收藏  |  浏览/下载:40/0  |  提交时间:2023/12/04
Metals  Computational modeling  Monte Carlo methods  Kinetic theory  Aging  Steel  Silicon  Atomic kinetic Monte Carlo  many-core processor  scalability  
Network Pruning for Bit-Serial Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 卷号: 42, 期号: 5, 页码: 1597-1609
作者:  Zhao, Xiandong;  Wang, Ying;  Liu, Cheng;  Shi, Cong;  Tu, Kaijie;  Zhang, Lei
收藏  |  浏览/下载:41/0  |  提交时间:2023/12/04
AI accelerators  neural networks (NNs)  NN compression