CSpace

浏览/检索结果: 共3条,第1-3条 帮助

已选(0)清除 条数/页:   排序方式:
Accelerating Parallel Structures in DNNs via Parallel Fusion and Operator Co-Optimization 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Di, Zhanyuan;  Wang, Leping;  Ma, Zhaojia;  Shao, En;  Zhao, Jie;  Ren, Ziyi;  Feng, Siyuan;  Tao, Dingwen;  Tan, Guangming;  Sun, Ninghui
收藏  |  浏览/下载:2/0  |  提交时间:2025/12/03
Deep learning  tensor compiler  inference optimization  code generation  GPU  
Advancements in Accelerating Deep Neural Network Inference on AIoT Devices: A Survey 期刊论文
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 卷号: 9, 期号: 6, 页码: 830-847
作者:  Cheng, Long;  Gu, Yan;  Liu, Qingzhi;  Yang, Lei;  Liu, Cheng;  Wang, Ying
收藏  |  浏览/下载:33/0  |  提交时间:2025/06/25
Computational modeling  Hardware  Artificial neural networks  Optimization  Internet of Things  Adaptation models  Data models  AIoT devices  DNN inference  model compression  parallel computing  performance optimization  survey  
An Automatic Neural Network Architecture-and-Quantization Joint Optimization Framework for Efficient Model Inference 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 5, 页码: 1497-1510
作者:  Liu, Lian;  Wang, Ying;  Zhao, Xiandong;  Chen, Weiwei;  Li, Huawei;  Li, Xiaowei;  Han, Yinhe
收藏  |  浏览/下载:61/0  |  提交时间:2024/12/06
Optimization  Quantization (signal)  Computer architecture  Training  Computational modeling  Integrated circuit modeling  Convergence  Automatic joint optimization  efficient model inference  network quantization  neural architecture search (NAS)