CSpace

浏览/检索结果: 共281条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Mortar-FP8: Morphing the Existing FP32 Infrastructure for High-Performance Deep Learning Acceleration 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 3, 页码: 878-891
作者:  Li, Hongyan;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Deep learning accelerator  deep neural network (DNN)  fp8 format  
Improving Utilization of Dataflow Unit for Multi-Batch Processing 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Fan, Zhihua;  Li, Wenming;  Wang, Zhen;  Yang, Yu;  Ye, Xiaochun;  Fan, Dongrui;  Sun, Ninghui;  An, Xuejun
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Utilization  network-on-chip  decoupled architecture  batch processing  
Rescue to the Curse of universality 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 卷号: 66, 期号: 9, 页码: 17
作者:  Zhao, Yongwei;  Du, Zidong;  Guo, Qi;  Xu, Zhiwei;  Chen, Yunji
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
universality  general-purpose architecture  specialized architecture  deep learning processor  universal circuit  
AKGF: Automatic Kernel Generation for DNN on CPU-FPGA 期刊论文
COMPUTER JOURNAL, 2023, 页码: 9
作者:  Dong, Dong;  Jiang, Hongxu;  Diao, Boyu
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
DNN accelerated compilers  polyhedral model  heterogeneous computing  CPU-FPGA  
TCADer: A Tightly Coupled Accelerator Design framework for heterogeneous system with hardware/software co-design 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 卷号: 136, 页码: 12
作者:  Li, Wenqing;  Liu, Tianyi;  Xiao, Ziyuan;  Qi, Han;  Zhu, Weipu;  Wang, Jian
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Heterogeneous system  Tightly coupled  Accelerator design framework  Interaction latency  Low-latency task  
CAP: Communication-Aware Automated Parallelization for Deep Learning Inference on CMP Architectures 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 7, 页码: 1626-1639
作者:  Zou, Kaiwei;  Wang, Ying;  Cheng, Long;  Qu, Songyun;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:34/0  |  提交时间:2022/12/07
Kernel  Computer architecture  Multicore processing  Deep learning  System-on-chip  Parallel processing  Real-time systems  Neural networks  parallel processing  real-time and embedded systems  single-chip multiprocessors  reinforcement learning  structured sparsity  
ParaML: A Polyvalent Multicore Accelerator for Machine Learning 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 卷号: 39, 期号: 9, 页码: 1764-1777
作者:  Zhou, Shengyuan;  Guo, Qi;  Du, Zidong;  Liu, Daofu;  Chen, Tianshi;  Li, Ling;  Liu, Shaoli;  Zhou, Jinhong;  Temam, Olivier;  Feng, Xiaobing;  Zhou, Xuehai;  Chen, Yunji
收藏  |  浏览/下载:56/0  |  提交时间:2020/12/10
Neural networks  Machine learning  Testing  Support vector machines  Linear regression  Computers  Computer architecture  Accelerator  machine learning (ML) techniques  multicore accelerator  
GPGPU-Based ATPG System: Myth or Reality? 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 卷号: 39, 期号: 1, 页码: 239-247
作者:  Lai, Liyang;  Tsai, Hans;  Li, Huawei
收藏  |  浏览/下载:53/0  |  提交时间:2020/12/10
ATPG  fault simulation  general-purpose computing on graphics processing units (GPGPUs)  
Accelerating DNN-based 3D point cloud processing for mobile computing 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2019, 卷号: 62, 期号: 11, 页码: 11
作者:  Liu, Bosheng;  Chen, Xiaoming;  Han, Yinhe;  Li, Jiajun;  Xu, Haobo;  Li, Xiaowei
收藏  |  浏览/下载:230/0  |  提交时间:2019/12/10
deep neural network acceleration  point cloud data  neighbor point search  mobile robotics  hardware architecture  
An Instruction Set Architecture for Machine Learning 期刊论文
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2019, 卷号: 36, 期号: 3, 页码: 35
作者:  Chen, Yunji;  Lan, Huiying;  Du, Zidong;  Liu, Shaoli;  Tao, Jinhua;  Han, Dong;  Luo, Tao;  Guo, Qi;  Li, Ling;  Xie, Yuan;  Chen, Tianshi
收藏  |  浏览/下载:56/0  |  提交时间:2020/12/10