CSpace

浏览/检索结果: 共8条,第1-8条 帮助

已选(0)清除 条数/页:   排序方式:
Characterizing and Understanding HGNN Training on GPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 1, 页码: 25
作者:  Han, Dengke;  Yan, Mingyu;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:7/0  |  提交时间:2025/06/25
Heterogeneous graph neural networks  graph neural networks training  characterization  quantitative analysis  optimization guidelines  
Accelerating tensor multiplication by exploring hybrid product with hardware and software co-design 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2025, 卷号: 159, 页码: 16
作者:  Zhang, Zhiyuan;  Fan, Zhihua;  Li, Wenming;  Qiu, Yuhang;  Wang, Zhen;  Ye, Xiaochun;  Fan, Dongrui;  An, Xuejun
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Tensor multiplication  Hybrid product  Dataflow  Accelerator  
AI Computing Systems for Large Language Models Training 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 1, 页码: 6-41
作者:  Zhang, Zhen-Xing;  Wen, Yuan-Bo;  Lyu, Han-Qi;  Liu, Chang;  Zhang, Rui;  Li, Xia-Qing;  Wang, Chao;  Du, Zi-Dong;  Guo, Qi;  Li, Ling;  Zhou, Xue-Hai;  Chen, Yun-Ji
收藏  |  浏览/下载:15/0  |  提交时间:2025/06/25
artificial intelligence (AI) chip  large language model (LLM)  AI computing system  accelerator  
Hardware Acceleration for SLAM in Mobile Systems 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 6, 页码: 1300-1322
作者:  Fan, Zhe;  Hao, Yi-Fan;  Zhi, Tian;  Guo, Qi;  Du, Zi-Dong
收藏  |  浏览/下载:27/0  |  提交时间:2024/05/20
hardware accelerator  instruction set  mobile system  simultaneous localization and mapping (SLAM) algorithm  
Chip design with machine learning: a survey from algorithm perspective 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 卷号: 66, 期号: 11, 页码: 31
作者:  He, Wenkai;  Li, Xiaqing;  Song, Xinkai;  Hao, Yifan;  Zhang, Rui;  Du, Zidong;  Chen, Yunji
收藏  |  浏览/下载:34/0  |  提交时间:2023/12/04
chip design  machine learning  chip design automation  design result estimation  design optimization and correction  design construction  
Rescue to the Curse of universality 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 卷号: 66, 期号: 9, 页码: 17
作者:  Zhao, Yongwei;  Du, Zidong;  Guo, Qi;  Xu, Zhiwei;  Chen, Yunji
收藏  |  浏览/下载:28/0  |  提交时间:2023/12/04
universality  general-purpose architecture  specialized architecture  deep learning processor  universal circuit  
Characterizing and Understanding Defense Methods for GNNs on GPUs 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 卷号: 22, 期号: 2, 页码: 137-140
作者:  Wu, Meng;  Yan, Mingyu;  Yang, Xiaocheng;  Li, Wenming;  Zhang, Zhimin;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:40/0  |  提交时间:2023/12/04
Kernel  Purification  Estimation  Graphics processing units  Perturbation methods  Electric breakdown  Training  Graph neural networks  defense  execution semantic  execution pattern  overhead  
Multi-Node Acceleration for Large-Scale GCNs 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 12, 页码: 3140-3152
作者:  Sun, Gongjian;  Yan, Mingyu;  Wang, Duo;  Li, Han;  Li, Wenming;  Ye, Xiaochun;  Fan, Dongrui;  Xie, Yuan
收藏  |  浏览/下载:63/0  |  提交时间:2023/07/12
Deep learning  graph neural network  hardware accelerator  multi-node system  communication optimization