CSpace

浏览/检索结果: 共2条,第1-2条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CUTE: A scalable CPU-centric and Ultra-utilized Tensor Engine for convolutions 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 卷号: 149, 页码: 15
作者:  Li, Wenqing;  Ye, Jinpeng;  Zhang, Fuxin;  Liu, Tianyi;  Zhang, Tingting;  Wang, Jian
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Tensor engine  Convolution  Scalable architecture  CPU-centric  Utilization  
Mortar-FP8: Morphing the Existing FP32 Infrastructure for High-Performance Deep Learning Acceleration 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 3, 页码: 878-891
作者:  Li, Hongyan;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Deep learning accelerator  deep neural network (DNN)  fp8 format