CSpace

浏览/检索结果: 共3条,第1-3条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Fast and accurate variable batch size convolution neural network training on large scale distributed systems 期刊论文
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 页码: 26
作者:  Hu, Zhongzhe;  Xiao, Junmin;  Sun, Ninghui;  Tan, Guangming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
deep learning  distributed computing  ImageNet-1K  large-batch training  synchronous SGD  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Breaking the Interaction Wall: A DLPU-Centric Deep Learning Computing System 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 1, 页码: 209-222
作者:  Du, Zidong;  Guo, Qi;  Zhao, Yongwei;  Zeng, Xi;  Li, Ling;  Cheng, Limin;  Xu, Zhiwei;  Sun, Ninghui;  Chen, Yunji
收藏  |  浏览/下载:32/0  |  提交时间:2022/06/21
Deep learning  Central Processing Unit  Process control  Task analysis  Computational modeling  Pipelines  Runtime  Neural net accelerators  system architectures  interaction wall