CSpace

浏览/检索结果: 共3条,第1-3条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 3, 页码: 766-780
作者:  Cao, Hang;  Yuan, Liang;  Zhang, He;  Zhang, Yunquan;  Wu, Baodong;  Li, Kun;  Li, Shigang;  Zhang, Minghua;  Lu, Pengqi;  Xiao, Junmin
收藏  |  浏览/下载:15/0  |  提交时间:2023/07/12
Atmospheric general circulation model  3-D decomposition  leap-format finite-difference  heterogeneous acceleration  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives