CSpace

浏览/检索结果: 共10条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 3, 页码: 766-780
作者:  Cao, Hang;  Yuan, Liang;  Zhang, He;  Zhang, Yunquan;  Wu, Baodong;  Li, Kun;  Li, Shigang;  Zhang, Minghua;  Lu, Pengqi;  Xiao, Junmin
收藏  |  浏览/下载:15/0  |  提交时间:2023/07/12
Atmospheric general circulation model  3-D decomposition  leap-format finite-difference  heterogeneous acceleration  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations 期刊论文
JOURNAL OF SUPERCOMPUTING, 2020, 卷号: 76, 期号: 7, 页码: 5501-5520
作者:  Li, Kun;  Li, Shigang;  Huang, Shan;  Chen, Yifeng;  Zhang, Yunquan
收藏  |  浏览/下载:50/0  |  提交时间:2020/12/10
Neighbor list  Bitwise operations  SIMD  Molecular dynamics  
The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package 期刊论文
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2020, 卷号: 34, 期号: 2, 页码: 159-168
作者:  Qin, Xinming;  Shang, Honghui;  Xu, Lei;  Hu, Wei;  Yang, Jinlong;  Li, Shigang;  Zhang, Yunquan
收藏  |  浏览/下载:45/0  |  提交时间:2020/12/10
Distributed algorithms  hybrid density-functional calculations  HONPAS package  electron repulsion integrals  parallel implementation  
FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations (vol 457, pg 235, 2020) 期刊论文
JOURNAL OF SUPERCOMPUTING, 2019, 卷号: 75, 期号: 12, 页码: 8339-8340
作者:  Li, Kun;  Li, Shigang;  Huang, Shan;  Chen, Yifeng;  Zhang, Yunquan
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
Efficient parallel optimizations of a high-performance SIFT on GPUs 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 卷号: 124, 页码: 78-91
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Liu, Shice;  Li, Shigang;  Wang, Xiao;  Zhang, Hao
收藏  |  浏览/下载:73/0  |  提交时间:2019/04/03
HartSift  SIFT  CPU  High performance  Feature extraction  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives  
Hybrid-optimization strategy for the communication of large-scale Kinetic Monte Carlo simulation 期刊论文
COMPUTER PHYSICS COMMUNICATIONS, 2017, 卷号: 211, 页码: 113-123
作者:  Wu, Baodong;  Li, Shigang;  Zhang, Yunquan;  Nie, Ningming
收藏  |  浏览/下载:43/0  |  提交时间:2019/12/12
Kinetic Monte Carlo  Communication aggregation  Shared memory  Neighborhood collectives  
A Cross-Platform SpMV Framework on Many-Core Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 卷号: 13, 期号: 4, 页码: 25
作者:  Zhang, Yunquan;  Li, Shigang;  Yan, Shengen;  Zhou, Huiyang
收藏  |  浏览/下载:38/0  |  提交时间:2019/12/12
SpMV  segmented scan  BCCOO  OpenCL  CUDA  GPU  Intel MIC  parallel algorithms  
Parallel Processing Systems for Big Data: A Survey 期刊论文
PROCEEDINGS OF THE IEEE, 2016, 卷号: 104, 期号: 11, 页码: 2114-2136
作者:  Zhang, Yunquan;  Cao, Ting;  Li, Shigang;  Tian, Xinhui;  Yuan, Liang;  Jia, Haipeng;  Vasilakos, Athanasios V.
收藏  |  浏览/下载:55/0  |  提交时间:2019/12/13
Big data  machine learning  MapReduce  parallel processing  SQL  survey