CSpace

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 卷号: 145, 页码: 202-216
作者:  Cheng Daning;  Li Shigang;  Zhang Yunquan
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
SGD  Unbalanced workload  SimuParallel SGD  Distributed system  
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 卷号: 31, 期号: 8, 页码: 1925-1941
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Chen, Tun;  Yuan, Liang;  Vuduc, Richard
收藏  |  浏览/下载:57/0  |  提交时间:2020/12/10
AutoFFT  FFT  code generation  template  DFT  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
Cacheap: Portable and Collaborative I/O Optimization for Graph Processing 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 卷号: 34, 期号: 3, 页码: 690-706
作者:  Zhao, Peng;  Ding, Chen;  Liu, Lei;  Yu, Jiping;  Han, Wentao;  Feng, Xiao-Bing
收藏  |  浏览/下载:74/0  |  提交时间:2019/08/16
out-of-core graph processing system  I/O optimization  memory cache  graph analytics  locality  
Efficient parallel optimizations of a high-performance SIFT on GPUs 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 卷号: 124, 页码: 78-91
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Liu, Shice;  Li, Shigang;  Wang, Xiao;  Zhang, Hao
收藏  |  浏览/下载:73/0  |  提交时间:2019/04/03
HartSift  SIFT  CPU  High performance  Feature extraction  
Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture 期刊论文
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2018, 卷号: 44, 期号: 4, 页码: 25
作者:  Tan, Guangming;  Liu, Junhong;  Li, Jiajia
收藏  |  浏览/下载:56/0  |  提交时间:2019/12/10
Sparse matrix vector multiplication  auto-tuning  multicore  machine learning  
Quadboost: A Scalable Concurrent Quadtree 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 673-686
作者:  Zhou, Keren;  Tan, Guangming;  Zhou, Wei
收藏  |  浏览/下载:40/0  |  提交时间:2019/12/10
Concurrent data structures  quadtree  continuous find  decoupling  LCA  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives