CSpace

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 卷号: 145, 页码: 202-216
作者:  Cheng Daning;  Li Shigang;  Zhang Yunquan
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
SGD  Unbalanced workload  SimuParallel SGD  Distributed system  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
PIM-WEAVER: A High Energy-efficient, General-purpose Acceleration Architecture for String Operations in Big Data Processing 期刊论文
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2019, 卷号: 21, 页码: 129-142
作者:  Li, Wenming;  Ye, Xiaochun;  Wang, Da;  Zhang, Hao;  Tang, Zhimin;  Fan, Dongrui;  Sun, Ninghui
收藏  |  浏览/下载:133/0  |  提交时间:2019/08/16
PIM  String operations  Acceleration architecture  Big data  HMC  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives  
Parallel Incremental Frequent Itemset Mining for Large Data 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 卷号: 32, 期号: 2, 页码: 368-385
作者:  Song, Yu-Geng;  Cui, Hui-Min;  Feng, Xiao-Bing
收藏  |  浏览/下载:40/0  |  提交时间:2019/12/12
incremental parallel FPGrowth  data mining  frequent itemset mining  MapReduce  
DLPlib: A Library for Deep Learning Processor 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 卷号: 32, 期号: 2, 页码: 286-296
作者:  Lan, Hui-Ying;  Wu, Lin-Yang;  Zhang, Xiao;  Tao, Jin-Hua;  Chen, Xun-Yu;  Wang, Bing-Rui;  Wang, Yu-Qing;  Guo, Qi;  Chen, Yun-Ji
收藏  |  浏览/下载:69/0  |  提交时间:2019/12/12
deep learning processor  API  library  DLPlib  
A Cross-Platform SpMV Framework on Many-Core Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 卷号: 13, 期号: 4, 页码: 25
作者:  Zhang, Yunquan;  Li, Shigang;  Yan, Shengen;  Zhou, Huiyang
收藏  |  浏览/下载:38/0  |  提交时间:2019/12/12
SpMV  segmented scan  BCCOO  OpenCL  CUDA  GPU  Intel MIC  parallel algorithms