CSpace

浏览/检索结果: 共3条,第1-3条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
A Low Overhead In-Network Data Compressor for the Memory Hierarchy of Chip Multiprocessors 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 卷号: 37, 期号: 6, 页码: 1265-1277
作者:  Wang, Ying;  Li, Huawei;  Han, Yinhe;  Li, Xiaowei
收藏  |  浏览/下载:67/0  |  提交时间:2019/12/10
Cache  chip multiprocessor (CMP)  compression  memory hierarchy  network-on-chip (NoC)  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives