CSpace

浏览/检索结果: 共3条,第1-3条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 卷号: 31, 期号: 8, 页码: 1925-1941
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Chen, Tun;  Yuan, Liang;  Vuduc, Richard
收藏  |  浏览/下载:58/0  |  提交时间:2020/12/10
AutoFFT  FFT  code generation  template  DFT  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
A Cross-Platform SpMV Framework on Many-Core Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 卷号: 13, 期号: 4, 页码: 25
作者:  Zhang, Yunquan;  Li, Shigang;  Yan, Shengen;  Zhou, Huiyang
收藏  |  浏览/下载:38/0  |  提交时间:2019/12/12
SpMV  segmented scan  BCCOO  OpenCL  CUDA  GPU  Intel MIC  parallel algorithms