CSpace

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 卷号: 31, 期号: 8, 页码: 1925-1941
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Chen, Tun;  Yuan, Liang;  Vuduc, Richard
收藏  |  浏览/下载:57/0  |  提交时间:2020/12/10
AutoFFT  FFT  code generation  template  DFT  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
Efficient parallel optimizations of a high-performance SIFT on GPUs 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 卷号: 124, 页码: 78-91
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Liu, Shice;  Li, Shigang;  Wang, Xiao;  Zhang, Hao
收藏  |  浏览/下载:73/0  |  提交时间:2019/04/03
HartSift  SIFT  CPU  High performance  Feature extraction  
Two-Level Task Scheduling for Irregular Applications on GPU Platform 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 卷号: 45, 期号: 1, 页码: 79-93
作者:  Li, Jing;  Liu, Lei;  Wu, Yuan;  Feng, Xiaobing;  Wu, Chengyong
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/12
Hierarchical schedule  Resource-aware  Irregular application  GPU  
Performance Evaluation and Enhancement of Process-Based Parallel Loop Execution 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 卷号: 45, 期号: 1, 页码: 185-198
作者:  Lu, Xingjing;  Chen, Long;  Li, Zhiyuan
收藏  |  浏览/下载:40/0  |  提交时间:2019/12/12
Parallel loop  Process-based execution  Thread-based execution  DOACROSS  
A Cross-Platform SpMV Framework on Many-Core Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 卷号: 13, 期号: 4, 页码: 25
作者:  Zhang, Yunquan;  Li, Shigang;  Yan, Shengen;  Zhou, Huiyang
收藏  |  浏览/下载:38/0  |  提交时间:2019/12/12
SpMV  segmented scan  BCCOO  OpenCL  CUDA  GPU  Intel MIC  parallel algorithms