CSpace

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Compiler-assisted Operator Template Library for DNN Accelerators 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 页码: 18
作者:  Li, Jiansong;  Cao, Wei;  Dong, Xiao;  Li, Guangli;  Wang, Xueying;  Zhao, Peng;  Liu, Lei;  Feng, Xiaobing
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
DNN Accelerators  Template Library  Address Space Management  
Fast Data-Obtaining Algorithm for Data Assimilation with Large Data Set 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 页码: 21
作者:  Xiao, Junmin;  Zhang, Guizhao;  Gao, Yanan;  Ho, Xuehai;  Tan, Guangming
收藏  |  浏览/下载:47/0  |  提交时间:2020/12/10
Data assimilation  I  O optimization  Communication optimization  Parallel implementation  Domain localization  
BSHIFT: A Low Cost Deep Neural Networks Accelerator 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 360-372
作者:  Yu, Yong;  Zhi, Tian;  Zhou, Xuda;  Liu, Shaoli;  Chen, Yunji;  Cheng, Shuyao
收藏  |  浏览/下载:87/0  |  提交时间:2019/08/16
Deep neural networks  Low power  Lossless  Accelerator  
Float-Fix: An Efficient and Hardware-Friendly Data Type for Deep Neural Network 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 345-359
作者:  Han, Dong;  Zhou, Shengyuan;  Zhi, Tian;  Wang, Yibo;  Liu, Shaoli
收藏  |  浏览/下载:75/0  |  提交时间:2019/08/16
Float-Fix  Neural network  Hardware accelerator  Data type  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:80/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:263/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
HyperFatTree: A Large-Scale Tree-Based Network with Low-Radix Switches 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 卷号: 45, 期号: 1, 页码: 172-184
作者:  Su, Yong;  Wang, Zhan;  Fan, Zhiguo;  Cao, Zheng;  Liu, Xiaoli;  Shao, En;  An, Xuejun;  Sun, Ninghui
收藏  |  浏览/下载:71/0  |  提交时间:2019/12/12
High energy efficiency  Hierarchical topology  Low-radix switch  Large scale interconnecting network  
Two-Level Task Scheduling for Irregular Applications on GPU Platform 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 卷号: 45, 期号: 1, 页码: 79-93
作者:  Li, Jing;  Liu, Lei;  Wu, Yuan;  Feng, Xiaobing;  Wu, Chengyong
收藏  |  浏览/下载:52/0  |  提交时间:2019/12/12
Hierarchical schedule  Resource-aware  Irregular application  GPU  
Performance Evaluation and Enhancement of Process-Based Parallel Loop Execution 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 卷号: 45, 期号: 1, 页码: 185-198
作者:  Lu, Xingjing;  Chen, Long;  Li, Zhiyuan
收藏  |  浏览/下载:42/0  |  提交时间:2019/12/12
Parallel loop  Process-based execution  Thread-based execution  DOACROSS  
SARP: Synopsis-Based Approximate Request Processing for Low Latency and Small Correctness Loss in Cloud Online Services 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 卷号: 44, 期号: 5, 页码: 1054-1077
作者:  Han, Rui;  Zhan, Jianfeng;  Vazquez-Poletti Luis, Jose
收藏  |  浏览/下载:40/0  |  提交时间:2019/12/13
Cloud online service  Approximate request processing  Result correctness  Synopsis