CSpace

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems 期刊论文
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 卷号: 35, 页码: 8
作者:  Wu, Jingya;  Lu, Wenyan;  Yan, Guihai;  Li, Xiaowei
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Accelerators  Heterogeneous systems  Bandwidth contention  Hardware hazard  PCIe  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 卷号: 145, 页码: 202-216
作者:  Cheng Daning;  Li Shigang;  Zhang Yunquan
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
SGD  Unbalanced workload  SimuParallel SGD  Distributed system  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
A Non-Stop Double Buffering Mechanism for Dataflow Architecture 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 卷号: 33, 期号: 1, 页码: 145-157
作者:  Tan, Xu;  Shen, Xiao-Wei;  Ye, Xiao-Chun;  Wang, Da;  Fan, Dong-Rui;  Zhang, Lunkai;  Li, Wen-Ming;  Zhang, Zhi-Min;  Tang, Zhi-Min
收藏  |  浏览/下载:67/0  |  提交时间:2019/12/10
non-stop  double buffering  dataflow architecture  high-performance computing  
Going Cooler With Timing-Constrained TeSHoP: A Temperature Sensing-Based Hotspot-Driven Placement Technique for FPGAs 期刊论文
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 卷号: 25, 期号: 9, 页码: 2525-2537
作者:  Lu, Weina;  Hu, Yu;  Ye, Jing;  Li, Xiaowei
收藏  |  浏览/下载:39/0  |  提交时间:2019/12/12
Computer-aided design flow  field-programmable gate arrays (FPGAs)  hotspot optimization  performance  
A Cross-Platform SpMV Framework on Many-Core Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 卷号: 13, 期号: 4, 页码: 25
作者:  Zhang, Yunquan;  Li, Shigang;  Yan, Shengen;  Zhou, Huiyang
收藏  |  浏览/下载:38/0  |  提交时间:2019/12/12
SpMV  segmented scan  BCCOO  OpenCL  CUDA  GPU  Intel MIC  parallel algorithms