CSpace

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems 期刊论文
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 卷号: 35, 页码: 8
作者:  Wu, Jingya;  Lu, Wenyan;  Yan, Guihai;  Li, Xiaowei
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Accelerators  Heterogeneous systems  Bandwidth contention  Hardware hazard  PCIe  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 卷号: 145, 页码: 202-216
作者:  Cheng Daning;  Li Shigang;  Zhang Yunquan
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
SGD  Unbalanced workload  SimuParallel SGD  Distributed system  
BZIP: A compact data memory system for UTXO-based blockchains 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 卷号: 109, 页码: 8
作者:  Jiang, Shuhao;  Li, Jiajun;  Gong, Shijun;  Yan, Junchao;  Yan, Guihai;  Sun, Yi;  Li, Xiaowei
收藏  |  浏览/下载:48/0  |  提交时间:2020/12/10
UTXO  Blockchain  Data Compression  IoT  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
SynergyFlow: An Elastic Accelerator Architecture Supporting Batch Processing of Large-Scale Deep Neural Networks 期刊论文
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2019, 卷号: 24, 期号: 1, 页码: 27
作者:  Li, Jiajun;  Yan, Guihai;  Lu, Wenyan;  Gong, Shijun;  Jiang, Shuhao;  Wu, Jingya;  Li, Xiaowei
收藏  |  浏览/下载:70/0  |  提交时间:2019/04/03
Deep neural networks  convolutional neural networks  accelerator  architecture  resource utilization  complementary effect  
Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture 期刊论文
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2018, 卷号: 44, 期号: 4, 页码: 25
作者:  Tan, Guangming;  Liu, Junhong;  Li, Jiajia
收藏  |  浏览/下载:56/0  |  提交时间:2019/12/10
Sparse matrix vector multiplication  auto-tuning  multicore  machine learning  
DLPlib: A Library for Deep Learning Processor 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 卷号: 32, 期号: 2, 页码: 286-296
作者:  Lan, Hui-Ying;  Wu, Lin-Yang;  Zhang, Xiao;  Tao, Jin-Hua;  Chen, Xun-Yu;  Wang, Bing-Rui;  Wang, Yu-Qing;  Guo, Qi;  Chen, Yun-Ji
收藏  |  浏览/下载:69/0  |  提交时间:2019/12/12
deep learning processor  API  library  DLPlib