CSpace

浏览/检索结果: 共7条,第1-7条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems 期刊论文
SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 卷号: 35, 页码: 8
作者:  Wu, Jingya;  Lu, Wenyan;  Yan, Guihai;  Li, Xiaowei
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Accelerators  Heterogeneous systems  Bandwidth contention  Hardware hazard  PCIe  
ShuntFlowPlus: An Efficient and Scalable Dataflow Accelerator Architecture for Stream Applications 期刊论文
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 卷号: 17, 期号: 4, 页码: 24
作者:  Gong, Shijun;  Li, Jiajun;  Lu, Wenyan;  Yan, Guihai;  Li, Xiaowei
收藏  |  浏览/下载:21/0  |  提交时间:2022/12/07
Streaming processing  sliding-window aggregations  dataflow  buffer sharing  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 卷号: 31, 期号: 8, 页码: 1925-1941
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Chen, Tun;  Yuan, Liang;  Vuduc, Richard
收藏  |  浏览/下载:57/0  |  提交时间:2020/12/10
AutoFFT  FFT  code generation  template  DFT  
Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers 期刊论文
MOBILE NETWORKS & APPLICATIONS, 2019, 卷号: 24, 期号: 3, 页码: 1063-1077
作者:  Marahatta, Avinab;  Wang, Youshi;  Zhang, Fa;  Sangaiah, Arun Kumar;  Tyagi, Sumarga Kumar Sah;  Liu, Zhiyong
收藏  |  浏览/下载:82/0  |  提交时间:2019/08/16
Cloud computing  Cloud data center  Fault-tolerant  Dynamic task scheduling  Virtual machine  Migration  Energy-efficiency  
Efficient parallel optimizations of a high-performance SIFT on GPUs 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 卷号: 124, 页码: 78-91
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Liu, Shice;  Li, Shigang;  Wang, Xiao;  Zhang, Hao
收藏  |  浏览/下载:73/0  |  提交时间:2019/04/03
HartSift  SIFT  CPU  High performance  Feature extraction  
A Case of On-Chip Memory Subsystem Design for Low-Power CNN Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 卷号: 37, 期号: 10, 页码: 1971-1984
作者:  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:68/0  |  提交时间:2019/12/10
Convolutional neural network (CNN)  deep learning  low power  memory subsystem