CSpace

浏览/检索结果: 共15条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Frequency-Domain Inference Acceleration for Convolutional Neural Networks Using ReRAMs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 12, 页码: 3133-3146
作者:  Liu, Bosheng;  Jiang, Zhuoshen;  Wu, Yalan;  Wu, Jigang;  Chen, Xiaoming;  Liu, Peng;  Zhou, Qingguo;  Han, Yinhe
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Frequency-domain accelerator  energy efficiency  resistive random access memory  frequency-domain convolutions  
Accelerating k-Shape Time Series Clustering Algorithm Using GPU 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 10, 页码: 2718-2734
作者:  Wang, Xun;  Song, Ruibao;  Xiao, Junmin;  Li, Tong;  Li, Xueqi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Data space  time series analysis  time series clustering  GPU architecture  k-shape algorithm  
DRONE: An Efficient Distributed Subgraph-Centric Framework for Processing Large-Scale Power-law Graphs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 2, 页码: 463-474
作者:  Zhang, Shuai;  Jiang, Zite;  Hou, Xingzhong;  Li, Mingyu;  Yuan, Mengting;  You, Haihang
收藏  |  浏览/下载:13/0  |  提交时间:2023/07/12
Fault tolerance  graph partition  large-scale power-law graph  parallel graph computation  subgraph-centric model  
An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 11, 页码: 3129-3140
作者:  Li, Kun;  Yuan, Liang;  Zhang, Yunquan;  Chen, Gongwei
收藏  |  浏览/下载:30/0  |  提交时间:2022/12/07
Clustering algorithms  Training  Mathematical models  Computational modeling  Libraries  Kernel  Support vector machines  Distributed machine learning  scalable algorithm  large-scale clustering  parallel regression  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Optimizing the LINPACK Algorithm for Large-Scale PCIe-Based CPU-GPU Heterogeneous Systems 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 9, 页码: 2367-2380
作者:  Tan, Guangming;  Shui, Chaoyang;  Wang, Yinshan;  Yu, Xianzhi;  Yan, Yujin
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Pipeline processing  Graphics processing units  Computer architecture  Supercomputers  Clustering algorithms  Programming  Optimization  LINPACK algorithm  software pipeline  performance model  heterogeneous computing  cluster  
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 7, 页码: 1702-1712
作者:  Cheng, Daning;  Li, Shigang;  Zhang, Hanping;  Xia, Fen;  Zhang, Yunquan
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Training  Scalability  Machine learning  Machine learning algorithms  Stochastic processes  Task analysis  Upper bound  Parallel training algorithms  training dataset  scalability  stochastic optimization methods  
LPM: A Systematic Methodology for Concurrent Data Access Pattern Optimization from a Matching Perspective 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 卷号: 30, 期号: 11, 页码: 2478-2493
作者:  Liu, Yuhang;  Sun, Xian-He
收藏  |  浏览/下载:40/0  |  提交时间:2020/12/10
Concurrent computing  Optimization  Delays  Program processors  Hardware  Systematics  Analytical models  Memory wall  memory stall time  efficiency  performance optimization  layered performance matching (LPM)  memory concurrency  
Cache-Oblivious MPI All-to-All Communications Based on Morton Order 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 542-555
作者:  Li, Shigang;  Zhang, Yunquan;  Hoefler, Torsten
收藏  |  浏览/下载:50/0  |  提交时间:2019/12/10
cache-oblivious algorithms  collective communication  NUMA  MPI_Alltoall  MPI_Allgather  neighborhood collectives  
Parallel and Streaming Truth Discovery in Large-Scale Quantitative Crowdsourcing 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 卷号: 27, 期号: 10, 页码: 2984-2997
作者:  Ouyang, Robin Wentao;  Kaplan, Lance M.;  Toniolo, Alice;  Srivastava, Mani;  Norman, Timothy J.
收藏  |  浏览/下载:51/0  |  提交时间:2019/12/13
Crowdsourcing  truth discovery  quantitative task  big data  parallel algorithm  streaming algorithm