CSpace

浏览/检索结果: 共9条,第1-9条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Towards connection-scalable RNIC architecture 期刊论文
JOURNAL OF SUPERCOMPUTING, 2024, 页码: 25
作者:  Kang, Ning;  Wang, Zhan;  Yang, Fan;  Ma, Xiaoxiao;  Ma, Zhenlong;  Yuan, Guojun;  Tan, Guangming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Architecture design  Network Interface Card (NIC)  Remote Direct Memory Access (RDMA)  Scalability problem  
Fast and accurate variable batch size convolution neural network training on large scale distributed systems 期刊论文
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 页码: 26
作者:  Hu, Zhongzhe;  Xiao, Junmin;  Sun, Ninghui;  Tan, Guangming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
deep learning  distributed computing  ImageNet-1K  large-batch training  synchronous SGD  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Optimizing the LINPACK Algorithm for Large-Scale PCIe-Based CPU-GPU Heterogeneous Systems 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 卷号: 32, 期号: 9, 页码: 2367-2380
作者:  Tan, Guangming;  Shui, Chaoyang;  Wang, Yinshan;  Yu, Xianzhi;  Yan, Yujin
收藏  |  浏览/下载:39/0  |  提交时间:2021/12/01
Pipeline processing  Graphics processing units  Computer architecture  Supercomputers  Clustering algorithms  Programming  Optimization  LINPACK algorithm  software pipeline  performance model  heterogeneous computing  cluster  
Fast Data-Obtaining Algorithm for Data Assimilation with Large Data Set 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 页码: 21
作者:  Xiao, Junmin;  Zhang, Guizhao;  Gao, Yanan;  Ho, Xuehai;  Tan, Guangming
收藏  |  浏览/下载:45/0  |  提交时间:2020/12/10
Data assimilation  I  O optimization  Communication optimization  Parallel implementation  Domain localization  
Automated and precise event detection method for big data in biomedical imaging with support vector machine 期刊论文
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2018, 卷号: 33, 期号: 2, 页码: 105-114
作者:  Yuan, Lufeng;  Yao, Erlin;  Tan, Guangming
收藏  |  浏览/下载:48/0  |  提交时间:2019/04/03
Biomedical imaging  event detection  machine learning  support vector machine  big data  
Graphine: Programming Graph-Parallel Computation of Large Natural Graphs for Multicore Clusters 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 卷号: 27, 期号: 6, 页码: 1647-1659
作者:  Yan, Jie;  Tan, Guangming;  Mo, Zeyao;  Sun, Ninghui
收藏  |  浏览/下载:42/0  |  提交时间:2019/12/13
Graph-parallel  parallel framework  computational model  
Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 卷号: 20, 期号: 2, 页码: 261-274
作者:  Tan, Guangming;  Sun, Ninghui;  Gao, Guang R.
收藏  |  浏览/下载:37/0  |  提交时间:2019/12/16
Dynamic programming  memory hierarchy  latency tolerant  percolation  multicore  
Cache oblivious algorithms for nonserial polyadic programming 期刊论文
JOURNAL OF SUPERCOMPUTING, 2007, 卷号: 39, 期号: 2, 页码: 227-249
作者:  Tan, Guangming;  Feng, Shengzhong;  Sun, Ninghui
收藏  |  浏览/下载:43/0  |  提交时间:2019/12/16
dynamic programming  nonserial polyadic  cache oblivious  algorithmic transformation  data dependencies