CSpace

浏览/检索结果: 共151条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
MRFI: An Open-Source Multiresolution Fault Injection Framework for Neural Network Processing 期刊论文
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 页码: 11
作者:  Huang, Haitong;  Liu, Cheng;  Xue, Xinghua;  Liu, Bo;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Biological neural networks  Hardware  Reliability  Computational modeling  Neural networks  Fault tolerant systems  Fault tolerance  Fault evaluation  fault injection  fault simulation  multiresolution  neural network reliability  
Sketch-fusion: A gradient compression method with multi-layer fusion for communication-efficient distributed training 期刊论文
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 卷号: 185, 页码: 10
作者:  Dai, Lingfei;  Gong, Luqi;  An, Zhulin;  Xu, Yongjun;  Diao, Boyu
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Gradient compression  Multi-layer fusion  Distributed stochastic gradient descent  Deep learning training  
Distributed Multi-GPU Ab Initio Density Matrix Renormalization Group Algorithm with Applications to the P-Cluster of Nitrogenase 期刊论文
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 卷号: 20, 期号: 2, 页码: 775-786
作者:  Xiang, Chunyang;  Jia, Weile;  Fang, Wei-Hai;  Li, Zhendong
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Poseidon-NDP: Practical Fully Homomorphic Encryption Accelerator Based on Near Data Processing Architecture 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 卷号: 42, 期号: 12, 页码: 4749-4762
作者:  Yang, Yinghao;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
FPGA accelerator  fully homomorphic encryption (FHE)  near data processing (NDP)  privacy computing  
Characterizing and Understanding Defense Methods for GNNs on GPUs 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 卷号: 22, 期号: 2, 页码: 137-140
作者:  Wu, Meng;  Yan, Mingyu;  Yang, Xiaocheng;  Li, Wenming;  Zhang, Zhimin;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
Kernel  Purification  Estimation  Graphics processing units  Perturbation methods  Electric breakdown  Training  Graph neural networks  defense  execution semantic  execution pattern  overhead  
EALI: Energy-aware layer-level scheduling for convolutional neural network inference services on GPUs 期刊论文
NEUROCOMPUTING, 2022, 卷号: 507, 页码: 265-281
作者:  Yao, Chunrong;  Liu, Wantao;  Liu, Zhibing;  Yan, Longchuan;  Hu, Songlin;  Tang, Weiqing
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Scheduling  Convolutional neural networks (CNNs)  GPUs  Service-level-objective (SLO)  Energy minimization  Inference services  
Characterizing and Understanding HGNNs on GPUs 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 卷号: 21, 期号: 2, 页码: 69-72
作者:  Yan, Mingyu;  Zou, Mo;  Yang, Xiaocheng;  Li, Wenming;  Ye, Xiaochun;  Fan, Dongrui;  Xie, Yuan
收藏  |  浏览/下载:22/0  |  提交时间:2022/12/07
Kernel  Semantics  Aggregates  Mercury (metals)  Motion pictures  Graphics processing units  Electric breakdown  Heterogeneous graph neural networks  GNNs  characterization  execution semantic  execution pattern  
Fast and accurate variable batch size convolution neural network training on large scale distributed systems 期刊论文
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 页码: 26
作者:  Hu, Zhongzhe;  Xiao, Junmin;  Sun, Ninghui;  Tan, Guangming
收藏  |  浏览/下载:18/0  |  提交时间:2022/12/07
deep learning  distributed computing  ImageNet-1K  large-batch training  synchronous SGD  
EAIS: Energy-aware adaptive scheduling for CNN inference on high-performance GPUs 期刊论文
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 卷号: 130, 页码: 253-268
作者:  Yao, Chunrong;  Liu, Wantao;  Tang, Weiqing;  Hu, Songlin
收藏  |  浏览/下载:17/0  |  提交时间:2022/12/07
Energy-aware  Convolutional neural network (CNN) inference  High-performance GPUs  Workload scheduling  Service-Level-Objective (SLO)  
Breaking the Interaction Wall: A DLPU-Centric Deep Learning Computing System 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 1, 页码: 209-222
作者:  Du, Zidong;  Guo, Qi;  Zhao, Yongwei;  Zeng, Xi;  Li, Ling;  Cheng, Limin;  Xu, Zhiwei;  Sun, Ninghui;  Chen, Yunji
收藏  |  浏览/下载:30/0  |  提交时间:2022/06/21
Deep learning  Central Processing Unit  Process control  Task analysis  Computational modeling  Pipelines  Runtime  Neural net accelerators  system architectures  interaction wall