CSpace

浏览/检索结果: 共1228条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
无权访问的条目 学位论文
作者:  蒲宇宁
Adobe PDF(4620Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2016/07/01
MRFI: An Open-Source Multiresolution Fault Injection Framework for Neural Network Processing 期刊论文
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 页码: 11
作者:  Huang, Haitong;  Liu, Cheng;  Xue, Xinghua;  Liu, Bo;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Biological neural networks  Hardware  Reliability  Computational modeling  Neural networks  Fault tolerant systems  Fault tolerance  Fault evaluation  fault injection  fault simulation  multiresolution  neural network reliability  
PDG: A Prefetcher for Dynamic Graph Updating 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 4, 页码: 1246-1259
作者:  Zhang, Xinmiao;  Liu, Cheng;  Ni, Jiacheng;  Cheng, Yuanqing;  Zhang, Lei;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Prefetching  Arrays  Optimization  Runtime  Heuristic algorithms  Computers  Monitoring  Computer architecture  data prefetching  memory system  
CUTE: A scalable CPU-centric and Ultra-utilized Tensor Engine for convolutions 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 卷号: 149, 页码: 15
作者:  Li, Wenqing;  Ye, Jinpeng;  Zhang, Fuxin;  Liu, Tianyi;  Zhang, Tingting;  Wang, Jian
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Tensor engine  Convolution  Scalable architecture  CPU-centric  Utilization  
ApproxDup: Developing an Approximate Instruction Duplication Mechanism for Efficient SDC Detection in GPGPUs 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 4, 页码: 1051-1064
作者:  Wei, Xiaohui;  Jiang, Nan;  Yue, Hengshan;  Wang, Xiaonan;  Zhao, Jianpeng;  Li, Guangli;  Qiu, Meikang
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Instruction sets  Reliability  Resilience  Circuit faults  Registers  Kernel  Graphics processing units  Approximate computing  GPGPUs  instruction duplication  silent data corruptions (SDCs)  soft error  
Towards connection-scalable RNIC architecture 期刊论文
JOURNAL OF SUPERCOMPUTING, 2024, 页码: 25
作者:  Kang, Ning;  Wang, Zhan;  Yang, Fan;  Ma, Xiaoxiao;  Ma, Zhenlong;  Yuan, Guojun;  Tan, Guangming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Architecture design  Network Interface Card (NIC)  Remote Direct Memory Access (RDMA)  Scalability problem  
RTIFed: A Reputation based Triple-step Incentive mechanism for energy-aware Federated learning over battery-constricted devices 期刊论文
COMPUTER NETWORKS, 2024, 卷号: 241, 页码: 14
作者:  Wen, Tian;  Zhang, Hanqing;  Zhang, Han;  Wu, Huixin;  Wang, Danxin;  Liu, Xiuwen;  Zhang, Weishan;  Wang, Yuwei;  Cao, Shaohua
收藏  |  浏览/下载:4/0  |  提交时间:2024/05/20
Federated learning  Incentive mechanism  Energy efficiency  Client activation  Stackelberg game  
Mortar-FP8: Morphing the Existing FP32 Infrastructure for High-Performance Deep Learning Acceleration 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 3, 页码: 878-891
作者:  Li, Hongyan;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Deep learning accelerator  deep neural network (DNN)  fp8 format  
Improving Utilization of Dataflow Unit for Multi-Batch Processing 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Fan, Zhihua;  Li, Wenming;  Wang, Zhen;  Yang, Yu;  Ye, Xiaochun;  Fan, Dongrui;  Sun, Ninghui;  An, Xuejun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Utilization  network-on-chip  decoupled architecture  batch processing  
Fast Convolution Meets Low Precision: Exploring Efficient Quantized Winograd Convolution on Modern CPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Wang, Xueying;  Li, Guangli;  Jia, Zhen;  Feng, Xiaobing;  Wang, Yida
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Deep learning  winograd convolution  low-precision computation