CSpace

浏览/检索结果: 共566条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
无权访问的条目 学位论文
作者:  蒲宇宁
Adobe PDF(4620Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2016/07/01
ApproxDup: Developing an Approximate Instruction Duplication Mechanism for Efficient SDC Detection in GPGPUs 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 4, 页码: 1051-1064
作者:  Wei, Xiaohui;  Jiang, Nan;  Yue, Hengshan;  Wang, Xiaonan;  Zhao, Jianpeng;  Li, Guangli;  Qiu, Meikang
收藏  |  浏览/下载:8/0  |  提交时间:2024/05/20
Instruction sets  Reliability  Resilience  Circuit faults  Registers  Kernel  Graphics processing units  Approximate computing  GPGPUs  instruction duplication  silent data corruptions (SDCs)  soft error  
Accelerating Deformable Convolution Networks with Dynamic and Irregular Memory Accesses 期刊论文
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2023, 卷号: 28, 期号: 4, 页码: 23
作者:  Chu, Cheng;  Liu, Cheng;  Xu, Dawen;  Wang, Ying;  Luo, Tao;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:12/0  |  提交时间:2023/12/04
Deformable convolution network  neural network accelerator  irregular memory access  runtime tile scheduling  
TCADer: A Tightly Coupled Accelerator Design framework for heterogeneous system with hardware/software co-design 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 卷号: 136, 页码: 12
作者:  Li, Wenqing;  Liu, Tianyi;  Xiao, Ziyuan;  Qi, Han;  Zhu, Weipu;  Wang, Jian
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Heterogeneous system  Tightly coupled  Accelerator design framework  Interaction latency  Low-latency task  
Reconfiguration algorithms for synchronous communication on switch based degradable arrays 期刊论文
PARALLEL COMPUTING, 2022, 卷号: 111, 页码: 10
作者:  Wu, Yalan;  Wu, Jigang;  Liu, Peng;  Han, Yinhe;  Srikanthan, Thambipillai
收藏  |  浏览/下载:26/0  |  提交时间:2022/12/07
Mesh-connected processor array  Reconfiguration algorithm  Fault-tolerance  Synchronous communication  
CAP: Communication-Aware Automated Parallelization for Deep Learning Inference on CMP Architectures 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 7, 页码: 1626-1639
作者:  Zou, Kaiwei;  Wang, Ying;  Cheng, Long;  Qu, Songyun;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:34/0  |  提交时间:2022/12/07
Kernel  Computer architecture  Multicore processing  Deep learning  System-on-chip  Parallel processing  Real-time systems  Neural networks  parallel processing  real-time and embedded systems  single-chip multiprocessors  reinforcement learning  structured sparsity  
DeepCS: Training a deep learning model for cervical spondylosis recognition on small-labeled sensor data 期刊论文
NEUROCOMPUTING, 2022, 卷号: 472, 页码: 24-34
作者:  Wang, Nana;  Luo, Chunjie;  Huang, Xi;  Huang, Yunyou;  Zhan, Jianfeng
收藏  |  浏览/下载:22/0  |  提交时间:2022/12/07
Cervical spondylosis recognition  High-dimensional time series sensor data  Convolutional neural network  Network architecture search  Feature extraction  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:47/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network  
Breaking the Interaction Wall: A DLPU-Centric Deep Learning Computing System 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2022, 卷号: 71, 期号: 1, 页码: 209-222
作者:  Du, Zidong;  Guo, Qi;  Zhao, Yongwei;  Zeng, Xi;  Li, Ling;  Cheng, Limin;  Xu, Zhiwei;  Sun, Ninghui;  Chen, Yunji
收藏  |  浏览/下载:37/0  |  提交时间:2022/06/21
Deep learning  Central Processing Unit  Process control  Task analysis  Computational modeling  Pipelines  Runtime  Neural net accelerators  system architectures  interaction wall  
Many-core acceleration of the first-principles all-electron quantum perturbation calculations 期刊论文
COMPUTER PHYSICS COMMUNICATIONS, 2021, 卷号: 267, 页码: 8
作者:  Shang, Honghui;  Duan, Xiaohui;  Li, Fang;  Zhang, Libo;  Xu, Zhiqian;  Liu, Kan;  Luo, Haiwen;  Ji, Yingrui;  Zhao, Wenxuan;  Xue, Wei;  Chen, Li;  Zhang, Yunquan
收藏  |  浏览/下载:40/0  |  提交时间:2021/12/01
Density-functional perturbation theory  Many-core architecture  Linear scaling  MPI  Numeric atomic orbitals