CSpace

浏览/检索结果: 共12条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Fast Convolution Meets Low Precision: Exploring Efficient Quantized Winograd Convolution on Modern CPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Wang, Xueying;  Li, Guangli;  Jia, Zhen;  Feng, Xiaobing;  Wang, Yida
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
Deep learning  winograd convolution  low-precision computation  
VTensor: Using Virtual Tensors to Build a Layout-Oblivious AI Programming Framework 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 5, 页码: 1074-1097
作者:  Yu, Feng;  Zhao, Jia-Cheng;  Cui, Hui-Min;  Feng, Xiao-Bing;  Xue, Jingling
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
artificial intelligence (AI) programming  layout-oblivious  tensor processing  
A Coordinated Model Pruning and Mapping Framework for RRAM-Based DNN Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 卷号: 42, 期号: 7, 页码: 2364-2376
作者:  Qu, Songyun;  Li, Bing;  Zhao, Shixin;  Zhang, Lei;  Wang, Ying
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
AutoML  bit-pruning  deep neural networks (DNNs)  resistive random access memory (RRAM)  
Scalable and Conflict-Free NTT Hardware Accelerator Design: Methodology, Proof, and Implementation 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 卷号: 42, 期号: 5, 页码: 1504-1517
作者:  Mu, Jianan;  Ren, Yi;  Wang, Wen;  Hu, Yizhong;  Chen, Shuai;  Chang, Chip-Hong;  Fan, Junfeng;  Ye, Jing;  Cao, Yuan;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Memory access pattern  number theoretic transform (NTT)  post-quantum cryptography (PQC)  scalable hardware design  
Parallel Software-Based Self-Testing with Bounded Model Checking for Kilo-Core Networks-on-Chip 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 2, 页码: 405-421
作者:  Zhang, Ying;  Ji, Peng-Fei;  Zhu, Pan-Wei;  Peng, Zebo;  Li, Hua-Wei;  Jiang, Jian-Hui
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
software-based self-testing (SBST)  parallel test  kilo-core networks-on-chip (NoCs)  online testing  
On-Line Fault Protection for ReRAM-Based Neural Networks 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2023, 卷号: 72, 期号: 2, 页码: 423-437
作者:  Li, Wen;  Wang, Ying;  Liu, Cheng;  He, Yintao;  Liu, Lian;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:13/0  |  提交时间:2023/07/12
Training  Fault detection  Computational modeling  Image edge detection  Memristors  Neural networks  Kernel  Deep neural network  hard fault  ReRAM  reliability  soft fault  
IVP: An Intelligent Video Processing Architecture for Video Streaming 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2023, 卷号: 72, 期号: 1, 页码: 264-277
作者:  Gao, Chengsi;  Wang, Ying;  Han, Yinhe;  Chen, Weiwei;  Zhang, Lei
收藏  |  浏览/下载:12/0  |  提交时间:2023/07/12
Video enhancement  compressed video  DNN  approximate computing  optical flow  accelerator  
Amphis: Managing Reconfigurable Processor Architectures With Generative Adversarial Learning 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 11, 页码: 3993-4003
作者:  Chen, Weiwei;  Wang, Ying;  Xu, Ying;  Gao, Chengsi;  Han, Yinhe;  Zhang, Lei
收藏  |  浏览/下载:11/0  |  提交时间:2023/07/12
Resource management  Predictive models  Runtime  Generators  Generative adversarial networks  Computational modeling  Training  Design space exploration  generative adversarial network (GAN)  reconfigurable processor  
DHSA: efficient doubly homomorphic secure aggregation for cross-silo federated learning 期刊论文
JOURNAL OF SUPERCOMPUTING, 2022, 页码: 31
作者:  Liu, Zizhen;  Chen, Si;  Ye, Jing;  Fan, Junfeng;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Federated learning  Security  Efficient  Homomorphic  
Scaling Poisson Solvers on Many Cores via MMEwald 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 8, 页码: 1888-1901
作者:  Wu, Mingchuan;  Wu, Yangjun;  Shang, Honghui;  Liu, Ying;  Cui, Huimin;  Li, Fang;  Duan, Xiaohui;  Zhang, Yunquan;  Feng, Xiaobing
收藏  |  浏览/下载:34/0  |  提交时间:2022/06/21
Optimization  Bandwidth  Supercomputers  Electric potential  Boundary conditions  Electrostatics  Silicon  Poisson solver  architecture-specific optimizations  many-core processor