CSpace

浏览/检索结果: 共27条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
CoAxNN: Optimizing on-device deep learning with conditional approximate neural networks 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 卷号: 143, 页码: 14
作者:  Li, Guangli;  Ma, Xiu;  Yu, Qiuchu;  Liu, Lei;  Liu, Huaxiao;  Wang, Xueying
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
On-device deep learning  Efficient neural networks  Model approximation and optimization  
Predicting for I/O stack optimizations on cyber-physical systems 期刊论文
MICROPROCESSORS AND MICROSYSTEMS, 2023, 卷号: 101, 页码: 10
作者:  Zhang, Yangmei;  Shen, Fanfan;  Li, Mengquan;  Wu, Chao
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
I  O stack  Cyber-physical system  O idle time  Prediction  
ESA: An efficient sequence alignment algorithm for biological database search on Sunway TaihuLight 期刊论文
PARALLEL COMPUTING, 2023, 卷号: 117, 页码: 11
作者:  Zhang, Hao;  Huang, Zhiyi;  Chen, Yawen;  Liang, Jianguo;  Gao, Xiran
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Hybrid sequence alignment  Biological database search  Sunway TaihuLight  SW26010  Heterogeneous architecture  
Characterizing and Understanding Defense Methods for GNNs on GPUs 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 卷号: 22, 期号: 2, 页码: 137-140
作者:  Wu, Meng;  Yan, Mingyu;  Yang, Xiaocheng;  Li, Wenming;  Zhang, Zhimin;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Kernel  Purification  Estimation  Graphics processing units  Perturbation methods  Electric breakdown  Training  Graph neural networks  defense  execution semantic  execution pattern  overhead  
Enabling In-Network Floating-Point Arithmetic for Efficient Computation Offloading 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 12, 页码: 4918-4934
作者:  Cui, Penglai;  Pan, Heng;  Li, Zhenyu;  Zhang, Penghao;  Miao, Tianhao;  Zhou, Jianer;  Guan, Hongtao;  Xie, Gaogang
收藏  |  浏览/下载:14/0  |  提交时间:2023/07/12
Open area test sites  Arithmetic  Memory management  Task analysis  Training  Standards  Servers  In-network computation  computation offloading  floating-point operation  
An Efficient Deep Learning Accelerator Architecture for Compressed Video Analysis 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 9, 页码: 2808-2820
作者:  Wang, Yongchen;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:31/0  |  提交时间:2022/12/07
Streaming media  Neural networks  Image coding  Decoding  Metadata  Deep learning  Hardware  Neural network acceleration  specialized accelerator  video analysis  
Scaling Poisson Solvers on Many Cores via MMEwald 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 8, 页码: 1888-1901
作者:  Wu, Mingchuan;  Wu, Yangjun;  Shang, Honghui;  Liu, Ying;  Cui, Huimin;  Li, Fang;  Duan, Xiaohui;  Zhang, Yunquan;  Feng, Xiaobing
收藏  |  浏览/下载:36/0  |  提交时间:2022/06/21
Optimization  Bandwidth  Supercomputers  Electric potential  Boundary conditions  Electrostatics  Silicon  Poisson solver  architecture-specific optimizations  many-core processor  
Taming Process Variations in CNFET for Efficient Last-Level Cache Design 期刊论文
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 卷号: 30, 期号: 4, 页码: 418-431
作者:  Xu, Dawen;  Feng, Zhuangyu;  Liu, Cheng;  Li, Li;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
CNTFETs  Delays  Transistors  Layout  Very large scale integration  Radio frequency  Energy consumption  nanotube field-effect transistor (CNFET)  last-level cache (LLC)  process variation (PV)  variation-aware cache  
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 卷号: 31, 期号: 8, 页码: 1925-1941
作者:  Li, Zhihao;  Jia, Haipeng;  Zhang, Yunquan;  Chen, Tun;  Yuan, Liang;  Vuduc, Richard
收藏  |  浏览/下载:56/0  |  提交时间:2020/12/10
AutoFFT  FFT  code generation  template  DFT  
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 403-417
作者:  Liu, Junhong;  He, Xin;  Liu, Weifeng;  Tan, Guangming
收藏  |  浏览/下载:78/0  |  提交时间:2019/08/16
Sparse matrix  Sparse matrix-matrix multiplication  GPU  Register