CSpace

浏览/检索结果: 共37条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
TensorFHE plus : Fully Homomorphic Encryption Acceleration Based on Linear Algebra 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2026, 卷号: 75, 期号: 2, 页码: 612-627
作者:  Sun, Yintai;  Fan, Shengyu;  Yin, Zhenhua;  Song, Xinkai;  Hu, Xing;  Du, Zidong;  Guo, Qi;  Xu, Weizhi;  Hou, Rui;  Meng, Dan;  Bian, Song;  Zhang, Mingzhe
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Polynomials  Kernel  Optimization  Vectors  Transforms  System-on-chip  Symbols  Servers  Pipelines  Linear accelerators  FHE  GPGPU  HPC  linear algebra  modulo  data layout  
Swift: High Parallelism Program Generation of Tensor Operators for Accelerating Deep Learning Inference 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 4, 页码: 25
作者:  Yu, Xiyue;  Bi, Jun;  Wen, Yuanbo;  Xu, Jianxing;  Huang, Di;  Guo, Jiaming;  Li, Wei;  Du, Zidong;  Li, Jing;  Chen, Tianshi;  Guo, Qi
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Code generation  compiler optimization  tensor computation  
SaaP: Rearchitect SoC-as-a-Processor to Orchestrate Hardware Heterogeneity 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: 44, 期号: 10, 页码: 3962-3975
作者:  Jin, Pengwei;  Fan, Zhe;  Zhao, Yongwei;  Du, Zidong;  Guo, Hongrui;  Nan, Ziyuan;  Hao, Yifan;  Li, Chongxiao;  Ma, Tianyun;  Zhang, Zhenxing;  Li, Xiaqing;  Li, Wei;  Hu, Xing;  Guo, Qi;  Xu, Zhiwei;  Chen, Tianshi
收藏  |  浏览/下载:25/0  |  提交时间:2025/12/03
IP networks  Hardware  Graphics processing units  Central Processing Unit  Programming  Software  Pipelines  System-on-chip  Process control  Neural networks  Hardware heterogeneity  system architectures  System-on-Chip (SoC)  
SnsBooster: Enhancing Sampling-based μArch Evaluation Efficiency through Online Performance Sensitivity Analysis 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Han, Chenji;  Zhang, Zifei;  Xue, Feng;  Li, Xinyu;  Wu, Yuxuan;  Zhang, Tingting;  Liu, Tianyi;  Guo, Qi;  Zhang, Fuxin
收藏  |  浏览/下载:22/0  |  提交时间:2025/12/03
Representative sampling  microarchitecture-independent characteristic analysis  
Harmonia: A Unified Architecture for Efficient Deep Symbolic Regression 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: 44, 期号: 2, 页码: 737-750
作者:  Ma, Tianyun;  Wen, Yuanbo;  Song, Xinkai;  Jin, Pengwei;  Huang, Di;  Han, Husheng;  Nan, Ziyuan;  Yu, Zhongkai;  Peng, Shaohui;  Zhao, Yongwei;  Chen, Huaping;  Du, Zidong;  Hu, Xing;  Guo, Qi
收藏  |  浏览/下载:58/0  |  提交时间:2025/06/25
Skeleton  Optimization  Graphics processing units  Vectors  Hardware  Artificial neural networks  Accuracy  Deep symbolic regression (DSR)  radial basis function network (RBFN)  transcendental functions  unified array  
Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 1, 页码: 155-169
作者:  Bi, Jun;  Wen, Yuanbo;  Li, Xiaqing;  Zhao, Yongwei;  Guo, Yuxuan;  Zhou, Enshuai;  Hu, Xing;  Du, Zidong;  Li, Ling;  Chen, Huaping;  Chen, Tianshi;  Guo, Qi
收藏  |  浏览/下载:36/0  |  提交时间:2025/06/25
Optimization  Space exploration  Schedules  Libraries  Biological cells  Deep learning  Costs  Computers  Search problems  Tensors  Code generation  compiler optimization  tensor computation  
AI Computing Systems for Large Language Models Training 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 1, 页码: 6-41
作者:  Zhang, Zhen-Xing;  Wen, Yuan-Bo;  Lyu, Han-Qi;  Liu, Chang;  Zhang, Rui;  Li, Xia-Qing;  Wang, Chao;  Du, Zi-Dong;  Guo, Qi;  Li, Ling;  Zhou, Xue-Hai;  Chen, Yun-Ji
收藏  |  浏览/下载:66/0  |  提交时间:2025/06/25
artificial intelligence (AI) chip  large language model (LLM)  AI computing system  accelerator  
CAN: Cascade Augmentations Against Noise for Image Restoration 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 卷号: 34, 页码: 5131-5146
作者:  Yan, Yanyang;  Yao, Siyuan;  Ren, Wenqi;  Zhang, Rui;  Guo, Qi;  Cao, Xiaochun
收藏  |  浏览/下载:23/0  |  提交时间:2025/12/03
Image restoration  cascade augmentations  cascade augmentations  noise corruptions  noise corruptions  
FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning With Partitioning and Parallelism of Search Space 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 7, 页码: 1174-1188
作者:  Li, Xiaqing;  Guo, Qi;  Zhang, Guangyan;  Ye, Siwei;  He, Guanhua;  Yao, Yiheng;  Zhang, Rui;  Hao, Yifan;  Du, Zidong;  Zheng, Weimin
收藏  |  浏览/下载:80/0  |  提交时间:2024/12/06
Deep learning  distributed hyper-parameter tuning (HPT) system  parallel computing  
Tyche: An Efficient and General Prefetcher for Indirect Memory Accesses 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 2, 页码: 26
作者:  Xue, Feng;  Han, Chenji;  Li, Xinyu;  Wu, Junliang;  Zhang, Tingting;  Liu, Tianyi;  Hao, Yifan;  Du, Zidong;  Guo, Qi;  Zhang, Fuxin
收藏  |  浏览/下载:48/0  |  提交时间:2024/12/06
Data prefetching  hardware prefetching  indirect memory accesses  microarchitecture