CSpace

浏览/检索结果: 共60条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: 44, 期号: 5, 页码: 1745-1759
作者:  Sun, Xiaotian;  Wang, Xinyu;  Li, Wanqian;  Han, Yinhe;  Chen, Xiaoming
收藏  |  浏览/下载:5/0  |  提交时间:2025/06/25
Hardware  Optimization  Artificial neural networks  Pipelines  Parallel processing  Biological system modeling  Resource management  Adaptation models  Scheduling  Memory management  Deep neural network (DNN)  end-to-end compiler  processing-in-memory (PIM) accelerator  system-level optimization  
HEAT: Efficient Vision Transformer Accelerator With Hybrid-Precision Quantization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2025, 卷号: 72, 期号: 5, 页码: 758-762
作者:  Zhao, Pan;  Xue, Donghui;  Wu, Licheng;  Chang, Liang;  Tan, Haining;  Han, Yinhe;  Zhou, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2025/06/25
Vision transformer  accelerator  hybrid-precision quantization  FPGA  Vision transformer  accelerator  hybrid-precision quantization  FPGA  
Trident: The Acceleration Architecture for High-Performance Private Set Intersection 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 4, 页码: 1152-1167
作者:  Zhang, Jinkai;  Yang, Yinghao;  Zhou, Zhe;  Hu, Zhicheng;  Zhao, Xin;  Chang, Liang;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:4/0  |  提交时间:2025/06/25
Protocols  Receivers  Cryptography  Hardware  Central Processing Unit  Random access memory  Data privacy  Polynomials  Field programmable gate arrays  Computer architecture  Private set intersection (PSI)  fully homomorphic encryption (FHE)  FPGA accelerator  privacy computing  
A Data-Centric Software-Hardware Co-Designed Architecture for Large-Scale Graph Processing 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 4, 页码: 1109-1122
作者:  Li, Zerun;  Chen, Xiaoming;  Yang, Yuxin;  Min, Feng;  Zhang, Xiaoyu;  Han, Yinhe
收藏  |  浏览/下载:6/0  |  提交时间:2025/06/25
Bandwidth  Memory management  Computational modeling  System-on-chip  Software  Hardware  Computer architecture  Three-dimensional displays  Performance evaluation  Data communication  Large-scale graph processing  near memory computing  memory system  accelerator  
VastPipe: A High-Throughput Inference System via Adaptive Space-Division Multiplexing for Diverse Accelerators 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 2, 页码: 444-463
作者:  Ma, Li-Xian;  Wang, Le-Ping;  Shao, En;  Cao, Rong-Yu;  Tan, Guang-Ming
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
cluster scheduling  resource management  reinforcement learning  DNN accelerator  
FuHsi: Shifting Base-Calling Closer to Sequencer via In-Cache Acceleration 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 2, 页码: 482-499
作者:  Li, Ye-Wen;  Tan, Guang-Ming;  Li, Xue-Qi
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
genome base-calling  in-cache accelerator  domain-specific architecture  genome analysis  Nanopore sequencing  
Accelerating tensor multiplication by exploring hybrid product with hardware and software co-design 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2025, 卷号: 159, 页码: 16
作者:  Zhang, Zhiyuan;  Fan, Zhihua;  Li, Wenming;  Qiu, Yuhang;  Wang, Zhen;  Ye, Xiaochun;  Fan, Dongrui;  An, Xuejun
收藏  |  浏览/下载:1/0  |  提交时间:2025/06/25
Tensor multiplication  Hybrid product  Dataflow  Accelerator  
AI Computing Systems for Large Language Models Training 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2025, 卷号: 40, 期号: 1, 页码: 6-41
作者:  Zhang, Zhen-Xing;  Wen, Yuan-Bo;  Lyu, Han-Qi;  Liu, Chang;  Zhang, Rui;  Li, Xia-Qing;  Wang, Chao;  Du, Zi-Dong;  Guo, Qi;  Li, Ling;  Zhou, Xue-Hai;  Chen, Yun-Ji
收藏  |  浏览/下载:6/0  |  提交时间:2025/06/25
artificial intelligence (AI) chip  large language model (LLM)  AI computing system  accelerator  
ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs 期刊论文
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 卷号: 158, 页码: 138-149
作者:  Wan, Yi;  Xie, Xianzhong;  Chen, Junfan;  Xie, Kunpeng;  Yi, Dezhi;  Lu, Ye;  Gai, Keke
收藏  |  浏览/下载:16/0  |  提交时间:2024/12/06
Lightweight convolutional neural networks  FPGA  Accelerator  Adaptive dataflow  Unified computing engine  Tiling strategy  
DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2024, 卷号: 73, 期号: 8, 页码: 2081-2095
作者:  Liao, Yunkun;  Wu, Jingya;  Lu, Wenyan;  Li, Xiaowei;  Yan, Guihai
收藏  |  浏览/下载:18/0  |  提交时间:2024/12/06
Central Processing Unit  Engines  Jitter  Computers  Pipelines  Programming  Encryption  Disaggregated datacenter  SmartNIC  RDMA  hardware accelerator