CSpace

浏览/检索结果: 共78条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: 44, 期号: 5, 页码: 1745-1759
作者:  Sun, Xiaotian;  Wang, Xinyu;  Li, Wanqian;  Han, Yinhe;  Chen, Xiaoming
收藏  |  浏览/下载:5/0  |  提交时间:2025/06/25
Hardware  Optimization  Artificial neural networks  Pipelines  Parallel processing  Biological system modeling  Resource management  Adaptation models  Scheduling  Memory management  Deep neural network (DNN)  end-to-end compiler  processing-in-memory (PIM) accelerator  system-level optimization  
CKTSO: High-Performance Parallel Sparse Linear Solver for General Circuit Simulations 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: 44, 期号: 5, 页码: 1887-1900
作者:  Chen, Xiaoming
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
SPICE  Sparse matrices  Parallel processing  Design automation  Vectors  Scalability  Linear systems  Upper bound  Performance evaluation  Numerical stability  Circuit simulation  parallel linear solver  sparse linear solver  
29-Billion Atoms Molecular Dynamics Simulation With Ab Initio Accuracy on 35 Million Cores of New Sunway Supercomputer 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 5, 页码: 1634-1648
作者:  Wang, Xun;  Meng, Xiangyu;  Guo, Zhuoqiang;  Li, Mingzhen;  Liu, Lijun;  Li, Mingfan;  Xiao, Qian;  Zhao, Tong;  Sun, Ninghui;  Tan, Guangming;  Jia, Weile
收藏  |  浏览/下载:5/0  |  提交时间:2025/06/25
Atoms  Accuracy  Supercomputers  Optimization  Artificial neural networks  Force  Training  Fitting  Predictive models  Nuclear power generation  High Performance Computing  Molecular Dynamics  DeePMD  Parallel Optimization  New Sunway Supercomputer  
Pyramid: Accelerating LLM Inference With Cross-Level Processing-in-Memory 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 卷号: 24, 期号: 1, 页码: 121-124
作者:  Yan, Liang;  Lu, Xiaoyang;  Chen, Xiaoming;  Han, Yinhe;  Sun, Xian-He
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Graphics processing units  Decoding  Computational modeling  Parallel processing  Systolic arrays  Computer architecture  Table lookup  Random access memory  Interpolation  Transformers  Large language models  Processing-in-memory  
Advancements in Accelerating Deep Neural Network Inference on AIoT Devices: A Survey 期刊论文
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2024, 卷号: 9, 期号: 6, 页码: 830-847
作者:  Cheng, Long;  Gu, Yan;  Liu, Qingzhi;  Yang, Lei;  Liu, Cheng;  Wang, Ying
收藏  |  浏览/下载:6/0  |  提交时间:2025/06/25
Computational modeling  Hardware  Artificial neural networks  Optimization  Internet of Things  Adaptation models  Data models  AIoT devices  DNN inference  model compression  parallel computing  performance optimization  survey  
Asynchronous Memory Access Unit: Exploiting Massive Parallelism for Far Memory Access 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 3, 页码: 28
作者:  Wang, Luming;  Zhang, Xu;  Wang, Songyue;  Jiang, Zhuolun;  Lu, Tianyue;  Chen, Mingyu;  Luo, Siwei;  Hijang, Keji
收藏  |  浏览/下载:13/0  |  提交时间:2024/12/06
CCS Concepts:  Computer systems organization  Parallel architectures  Hardware  Memory  
FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning With Partitioning and Parallelism of Search Space 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 7, 页码: 1174-1188
作者:  Li, Xiaqing;  Guo, Qi;  Zhang, Guangyan;  Ye, Siwei;  He, Guanhua;  Yao, Yiheng;  Zhang, Rui;  Hao, Yifan;  Du, Zidong;  Zheng, Weimin
收藏  |  浏览/下载:50/0  |  提交时间:2024/12/06
Deep learning  distributed hyper-parameter tuning (HPT) system  parallel computing  
HiHGNN: Accelerating HGNNs Through Parallelism and Data Reusability Exploitation 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 7, 页码: 1122-1138
作者:  Xue, Runzhen;  Han, Dengke;  Yan, Mingyu;  Zou, Mo;  Yang, Xiaocheng;  Wang, Duo;  Li, Wenming;  Tang, Zhimin;  Kim, John;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:22/0  |  提交时间:2024/12/06
Semantics  Parallel processing  Graph neural networks  Vectors  Graphics processing units  Fuses  Hardware  GNN  GNN accelerator  graph neural network  HGNN  HGNN accelerator  heterogeneous graph neural network  
An Energy-Efficient In-Memory Accelerator for Graph Construction and Updating 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 6, 页码: 1781-1793
作者:  Chen, Mingkai;  Liu, Cheng;  Liang, Shengwen;  He, Lei;  Wang, Ying;  Zhang, Lei;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:23/0  |  提交时间:2024/12/06
Memory management  Buildings  Bandwidth  Hardware  Social networking (online)  Energy efficiency  Parallel processing  Graph construction  graph updating  near-data processing  nearest neighbors  power gating  stacked memory  
General Purpose Deep Learning Accelerator Based on Bit Interleaving 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 5, 页码: 1470-1483
作者:  Chang, Liang;  Lu, Hang;  Li, Chenglong;  Zhao, Xin;  Hu, Zhicheng;  Zhou, Jun;  Li, Xiaowei
收藏  |  浏览/下载:27/0  |  提交时间:2024/12/06
Synchronization  Parallel processing  Computational modeling  Training  Pragmatics  Power demand  Hardware acceleration  Accelerator  bit-level sparsity  deep neural network (DNN)