CSpace

浏览/检索结果: 共29条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
GenCNN: A Partition-Aware Multi-Objective Mapping Framework for CNN Accelerators Based on Genetic Algorithm 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Mu, Yudong;  Fan, Zhihua;  Li, Wenming;  Zhang, Zhiyuan;  An, Xuejun;  Fan, Dongrui;  Ye, Xiaochun
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
CNN Accelerator  Dataflow Graph Mapping  Genetic Algorithm  Multi-objective Optimization  
Augur: Semantics-Aware Temporal Prefetching for Linked Data Structure 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 27
作者:  Xue, Feng;  Wu, Junliang;  Jihan, Chen;  Li, Xin Yu;  Zhang, Tingting;  Li, Tianyi;  Zhang, Fuxin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Accelerating Parallel Structures in DNNs via Parallel Fusion and Operator Co-Optimization 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Di, Zhanyuan;  Wang, Leping;  Ma, Zhaojia;  Shao, En;  Zhao, Jie;  Ren, Ziyi;  Feng, Siyuan;  Tao, Dingwen;  Tan, Guangming;  Sun, Ninghui
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Deep learning  tensor compiler  inference optimization  code generation  GPU  
CGCGraph: Efficient CPU-GPU Co-execution for Concurrent Dynamic Graph Processing 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 3, 页码: 26
作者:  Sun, Yiming;  Zhang, Jie;  Cao, Huawei;  Zhang, Yuan;  An, Xuejun;  Huang, Junying;  Ye, Xiaochun
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
CPU-GPU co-execution  concurrent graph processing  dynamic graph snapshot processing  high throughput  
OptiFX: Automatic Optimization for Convolutional Neural Networks with Aggressive Operator Fusion on GPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Wang, Xueying;  Li, Shigang;  Qian, Hao;  Luo, Fan;  Hao, Zhaoyang;  Wu, Tong;  Xu, Ruiyuan;  Cui, Huimin;  Feng, Xiaobing;  Li, Guangli
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Deep learning systems  convolutional neural networks  operator fusion  
LitTLS: Lightweight Thread-Level Speculation on Little Cores 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Cheng, Xin;  Ye, Jinpeng;  Deng, Haoyu;  Zhang, Tingting;  Liu, Tianyi;  Wang, Jian
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Thread-level speculation  parallel computing  heterogeneous multicore  
SRSparse: Generating Codes for High-Performance Sparse Matrix-Vector Semiring Computations 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 26
作者:  Du, Zhen;  Li, Ying;  Sun, Ninghui;  Cui, Huimin;  Feng, Xiaobing;  Li, Jiajia
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
High performance computing  sparse matrix computation  auto-tuning  code generator  semiring computation  
PANDA: Adaptive Prefetching and Decentralized Scheduling for Dataflow Architectures 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Qin, Shantian;  Fan, Zhihua;  Li, Wenming;  Wang, Zhen;  An, Xuejun;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Prefetching  decentralized dynamic scheduling  reconfigurable on-chip memory architecture  
ShuffleInfer: Disaggregate LLM Inference for Mixed Downstream Workloads 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 24
作者:  Hu, Cunchen;  Huang, Heyang;  Xu, Liangliang;  Chen, Xusheng;  Wang, Chenxi;  Xu, Jiang;  Chen, Shuang;  Feng, Hao;  Wang, Sa;  Bao, Yungang;  Sun, Ninghui;  Shan, Yizhou
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
LLM serving  disaggregated  interference  schedule  
SnsBooster: Enhancing Sampling-based μArch Evaluation Efficiency through Online Performance Sensitivity Analysis 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Han, Chenji;  Zhang, Zifei;  Xue, Feng;  Li, Xinyu;  Wu, Yuxuan;  Zhang, Tingting;  Liu, Tianyi;  Guo, Qi;  Zhang, Fuxin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Representative sampling  microarchitecture-independent characteristic analysis