CSpace  > 中国科学院计算技术研究所期刊论文  > 英文
Tyche: An Efficient and General Prefetcher for Indirect Memory Accesses
Xue, Feng1,2; Han, Chenji1,2; Li, Xinyu1,2; Wu, Junliang1,2; Zhang, Tingting3,4; Liu, Tianyi5; Hao, Yifan1; Du, Zidong1; Guo, Qi1; Zhang, Fuxin1
2024-06-01
发表期刊ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION
ISSN1544-3566
卷号21期号:2页码:26
摘要Indirect memory accesses (IMAs, i.e., A[f(B[i])]) are typical memory access patterns in applications such as graph analysis, machine learning, and database. IMAs are composed of producer-consumer pairs, where the consumers' memory addresses are derived from the producers' memory data. Due to the built-in value-dependent feature, IMAs exhibit poor locality, making prefetching ineffective. Hindered by the challenges of recording the potentially complex graphs of instruction dependencies among IMA producers and consumers, current state-of-the-art hardware prefetchers either (a) exhibit inadequate IMA identification abilities or ( b) rely on the run-ahead mechanism to prefetch IMAs intermittently and insufficiently. To solve this problem, we propose Tyche,1 an efficient and general hardware prefetcher to enhance IMA performance. Tyche adopts a bilateral propagation mechanism to precisely excavate the instruction dependencies in simple chains with moderate length (rather than complex graphs). Based on the exact instruction dependencies, Tyche can accurately identify various IMA patterns, including nonlinear ones, and generate accurate prefetching requests continuously. Evaluated on broad benchmarks, Tyche achieves an average performance speedup of 16.2% over the state-of-the-art spatial prefetcher Berti. More importantly, Tyche outperforms the state-of-the-art IMA prefetchers IMP, Gretch, and Vector Runahead, by 15.9%, 12.8%, and 10.7%, respectively, with a lower storage overhead of only 0.57 KB.
关键词Data prefetching hardware prefetching indirect memory accesses microarchitecture
DOI10.1145/3641853
收录类别SCI
语种英语
资助项目National Key Research and Development Program of China[2022YFB3105103]
WOS研究方向Computer Science
WOS类目Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods
WOS记录号WOS:001242588100010
出版者ASSOC COMPUTING MACHINERY
引用统计
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/39897
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Xue, Feng
作者单位1.Chinese Acad Sci, Inst Comp Technol, SKLP, 6 Kexueyuan Nanlu, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
3.Loongson Technol Co Ltd, Loongson Ind Pk, Beijing 100095, Peoples R China
4.Chinese Acad Sci, Inst Comp Technol, 6 Kexueyuan Nanlu, Beijing 100190, Peoples R China
5.Univ Texas San Antonio, 1 UTSA Circle, San Antonio, TX 78249 USA
推荐引用方式
GB/T 7714
Xue, Feng,Han, Chenji,Li, Xinyu,et al. Tyche: An Efficient and General Prefetcher for Indirect Memory Accesses[J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION,2024,21(2):26.
APA Xue, Feng.,Han, Chenji.,Li, Xinyu.,Wu, Junliang.,Zhang, Tingting.,...&Zhang, Fuxin.(2024).Tyche: An Efficient and General Prefetcher for Indirect Memory Accesses.ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION,21(2),26.
MLA Xue, Feng,et al."Tyche: An Efficient and General Prefetcher for Indirect Memory Accesses".ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION 21.2(2024):26.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Xue, Feng]的文章
[Han, Chenji]的文章
[Li, Xinyu]的文章
百度学术
百度学术中相似的文章
[Xue, Feng]的文章
[Han, Chenji]的文章
[Li, Xinyu]的文章
必应学术
必应学术中相似的文章
[Xue, Feng]的文章
[Han, Chenji]的文章
[Li, Xinyu]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。