Institute of Computing Technology, Chinese Academy IR
| Balancing Graph Processing Workloads in Heterogeneous CPU-PIM Systems | |
| Xu, Sheng1,2; Li, Chun1; Luo, Le1; Zheng, Ming1; Yan, Liang3; Zou, Xingqi3; Chen, Xiaoming3 | |
| 2025-07-01 | |
| 发表期刊 | IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING
![]() |
| ISSN | 2168-6750 |
| 卷号 | 13期号:3页码:1068-1082 |
| 摘要 | Processing-in-Memory (PIM) offers a promising architecture to alleviate the memory wall challenge in graph processing applications. The key aspect of PIM is to incorporate logic within the memory, thereby leveraging the near-data advantages. State-of-the-art PIM-based graph processing accelerators tend to offload more to the memory in order to maximize near-data benefits, causing significant load imbalance in PIM systems. In this paper, we demonstrate that this intention is not true and that host processors still play a vital role in heterogeneous CPU-PIM systems. For this purpose, we propose CAPLBS, an online contention-aware Processing-in-Memory load-balance scheduler for graph processing applications in CPU-PIM systems. The core concept of CAPLBS is to steal workload candidates back to host processors with minimal off-chip data synchronization overhead when some host processors are idle. To model data contentions among workloads and determine the stealing decision, a measurement structure called Locality Cohesive Subgraph is proposed by deeply exploring the connectivity of the input graph and the memory access patterns of deployed graph applications. Experimental results show that CAPLBS achieved an average speed-up of 4.8x and 1.3x (up to 9.1x and 1.9x) compared with CPU-only and the upper bound of locality-aware fine-grained in-memory atomics. Moreover, CAPLBS adds no hardware overhead and works well with existing CPU-PIM graph processing accelerators. |
| 关键词 | Program processors Logic Codes Annotations Synchronization Resource management Parallel processing Programming Hardware Energy consumption Processing-in-Memory workload balance graph processing heterogeneous system |
| DOI | 10.1109/TETC.2025.3563249 |
| 收录类别 | SCI |
| 语种 | 英语 |
| 资助项目 | National Natural Science Foundation of China[62102005] ; National Natural Science Foundation of China[62306009] ; National Natural Science Foundation of China[62122076] ; National Natural Science Foundation of China[62104230] ; National Natural Science Foundation of China[62488101] ; National Natural Science Foundation of China[62495104] ; University Synergy Innovation Program of Anhui Province[GXXT-2021-011] ; Anhui Natural Science Foundation[2008085QF330] ; Anhui Natural Science Foundation[2108085QF265] ; Key Research Program of Frontier Sciences, CAS[ZDBS-LY-JSC012] ; Youth Innovation Promotion Association CAS |
| WOS研究方向 | Computer Science ; Telecommunications |
| WOS类目 | Computer Science, Information Systems ; Telecommunications |
| WOS记录号 | WOS:001571491400050 |
| 出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
| 引用统计 | |
| 文献类型 | 期刊论文 |
| 条目标识符 | http://119.78.100.204/handle/2XEOYT63/41701 |
| 专题 | 中国科学院计算技术研究所期刊论文_英文 |
| 通讯作者 | Chen, Xiaoming |
| 作者单位 | 1.Anhui Normal Univ, Sch Comp & Informat, Wuhu 241000, Peoples R China 2.Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230088, Peoples R China 3.Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China |
| 推荐引用方式 GB/T 7714 | Xu, Sheng,Li, Chun,Luo, Le,et al. Balancing Graph Processing Workloads in Heterogeneous CPU-PIM Systems[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING,2025,13(3):1068-1082. |
| APA | Xu, Sheng.,Li, Chun.,Luo, Le.,Zheng, Ming.,Yan, Liang.,...&Chen, Xiaoming.(2025).Balancing Graph Processing Workloads in Heterogeneous CPU-PIM Systems.IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING,13(3),1068-1082. |
| MLA | Xu, Sheng,et al."Balancing Graph Processing Workloads in Heterogeneous CPU-PIM Systems".IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING 13.3(2025):1068-1082. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论