CSpace
Online recommendation in non-stationary environments based on knowledge graph enhancement and time-varying reward mechanism
Li, Jiaxin1,2; Fang, Jinyun1,2
2026-02-01
发表期刊APPLIED INTELLIGENCE
ISSN0924-669X
卷号56期号:3页码:22
摘要Online recommendation systems quickly develop personalized recommendations based on users' historical feedback, thereby improving user experience and increasing platform revenue. Contextual Multi-Armed Bandits (CMAB) model based on reinforcement learning can achieve an effective balance between exploration and utilization, thereby maximizing long-term returns. In this work, we propose a novel CMAB model for online recommendation, which introduces two key innovations: (1) Knowledge Graph-driven Thompson Sampling (KG-TS) that enriches context by constructing a dynamic knowledge graph from user-item interactions to alleviate data sparsity, and (2) Time-Varying Reward Mechanism (TV-RM) that dynamically updates graph edges based on real-time feedback to adapt to non-stationary environments. The integrated algorithm, named KG-TV-TS, is designed to handle sparse and evolving recommendation scenarios. Experiments on three public datasets demonstrate that KG-TV-TS consistently outperforms state-of-the-art bandit algorithms in both recommendation accuracy and cumulative regret, especially under sparse and non-stationary conditions.
关键词Online recommendation system Contextual Multi-Armed bandits (CMAB) Time-Varying reward mechanism (TV-RM) Knowledge graph (KG) Thompson sampling (TS)
DOI10.1007/s10489-025-07083-z
收录类别SCI
语种英语
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence
WOS记录号WOS:001679533100002
出版者SPRINGER
引用统计
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/42828
专题中国科学院计算技术研究所
通讯作者Fang, Jinyun
作者单位1.Chinese Acad Sci, Inst Comp Technol, 6,Zhongguancun Sci Acad South Rd, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
推荐引用方式
GB/T 7714
Li, Jiaxin,Fang, Jinyun. Online recommendation in non-stationary environments based on knowledge graph enhancement and time-varying reward mechanism[J]. APPLIED INTELLIGENCE,2026,56(3):22.
APA Li, Jiaxin,&Fang, Jinyun.(2026).Online recommendation in non-stationary environments based on knowledge graph enhancement and time-varying reward mechanism.APPLIED INTELLIGENCE,56(3),22.
MLA Li, Jiaxin,et al."Online recommendation in non-stationary environments based on knowledge graph enhancement and time-varying reward mechanism".APPLIED INTELLIGENCE 56.3(2026):22.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Li, Jiaxin]的文章
[Fang, Jinyun]的文章
百度学术
百度学术中相似的文章
[Li, Jiaxin]的文章
[Fang, Jinyun]的文章
必应学术
必应学术中相似的文章
[Li, Jiaxin]的文章
[Fang, Jinyun]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。