TY - GEN
T1 - Learn to Walk with Continuous-action for Knowledge-enhanced Recommendation System
AU - Sun, Jiahao
AU - Liu, Yu
AU - Zhang, Xianjie
AU - Xu, Xiujuan
AU - Hong, Li
AU - Wang, Kai
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Knowledge graphs are more widely utilized to enhance recommendability and explainability. Reinforcement learning agents built to wander around the knowledge graph have been successfully applied in recommendation systems in a form of multi-hop relation reasoning. Some previous multi-hop methods relied on reinforcement learning of discrete actions, making agent space design challenging and a lack of clarity in the meaning of actions because of inconsistent action. To solve the aforementioned issues, we propose Continuous-action Walking-tendency Interest-oriented Path Reasoning (CWIPR), a novel and pioneering method that uses continuous actions provided by reinforcement learning agents to predict inference relations and the next entity. Meanwhile, to better interact with the knowledge graph through continuous actions, we firstly propose a graph search algorithm called the walking tendency algorithm. Moreover, we introduce an interest-oriented reward as the intrinsic reward that encourages the agent to balance the tendency between exploring the most similar entities and exploring the correct recommendation type to achieve more precise recommendations. We extensively evaluate our method on three real-world datasets from Amazon and obtain favorable performance compared with state-of-the-art methods.
AB - Knowledge graphs are more widely utilized to enhance recommendability and explainability. Reinforcement learning agents built to wander around the knowledge graph have been successfully applied in recommendation systems in a form of multi-hop relation reasoning. Some previous multi-hop methods relied on reinforcement learning of discrete actions, making agent space design challenging and a lack of clarity in the meaning of actions because of inconsistent action. To solve the aforementioned issues, we propose Continuous-action Walking-tendency Interest-oriented Path Reasoning (CWIPR), a novel and pioneering method that uses continuous actions provided by reinforcement learning agents to predict inference relations and the next entity. Meanwhile, to better interact with the knowledge graph through continuous actions, we firstly propose a graph search algorithm called the walking tendency algorithm. Moreover, we introduce an interest-oriented reward as the intrinsic reward that encourages the agent to balance the tendency between exploring the most similar entities and exploring the correct recommendation type to achieve more precise recommendations. We extensively evaluate our method on three real-world datasets from Amazon and obtain favorable performance compared with state-of-the-art methods.
KW - Continuous Action
KW - Knowledge Graph Reasoning
KW - Recommendation System
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/85205031118
U2 - 10.1109/IJCNN60899.2024.10650282
DO - 10.1109/IJCNN60899.2024.10650282
M3 - 会议稿件
AN - SCOPUS:85205031118
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2024 International Joint Conference on Neural Networks, IJCNN 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Joint Conference on Neural Networks, IJCNN 2024
Y2 - 30 June 2024 through 5 July 2024
ER -