Large Language Models are Learnable Planners for Long-Term Recommendation

Shi, Wentao; He, Xiangnan; Zhang, Yang; Gao, Chongming; Li, Xinyue; Zhang, Jizhi; Wang, Qifan; Feng, Fuli

doi:10.1145/3626772.3657683

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 2403

Computer Science > Information Retrieval

Title: Large Language Models are Learnable Planners for Long-Term Recommendation

Authors: Wentao Shi, Xiangnan He, Yang Zhang, Chongming Gao, Xinyue Li, Jizhi Zhang, Qifan Wang, Fuli Feng

(Submitted on 29 Feb 2024 (v1), last revised 26 Apr 2024 (this version, v2))

Abstract: Planning for both immediate and long-term benefits becomes increasingly important in recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning capacity by maximizing cumulative reward for long-term recommendation. However, the scarcity of recommendation data presents challenges such as instability and susceptibility to overfitting when training RL models from scratch, resulting in sub-optimal performance. In this light, we propose to leverage the remarkable planning capabilities over sparse data of Large Language Models (LLMs) for long-term recommendation. The key to achieving the target lies in formulating a guidance plan following principles of enhancing long-term engagement and grounding the plan to effective and executable actions in a personalized manner. To this end, we propose a Bi-level Learnable LLM Planner framework, which consists of a set of LLM instances and breaks down the learning process into macro-learning and micro-learning to learn macro-level guidance and micro-level personalized recommendation policies, respectively. Extensive experiments validate that the framework facilitates the planning ability of LLMs for long-term recommendation. Our code and data can be found at this https URL

Comments:	11 pages, 5 figures
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI:	10.1145/3626772.3657683
Cite as:	arXiv:2403.00843 [cs.IR]
	(or arXiv:2403.00843v2 [cs.IR] for this version)

Submission history

From: Wentao Shi [view email]
[v1] Thu, 29 Feb 2024 13:49:56 GMT (2358kb,D)
[v2] Fri, 26 Apr 2024 07:41:07 GMT (3454kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.00843

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: Large Language Models are Learnable Planners for Long-Term Recommendation

Submission history