Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems

Chen, Xiaoshuang; Zhang, Gengrui; Wang, Yao; Wu, Yulin; Su, Shuo; Zhan, Kaiqiao; Wang, Ben

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2404

Change to browse by:

Computer Science > Machine Learning

Title: Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems

Authors: Xiaoshuang Chen, Gengrui Zhang, Yao Wang, Yulin Wu, Shuo Su, Kaiqiao Zhan, Ben Wang

(Submitted on 23 Apr 2024)

Abstract: Modern large-scale recommender systems are built upon computation-intensive infrastructure and usually suffer from a huge difference in traffic between peak and off-peak periods. In peak periods, it is challenging to perform real-time computation for each request due to the limited budget of computational resources. The recommendation with a cache is a solution to this problem, where a user-wise result cache is used to provide recommendations when the recommender system cannot afford a real-time computation. However, the cached recommendations are usually suboptimal compared to real-time computation, and it is challenging to determine the items in the cache for each user. In this paper, we provide a cache-aware reinforcement learning (CARL) method to jointly optimize the recommendation by real-time computation and by the cache. We formulate the problem as a Markov decision process with user states and a cache state, where the cache state represents whether the recommender system performs recommendations by real-time computation or by the cache. The computational load of the recommender system determines the cache state. We perform reinforcement learning based on such a model to improve user engagement over multiple requests. Moreover, we show that the cache will introduce a challenge called critic dependency, which deteriorates the performance of reinforcement learning. To tackle this challenge, we propose an eigenfunction learning (EL) method to learn independent critics for CARL. Experiments show that CARL can significantly improve the users' engagement when considering the result cache. CARL has been fully launched in Kwai app, serving over 100 million users.

Comments:	8 pages, 8 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2404.14961 [cs.LG]
	(or arXiv:2404.14961v1 [cs.LG] for this version)

Submission history

From: Xiaoshuang Chen [view email]
[v1] Tue, 23 Apr 2024 12:06:40 GMT (1369kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.14961

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems

Submission history