Structuring and Tokenizing Distributed User Interest Context for Generative Recommendation

2026-06-18 • Information Retrieval

Information RetrievalArtificial Intelligence

AI summaryⓘ

The authors address challenges in recommendation systems that try to predict what users will interact with next using their past behavior. They note that current methods either have trouble handling complex user and item information together or rely on guesswork for item meanings. To improve this, the authors created G2Rec, a system that combines a big-picture view of user connections with clear, meaning-based item representations. This helps recommendation models better understand user interests without needing exact user preferences, and tests show G2Rec works better than existing approaches.

generative recommendationitem tokenizationuser behavior modelinggraph neural networkssemantic tokenizationuser interest prototypessequential recommendationrecommendation scalabilityuser co-engagementindustrial recommendation systems

Authors

Ruizhong Qiu, Yinglong Xia, Dongqi Fu, Hanqing Zeng, Ren Chen, Xiangjun Fan, Hong Li, Hong Yan, Hanghang Tong

Abstract

Generative recommendation is an emerging paradigm that has shown promise in industrial recommendation systems, aiming to predict users' next interactions from their historical behaviors. At the core of generative recommendation lies item tokenization, which bridges item semantics and recommendation models. However, existing methods often struggle to effectively organize and inject complex user-behavioral and item-semantic contexts into recommendation models simultaneously. On the one hand, existing graph-based integration methods, such as graph serialization and graph neural networks, either suffer from scalability issues or exploit only local graph information. On the other hand, existing semantic tokenization methods typically rely on heuristics and lack explicit supervision signals, which may lead to inaccurate or suboptimal semantic representations. To address these limitations in user interest context modeling, we propose G2Rec, a scalable framework that unifies holistic graph-based user co-engagement modeling with semantic tokenization for industrial-scale generative recommendation. Overall, G2Rec enables recommendation models to capture holistic and semantically grounded user interest prototypes without requiring ground-truth user interests, thereby providing more comprehensive and accurate modeling of user behavior contexts in industrial sequential recommendation. Online deployment across product surfaces and extensive experiments on public datasets demonstrate the superiority of G2Rec over existing methods.

View PDFOpen arXiv