Abstract
Conversational recommender systems (CRSs) aim to capture the preferences of users with minimal interaction and then recommend appropriate items. Previous work formalizes a CRS as a Markov decision process that explores strategies for query timing and recommendation. In this paper, we propose a CRS that incorporates negative feedback from recommendations based on maximum entropy to clarify user preferences (MECRS). We formalize how MECRS explores and exploits new preferences. We also implement MECRS based on a classical CRS. On a real-world collected dataset, we find that MECRS achieves better performance when the user simulator is unable to accurately answer preferences.
| Original language | English |
|---|---|
| Pages (from-to) | 527-548 |
| Number of pages | 22 |
| Journal | Data Intelligence |
| Volume | 7 |
| Issue number | 2 |
| DOIs | |
| State | Published - 1 Jun 2025 |
Keywords
- Conversational recommender system
- Reinforcement learning
- User simulator
Fingerprint
Dive into the research topics of 'Clarifying user Preference with Maximum Entropy Based Recommendation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver