|
One Planner To Guide Them All ! Learning Adaptive Conversational Planners for Goal-oriented Dialogues Speaker (s):
 DAO Quang Huy PhD Candidate, School of Computing and Information Systems Singapore Management University
| Date: Time: Venue: | | 22 October 2025, Wednesday 4:30pm – 5:00pm Meeting room 4.4, Level 4. School of Computing and Information Systems 1, Singapore Management University, 80 Stamford Road, Singapore 178902 We look forward to seeing you at this research seminar. Please register by 20 October 2025. 
|
|
About the Talk Goal-oriented dialogues, such as recommendation and negotiation, often require balancing multiple conflicting objectives. Conventional approaches typically train separate policies for each predefined objective trade-off, which is computationally costly and scales poorly. In this work, we pursue a single dialogue policy that can dynamically adapt to varying objective preferences at inference time without retraining. This raises several challenges in terms of both (1) optimization strategy and (2) knowledge utilization. To address these, we propose a novel policy learning framework, Preference Adaptive Dialogue Policy Planner (PADPP), for multi-objective goal-oriented dialogues. Specifically, to tackle the former, we introduce a novel optimization scheme, which leverages information gained from training the model on previously updated objective weights, accelerating the learning capability on new weight settings. To address the latter, we utilize Generalized Policy Improvement (GPI) to ensure the effectiveness of leveraged knowledge. Experimental results demonstrate that PADPP achieves superior adaptability and performance compared to state-of-the-art approaches, offering a scalable and flexible solution for multi-objective, goal-oriented dialogues.
This is a Pre-Conference talk for The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025). About the Speaker DAO Quang Huy is currently a second-year PhD candidate at the SMU School of Computing and Information Systems, supervised by Prof. LIAO Lizi. His research focuses on efficient methods for dialogue policy planning and the intersection between conversational agents and recommender systems.
|