Pre-Conference Talk by DAO Quang Huy | One Planner To Guide Them All ! Learning Adaptive Conversational Planners for Goal-oriented Dialogues

Please click here if you are unable to view this page.

One Planner To Guide Them All ! Learning Adaptive Conversational Planners for Goal-oriented Dialogues

Speaker (s):

DAO Quang Huy
PhD Candidate,
School of Computing and Information Systems
Singapore Management University

Date:

Time:

Venue:

22 October 2025, Wednesday

4:30pm – 5:00pm

Meeting room 4.4, Level 4.
School of Computing and
Information Systems 1,
Singapore Management University,
80 Stamford Road, Singapore 178902

We look forward to seeing you at this research seminar.

Please register by 20 October 2025.

About the Talk

Goal-oriented dialogues, such as recommendation and negotiation, often require balancing multiple conflicting objectives. Conventional approaches typically train separate policies for each predefined objective trade-off, which is computationally costly and scales poorly. In this work, we pursue a single dialogue policy that can dynamically adapt to varying objective preferences at inference time without retraining. This raises several challenges in terms of both (1) optimization strategy and (2) knowledge utilization. To address these, we propose a novel policy learning framework, Preference Adaptive Dialogue Policy Planner (PADPP), for multi-objective goal-oriented dialogues. Specifically, to tackle the former, we introduce a novel optimization scheme, which leverages information gained from training the model on previously updated objective weights, accelerating the learning capability on new weight settings. To address the latter, we utilize Generalized Policy Improvement (GPI) to ensure the effectiveness of leveraged knowledge. Experimental results demonstrate that PADPP achieves superior adaptability and performance compared to state-of-the-art approaches, offering a scalable and flexible solution for multi-objective, goal-oriented dialogues.

This is a Pre-Conference talk for The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).

About the Speaker

DAO Quang Huy is currently a second-year PhD candidate at the SMU School of Computing and Information Systems, supervised by Prof. LIAO Lizi. His research focuses on efficient methods for dialogue policy planning and the intersection between conversational agents and recommender systems.

Where to find us

Get in touch