PhD Dissertation Proposal by JIANG Hao | Reward Penalty and Value Decomposition for Reinforcement Learning with Constraint

Please click here if you are unable to view this page.

Reward Penalty and Value Decomposition for Reinforcement Learning with Constraint

JIANG Hao

PhD Candidate
School of Computing and Information Systems
Singapore Management University

FULL PROFILE

Research Area

Artificial Intelligence & Data Science
- Decision Making & Optimization

Dissertation Committee

Research Advisor

Prof Pradeep Reddy VARAKANTHAM

Dissertation Committee Members

Date

25 November 2024 (Monday)

Time

1:00pm – 2:00pm

Venue

Meeting Room 5.1,
Level 5
School of Computing and Information Systems 1, Singapore Management University, 80 Stamford Road Singapore 178902

Please register by 24 November 2024.

We look forward to seeing you at this research seminar.

ABOUT THE TALK

Reinforcement learning with constraints has become essential for solving decision-making tasks subject to various constraints, especially in complex multi-agent environments. This talk introduces two CRL methods: Reward Penalty and Value Decomposition. The Reward Penalty approach ensures safety by applying penalties to unsafe policies, guiding actions to remain within specified constraints. Additionally, Value Decomposition enables agents to work together efficiently by decomposing joint actions, making it effective in large-scale multi-agent settings. Applications include multi-agent ride-pooling, where efficient resource allocation and flexible pickup/drop-off points are critical for optimal performance.

ABOUT THE SPEAKER

Jiang Hao is a PhD candidate in Computer Science at School of Computing and Information Systems, supervised by Prof. Pradeep VARAKANTHAM. His research interests lie in the areas of reinforcement learning with constraints, with a particular focus on developing solutions using reward penalty and improving value decomposition to address complex multi-agent and multi-constraint problems.

Where to find us

Get in touch