showSidebars ==
showTitleBreadcrumbs == 1
node.field_disable_title_breadcrumbs.value ==

Pre-Conference Talk by Jiajing LING | Constrained Multiagent Reinforcement Learning for Large Agent Population

Please click here if you are unable to view this page.

 
Constrained Multiagent Reinforcement Learning for Large Agent Population

Speaker (s):

LING Jiajing
PhD Candidate
School of Computing and Information Systems
Singapore Management University

Date:

Time:

Venue:

 

13 September 2022, Tuesday

9:00am - 10:00am

Meeting Room 5.1, Level 5,
School of Computing and Information Systems 1,
Singapore Management University,
80 Stamford Road
Singapore 178902

Please register by 12 Sep 2022.

About the Talk

Learning control policies for a large number of agents in a decentralized setting is challenging due to partial observability, uncertainty in the environment, and scalability challenges. While several scalable multiagent RL (MARL) methods have been proposed, relatively few approaches exist for large scale \textit{constrained} MARL settings. To address this, we first formulate the constrained MARL problem in a collective multiagent setting where interactions among agents are governed by the aggregate count and types of agents, and do not depend on agents' specific identities. Second, we show that standard Lagrangian relaxation methods, which are popular for single agent RL, do not perform well in constrained MARL settings due to the problem of credit assignment---how to identify and modify behavior of agents that contribute most to constraint violations (and also optimize primary objective alongside)? We develop a fictitious MARL method that addresses this key challenge. Finally, we evaluate our approach on two large-scale real-world applications:  maritime traffic management and vehicular network routing. Empirical results show that our approach is highly scalable, can optimize the cumulative global reward and effectively minimize constraint violations, while also being significantly more sample efficient than previous best methods.

This is a Pre-Conference talk for European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2022).

About the Speaker

Jiajing LING is a Ph.D. candidate in Computer Science and is supervised by Prof. Akshat KUMAR. His research fields include reinforcement learning, large-scale multi-agent decision making, urban system optimization, and propositional logic.