Pre-Conference Talk by GENG Minghong | L2M2: A Hierarchical Framework Integrating Large Language Model and Multi-agent Reinforcement Learning

Please click here if you are unable to view this page.

L2M2: A Hierarchical Framework Integrating Large Language Model and Multi-agent Reinforcement Learning

Speaker (s):

GENG Minghong
PhD Candidate,
School of Computing and Information Systems
Singapore Management University

Date:

Time:

Venue:

12 August 2025, Tuesday

10:00am – 10:30am

Meeting room 4.4, Level 4.
School of Computing and
Information Systems 1,
Singapore Management University,
80 Stamford Road, Singapore 178902

We look forward to seeing you at this research seminar.

Please register by 10 August 2025.

About the Talk

Multi-agent reinforcement learning (MARL) has demonstrated remarkable success in collaborative tasks, yet faces significant challenges in scaling to complex scenarios requiring sustained planning and coordination across long horizons. While hierarchical approaches help decompose these tasks, they typically rely on hand-crafted subtasks and domain-specific knowledge, limiting their generalizability. We present L2M2, a novel hierarchical framework that leverages large language models (LLMs) for high-level strategic planning and MARL for low-level execution, enabling zero-shot planning that supports both end-to-end training and direct integration with pre-trained MARL models. Through extensive experiments on diverse multi-agent navigation scenarios, we demonstrate that L2M2’s end-to-end MARL achieves superior performance while requiring less than 20% of the training samples compared to baseline methods on the VMAS tasks. On complex scenarios in the MOSMAC environment, L2M2 achieves a 98.75% win rate with pre-defined subgoals and maintains a 68.13% win rate without subgoals — where baseline methods fail to perform. Analysis through kernel density estimation reveals L2M2’s ability to automatically generate appropriate navigation plans, demonstrating its potential for addressing complex multi-agent coordination tasks.

This is a Pre-Conference talk for The 34th International Joint Conference on Artificial Intelligence (IJCAI 2025).

About the Speaker

GENG Minghong is a Ph.D. candidate at SCIS, supervised by Professor TAN Ah Hwee. Minghong's research focuses on multi-agent system and multi-agent reinforcement learning. During his PhD candidature, he developed hierarchical frameworks and benchmarking tools that enable effective cooperation among large agent teams in long-horizon tasks. His work has been published in venues including AAMAS, IJCAI, and ESwA. Please visit his homepage https://gengminghong.github.io/ for more information.

Where to find us

Get in touch