Research Seminar by Petr Novotný | Safe Planning and Reinforcement Learning with Monte Carlo Tree Search

Please click here if you are unable to view this page.

Safe Planning and Reinforcement Learning with Monte Carlo Tree Search

Speaker (s):

Petr Novotný
Associate Professor
Masaryk University, Brno,
Czech Republic

Date:

Time:

Venue:

12 June 2025, Thursday

2:00pm – 3:00pm

School of Computing &
Information Systems 2 (SCIS 2)
Level 4, Seminar Room 4-3
Singapore Management University
90 Stamford Road,
Singapore 178903

Please register by 11 June 2025.

About the Talk

In this talk, we will study the problem of controlling an agent so as to maximize the agent’s expected utility while ensuring that the agent’s policy does not violate a given safety constraint. Problems of this form are typically modelled as constrained Markov decision processes (CMDPs). A number of both exact and approximate algorithms for CMDP optimization were developed over the last decade. In this talk, we will focus on algorithms employing the Monte Carlo tree search (MCTS) paradigm (of Alpha- and MuZero fame), due to their straightforward extensibility to partially observable and multi-agent settings. We will consider MCTS-based CMDP solvers in several contexts, ranging from algorithms that can provide strict formal guarantees on the quality of agent's behaviour to heuristic techniques that can safely control agents in environments with an astronomical number of states.

About the Speaker

Petr Novotný is an associate professor of computer science at Masaryk University, Brno, Czech Republic. HIs research lies at the interface of formal methods and reinforcement learning. In particular, he is interested in safe reinforcement learning, formal analysis of probabilistic programs, and in computational complexity of problems pertaining to decision making under uncertainty.

Where to find us

Get in touch