Pre-Conference Talk by Brahmanage Janaka Chathuranga THILAKARATHNA | Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

Please click here if you are unable to view this page.

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

Speaker:

Brahmanage Janaka Chathuranga THILAKARATHNA
Ph.D. Candidate
School of Computing and Information Systems
Singapore Management University

Date:

Time:

Venue:

16 June 2026, Tuesday

11:30am – 11:50am

Meeting room 5.1, Level 5. School of Computing and Information Systems 1, Singapore Management University, 80 Stamford Road Singapore 178902

Please register by 14 June 2026.

About the Talk

Sequential decision-making using Markov Decision Process underpins many real-world applications. Both model-based and model-free methods have achieved strong results in these settings. However, real-world tasks must balance reward maximization with safety constraints, often conflicting objectives, that can lead to unstable min–max, adversarial optimization. A promising alternative is safety reachability analysis, which precomputes a forward-invariant safe state–action set, ensuring that an agent starting inside this set remains safe indefinitely. Yet, most reachability-based methods address only hard safety constraints, and little work extends reachability to cumulative cost constraints. To address this, first, we define a safety-conditioned reachability set that decouples reward maximization from cumulative safety cost constraints. Second, we show how this set enforces safety constraints without unstable min–max or Lagrangian optimization, yielding a novel offline safe RL algorithm that learns a safe policy from a fixed dataset without environment interaction. Finally, experiments on standard offline safe-RL benchmarks, and a real-world maritime navigation task demonstrate that our method matches or outperforms state-of-the-art baselines while maintaining safety.

This is a Pre-Conference talk for The 36th International Conference on Automated Planning and Scheduling (ICAPS 2026).

About the Speaker

Janaka Brahmanage is a fourth-year PhD candidate in Computer Science, conducting research under the guidance of Associate Prof. Akshat Kumar at the SMU School of Computing and Information Systems. His research focuses on safe reinforcement learning and multi-agent systems.

Where to find us

Get in touch