| |
Robots, Reinforcement Learning, and Data Mining
Speaker (s):

Prashant DOSHI
Professor of Computer Science
Institute for Artificial Intelligence
University of Georgia, USA
|
|
Date:
Time:
Venue:
|
|
May 29, 2017, Monday
11:00 am - 12:00 pm
Meeting Room 4.4, Level 4
School of Information Systems
Singapore Management University
80 Stamford Road
Singapore 178902
We look forward to seeing you at this research seminar.

|
|
ABSTRACT
Inverse reinforcement learning (IRL) investigates ways by which a learner may approximate the preferences of an expert by observing the experts’ actions over time. Usually, the expert is modeled as optimizing its actions using a Markov decision process (MDP), whose parameters except for the reward function are known to the learner. IRL is enjoying much success and attention in applications such as robots learning tasks from demonstrations by human experts and in imitation learning. Robots gather large amounts of multi-modal data as they observe others and must sift it to infer the trajectories.
In this talk, I will first discuss the data processing pipeline that enables IRL in robotic applications. Then, I will present methods for generalizing IRL to contexts that involve multiple observed agents, which may interact and whose observed trajectories are partially occluded from the learner. I will also explore generalizing IRL to settings where additional parameters of others' MDPs are unknown. These are all challenges encountered in real-world contexts. We evaluate these methods in the context of an application setting involving two mobile robots executing simple cyclic trajectories for perimeter patrolling. Both robots’ patrolling motions are disturbed when they approach each other in narrow corridors leading to an interaction. A subject robot observes them from a hidden vantage point that affords partial observability of their trajectories only. It’s task is to penetrate the patrols and reach a goal location without being spotted. Thus, its eventual actions do not impact the other robots. Videos of demonstration runs will be shown.
About the Speaker
Dr. Prashant Doshi is a tenured Professor of Computer Science and faculty member of the AI Institute at The University of Georgia, USA. His research interests lie broadly in artificial intelligence and robotics. Specifically, he studies automated decision-making under uncertainty in multiagent settings, non-cooperative game theory, and robot learning. He was a visiting professor at the University of Waterloo in 2015, and he has also had short stints at the IBM T. J. Watson Research Center. He has published 100+ articles and papers in journals, conferences, and other forums in the fields of agents and AI. His research has led to publications in the Journal of AI Research, Journal of AAMAS, AAAI, IJCAI and AAMAS conferences. In 2011, Prof. Doshi was awarded UGA’s Creative Research Medal for his contributions to automated decision making. Prof. Doshi teaches introductory courses on AI to undergraduate and graduate students, and a course on decision making under uncertainty to graduate students, all of which are well received among the students.
|