showSidebars ==
showTitleBreadcrumbs == 1
node.field_disable_title_breadcrumbs.value ==

Research Seminar by TRUONG Quoc Tuan | Variational Learning from Implicit Bandit Feedback

Please click here if you are unable to view this page.

 
Variational Learning from Implicit Bandit Feedback

Speaker (s):

TRUONG Quoc Tuan
PhD Student
School of Computing and Information Systems
Singapore Management University

Date:

Time:

Venue:

 

13 September 2021, Monday

2:30pm - 2:50pm

This is a virtual seminar. Please register by 09 Sep, the zoom link will be sent out on the following day to those who registered.

We look forward to seeing you at this research seminar.

About the Talk

Recommendations are prevalent in Web applications (e.g., search ranking, item recommendation, advertisement placement). Learning from bandit feedback is challenging due to the sparsity of feedback limited to system-provided actions. In this work, we focus on batch learning from logs of recommender systems involving both bandit and organic feedbacks. We develop a probabilistic framework with a likelihood function for estimating not only explicit positive observations but also implicit negative observations inferred from the data. Moreover, we introduce a latent variable model for organic-bandit feedbacks to robustly capture user preference distributions. Next, we analyze the behavior of the new likelihood under two scenarios, i.e., with and without counterfactual re-weighting. For speedier item ranking, we further investigate the possibility of using Maximum-a-Posteriori (MAP) estimate instead of Monte Carlo (MC)-based approximation for prediction. Experiments on both real datasets as well as data from a simulation environment show substantial performance improvements over comparable baselines.
 

About the Speaker

TRUONG Quoc Tuan is a PhD candidate advised by Associate Professor Hady W. Lauw in the School of Computing and Information Systems, Singapore Management University. His research focuses on multimodal representation learning and preference modeling for recommender systems.