PhD Dissertation Defense by XIAO Hanhua | Graph Perturbations for Robust Knowledge Discovery and Retrieval

Please click here if you are unable to view this page.

Graph Perturbations for Robust Knowledge Discovery and Retrieval

XIAO Hanhua

PhD Candidate
School of Computing and Information Systems
Singapore Management University

FULL PROFILE

Research Area

Artificial Intelligence & Data Science
- Data Management & Mining

Dissertation Committee

Research Advisor

Associate Prof LI Yuchen

Committee Members

External Member

Roy Ka-Wei LEE, Cheng Tsang Man Early Career Chair Professor, Information Systems Technology and Design Pillar, Singapore University of Technology and Design

Date

21 November 2025 (Friday)

Time

3:30pm - 4:30pm

Venue

Meeting room 5.1,
Level 5
School of Computing and Information Systems 1,
Singapore Management University,
80 Stamford Road
Singapore 178902

Please register by 19 November 2025.

We look forward to seeing you at this research seminar.

ABOUT THE TALK

Graph perturbation studies how small structural edits, e.g., adding or deleting edges, affect graph properties and downstream tasks. While prior work largely targets global statistics or model outputs, robustness for knowledge discovery and information retrieval on attributed graphs is less explored. This dissertation introduces formulations and algorithms that generate and leverage perturbations to make these tasks more reliable.

First, we assess the robustness of outstanding facts on knowledge graphs. We formalize two perturbation types: (i) context‑entity swaps and (ii) plausible data insertions and estimate robustness via a perturbation‑relevance distribution, with powerful relevance‑based pruning and importance‑sampling estimators.

Second, we study subgraph counting under adversarial edge additions. We prove the kSub problem is NP‑hard and inapproximable, derive the topkSub relaxation, and design connected/disconnected processing with pruning and unbiased samplers, enabling practical stress tests that expose weaknesses in GNN‑based counters.

Finally, for robust information retrieval via retrieval‑augmented generation (RAG), we present GBRAG, a training‑free graph‑bandit framework that treats chunks as bandit arms and updates a query‑conditioned priority map using two LLM‑scored rewards: immediate relevance and marginal information gain, yielding consistent gains over strong graph‑RAG baselines.

Across extensive evaluations, the methods are efficient, theoretically grounded, and effective for robust knowledge discovery and information retrieval.

SPEAKER BIOGRAPHY

XIAO Hanhua is a Ph.D. candidate in Computer Science at Singapore Management University, under the supervision of Associate Professor Li Yuchen. His research focuses on graph mining, with particular interests in knowledge graph mining, graph perturbation, and large language model (LLM)-based graph analytics. Before joining SMU, Hanhua received his B.Sc. and M.Sc. degrees in Electronic Engineering from the Beijing University of Posts and Telecommunications and the University of Southern California in 2017 and 2019, respectively. In his leisure time, he enjoys cycling and cooking.

Where to find us

Get in touch