showSidebars ==
showTitleBreadcrumbs == 1
node.field_disable_title_breadcrumbs.value ==

PhD Dissertation Defense by XIE Wei | Real-time Bursty Topic Detection and Virality Forecasting in Microblogs

Please click here if you are unable to view this page.

 

Real-time Bursty Topic Detection and Virality Forecasting in Microblogs


Speaker (s):

XIE Wei

PhD Candidate

School of Information Systems

Singapore Management University


Date:


Time:


Venue:

 

July 6, 2017, Thursday


10:00am - 11:00am


Meeting Room 4.4, Level 4

School of Information Systems

Singapore Management University

80 Stamford Road

Singapore 178902

We look forward to seeing you at this research seminar.

About the Talk

Microblogs such as Twitter have become the largest social platforms for users around the world to share anything happening around them with friends and beyond. A bursty topic in microblogs is one that triggers a surge of relevant tweets within a short period of time, which often reflects important events of mass interest. How to leverage microblogs for early detection and further impact analysis of bursty topics has, therefore, become an important research problem with immense practical value.

To address the above problem, we propose in this thesis a framework which contains the following three parts. The first part is to select a budgeted set of users as information sources for early detection of bursty topics. An efficient algorithm is proposed to find such set of users from a large social network. The second part is that of detecting bursty topics from a tweet stream. We propose a two-stage integrated solution TopicSketch. In the first stage, we design a small data sketch which efficiently maintains at a low computational cost the acceleration of two quantities: the occurrence of each word pair and the occurrence of each word triple. In the second stage, we propose a sketch-based topic model to infer both the bursty topics and their acceleration based on the statistics maintained in the data sketch. A dimension reduction technique based on hashing is also proposed to achieve scalability. Once a bursty topic is detected, the third part will predict the further cascade size of this bursty topic. Based on a general time-aware cascade model, in which an appropriate hazard function is designed specifically for microblog networks, a simulation-based approach is proposed to forecast the virality of the cascade. Although each part can work separately, when putting them together, we get a comprehensive solution which is able to benefit many real world applications such as social network monitoring and viral marketing.

 

About the Speaker

XIE Wei is a PhD candidate in the School of Information Systems, Singapore Management University, under the supervision of Assistant Professor ZHU Feida. His primary research interests include social network mining, as well as machine learning. His current research focuses on detecting bursty topics from a large volume of tweet stream in real time.