showSidebars ==
showTitleBreadcrumbs == 1
node.field_disable_title_breadcrumbs.value ==

Pre-Conference Talk by XU Chenshu | Trajectory as Creative Intent: Rethinking Drag-Based Interaction in Generative Visual Editing

Please click here if you are unable to view this page.

 

Trajectory as Creative Intent: Rethinking Drag-Based Interaction in Generative Visual Editing

Speaker:


XU Chenshu
Ph.D. Candidate
School of Computing and Information Systems
Singapore Management University

 

Date:

Time:

Venue:

 

29 June 2026, Monday

10:00am – 10:15am

Meeting room 5.1, Level 5. 
School of Computing and Information Systems 1, 
Singapore Management University, 
80 Stamford Road
Singapore 178902

Please register by 28 June 2026.

About the Talk

Trajectory-based interaction, where users draw spatial paths to guide generative editing, has become a popular paradigm across 2D image editing, video generation, 3D scene editing, and human motion synthesis. We present a unified taxonomy of these methods along four dimensions: output modality, trajectory representation, trajectory grounding, and human-AI agency. A striking pattern emerges: across all methods and modalities, the AI operates strictly as a spatial command executor, with collaborative and model-initiated categories entirely vacant. We identify three systematic gaps: ambiguity is resolved by increasing user burden rather than developing interpretive capacity; pipelines are open-loop; and evaluation rewards geometric precision over creative quality. We discuss directions toward systems where trajectories serve as the starting point of creative dialogue rather than commands to be executed.

This is a Pre-Conference talk for Forty-Third International Conference on Machine Learning (ICML 2026).

About the Speaker

Chenshu XU is a PhD candidate in the School of Computing and Information Systems (SCIS) at Singapore Management University (SMU). She is a member of Visual Understanding and Generation Lab (VUG) @ SMU supervised by Prof. HE Shengfeng. Her research interests lie at the intersection of Computer Vision and Computer Graphics.