Meeting room 5.1, Level 5 School of Computing and Information Systems 1, Singapore Management University, 80 Stamford Road Singapore 178902
Please register by 6 June 2024.
We look forward to seeing you at this research seminar.
ABOUT THE TALK
This thesis delves into the acceleration and optimization of Transformer inference, a subject of increasing importance with the emergence of Large Language Models (LLMs). The study primarily addresses the challenges posed by the inherent property of Transformers during inference: autoregressive nature.
ABOUT THE SPEAKER
DU Cunxiao is a PhD Candidate supervised by Prof. Jing JIANG. His research interests are LLM and MLsys. His life creed is LESS IS MORE.