showSidebars ==
showTitleBreadcrumbs == 1
node.field_disable_title_breadcrumbs.value ==

Pre-Conference Talk by SHI Jieke | Green Large Language Models of Code

Please click here if you are unable to view this page.

 

Green Large Language Models of Code

Speaker (s): 

 

SHI Jieke
PhD Candidate
School of Computing and Information Systems
Singapore Management University

 

 

 

 

Date:


Time:


Venue:

 

 

 

11 April 2024, Thursday


5:30pm - 6:00pm


Meeting room 5.1 Level 5
School of Computing and Information Systems 1, 
Singapore Management University, 
80 Stamford Road Singapore 178902

Please register by 10 April 2024.
 

 

About the Talk

Large language models of code have shown remarkable effectiveness in various software engineering tasks. Despite the availability of many cloud services built upon these powerful models, there remain several scenarios where developers cannot take full advantage of them, stemming from factors such as restricted or unreliable internet access, institutional privacy policies that prohibit external transmission of code to third-party vendors, and more. Therefore, developing a compact, efficient, and yet energy-saving model for deployment on developers' devices becomes essential. To this aim, we propose Avatar, a novel approach that crafts a deployable model from a large language model of code by optimizing it in terms of model size, inference latency, energy consumption, and carbon footprint while maintaining a comparable level of effectiveness. The key idea of Avatar is to formulate the optimization of language models as a multi-objective configuration tuning problem and solve it using a Satisfiability Modulo Theories (SMT) solver and a tailored optimization algorithm. We use Avatar to produce optimized language models of code with a small size (3 MB, 160× smaller). The optimized models significantly reduce energy consumption (up to 184× less), carbon footprint (up to 157× less), and inference latency (up to 76× faster), with only a negligible loss in effectiveness (1.67% on average). 

This is a Pre-Conference talk for 46th International Conference on Software Engineering (ICSE 2024).
 

About the Speaker

SHI Jieke is a PhD candidate and a Research Engineer at the School of Computing and Information Systems (SCIS), Singapore Management University (SMU). His research interests lie primarily in the intersection of Software Engineering (SE) and Artificial Intelligence (AI). Particularly, he focuses on (1) quality assurance of AI-enabled systems from an SE perspective, and (2) efficiency improvement of code models for real-world deployment. His work has been published in high-quality SE conferences such as ICSE, ASE, MSR, etc. He has won/been nominated for several research paper awards. More info: https://jiekeshi.github.io.