Skip to yearly menu bar Skip to main content


Spotlight
in
Workshop: Machine Learning for Systems

Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs (Chris Cummins, Meta)

Chris Cummins · Volker Seeker · Hugh Leather · Jordi Armengol-EstapĂ© · Aram Markosyan · Gabriel Synnaeve

[ ] [ Project Page ]
Sun 15 Dec 2:20 p.m. PST — 2:30 p.m. PST
 
presentation: Machine Learning for Systems
Sun 15 Dec 8:15 a.m. PST — 4:30 p.m. PST

Abstract:

Tools for rewriting, refactoring and optimizing code should be fast and correct. Large Language Models (LLMs), by their nature, possess neither of these qualities. Yet, there remains tremendous opportunity in using LLMs to improve code.We explore the use of LLMs not to transform code, but to code transforms. We propose a chain-of-thought approach to synthesizing code transformations from a small number of input/output code examples. Unlike the direct rewrite approach, LLM-generated transformations are easy to inspect, debug, and validate. The logic of the rewrite is explicitly coded and easy to adapt. The compute required to run code transformations is minute compared to that of LLM rewriting.We test our approach on 16 Python code transformations and find that LLM-generated transforms are perfectly precise for 7 of them and less imprecise than direct LLM rewriting on the others. We hope to encourage further research to improving the precision of LLM code rewriting.

Chat is not available.