Machine Learning for Systems

Workshop

Machine Learning for Systems

Xinlei XU · Dan Zhang · Mangpo Phothilimthana · Divya Mahajan · Haoran Qiu · Patrick Musau

West Meeting Room 201

Sun 15 Dec, 8:15 a.m. PST

[ Abstract ] Workshop Website

[ OpenReview]

Machine Learning (ML) for Systems describes the application of machine learning techniques to problems related to computer systems. By leveraging supervised learning and reinforcement learning (RL) approaches, machine learning can replace longstanding heuristics that currently drive many of these systems. This includes a wide range of topics, including multi-objective tasks such as designing new data structures, integrated circuits, or design verification, as well as implementing control algorithms for applications such as compilers, databases, memory management, or ML frameworks. While the systems community increasingly recognizes the importance of ML in solving a variety of different systems problems, ML for Systems remains an emerging area without widely established best practices, methods and strategies for the application of state-of-the-art machine learning techniques. The goal of this workshop is to provide an interdisciplinary venue for ML and Systems experts to push this boundary and start new directions within the ML for Systems area. This year, we will encourage work in key emerging areas such as Large Language Model (LLM) training and serving.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sun 8:15 a.m. - 8:20 a.m.	Mimee Xu (NYU) ( Opening Remarks ) > SlidesLive Video	🔗
Sun 8:20 a.m. - 9:15 a.m.	Jeff Dean: Advances in Machine Learning for Systems ( Keynote ) > SlidesLive Video	Jeff Dean 🔗
Sun 9:15 a.m. - 9:55 a.m.	Natasha Jaques: Multi-Agent Reinforcement Learning for Systems ( Invited Talk ) > SlidesLive Video	Natasha Jaques 🔗
Sun 9:55 a.m. - 10:10 a.m.	Coffee Break!	🔗
Sun 10:10 a.m. - 10:40 a.m.	Richard Ho: Navigating Scaling and Efficiency Challenges of ML Systems ( Special Talk ) > SlidesLive Video	Richard Ho 🔗
Sun 10:40 a.m. - 11:10 a.m.	Tim Kraska: ML and Generative AI for Data Systems ( Special Talk ) > SlidesLive Video	Tim Kraska 🔗
Sun 11:10 a.m. - 12:00 p.m.	AM Posters ( Poster Session ) >	🔗
Sun 12:00 p.m. - 1:00 p.m.	Lunch Break!	🔗
Sun 1:00 p.m. - 1:45 p.m.	Panel: Jeff Dean, Natasha Jaques, Tim Kraska, Lidong Zhou ( Panel Discussion ) > SlidesLive Video	Jeff Dean · Natasha Jaques · Tim Kraska · Lidong Zhou 🔗
Sun 1:45 p.m. - 2:00 p.m.	Coffee Break!	🔗
Sun 2:00 p.m. - 2:20 p.m.	OpenAI's o1 Competing on IOI (Amhed, OpenAI) ( CodeGen Talk ) > SlidesLive Video	Ahmed El-Kishky 🔗
Sun 2:20 p.m. - 2:30 p.m.	Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs (Chris Cummins, Meta) ( Spotlight ) > link SlidesLive Video Link	Chris Cummins · Volker Seeker · Hugh Leather · Jordi Armengol-Estapé · Aram Markosyan · Gabriel Synnaeve 🔗
Sun 2:30 p.m. - 2:40 p.m.	The Unreasonable Effectiveness of LLMs for Query Optimization (Peter Akioyamen, UPenn) ( Spotlight ) > link SlidesLive Video Link	Peter Akioyamen · Zixuan Yi · Ryan Marcus 🔗
Sun 2:40 p.m. - 2:50 p.m.	CodeGen ( Q & A ) >	🔗
Sun 2:50 p.m. - 3:10 p.m.	Tea Break!	🔗
Sun 3:10 p.m. - 3:20 p.m.	Scalable RL for Systems via Offline Imitation from Multiple Baselines: A Case Study in Compiler Optimization (Teodor V. Marinov, Google) ( Spotlight ) > link SlidesLive Video Link	Teodor Vanislavov Marinov · Alekh Agarwal · Mircea Trofin 🔗
Sun 3:20 p.m. - 3:30 p.m.	WarpDrive: An Agentic Workflow for Ninja GPU Transformations (Siva Hari, NVIDIA) ( Spotlight ) > link SlidesLive Video Link	Sana Damani · Siva Kumar Sastry Hari · Mark Stephenson · Christos Kozyrakis 🔗
Sun 3:30 p.m. - 4:30 p.m.	PM Posters ( Poster Session ) >	🔗
-	$\text{ML$^2$Tuner}$ : Efficient Code Tuning via Multi-Level Machine Learning Models ( Poster ) > link Link	JooHyoung Cha · Munyoung Lee · Jinse Kwon · Jubin Lee · Jemin Lee · Yongin Kwon 🔗
-	BladeDISC++: Memory Optimizations Based On Symbolic Shape ( Poster ) > link Link	Xiulong Yuan · Xu Yan · Wenting Shen · Xiafei Qiu · Ang Wang · Jie Zhang · Yong Li · Wei Lin 🔗
-	V“Mean”ba: Visual State Space Models only need 1 hidden dimension ( Poster ) > link Link	TienYu Chi · Hung-Yueh Chiang · Chi-Chih Chang · Ning-Chi Huang · Kai-Chiang Wu 🔗
-	$\texttt{Mycroft}$: Towards Effective and Efficient External Data Augmentation ( Poster ) > link Link	Zain Sarwar · Van Tran · Arjun Bhagoji · Nicholas Feamster · Ben Zhao · Supriyo Chakraborty 🔗
-	The Unreasonable Effectiveness of LLMs for Query Optimization ( Poster ) > link Link	Peter Akioyamen · Zixuan Yi · Ryan Marcus 🔗
-	Predicting LLM Inference Latency: A Roofline-Driven ML Method ( Poster ) > link Link	Saki Imai · Rina Nakazawa · Marcelo Amaral · Sunyanan Choochotkaew · Tatsuhiro Chiba 🔗
-	Eagle: Efficient Training-Free Router for Multi-LLM Inference ( Poster ) > link Link	Zesen Zhao · Shuowei Jin · Zhuoqing Morley Mao 🔗
-	FlexFlood: Efficiently Updatable Learned Multi-dimensional Index ( Poster ) > link Link	FUMA HIDAKA · Yusuke Matsui 🔗
-	On the Role of Context Granularity in LLM-Driven Program Repair ( Poster ) > link Link	Tyler Holloway · Ethan Elenberg 🔗
-	FALCON: Long Short Term Memory Feedback-Driven Adaptive Code Generation for Enhanced Automated Programming Systems ( Poster ) > link Link	Zeyuan Li · Yangfan He · Yuchen Li · TIANYU SHI · Bin Lei · Jianhui Wang · Lewei He · qiu wu chen 🔗
-	Subnormal Number Attacks on Binarized Neural Networks ( Poster ) > link Link	Nicolás Berrios 🔗
-	Reward Copilot for RL-driven Systems Optimization ( Poster ) > link Link	Karan Tandon · Manav Mishra · Gagan Somashekar · Mayukh Das · Nagarajan Natarajan 🔗
-	LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts ( Poster ) > link Link	Zhuohan Gu · Jiayi Yao · Kuntai Du · Junchen Jiang 🔗
-	WarpDrive: An Agentic Workflow for Ninja GPU Transformations ( Poster ) > link Link	Sana Damani · Siva Kumar Sastry Hari · Mark Stephenson · Christos Kozyrakis 🔗
-	Scalable RL for Systems via Offline Imitation from Multiple Baselines: A Case Study in Compiler Optimization ( Poster ) > link Link	Teodor Vanislavov Marinov · Alekh Agarwal · Mircea Trofin 🔗
-	Chiplet Placement and Routing Optimization: A Novel Benchmark and Neural Solver ( Poster ) > link Link	HAEYEON KIM · Federico Berto · Chuanbo Hua · Minsu Kim · joungho kim · Jinkyoo Park 🔗
-	Exploring CXL-based KV Cache Storage for LLM Serving ( Poster ) > link Link	Yupeng Tang · Runxiang Cheng · Ping Zhou · Tongping Liu · Fei Liu · Wei Tang · Kyoungryun Bae · Jianjun Chen · Wu Xiang · Rui Shi 🔗
-	IFMoE: An Inference Framework Design for Fine-grained MoE ( Poster ) > link Link	Yuwei An · Zhuoming Chen · Beidi Chen 🔗
-	Understanding and Alleviating Memory Issue in RLHF for LLMs ( Poster ) > link Link	Jin Zhou · Hanmei Yang · Steven Jiaxun Tang · Mingcan Xiang · Hui Guan · Tongping Liu 🔗
-	TurboMoE: Enhancing MoE Model Training with Smart Kernel-Fusion and Data Transformation ( Poster ) > link Link	Reza Yazdani Aminabadi · Connor Holmes · Samyam Rajbhandari · Zhewei Yao · Yuxiong He 🔗
-	Fixrleak: GenAI-based Resource Leak Fix for Real-World Java Programs ( Poster ) > link Link	Zhizhou Zhang · Akshay Utture · Manu Sridharan · Jens Palsberg 🔗
-	CubicML: Automated ML for Large ML Systems Co-design with ML Prediction of Performance ( Poster ) > link Link	WEI WEN · Quanyu Zhu · Weiwei Chu · Wen-Yen Chen · Jiyan Yang 🔗
-	OMPar: Automatic Parallelization with AI-Driven Source-to-Source Compilation ( Poster ) > link Link	Tal Kadosh · Niranjan Hasabnis · Prema Soundararajan · Vy Vo · Mihai Capotă · Nesreen K. Ahmed · Yuval Pinter · Gal Oren 🔗
-	Accelerating Malware Classification: A Vision Transformer Solution ( Poster ) > link Link	Shrey Bavishi · Shrey Modi 🔗
-	Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs ( Poster ) > link Link	Chris Cummins · Volker Seeker · Hugh Leather · Jordi Armengol-Estapé · Aram Markosyan · Gabriel Synnaeve 🔗
-	Debug-HD: Debugging TinyML models on-device using Hyper-Dimensional computing ( Poster ) > link Link	Nikhil Pratap Ghanathe · Steve Wilton 🔗