Sun 8:15 a.m. - 8:20 a.m.
|
Mimee Xu (NYU)
(
Opening Remarks
)
>
SlidesLive Video
|
🔗
|
Sun 8:20 a.m. - 9:15 a.m.
|
Jeff Dean: Advances in Machine Learning for Systems
(
Keynote
)
>
SlidesLive Video
|
Jeff Dean
🔗
|
Sun 9:15 a.m. - 9:55 a.m.
|
Natasha Jaques: Multi-Agent Reinforcement Learning for Systems
(
Invited Talk
)
>
SlidesLive Video
|
Natasha Jaques
🔗
|
Sun 9:55 a.m. - 10:10 a.m.
|
Coffee Break!
|
🔗
|
Sun 10:10 a.m. - 10:40 a.m.
|
Richard Ho: Navigating Scaling and Efficiency Challenges of ML Systems
(
Special Talk
)
>
SlidesLive Video
|
Richard Ho
🔗
|
Sun 10:40 a.m. - 11:10 a.m.
|
Tim Kraska: ML and Generative AI for Data Systems
(
Special Talk
)
>
SlidesLive Video
|
Tim Kraska
🔗
|
Sun 11:10 a.m. - 12:00 p.m.
|
AM Posters
(
Poster Session
)
>
|
🔗
|
Sun 12:00 p.m. - 1:00 p.m.
|
Lunch Break!
|
🔗
|
Sun 1:00 p.m. - 1:45 p.m.
|
Panel: Jeff Dean, Natasha Jaques, Tim Kraska, Lidong Zhou
(
Panel Discussion
)
>
SlidesLive Video
|
Jeff Dean · Natasha Jaques · Tim Kraska · Lidong Zhou
🔗
|
Sun 1:45 p.m. - 2:00 p.m.
|
Coffee Break!
|
🔗
|
Sun 2:00 p.m. - 2:20 p.m.
|
OpenAI's o1 Competing on IOI (Amhed, OpenAI)
(
CodeGen Talk
)
>
SlidesLive Video
|
Ahmed El-Kishky
🔗
|
Sun 2:20 p.m. - 2:30 p.m.
|
Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs (Chris Cummins, Meta)
(
Spotlight
)
>
link
SlidesLive Video
|
Chris Cummins · Volker Seeker · Hugh Leather · Jordi Armengol-Estapé · Aram Markosyan · Gabriel Synnaeve
🔗
|
Sun 2:30 p.m. - 2:40 p.m.
|
The Unreasonable Effectiveness of LLMs for Query Optimization (Peter Akioyamen, UPenn)
(
Spotlight
)
>
link
SlidesLive Video
|
Peter Akioyamen · Zixuan Yi · Ryan Marcus
🔗
|
Sun 2:40 p.m. - 2:50 p.m.
|
CodeGen
(
Q & A
)
>
|
🔗
|
Sun 2:50 p.m. - 3:10 p.m.
|
Tea Break!
|
🔗
|
Sun 3:10 p.m. - 3:20 p.m.
|
Scalable RL for Systems via Offline Imitation from Multiple Baselines: A Case Study in Compiler Optimization (Teodor V. Marinov, Google)
(
Spotlight
)
>
link
SlidesLive Video
|
Teodor Vanislavov Marinov · Alekh Agarwal · Mircea Trofin
🔗
|
Sun 3:20 p.m. - 3:30 p.m.
|
WarpDrive: An Agentic Workflow for Ninja GPU Transformations (Siva Hari, NVIDIA)
(
Spotlight
)
>
link
SlidesLive Video
|
Sana Damani · Siva Kumar Sastry Hari · Mark Stephenson · Christos Kozyrakis
🔗
|
Sun 3:30 p.m. - 4:30 p.m.
|
PM Posters
(
Poster Session
)
>
|
🔗
|
-
|
$\text{ML$^2$Tuner}$ : Efficient Code Tuning via Multi-Level Machine Learning Models
(
Poster
)
>
link
|
JooHyoung Cha · Munyoung Lee · Jinse Kwon · Jubin Lee · Jemin Lee · Yongin Kwon
🔗
|
-
|
BladeDISC++: Memory Optimizations Based On Symbolic Shape
(
Poster
)
>
link
|
Xiulong Yuan · Xu Yan · Wenting Shen · Xiafei Qiu · Ang Wang · Jie Zhang · Yong Li · Wei Lin
🔗
|
-
|
V“Mean”ba: Visual State Space Models only need 1 hidden dimension
(
Poster
)
>
link
|
TienYu Chi · Hung-Yueh Chiang · Chi-Chih Chang · Ning-Chi Huang · Kai-Chiang Wu
🔗
|
-
|
$\texttt{Mycroft}$: Towards Effective and Efficient External Data Augmentation
(
Poster
)
>
link
|
Zain Sarwar · Van Tran · Arjun Bhagoji · Nicholas Feamster · Ben Zhao · Supriyo Chakraborty
🔗
|
-
|
The Unreasonable Effectiveness of LLMs for Query Optimization
(
Poster
)
>
link
|
Peter Akioyamen · Zixuan Yi · Ryan Marcus
🔗
|
-
|
Predicting LLM Inference Latency: A Roofline-Driven ML Method
(
Poster
)
>
link
|
Saki Imai · Rina Nakazawa · Marcelo Amaral · Sunyanan Choochotkaew · Tatsuhiro Chiba
🔗
|
-
|
Eagle: Efficient Training-Free Router for Multi-LLM Inference
(
Poster
)
>
link
|
Zesen Zhao · Shuowei Jin · Zhuoqing Morley Mao
🔗
|
-
|
FlexFlood: Efficiently Updatable Learned Multi-dimensional Index
(
Poster
)
>
link
|
FUMA HIDAKA · Yusuke Matsui
🔗
|
-
|
On the Role of Context Granularity in LLM-Driven Program Repair
(
Poster
)
>
link
|
Tyler Holloway · Ethan Elenberg
🔗
|
-
|
FALCON: Long Short Term Memory Feedback-Driven Adaptive Code Generation for Enhanced Automated Programming Systems
(
Poster
)
>
link
|
Zeyuan Li · Yangfan He · Yuchen Li · TIANYU SHI · Bin Lei · Jianhui Wang · Lewei He · qiu wu chen
🔗
|
-
|
Subnormal Number Attacks on Binarized Neural Networks
(
Poster
)
>
link
|
Nicolás Berrios
🔗
|
-
|
Reward Copilot for RL-driven Systems Optimization
(
Poster
)
>
link
|
Karan Tandon · Manav Mishra · Gagan Somashekar · Mayukh Das · Nagarajan Natarajan
🔗
|
-
|
LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts
(
Poster
)
>
link
|
Zhuohan Gu · Jiayi Yao · Kuntai Du · Junchen Jiang
🔗
|
-
|
WarpDrive: An Agentic Workflow for Ninja GPU Transformations
(
Poster
)
>
link
|
Sana Damani · Siva Kumar Sastry Hari · Mark Stephenson · Christos Kozyrakis
🔗
|
-
|
Scalable RL for Systems via Offline Imitation from Multiple Baselines: A Case Study in Compiler Optimization
(
Poster
)
>
link
|
Teodor Vanislavov Marinov · Alekh Agarwal · Mircea Trofin
🔗
|
-
|
Chiplet Placement and Routing Optimization: A Novel Benchmark and Neural Solver
(
Poster
)
>
link
|
HAEYEON KIM · Federico Berto · Chuanbo Hua · Minsu Kim · joungho kim · Jinkyoo Park
🔗
|
-
|
Exploring CXL-based KV Cache Storage for LLM Serving
(
Poster
)
>
link
|
Yupeng Tang · Runxiang Cheng · Ping Zhou · Tongping Liu · Fei Liu · Wei Tang · Kyoungryun Bae · Jianjun Chen · Wu Xiang · Rui Shi
🔗
|
-
|
IFMoE: An Inference Framework Design for Fine-grained MoE
(
Poster
)
>
link
|
Yuwei An · Zhuoming Chen · Beidi Chen
🔗
|
-
|
Understanding and Alleviating Memory Issue in RLHF for LLMs
(
Poster
)
>
link
|
Jin Zhou · Hanmei Yang · Steven Jiaxun Tang · Mingcan Xiang · Hui Guan · Tongping Liu
🔗
|
-
|
TurboMoE: Enhancing MoE Model Training with Smart Kernel-Fusion and Data Transformation
(
Poster
)
>
link
|
Reza Yazdani Aminabadi · Connor Holmes · Samyam Rajbhandari · Zhewei Yao · Yuxiong He
🔗
|
-
|
Fixrleak: GenAI-based Resource Leak Fix for Real-World Java Programs
(
Poster
)
>
link
|
Zhizhou Zhang · Akshay Utture · Manu Sridharan · Jens Palsberg
🔗
|
-
|
CubicML: Automated ML for Large ML Systems Co-design with ML Prediction of Performance
(
Poster
)
>
link
|
WEI WEN · Quanyu Zhu · Weiwei Chu · Wen-Yen Chen · Jiyan Yang
🔗
|
-
|
OMPar: Automatic Parallelization with AI-Driven Source-to-Source Compilation
(
Poster
)
>
link
|
Tal Kadosh · Niranjan Hasabnis · Prema Soundararajan · Vy Vo · Mihai Capotă · Nesreen K. Ahmed · Yuval Pinter · Gal Oren
🔗
|
-
|
Accelerating Malware Classification: A Vision Transformer Solution
(
Poster
)
>
link
|
Shrey Bavishi · Shrey Modi
🔗
|
-
|
Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
(
Poster
)
>
link
|
Chris Cummins · Volker Seeker · Hugh Leather · Jordi Armengol-Estapé · Aram Markosyan · Gabriel Synnaeve
🔗
|
-
|
Debug-HD: Debugging TinyML models on-device using Hyper-Dimensional computing
(
Poster
)
>
link
|
Nikhil Pratap Ghanathe · Steve Wilton
🔗
|