Workshop
Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning
Mengye Ren · Paul Vicol · Naila Murray · Renjie Liao · Beidi Chen · Wei-Chiu Ma
West Exhibition Hall A
Sat 14 Dec, 8:30 a.m. PST
In the rapidly evolving landscape of AI, the development of adaptive foundation models represents a ground-breaking shift towards AI systems that can continually learn, adapt, and evolve in response to new information, changing environments, and user preferences. This workshop aims to explore cutting-edge advancements in adaptive foundation models, focusing on methodologies that enable continual weight updates, memory-efficient fine-tuning, and personalized adaptation to diverse tasks and domains. We feature invited talks by experts in LLMs, diffusion models, multimodal learning, continual learning, and efficient ML to explore this interdisciplinary topic. We host workshop paper submissions and invite oral papers for contributed talks. In addition, there is a panel discussion with the invited speakers.
Schedule
Sat 8:30 a.m. - 9:00 a.m.
|
Keynote: Tree Search for Language Model Agents
(
Talk
)
>
SlidesLive Video |
Ruslan Salakhutdinov 🔗 |
Sat 9:00 a.m. - 9:30 a.m.
|
Multimodal Iterative Refinement
(
Invited Talk
)
>
SlidesLive Video |
Sander Dieleman 🔗 |
Sat 9:30 a.m. - 9:45 a.m.
|
Coffee Break
|
🔗 |
Sat 9:45 a.m. - 10:15 a.m.
|
Is pre-training the key to successful domain generalization?
(
Invited Talk
)
>
SlidesLive Video |
Kate Saenko 🔗 |
Sat 10:15 a.m. - 10:45 a.m.
|
Workshop Oral Papers 1
(
Contributed Talks
)
>
|
🔗 |
Sat 10:15 a.m. - 10:25 a.m.
|
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
(
Oral
)
>
SlidesLive Video |
Fabian Paischer · Lukas Hauzenberger · Thomas Schmied · Benedikt Alkin · Marc Deisenroth · Sepp Hochreiter 🔗 |
Sat 10:25 a.m. - 10:35 a.m.
|
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
(
Oral
)
>
SlidesLive Video |
Joel Jang · Seungone Kim · Bill Yuchen Lin · Yizhong Wang · Jack Hessel · Luke Zettlemoyer · Hannaneh Hajishirzi · Yejin Choi · Prithviraj Ammanabrolu 🔗 |
Sat 10:35 a.m. - 10:45 a.m.
|
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems
(
Oral
)
>
SlidesLive Video |
Jennifer Hsia · Afreen Shaikh · Zhiruo Wang · Graham Neubig 🔗 |
Sat 10:45 a.m. - 11:45 a.m.
|
Morning Poster Session, Paper IDs 1-75
(
Poster Session
)
>
|
🔗 |
Sat 11:45 a.m. - 1:00 p.m.
|
Lunch Break
|
🔗 |
Sat 1:00 p.m. - 1:30 p.m.
|
Continual Foundation Model Learning
(
Invited Talk
)
>
SlidesLive Video |
Matthias Bethge · Matthias Bethge · Vishaal Udandarao 🔗 |
Sat 1:30 p.m. - 2:00 p.m.
|
On the Knowledge Adaptability of Language Models
(
Invited Talk
)
>
SlidesLive Video |
Minjoon Seo 🔗 |
Sat 2:00 p.m. - 3:00 p.m.
|
Workshop Oral Papers 2
(
Contributed Talks
)
>
|
🔗 |
Sat 2:00 p.m. - 2:10 p.m.
|
Self-Play Preference Optimization for Language Model Alignment
(
Oral
)
>
SlidesLive Video |
Yue Wu · Zhiqing Sun · Huizhuo Yuan · Kaixuan Ji · Yiming Yang · Quanquan Gu 🔗 |
Sat 2:10 p.m. - 2:20 p.m.
|
Fast and Accurate Language Model Decoding via Parallel Token Processing
(
Oral
)
>
SlidesLive Video |
Zhepei Wei · Wei-Lin Chen · Xinyu Zhu · Yu Meng 🔗 |
Sat 2:20 p.m. - 2:30 p.m.
|
Fine-tuning LLM Agents with Retrospective In-Context Online Learning
(
Oral
)
>
SlidesLive Video |
Wen-Tse Chen · Jiayu Chen · Fahim Tajwar · Hao Zhu · Xintong Duan · Ruslan Salakhutdinov · Jeff Schneider 🔗 |
Sat 2:30 p.m. - 2:40 p.m.
|
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle
(
Oral
)
>
SlidesLive Video |
Amelia Hui Dai · Ryan Teehan · Mengye Ren 🔗 |
Sat 2:40 p.m. - 2:50 p.m.
|
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
(
Oral
)
>
SlidesLive Video |
Tong Chen · Hao Fang · Patrick Xia · Xiaodong Liu · Ben Van Durme · Luke Zettlemoyer · Jianfeng Gao · Hao Cheng 🔗 |
Sat 2:50 p.m. - 3:00 p.m.
|
Coffee Break
|
🔗 |
Sat 3:00 p.m. - 3:30 p.m.
|
Continual Learning with Large Language Models
(
Invited Talk
)
>
SlidesLive Video |
Bing Liu 🔗 |
Sat 3:30 p.m. - 4:00 p.m.
|
Enable Large Language Model Deployment Across Cloud and Edge with ML Compilation
(
Invited Talk
)
>
SlidesLive Video |
Tianqi Chen 🔗 |
Sat 4:00 p.m. - 4:30 p.m.
|
Panel Discussion
(
Panel
)
>
SlidesLive Video |
Ruslan Salakhutdinov · Sander Dieleman · Kate Saenko · Matthias Bethge · Minjoon Seo · Bing Liu 🔗 |
Sat 4:30 p.m. - 5:30 p.m.
|
Afternoon Poster Session, Paper IDs 76-157
(
Poster Session
)
>
|
🔗 |
-
|
Towards Personalized Language Models via Inference-time Human Preference Optimization ( Poster ) > link | Nikki Lijing Kuang · Wei Sun · Scott McFaddin · Yian Ma · Markus Ettl 🔗 |
-
|
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning ( Poster ) > link | Alexander Nikulin · Ilya Zisman · Alexey Zemtsov · Vladislav Kurenkov 🔗 |
-
|
SeCom: On Memory Construction and Retrieval for Personalized Conversational Agents ( Poster ) > link |
11 presentersZhuoshi Pan · Qianhui Wu · Huiqiang Jiang · Xufang Luo · Hao Cheng · Dongsheng Li · Yuqing Yang · Chin-Yew Lin · H. Vicky Zhao · Lili Qiu · Jianfeng Gao |
-
|
metaTextGrad: Learning to learn with language models as optimizers ( Poster ) > link | Guowei Xu · Mert Yuksekgonul · Carlos Guestrin · James Zou 🔗 |
-
|
Adapting Foundation Models via Training-free Dynamic Weight Interpolation ( Poster ) > link | Changdae Oh · Sharon Li · Kyungwoo Song · Sangdoo Yun · Dongyoon Han 🔗 |
-
|
Personalized Language Modeling from Personalized Human Feedback ( Poster ) > link | Xinyu Li · Ruiyang Zhou · Zachary Lipton · Liu Leqi 🔗 |
-
|
AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations ( Poster ) > link | Gaurav Verma · Rachneet Kaur · Nishan Srishankar · Zhen Zeng · Tucker Balch · Manuela Veloso 🔗 |
-
|
Automatically Generating Custom Context-Driven SFT Data for LLMs with Multi-Granularity ( Poster ) > link | Shanghaoran Quan 🔗 |
-
|
ZO-Offloading: Fine-Tuning LLMs with 100 Billion Parameters on a Single GPU ( Poster ) > link | Liangyu Wang · Jie Ren · Hang Xu · Junxiao Wang · David Keyes · Di Wang 🔗 |
-
|
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory ( Poster ) > link | Di Wu · Hongwei Wang · Wenhao Yu · Yuwei Zhang · Kai-Wei Chang · Dong Yu 🔗 |
-
|
InvestAlign: Align LLMs with Investor Decision-Making under Herd Behavior ( Poster ) > link | Huisheng Wang · Zhuoshi Pan · Hangjing Zhang · Mingxiao Liu · Yiqing Lin · H. Vicky Zhao 🔗 |
-
|
Improving Model Merging with Natural Niches ( Poster ) > link | João Abrantes · Robert Lange · Yujin Tang 🔗 |
-
|
Improving In-Context Learning with Small Language Model Ensembles ( Poster ) > link | Mehdi Mojarradi · Lingyi Yang · Robert McCraith · Adam Mahdi 🔗 |
-
|
Adaptive LoRA Merging for Efficient Domain Incremental Learning ( Poster ) > link | Eric Nuertey Coleman · Luigi Quarantiello · Julio Hurtado · Vincenzo Lomonaco 🔗 |
-
|
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation ( Oral ) > link | Fabian Paischer · Lukas Hauzenberger · Thomas Schmied · Benedikt Alkin · Marc Deisenroth · Sepp Hochreiter 🔗 |
-
|
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation ( Poster ) > link | Quanting Xie · So Yeon Min · Tianyi Zhang · Kedi Xu · Aarav Bajaj · Ruslan Salakhutdinov · Matthew Johnson-Roberson · Yonatan Bisk 🔗 |
-
|
Slaying the HyDRA: Parameter-Efficient Hyper Networks with Low-Displacement Rank Adaptation ( Poster ) > link | Xiangyu Chen · Ye Wang · Matthew Brand · Perry Wang · Jing Liu · Toshiaki Koike-Akino 🔗 |
-
|
Generating Diverse Negations from Affirmative Sentences ( Poster ) > link | Darian Vasquez · Afroditi Papadaki 🔗 |
-
|
P3O: Pessimistic Preference-based Policy Optimization for Robust Alignment from Preferences ( Poster ) > link | Dhawal Gupta · Christoph Dann · Alekh Agarwal 🔗 |
-
|
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models ( Poster ) > link | Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Linjun Zhang · James Zou · Huaxiu Yao 🔗 |
-
|
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning ( Poster ) > link | Soeun Lee · Si-Woo Kim · taewhan Kim · Dong-Jin Kim 🔗 |
-
|
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks ( Poster ) > link | Minju Seo · Jinheon Baek · James Thorne · Sung Ju Hwang 🔗 |
-
|
$\text{Transformer}^2$: Self-adaptive LLMs ( Poster ) > link | Qi Sun · Edoardo Cetin · Yujin Tang 🔗 |
-
|
Data-Efficient Training by Evolved Sampling ( Poster ) > link | Ziheng Cheng · Zhong Li · Jiang Bian 🔗 |
-
|
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion ( Oral ) > link | Yongyuan Liang · Tingqiang Xu · Kaizhe Hu · Guangqi Jiang · Furong Huang · Huazhe Xu 🔗 |
-
|
Agent Skill Acquisition for LLMs via CycleQD ( Poster ) > link | So Kuroki · Taishi Nakamura · Takuya Akiba · Yujin Tang 🔗 |
-
|
Enhancing Low-Light Imagery: A Fusion of Deep Learning and Diffusion Models for Superior Visibility ( Poster ) > link | Yangfan He · Jianhui Wang · Sida Li · Haoyuan Li · TIANYU SHI 🔗 |
-
|
AgentMerge: Enhancing Generalization in Fine-Tuned LLM Agents ( Poster ) > link | Megh Thakkar · Léo Boisvert · Thibault de Chezelles · Alexandre Piche · Maxime Gasse · Massimo Caccia · Alexandre Lacoste 🔗 |
-
|
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models ( Poster ) > link | Ognjen Rudovic · Pranay Dighe · Yi Su · Vineet Garg · Sameer Dharur · Xiaochuan Niu · Ahmed Abdelaziz · Saurabh Adya · Ahmed Tewfik 🔗 |
-
|
AoP-SAM: Automation of Prompts for Efficient Segmentation ( Poster ) > link | Yi Chen · Muyoung Son · Chuanbo Hua · Joo-Young Kim 🔗 |
-
|
Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models ( Poster ) > link | Gang Li · Wendi Yu · Yao Yao · Wei Tong · Yingbin Liang · Qihang Lin · Tianbao Yang 🔗 |
-
|
Enhancing Fine-Tuning Efficiency of LLMs Through Gradient Subspace Tracking ( Poster ) > link | Sahar Rajabi · Sirisha Rambhatla 🔗 |
-
|
Memory Efficient Continual Learning with CLIP Models ( Poster ) > link | Ryan King · Gang Li · Bobak J Mortazavi · Tianbao Yang 🔗 |
-
|
Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning ( Poster ) > link | Jiajun Chai · Sicheng Li · Yuqian Fu · Dongbin Zhao · Yuanheng Zhu 🔗 |
-
|
Leveraging Self Weak-supervision for Improved VLM Performance ( Poster ) > link | Shuvendu Roy · Ali Etemad 🔗 |
-
|
Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention ( Poster ) > link | Tianyun Yang · Ziniu Li · Juan Cao · Chang Xu 🔗 |
-
|
Deliberate Practice with Synthetic Data ( Poster ) > link | Reyhane Askari Hemmat · Mohammad Pezeshki · Pietro Astolfi · Melissa Hall · Florian Bordes · Jakob Verbeek · Michal Drozdzal · Adriana Romero 🔗 |
-
|
LangDA: Adapting Visual Features with Instruction Tuning for Semantic Segmentation ( Poster ) > link | Chang Liu · Saad Hossain · C Thomas · Kwei-Herng Lai · Raviteja Vemulapalli · Sirisha Rambhatla · Alexander Wong 🔗 |
-
|
Fine-Grained Visual Recognition in the Age of Multimodal LLMs ( Poster ) > link | Hari Chandana Kuchibhotla · Abbavaram Gowtham Reddy · Sai Srinivas Kancheti · Vineeth N Balasubramanian 🔗 |
-
|
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding ( Poster ) > link | Xinyu Yang · Tianqi Chen · Beidi Chen 🔗 |
-
|
Imbalance-Regularized LoRA: A Plug-and-Play Method for Improving Fine-Tuning of Foundation Models ( Poster ) > link | Zhenyu Zhu · Yongtao Wu · Quanquan Gu · Volkan Cevher 🔗 |
-
|
Transfer Learning for Finetuning Large Language Models ( Poster ) > link | Tobias Strangmann · Lennart Purucker · Jörg Franke · Ivo Rapant · Fabio Ferreira · Frank Hutter 🔗 |
-
|
Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass ( Oral ) > link | Tong Chen · Hao Fang · Patrick Xia · Xiaodong Liu · Ben Van Durme · Luke Zettlemoyer · Jianfeng Gao · Hao Cheng 🔗 |
-
|
Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism ( Poster ) > link | Yimin Tang · Yurong Xu · Ning Yan · Masood Seyed Mortazavi 🔗 |
-
|
Synergistic Weak-Strong Collaboration by Aligning Preferences ( Poster ) > link | Yizhu Jiao · Xuchao Zhang · Zhaoyang Wang · Yubo Ma · Zhun Deng · Rujia Wang · Chetan Bansal · Saravan Rajmohan · Jiawei Han · Huaxiu Yao 🔗 |
-
|
ViPCap: Retrieval Text-based Visual Prompts for Lightweight Image Captioning ( Poster ) > link | taewhan Kim · Soeun Lee · Si-Woo Kim · Dong-Jin Kim 🔗 |
-
|
Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generation ( Poster ) > link | Manish Bhattarai · Minh Vu · Javier E. Santos · Ismael Boureima · Daniel O'Malley 🔗 |
-
|
Fine-tuning LLM Agents with Retrospective In-Context Online Learning ( Oral ) > link | Wen-Tse Chen · Jiayu Chen · Fahim Tajwar · Hao Zhu · Xintong Duan · Ruslan Salakhutdinov · Jeff Schneider 🔗 |
-
|
REGENT: A Retrieval-Augmented Generalist Agent That Can Act in-Context In New Environments ( Poster ) > link | Kaustubh Sridhar · Souradeep Dutta · Dinesh Jayaraman · Insup Lee 🔗 |
-
|
Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images? ( Poster ) > link | Xiujun Li · Yujie Lu · William Yang Wang · Yejin Choi 🔗 |
-
|
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? ( Poster ) > link | Bowen Zhao · Leo Dirac · Paulina Varshavskaya 🔗 |
-
|
Fast and Accurate Language Model Decoding via Parallel Token Processing ( Oral ) > link | Zhepei Wei · Wei-Lin Chen · Xinyu Zhu · Yu Meng 🔗 |
-
|
Are LLMs Prescient? A Continuous Evaluation using Daily News as the Oracle ( Oral ) > link | Amelia Hui Dai · Ryan S Teehan · Mengye Ren 🔗 |
-
|
CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models ( Poster ) > link | Xinle Cheng · Zhuoming Chen · Zhihao Jia 🔗 |
-
|
Personalized Adaptation via In-Context Preference Learning ( Poster ) > link | Allison Lau · Younwoo Choi · Vahid Balazadeh · Keertana Chidambaram · Vasilis Syrgkanis · Rahul Krishnan 🔗 |
-
|
OmniPredict: GPT-4o Enhanced Multi-modal Pedestrian Crossing Intention Prediction ( Poster ) > link | Je-Seok Ham · Jia Huang · Peng Jiang · Jinyoung Moon · Yongjin Kwon · Srikanth Saripalli · Changick Kim 🔗 |
-
|
Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment ( Poster ) > link | Hui Yuan · Yifan Zeng · Yue Wu · Huazheng Wang · Mengdi Wang · Liu Leqi 🔗 |
-
|
Self-Play Preference Optimization for Language Model Alignment ( Oral ) > link | Yue Wu · Zhiqing Sun · Huizhuo Yuan · Kaixuan Ji · Yiming Yang · Quanquan Gu 🔗 |
-
|
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning ( Poster ) > link | Song Jiang · Da JU · Andrew Cohen · Sasha Mitts · Aaron Foss · Justine Kao · Xian Li · Yuandong Tian 🔗 |
-
|
Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding ( Poster ) > link | Yangfan He · Jianhui Wang · Haoyuan Li · Sida Li · Li Sun · TIANYU SHI 🔗 |
-
|
Towards Federated Low-Rank Adaptation with Rank Heterogeneity ( Poster ) > link | Yuji Byun · Jaeho Lee 🔗 |
-
|
UniTMGE: Uniform Text-Motion Generation and Editing via Diffusion Model ( Poster ) > link | Ruoyu Wang · Xiang Li · Tengjiao Sun · Yangfan He · TIANYU SHI · yitingxie 🔗 |
-
|
Assisted Few-Shot Learning for Vision-Language Models in Agricultural Stress Phenotype Identification ( Poster ) > link | Muhammad Arbab Arshad · Talukder "Zaki" Jubery · Asheesh Singh · ARTI SINGH · Chinmay Hegde · Baskar Ganapathysubramanian · Aditya Balu · Adarsh Krishnamurthy · Soumik Sarkar 🔗 |
-
|
On Pre-training of Multimodal Language Models Customized for Chart Understanding ( Poster ) > link | Wan-Cyuan Fan · Yen-Chun Chen · Mengchen Liu · Lu Yuan · Leonid Sigal 🔗 |
-
|
Personas within Parameters: Fine-Tuning Small Language Models with Low-Rank Adapters to Mimic User Behaviors ( Poster ) > link | Himanshu Thakur · Eshani Agrawal · Smruthi Mukund 🔗 |
-
|
Controlling Multimodal LLMs via Reward-guided Decoding ( Poster ) > link | Oscar Mañas · Pierluca D'Oro · Koustuv Sinha · Adriana Romero · Michal Drozdzal · Aishwarya Agrawal 🔗 |
-
|
Dream To Adapt: Learning Behaviors by Latent Imagination Under Non-Stationarity ( Poster ) > link | Emiliyan Gospodinov · Vaisakh Shaj Kumar · Philipp Becker · Stefan Geyer · Gerhard Neumann 🔗 |
-
|
Sirius: Contextual Sparsity with Correction for Efficient LLM ( Poster ) > link | Yang Zhou · Zhuoming Chen · Zhaozhuo Xu · Victoria Lin · Beidi Chen 🔗 |
-
|
Approximate Top-k for Increased Parallelism ( Poster ) > link | Oscar Key · Luka Ribar · Alberto Cattaneo · Luke Hudlass-Galley · Douglas Orr 🔗 |
-
|
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences ( Poster ) > link | Daiwei Chen · Yi Chen · Aniket Rege · Ramya Korlakai Vinayak 🔗 |
-
|
Instant Transformer Adaption via HyperLoRA ( Poster ) > link | Rujikorn Charakorn · Edoardo Cetin · Yujin Tang · Robert Lange 🔗 |
-
|
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models ( Poster ) > link |
12 presentersPeng Xia · Siwei Han · Shi Qiu · Yiyang Zhou · Zhaoyang Wang · Wenhao Zheng · Zhaorun Chen · Chenhang Cui · Mingyu Ding · Linjie Li · Lijuan Wang · Huaxiu Yao |
-
|
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems ( Oral ) > link | Jennifer Hsia · Afreen Shaikh · Zhiruo Wang · Graham Neubig 🔗 |
-
|
MagicPIG: LSH Sampling for Efficient LLM Generation ( Poster ) > link |
11 presentersZhuoming Chen · Ranajoy Sadhukhan · Zihao Ye · Yang Zhou · Jianyu Zhang · Niklas S Nolte · Yuandong Tian · Matthijs Douze · Leon Bottou · Zhihao Jia · Beidi Chen |
-
|
Efficient Domain Adaptation of Robotic Foundation Models via Hypernetwork-Generated LoRA ( Poster ) > link | Zheng Xiong · Siddhant Sharma · Kang Li · Risto Vuorio · Shimon Whiteson 🔗 |
-
|
NegMerge: Consensual Weight Negation for Strong Machine Unlearning ( Poster ) > link | Hyoseo Kim · Dongyoon Han · Junsuk Choe 🔗 |
-
|
Controlling Forgetting with Test-Time Data in Continual Learning ( Poster ) > link | Vaibhav Singh · Rahaf Aljundi · Eugene Belilovsky 🔗 |
-
|
Efficient Transfer Learning driven by Layer-wise Features Aggregation ( Poster ) > link | Chanwoo Kim · Jeyoon Yeom · JOOWANG KIM · Suho Kang · Kyungwoo Song 🔗 |
-
|
Ensemble-based Offline Reinforcement Learning with Adaptive Behavior Cloning ( Poster ) > link | Danyang Wang · Lingsong Zhang 🔗 |
-
|
Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models ( Poster ) > link | Felix Stahlberg · Jared Lichtarge · Shankar Kumar 🔗 |
-
|
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees ( Poster ) > link | Ryan Zhang · Herbert Woisetschläger · Shiqiang Wang · Hans Arno Jacobsen 🔗 |
-
|
Uncertainty-Penalized Direct Preference Optimization ( Poster ) > link | Sam Houliston · Alizée Pace · Alexander Immer · Gunnar Rätsch 🔗 |
-
|
Continuous Language Model Interpolation for Dynamic and Controllable Text Generation ( Poster ) > link | Sara Kangaslahti · David Alvarez-Melis 🔗 |
-
|
Long Context RAG Performance of Large Language Models ( Poster ) > link | Quinn Leng · Jacob Portes · Samuel Havens · Matei A Zaharia · Michael Carbin 🔗 |
-
|
Adapting Language Models via Token Alignment ( Poster ) > link | Zhili Feng · Tanya Marwah · Lester Mackey · David Alvarez-Melis · Nicolo Fusi 🔗 |
-
|
Exploring Visual Prompt Tuning for Demographic Adaptation in Foundation Models for Medical Imaging ( Poster ) > link | Artur Parkhimchyk · Amirreza Naziri · Laleh Seyyed-Kalantari 🔗 |
-
|
Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts ( Poster ) > link | Nikolas Gritsch · Qizhen (Irene) Zhang · Acyr Locatelli · Sara Hooker · Ahmet Ãœstün 🔗 |
-
|
Narrow Transformer: Mono-lingual Code SLM for Desktop ( Poster ) > link | Kamalkumar Rathinasamy · Balaji J · Ankush Kumar · Gagan Gayari · Harshini K · Rajab Mondal · Sreenivasa S · Swayam Singh · Mohammed Rafee Tarafdar 🔗 |
-
|
Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMs ( Poster ) > link | Megh Thakkar · Yash More · Quentin Fournier · Matthew Riemer · Pin-Yu Chen · Amal Zouaq · Payel Das · Sarath Chandar 🔗 |
-
|
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs ( Poster ) > link | Ilya Zisman · Alexander Nikulin · Andrei Polubarov · Nikita Lyubaykin · Vladislav Kurenkov 🔗 |
-
|
In-Context Learning behaves as a greedy layer-wise gradient descent algorithm ( Poster ) > link | Brian Chen · Tianyang Hu · Hui Jin · Hwee Lee · Kenji Kawaguchi 🔗 |
-
|
Dynamically Managing a Prompt Pool via Self-Enhancement in Continual Learning ( Poster ) > link | Hayun Lee · Kiseong Hong · Hwanhee Lee · Sungho Suh · Eunwoo Kim 🔗 |
-
|
DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach ( Poster ) > link | Daniel Gallo Fernández · Răzvan-Andrei MatiÈ™an · Alejandro Monroy · Ana Vasilcoiu · Janusz Partyka · Tin Hadži Veljković · Metod Jazbec 🔗 |
-
|
Is In-Context Learning Sufficient for Instruction Following in LLMs? ( Poster ) > link | Hao Zhao · Maksym Andriushchenko · Francesco Croce · Nicolas Flammarion 🔗 |
-
|
Fully-inductive Node Classification on Arbitrary Graphs ( Poster ) > link | Jianan Zhao · Michael Galkin · Hesham Mostafa · Michael Bronstein · Zhaocheng Zhu · Jian Tang 🔗 |
-
|
Visual Language Alignment Tuning ( Poster ) > link | LE ZHANG · Qian Yang · Aishwarya Agrawal 🔗 |
-
|
Enhancing Multi-Agent Multi-Modal Collaboration with Fine-Grained Reward Modeling ( Poster ) > link | Qian Yang · Weixiang Yan · Aishwarya Agrawal 🔗 |
-
|
Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications ( Poster ) > link | Bo Wen · Xin Zhang 🔗 |
-
|
InstructRAG: Instructing Retrieval Augmented Generation via Self-Synthesized Rationales ( Poster ) > link | Zhepei Wei · Wei-Lin Chen · Yu Meng 🔗 |
-
|
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging ( Oral ) > link | Joel Jang · Seungone Kim · Bill Yuchen Lin · Yizhong Wang · Jack Hessel · Luke Zettlemoyer · Hannaneh Hajishirzi · Yejin Choi · Prithviraj Ammanabrolu 🔗 |
-
|
Towards Conversational AI for Spina Bifida Care ( Poster ) > link | Asfandyar Azhar · Shaurjya Mandal · Nidhish Shah 🔗 |
-
|
Informed Tree of Thought: Cost-efficient Problem Solving with Large Language Models ( Poster ) > link | Sajad Mousavi · Desik Rengarajan · Ashwin Ramesh Babu · Sahand Ghorbanpour · Vineet Gundecha · Avisek Naug · Soumyendu Sarkar 🔗 |
-
|
Efficient Fine-Tuning of Image-Conditional Diffusion Models for Depth and Surface Normal Estimation ( Poster ) > link | Gonzalo Martin Garcia · Karim Abou Zeid · Christian Schmidt · Daan de Geus · Alexander Hermans · Bastian Leibe 🔗 |
-
|
LinkGPT: Teaching Large Language Models To Predict Missing Links ( Poster ) > link | Zhongmou He · Jing Zhu · Shengyi Qian · Joyce Chai · Danai Koutra 🔗 |
-
|
Extracting Parallelism from Large Language Model Queries ( Poster ) > link | Steven Kolawole · Keshav Santhanam · Pratiksha Thaker · Virginia Smith 🔗 |
-
|
PM-Jewelry: Personalized Multimodal Adaptation for Virtual Jewelry Try-On with Latent Diffusion ( Poster ) > link | Yangfan He · Yinghui Xia · Jinfeng Wei · TIANYU SHI · Yang Jingsong 🔗 |
-
|
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning ( Poster ) > link | Aniket Didolkar · Andrii Zadaianchuk · Rabiul Awal · Maximilian Seitzer · Efstratios Gavves · Aishwarya Agrawal 🔗 |
-
|
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement ( Poster ) > link | Yuxi Xie · Anirudh Goyal · Xiaobao Wu · Xunjian Yin · Xiao Xu · Min-Yen Kan · Liangming Pan · William Yang Wang 🔗 |
-
|
Do Think Tags Really Help LLMs Plan? A Critical Evaluation of ReAct-Style Prompting ( Poster ) > link | Mudit Verma · Siddhant Bhambri · Subbarao Kambhampati 🔗 |
-
|
SpikingVTG: Saliency Feedback Gating Enabled Spiking Video Temporal Grounding ( Poster ) > link | Malyaban Bal · Brian Matejek · Susmit Jha · Adam Cobb 🔗 |
-
|
Domain Adaptation for Robust Model Routing ( Poster ) > link | Christoph Dann · Yishay Mansour · Teodor Vanislavov Marinov · Mehryar Mohri 🔗 |
-
|
Automated Design of Agentic Systems ( Poster ) > link | Shengran Hu · Cong Lu · Jeff Clune 🔗 |
-
|
Can the Spectrum of the Neural Tangent Kernel Anticipate Fine-Tuning Performance? ( Poster ) > link | Zahra Rahimi Afzal · Tara Esmaeilbeig · Mojtaba Soltanalian · Mesrob I Ohannessian 🔗 |
-
|
Situated Instruction Following Under Ambiguous Human Intent ( Poster ) > link | So Yeon Min · Xavier Puig · Devendra Singh Chaplot · Tsung-Yen Yang · Akshara Rai · Priyam Parashar · Ruslan Salakhutdinov · Yonatan Bisk · Roozbeh Mottaghi 🔗 |
-
|
Better Prompt Compression Without Multi-Layer Perceptrons ( Poster ) > link | Edouardo Honig · Andrew Lizarraga · Zijun Frank Zhang · Ying Nian Wu 🔗 |
-
|
Effective Text-to-Image Alignment with Quality Aware Pair Ranking ( Poster ) > link | Kunal Singh · Mukund Khanna · Pradeep Moturi 🔗 |
-
|
FlashDP: Memory-Efficient and High-Throughput DP-SGD Training for Large Language Models ( Poster ) > link | Liangyu Wang · Junxiao Wang · Jie Ren · Zihang Xiang · David Keyes · Di Wang 🔗 |
-
|
GraphText: Graph Reasoning in Text Space ( Poster ) > link | Jianan Zhao · Le Zhuo · Yikang Shen · Meng Qu · Kai Liu · Michael Bronstein · Zhaocheng Zhu · Jian Tang 🔗 |
-
|
Prompt Learning Based Adaptor for Enhanced Video Editing with Pretrained Text-to-Image Diffusion Models ( Poster ) > link | Yangfan He · Sida Li · Jianhui Wang 🔗 |
-
|
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs ( Poster ) > link | Jonas Hübotter · Sascha Bongni · Ido Hakimi · Andreas Krause 🔗 |
-
|
Understanding Visual Concepts Across Models ( Poster ) > link | Brandon Trabucco · Max Gurinas · Kyle Doherty · Ruslan Salakhutdinov 🔗 |
-
|
From One to Zero: RAG-IM Adapts Language Models for Interpretable Zero-Shot Clinical Predictions ( Poster ) > link | Sazan Mahbub · Caleb Ellington · Sina Alinejad · Kevin Wen · Yingtao Luo · Ben Lengerich · Eric Xing 🔗 |
-
|
Pick Your Influencer: Being Selective is Good for Personalization ( Poster ) > link | Ashutosh Ranjan · Vivek Srivastava · Shirish Karande 🔗 |
-
|
Warmstarting for scaling language models ( Poster ) > link | Neeratyoy Mallik · Maciej Janowski · Johannes Hog · Herilalaina Rakotoarison · Aaron Klein · Josif Grabocka · Frank Hutter 🔗 |
-
|
Evaluating RAG System Performance: The Impact of Knowledge Cut-off and Fine-Tuning ( Poster ) > link | Omkar Dige · John Willes · David B. Emerson 🔗 |
-
|
Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers ( Poster ) > link | Yingyu Liang · Zhenmei Shi · Zhao Song · Yufa Zhou 🔗 |
-
|
MD-DiT: Step-aware Mixture-of-Depths for Efficient Diffusion Transformers ( Poster ) > link | Mingzhu Shen · pengtao chen · Peng Ye · Guoxuan Xia · Tao Chen · Christos Bouganis · Yiren Zhao 🔗 |
-
|
Accelerated Preference Optimization for Large Language Model Alignment ( Poster ) > link | Jiafan He · Huizhuo Yuan · Quanquan Gu 🔗 |
-
|
Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models ( Poster ) > link | Sungmin Cha · Sungjun Cho · Dasol Hwang · Moontae Lee 🔗 |
-
|
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing ( Poster ) > link | Wenhao Zheng · Yixiao Chen · Weitong Zhang · Souvik Kundu · Yun Li · Zhengzhong Liu · Eric Xing · Hongyi Wang · Huaxiu Yao 🔗 |