Workshop
Instruction Tuning and Instruction Following
Qinyuan Ye · Yizhong Wang · Shayne Longpre · Yao Fu · Daniel Khashabi
Room 220 - 222
Fri 15 Dec, 6:30 a.m. PST
Recent advancements in training large language models (LLMs) to follow “instructions” have significantly increased their ability to comprehend open-ended language commands, encompassing a wide range of needs, preferences, and values.
This remarkable transformation has led to the creation of remarkable industrial models such as GPT-4 and Bard, as well as an increased focus within the open-source and research communities: creating new benchmark and resources, developing new training methods, and understanding the limitations of these methods. Furthermore, instruction following powered by LLMs has proven to be effective in multi-modal settings, with applications in image editing and robotic command execution.
We organize this workshop to facilitate discussions on advancing instruction tuning methodologies and constructing general-purpose instruction-following models. We believe it is crucial to organize this workshop due to the prevalence of proprietary models with restricted access, thereby creating the need for an open platform to encourage discussions. Moreover, we aim to foster interdisciplinary collaboration by bringing together researchers from diverse fields such as natural language processing, computer vision, robotics, human-computer interaction, AI safety, among others, to share their latest findings and explore potential avenues for future research.
Schedule
Fri 6:40 a.m. - 7:00 a.m.
|
Opening Remarks
(
Opening Remarks
)
>
SlidesLive Video |
🔗 |
Fri 7:00 a.m. - 7:30 a.m.
|
Invited Talk 1 - Tatsunori Hashimoto
(
Talk
)
>
SlidesLive Video |
🔗 |
Fri 7:30 a.m. - 8:00 a.m.
|
Invited Talk 2 - Nazneen Rajani
(
Talk
)
>
SlidesLive Video |
🔗 |
Fri 8:00 a.m. - 8:15 a.m.
|
Break
|
🔗 |
Fri 8:15 a.m. - 8:45 a.m.
|
Invited Talk 3 – Fei Xia
(
Talk
)
>
SlidesLive Video |
🔗 |
Fri 8:45 a.m. - 9:30 a.m.
|
Panel 1: Key Techniques, Insights, and Challenges in Building Instruction-following Models
(
Discussion Panel
)
>
SlidesLive Video |
🔗 |
Fri 11:00 a.m. - 12:00 p.m.
|
Poster Session
(
Poster Session
)
>
|
🔗 |
Fri 12:00 p.m. - 12:30 p.m.
|
Invited Talk 4 - Sara Hooker
(
Talk
)
>
SlidesLive Video |
🔗 |
Fri 12:30 p.m. - 1:00 p.m.
|
Invited Talk 5 - Alex Tamkin
(
Talk
)
>
SlidesLive Video |
🔗 |
Fri 1:00 p.m. - 1:15 p.m.
|
Break
|
🔗 |
Fri 1:15 p.m. - 2:00 p.m.
|
Panel 2: Open and Collaborative Strategies for the Large Language Models Adaptation
(
Discussion Panel
)
>
SlidesLive Video |
🔗 |
Fri 2:00 p.m. - 3:20 p.m.
|
Oral Presentations
(
Spotlight
)
>
SlidesLive Video |
🔗 |
Fri 3:20 p.m. - 3:30 p.m.
|
Closing Remarks
(
Closing Remarks
)
>
SlidesLive Video |
🔗 |
-
|
Improved Baselines with Visual Instruction Tuning ( Poster ) > link | Haotian Liu · Chunyuan Li · Yuheng Li · Yong Jae Lee 🔗 |
-
|
Can LLM-Generated Misinformation Be Detected? ( Poster ) > link | Canyu Chen · Kai Shu 🔗 |
-
|
Prometheus: Inducing Evaluation Capability in Language Models ( Poster ) > link |
11 presentersSeungone Kim · Jamin Shin · Yejin Cho · Joel Jang · Shayne Longpre · Hwaran Lee · Sangdoo Yun · Seongjin Shin · Sungdong Kim · James Thorne · Minjoon Seo |
-
|
Instruction-tuned LLMs with World Knowledge are More Aligned to the Human Brain ( Poster ) > link | Khai Loong Aw · Syrielle Montariol · Badr AlKhamissi · Martin Schrimpf · Antoine Bosselut 🔗 |
-
|
Ring Attention with Blockwise Transformers for Near-Infinite Context ( Poster ) > link | Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗 |
-
|
Reflection-Tuning: Recycling Data for Better Instruction-Tuning ( Poster ) > link | Ming Li · Lichang Chen · Jiuhai Chen · Shwai He · Tianyi Zhou 🔗 |
-
|
Supervised Fine-Tuning of Large Language Models on Human Demonstrations Through the Lens of Memorization ( Poster ) > link | Yubin Ge · Devamanyu Hazarika · Yang Liu · Mahdi Namazifar 🔗 |
-
|
Grounding Code Generation with Input-Output Specifications ( Poster ) > link | Yeming Wen · Pengcheng Yin · Kensen Shi · Henryk Michalewski · Swarat Chaudhuri · Oleksandr Polozov 🔗 |
-
|
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models ( Poster ) > link | Keming Lu · Hongyi Yuan · Zheng Yuan · Runji Lin · Junyang Lin · Chuanqi Tan · Chang Zhou · Jingren Zhou 🔗 |
-
|
Training Speech Recognition Models to Follow Instructions ( Poster ) > link | Cheng-I Jeff Lai · Zhiyun Lu · Liangliang Cao · Ruoming Pang 🔗 |
-
|
Enhanced Visual Instruction Tuning for Text-Rich Image Understanding ( Poster ) > link | Yanzhe Zhang · Ruiyi Zhang · Jiuxiang Gu · Yufan Zhou · Nedim Lipka · Diyi Yang · Tong Sun 🔗 |
-
|
Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning ( Poster ) > link | Sagar Srinivas Sakhinana · Venkataramana Runkana 🔗 |
-
|
Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers ( Poster ) > link | Xiaoqiang Lin · Zhaoxuan Wu · Zhongxiang Dai · Wenyang Hu · YAO SHU · See-Kiong Ng · Patrick Jaillet · Bryan Kian Hsiang Low 🔗 |
-
|
Learning to Generate Instructions to Adapt Language Models to New Tasks ( Poster ) > link | Nihal Nayak · Yiyang Nan · Avi Trost · Stephen Bach 🔗 |
-
|
An Emulator for Fine-tuning Large Language Models using Small Language Models ( Poster ) > link | Eric Mitchell · Rafael Rafailov · Archit Sharma · Chelsea Finn · Christopher D Manning 🔗 |
-
|
Evaluating Large Language Models at Evaluating Instruction Following ( Poster ) > link | Zhiyuan Zeng · Jiatong Yu · Tianyu Gao · Yu Meng · Tanya Goyal · Danqi Chen 🔗 |
-
|
Instruction-following Evaluation through Verbalizer Manipulation ( Poster ) > link | Shiyang Li · Jun Yan · Hai Wang · Zheng Tang · Xiang Ren · Vijay Srinivasan · Hongxia Jin 🔗 |
-
|
Delve into PPO: Implementation Matters for Stable RLHF ( Poster ) > link |
19 presentersRui Zheng · Shihan Dou · Songyang Gao · Yuan Hua · Wei Shen · Binghai Wang · Yan Liu · Senjie Jin · Yuhao Zhou · Limao Xiong · Lu Chen · Zhiheng Xi · Nuo Xu · Wenbin Lai · Minghao Zhu · Haoran Huang · Tao Gui · Qi Zhang · Xuanjing Huang |
-
|
NLPBench: Evaluating Large Language Models on Solving NLP Problems ( Poster ) > link | Linxin Song · Jieyu Zhang · Lechao Cheng · Pengyuan Zhou · Tianyi Zhou · Zihui Li 🔗 |
-
|
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing ( Poster ) > link | Xinyu Hu · Pengfei Tang · Simiao Zuo · Zihan Wang · Bowen Song · Qiang Lou · Jian Jiao · Denis Charles 🔗 |
-
|
URIAL: Tuning-Free Instruction Learning and Alignment for Untuned LLMs ( Poster ) > link | Bill Yuchen Lin · Abhilasha Ravichander · Ximing Lu · Nouha Dziri · Melanie Sclar · Khyathi Chandu · Chandra Bhagavatula · Yejin Choi 🔗 |
-
|
Verbosity Bias in Preference Labeling by Large Language Models ( Poster ) > link | Keita Saito · Akifumi Wachi · Koki Wataoka · Youhei Akimoto 🔗 |
-
|
Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models ( Poster ) > link | SungJoo Byun · Dongjun Jang · Hyemi Jo · HYOPIL SHIN 🔗 |
-
|
Fine-tuning Language Models for Factuality ( Poster ) > link | Katherine Tian · Eric Mitchell · Huaxiu Yao · Christopher D Manning · Chelsea Finn 🔗 |
-
|
Self-RAG: Self-reflective Retrieval Augmented Generation ( Poster ) > link | Akari Asai · Zeqiu Wu · Yizhong Wang · Avi Sil · Hannaneh Hajishirzi 🔗 |
-
|
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation ( Poster ) > link | Sewon Min · Kalpesh Krishna · Xinxi Lyu · Mike Lewis · Scott Yih · Pang Wei Koh · Mohit Iyyer · Luke Zettlemoyer · Hannaneh Hajishirzi 🔗 |
-
|
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models ( Poster ) > link | Sagar Srinivas Sakhinana · Sannidhi G N K Geethan · Venkataramana Runkana 🔗 |
-
|
Exploring and Improving the Spatial Reasoning Abilities of Large Language Models ( Poster ) > link | Manasi Sharma 🔗 |
-
|
Investigating the Effects of Zero-Shot Chain-of-Thought on Empathetic Dialogue Generation ( Poster ) > link | Young-Jun Lee · Dokyong Lee · Jihui Im · Joo Won Sung · Ho-Jin Choi 🔗 |
-
|
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models ( Poster ) > link | Yiyang Zhou · Chenhang Cui · Jaehong Yoon · Linjun Zhang · Zhun Deng · Chelsea Finn · Mohit Bansal · Huaxiu Yao 🔗 |
-
|
Chain of Natural Language Inference for Reducing Large Language Model Hallucinations ( Poster ) > link | Deren Lei · Yaxi Li · Mengya Hu · Mingyu Wang · Xi Yun 🔗 |
-
|
Chain-of-Thought Reasoning is a Policy Improvement Operator ( Poster ) > link | Hugh Zhang · David Parkes 🔗 |
-
|
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints ( Poster ) > link | Chaoqi Wang · Yibo Jiang · Chenghao Yang · Han Liu · Yuxin Chen 🔗 |
-
|
Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks ( Poster ) > link | Lingfeng Sun · Devesh Jha · Chiori HORI · Siddarth Jain · Radu Corcodel · Xinghao Zhu · Masayoshi TOMIZUKA · Diego Romeres 🔗 |
-
|
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs ( Poster ) > link | Qingru Zhang · Chandan Singh · Liyuan Liu · Xiaodong Liu · Bin Yu · Jianfeng Gao · Tuo Zhao 🔗 |
-
|
Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications ( Poster ) > link | Fengqing Jiang · Zhangchen Xu · Luyao Niu · Boxin Wang · Jinyuan Jia · Bo Li · Radha Poovendran 🔗 |
-
|
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game ( Poster ) > link |
12 presentersSam Toyer · Olivia Watkins · Ethan Mendes · Justin Svegliato · Luke Bailey · Tiffany Wang · Isaac Ong · Karim Elmaaroufi · Pieter Abbeel · Trevor Darrell · Alan Ritter · Stuart J Russell |
-
|
A Case Study of Instruction Tuning with Mixture of Parameter-Efficient Experts ( Poster ) > link | Oleksiy Ostapenko · Lucas Page-Caccia · Zhan Su · Nicolas Le Roux · Laurent Charlin · Alessandro Sordoni 🔗 |
-
|
Investigating the Catastrophic Forgetting in Multimodal Large Language Models ( Poster ) > link | Simon Zhai · Shengbang Tong · Xiao Li · Mu Cai · Qing Qu · Yong Jae Lee · Yi Ma 🔗 |
-
|
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms ( Poster ) > link | Aditi Jha · Sam Havens · Jeremy Dohmann · Alexander Trott · Jacob Portes 🔗 |
-
|
Let's Reinforce Step by Step ( Poster ) > link | Sarah Pan · Vladislav Lialin · Sherin Muckatira · Anna Rumshisky 🔗 |
-
|
DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datasets ( Poster ) > link | Young-Jun Lee · Byung Soo Ko · Han-Gyu Kim · Jonghwan Hyeon · Ho-Jin Choi 🔗 |
-
|
Knowledge Augmented Instruction Tuning for Zero-shot Animal Species Recognition ( Poster ) > link |
13 presentersZalan Fabian · Zhongqi Miao · Chunyuan Li · Yuanhan Zhang · Ziwei Liu · Andres Hernandez · Pablo Arbelaez · Andrés Link · Andrés Montes-Rojas · Rafael Escucha · Laura Siabatto · Rahul Dodhia · Juan Lavista Ferres |
-
|
Reward Model Ensembles Help Mitigate Overoptimization ( Poster ) > link | Thomas Coste · Usman Anwar · Robert Kirk · David Krueger 🔗 |
-
|
NexusRaven: a commercially-permissive Language Model for function calling ( Poster ) > link | Venkat Krishna Srinivasan · Zhen Dong · Banghua Zhu · Brian Yu · Hanzi Mao · Damon Mosk-Aoyama · Kurt Keutzer · Jiantao Jiao · Jian Zhang 🔗 |
-
|
How Long Can Context Length of Open-Source LLMs truly Promise? ( Poster ) > link | Dacheng Li · Rulin Shao · Anze Xie · Ying Sheng · Lianmin Zheng · Joseph Gonzalez · Ion Stoica · Xuezhe Ma · Hao Zhang 🔗 |
-
|
Learning to Generate Better Than Your LLM ( Poster ) > link | Jonathan Chang · Kianté Brantley · Rajkumar Ramamurthy · Dipendra Misra · Wen Sun 🔗 |
-
|
From Classification to Generation: Insights into Crosslingual Retrieval Augmented ICL ( Poster ) > link | Xiaoqian Li · Ercong Nie · Sheng Liang 🔗 |
-
|
Reward Model Aggregation ( Poster ) > link | Zihao Wang · Chirag Nagpal · Alexander D'Amour · Victor Veitch · Sanmi Koyejo 🔗 |
-
|
Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions ( Poster ) > link | Taehyeon Kim · Joonkee Kim · Gihun Lee · Se-Young Yun 🔗 |
-
|
Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language ( Poster ) > link | Di Jin · Shikib Mehri · Devamanyu Hazarika · Aishwarya Padmakumar · SUNGJIN LEE · Yang Liu · Mahdi Namazifar 🔗 |
-
|
Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models ( Poster ) > link | Rob Grzywinski · Joshua DArcy · Robert Naidoff · Ashish Shukla · Alex Browne · Ren Gibbons · Brinnae Bent 🔗 |
-
|
CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning ( Poster ) > link | Hongyu Hu · Jiyuan Zhang · Minyi Zhao · Zhenbang Sun 🔗 |
-
|
Understanding Hidden Context in Preference Learning: Consequences for RLHF ( Poster ) > link | Anand Siththaranajn · Cassidy Laidlaw · Dylan Hadfield-Menell 🔗 |
-
|
Past as a Guide: Leveraging Retrospective Learning for Python Code Completion ( Poster ) > link | Seungyoun Shin · Seunggyu Chang · Sungjoon Choi 🔗 |
-
|
FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets ( Poster ) > link | Neng Wang · Hongyang Yang · Christina Wang 🔗 |
-
|
Large Language Models are Zero Shot Hypothesis Proposers ( Poster ) > link | Biqing Qi · kaiyan zhang · Haoxiang Li · Kai Tian · Sihang Zeng · Zhang-Ren Chen · Bowen Zhou 🔗 |
-
|
OctoPack: Instruction Tuning Code Large Language Models ( Poster ) > link | Niklas Muennighoff · Qian Liu · Armel Zebaze · Qinkai Zheng · Binyuan Hui · Terry Yue Zhuo · Swayam Singh · Xiangru Tang · Leandro Von Werra · Shayne Longpre 🔗 |
-
|
Approximate Clustering for Extracting Task Relationships in Multi-Instruction Tuning ( Poster ) > link | Dongyue Li · Jinhong Yu · Hongyang Zhang 🔗 |
-
|
Understanding the Effects of RLHF on LLM Generalisation and Diversity ( Poster ) > link | Robert Kirk · Ishita Mediratta · Christoforos Nalmpantis · Jelena Luketina · Eric Hambro · Edward Grefenstette · Roberta Raileanu 🔗 |
-
|
Group Preference Optimization: Few-Shot Alignment of Large Language Models ( Poster ) > link | Siyan Zhao · John Dang · Aditya Grover 🔗 |
-
|
Platypus: Quick, Cheap, and Powerful Refinement of LLMs ( Poster ) > link | Ariel Lee · Cole Hunter · Nataniel Ruiz 🔗 |
-
|
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets ( Poster ) > link | Seonghyeon Ye · Doyoung Kim · Sungdong Kim · Hyeonbin Hwang · Seungone Kim · Yongrae Jo · James Thorne · Juho Kim · Minjoon Seo 🔗 |
-
|
Learning Interactive Real-World Simulators ( Poster ) > link | Sherry Yang · Yilun Du · Kamyar Ghasemipour · Jonathan Tompson · Dale Schuurmans · Pieter Abbeel 🔗 |
-
|
FinGPT: Democratizing Internet-scale Data for Financial Large Language Models ( Poster ) > link | Xiao-Yang Liu · Guoxuan Wang · Hongyang Yang · Daochen Zha 🔗 |
-
|
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics ( Poster ) > link | Haoqin Tu · Bingchen Zhao · Chen Wei · Cihang Xie 🔗 |
-
|
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction ( Poster ) > link | Erica Cai · Brendan O'Connor 🔗 |
-
|
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models ( Poster ) > link | Yadong Lu · Chunyuan Li · Haotian Liu · Jianwei Yang · Jianfeng Gao · yelong shen 🔗 |
-
|
For Distillation, Tokens Are Not All You Need ( Poster ) > link | Mrigank Raman · Pranav Mani · Davis Liang · Zachary Lipton 🔗 |
-
|
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following ( Poster ) > link | Seonghyeon Ye · Hyeonbin Hwang · Sohee Yang · HyeonGu Yun · Yireun Kim · Minjoon Seo 🔗 |
-
|
Simulating Iterative Human-AI Interaction in Programming with LLMs ( Poster ) > link | Hussein Mozannar · Valerie Chen · Dennis Wei · Prasanna Sattigeri · Manish Nagireddy · Subhro Das · Ameet Talwalkar · David Sontag 🔗 |
-
|
Balancing Multiple Objectives for Efficient Metaprompts for Data Labeling Tasks with Extensive Guidelines ( Poster ) > link | Tobias Schnabel · Jennifer Neville 🔗 |