Instruction Tuning and Instruction Following

Workshop

Instruction Tuning and Instruction Following

Qinyuan Ye · Yizhong Wang · Shayne Longpre · Yao Fu · Daniel Khashabi

Room 220 - 222

Fri 15 Dec, 6:30 a.m. PST

[ Abstract ] Workshop Website

Recent advancements in training large language models (LLMs) to follow “instructions” have significantly increased their ability to comprehend open-ended language commands, encompassing a wide range of needs, preferences, and values.

This remarkable transformation has led to the creation of remarkable industrial models such as GPT-4 and Bard, as well as an increased focus within the open-source and research communities: creating new benchmark and resources, developing new training methods, and understanding the limitations of these methods. Furthermore, instruction following powered by LLMs has proven to be effective in multi-modal settings, with applications in image editing and robotic command execution.

We organize this workshop to facilitate discussions on advancing instruction tuning methodologies and constructing general-purpose instruction-following models. We believe it is crucial to organize this workshop due to the prevalence of proprietary models with restricted access, thereby creating the need for an open platform to encourage discussions. Moreover, we aim to foster interdisciplinary collaboration by bringing together researchers from diverse fields such as natural language processing, computer vision, robotics, human-computer interaction, AI safety, among others, to share their latest findings and explore potential avenues for future research.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 6:40 a.m. - 7:00 a.m.	Opening Remarks ( Opening Remarks ) > SlidesLive Video	🔗
Fri 7:00 a.m. - 7:30 a.m.	Invited Talk 1 - Tatsunori Hashimoto ( Talk ) > SlidesLive Video	🔗
Fri 7:30 a.m. - 8:00 a.m.	Invited Talk 2 - Nazneen Rajani ( Talk ) > SlidesLive Video	🔗
Fri 8:00 a.m. - 8:15 a.m.	Break	🔗
Fri 8:15 a.m. - 8:45 a.m.	Invited Talk 3 – Fei Xia ( Talk ) > SlidesLive Video	🔗
Fri 8:45 a.m. - 9:30 a.m.	Panel 1: Key Techniques, Insights, and Challenges in Building Instruction-following Models ( Discussion Panel ) > SlidesLive Video	🔗
Fri 11:00 a.m. - 12:00 p.m.	Poster Session ( Poster Session ) >	🔗
Fri 12:00 p.m. - 12:30 p.m.	Invited Talk 4 - Sara Hooker ( Talk ) > SlidesLive Video	🔗
Fri 12:30 p.m. - 1:00 p.m.	Invited Talk 5 - Alex Tamkin ( Talk ) > SlidesLive Video	🔗
Fri 1:00 p.m. - 1:15 p.m.	Break	🔗
Fri 1:15 p.m. - 2:00 p.m.	Panel 2: Open and Collaborative Strategies for the Large Language Models Adaptation ( Discussion Panel ) > SlidesLive Video	🔗
Fri 2:00 p.m. - 3:20 p.m.	Oral Presentations ( Spotlight ) > SlidesLive Video	🔗
Fri 3:20 p.m. - 3:30 p.m.	Closing Remarks ( Closing Remarks ) > SlidesLive Video	🔗
-	Improved Baselines with Visual Instruction Tuning ( Poster ) > link Link	Haotian Liu · Chunyuan Li · Yuheng Li · Yong Jae Lee 🔗
-	Can LLM-Generated Misinformation Be Detected? ( Poster ) > link Link	Canyu Chen · Kai Shu 🔗
-	Prometheus: Inducing Evaluation Capability in Language Models ( Poster ) > link Link	11 presenters Seungone Kim · Jamin Shin · Yejin Cho · Joel Jang · Shayne Longpre · Hwaran Lee · Sangdoo Yun · Seongjin Shin · Sungdong Kim · James Thorne · Minjoon Seo 🔗
-	Instruction-tuned LLMs with World Knowledge are More Aligned to the Human Brain ( Poster ) > link Link	Khai Loong Aw · Syrielle Montariol · Badr AlKhamissi · Martin Schrimpf · Antoine Bosselut 🔗
-	Ring Attention with Blockwise Transformers for Near-Infinite Context ( Poster ) > link Link	Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗
-	Reflection-Tuning: Recycling Data for Better Instruction-Tuning ( Poster ) > link Link	Ming Li · Lichang Chen · Jiuhai Chen · Shwai He · Tianyi Zhou 🔗
-	Supervised Fine-Tuning of Large Language Models on Human Demonstrations Through the Lens of Memorization ( Poster ) > link Link	Yubin Ge · Devamanyu Hazarika · Yang Liu · Mahdi Namazifar 🔗
-	Grounding Code Generation with Input-Output Specifications ( Poster ) > link Link	Yeming Wen · Pengcheng Yin · Kensen Shi · Henryk Michalewski · Swarat Chaudhuri · Oleksandr Polozov 🔗
-	#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models ( Poster ) > link Link	Keming Lu · Hongyi Yuan · Zheng Yuan · Runji Lin · Junyang Lin · Chuanqi Tan · Chang Zhou · Jingren Zhou 🔗
-	Training Speech Recognition Models to Follow Instructions ( Poster ) > link Link	Cheng-I Jeff Lai · Zhiyun Lu · Liangliang Cao · Ruoming Pang 🔗
-	Enhanced Visual Instruction Tuning for Text-Rich Image Understanding ( Poster ) > link Link	Yanzhe Zhang · Ruiyi Zhang · Jiuxiang Gu · Yufan Zhou · Nedim Lipka · Diyi Yang · Tong Sun 🔗
-	Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning ( Poster ) > link Link	Sagar Srinivas Sakhinana · Venkataramana Runkana 🔗
-	Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers ( Poster ) > link Link	Xiaoqiang Lin · Zhaoxuan Wu · Zhongxiang Dai · Wenyang Hu · YAO SHU · See-Kiong Ng · Patrick Jaillet · Bryan Kian Hsiang Low 🔗
-	Learning to Generate Instructions to Adapt Language Models to New Tasks ( Poster ) > link Link	Nihal Nayak · Yiyang Nan · Avi Trost · Stephen Bach 🔗
-	An Emulator for Fine-tuning Large Language Models using Small Language Models ( Poster ) > link Link	Eric Mitchell · Rafael Rafailov · Archit Sharma · Chelsea Finn · Christopher D Manning 🔗
-	Evaluating Large Language Models at Evaluating Instruction Following ( Poster ) > link Link	Zhiyuan Zeng · Jiatong Yu · Tianyu Gao · Yu Meng · Tanya Goyal · Danqi Chen 🔗
-	Instruction-following Evaluation through Verbalizer Manipulation ( Poster ) > link Link	Shiyang Li · Jun Yan · Hai Wang · Zheng Tang · Xiang Ren · Vijay Srinivasan · Hongxia Jin 🔗
-	Delve into PPO: Implementation Matters for Stable RLHF ( Poster ) > link Link	19 presenters Rui Zheng · Shihan Dou · Songyang Gao · Yuan Hua · Wei Shen · Binghai Wang · Yan Liu · Senjie Jin · Yuhao Zhou · Limao Xiong · Lu Chen · Zhiheng Xi · Nuo Xu · Wenbin Lai · Minghao Zhu · Haoran Huang · Tao Gui · Qi Zhang · Xuanjing Huang 🔗
-	NLPBench: Evaluating Large Language Models on Solving NLP Problems ( Poster ) > link Link	Linxin Song · Jieyu Zhang · Lechao Cheng · Pengyuan Zhou · Tianyi Zhou · Zihui Li 🔗
-	Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing ( Poster ) > link Link	Xinyu Hu · Pengfei Tang · Simiao Zuo · Zihan Wang · Bowen Song · Qiang Lou · Jian Jiao · Denis Charles 🔗
-	URIAL: Tuning-Free Instruction Learning and Alignment for Untuned LLMs ( Poster ) > link Link	Bill Yuchen Lin · Abhilasha Ravichander · Ximing Lu · Nouha Dziri · Melanie Sclar · Khyathi Chandu · Chandra Bhagavatula · Yejin Choi 🔗
-	Verbosity Bias in Preference Labeling by Large Language Models ( Poster ) > link Link	Keita Saito · Akifumi Wachi · Koki Wataoka · Youhei Akimoto 🔗
-	Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models ( Poster ) > link Link	SungJoo Byun · Dongjun Jang · Hyemi Jo · HYOPIL SHIN 🔗
-	Fine-tuning Language Models for Factuality ( Poster ) > link Link	Katherine Tian · Eric Mitchell · Huaxiu Yao · Christopher D Manning · Chelsea Finn 🔗
-	Self-RAG: Self-reflective Retrieval Augmented Generation ( Poster ) > link Link	Akari Asai · Zeqiu Wu · Yizhong Wang · Avi Sil · Hannaneh Hajishirzi 🔗
-	FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation ( Poster ) > link Link	Sewon Min · Kalpesh Krishna · Xinxi Lyu · Mike Lewis · Scott Yih · Pang Wei Koh · Mohit Iyyer · Luke Zettlemoyer · Hannaneh Hajishirzi 🔗
-	Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models ( Poster ) > link Link	Sagar Srinivas Sakhinana · Sannidhi G N K Geethan · Venkataramana Runkana 🔗
-	Exploring and Improving the Spatial Reasoning Abilities of Large Language Models ( Poster ) > link Link	Manasi Sharma 🔗
-	Investigating the Effects of Zero-Shot Chain-of-Thought on Empathetic Dialogue Generation ( Poster ) > link Link	Young-Jun Lee · Dokyong Lee · Jihui Im · Joo Won Sung · Ho-Jin Choi 🔗
-	Analyzing and Mitigating Object Hallucination in Large Vision-Language Models ( Poster ) > link Link	Yiyang Zhou · Chenhang Cui · Jaehong Yoon · Linjun Zhang · Zhun Deng · Chelsea Finn · Mohit Bansal · Huaxiu Yao 🔗
-	Chain of Natural Language Inference for Reducing Large Language Model Hallucinations ( Poster ) > link Link	Deren Lei · Yaxi Li · Mengya Hu · Mingyu Wang · Xi Yun 🔗
-	Chain-of-Thought Reasoning is a Policy Improvement Operator ( Poster ) > link Link	Hugh Zhang · David Parkes 🔗
-	Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints ( Poster ) > link Link	Chaoqi Wang · Yibo Jiang · Chenghao Yang · Han Liu · Yuxin Chen 🔗
-	Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks ( Poster ) > link Link	Lingfeng Sun · Devesh Jha · Chiori HORI · Siddarth Jain · Radu Corcodel · Xinghao Zhu · Masayoshi TOMIZUKA · Diego Romeres 🔗
-	Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs ( Poster ) > link Link	Qingru Zhang · Chandan Singh · Liyuan Liu · Xiaodong Liu · Bin Yu · Jianfeng Gao · Tuo Zhao 🔗
-	Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications ( Poster ) > link Link	Fengqing Jiang · Zhangchen Xu · Luyao Niu · Boxin Wang · Jinyuan Jia · Bo Li · Radha Poovendran 🔗
-	Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game ( Poster ) > link Link	12 presenters Sam Toyer · Olivia Watkins · Ethan Mendes · Justin Svegliato · Luke Bailey · Tiffany Wang · Isaac Ong · Karim Elmaaroufi · Pieter Abbeel · Trevor Darrell · Alan Ritter · Stuart J Russell 🔗
-	A Case Study of Instruction Tuning with Mixture of Parameter-Efficient Experts ( Poster ) > link Link	Oleksiy Ostapenko · Lucas Page-Caccia · Zhan Su · Nicolas Le Roux · Laurent Charlin · Alessandro Sordoni 🔗
-	Investigating the Catastrophic Forgetting in Multimodal Large Language Models ( Poster ) > link Link	Simon Zhai · Shengbang Tong · Xiao Li · Mu Cai · Qing Qu · Yong Jae Lee · Yi Ma 🔗
-	LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms ( Poster ) > link Link	Aditi Jha · Sam Havens · Jeremy Dohmann · Alexander Trott · Jacob Portes 🔗
-	Let's Reinforce Step by Step ( Poster ) > link Link	Sarah Pan · Vladislav Lialin · Sherin Muckatira · Anna Rumshisky 🔗
-	DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datasets ( Poster ) > link Link	Young-Jun Lee · Byung Soo Ko · Han-Gyu Kim · Jonghwan Hyeon · Ho-Jin Choi 🔗
-	Knowledge Augmented Instruction Tuning for Zero-shot Animal Species Recognition ( Poster ) > link Link	13 presenters Zalan Fabian · Zhongqi Miao · Chunyuan Li · Yuanhan Zhang · Ziwei Liu · Andres Hernandez · Pablo Arbelaez · Andrés Link · Andrés Montes-Rojas · Rafael Escucha · Laura Siabatto · Rahul Dodhia · Juan Lavista Ferres 🔗
-	Reward Model Ensembles Help Mitigate Overoptimization ( Poster ) > link Link	Thomas Coste · Usman Anwar · Robert Kirk · David Krueger 🔗
-	NexusRaven: a commercially-permissive Language Model for function calling ( Poster ) > link Link	Venkat Krishna Srinivasan · Zhen Dong · Banghua Zhu · Brian Yu · Hanzi Mao · Damon Mosk-Aoyama · Kurt Keutzer · Jiantao Jiao · Jian Zhang 🔗
-	How Long Can Context Length of Open-Source LLMs truly Promise? ( Poster ) > link Link	Dacheng Li · Rulin Shao · Anze Xie · Ying Sheng · Lianmin Zheng · Joseph Gonzalez · Ion Stoica · Xuezhe Ma · Hao Zhang 🔗
-	Learning to Generate Better Than Your LLM ( Poster ) > link Link	Jonathan Chang · Kianté Brantley · Rajkumar Ramamurthy · Dipendra Misra · Wen Sun 🔗
-	From Classification to Generation: Insights into Crosslingual Retrieval Augmented ICL ( Poster ) > link Link	Xiaoqian Li · Ercong Nie · Sheng Liang 🔗
-	Reward Model Aggregation ( Poster ) > link Link	Zihao Wang · Chirag Nagpal · Alexander D'Amour · Victor Veitch · Sanmi Koyejo 🔗
-	Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions ( Poster ) > link Link	Taehyeon Kim · Joonkee Kim · Gihun Lee · Se-Young Yun 🔗
-	Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language ( Poster ) > link Link	Di Jin · Shikib Mehri · Devamanyu Hazarika · Aishwarya Padmakumar · SUNGJIN LEE · Yang Liu · Mahdi Namazifar 🔗
-	Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models ( Poster ) > link Link	Rob Grzywinski · Joshua DArcy · Robert Naidoff · Ashish Shukla · Alex Browne · Ren Gibbons · Brinnae Bent 🔗
-	CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning ( Poster ) > link Link	Hongyu Hu · Jiyuan Zhang · Minyi Zhao · Zhenbang Sun 🔗
-	Understanding Hidden Context in Preference Learning: Consequences for RLHF ( Poster ) > link Link	Anand Siththaranajn · Cassidy Laidlaw · Dylan Hadfield-Menell 🔗
-	Past as a Guide: Leveraging Retrospective Learning for Python Code Completion ( Poster ) > link Link	Seungyoun Shin · Seunggyu Chang · Sungjoon Choi 🔗
-	FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets ( Poster ) > link Link	Neng Wang · Hongyang Yang · Christina Wang 🔗
-	Large Language Models are Zero Shot Hypothesis Proposers ( Poster ) > link Link	Biqing Qi · kaiyan zhang · Haoxiang Li · Kai Tian · Sihang Zeng · Zhang-Ren Chen · Bowen Zhou 🔗
-	OctoPack: Instruction Tuning Code Large Language Models ( Poster ) > link Link	Niklas Muennighoff · Qian Liu · Armel Zebaze · Qinkai Zheng · Binyuan Hui · Terry Yue Zhuo · Swayam Singh · Xiangru Tang · Leandro Von Werra · Shayne Longpre 🔗
-	Approximate Clustering for Extracting Task Relationships in Multi-Instruction Tuning ( Poster ) > link Link	Dongyue Li · Jinhong Yu · Hongyang Zhang 🔗
-	Understanding the Effects of RLHF on LLM Generalisation and Diversity ( Poster ) > link Link	Robert Kirk · Ishita Mediratta · Christoforos Nalmpantis · Jelena Luketina · Eric Hambro · Edward Grefenstette · Roberta Raileanu 🔗
-	Group Preference Optimization: Few-Shot Alignment of Large Language Models ( Poster ) > link Link	Siyan Zhao · John Dang · Aditya Grover 🔗
-	Platypus: Quick, Cheap, and Powerful Refinement of LLMs ( Poster ) > link Link	Ariel Lee · Cole Hunter · Nataniel Ruiz 🔗
-	FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets ( Poster ) > link Link	Seonghyeon Ye · Doyoung Kim · Sungdong Kim · Hyeonbin Hwang · Seungone Kim · Yongrae Jo · James Thorne · Juho Kim · Minjoon Seo 🔗
-	Learning Interactive Real-World Simulators ( Poster ) > link Link	Sherry Yang · Yilun Du · Kamyar Ghasemipour · Jonathan Tompson · Dale Schuurmans · Pieter Abbeel 🔗
-	FinGPT: Democratizing Internet-scale Data for Financial Large Language Models ( Poster ) > link Link	Xiao-Yang Liu · Guoxuan Wang · Hongyang Yang · Daochen Zha 🔗
-	Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics ( Poster ) > link Link	Haoqin Tu · Bingchen Zhao · Chen Wei · Cihang Xie 🔗
-	A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction ( Poster ) > link Link	Erica Cai · Brendan O'Connor 🔗
-	An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models ( Poster ) > link Link	Yadong Lu · Chunyuan Li · Haotian Liu · Jianwei Yang · Jianfeng Gao · yelong shen 🔗
-	For Distillation, Tokens Are Not All You Need ( Poster ) > link Link	Mrigank Raman · Pranav Mani · Davis Liang · Zachary Lipton 🔗
-	Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following ( Poster ) > link Link	Seonghyeon Ye · Hyeonbin Hwang · Sohee Yang · HyeonGu Yun · Yireun Kim · Minjoon Seo 🔗
-	Simulating Iterative Human-AI Interaction in Programming with LLMs ( Poster ) > link Link	Hussein Mozannar · Valerie Chen · Dennis Wei · Prasanna Sattigeri · Manish Nagireddy · Subhro Das · Ameet Talwalkar · David Sontag 🔗
-	Balancing Multiple Objectives for Efficient Metaprompts for Data Labeling Tasks with Extensive Guidelines ( Poster ) > link Link	Tobias Schnabel · Jennifer Neville 🔗