Fri 5:30 a.m. - 5:50 a.m.
|
Breakfast
(
Breakfast
)
>
|
🔗
|
Fri 5:50 a.m. - 6:00 a.m.
|
Opening Remarks
(
Opening
)
>
|
🔗
|
Fri 6:00 a.m. - 6:30 a.m.
|
Fine-grained Interactive Vision Language Pre-training
(
KeyNote Talk
)
>
SlidesLive Video
|
Lu Hou · Lu Hou
🔗
|
Fri 6:30 a.m. - 7:05 a.m.
|
​Efficiency Tradeoffs in the Design of Neural Search Systems
(
KeyNote Talk
)
>
SlidesLive Video
|
Jimmy Lin
🔗
|
Fri 7:05 a.m. - 7:35 a.m.
|
Last Advances in End-to-End Speech Recognition
(
KeyNote Talk
)
>
|
Tara Sainath
🔗
|
Fri 7:35 a.m. - 7:45 a.m.
|
Collective Knowledge Graph Completion with Mutual Knowledge Distillation
(
Spotlight
)
>
SlidesLive Video
|
Weihang Zhang · Ovidiu Serban · Jiahao Sun · Yike Guo
🔗
|
Fri 7:45 a.m. - 7:56 a.m.
|
Attribute Controlled Dialogue Prompting
(
Spotlight
)
>
SlidesLive Video
|
Runcheng Liu · Ahmad Rashid · Ivan Kobyzev · Mehdi Rezaghoizadeh · Pascal Poupart
🔗
|
Fri 7:56 a.m. - 8:05 a.m.
|
Fast DistilBERT on CPUs
(
Spotlight
)
>
SlidesLive Video
|
Haihao Shen · Ofir Zafrir · Bo Dong · Hengyu Meng · Xinyu Ye · Zhe Wang · Yi Ding · Hanwen Chang · Guy Boudoukh · Moshe Wasserblat
🔗
|
Fri 8:00 a.m. - 8:30 a.m.
|
Morning Break and Poster Session 1
(
Break and Poster Session
)
>
|
🔗
|
Fri 8:30 a.m. - 9:05 a.m.
|
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
(
KeyNote Talk
)
>
SlidesLive Video
|
Song Han
🔗
|
Fri 9:05 a.m. - 9:35 a.m.
|
Building Language Models Based on Retrieval
(
KeyNote Talk
)
>
SlidesLive Video
|
Danqi Chen
🔗
|
Fri 9:35 a.m. - 10:05 a.m.
|
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training
(
KeyNote Talk
)
>
SlidesLive Video
|
Yang You
🔗
|
Fri 10:05 a.m. - 10:15 a.m.
|
Efficient Few-Shot Learning Without Prompts
(
Spotlight
)
>
SlidesLive Video
|
Oren Pereg · Daniel Korat · Moshe Wasserblat · Lewis Tunstall · Unso Eun Seo Jo · Luke Bates · Nils Reimers
🔗
|
Fri 10:15 a.m. - 10:25 a.m.
|
PCFG-based Natural Language Interface Improves Generalization for Controlled Text Generation
(
Spotlight
)
>
SlidesLive Video
|
Jingyu Zhang · Jim Glass · Tianxing He
🔗
|
Fri 10:25 a.m. - 10:35 a.m.
|
PromptDA: Label-guided Data Augmentation for Prompt-based Few Shot Learners
(
Spotlight
)
>
SlidesLive Video
|
Canyu Chen · Kai Shu
🔗
|
Fri 10:30 a.m. - 11:30 a.m.
|
Lunch Break and Virtual Poster Session
link
|
🔗
|
Fri 11:30 a.m. - 12:00 p.m.
|
Efficient Identify Event Causality with Knowledge and Analogy
(
KeyNote Talk
)
>
SlidesLive Video
|
Bang Liu
🔗
|
Fri 12:00 p.m. - 12:50 p.m.
|
Interactive Industrial Panel
(
Discussion Panel
)
>
SlidesLive Video
|
Jiahao Sun · Ahmed Ibrahim · Marjan Ghazvininejad · Yu Cheng · Boxing Chen · Mohammad Norouzi · Rahul Gupta
🔗
|
Fri 12:50 p.m. - 12:59 p.m.
|
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement
(
Spotlight
)
>
SlidesLive Video
|
Heitor Guimarães · Arthur Pimentel · Anderson R. Avila · Mehdi Rezaghoizadeh · Tiago H Falk
🔗
|
Fri 12:59 p.m. - 1:05 p.m.
|
Gradient Knowledge Distillation for Pre-trained Language Models
(
Spotlight
)
>
SlidesLive Video
|
Lean Wang · Lei Li · Xu Sun
🔗
|
Fri 1:00 p.m. - 1:30 p.m.
|
Break and Poster Session II
(
Break and Poster Session
)
>
|
🔗
|
Fri 1:30 p.m. - 2:05 p.m.
|
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
(
KeyNote Talk
)
>
SlidesLive Video
|
Graham Neubig
🔗
|
Fri 2:05 p.m. - 2:35 p.m.
|
Do we still need inductive biases after Transformer language models?
(
KeyNote Talk
)
>
SlidesLive Video
|
Siva Reddy
🔗
|
Fri 2:35 p.m. - 3:05 p.m.
|
8-bit Methods for Efficient Deep Learning
(
KeyNote Talk
)
>
SlidesLive Video
|
Tim Dettmers
🔗
|
Fri 3:05 p.m. - 3:35 p.m.
|
Efficient Controllable Generative Models for Music and Performance Synthesis
(
KeyNote Talk
)
>
SlidesLive Video
|
Cheng-Zhi Anna Huang
🔗
|
Fri 3:35 p.m. - 3:45 p.m.
|
Best Paper and Poster Awards
(
Closing remark
)
>
SlidesLive Video
|
🔗
|
-
|
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
(
Poster
)
>
SlidesLive Video
|
Mingyu Derek Ma · Jiun-Yu Kao · Shuyang Gao · arpit gupta · Di Jin · Tagyoung Chung · Nanyun Peng
🔗
|
-
|
BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning
(
Poster
)
>
SlidesLive Video
|
Mohsen Fayyaz · Ehsan Aghazadeh · Seyed MohammadAli Modarressi · Mohammad Taher Pilehvar · Yadollah Yaghoobzadeh · Samira Ebrahimi Kahou
🔗
|
-
|
Pre-Training a Graph Recurrent Network for Language Representation
(
Poster
)
>
SlidesLive Video
|
Yile Wang · Linyi Yang · Zhiyang Teng · Ming Zhou · Yue Zhang
🔗
|
-
|
An Efficient Memory-Augmented Transformer for Knowledge-Intensive NLP Tasks
(
Poster
)
>
SlidesLive Video
|
Yuxiang Wu · Yu Zhao · Baotian Hu · Pasquale Minervini · Pontus Lars Erik Saito Stenetorp · Sebastian Riedel
🔗
|
-
|
QuaLA-MiniLM: a Quantized Length Adaptive MiniLM
(
Poster
)
>
SlidesLive Video
|
Shira Guskin · Moshe Wasserblat · Haihao Shen · Chang Wang
🔗
|
-
|
Towards Data Efficient And Robust Speech Representation Model Distillation
(
Poster
)
>
SlidesLive Video
|
Pheobe Sun · Ruibo Shi · Ahmad Emami · Sean Moran
🔗
|
-
|
On Spectral and Temporal Feature Encoding Behaviour in Stacked Architectures
(
Poster
)
>
SlidesLive Video
|
Vaibhav Singh · Vinayak Abrol · Karan Nathwani
🔗
|
-
|
Few-Shot Aspect Extraction using Prompt Training
(
Poster
)
>
SlidesLive Video
|
Oren Pereg · Daniel Korat · Moshe Wasserblat · Kfir Bar
🔗
|
-
|
Can we get smarter than majority vote? Efficient use of individual rater’s labels for content moderation
(
Poster
)
>
|
Changho Shin · Alice Schoenauer-Sebag
🔗
|
-
|
BudgetLongformer: Can we Cheaply Pretrain a SOTA Legal Language Model From Scratch?
(
Poster
)
>
SlidesLive Video
|
Joel Niklaus · Daniele Giofrè
🔗
|
-
|
Parameter-Efficient Finetuning of Transformers for Source Code
(
Poster
)
>
SlidesLive Video
|
Shamil Ayupov · Nadezhda Chirkova
🔗
|
-
|
Graph Masking Pre-training for Graph-to-Text Generation
(
Poster
)
>
SlidesLive Video
|
Jiuzhou Han · Ehsan Shareghi
🔗
|
-
|
The Ineffectiveness of TKGE Models in Encoding Real-World Knowledge Graphs
(
Poster
)
>
SlidesLive Video
|
Chuan Ming Ong · Jiahao Sun · Ovidiu Serban · Yike Guo
🔗
|
-
|
PEST: Combining Parameter-Efficient Fine-Tuning with Self-Training and Co-Training
(
Poster
)
>
SlidesLive Video
|
Hunter Lang · Monica Agrawal · Yoon Kim · David Sontag
🔗
|
-
|
ContextNER: Contextual Phrase Generation at Scale
(
Poster
)
>
SlidesLive Video
|
Himanshu Gupta · Shreyas Verma · Tarun Kumar · Swaroop Mishra · Tamanna Agrawal · Amogh Badugu · Himanshu Bhatt
🔗
|
-
|
Efficient Speech Translation with Pre-trained models
(
Poster
)
>
SlidesLive Video
|
Zhaolin Li · Jan Niehues
🔗
|
-
|
Dynamic Query Representation for Extractive Question Answering
(
Poster
)
>
SlidesLive Video
|
Urchade Zaratiana · Niama El Khbir · Dennis Núñez-Fernández · Pierre Holat · Nadi Tomeh · Thierry Charnois
🔗
|
-
|
Strategies for Applying Low Rank Decomposition to Transformer-Based Models
(
Poster
)
>
SlidesLive Video
|
Habib Hajimolahoseini · Walid Ahmed · Mehdi Rezaghoizadeh · Vahid Partovi Nia · Yang Liu
🔗
|
-
|
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low Rank Adaptation
(
Poster
)
>
SlidesLive Video
|
Mojtaba Valipour · Mehdi Rezaghoizadeh · Ivan Kobyzev · Ali Ghodsi
🔗
|
-
|
Pyramid Dynamic Inference: Encouraging Faster Inference via Early Exit Boosting
(
Poster
)
>
SlidesLive Video
|
Ershad Banijamali · Pegah Kharazmi · Samridhi Choudhary · Sepehr Eghbali · Clement Chung
🔗
|
-
|
An efficient RNN Language Model using activity sparsity and sparse back-propagation through time
(
Poster
)
>
SlidesLive Video
|
Mark Schoene · Khaleelulla Khan Nazeer · David Kappel · Christian Mayr · Anand Subramoney
🔗
|
-
|
An Exploration of Methods for Zero-shot Transfer in Small Language Models
(
Poster
)
>
SlidesLive Video
|
Alon Albalak · Akshat Shrivastava · Chinnadhurai Sankar · Adithya Sagar · Mike Ross
🔗
|
-
|
On the impact of the quality of pseudo-labels on the self-supervised speaker verification task
(
Poster
)
>
SlidesLive Video
|
Abderrahim Fathan · JAHANGIR ALAM · Woo Hyun Kang
🔗
|
-
|
INT8 Transformers for Inference Acceleration
(
Poster
)
>
SlidesLive Video
|
Andy Rock · Omar Khalil · Ofer Shai · Paul Grouchy
🔗
|
-
|
Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic
(
Poster
)
>
SlidesLive Video
|
Soumajyoti Sarkar · Saab Mansour · Sailik Sengupta · Sheng Zha · Kaixiang Lin
🔗
|
-
|
Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
(
Poster
)
>
|
Muhammad ElNokrashy · Badr AlKhamissi · Mona Diab
🔗
|
-
|
SymbolicGPT: A Generative Transformer Model for Symbolic Regression
(
Poster
)
>
SlidesLive Video
|
Mojtaba Valipour · Bowen You · Maysum H Panju · Ali Ghodsi
🔗
|
-
|
Using Informative Data Subsets for Efficient Training of Large Language Models: An Initial Study
(
Poster
)
>
SlidesLive Video
|
H S V N S Kowndinya Renduchintala · Krishnateja Killamsetty · Sumit Bhatia · Milan Aggarwal · Ganesh Ramakrishnan · Rishabh Iyer
🔗
|
-
|
Using Selective Masking as a Bridge between Pre-training and Fine-tuning
(
Poster
)
>
SlidesLive Video
|
Tanish Lad · Himanshu Maheshwari · Shreyas Kottukkal · Radhika Mamidi
🔗
|
-
|
Improved Knowledge Distillation by Utilizing Backward Pass Knowledge in Neural Networks
(
Poster
)
>
SlidesLive Video
|
Aref Jafari · Mehdi Rezaghoizadeh · Ali Ghodsi
🔗
|
-
|
Topic Segmentation in the Wild: Towards Segmentation of Semi-structured & Unstructured Chats
(
Poster
)
>
SlidesLive Video
|
Reshmi Ghosh · Sharanya Kamath · Soundararajan Srinivasan · Dhuri Shrivastava · Samyadeep Basu · Harjeet Kajal
🔗
|
-
|
A Theory of Unsupervised Translation for Understanding Animal Communication
(
Poster
)
>
SlidesLive Video
|
Shafi Goldwasser · David Gruber · Adam Tauman Kalai · Orr Paradise
🔗
|
-
|
Collective Knowledge Graph Completion with Mutual Knowledge Distillation
(
Poster
)
>
SlidesLive Video
|
Weihang Zhang · Ovidiu Serban · Jiahao Sun · Yike Guo
🔗
|
-
|
Gradient Knowledge Distillation for Pre-trained Language Models
(
Poster
)
>
SlidesLive Video
|
Lean Wang · Lei Li · Xu Sun
🔗
|
-
|
Efficient Few-Shot Learning Without Prompts
(
Poster
)
>
SlidesLive Video
|
Oren Pereg · Daniel Korat · Moshe Wasserblat · Lewis Tunstall · Unso Eun Seo Jo · Luke Bates · Nils Reimers
🔗
|
-
|
Fast DistilBERT on CPUs
(
Spotlight
)
>
|
Haihao Shen · Ofir Zafrir · Bo Dong · Hengyu Meng · Xinyu Ye · Zhe Wang · Yi Ding · Hanwen Chang · Guy Boudoukh · Moshe Wasserblat
🔗
|
-
|
PCFG-based Natural Language Interface Improves Generalization for Controlled Text Generation
(
Spotlight
)
>
SlidesLive Video
|
Jingyu Zhang · Jim Glass · Tianxing He
🔗
|
-
|
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement
(
Poster
)
>
|
Heitor Guimarães · Arthur Pimentel · Anderson R. Avila · Mehdi Rezaghoizadeh · Tiago H Falk
🔗
|
-
|
Attribute Controlled Dialogue Prompting
(
Spotlight
)
>
|
Runcheng Liu · Ahmad Rashid · Ivan Kobyzev · Mehdi Rezaghoizadeh · Pascal Poupart
🔗
|
-
|
PromptDA: Label-guided Data Augmentation for Prompt-based Few Shot Learners
(
Spotlight
)
>
SlidesLive Video
|
Canyu Chen · Kai Shu
🔗
|
-
|
TBD7
(
KeyNote Talk
)
>
|
Kenneth Heafield
🔗
|