Workshop
Scientific Methods for Understanding Neural Networks
Zahra Kadkhodaie · Florentin Guth · Sanae Lotfi · Davis Brown · Micah Goldblum · Valentin De Bortoli · Andrew Saxe
West Meeting Room 205-207
Sun 15 Dec, 8:50 a.m. PST
While deep learning continues to achieve impressive results on an ever-growing range of tasks, our understanding of the principles underlying these successes remains largely limited. This problem is usually tackled from a mathematical point of view, aiming to prove rigorous theorems about optimization or generalization errors of standard algorithms, but so far they have been limited to overly-simplified settings. The main goal of this workshop is to promote a complementary approach that is centered on the use of the scientific method, which forms hypotheses and designs controlled experiments to test them. More specifically, it focuses on empirical analyses of deep networks that can validate or falsify existing theories and assumptions, or answer questions about the success or failure of these models. This approach has been largely underexplored, but has great potential to further our understanding of deep learning and to lead to significant progress in both theory and practice. The secondary goal of this workshop is to build a community of researchers, currently scattered in several subfields, around the common goal of understanding deep learning through a scientific lens.
Schedule
Sun 8:50 a.m. - 9:00 a.m.
|
Opening Remarks
(
Opening remarks by organizers
)
>
SlidesLive Video |
🔗 |
Sun 9:00 a.m. - 9:30 a.m.
|
Tom Goldstein: Can transformers solve harder problems than they were trained on? Scaling up test-time computation via recurrence
(
Keynote Talk
)
>
SlidesLive Video |
Tom Goldstein 🔗 |
Sun 9:30 a.m. - 10:00 a.m.
|
Surya Ganguli: An analytic theory of creativity in convolutional diffusion models
(
Keynote Talk
)
>
SlidesLive Video |
Surya Ganguli 🔗 |
Sun 10:00 a.m. - 10:30 a.m.
|
Hanie Sedghi: Exploring and Improving Planning Capabilities of LLMs
(
Keynote Talk
)
>
SlidesLive Video |
Hanie Sedghi 🔗 |
Sun 10:30 a.m. - 10:50 a.m.
|
Coffee Break
(
Coffee Break
)
>
|
🔗 |
Sun 10:50 a.m. - 11:05 a.m.
|
Christos Perivolaropoulos: Softmax is not enough (for sharp out-of-distribution)
(
Contributed Talk
)
>
SlidesLive Video |
Petar Veličković · Christos Perivolaropoulos · Federico Barbero · Razvan Pascanu 🔗 |
Sun 11:05 a.m. - 11:20 a.m.
|
David Krueger: Input Space Mode Connectivity in Deep Neural Networks
(
Contributed Talk
)
>
SlidesLive Video |
Jakub Vrabel · Ori Shem Ur · Yaron Oz · David Krueger 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Structured Identity Mapping Learning As a Model for Compositional Generalization in Generative Models ( Poster Session ) > link | Yongyi Yang · Core Francisco Park · Ekdeep S Lubana · Maya Okawa · Wei Hu · Hidenori Tanaka 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks ( Poster Session ) > link | Alba Carballo Castro · Sonia Laguna · Moritz Vandenhirtz · Julia Vogt 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Distributional Scaling Laws for Emergent Capabilities ( Poster Session ) > link | Rosie Zhao · Naomi Saphra · Sham Kakade 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Input Space Mode Connectivity in Deep Neural Networks ( Poster Session ) > link | Jakub Vrabel · Ori Shem Ur · Yaron Oz · David Krueger 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Learned Random Label Predictions as a Neural Network Complexity Metric ( Poster Session ) > link | Marlon Becker · Benjamin Risse 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs ( Poster Session ) > link | Daniel Lee · Stefan Heimersheim 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
BatchTopK Sparse Autoencoders ( Poster Session ) > link | Bart Bussmann · Patrick Leask · Neel Nanda 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models ( Poster Session ) > link | Zhang · Difan Zou 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
softmax is not enough (for sharp out-of-distribution) ( Poster Session ) > link | Petar Veličković · Christos Perivolaropoulos · Federico Barbero · Razvan Pascanu 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Understanding Visual Concepts Across Models ( Poster Session ) > link | Brandon Trabucco · Max Gurinas · Kyle Doherty · Ruslan Salakhutdinov 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Eliminating Position Bias of Language Models: A Mechanistic Approach ( Poster Session ) > link | Ziqi Wang · Hanlin Zhang · Xiner Li · Kuan-Hao Huang · Chi Han · Shuiwang Ji · Sham Kakade · Hao Peng · Heng Ji 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations ( Poster Session ) > link | Kola Ayonrinde · Michael Pearce · Lee Sharkey 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structure ( Poster Session ) > link | Christine Ye · Charles O'Neill · John Wu · Kartheik Iyer 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Learnability in the Context of Neural Tangent Kernels ( Poster Session ) > link | Progyan Das · Dwip Dalal 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
The Master Key Filters Hypothesis: Deep Filters Are General ( Poster Session ) > link | Zahra Babaiee · Peyman M. Kiasari · Daniela Rus · Radu Grosu 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
The Unreasonable Ineffectiveness of the Deeper Layers ( Poster Session ) > link | Andrey Gromov · Kushal Tirumala · Hassan Shapourian · Paolo Glorioso · Dan Roberts 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Probing the Decision Boundaries of In-context Learning in Large Language Models Download PDF ( Poster Session ) > link | Siyan Zhao · Tung Nguyen · Aditya Grover 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
The Pitfalls of Memorization: When Memorization Hinders Generalization ( Poster Session ) > link | Reza Bayat · Mohammad Pezeshki · Elvis Dohmatob · David Lopez-Paz · Pascal Vincent 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
"Twin Studies" of Factors in OOD Generalization ( Poster Session ) > link | Victoria R. Li · Jenny Kaufmann · David Alvarez-Melis · Naomi Saphra 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Hiding in a Plain Sight: Out-of-Distribution Data in the Logit Space Embeddings ( Poster Session ) > link | Vangjush Komini · Sarunas Girdzijauskas 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Is Saliency Really Captured By Gradient? ( Poster Session ) > link | Nehal Yasin · Jonathon Hare · Antonia Marcu 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
A Continuous-Time Analysis of Adaptive Optimization and Normalization ( Poster Session ) > link | Rhys Gould · Hidenori Tanaka 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference ( Poster Session ) > link | Anton Xue · Avishree Khare · Rajeev Alur · Surbhi Goel · Eric Wong 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics ( Poster Session ) > link | Charlotte Beylier · Simon M. Hofmann · Nico Scherf 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Training Neural Networks for Modularity aids Interpretability ( Poster Session ) > link | Satvik Golechha · Dylan Cope · Nandi Schoots 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Denoising for Manifold Extrapolation ( Poster Session ) > link | Zeyu Yun · Galen Chuang · Derek Dong · Yubei Chen 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
A Method on Searching Better Activation Functions ( Poster Session ) > link | Haoyuan Sun · Zihao Wu · Bo Xia · Pu Chang · Zibin Dong · Yifu Yuan · Yongzhe Chang · Xueqian Wang 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Pre-processing and Compression: Understanding Hidden Representation Refinement Across Imaging Domains via Intrinsic Dimension ( Poster Session ) > link | Nicholas Konz · Maciej Mazurowski 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Model Recycling: Model component reuse to promote in-context learning ( Poster Session ) > link | Lindsay Smith · Chase Goddard · Vudtiwat Ngampruetikorn · David Schwab 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Emergence of Hierarchical Emotion Representations in Large Language Models ( Poster Session ) > link | Bo Zhao · Maya Okawa · Eric Bigelow · Rose Yu · Tomer Ullman · Hidenori Tanaka 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps ( Poster Session ) > link | Fuxiao Liu 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Amplified Early Stopping Bias: Overestimated Performance with Deep Learning ( Poster Session ) > link | Nona Rajabi · Antonio Ribeiro · Miguel Vasco · Danica Kragic 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains ( Poster Session ) > link | Ezra Edelman · Nikolaos Tsilivis · Surbhi Goel · Benjamin Edelman · Eran Malach 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Transformers can reinforcement learn to approximate Gittins Index ( Poster Session ) > link | Vladimir Petrov · Nikhil Vyas · Lucas Janson 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Understanding the Limitations of B-Spline KANs: Convergence Dynamics and Computational Efficiency ( Poster Session ) > link | Avik Pal · Dipankar Das 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Structure Development in List Sorting Transformers ( Poster Session ) > link | Einar Urdshals · Jasmina Urdshals 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Robust Learning in Bayesian Parallel Branching Graph Neural Networks: The Narrow Width Limit ( Poster Session ) > link | Zechen Zhang · Haim Sompolinsky 🔗 |
Sun 11:20 a.m. - 12:20 p.m.
|
Testing knowledge distillation theories with dataset size ( Poster Session ) > link | Giulia Lanzillotta · Felix Sarnthein · Gil Kur · Thomas Hofmann · Bobby He 🔗 |
Sun 12:20 p.m. - 1:20 p.m.
|
Lunch break
(
Lunch break
)
>
|
🔗 |
Sun 1:20 p.m. - 1:35 p.m.
|
Antonio Sclocchi: Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models
(
Contributed Talk
)
>
SlidesLive Video |
Antonio Sclocchi · Noam Levi · Alessandro Favero · Matthieu Wyart 🔗 |
Sun 1:35 p.m. - 1:50 p.m.
|
Bao Pham: Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory
(
Contributed Talk
)
>
SlidesLive Video |
Bao Pham · Gabriel Raya · Matteo Negri · Mohammed Zaki · Luca Ambrogioni · Dmitry Krotov 🔗 |
Sun 1:50 p.m. - 2:20 p.m.
|
Zico Kolter: Is this really science? A lukewarm defense of alchemy
(
Keynote Talk
)
>
SlidesLive Video |
J. Zico Kolter 🔗 |
Sun 2:20 p.m. - 2:50 p.m.
|
Misha Belkin: Building on observations: some personal experience
(
Keynote Talk
)
>
SlidesLive Video |
Misha Belkin 🔗 |
Sun 2:50 p.m. - 3:10 p.m.
|
Coffee Break
(
Coffee Break
)
>
|
🔗 |
Sun 3:10 p.m. - 4:10 p.m.
|
Yasaman Bahri, Andrew Gordon Wilson, Misha Belkin, Tom Goldstein, Eero Simoncelli
(
Panel discussion
)
>
SlidesLive Video |
Yasaman Bahri · Andrew Wilson · Misha Belkin · Eero Simoncelli · Tom Goldstein · Surya Ganguli 🔗 |
Sun 4:10 p.m. - 4:30 p.m.
|
Winners Announcement + Closing Remarks
(
Winners Announcement + Closing Remarks
)
>
SlidesLive Video |
🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory ( Poster Session ) > link | Bao Pham · Gabriel Raya · Matteo Negri · Mohammed Zaki · Luca Ambrogioni · Dmitry Krotov 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Language model scaling laws and zero-sum learning ( Poster Session ) > link | Andrei Mircea · Ekaterina Lobacheva · Supriyo Chakraborty · Nima Chitsazan · Irina Rish 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Evaluating Loss Landscapes from a Topology Perspective ( Poster Session ) > link | Tiankai Xie · Caleb Geniesse · Jiaqing Chen · Yaoqing Yang · Dmitriy Morozov · Michael Mahoney · Ross Maciejewski · Gunther Weber 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Training Dynamics of Convolutional Neural Networks for Learning the Derivative Operator ( Poster Session ) > link | Erik Wang · Yongji Wang · Ching-Yao Lai 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Understanding the Transient Nature of In-Context Learning: The Window of Generalization ( Poster Session ) > link | Core Francisco Park · Ekdeep S Lubana · Hidenori Tanaka 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Learning Stochastic Rainbow Networks ( Poster Session ) > link | Vivian White · Muawiz Chaudhary · Guy Wolf · Guillaume Lajoie · Kameron Decker Harris 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
SolidMark: How to Evaluate Memorization in Image Generative Models ( Poster Session ) > link | Nicky Kriplani · Minh Pham · Malikka Rajshahi · Chinmay Hegde · Niv Cohen 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Token-token correlations predict the scaling of the test loss with the number of input tokens ( Poster Session ) > link | Francesco Cagnetta · Matthieu Wyart 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Explicit Regularisation, Sharpness and Calibration ( Poster Session ) > link | Israel Mason-Williams · Fredrik Ekholm · Ferenc Huszar 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Knowledge Distillation for Teaching Symmetry Invariances ( Poster Session ) > link | Patrick Odagiu · Nicole Nobili · Fabian Dionys Schrag · Yves Bicker · Yuhui Ding 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Standard adversarial attacks only fool the final layer ( Poster Session ) > link | Stanislav Fort 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Generalization vs Specialization under Concept Shift ( Poster Session ) > link | Alex Nguyen · David Schwab · Vudtiwat Ngampruetikorn 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models ( Poster Session ) > link | Antonio Sclocchi · Noam Levi · Alessandro Favero · Matthieu Wyart 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Stitching Sparse Autoencoders of Different Sizes ( Poster Session ) > link | Patrick Leask · Bart Bussmann · Joseph Bloom · Curt Tigges · Noura Al Moubayed · Neel Nanda 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Illusions as features: the generative side of recognition ( Poster Session ) > link | Tahereh Toosi · Kenneth Miller 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Exploring model depth and data complexity through the lens of cellular automata ( Poster Session ) > link | Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
We Need Far Fewer Unique Filters Than We Thought ( Poster Session ) > link | Zahra Babaiee · Peyman M. Kiasari · Daniela Rus · Radu Grosu 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
How Learning Rates Shape Neural Network Focus: Insights from Example Ranking ( Poster Session ) > link | Ekaterina Lobacheva · Keller Jordan · Aristide Baratin · Nicolas Le Roux 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models ( Poster Session ) > link | Marianna Nezhurina · Lucia Cipolina Kun · Mehdi Cherti · Jenia Jitsev 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Causation Does Not Imply Correlation: A Study of Circuit Mechanisms and Model Behaviors ( Poster Session ) > link | Jenny Kaufmann · Victoria R. Li · Martin Wattenberg · David Alvarez-Melis · Naomi Saphra 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Effectiveness of Sparse Autoencoder for understanding and removing gender bias in LLMs ( Poster Session ) > link | Praveen Hegde 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Boundaries of stable regions in activation space of LLMs become sharper with more compute ( Poster Session ) > link | Jett Janiak · Jacek Karwowski · Chatrik Mangat · Giorgi Giglemiani · Nora Petrova · Stefan Heimersheim 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Measuring the Reliability of Causal Probing Methods: Tradeoffs, Limitations, and the Plight of Nullifying Interventions ( Poster Session ) > link | Marc Canby · Adam Davies · Chirag Rastogi · Julia C Hockenmaier 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Impact of Label Noise on Learning Complex Features ( Poster Session ) > link | Rahul Vashisht · P Kumar · Harsha Vardhan Govind · Harish Guruprasad Ramaswamy 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
How rare events shape the learning curves of hierarchical data ( Poster Session ) > link | Hyunmo Kang · Francesco Cagnetta · Matthieu Wyart 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Specialization-generalization transition in in-context learning of linear functions ( Poster Session ) > link | Chase Goddard · Lindsay Smith · Vudtiwat Ngampruetikorn · David Schwab 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Knowledge Distillation: The Functional Perspective ( Poster Session ) > link | Gabryel Mason-Williams · Israel Mason-Williams · Mark Sandler 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Are Capsule Networks Texture or Shape Biased? ( Poster Session ) > link | Riccardo Renzulli · Dominik Vranay · Marco Grangetto 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Sometimes I am a Tree: Data Drives Fragile Hierarchical Generalization ( Poster Session ) > link | Tian Qin · Naomi Saphra · David Alvarez-Melis 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition ( Poster Session ) > link | Youssef Doulfoukar · Laurent Mertens · Joost Vennekens 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Improving Deep Learning Speed and Performance through Synaptic Neural Balance ( Poster Session ) > link | Antonios Alexos · ian domingo · Pierre Baldi 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Is Expressivity Essential for the Predictive Performance of Graph Neural Networks? ( Poster Session ) > link | Fabian Jogl · Pascal Welke · Thomas Gärtner 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Is network fragmentation a useful complexity measure? ( Poster Session ) > link | Coenraad Mouton · Randle Rabe · Daniël Haasbroek · Marthinus Theunissen · Hermanus Potgieter · Marelie Davel 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Comparing Apples and Oranges: is Stitching Similarity a Load of Spheres? ( Poster Session ) > link | Damian Smith · Antonia Marcu 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Emergent properties with repeated examples ( Poster Session ) > link | Francois Charton · Julia Kempe 🔗 |
Sun 4:30 p.m. - 5:30 p.m.
|
Rethinking Knowledge Transfer in Learning Using Privileged Information ( Poster Session ) > link | Danil Provodin · Bram van den Akker · Christina Katsimerou · Maurits Clemens Kaptein · Mykola Pechenizkiy 🔗 |