Scientific Methods for Understanding Neural Networks

Workshop

Scientific Methods for Understanding Neural Networks

Zahra Kadkhodaie · Florentin Guth · Sanae Lotfi · Davis Brown · Micah Goldblum · Valentin De Bortoli · Andrew Saxe

West Meeting Room 205-207

Sun 15 Dec, 8:50 a.m. PST

[ Abstract ] Workshop Website

[ OpenReview]

While deep learning continues to achieve impressive results on an ever-growing range of tasks, our understanding of the principles underlying these successes remains largely limited. This problem is usually tackled from a mathematical point of view, aiming to prove rigorous theorems about optimization or generalization errors of standard algorithms, but so far they have been limited to overly-simplified settings. The main goal of this workshop is to promote a complementary approach that is centered on the use of the scientific method, which forms hypotheses and designs controlled experiments to test them. More specifically, it focuses on empirical analyses of deep networks that can validate or falsify existing theories and assumptions, or answer questions about the success or failure of these models. This approach has been largely underexplored, but has great potential to further our understanding of deep learning and to lead to significant progress in both theory and practice. The secondary goal of this workshop is to build a community of researchers, currently scattered in several subfields, around the common goal of understanding deep learning through a scientific lens.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sun 8:50 a.m. - 9:00 a.m.	Opening Remarks ( Opening remarks by organizers ) > SlidesLive Video	🔗
Sun 9:00 a.m. - 9:30 a.m.	Tom Goldstein: Can transformers solve harder problems than they were trained on? Scaling up test-time computation via recurrence ( Keynote Talk ) > SlidesLive Video	Tom Goldstein 🔗
Sun 9:30 a.m. - 10:00 a.m.	Surya Ganguli: An analytic theory of creativity in convolutional diffusion models ( Keynote Talk ) > SlidesLive Video	Surya Ganguli 🔗
Sun 10:00 a.m. - 10:30 a.m.	Hanie Sedghi: Exploring and Improving Planning Capabilities of LLMs ( Keynote Talk ) > SlidesLive Video	Hanie Sedghi 🔗
Sun 10:30 a.m. - 10:50 a.m.	Coffee Break ( Coffee Break ) >	🔗
Sun 10:50 a.m. - 11:05 a.m.	Christos Perivolaropoulos: Softmax is not enough (for sharp out-of-distribution) ( Contributed Talk ) > SlidesLive Video	Petar Veličković · Christos Perivolaropoulos · Federico Barbero · Razvan Pascanu 🔗
Sun 11:05 a.m. - 11:20 a.m.	David Krueger: Input Space Mode Connectivity in Deep Neural Networks ( Contributed Talk ) > SlidesLive Video	Jakub Vrabel · Ori Shem Ur · Yaron Oz · David Krueger 🔗
Sun 11:20 a.m. - 12:20 p.m.	Structured Identity Mapping Learning As a Model for Compositional Generalization in Generative Models ( Poster Session ) > link Link	Yongyi Yang · Core Francisco Park · Ekdeep S Lubana · Maya Okawa · Wei Hu · Hidenori Tanaka 🔗
Sun 11:20 a.m. - 12:20 p.m.	Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks ( Poster Session ) > link Link	Alba Carballo Castro · Sonia Laguna · Moritz Vandenhirtz · Julia Vogt 🔗
Sun 11:20 a.m. - 12:20 p.m.	Distributional Scaling Laws for Emergent Capabilities ( Poster Session ) > link Link	Rosie Zhao · Naomi Saphra · Sham Kakade 🔗
Sun 11:20 a.m. - 12:20 p.m.	Input Space Mode Connectivity in Deep Neural Networks ( Poster Session ) > link Link	Jakub Vrabel · Ori Shem Ur · Yaron Oz · David Krueger 🔗
Sun 11:20 a.m. - 12:20 p.m.	Learned Random Label Predictions as a Neural Network Complexity Metric ( Poster Session ) > link Link	Marlon Becker · Benjamin Risse 🔗
Sun 11:20 a.m. - 12:20 p.m.	Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs ( Poster Session ) > link Link	Daniel Lee · Stefan Heimersheim 🔗
Sun 11:20 a.m. - 12:20 p.m.	BatchTopK Sparse Autoencoders ( Poster Session ) > link Link	Bart Bussmann · Patrick Leask · Neel Nanda 🔗
Sun 11:20 a.m. - 12:20 p.m.	On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models ( Poster Session ) > link Link	Zhang · Difan Zou 🔗
Sun 11:20 a.m. - 12:20 p.m.	softmax is not enough (for sharp out-of-distribution) ( Poster Session ) > link Link	Petar Veličković · Christos Perivolaropoulos · Federico Barbero · Razvan Pascanu 🔗
Sun 11:20 a.m. - 12:20 p.m.	Understanding Visual Concepts Across Models ( Poster Session ) > link Link	Brandon Trabucco · Max Gurinas · Kyle Doherty · Ruslan Salakhutdinov 🔗
Sun 11:20 a.m. - 12:20 p.m.	Eliminating Position Bias of Language Models: A Mechanistic Approach ( Poster Session ) > link Link	Ziqi Wang · Hanlin Zhang · Xiner Li · Kuan-Hao Huang · Chi Han · Shuiwang Ji · Sham Kakade · Hao Peng · Heng Ji 🔗
Sun 11:20 a.m. - 12:20 p.m.	Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations ( Poster Session ) > link Link	Kola Ayonrinde · Michael Pearce · Lee Sharkey 🔗
Sun 11:20 a.m. - 12:20 p.m.	Sparse autoencoders for dense text embeddings reveal hierarchical feature sub-structure ( Poster Session ) > link Link	Christine Ye · Charles O'Neill · John Wu · Kartheik Iyer 🔗
Sun 11:20 a.m. - 12:20 p.m.	Learnability in the Context of Neural Tangent Kernels ( Poster Session ) > link Link	Progyan Das · Dwip Dalal 🔗
Sun 11:20 a.m. - 12:20 p.m.	The Master Key Filters Hypothesis: Deep Filters Are General ( Poster Session ) > link Link	Zahra Babaiee · Peyman M. Kiasari · Daniela Rus · Radu Grosu 🔗
Sun 11:20 a.m. - 12:20 p.m.	The Unreasonable Ineffectiveness of the Deeper Layers ( Poster Session ) > link Link	Andrey Gromov · Kushal Tirumala · Hassan Shapourian · Paolo Glorioso · Dan Roberts 🔗
Sun 11:20 a.m. - 12:20 p.m.	Probing the Decision Boundaries of In-context Learning in Large Language Models Download PDF ( Poster Session ) > link Link	Siyan Zhao · Tung Nguyen · Aditya Grover 🔗
Sun 11:20 a.m. - 12:20 p.m.	The Pitfalls of Memorization: When Memorization Hinders Generalization ( Poster Session ) > link Link	Reza Bayat · Mohammad Pezeshki · Elvis Dohmatob · David Lopez-Paz · Pascal Vincent 🔗
Sun 11:20 a.m. - 12:20 p.m.	"Twin Studies" of Factors in OOD Generalization ( Poster Session ) > link Link	Victoria R. Li · Jenny Kaufmann · David Alvarez-Melis · Naomi Saphra 🔗
Sun 11:20 a.m. - 12:20 p.m.	Hiding in a Plain Sight: Out-of-Distribution Data in the Logit Space Embeddings ( Poster Session ) > link Link	Vangjush Komini · Sarunas Girdzijauskas 🔗
Sun 11:20 a.m. - 12:20 p.m.	Is Saliency Really Captured By Gradient? ( Poster Session ) > link Link	Nehal Yasin · Jonathon Hare · Antonia Marcu 🔗
Sun 11:20 a.m. - 12:20 p.m.	A Continuous-Time Analysis of Adaptive Optimization and Normalization ( Poster Session ) > link Link	Rhys Gould · Hidenori Tanaka 🔗
Sun 11:20 a.m. - 12:20 p.m.	Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference ( Poster Session ) > link Link	Anton Xue · Avishree Khare · Rajeev Alur · Surbhi Goel · Eric Wong 🔗
Sun 11:20 a.m. - 12:20 p.m.	Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics ( Poster Session ) > link Link	Charlotte Beylier · Simon M. Hofmann · Nico Scherf 🔗
Sun 11:20 a.m. - 12:20 p.m.	Training Neural Networks for Modularity aids Interpretability ( Poster Session ) > link Link	Satvik Golechha · Dylan Cope · Nandi Schoots 🔗
Sun 11:20 a.m. - 12:20 p.m.	Denoising for Manifold Extrapolation ( Poster Session ) > link Link	Zeyu Yun · Galen Chuang · Derek Dong · Yubei Chen 🔗
Sun 11:20 a.m. - 12:20 p.m.	A Method on Searching Better Activation Functions ( Poster Session ) > link Link	Haoyuan Sun · Zihao Wu · Bo Xia · Pu Chang · Zibin Dong · Yifu Yuan · Yongzhe Chang · Xueqian Wang 🔗
Sun 11:20 a.m. - 12:20 p.m.	Pre-processing and Compression: Understanding Hidden Representation Refinement Across Imaging Domains via Intrinsic Dimension ( Poster Session ) > link Link	Nicholas Konz · Maciej Mazurowski 🔗
Sun 11:20 a.m. - 12:20 p.m.	Model Recycling: Model component reuse to promote in-context learning ( Poster Session ) > link Link	Lindsay Smith · Chase Goddard · Vudtiwat Ngampruetikorn · David Schwab 🔗
Sun 11:20 a.m. - 12:20 p.m.	Emergence of Hierarchical Emotion Representations in Large Language Models ( Poster Session ) > link Link	Bo Zhao · Maya Okawa · Eric Bigelow · Rose Yu · Tomer Ullman · Hidenori Tanaka 🔗
Sun 11:20 a.m. - 12:20 p.m.	Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps ( Poster Session ) > link Link	Fuxiao Liu 🔗
Sun 11:20 a.m. - 12:20 p.m.	Amplified Early Stopping Bias: Overestimated Performance with Deep Learning ( Poster Session ) > link Link	Nona Rajabi · Antonio Ribeiro · Miguel Vasco · Danica Kragic 🔗
Sun 11:20 a.m. - 12:20 p.m.	The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains ( Poster Session ) > link Link	Ezra Edelman · Nikolaos Tsilivis · Surbhi Goel · Benjamin Edelman · Eran Malach 🔗
Sun 11:20 a.m. - 12:20 p.m.	Transformers can reinforcement learn to approximate Gittins Index ( Poster Session ) > link Link	Vladimir Petrov · Nikhil Vyas · Lucas Janson 🔗
Sun 11:20 a.m. - 12:20 p.m.	Understanding the Limitations of B-Spline KANs: Convergence Dynamics and Computational Efficiency ( Poster Session ) > link Link	Avik Pal · Dipankar Das 🔗
Sun 11:20 a.m. - 12:20 p.m.	Structure Development in List Sorting Transformers ( Poster Session ) > link Link	Einar Urdshals · Jasmina Urdshals 🔗
Sun 11:20 a.m. - 12:20 p.m.	Robust Learning in Bayesian Parallel Branching Graph Neural Networks: The Narrow Width Limit ( Poster Session ) > link Link	Zechen Zhang · Haim Sompolinsky 🔗
Sun 11:20 a.m. - 12:20 p.m.	Testing knowledge distillation theories with dataset size ( Poster Session ) > link Link	Giulia Lanzillotta · Felix Sarnthein · Gil Kur · Thomas Hofmann · Bobby He 🔗
Sun 12:20 p.m. - 1:20 p.m.	Lunch break ( Lunch break ) >	🔗
Sun 1:20 p.m. - 1:35 p.m.	Antonio Sclocchi: Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models ( Contributed Talk ) > SlidesLive Video	Antonio Sclocchi · Noam Levi · Alessandro Favero · Matthieu Wyart 🔗
Sun 1:35 p.m. - 1:50 p.m.	Bao Pham: Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory ( Contributed Talk ) > SlidesLive Video	Bao Pham · Gabriel Raya · Matteo Negri · Mohammed Zaki · Luca Ambrogioni · Dmitry Krotov 🔗
Sun 1:50 p.m. - 2:20 p.m.	Zico Kolter: Is this really science? A lukewarm defense of alchemy ( Keynote Talk ) > SlidesLive Video	J. Zico Kolter 🔗
Sun 2:20 p.m. - 2:50 p.m.	Misha Belkin: Building on observations: some personal experience ( Keynote Talk ) > SlidesLive Video	Misha Belkin 🔗
Sun 2:50 p.m. - 3:10 p.m.	Coffee Break ( Coffee Break ) >	🔗
Sun 3:10 p.m. - 4:10 p.m.	Yasaman Bahri, Andrew Gordon Wilson, Misha Belkin, Tom Goldstein, Eero Simoncelli ( Panel discussion ) > SlidesLive Video	Yasaman Bahri · Andrew Wilson · Misha Belkin · Eero Simoncelli · Tom Goldstein · Surya Ganguli 🔗
Sun 4:10 p.m. - 4:30 p.m.	Winners Announcement + Closing Remarks ( Winners Announcement + Closing Remarks ) > SlidesLive Video	🔗
Sun 4:30 p.m. - 5:30 p.m.	Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory ( Poster Session ) > link Link	Bao Pham · Gabriel Raya · Matteo Negri · Mohammed Zaki · Luca Ambrogioni · Dmitry Krotov 🔗
Sun 4:30 p.m. - 5:30 p.m.	Language model scaling laws and zero-sum learning ( Poster Session ) > link Link	Andrei Mircea · Ekaterina Lobacheva · Supriyo Chakraborty · Nima Chitsazan · Irina Rish 🔗
Sun 4:30 p.m. - 5:30 p.m.	Evaluating Loss Landscapes from a Topology Perspective ( Poster Session ) > link Link	Tiankai Xie · Caleb Geniesse · Jiaqing Chen · Yaoqing Yang · Dmitriy Morozov · Michael Mahoney · Ross Maciejewski · Gunther Weber 🔗
Sun 4:30 p.m. - 5:30 p.m.	Training Dynamics of Convolutional Neural Networks for Learning the Derivative Operator ( Poster Session ) > link Link	Erik Wang · Yongji Wang · Ching-Yao Lai 🔗
Sun 4:30 p.m. - 5:30 p.m.	Understanding the Transient Nature of In-Context Learning: The Window of Generalization ( Poster Session ) > link Link	Core Francisco Park · Ekdeep S Lubana · Hidenori Tanaka 🔗
Sun 4:30 p.m. - 5:30 p.m.	Learning Stochastic Rainbow Networks ( Poster Session ) > link Link	Vivian White · Muawiz Chaudhary · Guy Wolf · Guillaume Lajoie · Kameron Decker Harris 🔗
Sun 4:30 p.m. - 5:30 p.m.	SolidMark: How to Evaluate Memorization in Image Generative Models ( Poster Session ) > link Link	Nicky Kriplani · Minh Pham · Malikka Rajshahi · Chinmay Hegde · Niv Cohen 🔗
Sun 4:30 p.m. - 5:30 p.m.	Token-token correlations predict the scaling of the test loss with the number of input tokens ( Poster Session ) > link Link	Francesco Cagnetta · Matthieu Wyart 🔗
Sun 4:30 p.m. - 5:30 p.m.	Explicit Regularisation, Sharpness and Calibration ( Poster Session ) > link Link	Israel Mason-Williams · Fredrik Ekholm · Ferenc Huszar 🔗
Sun 4:30 p.m. - 5:30 p.m.	Knowledge Distillation for Teaching Symmetry Invariances ( Poster Session ) > link Link	Patrick Odagiu · Nicole Nobili · Fabian Dionys Schrag · Yves Bicker · Yuhui Ding 🔗
Sun 4:30 p.m. - 5:30 p.m.	Standard adversarial attacks only fool the final layer ( Poster Session ) > link Link	Stanislav Fort 🔗
Sun 4:30 p.m. - 5:30 p.m.	Generalization vs Specialization under Concept Shift ( Poster Session ) > link Link	Alex Nguyen · David Schwab · Vudtiwat Ngampruetikorn 🔗
Sun 4:30 p.m. - 5:30 p.m.	Unraveling the Latent Hierarchical Structure of Language and Images via Diffusion Models ( Poster Session ) > link Link	Antonio Sclocchi · Noam Levi · Alessandro Favero · Matthieu Wyart 🔗
Sun 4:30 p.m. - 5:30 p.m.	Stitching Sparse Autoencoders of Different Sizes ( Poster Session ) > link Link	Patrick Leask · Bart Bussmann · Joseph Bloom · Curt Tigges · Noura Al Moubayed · Neel Nanda 🔗
Sun 4:30 p.m. - 5:30 p.m.	Illusions as features: the generative side of recognition ( Poster Session ) > link Link	Tahereh Toosi · Kenneth Miller 🔗
Sun 4:30 p.m. - 5:30 p.m.	Exploring model depth and data complexity through the lens of cellular automata ( Poster Session ) > link Link	Tianyu He · Darshil Doshi · Aritra Das · Andrey Gromov 🔗
Sun 4:30 p.m. - 5:30 p.m.	We Need Far Fewer Unique Filters Than We Thought ( Poster Session ) > link Link	Zahra Babaiee · Peyman M. Kiasari · Daniela Rus · Radu Grosu 🔗
Sun 4:30 p.m. - 5:30 p.m.	How Learning Rates Shape Neural Network Focus: Insights from Example Ranking ( Poster Session ) > link Link	Ekaterina Lobacheva · Keller Jordan · Aristide Baratin · Nicolas Le Roux 🔗
Sun 4:30 p.m. - 5:30 p.m.	Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models ( Poster Session ) > link Link	Marianna Nezhurina · Lucia Cipolina Kun · Mehdi Cherti · Jenia Jitsev 🔗
Sun 4:30 p.m. - 5:30 p.m.	Causation Does Not Imply Correlation: A Study of Circuit Mechanisms and Model Behaviors ( Poster Session ) > link Link	Jenny Kaufmann · Victoria R. Li · Martin Wattenberg · David Alvarez-Melis · Naomi Saphra 🔗
Sun 4:30 p.m. - 5:30 p.m.	Effectiveness of Sparse Autoencoder for understanding and removing gender bias in LLMs ( Poster Session ) > link Link	Praveen Hegde 🔗
Sun 4:30 p.m. - 5:30 p.m.	Boundaries of stable regions in activation space of LLMs become sharper with more compute ( Poster Session ) > link Link	Jett Janiak · Jacek Karwowski · Chatrik Mangat · Giorgi Giglemiani · Nora Petrova · Stefan Heimersheim 🔗
Sun 4:30 p.m. - 5:30 p.m.	Measuring the Reliability of Causal Probing Methods: Tradeoffs, Limitations, and the Plight of Nullifying Interventions ( Poster Session ) > link Link	Marc Canby · Adam Davies · Chirag Rastogi · Julia C Hockenmaier 🔗
Sun 4:30 p.m. - 5:30 p.m.	Impact of Label Noise on Learning Complex Features ( Poster Session ) > link Link	Rahul Vashisht · P Kumar · Harsha Vardhan Govind · Harish Guruprasad Ramaswamy 🔗
Sun 4:30 p.m. - 5:30 p.m.	How rare events shape the learning curves of hierarchical data ( Poster Session ) > link Link	Hyunmo Kang · Francesco Cagnetta · Matthieu Wyart 🔗
Sun 4:30 p.m. - 5:30 p.m.	Specialization-generalization transition in in-context learning of linear functions ( Poster Session ) > link Link	Chase Goddard · Lindsay Smith · Vudtiwat Ngampruetikorn · David Schwab 🔗
Sun 4:30 p.m. - 5:30 p.m.	Knowledge Distillation: The Functional Perspective ( Poster Session ) > link Link	Gabryel Mason-Williams · Israel Mason-Williams · Mark Sandler 🔗
Sun 4:30 p.m. - 5:30 p.m.	Are Capsule Networks Texture or Shape Biased? ( Poster Session ) > link Link	Riccardo Renzulli · Dominik Vranay · Marco Grangetto 🔗
Sun 4:30 p.m. - 5:30 p.m.	Sometimes I am a Tree: Data Drives Fragile Hierarchical Generalization ( Poster Session ) > link Link	Tian Qin · Naomi Saphra · David Alvarez-Melis 🔗
Sun 4:30 p.m. - 5:30 p.m.	EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition ( Poster Session ) > link Link	Youssef Doulfoukar · Laurent Mertens · Joost Vennekens 🔗
Sun 4:30 p.m. - 5:30 p.m.	Improving Deep Learning Speed and Performance through Synaptic Neural Balance ( Poster Session ) > link Link	Antonios Alexos · ian domingo · Pierre Baldi 🔗
Sun 4:30 p.m. - 5:30 p.m.	Is Expressivity Essential for the Predictive Performance of Graph Neural Networks? ( Poster Session ) > link Link	Fabian Jogl · Pascal Welke · Thomas Gärtner 🔗
Sun 4:30 p.m. - 5:30 p.m.	Is network fragmentation a useful complexity measure? ( Poster Session ) > link Link	Coenraad Mouton · Randle Rabe · Daniël Haasbroek · Marthinus Theunissen · Hermanus Potgieter · Marelie Davel 🔗
Sun 4:30 p.m. - 5:30 p.m.	Comparing Apples and Oranges: is Stitching Similarity a Load of Spheres? ( Poster Session ) > link Link	Damian Smith · Antonia Marcu 🔗
Sun 4:30 p.m. - 5:30 p.m.	Emergent properties with repeated examples ( Poster Session ) > link Link	Francois Charton · Julia Kempe 🔗
Sun 4:30 p.m. - 5:30 p.m.	Rethinking Knowledge Transfer in Learning Using Privileged Information ( Poster Session ) > link Link	Danil Provodin · Bram van den Akker · Christina Katsimerou · Maurits Clemens Kaptein · Mykola Pechenizkiy 🔗