Workshop
System-2 Reasoning at Scale
Shikhar Murty · Federico Bianchi · Róbert Csordás · Nouha Dziri · Alex Gu · Shunyu Yao · Christopher D Manning · Yejin Choi
West Ballroom B
Sun 15 Dec, 8:55 a.m. PST
Our workshop focuses on improving reasoning in neural networks, particularly the challenges and strategies for achieving System-2 reasoning in transformer-like models. The workshop addresses issues like distinguishing memorization from rule-based learning, understanding, syntactic generalization, and compositionality. The workshop also covers the importance of understanding how systematic models are in their decisions for AI safety, integrating neural networks with symbolic reasoning, and developing new architectures for enhanced reasoning capabilities. We have (tentatively) confirmed a distinguished group of speakers and panelists who are some of the most influential figures in recent literature on reasoning. Considering how important these topics are today and our distinguished lineup of speakers, we expect \textbf{more than 500 participants to the workshop}.
Schedule
Sun 9:00 a.m. - 9:15 a.m.
|
Poster setup
|
🔗 |
Sun 9:15 a.m. - 9:20 a.m.
|
Opening Remarks
SlidesLive Video |
Nouha Dziri · Alex Gu · Róbert Csordás 🔗 |
Sun 9:20 a.m. - 9:30 a.m.
|
Lightning Talk: softmax is not enough (for sharp out-of-distribution)
link
SlidesLive Video |
🔗 |
Sun 9:30 a.m. - 9:40 a.m.
|
Lightning Talk: Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
link
SlidesLive Video |
🔗 |
Sun 9:40 a.m. - 9:50 a.m.
|
Lightning Talk: System 1.5: Designing Metacognition in Artificial Intelligence
link
SlidesLive Video |
🔗 |
Sun 9:50 a.m. - 10:00 a.m.
|
Lightning Talk: Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
link
SlidesLive Video |
🔗 |
Sun 10:00 a.m. - 10:35 a.m.
|
Keynote: Josh Tenenbaum
SlidesLive Video |
Josh Tenenbaum 🔗 |
Sun 10:35 a.m. - 10:55 a.m.
|
Coffee Break and Poster Session
|
🔗 |
Sun 10:55 a.m. - 11:30 a.m.
|
Keynote: Melanie Mitchell: On Understanding and Abstraction in Humans and AI Systems
SlidesLive Video |
Melanie Mitchell 🔗 |
Sun 11:30 a.m. - 1:00 p.m.
|
Poster Session
|
🔗 |
Sun 1:00 p.m. - 2:00 p.m.
|
Lunch Break
|
🔗 |
Sun 2:00 p.m. - 2:35 p.m.
|
Keynote: Jason Weston: Self-Training Methods for System 2 Reasoning
SlidesLive Video |
Jason Weston 🔗 |
Sun 2:35 p.m. - 2:45 p.m.
|
Invited Talk: Basis
SlidesLive Video |
Zenna Tavares · Kevin Ellis 🔗 |
Sun 2:45 p.m. - 3:00 p.m.
|
Break and Poster Session
|
🔗 |
Sun 3:00 p.m. - 3:35 p.m.
|
Keynote: François Chollet: ARC Prize 2024: What we learned
SlidesLive Video |
Francois Chollet 🔗 |
Sun 3:35 p.m. - 5:00 p.m.
|
Panel DIscussion
SlidesLive Video |
Dzmitry Bahdanau · Jason Weston · Josh Tenenbaum · Francois Chollet · Melanie Mitchell 🔗 |
Sun 5:00 p.m. - 5:30 p.m.
|
Poster Session and Social
|
🔗 |
-
|
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations ( Poster ) > link | Paul Soulos · Henry Conklin · Mattia Opper · Paul Smolensky · Jianfeng Gao · Roland Fernandez 🔗 |
-
|
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents ( Poster ) > link | Quentin Delfosse · Sebastian Sztwiertnia · Mark Rothermel · Wolfgang Stammer · Kristian Kersting 🔗 |
-
|
From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs ( Poster ) > link | Alireza Rezazadeh · Zichao Li · Wei Wei · Yujia Bao 🔗 |
-
|
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus ( Poster ) > link | Terufumi Morishita · Gaku Morio · Atsuki Yamaguchi · Yasuhiro Sogawa 🔗 |
-
|
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning ( Poster ) > link | Yuxi Xie · Anirudh Goyal · Wenyue Zheng · Min-Yen Kan · Timothy Lillicrap · Kenji Kawaguchi · Michael Qizhe Shieh 🔗 |
-
|
ALTA: Compiler-Based Analysis of Transformers ( Poster ) > link | Peter Shaw · James Cohan · Jacob Eisenstein · Kenton Lee · Jonathan Berant · Kristina N Toutanova 🔗 |
-
|
Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization ( Poster ) > link | Hao Sun · Thomas Pouplin · Nicolás Astorga · Tennison Liu · Mihaela van der Schaar 🔗 |
-
|
MovieCORE: COgnitive REasoning in Movies ( Poster ) > link | Gueter Josmy Faure · Min-Hung Chen · Jia-Fong Yeh · Ying Cheng · Hung-Ting Su · Shang-Hong Lai · Winston Hsu 🔗 |
-
|
Thinking Fast and Laterally: Multi-Agentic Approach for Reasoning about Uncertain Emerging Events ( Poster ) > link | Stefan Dernbach · Alejandro Michel · Khushbu Agarwal · Christopher Brissette · geetika gupta · Sutanay Choudhury 🔗 |
-
|
Equitable Access to Justice: Logical LLMs Show Promise ( Poster ) > link | Manuj Kant · Marzieh Nabi · Manav Kant · Preston Carlson · Megan Ma 🔗 |
-
|
CryptoFormalEval: Integrating Large Language Models and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection ( Poster ) > link | Cristian Curaba · D'Ambrosi Denis · Alessandro Minisini 🔗 |
-
|
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning ( Poster ) > link | Mingde Zhao · Safa Alver · Harm Seijen · Romain Laroche · Doina Precup · Yoshua Bengio 🔗 |
-
|
Reasoning Abilities of Large Language Models through the Lens of Abstraction and Reasoning ( Poster ) > link | Seungpil Lee · Woochang Sim · Donghyeon Shin · Sejin Kim · Sundong Kim 🔗 |
-
|
Proof Flow: Preliminary Study on Generative Flow Network Language Model Tuning for Formal Reasoning ( Poster ) > link | Matthew Ho · Vincent Zhu · Xiaoyin Chen · Moksh Jain · Nikolay Malkin · Edwin Zhang 🔗 |
-
|
System-2 Reasoning via Generality and Adaptation ( Poster ) > link | Sejin Kim · Sundong Kim 🔗 |
-
|
Can Language Models Perform Implicit Bayesian Inference Over User Preference States? ( Poster ) > link | Linlu Qiu · Fei Sha · Kelsey Allen · Yoon Kim · Tal Linzen · Sjoerd van Steenkiste 🔗 |
-
|
Not All LLM Reasoners Are Created Equal ( Poster ) > link | Arian Hosseini · Alessandro Sordoni · Daniel Toyama · Aaron Courville · Rishabh Agarwal 🔗 |
-
|
Generative Verifiers: Reward Modeling as Next-Token Prediction ( Poster ) > link | Lunjun Zhang · Arian Hosseini · Hritik Bansal · Mehran Kazemi · Aviral Kumar · Rishabh Agarwal 🔗 |
-
|
System 2 Reasoning Capabilities Are Nigh ( Poster ) > link | Scott C. Lowe 🔗 |
-
|
The Turing Game ( Poster ) > link | Michal Lewandowski · Simon Schmid · Patrick Mederitsch · Alexander Aufreiter · Gregor Aichinger · Felix Nessler · Severin Bergsmann · Viktor Szolga · Tobias Halmdienst · Bernhard Nessler 🔗 |
-
|
Distilling System 2 into System 1 ( Poster ) > link | Ping Yu · Jing Xu · Jason Weston · Ilia Kulikov 🔗 |
-
|
Algorithmic Language Models with Neurally Compiled Libraries ( Poster ) > link | Lucas Saldyt · Subbarao Kambhampati 🔗 |
-
|
CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models ( Poster ) > link | ZEYU WANG 🔗 |
-
|
System 1.5: Designing Metacognition in Artificial Intelligence ( Poster ) > link | Nick Oh · Fernand Gobet 🔗 |
-
|
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints ( Poster ) > link | Thomas Palmeira Ferraz · Kartik Mehta · Yu-Hsiang Lin · Haw-Shiuan Chang · Shereen Oraby · Sijia Liu · Vivek Subramanian · Tagyoung Chung · Mohit Bansal · Nanyun Peng 🔗 |
-
|
PROOF OF THOUGHT : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning ( Poster ) > link | Debargha Ganguly · Srinivasan Iyengar · Vipin Chaudhary · Shivkumar Kalyanaraman 🔗 |
-
|
Rational Metareasoning for Large Language Models ( Poster ) > link | Camillo Nicolò De Sabbata · Ted Sumers · Tom Griffiths 🔗 |
-
|
Sampling Language from Latent System 2 Reasoning ( Poster ) > link | Celine Lee · Md Arafat Sultan · Tahira Naseem · Alexander Rush · Ramón Astudillo 🔗 |
-
|
MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning ( Poster ) > link | Ching-Yun Ko · Sihui Dai · Payel Das · Georgios Kollias · Subhajit Chaudhury · Aurelie Lozano 🔗 |
-
|
World Models for Web Agents ( Poster ) > link | Hyungjoo Chae · Namyoung Kim · Minju Gwak · Gwanwoo Song · Jihoon Kim · Kai Ong · Seonghwan Kim · Dongha Lee · Jinyoung Yeo 🔗 |
-
|
Recursive Decomposition with Dependencies for Generic Divide-and-Conquer Reasoning ( Poster ) > link | Sergio Hernández-Gutiérrez · Minttu Alakuijala · Alexander Nikitin · Pekka Marttinen 🔗 |
-
|
STaR: Benchmarking Spatio-Temporal Reasoning for Systematic Generalization ( Poster ) > link | Muhammad Irtaza Khalid · Steven Schockaert 🔗 |
-
|
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning ( Poster ) > link | Yifeng Ding · Hantian Ding · Shiqi Wang · Qing Sun · Varun Kumar · Zijian Wang 🔗 |
-
|
Automated Design of Agentic Systems ( Poster ) > link | Shengran Hu · Cong Lu · Jeff Clune 🔗 |
-
|
Logically Consistent Language Models via Neuro-Symbolic Integration ( Poster ) > link | Diego Calanzone · Stefano Teso · Antonio Vergari 🔗 |
-
|
LLMs on interactive feature collections with implicit look-ahead strategies ( Poster ) > link | Juyeon Heo · Vihari Piratla · Kyunghyun Lee · Hyonkeun Joh · Adrian Weller 🔗 |
-
|
softmax is not enough (for sharp out-of-distribution) ( Poster ) > link | Petar Veličković · Christos Perivolaropoulos · Federico Barbero · Razvan Pascanu 🔗 |
-
|
Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules ( Poster ) > link | Binxu Wang · Jiaqi Shang · Haim Sompolinsky 🔗 |
-
|
Can Stories Help LLMs Reason? Curating Information Space Through Narrative ( Poster ) > link | Vahid Sadiri Javadi · Johanne Trippas · Lucie Flek 🔗 |
-
|
Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers ( Poster ) > link | Reza Ebrahimi · Sunny Panchal · Roland Memisevic 🔗 |
-
|
A Llama Sunk My Battleship! Asking Rational Questions with LLMs via Bayesian Inference ( Poster ) > link | Gabriel Grand · Valerio Pepe · Jacob Andreas · Josh Tenenbaum 🔗 |
-
|
Implicit Reasoning in Deep Time Series Forecasting ( Poster ) > link | Willa Potosnak · Cristian Challu · Mononito Goswami · Michal Wilinski · Nina Żukowska · Artur Dubrawski 🔗 |
-
|
Thought of Search: Planning with Language Models Through The Lens of Efficiency ( Poster ) > link | Michael Katz · Harsha Kokel · Kavitha Srinivas · Shirin Sohrabi Araghi 🔗 |
-
|
Planning in Natural Language Improves LLM Search for Code Generation ( Poster ) > link | Evan Wang · Federico Cassano · Catherine Wu · Yunfeng Bai · William Song · Vaskar Nath · Ziwen Han · Sean Hendryx · Summer Yue · Hugh Zhang 🔗 |
-
|
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning ( Poster ) > link | Top Piriyakulkij · Cassidy Langenfeld · Tuan Anh Le · Kevin Ellis 🔗 |
-
|
Recurrent Transformers Trade-off Parallelism for Length Generalization on Regular Languages ( Poster ) > link | Paul Soulos · Aleksandar Terzic · Michael Hersche · Abbas Rahimi 🔗 |
-
|
VCR: Visual Caption Restoration ( Poster ) > link | Tianyu Zhang · Suyuchen Wang · Lu Li · Ge Zhang · Perouz Taslakian · Sai Rajeswar Mudumba · Jie Fu · Bang Liu · Yoshua Bengio 🔗 |
-
|
Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs ( Poster ) > link | Siyuan Wang · zhongyu wei · Yejin Choi · Xiang Ren 🔗 |
-
|
Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad? ( Poster ) > link | Antonia Wüst · Tim Nelson Tobiasch · Lukas Helff · Devendra S Dhami · Constantin Rothkopf · Kristian Kersting 🔗 |
-
|
Diffusion On Syntax Trees For Program Synthesis ( Poster ) > link | Shreyas Kapur · Erik Jenner · Stuart J Russell 🔗 |