Skip to yearly menu bar Skip to main content


Poster
in
Workshop: System-2 Reasoning at Scale

ALTA: Compiler-Based Analysis of Transformers

Peter Shaw · James Cohan · Jacob Eisenstein · Kenton Lee · Jonathan Berant · Kristina N Toutanova


Abstract:

We propose a new programming language called ALTA and a compiler that can map ALTA programs to Transformer weights. ALTA is inspired by RASP, a language proposed by Weiss et al. (2021), and Tracr (Lindner et al., 2023), a compiler from RASP programs to Transformer weights. ALTA complements and extends this prior work, offering the ability to express loops and to compile programs to Universal Transformers, among other advantages. ALTA allows us to constructively show how Transformers can represent length-invariant algorithms for computing parity and addition, as well as a solution to the SCAN benchmark of compositional generalization tasks, without requiring intermediate scratchpad decoding steps. We make the ALTA framework --- language specification, symbolic interpreter, and weight compiler --- available to the community to enable further applications and insights.

Chat is not available.