Skip to yearly menu bar Skip to main content


Spotlight Poster

Algebraic Positional Encodings

Konstantinos Kogkalidis · Jean-Philippe Bernardy · Vikas Garg

East Exhibit Hall A-C #2108
[ ]
Wed 11 Dec 11 a.m. PST — 2 p.m. PST

Abstract:

We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework provides a flexible mapping from the algebraic specification of a domain to an interpretation as orthogonal operators. This design preserves the algebraic characteristics of the source domain, ensuring that the model upholds the desired structural properties. Our scheme can accommodate various structures, including sequences, grids and trees, as well as their compositions. We conduct a series of experiments to demonstrate the practical applicability of our approach. Results suggest performance on par with or surpassing the current state-of-the-art, without hyperparameter optimizations or ``task search'' of any kind. Code is available through https://aalto-quml.github.io/ape/.

Live content is unavailable. Log in and register to view live content