NeurIPS Can language model plan in extrapolated environments?: Casestudy in textualized Gridworld

Poster
in
Workshop: Compositional Learning: Perspectives, Methods, and Paths Forward

Can language model plan in extrapolated environments?: Casestudy in textualized Gridworld

Doyoung Kim · Jongwon Lee · Jinho Park · Minjoon Seo

Keywords: [ language agent ] [ planning ] [ cognitive map ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

While language models have demonstrated impressive capabilities across generalized language tasks, their ability to extrapolate in a certain task is highly unknown. We first introduce the optimal path planning task in a textualized Gridworld environment as a valid probe for estimating the extrapolability of language models. We show that the mere next token prediction inherently fails to extrapolate in solving the task. Inspired by human cognition, we claim that language models should construct an internal simulation that explores the environment, i.e. cognitive map before actually interacting with the given environment. We demonstrate that auto-regressive generation of cognitive map and planning sequence can significantly enhance the performance of the planning power even in extrapolated environments, suggesting the necessity of cognitive map for language models as a path forward.

Chat is not available.

Poster in Workshop: Compositional Learning: Perspectives, Methods, and Paths Forward

Can language model plan in extrapolated environments?: Casestudy in textualized Gridworld

Doyoung Kim · Jongwon Lee · Jinho Park · Minjoon Seo

Poster
in
Workshop: Compositional Learning: Perspectives, Methods, and Paths Forward