NeurIPS MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

Oral
in
Workshop: Table Representation Learning Workshop (TRL)

MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

Satya Krishna Gorti · Ilan Gofman · Zhaoyan Liu · Jiapeng Wu · Noël Vouitsis · Guangwei Yu · Jesse Cresswell · Rasa Hosseinzadeh

[ Abstract ] [ Project Page ]

[ OpenReview]

Sat 14 Dec 2:10 p.m. PST — 2:20 p.m. PST

Abstract:

Text-to-SQL generation enables non-experts to interact with databases via natural language. Recent advances rely on large closed-source models like GPT-4 that present challenges in accessibility, privacy, and latency. To address these issues, we focus on developing small, efficient, and open-source text-to-SQL models. We demonstrate the benefits of sampling multiple candidate SQL generations and propose our method, MSc-SQL, to critique them using associated metadata. Our sample critiquing model evaluates multiple outputs simultaneously, achieving state-of-the-art performance compared to other open-source models while remaining competitive with larger models at a much lower cost. Full code of our method will be released for the camera-ready version.

Chat is not available.

Oral in Workshop: Table Representation Learning Workshop (TRL)

MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

Satya Krishna Gorti · Ilan Gofman · Zhaoyan Liu · Jiapeng Wu · Noël Vouitsis · Guangwei Yu · Jesse Cresswell · Rasa Hosseinzadeh

Oral
in
Workshop: Table Representation Learning Workshop (TRL)