Skip to yearly menu bar Skip to main content


In-person presentation
in
Competition: NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Invited Speaker: Keming Lu (Alibaba Research) - Qwen: Towards a Generalist Model

Keming Lu


Abstract:

We introduce the large language and multimodal model series Qwen, published and opensourced by Alibaba Group. The Qwen model have achieved competitive performance against both opensource and proprietary LLMs and LMMs in both benchmark and human evaluation. This talk provides a brief overview of the model series and delves into details about building the LLMs, including pretraining, alignment, as well as the opensource. Additionally, it points out the limitations, and discusses the future work for both research community and industry in this field.

Chat is not available.