Workshop
Foundation Models for Decision Making
Sherry Yang · Ofir Nachum · Yilun Du · Stephen McAleer · Igor Mordatch · Linxi Fan · Jeannette Bohg · Dale Schuurmans
Hall E2 (level 1)
Fri 15 Dec, 6:15 a.m. PST
Foundation models pretrained on diverse vision and language datasets have demonstrated exceptional capabilities in performing a wide range of downstream vision and language tasks. As foundation models are deployed in real-world applications such as dialogue, autonomous driving, healthcare, and robotics, they inevitably face new challenges such as learning from external feedback, adapting to different task modalities, and performing long-term reasoning and planning. Such challenges have traditionally been at the core of sequential decision making, encompassing areas such as reinforcement learning, imitation learning, planning, search, and optimal control. These research fields have traditionally focused on task-specific settings with limited prior knowledge, and yet there has been significant research progress in surpassing human performance in tasks like playing board games and Atari video games, as well as operating robots to complete navigation and manipulation tasks. However, since these methods generally learn to solve a specific task from scratch without broad knowledge from vision and language, they can struggle with generalization and sample efficiency. The goal of this workshop is to bring together the sequential decision making community including planning, search, RL, and optimal control, together with the foundation models community in vision and language to confront the challenges in decision making at scale. The workshop will span high-level discussions on how foundation models and decision making can benefit each other when jointly considered and low-level algorithmic details of various decision making algorithms and vision-language architectures, which might lead to both opportunities or challenges. Specific topics, for example, will include foundation model agents interacting with humans, computers, tools, simulators, physical world, and each other.
Schedule
Fri 6:15 a.m. - 7:20 a.m.
|
Poster Session I
(
Poster Session
)
>
|
🔗 |
Fri 7:20 a.m. - 7:30 a.m.
|
Opening Remark
(
Opening Remark
)
>
SlidesLive Video |
🔗 |
Fri 7:30 a.m. - 8:00 a.m.
|
Percy Liang
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 8:00 a.m. - 8:30 a.m.
|
Ruslan Salakhutdinov
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 8:30 a.m. - 9:00 a.m.
|
Jürgen Schmidhuber
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 9:00 a.m. - 9:10 a.m.
|
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 9:10 a.m. - 9:20 a.m.
|
Goal Masked Diffusion Policies for Unified Navigation and Exploration
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 9:20 a.m. - 9:30 a.m.
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 9:30 a.m. - 9:40 a.m.
|
WebArena: A Realistic Web Environment for Building Autonomous Agents
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 9:40 a.m. - 9:50 a.m.
|
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 9:50 a.m. - 10:00 a.m.
|
Benchmarking Large Language Models as AI Research Agents
(
Oral Presentation
)
>
SlidesLive Video |
🔗 |
Fri 10:00 a.m. - 11:00 a.m.
|
Lunch & Poster Session II
(
Poster Session
)
>
|
🔗 |
Fri 11:00 a.m. - 11:30 a.m.
|
Phillip Isola
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 11:30 a.m. - 12:00 p.m.
|
Xinyun Chen
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 12:00 p.m. - 12:30 p.m.
|
Chelsea Finn
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 12:30 p.m. - 1:00 p.m.
|
Russ Tedrake
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 1:00 p.m. - 1:45 p.m.
|
Panel Discussion
(
Panel
)
>
SlidesLive Video |
🔗 |
Fri 1:30 p.m. - 2:00 p.m.
|
Kristen Grauman
(
Invited Talk
)
>
SlidesLive Video |
🔗 |
Fri 2:00 p.m. - 3:30 p.m.
|
Poster Session III
(
Poster Session
)
>
|
🔗 |
Fri 2:00 p.m. - 2:15 p.m.
|
Industry Demos
(
Industry Demos
)
>
SlidesLive Video |
🔗 |
-
|
WebArena: A Realistic Web Environment for Building Autonomous Agents ( Poster ) > link |
12 presentersShuyan Zhou · Frank F. Xu · Hao Zhu · Xuhui Zhou · Robert Lo · Abishek Sridhar · Xianyi Cheng · Tianyue Ou · Yonatan Bisk · Daniel Fried · Uri Alon · Graham Neubig |
-
|
WebArena: A Realistic Web Environment for Building Autonomous Agents ( Oral ) > link |
12 presentersShuyan Zhou · Frank F. Xu · Hao Zhu · Xuhui Zhou · Robert Lo · Abishek Sridhar · Xianyi Cheng · Tianyue Ou · Yonatan Bisk · Daniel Fried · Uri Alon · Graham Neubig |
-
|
Towards General-Purpose In-Context Learning Agents ( Poster ) > link | Louis Kirsch · James Harrison · Daniel Freeman · Jascha Sohl-Dickstein · Jürgen Schmidhuber 🔗 |
-
|
Towards General-Purpose In-Context Learning Agents ( Oral ) > link | Louis Kirsch · James Harrison · Daniel Freeman · Jascha Sohl-Dickstein · Jürgen Schmidhuber 🔗 |
-
|
Agnostic Architecture for Heterogeneous Multi-Environment Reinforcement Learning ( Poster ) > link | Kukjin Kim · Changhee Joo 🔗 |
-
|
Agnostic Architecture for Heterogeneous Multi-Environment Reinforcement Learning ( Oral ) > link | Kukjin Kim · Changhee Joo 🔗 |
-
|
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment ( Poster ) > link | Tianhao Wu · Banghua Zhu · Ruoyu Zhang · Zhaojin Wen · Kannan Ramchandran · Jiantao Jiao 🔗 |
-
|
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment ( Oral ) > link | Tianhao Wu · Banghua Zhu · Ruoyu Zhang · Zhaojin Wen · Kannan Ramchandran · Jiantao Jiao 🔗 |
-
|
Target Rate Optimization: Avoiding Iterative Error Exploitation ( Poster ) > link | Braham Snyder · Amy Zhang · Yuke Zhu 🔗 |
-
|
Target Rate Optimization: Avoiding Iterative Error Exploitation ( Oral ) > link | Braham Snyder · Amy Zhang · Yuke Zhu 🔗 |
-
|
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning ( Poster ) > link | Ruizhe Shi · Yuyao Liu · Yanjie Ze · Simon Du · Huazhe Xu 🔗 |
-
|
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning ( Oral ) > link | Ruizhe Shi · Yuyao Liu · Yanjie Ze · Simon Du · Huazhe Xu 🔗 |
-
|
H-GAP: Humanoid Control with a Generalist Planner ( Poster ) > link | zhengyao Jiang · Yingchen Xu · Nolan Wagener · Yicheng Luo · Michael Janner · Edward Grefenstette · Tim Rocktäschel · Yuandong Tian 🔗 |
-
|
H-GAP: Humanoid Control with a Generalist Planner ( Oral ) > link | zhengyao Jiang · Yingchen Xu · Nolan Wagener · Yicheng Luo · Michael Janner · Edward Grefenstette · Tim Rocktäschel · Yuandong Tian 🔗 |
-
|
Reasoning about Action Preconditions with Programs ( Poster ) > link | Lajanugen Logeswaran · Sungryull Sohn · Yiwei Lyu · Anthony Liu · Dong-Ki Kim · Dongsub Shim · Moontae Lee · Honglak Lee 🔗 |
-
|
Reasoning about Action Preconditions with Programs ( Oral ) > link | Lajanugen Logeswaran · Sungryull Sohn · Yiwei Lyu · Anthony Liu · Dong-Ki Kim · Dongsub Shim · Moontae Lee · Honglak Lee 🔗 |
-
|
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View ( Poster ) > link | Raphael Schumann · Wanrong Zhu · Weixi Feng · Tsu-Jui Fu · Stefan Riezler · William Yang Wang 🔗 |
-
|
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View ( Oral ) > link | Raphael Schumann · Wanrong Zhu · Weixi Feng · Tsu-Jui Fu · Stefan Riezler · William Yang Wang 🔗 |
-
|
Chain of Code: Reasoning with a Language Model-Augmented Code Interpreter ( Poster ) > link | Chengshu Li · Jacky Liang · Fei Xia · Andy Zeng · Sergey Levine · Dorsa Sadigh · Karol Hausman · Xinyun Chen · Fei-Fei Li · brian ichter 🔗 |
-
|
Chain of Code: Reasoning with a Language Model-Augmented Code Interpreter ( Oral ) > link | Chengshu Li · Jacky Liang · Fei Xia · Andy Zeng · Sergey Levine · Dorsa Sadigh · Karol Hausman · Xinyun Chen · Fei-Fei Li · brian ichter 🔗 |
-
|
Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks ( Poster ) > link | 昊琦 袁 · Chi Zhang · Hongcheng Wang · Feiyang Xie · Penglin Cai · Hao Dong · Zongqing Lu 🔗 |
-
|
Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks ( Oral ) > link | 昊琦 袁 · Chi Zhang · Hongcheng Wang · Feiyang Xie · Penglin Cai · Hao Dong · Zongqing Lu 🔗 |
-
|
Language Agents as Digital Representatives in Collective Decision-Making ( Poster ) > link | Daniel Jarrett · Miruna Pislar · Michael Tessler · Michiel Bakker · Raphael Koster · Jan Balaguer · Romuald Elie · Christopher Summerfield · Andrea Tacchetti 🔗 |
-
|
Language Agents as Digital Representatives in Collective Decision-Making ( Oral ) > link | Daniel Jarrett · Miruna Pislar · Michael Tessler · Michiel Bakker · Raphael Koster · Jan Balaguer · Romuald Elie · Christopher Summerfield · Andrea Tacchetti 🔗 |
-
|
Selective Perception: Learning Concise State Descriptions for Language Model Actors ( Poster ) > link | Kolby T Nottingham · Yasaman Razeghi · Kyungmin Kim · JB Lanier · Pierre Baldi · Roy Fox · Sameer Singh 🔗 |
-
|
Selective Perception: Learning Concise State Descriptions for Language Model Actors ( Oral ) > link | Kolby T Nottingham · Yasaman Razeghi · Kyungmin Kim · JB Lanier · Pierre Baldi · Roy Fox · Sameer Singh 🔗 |
-
|
LASER: LLM Agent with State-Space Exploration for Web Navigation ( Poster ) > link | Kaixin Ma · Hongming Zhang · Hongwei Wang · Xiaoman Pan · Dong Yu 🔗 |
-
|
LASER: LLM Agent with State-Space Exploration for Web Navigation ( Oral ) > link | Kaixin Ma · Hongming Zhang · Hongwei Wang · Xiaoman Pan · Dong Yu 🔗 |
-
|
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning ( Poster ) > link | Zhaoyi Zhou · Chuning Zhu · Runlong Zhou · Qiwen Cui · Abhishek Gupta · Simon Du 🔗 |
-
|
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning ( Oral ) > link | Zhaoyi Zhou · Chuning Zhu · Runlong Zhou · Qiwen Cui · Abhishek Gupta · Simon Du 🔗 |
-
|
Towards End-to-End Embodied Decision Making with Multi-modal Large Language Model ( Poster ) > link | Liang Chen · Yichi Zhang · Shuhuai Ren · Haozhe Zhao · Zefan Cai · Yuchi Wang · Tianyu Liu · Baobao Chang 🔗 |
-
|
Towards End-to-End Embodied Decision Making with Multi-modal Large Language Model ( Oral ) > link | Liang Chen · Yichi Zhang · Shuhuai Ren · Haozhe Zhao · Zefan Cai · Yuchi Wang · Tianyu Liu · Baobao Chang 🔗 |
-
|
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations ( Poster ) > link | Joey Hong · Sergey Levine · Anca Dragan 🔗 |
-
|
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations ( Oral ) > link | Joey Hong · Sergey Levine · Anca Dragan 🔗 |
-
|
PASTA: Pretrained Action-State Transformer Agents ( Poster ) > link | Raphael Boige · Yannis Flet-Berliac · Arthur Flajolet · Guillaume Richard · Thomas PIERROT 🔗 |
-
|
PASTA: Pretrained Action-State Transformer Agents ( Oral ) > link | Raphael Boige · Yannis Flet-Berliac · Arthur Flajolet · Guillaume Richard · Thomas PIERROT 🔗 |
-
|
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control ( Poster ) > link | Longtao Zheng · Rundong Wang · Xinrun Wang · Bo An 🔗 |
-
|
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control ( Oral ) > link | Longtao Zheng · Rundong Wang · Xinrun Wang · Bo An 🔗 |
-
|
Building Cooperative Embodied Agents Modularly with Large Language Models ( Poster ) > link | Hongxin Zhang · Weihua Du · Jiaming Shan · Qinhong Zhou · Yilun Du · Josh Tenenbaum · Tianmin Shu · Chuang Gan 🔗 |
-
|
Building Cooperative Embodied Agents Modularly with Large Language Models ( Oral ) > link | Hongxin Zhang · Weihua Du · Jiaming Shan · Qinhong Zhou · Yilun Du · Josh Tenenbaum · Tianmin Shu · Chuang Gan 🔗 |
-
|
FoMo rewards: Casting foundation models as generic reward functions ( Poster ) > link | Ekdeep S Lubana · Pim de Haan · Taco Cohen · Johann Brehmer 🔗 |
-
|
FoMo rewards: Casting foundation models as generic reward functions ( Oral ) > link | Ekdeep S Lubana · Pim de Haan · Taco Cohen · Johann Brehmer 🔗 |
-
|
A Universal World Model Learned from Large Scale and Diverse Videos ( Poster ) > link | Hanchen Cui · Yang Gao 🔗 |
-
|
A Universal World Model Learned from Large Scale and Diverse Videos ( Oral ) > link | Hanchen Cui · Yang Gao 🔗 |
-
|
From Text to Tactic: Evaluating LLMs Playing the Game of Avalon ( Poster ) > link | Jonathan Light · Min Cai · Sheng Shen · Ziniu Hu 🔗 |
-
|
From Text to Tactic: Evaluating LLMs Playing the Game of Avalon ( Oral ) > link | Jonathan Light · Min Cai · Sheng Shen · Ziniu Hu 🔗 |
-
|
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment ( Poster ) > link | Tianwei Ni · Michel Ma · Benjamin Eysenbach · Pierre-Luc Bacon 🔗 |
-
|
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment ( Oral ) > link | Tianwei Ni · Michel Ma · Benjamin Eysenbach · Pierre-Luc Bacon 🔗 |
-
|
Self-Select: Optimizing Instruction Selection for Large Language Models ( Poster ) > link | Keshav Ramji · Alexander Kyimpopkin 🔗 |
-
|
Self-Select: Optimizing Instruction Selection for Large Language Models ( Oral ) > link | Keshav Ramji · Alexander Kyimpopkin 🔗 |
-
|
Benchmarking Large Language Models as AI Research Agents ( Poster ) > link | Qian Huang · Jian Vora · Percy Liang · Jure Leskovec 🔗 |
-
|
Benchmarking Large Language Models as AI Research Agents ( Oral ) > link | Qian Huang · Jian Vora · Percy Liang · Jure Leskovec 🔗 |
-
|
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study ( Poster ) > link |
12 presentersRim Assouel · Tom Marty · Massimo Caccia · Issam Hadj Laradji · Alexandre Drouin · Sai Rajeswar Mudumba · Hector Palacios · Quentin Cappart · David Vazquez · Nicolas Chapados · Maxime Gasse · Alexandre Lacoste |
-
|
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study ( Oral ) > link |
12 presentersRim Assouel · Tom Marty · Massimo Caccia · Issam Hadj Laradji · Alexandre Drouin · Sai Rajeswar Mudumba · Hector Palacios · Quentin Cappart · David Vazquez · Nicolas Chapados · Maxime Gasse · Alexandre Lacoste |
-
|
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models ( Poster ) > link | ZUXIN LIU · Jesse Zhang · Kavosh Asadi · Yao Liu · DING ZHAO · Shoham Sabach · Rasool Fakoor 🔗 |
-
|
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models ( Oral ) > link | ZUXIN LIU · Jesse Zhang · Kavosh Asadi · Yao Liu · DING ZHAO · Shoham Sabach · Rasool Fakoor 🔗 |
-
|
MMToM-QA: Multimodal Theory of Mind Question Answering ( Poster ) > link | Chuanyang Jin · Yutong Wu · Jing Cao · Jiannan Xiang · Yen-Ling Kuo · Zhiting Hu · Tomer Ullman · Antonio Torralba · Josh Tenenbaum · Tianmin Shu 🔗 |
-
|
MMToM-QA: Multimodal Theory of Mind Question Answering ( Oral ) > link | Chuanyang Jin · Yutong Wu · Jing Cao · Jiannan Xiang · Yen-Ling Kuo · Zhiting Hu · Tomer Ullman · Antonio Torralba · Josh Tenenbaum · Tianmin Shu 🔗 |
-
|
Compositional Foundation Models for Hierarchical Planning ( Poster ) > link | Anurag Ajay · Seungwook Han · Yilun Du · Shuang Li · Abhi Gupta · Tommi Jaakkola · Josh Tenenbaum · Leslie Kaelbling · Akash Srivastava · Pulkit Agrawal 🔗 |
-
|
Compositional Foundation Models for Hierarchical Planning ( Oral ) > link | Anurag Ajay · Seungwook Han · Yilun Du · Shuang Li · Abhi Gupta · Tommi Jaakkola · Josh Tenenbaum · Leslie Kaelbling · Akash Srivastava · Pulkit Agrawal 🔗 |
-
|
Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception ( Poster ) > link | Yunhao Yang · Cyrus Neary · Ufuk Topcu 🔗 |
-
|
Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception ( Oral ) > link | Yunhao Yang · Cyrus Neary · Ufuk Topcu 🔗 |
-
|
RF-POLICY: Rectified Flows are Computation-Adaptive Decision Makers ( Poster ) > link | Xixi Hu · Bo Liu · Xingchao Liu · Qiang Liu 🔗 |
-
|
RF-POLICY: Rectified Flows are Computation-Adaptive Decision Makers ( Oral ) > link | Xixi Hu · Bo Liu · Xingchao Liu · Qiang Liu 🔗 |
-
|
Creative Robot Tool Use with Large Language Models ( Poster ) > link | Mengdi Xu · Wenhao Yu · Peide Huang · Shiqi Liu · Xilun Zhang · Yaru Niu · Tingnan Zhang · Fei Xia · Jie Tan · DING ZHAO 🔗 |
-
|
Creative Robot Tool Use with Large Language Models ( Oral ) > link | Mengdi Xu · Wenhao Yu · Peide Huang · Shiqi Liu · Xilun Zhang · Yaru Niu · Tingnan Zhang · Fei Xia · Jie Tan · DING ZHAO 🔗 |
-
|
Investigating the Effectiveness of Self-critiquing in LLMs solving Planning Tasks ( Poster ) > link | Karthik Valmeekam · Matthew Marquez · Subbarao Kambhampati 🔗 |
-
|
Investigating the Effectiveness of Self-critiquing in LLMs solving Planning Tasks ( Oral ) > link | Karthik Valmeekam · Matthew Marquez · Subbarao Kambhampati 🔗 |
-
|
Capture the Flag: Uncovering Data Insights with Large Language Models ( Poster ) > link | Issam Hadj Laradji · Perouz Taslakian · Sai Rajeswar Mudumba · Valentina Zantedeschi · Alexandre Lacoste · Nicolas Chapados · David Vazquez · Chris Pal · Alexandre Drouin 🔗 |
-
|
Capture the Flag: Uncovering Data Insights with Large Language Models ( Oral ) > link | Issam Hadj Laradji · Perouz Taslakian · Sai Rajeswar Mudumba · Valentina Zantedeschi · Alexandre Lacoste · Nicolas Chapados · David Vazquez · Chris Pal · Alexandre Drouin 🔗 |
-
|
$S^2AC$: ENERGY-BASED REINFORCEMENT LEARNING WITH STEIN SOFT ACTOR CRITIC ( Poster ) > link | Safa Messaoud · Billel Mokeddem · Zhenghai Xue · Bo An · Haipeng Chen · Sanjay Chawla 🔗 |
-
|
$S^2AC$: ENERGY-BASED REINFORCEMENT LEARNING WITH STEIN SOFT ACTOR CRITIC ( Oral ) > link | Safa Messaoud · Billel Mokeddem · Zhenghai Xue · Bo An · Haipeng Chen · Sanjay Chawla 🔗 |
-
|
Ring Attention with Blockwise Transformers for Near-Infinite Context ( Poster ) > link | Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗 |
-
|
Ring Attention with Blockwise Transformers for Near-Infinite Context ( Oral ) > link | Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗 |
-
|
GPT4GEO: How a Language Model Sees the World’s Geography ( Poster ) > link | Jonathan Roberts · Timo Lüddecke · Sowmen Das · Kai Han · Samuel Albanie 🔗 |
-
|
GPT4GEO: How a Language Model Sees the World’s Geography ( Oral ) > link | Jonathan Roberts · Timo Lüddecke · Sowmen Das · Kai Han · Samuel Albanie 🔗 |
-
|
Vision-and-Language Navigation in Real World using Foundation Models ( Poster ) > link | Chengguang Xu · Hieu T. Nguyen · Christopher Amato · Lawson Wong 🔗 |
-
|
Vision-and-Language Navigation in Real World using Foundation Models ( Oral ) > link | Chengguang Xu · Hieu T. Nguyen · Christopher Amato · Lawson Wong 🔗 |
-
|
D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Robotic Manipulation ( Poster ) > link | Yixuan Wang · Zhuoran Li · Mingtong Zhang · Katherine Driggs-Campbell · Jiajun Wu · Fei-Fei Li · Yunzhu Li 🔗 |
-
|
D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Robotic Manipulation ( Oral ) > link | Yixuan Wang · Zhuoran Li · Mingtong Zhang · Katherine Driggs-Campbell · Jiajun Wu · Fei-Fei Li · Yunzhu Li 🔗 |
-
|
Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models ( Poster ) > link | Kevin Black · Mitsuhiko Nakamoto · Pranav Atreya · Homer Walke · Chelsea Finn · Aviral Kumar · Sergey Levine 🔗 |
-
|
Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models ( Oral ) > link | Kevin Black · Mitsuhiko Nakamoto · Pranav Atreya · Homer Walke · Chelsea Finn · Aviral Kumar · Sergey Levine 🔗 |
-
|
On the Tool Manipulation Capability of Open-sourced Large Language Models ( Poster ) > link | Qiantong Xu · Fenglu Hong · Bo Li · Changran Hu · Zhengyu Chen · Jian Zhang 🔗 |
-
|
On the Tool Manipulation Capability of Open-sourced Large Language Models ( Oral ) > link | Qiantong Xu · Fenglu Hong · Bo Li · Changran Hu · Zhengyu Chen · Jian Zhang 🔗 |
-
|
CodePlan: Repository-level Coding using LLMs and Planning ( Poster ) > link | Ramakrishna Bairi · Atharv Sonwane · Aditya Kanade · Vageesh C · Arun Iyer · Suresh Parthasarathy · Sriram Rajamani · B. Ashok · Shashank Shet 🔗 |
-
|
CodePlan: Repository-level Coding using LLMs and Planning ( Oral ) > link | Ramakrishna Bairi · Atharv Sonwane · Aditya Kanade · Vageesh C · Arun Iyer · Suresh Parthasarathy · Sriram Rajamani · B. Ashok · Shashank Shet 🔗 |
-
|
Exploration with Principles for Diverse AI Supervision ( Poster ) > link | Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗 |
-
|
Exploration with Principles for Diverse AI Supervision ( Oral ) > link | Hao Liu · Matei A Zaharia · Pieter Abbeel 🔗 |
-
|
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents ( Poster ) > link | Zhihan Liu · Hao Hu · Shenao Zhang · Hongyi Guo · Shuqi Ke · Boyi Liu · Zhaoran Wang 🔗 |
-
|
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents ( Oral ) > link | Zhihan Liu · Hao Hu · Shenao Zhang · Hongyi Guo · Shuqi Ke · Boyi Liu · Zhaoran Wang 🔗 |
-
|
AdaPlanner: Adaptive Planning from Feedback with Language Models ( Poster ) > link | Haotian Sun · Yuchen Zhuang · Lingkai Kong · Bo Dai · Chao Zhang 🔗 |
-
|
AdaPlanner: Adaptive Planning from Feedback with Language Models ( Oral ) > link | Haotian Sun · Yuchen Zhuang · Lingkai Kong · Bo Dai · Chao Zhang 🔗 |
-
|
Language Model Agents Suffer from Compositional Decision Making ( Poster ) > link | Hiroki Furuta · Yutaka Matsuo · Aleksandra Faust · Izzeddin Gur 🔗 |
-
|
Language Model Agents Suffer from Compositional Decision Making ( Oral ) > link | Hiroki Furuta · Yutaka Matsuo · Aleksandra Faust · Izzeddin Gur 🔗 |
-
|
Semantically-Driven Object Search Using Partially Observed 3D Scene Graphs ( Poster ) > link | Isaac Remy · Abhishek Gupta · Karen Leung 🔗 |
-
|
Semantically-Driven Object Search Using Partially Observed 3D Scene Graphs ( Oral ) > link | Isaac Remy · Abhishek Gupta · Karen Leung 🔗 |
-
|
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking ( Poster ) > link | Byoungjip Kim · Youngsoo Jang · Lajanugen Logeswaran · Geon-Hyeong Kim · Yu Jin Kim · Honglak Lee · Moontae Lee 🔗 |
-
|
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking ( Oral ) > link | Byoungjip Kim · Youngsoo Jang · Lajanugen Logeswaran · Geon-Hyeong Kim · Yu Jin Kim · Honglak Lee · Moontae Lee 🔗 |
-
|
$\texttt{PREMIER-TACO}$ is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss ( Poster ) > link | Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang 🔗 |
-
|
$\texttt{PREMIER-TACO}$ is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss ( Oral ) > link | Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang 🔗 |
-
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks ( Poster ) > link | Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov 🔗 |
-
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks ( Oral ) > link | Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov 🔗 |
-
|
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction ( Poster ) > link | Seohong Park · Oleh Rybkin · Sergey Levine 🔗 |
-
|
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction ( Oral ) > link | Seohong Park · Oleh Rybkin · Sergey Levine 🔗 |
-
|
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making ( Poster ) > link | Jeonghye Kim · Suyoung Lee · Woojun Kim · Youngchul Sung 🔗 |
-
|
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making ( Oral ) > link | Jeonghye Kim · Suyoung Lee · Woojun Kim · Youngchul Sung 🔗 |
-
|
Reward Model Ensembles Help Mitigate Overoptimization ( Poster ) > link | Thomas Coste · Usman Anwar · Robert Kirk · David Krueger 🔗 |
-
|
Reward Model Ensembles Help Mitigate Overoptimization ( Oral ) > link | Thomas Coste · Usman Anwar · Robert Kirk · David Krueger 🔗 |
-
|
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy ( Poster ) > link | Zichen "Charles" Zhang · Yunshuang Li · Osbert Bastani · Abhishek Gupta · Dinesh Jayaraman · Jason Ma · Luca Weihs 🔗 |
-
|
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy ( Oral ) > link | Zichen "Charles" Zhang · Yunshuang Li · Osbert Bastani · Abhishek Gupta · Dinesh Jayaraman · Jason Ma · Luca Weihs 🔗 |
-
|
Fast Imitation via Behavior Foundation Models ( Poster ) > link | Matteo Pirotta · Andrea Tirinzoni · Ahmed Touati · Alessandro Lazaric · Yann Ollivier 🔗 |
-
|
Fast Imitation via Behavior Foundation Models ( Oral ) > link | Matteo Pirotta · Andrea Tirinzoni · Ahmed Touati · Alessandro Lazaric · Yann Ollivier 🔗 |
-
|
O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language models ( Poster ) > link | Yuchen Xiao · Yanchao Sun · Mengda Xu · Udari Madhushani · Jared Vann · Deepeka Garg · Sumitra Ganesh 🔗 |
-
|
O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language models ( Oral ) > link | Yuchen Xiao · Yanchao Sun · Mengda Xu · Udari Madhushani · Jared Vann · Deepeka Garg · Sumitra Ganesh 🔗 |
-
|
Robotic Offline RL from Internet Videos via Value-Function Pre-Training ( Poster ) > link | Chethan Bhateja · Derek Guo · Dibya Ghosh · Anikait Singh · Manan Tomar · Quan Vuong · Yevgen Chebotar · Sergey Levine · Aviral Kumar 🔗 |
-
|
Robotic Offline RL from Internet Videos via Value-Function Pre-Training ( Oral ) > link | Chethan Bhateja · Derek Guo · Dibya Ghosh · Anikait Singh · Manan Tomar · Quan Vuong · Yevgen Chebotar · Sergey Levine · Aviral Kumar 🔗 |
-
|
RoboVQA: Multimodal Long-Horizon Reasoningfor Robotics ( Poster ) > link |
20 presentersPierre Sermanet · Tianli Ding · Jeffrey Zhao · Fei Xia · Debidatta Dwibedi · Keerthana Gopalakrishnan · Gabriel Dulac-Arnold · sharath maddineni · Nikhil Joshi · Pete Florence · Wei Han · Robert Baruch · Yao Lu · Suvir Mirchandani · Peng Xu · Pannag Sanketi · Karol Hausman · Izhak Shafran · brian ichter · Yuan Cao |
-
|
RoboVQA: Multimodal Long-Horizon Reasoningfor Robotics ( Oral ) > link |
20 presentersPierre Sermanet · Tianli Ding · Jeffrey Zhao · Fei Xia · Debidatta Dwibedi · Keerthana Gopalakrishnan · Gabriel Dulac-Arnold · sharath maddineni · Nikhil Joshi · Pete Florence · Wei Han · Robert Baruch · Yao Lu · Suvir Mirchandani · Peng Xu · Pannag Sanketi · Karol Hausman · Izhak Shafran · brian ichter · Yuan Cao |
-
|
Importance of Directional Feedback for LLM-based Optimizers ( Poster ) > link | Allen Nie · Ching-An Cheng · Andrey Kolobov · Adith Swaminathan 🔗 |
-
|
Importance of Directional Feedback for LLM-based Optimizers ( Oral ) > link | Allen Nie · Ching-An Cheng · Andrey Kolobov · Adith Swaminathan 🔗 |
-
|
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft ( Poster ) > link | Shalev Lifshitz · Keiran Paster · Harris Chan · Jimmy Ba · Sheila McIlraith 🔗 |
-
|
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft ( Oral ) > link | Shalev Lifshitz · Keiran Paster · Harris Chan · Jimmy Ba · Sheila McIlraith 🔗 |
-
|
GPT-Driver: Learning to Drive with GPT ( Poster ) > link | Jiageng Mao · Yuxi Qian · Hang Zhao · Yue Wang 🔗 |
-
|
GPT-Driver: Learning to Drive with GPT ( Oral ) > link | Jiageng Mao · Yuxi Qian · Hang Zhao · Yue Wang 🔗 |
-
|
GPT-4 Doesn’t Know It’s Wrong: An Analysis of Iterative Prompting for Reasoning Problems ( Poster ) > link | Kaya Stechly · Matthew Marquez · Subbarao Kambhampati 🔗 |
-
|
GPT-4 Doesn’t Know It’s Wrong: An Analysis of Iterative Prompting for Reasoning Problems ( Oral ) > link | Kaya Stechly · Matthew Marquez · Subbarao Kambhampati 🔗 |
-
|
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training ( Poster ) > link | Xidong Feng · Ziyu Wan · Muning Wen · Ying Wen · Weinan Zhang · Jun Wang 🔗 |
-
|
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training ( Oral ) > link | Xidong Feng · Ziyu Wan · Muning Wen · Ying Wen · Weinan Zhang · Jun Wang 🔗 |
-
|
Voyager: An Open-Ended Embodied Agent with Large Language Models ( Poster ) > link | Guanzhi Wang · Yuqi Xie · Yunfan Jiang · Ajay Mandlekar · Chaowei Xiao · Yuke Zhu · Linxi Fan · Animashree Anandkumar 🔗 |
-
|
Voyager: An Open-Ended Embodied Agent with Large Language Models ( Oral ) > link | Guanzhi Wang · Yuqi Xie · Yunfan Jiang · Ajay Mandlekar · Chaowei Xiao · Yuke Zhu · Linxi Fan · Animashree Anandkumar 🔗 |
-
|
Double Policy Estimation for Importance Sampling in Sequence Modeling-Based Reinforcement Learning ( Poster ) > link | Hanhan Zhou · Tian Lan · Vaneet Aggarwal 🔗 |
-
|
Double Policy Estimation for Importance Sampling in Sequence Modeling-Based Reinforcement Learning ( Oral ) > link | Hanhan Zhou · Tian Lan · Vaneet Aggarwal 🔗 |
-
|
Strategic Reasoning with Language Models ( Poster ) > link | Kanishk Gandhi · Dorsa Sadigh · Noah Goodman 🔗 |
-
|
Strategic Reasoning with Language Models ( Oral ) > link | Kanishk Gandhi · Dorsa Sadigh · Noah Goodman 🔗 |
-
|
LLMs-augmented Contextual Bandit ( Poster ) > link | Ali Baheri · Cecilia Alm 🔗 |
-
|
LLMs-augmented Contextual Bandit ( Oral ) > link | Ali Baheri · Cecilia Alm 🔗 |
-
|
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics ( Poster ) > link | Sajad Mousavi · Ricardo Luna Gutierrez · Desik Rengarajan · Vineet Gundecha · Ashwin Ramesh Babu · Avisek Naug · Antonio Guillen-Perez · Soumyendu Sarkar 🔗 |
-
|
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics ( Oral ) > link | Sajad Mousavi · Ricardo Luna Gutierrez · Desik Rengarajan · Vineet Gundecha · Ashwin Ramesh Babu · Avisek Naug · Antonio Guillen-Perez · Soumyendu Sarkar 🔗 |
-
|
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models ( Poster ) > link | Wenlong Huang · Chen Wang · Ruohan Zhang · Yunzhu Li · Jiajun Wu · Fei-Fei Li 🔗 |
-
|
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models ( Oral ) > link | Wenlong Huang · Chen Wang · Ruohan Zhang · Yunzhu Li · Jiajun Wu · Fei-Fei Li 🔗 |
-
|
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning ( Poster ) > link | Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner 🔗 |
-
|
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning ( Oral ) > link | Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner 🔗 |
-
|
Goal Masked Diffusion Policies for Unified Navigation and Exploration ( Poster ) > link | Ajay Sridhar · Dhruv Shah · Catherine Glossop · Sergey Levine 🔗 |
-
|
Goal Masked Diffusion Policies for Unified Navigation and Exploration ( Oral ) > link | Ajay Sridhar · Dhruv Shah · Catherine Glossop · Sergey Levine 🔗 |
-
|
Learning to Solve New sequential decision-making Tasks with In-Context Learning ( Poster ) > link | Sharath Chandra Raparthy · Eric Hambro · Robert Kirk · Mikael Henaff · Roberta Raileanu 🔗 |
-
|
Learning to Solve New sequential decision-making Tasks with In-Context Learning ( Oral ) > link | Sharath Chandra Raparthy · Eric Hambro · Robert Kirk · Mikael Henaff · Roberta Raileanu 🔗 |
-
|
ExPT: Scaling Foundation Models for Experimental Design via Synthetic Pretraining ( Poster ) > link | Tung Nguyen · Sudhanshu Agrawal · Aditya Grover 🔗 |
-
|
ExPT: Scaling Foundation Models for Experimental Design via Synthetic Pretraining ( Oral ) > link | Tung Nguyen · Sudhanshu Agrawal · Aditya Grover 🔗 |
-
|
Natural Language-based State Representation in Deep Reinforcement Learning ( Poster ) > link | Masudur Rahman · Yexiang Xue 🔗 |
-
|
Natural Language-based State Representation in Deep Reinforcement Learning ( Oral ) > link | Masudur Rahman · Yexiang Xue 🔗 |
-
|
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent ( Poster ) > link | Ziniu Hu 🔗 |
-
|
AVIS: Autonomous Visual Information Seeking with Large Language Model Agent ( Oral ) > link | Ziniu Hu 🔗 |
-
|
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting ( Poster ) > link | Xinlu Zhang · Shiyang Li · Xianjun Yang · Chenxin Tian · Yao Qin · Linda Petzold 🔗 |
-
|
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting ( Oral ) > link | Xinlu Zhang · Shiyang Li · Xianjun Yang · Chenxin Tian · Yao Qin · Linda Petzold 🔗 |
-
|
LLM Augmented Hierarchical Agents ( Poster ) > link | Bharat Prakash · Tim Oates · Tinoosh Mohsenin 🔗 |
-
|
LLM Augmented Hierarchical Agents ( Oral ) > link | Bharat Prakash · Tim Oates · Tinoosh Mohsenin 🔗 |
-
|
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents ( Poster ) > link | Jingqing Ruan · YiHong Chen · Bin Zhang · Zhiwei Xu · Tianpeng Bao · du qing · shi shiwei · Hangyu Mao · Xingyu Zeng · Rui Zhao 🔗 |
-
|
TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents ( Oral ) > link | Jingqing Ruan · YiHong Chen · Bin Zhang · Zhiwei Xu · Tianpeng Bao · du qing · shi shiwei · Hangyu Mao · Xingyu Zeng · Rui Zhao 🔗 |
-
|
$\mathcal{B}$-Coder: On Value-Based Deep Reinforcement Learning for Program Synthesis ( Poster ) > link | Zishun Yu · Yunzhe Tao · Liyu Chen · Tao Sun · Hongxia Yang 🔗 |
-
|
$\mathcal{B}$-Coder: On Value-Based Deep Reinforcement Learning for Program Synthesis ( Oral ) > link | Zishun Yu · Yunzhe Tao · Liyu Chen · Tao Sun · Hongxia Yang 🔗 |
-
|
TD-MPC2: Scalable, Robust World Models for Continuous Control ( Poster ) > link | Nicklas Hansen · Hao Su · Xiaolong Wang 🔗 |
-
|
TD-MPC2: Scalable, Robust World Models for Continuous Control ( Oral ) > link | Nicklas Hansen · Hao Su · Xiaolong Wang 🔗 |
-
|
Scaling Offline Q-Learning with Vision Transformers ( Poster ) > link | Yingjie Miao · Jordi Orbay · Rishabh Agarwal · Aviral Kumar · George Tucker · Aleksandra Faust 🔗 |
-
|
Scaling Offline Q-Learning with Vision Transformers ( Oral ) > link | Yingjie Miao · Jordi Orbay · Rishabh Agarwal · Aviral Kumar · George Tucker · Aleksandra Faust 🔗 |
-
|
Identifying the Risks of LM Agents with an LM-Emulated Sandbox ( Poster ) > link | Yangjun Ruan · Honghua Dong · Andrew Wang · Silviu Pitis · Yongchao Zhou · Jimmy Ba · Yann Dubois · Chris Maddison · Tatsunori Hashimoto 🔗 |
-
|
Identifying the Risks of LM Agents with an LM-Emulated Sandbox ( Oral ) > link | Yangjun Ruan · Honghua Dong · Andrew Wang · Silviu Pitis · Yongchao Zhou · Jimmy Ba · Yann Dubois · Chris Maddison · Tatsunori Hashimoto 🔗 |
-
|
Using Large Language Models for Hyperparameter Optimization ( Poster ) > link | Michael Zhang · Nishkrit Desai · Juhan Bae · Jonathan Lorraine · Jimmy Ba 🔗 |
-
|
Using Large Language Models for Hyperparameter Optimization ( Oral ) > link | Michael Zhang · Nishkrit Desai · Juhan Bae · Jonathan Lorraine · Jimmy Ba 🔗 |
-
|
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning ( Poster ) > link | Zirui Zhao · Wee Sun Lee · David Hsu 🔗 |
-
|
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning ( Oral ) > link | Zirui Zhao · Wee Sun Lee · David Hsu 🔗 |
-
|
Linear diffusion models meet contextual bandits with large action spaces ( Poster ) > link | Imad Aouali 🔗 |
-
|
Linear diffusion models meet contextual bandits with large action spaces ( Oral ) > link | Imad Aouali 🔗 |
-
|
Policy-Gradient Training of Language Models for Ranking ( Poster ) > link | Ge Gao · Jonathan Chang · Claire Cardie · Kianté Brantley · Thorsten Joachims 🔗 |
-
|
Policy-Gradient Training of Language Models for Ranking ( Oral ) > link | Ge Gao · Jonathan Chang · Claire Cardie · Kianté Brantley · Thorsten Joachims 🔗 |
-
|
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining ( Poster ) > link | Licong Lin · Yu Bai · Song Mei 🔗 |
-
|
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining ( Oral ) > link | Licong Lin · Yu Bai · Song Mei 🔗 |
-
|
Vision-Language Models Provide Promptable Representations for Reinforcement Learning ( Poster ) > link | William Chen · Oier Mees · Aviral Kumar · Sergey Levine 🔗 |
-
|
Vision-Language Models Provide Promptable Representations for Reinforcement Learning ( Oral ) > link | William Chen · Oier Mees · Aviral Kumar · Sergey Levine 🔗 |
-
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback ( Poster ) > link | Martin Klissarov · Pierluca D'Oro · Shagun Sodhani · Roberta Raileanu · Pierre-Luc Bacon · Pascal Vincent · Amy Zhang · Mikael Henaff 🔗 |
-
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback ( Oral ) > link | Martin Klissarov · Pierluca D'Oro · Shagun Sodhani · Roberta Raileanu · Pierre-Luc Bacon · Pascal Vincent · Amy Zhang · Mikael Henaff 🔗 |
-
|
Eureka: Human-Level Reward Design via Coding Large Language Models ( Poster ) > link | Jason Ma · William Liang · Guanzhi Wang · De-An Huang · Osbert Bastani · Dinesh Jayaraman · Yuke Zhu · Linxi Fan · Animashree Anandkumar 🔗 |
-
|
Eureka: Human-Level Reward Design via Coding Large Language Models ( Oral ) > link | Jason Ma · William Liang · Guanzhi Wang · De-An Huang · Osbert Bastani · Dinesh Jayaraman · Yuke Zhu · Linxi Fan · Animashree Anandkumar 🔗 |
-
|
Language Conditioned Semantic Search Based Policy for Robotic Manipulation Tasks ( Poster ) > link | Jannik Sheikh · Andrew Melnik · Gora Chand Nandi · Robert Haschke 🔗 |
-
|
Language Conditioned Semantic Search Based Policy for Robotic Manipulation Tasks ( Oral ) > link | Jannik Sheikh · Andrew Melnik · Gora Chand Nandi · Robert Haschke 🔗 |
-
|
NexusRaven: A Commercially-Permissive Language Model for Function Calling ( Poster ) > link | Venkat Krishna Srinivasan · Zhen Dong · Banghua Zhu · Brian Yu · Hanzi Mao · Damon Mosk-Aoyama · Kurt Keutzer · Jiantao Jiao · Jian Zhang 🔗 |
-
|
NexusRaven: A Commercially-Permissive Language Model for Function Calling ( Oral ) > link | Venkat Krishna Srinivasan · Zhen Dong · Banghua Zhu · Brian Yu · Hanzi Mao · Damon Mosk-Aoyama · Kurt Keutzer · Jiantao Jiao · Jian Zhang 🔗 |
-
|
Mitigating Generative Agent Social Dilemmas ( Poster ) > link | Julian Yocum · Phillip Christoffersen · Mehul Damani · Justin Svegliato · Dylan Hadfield-Menell · Stuart J Russell 🔗 |
-
|
Mitigating Generative Agent Social Dilemmas ( Oral ) > link | Julian Yocum · Phillip Christoffersen · Mehul Damani · Justin Svegliato · Dylan Hadfield-Menell · Stuart J Russell 🔗 |
-
|
In-Context Multi-Armed Bandits via Supervised Pretraining ( Poster ) > link | Fred Zhang · Jiaxin Ye · Zhuoran Yang 🔗 |
-
|
In-Context Multi-Armed Bandits via Supervised Pretraining ( Oral ) > link | Fred Zhang · Jiaxin Ye · Zhuoran Yang 🔗 |
-
|
GROOT: Learning to Follow Instructions by Watching Gameplay Videos ( Poster ) > link | Shaofei Cai · Bowei Zhang · Zihao Wang · Xiaojian (Shawn) Ma · Anji Liu · Yitao Liang 🔗 |
-
|
GROOT: Learning to Follow Instructions by Watching Gameplay Videos ( Oral ) > link | Shaofei Cai · Bowei Zhang · Zihao Wang · Xiaojian (Shawn) Ma · Anji Liu · Yitao Liang 🔗 |
-
|
Asking Clarifying Questions using Language Models and Probabilistic Reasoning ( Poster ) > link | Top Piriyakulkij · Volodymyr Kuleshov · Kevin Ellis 🔗 |
-
|
Asking Clarifying Questions using Language Models and Probabilistic Reasoning ( Oral ) > link | Top Piriyakulkij · Volodymyr Kuleshov · Kevin Ellis 🔗 |
-
|
Pre-Training and Fine-Tuning Generative Flow Networks ( Poster ) > link | Ling Pan · Moksh Jain · Kanika Madan · Yoshua Bengio 🔗 |
-
|
Pre-Training and Fine-Tuning Generative Flow Networks ( Oral ) > link | Ling Pan · Moksh Jain · Kanika Madan · Yoshua Bengio 🔗 |
-
|
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View. ( Poster ) > link | Raj Ghugare · Matthieu Geist · Glen Berseth · Benjamin Eysenbach 🔗 |
-
|
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View. ( Oral ) > link | Raj Ghugare · Matthieu Geist · Glen Berseth · Benjamin Eysenbach 🔗 |
-
|
Confronting Reward Model Overoptimization with Constrained RLHF ( Poster ) > link | Ted Moskovitz · Aaditya Singh · DJ Strouse · Tuomas Sandholm · Russ Salakhutdinov · Anca Dragan · Stephen McAleer 🔗 |
-
|
Confronting Reward Model Overoptimization with Constrained RLHF ( Oral ) > link | Ted Moskovitz · Aaditya Singh · DJ Strouse · Tuomas Sandholm · Russ Salakhutdinov · Anca Dragan · Stephen McAleer 🔗 |