Workshop
Workshop on Open-World Agents: Synnergizing Reasoning and Decision-Making in Open-World Environments (OWA-2024)
Xiaojian (Shawn) Ma · Siyuan Qi · Hangxin Liu · Zihao Wang · Xue Feng · Shaofei Cai · Zhi Gao · Anji Liu · Yitao Liang · Yaodong Yang · Zilong Zheng · Qing Li · Siyuan Huang · Shuang Li · Ruiqi Gao · Dave Chen · Angel Chang · Song-Chun Zhu
East Meeting Room 1-3, Foyer
Sun 15 Dec, 9 a.m. PST
In recent years, AI has made significant strides in achieving success across various domains, demonstrating capabilities that often surpass human performance in specific tasks. However, the real world presents challenges that go beyond single tasks, objectives, or predefined, static environments. We propose to consider open-world environments as the new habitat for AI agents: highly diverse and dynamic, fully interactive, teaming up with infinite and creative tasks, and requiring continuing learning and growth. Therefore, open-world agents, are expected to possess remarkable problem-solving capabilities across all cognitive functions, notably, reasoning and decision-making compared to specialized AI agents.
This workshop aims to bring together researchers from various fields to discuss emerging topics about reasoning and decision-making in open-world environments. This topic can be overly broad, but we are particularly interested in synergizing reasoning and decision-making, i.e., open-world agents that can simultaneously perform reasoning (e.g., QA, dialogue) and decision-making (e.g., planning and control), and how such unification helps tackle the challenges brought by the open world to both parties. To this end, the related fields are not limited to interleaved reasoning with decision-making, reasoning in embodied learning agents, LLM tool usage, reinforcement learning in open-world environments, open vocabulary learning, continued learning, multi-agent learning, and emerging ethical considerations in open-world environments. Our objective is to foster collaboration and insights into addressing the scientific questions about developing open-world reasoning and decision-making agents.
Schedule
Sun 9:00 a.m. - 9:10 a.m.
|
Opening remarks
(
Intro
)
>
SlidesLive Video |
🔗 |
Sun 9:10 a.m. - 9:40 a.m.
|
Invited talk: What's Missing for Robot Foundation Models
(
Invited Talk
)
>
SlidesLive Video |
Ted Xiao 🔗 |
Sun 9:40 a.m. - 10:10 a.m.
|
Invited talk: Structured Representations for Human-Centered Embodied AI
(
Invited Talk
)
>
SlidesLive Video |
Jiajun Wu 🔗 |
Sun 10:10 a.m. - 10:40 a.m.
|
OWA-2024 Panel: The Past, Present, and Future of Open-World Agents
(
Panel Discussion
)
>
SlidesLive Video |
Natasha Jaques · Tao Yu · John Langford · Ted Xiao 🔗 |
Sun 10:45 a.m. - 11:45 a.m.
|
Poster session 1
(
Poster Session
)
>
|
🔗 |
Sun 1:00 p.m. - 1:30 p.m.
|
Invited talk: Building AI Society with Foundation-Model Agents
(
Invited Talk
)
>
SlidesLive Video |
Zhenfei (Jeremy) Yin 🔗 |
Sun 1:30 p.m. - 2:00 p.m.
|
Invited talk: Generative World Modeling for Embodied Agents
(
Invited Talk
)
>
SlidesLive Video |
Sherry Yang 🔗 |
Sun 2:00 p.m. - 2:30 p.m.
|
Oral session 1
(
Oral Session
)
>
SlidesLive Video |
Shaofei Cai · Logan Cross · Yongjun Cho 🔗 |
Sun 2:30 p.m. - 3:30 p.m.
|
Poster session 2
(
Poster Session
)
>
|
🔗 |
Sun 3:30 p.m. - 4:00 p.m.
|
Invited talk: Scaling Multimodal Computer Agents
(
Invited Talk
)
>
SlidesLive Video |
Tao Yu 🔗 |
Sun 4:00 p.m. - 4:30 p.m.
|
Invited talk: Social Reinforcement Learning for Coordination, Social Reasoning, and Online Adaptation
(
Invited Talk
)
>
SlidesLive Video |
Natasha Jaques 🔗 |
Sun 4:30 p.m. - 5:00 p.m.
|
Oral session 2
(
Oral Session
)
>
SlidesLive Video |
Chen Wu · Kevin Qinghong Lin · Shengran Hu 🔗 |
Sun 5:00 p.m. - 5:15 p.m.
|
Closing remarks & Awards
(
Closing Remarks
)
>
SlidesLive Video |
🔗 |
-
|
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting ( Oral ) > link | Shaofei Cai · Zihao Wang · Kewei Lian · Zhancun Mu · Xiaojian (Shawn) Ma · Anji Liu · Yitao Liang 🔗 |
-
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent ( Oral ) > link | Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou 🔗 |
-
|
Integrating Visual and Linguistic Instructions for Context-Aware Navigation Agents
(
Oral
)
>
link
SlidesLive Video |
12 presentersSuhwan Choi · Yongjun Cho · Minchan Kim · Jaeyoon Jung · Myunchul Joe · Park Yu Been · Minseo Kim · Sungwoong Kim · Sungjae Lee · WHISEONG PARK · Jiwan Chung · Youngjae Yu |
-
|
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
(
Oral
)
>
link
SlidesLive Video |
Logan Cross · Violet Xiang · Agam Bhatia · Dan Yamins · Nick Haber 🔗 |
-
|
Dissecting Adversarial Robustness of Multimodal LM Agents ( Oral ) > link | Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan 🔗 |
-
|
Automated Design of Agentic Systems
(
Oral
)
>
link
SlidesLive Video |
Shengran Hu · Cong Lu · Jeff Clune 🔗 |
-
|
A Simplified A Priori Theory Of Meaning, –Nature based AI ‘first principles’– ( Poster ) > link | Marcus Abundis 🔗 |
-
|
Variational Inequality Perspective and Optimizers for Multi-Agent Reinforcement Learning ( Poster ) > link | Baraah Adil Mohammed Sidahmed · Tatjana Chavdarova 🔗 |
-
|
REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments ( Poster ) > link | Kaustubh Sridhar · Souradeep Dutta · Dinesh Jayaraman · Insup Lee 🔗 |
-
|
MobileFlow: A Multimodal LLM For Mobile GUI Agent ( Poster ) > link | Songqin Nong · Jiali Zhu · Rui Wu · Jiongchao Jin · Shuo Shan · Xiutian Huang · Wenhao Xu 🔗 |
-
|
Simulating User Agents for Embodied Conversational AI ( Poster ) > link | Daniel Philipov · Vardhan Dongre · Gokhan Tur · Dilek Tur 🔗 |
-
|
Automating Thought of Search: A Journey Towards Soundness and Completeness ( Poster ) > link | Daniel Cao · Michael Katz · Harsha Kokel · Kavitha Srinivas · Shirin Sohrabi Araghi 🔗 |
-
|
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture ( Poster ) > link | Konstantina Christakopoulou · Shibl Mourad · Maja Matarić 🔗 |
-
|
Improving Decision-Making in Open-World Agents with Conformal Prediction and Monty Hall ( Poster ) > link | Harit Vishwakarma · Alan Mishler · Thomas Cook · Niccolo Dalmasso · Natraj Raman · Sumitra Ganesh 🔗 |
-
|
AgentStudio: A Toolkit for Building General Virtual Agents ( Poster ) > link | Longtao Zheng · Zhiyuan Huang · Zhenghai Xue · Xinrun Wang · Bo An · Shuicheng Yan 🔗 |
-
|
SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals ( Poster ) > link | 睿涵 杨 · Jiangjie Chen · yikai zhang · Siyu Yuan · Chen · Kyle Richardson · Yanghua Xiao · Deqing Yang 🔗 |
-
|
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning ( Poster ) > link | Alicia Li · Nishanth Kumar · Tomás Lozano-Pérez · Leslie Kaelbling 🔗 |
-
|
Infer Human’s Intentions Before Following Natural Language Instructions ( Poster ) > link | Yanming Wan · Yue Wu · Yiping Wang · Jiayuan Mao · Natasha Jaques 🔗 |
-
|
HSCL-RL: Mitigating Hallucinations in Multimodal Large Language Models ( Poster ) > link | Zichen Song · 思潭 黄 🔗 |
-
|
Agent S: An Open Agentic Framework that Uses Computers Like a Human ( Poster ) > link | Saaket Agashe · Jiuzhou Han · Shuyu Gan · Jiachen Yang · Ang Li · Xin Eric Wang 🔗 |
-
|
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks ( Poster ) > link | Thomas Schmied · Thomas Adler · Vihang Patil · Maximilian Beck · Korbinian Pöppel · Johannes Brandstetter · Günter Klambauer · Razvan Pascanu · Sepp Hochreiter 🔗 |
-
|
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing ( Poster ) > link | Haoyi Niu · Qimao Chen · Tenglong Liu · Jianxiong Li · Guyue Zhou · Yi ZHANG · Jianming HU · Xianyuan Zhan 🔗 |
-
|
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks ( Poster ) > link | Lawrence Jang · Yinheng Li · Charles Ding · Justin Lin · Paul Pu Liang · Dan Zhao · Rogerio Bonatti · Kazuhito Koishida 🔗 |
-
|
Inverse Attention Agent in Multi-Agent System ( Poster ) > link | Qian Long · Ruoyan Li · Minglu Zhao · Tao Gao · Demetri Terzopoulos 🔗 |
-
|
Language Models and Symbolic Planners can Infer Action Semantics through Environment Feedback ( Poster ) > link | Wang Zhu · Ishika Singh · Robin Jia · Jesse Thomason 🔗 |
-
|
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL ( Poster ) > link | Woosung Koh · Wonbeen Oh · 시열 김 · Suhin Shin · Hyeongjin Kim · Jaein Jang · Junghyun Lee · Se-Young Yun 🔗 |
-
|
Thermal and Energy Management with Fan Control Through Offline Meta-Reinforcement Learning ( Poster ) > link | Shao-Yu Yen · Yen Lai · Fu-Chieh Chang · Pei-Yuan Wu 🔗 |
-
|
One-shot World Models Using a Transformer Trained on a Synthetic Prior ( Poster ) > link | Fabio Ferreira · Moreno Schlageter · Raghu Rajan · André Biedenkapp · Frank Hutter 🔗 |
-
|
Policy optimization to align the validity, coherence and efficiency of reasoning agents in multi-turn dialogues ( Poster ) > link | Jeremy Curuksu 🔗 |
-
|
Agentic Anomaly Detection for Shipping ( Poster ) > link | Alexander Timms · Abigail Langbridge · Fearghal O'Donncha 🔗 |
-
|
Words as Beacons: Guiding RL Agents with High-Level Language Prompts ( Poster ) > link | Unai Ruiz-Gonzalez · Alain Andres · Pedro Bascoy · Javier Del Ser 🔗 |
-
|
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents ( Poster ) > link |
13 presentersTianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li |
-
|
Towards Robust Estimation of Human Intention Hierarchy in Robot Teleoperation ( Poster ) > link | Nikki Lijing Kuang · Songpo Li · Soshi Iba 🔗 |
-
|
Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset ( Poster ) > link | Shankar Kumar Jeyakumar · Alaa Ahmad · Adrian Gabriel 🔗 |
-
|
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning ( Poster ) > link | Somnath Sendhil Kumar · Yash Gadhia · Tanuja Ganu · Akshay Nambi 🔗 |
-
|
ENHANCING DATA EFFICIENCY IN REINFORCEMENT LEARNING: A NOVEL IMAGINATION MECHANISM BASED ON MESH INFORMATION PROPAGATION ( Poster ) > link | Zihang Wang · Maowei Jiang · Pengyu Zeng · ruiqi li · Quangao Liu · Peter Búš 🔗 |
-
|
Towards Humanoid: Value-Driven Agent Modeling Based on Large Language Models ( Poster ) > link | Xuzheng Chen · Zhangshiyin · Guojie Song 🔗 |
-
|
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION ( Poster ) > link |
16 presentersJingxuan Chen · Derek Yuen · Bin Xie · Yuhao Yang · Gongwei Chen · Zhihao Wu · Li Yixing · Xurui Zhou · Weiwen Liu · Shuai Wang · Rui Shao · Liqiang Nie · Yasheng Wang · Jianye Hao · Jun Wang · Kun Shao |
-
|
IDS-Agent: An LLM Agent for Explainable Intrusion Detection in IoT Networks ( Poster ) > link | Yanjie Li · Zhen Xiang · Nathaniel Bastian · Dawn Song · Bo Li 🔗 |
-
|
Learning Region-Word Alignment with Attentive Masking for Open-Vocabulary Object Detection ( Poster ) > link | Masoumeh Zareapoor · Pourya Shamsolmoali · Yue Lu 🔗 |
-
|
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena ( Poster ) > link | Jiangjie Chen · Siyu Yuan · Rong Ye · Bodhisattwa Prasad Majumder · Kyle Richardson 🔗 |
-
|
OASIS: Open Agents Social Interaction Simulations on One Million Agents ( Poster ) > link |
22 presentersZiyi Yang · Zaibin Zhang · Zirui Zheng · Yuxian Jiang · Ziyue Gan · Zhiyu Wang · Zijian Ling · Konisberg · Martz Ma · Bowen Dong · Prateek Gupta · Shuyue Hu · Zhenfei (Jeremy) Yin · Guohao Li · Xu Jia · Lijun Wang · Bernard Ghanem · Huchuan Lu · Wanli Ouyang · Yu Qiao · Philip Torr · Jing Shao |
-
|
Multimodal Auto Validation For Self-Refinement in Web Agents ( Poster ) > link | Ruhana Azam · Tamer Abuelsaad · Aditya Vempaty · Ashish Jagmohan 🔗 |
-
|
Cognitive Planning for Object Goal Navigation using Generative AI Models ( Poster ) > link | Arjun P S · Andrew Melnik · Gora Chand Nandi 🔗 |
-
|
LLM4Drive: A Survey of Large Language Models for Autonomous Driving ( Poster ) > link | Zhenjie Yang · Xiaosong Jia · Hongyang Li · Junchi Yan 🔗 |
-
|
Robust Offline Learning via Adversarial World Models ( Poster ) > link | Uljad Berdica · Kelvin Li · Michael Beukman · Alexander D. Goldie · Perla Maiolino · Jakob Foerster 🔗 |
-
|
First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs ( Poster ) > link | Ben Norman · Jeff Clune 🔗 |
-
|
CARD: Cross-modal Agent Framework for Generative and Editable Residential Design ( Poster ) > link | Pengyu Zeng · Maowei Jiang · Zihang Wang · Jizhizi Li · Jun Yin · Shuai Lu 🔗 |
-
|
Quality-Diversity Self-Play: Open-Ended Strategy Innovation via Foundation Models ( Poster ) > link | Aaron Dharna · Cong Lu · Jeff Clune 🔗 |
-
|
Do LLM Personas Dream of Bull Markets? Comparing Human and AI Investment Strategies Through the Lens of the Five-Factor Model ( Poster ) > link | Harris Borman · Anna Leontjeva · Luiz Pizzato · Max Kun Jiang · Dan Jermyn 🔗 |
-
|
RAR-Agent: Retrieval Augmented Reflection Learning from Scratch for Reasoning ( Poster ) > link | Shipeng Xie · Haichao Zhu · Da Chen 🔗 |
-
|
IDEA: Enhancing the Rule Learning Ability of Language Agent through Induction, Deduction, and Abduction ( Poster ) > link | Kaiyu He · Mian Zhang · Shuo yan · Peilin Wu · Zhiyu Chen 🔗 |
-
|
RefactorBench: Evaluating Stateful Reasoning In Language Agents Through Code ( Poster ) > link | Dhruv Gautam · Spandan Garg · Jinu Jang · Neel Sundaresan · Roshanak Zilouchian Moghaddam 🔗 |
-
|
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems ( Poster ) > link | Tamer Abuelsaad · Deepak Akkil · Prasenjit Dey · Ashish Jagmohan · Aditya Vempaty · Ravi Kokku 🔗 |
-
|
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting ( Poster ) > link | Shaofei Cai · Zihao Wang · Kewei Lian · Zhancun Mu · Xiaojian (Shawn) Ma · Anji Liu · Yitao Liang 🔗 |
-
|
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents ( Poster ) > link | Tomoyuki Kagaya · Thong Yuan · Yuxuan Lou · Panasonic Karlekar Jayashree · Panasonic Sugiri Pranata · Akira Kinose · Koki Oguri · Felix Wick · Yang You 🔗 |
-
|
Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation ( Poster ) > link | Asad Ali Shahid 🔗 |
-
|
Cradle: Empowering Foundation Agents towards General Computer Control ( Poster ) > link |
28 presentersWeihao Tan · Wentao Zhang · Xinrun Xu · Haochong Xia · Gang Ding · Boyu Li · Bohan Zhou · Junpeng Yue · Jiechuan Jiang · Yewen Li · Ruyi An · Molei Qin · Chuqiao Zong · Longtao Zheng · YuJie Wu · Xiaoqiang Chai · Yifei Bi · Tianbao Xie · Pengjie Gu · Xiyun Li · Ceyao Zhang · Long Tian · Chaojie Wang · Xinrun Wang · Börje F. Karlsson · Bo An · Shuicheng Yan · Zongqing Lu |
-
|
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning ( Poster ) > link | Bryan Lincoln Marques de Oliveira · Bruno Brandão · Murilo da Luz · Luana Guedes Barros Martins · Telma de Lima Soares · Luckeciano Carvalho Melo 🔗 |
-
|
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent ( Poster ) > link | Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Zechen Bai · Weixian Lei · Lijuan Wang · Mike Zheng Shou 🔗 |
-
|
Integrating Visual and Linguistic Instructions for Context-Aware Navigation Agents ( Poster ) > link |
12 presentersSuhwan Choi · Yongjun Cho · Minchan Kim · Jaeyoon Jung · Myunchul Joe · Park Yu Been · Minseo Kim · Sungwoong Kim · Sungjae Lee · WHISEONG PARK · Jiwan Chung · Youngjae Yu |
-
|
Infogent: An Agent-based Framework for Web Information Aggregation ( Poster ) > link | Revanth Gangi Reddy · Sagnik Mukherjee · Jeonghwan Kim · Zhenhailong Wang · Dilek Tur · Heng Ji 🔗 |
-
|
Towards Principled Representation Learning from Videos for Reinforcement Learning ( Poster ) > link | Dipendra Misra · Akanksha Saran · Tengyang Xie · Alex Lamb · John Langford 🔗 |
-
|
Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think ( Poster ) > link | Massimo Caccia · Megh Thakkar · Léo Boisvert · Thibault de Chezelles · Alexandre Piche · Nicolas Chapados · Alexandre Drouin · Maxime Gasse · Alexandre Lacoste 🔗 |
-
|
Interactive Navigation of Quadruped Robots in Challenging Environments using Large Language Models ( Poster ) > link | Kangjie Zhou · Yao Mu · Pengying Wu · Han Gao · Chang Liu 🔗 |
-
|
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination ( Poster ) > link | Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai 🔗 |
-
|
DARD: A Multi-Agent Approach for Task-Oriented Dialog Systems ( Poster ) > link | Aman Gupta · Anirudh Ravichandran · Ziji Zhang · Swair Shah · Anurag Beniwal · Narayanan Sadagopan 🔗 |
-
|
RH20T-P: A Primitive-Level Robotic Manipulation Dataset Towards Composable Generalization Agents in Real-world Scenarios ( Poster ) > link |
11 presentersZeren Chen · Zhelun Shi · Xiaoya Lu · Lehan He · Sucheng Qian · Zhenfei (Jeremy) Yin · Wanli Ouyang · Jing Shao · Yu Qiao · Cewu Lu · Lu Sheng |
-
|
Zero-shot Whole-Body Humanoid Control via Behavioral Foundation Models ( Poster ) > link | Andrea Tirinzoni · Ahmed Touati · Jesse Farebrother · Mateusz Guzek · Anssi Kanervisto · Yingchen Xu · Alessandro Lazaric · Matteo Pirotta 🔗 |
-
|
Lightweight Neural App Control ( Poster ) > link | Filippos Christianos · Georgios Papoudakis · Thomas Coste · Jianye Hao · Jun Wang · Kun Shao 🔗 |
-
|
Articulated Animal AI: An Environment for Animal-like Cognition in a Limbed Agent ( Poster ) > link | Jeremy Lucas · Isabeau Prémont-Schwarz 🔗 |
-
|
Planning as Inpainting: A Generative Framework for Realistic Embodied Path Planning ( Poster ) > link | Cheng-Fu Yang · Haoyang Xu · Te-Lin Wu · Xiaofeng Gao · Kai-Wei Chang · Feng Gao 🔗 |
-
|
An Efficient Open World Benchmark for Multi-Agent Reinforcement Learning ( Poster ) > link | Eric Ye · Natasha Jaques 🔗 |
-
|
MASAI: Modular Architecture for Software-engineering AI Agents ( Poster ) > link | Nalin Wadhwa · Atharv Sonwane · Daman Arora · Abhav Mehrotra · Saiteja Utpala · Ramakrishna Bairi · Aditya Kanade · Nagarajan Natarajan 🔗 |
-
|
Collective Wisdom in Language Models: Harnessing LLM-Swarm for Agile Project Management ( Poster ) > link | Tahmid Hussain · Tashin Ahmed · Shahedul Haque · Mohammad rifat ahmmad Rashid 🔗 |
-
|
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models ( Poster ) > link | Logan Cross · Violet Xiang · Agam Bhatia · Dan Yamins · Nick Haber 🔗 |
-
|
Efficient Reinforcement Learning via Large Language Model-based Search ( Poster ) > link | Siddhant Bhambri · Amrita Bhattacharjee · huan liu · Subbarao Kambhampati 🔗 |
-
|
Dissecting Adversarial Robustness of Multimodal LM Agents ( Poster ) > link | Chen Wu · Rishi Shah · Jing Yu Koh · Ruslan Salakhutdinov · Daniel Fried · Aditi Raghunathan 🔗 |
-
|
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy ( Poster ) > link | Zhenyu Guan · Xiangyu Kong · Fangwei Zhong · Yizhou Wang 🔗 |
-
|
FPGA-Gym: An FPGA-Accelerated Reinforcement Learning Environment Simulation Framework ( Poster ) > link | Jiayi Li · Hongxiao Zhao · Wenshuo Yue · Yihan Fu · Daijing Shi · Anjunyi Fan · Qinghao Wang · Yaodong Yang · Bonan Yan 🔗 |
-
|
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning ( Poster ) > link |
11 presentersJianxiong Li · Zhihao Wang · Jinliang Zheng · Xiaoai Zhou · Guanming Wang · Guanglu Song · Yu Liu · Jingjing Liu · Ya-Qin Zhang · Junzhi Yu · Xianyuan Zhan |
-
|
DepsRAG: Towards Agentic Reasoning and Planning for Software Dependency Management ( Poster ) > link | Mohannad Alhanahnah · Yazan Boshmaf 🔗 |
-
|
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale ( Poster ) > link |
11 presentersRogerio Bonatti · Dan Zhao · Dillon Dupont · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Zheng Hui |
-
|
Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models ( Poster ) > link | Abhishek Dutta · Yen-Che Hsiao 🔗 |
-
|
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench ( Poster ) > link | Karthik Valmeekam · Kaya Stechly · Subbarao Kambhampati 🔗 |
-
|
FEABench: Evaluating Language Models on Real World Physics Reasoning Ability ( Poster ) > link | Nayantara Mudur · Hao Cui · Subhashini Venugopalan · Paul Raccuglia · Michael Brenner · Peter Norgaard 🔗 |
-
|
SEAL: Suite for Evaluating API-use of LLMs ( Poster ) > link | Woojeong Kim · Ashish Jagmohan · Aditya Vempaty 🔗 |
-
|
Automated Design of Agentic Systems ( Poster ) > link | Shengran Hu · Cong Lu · Jeff Clune 🔗 |
-
|
StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows ( Poster ) > link | Yiran Wu · Tianwei Yue · Shaokun Zhang · Chi Wang · Qingyun Wu 🔗 |
-
|
Can VLMs Play Action Role-Playing Games? Take Black Myth Wukong as a Study Case ( Poster ) > link | Peng Chen · Pi Bu · Jun Song · Yuan Gao · Bo Zheng 🔗 |
-
|
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents ( Poster ) > link | Nalin Tiwary · Vardhan Dongre · Sanil Chawla · Ashwin Lamani · Dilek Tur 🔗 |
-
|
LLM2Swarm: Robot Swarms that Responsively Reason, Plan, and Collaborate through LLMs ( Poster ) > link | Volker Strobel · Marco Dorigo · Mario Fritz 🔗 |
-
|
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning and Verification in Long-Horizon Generation ( Poster ) > link | Zihao Wang · Anji Liu · Haowei Lin · Jiaqi Li · Xiaojian (Shawn) Ma · Yitao Liang 🔗 |
-
|
Semantically Safe Robot Manipulation: From Semantic Scene Understanding to Motion Safeguards ( Poster ) > link | Lukas Brunke · Yanni Zhang · Ralf Römer · Jack Naimer · Nikola Staykov · SiQi Zhou · Angela Schoellig 🔗 |
-
|
What Do You Mean by "Open World"? ( Poster ) > link | Bowen Xu 🔗 |
-
|
Towards Automated Patent Workflows: AI-Orchestrated Multi-Agent Framework for Intellectual Property Management and Analysis ( Poster ) > link | Sagar Srinivas Sakhinana · Vijay sri vaikunth · Venkataramana Runkana 🔗 |
-
|
The Impact of Element Ordering on LM Agent Performance ( Poster ) > link | Wayne Chi · Ameet Talwalkar · Chris Donahue 🔗 |
-
|
In-Context Imitation Learning via Next-Token Prediction ( Poster ) > link | Max Fu · Huang Huang · Gaurav Datta · Lawrence Yunliang Chen · William Panitch · Fangchen Liu · Hui Li · Ken Goldberg 🔗 |
-
|
Generalized Open-World Semi-Supervised Object Detection ( Poster ) > link | Garvita Allabadi · Ana Lucic · Siddarth Aananth · Tiffany Yang · Yu-Xiong Wang · Vikram Adve 🔗 |
-
|
Are Expressive Models Truly Necessary for Offline RL? ( Poster ) > link | Guan Wang · Haoyi Niu · Jianxiong Li · Li Jiang · Jianming HU · Xianyuan Zhan 🔗 |
-
|
Chain-of-Imagination for Reliable Instruction Following in Decision Making ( Poster ) > link | Enshen Zhou · Yiran Qin · Zhenfei (Jeremy) Yin · Yuzhou Huang · Ruimao Zhang · Lu Sheng · Yu Qiao · Jing Shao 🔗 |
-
|
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms ( Poster ) > link | Siyu Yuan · Kaitao Song · Jiangjie Chen · Xu Tan · Dongsheng Li · Deqing Yang 🔗 |
-
|
GTA: A Benchmark for General Tool Agents ( Poster ) > link | Jize Wang · Ma Zerun · Yining Li · Songyang Zhang · Cailian Chen · Kai Chen · Xinyi Le 🔗 |