Fri 8:25 a.m. - 8:30 a.m.
|
Opening Remarks
(
Opening Remarks
)
>
SlidesLive Video
|
🔗
|
Fri 8:30 a.m. - 9:00 a.m.
|
Tobias Gerstenberg
(
Invited Talk
)
>
SlidesLive Video
|
Tobias Gerstenberg
🔗
|
Fri 9:00 a.m. - 9:15 a.m.
|
ESCHER: ESCHEWING IMPORTANCE SAMPLING IN GAMES BY COMPUTING A HISTORY VALUE FUNCTION TO ESTIMATE REGRET
(
Poster
)
>
link
|
Stephen McAleer · Gabriele Farina · Marc Lanctot · Tuomas Sandholm
🔗
|
Fri 9:15 a.m. - 9:30 a.m.
|
Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
(
Poster
)
>
link
SlidesLive Video
|
Jason Yecheng Ma · Shagun Sodhani · Dinesh Jayaraman · Osbert Bastani · Vikash Kumar · Amy Zhang
🔗
|
Fri 9:30 a.m. - 9:45 a.m.
|
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
(
Poster
)
>
link
SlidesLive Video
|
Ruijie Zheng · Xiyao Wang · Huazhe Xu · Furong Huang
🔗
|
Fri 9:45 a.m. - 10:00 a.m.
|
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes
(
Poster
)
>
link
SlidesLive Video
|
Aviral Kumar · Rishabh Agarwal · XINYANG GENG · George Tucker · Sergey Levine
🔗
|
Fri 10:00 a.m. - 10:30 a.m.
|
Jakob Foerster
(
Invited Talk
)
>
SlidesLive Video
|
Jakob Foerster
🔗
|
Fri 11:00 a.m. - 11:30 a.m.
|
Scientific Experiments in Reinforcement Learning
(
Opinion Talk
)
>
SlidesLive Video
|
Scott Jordan
🔗
|
Fri 11:30 a.m. - 11:45 a.m.
|
Transformers are Sample-Efficient World Models
(
Poster
)
>
link
|
Vincent Micheli · Eloi Alonso · François Fleuret
🔗
|
Fri 11:45 a.m. - 12:00 p.m.
|
Scaling Laws for a Multi-Agent Reinforcement Learning Model
(
Poster
)
>
link
SlidesLive Video
|
Oren Neumann · Claudius Gros
🔗
|
Fri 12:00 p.m. - 12:30 p.m.
|
Natasha Jaques
(
Opinion Talk
)
>
SlidesLive Video
|
Natasha Jaques
🔗
|
Fri 1:30 p.m. - 2:00 p.m.
|
The World is not Uniformly Distributed; Important Implications for Deep RL
(
Opinion Talk
)
>
|
Stephanie Chan
🔗
|
Fri 2:00 p.m. - 2:30 p.m.
|
Amy Zhang
(
Invited Talk
)
>
|
Amy Zhang
🔗
|
Fri 3:00 p.m. - 3:30 p.m.
|
Igor Mordatch
(
Invited Talk
)
>
SlidesLive Video
|
Igor Mordatch
🔗
|
Fri 3:30 p.m. - 3:45 p.m.
|
John Schulman
(
Implementation Talk
)
>
SlidesLive Video
|
John Schulman
🔗
|
Fri 3:45 p.m. - 4:00 p.m.
|
Danijar Hafner
(
Implementation Talk
)
>
SlidesLive Video
|
Danijar Hafner
🔗
|
Fri 4:00 p.m. - 4:15 p.m.
|
Kristian Hartikainen
(
Implementation Talk
)
>
|
Kristian Hartikainen
🔗
|
Fri 4:15 p.m. - 4:30 p.m.
|
Ilya Kostrikov, Aviral Kumar
(
Implementation Talk
)
>
SlidesLive Video
|
Ilya Kostrikov · Aviral Kumar
🔗
|
Fri 4:30 p.m. - 5:30 p.m.
|
Panel Discussion
(
Panel Discussion
)
>
SlidesLive Video
|
🔗
|
Fri 5:30 p.m. - 5:35 p.m.
|
Closing Remarks
(
Closing Remarks
)
>
|
🔗
|
-
|
Compositional Task Generalization with Modular Successor Feature Approximators
(
Poster
)
>
link
|
Wilka Carvalho Carvalho
🔗
|
-
|
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
(
Poster
)
>
link
|
Sudeep Dasari · Vikash Kumar
🔗
|
-
|
Neural All-Pairs Shortest Path for Reinforcement Learning
(
Poster
)
>
link
|
Cristina Pinneri · Georg Martius · Andreas Krause
🔗
|
-
|
VI2N: A Network for Planning Under Uncertainty based on Value of Information
(
Poster
)
>
link
SlidesLive Video
|
Samantha Johnson · Michael Buice · Koosha Khalvati
🔗
|
-
|
Efficient Multi-Horizon Learning for Off-Policy Reinforcement Learning
(
Poster
)
>
link
|
Raja Farrukh Ali · Nasik Muhammad Nafi · Kevin Duong · William Hsu
🔗
|
-
|
Analyzing the Sensitivity to Policy-Value Decoupling in Deep Reinforcement Learning Generalization
(
Poster
)
>
link
SlidesLive Video
|
Nasik Muhammad Nafi · Raja Farrukh Ali · William Hsu
🔗
|
-
|
Lagrangian Model Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Adithya Ramesh · Balaraman Ravindran
🔗
|
-
|
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines
(
Poster
)
>
link
SlidesLive Video
|
Andrew Li · Zizhao Chen · Pashootan Vaezipoor · Toryn Klassen · Rodrigo Toro Icarte · Sheila McIlraith
🔗
|
-
|
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
(
Poster
)
>
link
SlidesLive Video
|
Min Zhang · Hongyao Tang · Jianye Hao · YAN ZHENG
🔗
|
-
|
Informative rewards and generalization in curriculum learning
(
Poster
)
>
link
SlidesLive Video
|
Rahul Siripurapu · Vihang Patil · Kajetan Schweighofer · Marius-Constantin Dinu · Markus Holzleitner · Hamid Eghbalzadeh · Luis Ferro · Thomas Schmied · Michael Kopp · Sepp Hochreiter
🔗
|
-
|
Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation
(
Poster
)
>
link
|
Yuzhe Qin · Binghao Huang · Zhao-Heng Yin · Hao Su · Xiaolong Wang
🔗
|
-
|
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
(
Poster
)
>
link
SlidesLive Video
|
Abdus Salam Azad · Izzeddin Gur · Aleksandra Faust · Pieter Abbeel · Ion Stoica
🔗
|
-
|
The Emphatic Approach to Average-Reward Policy Evaluation
(
Poster
)
>
link
SlidesLive Video
|
Jiamin He · Yi Wan · Rupam Mahmood
🔗
|
-
|
Learning Exploration Policies with View-based Intrinsic Rewards
(
Poster
)
>
link
SlidesLive Video
|
Yijie Guo · Yao Fu · Run Peng · Honglak Lee
🔗
|
-
|
Scaling Covariance Matrix Adaptation MAP-Annealing to High-Dimensional Controllers
(
Poster
)
>
link
SlidesLive Video
|
Bryon Tjanaka · Matthew Fontaine · Aniruddha Kalkar · Stefanos Nikolaidis
🔗
|
-
|
Policy Aware Model Learning via Transition Occupancy Matching
(
Poster
)
>
link
SlidesLive Video
|
Jason Yecheng Ma · Kausik Sivakumar · Osbert Bastani · Dinesh Jayaraman
🔗
|
-
|
On The Fragility of Learned Reward Functions
(
Poster
)
>
link
SlidesLive Video
|
Lev McKinney · Yawen Duan · Adam Gleave · David Krueger
🔗
|
-
|
Temporary Goals for Exploration
(
Poster
)
>
link
SlidesLive Video
|
Haoyang Xu · Jimmy Ba · Silviu Pitis · Harris Chan
🔗
|
-
|
Revisiting Bellman Errors for Offline Model Selection
(
Poster
)
>
link
|
Joshua Zitovsky · Daniel de Marchi · Rishabh Agarwal · Michael Kosorok
🔗
|
-
|
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zhixuan Lin · Pierluca D'Oro · Evgenii Nikishin · Aaron Courville
🔗
|
-
|
What Makes Certain Pre-Trained Visual Representations Better for Robotic Learning?
(
Poster
)
>
link
|
Kyle Hsu · Tyler Lum · Ruohan Gao · Shixiang (Shane) Gu · Jiajun Wu · Chelsea Finn
🔗
|
-
|
Curiosity in Hindsight
(
Poster
)
>
link
SlidesLive Video
|
Daniel Jarrett · Corentin Tallec · Florent Altché · Thomas Mesnard · Remi Munos · Michal Valko
🔗
|
-
|
Train Offline, Test Online: A Real Robot Learning Benchmark
(
Poster
)
>
link
SlidesLive Video
|
12 presenters
Gaoyue Zhou · Victoria Dean · Mohan Kumar Srirama · Aravind Rajeswaran · Jyothish Pari · Kyle Hatch · Aryan Jain · Tianhe Yu · Pieter Abbeel · Lerrel Pinto · Chelsea Finn · Abhinav Gupta
🔗
|
-
|
A Framework for Predictable Actor-Critic Control
(
Poster
)
>
link
SlidesLive Video
|
Josiah Coad · James Ault · Jeff Hykin · Guni Sharon
🔗
|
-
|
Ensemble based uncertainty estimation with overlapping alternative predictions
(
Poster
)
>
link
SlidesLive Video
|
Dirk Eilers · Felippe Schmoeller Roza · Karsten Roscher
🔗
|
-
|
Offline Reinforcement Learning on Real Robot with Realistic Data Sources
(
Poster
)
>
link
SlidesLive Video
|
Gaoyue Zhou · Liyiming Ke · Siddhartha Srinivasa · Abhinav Gupta · Aravind Rajeswaran · Vikash Kumar
🔗
|
-
|
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
(
Poster
)
>
link
SlidesLive Video
|
JB Lanier · Stephen McAleer · Pierre Baldi · Roy Fox
🔗
|
-
|
Training Equilibria in Reinforcement Learning
(
Poster
)
>
link
|
Lauro Langosco · David Krueger · Adam Gleave
🔗
|
-
|
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
(
Poster
)
>
link
|
Samuel Sokota · Ryan D'Orazio · J. Zico Kolter · Nicolas Loizou · Marc Lanctot · Ioannis Mitliagkas · Noam Brown · Christian Kroer
🔗
|
-
|
Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Ali Rahimi-Kalahroudi · Janarthanan Rajendran · Ida Momennejad · Harm Van Seijen · Sarath Chandar
🔗
|
-
|
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
(
Poster
)
>
link
|
Joey Hong · Aviral Kumar · Sergey Levine
🔗
|
-
|
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
(
Poster
)
>
link
SlidesLive Video
|
Yanqiu Wu · Xinyue Chen · Che Wang · Yiming Zhang · Keith Ross
🔗
|
-
|
Integrating Episodic and Global Bonuses for Efficient Exploration
(
Poster
)
>
link
|
Mikael Henaff · Minqi Jiang · Roberta Raileanu
🔗
|
-
|
Deconfounded Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Risto Vuorio · Pim de Haan · Johann Brehmer · Hanno Ackermann · Daniel Dijkman · Taco Cohen
🔗
|
-
|
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Eddy Hudson · Ishan Durugkar · Garrett Warnell · Peter Stone
🔗
|
-
|
Human-AI Coordination via Human-Regularized Search and Learning
(
Poster
)
>
link
SlidesLive Video
|
Hengyuan Hu · David Wu · Adam Lerer · Jakob Foerster · Noam Brown
🔗
|
-
|
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
(
Poster
)
>
link
|
Jesse Farebrother · Joshua Greaves · Rishabh Agarwal · Charline Le Lan · Ross Goroshin · Pablo Samuel Castro · Marc Bellemare
🔗
|
-
|
Return Augmentation gives Supervised RL Temporal Compositionality
(
Poster
)
>
link
SlidesLive Video
|
Keiran Paster · Silviu Pitis · Sheila McIlraith · Jimmy Ba
🔗
|
-
|
Design Process is a Reinforcement Learning Problem
(
Poster
)
>
link
SlidesLive Video
|
Reza Kakooee · Benjamin Dillenburger
🔗
|
-
|
Bayesian Q-learning With Imperfect Expert Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Fengdi Che · Xiru Zhu · Doina Precup · David Meger · Gregory Dudek
🔗
|
-
|
Efficient Deep Reinforcement Learning Requires Regulating Statistical Overfitting
(
Poster
)
>
link
|
Qiyang Li · Aviral Kumar · Ilya Kostrikov · Sergey Levine
🔗
|
-
|
Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
(
Poster
)
>
link
|
Anikait Singh · Aviral Kumar · Frederik Ebert · Yanlai Yang · Chelsea Finn · Sergey Levine
🔗
|
-
|
Offline Reinforcement Learning from Heteroskedastic Data Via Support Constraints
(
Poster
)
>
link
|
Anikait Singh · Aviral Kumar · Quan Vuong · Yevgen Chebotar · Sergey Levine
🔗
|
-
|
Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Johan Obando Ceron · Marc Bellemare · Pablo Samuel Castro
🔗
|
-
|
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems
(
Poster
)
>
link
|
Yihao Feng · Shentao Yang · Shujian Zhang · Jianguo Zhang · Caiming Xiong · Mingyuan Zhou · Huan Wang
🔗
|
-
|
In the ZONE: Measuring difficulty and progression in curriculum generation
(
Poster
)
>
link
SlidesLive Video
|
Rose Wang · Jesse Mu · Dilip Arumugam · Natasha Jaques · Noah Goodman
🔗
|
-
|
Better state exploration using action sequence equivalence
(
Poster
)
>
link
SlidesLive Video
|
Nathan Grinsztajn · Toby Johnstone · Johan Ferret · philippe preux
🔗
|
-
|
Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment
(
Poster
)
>
link
SlidesLive Video
|
Louis Bagot · Kevin Mets · Tom De Schepper · Steven Latre
🔗
|
-
|
Guiding Exploration Towards Impactful Actions
(
Poster
)
>
link
SlidesLive Video
|
Vaibhav Saxena · Jimmy Ba · Danijar Hafner
🔗
|
-
|
Domain Invariant Q-Learning for model-free robust continuous control under visual distractions
(
Poster
)
>
link
SlidesLive Video
|
Tom Dupuis · Jaonary Rabarisoa · Quoc Cuong PHAM · David Filliat
🔗
|
-
|
Multi-Agent Policy Transfer via Task Relationship Modeling
(
Poster
)
>
link
SlidesLive Video
|
Rong-Jun Qin · Feng Chen · Tonghan Wang · Lei Yuan · Xiaoran Wu · Yipeng Kang · Zongzhang Zhang · Chongjie Zhang · Yang Yu
🔗
|
-
|
Foundation Models for History Compression in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Fabian Paischer · Thomas Adler · Andreas Radler · Markus Hofmarcher · Sepp Hochreiter
🔗
|
-
|
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Chang Yang · RUIYU WANG · Xinrun Wang · Zhen Wang
🔗
|
-
|
Imitating Human Behaviour with Diffusion Models
(
Poster
)
>
link
SlidesLive Video
|
11 presenters
Tim Pearce · Tabish Rashid · Anssi Kanervisto · David Bignell · Mingfei Sun · Raluca Georgescu · Sergio Valcarcel Macua · Shan Zheng Tan · Ida Momennejad · Katja Hofmann · Sam Devlin
🔗
|
-
|
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
(
Poster
)
>
link
SlidesLive Video
|
Yifu Yuan · Jianye Hao · Fei Ni · Yao Mu · YAN ZHENG · Yujing Hu · Jinyi Liu · Yingfeng Chen · Changjie Fan
🔗
|
-
|
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
(
Poster
)
>
link
SlidesLive Video
|
Pengyi Li · Hongyao Tang · Jianye Hao · YAN ZHENG · Xian Fu · Zhaopeng Meng
🔗
|
-
|
Quantization-aware Policy Distillation (QPD)
(
Poster
)
>
link
SlidesLive Video
|
Thomas Avé · Kevin Mets · Tom De Schepper · Steven Latre
🔗
|
-
|
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
(
Poster
)
>
link
SlidesLive Video
|
Michał Zawalski · Michał Tyrolski · Konrad Czechowski · Damian Stachura · Piotr Piękos · Tomasz Odrzygóźdź · Yuhuai Wu · Łukasz Kuciński · Piotr Miłoś
🔗
|
-
|
Cyclophobic Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Stefan Wagner · Peter Arndt · Jan Robine · Stefan Harmeling
🔗
|
-
|
AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning
(
Poster
)
>
link
|
Qinsheng Zhang · Arjun Krishna · Sehoon Ha · Yongxin Chen
🔗
|
-
|
Fine-tuning Offline Policies with Optimistic Action Selection
(
Poster
)
>
link
SlidesLive Video
|
Max Sobol Mark · Ali Ghadirzadeh · Xi Chen · Chelsea Finn
🔗
|
-
|
SEM2: Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model
(
Poster
)
>
link
SlidesLive Video
|
Zeyu Gao · Yao Mu · Ruoyan Shen · Chen Chen · Yangang Ren · Jianyu Chen · Shengbo Li · Ping Luo · Yanfeng Lu
🔗
|
-
|
Policy Architectures for Compositional Generalization in Control
(
Poster
)
>
link
SlidesLive Video
|
Allan Zhou · Vikash Kumar · Chelsea Finn · Aravind Rajeswaran
🔗
|
-
|
Rethinking Learning Dynamics in RL using Adversarial Networks
(
Poster
)
>
link
SlidesLive Video
|
Ramnath Kumar · Tristan Deleu · Yoshua Bengio
🔗
|
-
|
Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation
(
Poster
)
>
link
SlidesLive Video
|
Ramnath Kumar · Dheeraj Nagaraj
🔗
|
-
|
Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction
(
Poster
)
>
link
SlidesLive Video
|
Jiachen Li · Shuo Cheng · Zhenyu Liao · Huayan Wang · William Yang Wang · Qinxun Bai
🔗
|
-
|
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
(
Poster
)
>
link
SlidesLive Video
|
Stone Tao · Xiaochen Li · Tongzhou Mu · Zhiao Huang · Yuzhe Qin · Hao Su
🔗
|
-
|
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
(
Poster
)
>
link
|
Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc Bellemare · Aaron Courville
🔗
|
-
|
Adversarial Policies Beat Professional-Level Go AIs
(
Poster
)
>
link
SlidesLive Video
|
Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart J Russell
🔗
|
-
|
VARIATIONAL REPARAMETRIZED POLICY LEARNING WITH DIFFERENTIABLE PHYSICS
(
Poster
)
>
link
SlidesLive Video
|
Zhiao Huang · Litian Liang · Zhan Ling · Xuanlin Li · Chuang Gan · Hao Su
🔗
|
-
|
Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing
(
Poster
)
>
link
SlidesLive Video
|
Grace Zhang · Ayush Jain · Injune Hwang · Shao-Hua Sun · Joseph Lim
🔗
|
-
|
Contrastive Example-Based Control
(
Poster
)
>
link
|
Kyle Hatch · Sarthak J Shetty · Benjamin Eysenbach · Tianhe Yu · Rafael Rafailov · Russ Salakhutdinov · Sergey Levine · Chelsea Finn
🔗
|
-
|
A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations
(
Poster
)
>
link
SlidesLive Video
|
Qisai Liu · Xian Yeow Lee · Soumik Sarkar
🔗
|
-
|
Multi-skill Mobile Manipulation for Object Rearrangement
(
Poster
)
>
link
SlidesLive Video
|
Jiayuan Gu · Devendra Singh Chaplot · Hao Su · Jitendra Malik
🔗
|
-
|
Visual Reinforcement Learning with Self-Supervised 3D Representations
(
Poster
)
>
link
SlidesLive Video
|
Yanjie Ze · Nicklas Hansen · Yinbo Chen · Mohit Jain · Xiaolong Wang
🔗
|
-
|
One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation
(
Poster
)
>
link
SlidesLive Video
|
Matthew Chang · Saurabh Gupta
🔗
|
-
|
Building a Subspace of Policies for Scalable Continual Learning
(
Poster
)
>
link
SlidesLive Video
|
Jean-Baptiste Gaya · Thang Long Doan · Lucas Page-Caccia · Laure Soulier · Ludovic Denoyer · Roberta Raileanu
🔗
|
-
|
Skill Machines: Temporal Logic Composition in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Geraud Nangue Tasse · Devon Jarvis · Steven James · Benjamin Rosman
🔗
|
-
|
Learning Representations for Reinforcement Learning with Hierarchical Forward Models
(
Poster
)
>
link
SlidesLive Video
|
Trevor McInroe · Lukas Schäfer · Stefano Albrecht
🔗
|
-
|
In-context Reinforcement Learning with Algorithm Distillation
(
Poster
)
>
link
SlidesLive Video
|
14 presenters
Michael Laskin · Luyu Wang · Junhyuk Oh · Emilio Parisotto · Stephen Spencer · Richie Steigerwald · DJ Strouse · Steven Hansen · Angelos Filos · Ethan Brooks · Maxime Gazeau · Himanshu Sahni · Satinder Singh · Volodymyr Mnih
🔗
|
-
|
Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm
(
Poster
)
>
link
SlidesLive Video
|
Marc Höftmann · Jan Robine · Stefan Harmeling
🔗
|
-
|
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Nicklas Hansen · Yixin Lin · Hao Su · Xiaolong Wang · Vikash Kumar · Aravind Rajeswaran
🔗
|
-
|
Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation
(
Poster
)
>
link
SlidesLive Video
|
Linfeng Zhao · Huazhe Xu · Lawson Wong
🔗
|
-
|
Graph Inverse Reinforcement Learning from Diverse Videos
(
Poster
)
>
link
SlidesLive Video
|
Sateesh Kumar · Jonathan Zamora · Nicklas Hansen · Rishabh Jangir · Xiaolong Wang
🔗
|
-
|
Simple Emergent Action Representations from Multi-Task Policy Training
(
Poster
)
>
link
SlidesLive Video
|
Pu Hua · Yubei Chen · Huazhe Xu
🔗
|
-
|
Adversarial Cheap Talk
(
Poster
)
>
link
|
Chris Lu · Timon Willi · Alistair Letcher · Jakob Foerster
🔗
|
-
|
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
yifan xu · Nicklas Hansen · Zirui Wang · Yung-Chieh Chan · Hao Su · Zhuowen Tu
🔗
|
-
|
SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling
(
Poster
)
>
link
SlidesLive Video
|
Jesse Zhang · Karl Pertsch · Jiahui Zhang · Taewook Nam · Sung Ju Hwang · Xiang Ren · Joseph Lim
🔗
|
-
|
Towards True Lossless Sparse Communication in Multi-Agent Systems
(
Poster
)
>
link
SlidesLive Video
|
Seth Karten · Mycal Tucker · Siva Kailas · Katia Sycara
🔗
|
-
|
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
(
Poster
)
>
link
SlidesLive Video
|
Anton Bakhtin · David Wu · Adam Lerer · Jonathan Gray · Athul Jacob · Gabriele Farina · Alexander Miller · Noam Brown
🔗
|
-
|
PnP-Nav: Plug-and-Play Policies for Generalizable Visual Navigation Across Robots
(
Poster
)
>
link
SlidesLive Video
|
Dhruv Shah · Ajay Sridhar · Arjun Bhorkar · Noriaki Hirose · Sergey Levine
🔗
|
-
|
Offline Reinforcement Learning for Customizable Visual Navigation
(
Poster
)
>
link
|
Dhruv Shah · Arjun Bhorkar · Hrishit Leen · Ilya Kostrikov · Nicholas Rhinehart · Sergey Levine
🔗
|
-
|
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
(
Poster
)
>
link
|
Remo Sasso · Matthia Sabatelli · Marco Wiering
🔗
|
-
|
Hyperbolic Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Edoardo Cetin · Benjamin Chamberlain · Michael Bronstein · jonathan j hunt
🔗
|
-
|
Investigating Multi-task Pretraining and Generalization in Reinforcement Learning
(
Poster
)
>
link
|
Adrien Ali Taiga · Rishabh Agarwal · Jesse Farebrother · Aaron Courville · Marc Bellemare
🔗
|
-
|
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zhendong Wang · jonathan j hunt · Mingyuan Zhou
🔗
|
-
|
Efficient Exploration using Model-Based Quality-Diversity with Gradients
(
Poster
)
>
link
SlidesLive Video
|
Bryan Lim · Manon Flageat · Antoine Cully
🔗
|
-
|
Choreographer: Learning and Adapting Skills in Imagination
(
Poster
)
>
link
SlidesLive Video
|
Pietro Mazzaglia · Tim Verbelen · Bart Dhoedt · Alexandre Lacoste · Sai Rajeswar Mudumba
🔗
|
-
|
Giving Robots a Hand: Broadening Generalization via Hand-Centric Human Video Demonstrations
(
Poster
)
>
link
|
Moo J Kim · Jiajun Wu · Chelsea Finn
🔗
|
-
|
Efficient Offline Policy Optimization with a Learned Model
(
Poster
)
>
link
SlidesLive Video
|
Zichen Liu · Siyi Li · Wee Sun Lee · Shuicheng Yan · Zhongwen Xu
🔗
|
-
|
Emergent collective intelligence from massive-agent cooperation and competition
(
Poster
)
>
link
SlidesLive Video
|
Hanmo Chen · Stone Tao · JIAXIN CHEN · Weihan Shen · Xihui Li · Chenghui Yu · Sikai Cheng · Xiaolong Zhu · Xiu Li
🔗
|
-
|
Distance-Sensitive Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Li Jianxiong · Xianyuan Zhan · Haoran Xu · Xiangyu Zhu · Jingjing Liu · Ya-Qin Zhang
🔗
|
-
|
Uncertainty-Driven Exploration for Generalization in Reinforcement Learning
(
Poster
)
>
link
|
Yiding Jiang · J. Zico Kolter · Roberta Raileanu
🔗
|
-
|
Language Models Can Teach Themselves to Program Better
(
Poster
)
>
link
SlidesLive Video
|
Patrick Haluptzok · Matthew Bowers · Adam Kalai
🔗
|
-
|
Graph Q-Learning for Combinatorial Optimization
(
Poster
)
>
link
SlidesLive Video
|
Victoria Magdalena Dax · Jiachen Li · Kevin Leahy · Mykel J Kochenderfer
🔗
|
-
|
Transformer-based World Models Are Happy With 100k Interactions
(
Poster
)
>
link
SlidesLive Video
|
Jan Robine · Marc Höftmann · Tobias Uelwer · Stefan Harmeling
🔗
|
-
|
Contrastive Value Learning: Implicit Models for Simple Offline RL
(
Poster
)
>
link
SlidesLive Video
|
Bogdan Mazoure · Benjamin Eysenbach · Ofir Nachum · Jonathan Tompson
🔗
|
-
|
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
(
Poster
)
>
link
SlidesLive Video
|
Changnan Xiao · Haosen Shi · Jiajun Fan · Shihong Deng · Haiyan Yin
🔗
|
-
|
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktäschel
🔗
|
-
|
Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Onno Eberhard · Jakob Hollenstein · Cristina Pinneri · Georg Martius
🔗
|
-
|
Evaluating Long-Term Memory in 3D Mazes
(
Poster
)
>
link
|
Jurgis Pašukonis · Timothy Lillicrap · Danijar Hafner
🔗
|
-
|
Visual Imitation Learning with Patch Rewards
(
Poster
)
>
link
|
Minghuan Liu · Tairan He · Weinan Zhang · Shuicheng Yan · Zhongwen Xu
🔗
|
-
|
Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness
(
Poster
)
>
link
SlidesLive Video
|
Ryosuke Unno · Yoshimasa Tsuruoka
🔗
|
-
|
Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer
(
Poster
)
>
link
SlidesLive Video
|
Hayato Watahiki · Ryo Iwase · Ryosuke Unno · Yoshimasa Tsuruoka
🔗
|
-
|
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mhairi Dunion · Trevor McInroe · Kevin Sebastian Luck · Josiah Hanna · Stefano Albrecht
🔗
|
-
|
Toward Effective Deep Reinforcement Learning for 3D Robotic Manipulation: End-to-End Learning from Multimodal Raw Sensory Data
(
Poster
)
>
link
SlidesLive Video
|
Samyeul Noh · Hyun Myung
🔗
|
-
|
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
(
Poster
)
>
link
SlidesLive Video
|
Dolton Fernandes · Pramod Kaushik · Harsh Shukla · Raju Bapi
🔗
|
-
|
A Ranking Game for Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Harshit Sushil Sikchi · Akanksha Saran · Wonjoon Goo · Scott Niekum
🔗
|
-
|
Implicit Offline Reinforcement Learning via Supervised Learning
(
Poster
)
>
link
|
Alexandre Piche · Rafael Pardinas · David Vazquez · Igor Mordatch · Igor Mordatch · Chris Pal
🔗
|
-
|
Distributional deep Q-learning with CVaR regression
(
Poster
)
>
link
SlidesLive Video
|
Mastane Achab · REDA ALAMI · YASSER ABDELAZIZ DAHOU DJILALI · Kirill Fedyanin · Eric Moulines · Maxim Panov
🔗
|
-
|
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Samuel Kessler · Piotr Miłoś · Jack Parker-Holder · S Roberts
🔗
|
-
|
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization
(
Poster
)
>
link
SlidesLive Video
|
Lunjun Zhang · Bradly Stadie
🔗
|
-
|
Perturbed Quantile Regression for Distributional Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Taehyun Cho · Seungyub Han · Heesoo Lee · Kyungjae Lee · Jungwoo Lee
🔗
|
-
|
Concept-based Understanding of Emergent Multi-Agent Behavior
(
Poster
)
>
link
SlidesLive Video
|
Niko Grupen · Shayegan Omidshafiei · Natasha Jaques · Been Kim
🔗
|
-
|
Constrained Imitation Q-learning with Earth Mover’s Distance reward
(
Poster
)
>
link
SlidesLive Video
|
WENYAN Yang · Nataliya Strokina · Joni Pajarinen · Joni-kristian Kamarainen
🔗
|
-
|
Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
(
Poster
)
>
link
SlidesLive Video
|
Michael Chang · Alyssa L Dayan · Franziska Meier · Tom Griffiths · Sergey Levine · Amy Zhang
🔗
|
-
|
SoftTreeMax: Policy Gradient with Tree Search
(
Poster
)
>
link
SlidesLive Video
|
Gal Dalal · Assaf Hallak · Shie Mannor · Gal Chechik
🔗
|
-
|
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
(
Poster
)
>
link
SlidesLive Video
|
Philipp Siedler
🔗
|
-
|
Hypernetwork-PPO for Continual Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Philemon Schöpf · Sayantan Auddy · Jakob Hollenstein · Antonio Rodriguez-sanchez
🔗
|
-
|
DRL-EPANET: Deep reinforcement learning for optimal control at scale in Water Distribution Systems
(
Poster
)
>
link
SlidesLive Video
|
Anas Belfadil · David Modesto · Jose Martin H.
🔗
|
-
|
Actor Prioritized Experience Replay
(
Poster
)
>
link
SlidesLive Video
|
Baturay Saglam · Furkan Burak Mutlu · Doğan Can Çiçek · Suleyman Kozat
🔗
|
-
|
Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning
(
Poster
)
>
link
|
Siyang Wu · Tonghan Wang · Xiaoran Wu · Jingfeng ZHANG · Yujing Hu · Changjie Fan · Chongjie Zhang
🔗
|
-
|
Converging to Unexploitable Policies in Continuous Control Adversarial Games
(
Poster
)
>
link
SlidesLive Video
|
Maxwell Goldstein · Noam Brown
🔗
|
-
|
Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Chaitanya Kharyal · Tanmay Sinha · Vijaya Sai Krishna Gottipati · Srijita Das · Matthew Taylor
🔗
|
-
|
On All-Action Policy Gradients
(
Poster
)
>
link
SlidesLive Video
|
Michal Nauman · Marek Cygan
🔗
|
-
|
A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Benjamin Eysenbach · Matthieu Geist · Russ Salakhutdinov · Sergey Levine
🔗
|
-
|
The Benefits of Model-Based Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Kenny Young · Aditya Ramesh · Louis Kirsch · Jürgen Schmidhuber
🔗
|
-
|
Training graph neural networks with policy gradients to perform tree search
(
Poster
)
>
link
SlidesLive Video
|
Matthew Macfarlane · Diederik Roijers · Herke van Hoof
🔗
|
-
|
Co-Imitation: Learning Design and Behaviour by Imitation
(
Poster
)
>
link
SlidesLive Video
|
Chang Rajani · Karol Arndt · David Blanco-Mulero · Kevin Sebastian Luck · Ville Kyrki
🔗
|
-
|
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mingqi Yuan · Bo Li · Xin Jin · Wenjun Zeng
🔗
|
-
|
BLaDE: Robust Exploration via Diffusion Models
(
Poster
)
>
link
SlidesLive Video
|
Bilal Piot · Zhaohan Guo · Shantanu Thakoor · Mohammad Gheshlaghi Azar
🔗
|
-
|
Learning Semantics-Aware Locomotion Skills from Human Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Yuxiang Yang · Xiangyun Meng · Wenhao Yu · Tingnan Zhang · Jie Tan · Byron Boots
🔗
|
-
|
Imitation from Observation With Bootstrapped Contrastive Learning
(
Poster
)
>
link
|
Medric Sonwa · Johanna Hansen · Eugene Belilovsky
🔗
|
-
|
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
(
Poster
)
>
link
SlidesLive Video
|
Toygun Basaklar · Suat Gumussoy · Umit Ogras
🔗
|
-
|
Improving Assistive Robotics with Deep Reinforcement Learning
(
Poster
)
>
link
|
Yash Jakhotiya · Iman Haque
🔗
|
-
|
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Matthias Gerstgrasser · Tom Danino · Sarah Keren
🔗
|
-
|
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Manuel Goulão · Arlindo L Oliveira
🔗
|
-
|
Variance Reduction in Off-Policy Deep Reinforcement Learning using Spectral Normalization
(
Poster
)
>
link
SlidesLive Video
|
Payal Bawa · Rafael Oliveira · Fabio Ramos
🔗
|
-
|
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents
(
Poster
)
>
link
|
Minghuan Liu · Zhengbang Zhu · Menghui Zhu · Yuzheng Zhuang · Weinan Zhang · Jianye Hao
🔗
|
-
|
Guided Skill Learning and Abstraction for Long-Horizon Manipulation
(
Poster
)
>
link
SlidesLive Video
|
Shuo Cheng · Danfei Xu
🔗
|
-
|
Locally Constrained Representations in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Somjit Nath · Samira Ebrahimi Kahou
🔗
|
-
|
Sample-efficient Adversarial Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Dahuin Jung · Hyungyu Lee · Sungroh Yoon
🔗
|
-
|
Prioritizing Samples in Reinforcement Learning with Reducible Loss
(
Poster
)
>
link
SlidesLive Video
|
Shivakanth Sujit · Somjit Nath · Pedro Braga · Samira Ebrahimi Kahou
🔗
|
-
|
PCRL: Priority Convention Reinforcement Learning for Microscopically Sequencable Multi-agent Problems
(
Poster
)
>
link
SlidesLive Video
|
Xing Zhou · Hao Gao · Xin Xu · Xinglong Zhang · Hongda Jia · Dongzi Wang
🔗
|
-
|
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Zixiang Chen · Chris Junchi Li · Angela Yuan · Quanquan Gu · Michael Jordan
🔗
|
-
|
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
(
Poster
)
>
link
SlidesLive Video
|
Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Ruslan Salakhutdinov
🔗
|
-
|
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
(
Poster
)
>
link
SlidesLive Video
|
Pascal Leroy · Jonathan Pisane · Damien Ernst
🔗
|
-
|
Reinforcement Learning in System Identification
(
Poster
)
>
link
SlidesLive Video
|
Jose Martin H. · Óscar Fernandez Vicente · Sergio Perez · Anas Belfadil · Cristina Ibanez-Llano · Freddy Perozo Rondón · Jose Valle · Javier Arechalde Pelaz
🔗
|
-
|
Robust Option Learning for Adversarial Generalization
(
Poster
)
>
link
SlidesLive Video
|
Kishor Jothimurugan · Steve Hsu · Osbert Bastani · Rajeev Alur
🔗
|
-
|
Biological Neurons vs Deep Reinforcement Learning: Sample efficiency in a simulated game-world
(
Poster
)
>
link
SlidesLive Video
|
Forough Habibollahi · Moein Khajehnejad · Amitesh Gaurav · Brett J. Kagan
🔗
|
-
|
Inducing Functions through Reinforcement Learning without Task Specification
(
Poster
)
>
link
SlidesLive Video
|
Junmo Cho · Donghwan Lee · Young-Gyu Yoon
🔗
|
-
|
Learning Successor Feature Representations to Train Robust Policies for Multi-task Learning
(
Poster
)
>
link
|
Melissa Mozifian · Dieter Fox · David Meger · Fabio Ramos · Animesh Garg
🔗
|
-
|
Automated Dynamics Curriculums for Deep Reinforcement Learning
(
Poster
)
>
link
|
Sean Metzger
🔗
|
-
|
Supervised Q-Learning for Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun · Ziping Xu · Taiyi Wang · Meng Fang · Bolei Zhou
🔗
|
-
|
MOPA: a Minimalist Off-Policy Approach to Safe-RL
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun · Ziping Xu · Zhenghao Peng · Meng Fang · Bo Dai · Bolei Zhou
🔗
|
-
|
Novel Policy Seeking with Constrained Optimization
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun · Zhenghao Peng · Bolei Zhou
🔗
|
-
|
Toward Causal-Aware RL: State-Wise Action-Refined Temporal Difference
(
Poster
)
>
link
SlidesLive Video
|
Hao Sun · Taiyi Wang
🔗
|