Deep Reinforcement Learning

Fri 8:30 a.m. - 9:00 a.m.

Invited talk: PierreYves Oudeyer "Machines that invent their own problems: Towards open-ended learning of skills" ( Talk ) >
SlidesLive Video

Pierre-Yves Oudeyer 🔗

Fri 9:00 a.m. - 9:15 a.m.

Contributed Talk: Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning ( Talk ) >
SlidesLive Video

Sammy Christen · Lukas Jendele · Emre Aksan · Otmar Hilliges 🔗

Fri 9:15 a.m. - 9:30 a.m.

Contributed Talk: Maximum Reward Formulation In Reinforcement Learning ( Talk ) >
SlidesLive Video

Vijaya Sai Krishna Gottipati · Yashaswi Pathak · Rohan Nuttall · Sahir . · Raviteja Chunduru · Ahmed Touati · Sriram Ganapathi · Matthew Taylor · Sarath Chandar 🔗

Fri 9:30 a.m. - 9:45 a.m.

Contributed Talk: Accelerating Reinforcement Learning with Learned Skill Priors ( Talk ) >
SlidesLive Video

Karl Pertsch · Youngwoon Lee · Joseph Lim 🔗

Fri 9:45 a.m. - 10:00 a.m.

Contributed Talk: Asymmetric self-play for automatic goal discovery in robotic manipulation ( Talk ) >
SlidesLive Video

16 presenters

OpenAI Robotics · Matthias Plappert · Raul Sampedro · Tao Xu · Ilge Akkaya · Vineet Kosaraju · Peter Welinder · Ruben D'Sa · Arthur Petron · Henrique Ponde · Alex Paino · Hyeonwoo Noh Noh · Lilian Weng · Qiming Yuan · Casey Chu · Wojciech Zaremba

🔗

Fri 10:00 a.m. - 10:30 a.m.

Invited talk: Marc Bellemare "Autonomous navigation of stratospheric balloons using reinforcement learning" ( Talk ) >

Marc Bellemare 🔗

Fri 10:30 a.m. - 11:00 a.m.

Break

🔗

Fri 11:00 a.m. - 11:30 a.m.

Invited talk: Peter Stone "Grounded Simulation Learning for Sim2Real with Connections to Off-Policy Reinforcement Learning" ( Talk ) >
SlidesLive Video

Peter Stone 🔗

Fri 11:30 a.m. - 11:45 a.m.

Contributed Talk: Mirror Descent Policy Optimization ( Talk ) >
SlidesLive Video

Manan Tomar · Lior Shani · Yonathan Efroni · Mohammad Ghavamzadeh 🔗

Fri 11:45 a.m. - 12:00 p.m.

Contributed Talk: Planning from Pixels using Inverse Dynamics Models ( Talk ) >
SlidesLive Video

Keiran Paster · Sheila McIlraith · Jimmy Ba 🔗

Fri 12:00 p.m. - 12:30 p.m.

Invited talk: Matt Botvinick "Alchemy: A Benchmark Task Distribution for Meta-Reinforcement Learning Research" ( Talk ) >
SlidesLive Video

Matt Botvinick 🔗

Fri 12:30 p.m. - 1:30 p.m.

Poster session 1 ( Poster session ) > link

Link

🔗

Fri 1:30 p.m. - 2:00 p.m.

Invited talk: Susan Murphy "We used RL but…. Did it work?!" ( Talk ) >
SlidesLive Video

Susan Murphy 🔗

Fri 2:00 p.m. - 2:15 p.m.

Contributed Talk: MaxEnt RL and Robust Control ( Talk ) >
SlidesLive Video

Benjamin Eysenbach · Sergey Levine 🔗

Fri 2:15 p.m. - 2:30 p.m.

Contributed Talk: Reset-Free Lifelong Learning with Skill-Space Planning ( Talk ) >
SlidesLive Video

Kevin Lu · Aditya Grover · Pieter Abbeel · Igor Mordatch 🔗

Fri 2:30 p.m. - 3:00 p.m.

Invited talk: Anusha Nagabandi "Model-based Deep Reinforcement Learning for Robotic Systems" ( Talk ) >
SlidesLive Video

Anusha Nagabandi 🔗

Fri 3:00 p.m. - 3:30 p.m.

Break

🔗

Fri 3:30 p.m. - 4:00 p.m.

Invited talk: Ashley Edwards "Learning Offline from Observation" ( Talk ) >
SlidesLive Video

Ashley Edwards 🔗

Fri 4:00 p.m. - 4:07 p.m.

NeurIPS RL Competitions: Flatland challenge ( Talk ) >
SlidesLive Video

Sharada Mohanty 🔗

Fri 4:07 p.m. - 4:15 p.m.

NeurIPS RL Competitions: Learning to run a power network ( Talk ) >
SlidesLive Video

Antoine Marot 🔗

Fri 4:15 p.m. - 4:22 p.m.

NeurIPS RL Competitions: Procgen challenge ( Talk ) >

Sharada Mohanty 🔗

Fri 4:22 p.m. - 4:30 p.m.

NeurIPS RL Competitions: MineRL ( Talk ) >
SlidesLive Video

William Guss · Stephanie Milani 🔗

Fri 4:30 p.m. - 5:00 p.m.

Invited talk: Karen Liu "Deep Reinforcement Learning for Physical Human-Robot Interaction" ( Talk ) >
SlidesLive Video

Karen Liu 🔗

Fri 5:00 p.m. - 6:00 p.m.

Panel discussion ( Panel discussion ) >

Pierre-Yves Oudeyer · Marc Bellemare · Peter Stone · Matt Botvinick · Susan Murphy · Anusha Nagabandi · Ashley Edwards · Karen Liu · Pieter Abbeel 🔗

Fri 6:00 p.m. - 7:00 p.m.

Poster session 2 ( Poster session ) > link

Link

🔗

-

Poster: Planning from Pixels using Inverse Dynamics Models ( Poster ) >
SlidesLive Video

🔗

-

Poster: OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Maximum Reward Formulation In Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Reset-Free Lifelong Learning with Skill-Space Planning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Mirror Descent Policy Optimization ( Poster ) >
SlidesLive Video

🔗

-

Poster: MaxEnt RL and Robust Control ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Provably Efficient Policy Optimization via Thompson Sampling ( Poster ) >
SlidesLive Video

🔗

-

Poster: Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates ( Poster ) >
SlidesLive Video

🔗

-

Poster: Efficient Competitive Self-Play Policy Optimization ( Poster ) >
SlidesLive Video

🔗

-

Poster: Asymmetric self-play for automatic goal discovery in robotic manipulation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Correcting Momentum in Temporal Difference Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Decoupling Exploration and Exploitation in Meta-Reinforcement Learning without Sacrifices ( Poster ) >
SlidesLive Video

🔗

-

Poster: Diverse Exploration via InfoMax Options ( Poster ) >
SlidesLive Video

🔗

-

Poster: Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads ( Poster ) >

🔗

-

Poster: Parrot: Data-driven Behavioral Priors for Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: C-Learning: Horizon-Aware Cumulative Accessibility Estimation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning ( Poster ) >

🔗

-

Poster: Data-Efficient Reinforcement Learning with Self-Predictive Representations ( Poster ) >
SlidesLive Video

🔗

-

Poster: Accelerating Reinforcement Learning with Learned Skill Priors ( Poster ) >
SlidesLive Video

🔗

-

Poster: C-Learning: Learning to Achieve Goals via Recursive Classification ( Poster ) >
SlidesLive Video

🔗

-

Poster: Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning to Reach Goals via Iterated Supervised Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Unified View of Inference-based Off-policy RL: Decoupling Algorithmic and Implemental Source of Performance Gaps ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning to Sample with Local and Global Contexts in Experience Replay Buffer ( Poster ) >
SlidesLive Video

🔗

-

Poster: Adversarial Environment Generation for Learning to Navigate the Web ( Poster ) >
SlidesLive Video

🔗

-

Poster: Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments ( Poster ) >
SlidesLive Video

🔗

-

Poster: DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies ( Poster ) >
SlidesLive Video

🔗

-

Poster: Discovery of Options via Meta-Gradients ( Poster ) >
SlidesLive Video

🔗

-

Poster: GRAC: Self-Guided and Self-Regularized Actor-Critic ( Poster ) >
SlidesLive Video

🔗

-

Poster: Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity ( Poster ) >
SlidesLive Video

🔗

-

Poster: Deep Bayesian Quadrature Policy Gradient ( Poster ) >
SlidesLive Video

🔗

-

Poster: PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards ( Poster ) >
SlidesLive Video

🔗

-

Poster: A Policy Gradient Method for Task-Agnostic Exploration ( Poster ) >
SlidesLive Video

🔗

-

Poster: Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Skill Transfer via Partially Amortized Hierarchical Planning ( Poster ) >
SlidesLive Video

🔗

-

Poster: On Effective Parallelization of Monte Carlo Tree Search ( Poster ) >
SlidesLive Video

🔗

-

Poster: Mastering Atari with Discrete World Models ( Poster ) >

🔗

-

Poster: Average Reward Reinforcement Learning with Monotonic Policy Improvement ( Poster ) >
SlidesLive Video

🔗

-

Poster: Combating False Negatives in Adversarial Imitation Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Evaluating Agents Without Rewards ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning Latent Landmarks for Generalizable Planning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Conservative Safety Critics for Exploration ( Poster ) >
SlidesLive Video

🔗

-

Poster: Solving Compositional Reinforcement Learning Problems via Task Reduction ( Poster ) >
SlidesLive Video

🔗

-

Poster: Deep Q-Learning with Low Switching Cost ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning to Represent Action Values as a Hypergraph on the Action Vertices ( Poster ) >
SlidesLive Video

🔗

-

Poster: Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets ( Poster ) >
SlidesLive Video

🔗

-

Poster: TACTO: A Simulator for Learning Control from Touch Sensing ( Poster ) >
SlidesLive Video

🔗

-

Poster: Safe Reinforcement Learning with Natural Language Constraints ( Poster ) >
SlidesLive Video

🔗

-

Poster: Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks ( Poster ) >
SlidesLive Video

🔗

-

Poster: An Examination of Preference-based Reinforcement Learning for Treatment Recommendation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Model-based Navigation in Environments with Novel Layouts Using Abstract $n$-D Maps ( Poster ) >
SlidesLive Video

🔗

-

Poster: Online Safety Assurance for Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Lyapunov Barrier Policy Optimization ( Poster ) >
SlidesLive Video

🔗

-

Poster: Evolving Reinforcement Learning Algorithms ( Poster ) >
SlidesLive Video

🔗

-

Poster: Chaining Behaviors from Data with Model-Free Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Pairwise Weights for Temporal Credit Assignment ( Poster ) >
SlidesLive Video

🔗

-

Poster: Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Understanding Learned Reward Functions ( Poster ) >
SlidesLive Video

🔗

-

Poster: Addressing reward bias in Adversarial Imitation Learning with neutral reward functions ( Poster ) >
SlidesLive Video

🔗

-

Poster: Reinforcement Learning with Bayesian Classifiers: Efficient Skill Learning from Outcome Examples ( Poster ) >
SlidesLive Video

🔗

-

Poster: Decoupling Representation Learning from Reinforcement Learning ( Poster ) >

🔗

-

Poster: Model-Based Reinforcement Learning via Latent-Space Collocation ( Poster ) >
SlidesLive Video

🔗

-

Poster: A Variational Inference Perspective on Goal-Directed Behavior in Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II ( Poster ) >
SlidesLive Video

🔗

-

Poster: Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Latent State Models for Meta-Reinforcement Learning from Images ( Poster ) >
SlidesLive Video

🔗

-

Poster: Dream and Search to Control: Latent Space Planning for Continuous Control ( Poster ) >
SlidesLive Video

🔗

-

Poster: Explanation Augmented Feedback in Human-in-the-Loop Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Goal-Conditioned Reinforcement Learning in the Presence of an Adversary ( Poster ) >
SlidesLive Video

🔗

-

Poster: Regularized Inverse Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Domain Adversarial Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Safety Aware Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Sample Efficient Training in Multi-Agent AdversarialGames with Limited Teammate Communication ( Poster ) >
SlidesLive Video

🔗

-

Poster: Amortized Variational Deep Q Network ( Poster ) >
SlidesLive Video

🔗

-

Poster: Disentangled Planning and Control in Vision Based Robotics via Reward Machines ( Poster ) >
SlidesLive Video

🔗

-

Poster: Maximum Mutation Reinforcement Learning for Scalable Control ( Poster ) >
SlidesLive Video

🔗

-

Poster: Unsupervised Task Clustering for Multi-Task Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning Intrinsic Symbolic Rewards in Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity ( Poster ) >
SlidesLive Video

🔗

-

Poster: Action and Perception as Divergence Minimization ( Poster ) >
SlidesLive Video

🔗

-

Poster: Randomized Ensembled Double Q-Learning: Learning Fast Without a Model ( Poster ) >
SlidesLive Video

🔗

-

Poster: D2RL: Deep Dense Architectures in Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms ( Poster ) >
SlidesLive Video

🔗

-

Poster: Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization ( Poster ) >
SlidesLive Video

🔗

-

Poster: What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study ( Poster ) >
SlidesLive Video

🔗

-

Poster: Semantic State Representation for Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Hyperparameter Auto-tuning in Self-Supervised Robotic Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Targeted Query-based Action-Space Adversarial Policies on Deep Reinforcement Learning Agents ( Poster ) >
SlidesLive Video

🔗

-

Poster: Abstract Value Iteration for Hierarchical Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Compute- and Memory-Efficient Reinforcement Learning with Latent Experience Replay ( Poster ) >
SlidesLive Video

🔗

-

Poster: Emergent Road Rules In Multi-Agent Driving Environments ( Poster ) >
SlidesLive Video

🔗

-

Poster: An Algorithmic Causal Model of Credit Assignment in Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning to Weight Imperfect Demonstrations ( Poster ) >
SlidesLive Video

🔗

-

Poster: Structure and randomness in planning and reinforcement learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Parameter-based Value Functions ( Poster ) >
SlidesLive Video

🔗

-

Poster: Influence-aware Memory for Deep Reinforcement Learning in POMDPs ( Poster ) >
SlidesLive Video

🔗

-

Poster: Modular Training, Integrated Planning Deep Reinforcement Learning for Mobile Robot Navigation ( Poster ) >
SlidesLive Video

🔗

-

Poster: How to make Deep RL work in Practice ( Poster ) >
SlidesLive Video

🔗

-

Poster: Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Which Mutual-Information Representation Learning Objectives are Sufficient for Control? ( Poster ) >
SlidesLive Video

🔗

-

Poster: Curriculum Learning through Distilled Discriminators ( Poster ) >
SlidesLive Video

🔗

-

Poster: Self-Supervised Policy Adaptation during Deployment ( Poster ) >
SlidesLive Video

🔗

-

Poster: Trust, but verify: model-based exploration in sparse reward environments ( Poster ) >
SlidesLive Video

🔗

-

Poster: Optimizing Traffic Bottleneck Throughput using Cooperative, Decentralized Autonomous Vehicles ( Poster ) >
SlidesLive Video

🔗

-

Poster: Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking ( Poster ) >
SlidesLive Video

🔗

-

Poster: Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research ( Poster ) >
SlidesLive Video

🔗

-

Poster: Reinforcement Learning with Latent Flow ( Poster ) >
SlidesLive Video

🔗

-

Poster: Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization ( Poster ) >
SlidesLive Video

🔗

-

Poster: AWAC: Accelerating Online Reinforcement Learning With Offline Datasets ( Poster ) >
SlidesLive Video

🔗

-

Poster: Inter-Level Cooperation in Hierarchical Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Multi-Agent Option Critic Architecture ( Poster ) >

🔗

-

Poster: Measuring Visual Generalization in Continuous Control from Pixels ( Poster ) >
SlidesLive Video

🔗

-

Poster: Policy Learning Using Weak Supervision ( Poster ) >
SlidesLive Video

🔗

-

Poster: Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments ( Poster ) >

🔗

-

Poster: Unsupervised Domain Adaptation for Visual Navigation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning Markov State Abstractions for Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Value Generalization among Policies: Improving Value Function with Policy Representation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Energy-based Surprise Minimization for Multi-Agent Value Factorization ( Poster ) >
SlidesLive Video

🔗

-

Poster: Backtesting Optimal Trade Execution Policies in Agent-Based Market Simulator ( Poster ) >
SlidesLive Video

🔗

-

Poster: Successor Landmarks for Efficient Exploration and Long-Horizon Navigation ( Poster ) >
SlidesLive Video

🔗

-

Poster: Multi-task Reinforcement Learning with a Planning Quasi-Metric ( Poster ) >
SlidesLive Video

🔗

-

Poster: R-LAtte: Visual Control via Deep Reinforcement Learning with Attention Network ( Poster ) >
SlidesLive Video

🔗

-

Poster: Quantifying Differences in Reward Functions ( Poster ) >
SlidesLive Video

🔗

-

Poster: DERAIL: Diagnostic Environments for Reward And Imitation Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations ( Poster ) >
SlidesLive Video

🔗

-

Poster: Unlocking the Potential of Deep Counterfactual Value Networks ( Poster ) >
SlidesLive Video

🔗

-

Poster: FactoredRL: Leveraging Factored Graphs for Deep Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Reusability and Transferability of Macro Actions for Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Interactive Visualization for Debugging RL ( Poster ) >
SlidesLive Video

🔗

-

Poster: A Deep Value-based Policy Search Approach for Real-world Vehicle Repositioning on Mobility-on-Demand Platforms ( Poster ) >
SlidesLive Video

🔗

-

Poster: FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance ( Poster ) >
SlidesLive Video

🔗

-

Poster: Visual Imitation with Reinforcement Learning using Recurrent Siamese Networks ( Poster ) >
SlidesLive Video

🔗

-

Poster: Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: XLVIN: eXecuted Latent Value Iteration Nets ( Poster ) >
SlidesLive Video

🔗

-

Poster: Beyond Exponentially Discounted Sum: Automatic Learning of Return Function ( Poster ) >
SlidesLive Video

🔗

-

Poster: XT2: Training an X-to-Text Typing Interface with Online Learning from Implicit Feedback ( Poster ) >
SlidesLive Video

🔗

-

Poster: Greedy Multi-Step Off-Policy Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation ( Poster ) >
SlidesLive Video

🔗

-

Poster: ReaPER: Improving Sample Efficiency in Model-Based Latent Imagination ( Poster ) >
SlidesLive Video

🔗

-

Poster: Model-Based Reinforcement Learning: A Compressed Survey ( Poster ) >
SlidesLive Video

🔗

-

Poster: BeBold: Exploration Beyond the Boundary of Explored Regions ( Poster ) >
SlidesLive Video

🔗

-

Poster: Model-Based Visual Planning with Self-Supervised Functional Distances ( Poster ) >
SlidesLive Video

🔗

-

Poster: Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: Utilizing Skipped Frames in Action Repeats via Pseudo-Actions ( Poster ) >
SlidesLive Video

🔗

-

Poster: Bringing order into Actor-Critic Algorithms usingStackelberg Games ( Poster ) >

🔗

-

Poster: Continual Model-Based Reinforcement Learning withHypernetworks ( Poster ) >
SlidesLive Video

🔗

-

Poster: Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies ( Poster ) >
SlidesLive Video

🔗

-

Poster: Policy Guided Planning in Learned Latent Space ( Poster ) >
SlidesLive Video

🔗

-

Poster: PettingZoo: Gym for Multi-Agent Reinforcement Learning ( Poster ) >
SlidesLive Video

🔗

-

Poster: DREAM: Deep Regret minimization with Advantage baselines and Model-free learning ( Poster ) >
SlidesLive Video

🔗

Main Navigation

Workshop

Deep Reinforcement Learning

Pieter Abbeel · Chelsea Finn · Joelle Pineau · David Silver · Satinder Singh · Coline Devin · Misha Laskin · Kimin Lee · Janarthanan Rajendran · Vivek Veeriah

Schedule