Toggle Poster Visibility
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #200
A Geometric Perspective on Optimal Representations for Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #201
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #202
Constrained Reinforcement Learning Has Zero Duality Gap
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #203
Distributional Reward Decomposition for Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #204
Divergence-Augmented Policy Optimization
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #205
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #206
Fast Efficient Hyperparameter Tuning for Policy Gradient Methods
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #207
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #208
Fully Parameterized Quantile Function for Distributional Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #209
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #210
Learning Reward Machines for Partially Observable Reinforcement Learning
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #211
Off-Policy Evaluation via Off-Policy Classification
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #212
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #213
Variance Reduced Policy Evaluation with Smooth Function Approximation
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #214
VIREL: A Variational Inference Framework for Reinforcement Learning
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #202
Budgeted Reinforcement Learning in Continuous State Space
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #203
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #204
From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #205
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #206
Learning from Trajectories via Subgoal Discovery
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #207
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #208
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #209
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #210
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #211
Neural Temporal-Difference Learning Converges to Global Optima
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #212
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #213
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #214
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #215
Surrogate Objectives for Batch Policy Optimization in One-step Decision Making
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #216
Discovery of Useful Questions as Auxiliary Tasks