Workshop
Reinforcement Learning for Real Life (RL4RealLife) Workshop
Yuxi Li · Emma Brunskill · MINMIN CHEN · Omer Gottesman · Lihong Li · Yao Liu · Zhiwei Tony Qin · Matthew Taylor
Theater A
Sat 3 Dec, 5:30 a.m. PST
Discover how to improve the adoption of RL in practice, by discussing key research problems, SOTA, and success stories / insights / lessons w.r.t. practical RL algorithms, practical issues, and applications with leading experts from both academia and industry @ NeurIPS 2022 RL4RealLife workshop.
Chat is not available.
Timezone: America/Los_Angeles
Schedule
Sat 5:30 a.m. - 6:25 a.m.
|
posters (for early birds, optional)
(
posters
)
>
|
🔗 |
Sat 6:25 a.m. - 6:30 a.m.
|
opening remarks
(
opening remarks
)
>
SlidesLive Video |
🔗 |
Sat 6:31 a.m. - 7:00 a.m.
|
Invited talk: Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning
(
talk
)
>
link
SlidesLive Video |
Peter Stone 🔗 |
Sat 7:01 a.m. - 7:30 a.m.
|
Invited talk: Scaling reinforcement learning in the real world, from gaming to finance to manufacturing
(
talk
)
>
SlidesLive Video |
Robert Nishihara 🔗 |
Sat 7:30 a.m. - 7:31 a.m.
|
Intro speaker
(
In-person Intro
)
>
|
🔗 |
Sat 7:31 a.m. - 8:00 a.m.
|
Invited talk: Deep Reinforcement Learning for Real-World Inventory Management
(
talk
)
>
SlidesLive Video |
Dhruv Madeka 🔗 |
Sat 8:00 a.m. - 8:20 a.m.
|
Coffee break
(
Coffee break
)
>
|
🔗 |
Sat 8:20 a.m. - 9:10 a.m.
|
Panel RL Implementation
(
Panel
)
>
SlidesLive Video |
Xiaolin Ge · Alborz Geramifard · Kence Anderson · Craig Buhr · Robert Nishihara · Yuandong Tian 🔗 |
Sat 9:10 a.m. - 10:00 a.m.
|
Panel RL Benchmarks
(
Panel
)
>
SlidesLive Video |
Minmin Chen · Pablo Samuel Castro · Caglar Gulcehre · Tony Jebara · Peter Stone 🔗 |
Sat 10:00 a.m. - 11:30 a.m.
|
Lunch Break / Posters
(
Poster/Break
)
>
|
🔗 |
Sat 11:31 a.m. - 12:00 p.m.
|
Invited talk AlphaTensor: Discovering faster matrix multiplication algorithms with RL
(
talk
)
>
SlidesLive Video |
Matej Balog 🔗 |
Sat 12:00 p.m. - 12:55 p.m.
|
Panel RL Theory-Practice Gap
(
Panel
)
>
SlidesLive Video |
Peter Stone · Matej Balog · Jonas Buchli · Jason Gauci · Dhruv Madeka 🔗 |
Sat 12:55 p.m. - 1:00 p.m.
|
closing remarks
(
closing remarks
)
>
|
🔗 |
Sat 1:00 p.m. - 1:30 p.m.
|
Coffee break / Posters
(
Coffee break / Posters
)
>
|
🔗 |
Sat 1:30 p.m. - 3:00 p.m.
|
Posters
(
Posters
)
>
|
🔗 |
-
|
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
(
Poster
)
>
|
Danil Provodin · Pratik Gajane · Mykola Pechenizkiy · Maurits Kaptein 🔗 |
-
|
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
(
Spotlight
)
>
SlidesLive Video |
Danil Provodin · Pratik Gajane · Mykola Pechenizkiy · Maurits Kaptein 🔗 |
-
|
MARLIM: Multi-Agent Reinforcement Learning for Inventory Management
(
Poster
)
>
SlidesLive Video |
Rémi Leluc · Elie Kadoche · Antoine Bertoncello · Sébastien Gourvénec 🔗 |
-
|
MARLIM: Multi-Agent Reinforcement Learning for Inventory Management
(
Spotlight
)
>
|
Rémi Leluc · Elie Kadoche · Antoine Bertoncello · Sébastien Gourvénec 🔗 |
-
|
A Versatile and Efficient Reinforcement Learning Approach for Autonomous Driving
(
Poster
)
>
|
Guan Wang · Haoyi Niu · desheng zhu · Jianming HU · Xianyuan Zhan · Guyue Zhou 🔗 |
-
|
A Versatile and Efficient Reinforcement Learning Approach for Autonomous Driving
(
Spotlight
)
>
SlidesLive Video |
Guan Wang · Haoyi Niu · desheng zhu · Jianming HU · Xianyuan Zhan · Guyue Zhou 🔗 |
-
|
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
Yuri Chervonyi · Praneet Dutta 🔗 |
-
|
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
(
Spotlight
)
>
|
Yuri Chervonyi · Praneet Dutta 🔗 |
-
|
Structured Q-learning For Antibody Design
(
Poster
)
>
SlidesLive Video |
Alexander Cowen-Rivers · Philip John Gorinski · aivar sootla · Asif Khan · Jun WANG · Jan Peters · Haitham Bou Ammar 🔗 |
-
|
Structured Q-learning For Antibody Design
(
Spotlight
)
>
|
Alexander Cowen-Rivers · Philip John Gorinski · aivar sootla · Asif Khan · Jun WANG · Jan Peters · Haitham Bou Ammar 🔗 |
-
|
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
(
Poster
)
>
SlidesLive Video |
Xinhan Di · Pengqian Yu 🔗 |
-
|
Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
(
Spotlight
)
>
|
Xinhan Di · Pengqian Yu 🔗 |
-
|
Learning an Adaptive Forwarding Strategy for Mobile Wireless Networks: Resource Usage vs. Latency
(
Poster
)
>
SlidesLive Video |
Victoria Manfredi · Alicia Wolfe · Xiaolan Zhang · Bing Wang 🔗 |
-
|
Learning an Adaptive Forwarding Strategy for Mobile Wireless Networks: Resource Usage vs. Latency
(
Spotlight
)
>
SlidesLive Video |
Victoria Manfredi · Alicia Wolfe · Xiaolan Zhang · Bing Wang 🔗 |
-
|
Safe Reinforcement Learning for Automatic Insulin Delivery in Type I Diabetes
(
Poster
)
>
SlidesLive Video |
Maxime Louis · Hector Romero Ugalde · Pierre Gauthier · Alice Adenis · Yousra Tourki · Erik Huneker 🔗 |
-
|
Safe Reinforcement Learning for Automatic Insulin Delivery in Type I Diabetes
(
Spotlight
)
>
|
Maxime Louis · Hector Romero Ugalde · Pierre Gauthier · Alice Adenis · Yousra Tourki · Erik Huneker 🔗 |
-
|
Power Grid Congestion Management via Topology Optimization with AlphaZero
(
Poster
)
>
|
Matthias Dorfer · Anton R. Fuxjaeger · Kristián Kozák · Patrick Blies · Marcel Wasserer 🔗 |
-
|
Power Grid Congestion Management via Topology Optimization with AlphaZero
(
Spotlight
)
>
SlidesLive Video |
Matthias Dorfer · Anton R. Fuxjaeger · Kristián Kozák · Patrick Blies · Marcel Wasserer 🔗 |
-
|
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
(
Poster
)
>
SlidesLive Video |
Yuandong Ding · Mingxiao Feng · Guozi Liu · Wei Jiang · Chuheng Zhang · Li Zhao · Lei Song · Houqiang Li · Yan Jin · Jiang Bian 🔗 |
-
|
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
(
Spotlight
)
>
SlidesLive Video |
Yuandong Ding · Mingxiao Feng · Guozi Liu · Wei Jiang · Chuheng Zhang · Li Zhao · Lei Song · Houqiang Li · Yan Jin · Jiang Bian 🔗 |
-
|
LibSignal: An Open Library for Traffic Signal Control
(
Poster
)
>
SlidesLive Video |
Hao Mei · Xiaoliang Lei · Longchao Da · Bin Shi · Hua Wei 🔗 |
-
|
LibSignal: An Open Library for Traffic Signal Control
(
Spotlight
)
>
|
Hao Mei · Xiaoliang Lei · Longchao Da · Bin Shi · Hua Wei 🔗 |
-
|
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
(
Poster
)
>
SlidesLive Video |
Benjamin Fuhrer · Yuval Shpigelman · Chen Tessler · Shie Mannor · Gal Chechik · Eitan Zahavi · Gal Dalal 🔗 |
-
|
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
(
Spotlight
)
>
|
Benjamin Fuhrer · Yuval Shpigelman · Chen Tessler · Shie Mannor · Gal Chechik · Eitan Zahavi · Gal Dalal 🔗 |
-
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
(
Poster
)
>
|
Kaixuan Huang · Yu Wu · Xuezhou Zhang · Shenyinying Tu · Qingyun Wu · Mengdi Wang · Huazheng Wang 🔗 |
-
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
(
Spotlight
)
>
SlidesLive Video |
Kaixuan Huang · Yu Wu · Xuezhou Zhang · Shenyinying Tu · Qingyun Wu · Mengdi Wang · Huazheng Wang 🔗 |
-
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang 🔗 |
-
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
(
Spotlight
)
>
|
zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang 🔗 |
-
|
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices
(
Poster
)
>
SlidesLive Video |
Toygun Basaklar · Yigit Tuncel · Umit Ogras 🔗 |
-
|
tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices
(
Spotlight
)
>
|
Toygun Basaklar · Yigit Tuncel · Umit Ogras 🔗 |
-
|
Optimizing Audio Recommendations for the Long-Term
(
Poster
)
>
SlidesLive Video |
Lucas Maystre · Daniel Russo · Yu Zhao 🔗 |
-
|
Optimizing Audio Recommendations for the Long-Term
(
Spotlight
)
>
|
Lucas Maystre · Daniel Russo · Yu Zhao 🔗 |
-
|
Controlling Commercial Cooling Systems Using Reinforcement Learning
(
Poster
)
>
|
27 presentersJerry Luo · Cosmin Paduraru · Octavian Voicu · Yuri Chervonyi · Scott Munns · Jerry Li · Crystal Qian · Praneet Dutta · Daniel Mankowitz · Jared Quincy Davis · Ningjia Wu · Xingwei Yang · Chu-Ming Chang · Ted Li · Rob Rose · Mingyan Fan · Hootan Nakhost · Tinglin Liu · Deeni Fatiha · Neil Satra · Juliet Rothenberg · Molly Carlin · Satish Tallapaka · Sims Witherspoon · David Parish · Peter Dolan · Chenyu Zhao |
-
|
Controlling Commercial Cooling Systems Using Reinforcement Learning
(
Spotlight
)
>
SlidesLive Video |
27 presentersJerry Luo · Cosmin Paduraru · Octavian Voicu · Yuri Chervonyi · Scott Munns · Jerry Li · Crystal Qian · Praneet Dutta · Daniel Mankowitz · Jared Quincy Davis · Ningjia Wu · Xingwei Yang · Chu-Ming Chang · Ted Li · Rob Rose · Mingyan Fan · Hootan Nakhost · Tinglin Liu · Deeni Fatiha · Neil Satra · Juliet Rothenberg · Molly Carlin · Satish Tallapaka · Sims Witherspoon · David Parish · Peter Dolan · Chenyu Zhao |
-
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response
(
Poster
)
>
SlidesLive Video |
Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry 🔗 |
-
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response
(
Spotlight
)
>
|
Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry 🔗 |
-
|
Identifying Disparities in Sepsis Treatment by Learning the Expert Policy
(
Poster
)
>
SlidesLive Video |
Hyewon Jeong · Siddharth Nayak · Taylor Killian · Sanjat Kanjilal · Marzyeh Ghassemi 🔗 |
-
|
Identifying Disparities in Sepsis Treatment by Learning the Expert Policy
(
Spotlight
)
>
|
Hyewon Jeong · Siddharth Nayak · Taylor Killian · Sanjat Kanjilal · Marzyeh Ghassemi 🔗 |
-
|
Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
(
Poster
)
>
|
20 presentersVashist Avadhanula · Omar Abdul Baki · Hamsa Bastani · Osbert Bastani · Caner Gocmen · Daniel Haimovich · Darren Hwang · Dmytro Karamshuk · Thomas Leeper · Jiayuan Ma · Gregory macnamara · Jake Mullet · Christopher Palow · Sung Park · Varun S Rajagopal · Kevin Schaeffer · Parikshit Shah · Deeksha Sinha · Nicolas Stier-Moses · Ben Xu |
-
|
Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
(
Spotlight
)
>
SlidesLive Video |
20 presentersVashist Avadhanula · Omar Abdul Baki · Hamsa Bastani · Osbert Bastani · Caner Gocmen · Daniel Haimovich · Darren Hwang · Dmytro Karamshuk · Thomas Leeper · Jiayuan Ma · Gregory macnamara · Jake Mullet · Christopher Palow · Sung Park · Varun S Rajagopal · Kevin Schaeffer · Parikshit Shah · Deeksha Sinha · Nicolas Stier-Moses · Ben Xu |
-
|
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
(
Poster
)
>
|
11 presentersMelody Wolk · Andy Applebaum · Camron Dennler · Patrick Dwyer · Marina Moskowitz · Harold Nguyen · Nicole Nichols · Nicole Park · Paul Rachwalski · Frank Rau · Adrian Webster |
-
|
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
(
Spotlight
)
>
SlidesLive Video |
11 presentersMelody Wolk · Andy Applebaum · Camron Dennler · Patrick Dwyer · Marina Moskowitz · Harold Nguyen · Nicole Nichols · Nicole Park · Paul Rachwalski · Frank Rau · Adrian Webster |
-
|
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
(
Poster
)
>
SlidesLive Video |
William Wong · Praneet Dutta · Octavian Voicu · Yuri Chervonyi · Cosmin Paduraru · Jerry Luo 🔗 |
-
|
Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
(
Spotlight
)
>
|
William Wong · Praneet Dutta · Octavian Voicu · Yuri Chervonyi · Cosmin Paduraru · Jerry Luo 🔗 |
-
|
Reinforcement Learning Approaches for Traffic Signal Control under Missing Data
(
Poster
)
>
SlidesLive Video |
Hao Mei · Junxian Li · Bin Shi · Hua Wei 🔗 |
-
|
Reinforcement Learning Approaches for Traffic Signal Control under Missing Data
(
Spotlight
)
>
SlidesLive Video |
Hao Mei · Junxian Li · Bin Shi · Hua Wei 🔗 |
-
|
Reinforcement Learning-Based Air Traffic Deconfliction
(
Poster
)
>
|
Denis Osipychev · Dragos Margineantu 🔗 |
-
|
Reinforcement Learning-Based Air Traffic Deconfliction
(
Spotlight
)
>
SlidesLive Video |
Denis Osipychev · Dragos Margineantu 🔗 |
-
|
Automatic Evaluation of Excavator Operators using Learned Reward Functions
(
Poster
)
>
SlidesLive Video |
Pranav Agarwal · Marek Teichmann · Sheldon Andrews · Samira Ebrahimi Kahou 🔗 |
-
|
Automatic Evaluation of Excavator Operators using Learned Reward Functions
(
Spotlight
)
>
|
Pranav Agarwal · Marek Teichmann · Sheldon Andrews · Samira Ebrahimi Kahou 🔗 |
-
|
Function Approximations for Reinforcement Learning Controller for Wave Energy Converters
(
Poster
)
>
SlidesLive Video |
Soumyendu Sarkar · Vineet Gundecha · Alexander Shmakov · Sahand Ghorbanpour · Ashwin Ramesh Babu · Alexandre Pichard · Mathieu Cocho 🔗 |
-
|
Function Approximations for Reinforcement Learning Controller for Wave Energy Converters
(
Spotlight
)
>
SlidesLive Video |
Soumyendu Sarkar · Vineet Gundecha · Alexander Shmakov · Sahand Ghorbanpour · Ashwin Ramesh Babu · Alexandre Pichard · Mathieu Cocho 🔗 |