A list of papers regarding generalization in (deep) reinforcement learning. Please feel free to open an issue to add papers.
- [arXiv 2021] Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
- [arXiv 2021] A Survey of Generalisation in Deep Reinforcement Learning
- [arXiv 2021] Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
- [arXiv 2021] Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
- [arXiv 2021] Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
- [arXiv 2021] Robust Deep Reinforcement Learning via Multi-View Information Bottleneck
- [arXiv 2021] Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
- [NeurIPS 2021] Automatic Data Augmentation for Generalization in Reinforcement Learning
- [ICML 2021] Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
- [ICML 2021] Decoupling Value and Policy for Generalization in Reinforcement Learning
- [ICML 2021] Prioritized Level Replay
- [ICRA 2021] Generalization in Reinforcement Learning by Soft Data Augmentation
- [ICLR 2021] Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
- [ICLR 2021] Transient Non-stationarity and Generalisation in Deep Reinforcement Learning
- [ICLR 2021] Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
- [Procgen Challenge 2020] Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks
- [NeurIPS 2020] Instance based Generalization in Reinforcement Learning
- [NeurIPS 2020] Improving Generalization in Reinforcement Learning with Mixture Regularization
- [NeurIPS 2020] Reinforcement Learning with Augmented Data
- [ICML 2020] Fast Adaptation to New Environments via Policy-Dynamics Value Functions
- [ICML 2020] Leveraging Procedural Generation to Benchmark Reinforcement Learning
- [ICLR 2020] Observational Overfitting in Reinforcement Learning
- [ICLR 2020] Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning
- [NeurIPS 2019] Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck
- [ICML 2019] On the Generalization Gap in Reparameterizable Reinforcement Learning
- [ICML 2019] Quantifying Generalization in Reinforcement Learning
- [ICML 2019] Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
- [ICMLW 2019] The Principle of Unchanged Optimality in Reinforcement Learning Generalization
- [arXiv 2018] Natural Environment Benchmarks for Reinforcement Learning
- [arXiv 2018] Assessing Generalization in Deep Reinforcement Learning
- [arXiv 2018] A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
- [arXiv 2018] A Study on Overfitting in Deep Reinforcement Learning
- [arXiv 2018] Gotta Learn Fast: A New Benchmark for Generalization in RL
- [NeurIPSW 2018] Generalization and Regularization in DQN
- [ICRA 2018] Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
- [NeurIPS 2017] Towards Generalization and Simplicity in Continuous Control