Balancing Constraints and Rewards with Meta-Gradient D4PG

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

Discovering Reinforcement Learning Algorithms

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

A Self-Tuning Actor-Critic Algorithm

What Can Learned Intrinsic Rewards Capture?

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Discovery of Useful Questions as Auxiliary Tasks

Unicorn: Continual Learning with a Universal, Off-policy Agent

Contingency-Aware Exploration in Reinforcement Learning

On Learning Intrinsic Rewards for Policy Gradient Methods

Many-Goals Reinforcement Learning

Generative Adversarial Self-Imitation Learning

Self-Imitation Learning

Value Prediction Network

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Action-Conditional Video Prediction using Deep Networks in Atari Games