1

Balancing Constraints and Rewards with Meta-Gradient D4PG

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

Discovering Reinforcement Learning Algorithms

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

A Self-Tuning Actor-Critic Algorithm

What Can Learned Intrinsic Rewards Capture?

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Discovery of Useful Questions as Auxiliary Tasks

Unicorn: Continual Learning with a Universal, Off-policy Agent

Contingency-Aware Exploration in Reinforcement Learning