1

What Can Learned Intrinsic Rewards Capture?

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Discovery of Useful Questions as Auxiliary Tasks

Unicorn: Continual Learning with a Universal, Off-policy Agent

Contingency-Aware Exploration in Reinforcement Learning

On Learning Intrinsic Rewards for Policy Gradient Methods

Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies

Self-Imitation Learning

Value Prediction Network

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning