Home
Publications
Experience
CV
1
Balancing Constraints and Rewards with Meta-Gradient D4PG
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity
Discovering Reinforcement Learning Algorithms
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
A Self-Tuning Actor-Critic Algorithm
What Can Learned Intrinsic Rewards Capture?
Grandmaster level in StarCraft II using multi-agent reinforcement learning
Discovery of Useful Questions as Auxiliary Tasks
Unicorn: Continual Learning with a Universal, Off-policy Agent
Contingency-Aware Exploration in Reinforcement Learning
«
»
Cite
×