Reward learning from human preferences and demonstrations in Atari
Borja Ibarz and Jan Leike and Tobias Pohlen and Geoffrey Irving and Shane Legg and Dario Amodei
arXiv e-Print archive - 2018 via Local arXiv
Keywords: cs.LG, cs.AI, cs.NE, stat.ML


Summary by wassname 2 years ago
Your comment: allows researchers to publish paper summaries that are voted on and ranked!

Sponsored by: