Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Kumar, Aviral and Fu, Justin and Soh, Matthew and Tucker, George and Levine, Sergey
Neural Information Processing Systems Conference - 2019 via Local Bibsonomy
Keywords: dblp

Summary by Robert Müller 3 years ago
Your comment: allows researchers to publish paper summaries that are voted on and ranked!

Sponsored by: