papers.nips.cc
scholar.google.com
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Kumar, Aviral and Fu, Justin and Soh, Matthew and Tucker, George and Levine, Sergey
Neural Information Processing Systems Conference - 2019 via Local Bibsonomy
Keywords: dblp


[link]
Summary by Robert Müller 4 years ago
Loading...
Your comment:


ShortScience.org allows researchers to publish paper summaries that are voted on and ranked!
About

Sponsored by: