Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Kumar, Aviral and
Fu, Justin and
Soh, Matthew and
Tucker, George and
Neural Information Processing Systems Conference - 2019
via Local Bibsonomy
Write your summary here (You can use $\LaTeX$ and
You must log in before you can submit this summary! Your draft will not be saved!
Summary by guest just now
ShortScience.org allows researchers to publish paper summaries that are voted on and ranked!