Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution
Chou, Po-Wei
and
Maturana, Daniel
and
Scherer, Sebastian
International Conference on Machine Learning - 2017 via Local Bibsonomy
Keywords:
dblp