PGQ: Combining policy gradient and Q-learning
O'Donoghue, Brendan and Munos, Rémi and Kavukcuoglu, Koray and Mnih, Volodymyr
arXiv e-Print archive - 2016 via Local Bibsonomy
Keywords: dblp

Summary by abhishm 7 years ago
Your comment: allows researchers to publish paper summaries that are voted on and ranked!

Sponsored by: