Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning
2017
Conference Paper
ei
Author(s): | Gu, S. and Lillicrap, T. and Turner, R. E. and Ghahramani, Z. and Schölkopf, B. and Levine, S. |
Book Title: | Advances in Neural Information Processing Systems 30 (NIPS 2017) |
Pages: | 3849--3858 |
Year: | 2017 |
Month: | December |
Day: | 4-9 |
Editors: | Guyon I. and Luxburg U.v. and Bengio S. and Wallach H. and Fergus R. and Vishwanathan S. and Garnett R. |
Publisher: | Curran Associates, Inc. |
Department(s): | Empirical Inference |
Research Project(s): | |
Bibtex Type: | Conference Paper (conference) |
Event Name: | 31st Annual Conference on Neural Information Processing Systems |
Event Place: | Long Beach, CA, USA |
State: | Published |
URL: | http://papers.nips.cc/paper/6974-interpolated-policy-gradient-merging-on-policy-and-off-policy-gradient-estimation-for-deep-reinforcement-learning.pdf |
BibTex @conference{Guetal17, title = {Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning}, author = {Gu, S. and Lillicrap, T. and Turner, R. E. and Ghahramani, Z. and Sch{\"o}lkopf, B. and Levine, S.}, booktitle = {Advances in Neural Information Processing Systems 30 (NIPS 2017)}, pages = {3849--3858}, editors = {Guyon I. and Luxburg U.v. and Bengio S. and Wallach H. and Fergus R. and Vishwanathan S. and Garnett R.}, publisher = {Curran Associates, Inc.}, month = dec, year = {2017}, doi = {}, url = {http://papers.nips.cc/paper/6974-interpolated-policy-gradient-merging-on-policy-and-off-policy-gradient-estimation-for-deep-reinforcement-learning.pdf}, month_numeric = {12} } |