Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
2017
Conference Paper
ei
Author(s): | Gu, Shixiang and Lillicrap, Timothy and Ghahramani, Zoubin and Turner, Richard E. and Levine, Sergey |
Book Title: | Proceedings International Conference on Learning Representations (ICLR) |
Year: | 2017 |
Month: | April |
Day: | 24-26 |
Department(s): | Empirical Inference |
Research Project(s): | |
Bibtex Type: | Conference Paper (conference) |
Event Place: | Toulon, France |
State: | Published |
URL: | https://openreview.net/pdf?id=rkE3y85ee |
Links: |
PDF
|
BibTex @conference{GuLilGhaTurLev17, title = {Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic}, author = {Gu, Shixiang and Lillicrap, Timothy and Ghahramani, Zoubin and Turner, Richard E. and Levine, Sergey}, booktitle = {Proceedings International Conference on Learning Representations (ICLR)}, month = apr, year = {2017}, doi = {}, url = {https://openreview.net/pdf?id=rkE3y85ee}, month_numeric = {4} } |