Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic

Author(s):	Gu, Shixiang and Lillicrap, Timothy and Ghahramani, Zoubin and Turner, Richard E. and Levine, Sergey
Book Title:	Proceedings International Conference on Learning Representations (ICLR)
Year:	2017
Month:	April
Day:	24-26

Department(s):	Empirical Inference
Research Project(s):
Bibtex Type:	Conference Paper (conference)

Event Place:	Toulon, France

State:	Published
URL:	https://openreview.net/pdf?id=rkE3y85ee

Links:	PDF

BibTex @conference{GuLilGhaTurLev17, title = {Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic}, author = {Gu, Shixiang and Lillicrap, Timothy and Ghahramani, Zoubin and Turner, Richard E. and Levine, Sergey}, booktitle = {Proceedings International Conference on Learning Representations (ICLR)}, month = apr, year = {2017}, doi = {}, url = {https://openreview.net/pdf?id=rkE3y85ee}, month_numeric = {4} }

Shane Gu

Alumni

Latest News