Shangtong Zhang

Cited by

	All	Since 2019
Citations	1230	1191
h-index	16	16
i10-index	25	24

300

150

225

201720182019202020212022202320246 27 62 159 225 282 298 161

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Bo LiuPhD, AAAI SM, IEEE SMVerified email at cs.umass.edu
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiVerified email at ualberta.ca
Remi Tachet des CombesVerified email at alpacaml.com
Romain LarocheMicrosoft ResearchVerified email at polytechnique.org
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyVerified email at tudelft.nl
Ray JiangResearch Scientist, DeepMindVerified email at google.com
Marcus EdelComputer Science, Free University of BerlinVerified email at fu-berlin.de
Ryan R. CurtinFree agentVerified email at ratml.org
Nando de FreitasCIFAR & DeepMindVerified email at google.com
Tom Le PaineStaff Research Scientist at Google DeepMindVerified email at google.com
Julian SchrittwieserDeepMindVerified email at furidamu.org
Roman RingGoogle DeepMindVerified email at deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeVerified email at cam.ac.uk
Michael MathieuDeepMindVerified email at google.com
Aäron van den OordGoogle DeepMindVerified email at google.com
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMindVerified email at google.com
Aja HuangDeepMindVerified email at google.com
Sherjil OzairTesla AIVerified email at tesla.com

Shangtong Zhang

University of Virginia

Verified email at virginia.edu - Homepage

reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	345	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	101	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	94	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	88	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	81	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	57	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	54	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	46	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	38	2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	35	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	34	2018
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	31*	2018
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	27	2019
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	27	2015
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	25*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	15	2022
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020	13	2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	13	2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017	13	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors