Ziyu Wang

Cited by

	All	Since 2019
Citations	25332	22821
h-index	41	40
i10-index	58	55

5000

2500

1250

3750

201520162017201820192020202120222023202475 271 536 1267 1995 3163 4212 4580 4905 3933

Public access

View all

9 articles

1 article

available

not available

Based on funding mandates

Co-authors

Nando de FreitasCIFAR & DeepMindVerified email at google.com
Nicolas HeessDeepMindVerified email at google.com
Scott ReedResearch Scientist, NVIDIA ResearchVerified email at google.com
Bobak ShahriariDeepMindVerified email at google.com
Josh MerelVerified email at google.com
Tom Le PaineStaff Research Scientist at Google DeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Misha DenilDeepMindVerified email at google.com
David BuddenGoogle DeepMindVerified email at csail.mit.edu
Yusuf AytarResearch Scientist, DeepMindVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Tom ErezResearcher, DeepMindVerified email at google.com
Yuval TassaSenior Research Scientist, Google DeepMindVerified email at google.com
Kevin SwerskyGoogle BrainVerified email at cs.toronto.edu
Ryan P. AdamsPrinceton UniversityVerified email at princeton.edu
Marc LanctotResearch Scientist, Google DeepMindVerified email at google.com
Victor BapstGoethe Universität, FrankfurtVerified email at math.uni-frankfurt.de
Matthew W. HoffmanGoogle DeepMindVerified email at google.com
Yutian ChenResearch Scientist, DeepMindVerified email at google.com

Ziyu Wang

Deepmind

Verified email at google.com - Homepage

machine learning Statistics Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Taking the human out of the loop: A review of Bayesian optimization B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas Proceedings of the IEEE 104 (1), 148-175, 2015	5660	2015
Dueling network architectures for deep reinforcement learning Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas International conference on machine learning, 1995-2003, 2016	5254	2016
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... nature 575 (7782), 350-354, 2019	4778	2019
Emergence of locomotion behaviours in rich environments N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ... arXiv preprint arXiv:1707.02286, 2017	1146	2017
Sample efficient actor-critic with experience replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016	1026	2016
Bayesian optimization in a billion dimensions via random embeddings Z Wang, F Hutter, M Zoghi, D Matheson, N De Feitas Journal of Artificial Intelligence Research 55, 361-387, 2016	861	2016
Alphastar: Mastering the real-time strategy game starcraft ii O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ... DeepMind blog 2, 20, 2019	565	2019
Autonomous navigation of stratospheric balloons using reinforcement learning MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ... Nature 588 (7836), 77-82, 2020	407	2020
Reinforcement and imitation learning for diverse visuomotor skills Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ... arXiv preprint arXiv:1802.09564, 2018	376	2018
Deep fried convnets Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015	353	2015
Learning an embedding space for transferable robot skills K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller International Conference on Learning Representations, 2018	347	2018
Critic regularized regression Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ... Advances in Neural Information Processing Systems 33, 7768-7778, 2020	324	2020
Playing hard exploration games by watching youtube Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas Advances in neural information processing systems 31, 2018	317	2018
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	258	2020
Robust imitation of diverse behaviors Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess Advances in Neural Information Processing Systems 30, 2017	249	2017
Learning human behaviors from motion capture by adversarial imitation J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ... arXiv preprint arXiv:1707.02201, 2017	244	2017
Parallel multiscale autoregressive density estimation S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ... International conference on machine learning, 2912-2921, 2017	240	2017
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	181	2020
Hyperparameter selection for offline reinforcement learning TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ... arXiv preprint arXiv:2007.09055, 2020	161	2020
Bayesian optimization in alphago Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ... arXiv preprint arXiv:1812.06855, 2018	160	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors