Shengyi Huang

Cited by

	All	Since 2019
Citations	751	751
h-index	10	10
i10-index	10	10

380

190

285

202020212022202320243 23 97 367 260

Co-authors

Santiago OntañónResearch Scientist, Google ResearchVerified email at google.com
Rousslan Fernand Julien DossaKobe UniversityVerified email at ai.cs.kobe-u.ac.jp
Chang YeGoogleVerified email at google.com
Christopher BamfordQueen Mary University of LondonVerified email at qmul.ac.uk
Anitha KannanCuraiVerified email at curai.com
Xavier AmatriainVP of Product, Core ML/AI. GoogleVerified email at amatriain.net
Ilya ValmianskiResearch scientist at CuraiVerified email at curai.com
David GrethleinComputer Science PhD Candidate, Drexel UniversityVerified email at drexel.edu
Namit KatariyaTech Lead Manager, ML Platform at Faire

Shengyi Huang

Drexel University

Verified email at drexel.edu - Homepage

Artificial Intelligence Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A closer look at invalid action masking in policy gradient algorithms S Huang, S Ontañón The International FLAIRS Conference 2022 35, 2022	250	2022
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	139	2023
Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms S Huang, RFJ Dossa, C Ye, J Braga, D Chakraborty, K Mehta, ... Journal of Machine Learning Research 23 (274), 1-18, 2022	124	2022
The 37 Implementation Details of Proximal Policy Optimization S Huang, RFJ Dossa, A Raffin, A Kanervisto, W Wang International Conference on Learning Representations Blog Track, 2022	72	2022
Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert, ... GitHub. Available online at: https://github. com/lvwerra/trl, 2020	55	2020
Envpool: A highly parallel reinforcement learning environment execution engine J Weng, M Lin, S Huang, B Liu, D Makoviichuk, V Makoviychuk, Z Liu, ... Advances in Neural Information Processing Systems 35, 22409-22421, 2022	36	2022
Gym-RTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning S Huang, S Ontañón, C Bamford, L Grela Proceedings of the 3rd IEEE Conference on Games, 2021	28	2021
A2C is a special case of PPO S Huang, A Kanervisto, A Raffin, W Wang, S Ontañón, RFJ Dossa arXiv preprint arXiv:2205.09123, 2022	15	2022
Action guidance: Getting the best of sparse rewards and shaped rewards for real-time strategy games S Huang, S Ontañón AIIDE-20 Workshop on Artificial Intelligence for Strategy Games, 2020	11	2020
An empirical investigation of early stopping optimizations in proximal policy optimization RFJ Dossa, S Huang, S Ontañón, T Matsubara IEEE Access 9, 117981-117992, 2021	10	2021
Comparing Observation and Action Representations for Deep Reinforcement Learning in RTS S Huang, S Ontañón AIIDE-19 Workshop on Artificial Intelligence for Strategy Games, 2019	7*	2019
Medcod: A medically-accurate, emotive, diverse, and controllable dialog system R Compton, I Valmianski, L Deng, C Huang, N Katariya, X Amatriain, ... Machine Learning for Health, 110-129, 2021	3	2021
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning S Huang, Q Gallouédec, F Felten, A Raffin, RFJ Dossa, Y Zhao, ... arXiv preprint arXiv:2402.03046, 2024	1	2024
The N+ Implementation Details of RLHF with PPO: A Case Study on TL; DR Summarization S Huang, M Noukhovitch, A Hosseini, K Rasul, W Wang, L Tunstall arXiv preprint arXiv:2403.17031, 2024		2024
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks R Sullivan, A Kumar, S Huang, J Dickerson, J Suarez Advances in Neural Information Processing Systems 36, 2024		2024
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform S Huang, J Weng, R Charakorn, M Lin, Z Xu, S Ontañón The Twelfth International Conference on Learning Representations, 2023		2023
Measuring Generalization of Deep Reinforcement Learning with Real-time Strategy Games SO Shengyi Huang AAAI-21 Workshop on Reinforcement Learning in Games, 2021		2021
StreetTraffic: A Library for Traffic Flow Data Collection and Analysis S Huang, C Healy Proceedings of the ACMSE 2018 Conference, 1-3, 2018		2018
The N Implementation Details of RLHF with PPO S Huang, T Liu, L Von Werra The Third Blogpost Track at ICLR 2024, 0

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors