Shun Zhang

Cited by

	All	Since 2019
Citations	507	410
h-index	10	10
i10-index	10	10

140

105

201420152016201720182019202020212022202320243 5 16 32 39 23 41 41 49 121 135

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Peter StoneProfessor of Computer Science, The University of Texas at AustinVerified email at cs.utexas.edu
Chuang GanUMass Amherst | MIT-IBM Watson AI LabVerified email at csail.mit.edu
Tsz-Chiu AuUlsan National Institute of Science and TechnologyVerified email at cs.utexas.edu
Edmund DurfeeProfessor Emeritus of Computer Science and Engineering, University of MichiganVerified email at umich.edu
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Dana BallardProfessor of Computer Science, University of Texas at AustinVerified email at cs.utexas.edu
Mary HayhoeProfessor of Psychology, University of Texas AustinVerified email at utexas.edu
Matthew TongIBM ResearchVerified email at alumni.ucsd.edu
Xin ZhangIBM Thomas J. Watson Research Center / Columbia UniversityVerified email at us.ibm.com
Ruohan ZhangStanford UniversityVerified email at stanford.edu

Shun Zhang

MIT-IBM Watson AI Lab

Verified email at ibm.com - Homepage

reinforcement learning human-agent interaction value alignment AI safety


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Autonomous intersection management for semi-autonomous vehicles TC Au, S Zhang, P Stone Routledge Handbook of Transportation, 88-104, 2015	147	2015
Prompting Decision Transformer for Few-Shot Policy Generalization M Xu, Y Shen, S Zhang, Y Lu, D Zhao, JB Tenenbaum, C Gan International Conference on Machine Learning, 2022	99	2022
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan arXiv preprint arXiv:2303.05510, 2023	71	2023
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes. S Zhang, EH Durfee, S Singh IJCAI, 4867-4873, 2018	45	2018
Determining placements of influencing agents in a flock K Genter, S Zhang, P Stone Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	30	2015
Hyper-decision transformer for efficient online policy adaptation M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan arXiv preprint arXiv:2304.08487, 2023	29	2023
Semi-autonomous intersection management. TC Au, S Zhang, P Stone AAMAS, 1451-1452, 2014	28	2014
Modeling sensory-motor decisions in natural behavior R Zhang, S Zhang, MH Tong, Y Cui, CA Rothkopf, DH Ballard, ... PLoS computational biology 14 (10), e1006518, 2018	13	2018
Querying to find a safe policy under uncertain safety constraints in markov decision processes S Zhang, E Durfee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2552-2559, 2020	11	2020
From specification to topology: Automatic power converter design via reinforcement learning S Fan, N Cao, S Zhang, J Li, X Guo, X Zhang 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021	10	2021
Approximately-optimal queries for planning in reward-uncertain Markov decision processes S Zhang, E Durfee, S Singh Proceedings of the International Conference on Automated Planning and …, 2017	9	2017
Modeling Task Control of Gaze M Tong, S Zhang, L Johnson, D Ballard, M Hayhoe Journal of Vision 15 (12), 784-784, 2015	4	2015
Improving reinforcement learning from human feedback with efficient reward model ensemble S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan arXiv preprint arXiv:2401.16635, 2024	3	2024
Adaptive Online Replanning with Diffusion Models S Zhou, Y Du, S Zhang, M Xu, Y Shen, W Xiao, DY Yeung, C Gan Advances in Neural Information Processing Systems 36, 2023	3	2023
Power Converter Circuit Design Automation using Parallel Monte Carlo Tree Search S Fan, S Zhang, J Liu, N Cao, X Guo, J Li, X Zhang ACM Transactions on Design Automation of Electronic Systems (TODAES), 2022	2	2022
Modeling Sensorimotor Behavior through Modular Inverse Reinforcement Learning with Discount Factors R Zhang, S Zhang, MH Tong, MM Hayhoe, DH Ballard Journal of Vision 17 (10), 1267-1267, 2017	1	2017
Parameterized modular inverse reinforcement learning S Zhang	1	2015
Intersection Management With Constraint-Based Reservation Systems TC Au, S Zhang, P Stone Autonomous Robots and Multirobot Systems (ARMS), 2014	1	2014
LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits CC Chang, Y Shen, S Fan, J Li, S Zhang, N Cao, Y Chen, X Zhang Forty-first International Conference on Machine Learning, 2024		2024
Efficiently Finding Approximately-Optimal Queries for Improving Policies and Guaranteeing Safety S Zhang		2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors