Beining Han

120

202020212022202320247 44 66 106 55

Chongjie zhangWashington University in St. LouisVerified email at wustl.edu
Yihan WangPh.D. Student, Princeton UniversityVerified email at princeton.edu
Zhizhou RenUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Heng DongPhD Student, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Tonghan WangEcon CS group, Harvard UniversityVerified email at g.harvard.edu
Jianhao WangPhd of Computer Science, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yiming ZuoPrinceton UniversityVerified email at princeton.edu
Zeyu MaPhD student of Computer Science, Princeton UniversityVerified email at princeton.edu
Kaiyu YangCalifornia Institute of TechnologyVerified email at caltech.edu
Ankit GoyalNVIDIAVerified email at nvidia.com
Hei LawPrinceton UniversityVerified email at cs.princeton.edu
Jia DengPrinceton UniversityVerified email at cs.princeton.edu
Alejandro NewellAppleVerified email at apple.com
Jian PengHelixon; Previously at UIUC & MITVerified email at helixon.com
Yuan ZhouDepartment of ISE, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Harris ChanUniversity of Toronto, Vector InstituteVerified email at cs.toronto.edu
Jimmy BaUniversity of TorontoVerified email at cs.toronto.edu
Keiran PasterUniversity of Toronto, Vector InstituteVerified email at cs.toronto.edu
Michael ZhangUniversity of TorontoVerified email at cs.toronto.edu
Guangxiang ZhuTsinghua UniversityVerified email at mails.tsinghua.edu.cn

Beining Han

Verified email at princeton.edu


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Dop: Off-policy multi-agent decomposed policy gradients Y Wang, B Han, T Wang, H Dong, C Zhang International conference on learning representations, 2020	165	2020
Towards understanding cooperative multi-agent q-learning with value factorization J Wang, Z Ren, B Han, J Ye, C Zhang Advances in Neural Information Processing Systems 34, 29142-29155, 2021	44*	2021
Infinite photorealistic worlds using procedural generation A Raistrick, L Lipson, Z Ma, L Mei, M Wang, Y Zuo, K Kayan, H Wen, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	22	2023
Off-policy reinforcement learning with delayed rewards B Han, Z Ren, Z Wu, Y Zhou, J Peng International Conference on Machine Learning, 8280-8303, 2022	21	2022
Learning domain invariant representations in goal-conditioned block mdps B Han, C Zheng, H Chan, K Paster, M Zhang, J Ba Advances in Neural Information Processing Systems 34, 764-776, 2021	14	2021
On the estimation bias in double q-learning Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang Advances in Neural Information Processing Systems 34, 10246-10259, 2021	12	2021

The system can't perform the operation now. Try again later.

Articles 1–6

Citations per year