Follow
Beining Han
Beining Han
Computer Science, Princeton University
Verified email at princeton.edu
Title
Cited by
Cited by
Year
Dop: Off-policy multi-agent decomposed policy gradients
Y Wang, B Han, T Wang, H Dong, C Zhang
International conference on learning representations, 2020
1652020
Towards understanding cooperative multi-agent q-learning with value factorization
J Wang, Z Ren, B Han, J Ye, C Zhang
Advances in Neural Information Processing Systems 34, 29142-29155, 2021
44*2021
Infinite photorealistic worlds using procedural generation
A Raistrick, L Lipson, Z Ma, L Mei, M Wang, Y Zuo, K Kayan, H Wen, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
222023
Off-policy reinforcement learning with delayed rewards
B Han, Z Ren, Z Wu, Y Zhou, J Peng
International Conference on Machine Learning, 8280-8303, 2022
212022
Learning domain invariant representations in goal-conditioned block mdps
B Han, C Zheng, H Chan, K Paster, M Zhang, J Ba
Advances in Neural Information Processing Systems 34, 764-776, 2021
142021
On the estimation bias in double q-learning
Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang
Advances in Neural Information Processing Systems 34, 10246-10259, 2021
122021
The system can't perform the operation now. Try again later.
Articles 1–6