Fengshuo Bai

2023202420 19

Yali DuTuring Fellow, Assistant professor, King's College LondonVerified email at kcl.ac.uk
Yaodong YangBOYA (博雅) Assistant Professor at Peking UniversityVerified email at pku.edu.cn
Runze LiuTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Zhaowei ZhangPeking UniversityVerified email at stu.pku.edu.cn
Hongming ZhangUniversity of AlbertaVerified email at ualberta.ca
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn

Fengshuo Bai

Verified email at sjtu.edu.cn - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Meta-reward-net: Implicitly differentiable reward learning for preference-based reinforcement learning R Liu, F Bai, Y Du, Y Yang Advances in Neural Information Processing Systems 35, 22270-22284, 2022	30	2022
Picor: Multi-task deep reinforcement learning with policy correction F Bai, H Zhang, T Tao, Z Wu, Y Wang, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6728-6736, 2023	4	2023
Measuring Value Understanding in Language Models through Discriminator-Critique Gap Z Zhang, F Bai, J Gao, Y Yang arXiv preprint arXiv:2310.00378, 2023	3	2023
Zero-shot Preference Learning for Offline RL via Optimal Transport R Liu, Y Du, F Bai, J Lyu, X Li arXiv preprint arXiv:2306.03615, 2023	3	2023
Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects Z Zhang, F Bai, M Wang, H Ye, C Ma, Y Yang arXiv preprint arXiv:2402.12907, 2024		2024
-DQN: Diverse Exploration via Learning a Behavior Function H Zhang, F Bai, C Xiao, C Gao, M Müller		2023
BATTLE: Towards Behavior-oriented Adversarial Attacks against Deep Reinforcement Learning F Bai, R Liu, Y Du, Y Wen, Y Yang		2023
Zero-shot Cross-task Preference Alignment for Offline RL via Optimal Transport R Liu, Y Du, F Bai, J Lyu, X Li		2023

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year