Follow
Jiayi Huang
Jiayi Huang
Verified email at stu.pku.edu.cn - Homepage
Title
Cited by
Cited by
Year
Tackling heavy-tailed rewards in reinforcement learning with function approximation: Minimax optimal and instance-dependent regret bounds
J Huang, H Zhong, L Wang, L Yang
Advances in Neural Information Processing Systems 36, 2024
62024
Breaking the moments condition barrier: No-regret algorithm for bandits with super heavy-tailed payoffs
H Zhong, J Huang, L Yang, L Wang
Advances in Neural Information Processing Systems 34, 15710-15720, 2021
52021
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
J Huang, H Zhong, L Wang, L Yang
International Conference on Artificial Intelligence and Statistics, 3673-3681, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–3