关注
Jiayi Huang
Jiayi Huang
在 stu.pku.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Tackling heavy-tailed rewards in reinforcement learning with function approximation: Minimax optimal and instance-dependent regret bounds
J Huang, H Zhong, L Wang, L Yang
Advances in Neural Information Processing Systems 36, 2024
62024
Breaking the moments condition barrier: No-regret algorithm for bandits with super heavy-tailed payoffs
H Zhong, J Huang, L Yang, L Wang
Advances in Neural Information Processing Systems 34, 15710-15720, 2021
52021
Horizon-free and instance-dependent regret bounds for reinforcement learning with general function approximation
J Huang, H Zhong, L Wang, L Yang
International Conference on Artificial Intelligence and Statistics, 3673-3681, 2024
12024
系统目前无法执行此操作,请稍后再试。
文章 1–3