Heyang Zhao

2022202320247 38 70

Public access

3 articles

0 articles

available

not available

Based on funding mandates

Quanquan GuAssociate Professor of Computer Science, UCLAVerified email at cs.ucla.edu
Jiafan HePhD student, Department of Computer Science, UCLAVerified email at ucla.edu
Dongruo ZhouIndiana University BloomingtonVerified email at iu.edu
QIWEI DIPhd student, Department of Computer Science , University of California, Los AngelesVerified email at cs.ucla.edu
Tong ZhangUIUCVerified email at tongzhang-ml.org
Farzad FarnoudUniversity of VirginiaVerified email at virginia.edu
Tao JinPhD Student, University of VirginiaVerified email at virginia.edu
Yue WuPhD student, Department of Computer Science, UCLAVerified email at ucla.edu
XUHENG LIDepartment of Computer Science, University of California, Los AngelesVerified email at ucla.edu

Heyang Zhao

Verified email at cs.ucla.edu - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Nearly minimax optimal reinforcement learning for linear markov decision processes J He, H Zhao, D Zhou, Q Gu International Conference on Machine Learning, 12790-12822, 2023	49	2023
Variance-dependent regret bounds for linear bandits and reinforcement learning: Adaptivity and computational efficiency H Zhao, J He, D Zhou, T Zhang, Q Gu The Thirty Sixth Annual Conference on Learning Theory, 2023	21	2023
Linear contextual bandits with adversarial corruptions H Zhao, D Zhou, Q Gu arXiv preprint arXiv:2110.12615, 2021	21	2021
Optimal online generalized linear regression with stochastic noise and its application to heteroscedastic bandits H Zhao, D Zhou, J He, Q Gu International Conference on Machine Learning, 42259-42279, 2023	11*	2023
Variance-aware regret bounds for stochastic contextual dueling bandits Q Di, T Jin, Y Wu, H Zhao, F Farnoud, Q Gu arXiv preprint arXiv:2310.00968, 2023	5	2023
Pessimistic nonlinear least-squares value iteration for offline reinforcement learning Q Di, H Zhao, J He, Q Gu arXiv preprint arXiv:2310.01380, 2023	4	2023
A nearly optimal and low-switching algorithm for reinforcement learning with general function approximation H Zhao, J He, Q Gu arXiv preprint arXiv:2311.15238, 2023	3	2023
Feel-Good Thompson Sampling for Contextual Dueling Bandits X Li, H Zhao, Q Gu arXiv preprint arXiv:2404.06013, 2024	1	2024

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year