Follow
Zhihan Xiong
Title
Cited by
Year
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Z Xiong*, R Camilleri*, M Fazel, L Jain, K Jamieson
arXiv preprint arXiv:2307.15154, 2023
12023
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
H Jiang, Q Cui, Z Xiong, M Fazel, SS Du
arXiv preprint arXiv:2306.07465, 2023
2023
Learning in congestion games with bandit feedback
Q Cui*, Z Xiong*, M Fazel, SS Du
Advances in Neural Information Processing Systems 35, 11009-11022, 2022
142022
Near-Optimal Randomized Exploration for Tabular Markov Decision Processes
Z Xiong*, R Shen*, Q Cui*, M Fazel, SS Du
Advances in Neural Information Processing Systems 35, 6358-6371, 2022
20*2022
Offline congestion games: How feedback type affects data coverage requirement
H Jiang*, Q Cui*, Z Xiong, M Fazel, SS Du
arXiv preprint arXiv:2210.13396, 2022
12022
Fourier Learning with Cyclical Data
Y Yang*, Z Xiong*, T Liu*, T Wang, C Wang
International Conference on Machine Learning, 25280-25301, 2022
12022
Selective sampling for online best-arm identification
R Camilleri*, Z Xiong*, M Fazel, L Jain, KG Jamieson
Advances in Neural Information Processing Systems 34, 11071-11082, 2021
62021
Parameterized indexed value function for efficient exploration in reinforcement learning
T Tan*, Z Xiong*, VR Dwaracherla
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5948-5955, 2020
72020
The system can't perform the operation now. Try again later.
Articles 1–8