Follow
Zixuan Dong
Title
Cited by
Cited by
Year
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Z Dong, C Wang, K Ross
arXiv preprint arXiv:2209.02864, 2022
22022
Cross Entropy versus Label Smoothing: A Neural Collapse Perspective
L Guo, K Ross, Z Zhao, A George, S Ling, Y Xu, Z Dong
arXiv preprint arXiv:2402.03979, 2024
2024
PRE-TRAINING WITH SYNTHETIC DATA HELPS OFFLINE REINFORCEMENT LEARNING
Z Wang, C Wang, Z Dong, K Ross
arXiv preprint arXiv:2310.00771, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–3