关注
Miao Lu
Miao Lu
其他姓名Lu, Miao
在 stanford.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
M Lu, Y Min, Z Wang, Z Yang
International Conference on Learning Representations, 2023
222023
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining
M Lu, X Luo, T Chen, W Chen, D Liu, Z Wang
International Conference on Learning Representations 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭, 2022
202022
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang
Neural Information Processing Systems 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭, 2023
17*2023
Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy
Z Liu, M Lu, Z Wang, M Jordan, Z Yang
International Conference on Machine Learning, 2022
172022
Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
Y Kuang, M Lu, J Wang, Q Zhou, B Li, H Li
Association for the Advancement of Artificial Intelligence, 2022
172022
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
J Blanchet, M Lu, T Zhang, H Zhong
Neural Information Processing Systems, 2023
152023
Benign Oscillation of Stochastic Gradient Descent with Large Learning Rates
M Lu, B Wu, X Yang, D Zou
International Conference on Learning Representations, 2024
12024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
M Lu, H Zhong, T Zhang, J Blanchet
arXiv preprint arXiv:2404.03578, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–8