关注
Nuoya Xiong
Nuoya Xiong
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Combinatorial pure exploration of causal bandits
N Xiong, W Chen
arXiv preprint arXiv:2206.07883, 2022
72022
Provably safe reinforcement learning with step-wise violation constraints
N Xiong, Y Du, L Huang
Advances in Neural Information Processing Systems 36, 2024
32024
A General Framework for Sequential Decision-Making under Adaptivity Constraints
N Xiong, Z Wang, Z Yang
arXiv preprint arXiv:2306.14468, 2023
32023
Combinatorial Causal Bandits without Graph Skeleton
S Feng, N Xiong, W Chen
arXiv preprint arXiv:2301.13392, 2023
32023
Sample-Efficient Multi-Agent RL: An Optimization Perspective
N Xiong, Z Liu, Z Wang, Z Yang
arXiv preprint arXiv:2310.06243, 2023
12023
How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization
N Xiong, L Ding, SS Du
arXiv preprint arXiv:2310.01769, 2023
12023
A Correction of Pseudo Log-Likelihood Method
S Feng, N Xiong, Z Zhang, W Chen
arXiv preprint arXiv:2403.18127, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–7