关注
Jianxiong Li
Jianxiong Li
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
H Xu, L Jiang, J Li, Z Yang, Z Wang, VWK Chan, X Zhan
ICLR 2023, 2023
392023
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
H Xu, L Jiang, J Li, X Zhan
NeurIPS 2022, 2022
262022
When data geometry meets deep function: Generalizing offline reinforcement learning
J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang
ICLR 2023, 2023
24*2023
Offline Reinforcement Learning with Soft Behavior Regularization
H Xu, X Zhan, J Li, H Yin
NeurIPS 2021 Offline Reinforcement Learning Workshop, 2021
232021
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
J Li*, X Hu*, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang
ICLR 2023, 2023
152023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang
arXiv preprint arXiv:2305.15669, 2023
82023
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Y Zheng*, J Li*, D Yu, Y Yang, SE Li, X Zhan, J Liu
ICLR 2024, 2024
32024
Query-Policy Misalignment in Preference-Based Reinforcement Learning
X Hu*, J Li*, X Zhan, QS Jia, YQ Zhang
ICLR 2024, 2023
32023
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
J Li, J Zheng, Y Zheng, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ...
arXiv preprint arXiv:2402.18137, 2024
2024
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning
J Li*, S Lin*, T Shi, C Tian, Y Mei, J Song, X Zhan, R Li
arXiv preprint arXiv:2311.15920, 2023
2023
Vehicle Extreme Control based on Offline Reinforcement Leaning
S Zhao, J Li, X Hu, J Zhang, C He
CAC 2022, 4539-4543, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–11