Loading...
The system can't perform the operation now. Try again later.
Citations per year
Duplicate citations
The following articles are merged in Scholar. Their
combined citations
are counted only for the first article.
Merged citations
This "Cited by" count includes citations to the following articles in Scholar. The ones marked
*
may be different from the article in the profile.
Add co-authors
Co-authors
Follow
New articles by this author
New citations to this author
New articles related to this author's research
Email address for updates
Done
My profile
My library
Metrics
Alerts
Settings
Sign in
Sign in
Get my own profile
Cited by
All
Since 2019
Citations
35
34
h-index
2
2
i10-index
1
1
0
18
9
2023
2024
16
18
Co-authors
Yali Du
Turing Fellow, Assistant professor, King's College London
Verified email at kcl.ac.uk
Fengshuo Bai
Shanghai Jiao Tong University
Verified email at sjtu.edu.cn
Yaodong Yang
BOYA (博雅) Assistant Professor at Peking University
Verified email at pku.edu.cn
Jiafei Lyu
PhD of Control Science and Engineering, Tsinghua Shenzhen International Graduate School
Verified email at mails.tsinghua.edu.cn
Xiu LI
Tsinghua University
Verified email at sz.tsinghua.edu.cn
Chenjia Bai
Shanghai AI Laboratory
Verified email at pjlab.org.cn
Follow
Runze Liu
Tsinghua University
Verified email at mails.tsinghua.edu.cn
Reinforcement Learning
RLHF
AI Alignment
Large Language Model
Articles
Cited by
Co-authors
Title
Sort
Sort by citations
Sort by year
Sort by title
Cited by
Cited by
Year
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
R Liu, F Bai, Y Du, Y Yang
NeurIPS 2022
, 2022
32
2022
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation
R Liu, Y Du, F Bai, J Lyu, X Li
ICML 2024; OTML Workshop@NeurIPS 2023
, 2023
3
*
2023
SEABO: A Simple Search-Based Method for Offline Imitation Learning
J Lyu, X Ma, L Wan, R Liu, X Li, Z Lu
ICLR 2024
, 2024
2024
BATTLE: Towards Behavior-oriented Adversarial Attacks against Deep Reinforcement Learning
F Bai, R Liu, Y Du, Y Wen, Y Yang
2023
The system can't perform the operation now. Try again later.
Articles 1–4
Show more
Privacy
Terms
Help
About Scholar
Search help