EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan The 11th International Conference on Learning Representations (ICLR), 2022 | 10 | 2022 |
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL F Ni, J Hao, Y Mu, Y Yuan, Y Zheng, B Wang, Z Liang The 40th International Conference on Machine Learning (ICML), 2023 | 9 | 2023 |
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Z Dong, Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, T Lv, C Fan, Z Hu The 12th International Conference on Learning Representations (ICLR), 2023 | 3 | 2023 |
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Y Chen, Y Yuan, Z Zhang, Y Zheng, J Liu, F Ni, J Hao arXiv preprint arXiv:2403.03636, 2024 | | 2024 |
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint X Zhou, Y Yuan, S Yang, J Hao arXiv preprint arXiv:2402.14244, 2024 | | 2024 |
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models J Liu, Y Yuan, J Hao, F Ni, L Fu, Y Chen, Y Zheng arXiv preprint arXiv:2402.14245, 2024 | | 2024 |
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback Y Yuan, J Hao, Y Ma, Z Dong, H Liang, J Liu, Z Feng, K Zhao, Y Zheng The 12th International Conference on Learning Representations (ICLR), 2024 | | 2024 |
DiffuserLite: Towards Real-time Diffusion Planning Z Dong, J Hao, Y Yuan, F Ni, Y Wang, P Li, Y Zheng arXiv preprint arXiv:2401.15443, 2024 | | 2024 |
ED2: Environment Dynamics Decomposition World Models for Continuous Control J Hao, Y Yuan, C Wang, Z Wang arXiv preprint arXiv: 2112.02817, 2023 | | 2023 |