Exploration in deep reinforcement learning: From single-agent to multiagent domain J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 155* | 2023 |
Euclid: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan arXiv preprint arXiv:2210.00498, 2022 | 10 | 2022 |
FIGCPS: Effective failure-inducing input generation for cyber-physical systems with deep reinforcement learning S Zhang, S Liu, J Sun, Y Chen, W Huang, J Liu, J Liu, J Hao 2021 36th IEEE/ACM International Conference on Automated Software …, 2021 | 9 | 2021 |
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles K Zhao, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023 | 5* | 2023 |
ED2: an environment dynamics decomposition framework for world model construction C Wang, T Yang, J Hao, Y Zheng, H Tang, F Barez, J Liu, J Peng, H Piao, ... arXiv preprint arXiv:2112.02817, 2021 | 1 | 2021 |
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments J Liu, Z Wang, Y Zheng, J Hao, C Bai, J Ye, Z Wang, H Piao, Y Sun Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13954 …, 2024 | | 2024 |
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Y Chen, Y Yuan, Z Zhang, Y Zheng, J Liu, F Ni, J Hao arXiv preprint arXiv:2403.03636, 2024 | | 2024 |
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models J Liu, Y Yuan, J Hao, F Ni, L Fu, Y Chen, Y Zheng arXiv preprint arXiv:2402.14245, 2024 | | 2024 |
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback Y Yuan, J Hao, Y Ma, Z Dong, H Liang, J Liu, Z Feng, K Zhao, Y Zheng arXiv preprint arXiv:2402.02423, 2024 | | 2024 |
OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making Y Ma, C Wang, C Chen, J Liu, Z Meng, Y Zheng, J Hao CAAI Artificial Intelligence Research 2, 2023 | | 2023 |
A Policy-Decoupled Method for High-Quality Data Augmentation in Offline Reinforcement Learning S Lian, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023 | | 2023 |
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv, C Fan Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023, 2023 | | 2023 |