Rethinking the implementation tricks and monotonicity constraint in cooperative multi-agent reinforcement learning J Hu, S Wang, S Jiang, W Wang ICLR Blog Track 2023, 2023 | 80* | 2023 |
Noise-Regularized Advantage Value for Multi-Agent Reinforcement Learning S Wang, W Chen, J Hu*, S Hu, L Huang Mathematics, 2022, 2022 | 15* | 2022 |
Aligning language models with offline learning from human feedback J Hu, L Tao, J Yang, C Zhou arXiv preprint arXiv:2308.12050, 2023 | 12 | 2023 |
QR-MIX: Distributional value function factorisation for cooperative multi-agent reinforcement learning J Hu, SA Harding, H Wu, S Hu, S Liao arXiv preprint arXiv:2009.04197, 2020 | 10 | 2020 |
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework J Hu, X Wu, W Wang, D Zhang, Y Cao arXiv preprint arXiv:2405.11143, 2024 | 2 | 2024 |