Learning attentional communication for multi-agent cooperation J Jiang, Z Lu Advances in neural information processing systems 31, 2018 | 504 | 2018 |
Graph convolutional reinforcement learning J Jiang, C Dun, T Huang, Z Lu arXiv preprint arXiv:1810.09202, 2018 | 404 | 2018 |
Learning fairness in multi-agent systems J Jiang, Z Lu Advances in Neural Information Processing Systems 32, 2019 | 60 | 2019 |
Towards human-level bimanual dexterous manipulation with reinforcement learning Y Chen, T Wu, S Wang, X Feng, J Jiang, Z Lu, S McAleer, H Dong, ... Advances in Neural Information Processing Systems 35, 5150-5163, 2022 | 57 | 2022 |
The emergence of individuality J Jiang, Z Lu International Conference on Machine Learning, 4992-5001, 2021 | 37* | 2021 |
Offline decentralized multi-agent reinforcement learning J Jiang, Z Lu arXiv preprint arXiv:2108.01832, 2021 | 32 | 2021 |
Model-based opponent modeling X Yu, J Jiang, W Zhang, H Jiang, Z Lu Advances in Neural Information Processing Systems 35, 28208-28221, 2022 | 19 | 2022 |
I2q: A fully decentralized q-learning algorithm J Jiang, Z Lu Advances in Neural Information Processing Systems 35, 20469-20481, 2022 | 13 | 2022 |
MA2QL: A minimalist approach to fully decentralized multi-agent reinforcement learning K Su, S Zhou, J Jiang, C Gan, X Wang, Z Lu arXiv preprint arXiv:2209.08244, 2022 | 7 | 2022 |
Generative exploration and exploitation J Jiang, Z Lu Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4337-4344, 2020 | 6 | 2020 |
Towards general computer control: A multimodal agent for red dead redemption ii as a case study W Tan, Z Ding, W Zhang, B Li, B Zhou, J Yue, H Xia, J Jiang, L Zheng, ... arXiv preprint arXiv:2403.03186, 2024 | 4 | 2024 |
Online tuning for offline decentralized multi-agent reinforcement learning J Jiang, Z Lu Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8050-8059, 2023 | 4 | 2023 |
A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges X Xu, Y Wang, C Xu, Z Ding, J Jiang, Z Ding, BF Karlsson arXiv preprint arXiv:2403.10249, 2024 | 1 | 2024 |
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer B Zhou, K Li, J Jiang, Z Lu Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Best possible q-learning J Jiang, Z Lu arXiv preprint arXiv:2302.01188, 2023 | 1 | 2023 |
Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey J Jiang, K Su, Z Lu arXiv preprint arXiv:2401.04934, 2024 | | 2024 |
Opponent Modeling based on Sub-Goal Inference XP Yu, J Jiang, Z Lu | | 2023 |
Model-Based Decentralized Policy Optimization H Luo, J Jiang, Z Lu arXiv preprint arXiv:2302.08139, 2023 | | 2023 |
Adaptive Learning Rates for Multi-Agent Reinforcement Learning J Jiang, Z Lu | | 2020 |