Rmix: Learning risk-sensitive policies for cooperative reinforcement learning agents W Qiu, X Wang, R Yu, R Wang, X He, B An, S Obraztsova, Z Rabinovich Advances in Neural Information Processing Systems 34, 23049-23062, 2021 | 45 | 2021 |
Learning to collaborate in multi-module recommendation via multi-agent reinforcement learning without communication X He, B An, Y Li, H Chen, R Wang, X Wang, R Yu, X Li, Z Wang Proceedings of the 14th ACM Conference on Recommender Systems, 210-219, 2020 | 31 | 2020 |
Catching Captain Jack: Efficient time and space dependent patrols to combat oil-siphoning in international waters X Wang, B An, M Strobel, F Kong Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 16 | 2018 |
Learning expensive coordination: An event-based deep RL approach Z Shi, R Yu, X Wang, R Wang, Y Zhang, H Lai, B An International Conference on Learning Representations, 2019 | 14 | 2019 |
Solving large-scale extensive-form network security games via neural fictitious self-play W Xue, Y Zhang, S Li, X Wang, B An, CK Yeo arXiv preprint arXiv:2106.00897, 2021 | 13 | 2021 |
CFR-MIX: Solving imperfect information extensive-form games with combinatorial action space S Li, Y Zhang, X Wang, W Xue, B An arXiv preprint arXiv:2105.08440, 2021 | 8 | 2021 |
Enhancing meta learning via multi-objective soft improvement functions R Yu, W Chen, X Wang, J Kwok The Eleventh International Conference on Learning Representations, 2022 | 7 | 2022 |
DO-GAN: A Double Oracle Framework for Generative Adversarial Networks APP Aung, X Wang, R Yu, B An, S Jayavelu, X Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 7 | 2022 |
Synapse: Trajectory-as-exemplar prompting with memory for computer control L Zheng, R Wang, X Wang, B An The Twelfth International Conference on Learning Representations, 2023 | 6 | 2023 |
Mastering stock markets with efficient mixture of diversified trading experts S Sun, X Wang, W Xue, X Lou, B An Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 6 | 2023 |
PRUDEX-compass: Towards systematic evaluation of reinforcement learning in financial markets S Sun, M Qin, X Wang, B An arXiv preprint arXiv:2302.00586, 2023 | 6 | 2023 |
Neural Regret-Matching for Distributed Constraint Optimization Problems. Y Deng, R Yu, X Wang, B An IJCAI, 146-153, 2021 | 6 | 2021 |
Choosing protection: User investments in security measures for cyber risk management YB Yaakov, X Wang, J Meyer, B An Decision and Game Theory for Security: 10th International Conference …, 2019 | 6 | 2019 |
Solving large-scale pursuit-evasion games using pre-trained strategies S Li, X Wang, Y Zhang, W Xue, J Černý, B An Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11586 …, 2023 | 5 | 2023 |
Stop nuclear smuggling through efficient container inspection X Wang, Q Guo, B An Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017 | 5 | 2017 |
Towards general computer control: A multimodal agent for red dead redemption ii as a case study W Tan, Z Ding, W Zhang, B Li, B Zhou, J Yue, H Xia, J Jiang, L Zheng, ... arXiv preprint arXiv:2403.03186, 2024 | 4 | 2024 |
Offline equilibrium finding S Li, X Wang, Y Zhang, J Cerny, P Li, H Chan, B An arXiv preprint arXiv:2207.05285, 2022 | 4 | 2022 |
A unified perspective on deep equilibrium finding X Wang, J Cerny, S Li, C Yang, Z Yin, H Chan, B An arXiv preprint arXiv:2204.04930, 2022 | 4 | 2022 |
Inducing cooperation via team regret minimization based multi-agent deep reinforcement learning R Yu, Z Shi, X Wang, R Wang, B Liu, X Hou, H Lai, B An arXiv preprint arXiv:1911.07712, 2019 | 4 | 2019 |
Who Should Pay the Cost: A Game-theoretic Model for Government Subsidized Investments to Improve National Cybersecurity. X Wang, B An, H Chan IJCAI, 6020-6027, 2019 | 4 | 2019 |