Follow
Hangyu Mao(毛航宇)
Hangyu Mao(毛航宇)
SenseTime SCG/Research
Verified email at pku.edu.cn - Homepage
Title
Cited by
Cited by
Year
Modelling the dynamic joint policy of teammates with attention multi-agent DDPG
H Mao, Z Zhang, Z Xiao, Z Gong
Proceedings of the 18th International Conference on Autonomous Agents and …, 2019
1022019
Learning agent communication under limited bandwidth by message pruning
H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020
742020
Neighborhood cognition consistent multi-agent reinforcement learning
H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao
Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020
662020
Tptu: Task planning and tool usage of large language model-based ai agents
J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ...
arXiv preprint arXiv:2308.03427, 2023
552023
ACCNet: Actor-coordinator-critic net for" Learning-to-communicate" with deep multi-agent reinforcement learning
H Mao, Z Gong, Y Ni, Z Xiao
arXiv preprint arXiv:1706.03235, 2017
462017
Learning multi-agent communication with double attentional deep reinforcement learning
H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni
Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020
402020
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing
H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni
arXiv preprint arXiv:1903.05561, 2019
282019
Reward design in cooperative multi-agent reinforcement learning for packet routing
H Mao, Z Gong, Z Xiao
arXiv preprint arXiv:2003.03433, 2020
222020
Seihai: A sample-efficient hierarchical ai for the minerl competition
H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang
Distributed Artificial Intelligence: Third International Conference, DAI …, 2022
202022
An efficient transfer learning framework for multiagent reinforcement learning
T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ...
Advances in Neural Information Processing Systems 34, 17037-17048, 2021
202021
Cooperative multi-agent transfer learning with level-adaptive credit assignment
T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ...
arXiv preprint arXiv:2106.00517, 2021
192021
What about inputting policy in value function: Policy representation and policy-extended value function approximator
H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ...
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022
172022
Structural relational inference actor-critic for multi-agent reinforcement learning
X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie
Neurocomputing 459, 383-394, 2021
162021
Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems
Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shi, G Du, X Hu, H Mao, Z Li, ...
arXiv preprint arXiv:2311.11315, 2023
102023
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020
WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ...
NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021
102021
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach
B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ...
arXiv preprint arXiv:2311.13884, 2023
82023
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks
HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang
The Eleventh International Conference on Learning Representations, 2022
82022
API: Boosting multi-agent reinforcement learning via agent-permutation-invariant networks
X Hao, W Wang, H Mao, Y Yang, D Li, Y Zheng, Z Wang, J Hao
arXiv preprint arXiv:2203.05285, 2022
82022
Transformer in transformer as backbone for deep reinforcement learning
H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao
arXiv preprint arXiv:2212.14538, 2022
72022
Multiagent q-learning with sub-team coordination
W Huang, K Li, K Shao, T Zhou, M Taylor, J Luo, D Wang, H Mao, J Hao, ...
Advances in Neural Information Processing Systems 35, 29427-29439, 2022
72022
The system can't perform the operation now. Try again later.
Articles 1–20