Hangyu Mao（毛航宇）

Cited by

	All	Since 2019
Citations	743	733
h-index	14	14
i10-index	17	17

240

120

180

201720182019202020212022202320242 4 9 48 73 146 235 222

Public access

View all

13 articles

1 article

available

not available

Based on funding mandates

Hangyu Mao（毛航宇）

Peking University

Verified email at pku.edu.cn - Homepage

AI Agent Reinforcement Learning Multi-Agent Reinforcement Learning Large Language Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Modelling the dynamic joint policy of teammates with attention multi-agent DDPG H Mao, Z Zhang, Z Xiao, Z Gong Proceedings of the 18th International Conference on Autonomous Agents and …, 2019	107	2019
Learning agent communication under limited bandwidth by message pruning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020	79	2020
Neighborhood cognition consistent multi-agent reinforcement learning H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020	70	2020
Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ... arXiv preprint arXiv:2308.03427, 2023	68	2023
Accnet: Actor-coordinator-critic net for" learning-to-communicate" with deep multi-agent reinforcement learning H Mao, Z Gong, Y Ni, Z Xiao arXiv preprint arXiv:1706.03235, 2017	48	2017
Learning multi-agent communication with double attentional deep reinforcement learning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020	42	2020
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni arXiv preprint arXiv:1903.05561, 2019	29	2019
Reward design in cooperative multi-agent reinforcement learning for packet routing H Mao, Z Gong, Z Xiao arXiv preprint arXiv:2003.03433, 2020	25	2020
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021	24	2021
Seihai: A sample-efficient hierarchical ai for the minerl competition H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang Distributed Artificial Intelligence: Third International Conference, DAI …, 2022	21	2022
What about inputting policy in value function: Policy representation and policy-extended value function approximator H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022	20	2022
Cooperative multi-agent transfer learning with level-adaptive credit assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021	20	2021
Structural relational inference actor-critic for multi-agent reinforcement learning X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie Neurocomputing 459, 383-394, 2021	19	2021
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023	14	2023
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2022	13	2022
Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shi, G Du, X Hu, H Mao, Z Li, ... arXiv preprint arXiv:2311.11315, 2023	10	2023
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020 WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ... NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021	10	2021
Transformer in transformer as backbone for deep reinforcement learning H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao arXiv preprint arXiv:2212.14538, 2022	9	2022
Api: Boosting multi-agent reinforcement learning via agent-permutation-invariant networks X Hao, W Wang, H Mao, Y Yang, D Li, Y Zheng, Z Wang, J Hao arXiv preprint arXiv:2203.05285, 2022	8	2022
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation B Zhang, Y Ye, G Du, X Hu, Z Li, S Yang, CH Liu, R Zhao, Z Li, H Mao arXiv preprint arXiv:2403.02951, 2024	7	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by