Follow
Bin Zhang
Title
Cited by
Cited by
Year
Tptu: Task planning and tool usage of large language model-based ai agents
J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ...
arXiv preprint arXiv:2308.03427, 2023
562023
Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems
Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shi, G Du, X Hu, H Mao, Z Li, ...
arXiv preprint arXiv:2311.11315, 2023
102023
HAVEN: hierarchical cooperative multi-agent reinforcement learning with dual coordination mechanism
Z Xu, Y Bai, B Zhang, D Li, G Fan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11735 …, 2023
102023
Cooperative multi-agent reinforcement learning with hypergraph convolution
Y Bai, C Gong, B Zhang, G Fan, X Hou, Y Lu
2022 International Joint Conference on Neural Networks (IJCNN), 1-8, 2022
10*2022
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach
B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ...
LLM Agents Workshop@ICLR2024, 2023
92023
Efficient Policy Generation in Multi-agent Systems via Hypergraph Neural Network
B Zhang, Y Bai, Z Xu, D Li, G Fan
International Conference on Neural Information Processing (ICONIP), 219-230, 2022
9*2022
Learning to coordinate via multiple graph neural networks
Z Xu, B Zhang, Y Bai, D Li, G Fan
Neural Information Processing: 28th International Conference, ICONIP 2021 …, 2021
92021
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
B Zhang, L Li, Z Xu, D Li, G Fan
IJCAI 2023, 2023
82023
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning
Z Xu, Y Bai, D Li, B Zhang, G Fan
The 19nd International Conference on Autonomous Agents and Multiagent …, 2021
62021
Ptde: Personalized training with distillated execution for multi-agent reinforcement learning
Y Chen, H Mao, T Zhang, S Wu, B Zhang, J Hao, D Li, B Wang, H Chang
arXiv preprint arXiv:2210.08872, 2022
52022
Consensus learning for cooperative multi-agent reinforcement learning
Z Xu, B Zhang, D Li, Z Zhang, G Zhou, H Chen, G Fan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11726 …, 2023
42023
Stackelberg decision transformer for asynchronous action coordination in multi-agent systems
B Zhang, H Mao, L Li, Z Xu, D Li, R Zhao, G Fan
arXiv preprint arXiv:2305.07856, 2023
42023
From explicit communication to tacit cooperation: A novel paradigm for cooperative marl
D Li, Z Xu, B Zhang, G Fan
arXiv preprint arXiv:2304.14656, 2023
42023
Dual self-awareness value decomposition framework without individual global max for cooperative MARL
Z Xu, B Zhang, G Zhou, Z Zhang, G Fan
Advances in Neural Information Processing Systems 36, 2024
3*2024
Mingling foresight with imagination: Model-based cooperative multi-agent reinforcement learning
Z Xu, B Zhang, Y Zhan, Y Baiia, G Fan
Advances in Neural Information Processing Systems 35, 11327-11340, 2022
32022
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
H Mao, R Zhao, Z Li, Z Xu, H Chen, Y Chen, B Zhang, Z Xiao, J Zhang, ...
arXiv preprint arXiv:2312.15863, 2023
22023
Sea: A spatially explicit architecture for multi-agent reinforcement learning
D Li, Z Xu, B Zhang, G Fan
2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023
22023
Adaptive parameter sharing for multi-agent reinforcement learning
D Li, N Lou, B Zhang, Z Xu, G Fan
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Benchmarking the text-to-sql capability of large language models: A comprehensive evaluation
B Zhang, Y Ye, G Du, X Hu, Z Li, S Yang, CH Liu, R Zhao, Z Li, H Mao
arXiv preprint arXiv:2403.02951, 2024
12024
Multi-agent hyper-attention policy optimization
B Zhang, Z Xu, Y Chen, D Li, Y Bai, G Fan, L Li
International Conference on Neural Information Processing, 76-87, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20