Follow
Shiyu Huang
Shiyu Huang
Other names黄 世宇
Researcher at Zhipu AI; Tsinghua University
Verified email at aminer.cn - Homepage
Title
Cited by
Cited by
Year
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
S Huang, D Ramanan
622017
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
B Yuchen Lin, Y Fu, K Yang, P Ammanabrolu, F Brahman, S Huang, ...
arXiv e-prints, arXiv: 2305.17390, 2023
25*2023
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
S Huang, W Chen, L Zhang, Z Li, F Zhu, D Ye, T Chen, J Zhu
arXiv preprint arXiv:2110.04507, 2021
232021
Deep reinforcement learning with credit assignment for combinatorial optimization
D Yan, J Weng, S Huang, C Li, Y Zhou, H Su, J Zhu
Pattern Recognition 124, 108466, 2022
192022
Combo-action: Training agent for fps game with auxiliary tasks
S Huang, H Su, J Zhu, T Chen
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 954-961, 2019
192019
Svqn: Sequential variational soft q-learning networks
S Huang, H Su, J Zhu, T Chen
122020
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning
Y Lin, Q Zhang, B Gao, J Tang, P Yao, C Li, S Huang, Z Liu, Y Zhou, Y Liu, ...
Nature Machine Intelligence 5 (7), 714-723, 2023
72023
Tizero: Mastering multi-agent football with curriculum learning and self-play
F Lin, S Huang, T Pearce, W Chen, WW Tu
arXiv preprint arXiv:2302.07515, 2023
72023
Learning graph-enhanced commander-executor for multi-agent navigation
X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu, H Yang, Y Wang
arXiv preprint arXiv:2302.04094, 2023
42023
DGPO: discovering multiple strategies with diversity-guided policy optimization
W Chen, S Huang, Y Chiang, T Pearce, WW Tu, T Chen, J Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11390 …, 2024
32024
Robustness and generalizability of deepfake detection: A study with diffusion models
H Song, S Huang, Y Dong, WW Tu
arXiv preprint arXiv:2309.02218, 2023
32023
VMAPD: generate diverse solutions for multi-agent games with recurrent trajectory discriminators
S Huang, C Yu, B Wang, D Li, Y Wang, T Chen, J Zhu
2022 IEEE Conference on Games (CoG), 9-16, 2022
22022
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization
S Huang, B Wang, D Li, J Hao, T Chen, J Zhu
arXiv preprint arXiv:2110.03939, 2021
22021
Learning to assign credit in reinforcement learning by incorporating abstract relations
D Yan, S Huang, H Su, J Zhu
AAAI Workshop on Reinforcement Learning in Games, 2019
22019
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments
J Chen, X Hu, S Liu, S Huang, WW Tu, Z He, L Wen
arXiv preprint arXiv:2402.16499, 2024
12024
OpenRL: A Unified Reinforcement Learning Framework
S Huang, W Chen, Y Sun, F Bie, WW Tu
arXiv preprint arXiv:2312.16189, 2023
12023
MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment
Z Xiong, B Chen, S Huang, WW Tu, Z He, Y Gao
arXiv preprint arXiv:2403.16015, 2024
2024
AutoSAT: Automatically Optimize SAT Solvers via Large Language Models
Y Sun, X Zhang, S Huang, S Cai, BZ Zhang, K Wei
arXiv preprint arXiv:2402.10705, 2024
2024
Diverse Policies Converge in Reward-Free Markov Decision Processes
F Lin, S Huang, WW Tu
Pacific Rim International Conference on Artificial Intelligence, 125-136, 2023
2023
Off-Policy Training for Truncated TD() Boosted Soft Actor-Critic
S Huang, B Wang, H Su, D Li, J Hao, J Zhu, T Chen
Pacific Rim International Conference on Artificial Intelligence, 46-59, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20