Distributional reward estimation for effective multi-agent deep reinforcement learning J Hu, Y Sun, H Chen, S Huang, Y Chang, L Sun Advances in Neural Information Processing Systems 35, 12619-12632, 2022 | 5 | 2022 |
A simple unified uncertainty-guided framework for offline-to-online reinforcement learning S Guo, Y Sun, J Hu, S Huang, H Chen, H Piao, L Sun, Y Chang arXiv preprint arXiv:2306.07541, 2023 | 4 | 2023 |
Instructed diffuser with temporal condition guidance for offline reinforcement learning J Hu, Y Sun, S Huang, SY Guo, H Chen, L Shen, L Sun, Y Chang, D Tao arXiv preprint arXiv:2306.04875, 2023 | 4 | 2023 |
MA-TREX: Mutli-agent Trajectory-Ranked Reward Extrapolation via Inverse Reinforcement Learning S Huang, B Yang, H Chen, H Piao, Z Sun, Y Chang International Conference on Knowledge Science, Engineering and Management, 3-14, 2020 | 3 | 2020 |
Learning Generalizable Agents via Saliency-Guided Features Decorrelation S Huang, Y Sun, J Hu, S Guo, H Chen, Y Chang, L Sun, B Yang Advances in Neural Information Processing Systems 36, 2024 | | 2024 |
Causal RL Agents for Out-of-distribution Generalization S Huang, B Yang, H Chen, P Cui, J Hu, L Sun | | 2022 |