A survey on explainable reinforcement learning: Concepts, algorithms, challenges Y Qing, S Liu, J Song, H Wang, M Song arXiv preprint arXiv:2211.06665, 2022 | 16 | 2022 |
Is centralized training with decentralized execution framework centralized enough for marl? Y Zhou, S Liu, Y Qing, K Chen, T Zheng, Y Huang, J Song, M Song arXiv preprint arXiv:2305.17352, 2023 | 8 | 2023 |
Curricular Subgoals for Inverse Reinforcement Learning S Liu, Y Qing, S Xu, H Wu, J Zhang, J Cong, T Chen, Y Liu, M Song arXiv preprint arXiv:2306.08232, 2023 | 1 | 2023 |
Advantage-Aware Policy Optimization for Offline Reinforcement Learning Y Qing, J Cong, K Chen, Y Zhou, M Song arXiv preprint arXiv:2403.07262, 2024 | | 2024 |
Powerformer: A Section-adaptive Transformer for Power Flow Adjustment K Chen, W Luo, S Liu, Y Wei, Y Zhou, Y Qing, Q Zhang, J Song, M Song arXiv preprint arXiv:2401.02771, 2024 | | 2024 |