Drm: Mastering visual reinforcement learning through dormant ratio minimization G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ... arXiv preprint arXiv:2310.19668, 2023 | 9 | 2023 |
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization T Ji, Y Liang, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, H Xu arXiv preprint arXiv:2402.14528, 2024 | | 2024 |