Actor-critic with familiarity-based trajectory experience replay X Gong, J Yu, S Lü, H Lu Information Sciences 582, 633-647, 2022 | 12 | 2022 |
Evolutionary generative adversarial networks with crossover based knowledge distillation J Li, J Zhang, X Gong, S Lü 2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021 | 9 | 2021 |
Entropy regularization methods for parameter space exploration S Han, W Zhou, S Lü, S Zhu, X Gong Information Sciences 622, 476-489, 2023 | 2 | 2023 |
Adaptive estimation Q-learning with uncertainty and familiarity X Gong, S Lü, J Yu, S Zhu, Z Li Proceedings of the Thirty-Second International Joint Conference on …, 2023 | 1 | 2023 |
基于样本效率优化的深度强化学习方法综述 张峻伟, 吕帅, 张正昊, 于佳玉, 龚晓宇 软件学报 33 (11), 4217-4238, 2021 | 1 | 2021 |
Guided deterministic policy optimization with gradient-free policy parameters information C Shen, S Zhu, S Han, X Gong, S Lü Expert Systems with Applications 231, 120693, 2023 | | 2023 |
Survey on Deep Reinforcement Learning Methods Based on Sample Efficiency Optimization 张峻伟, 吕帅, 张正昊, 于佳玉, 龚晓宇 Journal of Software 33 (11), 4217-4238, 2021 | | 2021 |