Wd3: Taming the estimation bias in deep reinforcement learning Q He, X Hou arXiv preprint arXiv:2006.12622, 2020 | 30* | 2020 |
Mepg: A minimalist ensemble policy gradient framework for deep reinforcement learning Q He, C Gong, Y Qu, X Chen, X Hou, Y Liu ICML'23, DA in RL Workshop, 39th International Conference on Machine …, 2021 | 13 | 2021 |
Popo: Pessimistic offline policy optimization Q He, X Hou, Y Liu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 12 | 2022 |
Wide-sense stationary policy optimization with bellman residual on video games C Gong, Q He, Y Bai, X Hou, G Fan, Y Liu ICME'2021, 2021 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2021 | 9 | 2021 |
Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning Q He, H Su, J Zhang, X Hou CVPR'2023, Proceedings of the IEEE/CVF Conference on Computer Vision and …, 2023 | 8* | 2023 |
Eigensubspace of temporal-difference dynamics and how it improves value approximation in reinforcement learning Q He, T Zhou, M Fang, S Maghsudi ECML/PKDD'2023, Joint European Conference on Machine Learning and Knowledge …, 2023 | 2 | 2023 |
Centralized Cooperative Exploration Policy for Continuous Control Tasks C Li, C Gong, Q He, X Hou, Y Liu AAMAS'2023, The 22nd International Conference on Autonomous Agents and …, 2023 | 1 | 2023 |
The f-Divergence Reinforcement Learning Framework C Gong*, Q He*, Y Bai*, X Chen, X Hou, Y Liu, G Fan arXiv preprint arXiv:2109.11867, 2021 | 1 | 2021 |
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control C Li, C Gong, Q He, X Hou NeurIPS'2023, Thirty-seventh Conference on Neural Information Processing Systems, 2023 | | 2023 |
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning Y Yang, T Zhou, Q He, L Han, M Pechenizkiy, M Fang ICLR'2024 Spotlight; The Twelfth International Conference on Learning …, 2023 | | 2023 |
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation Q He, T Zhou, M Fang, S Maghsudi ICLR'2024; The Twelfth International Conference on Learning Representations, 2023 | | 2023 |