Follow
Qiang He
Title
Cited by
Cited by
Year
Wd3: Taming the estimation bias in deep reinforcement learning
Q He, X Hou
arXiv preprint arXiv:2006.12622, 2020
30*2020
Mepg: A minimalist ensemble policy gradient framework for deep reinforcement learning
Q He, C Gong, Y Qu, X Chen, X Hou, Y Liu
ICML'23, DA in RL Workshop, 39th International Conference on Machine …, 2021
132021
Popo: Pessimistic offline policy optimization
Q He, X Hou, Y Liu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
122022
Wide-sense stationary policy optimization with bellman residual on video games
C Gong, Q He, Y Bai, X Hou, G Fan, Y Liu
ICME'2021, 2021 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2021
92021
Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning
Q He, H Su, J Zhang, X Hou
CVPR'2023, Proceedings of the IEEE/CVF Conference on Computer Vision and …, 2023
8*2023
Eigensubspace of temporal-difference dynamics and how it improves value approximation in reinforcement learning
Q He, T Zhou, M Fang, S Maghsudi
ECML/PKDD'2023, Joint European Conference on Machine Learning and Knowledge …, 2023
22023
Centralized Cooperative Exploration Policy for Continuous Control Tasks
C Li, C Gong, Q He, X Hou, Y Liu
AAMAS'2023, The 22nd International Conference on Autonomous Agents and …, 2023
12023
The f-Divergence Reinforcement Learning Framework
C Gong*, Q He*, Y Bai*, X Chen, X Hou, Y Liu, G Fan
arXiv preprint arXiv:2109.11867, 2021
12021
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
C Li, C Gong, Q He, X Hou
NeurIPS'2023, Thirty-seventh Conference on Neural Information Processing Systems, 2023
2023
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Y Yang, T Zhou, Q He, L Han, M Pechenizkiy, M Fang
ICLR'2024 Spotlight; The Twelfth International Conference on Learning …, 2023
2023
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
Q He, T Zhou, M Fang, S Maghsudi
ICLR'2024; The Twelfth International Conference on Learning Representations, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–11