Follow
Qi Yi
Qi Yi
Verified email at mail.ustc.edu.cn
Title
Cited by
Cited by
Year
Causality-driven hierarchical structure discovery for reinforcement learning
X Hu, R Zhang, K Tang, J Guo, Q Yi, R Chen, Z Du, L Li, Q Guo, Y Chen
Advances in Neural Information Processing Systems 35, 20064-20076, 2022
92022
Hindsight value function for variance reduction in stochastic dynamic environment
J Guo, R Zhang, X Zhang, S Peng, Q Yi, Z Du, X Hu, Q Guo, Y Chen
arXiv preprint arXiv:2107.12216, 2021
92021
Object-category aware reinforcement learning
Q Yi, R Zhang, J Guo, X Hu, Z Du, Q Guo, Y Chen
Advances in Neural Information Processing Systems 35, 36453-36465, 2022
72022
Conceptual reinforcement learning for language-conditioned tasks
S Peng, X Hu, R Zhang, J Guo, Q Yi, R Chen, Z Du, L Li, Q Guo, Y Chen
Proceedings of the AAAI Conference on Artificial Intelligence 37 (8), 9426-9434, 2023
62023
Learning controllable elements oriented representations for reinforcement learning
Q Yi, R Zhang, S Peng, J Guo, X Hu, Z Du, Q Guo, R Chen, L Li, Y Chen
Neurocomputing 549, 126455, 2023
42023
Context shift reduction for offline meta-reinforcement learning
Y Gao, R Zhang, J Guo, F Wu, Q Yi, S Peng, S Lan, R Chen, Z Du, X Hu, ...
Advances in Neural Information Processing Systems 36, 2024
32024
Online prototype alignment for few-shot policy transfer
Q Yi, R Zhang, S Peng, J Guo, Y Gao, K Yuan, R Chen, S Lan, X Hu, Z Du, ...
International Conference on Machine Learning, 39968-39983, 2023
32023
Efficient symbolic policy learning with differentiable symbolic expression
J Guo, R Zhang, S Peng, Q Yi, X Hu, R Chen, Z Du, L Li, Q Guo, Y Chen
Advances in Neural Information Processing Systems 36, 2024
22024
Contrastive modules with temporal attention for multi-task reinforcement learning
S Lan, R Zhang, Q Yi, J Guo, S Peng, Y Gao, F Wu, R Chen, Z Du, X Hu, ...
Advances in Neural Information Processing Systems 36, 2024
12024
Prompt-based Visual Alignment for Zero-shot Policy Transfer
H Gao, R Zhang, Q Yi, H Yao, H Li, J Guo, S Peng, Y Gao, QC Wang, ...
arXiv preprint arXiv:2406.03250, 2024
2024
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning
F Wu, R Zhang, Q Yi, Y Gao, J Guo, S Peng, S Lan, H Han, Y Pan, K Yuan, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15897 …, 2024
2024
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning
S Peng, X Hu, Q Yi, R Zhang, J Guo, D Huang, Z Tian, R Chen, Z Du, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (13), 14599 …, 2024
2024
Contextual Symbolic Policy For Meta-Reinforcement Learning
J Guo, R Zhang, S Peng, Q Yi, X Hu, R Chen, K Long, Z Du, X Zhang, L Li, ...
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning–Appendix
S Peng, X Hu, R Zhang, K Tang, J Guo, Q Yi, R Chen, X Zhang, Z Du, L Li, ...
survival 200, 400, 0
The system can't perform the operation now. Try again later.
Articles 1–14