Efficient knowledge distillation from model checkpoints C Wang, Q Yang, R Huang, S Song, G Huang Advances in Neural Information Processing Systems 35, 607-619, 2022 | 37 | 2022 |
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling S Wang, C Liu, Z Zheng, S Qi, S Chen, Q Yang, A Zhao, C Wang, S Song, ... Findings of the Association for Computational Linguistics ACL 2024, 9909-9953, 2024 | 35* | 2024 |
Towards learning spatially discriminative feature representations C Wang, J Xiao, Y Han, Q Yang, S Song, G Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 25 | 2021 |
Fine-grained few shot learning with foreground object transformation C Wang, S Song, Q Yang, X Li, G Huang Neurocomputing 466, 16-26, 2021 | 18 | 2021 |
Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment Y Qisen, W Shenzhi, S Jinnan, W Chaofei, H Gao, WU Cheng, S Shiji Computer Integrated Manufacturing System 28 (7), 2030, 2022 | 9 | 2022 |
Boosting Offline Reinforcement Learning with Action Preference Query Q Yang, S Wang, MG Lin, S Song, G Huang International Conference on Machine Learning 202, 39509--39523, 2023 | 8 | 2023 |
复杂开放水域下智能船舶路径规划与避障方法 杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉 计算机集成制造系统 28 (7), 2030, 2022 | 8 | 2022 |
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents Q Yang, Z Wang, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 7* | 2024 |
Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning S Wang, Q Yang, J Gao, M Lin, H Chen, L Wu, N Jia, S Song, G Huang Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |
Hundreds guide millions: Adaptive offline reinforcement learning with expert guidance Q Yang, S Wang, Q Zhang, G Huang, S Song IEEE Transactions on Neural Networks and Learning Systems, 2023 | 5 | 2023 |
Leveraging reward consistency for interpretable feature discovery in reinforcement learning Q Yang, H Wang, M Tong, W Shi, G Huang, S Song IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2023 | 4 | 2023 |
Decoupled Prioritized Resampling for Offline RL Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan arXiv preprint arXiv:2306.05412, 2023 | | 2023 |