Follow
Qisen Yang (杨琪森)
Title
Cited by
Cited by
Year
Efficient knowledge distillation from model checkpoints
C Wang, Q Yang, R Huang, S Song, G Huang
Advances in Neural Information Processing Systems 35, 607-619, 2022
372022
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
S Wang, C Liu, Z Zheng, S Qi, S Chen, Q Yang, A Zhao, C Wang, S Song, ...
Findings of the Association for Computational Linguistics ACL 2024, 9909-9953, 2024
35*2024
Towards learning spatially discriminative feature representations
C Wang, J Xiao, Y Han, Q Yang, S Song, G Huang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
252021
Fine-grained few shot learning with foreground object transformation
C Wang, S Song, Q Yang, X Li, G Huang
Neurocomputing 466, 16-26, 2021
182021
Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment
Y Qisen, W Shenzhi, S Jinnan, W Chaofei, H Gao, WU Cheng, S Shiji
Computer Integrated Manufacturing System 28 (7), 2030, 2022
92022
Boosting Offline Reinforcement Learning with Action Preference Query
Q Yang, S Wang, MG Lin, S Song, G Huang
International Conference on Machine Learning 202, 39509--39523, 2023
82023
复杂开放水域下智能船舶路径规划与避障方法
杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉
计算机集成制造系统 28 (7), 2030, 2022
82022
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
Q Yang, Z Wang, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
7*2024
Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning
S Wang, Q Yang, J Gao, M Lin, H Chen, L Wu, N Jia, S Song, G Huang
Advances in Neural Information Processing Systems 36, 2024
72024
Hundreds guide millions: Adaptive offline reinforcement learning with expert guidance
Q Yang, S Wang, Q Zhang, G Huang, S Song
IEEE Transactions on Neural Networks and Learning Systems, 2023
52023
Leveraging reward consistency for interpretable feature discovery in reinforcement learning
Q Yang, H Wang, M Tong, W Shi, G Huang, S Song
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2023
42023
Decoupled Prioritized Resampling for Offline RL
Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan
arXiv preprint arXiv:2306.05412, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12