Follow
Situo Zhang
Title
Cited by
Cited by
Year
Large Language Models Are Semi-Parametric Reinforcement Learning Agents
D Zhang, L Chen, S Zhang, H Xu, Z Zhao, K Yu
Advances in Neural Information Processing Systems 36, 2024
202024
Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback
H Xu, Z Zhu, D Ma, S Zhang, S Fan, L Chen, K Yu
arXiv preprint arXiv:2403.18349, 2024
12024
Multi: Multimodal Understanding Leaderboard with Text and Images
Z Zhu, Y Xu, L Chen, J Yang, Y Ma, Y Sun, H Wen, J Liu, J Cai, Y Ma, ...
arXiv preprint arXiv:2402.03173, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–3