Chatbridge: Bridging modalities with large language model as a language catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023 | 29 | 2023 |
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models T Yue, J Cheng, L Guo, X Dai, Z Zhao, X He, G Xiong, Y Lv, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation W Wang, T Yue, Y Zhang, L Guo, X He, X Wang, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu arXiv preprint arXiv:2406.09367, 2024 | | 2024 |
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Z Zhao, L Guo, T Yue, E Hu, S Shao, Z Yuan, J Liu | | |