Follow
Xiaoqian Shen
Xiaoqian Shen
CS PhD @ KAUST
Verified email at kaust.edu.sa - Homepage
Title
Cited by
Cited by
Year
Minigpt-4: Enhancing vision-language understanding with advanced large language models
D Zhu, J Chen, X Shen, X Li, M Elhoseiny
ICLR 2024, 2024
9592024
Minigpt-v2: large language model as a unified interface for vision-language multi-task learning
J Chen, D Zhu, X Shen, X Li, Z Liu, P Zhang, R Krishnamoorthi, ...
arXiv preprint arXiv:2310.09478, 2023
1432023
Chatgpt asks, blip-2 answers: Automatic questioning towards enriched visual descriptions
D Zhu, J Chen, K Haydarov, X Shen, W Zhang, M Elhoseiny
TMLR, 2024
592024
Hrs-bench: Holistic, reliable and scalable benchmark for text-to-image models
EM Bakr, X Shen, P Sun, FF Khan, LE Li, M Elhoseiny
ICCV 2023, 2023
232023
KeMRE: knowledge-enhanced medical relation extraction for Chinese medicine instructions
T Qi, S Qiu, X Shen, H Chen, S Yang, H Wen, Y Zhang, Y Wu, Y Huang
Journal of Biomedical Informatics 120, 103834, 2021
172021
Exploring hierarchical graph representation for large-scale zero-shot image classification
K Yi, X Shen, Y Gou, M Elhoseiny
ECCV 2022, 2022
152022
Multi-ConDoS: Multimodal contrastive domain sharing generative adversarial networks for self-supervised medical image segmentation
J Zhang, S Zhang, X Shen, T Lukasiewicz, Z Xu
IEEE Transactions on Medical Imaging, 2023
132023
Mostgan-v: Video generation with temporal motion styles
X Shen, X Li, M Elhoseiny
CVPR 2023, 2023
132023
Affective visual dialog: A large-scale benchmark for emotional reasoning based on visually grounded conversations
K Haydarov, X Shen, A Madasu, M Salem, J Li, G Elsayed, M Elhoseiny
arXiv preprint arXiv:2308.16349, 2023
22023
Adversarial Text to Continuous Image Generation
K Haydarov, A Muhamed, J Lazarevic, I Skorokhodov, X Shen, ...
2*2022
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
K Ataallah, X Shen, E Abdelrahman, E Sleiman, D Zhu, J Ding, ...
arXiv preprint arXiv:2404.03413, 2024
2024
StoryGPT-V: Large Language Models as Consistent Story Visualizers
X Shen, M Elhoseiny
arXiv preprint arXiv:2312.02252, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12