Follow
Wenbo Hu
Wenbo Hu
Verified email at cs.ucla.edu - Homepage
Title
Cited by
Cited by
Year
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
W Hu, Y Xu, Y Li, W Li, Z Chen, Z Tu
AAAI 2024, 2023
942023
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
H Qiu*, W Hu*, ZY Dou, N Peng
ACL 2024 Findings, 2024
52024
Matryoshka Query Transformer for Large Vision-Language Models
W Hu, ZY Dou, LH Li, A Kamath, N Peng, KW Chang
NeurIPS 2024, 2024
22024
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
W Hu, JC Gu, ZY Dou, M Fayyaz, P Lu, KW Chang, N Peng
arXiv preprint arXiv:2410.08182, 2024
2024
MQT-LLaVA: Matryoshka Query Transformer for Large Vision-Language Models
W Hu, ZY Dou, LH Li, A Kamath, N Peng, KW Chang
NeurIPS 2024, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–5