Follow
Wanqing Cui
Title
Cited by
Cited by
Year
WenLan: Bridging vision and language by large-scale multi-modal pre-training
Y Huo, M Zhang, G Liu, H Lu, Y Gao, G Yang, J Wen, H Zhang, B Xu, ...
arXiv preprint arXiv:2103.06561, 2021
1172021
Beyond language: Learning commonsense from images for reasoning
W Cui, Y Lan, L Pang, J Guo, X Cheng
arXiv preprint arXiv:2010.05001, 2020
52020
MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning
W Cui, K Bi, J Guo, X Cheng
arXiv preprint arXiv:2402.13625, 2024
12024
Image-Text Matching with Multi-View Attention
R Cheng, W Cui
arXiv preprint arXiv:2402.17237, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–4