Follow
Cong Wei
Cong Wei
Verified email at uwaterloo.ca - Homepage
Title
Cited by
Cited by
Year
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi
X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1272024
Dreamedit: Subject-driven image editing
T Li, M Ku, C Wei, W Chen
arXiv preprint arXiv:2306.12624, 2023
152023
Consisti2v: Enhancing visual consistency for image-to-video generation
W Ren, H Yang, G Zhang, C Wei, X Du, S Huang, W Chen
arXiv preprint arXiv:2402.04324, 2024
102024
Uniir: Training and benchmarking universal multimodal information retrievers
C Wei, Y Chen, H Chen, H Hu, G Zhang, J Fu, A Ritter, W Chen
arXiv preprint arXiv:2311.17136, 2023
92023
Anyv2v: A plug-and-play framework for any video-to-video editing tasks
M Ku, C Wei, W Ren, H Yang, W Chen
arXiv preprint arXiv:2403.14468, 2024
72024
Viescore: Towards explainable metrics for conditional image synthesis evaluation
M Ku, D Jiang, C Wei, X Yue, W Chen
arXiv preprint arXiv:2312.14867, 2023
72023
Sparsifiner: Learning sparse instance-dependent attention for efficient vision transformers
C Wei, B Duke, R Jiang, P Aarabi, GW Taylor, F Shkurti
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
62023
Mantis: Interleaved multi-image instruction tuning
D Jiang, X He, H Zeng, C Wei, M Ku, Q Liu, W Chen
arXiv preprint arXiv:2405.01483, 2024
52024
The system can't perform the operation now. Try again later.
Articles 1–8