Follow
Mu Cai
Mu Cai
Verified email at cs.wisc.edu - Homepage
Title
Cited by
Cited by
Year
VOS: Learning What You Don't Know by Virtual Outlier Synthesis
X Du, Z Wang, M Cai, Y Li
Proceedings of the International Conference on Learning Representations 1 (4), 8, 2022
2352022
Masked Discrimination for Self-Supervised Learning on Point Clouds
H Liu, M Cai, YJ Lee
Proceedings of the European Conference on Computer Vision (ECCV), 2022, 2022
1152022
Frequency domain image translation: More photo-realistic, better identity-preserving
M Cai, H Zhang, H Huang, Q Geng, Y Li, G Huang
IEEE International Conference on Computer Vision (ICCV), 2021, 13930-13940, 2021
692021
Investigating the catastrophic forgetting in multimodal large language models
Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma
Conference on Parsimony and Learning (CPAL) 2023, 2023
60*2023
Out-of-distribution Detection via Frequency-regularized Generative Models
M Cai, Y Li
WACV (Spotlight), 2023, 2022
262022
Making large multimodal models understand arbitrary visual prompts
M Cai, H Liu, SK Mustikovela, GP Meyer, Y Chai, D Park, YJ Lee
CVPR 2024, 2024
212024
A Game-Theoretic Strategy-Aware Interaction Algorithm with Validation on Real Traffic Data
L Sun*, M Cai*, W Zhan, M Tomizuka
The 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2020
142020
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Z Huang, A Zhou, Z Lin, M Cai, H Wang, YJ Lee
ICCV 2023, 2023
62023
Llava-prumerge: Adaptive token reduction for efficient large multimodal models
Y Shang, M Cai, B Xu, YJ Lee, Y Yan
arXiv preprint arXiv:2403.15388, 2024
52024
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
M Cai, Z Huang, Y Li, H Wang, YJ Lee
arXiv preprint arXiv:2306.06094, 2023
52023
Matryoshka Multimodal Models
M Cai, J Yang, J Gao, YJ Lee
arXiv preprint arXiv:2405.17430, 2024
22024
Yo'LLaVA: Your Personalized Language and Vision Assistant
T Nguyen, H Liu, Y Li, M Cai, U Ojha, YJ Lee
arXiv preprint arXiv:2406.09400, 2024
2024
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
J Zhang*, M Cai*, T Xie, YJ Lee
Findings of the Association for Computational Linguistics: ACL 2024, 2024
2024
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for Point Clouds
M Cai, C Luo, YJ Lee, X Yang
Causal inference can prevent computer vision from falling into black-box deep learning
M Cai
The system can't perform the operation now. Try again later.
Articles 1–15