Exploiting BERT for multimodal target sentiment classification through input space translation Z Khan, Y Fu Proceedings of the 29th ACM international conference on multimedia, 3034-3042, 2021 | 84 | 2021 |
One label, one billion faces: Usage and consistency of racial categories in computer vision Z Khan, Y Fu Proceedings of the 2021 acm conference on fairness, accountability, and …, 2021 | 42 | 2021 |
Recognizing families in the wild (RFIW): the 4th edition JP Robinson, Y Yin, Z Khan, M Shao, S Xia, M Stopa, S Timoner, MA Turk, ... 2020 15th IEEE International Conference on Automatic Face and Gesture …, 2020 | 25 | 2020 |
Single-stream multi-level alignment for vision-language pretraining Z Khan, BG Vijay Kumar, X Yu, S Schulter, M Chandraker, Y Fu European Conference on Computer Vision, 735-751, 2022 | 14 | 2022 |
Families in wild multimedia: A multimodal database for recognizing kinship JP Robinson, Z Khan, Y Yin, M Shao, Y Fu IEEE Transactions on Multimedia 24, 3582-3594, 2021 | 14 | 2021 |
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! Z Khan, VK BG, S Schulter, X Yu, Y Fu, M Chandraker Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 7 | 2023 |
Contrastive alignment of vision to language through parameter-efficient transfer learning Z Khan, Y Fu Proceedings of the International Conference on Learning Representations …, 2023 | 5 | 2023 |
Exploring Question Decomposition for Zero-Shot VQA Z Khan, VK BG, S Schulter, M Chandraker, Y Fu Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Selective Prediction For Open-Ended Question Answering in Black-Box Vision-Language Models Z Khan, Y Fu R0-FoMo: Robustness of Few-shot and Zero-shot Learning in Large Foundation …, 2023 | 1 | 2023 |
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering Z Khan, Y Fu arXiv preprint arXiv:2404.10193, 2024 | | 2024 |
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement Z Khan, VK BG, S Schulter, Y Fu, M Chandraker arXiv preprint arXiv:2404.04627, 2024 | | 2024 |
Where is the bottleneck in long-tailed classification? Z Khan, Y Fu | | 2021 |