Follow
Zhuowan Li
Title
Cited by
Cited by
Year
Fd-gan: Pose-guided feature distilling gan for robust person re-identification
Y Ge, Z Li, H Zhao, G Yin, S Yi, X Wang, H Li
Proceedings of 32nd Conference on Neural Information Processing Systems …, 2018
3992018
Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering
V Gupta, Z Li, A Kortylewski, C Zhang, Y Li, A Yuille
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
472022
Context-aware group captioning via self-attention and contrastive features
Z Li, Q Tran, L Mai, Z Lin, AL Yuille
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
422020
Visual commonsense in pretrained unimodal and multimodal models
C Zhang, B Van Durme, Z Li, E Stengel-Eskin
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
312022
Super-CLEVR: A virtual benchmark to diagnose domain robustness in visual reasoning
Z Li, X Wang, E Stengel-Eskin, A Kortylewski, W Ma, B Van Durme, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
172023
Calibrating concepts and operations: Towards symbolic reasoning on real images
Z Li, E Stengel-Eskin, Y Zhang, C Xie, QH Tran, B Van Durme, A Yuille
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
162021
Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering. 2022 IEEE
V Gupta, Z Li, A Kortylewski, C Zhang, Y Li, AL Yuille
CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5068-5078, 2022
62022
Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models
Z Li, C Xie, B Van Durme, A Yuille
Proceedings of the 18th Conference of the European Chapter of the …, 2024
2*2024
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
S Zhao, Z Li, Y Lu, A Yuille, Y Wang
arXiv preprint arXiv:2312.06685, 2023
12023
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Z Li, B Jasani, P Tang, S Ghadar
arXiv preprint arXiv:2403.16385, 2024
2024
Contrastive captioning for image groups
T Quan, M Long, L Zhe, L Zhuowan
US Patent US20240037939A1, 2024
2024
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
X Wang, W Ma, Z Li, A Kortylewski, A Yuille
Thirty-seventh Conference on Neural Information Processing Systems, 2023
2023
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Y Wang, A Yuille, Z Li, Z Zheng
2023
The system can't perform the operation now. Try again later.
Articles 1–13