Language-augmented pixel embedding for generalized zero-shot learning Z Wang, Y Gou, J Li, L Zhu, HT Shen IEEE Transactions on Circuits and Systems for Video Technology 33 (3), 1019-1030, 2022 | 12 | 2022 |
Region semantically aligned network for zero-shot learning Z Wang, Y Gou, J Li, Y Zhang, Y Yang Proceedings of the 30th ACM International Conference on Information …, 2021 | 9 | 2021 |
A simple llm framework for long-range video question-answering C Zhang, T Lu, MM Islam, Z Wang, S Yu, M Bansal, G Bertasius arXiv preprint arXiv:2312.17235, 2023 | 8 | 2023 |
Unified coarse-to-fine alignment for video-text retrieval Z Wang, YL Sung, F Cheng, G Bertasius, M Bansal Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 8 | 2023 |
DAM: Dynamic Adapter Merging for Continual Video QA Learning F Cheng, Z Wang, YL Sung, YB Lin, M Bansal, G Bertasius arXiv preprint arXiv:2403.08755, 2024 | | 2024 |
Unified Embeddings for Multimodal Retrieval via Frozen LLMs Z Wang, H Elfardy, M Dreyer, K Small, M Bansal Findings of the Association for Computational Linguistics: EACL 2024, 1537-1547, 2024 | | 2024 |