SMASH: Improving SMAll Language Models’ Few-SHot Ability with Prompt-Based Distillation Y Wang, C Liu, K Chen, X Wang, D Zhao Findings of the Association for Computational Linguistics: EMNLP 2022, 6608-6619, 2022 | 3 | 2022 |
Hawkeye: Training video-text llms for grounding text in videos Y Wang, X Meng, J Liang, Y Wang, Q Liu, D Zhao arXiv preprint arXiv:2403.10228, 2024 | 2 | 2024 |
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions Y Wang, Z Zheng, X Zhao, J Li, Y Wang, D Zhao arXiv preprint arXiv:2305.18756, 2023 | 2 | 2023 |
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding Y Wang, Y Wang, P Wu, J Liang, D Zhao, Z Zheng arXiv preprint arXiv:2402.16050, 2024 | 1 | 2024 |
Overview of the NLPCC 2023 Shared Task 10: Learn to Watch TV: Multimodal Dialogue Understanding and Response Generation Y Wang, Y Wang, D Zhao CCF International Conference on Natural Language Processing and Chinese …, 2023 | 1 | 2023 |
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering Y Wang, Y Wang, K Chen, D Zhao Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19215 …, 2024 | | 2024 |