Env-qa: A video question answering benchmark for comprehensive understanding of dynamic environments D Gao, R Wang, Z Bai, X Chen Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 25 | 2021 |
Glance and focus: Memory prompting for multi-event video question answering Z Bai, R Wang, X Chen Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Local feature enhancement network for set-based face recognition Z Bai, R Wang, S Shan, X Chen 2021 16th IEEE International Conference on Automatic Face and Gesture …, 2021 | 3 | 2021 |
Event Graph Guided Compositional Spatial-Temporal Reasoning for Video Question Answering Z Bai, R Wang, D Gao, X Chen IEEE Transactions on Image Processing, 2024 | | 2024 |
Glance and Focus: Memory Prompting for Multi-Event Video Question Answering Supplementary Material Z Bai, R Wang, X Chen | | |