End-to-end dense video grounding via parallel regression F Shi, L Wang, W Huang Computer Vision and Image Understanding (CVIU), 2024 | 5 | 2024 |
Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding F Shi, R Gao, W Huang, L Wang IEEE transactions on pattern analysis and machine intelligence (TPAMI), 2023 | 5 | 2023 |
Progressive visual prompt learning with contrastive feature re-formation C Xu, H Shen, F Shi, B Chen, Y Liao, X Chen, L Wang arXiv preprint arXiv:2304.08386, 2023 | 5 | 2023 |
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models F Shi, J Gu, H Xu, S Xu, W Zhang, L Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | | 2024 |
Bridging The Gaps Between Token Pruning and Full Pre-training via Masked Fine-tuning F Shi, L Wang arXiv preprint arXiv:2310.17177, 2023 | | 2023 |