DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction H Feng, Y Wang, W Zhou, J Deng, H Li ACM International Conference on Multimedia (ACM MM), 2021, 2021 | 47 | 2021 |
Geometric Representation Learning for Document Image Rectification H Feng, W Zhou, J Deng, Y Wang, H Li European Conference on Computer Vision (ECCV), 2022, 475-492, 2022 | 22 | 2022 |
DocScanner: Robust Document Image Rectification with Progressive Learning H Feng, W Zhou, J Deng, Q Tian, H Li arXiv preprint arXiv:2110.14968, 2021 | 16 | 2021 |
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding H Feng, Z Wang, J Tang, J Lu, W Zhou, H Li, C Huang arXiv preprint arXiv:2308.11592, 2023 | 14 | 2023 |
Deep Unrestricted Document Image Rectification H Feng, S Liu, J Deng, W Zhou, H Li IEEE Transactions on Multimedia (TMM), 2023, 2023 | 10 | 2023 |
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding H Feng, Q Liu, H Liu, W Zhou, H Li, C Huang arXiv preprint arXiv:2311.11810, 2023 | 9 | 2023 |
Recurrent Generic Contour-based Instance Segmentation with Progressive Learning H Feng, K Zhou, W Zhou, Y Yin, J Deng, Q Sun, H Li IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024, 2023 | 7* | 2023 |
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs Y Wang, W Zhou, H Feng, K Zhou, H Li arXiv preprint arXiv:2311.13194, 2023 | 6 | 2023 |
Sign Language Translation with Iterative Prototype H Yao, W Zhou, H Feng, H Hu, H Zhou, H Li International Conference on Computer Vision (ICCV), 2023, 2023 | 4 | 2023 |
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning H Feng, W Wang, J Deng, W Zhou, L Li, H Li International Conference on Computer Vision (ICCV), 2023, 2023 | 3 | 2023 |
Model-aware Pre-training for Radial Distortion Rectification W Wang, H Feng, W Zhou, Z Liao, H Li IEEE Transactions on Image Processing (TIP), 2023 | 2 | 2023 |
DocMAE: Document Image Rectification via Self-supervised Representation Learning S Liu, H Feng, W Zhou, H Li, C Liu, F Wu International Conference on Multimedia and Expo (ICME), 2023, 2023 | 2 | 2023 |
Progressive Recurrent Network for Shadow Removal Y Wang, W Zhou, H Feng, L Li, H Li Computer Vision and Image Understanding (CVIU), 103861, 2023 | 1 | 2023 |
TextSquare: Scaling up Text-Centric Visual Instruction Tuning J Tang, C Lin, Z Zhao, S Wei, B Wu, Q Liu, H Feng, Y Li, S Wang, L Liao, ... arXiv preprint arXiv:2404.12803, 2024 | | 2024 |
Progressive Multi-modal Conditional Prompt Tuning X Qiu, H Feng, Y Wang, W Zhou, H Li arXiv preprint arXiv:2404.11864, 2024 | | 2024 |
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding B Luan, H Feng, H Chen, Y Wang, W Zhou, H Li arXiv preprint arXiv:2404.09797, 2024 | | 2024 |
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser H Feng, W Wang, S Liu, J Deng, W Zhou, H Li arXiv preprint arXiv:2402.19108, 2024 | | 2024 |
Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach S Liu, H Feng, W Zhou IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023, 2023 | | 2023 |
PolyTracker: Progressive Contour Regression for Multiple Object Tracking and Segmentation S Shen, H Feng, W Zhou, H Li Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2022 …, 2022 | | 2022 |