Mst: Masked self-supervised transformer for visual representation Z Li, Z Chen, F Yang, W Li, Y Zhu, C Zhao, R Deng, L Wu, R Zhao, M Tang, ... Advances in Neural Information Processing Systems 34, 13165-13176, 2021 | 127 | 2021 |
Dpt: Deformable patch-based transformer for visual recognition Z Chen, Y Zhu, C Zhao, G Hu, W Zeng, J Wang, M Tang Proceedings of the 29th ACM International Conference on Multimedia, 2899-2907, 2021 | 80 | 2021 |
Univip: A unified framework for self-supervised visual pre-training Z Li, Y Zhu, F Yang, W Li, C Zhao, Y Chen, Z Chen, J Xie, L Wu, R Zhao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 31 | 2022 |
Obj2seq: Formatting objects as sequences with class prompt for visual tasks Z Chen, Y Zhu, Z Li, F Yang, W Li, H Wang, C Zhao, L Wu, R Zhao, ... Advances in Neural Information Processing Systems 35, 2494-2506, 2022 | 15 | 2022 |
Efficient masked autoencoders with self-consistency Z Li, Y Zhu, Z Chen, W Li, C Zhao, L Wu, R Zhao, M Tang, J Wang arXiv preprint arXiv:2302.14431, 2023 | 3 | 2023 |
The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers Z Chen, Y Zhu, Z Li, F Yang, C Zhao, J Wang, M Tang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads H Wang, L Zhou, Y Chen, Z Chen, M Tang, J Wang IEEE Transactions on Circuits and Systems for Video Technology, 2023 | | 2023 |