More grounded image captioning by distilling image-text matching model Y Zhou, M Wang, D Liu, Z Hu, H Zhang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 145 | 2020 |
Semi-autoregressive transformer for image captioning Y Zhou, Y Zhang, Z Hu, M Wang Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 29 | 2021 |
Compact bidirectional transformer for image captioning Y Zhou, Z Hu, D Liu, H Ben, M Wang arXiv preprint arXiv:2201.01984, 2022 | 16 | 2022 |
A text-guided generation and refinement model for image captioning D Wang, Z Hu, Y Zhou, R Hong, M Wang IEEE Transactions on Multimedia, 2022 | 11 | 2022 |
Enhanced text-guided attention model for image captioning Y Zhou, Z Hu, Y Zhac, X Liu, R Hong 2018 IEEE fourth international conference on multimedia big data (BigMM), 1-5, 2018 | 5 | 2018 |
Revisiting knowledge distillation for image captioning J Dong, Z Hu, Y Zhou Artificial Intelligence: First CAAI International Conference, CICAI 2021 …, 2021 | 4 | 2021 |
A text-guided graph structure for image captioning D Wang, Z Hu, Y Zhou, X Liu, L Wu, R Hong 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 1-6, 2020 | 2 | 2020 |
Sequential image encoding for vision-to-language problems J Wang, Y Zhou, Z Hu, X Zhang, M Wang Multimedia Tools and Applications 80 (11), 16141-16152, 2021 | 1 | 2021 |
Video Captioning Based on the Spatial-Temporal Saliency Tracing Y Zhou, Z Hu, X Liu, M Wang Advances in Multimedia Information Processing–PCM 2018: 19th Pacific-Rim …, 2018 | | 2018 |