Hierarchical global-local temporal modeling for video captioning Y Hu, Z Chen, ZJ Zha, F Wu Proceedings of the 27th ACM international conference on multimedia, 774-783, 2019 | 52 | 2019 |
Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360 Videos: ITU-T Rec. P.919 J Gutierrez, P Perez, M Orduna, A Singla, C Cortes, P Mazumdar, I Viola, ... IEEE transactions on multimedia 24, 3087-3100, 2021 | 51 | 2021 |
Make it move: controllable image-to-video generation with text descriptions Y Hu, C Luo, Z Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 37 | 2022 |
A multimodal variational encoder-decoder framework for micro-video popularity prediction J Xie, Y Zhu, Z Zhang, J Peng, J Yi, Y Hu, H Liu, Z Chen Proceedings of the web conference 2020, 2542-2548, 2020 | 37 | 2020 |
RGB-D semantic segmentation: a review Y Hu, Z Chen, W Lin 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 1-6, 2018 | 27 | 2018 |
Exploiting the local temporal information for video captioning R Wei, L Mi, Y Hu, Z Chen Journal of Visual Communication and Image Representation 67, 102751, 2020 | 17 | 2020 |
Predicate correlation learning for scene graph generation L Tao, L Mi, N Li, X Cheng, Y Hu, Z Chen IEEE Transactions on Image Processing 31, 4173-4185, 2022 | 16 | 2022 |
Two-stream refinement network for RGB-D saliency detection D Liu, Y Hu, K Zhang, Z Chen 2019 IEEE International Conference on Image Processing (ICIP), 3925-3929, 2019 | 12 | 2019 |
Lamd: Latent motion diffusion for video generation Y Hu, Z Chen, C Luo arXiv preprint arXiv:2304.11603, 2023 | 7 | 2023 |
Maps: Joint multimodal attention and pos sequence generation for video captioning C Zou, X Wang, Y Hu, Z Chen, S Liu 2021 International Conference on Visual Communications and Image Processing …, 2021 | 2 | 2021 |
Subjective study of perceptual quality for micro-video applications Y Hu, Y Zhang, Z Liu, Z Chen, S Liu 2020 IEEE Conference on Multimedia Information Processing and Retrieval …, 2020 | 2 | 2020 |
A benchmark for controllable text-image-to-video generation Y Hu, C Luo, Z Chen IEEE Transactions on Multimedia, 2023 | 1 | 2023 |
Multiple visual relationship forecasting and arrangement in videos W Ouyang, Y Hu, Y Ou, Z Chen Neurocomputing 541, 126274, 2023 | | 2023 |
Decomposing style, content, and motion for videos Y Hu, D Yin, Y Wang, Z Chen, C Luo Journal of Visual Communication and Image Representation 89, 103686, 2022 | | 2022 |
Learn to Look Around: Deep Reinforcement Learning Agent for Video Saliency Prediction Y Tao, Y Hu, Z Chen 2021 International Conference on Visual Communications and Image Processing …, 2021 | | 2021 |
Supplementary Material Make It Move: Controllable Image-to-Video Generation with Text Descriptions Y Hu, C Luo, Z Chen | | |