Two-stream transformer for multi-label image classification X Zhu, J Cao, J Ge, W Liu, B Liu Proceedings of the 30th ACM International Conference on Multimedia, 3598-3607, 2022 | 23 | 2022 |
Effective fine-grained location prediction based on user check-in pattern in LBSNs J Cao, S Xu, X Zhu, R Lv, B Liu Journal of Network and Computer Applications 108, 64-75, 2018 | 21 | 2018 |
Joint visual-textual sentiment analysis based on cross-modality attention mechanism X Zhu, B Cao, S Xu, B Liu, J Cao MultiMedia Modeling: 25th International Conference, MMM 2019, Thessaloniki …, 2019 | 20 | 2019 |
Balanced symmetric cross entropy for large scale imbalanced and noisy data F Huang, J Li, X Zhu arXiv preprint arXiv:2007.01618, 2020 | 14 | 2020 |
Efficient fine-grained location prediction based on user mobility pattern in lbsns J Cao, S Xu, X Zhu, R Lv, B Liu 2017 Fifth International Conference on Advanced Cloud and Big Data (CBD …, 2017 | 14 | 2017 |
Exploring visual pre-training for robot manipulation: Datasets, models and methods Y Jing, X Zhu, X Liu, Q Sima, T Yang, Y Feng, T Kong 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 13 | 2023 |
Visual-textual sentiment analysis enhanced by hierarchical cross-modality interaction T Zhou, J Cao, X Zhu, B Liu, S Li IEEE Systems Journal 15 (3), 4303-4314, 2020 | 13 | 2020 |
Scene-aware label graph learning for multi-label image classification X Zhu, J Liu, W Liu, J Ge, B Liu, J Cao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 11 | 2023 |
Real-time anomaly detection on surveillance video with two-stream spatio-temporal generative model W Liu, J Cao, Y Zhu, B Liu, X Zhu Multimedia systems 29 (1), 59-71, 2023 | 10 | 2023 |
Community discovery based on social relations and temporal-spatial topics in LBSNs S Xu, J Cao, X Zhu, Y Dong, B Liu Advances in Knowledge Discovery and Data Mining: 22nd Pacific-Asia …, 2018 | 7 | 2018 |
Text as image: Learning transferable adapter for multi-label classification X Zhu, J Cao, D Tang, F Xu, W Liu, J Ge, B Liu, Q Guo, T Zhang arXiv preprint arXiv:2312.04160, 2023 | 3 | 2023 |
Enhancing Micro-Video Venue Recognition via Multi-Modal and Multi-Granularity Object Relations W Liu, J Cao, R Wei, X Zhu, B Liu IEEE Transactions on Circuits and Systems for Video Technology, 2024 | 2 | 2024 |
Beyond Visual Cues: Synchronously Exploring Target-Centric Semantics for Vision-Language Tracking J Ge, X Chen, J Cao, X Zhu, W Liu, B Liu arXiv preprint arXiv:2311.17085, 2023 | 2 | 2023 |
Context-Enhanced Video Moment Retrieval with Large Language Models W Liu, B Miao, J Cao, X Zhu, B Liu, M Nasim, A Mian arXiv preprint arXiv:2405.12540, 2024 | 1 | 2024 |
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification X Zhu, J Liu, D Tang, J Ge, W Liu, B Liu, J Cao arXiv preprint arXiv:2401.01181, 2024 | | 2024 |
Supplementary Materials Scene-Aware Label Graph Learning for Multi-Label Image Classification X Zhu, J Liu, W Liu, J Ge, B Liu, J Cao | | |
An Effective Ensemble Method for AliProducts Challenge: Large-scale Product Recognition X Zhu, W Liu | | |