Yolox: Exceeding yolo series in 2021 Z Ge, S Liu, F Wang, Z Li, J Sun arXiv preprint arXiv:2107.08430, 2021 | 3656 | 2021 |
Ota: Optimal transport assignment for object detection Z Ge, S Liu, Z Li, O Yoshie, J Sun Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 411 | 2021 |
Bevdepth: Acquisition of reliable depth for multi-view 3d object detection Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li Proceedings of the AAAI conference on artificial intelligence, 2022 | 327 | 2022 |
Nms by representative region: Towards crowded pedestrian detection by proposal pairing X Huang, Z Ge, Z Jie, O Yoshie Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 161 | 2020 |
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with dynamic temporal stereo Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li Proceedings of the AAAI conference on artificial intelligence, 2022 | 116* | 2022 |
Dense teacher: Dense pseudo-labels for semi-supervised object detection H Zhou, Z Ge, S Liu, W Mao, Z Li, H Yu, J Sun Proceedings of the European conference on computer vision (ECCV), 2022 | 55 | 2022 |
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning? R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma International Conference on Learning Representations (ICLR), 2023, 2022 | 53 | 2022 |
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining Z Qi, R Dong, G Fan, Z Ge, X Zhang, K Ma, L Yi International Conference on Machine Learning (ICML), 2023, 2023 | 48 | 2023 |
Dreamllm: Synergistic multimodal comprehension and creation R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ... arXiv preprint arXiv:2309.11499, 2023 | 45 | 2023 |
Sts: Surround-view temporal stereo for multi-view 3d detection Z Wang, C Min, Z Ge, Y Li, Z Li, H Yang, D Huang arXiv preprint arXiv:2208.10145, 2022 | 42 | 2022 |
Lla: Loss-aware label assignment for dense pedestrian detection Z Ge, J Wang, X Huang, S Liu, O Yoshie Neurocomputing 462, 272-281, 2021 | 38 | 2021 |
Ps-rcnn: Detecting secondary human instances in a crowd via primary object suppression Z Ge, Z Jie, X Huang, R Xu, O Yoshie 2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020 | 34 | 2020 |
Exploring recurrent long-term temporal fusion for multi-view 3d perception C Han, J Sun, Z Ge, J Yang, R Dong, H Zhou, W Mao, Y Peng, X Zhang arXiv preprint arXiv:2303.05970, 2023 | 32 | 2023 |
Implicit identity leakage: The stumbling block to improving deepfake detection generalization S Dong, J Wang, R Ji, J Liang, H Fan, Z Ge Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 31 | 2023 |
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ... arXiv preprint arXiv:2307.09474, 2023 | 22 | 2023 |
Matrixvt: Efficient multi-camera to bev transformation for 3d perception H Zhou, Z Ge, Z Li, X Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 19 | 2023 |
Delving deep into the imbalance of positive proposals in two-stage object detection Z Ge, Z Jie, X Huang, C Li, O Yoshie Neurocomputing 425, 107-116, 2021 | 19 | 2021 |
Vary: Scaling up the vision vocabulary for large vision-language models H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang arXiv preprint arXiv:2312.06109, 2023 | 15 | 2023 |
Align-DETR: Improving DETR with simple IoU-aware BCE loss Z Cai, S Liu, G Wang, Z Ge, X Zhang, D Huang arXiv preprint arXiv:2304.07527, 2023 | 8 | 2023 |
Workshop on autonomous driving at cvpr 2021: Technical report for streaming perception challenge S Zhang, L Song, S Liu, Z Ge, Z Li, X He, J Sun arXiv preprint arXiv:2108.04230, 2021 | 8 | 2021 |