Masked Motion Encoding for Self-Supervised Video Representation Learning X Sun, P Chen, L Chen, C Li, TH Li, M Tan, C Gan CVPR 2023, 2235-2245, 2023 | 23 | 2023 |
Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models P Chen, X Sun, H Zhi, R Zeng, TH Li, G Liu, M Tan, C Gan NeurIPS 2023 Robot Learning Workshop, 2023 | 11 | 2023 |
Prioritized Semantic Learning for Zero-shot Instance Navigation X Sun, L Lau, H Zhi, R Qiu, J Liang ECCV 2024, 2024 | 1 | 2024 |
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation X Sun, P Chen, J Fan, TH Li, J Chen, M Tan NeurIPS 2023, 2023 | 1 | 2023 |
CoNav: A Benchmark for Human-Centered Collaborative Navigation C Li*, X Sun*, P Chen*, J Fan, Z Wang, Y Liu, J Zhu, C Gan, M Tan arXiv preprint arXiv:2406.02425, 2024 | | 2024 |
A Simple Knowledge Distillation Framework for Open-world Object Detection S Ma, Y Wang, Y Wei, J Fan, X Sun, P Chen, E Zhang arXiv preprint arXiv:2312.08653, 2023 | | 2023 |
Contrastive Vision-Language Alignment Makes Efficient Instruction Learner L Liu*, X Sun*, T Xiang, Z Zhuang, L Yin, M Tan arXiv preprint arXiv:2311.17945, 2023 | | 2023 |
MVideo: Masked Motion Modeling for Self-Supervised Video Representation Learning X Sun, P Chen, L Chen, TH Li, M Tan, C Gan arXiv preprint arXiv:2210.06096, 2022 | | 2022 |