Llama-adapter: Efficient fine-tuning of language models with zero-init attention R Zhang, J Han, C Liu, P Gao, A Zhou, X Hu, S Yan, P Lu, H Li, Y Qiao arXiv preprint arXiv:2303.16199, 2023 | 357 | 2023 |
Personalize segment anything model with one shot R Zhang, Z Jiang, Z Guo, S Yan, J Pan, H Dong, P Gao, H Li ICLR 2024, 2023 | 81 | 2023 |
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation R Fang, S Yan, Z Huang, J Zhou, H Tian, J Dai, H Li arXiv preprint arXiv:2311.18835, 2023 | 3 | 2023 |
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation S Yan, R Zhang, Z Guo, W Chen, W Zhang, H Li, Y Qiao, Z He, P Gao AAAI 2024, 2023 | 3 | 2023 |
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation S Yan, X Xu, L Hong, W Chen, W Zhang, W Zhang arXiv preprint arXiv:2309.12303, 2023 | 2 | 2023 |
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning L Hong, S Yan, R Zhang, W Li, X Zhou, P Guo, K Jiang, Y Chen, J Li, ... CVPR 2024 Highlight, 2024 | 1 | 2024 |
Three-stage training pipeline with patch random drop for few-shot object detection S Lin, X Zeng, S Yan, R Zhao Proceedings of the Asian Conference on Computer Vision, 1027-1043, 2022 | 1 | 2022 |