Follow
Yuying Ge
Yuying Ge
Tencent AI Lab
Verified email at tencent.com - Homepage
Title
Cited by
Cited by
Year
Deepfashion2: A versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images
Y Ge, R Zhang, X Wang, X Tang, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
4082019
Parser-Free Virtual Try-on via Distilling Appearance Flows
Y Ge, Y Song, R Zhang, C Ge, W Liu, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
1452021
All in one: Exploring unified video-language pre-training
J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1392023
Bridging Video-Text Retrieval With Multiple Choice Questions
Y Ge, Y Ge, X Liu, D Li, Y Shan, X Qie, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1242022
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension
B Li, R Wang, G Wang, Y Ge, Y Ge, Y Shan
arXiv preprint arXiv:2307.16125, 2023
1052023
Scan: Self-and-collaborative attention network for video person re-identification
R Zhang, J Li, H Sun, Y Ge, P Luo, X Wang, L Lin
IEEE Transactions on Image Processing 28 (10), 4870-4882, 2019
912019
Disentangled Cycle Consistency for Highly-realistic Virtual Try-On
C Ge, Y Song, Y Ge, H Yang, W Liu, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
892021
Miles: Visual bert pre-training with injected language semantics for video-text retrieval
Y Ge, Y Ge, X Liu, J Wang, J Wu, Y Shan, X Qie, P Luo
European Conference on Computer Vision, 691-708, 2022
342022
Planting a SEED of Vision in Large Language Model
Y Ge, Y Ge, Z Zeng, X Wang, Y Shan
arXiv preprint arXiv:2307.08041, 2023
282023
Gnfactor: Multi-task real robot learning with generalizable neural feature fields
Y Ze, G Yan, YH Wu, A Macaluso, Y Ge, J Ye, N Hansen, LE Li, X Wang
Conference on Robot Learning, 284-301, 2023
212023
Making LLaMA SEE and Draw with SEED Tokenizer
Y Ge, S Zhao, Z Zeng, Y Ge, C Li, X Wang, Y Shan
arXiv preprint arXiv:2310.01218, 2023
172023
Journeydb: A benchmark for generative image understanding
K Sun, J Pan, Y Ge, H Li, H Duan, X Wu, R Zhang, A Zhou, Z Qin, Y Wang, ...
Advances in Neural Information Processing Systems 36, 2024
112024
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
J Zhu, X Ding, Y Ge, Y Ge, S Zhao, H Zhao, X Wang, Y Shan
arXiv preprint arXiv:2312.09251, 2023
62023
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Z Zeng, Y Ge, X Liu, B Chen, P Luo, ST Xia, Y Ge
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
62023
Retrieving-to-answer: Zero-shot video question answering with frozen large language models
J Pan, Z Lin, Y Ge, X Zhu, R Zhang, Y Wang, Y Qiao, H Li
Proceedings of the IEEE/CVF International Conference on Computer Vision, 272-283, 2023
52023
Policy Adaptation From Foundation Model Feedback
Y Ge, A Macaluso, LE Li, P Luo, X Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
5*2023
Align, Adapt and Inject: Sound-guided Unified Image Generation
Y Yang, K Zhang, Y Ge, W Shao, Z Xue, Y Qiao, P Luo
arXiv preprint arXiv:2306.11504, 2023
42023
MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection from a Few Samples
Y Ge, R Zhang, P Luo
IEEE Transactions on Image Processing, 2021
42021
SEED-Bench-2: Benchmarking Multimodal Large Language Models
B Li, Y Ge, Y Ge, G Wang, R Wang, R Zhang, Y Shan
arXiv preprint arXiv:2311.17092, 2023
32023
EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Y Chen, Y Ge, Y Ge, M Ding, B Li, R Wang, R Xu, Y Shan, X Liu
arXiv preprint arXiv:2312.06722, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20