Yunlong Tang

Cited by

	All	Since 2019
Citations	88	88
h-index	4	4
i10-index	2	2

2023202431 57

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Feng ZhengSouthern University of Science and TechnologyVerified email at sustech.edu.cn
Teng WangDepartment of Computer Science, The University of Hong KongVerified email at connect.hku.hk
Chenliang XuAssociate Professor, University of RochesterVerified email at rochester.edu
Siting XuSouthern University of Science and TechnologyVerified email at mail.sustech.edu.cn
Jiebo LuoAlbert Arendt Hopeman Professor of Engineering, University of RochesterVerified email at cs.rochester.edu
Ping Luo (羅平)Associate Professor, The University of Hong KongVerified email at hku.hk

Yunlong Tang

University of Rochester

Verified email at rochester.edu - Homepage

Multimodal Learning Video Understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Caption Anything: Interactive Image Description with Diverse Multimodal Controls T Wang, J Zhang, J Fei, H Zheng, Y Tang, Z Li, M Gao, S Zhao arXiv preprint arXiv:2305.02677, 2023	52	2023
Video Understanding with Large Language Models: A Survey Y Tang, J Bi, S Xu*, L Song, S Liang, T Wang, D Zhang, J An, J Lin, ... arXiv preprint arXiv:2312.17432, 2023	16	2023
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning Y Tang, J Zhang, X Wang, T Wang, F Zheng CVPR Workshops, 2023	6	2023
LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad S Xu, Y Tang, F Zheng Proceedings of the International Computer Music Conference (ICMC), 213-217, 2023	4	2023
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward Y Tang, S Xu, T Wang, Q Lin, Q Lu, F Zheng Proceedings of the Asian Conference on Computer Vision, 3519-3535, 2022	4	2022
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning H Hua, Y Tang, C Xu, J Luo arXiv preprint arXiv:2404.12353, 2024	2	2024
AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue Y Tang, D Shimada, J Bi, C Xu arXiv preprint arXiv:2403.16276, 2024	2	2024
Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering P Liu, L Song, D Zhang, H Hua, Y Tang, H Tu, J Luo, C Xu arXiv preprint arXiv:2402.00827, 2024	2	2024
Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning? M Feng, Y Tang, Z Zhang, C Xu arXiv preprint arXiv:2406.12663, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–9

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors