Follow
Yunlong Tang
Yunlong Tang
Verified email at rochester.edu - Homepage
Title
Cited by
Cited by
Year
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
T Wang, J Zhang, J Fei, H Zheng, Y Tang, Z Li, M Gao, S Zhao
arXiv preprint arXiv:2305.02677, 2023
402023
Video Understanding with Large Language Models: A Survey
Y Tang*, J Bi*, S Xu*, L Song, S Liang, T Wang, D Zhang, J An, J Lin, ...
arXiv preprint arXiv:2312.17432, 2023
82023
LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
S Xu*, Y Tang*, F Zheng
arXiv preprint arXiv:2307.04827, 2023
42023
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Y Tang, J Zhang, X Wang, T Wang, F Zheng
arXiv preprint arXiv:2306.10354, 2023
42023
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward
Y Tang, S Xu, T Wang, Q Lin, Q Lu, F Zheng
Proceedings of the Asian Conference on Computer Vision, 3519-3535, 2022
42022
AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue
Y Tang, D Shimada, J Bi, C Xu
arXiv preprint arXiv:2403.16276, 2024
12024
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
H Hua*, Y Tang*, C Xu, J Luo
arXiv preprint arXiv:2404.12353, 2024
2024
Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering
P Liu, L Song, D Zhang, H Hua, Y Tang, H Tu, J Luo, C Xu
arXiv preprint arXiv:2402.00827, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–8