Follow
Hai Huang
Hai Huang
Verified email at zju.edu.cn - Homepage
Title
Cited by
Cited by
Year
Achieving cross modal generalization with multimodal unified representation
Y Xia*, H Huang*, J Zhu, Z Zhao
NeurIPS 2023, 2024
52024
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
S Ji, J Zuo, M Fang, S Zheng, Q Chen, W Wang, Z Jiang, H Huang, ...
arXiv preprint arXiv:2406.01205, 2024
22024
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment
H Huang, Y Xia, S Ji, S Wang, H Wang, J Zhu, Z Dong, Z Zhao
arXiv preprint arXiv:2403.05168, 2024
12024
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling
M Fang, S Ji, J Zuo, H Huang, Y Xia, J Zhu, X Cheng, X Yang, W Liu, ...
arXiv preprint arXiv:2406.17507, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–4