Ziyue Jiang

Cited by

	All	Since 2019
Citations	245	245
h-index	9	9
i10-index	9	9

160

120

20212022202320243 9 81 152

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhou ZhaoZhejiang UniversityVerified email at zju.edu.cn
Yi Ren (任意)Research Scientist, TiktokVerified email at bytedance.com
Zhenhui Ye (叶振辉)Zhejiang universityVerified email at zju.edu.cn
Jinglin Liu (刘静林)Research Scientist, ByteDanceVerified email at bytedance.com
Rongjie HuangFacebook AI Research (FAIR), Zhejiang UniversityVerified email at meta.com
Jinzheng Hezhejiang universityVerified email at zju.edu.cn
Xiang YinBytedance AI LabVerified email at bytedance.com
shengpeng jiZhejiang universityVerified email at zju.edu.cn
qian yangZhe Jiang UniversityVerified email at zju.edu.cn
Chen ZhangResearch Scientist, ByteDanceVerified email at zju.edu.cn
Chunfeng WangByteDance Inc.Verified email at bytedance.com
Jialong ZuoZhejiang UniversityVerified email at zju.edu.cn

Ziyue Jiang

Zhejiang University

Verified email at zju.edu.cn

Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Geneface: Generalized and high-fidelity audio-driven 3d talking face synthesis Z Ye, Z Jiang, Y Ren, J Liu, J He, Z Zhao arXiv preprint arXiv:2301.13430, 2023	69	2023
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ... arXiv preprint arXiv:2306.03509, 2023	32	2023
Make-a-voice: Unified voice synthesis with discrete representation R Huang, C Zhang, Y Wang, D Yang, L Liu, Z Ye, Z Jiang, C Weng, ... arXiv preprint arXiv:2305.19269, 2023	21	2023
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ... The Twelfth International Conference on Learning Representations, 2024	19*	2024
Self-Supervised Spoofing Audio Detection Scheme. J Ziyue, Z Hongcheng, P Li, D Wenbing, R Yanzhen INTERSPEECH 2020, 4223-4227, 2020	19	2020
FedSpeech: Federated Text-to-Speech with Continual Learning Z Jiang, Y Ren, M Lei, Z Zhao IJCAI 2021, 3829-3835, 2021	17	2021
Textrolspeech: A text style control speech corpus with codec language text-to-speech models S Ji, J Zuo, M Fang, Z Jiang, F Chen, X Duan, B Huai, Z Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	12	2024
Clapspeech: Learning prosody from text context with contrastive language-audio pre-training Z Ye, R Huang, Y Ren, Z Jiang, J Liu, J He, X Yin, Z Zhao arXiv preprint arXiv:2305.10763, 2023	12	2023
Geneface++: Generalized and stable real-time audio-driven 3d talking face generation Z Ye, J He, Z Jiang, R Huang, J Huang, J Liu, Y Ren, X Yin, Z Ma, Z Zhao arXiv preprint arXiv:2305.00787, 2023	12	2023
FastDiff 2: Revisiting and incorporating GANs and diffusion models in high-fidelity speech synthesis R Huang, Y Ren, Z Jiang, C Cui, J Liu, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 6994-7009, 2023	6	2023
Real3d-portrait: One-shot realistic 3d talking portrait synthesis Z Ye, T Zhong, Y Ren, J Yang, W Li, J Huang, Z Jiang, J He, R Huang, ... arXiv preprint arXiv:2401.08503, 2024	5	2024
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech S Ji, Z Jiang, H Wang, J Zuo, Z Zhao arXiv preprint arXiv:2402.09378, 2024	4	2024
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models Z Jiang, Q Yang, J Zuo, Z Ye, R Huang, Y Ren, Z Zhao arXiv preprint arXiv:2305.13612, 2023	4	2023
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech Z Jiang, Z Su, Z Zhao, Q Yang, Y Ren, J Liu, Z Ye NeurIPS 2022, 2022	4	2022
Language-codec: Reducing the gaps between discrete codec representation and speech language models S Ji, M Fang, Z Jiang, R Huang, J Zuo, S Wang, Z Zhao arXiv preprint arXiv:2402.12208, 2024	3	2024
Ada-TTA: Towards adaptive high-quality text-to-talking avatar synthesis Z Ye, Z Jiang, Y Ren, J Liu, C Zhang, X Yin, Z Ma, Z Zhao arXiv preprint arXiv:2306.03504, 2023	3	2023
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension Q Yang, J Xu, W Liu, Y Chu, Z Jiang, X Zhou, Y Leng, Y Lv, Z Zhao, ... arXiv preprint arXiv:2402.07729, 2024	2	2024
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec S Ji, J Zuo, M Fang, S Zheng, Q Chen, W Wang, Z Jiang, H Huang, ... arXiv preprint arXiv:2406.01205, 2024	1	2024
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency R Liu, J Xi, Z Jiang, H Li arXiv preprint arXiv:2309.11725, 2023		2023
InstructSpeech: Following Speech Editing Instructions via Large Language Models R Huang, R Hu, Y Wang, Z Wang, X Cheng, Z Jiang, Z Ye, D Yang, L Liu, ... Forty-first International Conference on Machine Learning, 0

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors