Improving prosody with linguistic and bert derived features in multi-speaker based mandarin chinese neural tts Y Xiao, L He, H Ming, FK Soong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 59 | 2020 |
Paired phone-posteriors approach to ESL pronunciation quality assessment Y Xiao, FK Soong, W Hu bdl 1 (782d), 3, 2018 | 19 | 2018 |
Proficiency Assessment of ESL Learner's Sentence Prosody with TTS Synthesized Voice as Reference. Y Xiao, FK Soong INTERSPEECH, 1755-1759, 2017 | 11 | 2017 |
Prosodyspeech: Towards advanced prosody model for neural text-to-speech Y Yi, L He, S Pan, X Wang, Y Xiao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Improving fastspeech tts with efficient self-attention and compact feed-forward network Y Xiao, X Wang, L He, FK Soong ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 2 | 2022 |
QS-TTS: towards semi-supervised text-to-speech synthesis via vector-quantized self-supervised speech representation learning H Guo, F Xie, J Kang, Y Xiao, X Wu, H Meng arXiv preprint arXiv:2309.00126, 2023 | 1 | 2023 |
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading Y Xiao, S Zhang, X Wang, X Tan, L He, S Zhao, FK Soong, T Lee arXiv preprint arXiv:2307.00782, 2023 | 1 | 2023 |