Controllable emotion transfer for end-to-end speech synthesis T Li, S Yang, L Xue, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 70 | 2021 |
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis X Zhu, Y Zhang, S Yang, L Xue, L Xie IEEE Access 7, 65955-65964, 2019 | 38 | 2019 |
Building a mixed-lingual neural TTS system with only monolingual data L Xue, W Song, G Xu, L Xie, Z Wu arXiv preprint arXiv:1904.06063, 2019 | 35 | 2019 |
On the localness modeling for the self-attention based end-to-end speech synthesis S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu Neural networks 125, 121-130, 2020 | 32 | 2020 |
Cycle consistent network for end-to-end style transfer TTS training L Xue, S Pan, L He, L Xie, FK Soong Neural Networks 140, 223-236, 2021 | 21 | 2021 |
Building a controllable expressive speech synthesis system with multiple emotion strengths X Zhu, L Xue Cognitive Systems Research 59, 151-159, 2020 | 19 | 2020 |
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts L Xue, FK Soong, S Zhang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022 | 14 | 2022 |
A comparison of expressive speech synthesis approaches based on neural network L Xue, X Zhu, X An, L Xie Proceedings of the Joint Workshop of the 4th Workshop on Affective Social …, 2018 | 7 | 2018 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Leveraging content-based features from multiple acoustic models for singing voice conversion X Zhang, Y Gu, H Chen, Z Fang, L Zou, L Xue, Z Wu arXiv preprint arXiv:2310.11160, 2023 | 4 | 2023 |
A Kullback-Leibler divergence based recurrent mixture density network for acoustic modeling in emotional statistical parametric speech synthesis X An, Y Zhang, B Liu, L Xue, L Xie Proceedings of the Joint Workshop of the 4th Workshop on Affective Social …, 2018 | 3 | 2018 |
Multi-scale sub-band constant-q transform discriminator for high-fidelity vocoder Y Gu, X Zhang, L Xue, Z Wu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit X Zhang, L Xue, Y Wang, Y Gu, X Chen, Z Fang, H Chen, L Zou, C Wang, ... arXiv preprint arXiv:2312.09911, 2023 | 2 | 2023 |
An initial investigation of neural replay simulator for over-the-air adversarial perturbations to automatic speaker verification J Li, L Wang, L Xue, L Wang, Z Wu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
ChatMusician: Understanding and Generating Music Intrinsically with LLM R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ... arXiv preprint arXiv:2402.16153, 2024 | 1 | 2024 |
Multi-level temporal-channel speaker retrieval for robust zero-shot voice conversion Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang arXiv preprint arXiv:2305.07204, 2023 | 1 | 2023 |
Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers L Xue, S Yang, N Hu, D Su, L Xie arXiv preprint arXiv:2207.00756, 2022 | 1 | 2022 |
SponTTS: modeling and transferring spontaneous style for TTS H Li, X Zhu, L Xue, Y Song, Y Chen, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion L Xue, C Wang, M Wang, X Zhang, J Han, Z Wu arXiv preprint arXiv:2402.12660, 2024 | | 2024 |
HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS D Guo, X Zhu, L Xue, T Li, Y Lv, Y Jiang, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | | 2023 |