Tao Li - Google 学术搜索

创建我的个人资料

引用次数

	总计	2019 年至今
引用	169	169
h 指数	7	7
i10 指数	6	6

0

80

40

202020212022202320241 13 50 77 26

开放获取的出版物数量

2 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Lei XieNorthwestern Polytechnical University在 nwpu.edu.cn 的电子邮件经过验证
Xinsheng WangXi'an Jiaotong University在 stu.xjtu.edu.cn 的电子邮件经过验证
Shan YangTencent AI Lab在 nwpu-aslp.org 的电子邮件经过验证

Tao Li

Tao Li

Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science

在 npu-aslp.org 的电子邮件经过验证

speech synthesis prosody transfer diffusion model representation learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Controllable emotion transfer for end-to-end speech synthesis T Li, S Yang, L Xue, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021	71	2021
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022	31*	2022
Enriching source style transfer in recognition-synthesis based non-parallel voice conversion Z Wang, X Zhou, F Yang, T Li, H Du, L Xie, W Gan, H Chen, H Li arXiv preprint arXiv:2106.08741, 2021	18	2021
One-shot voice conversion for style transfer based on speaker adaptation Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	13	2022
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022	11	2022
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie arXiv preprint arXiv:2207.01198, 2022	10	2022
Multi-speaker expressive speech synthesis via multiple factors decoupling X Zhu, Y Lei, K Song, Y Zhang, T Li, L Xie ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
Vec-Tok Speech: speech vectorization and tokenization for neural speech generation X Zhu, Y Lv, Y Lei, T Li, W He, H Zhou, H Lu, L Xie arXiv preprint arXiv:2310.07246, 2023	3	2023
DiCLET-TTS: Diffusion model based cross-lingual emotion transfer for text-to-speech—A study between English and Mandarin T Li, C Hu, J Cong, X Zhu, J Li, Q Tian, Y Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	3	2023
MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling Z Wang, X Wang, Q Xie, T Li, L Xie, Q Tian, Y Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	1	2023
Improving Multi-Speaker ASR With Overlap-Aware Encoding And Monotonic Attention T Li, F Wang, W Guan, L Huang, Q Hong, L Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis W Guan, Y Li, T Li, H Huang, F Wang, J Lin, L Huang, L Li, Q Hong Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18117 …, 2024		2024
METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer X Zhu, Y Lei, T Li, Y Zhang, H Zhou, H Lu, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024		2024
HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS D Guo, X Zhu, L Xue, T Li, Y Lv, Y Jiang, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023		2023
CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization H Zhou, T Li, J Wang, L Li, Q Hong 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023		2023
U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning T Li, Z Wang, X Zhu, J Cong, Q Tian, Y Wang, L Xie arXiv preprint arXiv:2310.04004, 2023		2023
A Pipelined Framework with Serialized Output Training for Overlapping Speech Recognition T Li, L Huang, F Wang, S Li, Q Hong, L Li National Conference on Man-Machine Speech Communication, 114-123, 2022		2022

系统目前无法执行此操作，请稍后再试。

文章 1–17