Joint time-frequency and time domain learning for speech enhancement C Tang, C Luo, Z Zhao, W Xie, W Zeng Proceedings of the twenty-ninth international conference on international …, 2021 | 67 | 2021 |
RetrieverTTS: Modeling decomposed factors for text-based speech insertion D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo arXiv preprint arXiv:2206.13865, 2022 | 10 | 2022 |
Zero-shot text-to-speech for text-based insertion in audio narration C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng arXiv preprint arXiv:2109.05426, 2021 | 8 | 2021 |
TridentSE: Guiding speech enhancement with 32 global tokens D Yin, Z Zhao, C Tang, Z Xiong, C Luo arXiv preprint arXiv:2210.12995, 2022 | 7 | 2022 |
General-purpose speech representation learning through a self-supervised multi-granularity framework Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha arXiv preprint arXiv:2102.01930, 2021 | 7 | 2021 |
An anchor-free detector for continuous speech keyword spotting Z Zhao, C Tang, C Yao, C Luo arXiv preprint arXiv:2208.04622, 2022 | 2 | 2022 |
ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ... arXiv preprint arXiv:2311.18834, 2023 | | 2023 |
Speech enhancement T Chuanxin, Z Zhao, C Luo, W Zeng US Patent App. 17/927,861, 2023 | | 2023 |
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |