Discriminative and Correlative Partial Multi-Label Learning. H Wang, W Liu, Y Zhao, C Zhang, T Hu, G Chen IJCAI, 3691-3697, 2019 | 84 | 2019 |
SimulSpeech: End-to-end simultaneous speech to text translation Y Ren, J Liu, X Tan, C Zhang, T Qin, Z Zhao, TY Liu Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 62 | 2020 |
Uwspeech: Speech to speech translation for unwritten languages C Zhang, X Tan, Y Ren, T Qin, K Zhang, TY Liu AAAI 2021, 2020 | 43 | 2020 |
Denoispeech: Denoising text to speech with frame-level noise modeling C Zhang, Y Ren, X Tan, J Liu, K Zhang, T Qin, S Zhao, TY Liu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 42 | 2021 |
Task-level curriculum learning for non-autoregressive neural machine translation J Liu, Y Ren, X Tan, C Zhang, T Qin, Z Zhao, TY Liu IJCAI 2020, 2020 | 34 | 2020 |
S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification H Zhao, C Zhang, B Zhu, Z Ma, K Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 32 | 2022 |
TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method Z Ju, P Lu, X Tan, R Wang, C Zhang, S Wu, K Zhang, X Li, T Qin, TY Liu EMNLP 2022, 2022 | 30 | 2022 |
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ... arXiv preprint arXiv:2306.03509, 2023 | 22 | 2023 |
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription C Zhang, J Yu, LC Chang, X Tan, J Chen, T Qin, K Zhang ISMIR 2022, 2021 | 16 | 2021 |
FastLR: Non-autoregressive lipreading model with integrate-and-fire J Liu, Y Ren, Z Zhao, C Zhang, B Huai, J Yuan Proceedings of the 28th ACM International Conference on Multimedia, 4328-4336, 2020 | 13 | 2020 |
Make-an-audio 2: Temporal-enhanced text-to-audio generation J Huang, Y Ren, R Huang, D Yang, Z Ye, C Zhang, J Liu, X Yin, Z Ma, ... arXiv preprint arXiv:2305.18474, 2023 | 11 | 2023 |
Automatic Song Translation for Tonal Languages F Guo, C Zhang, Z Zhang, Q He, K Zhang, J Xie, J Boyd-Graber Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 10 | 2022 |
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts Z Jiang, J Liu, Y Ren, J He, C Zhang, Z Ye, P Wei, C Wang, X Yin, Z Ma, ... arXiv preprint arXiv:2307.07218, 2023 | 9 | 2023 |
Relyme: Improving lyric-to-melody generation by incorporating lyric-melody relationships C Zhang, L Chang, S Wu, X Tan, T Qin, TY Liu, K Zhang Proceedings of the 30th ACM International Conference on Multimedia, 1047-1056, 2022 | 8 | 2022 |
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation C Zhang, Y Ren, K Zhang, S Yan IEEE Transactions on MultiMedia, 2023 | 6 | 2023 |
Songdriver: Real-time music accompaniment generation without logical latency nor exposure bias Z Wang, K Zhang, Y Wang, C Zhang, Q Liang, P Yu, Y Feng, W Liu, ... Proceedings of the 30th ACM International Conference on Multimedia, 1057-1067, 2022 | 5 | 2022 |
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization Y Zhao, C Zhang, H Huang, H Li, Z Zhao Advances in Neural Information Processing Systems, 2022 | 5 | 2022 |
Real3d-portrait: One-shot realistic 3d talking portrait synthesis Z Ye, T Zhong, Y Ren, J Yang, W Li, J Huang, Z Jiang, J He, R Huang, ... arXiv preprint arXiv:2401.08503, 2024 | 3 | 2024 |
Bag of tricks for unsupervised text-to-speech Y Ren, C Zhang, YAN Shuicheng The Eleventh International Conference on Learning Representations, 2022 | 2 | 2022 |
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model L Ji, P Wei, Y Ren, J Liu, C Zhang, X Yin arXiv preprint arXiv:2308.15016, 2023 | 1 | 2023 |