HierSpeech: Bridging the gap between text and speech by hierarchical variational inference using self-supervised representations for speech synthesis SH Lee, SB Kim, JH Lee, E Song, MJ Hwang, SW Lee Advances in Neural Information Processing Systems 35, 16624-16636, 2022 | 27 | 2022 |
Emoq-tts: Emotion intensity quantization for fine-grained controllable emotional text-to-speech CB Im, SH Lee, SB Kim, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis SH Lee, HY Choi, SB Kim, SW Lee arXiv preprint arXiv:2311.12454, 2023 | 1 | 2023 |
TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data SB Kim, SH Lee, SW Lee arXiv preprint arXiv:2401.12992, 2024 | | 2024 |
Audio Super-Resolution With Robust Speech Representation Learning of Masked Autoencoder SB Kim, SH Lee, HY Choi, SW Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | | 2024 |