Emotional speech synthesis with rich and granularized control SY Um, S Oh, K Byun, I Jang, CH Ahn, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 92 | 2020 |
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks. HK Nguyen, K Jeong, SY Um, MJ Hwang, E Song, HG Kang Interspeech, 3595-3599, 2021 | 7 | 2021 |
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems H Yoon, C Kim, S Um, HW Yoon, HG Kang IEEE Signal Processing Letters, 2023 | 4 | 2023 |
Light-weight speaker verification with global context information M Kim, Z Piao, S Um, R Lee, J Joh, S Lee, HG Kang Proceedings of the Annual Conference of the International Speech …, 2022 | 4 | 2022 |
Facetron: A Multi-Speaker Face-to-Speech Model Based on Cross-Modal Latent Representations S Um, J Kim, J Lee, HG Kang 2023 31st European Signal Processing Conference (EUSIPCO), 281-285, 2023 | 2 | 2023 |
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS. C Kim, S Um, H Yoon, HG Kang INTERSPEECH, 4561-4565, 2022 | 2 | 2022 |
Recab-vae: Gumbel-softmax variational inference based on analytic divergence S Oh, S Um, HG Kang arXiv preprint arXiv:2205.04104, 2022 | 1 | 2022 |
Determination of representative emotional style of speech based on k-means algorithm S Oh, SY Um, I Jang, CH Ahn, HG Kang The Journal of the Acoustical Society of Korea 38 (5), 614-620, 2019 | 1 | 2019 |
SC-ERM: Speaker-Centric Learning for Speech Emotion Recognition J Yoon, S Um, WJ Chung, HG Kang 2024 International Conference on Electronics, Information, and Communication …, 2024 | | 2024 |
Consideration of Varying Training Lengths for Short-Duration Speaker Verification WS Ko, S Um, Z Piao, HG Kang 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | | 2023 |
Length-Normalized Representation Learning for Speech Signals K Byun, S Um, HG Kang IEEE Access 10, 60362-60372, 2022 | | 2022 |
AILTTS: Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech H Yoon, S Um, C Kim, HG Kang arXiv preprint arXiv:2204.02172, 2022 | | 2022 |
A study on the improvement of generation speed and speech quality for a granularized emotional speech synthesis system SY Um, S Oh, I Jang, C Ahn, HG Kang Proceedings of the Korean Society of Broadcast Engineers Conference, 453-455, 2020 | | 2020 |
Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech H Yoon, S Um, C Kim, HG Kang | | |