Diff-tts: A denoising diffusion model for text-to-speech M Jeong, H Kim, SJ Cheon, BJ Choi, NS Kim arXiv preprint arXiv:2104.01409, 2021 | 156 | 2021 |
Softflow: Probabilistic framework for normalizing flow on manifolds H Kim, H Lee, WH Kang, JY Lee, NS Kim Advances in Neural Information Processing Systems 33, 16388-16397, 2020 | 104 | 2020 |
EdiTTS: Score-based editing for controllable text-to-speech J Tae, H Kim, T Kim arXiv preprint arXiv:2110.02584, 2021 | 24 | 2021 |
NANSY++: Unified voice synthesis with neural analysis and synthesis HS Choi, J Yang, J Lee, H Kim arXiv preprint arXiv:2211.09407, 2022 | 22 | 2022 |
MLP singer: Towards rapid parallel Korean singing voice synthesis J Tae, H Kim, Y Lee 2021 IEEE 31st International Workshop on Machine Learning for Signal …, 2021 | 19 | 2021 |
WaveNODE: A continuous normalizing flow for speech synthesis H Kim, H Lee, WH Kang, SJ Cheon, BJ Choi, NS Kim arXiv preprint arXiv:2006.04598, 2020 | 13 | 2020 |
Gated recurrent context: Softmax-free attention for online encoder-decoder speech recognition H Lee, WH Kang, SJ Cheon, H Kim, NS Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 710-719, 2021 | 7 | 2021 |
Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation H Kim, H Lee, WH Kang, HY Kim, NS Kim Twenty-Ninth International Joint Conference on Artificial Intelligence, 3744 …, 2020 | 3 | 2020 |
Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric H Kim, HS Choi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Continuous monitoring of blood pressure with evidential regression H Kim, WH Kang, H Lee, NS Kim arXiv preprint arXiv:2102.03542, 2021 | 2 | 2021 |