Follow
Sang-Hoon Lee
Title
Cited by
Cited by
Year
Fre-GAN: Adversarial frequency-consistent audio synthesis
JH Kim, SH Lee, JH Lee, SW Lee
arXiv preprint arXiv:2106.02297, 2021
552021
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis
SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee
Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021
502021
Voicemixer: Adversarial voice style mixup
SH Lee, JH Kim, H Chung, SW Lee
Advances in Neural Information Processing Systems 34, 294-308, 2021
302021
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis
SH Lee, SB Kim, JH Lee, E Song, MJ Hwang, SW Lee
Advances in Neural Information Processing Systems, 2022
272022
Emoq-tts: Emotion intensity quantization for fine-grained controllable emotional text-to-speech
CB Im, SH Lee, SB Kim, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
252022
Duration controllable voice conversion via phoneme-based information bottleneck
SH Lee, HR Noh, WJ Nam, SW Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1173-1183, 2022
192022
Reinforce-aligner: Reinforcement alignment search for robust end-to-end text-to-speech
H Chung, SH Lee, SW Lee
arXiv preprint arXiv:2106.02830, 2021
152021
PVAE-TTS: adaptive text-to-speech via progressive style adaptation
JH Lee, SH Lee, JH Kim, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
112022
Audio dequantization for high fidelity audio generation in flow-based neural vocoder
HW Yoon, SH Lee, HR Noh, SW Lee
arXiv preprint arXiv:2008.06867, 2020
112020
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis
SH Lee, JH Kim, KE Lee, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
102022
GC-TTS: Few-shot speaker adaptation with geometric constraints
JH Kim, SH Lee, JH Lee, HG Jung, SW Lee
2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2021
72021
Dddm-vc: Decoupled denoising diffusion models with disentangled representation and prior mixup for verified robust voice conversion
HY Choi, SH Lee, SW Lee
Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17862 …, 2024
62024
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
HS Oh, SH Lee, SW Lee
arXiv preprint arXiv:2307.16549, 2023
42023
Hiddensinger: High-quality singing voice synthesis via neural audio codec and latent diffusion models
JS Hwang, SH Lee, SW Lee
arXiv preprint arXiv:2306.06814, 2023
42023
HierVST: Hierarchical Adaptive Zero-shot Voice Style Transfer
SH Lee, HY Choi, HS Oh, SW Lee
arXiv preprint arXiv:2307.16171, 2023
32023
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
HY Choi, SH Lee, SW Lee
Interspeech 2023, 0
2*
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis
SH Lee, HY Choi, SB Kim, SW Lee
arXiv preprint arXiv:2311.12454, 2023
12023
PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-Based Prosody Modeling
JS Hwang, SH Lee, SW Lee
Asian Conference on Pattern Recognition, 415-427, 2023
12023
StyleVC: Non-Parallel Voice Conversion with Adversarial Style Generalization
IS Hwang, SH Lee, SW Lee
2022 26th International Conference on Pattern Recognition (ICPR), 23-30, 2022
12022
Midi-Voice: Expressive Zero-Shot Singing Voice Synthesis via Midi-Driven Priors
DM Byun, SH Lee, JS Hwang, SW Lee
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20