Hierarchical multi-grained generative model for expressive speech synthesis Y Hono, K Tsuboi, K Sawada, K Hashimoto, K Oura, Y Nankaku, ... arXiv preprint arXiv:2009.08474, 2020 | 25 | 2020 |
The nitech text-to-speech system for the blizzard challenge 2016 K Sawada, C Asai, K Hashimoto, K Oura, K Tokuda Blizzard Challenge 2016 Workshop, 2016 | 14 | 2016 |
End-to-end text-to-speech based on latent representation of speaking styles using spontaneous dialogue K Mitsui, T Zhao, K Sawada, Y Hono, Y Nankaku, K Tokuda arXiv preprint arXiv:2206.12040, 2022 | 13 | 2022 |
Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014 K Sawada, S Takaki, K Hashimoto, K Oura, K Tokuda Blizzard Challenge 2014 Workshop, 2014 | 10 | 2014 |
Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2012 S Takaki, K Sawada, K Hashimoto, K Oura, K Tokuda Blizzard Challenge 2012 Workshop, 2012 | 10 | 2012 |
日本語自然言語処理における事前学習モデルの公開 趙天雨, 沢田慶 人工知能学会研究会資料 言語・音声理解と対話処理研究会 第 93 回 (2021/11), 169-170, 2021 | 9* | 2021 |
A Bayesian approach to image recognition based on separable lattice hidden Markov models K Sawada, A Tamamori, K Hashimoto, Y Nankaku, K Tokuda IEICE TRANSACTIONS on Information and Systems 99 (12), 3119-3131, 2016 | 8 | 2016 |
Face recognition based on separable lattice 2-D HMMS using variational bayesian method K Sawada, A Tamamori, K Hashimoto, Y Nankaku, K Tokuda 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 6 | 2012 |
A bayesian framework for image recognition based on hidden markov eigen‐image models K Sawada, K Hashimoto, Y Nankaku, K Tokuda IEEJ Transactions on Electrical and Electronic Engineering 13 (9), 1335-1347, 2018 | 5 | 2018 |
An integration of pre-trained speech and language models for end-to-end speech recognition Y Hono, K Mitsuda, T Zhao, K Mitsui, T Wakatsuki, K Sawada arXiv preprint arXiv:2312.03668, 2023 | 4 | 2023 |
UniFLG: Unified Facial Landmark Generator from Text or Speech K Mitsui, Y Hono, K Sawada arXiv preprint arXiv:2302.14337, 2023 | 4 | 2023 |
Image recognition based on discriminative models using features generated from separable lattice HMMS Y Tsuzuki, K Sawada, K Hashimoto, Y Nankaku, K Tokuda 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 4 | 2017 |
The blizzard machine learning challenge 2017 K Sawada, K Tokuda, S King, AW Black 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 3 | 2017 |
The NITECH HMM-based text-to-speech system for the Blizzard Challenge 2015 K Sawada, K Hashimoto, K Oura, K Tokuda Proc. Blizzard Challenge 2015 Workshop, 2015 | 3 | 2015 |
Image recognition based on hidden Markov eigen-image models using variational Bayesian method K Sawada, K Hashimoto, Y Nankaku, K Tokuda 2013 Asia-Pacific Signal and Information Processing Association Annual …, 2013 | 3 | 2013 |
Focused prefix tuning for controllable text generation C Ma, T Zhao, M Shing, K Sawada, M Okumura Journal of Natural Language Processing 31 (1), 250-265, 2024 | 2 | 2024 |
Towards human-like spoken dialogue generation between AI agents from written dialogue K Mitsui, Y Hono, K Sawada arXiv preprint arXiv:2310.01088, 2023 | 2 | 2023 |
Backchannel Generation Model for a Third Party Listener Agent D Lala, K Inoue, T Kawahara, K Sawada Proceedings of the 10th International Conference on Human-Agent Interaction …, 2022 | 2 | 2022 |
Singing Voice Conversion Using Posted Waveform Data on Music social media K Senda, Y Hono, K Sawada, K Hashimoto, K Oura, Y Nankaku, K Tokuda 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 2 | 2018 |
Image recognition based on convolutional neural networks using features generated from separable lattice hidden Markov models T Kasugai, Y Tsuzuki, K Sawada, K Hashimoto, K Oura, Y Nankaku, ... 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 2 | 2018 |