Improving massively multilingual ASR with auxiliary CTC objectives W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 23 | 2023 |
Generating multilingual voices using speaker space translation based on bilingual speaker data S Maiti, E Marchi, A Conkie ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 23 | 2020 |
Parametric resynthesis with neural vocoders S Maiti, MI Mandel 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019 | 22 | 2019 |
End-to-end diarization for variable number of speakers with local-global networks and discriminative speaker embeddings S Maiti, H Erdogan, K Wilson, S Wisdom, S Watanabe, JR Hershey ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement S Maiti, MI Mandel ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 21 | 2020 |
Reducing barriers to self-supervised learning: Hubert pre-training with academic compute W Chen, X Chang, Y Peng, Z Ni, S Maiti, S Watanabe arXiv preprint arXiv:2306.06672, 2023 | 16 | 2023 |
Speechlmscore: Evaluating speech generation using speech language model S Maiti, Y Peng, T Saeki, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 16 | 2023 |
Eend-ss: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers S Maiti, Y Ueda, S Watanabe, C Zhang, M Yu, SX Zhang, Y Xu 2022 IEEE Spoken Language Technology Workshop (SLT), 480-487, 2023 | 16 | 2023 |
Speech denoising by parametric resynthesis S Maiti, MI Mandel ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 12 | 2019 |
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 11 | 2023 |
VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks S Maiti, Y Peng, S Choi, J Jung, X Chang, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 10 | 2024 |
Predicting interaction quality in customer service dialogs S Stoyanchev, S Maiti, S Bangalore Advanced Social Interaction with Agents: 8th International Workshop on …, 2018 | 8 | 2018 |
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Unsupervised data selection for TTS: using Arabic Broadcast News as a case study M Baali, T Hayashi, H Mubarak, S Maiti, S Watanabe, W El-Hajj, A Ali arXiv preprint arXiv:2301.09099, 2023 | 7 | 2023 |
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe INTERSPEECH, 16-20, 2022 | 7 | 2022 |
Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari arXiv preprint arXiv:2301.12596, 2023 | 6 | 2023 |
Large Vocabulary Concatenative Resynthesis. S Maiti, J Ching, MI Mandel INTERSPEECH, 1190-1194, 2018 | 6 | 2018 |
Concatenative Resynthesis Using Twin Networks. S Maiti, MI Mandel INTERSPEECH, 3647-3651, 2017 | 6 | 2017 |
Cmu’s iwslt 2023 simultaneous speech translation system B Yan, J Shi, S Maiti, W Chen, X Li, Y Peng, S Arora, S Watanabe Proceedings of the 20th International Conference on Spoken Language …, 2023 | 5 | 2023 |
Findadaptnet: Find and insert adapters by learned layer importance J Huang, K Ganesan, S Maiti, YM Kim, X Chang, P Liang, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |