Target-speaker voice activity detection: a novel approach for multi-speaker diarization in a dinner party scenario I Medennikov, M Korenevsky, T Prisyach, Y Khokhlov, M Korenevskaya, ... arXiv preprint arXiv:2005.07272, 2020 | 178 | 2020 |
You do not need more data: Improving end-to-end speech recognition by text-to-speech data augmentation A Laptev, R Korostik, A Svischev, A Andrusenko, I Medennikov, S Rybin 2020 13th International Congress on Image and Signal Processing, BioMedical …, 2020 | 66 | 2020 |
The STC system for the CHiME-6 challenge I Medennikov, M Korenevsky, T Prisyach, Y Khokhlov, M Korenevskaya, ... CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020 | 60 | 2020 |
Towards a competitive end-to-end speech recognition for CHiME-6 dinner party transcription A Andrusenko, A Laptev, I Medennikov arXiv preprint arXiv:2004.10799, 2020 | 20 | 2020 |
Dynamic acoustic unit augmentation with bpe-dropout for low-resource end-to-end speech recognition A Laptev, A Andrusenko, I Podluzhny, A Mitrofanov, I Medennikov, ... Sensors 21 (9), 3063, 2021 | 16 | 2021 |
R-Vectors: New Technique for Adaptation to Room Acoustics. YY Khokhlov, A Zatvornitskiy, I Medennikov, I Sorokin, T Prisyach, ... INTERSPEECH, 1243-1247, 2019 | 13 | 2019 |
Uconv-conformer: High reduction of input sequence length for end-to-end speech recognition A Andrusenko, R Nasretdinov, A Romanenko ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
The STC ASR System for the VOiCES from a Distance Challenge 2019. I Medennikov, YY Khokhlov, A Romanenko, I Sorokin, A Mitrofanov, ... INTERSPEECH, 2453-2457, 2019 | 10 | 2019 |
Exploration of end-to-end asr for openstt–russian open speech-to-text dataset A Andrusenko, A Laptev, I Medennikov Speech and Computer: 22nd International Conference, SPECOM 2020, St …, 2020 | 9 | 2020 |
Salm: Speech-augmented language model with in-context learning for speech recognition and translation Z Chen, H Huang, A Andrusenko, O Hrinchuk, KC Puvvada, J Li, S Ghosh, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Improving out of vocabulary words recognition accuracy for an end-to-end Russian speech recognition system AY Andrusenko, AN Romanenko Научно-технический вестник информационных технологий, механики и оптики 22 …, 2022 | 2 | 2022 |
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring A Mitrofanov, M Korenevskaya, I Podluzhny, Y Khokhlov, A Laptev, ... arXiv preprint arXiv:2104.02526, 2021 | 2 | 2021 |