Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 265 | 2021 |
Espnet-slu: Advancing spoken language understanding through espnet S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 66 | 2022 |
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. J Villalba, Y Zhang, N Dehak Interspeech, 4233-4237, 2020 | 49 | 2020 |
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. Y Zhang, Z Jiang, J Villalba, N Dehak Interspeech, 4238-4242, 2020 | 45 | 2020 |
Spgispeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition PK O'Neill, V Lavrukhin, S Majumdar, V Noroozi, Y Zhang, O Kuchaiev, ... arXiv preprint arXiv:2104.02014, 2021 | 36 | 2021 |
Tiny transducer: A highly-efficient speech recognition model on edge devices Y Zhang, S Sun, L Ma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 29 | 2021 |
Sequence-to-sequence singing voice synthesis with perceptual entropy loss J Shi, S Guo, N Huo, Y Zhang, Q Jin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 19 | 2021 |
Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-9, 2023 | 3 | 2023 |