Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator J Pinto, S Garimella, M Magimai-Doss, H Hermansky, H Bourlard Audio, Speech, and Language Processing, IEEE Transactions on 19 (2), 225-241, 2011 | 108 | 2011 |
Sparse multilayer perceptron for phoneme recognition GSVS Sivaram, H Hermansky IEEE Transactions on Audio, Speech, and Language Processing 20 (1), 23-29, 2011 | 96 | 2011 |
Sparse coding for speech recognition GSVS Sivaram, SK Nemala, M Elhilali, TD Tran, H Hermansky 2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010 | 93 | 2010 |
A design methodology for selection and placement of sensors in multimedia surveillance systems G Sivaram, KR Ramakrishnan, PK Atrey, VK Singh, MS Kankanhalli Proceedings of the 4th ACM international workshop on Video surveillance and …, 2006 | 65 | 2006 |
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop G Zweig, P Nguyen, D Van Compernolle, K Demuynck, L Atlas, P Clark, ... 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 64 | 2011 |
Robust i-vector based adaptation of DNN acoustic model for speech recognition S Garimella, A Mandal, N Ström, B Hoffmeister, S Matsoukas, ... | 62 | 2015 |
fMLLR based feature-space speaker adaptation of DNN acoustic models SHK Parthasarathi, B Hoffmeister, S Matsoukas, A Mandal, N Ström, ... | 45 | 2015 |
Improving ASR confidence scores for Alexa using acoustic and hypothesis embeddings P Swarup, R Maas, S Garimella, SH Mallidi, B Hoffmeister | 44 | 2019 |
Streaming end-to-end bilingual asr systems with joint language identification S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... arXiv preprint arXiv:2007.03900, 2020 | 24 | 2020 |
Multilayer perceptron with sparse hidden outputs for phoneme recognition GSVS Sivaram, H Hermansky 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 24 | 2011 |
Data-driven and feedback based spectro-temporal features for speech recognition GSVS Sivaram, SK Nemala, N Mesgarani, H Hermansky IEEE Signal Processing Letters 17 (11), 957-960, 2010 | 21 | 2010 |
Joint ASR and language identification using RNN-T: An efficient approach to dynamic language switching S Punjabi, H Arsikere, Z Raeesy, C Chandak, N Bhave, A Bansal, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 20 | 2021 |
Design of multimedia surveillance systems GSVS Sivaram, MS Kankanhalli, KR Ramakrishnan ACM Transactions on Multimedia Computing, Communications, and Applications …, 2009 | 19 | 2009 |
Mixture of Auto-Associative Neural Networks for Speaker Verification. GSVS Sivaram, S Thomas, H Hermansky INTERSPEECH, 2381-2384, 2011 | 18 | 2011 |
Multi-dialect acoustic modeling using phone mapping and online i-vectors H Arsikere, A Sapru, S Garimella | 17 | 2019 |
Factor analysis of auto-associative neural networks with application in speaker verification S Garimella, H Hermansky IEEE transactions on neural networks and learning systems 24 (4), 522-528, 2013 | 16 | 2013 |
Generative modeling of speech using neural networks S Matsoukas, N Ström, A Rastrow, SVSSR Krishna US Patent 9,653,093, 2017 | 15 | 2017 |
Markov-based sequence tagging using neural networks A Rastrow, S Matsoukas, SVSSR Krishna, N Ström, B Hoffmeister US Patent 9,600,764, 2017 | 12 | 2017 |
Regularized auto-associative neural networks for speaker verification S Garimella, SH Mallidi, H Hermansky IEEE Signal Processing Letters 19 (12), 841-844, 2012 | 12 | 2012 |
Discriminant spectrotemporal features for phoneme recognition. N Mesgarani, GSVS Sivaram, SK Nemala, M Elhilali, H Hermansky INTERSPEECH, 2983-2986, 2009 | 12 | 2009 |