Follow
Ian Lane
Ian Lane
Verified email at andrew.cmu.edu
Title
Cited by
Year
Lexicon development via shared translation database
A Waibel, IR Lane
US Patent 11,972,227, 2024
12024
Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models
P Vakharia, D Joshi, M Chavan, D Sonawane, B Garg, P Mazaheri, I Lane
arXiv preprint arXiv:2312.14346, 2023
2023
Online continual learning of end-to-end speech recognition models
M Yang, I Lane, S Watanabe
arXiv preprint arXiv:2207.05071, 2022
202022
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
International Conference on Machine Learning, 17627-17643, 2022
922022
Lexicon development via shared translation database
A Waibel, IR Lane
US Patent 11,222,185, 2022
142022
Human-Agent Collaboration Strategies for Vision-Grounded Instruction Following
GL Chao, I Lane
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
2021
Identifying actions for sound event classification
B Elizalde, R Revutchi, S Das, B Raj, I Lane, LM Heller
2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021
62021
System and method for audio-visual speech recognition
IR Lane
US Patent 10,964,326, 2021
42021
System and Method for Face Detection and Landmark Localization
IR Lane, B Yu
US Patent App. 17/063,601, 2021
2021
Never-ending learning of sounds
BM Elizalde
Carnegie Mellon University, 2020
192020
System and method for multi-user GPU-accelerated speech recognition engine for client-server architectures
IR Lane, J Kim
US Patent 10,453,445, 2019
52019
Audio-visual TED corpus: enhancing the TED-LIUM corpus with facial information, contextual text and object recognition
GL Chao, CC Hu, B Liu, JP Shen, I Lane
Adjunct Proceedings of the 2019 ACM International Joint Conference on …, 2019
42019
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
GL Chao, A Rastogi, S Yavuz, D Hakkani-Tür, J Chen, I Lane
SigDial 2019, 2019
42019
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
GL Chao, I Lane
Interspeech 2019, 2019
1222019
Deep speaker embedding for speaker-targeted automatic speech recognition
GL Chao, JP Shen, I Lane
Proceedings of the 2019 3rd International Conference on Natural Language …, 2019
22019
Low-distraction Interaction
I Lane, T Selker, R Rajan
Technologies for Safe and Efficient Transportation. University …, 2019
2019
AudioPairBank: towards a large-scale tag-pair-based audio content analysis
S Säger, B Elizalde, D Borth, C Schulze, B Raj, I Lane
EURASIP Journal on Audio, Speech, and Music Processing 2018, 1-12, 2018
62018
Understanding and improving recurrent networks for human activity recognition by continuous attention
M Zeng, H Gao, T Yu, OJ Mengshoel, H Langseth, I Lane, X Liu
Proceedings of the 2018 ACM international symposium on wearable computers, 56-63, 2018
1702018
End-to-end learning of task-oriented dialogs
B Liu, I Lane
Proceedings of the 2018 Conference of the North American Chapter of the …, 2018
462018
Adversarial learning of task-oriented neural dialog models
B Liu, I Lane
arXiv preprint arXiv:1805.11762, 2018
392018
The system can't perform the operation now. Try again later.
Articles 1–20