Follow
Alexander H. Liu
Alexander H. Liu
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
A unified feature disentangler for multi-domain image translation and manipulation
AH Liu, YC Liu, YY Yeh, YCF Wang
Advances in neural information processing systems 31, 2018
3872018
Towards scene understanding: Unsupervised monocular depth estimation with semantic-aware representation
PY Chen, AH Liu, YC Liu, YCF Wang
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2019
2552019
Contrastive audio-visual masked autoencoder
Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ...
arXiv preprint arXiv:2210.07839, 2022
922022
Non-autoregressive predictive coding for learning speech representations from local dependencies
AH Liu, YA Chung, J Glass
arXiv preprint arXiv:2011.00406, 2020
892020
Listen, think, and understand
Y Gong, H Luo, AH Liu, L Karlinsky, J Glass
arXiv preprint arXiv:2305.10790, 2023
682023
Towards end-to-end unsupervised speech recognition
AH Liu, WN Hsu, M Auli, A Baevski
2022 IEEE Spoken Language Technology Workshop (SLT), 221-228, 2023
652023
Spoken moments: Learning joint audio-visual representations from video descriptions
M Monfort, SY Jin, A Liu, D Harwath, R Feris, J Glass, A Oliva
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
642021
Parp: Prune, adjust and re-prune for self-supervised speech recognition
CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ...
Advances in Neural Information Processing Systems 34, 21256-21272, 2021
602021
Towards unsupervised speech recognition and synthesis with quantized speech representation learning
AH Liu, T Tu, H Lee, L Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
532020
Adversarial training of end-to-end speech recognition using a criticizing language model
AH Liu, H Lee, L Lee
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
512019
Cross-modal discrete representation learning
AH Liu, SY Jin, CIJ Lai, A Rouditchenko, A Oliva, J Glass
arXiv preprint arXiv:2106.05438, 2021
412021
Simple and effective unsupervised speech synthesis
AH Liu, CIJ Lai, WN Hsu, M Auli, A Baevski, J Glass
arXiv preprint arXiv:2204.02524, 2022
172022
Joint audio and speech understanding
Y Gong, AH Liu, H Luo, L Karlinsky, J Glass
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
162023
Uavm: Towards unifying audio and visual models
Y Gong, AH Liu, A Rouditchenko, J Glass
IEEE Signal Processing Letters 29, 2437-2441, 2022
152022
Improving automatic speech recognition and speech translation via word embedding prediction
SP Chuang, AH Liu, TW Sung, H Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 93-105, 2020
152020
Worse wer, but better bleu? leveraging word embedding as intermediate in multitask end-to-end speech translation
SP Chuang, TW Sung, AH Liu, H Lee
arXiv preprint arXiv:2005.10678, 2020
152020
Generative pre-training for speech with flow matching
AH Liu, M Le, A Vyas, B Shi, A Tjandra, WN Hsu
arXiv preprint arXiv:2310.16338, 2023
112023
Sequence-to-sequence automatic speech recognition with word embedding regularization and fused decoding
AH Liu, TW Sung, SP Chuang, H Lee, L Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
112020
Dinosr: Self-distillation and online clustering for self-supervised speech representation learning
AH Liu, HJ Chang, M Auli, WN Hsu, J Glass
Advances in Neural Information Processing Systems 36, 2024
102024
End-to-end whispered speech recognition with frequency-weighted approaches and pseudo whisper pre-training
HJ Chang, AH Liu, H Lee, L Lee
2021 IEEE Spoken Language Technology Workshop (SLT), 186-193, 2021
102021
The system can't perform the operation now. Try again later.
Articles 1–20