Follow
Linhao Dong
Linhao Dong
Bytedance AI-Lab
Verified email at bytedance.com
Title
Cited by
Cited by
Year
Speech-transformer: a no-recurrence sequence-to-sequence model for speech recognition
L Dong, S Xu, B Xu
2018 IEEE international conference on acoustics, speech and signal …, 2018
10932018
Syllable-based sequence-to-sequence speech recognition with the transformer in mandarin chinese
S Zhou, L Dong, S Xu, B Xu
arXiv preprint arXiv:1804.10752, 2018
1322018
Cif: Continuous integrate-and-fire for end-to-end speech recognition
L Dong, B Xu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1102020
Self-attention aligner: A latency-control end-to-end model for asr using self-attention network and chunk-hopping
L Dong, F Wang, B Xu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
892019
A comparison of modeling units in sequence-to-sequence speech recognition with the transformer on mandarin chinese
S Zhou, L Dong, S Xu, B Xu
International Conference on Neural Information Processing, 210-220, 2018
642018
Extending recurrent neural aligner for streaming end-to-end speech recognition in mandarin
L Dong, S Zhou, W Chen, B Xu
arXiv preprint arXiv:1806.06342, 2018
352018
Improving end-to-end contextual speech recognition with fine-grained contextual knowledge selection
M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
Cif-based collaborative decoding for end-to-end contextual speech recognition
M Han, L Dong, S Zhou, B Xu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
202021
A comparison of label-synchronous and frame-synchronous end-to-end models for speech recognition
L Dong, C Yi, J Wang, S Zhou, S Xu, X Jia, B Xu
arXiv preprint arXiv:2005.10113, 2020
152020
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring.
Y Zou, L Dong, B Xu
INTERSPEECH, 2055-2059, 2019
52019
Sequence-level speaker change detection with difference-based continuous integrate-and-fire
Z Fan, L Dong, M Cai, Z Ma, B Xu
IEEE Signal Processing Letters 29, 1551-1554, 2022
42022
Syllable-based acoustic modeling with CTC for multi-scenarios Mandarin speech recognition
Y Zhao, L Dong, S Xu, B Xu
2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018
42018
Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition
Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2306.05279, 2023
32023
Token-level speaker change detection using speaker difference and speech content via continuous integrate-and-fire
Z Fan, Z Liang, L Dong, Y Liu, S Zhou, M Cai, J Zhang, Z Ma, B Xu
arXiv preprint arXiv:2211.09381, 2022
22022
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
L Dong, Z An, P Wu, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2305.17499, 2023
12023
Method, apparatus, device, and storage medium for speaker change point detection
D Linhao, Z Fan, Z Ma
US Patent App. 18/394,143, 2024
2024
Model training method, speech recognition method, device, medium, and apparatus
D Linhao, Z Ma
US Patent App. 18/276,769, 2024
2024
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Z Fan, L Dong, J Zhang, L Lu, Z Ma
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–18