Follow
Jun Zhang
Jun Zhang
ByteDance
Verified email at bytedance.com
Title
Cited by
Cited by
Year
Dynamic frequency feature selection based approach for classification of motor imageries
J Luo, Z Feng, J Zhang, N Lu
Computers in biology and medicine 75, 45-53, 2016
782016
A target guided subband filter for acoustic event detection in noisy environments using wavelet packets
ZR Feng, Q Zhou, J Zhang, P Jiang, XW Yang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (2), 361-372, 2014
332014
Deep LSTM for large vocabulary continuous speech recognition
X Tian, J Zhang, Z Ma, Y He, J Wei, P Wu, W Situ, S Li, Y Zhang
arXiv preprint arXiv:1703.07090, 2017
272017
Improving rnn transducer with normalized jointer network
M Huang, J Zhang, M Cai, Y Zhang, J Yao, Y You, Y He, Z Ma
arXiv preprint arXiv:2011.01576, 2020
92020
Bring dialogue-context into RNN-T for streaming ASR.
J Hou, J Chen, W Li, Y Tang, J Zhang, Z Ma
INTERSPEECH, 2048-2052, 2022
52022
Frame stacking and retaining for recurrent neural network acoustic model
X Tian, J Zhang, Z Ma, Y He, J Wei
arXiv preprint arXiv:1705.05992, 2017
52017
Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition
Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2306.05279, 2023
32023
The volcspeech system for the icassp 2022 multi-channel multi-party meeting transcription challenge
C Shen, Y Liu, W Fan, B Wang, S Wen, Y Tian, J Zhang, J Yang, Z Ma
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
32022
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
L Huang, J Sun, Y Tang, J Hou, J Chen, J Zhang, Z Ma
Proc. Interspeech 2021, 2021, 2021
32021
Dynamic latency speech recognition with asynchronous revision
M Huang, M Cai, J Zhang, Y Zhang, Y You, Y He, Z Ma
arXiv preprint arXiv:2011.01570, 2020
32020
Asynchronous motor imagery detection based on a target guided sub-band filter using wavelet packets
Y Sun, Z Feng, J Zhang, Q Zhou, J Luo
2017 29th Chinese Control And Decision Conference (CCDC), 4850-4855, 2017
32017
Exponential moving average model in parallel speech recognition training
X Tian, J Zhang, Z Ma, Y He, J Wei
arXiv preprint arXiv:1703.01024, 2017
32017
Token-level speaker change detection using speaker difference and speech content via continuous integrate-and-fire
Z Fan, Z Liang, L Dong, Y Liu, S Zhou, M Cai, J Zhang, Z Ma, B Xu
arXiv preprint arXiv:2211.09381, 2022
22022
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
L Dong, Z An, P Wu, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2305.17499, 2023
12023
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Z Fan, L Dong, J Zhang, L Lu, Z Ma
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer
J Qiu, L Huang, B Li, J Zhang, L Lu, Z Ma
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–16