Follow
Hao Shi
Title
Cited by
Cited by
Year
Language-specific Characteristic Assistance for Code-switching Speech Recognition
T Song, Q Xu, M Ge, L Wang, H Shi, Y Lv, Y Lin, J Dang
INTERSPEECH, 3924 - 3928, 2022
17*2022
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.
M Ge, L Wang, N Li, H Shi, J Dang, X Li
INTERSPEECH, 3153-3157, 2019
162019
Spectrograms Fusion with Minimum Difference Masks Estimation for Monaural Speech Dereverberation
H Shi, L Wang, M Ge, S Li, J Dang
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Time-Domain Speech Enhancement Assisted by Multi-Resolution Frequency Encoder and Decoder
H Shi, M Mimura, L Wang, J Dang, T Kawahara
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition
H Shi, L Wang, S Li, C Fan, J Dang, T Kawahara
2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021
62021
Singing Voice Extraction with Attention-Based Spectrograms Fusion
H Shi, L Wang, S Li, C Ding, M Ge, N Li, J Dang, H Seki
INTERSPEECH, 2412-2416, 2020
62020
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model.
Q Xu, T Song, L Wang, H Shi, Y Lin, Y Lv, M Ge, Q Yu, J Dang
INTERSPEECH, 1716-1720, 2022
52022
Extending audio masked autoencoders toward audio restoration
Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ...
2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023
32023
Diffusion-based speech enhancement with joint generative and predictive decoders
H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
22024
Fusing Multiple Bandwidth Spectrograms for Improving Speech Enhancement
H Shi, Y Shu, L Wang, J Dang, T Kawahara
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
22022
Monolingual Recognizers Fusion for Code-switching Speech Recognition
T Song, Q Xu, H Lu, L Wang, H Shi, Y Lin, Y Yang, J Dang
arXiv preprint arXiv:2211.01046, 2022
22022
Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction.
H Shi, L Wang, S Li, J Dang, T Kawahara
Interspeech, 221-225, 2022
22022
Tendency-and-attention-informed deep learning for ENSO forecasts
S Qiao, C Zhang, X Zhang, K Zhang, H Shi, S Li, H Wei
Climate Dynamics 61 (11), 5271-5286, 2023
12023
Investigation of Adapter for Automatic Speech Recognition in Noisy Environment
Hao Shi, Tatsuya Kawahara
研究報告自然言語処理(NL) 258 (19), 1-6, 2023
1*2023
Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition
Y Yang, H Shi, Y Lin, M Ge, L Wang, Q Hou, J Dang
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
12022
Subband-based Spectrogram Fusion for Speech Enhancement by Combining Mapping and Masking Approaches
H Shi, L Wang, S Li, J Dang, T Kawahara
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
12022
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition
Yuchun Shu, Hao Shi, Longbiao Wang, Jianwu Dang
INTERSPEECH 2024, 2024
2024
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction
Yuan Gao, Hao Shi, Chenhui Chu, Tatsuya Kawahara
INTERSPEECH 2024, 2024
2024
Dual-path Adaptation of Pretrained Feature Extraction Module for Robust Automatic Speech Recognition
Hao Shi, Tatsuya Kawahara
INTERSPEECH 2024, 2024
2024
Waveform-domain Speech Enhancement Using Spectrogram Encoding for Robust Speech Recognition
H Shi, M Mimura, T Kawahara
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20