Audio-visual speech recognition in misp2021 challenge: Dataset release and deep analysis H Chen, J Du, Y Dai, CH Lee, SM Siniscalchi, S Watanabe, ... Proceedings of the Annual Conference of the International Speech …, 2022 | 24 | 2022 |
Blind source separation‐based IVA‐Xception model for bird sound recognition in complex acoustic environments Y Dai, J Yang, Y Dong, H Zou, M Hu, B Wang Electronics Letters 57 (11), 454-456, 2021 | 14 | 2021 |
Meta-adaptive stock movement prediction with two-stage representation learning D Zhan, Y Dai, Y Dong, J He, Z Wang, J Anderson Proceedings of the 2024 SIAM International Conference on Data Mining (SDM …, 2024 | 6 | 2024 |
Improving audio-visual speech recognition by lip-subword correlation based visual pre-training and cross-modal fusion encoder Y Dai, H Chen, J Du, X Ding, N Ding, F Jiang, CH Lee 2023 IEEE International Conference on Multimedia and Expo (ICME), 2627-2632, 2023 | 4 | 2023 |
The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction S Wu, C Wang, H Chen, Y Dai, C Zhang, R Wang, H Lan, J Du, CH Lee, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge H Chen, S Wu, Y Dai, Z Wang, J Du, CH Lee, J Chen, S Watanabe, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization H Wang, J Du, Y Dai, CH Lee, Y Ren, Y Liu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Y Dai, H Chen, J Du, R Wang, S Chen, J Ma, H Wang, CH Lee arXiv preprint arXiv:2403.04245, 2024 | | 2024 |