关注
Dacheng Yin
标题
引用次数
引用次数
年份
Phasen: A phase-and-harmonics-aware speech enhancement network
D Yin, C Luo, Z Xiong, W Zeng
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9458-9465, 2020
2852020
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
D Yin, X Ren, C Luo, Y Wang, Z Xiong, W Zeng
International Conference on Learning Representations (ICLR), 2022
112022
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo
Interspeech 2022, 2022
102022
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng
Interspeech 2021, 2021
82021
TridentSE: Guiding speech enhancement with 32 global tokens
D Yin, Z Zhao, C Tang, Z Xiong, C Luo
arXiv preprint arXiv:2210.12995, 2022
72022
General-purpose speech representation learning through a self-supervised multi-granularity framework
Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha
arXiv preprint arXiv:2102.01930, 2021
72021
Learning trajectories are generalization indicators
J Fu, Z Zhang, D Yin, Y Lu, N Zheng
Advances in Neural Information Processing Systems 36, 2024
12024
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Y Wang, J Bao, W Weng, R Feng, D Yin, T Yang, J Zhang, QDZ Zhao, ...
arXiv preprint arXiv:2311.18829, 2023
2023
ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models
W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ...
arXiv preprint arXiv:2311.18834, 2023
2023
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
Decomposing style, content, and motion for videos
Y Hu, D Yin, Y Wang, Z Chen, C Luo
Journal of Visual Communication and Image Representation 89, 103686, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–11