关注
Jinglin Liu (刘静林)
Jinglin Liu (刘静林)
Research Scientist, ByteDance
在 bytedance.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Diffsinger: Singing voice synthesis via shallow diffusion mechanism
J Liu, C Li, Y Ren, F Chen, Z Zhao
AAAI, 2021
1952021
Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models
R Huang, J Huang, D Yang, Y Ren, L Liu, M Li, Z Ye, J Liu, X Yin, Z Zhao
ICML, 2023
1252023
Prodiff: Progressive fast diffusion model for high-quality text-to-speech
R Huang, Z Zhao, H Liu, J Liu, C Cui, Y Ren
Proceedings of the 30th ACM International Conference on Multimedia, 2595-2605, 2022
1052022
Audiogpt: Understanding and generating speech, music, sound, and talking head
R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (21), 23802 …, 2024
872024
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
R Huang, F Chen, Y Ren, J Liu, C Cui, Z Zhao
ACM MM, 2021
682021
PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Y Ren*, J Liu*, Z Zhao
NeurIPS, 2021
662021
A study of non-autoregressive model for sequence generation
Y Ren*, J Liu*, X Tan, S Zhao, Z Zhao, TY Liu
ACL, 2020
662020
SimulSpeech: End-to-end simultaneous speech to text translation
Y Ren*, J Liu*, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
ACL, 2020
622020
Generspeech: Towards style transfer for generalizable out-of-domain text-to-speech
R Huang, Y Ren, J Liu, C Cui, Z Zhao
NeurIPS, 2022
592022
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Z Ye, Z Jiang, Y Ren, J Liu, JZ He, Z Zhao
ICLR 2023, 2023
562023
Singgan: Generative adversarial network for high-fidelity singing voice generation
R Huang, C Cui, F Chen, Y Ren, J Liu, Z Zhao, B Huai, Z Wang
Proceedings of the 30th ACM International Conference on Multimedia, 2525-2535, 2022
472022
Denoispeech: Denoising text to speech with frame-level noise modeling
C Zhang, Y Ren, X Tan, J Liu, K Zhang, T Qin, S Zhao, TY Liu
ICASSP, 2021
422021
M4singer: A multi-style, multi-singer and musical score provided mandarin singing corpus
L Zhang, R Li, S Wang, L Deng, J Liu, Y Ren, J He, R Huang, J Zhu, ...
Advances in Neural Information Processing Systems 35, 6914-6926, 2022
372022
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
J Liu, Y Ren, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
IJCAI, 2020
342020
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
R Huang*, J Liu*, H Liu*, Y Ren, L Zhang, J He, Z Zhao
ICLR, 2022
282022
SimulSLT: End-to-End Simultaneous Sign Language Translation
A Yin, Z Zhao, J Liu, W Jin, M Zhang, X Zeng, X He
ACM MM 2021, 2021
222021
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
C Cui, Y Ren, J Liu, F Chen, R Huang, M Lei, Z Zhao
INTERSPEECH 2021, 2021
222021
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias
Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ...
arXiv preprint arXiv:2306.03509, 2023
202023
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory
Z Lin, Z Zhao, H Li, J Liu, M Zhang, X Zeng, X He
ACM MM 2021, 2021
152021
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
J Liu, Y Ren, Z Zhao, C Zhang, B Huai, J Yuan
ACM MM 2020, 2020
132020
系统目前无法执行此操作,请稍后再试。
文章 1–20