Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1037 | 2023 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 963 | 2023 |
Minimum risk training for neural machine translation S Shen, Y Cheng, Z He, W He, H Wu, M Sun, Y Liu ACL, 2015 | 498 | 2015 |
Semi-supervised learning for neural machine translation Y Cheng, W Xu, Z He, W He, H Wu, M Sun, Y Liu ACL, 2016 | 315 | 2016 |
Robust Neural Machine Translation with Doubly Adversarial Inputs Y Cheng, L Jiang, W Macherey ACL, 2019 | 264 | 2019 |
Towards robust neural machine translation Y Cheng, Z Tu, F Meng, J Zhai, Y Liu ACL, 2018 | 177 | 2018 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 172 | 2024 |
A teacher-student framework for zero-resource neural machine translation Y Chen, Y Liu, Y Cheng, VOK Li ACL, 2017 | 149 | 2017 |
Advaug: Robust adversarial augmentation for neural machine translation Y Cheng, L Jiang, W Macherey, J Eisenstein ACL, 2020 | 108 | 2020 |
Joint Training for Pivot-based Neural Machine Translation Y Cheng, Q Yang, Y Liu, M Sun, W Xu IJCAI, 3974-3980, 2017 | 100 | 2017 |
mslam: Massively multilingual joint pre-training for speech and text A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ... arXiv preprint arXiv:2202.01374, 2022 | 97 | 2022 |
Magvit: Masked generative video transformer L Yu, Y Cheng, K Sohn, J Lezama, H Zhang, H Chang, AG Hauptmann, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 96 | 2023 |
Agreement-based joint training for bidirectional attention-based neural machine translation Y Cheng, Q Yang, Y Liu, M Sun, W Xu IJCAI, 2016 | 91 | 2016 |
Thumt: An open source toolkit for neural machine translation J Zhang, Y Ding, S Shen, Y Cheng, M Sun, H Luan, Y Liu arXiv preprint arXiv:1706.06415, 2017 | 74 | 2017 |
Reducing Word Omission Errors in Neural Machine Translation: A Contrastive Learning Approach Z Yang, Y Cheng, Y Liu, M Sun ACL, 2019 | 66 | 2019 |
Towards conversational diagnostic ai T Tu, A Palepu, M Schaekermann, K Saab, J Freyberg, R Tanno, A Wang, ... arXiv preprint arXiv:2401.05654, 2024 | 62 | 2024 |
Sunipa Dev R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vladimir Feinberg, Fangxiaoyu …, 2023 | 60 | 2023 |
Videopoet: A large language model for zero-shot video generation D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, R Hornung, H Adam, ... arXiv preprint arXiv:2312.14125, 2023 | 59 | 2023 |
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ... arXiv preprint arXiv:2310.05737, 2023 | 52 | 2023 |
An End-to-End Generative Architecture for Paraphrase Generation Q Yang, Z Huo, D Shen, Y Cheng, W Wang, G Wang, L Carin EMNLP, 2019 | 44 | 2019 |