Sora: A review on background, technology, limitations, and opportunities of large vision models Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen, Z Yuan, Y Huang, H Sun, ... arXiv preprint arXiv:2402.17177, 2024 | 55 | 2024 |
Tinygpt-v: Efficient multimodal large language model via small backbones Z Yuan, Z Li, L Sun arXiv preprint arXiv:2312.16862, 2023 | 18 | 2023 |
Mora: Enabling generalist video generation via a multi-agent framework Z Yuan, R Chen, Z Li, H Jia, L He, C Wang, L Sun arXiv preprint arXiv:2403.13248, 2024 | 3 | 2024 |
Artgpt-4: Artistic vision-language understanding with adapter-enhanced minigpt-4 Z Yuan, H Xue, X Wang, Y Liu, Z Zhao, K Wang arXiv preprint arXiv:2305.07490 19, 2023 | 3 | 2023 |
RPN: a word vector level data augmentation algorithm in deep learning for language understanding Z Yuan, X Zhang, Y Wang, X Hou, H Xue, Z Zhao, Y Liu 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2023 | 1 | 2023 |
Hulk: Graph Neural Networks for Optimizing Regionally Distributed Computing Systems Z Yuan, H Xue, C Zhang, Y Liu Proceedings of SAI Intelligent Systems Conference, 561-576, 2023 | 1 | 2023 |