WenLan: Bridging vision and language by large-scale multi-modal pre-training Y Huo, M Zhang, G Liu, H Lu, Y Gao, G Yang, J Wen, H Zhang, B Xu, ... arXiv preprint arXiv:2103.06561, 2021 | 117 | 2021 |
Transferring Foundation Models for Generalizable Robotic Manipulation J Yang, W Tan, C Jin, K Yao, B Liu, J Fu, R Song, G Wu, L Wang arXiv e-prints, arXiv: 2306.05716, 2023 | 22* | 2023 |
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation C Jin*, W Tan*, J Yang*, B Liu, R Song, L Wang, J Fu arXiv preprint arXiv:2305.18898, 2023 | 16 | 2023 |
Text2poster: Laying out stylized texts on retrieved images C Jin, H Xu, R Song, Z Lu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 6 | 2022 |
Few-shot learning in realistic settings for text CAPTCHA recognition Y Wang, Y Wei, Y Zhang, C Jin, G Xin, B Wang Neural Computing and Applications 35 (15), 10751-10764, 2023 | 2 | 2023 |
Joint Semantic and Strategy Matching for Persuasive Dialogue C Jin, Y Zhu, L Kong, S Li, X Zhang, R Song, X Chen, H Chen, Y Sun, ... Findings of the Association for Computational Linguistics: EMNLP 2023, 4187-4197, 2023 | | 2023 |