关注
Shuhuai Ren
Shuhuai Ren
其他姓名任抒怀
在 stu.pku.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Generating natural language adversarial examples through probability weighted word saliency
S Ren, Y Deng, K He, W Che
ACL 2019, 2019
6452019
MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
L Li, Y Yin, S Li, L Chen, P Wang, S Ren, M Li, Y Yang, J Xu, X Sun, ...
arXiv preprint arXiv:2306.04387, 2023
59*2023
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
L Li, Y Lin, D Chen, S Ren, P Li, J Zhou, X Sun
Findings of EMNLP 2021, 2021
40*2021
Dynamic Knowledge Distillation for Pre-trained Language Models
L Li, Y Lin, S Ren, P Li, J Zhou, X Sun
EMNLP 2021, 2021
322021
Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
S Ren, J Zhang, L Li, X Sun, J Zhou
EMNLP 2021, 2021
272021
Learning Relation Alignment for Calibrated Cross-modal Retrieval
S Ren, J Lin, G Zhao, R Men, A Yang, J Zhou, X Sun, H Yang
ACL 2021, 2021
252021
Delving into the Openness of CLIP
S Ren, L Li, X Ren, G Zhao, X Sun
Findings of ACL 2023, 2022
15*2022
DCA: Diversified Co-Attention towards Informative Live Video Commenting
Z Zhang, Z Yin, S Ren, X Li, S Li
NLPCC 2020, 2020
142020
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
S Ren, A Zhang, Y Zhu, S Zhang, S Zheng, M Li, A Smola, X Sun
NeurIPS 2023, 2023
132023
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
L Chen, Y Zhang, S Ren, H Zhao, Z Cai, Y Wang, P Wang, X Meng, T Liu, ...
arXiv preprint arXiv:2402.15527, 2024
11*2024
Cuge: A chinese language understanding and generation evaluation benchmark
Y Yao, Q Dong, J Guan, B Cao, Z Zhang, C Xiao, X Wang, F Qi, J Bao, ...
arXiv preprint arXiv:2112.13610, 2021
112021
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Y Liu, L Li, S Ren, R Gao, S Li, S Chen, X Sun, L Hou
NeurIPS 2023 (Datasets and Benchmarks Track), 2023
62023
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
S Ren, L Yao, S Li, X Sun, L Hou
CVPR 2024, 2023
42023
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
S Li, L Li, S Ren, Y Liu, Y Liu, R Gao, X Sun, L Hou
arXiv preprint arXiv:2311.17404, 2023
12023
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
S Ren, S Chen, S Li, X Sun, L Hou
Findings of EMNLP 2023, 2023
12023
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Y Wang, S Ren, R Gao, L Yao, Q Guo, K An, J Bai, X Sun
arXiv preprint arXiv:2404.10763, 2024
2024
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
S Chen, L Li, S Ren, R Gao, Y Liu, X Bi, X Sun, L Hou
arXiv preprint arXiv:2403.19221, 2024
2024
TempCompass: Do Video LLMs Really Understand Videos?
Y Liu, S Li, Y Liu, Y Wang, S Ren, L Li, S Chen, X Sun, L Hou
arXiv preprint arXiv:2403.00476, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–18