Folio: Natural language reasoning with first-order logic S Han, H Schoelkopf, Y Zhao, Z Qi, M Riddell, L Benson, L Sun, E Zubova, ... arXiv preprint arXiv:2209.00840, 2022 | 74* | 2022 |
Revisiting the gold standard: Grounding summarization evaluation with robust human evaluation Y Liu, AR Fabbri, P Liu, Y Zhao, L Nan, R Han, S Han, S Joty, CS Wu, ... ACL 2023, 2022 | 54 | 2022 |
MultiHiertt: Numerical reasoning over multi hierarchical tabular and textual data Y Zhao, Y Li, C Li, R Zhang ACL 2022, 2022 | 52 | 2022 |
Apparel-invariant feature learning for person re-identification Z Yu, Y Zhao, B Hong, Z Jin, J Huang, D Cai, XS Hua IEEE Transactions on Multimedia 24, 4482-4492, 2021 | 32 | 2021 |
Enhancing Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies L Nan, Y Zhao, W Zou, N Ri, J Tae, E Zhang, A Cohan, D Radev EMNLP 2023 Findings, 2023 | 22* | 2023 |
ReasTAP: Injecting table reasoning skills during pre-training via synthetic reasoning examples Y Zhao, L Nan, Z Qi, R Zhang, D Radev EMNLP 2022, 2022 | 19 | 2022 |
Investigating data contamination in modern benchmarks for large language models C Deng, Y Zhao, X Tang, M Gerstein, A Cohan NAACL 2024, 2023 | 18* | 2023 |
Medagents: Large language models as collaborators for zero-shot medical reasoning X Tang, A Zou, Z Zhang, Y Zhao, X Zhang, A Cohan, M Gerstein arXiv preprint arXiv:2311.10537, 2023 | 15 | 2023 |
Finmath: Injecting a tree-structured solver for question answering over financial reports C Li, W Ye, Y Zhao LREC 2022, 2022 | 12 | 2022 |
RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations Y Zhao, C Zhao, L Nan, Z Qi, W Zhang, X Tang, B Mi, D Radev ACL 2023, 2023 | 9 | 2023 |
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? X Tang, Y Zong, Y Zhao, A Cohan, M Gerstein NAACL 2024, 2023 | 8 | 2023 |
R2D2: Robust data-to-text with replacement detection L Nan, LJY Flores, Y Zhao, Y Liu, L Benson, W Zou, D Radev EMNLP 2022, 2022 | 8 | 2022 |
LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control Y Zhao, Z Qi, L Nan, LJY Flores, D Radev EACL 2023, 2023 | 7 | 2023 |
Benchmarking generation and evaluation capabilities of large language models for instruction controllable summarization Y Liu, AR Fabbri, J Chen, Y Zhao, S Han, S Joty, P Liu, D Radev, CS Wu, ... NAACL 2024 Findings, 2023 | 6 | 2023 |
LAMP: label augmented multimodal pretraining J Guo, C Zhu, Y Zhao, H Wang, Y Hu, X He, D Cai arXiv preprint arXiv:2012.04446, 2020 | 6 | 2020 |
Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios Y Zhao, H Zhang, S Si, L Nan, X Tang, A Cohan EMNLP 2023 Industry Track, 2023 | 5* | 2023 |
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models A Ni, P Yin, Y Zhao, M Riddell, T Feng, R Shen, S Yin, Y Liu, S Yavuz, ... arXiv preprint arXiv:2309.17446, 2023 | 5 | 2023 |
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science X Tang, Q Jin, K Zhu, T Yuan, Y Zhang, W Zhou, M Qu, Y Zhao, J Tang, ... arXiv preprint arXiv:2402.04247, 2024 | 4 | 2024 |
OpenRT: An Open-source Framework for Reasoning Over Tabular Data Y Zhao, B Mi, Z Qi, L Nan, M Guo, A Cohan, D Radev ACL 2023 Demo, 2023 | 4 | 2023 |
QTSumm: Query-Focused Summarization over Tabular Data Y Zhao, Z Qi, L Nan, B Mi, Y Liu, W Zou, S Han, R Chen, X Tang, Y Xu, ... EMNLP 2023, 2023 | 3* | 2023 |