Human-like summarization evaluation with chatgpt M Gao, J Ruan, R Sun, X Yin, S Yang, X Wan arXiv preprint arXiv:2304.02554, 2023 | 62 | 2023 |
Summarization is (almost) dead X Pu, M Gao, X Wan arXiv preprint arXiv:2309.09558, 2023 | 34 | 2023 |
Missing information, unresponsive authors, experimental flaws: The impossibility of assessing the reproducibility of previous human evaluations in NLP A Belz, C Thomson, E Reiter, G Abercrombie, JM Alonso-Moral, M Arvan, ... arXiv preprint arXiv:2305.01633, 2023 | 28 | 2023 |
DialSummEval: Revisiting summarization evaluation for dialogues M Gao, X Wan Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 18 | 2022 |
Llm-based nlg evaluation: Current status and challenges M Gao, X Hu, J Ruan, X Pu, X Wan arXiv preprint arXiv:2402.01383, 2024 | 10 | 2024 |
Reference matters: Benchmarking factual error correction for dialogue summarization with fine-grained evaluation framework M Gao, X Wan, J Su, Z Wang, B Huai arXiv preprint arXiv:2306.05119, 2023 | 4 | 2023 |
A Reproduction Study of the Human Evaluation of Role-Oriented Dialogue Summarization Models M Gao, J Ruan, X Wan Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems, 124-129, 2023 | 2 | 2023 |
Evaluating factuality in cross-lingual summarization M Gao, W Wang, X Wan, Y Xu Findings of the Association for Computational Linguistics: ACL 2023, 12415-12431, 2023 | 2 | 2023 |
Are LLM-based Evaluators Confusing NLG Quality Criteria? X Hu, M Gao, S Hu, Y Zhang, Y Chen, T Xu, X Wan arXiv preprint arXiv:2402.12055, 2024 | 1 | 2024 |
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks X Pu, M Gao, X Wan arXiv preprint arXiv:2305.15044, 2023 | 1 | 2023 |
Social Biases in Automatic Evaluation Metrics for NLG M Gao, X Wan arXiv preprint arXiv:2210.08859, 2022 | 1 | 2022 |
Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling J Ruan, X Pu, M Gao, X Wan, Y Zhu Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18915 …, 2024 | | 2024 |