Follow
Mingqi Gao
Mingqi Gao
Master's student, Peking University
Verified email at pku.edu.cn - Homepage
Title
Cited by
Cited by
Year
Human-like summarization evaluation with chatgpt
M Gao, J Ruan, R Sun, X Yin, S Yang, X Wan
arXiv preprint arXiv:2304.02554, 2023
622023
Summarization is (almost) dead
X Pu, M Gao, X Wan
arXiv preprint arXiv:2309.09558, 2023
342023
Missing information, unresponsive authors, experimental flaws: The impossibility of assessing the reproducibility of previous human evaluations in NLP
A Belz, C Thomson, E Reiter, G Abercrombie, JM Alonso-Moral, M Arvan, ...
arXiv preprint arXiv:2305.01633, 2023
282023
DialSummEval: Revisiting summarization evaluation for dialogues
M Gao, X Wan
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
182022
Llm-based nlg evaluation: Current status and challenges
M Gao, X Hu, J Ruan, X Pu, X Wan
arXiv preprint arXiv:2402.01383, 2024
102024
Reference matters: Benchmarking factual error correction for dialogue summarization with fine-grained evaluation framework
M Gao, X Wan, J Su, Z Wang, B Huai
arXiv preprint arXiv:2306.05119, 2023
42023
A Reproduction Study of the Human Evaluation of Role-Oriented Dialogue Summarization Models
M Gao, J Ruan, X Wan
Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems, 124-129, 2023
22023
Evaluating factuality in cross-lingual summarization
M Gao, W Wang, X Wan, Y Xu
Findings of the Association for Computational Linguistics: ACL 2023, 12415-12431, 2023
22023
Are LLM-based Evaluators Confusing NLG Quality Criteria?
X Hu, M Gao, S Hu, Y Zhang, Y Chen, T Xu, X Wan
arXiv preprint arXiv:2402.12055, 2024
12024
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
X Pu, M Gao, X Wan
arXiv preprint arXiv:2305.15044, 2023
12023
Social Biases in Automatic Evaluation Metrics for NLG
M Gao, X Wan
arXiv preprint arXiv:2210.08859, 2022
12022
Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling
J Ruan, X Pu, M Gao, X Wan, Y Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18915 …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12