Human-like summarization evaluation with chatgpt M Gao, J Ruan, R Sun, X Yin, S Yang, X Wan arXiv preprint arXiv:2304.02554, 2023 | 78 | 2023 |
How do seq2seq models perform on end-to-end data-to-text generation? X Yin, X Wan ACL 2022 (Volume 1: Long Papers), 7701-7710, 2022 | 16 | 2022 |
ALCUNA: large language models meet new knowledge X Yin, B Huang, X Wan EMNLP 2023, 2023 | 12 | 2023 |
History matters: Temporal knowledge editing in large language model X Yin, J Jiang, L Yang, X Wan AAAI 2024 38 (17), 19413-19421, 2024 | 5 | 2024 |
Benchmarking knowledge boundary for large language model: A different perspective on model evaluation X Yin, X Zhang, J Ruan, X Wan ACL 2024, 2024 | 3 | 2024 |
A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check X Yin, X Wan arXiv preprint arXiv:2307.13655, 2023 | 1 | 2023 |
Error-Robust Retrieval for Chinese Spelling Check X Yin, X Hu, J Jiang, X Wan COLING 2024, 6257-6267, 2022 | 1* | 2022 |
Themis: Towards Flexible and Interpretable NLG Evaluation X Hu, L Lin, M Gao, X Yin, X Wan arXiv preprint arXiv:2406.18365, 2024 | | 2024 |
MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency J Zhang, H Zhang, X Yin, B Huang, X Zhang, X Hu, X Wan arXiv preprint arXiv:2406.13219, 2024 | | 2024 |
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions X Zhang, X Yin, X Wan arXiv preprint arXiv:2406.08842, 2024 | | 2024 |
Exploring Context-Aware Evaluation Metrics for Machine Translation X Hu, X Yin, X Wan Findings of EMNLP 2023, 15291-15298, 2023 | | 2023 |
Contextual Modeling for Document-level ASR Error Correction J Jiang, X Yin, X Wan, W Peng, R Li, J Yang, Y Zhou COLING 2024, 3855-3867, 2022 | | 2022 |