Reasoning or reciting? exploring the capabilities and limitations of language models through counterfactual tasks Z Wu, L Qiu, A Ross, E Akyürek, B Chen, B Wang, N Kim, J Andreas, ... arXiv preprint arXiv:2307.02477, 2023 | 49 | 2023 |
Dynamic sparsity neural networks for automatic speech recognition Z Wu, D Zhao, Q Liang, J Yu, A Gulati, R Pang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 41 | 2021 |
Infusing finetuning with semantic dependencies Z Wu, H Peng, NA Smith Transactions of the Association for Computational Linguistics 9, 226-242, 2021 | 38 | 2021 |
We're Afraid Language Models Aren't Modeling Ambiguity A Liu, Z Wu, J Michael, A Suhr, P West, A Koller, S Swayamdipta, ... EMNLP 2023, 2023 | 35 | 2023 |
ABC: Attention with bounded-memory control H Peng, J Kasai, N Pappas, D Yogatama, Z Wu, L Kong, R Schwartz, ... ACL 2022, 2021 | 15 | 2021 |
WTMED at MEDIQA 2019: A hybrid approach to biomedical natural language inference Z Wu, Y Song, S Huang, Y Tian, F Xia Proceedings of the 18th BioNLP workshop and shared task, 415-426, 2019 | 15 | 2019 |
Understanding Mention Detector-Linker Interaction for Neural Coreference Resolution Z Wu, M Gardner Proceedings of the Fourth Workshop on Computational Models of Reference …, 2021 | 12 | 2021 |
Modeling Context With Linear Attention for Scalable Document-Level Translation Z Wu, H Peng, N Pappas, NA Smith Findings of EMNLP 2022, 2022 | 4 | 2022 |
Transparency helps reveal when language models learn meaning Z Wu, W Merrill, H Peng, I Beltagy, NA Smith Transactions of the Association for Computational Linguistics 11, 617-634, 2023 | 3 | 2023 |
Continued Pretraining for Better Zero-and Few-Shot Promptability Z Wu, RL Logan IV, P Walsh, A Bhagia, D Groeneveld, S Singh, I Beltagy EMNLP 2022, 2022 | 3 | 2022 |
Learning with latent structures in natural language processing: A survey Z Wu arXiv preprint arXiv:2201.00490, 2022 | 2 | 2022 |
A Taxonomy of Ambiguity Types for NLP MY Li, A Liu, Z Wu, NA Smith arXiv preprint arXiv:2403.14072, 2024 | | 2024 |
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment W Merrill, Z Wu, N Naka, Y Kim, T Linzen arXiv preprint arXiv:2402.13956, 2024 | | 2024 |