关注
Kaiyue Wen
Kaiyue Wen
Undergraduate, Tsinghua University
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
On transferability of prompt tuning for natural language processing
Y Su, X Wang, Y Qin, CM Chan, Y Lin, H Wang, K Wen, Z Liu, P Li, J Li, ...
arXiv preprint arXiv:2111.06719, 2021
932021
How Sharpness-Aware Minimization Minimizes Sharpness?
K Wen, T Ma, Z Li
International Conference on Learning Representations, 0
54*
Finding Skill Neurons in Pre-trained Transformer-based Language Models
X Wang, K Wen, Z Zhang, L Hou, Z Liu, J Li
arXiv preprint arXiv:2211.07349, 2022
452022
Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars
K Wen, Y Li, B Liu, A Risteski
Advances in Neural Information Processing Systems 36, 2024
11*2024
Sharpness minimization algorithms do not only minimize sharpness to achieve better generalization
K Wen, Z Li, T Ma
Advances in Neural Information Processing Systems 36, 2024
102024
Benign overfitting in classification: Provably counter label noise with larger models
K Wen, J Teng, J Zhang
arXiv preprint arXiv:2206.00501, 2022
5*2022
Residual permutation test for high-dimensional regression coefficient testing
K Wen, T Wang, Y Wang
arXiv preprint arXiv:2211.16182, 2022
42022
RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval
K Wen, X Dang, K Lyu
arXiv preprint arXiv:2402.18510, 2024
12024
Practically Solving LPN in High Noise Regimes Faster Using Neural Networks
H Jiang, K Wen, Y Chen
arXiv preprint arXiv:2303.07987, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–9