Follow
John Kirchenbauer
Title
Cited by
Cited by
Year
A watermark for large language models
J Kirchenbauer, J Geiping, Y Wen, J Katz, I Miers, T Goldstein
International Conference on Machine Learning, 17061-17084, 2023
3792023
Hard prompts made easy: Gradient-based discrete optimization for prompt tuning and discovery
Y Wen, N Jain, J Kirchenbauer, M Goldblum, J Geiping, T Goldstein
Advances in Neural Information Processing Systems 36, 2024
1312024
On the reliability of watermarks for large language models
J Kirchenbauer, J Geiping, Y Wen, M Shu, K Saifullah, K Kong, ...
arXiv preprint arXiv:2306.04634, 2023
622023
Tree-ring watermarks: Fingerprints for diffusion images that are invisible and robust
Y Wen, J Kirchenbauer, J Geiping, T Goldstein
arXiv preprint arXiv:2305.20030, 2023
59*2023
Baseline defenses for adversarial attacks against aligned language models
N Jain, A Schwarzschild, Y Wen, G Somepalli, J Kirchenbauer, P Chiang, ...
arXiv preprint arXiv:2309.00614, 2023
382023
Neftune: Noisy embeddings improve instruction finetuning
N Jain, P Chiang, Y Wen, J Kirchenbauer, HM Chu, G Somepalli, ...
arXiv preprint arXiv:2310.05914, 2023
31*2023
Kezhi Kong, Kasun Fernando, Aniruddha Saha, Micah Goldblum, and Tom Goldstein. 2023b. On the reliability of watermarks for large language models
J Kirchenbauer, J Geiping, Y Wen, M Shu, K Saifullah
arXiv preprint arXiv:2306.04634, 2023
302023
GOAT: A global transformer on large-scale graphs
K Kong, J Chen, J Kirchenbauer, R Ni, CB Bruss, T Goldstein
International Conference on Machine Learning, 17375-17390, 2023
252023
A closer look at distribution shifts and out-of-distribution generalization on graphs
M Ding, K Kong, J Chen, J Kirchenbauer, M Goldblum, D Wipf, F Huang, ...
172021
Bring your own data! self-supervised evaluation for large language models
N Jain, K Saifullah, Y Wen, J Kirchenbauer, M Shu, A Saha, M Goldblum, ...
arXiv preprint arXiv:2306.13651, 2023
162023
A watermark for large language models (2023)
J Kirchenbauer, J Geiping, Y Wen, J Katz, I Miers, T Goldstein
arXiv preprint arXiv:2301.10226, 2023
62023
Transformers Can Do Arithmetic with the Right Embeddings
S McLeish, A Bansal, A Stein, N Jain, J Kirchenbauer, BR Bartoldson, ...
arXiv preprint arXiv:2405.17399, 2024
32024
yeh Chiang
N Jain, A Schwarzschild, Y Wen, G Somepalli, J Kirchenbauer
P., Goldblum, M., Saha, A., Geiping, J., and Goldstein, T. Baseline defenses …, 2023
32023
Knowing When You Don’t Know: Quantifying and Reasoning about Uncertainty in Machine Learning Models
E Heim, J Kirchenbauer, J Helland, J Oaks, A Singh, Z Lipton
12022
What is Your Metric Telling You? Evaluating Classifier Calibration under Context-Specific Definitions of Reliability
J Kirchenbauer, J Oaks, E Heim
12022
GenQA: Generating Millions of Instructions from a Handful of Prompts
J Chen, R Qadri, Y Wen, N Jain, J Kirchenbauer, T Zhou, T Goldstein
arXiv preprint arXiv:2406.10323, 2024
2024
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
A Hans, Y Wen, N Jain, J Kirchenbauer, H Kazemi, P Singhania, S Singh, ...
arXiv preprint arXiv:2406.10209, 2024
2024
OPTune: Efficient Online Preference Tuning
L Chen, J Chen, C Liu, J Kirchenbauer, D Soselia, C Zhu, T Goldstein, ...
arXiv preprint arXiv:2406.07657, 2024
2024
LMD3: Language Model Data Density Dependence
J Kirchenbauer, G Honke, G Somepalli, J Geiping, D Ippolito, K Lee, ...
arXiv preprint arXiv:2405.06331, 2024
2024
How to Do a Vocab Swap? A Study of Embedding Replacement for Pre-trained Transformers
N Jain, J Kirchenbauer, J Geiping, T Goldstein
The system can't perform the operation now. Try again later.
Articles 1–20