Follow
Karina Nguyen
Karina Nguyen
OpenAI, prev. Anthropic, UC Berkeley
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Discovering Language Model Behaviors with Model-Written Evaluations
E Perez, S Ringer, K Lukošiūtė, K Nguyen, E Chen, S Heiner, C Pettit, ...
arXiv preprint arXiv:2212.09251, 2022
1302022
The Capacity for Moral Self-Correction in Large Language Models
D Ganguli, A Askell, N Schiefer, T Liao, K Lukošiūtė, A Chen, A Goldie, ...
arXiv preprint arXiv:2302.07459, 2023
962023
Towards measuring the representation of subjective global opinions in language models
E Durmus, K Nyugen, TI Liao, N Schiefer, A Askell, A Bakhtin, C Chen, ...
arXiv preprint arXiv:2306.16388, 2023
652023
Towards monosemanticity: Decomposing language models with dictionary learning. Transformer Circuits Thread
T Bricken, A Templeton, J Batson, B Chen, A Jermyn, T Conerly, N Turner, ...
61*2023
Studying large language model generalization with influence functions
R Grosse, J Bae, C Anil, N Elhage, A Tamkin, A Tajdini, B Steiner, D Li, ...
arXiv preprint arXiv:2308.03296, 2023
472023
Measuring faithfulness in chain-of-thought reasoning
T Lanham, A Chen, A Radhakrishnan, B Steiner, C Denison, ...
arXiv preprint arXiv:2307.13702, 2023
372023
Question decomposition improves the faithfulness of model-generated reasoning
A Radhakrishnan, K Nguyen, A Chen, C Chen, C Denison, D Hernandez, ...
arXiv preprint arXiv:2307.11768, 2023
262023
Specific versus General Principles for Constitutional AI
S Kundu, Y Bai, S Kadavath, A Askell, A Callahan, A Chen, A Goldie, ...
arXiv preprint arXiv:2310.13798, 2023
92023
FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling
WY Ko, D D'souza, K Nguyen, R Balestriero, S Hooker
arXiv preprint arXiv:2303.00586, 2023
42023
Vision Transformers for Mobile Applications: A Short Survey
N Alam, S Kolawole, S Sethi, N Bansali, K Nguyen
arXiv preprint arXiv:2305.19365, 2023
32023
Towards Semantically-Aware UI Design Tools: Design, Implementation, and Evaluation of Semantic Grouping Guidelines
P Duan, B Hartmann, K Nguyen, Y Li, M Hearst, MR Morris
12023
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
ARK Nguyen, JKJBSRB EthanPerez
Vision Transformers for Edge Devices-An Overview
N Alam, S Sethi, S Kolawole, ML Collective, N Bansali, K Nguyen
The system can't perform the operation now. Try again later.
Articles 1–13