Karina Nguyen

Cited by

	All	Since 2019
Citations	479	479
h-index	8	8
i10-index	7	7

260

130

195

2022202320245 228 245

Co-authors

Esin DurmusAnthropicVerified email at stanford.edu
Amanda AskellAnthropicVerified email at askell.io
Ethan PerezAnthropic; New York UniversityVerified email at anthropic.com
Deep GanguliAnthropicVerified email at cns.nyu.edu
Anton BakhtinFAIRVerified email at fb.com
Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Sara HookerHead of Cohere For AIVerified email at cohere.com
Randall BalestrieroAI ResearcherVerified email at citadel.com
Yang LiSenior Staff Research Scientist, GoogleVerified email at acm.org
Sam RingerSpeechmaticsVerified email at anthropic.com

Karina Nguyen

OpenAI, prev. Anthropic, UC Berkeley

Verified email at berkeley.edu - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Discovering Language Model Behaviors with Model-Written Evaluations E Perez, S Ringer, K Lukošiūtė, K Nguyen, E Chen, S Heiner, C Pettit, ... arXiv preprint arXiv:2212.09251, 2022	130	2022
The Capacity for Moral Self-Correction in Large Language Models D Ganguli, A Askell, N Schiefer, T Liao, K Lukošiūtė, A Chen, A Goldie, ... arXiv preprint arXiv:2302.07459, 2023	96	2023
Towards measuring the representation of subjective global opinions in language models E Durmus, K Nyugen, TI Liao, N Schiefer, A Askell, A Bakhtin, C Chen, ... arXiv preprint arXiv:2306.16388, 2023	65	2023
Towards monosemanticity: Decomposing language models with dictionary learning. Transformer Circuits Thread T Bricken, A Templeton, J Batson, B Chen, A Jermyn, T Conerly, N Turner, ...	61*	2023
Studying large language model generalization with influence functions R Grosse, J Bae, C Anil, N Elhage, A Tamkin, A Tajdini, B Steiner, D Li, ... arXiv preprint arXiv:2308.03296, 2023	47	2023
Measuring faithfulness in chain-of-thought reasoning T Lanham, A Chen, A Radhakrishnan, B Steiner, C Denison, ... arXiv preprint arXiv:2307.13702, 2023	37	2023
Question decomposition improves the faithfulness of model-generated reasoning A Radhakrishnan, K Nguyen, A Chen, C Chen, C Denison, D Hernandez, ... arXiv preprint arXiv:2307.11768, 2023	26	2023
Specific versus General Principles for Constitutional AI S Kundu, Y Bai, S Kadavath, A Askell, A Callahan, A Chen, A Goldie, ... arXiv preprint arXiv:2310.13798, 2023	9	2023
FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling WY Ko, D D'souza, K Nguyen, R Balestriero, S Hooker arXiv preprint arXiv:2303.00586, 2023	4	2023
Vision Transformers for Mobile Applications: A Short Survey N Alam, S Kolawole, S Sethi, N Bansali, K Nguyen arXiv preprint arXiv:2305.19365, 2023	3	2023
Towards Semantically-Aware UI Design Tools: Design, Implementation, and Evaluation of Semantic Grouping Guidelines P Duan, B Hartmann, K Nguyen, Y Li, M Hearst, MR Morris	1	2023
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning ARK Nguyen, JKJBSRB EthanPerez
Vision Transformers for Edge Devices-An Overview N Alam, S Sethi, S Kolawole, ML Collective, N Bansali, K Nguyen

The system can't perform the operation now. Try again later.

Articles 1–13

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors