Joe Kwon

Cited by

	All	Since 2019
Citations	389	389
h-index	5	5
i10-index	4	4

220

110

165

202020212022202320241 17 52 213 105

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Dawn SongProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Jacob SteinhardtStanford UniversityVerified email at cs.stanford.edu
Andy ZouPhD Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Mantas MazeikaUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Steven BasartPhD, University of ChicagoVerified email at ttic.edu
Dan HendrycksDirector of the Center for AI SafetyVerified email at berkeley.edu
Mohammadreza MostajabiPhD Candidate, TTI-ChicagoVerified email at ttic.edu
Joshua B. TenenbaumMITVerified email at mit.edu
Sydney LevineResearch Scientist, Allen Institute for AI (Research Affiliate: Harvard & MIT)Verified email at mit.edu
Michael Lopez-BrauYale UniversityVerified email at yale.edu
Julian Jara-EttingerYale UniversityVerified email at yale.edu
Dylan Hadfield-MenellMassachusetts Institute of TechnologyVerified email at csail.mit.edu
Stephen CasperPhD student, MITVerified email at mit.edu
Jason LinGoogle / StanfordVerified email at stanford.edu
Gatlen CulpMassachusetts Institute of TechnologyVerified email at mit.edu
Owain EvansResearch Associate, University of OxfordVerified email at philosophy.ox.ac.uk
Tan Zhi-XuanMITVerified email at mit.edu
Ilker YildirimYale UniversityVerified email at yale.edu
Mengmi ZhangAssistant professor and PI of Deep NeuroCognition Lab, NTU and A*STARVerified email at ntu.edu.sg
Gabriel KreimanProfessor, Harvard Medical School and Children's HospitalVerified email at tch.harvard.edu

Joe Kwon

MIT

Verified email at csail.mit.edu

artificial intelligence cognitive science AI Safety


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Scaling out-of-distribution detection for real-world settings D Hendrycks, S Basart, M Mazeika, A Zou, J Kwon, M Mostajabi, ... arXiv preprint arXiv:1911.11132, 2019	309	2019
Explore, establish, exploit: Red teaming language models from scratch S Casper, J Lin, J Kwon, G Culp, D Hadfield-Menell arXiv preprint arXiv:2306.09442, 2023	41	2023
Forecasting future world events with neural networks A Zou, T Xiao, R Jia, J Kwon, M Mazeika, R Li, D Song, J Steinhardt, ... Advances in Neural Information Processing Systems 35, 27293-27305, 2022	13	2022
Social inferences from physical evidence via bayesian event reconstruction. M Lopez-Brau, J Kwon, J Jara-Ettinger Journal of Experimental Psychology: General 151 (9), 2029, 2022	10	2022
Mental state inference from indirect evidence through Bayesian event reconstruction. M Lopez-Brau, J Kwon, J Jara-Ettinger CogSci, 2020	6	2020
Flexibility in Moral Cognition: When is it okay to break the rules? J Kwon, J Tenenbaum, S Levine Proceedings of the annual meeting of the cognitive science society 44 (44), 2022	5	2022
Neuro-Symbolic Models of Human Moral Judgment: LLMs as Automatic Feature Extractors J Kwon, S Levine, JB Tenenbaum	3	2023
When it is not out of line to get out of line: The role of universalization and outcome-based reasoning in rule-breaking judgments J Kwon, T Zhi-Xuan, J Tenenbaum, S Levine PsyArXiv, 2023	2	2023
When it's not out of line to get out of line: Principles of universalizability, welfare, and harm J Kwon, T Zhi-Xuan, J Tenenbaum, S Levine Proceedings of the Annual Meeting of the Cognitive Science Society 45 (45), 2023		2023
Detecting the involvement of agents through physical reasoning M Lopez-Brau, J Kwon, B McBean, I Yildirim, J Jara-Ettinger Proceedings of the Annual Meeting of the Cognitive Science Society 43 (43), 2021		2021
Lift-the-flap: what, where and when for context reasoning M Zhang, C Tseng, K Montejo, J Kwon, G Kreiman arXiv preprint arXiv:1902.00163, 2019		2019
Does It Know?: Probing and Benchmarking Uncertainty in Language Model Latent Beliefs BRY Huang, J Kwon

The system can't perform the operation now. Try again later.

Articles 1–12

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors