Xander Davies

100

2023202493 97

David Scott KruegerUniversity Assistant Professor, University of CambridgeVerified email at cam.ac.uk
Max NadeauHarvard CollegeVerified email at college.harvard.edu
Lauro LangoscoUniversity of CambridgeVerified email at cam.ac.uk
Dmitry KrotovMIT-IBM Watson AI Lab & IBM ResearchVerified email at ibm.com
Gabriel KreimanProfessor, Harvard Medical School and Children's HospitalVerified email at tch.harvard.edu
Trenton BrickenPhD in Systems, Synthetic and Quantitative Biology, Harvard UniversityVerified email at g.harvard.edu
David BauAssistant Professor at Northeastern UniversityVerified email at northeastern.edu
Nikhil PrakashNortheastern UniversityVerified email at northeastern.edu
Tamar Rott ShahamPostdoctoral fellow, MITVerified email at mit.edu

Xander Davies

Verified email at college.harvard.edu - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Open problems and fundamental limitations of reinforcement learning from human feedback S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, ... arXiv preprint arXiv:2307.15217, 2023	156	2023
Unifying Grokking and Double Descent X Davies, L Langosco, D Krueger arXiv preprint arXiv:2303.06173, 2023	14	2023
Sparse distributed memory is a continual learner T Bricken, X Davies, D Singh, D Krotov, G Kreiman arXiv preprint arXiv:2303.11934, 2023	9	2023
Circuit Breaking: Removing Model Behaviors with Targeted Ablation M Li, X Davies, M Nadeau*	7	2023
Discovering Variable Binding Circuitry with Desiderata X Davies, M Nadeau, N Prakash*, TR Shaham, D Bau arXiv preprint arXiv:2307.03637, 2023	4	2023
Delayed Generalization: Bridging Double Descent and Grokking X Davies, J Hoogland, L Langosco, D Krueger		2023

The system can't perform the operation now. Try again later.

Articles 1–6

Citations per year