Abstracting imperfect information away from two-player zero-sum games S Sokota, R D’Orazio, CK Ling, DJ Wu, JZ Kolter, N Brown International Conference on Machine Learning, 32169-32193, 2023 | 1 | 2023 |
The Update Equivalence Framework for Decision-Time Planning S Sokota, G Farina, DJ Wu, H Hu, KA Wang, JZ Kolter, N Brown arXiv preprint arXiv:2304.13138, 2023 | 2 | 2023 |
Converging to Unexploitable Policies in Continuous Control Adversarial Games M Goldstein, N Brown Deep Reinforcement Learning Workshop NeurIPS 2022, 2022 | | 2022 |
Human-level play in the game of Diplomacy by combining language models with strategic reasoning Meta Fundamental AI Research Diplomacy Team (FAIR)†, A Bakhtin, ... Science 378 (6624), 1067-1074, 2022 | 155 | 2022 |
Mastering the game of no-press Diplomacy via human-regularized reinforcement learning and planning A Bakhtin, DJ Wu, A Lerer, J Gray, AP Jacob, G Farina, AH Miller, ... arXiv preprint arXiv:2210.05492, 2022 | 25 | 2022 |
Human-ai coordination via human-regularized search and learning H Hu, DJ Wu, A Lerer, J Foerster, N Brown arXiv preprint arXiv:2210.05125, 2022 | 6 | 2022 |
AdaptFSP: Adaptive Fictitious Self Play M Goldstein, N Brown | | 2022 |
Modeling strong and human-like gameplay with KL-regularized search AP Jacob, DJ Wu, G Farina, A Lerer, H Hu, A Bakhtin, J Andreas, N Brown International Conference on Machine Learning, 9695-9728, 2022 | 39 | 2022 |
A unified approach to reinforcement learning, quantal response equilibria, and two-player zero-sum games S Sokota, R D'Orazio, JZ Kolter, N Loizou, M Lanctot, I Mitliagkas, ... arXiv preprint arXiv:2206.05825, 2022 | 38 | 2022 |
Equilibrium Finding in Normal-Form Games Via Greedy Regret Minimization H Zhang, A Lerer, N Brown arXiv preprint arXiv:2204.04826, 2022 | 9 | 2022 |
No-press diplomacy from scratch A Bakhtin, D Wu, A Lerer, N Brown Advances in Neural Information Processing Systems 34, 18063-18074, 2021 | 35 | 2021 |
Scalable online planning via reinforcement learning fine-tuning A Fickinger, H Hu, B Amos, S Russell, N Brown Advances in Neural Information Processing Systems 34, 16951-16963, 2021 | 14 | 2021 |
A fine-tuning approach to belief state modeling S Sokota, H Hu, DJ Wu, JZ Kolter, JN Foerster, N Brown International Conference on Learning Representations, 2021 | 6 | 2021 |
Off-belief learning H Hu, A Lerer, B Cui, L Pineda, N Brown, J Foerster International Conference on Machine Learning, 4369-4379, 2021 | 60 | 2021 |
Learned belief search: Efficiently improving policies in partially observable settings H Hu, A Lerer, N Brown, J Foerster arXiv preprint arXiv:2106.09086, 2021 | 6 | 2021 |
Safe search for Stackelberg equilibria in extensive-form games CK Ling, N Brown Proceedings of the AAAI conference on artificial intelligence 35 (6), 5541-5548, 2021 | 7 | 2021 |
Human-level performance in no-press diplomacy via equilibrium search J Gray, A Lerer, A Bakhtin, N Brown arXiv preprint arXiv:2010.02923, 2020 | 43 | 2020 |
Equilibrium Finding for Large Adversarial Imperfect-Information Games N Brown Carnegie Mellon University, 2020 | 22 | 2020 |
Unlocking the potential of deep counterfactual value networks R Zarick, B Pellegrino, N Brown, C Banister arXiv preprint arXiv:2007.10442, 2020 | 16 | 2020 |
Dream: Deep regret minimization with advantage baselines and model-free learning E Steinberger, A Lerer, N Brown arXiv preprint arXiv:2006.10410, 2020 | 53 | 2020 |