Follow
Pedro A. Ortega
Pedro A. Ortega
Artificial Intelligence & Machine Learning
Verified email at adaptiveagents.org - Homepage
Title
Cited by
Cited by
Year
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
4472019
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
3182017
Thermodynamics as a theory of decision-making with information-processing costs
PA Ortega, DA Braun
Proceedings of the Royal Society A: Mathematical, Physical and Engineering …, 2013
2752013
A Medical Claim Fraud/Abuse Detection System based on Data Mining: A Case Study in Chile.
PA Ortega, CJ Figueroa, GA Ruz
DMIN 6, 26-29, 2006
1562006
Nash equilibria in multi-agent motor interactions
DA Braun, PA Ortega, DM Wolpert
PLoS computational biology 5 (8), e1000468, 2009
1302009
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
1272019
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
1202019
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
802019
Information, utility and bounded rationality
DA Ortega, PA Braun
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
792011
A minimum relative entropy principle for learning and acting
PA Ortega, DA Braun
Journal of Artificial Intelligence Research 38, 475-511, 2010
782010
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
732021
Neural networks and the chomsky hierarchy
G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ...
arXiv preprint arXiv:2207.02098, 2022
692022
Path integral control and bounded rationality
DA Braun, PA Ortega, E Theodorou, S Schaal
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
652011
Action and perception as divergence minimization
D Hafner, PA Ortega, J Ba, T Parr, K Friston, N Heess
arXiv preprint arXiv:2009.01791, 2020
542020
Intrinsic social motivation via causal influence in multi-agent RL
N Jaques, A Lazaridou, E Hughes, C Gulcehre, PA Ortega, DJ Strouse, ...
532018
Generalized Thompson sampling for sequential decision-making and causal inference
PA Ortega, DA Braun
Complex Adaptive Systems Modeling 2 (2), 2014
482014
Laser processing of Al2O3/a‐SiCx:H stacks: a feasible solution for the rear surface of high‐efficiency p‐type c‐Si solar cells
I Martín, P Ortega, M Colina, A Orpella, G López, R Alcubilla
Progress in Photovoltaics: Research and Applications 21 (5), 1171-1175, 2013
472013
Shaking the foundations: delusions in sequence models for interaction and control
PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ...
arXiv preprint arXiv:2110.10819, 2021
412021
Human decision-making under limited time
PA Ortega, AA Stocker
Advances in Neural Information Processing Systems 29, 2016
402016
Information-Theoretic Bounded Rationality
PA Ortega, DA Braun, JS Dyer, KE Kim, N Tishby
arXiv preprint arXiv:1512.06789, 2015
392015
The system can't perform the operation now. Try again later.
Articles 1–20