| Multiagent systems: A survey from a machine learning perspective P Stone, M Veloso Autonomous Robots 8 (3), 345-383, 2000 | 1514 | 2000 |
| Transfer learning for reinforcement learning domains: A survey ME Taylor, P Stone Journal of Machine Learning Research 10 (Jul), 1633-1685, 2009 | 1022 | 2009 |
| A multiagent approach to autonomous intersection management K Dresner, P Stone Journal of artificial intelligence research 31, 591-656, 2008 | 735 | 2008 |
| Layered learning P Stone, M Veloso European Conference on Machine Learning, 369-381, 2000 | 642 | 2000 |
| Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork P Stone, M Veloso Artificial Intelligence 110 (2), 241-273, 1999 | 580 | 1999 |
| Policy gradient reinforcement learning for fast quadrupedal locomotion N Kohl, P Stone IEEE International Conference on Robotics and Automation, 2004. Proceedings …, 2004 | 538 | 2004 |
| Multiagent traffic management: A reservation-based intersection control mechanism K Dresner, P Stone Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004 | 496 | 2004 |
| Deep recurrent q-learning for partially observable mdps M Hausknecht, P Stone 2015 AAAI Fall Symposium Series, 2015 | 493 | 2015 |
| Reinforcement learning for robocup soccer keepaway P Stone, RS Sutton, G Kuhlmann Adaptive Behavior 13 (3), 165-188, 2005 | 488 | 2005 |
| The RoboCup synthetic agent challenge 97 H Kitano, M Tambe, P Stone, M Veloso, S Coradeschi, E Osawa, ... Robot Soccer World Cup, 62-73, 1997 | 470 | 1997 |
| Layered learning in multi-agent systems PH Stone CARNEGIE-MELLON UNIV PITTSBURGH PA SCHOOL OF COMPUTER SCIENCE, 1998 | 357 | 1998 |
| Evolutionary function approximation for reinforcement learning S Whiteson, P Stone Journal of Machine Learning Research 7 (May), 877-917, 2006 | 294 | 2006 |
| Scaling reinforcement learning toward RoboCup soccer P Stone, RS Sutton Icml 1, 537-544, 2001 | 262 | 2001 |
| Multiagent traffic management: An improved intersection control mechanism K Dresner, P Stone Proceedings of the fourth international joint conference on Autonomous …, 2005 | 258 | 2005 |
| Interactively shaping agents via human reinforcement: The TAMER framework WB Knox, P Stone Proceedings of the fifth international conference on Knowledge capture, 9-16, 2009 | 248 | 2009 |
| Layered approach to learning client behaviors in the robocup soccer server P Stone, M Veloso Applied Artificial Intelligence 12 (2-3), 165-188, 1998 | 218 | 1998 |
| Ad hoc autonomous agent teams: Collaboration without pre-coordination P Stone, GA Kaminka, S Kraus, JS Rosenschein Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010 | 202 | 2010 |
| Machine learning for fast quadrupedal locomotion N Kohl, P Stone AAAI 4, 611-616, 2004 | 193 | 2004 |
| Transfer learning via inter-task mappings for temporal difference learning ME Taylor, P Stone, Y Liu Journal of Machine Learning Research 8 (Sep), 2125-2167, 2007 | 183 | 2007 |
| The 2001 trading agent competition MP Wellman, A Greenwald, P Stone, PR Wurman Electronic Markets 13 (1), 4-12, 2003 | 182 | 2003 |