Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Sign In to gain access to subscriptions and/or personal tools.
Adaptive Behavior
This Article
Right arrow Free Full Text (Free PDF) Free
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (7)
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Stone, P.
Right arrow Articles by Kuhlmann, G.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Reinforcement Learning for RoboCup Soccer Keepaway

Peter Stone

Department of Computer Sciences, The University of Texas at Austin, pstone{at}cs.utexas.edu

Richard S. Sutton

Department of Computing Science, University of Alberta, sutton{at}cs.ualberta.ca

Gregory Kuhlmann

Department of Computer Sciences, The University of Texas at Austin, kuhlmann{at}cs.utexas.edu

RoboCup simulated soccer presents many challenges to reinforcement learning methods, including a large state space, hidden and uncertain state, multiple independent agents learning simultaneously, and long and variable delays in the effects of actions. We describe our application of episodic SMDP Sarsa({lambda}) with linear tile-coding function approximation and variable {lambda} to learning higher-level decisions in a keepaway subtask of RoboCup soccer. In keepaway, one team, "the keepers," tries to keep control of the ball for as long as possible despite the efforts of "the takers." The keepers learn individually when to hold the ball and when to pass to a teammate. Our agents learned policies that significantly outperform a range of benchmark policies. We demonstrate the generality of our approach by applying it to a number of task variations including different field sizes and different numbers of players on each team.

Key Words: multiagent systems • machine learning • multiagent learning • reinforcement learning • robot soccer

Adaptive Behavior, Vol. 13, No. 3, 165-188 (2005)
DOI: 10.1177/105971230501300301


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Adaptive BehaviorHome page
J. L. Krichmar
The Neuromodulatory System: A Framework for Survival and Adaptive Behavior in a Challenging World
Adaptive Behavior, December 1, 2008; 16(6): 385 - 399.
[Abstract] [PDF]


Home page
Adaptive BehaviorHome page
S. Whiteson, M. E. Taylor, and P. Stone
Empirical Studies in Action Selection with Reinforcement Learning
Adaptive Behavior, March 1, 2007; 15(1): 33 - 50.
[Abstract] [PDF]