Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Adaptive Behavior
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Panangadan, A.
Right arrow Articles by Dyer, M. G.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Construction in a Simulated Environment Using Temporal Goal Sequencing and Reinforcement Learning

Anand Panangadan

Saban Research Institute, Childrens Hospital of Los Angeles, anandvp{at}usc.edu

Michael G. Dyer

Computer Science Department, University of California Los Angeles, dyer{at}cs.ucla.edu

A behavior-based architecture (ConAg) with a connectionist action selection mechanism is introduced that enables a society of autonomous agents to construct arbitrary structures in their simulated two-dimensional world. Construction in this environment involves the agents picking up colored discs and dropping them at incomplete parts of the structure being built.

The ConAg architecture provides both reactive behaviors which are used to maintain the viability of the agent and navigational planning behaviors that are used for construction. The action selection mechanism enables learning the sequence of behaviors required for construction by reinforcement learning. The navigational planning behaviors use a grid-based representation of the world. The shape of the structure to be built is also encoded on an internal spatial map. Path planning is implemented by spreading activations on sets of grid-based maps so that the agents perform the construction task efficiently. Construction of arbitrary structures is supported by temporal sequencing of goals. We present simulation results that demonstrate the performance of the architecture and algorithms.

Key Words: construction • reinforcement learning • spatial map • spreading activation

Adaptive Behavior, Vol. 17, No. 1, 81-104 (2009)
DOI: 10.1177/1059712308101787


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?