| Sign In to gain access to subscriptions and/or personal tools. |
ActorCritic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial RatsAnimatLab, LIP6, Paris, France; LPPA, CNRSCollège de France, Paris, France
AnimatLab, LIP6, Paris, France
AnimatLab, LIP6, Paris, France; LPPA, CNRSCollège de France, Paris, France
LPPA, CNRSCollège de France, Paris, France
AnimatLab, LIP6, Paris, France Since 1995, numerous ActorCritic architectures for reinforcement learning have been proposed as models of dopamine-like reinforcement learning mechanisms in the rats basal ganglia. However, these models were usually tested in different tasks, and it is then difficult to compare their efficiency for an autonomous animat. We present here the comparison of four architectures in an animat as it per forms the same reward-seeking task. This will illustrate the consequences of different hypotheses about the management of different Actor sub-modules and Critic units, and their more or less autono mously determined coordination. We show that the classical method of coordination of modules by mixture of experts, depending on each modules performance, did not allow solving our task. Then we address the question of which principle should be applied efficiently to combine these units. Improve ments for Critic modeling and accuracy of ActorCritic models for a natural task are finally discussed in the perspective of our Psikharpax projectan artificial rat having to survive autonomously in unpre dictable environments.
Key Words: animat approach TD learning ActorCritic model SR task taxon navigation
Adaptive Behavior, Vol. 13, No. 2,
131-148 (2005) This article has been cited by other articles:
|
|||||||||||||||
