RT Journal Article
JF Intelligent Agent Technology, IEEE / WIC / ACM International Conference on
YR 2007
VO 00
IS
SP 277
TI Reinforcement Learning with Inertial Exploration
A1 Julien Laumonier,
A1 Dany Bergeron,
A1 Charles Desjardins,
A1 Brahim Chaib-draa,
K1 null
AB In the Q-Learning framework, the exploration of large environment is influenced by the time credit assignment problem. In this context, abstraction techniques may be used. Thus, multi-step actions (MSA) Q-Learning has been proposed to take advantage of the fact that few action switches are usually required in optimal policies. In this article, we propose the concept of inertial exploration, we apply a log-selection of the scales to MSA Q-Learning and we go further by proposing a dynamic time scale approach. We demonstrate that the same improvement in learning speed can be achieved without the full scales set. This improvement is shown on the mountain car problem and on a more realistic application of vehicle control.
PB IEEE Computer Society, [URL:http://www.computer.org]
SN
LA English
DO 10.1109/IAT.2007.74
LK http://doi.ieeecomputersociety.org/10.1109/IAT.2007.74