Real Time Robot Policy Adaptation Based on Intelligent Algorithms

Capi, Genci; Toda, Hideki; Kaneko, Shin-Ichiro

doi:10.1007/978-3-642-23960-1_1

Genci Capi⁴,
Hideki Toda⁴ &
Shin-Ichiro Kaneko⁵

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 364))

Included in the following conference series:

Abstract

In this paper we present a new method for robot real time policy adaptation by combining learning and evolution. The robot adapts the policy as the environment conditions change. In our method, we apply evolutionary computation to find the optimal relation between reinforcement learning parameters and robot performance. The proposed algorithm is evaluated in the simulated environment of the Cyber Rodent (CR) robot, where the robot has to increase its energy level by capturing the active battery packs. The CR robot lives in two environments with different settings that replace each other four times. Results show that evolution can generate an optimal relation between the robot performance and exploration-exploitation of reinforcement learning, enabling the robot to adapt online its strategy as the environment conditions change.

Download to read the full chapter text

Chapter PDF

Does Lifelong Learning Affect Mobile Robot Evolution?

A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems

Article 15 October 2022

Optimal Control and Reinforcement Learning for Robot: A Survey

Keywords

References

Sutton, R.S., Barto, A.G.: Reinforcement learning. MIT Press, Cambridge (1998)
Google Scholar
Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Minato, T., Asada, M.: Environmental change adaptation for mobile robot navigation. Journal of Robotics Society of Japan 18(5), 706–712 (2000)
Google Scholar
Matsui, T., Inuzuka, N., Seki, H.: Adapting to subsequent changes of environment by learning policy preconditions. Int. Journal of Computer and Information Science 3(1), 49–58 (2002)
Google Scholar
Doya, K.: Reinforcement learning in continuous time and space. Neural Computation 12, 219–245 (2000)
Article Google Scholar
Belew, R.K., McInerney, J., Schraudolph, N.N.: Evolving networks: using the genetic algorithm with connectionist learning. In: Langton, C.G., et al. (eds.) Artificial Life II, pp. 511–547. Addison Wesley, Reading (1990)
Google Scholar
Unemi, T.: Evolutionary differentiation of learning abilities - a case study on optimizing parameter values in Q-learning by a genetic algorithm. In: Brooks, R.A., Maes, P. (eds.) Artificial Life IV - Proceedings of the Fourth International Workshop on the Synthesis and Simulation of Living Systems, pp. 331–336. MIT Press, Cambridge (1994)
Google Scholar
French, R.M., Messinger, A.: Genes, phenes and the baldwin effect: learning and evolution in a simulated population. In: Proc. of Forth Int. Workshop on the Synthesis and Simulation of Living Systems, pp. 277–282 (1977)
Google Scholar
Nolfi, S., Parisi, D.: Learning to adapt to changing environments in evolving neural networks. Adaptive Behavior 5(1), 75–98 (1996)
Article Google Scholar
Niv, Y., Joel, D., Meilijson, I., Ruppin, E.: Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors. Adaptive Behavior 10(1), 5–24 (2002)
Article Google Scholar
Capi, G., Doya, K.: Application of evolutionary computation for efficient reinforcement learning. Applied Artificial Intelligence 20(1), 1–20 (2005)
Google Scholar
Eriksson, A., Capi, G., Doya, K.: Evolution of metaparameters in reinforcement learning. In: IROS 2003 (2003)
Google Scholar
Capi, G.: Multiobjective Evolution of Neural Controllers and Task Complexity. IEEE Transactions on Robotics 23(6), 1225–1234 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electric and Electronic Eng., University of Toyama, Toyama, Japan
Genci Capi & Hideki Toda
Toyama National College of Technology, Japan
Shin-Ichiro Kaneko

Authors

Genci Capi
View author publications
You can also search for this author in PubMed Google Scholar
Hideki Toda
View author publications
You can also search for this author in PubMed Google Scholar
Shin-Ichiro Kaneko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Democritus University of Thrace, 68200 N., Orestiada, Greece
Lazaros Iliadis
University of Central Greece, 35100, Lamia, Greece
Ilias Maglogiannis
Frederick University, 1036, Nicosia, Cyprus
Harris Papadopoulos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Capi, G., Toda, H., Kaneko, SI. (2011). Real Time Robot Policy Adaptation Based on Intelligent Algorithms. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds) Artificial Intelligence Applications and Innovations. EANN AIAI 2011 2011. IFIP Advances in Information and Communication Technology, vol 364. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23960-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-23960-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23959-5
Online ISBN: 978-3-642-23960-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics