Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Peters, James F.a; * | Henry, Christophera | Gunderson, David S.b
Affiliations: [a] Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, Manitoba R3T 5V6, Canada | [b] Department of Mathematics, University of Manitoba, Winnipeg, Manitoba R3T 5V6, Canada
Correspondence: [*] Corresponding author. E-mail: [email protected]
Abstract: The problem considered in this paper is how to mimic the adaptive learning control strategy called run-and-twiddle (RT) that can be observed in the behaviour of certain biological organisms. In 1984, Oliver Selfridge observed RT behaviour in E. coli bacteria, ants and male silk moths, where an organism continues its movement in a particular direction as long as the reward signal it receives from the environment has sufficient strength. The proposed approach to mimicking RT movement control behaviour has two basic forms in this paper. First, we follow a suggestion made by Chris J.C.H. Watkins in 1989 that a stopping time (strategy for determining when to twiddle) results from a a comparison of the value of the current state s with the value of the state s' in the next time step. Second, we consider various forms of actor-critic learning within the context of approximation spaces introduced by Zdzisław Pawlak during the early 1980s. Results of experiments with a digital camera-based target tracking system using reinforcement comparison introduced by Richard Sutton and Andrew Barto during the late 1990s and various new forms of actor-critic learning are reported in this paper. The contribution of this paper is a presentation of a number of biologically-inspired adaptive learning control strategies that are useful in target tracking.
Keywords: Actor-critic method, biologically-inspired adaptive learning, approximation space, control strategy, rough sets, run-and-twiddle, stopping time
DOI: 10.3233/HIS-2007-4401
Journal: International Journal of Hybrid Intelligent Systems, vol. 4, no. 4, pp. 203-216, 2007
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]