Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Wu, Chenga | Song, Huichuna | Yan, Changshengb | Wang, Yiminga; *
Affiliations: [a] School of Urban Rail Transportation, Soochow University, Suzhou, China | [b] Hewlett-Packard Laboratories, Suzhou, China
Correspondence: [*] Corresponding author. Yiming Wang, School of Urban Rail Transportation, Soochow University, Suzhou 215011, China. Tel.: +86 18001555858; Fax: +86 512 67601052; E-mail: [email protected].
Note: [1] This work was supported in part by the National Natural Science Foundation of China (Grant No. 61471252).
Abstract: Reinforcement learning is hard to solve optimization problems in multi-agent system because of the inefficiency of function approximation. Sparse distributed memories, which is implemented using Radial Basis Functions or Kanerva Coding, can be used to improve the efficiency. But this approach still often gives poor performance when applied to large-scale multi-agent systems. In this paper, we attempt to solve four-rooms problem in the predator-prey pursuit domain and argue that the poor performance that we observe is caused by frequent prototype collisions. We show that dynamic prototype allocation and adaptation can give better results by reducing these collisions. By using our novel approach about fuzzy Kanerva-based function approximation, that uses a fine-grained fuzzy membership grade to describe a state-action pair’s adjacency with respect to each prototype, we give some results that prototype collisions are completely eliminated and learning performance is greatly improved. We further show that prototype density varies widely across the state-action space and that this variation causes prototypes’ receptive fields to be unevenly distributed. This distribution limits the ability of fuzzy Kanerva Coding to achieve better results. It can be observed that fuzzy Kanerva Coding allows prototypes to adaptively tune their receptive fields for a target application. We conclude that fuzzy Kanerva Coding with prototype tuning and adaptation can significantly improve a reinforcement learner’s ability to solve large-scale multi-agent problems.
Keywords: Reinforcement learning, function approximation, fuzzy system, pursuit problem
DOI: 10.3233/IFS-162212
Journal: Journal of Intelligent & Fuzzy Systems, vol. 32, no. 6, pp. 3909-3920, 2017
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]