Affiliations: [a] IBISC-CNRS, FRE 3190 CNRS, Université d'Evry Val d'Essonne, Tour Evry 2, 523 place des terrasses de l'Agora, F-91000 Evry, France | [b] LIPN-CNRS, UMR 7030, Université Paris 13, 99, Av Jean Baptiste Clément, 93430 Villetaneuse, France
Abstract: In this paper we present Casep2: a hybrid neuro-symbolic system combining case-based reasoning (CBR) and artificial neural networks that aims at clustering and classifying users' behavior in an e-commerce site. A user behavior is represented by a sequence of visited web pages, in a session. Each registered behavior is associated to one of the following classes: buyer or non-buyer. Our goal is to provide a system that mines the web site access log in order to predict the class of an on-going user navigation. One major challenge to face is to provide scalable algorithms that can handle efficiently the large amount of data to learn from. Predictions should be made in real-time, during the current navigation. In addition, raw data has a sequential nature and are very noisy. In the proposed system, two original neural networks, named M-SOM-ART networks, are applied: one to implement the retrieval phase of a CBR cycle, and the second to implement the reuse phase. This hybrid scheme allows to ensure incremental learning as well as efficient treatment of large-scale sequential data. Experiments on real log data of an e-commerce site show the relevancy of the proposed approach.
Keywords: Sequence processing, case based reasoning, self-organizing map, hybrid neuro-CBR systems