Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Subtitle:
Article type: Research Article
Authors: Lin, Jerry Chun-Weia; b; * | Hong, Tzung-Peic; d | Gan, Wenshenga | Chen, Hsin-Yie | Li, Sheng-Tunf
Affiliations: [a] Innovative Information Industry Research Center, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China | [b] Shenzhen Key Laboratory of Internet Information Collaboration, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China | [c] Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan | [d] Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan | [e] Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan | [f] Department of Industrial and Information Management, National Cheng Kung University, Tainan, Taiwan
Correspondence: [*] Corresponding author: Chun-Wei Lin, Innovative Information Industry Research Center/Shenzhen Key Laboratory of Internet Information Collaboration, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China. E-mail:[email protected]
Abstract: Mining useful information from large databases has become an important research area in recent years. Among the classes of knowledge derived, sequential pattern can be applied in many domains, such as market analysis, web click streams, and biological data. The fast updated sequential pattern tree (FUSP-tree) algorithm was proposed to update discovered sequential patterns in incremental mining. However, it must rescan the original database for maintaining discovered sequential patterns. This study proposes the PreFUSP-TREE-INS algorithm based on the pre-large concept for maintaining discovered sequential patterns without rescanning the original database until the cumulative number of newly added customer sequences exceeds a safety bound. The execution time for reconstructing the tree when old or new customer sequences are added into the original database is reduced by using pre-large sequences. The pre-large sequences are defined by lower and upper support thresholds that prevent the movement of sequences directly from large to small and vice versa. Experiments are conducted to show the performance of the proposed algorithm for various minimum support thresholds and ratios of inserted sequences.
Keywords: Data mining, sequential pattern, FUSP-tree, large sequence, incremental mining, dynamic databases
DOI: 10.3233/IDA-150759
Journal: Intelligent Data Analysis, vol. 19, no. 5, pp. 1071-1089, 2015
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]