Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Cheng, Penga; d | Lee, Ivanb | Lin, Chun-Weia | Pan, Jeng-Shyanga; c; *
Affiliations: [a] Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen, Guangdong, China | [b] School of IT and Mathematical Sciences, University of South Australia, South Australia, Australia | [c] College of Information Science and Engineering, Fujian University of Technology, Fuzhou, Fujian, China | [d] School of Computer and Information Science, Southwest University, Chongqing, China
Correspondence: [*] Corresponding author: Jeng-Shyang Pan, Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen, Guangdong, China. E-mail:[email protected]
Abstract: When data mining techniques are applied to discover useful knowledge behind a large data collection, they are often required to preserve some confidential information, such as sensitive frequent itemsets, rules and so on. A feasible way to ensure the confidentiality is to sanitize the database and conceal sensitive information. However, the sanitization process often produces side effects, thus minimizing these side effects is an important task. An important but ignored fact is that a tradeoff exists within different side effects. When attempting to improve the performance on one dimension, the performance on other dimensions often will be degraded. In this paper, we focus on privacy preserving in association rule mining. Since there is a tradeoff within different side effects, we tried to minimize them from the view of multi-objective optimization. A rule hiding approach based on evolutionary multi-objective optimization (EMO) is proposed. It hides sensitive rules through removing identified items. The side effects on missing non-sensitive rules, ghost rules and data loss are formulated as optimization objectives. EMO is utilized to find a suitable subset of transactions for modification so that side effects can be minimized. Experimental results on real datasets illustrate that the proposed approach can achieve satisfactory results with fewer side effects. In addition, the EMO-based approach can produce multiple hiding solutions in a single run. It provides the opportunity for a user to choose freely the preferred one by preference or experience.
Keywords: Privacy preserving data mining, association rule hiding, evolutionary multi-objective optimization, EMO
DOI: 10.3233/IDA-160817
Journal: Intelligent Data Analysis, vol. 20, no. 3, pp. 495-514, 2016
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]