Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Knowledge Discovery from Data Streams
Guest editors: João Gamax, Jesus Aguilar-Ruizy and Ralf Klinkenbergz
Article type: Research Article
Authors: Jaroszewicz, Szymona | Ivantysynova, Lenkab | Scheffer, Tobiasc
Affiliations: [a] National Institute of Telecommunications, Warsaw, Poland. E-mail: [email protected] | [b] Humboldt-Universität zu Berlin, Berlin, Germany. E-mail: [email protected] | [c] Max Planck Institute for Computer Science, Saarbrücken, Germany. E-mail: [email protected] | [x] LIAAD-University of Porto, Porto, Portugal | [y] Polytechnic Pablo de Olavide University, Seville, Spain | [z] University of Dortmund, Dortmund, Germany
Abstract: We address the problem of matching imperfectly documented schemas of data streams and large databases. Instance-level schema matching algorithms identify likely correspondences between attributes by quantifying the similarity of their corresponding values. However, exact calculation of these similarities requires processing of all database records – which is infeasible for data streams. We devise a fast matching algorithm that uses only a small sample of records, and is yet guaranteed to find a matching that is a close approximation of the matching that would be obtained if the entire stream were processed. The method can be applied to any given (combination of) similarity metrics that can be estimated from a sample with bounded error; we apply the algorithm to several metrics. We give a rigorous proof of the method's correctness and report on experiments using large databases.
DOI: 10.3233/IDA-2008-12302
Journal: Intelligent Data Analysis, vol. 12, no. 3, pp. 253-270, 2008
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]