An empirical evaluation of cost-based federated SPARQL query processing engines

Qudus, Umair; Saleem, Muhammad; Ngonga Ngomo, Axel-Cyrille; Lee, Young-Koo

doi:10.3233/SW-200420

An empirical evaluation of cost-based federated SPARQL query processing engines

Issue title: Storing, Querying, and Benchmarking the Web of Data

Guest editors: Muhammad Saleem, Ruben Verborgh, Muhammad Intizar Ali and Olaf Hartig

Article type: Research Article

Authors: Qudus, Umair^a | Saleem, Muhammad^b | Ngonga Ngomo, Axel-Cyrille^c | Lee, Young-Koo^{a; *}

Affiliations: [a] DKE, Kyung Hee University, South Korea. E-mails: [email protected], [email protected] | [b] AKSW, Leipzig, Germany. E-mail: [email protected] | [c] University of Paderborn, Germany. E-mail: [email protected]

Correspondence: [*] Corresponding author. E-mail: [email protected].

Abstract: Finding a good query plan is key to the optimization of query runtime. This holds in particular for cost-based federation engines, which make use of cardinality estimations to achieve this goal. A number of studies compare SPARQL federation engines across different performance metrics, including query runtime, result set completeness and correctness, number of sources selected and number of requests sent. Albeit informative, these metrics are generic and unable to quantify and evaluate the accuracy of the cardinality estimators of cost-based federation engines. To thoroughly evaluate cost-based federation engines, the effect of estimated cardinality errors on the overall query runtime performance must be measured. In this paper, we address this challenge by presenting novel evaluation metrics targeted at a fine-grained benchmarking of cost-based federated SPARQL query engines. We evaluate five cost-based federated SPARQL query engines using existing as well as novel evaluation metrics by using LargeRDFBench queries. Our results provide a detailed analysis of the experimental outcomes that reveal novel insights, useful for the development of future cost-based federated SPARQL query processing engines.

Keywords: SPARQL, benchmarking, cost-based, cost-free, federated, querying

DOI: 10.3233/SW-200420

Journal: Semantic Web, vol. 12, no. 6, pp. 843-868, 2021

Published: 4 October 2021

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia