Extracting entity-specific substructures for RDF graph embeddings

Saeed, Muhammad Rizwan; Chelmis, Charalampos; Prasanna, Viktor K.

doi:10.3233/SW-190359

Extracting entity-specific substructures for RDF graph embeddings

Issue title: Knowledge Graphs: Construction, Management and Querying

Guest editors: Mayank Kejriwal, Vanessa Lopez and Juan F. Sequeda

Article type: Research Article

Authors: Saeed, Muhammad Rizwan^{a; *} | Chelmis, Charalampos^b | Prasanna, Viktor K.^a

Affiliations: [a] Ming Hseih Department of Electrical Engineering, University of Southern California, Los Angeles, CA, USA. E-mails: [email protected], [email protected] | [b] Department of Computer Science, University at Albany – SUNY, Albany, NY, USA. E-mail: [email protected]

Correspondence: [*] Corresponding author. E-mail: [email protected].

Abstract: Knowledge Graphs (KGs) have become useful sources of structured data for information retrieval and data analytics tasks. Enabling complex analytics, however, requires entities in KGs to be represented in a way that is suitable for Machine Learning tasks. Several approaches have been recently proposed for obtaining vector representations of KGs based on identifying and extracting relevant graph substructures using both uniform and biased random walks. However, such approaches lead to representations comprising mostly popular, instead of relevant, entities in the KG. In KGs, in which different types of entities often exist (such as in Linked Open Data), a given target entity may have its own distinct set of most relevant nodes and edges. We propose specificity as an accurate measure of identifying most relevant, entity-specific, nodes and edges. We develop a scalable method based on bidirectional random walks to compute specificity. Our experimental evaluation results show that specificity-based biased random walks extract more meaningful (in terms of size and relevance) substructures compared to the state-of-the-art and the graph embedding learned from the extracted substructures perform well against existing methods in common data mining tasks.

Keywords: Relevance metrics, graph embedding, Linked Open Data, data mining, recommender systems, RDF, SPARQL, Semantic Web, DBpedia

DOI: 10.3233/SW-190359

Journal: Semantic Web, vol. 10, no. 6, pp. 1087-1108, 2019

Published: 28 October 2019

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia