Vecsigrafo: Corpus-based word-concept embeddings

Denaux, Ronald; Gomez-Perez, Jose Manuel

doi:10.3233/SW-190361

Vecsigrafo: Corpus-based word-concept embeddings

Issue title: Special Issue on Semantic Deep Learning

Subtitle: Bridging the statistic-symbolic representational gap in natural language processing

Guest editors: Dagmar Gromann, Luis Espinosa Anke and Thierry Declerck

Article type: Research Article

Authors: Denaux, Ronald | Gomez-Perez, Jose Manuel^{; *}

Affiliations: Expert System Cogito Labs, 28036 Madrid, Spain. E-mails: [email protected], [email protected]

Correspondence: [*] Corresponding author. E-mail: [email protected].

Abstract: The proliferation of knowledge graphs and recent advances in Artificial Intelligence have raised great expectations related to the combination of symbolic and distributional semantics in cognitive tasks. This is particularly the case of knowledge-based approaches to natural language processing as near-human symbolic understanding relies on expressive, structured knowledge representations. Engineered by humans, such knowledge graphs are frequently well curated and of high quality, but at the same time can be labor-intensive, brittle or biased. The work reported in this paper aims to address such limitations, bringing together bottom-up, corpus-based knowledge and top-down, structured knowledge graphs by capturing as embeddings in a joint space the semantics of both words and concepts from large document corpora. To evaluate our results, we perform the largest and most comprehensive empirical study around this topic that we are aware of, analyzing and comparing the quality of the resulting embeddings over competing approaches. We include a detailed ablation study on the different strategies and components our approach comprises and show that our method outperforms the previous state of the art according to standard benchmarks.

Keywords: Joint word and concept embeddings, corpus-based embeddings, knowledge graphs, analysis and benchmarking

DOI: 10.3233/SW-190361

Journal: Semantic Web, vol. 10, no. 5, pp. 881-908, 2019

Published: 26 September 2019

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia