Sum-Product Network structure learning by efficient product nodes discovery

Di Mauro, Nicola; Esposito, Floriana; Ventola, Fabrizio Giuseppe; Vergari, Antonio

doi:10.3233/IA-170032

Sum-Product Network structure learning by efficient product nodes discovery

Issue title: Selected papers from the 16th International Conference of the Italian Association for Artificial Intelligence

Guest editors: Stefano Ferilli and Francesca Alessandra Lisi

Article type: Research Article

Authors: Di Mauro, Nicola | Esposito, Floriana | Ventola, Fabrizio Giuseppe^{; *} | Vergari, Antonio

Affiliations: Department of Computer Science, University of Bari, Bari, Italy

Correspondence: [*] Corresponding author: Fabrizio Giuseppe Ventola, Department of Computer Science, University of Bari, Via E. Orabona 4, 70125, Bari, Italy. E-mail: [email protected].

Abstract: Sum-Product Networks (SPNs) are recently introduced deep probabilistic models providing exact and tractable inference. SPNs have been successfully employed in several application domains, from computer vision to natural language processing, as accurate density estimators. However, learning their structure and parameters from high dimensional data poses a challenge in terms of time complexity. Classical SPNs structure learning algorithms work by repeating several times two high cost operations: determining independencies among random variables (RVs)–introducing product nodes–and finding sub-populations among samples–introducing sum nodes. Even one of the simplest greedy structure learner, LearnSPN, scales quadratically in the number of the variables to determine RVs independencies. In this work, we investigate the trade-off between accuracy and efficiency when employing approximate but fast procedures to determine independencies among RVs. We introduce and evaluate sub-quadratic procedures based on a random subspace approach and leveraging entropy as a proxy criterion to split independent RVs. Experimental results on many benchmark datasets for density estimation show that LearnSPN-like structure learners, when equipped by our splitting procedures, provide reduced learning and/or inference times, generally containing the degradation of inference accuracy. Ultimately, we provide an empirical confirmation of a “no free lunch” when learning the structure of SPNs.

Keywords: Machine learning, deep learning, structure learning, probabilistic models, density estimation, Sum-Product Networks

DOI: 10.3233/IA-170032

Journal: Intelligenza Artificiale, vol. 12, no. 2, pp. 143-159, 2018

Published: 29 January 2019

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia