``Re-make/Re-model'': Should big data change the modelling paradigm in official statistics?

Braaksma, Barteld; Zeelenberg, Kees

doi:10.3233/sji-150892

``Re-make/Re-model'': Should big data change the modelling paradigm in official statistics?

Subtitle:

Article type: Research Article

Authors: Braaksma, Barteld^{a; *} | Zeelenberg, Kees^b

Affiliations: [a] Innovation Program, Statistics Netherlands, 2490 HA Den Haag, The Netherlands | [b] Methods and Statistical Policies, Statistics Netherlands, 2490 HA Den Haag, The Netherlands

Correspondence: [*] Corresponding author: Barteld Braaksma, Manager Innovation Program, Statistics Netherlands, PO Box 24500, 2490 HA Den Haag, Netherlands. Tel.: +31 70 3374430;[email protected]

Keywords: Big data, model-based statistics

DOI: 10.3233/sji-150892

Journal: Statistical Journal of the IAOS, vol. 31, no. 2, pp. 193-202, 2015

Published: 2015

Get PDF

Abstract

Big data offers many opportunities for official statistics: for example increased resolution, better timeliness, and new statistical outputs. But there are also many challenges: uncontrolled changes in sources that threaten continuity, lack of identifiers that impedes linking to population frames, and data that refers only indirectly to phenomena of statistical interest. We discuss two approaches to deal with these challenges and opportunities.

First, we may accept big data for what they are: an imperfect, yet timely, indicator of phenomena in society. These data exist and that's why they are interesting. Secondly, we may extend this approach by explicit modelling. New methods like machine-learning techniques can be considered alongside more traditional methods like Bayesian techniques.

National statistical institutes have always been reluctant to use models, apart from specific cases like small-area estimates. Based on the experience at Statistics Netherlands we argue that NSIs should not be afraid to use models, provided that their use is documented and made transparent to users. Moreover, the primary purpose of an NSI is to describe society; we should refrain from making forecasts. The models used should therefore rely on actually observed data and they should be validated extensively.

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Abstract

Share this:

North America

Europe

Asia