22Statistical Data Production in a Digitized Age: The Need to Establish Successful Workflows for Micro Data Access

Stefan Bender1, 2, Jannick Blaschke1, and Christian Hirsch1

1Data Service Center, Deutsche Bundesbank, Frankfurt, Germany

2School of Social Science, University of Mannheim, Mannheim, Germany

22.1 Introduction

Nowadays, empirical researchers and statisticians find themselves in a curious situation. On the one hand, data are everywhere and come from an ever‐increasing number of different sources. Researchers use more data they no longer directly collect themselves (e.g. via surveys). Instead, they often analyze organic data (Groves 2011) collected for other purposes that are now being reused, e.g. via ETL1 or an adapted version of the Total Survey Error Approach (Biemer et al. 2017; Amaya et al. 2020). Many chapters in this book are evidence of how fundamentally the emergence of these new data sources has transformed the practice of social science research.

On the other hand, a surprisingly large amount of relevant data remains hidden in tightly regulated silos, which means they are underexploited by empirical research and statistics (e.g. SVR‐Gutachten 2021). One reason lies in the nature of the data themselves, which oftentimes allow the disclosure of information about an individual person's health or a company's business model. Initiatives like the FAIR data principles (Wilkinson et al. 2016; European Commission 2018) or the reproducibility standards of the AEA ...

Get Advances in Business Statistics, Methods and Data Collection now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.