Experiences and lessons learned in publishing a large dataset: Detailed tables from the Population and Occupational Censuses 1947

Onderzoeksoutput: Bijdrage aan wetenschappelijk tijdschrift/periodieke uitgaveArtikelWetenschappelijkpeer review

2 Downloads (Pure)

Samenvatting

Since the end of the nineties Dutch census publications have been digitized and made available for digital processing. New analysis of the data has been presented in some fruitful conferences. Besides the census publications a mass of detailed census data is found in dossiers and sets of worksheets in the archive of Statistics Netherlands. Most of that material has been scanned into digital images. The detailed data of the Population Census 1947 is the first set of detailed data that is made available in digitally processible form. The present article describes the extensive steps of preparation of the dataset obtained. Special attention is paid to the aspects of preparing a dataset with a very large number of files, the organization of the dataset and the way of documenting the process. This delivered a systematic and reproducible method to prepare such a large dataset. Presenting the data in the preferred format of CSV-text files appears to give ample opportunities for further analysis.
Originele taal-2Engels
TijdschriftResearch Data Journal for the Humanities and Social Sciences
StatusIngediend - 30 nov 2020

Vingerafdruk Duik in de onderzoeksthema's van 'Experiences and lessons learned in publishing a large dataset: Detailed tables from the Population and Occupational Censuses 1947'. Samen vormen ze een unieke vingerafdruk.

Citeer dit