Experiences and lessons learned in publishing a large dataset: Detailed tables from the Population and Occupational Censuses 1947

Research output: Contribution to journal/periodicalArticleScientificpeer-review

1 Downloads (Pure)

Abstract

Since the end of the nineties Dutch census publications have been digitized and
made available for digital processing. New analyses of the data were presented
in some fruitful conferences in the first decade of this century. In addition to the
census publications, a mass of detailed census data was found in dossiers and
so-called 'transparencies' in the archive of Statistics Netherlands. Most of that
material was scanned into digital images, awaiting further content conversion
into numeric data. In the present article we describe the process of digitizing the
detailed tables of the Dutch Population and Occupational Censuses held in 1947,
which is the first set of detailed data from this source that is made available in
digitally processible form. We give an example of historical analyses made
possible by this dataset. Moreover, we take these census data as an example of
preparing and publishing a large dataset. Experiences and lessons learned in the
process lead to ample opportunities for further analysis of the data and for
efficient ways to accomplish the content conversion of the many remaining
images of census data.
Original languageEnglish
JournalResearch Data Journal for the Humanities and Social Sciences
Publication statusSubmitted - 14 Apr 2021

Keywords

  • large dataset
  • census data
  • Netherlands
  • 1947
  • data-entry
  • versioning
  • documentation method
  • preferred format
  • CSV-text files

Fingerprint

Dive into the research topics of 'Experiences and lessons learned in publishing a large dataset: Detailed tables from the Population and Occupational Censuses 1947'. Together they form a unique fingerprint.

Cite this