CEDAR: The Dutch Historical Censuses as Linked Open Data

A. Meroño-Peñuela, A. Ashkpour, Christophe Dominique Marie Guéret, S. Schlobach

Research output: Contribution to journal/periodicalArticleScientificpeer-review

226 Downloads (Pure)

Abstract

In this document we describe the CEDAR dataset, a five-star Linked Open Data representation of the Dutch historical censuses, conducted in the Netherlands once every 10 years from 1795 to 1971. We produce a linked dataset from a digitized sample of 2,288 tables. The dataset contains more than 6.8 million statistical observations about the demography, labour and housing of the Dutch society in the 18th, 19th and 20th centuries. The dataset is modeled using the RDF Data Cube vocabulary for multidimensional data, uses Open Annotation to express rules of data harmonization, and keeps track of the provenance of every single data point and its transformations using PROV. We link these observations to well known standard classification systems in social history, such as the Historical International Standard Classification of Occupations (HISCO) and the Amsterdamse Code (AC), which in turn link to DBpedia and GeoNames. The two main contributions of the dataset are the improvement of data integration and access for historical research, and the emergence of new historical data hubs, like classifications of historical religions and historical house types, in the Linked Open Data cloud.
Original languageEnglish
Number of pages13
JournalSemantic Web Journal
Publication statusPublished - 2015

Keywords

  • census data
  • social history
  • linked open data
  • rdf data cube

Fingerprint

Dive into the research topics of 'CEDAR: The Dutch Historical Censuses as Linked Open Data'. Together they form a unique fingerprint.

Cite this