From Napoleon Conquests to the Big Brother Sabotage : Harmonization of the Dutch Historical Censuses in the Semantic Web

Albert Meroño-Peñuela, Ashkan Ashkpour

Research output: Contribution to conferenceAbstractScientific

16 Downloads (Pure)

Abstract

Around the turn of the 18th century, the first integral pop- ulation enumeration was held in the Netherlands during the Batavian Republic. It took over 30 years before the first official census was, by royal decree, organized and conducted in 1829, and was meant to be held from then onwards every ten years. The Dutch historical censuses are the only large scale, reliable statistical datasets available about the (demo- graphic, social and economic) history of the Netherlands, covering an all-encompassing geographical area for over two centuries (1795–1971). Not surprisingly, the currently preserved and digitized historical censuses are the most consulted historical statistics by researchers. However, the 2 288 census tables are highly disconnected and scarcely integrated in their current form. Meaningful information is still hidden in these miss- ing table-links, meaning that this wealth of information is not reaped to its full potential. In this paper we describe the lessons learnt in CEDAR5, a project of the Computational Humanities Programme6, to provide so- lutions to these integration problems. Our system leverages semantic technologies and Linked Data practices, which allow us to convert the census tables into a graph of fine-grained Linked Census Data. Using the distributed architecture of the Web, we interlink this graph with other online historical socioeconomic and demographic Linked Datasets. We use the information provided by these external links to guide the harmonization process in our dataset. At the same time, we investigate which historical classifications are not online yet following Web stan- dards, and we use our census tables (on demographic structures, housing types, occupational classes and statuses, and religious denominations) to urge the need of publishing these historical classifications on the Web. Such historical hubs could increase enormously the interoperability of other datasets. Finally, we propose a querying pipeline on the resulting harmonized census dataset to enhance the data exploration work by his- torians and social scientists and help answering their research questions.
Original languageEnglish
Publication statusPublished - 2014

Keywords

  • dutch history
  • census data
  • linked data

Fingerprint Dive into the research topics of 'From Napoleon Conquests to the Big Brother Sabotage : Harmonization of the Dutch Historical Censuses in the Semantic Web'. Together they form a unique fingerprint.

  • Cite this