From Napoleon Conquests to the Big Brother Sabotage : Harmonization of the Dutch Historical Censuses in the Semantic Web

Albert Meroño-Peñuela, Ashkan Ashkpour

Onderzoeksoutput: Bijdrage aan conferentieAbstractWetenschappelijk

40 Downloads (Pure)

Samenvatting

Around the turn of the 18th century, the first integral pop- ulation enumeration was held in the Netherlands during the Batavian Republic. It took over 30 years before the first official census was, by royal decree, organized and conducted in 1829, and was meant to be held from then onwards every ten years. The Dutch historical censuses are the only large scale, reliable statistical datasets available about the (demo- graphic, social and economic) history of the Netherlands, covering an all-encompassing geographical area for over two centuries (1795–1971). Not surprisingly, the currently preserved and digitized historical censuses are the most consulted historical statistics by researchers. However, the 2 288 census tables are highly disconnected and scarcely integrated in their current form. Meaningful information is still hidden in these miss- ing table-links, meaning that this wealth of information is not reaped to its full potential. In this paper we describe the lessons learnt in CEDAR5, a project of the Computational Humanities Programme6, to provide so- lutions to these integration problems. Our system leverages semantic technologies and Linked Data practices, which allow us to convert the census tables into a graph of fine-grained Linked Census Data. Using the distributed architecture of the Web, we interlink this graph with other online historical socioeconomic and demographic Linked Datasets. We use the information provided by these external links to guide the harmonization process in our dataset. At the same time, we investigate which historical classifications are not online yet following Web stan- dards, and we use our census tables (on demographic structures, housing types, occupational classes and statuses, and religious denominations) to urge the need of publishing these historical classifications on the Web. Such historical hubs could increase enormously the interoperability of other datasets. Finally, we propose a querying pipeline on the resulting harmonized census dataset to enhance the data exploration work by his- torians and social scientists and help answering their research questions.
Originele taal-2Engels
StatusGepubliceerd - 2014
EvenementDigital Humanities Congress 2014, The University of Sheffield - The University of Sheffield, Sheffield, Verenigd Koninkrijk
Duur: 04 sep. 201406 sep. 2014

Conferentie

ConferentieDigital Humanities Congress 2014, The University of Sheffield
Land/RegioVerenigd Koninkrijk
StadSheffield
Periode04/09/201406/09/2014

Vingerafdruk

Duik in de onderzoeksthema's van 'From Napoleon Conquests to the Big Brother Sabotage : Harmonization of the Dutch Historical Censuses in the Semantic Web'. Samen vormen ze een unieke vingerafdruk.

Citeer dit