Good Applications for Crummy Entity Linkers? The Case of Corpus Selection in Digital Humanities

Alex Olieman, Kaspar Beelen, Milan Lange, van, Jaap Kamps, Maarten Marx

Onderzoeksoutput: Bijdrage aan conferentiePaperWetenschappelijkpeer review

Samenvatting

Over the last decade we have made great progress in entity linking (EL) systems, but performance may vary depending on the context and, arguably, there are even principled limitations prevent a "perfect" EL system. This also suggests that there may be applications for which current "imperfect" EL is already very useful, and makes finding the "right" application as important as building the "right" EL system.

We investigate the Digital Humanities use case, where scholars spend a considerable amount of time selecting relevant source texts. We developed WideNet; a semantically-enhanced search tool which leverages the strengths of (imperfect) EL without getting in the way of its expert users. We evaluate this tool in two historical case-studies aiming to collect a set of references to historical periods in parliamentary debates from the last two decades; the first targeted the Dutch Golden Age, and the second World War II.

The case-studies conclude with a critical reflection on the utility of WideNet for this kind of research, after which we outline how such a real-world application can help to improve EL technology in general.
Originele taal-2Engels
Aantal pagina's8
StatusGepubliceerd - 2017
EvenementSEMANTiCS 2017 - De Meervaart, Amsterdam, Nederland
Duur: 11 sep. 201714 sep. 2017
https://2017.semantics.cc/

Conferentie

ConferentieSEMANTiCS 2017
Land/RegioNederland
StadAmsterdam
Periode11/09/201714/09/2017
AnderSEMANTiCS 2017 is an international event on Linked Data and the Semantic Web where business users, vendors and academia meet. Widely recognized to be of pivotal importance, it is the thirteenth edition of a well-attended yearly conference that started back in 2005. It offers keynotes by world-class practitioners, presentations and field reports in diverse tracks, talks addressing a variety of topics, and panel discussions. And, of course, ample opportunities for networking and meeting like-minded professionals in an informal setting.
Internet adres

Vingerafdruk

Duik in de onderzoeksthema's van 'Good Applications for Crummy Entity Linkers? The Case of Corpus Selection in Digital Humanities'. Samen vormen ze een unieke vingerafdruk.

Citeer dit