Towards entity spaces

Marieke van Erp, Paul Groth

Research output: Chapter in book/volumeContribution to conference proceedingsScientificpeer-review

Abstract

Entities are a central element of knowledge bases and are important input to many knowledge-centric tasks including text analysis. For example, they allow us to find documents relevant to a specific entity irrespective of the underlying syntactic expression within a document. However, the entities that are commonly represented in knowledge bases are often a simplification of what is truly being referred to in text. For example, in a knowledge base, we may have an entity for Germany as a country but not for the more fuzzy concept of Germany that covers notions of German Population, German Drivers, and the German Government. Inspired by recent advances in contextual word embeddings, we introduce the concept of entity spaces - specific representations of a set of associated entities with near-identity. Thus, these entity spaces provide a handle to an amorphous grouping of entities. We developed a proof-of-concept for English showing how, through the introduction of entity spaces in the form of disambiguation pages, the recall of entity linking can be improved.
Original languageEnglish
Title of host publicationLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
PublisherEuropean Language Resources Association (ELRA)
Pages2129-2137
Number of pages9
ISBN (Print)9791095546344
Publication statusPublished - Jun 2020

Publication series

NameLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings

Keywords

  • Entity
  • Entity linking
  • Identity
  • Knowledge representation

Fingerprint Dive into the research topics of 'Towards entity spaces'. Together they form a unique fingerprint.

Cite this