LAF-Fabric: a data analysis tool for Linguistic Annotation Framework with an application to the Hebrew Bible

Dirk Roorda, Gino Kalkman, Martijn Naaijer, Andreas van Cranenburgh

Research output: Contribution to journal/periodicalArticleScientificpeer-review

34 Downloads (Pure)

Abstract

The Linguistic Annotation Framework (LAF) provides a general, extensible stand-off markup system for corpora. This paper discusses LAF-Fabric, a new tool to analyse LAF resources in general with an extension to process the Hebrew Bible in particular. We first walk through the history of the Hebrew Bible as text database in decennium-wide steps. Then we describe how LAF-Fabric may serve as an analysis tool for this corpus. Finally, we describe three analytic projects/workflows that benefit from the new LAF representation: 1) the study of linguistic variation: extract cooccurrence data of common nouns between the books of the Bible (Martijn Naaijer); 2) the study of the grammar of Hebrew poetry in the Psalms: extract clause typology (Gino Kalkman); 3) construction of a parser of classical Hebrew by Data Oriented Parsing: generate tree structures from the database (Andreas van Cranenburgh).
Original languageEnglish
Pages (from-to)105-120
Number of pages16
JournalComputational Linguistics in the Netherlands Journal
Volume4
Publication statusPublished - 20 Dec 2014

Keywords

  • cs.CL

Fingerprint Dive into the research topics of 'LAF-Fabric: a data analysis tool for Linguistic Annotation Framework with an application to the Hebrew Bible'. Together they form a unique fingerprint.

  • SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup

    Roorda, D., Glanz, O., van den Berg, H. & Van de Schraaf, H., 01 Aug 2014

    Research output: Non-textual formWebsiteScientific

    Open Access
  • Cite this