Northeastern Neo-Aramaic Text-Fabric Corpus

Dataset

Description

Northeastern Neo-Aramaic consists of a very diverse group of Aramaic dialects that were spoken until modern times in Northern Iraq, North West Iran and South Eastern Turkey by Christian and Jewish communities. These are among the last remaining living vestiges of the Aramaic language, which was one of the major languages of the region in antiquity.

This text corpus consists of transcribed and recorded texts gathered by Prof. Geoffrey Khan and his team in their efforts to preserve these increasingly endangered languages. The corpus is encoded in Text-Fabric format for the purpose of aiding ongoing linguistic research. Linguistic annotations are to be added throughout the project.
Date made available03 Apr 2020
PublisherZenodo
Date of data production20 Jun 2019 - 03 Apr 2020

Keywords

  • aramaic
  • neo-aramaic
  • text processing

Dataset type

  • Processed data

Cite this