D4.2 : Interrogation and annotation of plurilingual corpora for discourse analysis (V1.0 )

Rocco Tripodi, Eleanora Marzi, Arianna Graciotti, Valeria Zotti, Antonella Luporini, Monica Turci, Ana Pano Alaman, Peter Van Kranenburg, René van Horik, Enrico Daga, Marilena Daquino

Research output: Book/ReportReportScientific

Abstract

The deliverable reports on the annotation and interrogation of the Polifonia Textual Corpus, the plurilingual diachronic corpus focused on Musical Heritage (MH) covering Italian, English, French, Spanish and Dutch. Natural Language Processing (NLP) techniques were used to process the corpus and produce automatic morphosyntactic, semantic and MH-specific annotations. Custom APIs have been developed and released to enable domain experts, scholars and music professionals to leverage the annotations produced to perform advanced structured queries on the corpus. The available interrogation capabilities overcome the basic keyword-based search, offering the possibility of querying the corpus by taking advantage of the advanced semantic and MH-specific information encoded in the annotation.
Original languageEnglish
PublisherZenodo
Number of pages47
DOIs
Publication statusPublished - 01 Sept 2022

Fingerprint

Dive into the research topics of 'D4.2 : Interrogation and annotation of plurilingual corpora for discourse analysis (V1.0 )'. Together they form a unique fingerprint.

Cite this