Flexible Metadata Schemes for Research Data Repositories. The Common Framework in Dataverse and the CMDI Use Case

Research output: Chapter in book/volumeContribution to conference proceedingsScientificpeer-review

298 Downloads (Pure)


In this paper we present an approach called Common Framework, which addresses issues of interoperability and flexibility of metadata schemes as developed by specific scientific communities, and as later supported by domain and cross-domain data repositories. The approach was triggered by a very concrete use case, namely the question how to expose Component Metadata Infrastructure (CMDI) metadata, stored in computational linguistics datasets in the DANS EASY archive, for discovery services. The work in CLARIN to push further for the development of CMDI into a standard (ISO 24622-1:2015, ISO 24622-2:2019) forms part of the background of the use case. We used the Dataverse platform to deliver proof of concepts for various elements of the Common Framework, including the recommendation of standardised elements for
Dataverse instances in CLARIN. At the core of the Common Framework is a design which envisions an interaction between different microservices, possibly also hosted by various service providers. Mechanisms of semantic mapping are used throughout a pipeline which starts at a set of existing metadata standards and values at a digital research data repository (Extraction) and their analysis. This leads to an alignment of these metadata standards with others standards (Transformation) and proposes enrichments to be used by other service providers but also to be imported back to the original source (Load). Some modules applied along this pipeline are discussed in detail, together with the challenges this specific use case entails. At the same time, we
also stress generic aspects, as we are convinced that this approach can also be applied in other settings, other archival platforms and other domain specific metadata schemes. The high-level goal of this exploration is to explore ways to make research data collections FAIR (Findable, Accessible, Interoperable and Re-usable), and in particular interoperable and re-usable, while preserving the rigour of domain specific indexing practices.
Original languageEnglish
Title of host publicationSelected Papers from the CLARIN Annual Conference 2021
Subtitle of host publicationVirtual Event, 2021, 27–29 September
EditorsMonica Monachini, Maria Eskevich
PublisherLinköping University Electronic Press, Linköpings universitet
Number of pages13
ISBN (Print)978-91-7929-444-1
Publication statusPublished - 08 Jul 2022
EventCLARIN Annual Conference 2021: (Virtual event) - Virtual
Duration: 27 Sept 202129 Sept 2021

Publication series

NameLinköping Electronic Conference Proceedings
ISSN (Print)1650-3686
ISSN (Electronic)1650-3740


ConferenceCLARIN Annual Conference 2021
Internet address


Dive into the research topics of 'Flexible Metadata Schemes for Research Data Repositories. The Common Framework in Dataverse and the CMDI Use Case'. Together they form a unique fingerprint.

Cite this