Abstract
Stand-off markup is widely considered as a possible solution for overcoming the limitation of inline XML markup, primarily dealing with multiple overlapping hierarchies. Considering previous contributions on the subject and implementations of stand-off markup, we propose a new TEI-based model for encoding, that still uses the regular TEI elements, but in a stand-off manner. Our light notation moves the bulk of markup into a separate <standoff> element, grouping layers of related textual features encoded via existing TEI elements (eg. <name> or <corr>) into individual <stf> elements; furthermore, our proposed notation provides a schema for referencing the transcription using the xml:id attribute. This approach is illustrated through a variety of examples. Our proof-of-concept transformation package works directly on the stand-off markup, without the necessity of reducing it back to inline TEI for parsing, querying and visualizing.
Original language | English |
---|---|
Pages (from-to) | 225-239 |
Journal | Digital Philology: A Journal of Medieval Cultures |
Volume | 8 |
Issue number | 2 |
Publication status | Published - 2019 |
Keywords
- XML
- standoff
- markup
- overlap