Laypa: A Novel Framework for Applying Segmentation Networks to Historical Documents

Research output: Contribution to conferencePaperScientificpeer-review

Abstract

We present novel software to process scans of historical documents to extract their layout information. We do this using a ResNet backbone with a feature pyramid head. We extract region information directly into PageXML. For baseline extraction, we use a two stage processing approach. The software has been applied successfully to several projects. The results show the feasibility to automatically label text lines and regions in historical documents.
Original languageEnglish
Pages67-72
Number of pages6
Publication statusPublished - 25 Aug 2023
EventHIP '23: Proceedings of the 7th International Workshop on Historical Document Imaging and Processing - San José, California, San José, United States
Duration: 25 Aug 202326 Aug 2023
https://dl.acm.org/doi/proceedings/10.1145/3604951

Conference

ConferenceHIP '23
Abbreviated titleHIP '23
Country/TerritoryUnited States
CitySan José
Period25/08/202326/08/2023
Internet address

Keywords

  • Datasets
  • historical documents
  • Neural networks
  • layout analysis

Fingerprint

Dive into the research topics of 'Laypa: A Novel Framework for Applying Segmentation Networks to Historical Documents'. Together they form a unique fingerprint.

Cite this