TY - CHAP
T1 - Towards style-based dating of historical documents
AU - Burgers, J.W.J.
AU - Samara, Petros
AU - Schomaker, L.R.B.
AU - He, S.
PY - 2014
Y1 - 2014
N2 - Estimating the date of undated medieval manuscripts by evaluating the script they contain, using document image analysis, is helpful for scholars of various disciplines studying the Middle Ages. However, there are, as yet, no systems to automatically and effectively infer the age of historical scripts using machine learning methods. To build a system to date medieval documents is a challenging problem in several aspects: 1) As yet, no suitable reference dataset of medieval handwriting exists; 2) relatively little is known about the evolution of writing styles in the Middle Ages, and especially in the later Middle Ages. Our Medieval Paleographic Scale (MPS) project aims at solving these problems. We have collected a corpus of charters from the Medieval Dutch language area, dating from the period 1300 to 1550. A global and local regression method is proposed for learning and estimating the year in which these documents were written, using several features which have been successfully used in writer identification. The proposed system can serve as a basic tool for the medievalist or paleographer. The experimental results of the proposed method demonstrate its effectiveness.
AB - Estimating the date of undated medieval manuscripts by evaluating the script they contain, using document image analysis, is helpful for scholars of various disciplines studying the Middle Ages. However, there are, as yet, no systems to automatically and effectively infer the age of historical scripts using machine learning methods. To build a system to date medieval documents is a challenging problem in several aspects: 1) As yet, no suitable reference dataset of medieval handwriting exists; 2) relatively little is known about the evolution of writing styles in the Middle Ages, and especially in the later Middle Ages. Our Medieval Paleographic Scale (MPS) project aims at solving these problems. We have collected a corpus of charters from the Medieval Dutch language area, dating from the period 1300 to 1550. A global and local regression method is proposed for learning and estimating the year in which these documents were written, using several features which have been successfully used in writer identification. The proposed system can serve as a basic tool for the medievalist or paleographer. The experimental results of the proposed method demonstrate its effectiveness.
KW - global and local regression
KW - Medieval Paleography
KW - historical document dating
M3 - Contribution to conference proceedings
T3 - 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), Crete, Greece, 2014
SP - 265
EP - 270
BT - 2014 14th International Conference on Frontiers in Handwriting Recognition
PB - Institute of Electrical and Electronics Engineers (IEEE)
ER -