Writer identification for historical handwritten documents using a single feature extraction method

With the growth of artificial intelligence techniques the problem of writer identification from historical documents has gained increased interest. It consists on knowing the identity of writers of these documents. This paper introduces our baseline system for writer identification, tested on a larg...

Full description

Saved in:
Bibliographic Details
Published in2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) pp. 1 - 6
Main Authors Chammas, Michel, Makhoul, Abdallah, Demerjian, Jacques
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2020
Subjects
Online AccessGet full text
DOI10.1109/ICMLA51294.2020.00010

Cover

More Information
Summary:With the growth of artificial intelligence techniques the problem of writer identification from historical documents has gained increased interest. It consists on knowing the identity of writers of these documents. This paper introduces our baseline system for writer identification, tested on a large dataset of latin historical manuscripts used in the ICDAR 2019 competition. The proposed system yielded the best results using Scale Invariant Feature Transform (SIFT) as a single feature extraction method, without any preprocessing stage. The system was compared against four teams who participated in the competition with different feature extraction methods: SRS-LBP, SIFT, Pathlet, Hinge, Co-Hinge, QuadHinge, Quill, TCC and oBIFs. An unsupervised learning system was implemented, where a deep Convolutional Neural Network (CNN) was trained using patches extracted from SIFT descriptors. Then the results were encoded using a multi - Vector of Locally Aggregated Descriptors (VLAD) and applied an Exemplar Support Vector Machine (E-SVM) at the end to compare the results. Our system achieved best performance using a single feature extraction method with 91.2% mean Average Precision (mAP) and 97.0% accuracy.
DOI:10.1109/ICMLA51294.2020.00010