Document Liveness Challenge Dataset (DLC-2021)

Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in orde...

Full description

Saved in:
Bibliographic Details
Published inJournal of imaging Vol. 8; no. 7; p. 181
Main Authors Polevoy, Dmitry V., Sigareva, Irina V., Ershova, Daria M., Arlazarov, Vladimir V., Nikolaev, Dmitry P., Ming, Zuheng, Luqman, Muhammad Muzzamil, Burie, Jean-Christophe
Format Journal Article
LanguageEnglish
Published Basel MDPI AG 28.06.2022
MDPI
Subjects
Online AccessGet full text
ISSN2313-433X
2313-433X
DOI10.3390/jimaging8070181

Cover

More Information
Summary:Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in order to improve user personal-data-input processes. The tasks of such systems are not only ID document data recognition and extraction but also fraud prevention by detecting document forgery or by checking whether the document is genuine. Modern systems of this kind are often expected to operate in unconstrained environments. A significant amount of research has been published on the topic of mobile ID document analysis, but the main difficulty for such research is the lack of public datasets due to the fact that the subject is protected by security requirements. In this paper, we present the DLC-2021 dataset, which consists of 1424 video clips captured in a wide range of real-world conditions, focused on tasks relating to ID document forensics. The novelty of the dataset is that it contains shots from video with color laminated mock ID documents, color unlaminated copies, grayscale unlaminated copies, and screen recaptures of the documents. The proposed dataset complies with the GDPR because it contains images of synthetic IDs with generated owner photos and artificial personal information. For the presented dataset, benchmark baselines are provided for tasks such as screen recapture detection and glare detection. The data presented are openly available in Zenodo.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2313-433X
2313-433X
DOI:10.3390/jimaging8070181