A Deep Learning Framework to Reconstruct Face under Mask

While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The...

Full description

Saved in:
Bibliographic Details
Published in2022 7th International Conference on Data Science and Machine Learning Applications (CDMA) pp. 200 - 205
Main Authors Modak, Gourango, Das, Shuvra Smaran, Islam Miraj, Md. Ajharul, Morol, Md. Kishor
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2022
Subjects
Online AccessGet full text
DOI10.1109/CDMA54072.2022.00038

Cover

Abstract While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web.
AbstractList While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web.
Author Das, Shuvra Smaran
Morol, Md. Kishor
Islam Miraj, Md. Ajharul
Modak, Gourango
Author_xml – sequence: 1
  givenname: Gourango
  surname: Modak
  fullname: Modak, Gourango
  email: 18-37102-1@student.aiub.edu
  organization: American International University,Department of Computer Science,Bangladesh
– sequence: 2
  givenname: Shuvra Smaran
  surname: Das
  fullname: Das, Shuvra Smaran
  email: shuvradas59@gmail.com
  organization: American International University,Department of Computer Science,Bangladesh
– sequence: 3
  givenname: Md. Ajharul
  surname: Islam Miraj
  fullname: Islam Miraj, Md. Ajharul
  email: miraj.cs18@gmail.com
  organization: American International University,Department of Computer Science,Bangladesh
– sequence: 4
  givenname: Md. Kishor
  surname: Morol
  fullname: Morol, Md. Kishor
  email: kishor@aiub.edu
  organization: American International University,Department of Computer Science,Bangladesh
BookMark eNotzs1Kw0AUQOERdGGrT6CLeYHEO383mWVIjQopgui6TO5cJdROyiRFfHsFXZ3dx1mJ8zQlFuJWQakU-Lt2s22chUqXGrQuAcDUZ2KlEJ1VoCxcirqRG-aj7DnkNKYP2eVw4K8p7-UyyRemKc1LPtEiu0AsTylyltsw76_ExXv4nPn6v2vx1t2_to9F__zw1DZ9MWowS-FtQBxMJCbFHi0SchxsTTYMYCMwanJI5GwwFf5eISuIpK1n79EHsxY3f-7IzLtjHg8hf-98ZdA4MD-sxkKy
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CDMA54072.2022.00038
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL) (UW System Shared)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1665410140
9781665410144
EndPage 205
ExternalDocumentID 9736350
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:36:59 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3
PageCount 6
ParticipantIDs ieee_primary_9736350
PublicationCentury 2000
PublicationDate 2022-March
PublicationDateYYYYMMDD 2022-03-01
PublicationDate_xml – month: 03
  year: 2022
  text: 2022-March
PublicationDecade 2020
PublicationTitle 2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)
PublicationTitleAbbrev CDMA
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8784174
Snippet While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable...
SourceID ieee
SourceType Publisher
StartPage 200
SubjectTerms Gender Classification
Generative adversarial networks
Image edge detection
Image segmentation
Inpainting
Landmark
Mask Segmentation
Multiaccess communication
Object detection
Shape
Task analysis
Title A Deep Learning Framework to Reconstruct Face under Mask
URI https://ieeexplore.ieee.org/document/9736350
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB1qT55UWvGbHDyaNrvJZptjaV2KsOLBQm8lH7MihW3R7aW_3mR3W0U8eAuBkEwCmbzMvDcA9_4CFIynjPrXcUoFOkaNxISyFAuVKBPbIgDF_FnO5uJpkSw68HDgwiBinXyGg9CsY_lubbfhq2yoUu79owfoR2mqGq5Wy4aLmBpOpvk4yMkFelVcy3AG0smPmim1y8hOIN9P1mSKrAbbygzs7pcO439Xcwr9b3IeeTm4nTPoYNmD0ZhMETeklUt9I9k-6YpUaxIwZqsUSzLtxwfm2AfJ9eeqD_Ps8XUyo21RBPoeM15RJbSUhjuLNkIlhbQSnREjK7RhwjGUsU2ktYnQ_u4QHvFhxJz1KAs9tFGan0O3XJd4AYSPTAiTOUxC-DMutHQem4YSU6JQHMUl9ILVy02je7FsDb76u_sajsO-N_lZN9D1RuGtd9iVuatP6gs_GJWr
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG4IHvSkBoxve_BooWwfS48E3KCyxAMk3Egfs8aQAMHl4q-33V3QGA_emiZNO2ky068z3zcI3XsHyCmLKfGv45hwcJQYCYLQGDIllIlsFoBiOpbDKX-eiVkNPey5MABQFJ9BKwyLXL5b2W34KmurmPn46AH6gfCoIi7ZWhUfrkNVuz9Ie0FQLhCsokKIM9BOfnRNKYJGcozS3XZlrciitc1Ny37-UmL873lOUPObnodf94HnFNVg2UDdHh4ArHElmPqGk13ZFc5XOKDMSisWJ9qvD9yxDU71x6KJpsnjpD8kVVsE8h5RlhPFtZSGOQu2A0pyaSU4w7uWa0O5oyAjK6S1gmvvPbjHfNChznqcBR7cKM3OUH25WsI5wqxrQqLMgQgJ0CjT0nl0GppM8Uwx4BeoEayer0vli3ll8OXf03focDhJR_PR0_jlCh2FOyirta5R3RsINz585-a2uLUvUAyY_A
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+7th+International+Conference+on+Data+Science+and+Machine+Learning+Applications+%28CDMA%29&rft.atitle=A+Deep+Learning+Framework+to+Reconstruct+Face+under+Mask&rft.au=Modak%2C+Gourango&rft.au=Das%2C+Shuvra+Smaran&rft.au=Islam+Miraj%2C+Md.+Ajharul&rft.au=Morol%2C+Md.+Kishor&rft.date=2022-03-01&rft.pub=IEEE&rft.spage=200&rft.epage=205&rft_id=info:doi/10.1109%2FCDMA54072.2022.00038&rft.externalDocID=9736350