A Deep Learning Framework to Reconstruct Face under Mask

While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The...

Full description

Saved in:

Bibliographic Details
Published in	2022 7th International Conference on Data Science and Machine Learning Applications (CDMA) pp. 200 - 205
Main Authors	Modak, Gourango, Das, Shuvra Smaran, Islam Miraj, Md. Ajharul, Morol, Md. Kishor
Format	Conference Proceeding
Language	English
Published	IEEE 01.03.2022
Subjects	Gender Classification Generative adversarial networks Image edge detection Image segmentation Inpainting Landmark Mask Segmentation Multiaccess communication Object detection Shape Task analysis
Online Access	Get full text
DOI	10.1109/CDMA54072.2022.00038

Cover

Abstract	While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web.
AbstractList	While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web.
Author	Das, Shuvra Smaran Morol, Md. Kishor Islam Miraj, Md. Ajharul Modak, Gourango
Author_xml	– sequence: 1 givenname: Gourango surname: Modak fullname: Modak, Gourango email: 18-37102-1@student.aiub.edu organization: American International University,Department of Computer Science,Bangladesh – sequence: 2 givenname: Shuvra Smaran surname: Das fullname: Das, Shuvra Smaran email: shuvradas59@gmail.com organization: American International University,Department of Computer Science,Bangladesh – sequence: 3 givenname: Md. Ajharul surname: Islam Miraj fullname: Islam Miraj, Md. Ajharul email: miraj.cs18@gmail.com organization: American International University,Department of Computer Science,Bangladesh – sequence: 4 givenname: Md. Kishor surname: Morol fullname: Morol, Md. Kishor email: kishor@aiub.edu organization: American International University,Department of Computer Science,Bangladesh
BookMark	eNotzs1Kw0AUQOERdGGrT6CLeYHEO383mWVIjQopgui6TO5cJdROyiRFfHsFXZ3dx1mJ8zQlFuJWQakU-Lt2s22chUqXGrQuAcDUZ2KlEJ1VoCxcirqRG-aj7DnkNKYP2eVw4K8p7-UyyRemKc1LPtEiu0AsTylyltsw76_ExXv4nPn6v2vx1t2_to9F__zw1DZ9MWowS-FtQBxMJCbFHi0SchxsTTYMYCMwanJI5GwwFf5eISuIpK1n79EHsxY3f-7IzLtjHg8hf-98ZdA4MD-sxkKy
CODEN	IEEPAD
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/CDMA54072.2022.00038
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	1665410140 9781665410144
EndPage	205
ExternalDocumentID	9736350
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:36:59 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3
PageCount	6
ParticipantIDs	ieee_primary_9736350
PublicationCentury	2000
PublicationDate	2022-March
PublicationDateYYYYMMDD	2022-03-01
PublicationDate_xml	– month: 03 year: 2022 text: 2022-March
PublicationDecade	2020
PublicationTitle	2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)
PublicationTitleAbbrev	CDMA
PublicationYear	2022
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.8784174
Snippet	While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable...
SourceID	ieee
SourceType	Publisher
StartPage	200
SubjectTerms	Gender Classification Generative adversarial networks Image edge detection Image segmentation Inpainting Landmark Mask Segmentation Multiaccess communication Object detection Shape Task analysis
Title	A Deep Learning Framework to Reconstruct Face under Mask
URI	https://ieeexplore.ieee.org/document/9736350
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB1qT55UWvGbHDyaNrvJZptjaV2KsOLBQm8lH7MihW3R7aW_3mR3W0U8eAuBkEwCmbzMvDcA9_4CFIynjPrXcUoFOkaNxISyFAuVKBPbIgDF_FnO5uJpkSw68HDgwiBinXyGg9CsY_lubbfhq2yoUu79owfoR2mqGq5Wy4aLmBpOpvk4yMkFelVcy3AG0smPmim1y8hOIN9P1mSKrAbbygzs7pcO439Xcwr9b3IeeTm4nTPoYNmD0ZhMETeklUt9I9k-6YpUaxIwZqsUSzLtxwfm2AfJ9eeqD_Ps8XUyo21RBPoeM15RJbSUhjuLNkIlhbQSnREjK7RhwjGUsU2ktYnQ_u4QHvFhxJz1KAs9tFGan0O3XJd4AYSPTAiTOUxC-DMutHQem4YSU6JQHMUl9ILVy02je7FsDb76u_sajsO-N_lZN9D1RuGtd9iVuatP6gs_GJWr
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG4IHvSkBoxve_BooWwfS48E3KCyxAMk3Egfs8aQAMHl4q-33V3QGA_emiZNO2ky068z3zcI3XsHyCmLKfGv45hwcJQYCYLQGDIllIlsFoBiOpbDKX-eiVkNPey5MABQFJ9BKwyLXL5b2W34KmurmPn46AH6gfCoIi7ZWhUfrkNVuz9Ie0FQLhCsokKIM9BOfnRNKYJGcozS3XZlrciitc1Ny37-UmL873lOUPObnodf94HnFNVg2UDdHh4ArHElmPqGk13ZFc5XOKDMSisWJ9qvD9yxDU71x6KJpsnjpD8kVVsE8h5RlhPFtZSGOQu2A0pyaSU4w7uWa0O5oyAjK6S1gmvvPbjHfNChznqcBR7cKM3OUH25WsI5wqxrQqLMgQgJ0CjT0nl0GppM8Uwx4BeoEayer0vli3ll8OXf03focDhJR_PR0_jlCh2FOyirta5R3RsINz585-a2uLUvUAyY_A
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+7th+International+Conference+on+Data+Science+and+Machine+Learning+Applications+%28CDMA%29&rft.atitle=A+Deep+Learning+Framework+to+Reconstruct+Face+under+Mask&rft.au=Modak%2C+Gourango&rft.au=Das%2C+Shuvra+Smaran&rft.au=Islam+Miraj%2C+Md.+Ajharul&rft.au=Morol%2C+Md.+Kishor&rft.date=2022-03-01&rft.pub=IEEE&rft.spage=200&rft.epage=205&rft_id=info:doi/10.1109%2FCDMA54072.2022.00038&rft.externalDocID=9736350