A Deep Learning Framework to Reconstruct Face under Mask
While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The...
Saved in:
Published in | 2022 7th International Conference on Data Science and Machine Learning Applications (CDMA) pp. 200 - 205 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.03.2022
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/CDMA54072.2022.00038 |
Cover
Abstract | While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web. |
---|---|
AbstractList | While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable results for attributing consistency to gender, ethnicity, expression, and other characteristics like the topological structure of the face. The purpose of this work is to extract the mask region from a masked image and rebuild the area that has been detected. This problem is complex because (i) it is difficult to determine the gender of an image hidden behind a mask, which causes the network to become confused and reconstruct the male face as a female or vice versa; (ii) we may receive images from multiple angles, making it extremely difficult to maintain the actual shape, topological structure of the face and a natural image; and (iii) there are problems with various mask forms because, in some cases, the area of the mask cannot be anticipated precisely; certain parts of the mask remain on the face after completion. To solve this complex task, we split the problem into three phases: landmark detection, object detection for the targeted mask area, and inpainting the addressed mask region. To begin, to solve the first problem, we have used gender classification, which detects the actual gender behind a mask, then we detect the landmark of the masked facial image. Second, we identified the non-face item, i.e., the mask, and used the Mask R-CNN network to create the binary mask of the observed mask area. Thirdly, we developed an inpainting network that uses anticipated landmarks to create realistic images. To segment the mask, this article uses a mask R-CNN and offers a binary segmentation map for identifying the mask area. Additionally, we generated the image utilizing landmarks as structural guidance through a GAN-based network. The studies presented in this paper use the FFHQ and CelebA datasets. This study outperformed all prior studies in terms of generating cutting-edge results for real-world pictures gathered from the web. |
Author | Das, Shuvra Smaran Morol, Md. Kishor Islam Miraj, Md. Ajharul Modak, Gourango |
Author_xml | – sequence: 1 givenname: Gourango surname: Modak fullname: Modak, Gourango email: 18-37102-1@student.aiub.edu organization: American International University,Department of Computer Science,Bangladesh – sequence: 2 givenname: Shuvra Smaran surname: Das fullname: Das, Shuvra Smaran email: shuvradas59@gmail.com organization: American International University,Department of Computer Science,Bangladesh – sequence: 3 givenname: Md. Ajharul surname: Islam Miraj fullname: Islam Miraj, Md. Ajharul email: miraj.cs18@gmail.com organization: American International University,Department of Computer Science,Bangladesh – sequence: 4 givenname: Md. Kishor surname: Morol fullname: Morol, Md. Kishor email: kishor@aiub.edu organization: American International University,Department of Computer Science,Bangladesh |
BookMark | eNotzs1Kw0AUQOERdGGrT6CLeYHEO383mWVIjQopgui6TO5cJdROyiRFfHsFXZ3dx1mJ8zQlFuJWQakU-Lt2s22chUqXGrQuAcDUZ2KlEJ1VoCxcirqRG-aj7DnkNKYP2eVw4K8p7-UyyRemKc1LPtEiu0AsTylyltsw76_ExXv4nPn6v2vx1t2_to9F__zw1DZ9MWowS-FtQBxMJCbFHi0SchxsTTYMYCMwanJI5GwwFf5eISuIpK1n79EHsxY3f-7IzLtjHg8hf-98ZdA4MD-sxkKy |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/CDMA54072.2022.00038 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1665410140 9781665410144 |
EndPage | 205 |
ExternalDocumentID | 9736350 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:36:59 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i203t-94a66b3dcec1e9646c6edb48c4ab04d0e62c56cc54a3764106e10dc249e9969a3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_9736350 |
PublicationCentury | 2000 |
PublicationDate | 2022-March |
PublicationDateYYYYMMDD | 2022-03-01 |
PublicationDate_xml | – month: 03 year: 2022 text: 2022-March |
PublicationDecade | 2020 |
PublicationTitle | 2022 7th International Conference on Data Science and Machine Learning Applications (CDMA) |
PublicationTitleAbbrev | CDMA |
PublicationYear | 2022 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.8784174 |
Snippet | While deep learning-based image reconstruction methods have shown significant success in removing objects from pictures, they have yet to achieve acceptable... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 200 |
SubjectTerms | Gender Classification Generative adversarial networks Image edge detection Image segmentation Inpainting Landmark Mask Segmentation Multiaccess communication Object detection Shape Task analysis |
Title | A Deep Learning Framework to Reconstruct Face under Mask |
URI | https://ieeexplore.ieee.org/document/9736350 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB1qT55UWvGbHDyaNrvJZptjaV2KsOLBQm8lH7MihW3R7aW_3mR3W0U8eAuBkEwCmbzMvDcA9_4CFIynjPrXcUoFOkaNxISyFAuVKBPbIgDF_FnO5uJpkSw68HDgwiBinXyGg9CsY_lubbfhq2yoUu79owfoR2mqGq5Wy4aLmBpOpvk4yMkFelVcy3AG0smPmim1y8hOIN9P1mSKrAbbygzs7pcO439Xcwr9b3IeeTm4nTPoYNmD0ZhMETeklUt9I9k-6YpUaxIwZqsUSzLtxwfm2AfJ9eeqD_Ps8XUyo21RBPoeM15RJbSUhjuLNkIlhbQSnREjK7RhwjGUsU2ktYnQ_u4QHvFhxJz1KAs9tFGan0O3XJd4AYSPTAiTOUxC-DMutHQem4YSU6JQHMUl9ILVy02je7FsDb76u_sajsO-N_lZN9D1RuGtd9iVuatP6gs_GJWr |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG4IHvSkBoxve_BooWwfS48E3KCyxAMk3Egfs8aQAMHl4q-33V3QGA_emiZNO2ky068z3zcI3XsHyCmLKfGv45hwcJQYCYLQGDIllIlsFoBiOpbDKX-eiVkNPey5MABQFJ9BKwyLXL5b2W34KmurmPn46AH6gfCoIi7ZWhUfrkNVuz9Ie0FQLhCsokKIM9BOfnRNKYJGcozS3XZlrciitc1Ny37-UmL873lOUPObnodf94HnFNVg2UDdHh4ArHElmPqGk13ZFc5XOKDMSisWJ9qvD9yxDU71x6KJpsnjpD8kVVsE8h5RlhPFtZSGOQu2A0pyaSU4w7uWa0O5oyAjK6S1gmvvPbjHfNChznqcBR7cKM3OUH25WsI5wqxrQqLMgQgJ0CjT0nl0GppM8Uwx4BeoEayer0vli3ll8OXf03focDhJR_PR0_jlCh2FOyirta5R3RsINz585-a2uLUvUAyY_A |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+7th+International+Conference+on+Data+Science+and+Machine+Learning+Applications+%28CDMA%29&rft.atitle=A+Deep+Learning+Framework+to+Reconstruct+Face+under+Mask&rft.au=Modak%2C+Gourango&rft.au=Das%2C+Shuvra+Smaran&rft.au=Islam+Miraj%2C+Md.+Ajharul&rft.au=Morol%2C+Md.+Kishor&rft.date=2022-03-01&rft.pub=IEEE&rft.spage=200&rft.epage=205&rft_id=info:doi/10.1109%2FCDMA54072.2022.00038&rft.externalDocID=9736350 |