Eff-YNet: A Dual Task Network for DeepFake Detection and Segmentation

Advances in generative models and manipulation techniques have given rise to digitally altered videos known as deepfakes. These videos are difficult to identify for both humans and machines. Modern detection methods exploit various weaknesses in deepfake videos, such as visual artifacts and inconsis...

Full description

Saved in:
Bibliographic Details
Published in2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) pp. 1 - 8
Main Authors Tjon, Eric, Moh, Melody, Moh, Teng-Sheng
Format Conference Proceeding
LanguageEnglish
Published IEEE 04.01.2021
Subjects
Online AccessGet full text
DOI10.1109/IMCOM51814.2021.9377373

Cover

More Information
Summary:Advances in generative models and manipulation techniques have given rise to digitally altered videos known as deepfakes. These videos are difficult to identify for both humans and machines. Modern detection methods exploit various weaknesses in deepfake videos, such as visual artifacts and inconsistent posing. In this paper, we describe a novel architecture called Eff-YNet designed to detect visual differences between altered and unaltered areas. The architecture combines an EfficientNet encoder and a U-Net with a classification branch into a model capable of both classifying and segmenting deepfake videos. The task of segmentation helps train the classifier and also produces useful segmentation masks. We also implement ResNet 3D to detect spatiotemporal inconsistencies. To test these models, we run experiments against the Deepfake Detection Challenge dataset and show improvements over baseline classification models. Furthermore, we find that an ensemble of these two approaches improves performance over a single approach alone.
DOI:10.1109/IMCOM51814.2021.9377373