| Literature DB >> 35875649 |
Saravana Balaji Balasubramanian1, Jagadeesh Kannan R2, Prabu P3, Venkatachalam K4, Pavel Trojovský5.
Abstract
In the recent research era, artificial intelligence techniques have been used for computer vision, big data analysis, and detection systems. The development of these advanced technologies has also increased security and privacy issues. One kind of this issue is Deepfakes which is the combined word of deep learning and fake. DeepFake refers to the formation of a fake image or video using artificial intelligence approaches which are created for political abuse, fake data transfer, and pornography. This paper has developed a Deepfake detection method by examining the computer vision features of the digital content. The computer vision features based on the frame change are extracted using a proposed deep learning model called the Cascaded Deep Sparse Auto Encoder (CDSAE) trained by temporal CNN. The detection process is performed using a Deep Neural Network (DNN) to classify the deep fake image/video from the real image/video. The proposed model is implemented using Face2Face, FaceSwap, and DFDC datasets which have secured an improved detection rate when compared to the traditional deep fake detection approaches. ©2022 Balasubramanian et al.Entities:
Keywords: DNN; Deep fake detection; Deep learning; Deep sparse Auto encoder; Face2Face; FaceSwap; Faceforensics++; Temporal Convolutional neural network
Year: 2022 PMID: 35875649 PMCID: PMC9299276 DOI: 10.7717/peerj-cs.1040
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Presents survey in detail of Deepfake detection.
| Article | Dataset | Detection method | Research focus | Algorithm used |
|---|---|---|---|---|
|
| Multiple website video | FaceSwap detection model | Frames and inconsistency region | LSTM, CNN |
|
| celebA-HQ | Generalization | Deep preprocessing | DCGAN, WCGAN, PG-GAN |
|
| FF++ | DeepFake, face2face, faceswap | Temporal discrepancies | RNN with CNN |
|
| FF, deepFake online, REPLAY-ATTACK DB. | Capsule in forensic application | Replay attacks, computer generated pictures | VGG, capsules |
|
| CelebA, GAN-deepfake | Deepfake images | Pairewise learning | CNN concatenated with CFFA |
|
| TPGAN,Style GAN | Deepfake images | Self training | Resnet, efficientNet |
|
| CelebA, PGGAN, FF, DF | Deepfake images | Fine tune transformer | ResNetr, Xception, Squeeze Net. |
|
| FF++ dataset | Deepfake images | Steganalysis feature | LSTM,XceptionNet |
|
| CelebA, GAN-deepfake | Deepfake tool | Adversarial perturbations | ResNet, VGG |
|
| FFHQ, CelebAHQ | Deepfake images | Texture analysis | Resnet |
|
| FF++ | Deepfake images | Variational autoencoder | Fake detection model |
|
| FF++, DFD | Videos deepfake | Spatial images and temporal images | Convolutional LSTM model |
|
| FF++,UADFV, CelebA-DF | Deepfake detection | Unsupervised learning | Xception, SVM, Bayes classification. |
Figure 1Proposed deepfake detection using computer vision features.
Figure 2DSAE structure.
Figure 3Proposed cascaded DSAE with TCNN-architecture.
Figure 4Temporal-CNN based optimization.
Extracted computer vision features using proposed CDSAE-TCNN.
| No. of feature | Feature name | Description |
|---|---|---|
| 1 | MSE | Mean square error is the average variance between actual and estimated values. |
| 2 | PSNR | Peak signal to noise ratio is the ratio between maximum signal power and corrupted noise. |
| 3 | SSIM | Structural similarity index measure is the quality of cinematic and television pictures. |
| 4 | RGB | The percentage of image red, green and blue color value. |
| 5 | HSV | The percentage of image hue, saturation and value. |
| 6 | Histogram | Based on image brightness, it plots the no. of pixels in the image or frames. |
| 7 | Luminance | Total image brightness mean value. |
| 8 | Variance | Variance of image. |
| 9 | Edge-Density | Ration between edge pixel and total pixel of image. |
| 10 | DCT | Discrete Cosine transform: Image DCT bias value. |
Deepfake video image datasets used for proposed system.
| Database | Total count of videos | Real video | No. of subjects | Fake video | Manipulation tool |
|---|---|---|---|---|---|
| FaceForensics++ | 3,000 | 1,000 | – | 5,000 | FaceSwap |
| DFDC | 128,124 | 23,654 | 3,426 | 10,4500 | DeepFake |
Proposed model performance to choose the hyperparameters.
| Optimizer | No. of hidden layers | Loss | Accuracy |
|---|---|---|---|
| Stochastic Gradient Descent (SGD) | 3 | 0.4561 | 73.2 |
| 5 | 0.4281 | 76.9 | |
| 8 | 0.3971 | 79.8 | |
| AdaGrad | 3 | 0.5612 | 71.8 |
| 5 | 0.5109 | 78.4 | |
| 8 | 0.4713 | 81.2 | |
| Adam | 3 | 0.1329 | 95.3 |
| 5 | 0.0139 | 98.7 | |
| 8 | 0.1022 | 96.3 |
Deepfake detection using the proposed model–performance comparison.
| Methods | Datasets | ||
|---|---|---|---|
| Face2Face | FaceSwap | DFDC | |
| Proposed CDSAE-DNN | 98.7 | 98.5 | 97.63 |
| ResNet | 93.6 | 92.4 | 81.8 |
| MobileNet | 95.2 | 94.8 | 78.5 |
| SVM | 86.5 | 83.4 | 71.02 |
Performance of proposed vs similar deepfake detection systems.
| Methods | Decentralized structure | Data integrity | Security | Separated storage | Transparency | Data sustainability |
|---|---|---|---|---|---|---|
| Proposed CDSAE-DNN | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| ResNet | ✓ | ✓ | ✓ | × | ✓ | ✓ |
| MobileNet | ✓ | ✓ | ✓ | ✓ | ✓ | × |
| SVM | ✓ | × | ✓ | ✓ | ✓ | × |
Figure 5Sensitivity comparison of deepfake detection methods.
Figure 6Specificity comparison of deepfake detection methods.
Figure 7Computation time comparison.
Figure 8AUC comparison of proposed vs similar deepfake detection systems.
Figure 9Error detection rate comparison of proposed vs similar deepfake detection systems.