DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training
Authors: Saksham Kumar, Ashish Singh, Srinivasarao Thota, Sunil Kumar Singh, Chandan Kumar
Published: 2025-11-15 05:55:09+00:00
AI Summary
The paper introduces DeiTFake, a deepfake detection model leveraging a DeiT-based transformer and a novel two-stage progressive training strategy. This curriculum learning approach applies initial transfer learning with standard augmentations, followed by fine-tuning using advanced affine and deepfake-specific augmentations to boost robustness. DeiTFake achieved 99.22% accuracy and 0.9997 AUROC on the OpenForensics dataset, setting a new state-of-the-art benchmark.
Abstract
Deepfakes are major threats to the integrity of digital media. We propose DeiTFake, a DeiT-based transformer and a novel two-stage progressive training strategy with increasing augmentation complexity. The approach applies an initial transfer-learning phase with standard augmentations followed by a fine-tuning phase using advanced affine and deepfake-specific augmentations. DeiT's knowledge distillation model captures subtle manipulation artifacts, increasing robustness of the detection model. Trained on the OpenForensics dataset (190,335 images), DeiTFake achieves 98.71\\% accuracy after stage one and 99.22\\% accuracy with an AUROC of 0.9997, after stage two, outperforming the latest OpenForensics baselines. We analyze augmentation impact and training schedules, and provide practical benchmarks for facial deepfake detection.