Submitted:
10 June 2026
Posted:
11 June 2026
You are already at the latest version
Abstract
Keywords:
1. Definition, Classification and Why Deepfake is Important
-
Modality deepfakes
- Visual: face swaps, reenactment, lip-sync, attribute editing (age, expression), full-body synthesis, scene relighting.
- Audio: voice cloning (speaker identity conversion), speech-to-speech conversion, text-to-speech impersonation.
- Multimodal: synchronized audio–video generation, avatar systems, “talking head” models with cloned voice.
-
Manipulation intent deepfakes
- Identity substitution (impersonation, fraud, non-consensual sexual content).
- Event fabrication (false evidence, fake statements, fake presence).
- Contextual distortion (true footage reframed via synthetic overlays, selective edits, or deceptive narration).
-
Generation regime deepfakes
- Closed-world generation (trained on a specific target identity).
- Open-world generation (foundation models enabling broad, low-friction synthesis).
1.1. Why Are Deepfakes Important?
- Evidentiary erosion: over time, authentic recordings become easier to dismiss as fake (e.g. someone might state: “this is not genuine, it could be AI”).
1.2. Why Detection Is Necessary But Not Always Sufficient?
-
Policy and compliance controls, including transparency duties for certain AI outputs and platform obligations for risk mitigation and accountability. In the EU context, deepfakes intersect directly with:
2. Creating Deepfakes: Types of Fakes, Standard Pipelines, and Generative Models
2.1. Common Deepfake Types
- Face swapping: the target’s face is replaced with the source’s face, with the goal of preserving the pose, lighting, and background. Historically, this was based on GAN families and evolved into high-fidelity models (e.g., the StyleGAN line) [26,27,28]. Such techniques are especially problematic for military briefings or crisis communication videos.
- Text-to-video / full scene synthesis: video sequences (and not just “doctored” faces) are generated using diffusion-based video generation and text-to-video approaches that expand the threat from “evidence tampering” to “event fabrication” [37,38]. This is crucial as it creates a risk of fabricated military events, scenes, or operational incidents.
2.2. Typical Production Pipeline
- Data collection/selection: sufficient variety of poses, expressions, and lighting (especially for identity-specific models).
- Localization/normalization: face detection, landmarks, alignment, cropping, and photometric normalization.
- Model training or adaptation: either general-purpose (foundation-style) or tailored to a specific individual/target.
- Inference and temporal consistency: especially in video, temporal consistency is crucial for perceptual plausibility.
- Post-processing: blending, color matching, denoising/oversampling, and final re-encoding, which often “hides” obvious traces and shifts detection to more subtle statistical cues.
2.3. Major Families of Generative Models
3. Deepfake Properties and Characteristics
3.1. Deepfake Categories for Audio and Visual Objects
- Entire face synthesis: the creation of an entirely new face (or even an entire image/scene) that does not correspond to a real, recorded person.
- Identity swap / face swap: replacing one subject’s face with another’s, typically while preserving the “host’s” pose, lighting, and motion.
- Attribute manipulation: changes to specific characteristics (age, gender/expression, morphological features), without necessarily changing the identity.
- Expression swap / reenactment: we “transfer” the expressions/movements of a target from a source so that the mouth, eyebrows, and micro-expressions match another video or a live source. A classic (predating modern deep models but fundamental) family of reenactment techniques is exemplified in projects such as [29].
3.2. Characteristics That "Give away" Deepfakes: Spatial, Temporal, Frequency, and Physiological Traces
- Spatial/morphological artifacts: Examples include: imperfect blending at the face–skin/hair boundaries, unrealistic geometry in teeth/lips, inconsistencies in shading/lighting, or a “mismatched” background in high-frequency details. Such indicators are systematized in approaches that analyze visual artifacts as production “signatures” [59,60].
- Frequency-domain traces: Many generative models leave traces in the frequency spectrum due to resampling and up sampling (e.g., “regularities” not systematically found in natural images). Frequency-domain analysis has been proposed as a complementary “channel” of evidence, especially when spatial cues are attenuated by compression [63,64].
- Physiological cues: A more recent line of thinking capitalizes on the fact that real-time facial video incorporates subtle physiological changes (e.g., remote photoplethysmography based on micro-changes in skin color). Deviations in such signals can serve as an indication of synthetic origin [65,66].
4. Detection and Evaluation of Deepfakes: Methodologies, Data, and Benchmarks
- authenticity classification (real vs. fake),
- spatial/temporal localization of alterations, and
- classification of forgery types (e.g., swap, reenactment, synthesis).
4.1. Cues and “Signatures” of Forgery
- Spatial artifacts: Cues resulting from blending, facial distortion, texture/lighting inconsistencies, or imperfect rendering of details. Works such as [59] make an effort to systematize such visual anomalies as practical detection cues, while the detection of warping artifacts has been proposed as a more specific category of cues in face manipulation [81,82].
- Temporal consistency: In videos, deepfakes may exhibit inconsistencies in the progression of facial expressions, micro-motions, or the stability of facial features from frame to frame. A classic line of research leverages “natural” temporal regularities, such as blink rate, to reveal synthetic content [61,62].
- Frequency-domain/spectral anomalies (frequency-domain cues): Generative pipelines often introduce patterns that are more prominent in the frequency domain, particularly due to up sampling and resampling. Frequency-domain analysis has been proposed as a complementary approach when purely visual artifacts are attenuated by compression [63].
- Biological/physiological signals (physiological cues): The use of signals such as rPPG (micro-changes in skin color related to pulsatile blood flow) aims to provide a more “generalizable” criterion, as such signals are not always reproduced with natural consistency in synthetic videos. A prime example of this approach is [65].
4.2. Detector Families: From Frame-Level to Multimodal
- Frame-level detectors (image-based detection): They focus on individual frames and offer advantages in terms of computational cost and are used in large-scale screening, but they are vulnerable to the selection of “good” frames or to deepfakes that optimize spatial artifacts [84].
- Video-level detectors (spatio-temporal models): They incorporate temporal information (e.g., 3D CNN, temporal aggregation, transformers) and tend to improve detection in cases where the forgery “escapes” spatially but remains temporally inconsistent [85].
- Compact / mesoscopic architectures: for example, [86] presented a compact architecture for face forgery detection, targeting meso-level features that remain useful under standard compression.
- Multimodal detection (audio–video): As deepfake production shifts toward full audiovisual synthesis, the integration of audio and video becomes particularly important. Datasets such as the ones used in [79] were designed specifically for evaluating multimodal scenarios (face + voice), while anti-spoofing benchmarks for speech support systematic evaluation of synthetic/transformed speech were also developed at times [69].
4.3. Datasets and Benchmarks: What We Measure and Why It Matters
- Celeb-DF: designed as a more challenging dataset to reduce the “ease” of detection via simplistic artifacts and push for generalization [73].
- DeeperForensics-1.0: focuses on conditions that closely resemble real-world pipelines and highlights the implications of cross-dataset evaluation [74].
- WildDeepfake: compiles an “in-the-wild” collection, highlighting the drop in performance when detectors are applied to real-world internet conditions [75].
- FaceForensics++: serves as a classic evaluation dataset for manipulated facial images and is widely used in experimental comparisons [80].
4.4. Evaluation Protocols and Metrics: From AUROC to Operational Reliability
- Classification metrics and class imbalance: In addition to accuracy, AUROC and AUPRC are used (especially in cases of class imbalance), while in anti-spoofing scenarios, metrics such as EER are standard [69].
- Calibration and decision thresholds: In operational scenarios, the output is not merely a “label,” but a risk score that leads to action (e.g., human review, takedown, posting ban). Therefore, calibration and threshold selection are part of the evaluation.
5. Mitigation, Governance, and Regulatory Compliance for Synthetic Content
- a detector’s performance on controlled data does not guarantee operational reliability under varying dissemination channels, and
5.1. Threat Identification, Risk Modeling, and Transparency Disclosure Controls
5.2. Risk Governance, Management Systems, and Technical Content Transparency
- TEVV (testing–evaluation–verification–validation) for detection/labeling/provenance tools,
- assurance case (documented argumentation that the system is “sufficiently secure/reliable” for a specific use),
- change management (what changes when the codec, model provider, or platform changes),
- monitoring & drift management (monitoring of performance degradation/increase in errors).
- detection signals (probabilistic evidence from ML detectors) and
- verifiable evidence that can be verified cryptographically or through structured manifests.
5.3. Watermarking Resilience, Operational Integration, and Forensics-Grade Documentation
6. Implications and Scenarios of Abuse: A Case-Based Analysis in Key Application Areas
6.1. Case A - News, Journalism, and Fact-Checking (Newsroom Verification)
6.2. Case B - Financial Fraud, Corporate Security, and Remote Identity Verification (KYC/Remote Onboarding)
6.3. Case C - Public Sector, Law Enforcement, and Forensic Use (Forensics, Chain of Custody)
6.4. Case D - Education, Academic Integrity, and the Protection of Students
6.5. Synthetic Assessment: Effectiveness as a System Property
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Verdoliva, L. Media Forensics and DeepFakes: An Overview. IEEE J. Sel. Top. Signal Process. 2020, 14, 910–932. [Google Scholar] [CrossRef]
- Akhtar, Z. Deepfakes generation and detection: A short survey. J. Imaging 2023, 9(1), 18. [Google Scholar] [CrossRef]
- Tolosana, R.; Vera-Rodriguez, R.; Fierrez, J.; Morales, A.; Ortega-Garcia, J. Deepfakes and Beyond: A Survey of Face Manipulation and Fake Detection. Inf. Fusion 2020, 64, 131–148. [Google Scholar] [CrossRef]
- Maxmudjanov, S.; Naimov, A. DEEPFAKE CONTENT TYPES AND THEIR GENERATION METHODS. Пoтoмки Аль-Фаргани 2026, 1, 26–31. [Google Scholar] [CrossRef]
- Rainie, L.; Anderson, J.; Vogels, E.A. Experts doubt ethical AI design will be broadly adopted as the norm within the next decade. Pew Res. Cent. 2021, 16, 121–154. Available online: https://www.pewresearch.org/internet/2021/06/16/experts-doubt-ethical-ai-design-will-be-broadly-adopted-as-the-norm-within-the-next-decade/ (accessed on 1 June 2026).
- Mahr, D. Sexualized deepfakes as a socio-technical continuation of gendered power. AI Soc. 2026, 1–13. [Google Scholar] [CrossRef]
- Twomey, J. Socially (de) constructing deepfakes: understanding socio-technical interests and concerns in deepfake media. Ph.D. Thesis, University College Cork, Cork, Ireland, 2025. Available online: https://hdl.handle.net/10468/18856 (accessed on 1 June 2026).
- Perriello, L.E. Blurred realities: Legal strategies for the deepfake era. Maastricht J. Eur. Comp. Law. 2026, 1023263X261433380. [Google Scholar] [CrossRef]
- Zhang, B. Governing deepfake realities: The truth immunity framework for telecommunications policy. Telecommun. Policy 2026, 103168. [Google Scholar] [CrossRef]
- NIST. Artificial Intelligence Risk Management Framework (AI RMF 1.0); NIST AI 100-1; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2023. [Google Scholar] [CrossRef]
- Soto-Sanfiel, M.T.; Wu, Q. How audiences make sense of deepfake resurrections: A multilevel analysis of realism, ethics, and cultural meaning. Comput. Hum. Behav. 2025, 108822. [Google Scholar] [CrossRef]
- Putra, A.B. The Legal Standing of Deepfake Digital Evidence in Criminal Proceedings: Challenges of Evidentiary Integrity in the AI Era. Fox Justi J. Ilmu Huk. 2026, 16, 283–290. Available online: https://ejournal.seaninstitute.or.id/index.php/Justi/article/view/8328 (accessed on 1 June 2026).
- Cheng, E.K. Deepfakes, Photographs, and Trust in Evidence. Virginia Law Review Online. 2025. [CrossRef]
- Peebles, W.; Xie, S. Scalable Diffusion Models with Transformers. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2023; pp. 4172–4182. [Google Scholar] [CrossRef]
- Barari, S.; Lucas, C.; Munger, K. Political deepfakes are as credible as other fake media and (sometimes) real media. J. Polit. 2025, 87, 510–526. [Google Scholar] [CrossRef]
- Soundarya, B.C.; Gururaj, H.L. Deepfake detection: critical review of state-of-the-art approaches and future perspectives. Discov. Appl. Sci. 2026. [CrossRef]
- Sharma, S.; Selwal, A. Potential of artificial intelligence in deepfake media: From generation to detection mechanisms, state-of-the-art, and challenges. Comput. Sci. Rev. 2026, 60, 100866. [Google Scholar] [CrossRef]
- Erokhin, D.; Komendantova, N. A Review of Tools and Technologies to Combat Deepfakes. Information 2026, 17, 347. [Google Scholar] [CrossRef]
- Dhanapal, R.; Ps, A. Defending Against Adaptive Deepfake Generation and Detection Evasion Attacks. In Proceedings of the 2026 9th International Conference on Inventive Computation Technologies (ICICT), April 2026; pp. 1619–1624. [Google Scholar] [CrossRef]
- Modi, K. The Deepfake Conundrum: Assessing Generative AI's Threat to Digital Reality and Proposing a Multi-Layered Defense Framework. Int. J. Comput. Trends Technol. 2025. [CrossRef]
- Coalition for Content Provenance and Authenticity (C2PA). Content Credentials: C2PA Technical Specification, Version 2.2; C2PA, 2025. Available online: https://spec.c2pa.org/specifications/specifications/2.2/specs/_attachments/C2PA_Specification.pdf (accessed on 1 June 2026).
- European Parliament; Council of the European Union. Regulation (EU) 2024/1689 (Artificial Intelligence Act). Off. J. Eur. Union. 2024. Available online: https://eur-lex.europa.eu/eli/reg/2024/1689/oj/eng (accessed on 1 June 2026).
- European Parliament; Council of the European Union. Regulation (EU) 2022/2065 (Digital Services Act). Off. J. Eur. Union 2022, L 277, 1–102. Available online: https://eur-lex.europa.eu/eli/reg/2022/2065/oj/eng (accessed on 1 June 2026).
- European Parliament; Council of the European Union. Regulation (EU) 2016/679 (General Data Protection Regulation). Off. J. Eur. Union 2016, L 119, 1–88. Available online: https://eur-lex.europa.eu/eli/reg/2016/679/oj/eng (accessed on 1 June 2026).
- Romero Moreno, F. Generative AI and deepfakes: a human rights approach to tackling harmful content. Int. Rev. Law. Comput. Technol. 2024, 38, 297–326. [Google Scholar] [CrossRef]
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. arXiv 2014, arXiv:1406.2661. [Google Scholar] [CrossRef]
- Karras, T.; Laine, S.; Aittala, M.; Hellsten, J.; Lehtinen, J.; Aila, T. Analyzing and Improving the Image Quality of StyleGAN. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), Seattle, WA, USA, 13–19 June 2020; pp. 8110–8119. [Google Scholar] [CrossRef]
- Waseem, S.; Bakar, S. A. R. S. A.; Ahmed, B. A.; Omar, Z.; Eisa, T. A. E.; Dalam, M. E. E. DeepFake on face and expression swap: A review. IEEE Access. 2023, 11, 117865–117906. [Google Scholar] [CrossRef]
- Thies, J.; Zollhöfer, M.; Stamminger, M.; Theobalt, C.; Nießner, M. Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NV, USA, 27–30 June 2016; pp. 2387–2395. [Google Scholar] [CrossRef]
- Dhanyalakshmi, R.; Popirlan, C.I.; Hemanth, D.J. A survey on deep learning based reenactment methods for deepfake applications. IET Image Process. 2024, 18, 4433–4460. [Google Scholar] [CrossRef]
- Prajwal, K.R.; Mukhopadhyay, R.; Namboodiri, V.; Jawahar, C.V. A Lip Sync Expert Is All You Need for Speech to Lip Generation in the Wild. In Proceedings of the 28th ACM International Conference on Multimedia (ACM MM 2020), Virtual Event, 12–16 October 2020; pp. 484–492. [Google Scholar] [CrossRef]
- Xiong, X.; Patel, P.; Fan, Q.; Wadhwa, A.; Selvam, S.; Guo, X.; et al. Talkingheadbench: A multi-modal benchmark & analysis of talking-head deepfake detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026; pp. 4139–4149. (accessed on 1 June 2026). [Google Scholar] [CrossRef]
- Qian, K.; Zhang, Y.; Chang, S.; Yang, X.; Hasegawa-Johnson, M. AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss. arXiv 2019, arXiv:1905.05879. [Google Scholar] [CrossRef]
- Kong, J.; Kim, J.; Bae, J. HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis. arXiv 2020, arXiv:2010.05646. [Google Scholar] [CrossRef]
- van den Oord, A.; Dieleman, S.; Zen, H.; Simonyan, K.; Vinyals, O.; Graves, A.; Kalchbrenner, N.; Senior, A.; Kavukcuoglu, K. WaveNet: A Generative Model for Raw Audio. arXiv 2016, arXiv:1609.03499. [Google Scholar] [CrossRef]
- Alnaqbi, M.; Ikuesan, R.A. A systematic review of audio deepfake detection techniques for digital investigation. Discov. Comput. 2026, 29, 202. [Google Scholar] [CrossRef]
- Ho, J.; Salimans, T.; Gritsenko, A.; Chan, W.; Norouzi, M.; Fleet, D.J. Video Diffusion Models. arXiv 2022, arXiv:2204.03458. [Google Scholar] [CrossRef]
- Singer, U.; Polyak, A.; Hayes, T.; Yin, X.; An, J.; Zhang, S.; Hu, Q.; Yang, H.; Ashual, O.; Gafni, O.; Parikh, D.; Gupta, S.; Taigman, Y. Make-A-Video: Text-to-Video Generation without Text-Video Data. arXiv 2022, arXiv:2209.14792. [Google Scholar] [CrossRef]
- Vu, P. T. A Fusion Model for Precipitation Nowcasting from Radar and Satellite. In Multi-disciplinary Trends in Artificial Intelligence: 18th International Conference, MIWAI 2025, Ho Chi Minh City, Vietnam, December 3–5, 2025, Proceedings, Part I, 16353, 430. [CrossRef]
- Li, Q.; Wang, W.; Du, S.; Peng, B.; Dong, J.; Wang, K.; et al. Towards High Fidelity Face Swapping: A Comprehensive Survey and New Benchmark. arXiv 2026, arXiv:2605.00883. [Google Scholar] [CrossRef]
- Zhu, J.-Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy, 22–29 October 2017; pp. 2242–2251. [Google Scholar] [CrossRef]
- Choi, Y.; Choi, M.; Kim, M.; Ha, J.-W.; Kim, S.; Choo, J. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018), Salt Lake City, UT, USA, 18–22 June 2018; pp. 8789–8797. [Google Scholar] [CrossRef]
- Liu, X.; Xu, R.; Chen, Y. Securing Digital Media Integrity: A Survey of Watermarking and Manipulation Detection for Image Authentication. Authorea Preprints. 2025. [CrossRef]
- Thies, J.; Zollhöfer, M.; Nießner, M. Deferred Neural Rendering: Image Synthesis using Neural Textures. ACM Trans. Graph. 2019, 38, 66. [Google Scholar] [CrossRef]
- He, Z.; Henderson, P.; Pugeault, N. Beyond Reconstruction: A Physics Based Neural Deferred Shader for Photo-Realistic Rendering. In Proceedings of the International Conference on Artificial Neural Networks; Springer Nature Switzerland: Cham, Switzerland, September 2025; pp. 378–389. [Google Scholar] [CrossRef]
- Svitov, D.; Dahaghin, M. NBAvatar: Neural Billboards Avatars with Realistic Hand-Face Interaction. arXiv 2026, arXiv:2603.12063. [Google Scholar] [CrossRef]
- Ho, J.; Jain, A.; Abbeel, P. Denoising Diffusion Probabilistic Models. arXiv 2020, arXiv:2006.11239. [Google Scholar] [CrossRef]
- Zhang, X.; Chen, C. Parameter-efficient quantum denoising diffusion probabilistic models with temporal encoding. Future Gener. Comput. Syst. 2026, 174, 107981. [Google Scholar] [CrossRef]
- Song, J.; Meng, C.; Ermon, S. Denoising Diffusion Implicit Models. arXiv 2020, arXiv:2010.02502. [Google Scholar] [CrossRef]
- Zhang, Q.; Tao, M.; Chen, Y. gddim: Generalized denoising diffusion implicit models. arXiv. 2022. [CrossRef]
- Rombach, R.; Blattmann, A.; Lorenz, D.; Esser, P.; Ommer, B. High-Resolution Image Synthesis with Latent Diffusion Models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), New Orleans, LA, USA, 19–24 June 2022; pp. 10684–10695. [Google Scholar] [CrossRef]
- Lin, Z.; Wang, X.; Zhang, S.; Wang, T.; Guo, Z.; Liu, T. TSCM: Efficient Image Synthesis Using Latent Diffusion. Pattern Recognit. Lett. 2026. [CrossRef]
- Mildenhall, B.; Srinivasan, P.P.; Tancik, M.; Barron, J.T.; Ramamoorthi, R.; Ng, R. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. arXiv 2020, arXiv:2003.08934. [Google Scholar] [CrossRef]
- Wang, K.; Wei, K.; Li, S.Y. Dynamic view synthesis with topologically-varying neural radiance fields from sparse input views. Neurocomputing 2026, 132942. [Google Scholar] [CrossRef]
- Karypidis, Efstathios; Mouslech, Stylianos G.; Skoulariki, Kassiani; Gazis, Alexandros. Comparison Analysis of Traditional Machine Learning and Deep Learning Techniques for Data and Image Classification. WSEAS Trans. Math. 2022, vol. 21, 122–130. [Google Scholar] [CrossRef]
- Sun, X.; Xu, Z.; Li, X.; et al. Deepfake: definitions, performance metrics and standards, datasets and future directions. Front. Big Data 2024, 7, 1400024. [Google Scholar] [CrossRef]
- Pei, G.; Zhang, J.; Hu, M.; Zhang, Z.; Wang, C.; Wu, Y.; et al. Deepfake generation and detection: A benchmark and survey. ACM Comput. Surv. 2026, 58, 1–41. [Google Scholar] [CrossRef]
- Hossain, S.; Sudarsan, D.; Zaffar, H.; Ahmad, N. Social and psychological impact of deepfakes: a comprehensive bibliometric review. Glob. Knowl. Mem. Commun. 2026, 1–20. [Google Scholar] [CrossRef]
- Matern, F.; Riess, C.; Stamminger, M. Exploiting Visual Artifacts to Expose Deepfakes and Face Manipulations. In Proceedings of the IEEE Winter Applications of Computer Vision Workshops (WACVW), 2019; pp. 83–92. [Google Scholar] [CrossRef]
- Syed Abu Bakar, S.A.R.; Waseem, S.; Omar, Z.; Bilalashfaqahmed. Exploring the Advancements and Challenges of Deepfake Face-swap: A Survey. Multimed. Tools Appl. 2026, 85, 14. [Google Scholar] [CrossRef]
- Li, Y.; Chang, M.-C.; Lyu, S. Ictu Oculi: Exposing AI Generated Fake Face Videos by Detecting Eye Blinking. In Proceedings of the IEEE International Workshop on Information Forensics and Security (WIFS), 2018. [Google Scholar] [CrossRef]
- Sar, A.; Roy, S.; Choudhury, T.; Abraham, A. Zero-shot visual deepfake detection: can ai predict and prevent fake content before it is created? Found. Trends Signal Process. 2025, 19(3), 212–361. [Google Scholar] [CrossRef]
- Frank, J.; Eisenhofer, T.; Schönherr, L.; Fischer, A.; Kolossa, D.; Holz, T. Leveraging Frequency Analysis for Deep Fake Image Recognition. arXiv 2020, arXiv:2003.08685. [Google Scholar] [CrossRef]
- Hamadene, A.; Allili, M.S. Cross-Model Deepfake Detection Through Contourlet-Based Inter-Channel Spectral Analysis. IEEE Trans. Biom. Behav. Identity Sci. 2026. [CrossRef]
- Ciftci, U.A.; Demir, I.; Yin, L. FakeCatcher: Detection of Synthetic Portrait Videos Using Biological Signals. IEEE Trans. Pattern Anal. Mach. Intell. 2020. [CrossRef]
- Sharma, D.; Garg, G.; Chawla, J. Fusion of EfficientNetB0 and ResNet50 for Enhanced Deepfake Video Detection. In Proceedings of the 2026 IEEE International Conference for Convergence in Computing Technology (I3CTCON), March 2026; pp. 1–6. [Google Scholar] [CrossRef]
- Cozzolino, D.; Verdoliva, L. Noiseprint: A CNN-Based Camera Model Fingerprint. IEEE Trans. Inf. Forensics Secur. 2019, 15, 144–159. [Google Scholar] [CrossRef]
- Siddiqui, N.; Islam, S. An Efficient Feature-Based Framework for Camera Model Identification. IEEE Access. 2026. [CrossRef]
- Todisco, M.; Wang, X.; Vestman, V.; Sahidullah, M.; Delgado, H.; Nautsch, A.; Yamagishi, J.; Evans, N. ASVspoof 2019: A Large-Scale Public Database of Synthesized, Converted and Replayed Speech. Comput. Speech Lang. 2021, 64, 101114. [Google Scholar] [CrossRef]
- Langmia, K. Black Communication in the Age of Disinformation: DeepFakes and Synthetic Media; Springer Nature: Cham, Switzerland, 2023. [Google Scholar] [CrossRef]
- Dolhansky, B.; Bitton, J.; Pflaum, B.; Lu, J.; Howes, R.; Wang, M.; Canton Ferrer, C. The DeepFake Detection Challenge (DFDC) Dataset. arXiv 2020, arXiv:2006.07397. [Google Scholar] [CrossRef]
- Meta, A.I. Deepfake Detection Challenge Dataset (DFDC). Available online: https://ai.meta.com/datasets/dfdc/ (accessed on 1 June 2026).
- Li, Y.; Yang, X.; Sun, P.; Qi, H.; Lyu, S. Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), Seattle, WA, USA, 13–19 June 2020; pp. 3207–3216. [Google Scholar] [CrossRef]
- Jiang, L.; Li, R.; Wu, W.; Qian, C.; Loy, C.C. DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), Seattle, WA, USA, 13–19 June 2020; pp. 2889–2898. [Google Scholar] [CrossRef]
- Zi, B.; Chang, M.; Chen, J.; Ma, X.; Jiang, Y.-G. WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection. In Proceedings of the 28th ACM International Conference on Multimedia (MM ’20), Virtual Event, 12–16 October 2020; pp. 2382–2390. [Google Scholar] [CrossRef]
- He, Y.; Gan, B.; Chen, S.; Zhou, Y.; Yin, G.; Song, L.; Sheng, L.; Shao, J.; Liu, Z. ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), Nashville, TN, USA, 20–25 June 2021; pp. 4360–4369. [Google Scholar] [CrossRef]
- He, Y.; Sheng, L.; Shao, J.; Liu, Z.; et al. ForgeryNet—Face Forgery Analysis Challenge 2021: Methods and Results. arXiv 2021, arXiv:2112.08325. [Google Scholar] [CrossRef]
- CodaLab. ForgeryNet—Face Forgery Analysis Challenge 2021 (Competition Page). Available online: https://competitions.codalab.org/competitions/33386 (accessed on 1 June 2026).
- Khalid, H.; Tariq, S.; Kim, M.; Woo, S.S. FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset. arXiv. 2021. [CrossRef]
- Rössler, A.; Cozzolino, D.; Verdoliva, L.; Riess, C.; Thies, J.; Nießner, M. FaceForensics++: Learning to Detect Manipulated Facial Images. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019. [Google Scholar] [CrossRef]
- Li, Y.; Lyu, S. Exposing DeepFake Videos by Detecting Face Warping Artifacts. arXiv. 2018. [CrossRef]
- Bai, W.; Liu, Y.; Zhang, A.; Wang, Y.; Li, B.; Hu, W.; Zhang, Z. Deepfake Detection via Exploring Degradation Inconsistency. IEEE Trans. Inf. Forensics Secur. 2026. [CrossRef]
- Yan, Z.; Luo, Y.; Lyu, S.; Liu, Q.; Wu, B. Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 2024; pp. 8984–8994. [Google Scholar] [CrossRef]
- Tariq, R.; Heo, M.; Tariq, S.; Woo, S. Through the Lens: Benchmarking Deepfake Detectors Against Moiré-Induced Distortions. Adv. Neural Inf. Process. Syst. 2025, 38. Available online: https://proceedings.neurips.cc/paper_files/paper/2025/hash/75c4f24bf8a51f0b0870f0f9bceea4ea-Abstract-Datasets_and_Benchmarks_Track.html (accessed on 1 June 2026).
- Alanazi, S.; Asif, S. VIDS-Guard: A Novel Forensics-Aware Multi-Stream Transformer Framework for Robust Deepfake Video Detection. Int. Syst. Appl. 2026, 200664. [Google Scholar] [CrossRef]
- Afchar, D.; Nozick, V.; Yamagishi, J.; Echizen, I. MesoNet: A Compact Facial Video Forgery Detection Network. In Proceedings of the IEEE International Workshop on Information Forensics and Security (WIFS), Hong Kong, China, 11–13 December 2018. [Google Scholar] [CrossRef]
- Mesa-Simón, M.; Escobar-Molero, A.; Sáez-Mingorance, B.; Morales, D. P.; Álvarez-Bermejo, J. A.; Romero, F. J. Enabling live video provenance and authenticity: A C2PA-based system with TPM-based security for livestreaming platforms. IEEE Trans. Multimed. 2026, 1–12. [Google Scholar] [CrossRef]
- Content Authenticity Initiative (CAI). Video Provenance and the Ethics of Deepfakes (Blog post). 2021. Available online: https://contentauthenticity.org/ (accessed on 1 June 2026).
- Nastoska, A.; Jancheska, B.; Rizinski, M.; Trajanov, D. Evaluating trustworthiness in AI: Risks, metrics, and applications across industries. Electronics 2025, 14(13), 2717. [Google Scholar] [CrossRef]
- Guan, H.; Horan, J.; Zhang, A. Guardians of Forensic Evidence: Evaluating Analytic Systems Against AI-Generated Deepfakes; Forensics@NIST 2025; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2025; Available online: https://www.nist.gov/publications/guardians-forensic-evidence-evaluating-analytic-systems-against-ai-generated-deepfakes (accessed on 1 June 2026).
- Chesney, R.; Citron, D. Deep Fakes: A Looming Challenge for Privacy, Democracy, and National Security. Calif. Law. Rev. 2019, 107, 1753–1819. [Google Scholar] [CrossRef]
- European Commission. AI Act Service Desk: Article 50—Transparency Obligations for Providers and Deployers of Certain AI Systems; European Commission: Brussels, Belgium, 2025; Available online: https://artificialintelligenceact.eu/article/50/ (accessed on 1 June 2026).
- International Organization for Standardization (ISO); International Electrotechnical Commission (IEC). ISO/IEC 42001:2023—Information Technology—Artificial Intelligence—Management System; ISO: Geneva, Switzerland, 2023; Available online: https://www.iso.org/obp/ui/en/#iso:std:iso-iec:42001:ed-1:v1:en (accessed on 1 June 2026).
- International Organization for Standardization (ISO); International Electrotechnical Commission (IEC). ISO/IEC 23894:2023—Information Technology—Artificial Intelligence—Guidance on Risk Management; ISO: Geneva, Switzerland, 2023; Available online: https://www.iso.org/standard/77304.html (accessed on 1 June 2026).
- Advanced Television Systems Committee (ATSC). A/334—Audio Watermark Emission; ATSC: Washington, DC, USA, 2025; Available online: https://www.atsc.org/atsc-documents/a3342016-audio-watermark-emission/ (accessed on 1 June 2026).
- Advanced Television Systems Committee (ATSC). A/335—Video Watermark Emission; ATSC: Washington, DC, USA, 2025; Available online: https://www.atsc.org/atsc-documents/a3352016-video-watermark-emission/ (accessed on 1 June 2026).
- Advanced Television Systems Committee (ATSC). A/336—Content Recovery in Redistribution Scenarios; ATSC: Washington, DC, USA, 2024; Available online: https://www.atsc.org/atsc-documents/a3362017-content-recovery-redistribution-scenarios/ (accessed on 1 June 2026).
- Ganbaatar, U. Do Ethics in AI Still Matter? A Review of the 2021 UNESCO Recommendation on the Ethics of AI. Rev. Faith Int. Aff. 2025, 23(3), 26–33. [Google Scholar] [CrossRef]
- OECD. Recommendation of the Council on Artificial Intelligence (OECD/LEGAL/0449); OECD: Paris, France, 2019; Available online: https://legalinstruments.oecd.org/en/instruments/oecd-legal-0449 (accessed on 1 June 2026).
- Gaur, L. (Ed.) DeepFakes: Creation, Detection, and Impact; CRC Press: Boca Raton, FL, USA, 2022. [Google Scholar] [CrossRef]
- Schick, N. Deepfakes: The Coming Infocalypse; Twelve (Hachette Book Group): New York, NY, USA, 2020; ISBN 978-1538754313. Available online: https://www.hachettebookgroup.com/titles/nina-schick/deepfakes/9781538754313/?lens=twelve (accessed on 1 June 2026).
- Mooney, C. AI and Deception: Plagiarism, Deep Fakes, and More; ReferencePoint Press: San Diego, CA, USA, 2025; Available online: https://catalog.pcpls.org/Record/23319106 (accessed on 1 June 2026)ISBN 978-1-6782-1066-3.
- Kietzmann, J.; Lee, L.W.; McCarthy, I.P.; Kietzmann, T.C. Deepfakes: Trick or treat? Bus. Horiz. 2020, 63, 135–146. [Google Scholar] [CrossRef]
- Vaccari, C.; Chadwick, A. Deepfakes and Disinformation: Exploring the Impact of Synthetic Political Video on Deception, Uncertainty, and Trust in News. Soc. Media Soc. 2020, 6, 2056305120903408. [Google Scholar] [CrossRef]
- Gregory, J. The Trouble with Deepfakes ISBN 978-1668946992; Cherry Lake Publishing: Ann Arbor, MI, USA, 2024; Available online: https://cherrylakepublishing.com/shop/show/53868 (accessed on 1 June 2026).
- Chandra, B.; Dunietz, J.; Roberts, K. Reducing Risks Posed by Synthetic Content: An Overview of Technical Approaches to Digital Content Transparency (NIST AI 100-4); National Institute of Standards and Technology: Gaithersburg, MD, USA, 2024. [Google Scholar] [CrossRef]
- Coalition for Content Provenance and Authenticity (C2PA). C2PA Technical Specification, Version 1.3, C2PA, 2023. Available online: https://spec.c2pa.org/specifications/specifications/1.3/specs/_attachments/C2PA_Specification.pdf (accessed on 1 June 2026).
- European Union. Regulation (EU) 2024/1689 (Artificial Intelligence Act)—Transparency obligations (Art. 50). Available online: https://ai-act-service-desk.ec.europa.eu/en/ai-act/article-50 (accessed on 1 June 2026).
- de Araujo Meirelles Magalhães, F.; Calle, I. M. Consumer Protection in the Digital Age: A Reflection on Regulation 2022/2065 (Digital Services Act—DSA) and Agenda 2030. In International Conference a Digital Europe for Citizens, Data governance, Digital Markets, Digital Services; Springer Nature Switzerland: Cham, 2026; pp. 171–188. [Google Scholar] [CrossRef]
- ENISA. Remote ID Proofing Good Practices; European Union Agency for Cybersecurity: Athens/Heraklion, Greece, 2024; Available online: https://www.enisa.europa.eu/sites/default/files/2024-11/Remote%20ID%20Proofing%20Good%20Practices_en_0.pdf (accessed on 1 June 2026).
- NIST. SP 800-63A: Digital Identity Guidelines—Identity Proofing and Enrollment; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2017; Available online: https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-63a.pdf (accessed on 1 June 2026).
- European Union. Regulation (EU) 2016/679 (General Data Protection Regulation)—Art. 9 (special categories of personal data). Available online: https://eur-lex.europa.eu/eli/reg/2016/679/oj/eng (accessed on 1 June 2026).
- Innovation Lab, Europol. Facing Reality? Law Enforcement and the Challenge of Deepfakes; Europol: The Hague, The Netherlands, 2022; Available online: https://www.europol.europa.eu/cms/sites/default/files/documents/Europol_Innovation_Lab_Facing_Reality_Law_Enforcement_And_The_Challenge_Of_Deepfakes.pdf (accessed on 1 June 2026).
- UNESCO. Recommendation on the Ethics of Artificial Intelligence; UNESCO: Paris, France, 2021; Available online: https://unesdoc.unesco.org/ark:/48223/pf0000380455 (accessed on 1 June 2026).
- Yan, Z.; Zhang, Y.; Yuan, X.; Lyu, S.; Wu, B. DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection. arXiv. 2023. [CrossRef]
- Mirsky, Y.; Lee, W. The Creation and Detection of Deepfakes: A Survey. ACM Comput. Surv. 2021, 54, 1–41. [Google Scholar] [CrossRef]
- Fernandez, P.; Couairon, G.; Jégou, H.; Douze, M.; Furon, T. The Stable Signature: Rooting Watermarks in Latent Diffusion Models. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–6 October 2023. [Google Scholar] [CrossRef]
- Wen, Y.; Kirchenbauer, J.; Geiping, J.; Goldstein, T. Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust. arXiv. 2023. [CrossRef]
- Hu, Y.; Jiang, Z.; Guo, M.; Gong, N. Stable Signature is Unstable: Removing Image Watermark from Diffusion Models. arXiv. 2024. [CrossRef]
- Ci, H.; et al. RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification. arXiv 2024, arXiv:2404.14055. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).