Spatio-Temporal Deepfake Detection: A CNN-RNN Hybrid Approach for Image and Video Forgery Identification

Vidhina Bansod; Rudra Lagde; Devesh Khachane

doi:10.65521/ijacte.v15i2S.2984

Authors

Vidhina Bansod Dept. of Artificial Intelligence & Data Science (GSMCOE) Pune, India
Rudra Lagde Dept. of Artificial Intelligence & Data Science (GSMCOE) Pune, India
Devesh Khachane Dept. of Artificial Intelligence & Data Science (GSMCOE) Pune, India

DOI:

https://doi.org/10.65521/ijacte.v15i2S.2984

Keywords:

Deepfake Detection Generative Adversarial Networks (GANs) Convolutional Neural Networks Spatio-Temporal Analysis MTCNN VGG16 LSTM DFDC Dataset Video Forensics Explainable AI

Abstract

The proliferation of AI-generated synthetic media, commonly known as deepfakes, poses a grave threat to digital security, public trust, and information integrity. Despite the existence of numerous detection frameworks, many suffer from limited generalizability, lack of interpretability, and poor performance against high-quality forgeries. This paper presents a spatio-temporal deepfake detection system that integrates Convolutional Neural Networks (CNNs) for spatial artifact extraction with Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units for temporal inconsistency modeling. The system employs MTCNN-based face extraction followed by VGG16/VGG19 backbone networks for per- frame analysis, and leverages the Deepfake Detection Challenge (DFDC) dataset with careful class balancing to overcome dataset bias and fake-accuracy pitfalls. Experimental results demonstrate that the hybrid spatio- temporal approach significantly outperforms single-modality baselines, achieving robust detection across varying lighting conditions, compression levels, and face resolutions. The proposed framework lays the foundation for a real-time multimodal deepfake authentication system integrating audio-visual synchronization analysis.

Spatio-Temporal Deepfake Detection: A CNN-RNN Hybrid Approach for Image and Video Forgery Identification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Similar Articles

Quick Links

For Authors

For Reviewers

Contact Us

Similar Articles

A Comprehensive Review of Strategy Design for Energy-Efficient Data Offloading in 6G-Enabled Vehicular Edge Computing Networks Using Double Deep Q-Network

An Integrated Deep Learning Framework for Multi-Hazard Risk Assessment: Flood Prediction and Landslide Detection

A Systematic Review of Cryptographic Indexing Schemes for Encrypted Scientific Databases: Methods, Architectures, and Future Research Directions

Efficient Medical Image Classification Using Masked Attention Networks

Adaptive MEMS Gyroscope Regulation Using Steerable Graph Learning Architectures

POS Tagging: A Review of Recent Techniques

Analysis of SVM-based Machine Learning Techniques to Improve the Accuracy of Disease Prediction in the Healthcare Sector

Comprehensive Survey on Healthcare Virtual Assistants: Leveraging Natural Language Processing for Predictive Insights

A Systematic Review of PDE-based models for advanced rendering in computer graphics: Methods, Architectures, and Future Research Directions

PharmaX-Net: A Unified Hybrid Architecture for Polypharmacy Side-Effect Prediction Using Molecular Fingerprints