AI-Based Detection of Cloned Voices in Deepfake Videos

Rajeshwari Kodulkar; Shreya Bhasme; Shruti Rajput; Nujhat Shaikh; Rajashri Yarakadavar

doi:10.65521/ijacte.v15i1.2936

Authors

Rajeshwari Kodulkar Dept. of AI & Data Science, Dr. J. J. Magdum College of Engineering, Jaysingpur , India
Shreya Bhasme Dept. of AI & Data Science, Dr. J. J. Magdum College of Engineering, Jaysingpur , India
Shruti Rajput Dept. of AI & Data Science, Dr. J. J. Magdum College of Engineering, Jaysingpur , India
Nujhat Shaikh Dept. of AI & Data Science, Dr. J. J. Magdum College of Engineering, Jaysingpur , India
Rajashri Yarakadavar Dept. of AI & Data Science, Dr. J. J. Magdum College of Engineering, Jaysingpur , India

DOI:

https://doi.org/10.65521/ijacte.v15i1.2936

Keywords:

Deepfake Audio Detection Voice Cloning MFCC Lip Synchronization Anti-Spoofing ASVspoof FakeAVCeleb Multimodal Detection CNN ECAPA-TDNN

Abstract

Voice cloning is no longer science fiction. AI tools today can copy someone's voice from just a few seconds of audio. This creates a new threat: take a real video of a trusted person, swap in a cloned voice saying something false, and share it. The face is real, the voice sounds real, but the message is fabricated. Existing speaker verification systems often miss it too. This paper describes a two-phase detection system built for exactly this kind of attack. Phase I analyzes audio using MFCC features, Mel-spectrograms, and a CNN-based classifier. Phase II adds video analysis, checking whether the speaker's lips match the audio and looking for timing mismatches. A custom dataset of real and cloned voice samples was built alongside benchmarks ASVspoof 2019/2021 and FakeAVCeleb. Results show cloned voices leave detectable traces, and combining both phases is noticeably more reliable than audio analysis alone.

AI-Based Detection of Cloned Voices in Deepfake Videos

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Similar Articles

Quick Links

For Authors

For Reviewers

Contact Us

Similar Articles

AI Automatic Pronunciation Mistake Detector

Edge Computing Security: Threats and Countermeasures in Edge Networks

Enhanced Secure Digital Data Embedding and Extracting Using Advanced Crypto-Stego Mechanisms

CNN Based Image Recognition System using Deep Learning

Autopod: An AI-Driven Framework for Automated Podcast Production and Optimization

A Systematic Review of Tensor analysis models for high-dimensional computer vision tasks: Methods, Architectures, and Future Research DirectionsHigh-dimensional computer vision tasks such as hyperspectral imaging, video understanding, 3D reconstruction, a

Advanced Deep Learning Architectures for ECG-Enabled Heart Disease Prediction

An Android approach towards Women Safety Application

A Survey of Methods and Architectures for Environmental Weather Monitoring and Prediction System Using IoT and Multi-Model Progressive Dense Self-Attention