MRI
MRI India Journals Vol. 14 No. 1 (2025)

Federated Multimodal Language Recognition: A Deep Learning Approach for Real-Time Applications

Authors

  • M. Asha Aruna Sheela Assistant Professor & HOD,Department of Computer Science & Engineering ,Chalapathi Institute of Engineering and Technology, LAM, Guntur, AP, India
  • Konakanchi Amulya Department of Computer Science and Engineering,Chalapathi Institute of Engineering and Technology, LAM, Guntur, AP, India
  • Dova Lokesh Department of Computer Science and Engineering,Chalapathi Institute of Engineering and Technology, LAM, Guntur, AP, India
  • Karra Yesubabu Department of Computer Science and Engineering,Chalapathi Institute of Engineering and Technology, LAM, Guntur, AP, India
  • Palla Ajay Department of Computer Science and Engineering,Chalapathi Institute of Engineering and Technology, LAM, Guntur, AP, India

DOI:

https://doi.org/10.65521/intjournalrecadvengtech.v14i1.163

Keywords:

Language Detection N-gram Random Forest K-Nearest Neighbors Support Vector Machine Multilingual Identification

Abstract

In a world of growing linguistic diversity, multilingual identification systems are essential for seamless communication across digital platforms and real-world applications. This research presents a robust, deep learning-based multilingual identification framework capable of recognizing and translating languages from text, speech, and image modalities. The proposed system integrates machine learning classifiers—Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Random Forest—with N-gram-based feature extraction to build baseline models. Experimental results highlight SVM's superior performance, achieving 95% accuracy across Tamil, Hindi, and Marathi languages. In parallel, the framework extends to real-time multilingual detection by incorporating advanced deep learning techniques such as Transformers, YOLOv5, and Whisper AI for hybrid text, speech, and image inputs.A key innovation in the system is the integration of Federated Learning (FL), enabling decentralized model training while preserving user privacy. This enhances both scalability and security, particularly in applications such as missing child identification, multilingual surveillance, and cross-border intelligence analysis. The system also features a translation module using Google Translator to convert recognized languages into English, making outputs more accessible for non-native speakers. Evaluations conducted across benchmark datasets demonstrate high precision, recall, and low latency, affirming the system's potential for real-world deployment. Future enhancements will explore large-scale multilingual datasets, context-aware neural architectures, and further FL optimization for real-time, privacy-preserving language recognition.

Downloads

Download data is not yet available.

Downloads

Published

2025-04-14

How to Cite

Sheela, M. A. A., Amulya, K., Lokesh, D., Yesubabu, K., & Ajay, P. (2025). Federated Multimodal Language Recognition: A Deep Learning Approach for Real-Time Applications. International Journal of Recent Advances in Engineering and Technology, 14(1), 17–26. https://doi.org/10.65521/intjournalrecadvengtech.v14i1.163

Issue

Section

Articles

Most read articles by the same author(s)

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.