Overcoming Unimodal Challenges: A Survey of Multi-Modal Fusion for Mobile Interfaces

Main Article Content

Harsh Wanjari
Danish Ayub Gaus
Uday Bhoyar
Dr. Komal K. Gehani
Sumit Prasad

Abstract

The proliferation of mobile devices has spurred the development of interaction paradigms that extend beyond traditional touch inputs, catering to users with motor impairments and situations requiring hands-free operation. This paper presents a comprehensive survey of the primary modalities for hands-free mobile interaction: head pose estimation, eye-gaze tracking, and voice command recognition. We conduct a comparative analysis of the algorithmic evolution within each modality, tracing the progression from classical computer vision techniques to modern deep learning architectures. For head pose estimation, we evaluate the trade-offs between landmark-based and landmark-free methods, with a focus on lightweight models suitable for on-device deployment. For eye-gaze tracking, we compare model-based and appearance-based approaches, highlighting the critical role of large-scale datasets in achieving robustness. For voice, we analyze the performance characteristics of on-device versus cloud-based speech recognition and the architectural necessity of low-power keyword spotting. Furthermore, we analyze the synergistic potential of multimodal fusion as a solution to inherent unimodal challenges, most notably the "Midas Touch problem." By synthesizing findings from across the field, this survey provides a structured overview of the state of the art and identifies key considerations for designing the next generation of effective and accessible hands-free systems.

Downloads

Download data is not yet available.

Article Details

How to Cite
Wanjari, H., Gaus, D. A., Bhoyar, U., Gehani, D. K. K., & Prasad, S. (2025). Overcoming Unimodal Challenges: A Survey of Multi-Modal Fusion for Mobile Interfaces. International Journal of Recent Advances in Engineering and Technology, 14(3s), 159–165. https://doi.org/10.65521/intjournalrecadvengtech.v14i3s.1685
Section
Articles

Similar Articles

<< < 4 5 6 7 8 9 10 11 12 13 > >> 

You may also start an advanced similarity search for this article.