Multimodal Depression Detection Using Textual and Visual Cues: A Machine Learning Approach with the DAIC-WOZ Dataset

Main Article Content

Sapna Singh
Samin Raza
Ratan Rajan Srivastava
Samiksha Singh

Abstract

Depression is a prevalent mental health disorder affecting millions worldwide, yet remains underdiagnosed due to various barriers in clinical seFngs. This research proposes a novel multimodal machine learning framework that leverages both textual and visual behavioral indicators to detect depression. Utilizing the Distress Analysis Interview Corpus Wizard-of-Oz (DAIC-WOZ) dataset, our approach combines natural language processing techniques to analyze linguistic patterns with computer vision methods to capture non-verbal cues. The multimodal model achieved significantly higher performance (F1-score: 0.89) compared to unimodal approaches (text-only: 0.76, visual-only: 0.72), demonstrating the effectiveness of integrating multiple data modalities. Key depression indicators identified include specific linguistic patterns (increased negative emotion words, first-person singular pronouns) and visual markers (reduced facial expressivity, decreased eye contact). This research contributes to the emerging field of automated depression screening tools that could supplement clinical diagnostics, particularly in telehealth seFngs where in-person assessment is limited. Ethical considerations regarding privacy, bias, and appropriate implementation contexts are discussed.

Downloads

Download data is not yet available.

Article Details

How to Cite
Singh, S., Raza, S., Srivastava, R. R., & Singh, S. (2025). Multimodal Depression Detection Using Textual and Visual Cues: A Machine Learning Approach with the DAIC-WOZ Dataset. International Journal of Recent Advances in Engineering and Technology, 14(2s), 99–105. Retrieved from https://journals.mriindia.com/index.php/ijraet/article/view/1444
Section
Articles

Similar Articles

<< < 1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.