MRI
MRI India Journals Vol. 15 No. 1S (2026): Special Issue on Cognition, Human and Artificial Intelligence

Evaluation of Artificial Intelligence in Answering Dermatological Medical Questions

Authors

  • Abdullah ALSarrajie Department of Computer Science, Faculty of Applied Sciences, Ibb University, Ibb, Yemen
  • Akram ALsubari Department of Computer Science, Faculty of Applied Sciences, Ibb University, Ibb, Yemen

DOI:

https://doi.org/10.65521/ijaece.v15i1S.1341

Keywords:

Arabic NLP Dermatology Question Answering Medical Question Answering Transformer Models Fine-tuning.

Abstract

This research presents a systematic evaluation of the adaptation of a large Arabic linguistic model (based on AraGPT2) to answer questions in the field of dermatology. The study aims to bridge the gap in specialized linguistic resources by fine-tuning the model to a newly created and purified dataset, collected using a hybrid methodology combining web scraping and filtered data enrichment. This dataset consists of 40,132 specialized question-answer pairs. The performance of the finely tuned model was quantitatively assessed using BERTScore, BLEU, Levenshtein distance, and two types of initial human evaluation. The quantitative results showed strong semantic performance, with the model achieving a BERTScore (F1) of 64.49%, confirming its ability to effectively understand medical context and meaning. In contrast, the verbal matching measures reflected a tendency toward free generation, scoring BLEU at 10.00% and Levenshtein distance at 28.13%. These results demonstrate that the model favors the free generation of new formulations over the verbatim retrieval of reference texts. Furthermore, the qualitative results of the proposed model showed a competitive overall performance of 4.01, achieving a high score of 4.55 in the criteria of linguistic clarity and readability for non-expert audiences. These results confirm that the model's primary contribution lies in its ability to enhance human comprehension (understanding) of complex medical data. The study also underscores the urgent need for subsequent clinical validation by field experts to ensure complete clinical accuracy and reliability.

Downloads

Published

2026-01-19

How to Cite

ALSarrajie , A., & ALsubari , A. (2026). Evaluation of Artificial Intelligence in Answering Dermatological Medical Questions . International Journal on Advanced Electrical and Computer Engineering, 15(1S), 24–41. https://doi.org/10.65521/ijaece.v15i1S.1341

Most read articles by the same author(s)

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.