Semantic Image Segmentation using Deep Learning Models

Main Article Content

Meeta B. Fadnavis

Abstract

Semantic image segmentation is a fundamental task in computer vision that involves assigning a class label to each pixel in an image, aiming to understand and interpret the visual content at a semantic level. Deep learning models, particularly convolutional neural networks (CNNs), have revolutionized this field, achieving significant advancements over traditional methods. This paper provides an overview of deep learning-based approaches for semantic image segmentation, highlighting key architectures such as Fully Convolutional Networks (FCNs), U-Net, DeepLab, and the use of advanced techniques like transformers and attention mechanisms. We explore various datasets and benchmark evaluation metrics that assess the performance of these models, particularly in domains such as medical image analysis, autonomous driving, and remote sensing. Additionally, we discuss the challenges faced in semantic segmentation, including class imbalance, high computational cost, and the need for large annotated datasets, and propose strategies for overcoming these issues. The integration of active learning techniques and semi-supervised learning in enhancing model performance with minimal labeled data is also covered. Finally, we present future directions, including the integration of multi-modal data and the development of more efficient, lightweight models for real-time applications.

Article Details

How to Cite
Fadnavis, M. B. (2024). Semantic Image Segmentation using Deep Learning Models. International Journal on Advanced Computer Theory and Engineering, 13(2), 48–53. Retrieved from https://journals.mriindia.com/index.php/ijacte/article/view/136
Section
Articles

Similar Articles

1 2 3 4 5 6 7 8 > >> 

You may also start an advanced similarity search for this article.