Self Healing Infrastructure System

R. V. Patil; Vrushali Kudande; Shweta Jagtap; Sakshi Jadhav; Aniket  Jawalgekar

doi:10.65521/ijeecs.v14i1.190

Authors

R. V. Patil Professor PDEA’s College of Engineering, Pune.
Vrushali Kudande Student PDEA’s College of Engineering, Pune.
Shweta Jagtap Student PDEA’s College of Engineering, Pune.
Sakshi Jadhav Student PDEA’s College of Engineering, Pune.
Aniket Jawalgekar Student PDEA’s College of Engineering, Pune.

DOI:

https://doi.org/10.65521/ijeecs.v14i1.190

Keywords:

Real-Time Observability DevOps Practices ML-Driven Automation Anomaly Detection Automated Remediation

Abstract

Managing modern IT infrastructure has become increasingly complex with the adoption of containerized applications and distributed architectures. Traditional methods reliant on manual monitoring, fault detection, and recovery are no longer sufficient due to their slow response times, error-prone nature, and increased downtime. The need for 24/7 system availability and unpredictable failures highlights the urgency of more resilient solutions. Advanced tools like Prometheus offer real-time observability, anomaly detection, and system health monitoring, enabling quicker responses. However, traditional approaches lack automated remediation capabilities.

Integrating DevOps practices with ML-driven technologies addresses these gaps. Enhancing tools like Prometheus with self-healing capabilities allows infrastructures to autonomously detect anomalies, diagnose issues, and execute automated recovery workflows. This reduces downtime, optimizes resources, and minimizes manual intervention. This research aims to develop a scalable, robust system ensuring business continuity, reliability, and resilience in dynamic, cloud-native environments, unlocking the potential of autonomous infrastructure management.

Self Healing Infrastructure System

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

Similar Articles

Quick Links

For Authors

For Reviewers

Contact Us

Similar Articles

A Deep Learning–Based Intelligent Framework for Road Safety Monitoring and Traffic Violation Detection

Women Safety Mobile Application Using ESP32 Camera and Real-Time Crime Data Analysis

Elevate: A Unified Web-based Platform for Ransomware Detection and Network Intrusion Analysis

AI Based Product Object Detection & Counting, Sorting

MediQueue: Queue Optimization in Healthcare using Machine Learning

Deep Learning Based- Pothole Detection

Deep Learning and Optimization Approaches in Dynamic Path-Controllable Deep Unfolding Network to Predict K-Barriers for Intrusion Detection Using Wireless Sensor Networks: A Review

HoneyCloud: A Smart Scalable Honeypot Platform with ML-Based Threat Classification and Real-Time Attacker Profiling

AI Smart Keyboard System: Real-Time Grammar Correction and Smart Prediction