Unsupervised Learning Framework for Anomaly Detection in High-Dimensional Data Streams Using Clustering and Autoencoders

Jovencio Qureshi-Haq

PDF

Published: Oct 30, 2025

Keywords:

Unsupervised Learning, Anomaly Detection, High-Dimensional Data Streams, Autoencoders, Clustering, Deep Learning

Jovencio Qureshi-Haq

Senior Lecturer, Department of Electrical and Computer Engineering, Visayan Maritime Polytechnic University, Philippines

Abstract

The rapid growth of high-dimensional data streams generated from IoT systems, financial networks, cybersecurity infrastructures, healthcare monitoring platforms, and industrial sensor systems has significantly increased the importance of real-time anomaly detection. Traditional supervised anomaly detection techniques often require large volumes of labeled data, which are difficult and expensive to obtain in dynamic environments. Furthermore, high-dimensional streaming data introduces challenges related to feature complexity, noise, scalability, and evolving data distributions. Unsupervised learning approaches have therefore emerged as effective solutions for detecting anomalous patterns without requiring labeled training datasets. This research proposes an unsupervised learning framework for anomaly detection in high-dimensional data streams using clustering and autoencoder-based deep learning techniques. The proposed framework integrates feature extraction, dimensionality reduction, distributed clustering, and deep autoencoder reconstruction mechanisms to identify abnormal patterns and rare events in continuously evolving data streams. Clustering algorithms are utilized to group normal behavioral patterns, while autoencoders learn compressed latent representations and identify anomalies through reconstruction error analysis. The framework supports real-time analytical processing, adaptive learning, and scalable anomaly detection in heterogeneous streaming environments. Experimental evaluation demonstrates that the proposed framework significantly improves anomaly detection accuracy, false-positive reduction, scalability, and computational efficiency compared to traditional statistical and distance-based anomaly detection approaches Furthermore, the integration of clustering and deep autoencoder architectures enhances the framework’s capability to identify subtle and previously unseen anomalies in high-dimensional feature spaces.

How to Cite

Qureshi-Haq, J. (2025). Unsupervised Learning Framework for Anomaly Detection in High-Dimensional Data Streams Using Clustering and Autoencoders. International Journal on Advanced Computer Theory and Engineering, 14(2), 259–267. Retrieved from https://journals.mriindia.com/index.php/ijacte/article/view/2718

Issue

Vol. 14 No. 2 (2025)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details

Similar Articles