The Industrial Internet o fThings(IoT) has revolutionized industrial operations by enabling real-time data collection and analysis through smart sensors and connected devices. This digital transformation is particularly crucial in the manufacturing industries, where predictive maintenance and efficiency optimization are essential. However, the massive volume of high frequency data collected by different sensors generated in IoT environments presents challenges related to data quality, dimensionality and computational efficiency. Addressing these challenges requires advanced data driven approaches, particularly in clustering and anomaly detection contexts. This thesis is inspired by the internship conducted at a leading company in the production of pumps, where clustering and deep learning techniques were applied to IoT data coming from a particular type of pumps. These smart pumps capture many operational parameters and thus generate a large amount of time-series and multivariate data, in particular, the dataset consists of millions of records and theretofore presented significant challenges due to missing or inconsistent values, sensor drift and data imbalance. The need to segment pump behaviors into distinct categories to extract meaningful insights suggest me to exploit different clustering algorithms. The second part of the thesis involves analyzing anomaly detection by exploring AT-DCAEP (Attention-based Dual-channel Autoencoder with External Prediction) model. This model is designed for multivariate time-series anomaly detection by combining spatial and temporalfeature learning. The results help to develop effective clustering and anomaly detection methodologies for IoT applications, which improve predictive maintenance strategies and operational efficiency in industrial settings.

The Industrial Internet o fThings(IoT) has revolutionized industrial operations by enabling real-time data collection and analysis through smart sensors and connected devices. This digital transformation is particularly crucial in the manufacturing industries, where predictive maintenance and efficiency optimization are essential. However, the massive volume of high frequency data collected by different sensors generated in IoT environments presents challenges related to data quality, dimensionality and computational efficiency. Addressing these challenges requires advanced data driven approaches, particularly in clustering and anomaly detection contexts. This thesis is inspired by the internship conducted at a leading company in the production of pumps, where clustering and deep learning techniques were applied to IoT data coming from a particular type of pumps. These smart pumps capture many operational parameters and thus generate a large amount of time-series and multivariate data, in particular, the dataset consists of millions of records and theretofore presented significant challenges due to missing or inconsistent values, sensor drift and data imbalance. The need to segment pump behaviors into distinct categories to extract meaningful insights suggest me to exploit different clustering algorithms. The second part of the thesis involves analyzing anomaly detection by exploring AT-DCAEP (Attention-based Dual-channel Autoencoder with External Prediction) model. This model is designed for multivariate time-series anomaly detection by combining spatial and temporalfeature learning. The results help to develop effective clustering and anomaly detection methodologies for IoT applications, which improve predictive maintenance strategies and operational efficiency in industrial settings.

Clustering and Anomaly Detection in Industrial IoT Data

COGO, GIULIO
2024/2025

Abstract

The Industrial Internet o fThings(IoT) has revolutionized industrial operations by enabling real-time data collection and analysis through smart sensors and connected devices. This digital transformation is particularly crucial in the manufacturing industries, where predictive maintenance and efficiency optimization are essential. However, the massive volume of high frequency data collected by different sensors generated in IoT environments presents challenges related to data quality, dimensionality and computational efficiency. Addressing these challenges requires advanced data driven approaches, particularly in clustering and anomaly detection contexts. This thesis is inspired by the internship conducted at a leading company in the production of pumps, where clustering and deep learning techniques were applied to IoT data coming from a particular type of pumps. These smart pumps capture many operational parameters and thus generate a large amount of time-series and multivariate data, in particular, the dataset consists of millions of records and theretofore presented significant challenges due to missing or inconsistent values, sensor drift and data imbalance. The need to segment pump behaviors into distinct categories to extract meaningful insights suggest me to exploit different clustering algorithms. The second part of the thesis involves analyzing anomaly detection by exploring AT-DCAEP (Attention-based Dual-channel Autoencoder with External Prediction) model. This model is designed for multivariate time-series anomaly detection by combining spatial and temporalfeature learning. The results help to develop effective clustering and anomaly detection methodologies for IoT applications, which improve predictive maintenance strategies and operational efficiency in industrial settings.
2024
Clustering and Anomaly Detection in Industrial IoT Data
The Industrial Internet o fThings(IoT) has revolutionized industrial operations by enabling real-time data collection and analysis through smart sensors and connected devices. This digital transformation is particularly crucial in the manufacturing industries, where predictive maintenance and efficiency optimization are essential. However, the massive volume of high frequency data collected by different sensors generated in IoT environments presents challenges related to data quality, dimensionality and computational efficiency. Addressing these challenges requires advanced data driven approaches, particularly in clustering and anomaly detection contexts. This thesis is inspired by the internship conducted at a leading company in the production of pumps, where clustering and deep learning techniques were applied to IoT data coming from a particular type of pumps. These smart pumps capture many operational parameters and thus generate a large amount of time-series and multivariate data, in particular, the dataset consists of millions of records and theretofore presented significant challenges due to missing or inconsistent values, sensor drift and data imbalance. The need to segment pump behaviors into distinct categories to extract meaningful insights suggest me to exploit different clustering algorithms. The second part of the thesis involves analyzing anomaly detection by exploring AT-DCAEP (Attention-based Dual-channel Autoencoder with External Prediction) model. This model is designed for multivariate time-series anomaly detection by combining spatial and temporalfeature learning. The results help to develop effective clustering and anomaly detection methodologies for IoT applications, which improve predictive maintenance strategies and operational efficiency in industrial settings.
Clustering
Anomaly detection
AT-DCAEP model
File in questo prodotto:
File Dimensione Formato  
Cogo_Giulio.pdf

Accesso riservato

Dimensione 12.17 MB
Formato Adobe PDF
12.17 MB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/91825