Real - time explainability for preditive maintenance
Date
2026-03-20
Embargo
Advisor
Coadvisor
Journal Title
Journal ISSN
Volume Title
Publisher
Language
Portuguese
Alternative Title
Abstract
A deteção de anomalias em ambientes industriais de tempo real constitui um desafio multidimensional que envolve restrições de latência, ausência de rótulos fiáveis, variabilidade operacional e necessidade de interpretabilidade das decisões. Esta dissertação propõe uma arquitetura híbrida para manutenção preditiva baseada num ensemble heterogéneo de modelos não supervisionados, combinando métodos treinados em regime batch sobre um baseline nominal com uma componente incremental adaptativa para operação em fluxo contínuo.
O sistema integra seis paradigmas complementares de deteção: Isolation Forest, Local Outlier Factor, One-Class SVM, K-Means, Predictive Lag-1 baseado em regressão Ridge e Half-Space Trees. Os indicadores produzidos são normalizados através de uma abordagem robusta baseada na mediana e no desvio absoluto mediano (MAD), posteriormente agregados por média aritmética simples e estabilizados por suavização temporal exponencial e confirmação por persistência consecutiva.
Os limiares de decisão são calibrados de forma adaptativa com base em quantis da distribuição empírica observada em regime nominal, garantindo alinhamento com taxas alvo de ativação operacional. A estratégia de validação temporal segue um paradigma prequencial adaptado, preservando a ordem cronológica dos dados e simulando condições realistas de operação em streaming.
Adicionalmente, é proposto um sistema de explicabilidade estruturado em três camadas hieráricas, que fornece desde justificação estatística imediata até análise detalhada dos contributos individuais dos modelos e geração de recomendações acionáveis. A validação experimental demonstra robustez, complementaridade entre detetores e estabilidade decisional, evidenciando a adequação da solução a cenários industriais de monitorização contínua.
Real-time anomaly detection in industrial environments presents multidimensional challenges involving latency constraints, lack of reliable labels, operational variability, and the need for decision interpretability. This dissertation proposes a hybrid architecture for predictive maintenance based on a heterogeneous ensemble of unsupervised models, combining batch-trained detectors built on a stable nominal baseline with an adaptive incremental component designed for streaming operation. The system integrates six complementary detection paradigms: Isolation Forest, Local Outlier Factor, One-Class SVM, K-Means, Predictive Lag-1 based on Ridge regression, and Half-Space Trees. Individual anomaly indicators are normalized using a robust median and Median Absolute Deviation (MAD) approach, aggregated through simple arithmetic averaging, and stabilized using exponential temporal smoothing combined with persistence-based confirmation logic. Decision thresholds are calibrated adaptively using empirical quantiles derived from nominal operating conditions, ensuring alignment with operational activation targets. The temporal validation strategy follows an adapted prequential framework, preserving chronological data order and simulating realistic streaming conditions. Furthermore, a three-layer explainability framework is introduced, providing progressive interpretative support ranging from statistical justification to model-level contribution analysis and actionable insights. Experimental validation confirms detector complementarity, robustness under extreme anomalies, and decision stability, supporting the suitability of the proposed approach for continuous industrial monitoring scenarios.
Real-time anomaly detection in industrial environments presents multidimensional challenges involving latency constraints, lack of reliable labels, operational variability, and the need for decision interpretability. This dissertation proposes a hybrid architecture for predictive maintenance based on a heterogeneous ensemble of unsupervised models, combining batch-trained detectors built on a stable nominal baseline with an adaptive incremental component designed for streaming operation. The system integrates six complementary detection paradigms: Isolation Forest, Local Outlier Factor, One-Class SVM, K-Means, Predictive Lag-1 based on Ridge regression, and Half-Space Trees. Individual anomaly indicators are normalized using a robust median and Median Absolute Deviation (MAD) approach, aggregated through simple arithmetic averaging, and stabilized using exponential temporal smoothing combined with persistence-based confirmation logic. Decision thresholds are calibrated adaptively using empirical quantiles derived from nominal operating conditions, ensuring alignment with operational activation targets. The temporal validation strategy follows an adapted prequential framework, preserving chronological data order and simulating realistic streaming conditions. Furthermore, a three-layer explainability framework is introduced, providing progressive interpretative support ranging from statistical justification to model-level contribution analysis and actionable insights. Experimental validation confirms detector complementarity, robustness under extreme anomalies, and decision stability, supporting the suitability of the proposed approach for continuous industrial monitoring scenarios.
Keywords
Deteção de anomalias, Manutenção preditiva, Data streams, Ensemble não supervisionado, Explicabilidade, Edge computing
Document Type
Master thesis
Publisher Version
Dataset
Citation
Sousa, D. C. P. G. (2026). Real - time explainability for preditive maintenance [Dissertação de Mestrado em Ciência de Dados, Universidade Portucalense]. Repositório Institucional UPT. http://hdl.handle.net/11328/https://hdl.handle.net/11328/7027
Identifiers
TID
Designation
Mestrado em Ciência de Dados
Access Type
Open Access