20YY
E. Marcelli, T. Barbariol, G.A. Susto.
Active Learning-based Isolation Forest (ALIF): Enhancing Anomaly Detection in Decision Support Systems. 20YY [
url] [
BibTeX]
2023
D. Dandolo, C. Masiero, M. Carletti, D. Dalle Pezze, G.A. Susto.
AcME - Accelerated Model-agnostic Explanations: Fast Whitening of the Machine-Learning Black Box. Expert Systems with Applications, vol. 214, 2023 [
url] [
BibTeX]
M. Carletti, M. Terzi, G.A. Susto.
Interpretable Anomaly Detection with DIFFI: Depth-based Feature Importance for the Isolation Forest. Engineering Applications of Artificial Intelligence, vol. 119, 2023
Abstract:
Anomaly Detection is an unsupervised learning task aimed at detecting anomalous behaviors with respect to historical data. In particular, multivariate Anomaly Detection has an important role in many applications thanks to the capability of summarizing the status of a complex system or observed phenomenon with a single indicator (typically called ‘anomaly score’) and thanks to the unsupervised nature of the task that does not require human tagging. The Isolation Forest is one of the most commonly adopted algorithms in the field of Anomaly Detection due to its proven effectiveness and low computational complexity. A major problem affecting Isolation Forest is represented by the lack of interpretability, an effect of the inherent randomness governing the splits performed by the Isolation Trees, the building blocks of the Isolation Forest. In this paper, we propose effective yet computationally inexpensive methods to define feature importance scores at both global and local levels for the Isolation Forest. Moreover, we define a procedure to perform unsupervised feature selection for Anomaly Detection problems based on our interpretability method. Such a procedure also serves the purpose of tackling the challenging task of feature importance evaluation in unsupervised anomaly detection. We assess the performance on several synthetic and real-world datasets, including comparisons against state-of-the-art interpretability techniques, and make the code publicly available to enhance reproducibility and foster research in the field.
[ abstract ] [
url] [
BibTeX]
F. Zocco, M. Maggipinto, G.A. Susto, S. McLoone.
Lazy FSCA for Unsupervised Variable Selection. Engineering Applications of Artificial Intelligence, vol. 124, 2023
Abstract:
Dimensionality reduction is a important step in the development of scalable and interpretable data-driven models, especially when there are a large number of candidate variables. This paper focuses on unsupervised variable selection based dimensionality reduction, and in particular on unsupervised greedy selection methods, which have been proposed by various researchers as computationally tractable approximations to optimal subset selection. These methods are largely distinguished from each other by the selection criterion adopted, which include squared correlation, variance explained, mutual information and frame potential. Motivated by the absence in the literature of a systematic comparison of these different methods, we present a critical evaluation of seven unsupervised greedy variable selection algorithms considering both simulated and real world case studies. We also review the theoretical results that provide performance guarantees and enable efficient implementations for certain classes of greedy selection function, related to the concept of submodularity. Furthermore, we introduce and evaluate for the first time, a lazy implementation of the variance explained based forward selection component analysis (FSCA) algorithm. Our experimental results show that: (1) variance explained and mutual information based selection methods yield smaller approximation errors than frame potential; (2) the lazy FSCA implementation has similar performance to FSCA, while being an order of magnitude faster to compute, making it the algorithm of choice for unsupervised variable selection.
[ abstract ] [
url] [
BibTeX]
L. Cristaldi, P. Esmaili, G. Gruosso, A. La Bella, M. Mecella, R. Scattolini, A. Arman, G.A. Susto, L. Tanca.
The MICS Project. A Data Science Pipeline for Industry 4.0 Applications. 2023 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence, and Neural Engineering (IEEE MetroXRAINE 2023), 2023 [
BibTeX]
2022
M. Maggipinto, A. Beghi, G.A. Susto.
A Deep Convolutional Autoencoder-based Approach for Anomaly Detection with Industrial, Non-images, 2-Dimensional data: a Semiconductor Manufacturing case study. IEEE Transactions on Automation Science and Engineering, vol. 19(3), pp. 1477-1490, 2022 [
url] [
BibTeX]
D. Dalle Pezze, D. Deronjic, C. Masiero, D. Tosato, A. Beghi, G.A. Susto.
A Multi-label Continual Learning Framework to Scale Deep Learning Approaches for Packaging Equipment Monitoring. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD), 2022 [
url] [
BibTeX]
T. Barbariol, F. Dalla Chiara, D. Marcato, G.A. Susto.
A review of Tree-based approaches for Anomaly Detection. Control Charts and Machine Learning for Anomaly Detection in Manufacturing, 2022
Abstract:
Data-driven Anomaly Detection approaches have received increasing attention in many other application areas in the past fewyears as a tool for monitoring of complex systems in addition to classical univariatecontrol charts. Tree-based approaches have proven to be particularly effective when dealing with high-dimensional Anomaly Detection problems and with underlying non-gaussian data distributions. The most popular approach in this family is the Isolation Forest, which is currently one of the mostpopular choices for scientists and practitioners when dealing with Anomaly Detection tasks. The Isolation Forest represents a seminal algorithm upon which many extended approaches have been presented in the past years aiming at improving the original method or at dealing with peculiar applicationscenarios. In this work, we revise some of the most popular and powerful Tree-based approaches to Anomaly Detection (extensions of the Isolation Forest and other approaches), considering both batch and streaming datascenarios. This work will review several relevant aspects of the methods, like computational costs and interpretability traits. To help practitioners, we also report available relevant libraries and open implementations and we review real-world industrial applications of the considered approaches.
[ abstract ] [
BibTeX]
E. Marcelli, T. Barbariol, V. Savarino, A. Beghi, G.A. Susto.
A Revised Isolation Forest procedure for Anomaly Detection with High Number of Data Points. 23rd IEEE Latin-American Test Symposium (LATS2022), 2022 [
BibTeX]
F. Simmini, M. Rampazzo, F. Peterle, G.A. Susto, A. Beghi.
A Self-Tuning KPCA-based Approach to Fault Detection in Chiller Systems. IEEE Transactions on Control Systems Technology, vol. 30(4), 2022 [
BibTeX]
A. Fabris, S. Messina, G. Silvello, G.A. Susto.
Algorithmic Fairness Datasets: the Story so Far. Data Mining and Knowledge Discovery, 2022 [
url] [
BibTeX]
H.T. Jebril, M. Pleschberger, G.A. Susto.
An Autoencoder-based Approach for Fault Detection in Multi-stage Manufacturing: a Sputter Deposition and Rapid Thermal Processing case study. IEEE Transactions on Semiconductor Manufacturing, 2022 [
BibTeX]
L.C. Brito, G.A. Susto, J.N. Brito, M.A.V. Duarte.
An Explainable Artificial Intelligence Approach for Unsupervised Fault Detection and Diagnosis in Rotating Machinery. Mechanical Systems and Signal Processing, vol. 163, 2022
Abstract:
The monitoring of rotating machinery is an essential task in today's production processes. Currently, several machine learning and deep learning-based modules have achieved excellent results in fault detection and diagnosis. Nevertheless, to further increase user adoption and diffusion of such technologies, users and human experts must be provided with explanations and insights by the modules. Another issue is related, in most cases, with the unavailability of labeled historical data that makes the use of supervised models unfeasible. Therefore, a new approach for fault detection and diagnosis in rotating machinery is here proposed. The methodology consists of three parts: feature extraction, fault detection and fault diagnosis. In the first part, the vibration features in the time and frequency domains are extracted. Secondly, in the fault detection, the presence of fault is verified in an unsupervised manner based on anomaly detection algorithms. The modularity of the methodology allows different algorithms to be implemented. Finally, in fault diagnosis, Shapley Additive Explanations (SHAP), a technique to interpret black-box models, is used. Through the feature importance ranking obtained by the model explainability, the fault diagnosis is performed. Two tools for diagnosis are proposed, namely: unsupervised classification and root cause analysis. The effectiveness of the proposed approach is shown on three datasets containing different mechanical faults in rotating machinery. The study also presents a comparison between models used in machine learning explainability: SHAP and Local Depth-based Feature Importance for the Isolation Forest (Local- DIFFI). Lastly, an analysis of several state-of-art anomaly detection algorithms in rotating machinery is included.
[ abstract ] [
url] [
BibTeX]
E. Anello, M. Chiara, F. Ferro, F. Ferrari, B. Mukaj, A. Beghi, G.A. Susto.
Anomaly Detection for the Industrial Internet of Things: an Unsupervised Approach for Fast Root Cause Analysis. IEEE Conference on Control Technology and Applications (CCTA), 2022 [
BibTeX]
L. Lorenti, G. De Rossi, A. Annoni, S. Rigutto, G.A. Susto.
CUAD-Mo: Continuos Unsupervised Anomaly Detection on Machining Operations. IEEE Conference on Control Technology and Applications (CCTA), 2022 [
BibTeX]
D. Dalle Pezze, C. Masiero, D. Tosato, A. Beghi, G.A. Susto.
FORMULA: A Deep Learning Approach for Rare Alarms Predictions in Industrial Equipment. IEEE Transactions on Automation Science and Engineering, vol. 19(3), pp. 1491--1502, 2022 [
url] [
BibTeX]
G.A. Susto, A. Diebold, A. Kyek, C. Lee, N. Patel.
Guest Editorial: Process-Level Machine Learning Applications in Semiconductor Manufacturing. IEEE Transactions on Semiconductor Manufacturing, 2022 [
BibTeX]
M. Maggipinto, M. Terzi, G.A. Susto.
IntroVAC: Introspective Variational Classifiers for Learning Interpretable Latent Subspaces. Engineering Applications of Artificial Intelligence, vol. 109, 2022
Abstract:
Learning useful representations of complex data has been the subject of extensive research for many years. With the diffusion of Deep Neural Networks, Variational Autoencoders have gained lots of attention since they provide an explicit model of the data distribution based on an encoder/decoder architecture which is able to both generate images and encode them in a low-dimensional subspace. However, the latent space is not easily interpretable and the generation capabilities show some limitations since images typically look blurry and lack details. In this paper, we propose the Introspective Variational Classifier (IntroVAC), a model that learns interpretable latent subspaces by exploiting information from an additional label and provides improved image quality thanks to an adversarial training strategy.We show that IntroVAC is able to learn meaningful directions in the latent space enabling fine-grained manipulation of image attributes. We validate our approach on the CelebA dataset.
[ abstract ] [
url] [
BibTeX]
A. Purpura, G. Silvello, G.A. Susto.
Learning to Rank from Relevance Judgments Distributions. Journal of the Association for Information Science and Technology, 2022
Abstract:
LEarning TO Rank (LETOR) algorithms are usually trained on annotated corpora where a single relevance label is assigned to each available document-topic pair. Within the Cranfield framework, relevance labels result from merging either multiple expertly curated or crowdsourced human assessments. In this paper, we explore how to train LETOR models with relevance judgments distributions (either real or synthetically generated) assigned to document-topic pairs instead of single-valued relevance labels. We propose five new probabilistic loss functions to deal with the higher expressive power provided by relevance judgments distributions and show how they can be applied both to neural and gradient boosting machine (GBM) architectures. Moreover, we show how training a LETOR model on a sampled version of the relevance judgments from certain probability distributions can improve its performance when relying either on traditional or probabilistic loss functions. Finally, we validate our hypothesis on real-world crowdsourced relevance judgments distributions. Overall, we observe that relying on relevance judgments distributions to train different LETOR models can boost their performance and even outperform strong baselines such as LambdaMART on several test collections.
[ abstract ] [
BibTeX]
L.C. Brito, G.A. Susto, J.N. Brito, M.A.V. Duarte.
Mechanical faults in rotating machinery dataset (normal, unbalance, misalignment, looseness). Mendeley Data, 2022 [
url] [
BibTeX]
T. Barbariol, G.A. Susto.
TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios. Information Sciences, vol. 610, pp. 126-143, 2022 [
url] [
BibTeX]
2021
M. Berno, M. Canil, N. Chiarello, L. Piazzon, F. Berti, F. Ferrari, A. Zaupa, N. Ferro, M. Rossi, G.A. Susto.
A Machine Learning-based Approach for Advanced Monitoring of Automated Equipment for the Entertainment Industry. International Workshop on Metrology for Industry 4.0 & IoT, 2021 [
BibTeX]
S. Tedesco, G.A. Susto, N. Gentner, A. Kyek, Y. Yang.
A Scalable Deep Learning-based Approach for Anomaly Detection in Semiconductor Manufacturing. Winter Simulation Conference, 2021
Abstract:
The diffusion of the Industry 4.0 paradigm lead to the creation and collection of huge manufacturing datasets; such datasets contain for example measurements coming from physical sensors located in different equipment or even in different productive manufacturing organizations. Such large and heterogeneous datasets represent a challenge when aiming for developing data-driven approaches like Anomaly Detection or Predictive Maintenance. In this work we present a new approach for performing Anomaly Detection that is able to handle heterogeneous data coming from different equipment, work centers or production sites.
[ abstract ] [
BibTeX]
M. Terzi, A. Achille, M. Maggipinto, G.A. Susto.
Adversarial Training Reduces Information and Improves Transferability. 35th AAAI Conference on Artificial Intelligence, (arXiv:2007.11259), 2021
Abstract:
Recent results show that features of adversarially trained networks for classification, in addition to being robust, enable desirable properties such as invertibility. The latter property may seem counter-intuitive as it is widely accepted by the community that classification models should only capture the minimal information (features) required for the task. Motivated by this discrepancy, we investigate the dual relationship between Adversarial Training and Information Theory. We show that the Adversarial Training can improve linear transferability to new tasks, from which arises a new trade-off between transferability of representations and accuracy on the source task. We validate our results employing robust networks trained on CIFAR-10, CIFAR-100 and ImageNet on several datasets. Moreover, we show that Adversarial Training reduces Fisher information of representations about the input and of the weights about the task, and we provide a theoretical argument which explains the invertibility of deterministic networks without violating the principle of minimality. Finally, we leverage our theoretical insights to remarkably improve the quality of reconstructed images through inversion.
[ abstract ] [
url] [
BibTeX]
A. Fabris, A. Mishler, S. Gottardi, M. Carletti, M. Daicampi, G.A. Susto, G. Silvello.
Algorithmic Audit of Italian Car Insurance: Evidence of Unfairness in Access and Pricing. Fourth AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES), 2021 [
BibTeX]
N. Bargellesi, A. Beghi, M. Rampazzo, G.A. Susto.
AutoSS: A Deep Learning-Based Soft Sensor for Handling Time-Series Input Data. IEEE Robotics and Automation Letters, vol. 6(3), pp. 6100--6107, 2021
Abstract:
Soft Sensors are data-driven technologies that allow to have estimations of quantities that are impossible or costly to be measured. Unfortunately, the design of effective soft sensors is heavily impacted by time-consuming feature engineering steps that may lead to sub-optimal information, especially when dealing with time-series input data. While domain knowledge may come into help when handling feature extraction in soft sensing applications, the feature extraction typically limits the adoption of such technologies: in this work, we propose AutoSS, a Deep-Learning based approach that allows to overcome such issue. By exploiting autoencoders, dilated convolutions and an ad-hoc defined architecture, AutoSS allows to develop effective soft sensing modules even with time-series input data. The effectiveness of AutoSS is demonstrated on a real-world case study related to Internet of Things equipment.
[ abstract ] [
url] [
BibTeX]
N. Gentner, M. Carletti, A. Kyek, G.A. Susto, Y. Yang.
DBAM: Making Virtual Metrology/Soft Sensing with Time Series Data Scalable Through Deep Learning. Control Engineering Practice, vol. 116, 2021
Abstract:
Machine Learning (ML) based technologies, like Virtual Metrology (VM)/Soft Sensing, Predictive Maintenance and Fault Detection, have been successfully applied in the past recent years in data intensive manufacturing industries, like semiconductor manufacturing, to improve process monitoring and related operations. Standardization and alignment over multiple equipment is a key element to ensure industry-wide adoption and scalability for ML-based technologies in complex production environment. In this work we address the topic of VM/Soft Sensing – a particular ML-based technology for process control – in the context of equipment matching and scalability. We present a Deep Learning-based domain adaptation approach, called DANN-Based Model Alignment (DBAM), that provides a common VM model for two identical-in-design systems whose data are following different distributions. The proposed approach has the merit of (i) exploiting directly raw sensor data (that typically present themselves in the form of time series) and (ii) offering interpretability of the features. The proposed approach is compared against other approaches in the literature for VM/Soft Sensing on a real-world case study from semiconductor manufacturing.
[ abstract ] [
BibTeX]
S. Arena, Y. Budrov, M. Carletti, N. Gentner, M. Maggipinto, Y. Yang, A. Beghi, A. Kyek, G.A. Susto.
Exploiting 2D Coordinates as Bayesian Priors for Deep Learning Defect Classification of SEM Images. IEEE Transactions on Semiconductor Manufacturing, 2021
Abstract:
Deep Learning approaches have revolutionized in the past decade the field of Computer Vision and, as a consequence, they are having a major impact in Industry 4.0 applications like automatic defect classification. Nevertheless, additional data, beside the image/video itself, is typically never exploited in a defect classification module: this aspect, given the abundance of data in data-intensive manufacturing environments (like semiconductor manufacturing) represents a missed opportunity. In this work we present a use case related to Scanning Electron Microscope (SEM) images where we exploit a Bayesian approach to improve defect classification. We validate our approach on a real-world case study and by employing modern Deep Learning architectures for classification.
[ abstract ] [
BibTeX]
L.C. Brito, G.A. Susto, J.N. Brito, M.A.V. Duarte.
Fault Detection of Bearing: an Unsupervised Machine Learning Approach Exploiting Feature Extraction and Dimensionality Reduction. Informatics, 2021 [
BibTeX]
R. Fantinel, A. Cenedese, G. Fadel.
Hybrid Learning Driven by Dynamic Descriptors for Video Classification of Reflective Surfaces. IEEE Transactions on Industrial Informatics, vol. 17(12), pp. 8102--8111, 2021
Abstract:
Visual inspection has recently gained increasing importance in the manufacturing industry and is often addressed by means of learning methodologies applied to data obtained from specific lighting and camera system setups. The industrial scenario becomes particularly challenging when the inspection regards reflective objects, which may affect both the data acquisition and the classification decision process, thus limiting the overall performance. In this context, we observe that the dynamics of the reflected light is the key aspect to characterize these surfaces and needs to be accurately exploited to improve the performances of the learning algorithms. To this aim, we propose a combined model-based and data-driven approach designed to detect defects on the reflective surfaces of industrial products, captured as video sequences under coaxial structured illumination. Specifically, a tunable spatial-temporal descriptor of the evolution of the reflected light (Dynamic Evolution of the Light, DEL) is designed and employed within a Hybrid Learning (HL) framework, where the learning process of a Convolutional Neural Network (CNN) is driven by the model-based descriptor. This approach is also extended by adopting the similar in nature descriptor Dynamic Image. The proposed HL solutions are validated against a whole spectrum of state-of-the-art learning procedures and different descriptors. Experiments run on a dataset coming from an actual industrial scenario confirm the ability of DEL to accurately characterize reflective surfaces and the validity of the HL method, which shows remarkably better performance in fault detection even with respect to modern 3D- CNNs with comparable computational effort.
[ abstract ] [
url] [
BibTeX]
M. Viola, L. Brunelli, G.A. Susto.
Instagram Images and Videos Popularity Prediction: a Deep Learning-Based Approach. Italian Workshop on Artificial Intelligence and Applications for Business and Industries, 2021 [
BibTeX]
D. Marcato, G. Arena, D. Bortolato, F. Gelain, V. Martinelli, E. Munaron, M. Roetta, G. Savarese, G.A. Susto.
Machine Learning-based Anomaly Detection for Particle Accelerators. 5th IEEE Conference on Control Technology and Applications (CCTA), 2021
Abstract:
Particle accelerators are complex systems composed of multiple subsystems that must work together to produce high quality beams employed for physics experiments. A fault or an anomalous behaviour in one of such subsystems can lead to expensive downtime for the whole facility. Thus, it is of paramount importance to be able to promptly detect anomalies.Given the vast amount of streaming data generated by accelerator field sensors, Machine Learning (ML)-based tools are promising candidates for efficient monitoring of such systems: an approach based on unsupervised ML techniques exploiting the data from a Radio Frequency tuning system is here proposed. Feature importance is exploited to guide the definition of the optimal windowing for feature extraction. The proposed approach is here validated on real-world data related to the ALPI accelerator at Legnaro National Laboratories in Italy.
[ abstract ] [
BibTeX]
A. Purpura, K. Buchner, G. Silvello, G.A. Susto.
Neural Feature Selection for Learning to Rank. Proceedings of the European Conference on Information Retrieval, 2021 [
BibTeX]
L. Mancin, I. Rollo, J.F. Mota, F. Piccini, M. Carletti, G.A. Susto, G. Valle, A. Paoli.
Optimizing Microbiota Profiles for Athletes: Dream or Reality?. Exercise and sport sciences reviews, vol. 49(1), pp. 42--49, 2021
Abstract:
Gut microbiome influences athletes’ physiology but, due to the complexity of sport performance and the great inter-variability of microbiome features, it is not reasonable to define a single healthy microbiota profile for athletes. We suggest the use of specific meta-omics analysis coupled with innovative computational systems to uncover the hidden relationship between microbes and athlete’s physiology and predicting personalized recommendation.
[ abstract ] [
url] [
BibTeX]
L. Frau, G.A. Susto, T. Barbariol, E. Feltresi.
Uncertainty estimation for Machine Learning models in Multiphase flow Applications. Informatics, vol. 8(3), 2021
Abstract:
In oil and gas production, it is essential to monitor some performance indicators that are related to the composition of the extracted mixture, such as the liquid and gas content of the flow. These indicators cannot be directly measured and must be inferred with other measurements by using soft sensor approaches that model the target quantity. For the purpose of production monitoring, point estimation alone is not enough, and a confidence interval is required in order to assess the uncertainty in the provided measure. Decisions based on these estimations can have a large impact on production costs; therefore, providing a quantification of uncertainty can help operators make the most correct choices. This paper focuses on the estimation of the performance indicator called the water-in-liquid ratio by using data-driven tools: firstly, anomaly detection techniques are employed to find data that can alter the performance of the subsequent model; then, different machine learning models, such as Gaussian processes, random forests, linear local forests, and neural networks, are tested and employed to perform uncertainty-aware predictions on data coming from an industrial tool, the multiphase flow meter, which collects multiple signals from the flow mixture. The reported results show the differences between the discussed approaches and the advantages of the uncertainty estimation; in particular, they show that methods such as the Gaussian process and linear local forest are capable of reaching competitive performance in terms of both RMSE (1.9–2.1) and estimated uncertainty (1.6–2.6).
[ abstract ] [
BibTeX]
2020
T. Barbariol, E. Feltresi, G.A. Susto.
A Machine Learning-based System for Self-diagnosis Multiphase Flow Meters. International Petroleum Technology Conference, 2020 [
BibTeX]
J.A. Mat Jizat, I.M. Khairuddin, A. Razman, A.F. Nasir, M.S.A. Karim, A.A. Jaafar, L. Wei Hong, A. Abdul Majeed, H. Myung, H. Choi, G.A. Susto.
Advances in Robotics, Automation and Data Analytics. Selected Papers from iCITES 2020. 2020
Abstract:
This book presents essentially a collection of proceedings that deliberate on the key challenges and recent trends on robotics, automation and data analytics which are the pillars of Industry 4.0. Solutions that are employed in the multitude spectra of innovative robotics & automation and data analytics are discussed. The readers are expected to gain an insightful view on the current trends, issues, mitigating factors as well as solutions from the book. This book consists of selected papers presented at the 2nd International Conference on Innovative Technology, Engineering and Sciences 2020 (iCITES) hosted virtually by Universiti Malaysia Pahang on 22nd December 2020. iCITES is a biennial conference, aimed at building a platform that allows relevant stakeholders to share and discuss their latest researches, ideas and survey reports from theoretical to a practical standpoint especially in the Innovative Robotics & Automation and Data Analytics tracks which was published in this book.
[ abstract ] [
url] [
BibTeX]
D. Tosato, D. Dalle Pezze, C. Masiero, G.A. Susto, A. Beghi.
Alarm Logs in Packaging Industry (ALPI). IEEEDataPort, 2020
Abstract:
The advent of the Industrial Internet of Things (IIoT) has led to the availability of huge amounts of data, that can be used to train advanced Machine Learning algorithms to perform tasks such as Anomaly Detection, Fault Classification and Predictive Maintenance. Even though not all pieces of equipment are equipped with sensors yet, usually most of them are already capable of logging warnings and alarms occurring during operation. Turning this data, which is easy to collect, into meaningful information about the health state of machinery can have a disruptive impact on the improvement of efficiency and up-time. The provided dataset consists of a sequence of alarms logged by packaging equipment in an industrial environment. The collection includes data logged by 20 machines, deployed in different plants around the world, from 2019-02-21 to 2020-06-17. There are 154 distinct alarm codes, whose distribution is highly unbalanced. This data can be used to address the following tasks:
- Next alarm forecasting: this problem can be framed as a supervised multi-class classification task, or a binary classification task when a specific alarm code is considered.
- Predicting alarms occurring in a future time frame: here the goal is to forecast the occurrence of certain alarm types in a future time window. Since many alarms can occur, this is a supervised multi-label classification.
- Future alarm sequence prediction: here the goal is predicting an ordered sequence of future alarms, in a sequence-to-sequence forecasting scenario.
- Anomaly Detection: the task is to detect abnormal equipment conditions, based on the pattern of alarms sequence. This task can be either unsupervised, if only the input sequence is considered, or supervised if future alarms are taken into account to assess whether or not there is an anomaly.
All of the above tasks can also be studied from a continual learning perspective. Indeed, information about the serial code of the specific piece of equipment can be used to train the model; however, a scalable model should also be easy to apply to new machines, without the need of a new training from scratch. The collection and release of this dataset has been supported by the Regione Veneto project PreMANI (MANIFATTURA PREDITTIVA: progettazione, sviluppo e implementazione di soluzioni di Digital Manufacturing per la previsione della Qualita e la Manutenzione Intelligente - PREDICTIVE MAINTENANCE: design, development and implementation of Digital Manufacturing solutions for the intelligent quality and maintenance systems).
[ abstract ] [
url] [
BibTeX]
M. Maggipinto, M. Terzi, G.A. Susto.
Beta-Variational Classifiers Under Attack. IFAC World Congress, 2020 [
BibTeX]
L. Meneghetti, M. Terzi, S. Del Favero, G.A. Susto, C. Cobelli.
Data-Driven Anomaly Recognition for Unsupervised Model-Free Fault Detection in Artificial Pancreas. IEEE Transactions on Control Systems Technology, vol. 28(1), pp. 33-47, 2020
Abstract:
The last decade has seen tremendous improvements in technologies for Type 1 Diabetes (T1D) management, in particular the so-called artificial pancreas (AP), a wearable closed-loop device modulating insulin injection based on glucose sensor readings. Unluckily, the AP actuator, an insulin pump, is subject to failures, with potentially serious consequences for subject safety. This calls for the development of advanced monitoring systems, leveraging the unprecedented data availability. This paper tackles for the first time the problem of automatically detecting pump faults with multidimensional data-driven anomaly detection (AD) methodologies. The approach allows to avoid the subtask of identifying a physiological model, typical of model-based approaches. Furthermore, we employ unsupervised methods, removing the need of labeled data for training, hardly available in practice. The adopted data-driven AD methods are local outlier factor, connectivity-based outlier factor, and isolation forest. Moreover, we propose a modification of these methods to cope with the dynamic nature of the underlying problem. The algorithms were tuned and tested on: 1) two-synthetic 100-patients' data set, of one-month data each, generated using the ``UVA/Padova T1D Simulator,'' a large-scale nonlinear computer simulator of T1D subject physiology, largely adopted in AP research and accepted by the American Food and Drug Administration as a replacement of preclinical animal trials for AP and 2) a real 7-patients' data set consisting of one month in free-living conditions. The satisfactory accuracy of the proposed approach paves the way to the embedding of these methodologies in AP systems or their deployment in remote monitoring systems.
[ abstract ] [
url] [
BibTeX]
M. Terzi, G.A. Susto, P. Chaudhari.
Directional Adversarial Training for Cost Sensitive Deep Learning Classification Applications. Engineering Applications of Artificial Intelligence, vol. 91, 2020
Abstract:
In many real-world applications of Machine Learning it is of paramount importance not only to provide accurate predictions, but also to ensure certain levels of robustness. Adversarial Training is a training procedure aiming at providing models that are robust to worst-case perturbations around predefined points. Unfortunately, one of the main issues in adversarial training is that robustness w.r.t. gradient-based attackers is always achieved at the cost of prediction accuracy. In this paper, a new algorithm, called Wasserstein Projected Gradient Descent (WPGD), for adversarial training is proposed. WPGD provides a simple way to obtain cost-sensitive robustness, resulting in a finer control of the robustness-accuracy trade-off. Moreover, WPGD solves an optimal transport problem on the output space of the network and it can efficiently discover directions where robustness is required, allowing to control the directional trade-off between accuracy and robustness. The proposed WPGD is validated in this work on image recognition tasks with different benchmark datasets and architectures. Moreover, real world-like datasets are often unbalanced: this paper shows that when dealing with such type of datasets, the performance of adversarial training are mainly affected in term of standard accuracy.
[ abstract ] [
url] [
BibTeX]
N. Gentner, M. Carletti, G.A. Susto, A. Kyek, Y. Yang.
Enhancing Scalability of Virtual Metrology: a Deep Learning-based Approach for Domain Adaptation. Winter Simulation Conference, 2020
Abstract:
One of the main challenges in developing Machine Learning-based solutions for Semiconductor Manu-facturing is the high number of machines in the production and their differences, even when consideringchambers of the same machine; this poses a challenge in the scalability of Machine Learning-based so-lutions in this context, since the development of chamber-specific models for all equipment in the fab isunsustainable. In this work, we present a domain adaptation approach for Virtual Metrology (VM), one ofthe most successful Machine Learning-based technology in this context. The approach provides a commonVM model for two identical-in-design chambers whose data follow different distributions. The approach isbased on Domain-Adversarial Neural Networks and it has the merit of exploiting raw trace data, avoidingthe loss of information that typically affects VM modules based on features. The effectiveness of theapproach is demonstrated on real-world Etching.
[ abstract ] [
BibTeX]
T. Barbariol, E. Feltresi, S. Galvanin, D. Tescaro, G.A. Susto.
How to improve Water Cut measurements in MPFM using a Sensor Fusion and Machine Learning-based Approach. North Sea Flow Measurement Workshop, 2020 [
BibTeX]
A. Razman, A. Majeed, R.M. Musa, Z. Taha, G.A. Susto, Y. Mukai.
Hyperparameter Tuning of the Model for Hunger State Classification. SpringerBriefs in Applied Sciences and Technology, pp. 49-57, 2020
Abstract:
To increase the classification, the rate of prediction based on existing models requires additional technique or in this case optimizing the model. Hyperparameter tuning is an optimization technique that evaluates and adjusts the free parameters that define the behaviour of classifiers. Data sets were classified practical with classifiers like SVM, k-NN, ANN and DA. To further improve the design efficiency, the secondary optimization level called hyperparameter tuning will be further investigated. DA, SVM, k-NN, decision tree (Tree), logistic regression (LR), random forest tree (RF) and neural network (NN) are evaluated. The k-NN provided 96.47% of the test sets with the best reliability in classifications. Bayesian optimization has been used to refine the hyperparameter; hence, standardize Euclidean distance metric with a k value of one is the ideal hyperparameters which could achieve classification performance of 97.16%.
[ abstract ] [
url] [
BibTeX]
A. Razman, A. Majeed, R.M. Musa, Z. Taha, G.A. Susto, Y. Mukai.
Image Processing Features Extraction on Fish Behaviour. SpringerBriefs in Applied Sciences and Technology, pp. 25-36, 2020
Abstract:
This chapter demonstrates the pipeline from data collection until classifier models that achieve the best possible model in identifying the disparity between hunger states. The pre-processing segment describes the features of the data sets obtained by means of image processing. The method includes the simple moving average (SMA), downsizing factors, dynamic time warping (DTW) and clustering by the k-means method. This is to rationally assign the necessary significant information from the data collected and processed the images captured for demand feeder and fish motion as a synthesis for anticipating the state of fish starvation. The selection of features in this study takes place via the boxplot analysis and the principal component analysis (PCA) on dimensionality reduction. Finally, the validation of the hunger state will be addressed by comparing machine learning (ML) classifiers, namely the discriminant analysis (DA), support vector machine (SVM) and k-nearest neighbour (k-NN). The outcome in this chapter will validate the features from image processing as a tool for identifying the behavioural changes of the fish in school size.
[ abstract ] [
url] [
BibTeX]
G.A. Susto, M. Maggipinto, F. Zocco, S. McLoone.
Induced Start Dynamic Sampling for Wafer Metrology Optimization. IEEE Transactions on Automation Science and Engineering, vol. 17(1), pp. 418-432, 2020 [
url] [
BibTeX]
M. Carletti, N. Gentner, Y. Yang, A. Kyek, M. Maggipinto, A. Beghi, G.A. Susto.
Interpretable Anomaly Detection for Knowledge Discovery in Semiconductor Manufacturing. Winter Simulation Conference, 2020
Abstract:
Machine Learning-based Anomaly Detection (AD) approaches are efficient tools to monitor complexprocesses. One of the advantages of such approaches is that they provide a unique anomaly indicator,a quantitative index that captures the degree of ’outlierness’ of the process at hand considering possiblyhundreds or more variables at the same time, the typical scenario in semiconductor manufacturing. Oneof the drawback of such approaches is that Root Cause Analysis is not guided by the system itself. Inthis work, we show the effectiveness of a method, called DIFFI, to equip Isolation Forest, one of themost popular AD algorithms, with interpretability traits that can help corrective actions and knowledgeunderstanding. Such approach is validated on real world semiconductor manufacturing data related to aChemical Vapor Deposition process.
[ abstract ] [
BibTeX]
A. Razman, A. Majeed, R.M. Musa, Z. Taha, G.A. Susto, Y. Mukai.
Machine Learning in Aquaculture Hunger Classification of Lates calcarifer. 2020
Abstract:
This book highlights the fundamental association between aquaculture and engineering in classifying fish hunger behaviour by means of machine learning techniques. Understanding the underlying factors that affect fish growth is essential, since they have implications for higher productivity in fish farms. Computer vision and machine learning techniques make it possible to quantify the subjective perception of hunger behaviour and so allow food to be provided as necessary. The book analyses the conceptual framework of motion tracking, feeding schedule and prediction classifiers in order to classify the hunger state, and proposes a system comprising an automated feeder system, image-processing module, as well as machine learning classifiers. Furthermore, the system substitutes conventional, complex modelling techniques with a robust, artificial intelligence approach. The findings presented are of interest to researchers, fish farmers, and aquaculture technologist wanting to gain insights into the productivity of fish and fish behaviour.
[ abstract ] [
url] [
BibTeX]
A. Razman, A. Majeed, R.M. Musa, Z. Taha, G.A. Susto, Y. Mukai.
Monitoring and Feeding Integration of Demand Feeder Systems. SpringerBriefs in Applied Sciences and Technology, pp. 11-24, 2020
Abstract:
This chapter highlights the findings of the developmental monitoring systems for swimming pattern or motion analysis with regard to feeding behaviour. A benchmark for examining the framework on how scientists control fish in animal variable function factors was gathered and referred to gauge the adequate design in constructing a viable device. The validation of image processing and automated demand feeder to determine the results will also be considered, as a validation aspect between the system of tracking and the behaviour of the Lates calcarifer where the pixel intensity will be extracted as the features. The results of this chapter will enable the reader on the development of an integrated feeder scheme that consolidates surveillance scheme to identify the feeding behaviour and relation towards the specific growth rate (SGR).
[ abstract ] [
url] [
BibTeX]
R. Fantinel, A. Cenedese.
Multistep hybrid learning: CNN driven by spatial–temporal features for faults detection on metallic surfaces. Journal of Electronic Imaging, vol. 4, pp. 29, 2020
Abstract:
Solutions for the quality control of metallic surfaces are proposed. Specifically, we study a deflectometric apparatus based on coaxial structured light and the related algorithmic procedure, which is able to detect the faulty surface of a sample captured by a video sequence. First, by considering the metallic surface a dynamic scene illuminated under different light conditions, we develop the descriptor residuals of linear evolution of light (RLEL) that extracts the defectiveness information starting from the movement of the object without explicitly considering the physical characteristics of the light structure. Then, leveraging on RLEL, we present a hybrid learning (HL) technique capable of overcoming the data-driven approach used in classic deep learning (DL). By exploiting a multisteps training process, we combine the model-based descriptor RLEL and a classical data-driven convolutional neural network (CNN) to obtain an unconventional gray-box CNN, which exceeds the performance of popular DL solutions such as 3-D-inception and 3-D-residual DL networks. Remarkably, HL also shows its validity in comparing the performance of the same network structures trained not in a hybrid way, namely without the injection of the model-based information given by RLEL.
[ abstract ] [
url] [
BibTeX]
M. Maggipinto, G.A. Susto, P. Chaudhari.
Proximal Deterministic Policy Gradient. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020 [
BibTeX]
T. Barbariol, E. Feltresi, G.A. Susto.
Self-Diagnosis of Multiphase Flow Meters through Machine Learning-based Anomaly Detection. Energies, vol. 12(13), pp. 1 -- 24, 2020
Abstract:
Measuring systems are becoming increasingly sophisticated in order to tackle the challenges of modern industrial problems. In particular, the Multiphase Flow Meter (MPFM) combines different sensors and data fusion techniques to estimate quantities that are difficult to be measured like the water or gas content of a multiphase flow, coming from an oil well. The evaluation of the flow composition is essential for the well productivity prediction and management, and for this reason, the quantification of the meter measurement quality is crucial. While instrument complexity is increasing, demands for confidence levels in the provided measures are becoming increasingly more common. In this work, we propose an Anomaly Detection approach, based on unsupervised Machine Learning algorithms, that enables the metrology system to detect outliers and to provide a statistical level of confidence in the measures. The proposed approach, called AD4MPFM (Anomaly Detection for Multiphase Flow Meters), is designed for embedded implementation and for multivariate time-series data streams. The approach is validated both on real and synthetic data.
[ abstract ] [
url] [
BibTeX]
T. Barbariol, E. Feltresi, G.A. Susto, D. Tescaro, S. Galvanin.
Sensor Fusion And Machine LearningTechniques To Improve Water Cut Measurements Accuracy In Multiphase Application. 2020 SPE Annual Technical Conference and Exhibition, 2020 [
BibTeX]
M. Zanon, G. Zambonin, G.A. Susto, S. McLoone.
Sparse Logistic Regression: Comparison of Regularization and Bayesian implementations. Algorithms, vol. 13(6), pp. 1 -- 24, 2020
Abstract:
In knowledge-based systems, besides obtaining good output prediction accuracy, it is crucial to understand the subset of input variables that have most influence on the output, with the goal of gaining deeper insight into the underlying process. These requirements call for logistic model estimation techniques that provide a sparse solution, i.e., where coefficients associated with non-important variables are set to zero. In this work we compare the performance of two methods: the first one is based on the well known Least Absolute Shrinkage and Selection Operator (LASSO) which involves regularization with an ?1 norm; the second one is the Relevance Vector Machine (RVM) which is based on a Bayesian implementation of the linear logistic model. The two methods are extensively compared in this paper, on real and simulated datasets. Results show that, in general, the two approaches are comparable in terms of prediction performance. RVM outperforms the LASSO both in term of structure recovery (estimation of the correct non-zero model coefficients) and prediction accuracy when the dimensionality of the data tends to increase. However, LASSO shows comparable performance to RVM when the dimensionality of the data is much higher than number of samples that is p>>n
[ abstract ] [
url] [
BibTeX]
T. Barbariol, D. Masiero, E. Feltresi, G.A. Susto.
Time series Forecasting to detect anomalous behaviours in Multiphase Flow Meter. North Sea Flow Measurement Workshop, 2020 [
BibTeX]
A. Razman, A. Majeed, R.M. Musa, Z. Taha, G.A. Susto, Y. Mukai.
Time-Series Identification on Fish Feeding Behaviour. SpringerBriefs in Applied Sciences and Technology, pp. 37-47, 2020
Abstract:
The identification of relevant parameters that could describe the state of fish hunger is vital for ensuring the appropriate allocation of food to the fish. The establishment of these relevant parameters is non-trivial, particularly when developing an automated demand feeder system. The present inquiry is being undertaken to determine the hunger state of Lates calcarifer. For data collection, a video analysis system is used, and the video was taken all day, where the fish was fed by an automatic feeding system. Sixteen characteristics of the raw data set have been extracted through feature engineering for 0.5 min, 1.0 min, 1.5 min and 2.0 min, respectively, in accordance with the mean, peak, minimum and variability of each of the different time window scales. Furthermore, the features extracted have been evaluated through principal component analysis (PCA) both for dimension reduction and PCA with varimax rotation. The details were then categorized using support vector machine (SVM), K-NN and random forest tree (RF) classifiers. The best identification accuracy was shown with eight described features in the varimax-based PCA. The forecast results based on the K-NN model built on selected data characteristics showed a level of 96.5% indicating that the characteristics analysed were crucial to classifying the actions of hunger among fisheries.
[ abstract ] [
url] [
BibTeX]
2019
M. Carletti, C. Masiero, A. Beghi, G.A. Susto.
A deep learning approach for anomaly detection with industrial time series data: a refrigerators manufacturing case study. Procedia Manufacturing, vol. 38, pp. 233-240, 2019
Abstract:
We propose a Deep Learning (DL)-based approach for production performance forecasting in fresh products packaging. On the one hand, this is a very demanding scenario where high throughput is mandatory; on the other, due to strict hygiene requirements, unexpected downtime caused by packaging machines can lead to huge product waste. Thus, our aim is predicting future values of key performance indexes such as Machine Mechanical Efficiency (MME) and Overall Equipment Effectiveness (OEE). We address this problem by leveraging DL-based approaches and historical production performance data related to measurements, warnings and alarms. Different architectures and prediction horizons are analyzed and compared to identify the most robust and effective solutions. We provide experimental results on a real industrial case, showing advantages with respect to current policies implemented by the industrial partner both in terms of forecasting accuracy and maintenance costs. The proposed architecture is shown to be effective on a real case study and it enables the development of predictive services in the area of Predictive Maintenance and Quality Monitoring for packaging equipment providers.
[ abstract ] [
url] [
BibTeX]
M. Maggipinto, A. Beghi, G.A. Susto.
A Deep Learning-based Approach to Anomaly Detection with 2-Dimensional Data in Manufacturing. International Conference on Industrial Informatics (INDIN), pp. 187 -- 191, 2019
Abstract:
In modern manufacturing scenarios, detecting
anomalies in production systems is pivotal to keep high-quality
standards and reduce costs. Even in the Industry 4.0 context, realworld monitoring systems are often simple and based on the use
of multiple univariate control charts. Data-driven technologies
offer a whole range of tools to perform multivariate data analysis
that allow to implement more effective monitoring procedures.
However, when dealing with complex data, common data-driven
methods cannot be directly used, and a feature extraction
phase must be employed. Feature extraction is a particularly
critical operation, especially in anomaly detection tasks, and it is
generally associated with information loss and low scalability. In
this paper we consider the task of Anomaly Detection with twodimensional, image-like input data, by adopting a Deep Learningbased monitoring procedure, that makes use of convolutional
autoencoders. The procedure is tested on real Optical Emission
Spectroscopy data, typical of semiconductor manufacturing. The
results show that the proposed approach outperforms classical
feature extraction procedures.
[ abstract ] [
BibTeX]
G.A. Susto, L. Vettore, G. Zambonin, F. Altinier, D. Beninato, T. Girotto, M. Rampazzo, A. Beghi.
A Machine Learning-based Soft Sensor for Laundry Load Fabric Typology Estimation in Household Washer-Dryers. 5th IFAC International Conference on Intelligent Control and Automation Sciences, 2019 [
BibTeX]
N. Bargellesi, M. Carletti, A. Cenedese, G.A. Susto, M. Terzi.
A Random Forest-based Approach for Hand Gesture Recognition with Wireless Wearable Motion Capture Sensors. 5th IFAC International Conference on Intelligent Control and Automation Sciences, 2019
Abstract:
Gesture Recognition has a prominent importance in smart environment and home automation. Thanks to the availability of Machine Learning approaches it is possible for users to define gestures that can be associated with commands for the smart environment. In this paper we propose a Random Forest-based approach for Gesture Recognition of hand movements starting from wireless wearable motion capture data. In the presented approach, we evaluate different feature extraction procedures to handle gestures and data with different duration. To enhance reproducibility of our results and to foster research in the Gesture Recognition area, we share the dataset that we have collected and exploited for the present work.
[ abstract ] [
url] [
BibTeX]
G. Zambonin, F. Altinier, A. Beghi, L.D.S. Coelho, T. Girotto, M. Rampazzo, G. Reynoso-Meza, G.A. Susto.
Data-Driven Models for the Determination of Laundry Moisture Content in a Household Laundry Treatment Dryer Appliance. Lecture Notes in Control and Information Sciences – Proceedings, 2019
Abstract:
Two methods based on Regression are presented to determine the moisture content of items, e.g. clothes and the like, which are introduced in a household laundry dryer appliance. The aim of this work is to develop Soft Sensors (SS) for a household Heat Pump Washer-Dryer (WD-HP) to provide an estimation of the desired signal (the laundry
moisture during drying) avoiding the use of additional physical sensors with the goal of improving the current performance in terms of precision and energy consumption of the automatic drying cycle end and using the machine equipment already available. On an algorithmic point of view, the SS developed in this work exploits regularization methods and Genetic Programming for Symbolic Regression
in order to find suitable models for the purpose at hand. Proposed approaches have been tested on real data provided by an industrial partner.
[ abstract ] [
pdf] [
BibTeX]
L. Brunelli, C. Masiero, D. Tosato, A. Beghi, G.A. Susto.
Deep Learning-based Production Forecasting in Manufacturing: a Packaging Equipment Case Study. Procedia Manufacturing, vol. 38, pp. 248-255, 2019
Abstract:
We propose a Deep Learning (DL)-based approach for production performance forecasting in fresh products packaging. On the one hand, this is a very demanding scenario where high throughput is mandatory; on the other, due to strict hygiene requirements, unexpected downtime caused by packaging machines can lead to huge product waste. Thus, our aim is predicting future values of key performance indexes such as Machine Mechanical Efficiency (MME) and Overall Equipment Effectiveness (OEE). We address this problem by leveraging DL-based approaches and historical production performance data related to measurements, warnings and alarms. Different architectures and prediction horizons are analyzed and compared to identify the most robust and effective solutions. We provide experimental results on a real industrial case, showing advantages with respect to current policies implemented by the industrial partner both in terms of forecasting accuracy and maintenance costs. The proposed architecture is shown to be effective on a real case study and it enables the development of predictive services in the area of Predictive Maintenance and Quality Monitoring for packaging equipment providers.
[ abstract ] [
BibTeX]
M. Maggipinto, A. Beghi, S. McLoone, G.A. Susto.
DeepVM: A Deep Learning-based Approach with Automatic Feature Extraction for 2D Input Data Virtual Metrology. Journal of Process Control, vol. 84, pp. 24-34, 2019
Abstract:
Industry 4.0 encapsulates methods, technologies, and procedures that transform data into informed decisions and added value inan industrial context. In this regard, technologies such as Virtual Metrology or Soft Sensing have gained much interest in thelast two decades due to their ability to provide valuable knowledge for production purposes at limited added expense. However,these technologies have struggled to achieve wide-scale industrial adoption, largely due to the challenges associated with handlingcomplex data structures and the feature extraction phase of model building. This phase is generally hand-engineered and basedon specific domain knowledge, making it time consuming, difficult to automate, and prone to loss of information, thus ultimatelylimiting portability. Moreover, in the presence of complex data structures, such as 2-dimensional input data, there are no establishedprocedures for feature extraction. In this paper, we present a Deep Learning approach for Virtual Metrology, called DeepVM,that exploits semi-supervised feature extraction based on Convolutional Autoencoders. The proposed approach is demonstratedusing a real world semiconductor manufacturing dataset where the Virtual Metrology input data is 2-dimensional Optical EmissionSpectrometry data. The feature extraction method is tested with different types of state-of-the-art autoencoder.
[ abstract ] [
url] [
BibTeX]
L. Meneghetti, G.A. Susto, S. Del Favero.
Detection of insulin pump malfunctioning to improve safety in artificial pancreas using unsupervised algorithms. Journal of Diabetes Science and Technology, 2019
Abstract:
Background:
Recent development of automated closed-loop (CL) insulin delivery systems, the so-called artificial pancreas (AP), improved the quality of type 1 diabetes (T1D) therapy. As new technologies emerge, patients put increasing trust in their therapeutic devices; therefore, it becomes increasingly important to detect malfunctioning affecting such devices. In this work, we explore a new paradigm to detect insulin pump faults (IPFs) that use unsupervised anomaly detection.
Methods:
We generated CL data corrupted with IPFs using the latest version of the T1D Padova/UVA simulator. From the data, we extracted several features capable to describe the patient dynamics and making more apparent suspicious data portions. Then, a feature selection is performed to determine the optimal feature set. Finally, the performance of several popular unsupervised anomaly detection algorithms is analyzed and compared on the identified optimal feature set.
Results:
Using the identified optimal configuration, the best performance is obtained by the Histogram-Based Outlier Score (HBOS) algorithm, which detected 87% of the IPF with only 0.08 false positives per day on average. Isolation forest is the best algorithm that offers more conservative performances, detection of 85% of the faults but only 0.06 false positives per day on average.
Conclusion:
Unsupervised anomaly detection algorithms can be used effectively to detect IPFs and improve the safety of the AP. Future studies will be dedicated to test the presented method inside dedicated clinical trials.
[ abstract ] [
url] [
BibTeX]
M. Carletti, C. Masiero, A. Beghi, G.A. Susto.
Explainable Machine Learning in Industry 4.0: Evaluating Feature Importance in Anomaly Detection to Enable Root Cause Analysis. 2019 IEEE International Conference on Systems, Man, and Cybernetics, 2019
Abstract:
In the past recent years, Machine Learning methodologies have been applied in countless application areas. In particular, they play a key role in enabling Industry4.0. However, one of the main obstacles to the diffusion of Machine Learning-based applications is related to the lack of interpretability of most of these methods. In this work, we propose an approach for defining a ‘feature importance’ in Anomaly Detection problems. Anomaly Detection is an important Machine Learning task that has an enormous applicability in industrial scenarios. Indeed, it is extremely relevant for the purpose of quality monitoring. Moreover, it is often the first step towards the design of a Machine Learning based smart monitoring solutions because Anomaly Detection can be implemented without the need of labelled data. The proposed feature importance evaluation approach is designed for Isolation Forest, one of the most commonly used algorithm for Anomaly Detection. The efficacy of the proposed method is tested on synthetic and real industrial datasets.
[ abstract ] [
url] [
BibTeX]
A. Purpura, C. Masiero, G. Silvello, G.A. Susto.
Feature Selection for Emotion Classification. 10th Italian Information Retrieval Workshop (IIR), 2019 [
BibTeX]
A. Razman, A.S.A. Ghani, A. Cenedese, F.A. Adnan, G.A. Susto, K.M. Ismail, R.M. Musa, Y. Mukai, Z. Taha, A. Majeed.
Hunger Classification of Lates Calcarifer by means of an automated feeder and image processing. Computers and Electronics in Agriculture, vol. 163, 2019
Abstract:
In an automated demand feeder system, underlining the parameters that contribute to fish hunger is crucial in order to facilitate an optimised food allocation to the fish. The present investigation is carried out to classify the hunger state of Lates calcarifer. A video surveillance technique is employed for data collection. The video was taken throughout the daytime, and the fish were fed through an automated feeding system. It was demonstrated through this investigation that the use of such automated system does contribute towards a higher specific growth rate percentage of body weight as well as the total length by approximately 26.00% and 15.00%, respectively against the conventional time-based method. Sixteen features were feature engineered from the raw dataset into window sizes ranging from 0.5?min, 1.0?min, 1.5?min and 2.0?min, respectively coupled with the mean, maximum, minimum and variance for each of the distinctive temporal window sizes. In addition, the extracted features were analysed through Principal Component Analysis (PCA) for dimensionality reduction as well as PCA with varimax rotation. The data were then classified using a Support Vector Machine (SVM), k-Nearest Neighbor (k-NN) and Random Forest Tree models. It was demonstrated that the varimax based PCA yielded the highest classification accuracy with eight identified features. The prediction results based of the developed k-NN model on the selected features on the test data exhibited a classification rate of 96.5% was achieved suggesting that the features examined are non-trivial in classifying the fish hunger behaviour.
[ abstract ] [
url] [
BibTeX]
M. Maggipinto, E. Pesavento, F. Altinier, G. Zambonin, A. Beghi, G.A. Susto.
Laundry Fabric Classification in Vertical AxisWashing Machines using Data-driven Soft Sensors. Energies, vol. 12(21), 2019
Abstract:
Embedding household appliances with smart capabilities is becoming common practice among major fabric-care producers that seek competitiveness on the market by providing more efficient and easy-to-use products. In Vertical Axis Washing Machines (VA-WM), knowing the laundry composition is fundamental to setting the washing cycle properly with positive impact both on energy/water consumption and on washing performance. An indication of the load typology composition (cotton, silk, etc.) is typically provided by the user through a physical selector that, unfortunately, is often placed by the user on the most general setting due to the discomfort of manually changing configurations. An automated mechanism to determine such key information would thus provide increased user experience, better washing performance, and reduced consumption; for this reason, we present here a data-driven soft sensor that exploits physical measurements already available on board a commercial VA-WM to provide an estimate of the load typology through a machine-learning-based statistical model of the process. The proposed method is able to work in a resource-constrained environment such as the firmware of a VA-WM.
[ abstract ] [
url] [
BibTeX]
T. Barbariol, E. Feltresi, G.A. Susto.
Machine Learning approaches for Anomaly Detection in Multiphase Flow Meters. 5th IFAC International Conference on Intelligent Control and Automation Sciences, 2019 [
BibTeX]
G. Zambonin, F. Altinier, A. Beghi, L.D.S. Coelho, N. Fiorella, T. Girotto, M. Rampazzo, G. Reynoso-Meza, G.A. Susto.
Machine Learning-based Soft Sensors for the Estimation of Laundry Moisture Content in Household Dryer Appliances. Energies, vol. 20(12), pp. 1 -- 24, 2019
Abstract:
The aim is to develop soft sensors (SSs) to provide an estimation of the laundry moisture of clothes introduced in a household Heat Pump Washer–Dryer (WD-HP) appliance. The developed SS represents a cost-effective alternative to physical sensors, and it aims at improving the WD-HP performance in terms of drying process efficiency of the automatic drying cycle. To this end, we make use of appropriate Machine Learning models, which are derived by means of Regularization and Symbolic Regression methods. These methods connect easy-to-measure variables with the laundry moisture content, which is a difficult and costly to measure variable. Thanks to the use of SSs, the laundry moisture estimation during the drying process is effectively available. The proposed models have been tested by exploiting real data through an experimental test campaign on household drying machines.
[ abstract ] [
url] [
BibTeX]
A. Purpura, M. Maggipinto, G. Silvello, G.A. Susto.
Probabilistic Word Embeddings in Neural IR: A Promising Model That Does Not Work as Expected (For Now). 5th ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR), 2019 [
BibTeX]
A. Purpura, C. Masiero, G. Silvello, G.A. Susto.
Supervised Lexicon Extraction for Emotion Classification. Proceedings of the 28th International Conference on World Wide Web Companion, pp. 1071 - 1078, 2019
Abstract:
Emotion Classification (EC) aims at assigning an emotion label to a textual document with two inputs – a set of emotion labels (e.g. anger, joy, sadness) and a document collection. The best performing approaches for EC are dictionary-based and suffer from two main limitations: (i) the out-of-vocabulary (OOV) keywords problem and (ii) they cannot be used across heterogeneous domains. In this work, we propose a way to overcome these limitations with a supervised approach based on TF-IDF indexing and Multinomial Linear Regression with Elastic-Net regularization to extract an emotion lexicon and classify short documents from diversified domains. We compare the proposed approach to state-of-the-art methods for document representation and classification by running an extensive experimental study on two shared and heterogeneous data sets.
[ abstract ] [
url] [
BibTeX]
T. Barbariol, E. Feltresi, G.A. Susto.
Validity and consistency of MPFM data through a Machine learning-based system. 37th International North Sea Flow Measurement Workshop, 2019 [
BibTeX]
M. Maggipinto, G.A. Susto, F. Zocco, S. McLoone.
What are the Most Informative Data for Virtual Metrology? A use case on Multi-Stage Processes Fault Prediction. IEEE Conference on Automation Science and Engineering, 2019 [
BibTeX]
2018
M. Maggipinto, M. Terzi, C. Masiero, A. Beghi, G.A. Susto.
A Computer Vision-inspired Deep Learning Architecture for Virtual Metrology modeling with 2-Dimensional Data. IEEE Transactions on Semiconductor Manufacturing, vol. 31(3), pp. 376 - 384, 2018
Abstract:
The rise of Industry 4.0 and data-intensive manufacturing makes Advanced Process Control (APC) applications more relevant than ever for process/production optimization, related costs reduction, and increased efficiency. One of the most important APC technologies is Virtual Metrology (VM). VM aims at exploiting information already available in the process/system under exam, to estimate quantities that are costly or impossible to measure. Machine Learning approaches are the foremost choice to design VM solutions. A serious drawback of traditional Machine Learning methodologies is that they require a features extraction phase that generally limits the scalability and performance of VM solutions. Particularly, in presence of multi-dimensional data, the feature extraction process is based on heuristic approaches that may capture features with poor predictive power. In this work, we exploit modern Deep Learning-based technologies that are able to automatically extract highly informative features from the data, providing more accurate and scalable VM solutions. In particular, we exploit Deep Learning architectures developed in the realm of Computer Vision to model data that have both spatial and time evolution. The proposed methodology is tested on a real industrial dataset related to Etching, one of the most important Semiconductor Manufacturing processes. The dataset at hand contains Optical Emission Spectroscopy data and it is paradigmatic of the feature extraction problem in VM under examination.
[ abstract ] [
url] [
BibTeX]
M. Maggipinto, C. Masiero, A. Beghi, G.A. Susto.
A Convolutional Autoencoder Approach for Feature Extraction in Virtual Metrology. Procedia Manufacturing, 28th International Conference on Flexible Automation and Intelligent Manufacturing, vol. 17, pp. 126-133, 2018
Abstract:
Exploiting the huge amount of data collected by industries is definitely one of the main challenges of the so-called Big Data era. In this sense, Machine Learning has gained growing attention in the scientific community, as it allows to extract valuable information by means of statistical predictive models trained on historical process data. In Semiconductor Manufacturing, one of the most extensively employed data-driven applications is Virtual Metrology, where a costly or unmeasurable variable is estimated by means of cheap and easy to obtain measures that are already available in the system. Often, these measures are multi-dimensional, so traditional Machine Learning algorithms cannot handle them directly. Instead, they require feature extraction, that is a preliminary step where relevant information is extracted from raw data and converted into a design matrix. Features are often hand-engineered and based on specific domain knowledge. Moreover, they may be difficult to scale and prone to information loss, affecting the effectiveness and maintainability of machine learning procedures. In this paper, we present a Deep Learning method for semi-supervised feature extraction based on Convolutional Autoencoders that is able to overcome the aforementioned problems. The proposed method is tested on a real dataset for Etch rate estimation. Optical Emission Spectrometry data, that exhibit a complex bi-dimensional time and wavelength evolution, are used as input.
[ abstract ] [
url] [
BibTeX]
G.A. Susto, M. Maggipinto, F. Zocco, S. McLoone.
A Dynamic Sampling Approach for Cost Reduction in Semiconductor Manufacturing. Procedia Manufacturing, 28th International Conference on Flexible Automation and Intelligent Manufacturing, vol. 17, pp. 1031-1038, 2018
Abstract:
In semiconductor manufacturing, metrology is generally a high cost, non-value added operation that impacts significantly on cycle time. As such, reducing wafer metrology continues to be a major target in semiconductor manufacturing efficiency initiatives. Data-driven spatial dynamic sampling methodologies are here compared. Such strategies aim at minimizing the number of sites that need to be measured across a wafer surface while maintaining an acceptable level of wafer profile reconstruction accuracy. The Spatial Dynamic Sampling approaches are based on analyzing historical metrology data to determine, from a set of candidate wafer sites, the minimum set of sites that need to be monitored in order to reconstruct the full wafer profile using statistical regression techniques. Spatial Dynamic sampling is then implemented in various strategies that guarantee coverage of all the possible sites in a given set of process iteration. In this way, the risk of not detecting previously unseen process behavior is mitigated. In this work, we demonstrate the efficacy of spatial dynamic sampling methodologies using both simulation studies and metrology data from a semiconductor manufacturing process.
[ abstract ] [
url] [
BibTeX]
G.A. Susto, M. Terzi, C. Masiero, S. Pampuri, A. Schirru.
A Fraud Detection Decision Support System via Human On-line Behavior Characterization and Machine Learning. 1st International Conference on Artificial Intelligence for Industries (AI4I), pp. 9-14, 2018
Abstract:
On-line and phone banking frauds are responsible for millions of dollars loss every year. In this work, we propose a Machine Learning-based Decision Support System to automatically associate a risk factor to each transaction performed through an on-line/mobile banking system. The proposed approach has a hierarchical architecture: First, an unsupervised Machine Learning module is used to detect abnormal patterns or wrongly labeled transactions; then, a supervised module provides a risk factor for the transactions that were not marked as anomalies in the previous step. Our solution exploits personal and historical information about the user, statistics that describe online traffic generated on the online/mobile banking system, and features extracted from motives of the transactions. The proposed approach deals with dataset unbalancing effectively. Moreover, it has been validated on a large database of transactions and on-line traffic provided by an industrial partner.
[ abstract ] [
url] [
BibTeX]
N. Normani, A. Urru, L. Abraham, M. Walsh, S. Tedesco, A. Cenedese, G.A. Susto, B. O'Flynn.
A Machine Learning Approach for Gesture Recognition with a Lensless Smart Sensor System. 15th International Conference on Wearable and Implantable Body Sensor Networks, pp. 136--139, 2018
Abstract:
Hand motion tracking traditionally re-
quires highly complex and expensive systems in terms
of energy and computational demands. A low-power,
low-cost system could lead to a revolution in this field
as it would not require complex hardware or additional
equipment. The present paper exploits the Multiple
Point Tracking algorithm developed at the Tyndall
National Institute as the basic algorithm to perform
a series of gesture recognition tasks. The hardware
relies upon the combination of a stereoscopic vision
of two novel Lensless Smart Sensors (LSS) combined
with IR filters and five hand-held LEDs to track. Track-
ing common gestures generates a six-gestures dataset,
which is then employed to train three Machine Learning
models: k-Nearest Neighbors, Support Vector Machine
and Random Forest. An offline analysis highlights how
different LEDs’ positions on the hand affect the clas-
sification accuracy. The comparison shows how the
Random Forest outperforms the other two models with
a classification accuracy of 90-91%.
[ abstract ] [
url] [
BibTeX]
S. McLoone, A.B. Johnston, G.A. Susto.
A Methodology for Efficient Dynamic Spatial Sampling and Reconstruction of Wafer Profiles. IEEE Transactions on Automation Science and Engineering, vol. 15(4), pp. 1692-1703, 2018
Abstract:
In semiconductor manufacturing, metrology is generally a high cost nonvalue-added operation that significantly impacts on cycle time. As such, reducing wafer metrology continues to be a major target in semiconductor manufacturing efficiency initiatives. A novel data-driven spatial dynamic sampling methodology is presented that minimizes the number of sites that need to be measured across a wafer surface while maintaining an acceptable level of wafer profile reconstruction accuracy. The methodology is based on analyzing historical metrology data using forward selection component analysis (FSCA) to determine, from a set of candidate wafer sites, the minimum set of sites that need to be monitored in order to reconstruct the full wafer profile using statistical regression techniques. Dynamic sampling is then implemented by clustering unmeasured sites in accordance with their similarity to the FSCA selected sites and temporally selecting a different sample from each cluster. In this way, the risk of not detecting previously unseen process behavior is mitigated. We demonstrate the efficacy of the proposed methodology using both simulation studies and metrology data from a semiconductor manufacturing process.
[ abstract ] [
url] [
pdf] [
BibTeX]
G.A. Susto, G. Zambonin, F. Altinier, E. Pesavento, A. Beghi.
A Soft Sensing approach for Clothes Load Estimation in Consumer Washing Machines. 2nd IEEE Conference on Control Technology and Applications (CCTA), 2018
Abstract:
Fabric care home appliances are pervasive inhouses worldwide and manufactures are constantly working for improving product performance, efficiency, and usability. From a manufacturing perspective, increase of performancehas to be attained while minimizing the increase of production costs. In this context, a Soft Sensor for estimating the clothes load weight in a horizontal axis household washing machines ishere presented. The proposed Soft Sensor is based on Machine Learning approaches. Several methodologies, both time-seriesand feature-based, are employed and compared. The approach has been tested on real world data on commercial household washing machines.
[ abstract ] [
url] [
BibTeX]
L. Meneghetti, M. Terzi, G.A. Susto, S. Del Favero, C. Cobelli.
Fault Detection in Artificial Pancreas: A Model-Free approach. Conference on Decision and Control (CDC), pp. 303-308, 2018
Abstract:
Subjects affected by Type I Diabetes (T1D) are constantly confronted with the complicated problem of administering themselves an adequate amount of insulin, so as to keep their blood-glucose concentration in a nearly physiological range. Recently, powerful technological tools have been developed to better face this challenge, in particular the so-called Artificial Pancreas (AP). Unluckily, the AP actuator, an insulin pump, is subject to faults, with potential serious consequences for subjects' safety. This calls for the development of advanced fault detection (FD) methods, leveraging the unprecedented data availability in this application. In this paper we tackle the problem of detecting insulin pump malfunctioning using a model-free approach, so that the complex sub-task of identifying a model of patients physiology is avoided. Moreover, we employed unsupervised methods since labeled data are hardly available in practice. The adopted data-driven Anomaly Detection (AD) methods are Local Outlier Factor and Connectivity-based Outlier Factor. The methods are applied on a feature set able to account for the physiological dynamics of T1D patients. The proposed algorithms are tested on a synthetic dataset, generated using the “UVA/Padova Type 1 Diabetic Simulator”, an accurate nonlinear computer simulator of the T1D subject physiology. Both methods show precision ~75% and recall ~60%• The described approach is suitable both for embedding in medical devices, such as the AP, and implementation in cloud-based remote monitoring systems.
[ abstract ] [
url] [
BibTeX]
G.A. Susto, M. Maggipinto, G. Zannon, F. Altinier, E. Pesavento, A. Beghi.
Machine Learning-based Laundry Weight Estimation for Vertical Axis Washing Machines. European Control Conference (ECC2018), pp. 3179 - 3184, 2018
Abstract:
In laundry treatment appliances, the weight ofthe laundry loaded by the user inside the drum dramaticallyaffects the operating behavior. Therefore, it is important toobtain a good estimate of the said quantity in order tocorrectly configure the machine before the washing/dryingstarts. In Vertical Axis Washing Machinesthe laundry weightis computed by exploiting the quantity of water absorbed bythe clothes. However, such approach does not grant accurateresults because the water absorption depends on the clothesfabric. For this reason, we propose a Soft Sensing approachfor weight estimation that exploits the information obtainedfrom physical sensors available on board without added costs.Data-driven Soft Sensors are developed, where, using MachineLearning techniques, a statistical model of the phenomenon ofinterest is created from a set of sample data.
[ abstract ] [
url] [
BibTeX]
G. Zambonin, F. Altinier, L. Corso, A. Beghi, G.A. Susto.
Soft Sensors for Estimating Laundry Weight in Household Heat Pump Tumble Dryers. Conference on Automation Science and Engineering (CASE), 2018
Abstract:
The laundry weight of the loaded in the drum of a laundry treatment machine is an important piece of information; laundry weight can be used to set various washing/drying cycle parameters and to optimize performances and efficiency. Unfortunately, dedicated weight sensors cannot be included in consumer laundry equipment given the related costs. For this reason, we present in this work a soft sensor approach for estimating laundry weight based on sensors already in place in a laundry treatment equipment; in particular, we consider here a heat pump tumble dryer as case study. The proposed soft sensor is based on regularization, a popular approach in Machine Learning to provide models without overfitting the training data. Different studies are provided in this work, by considering different constrains on timing and complexity of the Soft Sensor solution. The developed Soft Sensors have been tested on laboratory data provided by an industrial partner.
[ abstract ] [
url] [
BibTeX]
M. Zorzi, A. Chiuso.
The Harmonic Analysis of Kernel Functions. Automatica - accepted, 2018 [
BibTeX]
A. Purpura, C. Masiero, G.A. Susto.
WS4ABSA: an NMF-based Weakly-Supervised Approach for Aspect-Based Sentiment Analysis with Application to Online Reviews. Lecture Notes in Computer Science, pp. 386--401, 2018
Abstract:
The goal of Aspect-Based Sentiment Analysis is to identify opinions regarding specific targets and the corresponding sentiment polarity in a document. The proposed approach is designed for real-world scenarios, where the amount of available information and annotated data is often too limited to train supervised models. We focus on the two core tasks of Aspect-Based Sentiment Analysis: aspect and sentiment polarity classification. The first task – which consists in the identification of the opinion targets in a document – is tackled by means of a weakly-supervised technique based on Non-negative Matrix Factorization. This strategy allows users to easily embed some a priori domain knowledge by means of short seed terms lists. Experimental results on publicly available data sets related to online reviews suggest that the proposed approach is very flexible and can be easily adapted to different languages and domains.
[ abstract ] [
url] [
BibTeX]
2017
M. Terzi, A. Cenedese, G.A. Susto.
A multivariate symbolic approach to activity recognition for wearable applications. IFAC World Congress 2017, pp. 16435-16440, 2017
Abstract:
With the aim of monitoring human activities (in critical tasks as well as in leisure and
sport activities), wearable devices provide enhanced usability and seamless human experience
with respect to other portable devices (e.g. smartphones). At the same time, though, wearable
devices are more resource-constrained in terms of computational capability and memory, which
calls for the design of algorithmic solutions that explicitly take into account these issues. In
this paper, a symbolic approach for activity recognition with wearable devices is presented:
the Symbolic Aggregate approXimation technique is here extended to multi-dimensional time
series, in order to capture the mutual information of different dimensions. Moreover, a novel
approach to identify gestures within activities is here presented. The performance of the
proposed methodology is tested on the two heterogeneous datasets related to cross-country
skiing and daily activities.
[ abstract ] [
pdf] [
BibTeX]
G.A. Susto, M. Terzi, A. Beghi.
Anomaly Detection Approaches for Semiconductor Manufacturing. Procedia Manufacturing,
27th International Conference on Flexible Automation and Intelligent Manufacturing, vol. 11, pp. 2018-2024, 2017
Abstract:
Smart production monitoring is a crucial activity in advanced manufacturing for quality, control and maintenance purposes. Advanced Monitoring Systems aim to detect anomalies and trends; anomalies are data patterns that have different data characteristics from normal instances, while trends are tendencies of production to move in a particular direction over time. In this work, we compare state-of-the-art ML approaches (ABOD, LOF, onlinePCA and osPCA) to detect outliers and events in high-dimensional monitoring problems. The compared anomaly detection strategies have been tested on a real industrial dataset related to a Semiconductor Manufacturing Etching process
[ abstract ] [
url] [
BibTeX]
G.A. Susto, A. Beghi, S. McLoone.
Anomaly Detection through on-line Isolation Forest: an Application to Plasma Etching. IEEE/SEMI Advanced Semiconductor Manufacturing Conference, 2017
Abstract:
Advanced Monitoring Systems are fundamental in advanced manufacturing for control, quality and maintenance purposes. Nowadays, with the increasing availability of data in production and equipment, the need for high-dimensional Anomaly Detection techniques is thriving; anomalies are data patterns that have different data characteristics from normal production instances and that may be associated with faults or drifts in production. Tools for dealing with high-dimensional monitoring problems are provided by Machine Learning: in this paper, we test the performance of a state-of-the-art anomaly detection technique, called Isolation Forest, on a real industrial dataset related to Etching, one of the most important semiconductor manufacturing process. The monitoring has been performed exploiting Optical Spectroscopy Data.
[ abstract ] [
url] [
BibTeX]
M. Terzi, C. Masiero, A. Beghi, M. Maggipinto, G.A. Susto.
Deep Learning for Virtual Metrology: Modeling with Optical Emission Spectroscopy Data. IEEE 3rd International Forum on Research and Technologies for Society and Industry (RTSI), 2017
Abstract:
Virtual Metrology is one of the most prominentAdvanced Process Control applications in SemiconductorManufacturing. The goal of Virtual Metrology is to provideestimations of quantities that are important for production andto assess process quality, but are costly or impossible to bemeasured. Virtual Metrology solutions are based on MachineLearning approaches. The bottleneck of developing VirtualMetrology solutions is generally the feature extraction phase thatcan be time-consuming, and can deeply affect the estimationperformance. In particular, in presence of data with additionaldimensions, such as time, feature extraction is typicallyperformed by means of heuristic approaches that may pickfeatures with poor predictive capabilities. In this work, wepropose the usage of modern Deep Learning approaches tobypass manual feature extraction and to provide highperformanceautomatic Virtual Metrology modules. Theproposed methodology is tested on a real industrial datasetrelated to Etching. The dataset at hand contains OpticalEmission Spectroscopy data and it is paradigmatic of the featureextraction problem under examination.
[ abstract ] [
url] [
BibTeX]
M. Todescato, A. Dalla Libera, R. Carli, G. Pillonetto, L. Schenato.
Distributed Kalman Filtering for Time-Space Gaussian Processes (with proofs). 20th World Congress of International Federation of Automatic Control (IFAC), pp. 13234--13239, 2017 [
pdf] [
BibTeX]
G. Prando, M. Zorzi, A. Bertoldo, A. Chiuso.
Estimating effective connectivity in linear brain network models. 56th IEEE Conference on Decision and Control, pp. accepted, 2017 [
BibTeX]
F. Altinier, E. Pesavento, A. Beghi, G.A. Susto, G. Zambonin, G. Zannon.
Method for the Determination of a Laundry Weight in a Laundry Treatment Appliance. (Pub. No.: WO/2017/144085 International Application No.: PCT/EP2016/053788), 2017
Abstract:
The invention relates to a method for the determination of a laundry weight in a laundry treatment appliance comprising: Selecting a laundry program in the laundry treatment appliance; Starting the selected laundry program; Sensing a plurality of parameters indicating operating conditions of the laundry treatment appliance during the laundry program; and Predicting a weight of the laundry present within the laundry treatment appliance based on said plurality of parameters by means of a data-driven soft sensor.
[ abstract ] [
url] [
BibTeX]
M. Todescato, A. Carron, R. Carli, G. Pillonetto, L. Schenato.
Multi-Robots Gaussian Estimation and Coverage Control: from Server-based to Peer-to-Peer Architecture. Automatica, vol. 80, pp. 284--294, 2017 [
url] [
pdf] [
BibTeX]
M. Zorzi, A. Chiuso.
Sparse plus Low rank Network Identification: A Nonparamteric Approach. Automatica, vol. 53(2), 2017 [
BibTeX]
G.A. Susto, A. Cenedese, M. Terzi.
Big Data Application in Power Systems - Ch. 2.5. Time Series Classication Methods: Review and Applications to Power Systems Data. 2017
Abstract:
The diffusion in Power Systems of distributed renewable energy resources, electric vehicles and controllable loads has made advanced monitoring systems fundamental to cope with the consequent disturbances in power flows; advanced monitoring systems can be employed for Anomaly Detection, Root Cause Analysis and Control purposes.
Several Machine Learning-based approaches have been developed in the past recent years to detect if a power system is running under anomalous conditions and, eventually, to classify such situation with respect to known problems.
One of the aspects that makes Power Systems challenging to be tackled, is that the monitoring has to be performed on streams of data that have a time series evolution; this issue is generally tackled by performing a features extraction procedure before the classication phase. The features extraction phase consists of translating the informative content of time series data into scalar quantities: such procedure may be a time-consuming step that requires the involvement of process experts to avoid loss of information in the making; moreover, extracted features designed to capture certain behaviors of the system, may not be informative under unseen conditions leading to poor monitoring performances.
A different type of data-driven approaches, that will be reviewed in this chapter, allow to perform classication directly on the raw time series data, avoiding the features extraction phase: among these approaches, Dynamic Time Warping and Symbolic-based methodologies have been widely applied in many application areas.
In the following, pros and cons of each approach will be discussed and practical implementation guidelines will be provided.
[ abstract ] [
url] [
BibTeX]
2016
A. Cenedese, G.A. Susto, M. Terzi.
A Parsimonious Approach for Activity Recognition with Wearable Devices: an Application to Cross-country Skiing. European Control Conference 2016 (ECC'16), pp. 2541-2546, 2016
Abstract:
With the aim of monitoring the human activity,
wearable devices provide an enhanced usability and a seamless
human experience with respect to other portable devices (e.g.
smartphones) in critical tasks as well as in leisure and sport
activities. At the same time, though, wearable devices are more
resource-constrained in terms of computational capability and
memory, which calls for the design of algorithmic solutions
that explicitly take into account these issues. In this paper, a
parsimonious approach for activity recognition with wearable
devices is presented. The methodology is based on Relevant
Vector Machines (RVMs), a sparse machine learning framework
for classification, and allows to tackle the activity recognition
problem by identifying the two phases of Event Identification
and Gesture Recognition. The performance of the presented
methodology is tested on the interesting case study of cross-
country skiing (classic style): such a dataset presents three
different classes of gestures in addition to non-gesture activities
and has been obtained by recording the training sessions
of a heterogeneous set of executors in different environment
conditions.
[ abstract ] [
url] [
BibTeX]
T. Chen, G. Pillonetto, A. Chiuso, L. Ljung.
DC kernel - a stable generalized first order spline kernel. Proc. of CDC 2016 - accepted, 2016 [
BibTeX]
G.A. Susto, A. Beghi.
Dealing with Time-Series Data in Predictive Maintenance Problems. Emerging Technologies and Factory Automation, 2016
Abstract:
In this paper an approach to deal with Predictive Maintenance (PdM) problems with time-series data is discussed. PdM is a important approach to tackle maintenance and it is gaining an increasing attention in advanced manufacturing to minimize scrap materials, downtime, and associated costs. PdM approaches are generally based on Machine Learning tools that require the availability of historical process and maintenance data. Given the exponential growth in data logging in modern equipment, time series dataset are increasingly available in PdM applications. To exploit time series data for PdM, a functional learning methodology, namely Supervised Aggregative Feature Extraction (SAFE), is here employed on a semiconductor manufacturing maintenance problem.
[ abstract ] [
url] [
BibTeX]
A. Cenedese, L. Minetto, G.A. Susto, M. Terzi.
Human Activity Recognition with Wearable Devices: A Symbolic Approach. PsychNology, vol. 14(2-3), pp. 99-115, 2016
Abstract:
In the context of activity recognition, wearable devices are nowadays the preferable hardware
thanks to their usability, user experience and performances; at the same time, these devices
present limitations in terms of computational capability and memory, which force the algorithm
design to be at the same time efficient and simple. In this work, we adopt Symbolic Aggregate
Approximation (SAX), a symbolic approach for information retrieval in time series data that
allows dimensionality and numerosity reduction; SAX is employed here, in combination with
1-Nearest Neighbor classifier, to identify activity phases in continuous repetitive activities from
inertial time-series data. The proposed approach is validated on a cross-country skiing dataset
and on a daily living activities dataset.
[ abstract ] [
url] [
BibTeX]
M. Todescato, A. Carron, R. Carli, L. Schenato, G. Pillonetto.
Machine Learning meets Kalman Filtering (with proofs). 55th IEEE Conference on Decision and Control (CDC16), pp. 4594--4599, 2016 [
pdf] [
BibTeX]
S. Soatto, A. Chiuso.
Modeling Visual Representations:Defining Properties and Deep Approximations. International Conference on Learning Representation (ICLR), 2016 [
BibTeX]
G.A. Susto, A. Schirru, S. Pampuri, S. McLoone.
Supervised Aggregative Feature Extraction for Big Data Time Series Regression. IEEE Transactions on Industrial Informatics, vol. 12, pp. 1243 - 1252, 2016
Abstract:
In many applications, and especially thosewhere batch processes are involved, a target scalar outputof interest is often dependent on one or more time seriesof data. With the exponential growth in data logging inmodern industries such time series are increasingly availablefor statistical modeling in soft sensing applications. In orderto exploit time series data for predictive modelling, it isnecessary to summarise the information they contain as aset of features to use as model regressors. Typically thisis done in an unsupervised fashion using simple techniquessuch as computing statistical moments, principalcomponents or wavelet decompositions, often leading tosignificant information loss and hence suboptimal predictivemodels. In this paper, a functional learning paradigm isexploited in a supervised fashion to derive continuous,smooth estimates of time series data (yielding aggregatedlocal information), while simultaneously estimating a continuousshape function yielding optimal predictions. Theproposed Supervised Aggregative Feature Extraction (SAFE)methodology can be extended to support nonlinear predictivemodels by embedding the functional learning framework ina Reproducing Kernel Hilbert Spaces setting. SAFE has anumber of attractive features including closed form solutionand the ability to explicitly incorporate first and secondorder derivative information. Using simulation studies and apractical semiconductor manufacturing case study we highlightthe strengths of the new methodology with respect tostandard unsupervised feature extraction approaches.
[ abstract ] [
url] [
BibTeX]
2015
D. Romeres, G. Pillonetto, A. Chiuso.
Identification of stable models via nonparametric prediction error methods. Proc. of the European Control Conference, 2015
Abstract:
A new Bayesian approach to linear system identification
has been proposed in a series of recent papers. The
main idea is to frame linear system identification as predictor
estimation in an infinite dimensional space, with the aid of
regularization/Bayesian techniques. This approach guarantees
the identification of stable predictors based on the prediction
error minimization. Unluckily, the stability of the predictors
does not guarantee the stability of the impulse response of
the system. In this paper we propose and compare various
techniques to guarantee that the final model identified following
this Bayesian approach is stable. First, we consider the socalled
“LMI - constraint” approach and adapt it to constrain
the eigenvalues of the estimated model within the unit circle.
A second possibility which is being considered is to add to the
“classic” Stable-Spline algorithm a penalty term, depending on
the maximum absolute value of the eigenvalue of the system.
This last technique has the advantage of being integrated
directly inside the pre-existing optimization problem and not to
simply post-process the estimated model to guarantee stability.
Finally, we considered a Monte Carlo Markov Chain approach
sampling in both the space of hyper-parameters and of impulse
responses. Simulations results comparing these techniques will
be provided.
[ abstract ] [
BibTeX]
G.A. Susto, A. Schirru, S. Pampuri, S. McLoone, A. Beghi.
Machine Learning for Predictive Maintenance: a Multiple Classifiers Approach. IEEE Transactions on Industrial Informatics, vol. 11(3), pp. 812 - 820, 2015
Abstract:
In this paper a multiple classifier machine learning methodology for Predictive Maintenance (PdM) is presented. PdM is a prominent strategy for dealing with maintenance issues given the increasing need to minimize downtime and associated costs. One of the challenges with PdM is generating so called ’health factors’ or quantitative indicators of the status of a system associated with a given maintenance issue, and determining their relationship to operating costs and failure risk. The proposed PdM methodology allows dynamical decision rules to be adopted for maintenance management and can be used with high-dimensionaland censored data problems. This is achieved by training multiple classification modules with different prediction horizons to provide different performance trade-offs in terms of frequency of unexpected breaks and unexploited lifetime and then employing this information in an operating cost based maintenance decision system to minimise expected costs. The effectiveness of the methodology is demonstrated using a simulated example and a benchmark semiconductor manufacturing maintenance problem.
[ abstract ] [
url] [
BibTeX]
G.A. Susto, S. McLoone.
Slow Release Drug Dissolution Profile Prediction in Pharmaceutical Manufacturing: a Multivariate and Machine Learning Approach. 11th IEEE Conference on Automation Science and Engineering, pp. 1218-1223, 2015
Abstract:
Slow release drugs must be manufactured to meettarget speci?cations with respect to dissolution curve pro?les.In this paper we consider the problem of identifying thedrivers of dissolution curve variability of a drug from historicalmanufacturing data. Several data sources are considered: rawmaterial parameters, coating data, loss on drying and pellet sizestatistics. The methodology employed is to develop predictivemodels using LASSO, a powerful machine learning algorithmfor regression with high-dimensional datasets. LASSO providessparse solutions facilitating the identi?cation of the most importantcauses of variability in the drug fabrication process.The proposed methodology is illustrated using manufacturingdata for a slow release drug.
[ abstract ] [
url] [
BibTeX]
S. Soatto, A. Chiuso.
Visual Scene Representations: Sufficiency, Minimality, Invariance and Deep Approximation. International Conference on Learning Representation (ICLR), Workshop Track, 2015 [
BibTeX]
2014
G. Belgioioso, A. Cenedese, G.I. Cirillo, F. Fraccaroli, G.A. Susto.
A Machine Learning based Approach for Gesture Recognition from Inertial Measurements. IEEE 53rd Conference on Decision and Control, pp. 4899--4904, 2014
Abstract:
The interaction based on gestures has become a
prominent approach to interact with electronic devices. In this
paper a Machine Learning (ML) based approach to gesture
recognition (GR) is illustrated; the proposed tool is freestanding
from user, device and device orientation. The tool has been
tested on a heterogeneous dataset representative of a typical
application of gesture recognition. In the present work two novel
ML algorithms based on Sparse Bayesian Learning are tested
versus other classification approaches already employed in
literature (Support Vector Machine, Relevance Vector Machine,
k-Nearest Neighbor, Discriminant Analysis). A second element
of novelty is represented by a Principal Component Analysis-
based approach, called Pre-PCA, that is shown to enhance
gesture recognition with heterogeneous working conditions.
Feature extraction techniques are also investigated: a Principal
Component Analysis based approach is compared to Frame-
Based Description methods.
[ abstract ] [
url] [
pdf] [
BibTeX]
A. Chiuso.
System Identification Techniques: Convexification, Regularization, Relaxation. Springer Encyclopedia of Systems and Control, 2014 [
BibTeX]