Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants

© 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Bibliographische Detailangaben
Veröffentlicht in:Water science and technology : a journal of the International Association on Water Pollution Research. - 1986. - 90(2024), 11 vom: 29. Dez., Seite 3123-3138
1. Verfasser: Bellamoli, Francesca (VerfasserIn)
Weitere Verfasser: Vian, Marco, Di Iorio, Mattia, Melgani, Farid
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:Water science and technology : a journal of the International Association on Water Pollution Research
Schlagworte:Journal Article active learning domain adaptation gradient boosting intermittent aeration multiclass classification wastewater treatment plants Wastewater
Beschreibung
Zusammenfassung:© 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).
The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts
Beschreibung:Date Completed 14.12.2024
Date Revised 14.12.2024
published: Print-Electronic
Citation Status MEDLINE
ISSN:0273-1223
DOI:10.2166/wst.2024.387