Optimising Venturi flume oxygen transfer efficiency using uncertainty-aware decision trees
© 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).
Publié dans: | Water science and technology : a journal of the International Association on Water Pollution Research. - 1986. - 90(2024), 12 vom: 29. Dez., Seite 3210-3240 |
---|---|
Auteur principal: | |
Autres auteurs: | |
Format: | Article en ligne |
Langue: | English |
Publié: |
2024
|
Accès à la collection: | Water science and technology : a journal of the International Association on Water Pollution Research |
Sujets: | Journal Article MNLR) Shapley analysis Venturi flume machine learning (ML) and flume design parameters ( regression analysis (MLR standard oxygen transfer efficiency (SOTE) uncertainty analysis Oxygen S88TT14065 |
Résumé: | © 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/). This study optimizes standard oxygen transfer efficiency (SOTE) in Venturi flumes investigating the impact of key parameters such as discharge per unit width (q), throat width (W), throat length (F), upstream entrance width (E), and gauge readings (Ha and Hb). To achieve this, a comprehensive experimental dataset was analyzed using multiple linear regression (MLR), multiple nonlinear regression (MNLR), gradient boosting machine (GBM), extreme gradient boosting (XRT), random forest (RF), M5 (pruned and unpruned), random tree (RT), and reduced error pruning (REP). Model performance was evaluated based on key metrics: correlation coefficient (CC), root mean square error (RMSE), and mean absolute error (MAE). Among the proposed models, M5_Unprun emerged as the top performer, exhibiting the highest CC (0.9455), the lowest RMSE (0.1918), and the lowest MAE (0.0030). GBM followed closely with a CC value of 0.9372, an RMSE value of 0.2067, and an MAE value of 0.0006. Uncertainty analysis further solidified the superior performance of M5_Unpruned (0.7522) and GBM (0.8055), with narrower prediction bands compared to other models, including MLR, which exhibited the widest band (1.4320). One-way analysis of variance confirmed the reliability and robustness of the proposed models. Sensitivity, correlation, and SHapley Additive exPlanations analyses identified W and Hb as the most influencing factors |
---|---|
Description: | Date Completed 29.12.2024 Date Revised 29.12.2024 published: Print Citation Status MEDLINE |
ISSN: | 0273-1223 |
DOI: | 10.2166/wst.2024.393 |