Prediction of flood sensitivity based on Logistic Regression, eXtreme Gradient Boosting, and Random Forest modeling methods

© 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Détails bibliographiques
Publié dans:Water science and technology : a journal of the International Association on Water Pollution Research. - 1986. - 89(2024), 10 vom: 01. Mai, Seite 2605-2624
Auteur principal: Wu, Ying (Auteur)
Autres auteurs: Zhang, Zhiming, Qi, Xiaotian, Hu, Wenhan, Si, Shuai
Format: Article en ligne
Langue:English
Publié: 2024
Accès à la collection:Water science and technology : a journal of the International Association on Water Pollution Research
Sujets:Journal Article Logistic Regression (LR) Random Forest (RF) eXtreme Gradient Boosting (XGBoost) flood sensitivity assessment
Description
Résumé:© 2024 The Authors This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).
Floods are one of the most destructive disasters that cause loss of life and property worldwide every year. In this study, the aim was to find the best-performing model in flood sensitivity assessment and analyze key characteristic factors, the spatial pattern of flood sensitivity was evaluated using three machine learning (ML) models: Logistic Regression (LR), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF). Suqian City in Jiangsu Province was selected as the study area, and a random sample dataset of historical flood points was constructed. Fifteen different meteorological, hydrological, and geographical spatial variables were considered in the flood sensitivity assessment, 12 variables were selected based on the multi-collinearity study. Among the results of comparing the selected ML models, the RF method had the highest AUC value, accuracy, and comprehensive evaluation effect, and is a reliable and effective flood risk assessment model. As the main output of this study, the flood sensitivity map is divided into five categories, ranging from very low to very high sensitivity. Using the RF model (i.e., the highest accuracy of the model), the high-risk area covers about 44% of the study area, mainly concentrated in the central, eastern, and southern parts of the old city area
Description:Date Completed 01.06.2024
Date Revised 01.06.2024
published: Print-Electronic
Citation Status MEDLINE
ISSN:0273-1223
DOI:10.2166/wst.2024.146