Stochastic Modeling of Groundwater Fluoride Contamination : Introducing Lazy Learners

© 2019, National Ground Water Association.

Bibliographische Detailangaben
Veröffentlicht in:Ground water. - 1979. - 58(2020), 5 vom: 24. Sept., Seite 723-734
1. Verfasser: Khosravi, Khabat (VerfasserIn)
Weitere Verfasser: Barzegar, Rahim, Miraki, Shaghayegh, Adamowski, Jan, Daggupati, Prasad, Alizadeh, Mohammad Reza, Pham, Binh Thai, Alami, Mohammad Taghi
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2020
Zugriff auf das übergeordnete Werk:Ground water
Schlagworte:Journal Article Research Support, Non-U.S. Gov't Water Pollutants, Chemical Fluorides Q80VPU408O
Beschreibung
Zusammenfassung:© 2019, National Ground Water Association.
While it remains the primary source of safe drinking and irrigation water in northwest Iran's Maku Plain, the region's groundwater is prone to fluoride contamination. Accordingly, modeling techniques to accurately predict groundwater fluoride concentration are required. The current paper advances several novel data mining algorithms including Lazy learners [instance-based K-nearest neighbors (IBK); locally weighted learning (LWL); and KStar], a tree-based algorithm (M5P), and a meta classifier algorithm [regression by discretization (RBD)] to predict groundwater fluoride concentration. Drawing on several groundwater quality variables (e.g., Ca 2 + , Mg 2 + , Na + , K + , HCO 3 - , CO 3 2 - , SO 4 2 - , and Cl - concentrations), measured in each of 143 samples collected between 2004 and 2008, several models predicting groundwater fluoride concentrations were developed. The full dataset was divided into two subsets: 70% for model training (calibration) and 30% for model evaluation (validation). Models were validated using several statistical evaluation criteria and three visual evaluation approaches (i.e., scatter plots, Taylor and Violin diagrams). Although Na+ and Ca2+ showed the greatest positive and negative correlations with fluoride (r = 0.59 and -0.39, respectively), they were insufficient to reliably predict fluoride levels; therefore, other water quality variables, including those weakly correlated with fluoride, should be considered as inputs for fluoride prediction. The IBK model outperformed other models in fluoride contamination prediction, followed by KStar, RBD, M5P, and LWL. The RBD and M5P models were the least accurate in terms of predicting peaks in fluoride concentration values. Results of the current study can be used to support practical and sustainable management of water and groundwater resources
Beschreibung:Date Completed 17.03.2021
Date Revised 17.03.2021
published: Print-Electronic
Citation Status MEDLINE
ISSN:1745-6584
DOI:10.1111/gwat.12963