1
|
Agbasi JC, Egbueri JC. Prediction of potentially toxic elements in water resources using MLP-NN, RBF-NN, and ANFIS: a comprehensive review. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024:10.1007/s11356-024-33350-6. [PMID: 38641692 DOI: 10.1007/s11356-024-33350-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 04/12/2024] [Indexed: 04/21/2024]
Abstract
Water resources are constantly threatened by pollution of potentially toxic elements (PTEs). In efforts to monitor and mitigate PTEs pollution in water resources, machine learning (ML) algorithms have been utilized to predict them. However, review studies have not paid attention to the suitability of input variables utilized for PTE prediction. Therefore, the present review analyzed studies that employed three ML algorithms: MLP-NN (multilayer perceptron neural network), RBF-NN (radial basis function neural network), and ANFIS (adaptive neuro-fuzzy inference system) to predict PTEs in water. A total of 139 models were analyzed to ascertain the input variables utilized, the suitability of the input variables, the trends of the ML model applications, and the comparison of their performances. The present study identified seven groups of input variables commonly used to predict PTEs in water. Group 1 comprised of physical parameters (P), chemical parameters (C), and metals (M). Group 2 contains only P and C; Group 3 contains only P and M; Group 4 contains only C and M; Group 5 contains only P; Group 6 contains only C; and Group 7 contains only M. Studies that employed the three algorithms proved that Groups 1, 2, 3, 5, and 7 parameters are suitable input variables for forecasting PTEs in water. The parameters of Groups 4 and 6 also proved to be suitable for the MLP-NN algorithm. However, their suitability with respect to the RBF-NN and ANFIS algorithms could not be ascertained. The most commonly predicted PTEs using the MLP-NN algorithm were Fe, Zn, and As. For the RBF-NN algorithm, they were NO3, Zn, and Pb, and for the ANFIS, they were NO3, Fe, and Mn. Based on correlation and determination coefficients (R, R2), the overall order of performance of the three ML algorithms was ANFIS > RBF-NN > MLP-NN, even though MLP-NN was the most commonly used algorithm.
Collapse
Affiliation(s)
- Johnson C Agbasi
- Department of Geology, Chukwuemeka Odumegwu Ojukwu University, Uli, Nigeria
| | - Johnbosco C Egbueri
- Department of Geology, Chukwuemeka Odumegwu Ojukwu University, Uli, Nigeria.
- Research Management Office (RMO), Chukwuemeka Odumegwu Ojukwu University, Anambra State, Nigeria.
| |
Collapse
|
2
|
Ahn SH, Jeong DH, Kim M, Lee TK, Kim HK. Prediction of groundwater quality index to assess suitability for drinking purpose using averaged neural network and geospatial analysis. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 265:115485. [PMID: 37729698 DOI: 10.1016/j.ecoenv.2023.115485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/29/2023] [Accepted: 09/13/2023] [Indexed: 09/22/2023]
Abstract
Groundwater quality management is pivotal for ensuring public health and ecological resilience. However, the conventional water quality indices often face challenges related to parameter selection, geographic coverage, and scalability. The integration of machine learning and spatial analysis represents a promising methodological shift, allowing for high accuracy and adaptive management strategies. The Safe Groundwater Project in Unsupplied Areas (2017-2020) employed a comprehensive Groundwater Quality Index (GQI) to evaluate potable groundwater quality across South Korea, utilizing a large dataset comprising 28 water quality parameters and 3552 wells. This study revealed that over 50 % of the evaluated wells (Total 8326 wells) were inappropriate as sources of drinking water, indicating a pressing need for policy revision. The averaged neural network model achieved a high predictive accuracy of approximately 95 % for GQI grades, outperforming other classification models. The introduction of 2D spatial analysis in conjunction with machine learning algorithms notably increased the predictive accuracy for unevenly distributed groundwater samples. Moreover, this combined approach enabled the intuitive visualization of groundwater vulnerability across various regions, which can inform targeted interventions for effective resource allocation and management. This research represents a methodologically robust, interdisciplinary approach that holds significant implications for a framework for future groundwater quality management and vulnerability assessment.
Collapse
Affiliation(s)
- Seok Hyun Ahn
- Department of Environmental Engineering, Yonsei University, Wonju 26493, South Korea
| | - Do Hwan Jeong
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea
| | - MoonSu Kim
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea
| | - Tae Kwon Lee
- Department of Environmental Engineering, Yonsei University, Wonju 26493, South Korea.
| | - Hyun-Koo Kim
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea.
| |
Collapse
|
3
|
Haggerty R, Sun J, Yu H, Li Y. Application of machine learning in groundwater quality modeling - A comprehensive review. WATER RESEARCH 2023; 233:119745. [PMID: 36812816 DOI: 10.1016/j.watres.2023.119745] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 11/30/2022] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
Groundwater is a crucial resource across agricultural, civil, and industrial sectors. The prediction of groundwater pollution due to various chemical components is vital for planning, policymaking, and management of groundwater resources. In the last two decades, the application of machine learning (ML) techniques for groundwater quality (GWQ) modeling has grown exponentially. This review assesses all supervised, semi-supervised, unsupervised, and ensemble ML models implemented to predict any groundwater quality parameter, making this the most extensive modern review on this topic. Neural networks are the most used ML model in GWQ modeling. Their usage has declined in recent years, giving rise to more accurate or advanced techniques such as deep learning or unsupervised algorithms. Iran and the United States lead the world in areas modeled, with a wealth of historical data available. Nitrate has been modeled most exhaustively, targeted by nearly half of all studies. Advancements in future work will be made with further implementation of deep learning and explainable artificial intelligence or other cutting-edge techniques, application of these techniques for sparsely studied variables, the modeling of new or unique study areas, and the implementation of ML techniques for groundwater quality management.
Collapse
Affiliation(s)
- Ryan Haggerty
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Jianxin Sun
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Hongfeng Yu
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States; Holland Computing Center, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Yusong Li
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States.
| |
Collapse
|
4
|
El Amri A, M'nassri S, Nasri N, Nsir H, Majdoub R. Nitrate concentration analysis and prediction in a shallow aquifer in central-eastern Tunisia using artificial neural network and time series modelling. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:43300-43318. [PMID: 35091932 DOI: 10.1007/s11356-021-18174-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 12/14/2021] [Indexed: 06/14/2023]
Abstract
Agricultural activities have become a major source of groundwater nitrate contamination. In this context, this study aims to analyse nitrate concentrations in a shallow aquifer of Mahdia-Kssour Essef in central-eastern Tunisia, identify the assignable sources, and predict the future levels using artificial neural network (ANN) and autoregressive integrated moving average (ARIMA) models. The results showed that nitrate concentrations measured in 21 pumping wells across the plain ranged from 17 to 521 mg L-1. A total of 67% of the monitoring points greatly exceed the standard guideline value of 50 mg L-1. The main relevant anthropogenic and natural factors, such as soil texture, land use, fertilizers application rates, livestock waste disposal, and groundwater table, are positively correlated with groundwater nitrate concentration. The ANN model showed good fitting between measured and simulated results with coefficient of determination (R2), root-mean-square error (RMSE), and mean absolute error (MAE) values of 0.88, 53.95, and 39.64, respectively. The ARIMA applied on annual average nitrate concentrations from 1998 to 2017 revealed that the best fitted model (p, d, q) is (1, 2, 1). The R2 value is approximately 0.36, and the Theil inequality coefficient and bias proportion values are small and close to zero. These results proved the ARIMA model's adequacy in forecasting annual average nitrate concentrations of 116 mg L-1 in 2025. These findings may be useful in making groundwater management decisions, particularly in rural and semi-arid areas, and the proposed ARIMA model could be used as a managed tool to monitor and reduce the nitrate intrusion into groundwater.
Collapse
Affiliation(s)
- Asma El Amri
- Laboratory of Research in Management and Control of Animal and Environmental Resources in Semi-aride Ecosystem, Higher Agronomic Institute of Chott Meriem, University of Sousse, BP 42, 4042, Chott Meriem, Sousse, Tunisia
| | - Soumaia M'nassri
- Laboratory of Research in Management and Control of Animal and Environmental Resources in Semi-aride Ecosystem, Higher Agronomic Institute of Chott Meriem, University of Sousse, BP 42, 4042, Chott Meriem, Sousse, Tunisia.
| | - Nessrine Nasri
- Higher Institute of Environmental Technologies, Urban Planning and Construction, University of Carthage, 2035, Charguia II, Tunis, Tunisia
- Laboratory in Hydraulic and Environmental Modelling, National Engineering School of Tunis, University of Tunis El Manar, BP 37, 1002, Belvedere, Tunis, Tunisia
| | - Hanen Nsir
- Laboratory of Research in Management and Control of Animal and Environmental Resources in Semi-aride Ecosystem, Higher Agronomic Institute of Chott Meriem, University of Sousse, BP 42, 4042, Chott Meriem, Sousse, Tunisia
| | - Rajouene Majdoub
- Laboratory of Research in Management and Control of Animal and Environmental Resources in Semi-aride Ecosystem, Higher Agronomic Institute of Chott Meriem, University of Sousse, BP 42, 4042, Chott Meriem, Sousse, Tunisia
| |
Collapse
|
5
|
Daily River Water Temperature Prediction: A Comparison between Neural Network and Stochastic Techniques. ATMOSPHERE 2021. [DOI: 10.3390/atmos12091154] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The temperature of river water (TRW) is an important factor in river ecosystem predictions. This study aims to compare two different types of numerical model for predicting daily TRW in the Warta River basin in Poland. The implemented models were of the stochastic type—Autoregressive (AR), Moving Average (MA), Autoregressive Moving Average (ARMA) and Autoregressive Integrated Moving Average (ARIMA)—and the artificial intelligence (AI) type—Adaptive Neuro Fuzzy Inference System (ANFIS), Radial Basis Function (RBF) and Group Method of Data Handling (GMDH). The ANFIS and RBF models had the most fitted outputs and the AR, ARMA and ARIMA patterns were the most accurate ones. The results showed that both of the model types can significantly present suitable predictions. The stochastic models have somewhat less error with respect to both the highest and lowest TRW deciles than the AIs and were found to be better for prediction studies, with the GMDH complex model in some cases reaching Root Mean Square Error (RMSE) = 0.619 °C and Nash-Sutcliff coefficient (NS) = 0.992, while the AR(2) simple linear model with just two inputs was partially able to achieve better results (RMSE = 0.606 °C and NS = 0.994). Due to these promising outcomes, it is suggested that this work be extended to other catchment areas to extend and generalize the results.
Collapse
|
6
|
Bedi S, Samal A, Ray C, Snow D. Comparative evaluation of machine learning models for groundwater quality assessment. ENVIRONMENTAL MONITORING AND ASSESSMENT 2020; 192:776. [PMID: 33219864 DOI: 10.1007/s10661-020-08695-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 10/20/2020] [Indexed: 06/11/2023]
Abstract
Contamination from pesticides and nitrate in groundwater is a significant threat to water quality in general and agriculturally intensive regions in particular. Three widely used machine learning models, namely, artificial neural networks (ANN), support vector machines (SVM), and extreme gradient boosting (XGB), were evaluated for their efficacy in predicting contamination levels using sparse data with non-linear relationships. The predictive ability of the models was assessed using a dataset consisting of 303 wells across 12 Midwestern states in the USA. Multiple hydrogeologic, water quality, and land use features were chosen as the independent variables, and classes were based on measured concentration ranges of nitrate and pesticide. This study evaluates the classification performance of the models for two, three, and four class scenarios and compares them with the corresponding regression models. The study also examines the issue of class imbalance and tests the efficacy of three class imbalance mitigation techniques: oversampling, weighting, and oversampling and weighting, for all the scenarios. The models' performance is reported using multiple metrics, both insensitive to class imbalance (accuracy) and sensitive to class imbalance (F1 score and MCC). Finally, the study assesses the importance of features using game-theoretic Shapley values to rank features consistently and offer model interpretability.
Collapse
Affiliation(s)
- Shine Bedi
- Computer Science and Engineering, University of Nebraska, Lincoln, NE, USA.
| | - Ashok Samal
- Computer Science and Engineering, University of Nebraska, Lincoln, NE, USA
| | | | - Daniel Snow
- Water Sciences Laboratory, University of Nebraska, Lincoln, NE, USA
| |
Collapse
|
7
|
Lee CM, Hamm SY, Cheong JY, Kim K, Yoon H, Kim M, Kim J. Contribution of nitrate-nitrogen concentration in groundwater to stream water in an agricultural head watershed. ENVIRONMENTAL RESEARCH 2020; 184:109313. [PMID: 32151840 DOI: 10.1016/j.envres.2020.109313] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 02/26/2020] [Accepted: 02/26/2020] [Indexed: 06/10/2023]
Abstract
This study characterized nitrate-nitrogen (NO3-N) concentrations in groundwater and stream water in an agricultural head watershed in South Korea and identified the pollution load of NO3-N as a result of the groundwater entering streams using field surveys, analyses of chemical constituents, and numerical modeling. The mean NO3-N concentration in groundwater was 7.373 mg/L, which is approximately 1.9 times higher than concentrations found in stream water. The groundwater and stream water samples belonged to the Ca-HCO3 type. The concentration of NO3-N in groundwater tended to increase in the lowland areas downstream. There was seasonal variations of NO3-N in both the groundwater and stream water samples, with increases in concentration during the dry season (January-April) and decreases during the wet season (June-October). The NO3-N load in stream water to that in groundwater (R) was higher during the wet season (September) than the dry season (March), with R distinctly increasing in upstream areas relative to downstream areas, indicating that during the wet season, a large amount of NO3-N is introduced into stream water from groundwater. By analyzing the relationship between groundwater and stream water and through NO3-N transport modeling, it was revealed that in the watershed, the nitrate-N load in stream water is greatly augmented by inputs from groundwater, particularly in the middle and downstream areas.
Collapse
Affiliation(s)
- Chung-Mo Lee
- Department of Geological Sciences, Pusan National University, Busan, 46241, South Korea
| | - Se-Yeong Hamm
- Department of Geological Sciences, Pusan National University, Busan, 46241, South Korea.
| | - Jae-Yeol Cheong
- Korea Radioactive Waste Agency, Gyeongju, 38062, South Korea
| | - Kangjoo Kim
- Department of Environmental Engineering, Kunsan National University, Kunsan, 54150, South Korea
| | - Heesung Yoon
- Groundwater Research Center, Korea Institute of Geoscience and Mineral Resources, Daejeon, 34132, South Korea
| | - MoonSu Kim
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon, 22689, South Korea
| | - Jinsoo Kim
- Department of Spatial Information Engineering, Pukyong National University, Busan, 48513, South Korea
| |
Collapse
|
8
|
Quality assessment and artificial neural networks modeling for characterization of chemical and physical parameters of potable water. Food Chem Toxicol 2018; 118:212-219. [DOI: 10.1016/j.fct.2018.04.036] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 04/06/2018] [Accepted: 04/17/2018] [Indexed: 11/18/2022]
|
9
|
Application of artificial neural network in water quality index prediction: a case study in Little Akaki River, Addis Ababa, Ethiopia. ACTA ACUST UNITED AC 2018. [DOI: 10.1007/s40808-018-0437-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
10
|
Wagh VM, Panaskar DB, Muley AA. Estimation of nitrate concentration in groundwater of Kadava river basin-Nashik district, Maharashtra, India by using artificial neural network model. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/s40808-017-0290-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
11
|
Groundwater flow modeling for impact assessment of port dredging works on coastal hydrogeology in the area of Al-Wakrah (Qatar). ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s40808-016-0252-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
12
|
Simulation and analysis of temporal changes of groundwater depth using time series modeling. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s40808-016-0164-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|