1
|
Haggerty R, Sun J, Yu H, Li Y. Application of machine learning in groundwater quality modeling - A comprehensive review. WATER RESEARCH 2023; 233:119745. [PMID: 36812816 DOI: 10.1016/j.watres.2023.119745] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 11/30/2022] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
Groundwater is a crucial resource across agricultural, civil, and industrial sectors. The prediction of groundwater pollution due to various chemical components is vital for planning, policymaking, and management of groundwater resources. In the last two decades, the application of machine learning (ML) techniques for groundwater quality (GWQ) modeling has grown exponentially. This review assesses all supervised, semi-supervised, unsupervised, and ensemble ML models implemented to predict any groundwater quality parameter, making this the most extensive modern review on this topic. Neural networks are the most used ML model in GWQ modeling. Their usage has declined in recent years, giving rise to more accurate or advanced techniques such as deep learning or unsupervised algorithms. Iran and the United States lead the world in areas modeled, with a wealth of historical data available. Nitrate has been modeled most exhaustively, targeted by nearly half of all studies. Advancements in future work will be made with further implementation of deep learning and explainable artificial intelligence or other cutting-edge techniques, application of these techniques for sparsely studied variables, the modeling of new or unique study areas, and the implementation of ML techniques for groundwater quality management.
Collapse
Affiliation(s)
- Ryan Haggerty
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Jianxin Sun
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Hongfeng Yu
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States; Holland Computing Center, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Yusong Li
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States.
| |
Collapse
|
2
|
Yang Z, Li B, Wu H, Li M, Fan J, Chen M, Long J. Water consumption prediction and influencing factor analysis based on PCA-BP neural network in karst regions: a case study of Guizhou Province. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:33504-33515. [PMID: 36480138 PMCID: PMC9734345 DOI: 10.1007/s11356-022-24604-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 12/01/2022] [Indexed: 06/17/2023]
Abstract
Water consumption prediction is an integral part of water resource planning and management. Constructing a highly precise water consumption prediction model is of great significance for promoting regional water resource planning and high-quality development of the socio-economy. This paper focuses on the case of the typical karst region in Guizhou Province in China. Based on data on water consumption and its influencing factors spanning 2000-2020, the principal component analysis method was applied to reduce the dimensionality of 16 influencing factors of water consumption in Guizhou; the principal components extracted were used as input samples of the BP neural network and a PCA-BP neural network water consumption prediction model was conducted to predict water consumption of Guizhou Province in the next 10 years. The results show that the mean absolute error and mean relative error of prediction based on the constructed PCA-BP neural network were 2.8% and 2.9%, respectively, with superior performance in terms of prediction error and trends compared with other models. This paper discusses the main influencing factors of water consumption and analyzes their influence on the water consumption forecasting model so that the parameters of the water consumption forecasting model can be selected more efficiently and provide a reference for regional water consumption analysis and water resource planning and management.
Collapse
Affiliation(s)
- Zhicheng Yang
- Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Bo Li
- Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China.
- College of Resource and Environmental Engineering, Guizhou University, Guiyang, Guizhou, 550025, China.
| | - Huang Wu
- Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - MengHua Li
- Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Juan Fan
- College of Geology and Environment, Xi'an University of Science and Technology, Xian, 710077, Shanxi, China
| | - Mengyu Chen
- Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Jie Long
- State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing, 100012, China
| |
Collapse
|
3
|
Wu W, Chen H, Xu S, Liu T, Wang H, Li G, Wang J. Water Environment Characteristics and Water Quality Assessment of Water Source of Diversion System of Project from Hanjiang to Weihe River. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:2890. [PMID: 36833585 PMCID: PMC9957252 DOI: 10.3390/ijerph20042890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 01/19/2023] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
The water source of the water diversion project from the Hanjiang River to the Weihe River is one of the most important drinking water sources in China. Its water quality is related to the water safety of the long-distance water diversion system from the Hanjiang to Weihe Rivers. In order to explore the spatiotemporal change trend of the water environment characteristics of the water source area and analyze the key factors that have a greater impact on it, this study collected 9 types of water environment physical and chemical parameters from 10 water quality monitoring sections from 2017 to 2019; the water environment characteristics of the water source area of the water diversion system from the Hanjiang River to the Weihe River were analyzed and evaluated by using the variance analysis method, the hierarchical cluster analysis method and the water quality identification index evaluation method. The results were as follows. (1) There was spatiotemporal heterogeneity in a number of physical and chemical parameters in the water body of the water source. In terms of time, the concentrations of CODMn, COD, BOD5 and F- were higher in the flood season (July-October) than in the non-flood season (November-June). The concentrations of DO, TP and TN in the non-flood season were higher than those in the flood season. Spatially, the concentration of physical and chemical parameters of the water body in the Huangjinxia Reservoir area was higher than that in the Sanhekou Reservoir area. (2) The water quality of the water source area was good. The comprehensive water quality reached the Class II water quality standard of surface water environmental quality. Time showed that the comprehensive water quality in the non-flood season was better than that in the flood season. Spatially, the overall water quality of the tributaries was better than that of the mainstream. TN is a key indicator that affects water quality. (3) The spatial and temporal differences in water quality in water source areas are mainly affected by factors such as rainfall, temperature and human activities. This study can provide a scientific and data basis for related research on maintaining and improving the quality of the ecological environment of the water source areas of the Hanjiang to Weihe River Water Diversion System.
Collapse
Affiliation(s)
- Wei Wu
- State Key Laboratory of Eco-Hydraulics in Northwest Arid Region, Xi’an University of Technology, Xi’an 710048, China
| | - Hang Chen
- State Key Laboratory of Eco-Hydraulics in Northwest Arid Region, Xi’an University of Technology, Xi’an 710048, China
| | - Sheng Xu
- State Key Laboratory of Eco-Hydraulics in Northwest Arid Region, Xi’an University of Technology, Xi’an 710048, China
| | - Ting Liu
- Shaanxi Han Weihe Water Diversion Engineering Construction Co., Ltd., Xi’an 710086, China
| | - Hao Wang
- Shaanxi Han Weihe Water Diversion Engineering Construction Co., Ltd., Xi’an 710086, China
| | - Gaoqing Li
- State Key Laboratory of Eco-Hydraulics in Northwest Arid Region, Xi’an University of Technology, Xi’an 710048, China
| | - Jiawei Wang
- State Key Laboratory of Eco-Hydraulics in Northwest Arid Region, Xi’an University of Technology, Xi’an 710048, China
| |
Collapse
|
4
|
Yang H, Liu S. Water quality prediction in sea cucumber farming based on a GRU neural network optimized by an improved whale optimization algorithm. PeerJ Comput Sci 2022; 8:e1000. [PMID: 35721411 PMCID: PMC9202628 DOI: 10.7717/peerj-cs.1000] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 05/13/2022] [Indexed: 06/15/2023]
Abstract
Sea cucumber farming is an important part of China's aquaculture industry, and sea cucumbers have higher requirements for aquaculture water quality. This article proposes a sea cucumber aquaculture water quality prediction model that uses an improved whale optimization algorithm to optimize the gated recurrent unit neural network(IWOA-GRU), which provides a reference for the water quality control in the sea cucumber growth environment. This model first applies variational mode decomposition (VMD) and the wavelet threshold joint denoising method to remove mixed noise in water quality time series. Then, by optimizing the convergence factor, the convergence speed and global optimization ability of the whale optimization algorithm are strengthened. Finally, the improved whale optimization algorithm is used to construct a GRU prediction model based on optimal network weights and thresholds to predict sea cucumber farming water quality. The model was trained and tested using three water quality indices (dissolved oxygen, temperature and salinity) of sea cucumber culture waters in Shandong Peninsula, China, and compared with prediction models such as support vector regression (SVR), random forest (RF), convolutional neural network (CNN), recurrent neural network (RNN), and long short-term memory neural network (LSTM). Experimental results show that the prediction accuracy and generalization performance of this model are better than those of the other compared models.
Collapse
Affiliation(s)
- Huanhai Yang
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, Shandong, China
| | - Shue Liu
- Binzhou Medical University, Yantai, Shandong, China
| |
Collapse
|
5
|
Multi-Expression Programming (MEP): Water Quality Assessment Using Water Quality Indices. WATER 2022. [DOI: 10.3390/w14060947] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Water contamination is indeed a worldwide problem that threatens public health, environmental protection, and agricultural productivity. The distinctive attributes of machine learning (ML)-based modelling can provide in-depth understanding into increasing water quality challenges. This study presents the development of a multi-expression programming (MEP) based predictive model for water quality parameters, i.e., electrical conductivity (EC) and total dissolved solids (TDS) in the upper Indus River at two different outlet locations using 360 readings collected on a monthly basis. The optimized MEP models were assessed using different statistical measurements i.e., coefficient-of-determination (R2), root-mean-square error (RMSE), mean-absolute error (MAE), root-mean-square-logarithmic error (RMSLE) and mean-absolute-percent error (MAPE). The results show that the R2 in the testing phase (subjected to unseen data) for EC-MEP and TDS-MEP models is above 0.90, i.e., 0.9674 and 0.9725, respectively, reflecting the higher accuracy and generalized performance. Also, the error measures are quite lower. In accordance with MAPE statistics, both the MEP models shows an “excellent” performance in all three stages. In comparison with traditional non-linear regression models (NLRMs), the developed machine learning models have good generalization capabilities. The sensitivity analysis of the developed MEP models with regard to the significance of each input on the forecasted water quality parameters suggests that Cl and HCO3 have substantial impacts on the predictions of MEP models (EC and TDS), with a sensitiveness index above 0.90, although the influence of the Na is the less prominent. The results of this research suggest that the development of intelligence models for EC and TDS are cost effective and viable for the evaluation and monitoring of the quality of river water.
Collapse
|
6
|
Shiri N, Shiri J, Yaseen ZM, Kim S, Chung IM, Nourani V, Zounemat-Kermani M. Development of artificial intelligence models for well groundwater quality simulation: Different modeling scenarios. PLoS One 2021; 16:e0251510. [PMID: 34043648 PMCID: PMC8158946 DOI: 10.1371/journal.pone.0251510] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 04/27/2021] [Indexed: 11/19/2022] Open
Abstract
Groundwater is one of the most important freshwater resources, especially in arid and semi-arid regions where the annual amounts of precipitation are small with frequent drought durations. Information on qualitative parameters of these valuable resources is very crucial as it might affect its applicability from agricultural, drinking, and industrial aspects. Although geo-statistics methods can provide insight about spatial distribution of quality factors, applications of advanced artificial intelligence (AI) models can contribute to produce more accurate results as robust alternative for such a complex geo-science problem. The present research investigates the capacity of several types of AI models for modeling four key water quality variables namely electrical conductivity (EC), sodium adsorption ratio (SAR), total dissolved solid (TDS) and Sulfate (SO4) using dataset obtained from 90 wells in Tabriz Plain, Iran; assessed by k-fold testing. Two different modeling scenarios were established to make simulations using other quality parameters and the geographical information. The obtained results confirmed the capabilities of the AI models for modeling the well groundwater quality variables. Among all the applied AI models, the developed hybrid support vector machine-firefly algorithm (SVM-FFA) model achieved the best predictability performance for both investigated scenarios. The introduced computer aid methodology provided a reliable technology for groundwater monitoring and assessment.
Collapse
Affiliation(s)
- Naser Shiri
- Faculty of Civil Engineering, University of Tabriz, Tabriz, Iran
| | - Jalal Shiri
- Water Engineering Department, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
- Center of Excellence in Hydroinformatics, Faculty of Civil Engineering, University of Tabriz, Tabriz, Iran
| | - Zaher Mundher Yaseen
- School of Civil Engineering, Faculty of Engineering, Universiti Teknologi Malaysia (UTM), Johor Bahru, Malaysia
- * E-mail: ,
| | - Sungwon Kim
- Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju, South Korea
| | - Il-Moon Chung
- Department of Land, Water and Environment Research, Korea Institute of Civil Engineering and Building Technology, Goyang, South Korea
| | - Vahid Nourani
- Center of Excellence in Hydroinformatics, Faculty of Civil Engineering, University of Tabriz, Tabriz, Iran
- Faculty of Civil and Environmental Engineering, Near East University, Near East Boulevard, Nicosia, N. Cyprus, via Mersin 10, Turkey
| | | |
Collapse
|
7
|
Artificial Neural Network Optimized with a Genetic Algorithm for Seasonal Groundwater Table Depth Prediction in Uttar Pradesh, India. SUSTAINABILITY 2020. [DOI: 10.3390/su12218932] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Accurate information about groundwater level prediction is crucial for effective planning and management of groundwater resources. In the present study, the Artificial Neural Network (ANN), optimized with a Genetic Algorithm (GA-ANN), was employed for seasonal groundwater table depth (GWTD) prediction in the area between the Ganga and Hindon rivers located in Uttar Pradesh State, India. A total of 18 models for both seasons (nine for the pre-monsoon and nine for the post-monsoon) have been formulated by using groundwater recharge (GWR), groundwater discharge (GWD), and previous groundwater level data from a 21-year period (1994–2014). The hybrid GA-ANN models’ predictive ability was evaluated against the traditional GA models based on statistical indicators and visual inspection. The results appraisal indicates that the hybrid GA-ANN models outperformed the GA models for predicting the seasonal GWTD in the study region. Overall, the hybrid GA-ANN-8 model with an 8-9-1 structure (i.e., 8: inputs, 9: neurons in the hidden layer, and 1: output) was nominated optimal for predicting the GWTD during pre- and post-monsoon seasons. Additionally, it was noted that the maximum number of input variables in the hybrid GA-ANN approach improved the prediction accuracy. In conclusion, the proposed hybrid GA-ANN model’s findings could be readily transferable or implemented in other parts of the world, specifically those with similar geology and hydrogeology conditions for sustainable planning and groundwater resources management.
Collapse
|
8
|
Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models. WATER 2020. [DOI: 10.3390/w12102770] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Groundwater resources, unlike surface water, are more vulnerable to disturbances and contaminations, as they take a very long time and significant cost to recover. So, predictive modeling and prevention strategies can empower policymakers for efficient groundwater governance through informed decisions and recommendations. Due to the importance of groundwater quality modeling, the hardness susceptibility mapping using machine learning (ML) models has not been explored. For the first time, the current research aimed to predict groundwater hardness susceptibility using the ML models. The performance of two ensemble models of boosted regression trees (BRT) and random forest (RF) is investigated through the arrangement of a comparative study with multivariate discriminant analysis (MDA). According to the hardness values in 135 groundwater quality monitoring wells, the hard and soft water are determined; then, 11 predictor variables including distance from the sea (DFS), land use, elevation, distance from the river (DFR), depth to groundwater (DTGW), pH, precipitation (PCP), evaporation (E), groundwater level (GWL), curvature, and lithology are used for predicting the groundwater hardness susceptibility map. Results indicated that the variables of DFR, DTGW, elevation, and DFS had a higher contribution to the modeling process. So, the high harness areas are mostly related to low elevations, low DTGW, and proximity to river and sea, which facilitate the percolation conditions for minerals containing calcium or magnesium into groundwater.
Collapse
|