1
|
Usman US, Salh YHM, Yan B, Namahoro JP, Zeng Q, Sallah I. Fluoride contamination in African groundwater: Predictive modeling using stacking ensemble techniques. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 957:177693. [PMID: 39577590 DOI: 10.1016/j.scitotenv.2024.177693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 11/15/2024] [Accepted: 11/19/2024] [Indexed: 11/24/2024]
Abstract
Fluoride contamination of groundwater is a severe public health problem in Africa due to natural factors that include geological weathering of fluoride-bearing minerals and climatic conditions characterized by high evaporation rates that highly elevate fluoride levels. Anthropogenic activities further aggravate the problem and have affected millions of people in countries such as; South Africa, Tanzania, Nigeria, Ethiopia, Ghana, Kenya, Mauritania, Botswana, and Egypt. High fluoride levels of up to 10 mg/L have been encountered in parts of the East African Rift Valley, above the WHO's recommended limit of 1.5 mg/L, causing serious dental and skeletal fluorosis among the affected people. In this study, the distributions of F- in groundwater of Africa were forecast using an advanced stacking ensemble learning model based on 11 crucial groundwater physiochemical variables and 6270 accessible statistics of observed concentrations. The enhanced algorithm incorporates randomized trees, Tree-Bag, RF, DT, XGB, and ET Machine as base trainees, with a simple Naïve Bayes as the meta-analyzer. The model's AUC score of 0.86 accurately represented the uneven distributions of groundwater fluoride. The results showed that 20-35 % of the continent's eastern part and 10 % of its western region are at risk of having fluoride levels exceeding WHO limits, with an expected population of around 80 million. Regionally, fluoride contamination ranges from 0.1 to 3 mg/L in West Africa was range from 0.0 to 13.29 mg/L, 0.01-588 mg/L in East Africa, 0.04-65.9 mg/L in South Africa, and 0.1-10.5 mg/L in North and 0.01-1.9 mg/L in Central Africa. Na+ and HCO3- are Africa's leading primary causes of fluoride contamination, with Ca2+ and Cl- contributing to fluoride influence in some parts of the continent. This study helped identify health concerns linked to groundwater fluoride and offered guidance on assessing health risks in areas with sparse sample sizes.
Collapse
Affiliation(s)
- Usman Sunusi Usman
- School of Environmental Studies, China University of Geosciences Wuhan, 388 Lumo Road, Wuhan 430074, China
| | - Yousif Hassan Mohamed Salh
- School of Environmental Studies, China University of Geosciences Wuhan, 388 Lumo Road, Wuhan 430074, China
| | - Bing Yan
- School of Environmental Studies, China University of Geosciences Wuhan, 388 Lumo Road, Wuhan 430074, China.
| | | | - Qian Zeng
- School of Environmental Studies, China University of Geosciences Wuhan, 388 Lumo Road, Wuhan 430074, China
| | - Ismaila Sallah
- School of Environmental Studies, China University of Geosciences Wuhan, 388 Lumo Road, Wuhan 430074, China
| |
Collapse
|
2
|
Al-Kindi KM. Assessing the environmental factors affecting the sustainability of Aini Falaj system. PLoS One 2024; 19:e0301832. [PMID: 38743772 PMCID: PMC11093386 DOI: 10.1371/journal.pone.0301832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/27/2024] [Indexed: 05/16/2024] Open
Abstract
This study investigates the spatial distribution patterns and environmental factors influencing the Aini Falaj system in a specific study area. The research findings are presented through the lens of the following four categories: collinearity diagnostics, spatial autocorrelation analysis, kernel density (KD) findings, and multivariate geographically weighted regression (MGWR) analysis. The collinearity diagnostics were applied to examine the interrelationships among 18 independent environmental variables. The results indicate the absence of significant multicollinearity concerns, with most variables showing values below the critical threshold of five for variance inflation factors (VIFs). The selected variables indicate minimal intercorrelation, suggesting that researchers should be confident utilizing them in subsequent modelling or regression analyses. A spatial autocorrelation analysis using Moran's Index revealed positive spatial autocorrelation and significant clustering patterns in the distribution of live and non-functional Aini Falajs. High concentrations of live or dead Falajs tended to be surrounded by neighbouring areas with similar characteristics. These findings provide insights into the ecological preferences and habitat associations of Aini Falajs, thereby aiding conservation strategies and targeted studies. The kernel density (KD) analysis depicted distribution patterns of live and dry Aini Falajs through hotspots and cold spots. Specific regions exhibited high-density areas of live Falajs, indicating favourable environmental conditions or historical factors contributing to their concentrated distribution. Identifying these high-density zones can enhance our understanding of the spatial patterns and potential factors influencing the prevalence and sustainability of Aini Falajs. The multivariate geographically weighted regression (MGWR) models revealed strong associations between the live or dead status of Aini Falajs and environmental factors. The precipitation, topographic wetness index (TWI), aspect and slope exerted positive impacts on the live status, while evaporation, solar radiation, distance to drains and drain density exerted negative influences. Similar associations were observed for the dead status, emphasising the importance of controlling evaporation, shading mechanisms, proper drainage planning and sustainable land-use practices. This study provides valuable insights into the spatial distributions and factors influencing the live and dead status of Aini Falajs, thereby contributing to our understanding of their ecological dynamics and guiding conservation efforts and management strategies.
Collapse
Affiliation(s)
- Khalifa M. Al-Kindi
- UNESCO Chair of Aflaj Studies, Arco-hydrology, University of Nizwa, Nizwa, Oman
| |
Collapse
|
3
|
Chu Y, He B, He J, Zou H, Sun J, Wen D. Revealing the drivers and genesis of NO 3-N pollution classification in shallow groundwater of the Shaying River Basin by explainable machine learning and pathway analysis method. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 918:170742. [PMID: 38336062 DOI: 10.1016/j.scitotenv.2024.170742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 02/04/2024] [Accepted: 02/04/2024] [Indexed: 02/12/2024]
Abstract
Nitrate (NO3-N), as one of the ubiquitous contaminants in groundwater worldwide, has posed a serious threat to public health and the ecological environment. Despite extensive research on its genesis, little is known about the differences in the genesis of NO3-N pollution across different concentrations. Herein, a study of NO3-N pollution concentration classification was conducted using the Shaying River Basin as a typical area, followed by examining the genesis differences across different pollution classifications. Results demonstrated that three classifications (0-9.98 mg/L, 10.14-27.44 mg/L, and 28.34-136.30 mg/L) were effectively identified for NO3-N pollution using Jenks natural breaks method. Random forest exhibited superior performance in describing NO3-N pollution and was thereby affirmed as the optimal explanatory method. With this method coupling SEMs, the genesis of different NO3-N pollution classifications was proven to be significantly different. Specifically, strongly reducing conditions represented by Mn2+, Eh, and NO2-N played a dominant role in causing residual NO3-N at low levels. Manure and sewage (represented by Cl-) leaching into groundwater through precipitation is mainly responsible for NO3-N in the 10-30 mg/L classification, with a cumulative contribution rate exceeding 80 %. NO3-N concentrations >30 mg/L are primarily caused by the anthropogenic loads stemming from manure, sewage, and agricultural fertilization (represented by Cl- and K+) infiltrating under precipitation in vulnerable hydrogeological conditions. Pathway analysis based on standardized effect and significance further confirmed the rationality and reliability of the above results. The findings will provide more accurate information for policymakers in groundwater resource management to implement effective strategies to mitigate NO3-N pollution.
Collapse
Affiliation(s)
- Yanjia Chu
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China
| | - Baonan He
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China.
| | - Jiangtao He
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China.
| | - Hua Zou
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China
| | - Jichao Sun
- Institute of Hydrogeology and Environmental Geology, Chinese Academy of Geological Sciences, Shijiazhuang 050061, PR China
| | - Dongguang Wen
- Development Research Center of the Ministry of Water Resources, Beijing 100038, PR China
| |
Collapse
|
4
|
Sahour S, Khanbeyki M, Gholami V, Sahour H, Kahvazade I, Karimi H. Evaluation of machine learning algorithms for groundwater quality modeling. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:46004-46021. [PMID: 36715809 DOI: 10.1007/s11356-023-25596-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 01/24/2023] [Indexed: 06/18/2023]
Abstract
Groundwater quality is typically measured through water sampling and lab analysis. The field-based measurements are costly and time-consuming when applied over a large domain. In this study, we developed a machine learning-based framework to map groundwater quality in an unconfined aquifer in the north of Iran. Groundwater samples were provided from 248 monitoring wells across the region. The groundwater quality index (GWQI) in each well was measured and classified into four classes: very poor, poor, good, and excellent, according to their cut-off values. Factors affecting groundwater quality, including distance to industrial centers, distance to residential areas, population density, aquifer transmissivity, precipitation, evaporation, geology, and elevation, were identified and prepared in the GIS environment. Six machine learning classifiers, including extreme gradient boosting (XGB), random forest (RF), support vector machine (SVM), artificial neural networks (ANN), k-nearest neighbor (KNN), and Gaussian classifier model (GCM), were used to establish relationships between GWQI and its controlling factors. The algorithms were evaluated using the receiver operating characteristic curve (ROC) and statistical efficiencies (overall accuracy, precision, recall, and F-1 score). Accuracy assessment showed that ML algorithms provided high accuracy in predicting groundwater quality. However, RF was selected as the optimum model given its higher accuracy (overall accuracy, precision, and recall = 0.92; ROC = 0.95). The trained RF model was used to map GWQI classes across the entire region. Results showed that the poor GWQI class is dominant in the study area (covering 66% of the study area), followed by good (19% of the area), very poor (14% of the area), and excellent (< 1% of the area) classes. An area of very poor GWQI was observed in the north. Feature analysis indicated that the distance to industrial locations is the main factor affecting groundwater quality in the region. The study provides a cost-effective methodology in groundwater quality modeling that can be duplicated in other regions with similar hydrological and geological settings.
Collapse
Affiliation(s)
| | - Matin Khanbeyki
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Vahid Gholami
- Department of Range and Watershed Management and Dept. of Water Eng. and Environment, Faculty of Natural Resources, University of Guilan, Sowmeh Sara 1144, Guilan, Iran.
| | - Hossein Sahour
- Department of Geological and Environmental Sciences, Western Michigan University, Kalamazoo, MI, 49008, USA
| | - Irene Kahvazade
- Department of Computer Sciences, Western Michigan University, Kalamazoo, MI, 49008, USA
| | - Hadi Karimi
- Department of Geological and Environmental Sciences, Western Michigan University, Kalamazoo, MI, 49008, USA
| |
Collapse
|
5
|
Nafouanti MB, Li J, Nyakilla EE, Mwakipunda GC, Mulashani A. A novel hybrid random forest linear model approach for forecasting groundwater fluoride contamination. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:50661-50674. [PMID: 36800089 DOI: 10.1007/s11356-023-25886-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 02/07/2023] [Indexed: 02/18/2023]
Abstract
Groundwater quality in the Datong basin is threatened by high fluoride contamination. Laboratory analysis is a standard method for estimating groundwater quality parameters, which is expensive and time-consuming. Therefore, this paper proposes a hybrid random forest linear model (HRFLM) as a novel approach for estimating groundwater fluoride contamination. Light gradient boosting (LightGBM), random forest (RF), and extreme gradient boosting (Xgboost) were also employed in comparison with HRFLM for predicting fluoride contamination in groundwater. 202 groundwater samples were collected to draw up the performance capability of several models in forecasting subsurface water fluoride contamination. The performance of the models was assessed utilizing the receiver operating characteristic (ROC) area under the curve (AUC) and the confusion matrix (CM). The CM results reveal that with nine predictor variables, the hybrid HRFLM achieved an accuracy of 95%, outperforming the Xgboost, LightGBM, and RF models, which attained 88%, 88%, and 85%, respectively. Likewise, the AUC results of the hybrid HRFLM show high performance with an AUC of 0.98 compared to Xgboost, LightGBM, and RF, which achieved an AUC of 0.95, 0.90, and 0.88, respectively. The study demonstrates that the HRFLM can be applied as an advanced approach for groundwater fluoride contamination prediction in the Datong basin and could be adopted in various areas facing a similar challenge.
Collapse
Affiliation(s)
- Mouigni Baraka Nafouanti
- State Key Laboratory of Biogeology and Environmental Geology, China University of Geosciences, Wuhan, 430074, China.
| | - Junxia Li
- State Key Laboratory of Biogeology and Environmental Geology, China University of Geosciences, Wuhan, 430074, China.,China Laboratory of Basin Hydrology and Wetland Eco-restoration, China University of Geosciences, Wuhan, 430074, China
| | - Edwin E Nyakilla
- Department of Petroleum Engineering, Faculty of Earth Resources, China University of Geosciences, Wuhan, 430074, China
| | - Grant Charles Mwakipunda
- Department of Petroleum Engineering, Faculty of Earth Resources, China University of Geosciences, Wuhan, 430074, China
| | - Alvin Mulashani
- Department of Geosciences and Mining Technology, College of Engineering and Technology, Mbeya University of Science and Technology, Box 131, Mbeya, Tanzania
| |
Collapse
|
6
|
Saha A, Pal SC, Chowdhuri I, Roy P, Chakrabortty R. Effect of hydrogeochemical behavior on groundwater resources in Holocene aquifers of moribund Ganges Delta, India: Infusing data-driven algorithms. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 314:120203. [PMID: 36150620 DOI: 10.1016/j.envpol.2022.120203] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 08/16/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
One of the fundamental sustainable development goals has been recognized as having access to clean water for drinking purposes. In the Anthropocene era, rapid urbanization put further stress on water resources, and associated groundwater contamination expanded into a significant global environmental issue. Natural arsenic and related water pollution have already caused a burden issue on groundwater vulnerability and corresponding health hazard in and around the Ganges delta. A field based hydrogeochemical analysis has been carried out in the elevated arsenic prone areas of moribund Ganges delta, West Bengal, a part of western Ganga- Brahmaputra delta (GBD). New data driven heuristic algorithms are rarely used in groundwater vulnerability studies, specifically not yet used in the elevated arsenic prone areas of Ganges delta, India. Therefore, in the current study, emphasis has been given on integration of heuristic algorithms and random forest (RF) i.e., "RF-particle swarm optimization (PSO)", "RF-grey wolf optimizer (GWO)" and "RF-grasshopper optimization algorithm (GOA)", to identify groundwater vulnerable zones on the basis of field based hydrogeochemical parameters. In addition, correspondence health hazard of this area was assessed through human health hazard index. The spatial distribution of groundwater vulnerability revealed that middle-eastern and north-western part of the study area covered by very high and high, whereas central, western and south-western part are covered by very low and low vulnerability zones in outcomes of all the applied models. The evaluation result indicates that RF-GOA (AUC = 0.911) model performed the best considering testing dataset, and thereafter RF-GWO, RF-PSO and RF with AUC value is 0.901, 0.892 and 0.812 respectively. Findings also revealed the groundwater in this study region is quite unfavorable for drinking and irrigation purposes. The suggested models demonstrate their usefulness in foretelling sustainable groundwater resource management in various deltaic regions of the world through taking appropriate measures by policy-makers.
Collapse
Affiliation(s)
- Asish Saha
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Subodh Chandra Pal
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India.
| | - Indrajit Chowdhuri
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Paramita Roy
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Rabin Chakrabortty
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| |
Collapse
|