1
|
Díaz-González L, Aguilar-Rodríguez RA, Pérez-Sansalvador JC, Lakouari N. AQuA-P: A machine learning-based tool for water quality assessment. JOURNAL OF CONTAMINANT HYDROLOGY 2025; 269:104498. [PMID: 39787877 DOI: 10.1016/j.jconhyd.2025.104498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 12/28/2024] [Accepted: 01/02/2025] [Indexed: 01/12/2025]
Abstract
This study addresses the critical challenge of assessing the quality of groundwater and surface water, which are essential resources for various societal needs. The main contribution of this study is the application of machine learning models for evaluating water quality, using a national database from Mexico that includes groundwater, lotic (flowing), lentic (stagnant), and coastal water quality parameters. Notably, no comparable water quality classification system currently exists. Five advanced machine learning techniques were employed: extreme gradient boosting (XGB), support vector machines, K-nearest neighbors, decision trees, and multinomial logistic regression. The performance of the models was evaluated using the accuracy, precision, and F1 score metrics. The decision tree models emerged as the most effective across all water body types, closely followed by XGB. Therefore, the decision tree models were integrated into the AQuA-P software, which is currently the only software of its kind. It is recommended that these innovative water classification models be used through the AQuA-P software to facilitate informed decision-making in water quality management. This software provides a probability-based classification system that contributes to a deeper understanding of water quality dynamics. Lastly, an open-access repository containing all the datasets and Python notebooks used in our analysis is provided, allowing for easy adaptation and implementation of our methodology for other datasets worldwide.
Collapse
Affiliation(s)
- L Díaz-González
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos 62209, Mexico..
| | - R A Aguilar-Rodríguez
- Maestría en Optimización y Cómputo Aplicado, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos 62209, Mexico
| | - J C Pérez-Sansalvador
- Department of Computer Science, Instituto Nacional de Astrofísica, Optica y Electrónica, Luis Enrique Erro 1, Tonantzintla, 72840 Puebla, México; Consejo Nacional de Humanidades, Ciencias y Tecnologías (Conahcyt), Insurgentes Sur 1582, Mexico City 03940, Mexico
| | - N Lakouari
- Department of Computer Science, Instituto Nacional de Astrofísica, Optica y Electrónica, Luis Enrique Erro 1, Tonantzintla, 72840 Puebla, México; Consejo Nacional de Humanidades, Ciencias y Tecnologías (Conahcyt), Insurgentes Sur 1582, Mexico City 03940, Mexico.
| |
Collapse
|
2
|
Wani OA, Mahdi SS, Yeasin M, Kumar SS, Gagnon AS, Danish F, Al-Ansari N, El-Hendawy S, Mattar MA. Predicting rainfall using machine learning, deep learning, and time series models across an altitudinal gradient in the North-Western Himalayas. Sci Rep 2024; 14:27876. [PMID: 39537701 PMCID: PMC11561348 DOI: 10.1038/s41598-024-77687-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Accepted: 10/24/2024] [Indexed: 11/16/2024] Open
Abstract
Predicting rainfall is a challenging and critical task due to its significant impact on society. Timely and accurate predictions are essential for minimizing human and financial losses. The dependence of approximately 60% of agricultural land in India on monsoon rainfall implies the crucial nature of accurate rainfall prediction. Precise rainfall forecasts can facilitate early preparedness for disasters associated with heavy rains, enabling the public and government to take necessary precautions. In the North-Western Himalayas, where meteorological data are limited, the need for improved accuracy in traditional modeling methods for rainfall forecasting is pressing. To address this, our study proposes the application of advanced machine learning (ML) algorithms, including random forest (RF), support vector regression (SVR), artificial neural network (ANN), and k-nearest neighbour (KNN) along with various deep learning (DL) algorithms such as long short-term memory (LSTM), bi-directional LSTM, deep LSTM, gated recurrent unit (GRU), and simple recurrent neural network (RNN). These advanced techniques hold the potential to significantly improve the accuracy of rainfall prediction, offering hope for more reliable forecasts. Additionally, time series techniques, including autoregressive integrated moving average (ARIMA) and trigonometric, Box-Cox transform, arma errors, trend, and seasonal components (TBATS), are proposed for predicting rainfall across the altitudinal gradients of India's North-Western Himalayas. This approach can potentially revolutionise how we approach rainfall forecasting, ushering in a new era of accuracy and reliability. The effectiveness and accuracy of the proposed algorithms were assessed using meteorological data obtained from six weather stations at different elevations spanning from 1980 to 2021. The results indicate that DL methods exhibit the highest accuracy in predicting rainfall, as measured by the root mean squared error (RMSE) and mean absolute error (MAE), followed by ML algorithms and time series techniques. Among the DL algorithms, the accuracy order was bi-directional LSTM, LSTM, RNN, deep LSTM, and GRU. For the ML algorithms, the accuracy order was ANN, KNN, SVR, and RF. These findings suggest that altitude significantly affects the accuracy of the models, highlighting the need for additional weather stations in this mountainous region to enhance the precision of rainfall prediction.
Collapse
Affiliation(s)
- Owais Ali Wani
- Division of Agronomy, Faculty of Agriculture Wadoora, Sher-e-Kashmir University of Agricultural Sciences & Technology of Kashmir (SKUAST-K), Jammu and Kashmir, 193201, India
| | - Syed Sheraz Mahdi
- Division of Agronomy, Faculty of Agriculture Wadoora, Sher-e-Kashmir University of Agricultural Sciences & Technology of Kashmir (SKUAST-K), Jammu and Kashmir, 193201, India.
- Advanced Centre for Rainfed Agriculture (ACRA), Dhiansar, Bari-Brahmana-181133, SKUAST-Jammu, UT-J&K, India.
| | - Md Yeasin
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, 110 012, India
| | - Shamal Shasang Kumar
- Department of Agronomy (Rootcrops), Ministry of Agriculture & Waterways (MOA & W), Suva City, 679, Fiji
| | - Alexandre S Gagnon
- School of Biological and Environmental Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | - Faizan Danish
- Department of Mathematics, School of Advanced Sciences, VIT-AP University, Inavolu, Andhra Pradesh, 522237, India
| | - Nadhir Al-Ansari
- Civil, Environmental and Natural Resources Engineering, Lulea University of Technology, 97187, Lulea, Sweden.
| | - Salah El-Hendawy
- Department of Plant Production, College of Food and Agricultural Sciences, King Saud University, P.O. Box 2460, Riyadh 11451, Saudi Arabia
| | - Mohamed A Mattar
- Department of Agricultural Engineering, College of Food and Agriculture Sciences, King Saud University, P.O. Box 2460, Riyadh 11451, Saudi Arabia.
| |
Collapse
|
3
|
Lee B, Im JK, Han JW, Kang T, Kim W, Kim M, Lee S. Multiple remotely sensed datasets and machine learning models to predict chlorophyll-a concentration in the Nakdong River, South Korea. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:58505-58526. [PMID: 39316212 DOI: 10.1007/s11356-024-35005-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 09/13/2024] [Indexed: 09/25/2024]
Abstract
The Nakdong River is a crucial water resource in South Korea, supplying water for various purposes such as potable water, irrigation, and recreation. However, the river is vulnerable to algal blooms due to the inflow of pollutants from multiple points and non-point sources. Monitoring chlorophyll-a (Chl-a) concentrations, a proxy for algal biomass is essential for assessing the trophic status of the river and managing its ecological health. This study aimed to improve the accuracy and reliability of Chl-a estimation in the Nakdong River using machine learning models (MLMs) and simultaneous use of multiple remotely sensed datasets. This study compared the performances of four MLMs: multi-layer perceptron (MLP), support vector machine (SVM), random forest (RF), and eXetreme Gradient Boosting (XGB) using three different input datasets: (1) two remotely sensed datasets (Sentinel-2 and Landsat-8), (2) standalone Sentinel-2, and (3) standalone Landsat-8. The results showed that the MLP model with multiple remotely sensed datasets outperformed other MLMs with 0.43 - 0.86 greater in R2 and 0.36 - 5.88 lower in RMSE. The MLP model demonstrated the highest performance across the range of Chl-a concentrations and predicted peaks above 20 mg/m3 relatively well compared to other models. This was likely due to the capacity of MLP to handle imbalanced datasets. The predictive map of the spatial distribution of Chl-a generated by MLP well captured the areas with high and low Chl-a concentrations. This study pointed out the impacts of imbalanced Chl-a concentration observations (dominated by low Chl-a concentrations) on the performance of MLMs. The data imbalance likely led to MLMs poorly trained for high Chl-a values, producing low prediction accuracy. In conclusion, this study demonstrated the value of multiple remotely sensed datasets in enhancing the accuracy and reliability of Chl-a estimation, mainly when using the MLP model. These findings would provide valuable insights into utilizing MLMs effectively for Chl-a monitoring.
Collapse
Affiliation(s)
- Byeongwon Lee
- Department of Environmental Science & Ecological Engineering, College of Life Sciences & Biotechnology, Korea University, 145, Anam-Ro, Seongbuk-Gu, Seoul, 02841, South Korea
| | - Jong Kwon Im
- National Institute of Environmental Research, 42, Hwangyeong-Ro, Seo-Gu, Incheon, 22689, South Korea
| | - Ji Woo Han
- Han River Environment Research Center, National Institute of Environmental Research, 42, Dumulmeori-Gil 68Beon-Gil, Yangseo-Myeon, Yangpyeong-Gun, 12585, South Korea
| | - Taegu Kang
- Han River Environment Research Center, National Institute of Environmental Research, 42, Dumulmeori-Gil 68Beon-Gil, Yangseo-Myeon, Yangpyeong-Gun, 12585, South Korea
| | - Wonkook Kim
- Department of Civil and Environmental Engineering, Pusan National University, 2, Busandaehak-Ro 63Beon-Gil, Geumjeong-Gu, Busan, 46241, South Korea
| | - Moonil Kim
- Division of ICT-Integrated Environment, Pyeongtaek University, 3825, Seodong-Daero, Pyeongtaek-Si, 17869, Gyeonggi-Do, South Korea
| | - Sangchul Lee
- Department of Environmental Science & Ecological Engineering, College of Life Sciences & Biotechnology, Korea University, 145, Anam-Ro, Seongbuk-Gu, Seoul, 02841, South Korea.
| |
Collapse
|
4
|
Pathakamuri PC, Villuri VGK, Pasupuleti S, Banerjee A, Venkatesh AS. A holistic approach for understanding the status of water quality and causes of its deterioration in a drought-prone agricultural area of Southeastern India. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:116765-116780. [PMID: 36114973 DOI: 10.1007/s11356-022-22906-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 09/03/2022] [Indexed: 06/15/2023]
Abstract
This study investigates the groundwater quality in the Kadiri Basin, Ananthapuramu district of Andhra Pradesh, India. Groundwater samples from 77 locations were collected and tested for the concentration of various physicochemical parameters. The collected data were assimilated in the form of a groundwater quality index to estimate groundwater quality (drinking and irrigation) using an information entropy-based weight determination approach (EWQI). The water quality maps obtained from the study area suggest a definite trend in groundwater contamination of the study area. Furthermore, the influence of different physicochemical parameters on groundwater quality was determined using machine learning techniques. Learning and prediction accuracies of four different techniques, namely artificial neural network (ANN), deep learning (DL), random forest (RF), and gradient boosting machine (GBM), were investigated. The performance of the ANN model (MEA = 11.23, RSME = 21.22, MAPE = 7.48, and R2 = 0.91) was found to be highly effective for the present dataset. The ANN model was then used to understand the relative influence of physicochemical parameters on groundwater quality. It was observed that the deterioration in groundwater quality in the study area was primarily due to the excess concentration of turbidity and iron values. The relatively higher concentration of sulfate and nitrate had caused a significant impact on the groundwater quality. The study has wider implications for modeling in similar drought-prone agricultural areas elsewhere for assessing the groundwater quality.
Collapse
Affiliation(s)
- Prabhakara Chowdary Pathakamuri
- Department of Mining Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India
| | - Vasanta Govind Kumar Villuri
- Department of Mining Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India
| | - Srinivas Pasupuleti
- Department of Civil Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India.
| | - Ashes Banerjee
- Department of Civil Engineering, Alliance University, Bangalore, 562106, Karnataka, India
| | - Akella Satya Venkatesh
- Department of Applied Geology, Indian Institute of Technology (Indian School of Mines)), Dhanbad, 826004, Jharkhand, India
| |
Collapse
|
5
|
Xu L, Hao G, Li S, Song F, Zhao Y, Guo P. Prediction and sensitivity analysis of chlorophyll a based on a support vector machine regression algorithm. ENVIRONMENTAL MONITORING AND ASSESSMENT 2023; 195:698. [PMID: 37209292 DOI: 10.1007/s10661-023-11276-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Accepted: 04/19/2023] [Indexed: 05/22/2023]
Abstract
Outbreaks of planktonic algae seriously affect the water quality of rivers and are difficult to control. Based on the analysis of the temporal and spatial variation characteristics of environmental factors, this study uses a support vector machine regression (SVR) algorithm to establish a chlorophyll a (Chl-a) prediction model and conduct Chl-a sensitivity analysis. In 2018, the average Chl-a content was 126.25 ug/L. The maximum total nitrogen (TN) content was 16.68 mg/L and high year-round. The average NH4+-N and total phosphorous (TP) contents were only 0.78 and 0.18 mg/L. The content of NH4+-N was higher in spring and increased significantly along the water flow, while TP decreased slightly along the water flow. We used a radial basis function kernel SVR model and tenfold cross-validation method to optimize parameters. The penalty parameter c was 1.4142, the kernel function parameter g was 1, and the training and verification errors were only 0.032 and 0.067, respectively, indicating a good model fit. Based on a sensitivity analysis of the SVR prediction model, the maximum sensitivity coefficients of Chl-a to TP and WT were 0.571 and 0.394, respectively, and the contributions were 33% and 22%, respectively. The next highest sensitivity coefficients were those of DO (0.28, 16%) and pH (0.243, 14%). The sensitivity coefficients of TN and NH4+-N were the lowest. According to the current water environment pollution conditions, TP is the limiting factor of Chl-a in the Qingshui River, and it is also the main prevention and control factor of phytoplankton outbreak.
Collapse
Affiliation(s)
- Li Xu
- School of Energy and Environmental Engineering, Hebei University of Engineering, Handan, 056038, China
- Hebei Key Laboratory of Water Quality Enginerring and Comprehensive Utilization of Water Resources, Zhangjiakou, 075000, China
| | - Guizhen Hao
- Hebei Key Laboratory of Water Quality Enginerring and Comprehensive Utilization of Water Resources, Zhangjiakou, 075000, China.
- Department of Municipal and Environmental Engineering, Hebei University of Architecture, Zhangjiakou, 075000, China.
| | - Simin Li
- School of Energy and Environmental Engineering, Hebei University of Engineering, Handan, 056038, China
| | - Fengzhi Song
- Linyi Architectural Design and Research Institute Co.Ltd, Linyi, 276000, China
| | - Yong Zhao
- School of Energy and Environmental Engineering, Hebei University of Engineering, Handan, 056038, China
- Department of Municipal and Environmental Engineering, Hebei University of Architecture, Zhangjiakou, 075000, China
| | - Peiran Guo
- School of Energy and Environmental Engineering, Hebei University of Engineering, Handan, 056038, China
| |
Collapse
|
6
|
Abu El-Magd SA, Ismael IS, El-Sabri MAS, Abdo MS, Farhat HI. Integrated machine learning-based model and WQI for groundwater quality assessment: ML, geospatial, and hydro-index approaches. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:53862-53875. [PMID: 36864333 PMCID: PMC10119052 DOI: 10.1007/s11356-023-25938-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 02/10/2023] [Indexed: 06/19/2023]
Abstract
The demands upon the arid area for water supply pose threats to both the quantity and quality of social and economic activities. Thus, a widely used machine learning model, namely the support vector machines (SVM) integrated with water quality indices (WQI), was used to assess the groundwater quality. The predictive ability of the SVM model was assessed using a field dataset for groundwater from Abu-Sweir and Abu-Hammad, Ismalia, Egypt. Multiple water quality parameters were chosen as independent variables to build the model. The results revealed that the permissible and unsuitable class values range from 36 to 27%, 45 to 36%, and 68 to 15% for the WQI approach, SVM method and SVM-WQI model respectively. Besides, the SVM-WQI model shows a low percentage of the area for excellent class compared to the SVM model and WQI. The SVM model trained with all predictors with a mean square error (MSE) of 0.002 and 0.41; the models that had higher accuracy reached 0.88. Moreover, the study highlighted that SVM-WQI can be successfully implemented for the assessment of groundwater quality (0.90 accuracy). The resulting groundwater model in the study sites indicates that the groundwater is influenced by rock-water interaction and the effect of leaching and dissolution. Overall, the integrated ML model and WQI give an understanding of water quality assessment, which may be helpful in the future development of such areas.
Collapse
Affiliation(s)
| | - Ismael S Ismael
- Geology Department, Faculty of Science, Suez University, Suez, 43518, Egypt
| | | | - Mohamed Sayed Abdo
- Geology Department, Faculty of Science, Suez University, Suez, 43518, Egypt
| | - Hassan I Farhat
- Geology Department, Faculty of Science, Suez University, Suez, 43518, Egypt
| |
Collapse
|
7
|
Ni Q, Cao X, Tan C, Peng W, Kang X. An improved graph convolutional network with feature and temporal attention for multivariate water quality prediction. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:11516-11529. [PMID: 36094707 DOI: 10.1007/s11356-022-22719-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
The analysis and prediction of water quality are of great significance to water quality management and pollution control. In general, current water quality prediction methods are often aimed at single indicator, while the prediction effect is not ideal for multivariate water quality data. At the same time, there may be some correlations between multiple indicators which the conventional prediction models cannot capture. To resolve these problems, this paper proposes a deep learning model: Graph Convolutional Network with Feature and Temporal Attention (FTGCN), realizing the prediction for multivariable water quality data. Firstly, a feature attention mechanism based on multi-head self-attention is designed to capture the potential correlations between water indicators. Then, a temporal prediction module including temporal convolution and bidirectional GRU with a temporal attention mechanism is designed to deal with temporal dependencies of time series. Moreover, an adaptive graph learning mechanism is introduced to extract hidden associations between water quality indicators. An auto-regression module is also added to solve the disadvantage of non-linear nature of neural networks. Finally, an evolutionary algorithm is adopted to optimize the parameters of the proposed model. Our model is applied on four real-world water quality datasets, compared with other models for multivariate time series forecasting. Experimental results demonstrate that the proposed model has a better performance in water quality prediction than others by two indices.
Collapse
Affiliation(s)
- Qingjian Ni
- School of Computer Science and Engineering, Southeast University, Nanjing, China.
| | - Xuehan Cao
- School of Computer Science and Engineering, Southeast University, Nanjing, China
| | - Chaoqun Tan
- School of Civil Engineering, Southeast University, Nanjing, China
| | - Wenqiang Peng
- School of Computer Science and Engineering, Southeast University, Nanjing, China
| | - Xuying Kang
- School of Computer Science and Engineering, Southeast University, Nanjing, China
| |
Collapse
|
8
|
A Review on Pollution Treatment in Cement Industrial Areas: From Prevention Techniques to Python-Based Monitoring and Controlling Models. Processes (Basel) 2022. [DOI: 10.3390/pr10122682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Anthropogenic climate change, global warming, environmental pollution, and fossil fuel depletion have been identified as critical current scenarios and future challenges. Cement plants are one of the most impressive zones, emitting 15% of the worldwide contaminations into the environment among various industries. These contaminants adversely affect human well-being, flora, and fauna. Meanwhile, the use of cement-based substances in various fields, such as civil engineering, medical applications, etc., is inevitable due to the continuous increment of population and urbanization. To cope with this challenge, numerous filtering methods, recycling techniques, and modeling approaches have been introduced. Among the various statistical, mathematical, and computational modeling solutions, Python has received tremendous attention because of the benefit of smart libraries, heterogeneous data integration, and meta-models. The Python-based models are able to optimize the raw material contents and monitor the released pollutants in cement complex outputs with intelligent predictions. Correspondingly, this paper aims to summarize the performed studies to illuminate the resultant emissions from the cement complexes, their treatment methods, and the crucial role of Python modeling toward the high-efficient production of cement via a green and eco-friendly procedure. This comprehensive review sheds light on applying smart modeling techniques rather than experimental analysis for fundamental and applied research and developing future opportunities.
Collapse
|
9
|
Dadhich AP, Dadhich PN, Goyal R. Synthesis of water, sanitation, and hygiene (WaSH) spatial pattern in rural India: an integrated interpretation of WaSH practices. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:86873-86886. [PMID: 35804230 PMCID: PMC9668241 DOI: 10.1007/s11356-022-21918-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 07/04/2022] [Indexed: 06/15/2023]
Abstract
Rural areas largely lack access to improved drinking water, sanitation, and hygiene (WaSH) facilities in India. This requires documentation of WaSH practices at the local level for better understanding and sustainable development. In this paper, a global positioning system (GPS)-based household survey was carried out in 67 villages of Phagi tehsil using individual questionnaires to evaluate the existing WaSH conditions spatially at the panchayat level. Three sub-indices were used for WaSH risk areas mapping and prediction with the integration of machine learning algorithms. Survey results indicate the improvement in the availability of toilet facilities; however, a gap was found between toilet ownership and its usage by villagers. Data show that only six panchayats have almost zero open defecation practices among the 32 panchayats of Phagi tehsil. The findings highlight that presence of toilets in house, water supply in toilets, and high literacy rate lead to an increase in toilet usage by the population. WaSH index scores indicate that panchayats like Mandawari, Mendwas, Chandma Kalan, and Rotwara have worst conditions and fall in the high-risk category. Moreover, support vector machine regression (SVMR) results reveal that WaSH scores are mainly affected by open defecation (r = 0.94), water supply in toilets (r = 0.92), and female members' participation in sanitation facilities decision-making (r = 0.53), followed by literacy rate (r = 0.33). Findings demonstrate the association between gender inequalities and WaSH conditions, and the potential of the WaSH index as a monitoring tool by local policymakers to shrink the WaSH gaps.
Collapse
Affiliation(s)
- Ankita Pran Dadhich
- Department of Civil Engineering, Malaviya National Institute of Technology, J.L.N. Marg, Jaipur, 302017 Rajasthan India
| | - Pran N. Dadhich
- Department of Civil Engineering, Poornima College of Engineering, ISI-6, RIICO Institutional Area, Sitapura, Jaipur, 302022 Rajasthan India
| | - Rohit Goyal
- Department of Civil Engineering, Malaviya National Institute of Technology, J.L.N. Marg, Jaipur, 302017 Rajasthan India
| |
Collapse
|
10
|
Muhammad Z, Jailani NAJ, Leh NAM, Hamid SA. Classification of Drinking Water Quality using Support Vector Machine (SVM) Algorithm. 2022 IEEE 12TH INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE) 2022. [DOI: 10.1109/iccsce54767.2022.9935657] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Affiliation(s)
- Zuraida Muhammad
- Universiti Teknologi MARA,Center for Electrical Engineering Studies,Pulau Pinang,MALAYSIA,13500
| | - Nur Aqilah Jak Jailani
- Universiti Teknologi MARA,Center for Electrical Engineering Studies,Pulau Pinang,MALAYSIA,13500
| | - Nor Adni Mat Leh
- Universiti Teknologi MARA,Center for Electrical Engineering Studies,Pulau Pinang,MALAYSIA,13500
| | - Shabinar Abd Hamid
- Universiti Teknologi MARA,Center for Electrical Engineering Studies,Pulau Pinang,MALAYSIA,13500
| |
Collapse
|
11
|
A Machine Learning-Based Water Potability Prediction Model by Using Synthetic Minority Oversampling Technique and Explainable AI. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:9283293. [PMID: 36177311 PMCID: PMC9514946 DOI: 10.1155/2022/9283293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/09/2022] [Accepted: 08/17/2022] [Indexed: 11/24/2022]
Abstract
During the last few decades, the quality of water has deteriorated significantly due to pollution and many other issues. As a consequence of this, there is a need for a model that can make accurate projections about water quality. This work shows the comparative analysis of different machine learning approaches like Support Vector Machine (SVM), Decision Tree (DT), Random Forest, Gradient Boost, and Ada Boost, used for the water quality classification. The model is trained on the Water Quality Index dataset available on Kaggle. Z-score is used to normalize the dataset before beginning the training process for the model. Because the given dataset is unbalanced, Synthetic Minority Oversampling Technique (SMOTE) is used to balance the dataset. Experiments results depict that Random Forest and Gradient Boost give the highest accuracy of 81%. One of the major issues with the machine learning model is lack of transparency which makes it impossible to evaluate the results of the model. To address this issue, explainable AI (XAI) is used which assists us in determining which features are the most important. Within the context of this investigation, Local Interpretable Model-agnostic Explanations (LIME) is utilized to ascertain the significance of the features.
Collapse
|
12
|
Predicting Irrigation Water Quality Indices Based on Data-Driven Algorithms: Case Study in Semiarid Environment. J CHEM-NY 2022. [DOI: 10.1155/2022/4488446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Ascertaining water quality for irrigational use by employing conventional methods is often time taking and expensive due to the determination of multiple parameters needed, especially in developing countries. Therefore, constructing precise and adequate models may be beneficial in resolving this problem in agricultural water management to determine the suitable water quality classes for optimal crop yield production. To achieve this objective, five machine learning (ML) models, namely linear regression (LR), random subspace (RSS), additive regression (AR), reduced error pruning tree (REPTree), and support vector machine (SVM), have been developed and tested for predicting of six irrigation water quality (IWQ) indices such as sodium adsorption ratio (SAR), percent sodium (%Na), permeability index (PI), Kelly ratio (KR), soluble sodium percentage (SSP), and magnesium hazards (MH) in groundwater of the Nand Samand catchment of Rajasthan. The accuracy of these models was determined serially using the mean squared error (MSE), correlation coefficients (r), mean absolute error (MAE), and root mean square error (RMSE). The SVM model showed the best-fit model for all irrigation indices during testing, that is, RMSE: 0.0662, 4.0568, 3.0168, 0.1113, 3.7046, and 5.1066; r: 0.9364, 0.9618, 0.9588, 0.9819, 0.9547, and 0.8903; MSE: 0.004381, 16.45781, 9.101218, 0.012383, 13.72447, and 26.078; MAE: 0.042, 3.1999, 2.3584, 0.0726, 2.9603, and 4.0582 for KR, MH, SSP, SAR, %Na, and PI, respectively. The KR and SAR values were predicted accurately by the SVM model in comparison to the observed values. As a result, machine learning algorithms can improve irrigation water quality characteristics, which is critical for farmers and crop management in various irrigation procedures. Additionally, the findings of this research suggest that ML models are effective tools for reliably predicting groundwater quality using general water quality parameters that may be acquired directly on periodical basis. Assessment of water quality indices may also help in deriving optimal strategies to utilise inferior quality water conjunctively with fresh water resources in the water-limited areas.
Collapse
|
13
|
Comparison between Regression Models, Support Vector Machine (SVM), and Artificial Neural Network (ANN) in River Water Quality Prediction. Processes (Basel) 2022. [DOI: 10.3390/pr10081652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Both anthropogenic and natural sources of pollution are regionally significant. Therefore, in order to monitor and protect the quality of Langat River from deterioration, we use Artificial Intelligence (AI) to model the river water quality. This study has applied several machine learning models (two support vector machines (SVMs), six regression models, and artificial neural network (ANN)) to predict total suspended solids (TSS), total solids (TS), and dissolved solids (DS)) in Langat River, Malaysia. All of the models have been assessed using root mean square error (RMSE), mean square error (MSE) as well as the determination of coefficient (R2). Based on the model performance metrics, the ANN model outperformed all models, while the GPR and SVM models exhibited the characteristic of over-fitting. The remaining machine learning models exhibited fair to poor performances. Although there are a few researches conducted to predict TDS using ANN, however, there are less to no research conducted to predict TS and TSS in Langat River. Therefore, this is the first study to evaluate the water quality (TSS, TS, and DS) of Langat River using the aforementioned models (especially SVM and the six regression models).
Collapse
|
14
|
Ahmadianfar I, Shirvani-Hosseini S, Samadi-Koucheksaraee A, Yaseen ZM. Surface water sodium (Na +) concentration prediction using hybrid weighted exponential regression model with gradient-based optimization. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:53456-53481. [PMID: 35287188 DOI: 10.1007/s11356-022-19300-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 02/15/2022] [Indexed: 06/14/2023]
Abstract
Undeniably, there is a link between water resources and people's lives and, consequently, economic development, which makes them vital in health and the environment. Proper water quality forecasting time series has a crucial role in giving on-time warnings for water pollution and supporting the decision-making of water resource management. The principal aim of this study is to develop a novel and cutting-edge ensemble data intelligence model named the weighted exponential regression and hybridized by gradient-based optimization (WER-GBO). Indeed, this is to reach more meticulous sodium (Na+) prediction monthly at Maroon River in the southwest of Iran. This developed model has advantages over other previous methodologies thanks to the following merits: (i) it can improve the performance and ability by mixing the outputs of four distinct data intelligence (DI) models, i.e., adaptive neuro-fuzzy inference system (ANFIS), least square support vector regression (LSSVM), Bayesian linear regression (BLR), and response surface regression (RSR); (ii) the proposed model can employ a Cauchy weighted function combined with an exponential-based regression model being optimized by GBO algorithm. To evaluate the performance of these models, diverse statistical indices and graphical assessment including error distributions, box plots, scatter-plots with confidence bounds and Taylor diagrams were conducted. According to obtained statistical metrics and verified validation procedures, the proposed WER-GBO resulted in promising accuracy compared to other models. Furthermore, the outcomes revealed the WER-GBO (R = 0.9712, RMSE = 0.639, and KGE = 0.948) reached more accurate and reliable results than other methods such as the ANFIS, LSSVM, BLR, and RSR for Na prediction in this study. Hence, the WER-GBO model can be considered a constructive technique to forecast the water quality parameters.
Collapse
Affiliation(s)
- Iman Ahmadianfar
- Department of Civil Engineering, Behbahan Khatam Alanbia University of Technology, Behbahan, Iran
| | | | | | - Zaher Mundher Yaseen
- Adjunct Research Fellow, USQ's Advanced Data Analytics Research Group, School of Mathematics Physics and Computing, University of Southern Queensland, QLD, 4350, Queensland, Australia.
- New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, 64001, Iraq.
- Institute for Big Data Analytics and Artificial Intelligence (IBDAAI), Kompleks Al-Khawarizmi, Universiti Teknologi MARA, Shah Alam, Selangor, 40450, Malaysia.
| |
Collapse
|
15
|
Forecasting Daytime Ground-Level Ozone Concentration in Urbanized Areas of Malaysia Using Predictive Models. SUSTAINABILITY 2022. [DOI: 10.3390/su14137936] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Ground-level ozone (O3) is one of the most significant forms of air pollution around the world due to its ability to cause adverse effects on human health and environment. Understanding the variation and association of O3 level with its precursors and weather parameters is important for developing precise forecasting models that are needed for mitigation planning and early warning purposes. In this study, hourly air pollution data (O3, CO, NO2, PM10, NmHC, SO2) and weather parameters (relative humidity, temperature, UVB, wind speed and wind direction) covering a ten year period (2003–2012) in the selected urban areas in Malaysia were analyzed. The main aim of this research was to model O3 level in the band of greatest solar radiation with its precursors and meteorology parameters using the proposed predictive models. Six predictive models were developed which are Multiple Linear Regression (MLR), Feed-Forward Neural Network (FFANN), Radial Basis Function (RBFANN), and the three modified models, namely Principal Component Regression (PCR), PCA-FFANN, and PCA-RBFANN. The performances of the models were evaluated using four performance measures, i.e., Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Index of Agreement (IA), and Coefficient of Determination (R2). Surface O3 level was best described using linear regression model (MLR) with the smallest calculated error (MAE = 6.06; RMSE = 7.77) and the highest value of IA and R2 (0.85 and 0.91 respectively). The non-linear models (FFANN and RBFANN) fitted the observed O3 level well, but were slightly less accurate compared to MLR. Nonetheless, all the unmodified models (MLR, ANN, and RBF) outperformed the modified-version models (PCR, PCA-FFANN, and PCA-RBFANN). Verification of the best model (MLR) was done using air pollutant data in 2018. The MLR model fitted the dataset of 2018 very well in predicting the daily O3 level in the specified selected areas with the range of R2 values of 0.85 to 0.95. These indicate that MLR can be used as one of the reliable methods to predict daytime O3 level in Malaysia. Thus, it can be used as a predictive tool by the authority to forecast high ozone concentration in providing early warning to the population.
Collapse
|
16
|
Ziyad Sami BF, Latif SD, Ahmed AN, Chow MF, Murti MA, Suhendi A, Ziyad Sami BH, Wong JK, Birima AH, El-Shafie A. Machine learning algorithm as a sustainable tool for dissolved oxygen prediction: a case study of Feitsui Reservoir, Taiwan. Sci Rep 2022; 12:3649. [PMID: 35256619 PMCID: PMC8901922 DOI: 10.1038/s41598-022-06969-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Accepted: 01/24/2022] [Indexed: 11/09/2022] Open
Abstract
Water quality status in terms of one crucial parameter such as dissolved oxygen (D.O.) has been an important concern in the Fei-Tsui reservoir for decades since it's the primary water source for Taipei City. Therefore, this study aims to develop a reliable prediction model to predict D.O. in the Fei-Tsui reservoir for better water quality monitoring. The proposed model is an artificial neural network (ANN) with one hidden layer. Twenty-nine years of water quality data have been used to validate the accuracy of the proposed model. A different number of neurons have been investigated to optimize the model's accuracy. Statistical indices have been used to examine the reliability of the model. In addition to that, sensitivity analysis has been carried out to investigate the model's sensitivity to the input parameters. The results revealed the proposed model capable of capturing the dissolved oxygen's nonlinearity with an acceptable level of accuracy where the R-squared value was equal to 0.98. The optimum number of neurons was found to be equal to 15-neuron. Sensitivity analysis shows that the model can predict D.O. where four input parameters have been included as input where the d-factor value was equal to 0.010. This main achievement and finding will significantly impact the water quality status in reservoirs. Having such a simple and accurate model embedded in IoT devices to monitor and predict water quality parameters in real-time would ease the decision-makers and managers to control the pollution risk and support their decisions to improve water quality in reservoirs.
Collapse
Affiliation(s)
- Balahaha Fadi Ziyad Sami
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Sarmad Dashti Latif
- Civil Engineering Department, College of Engineering, Komar University of Science and Technology, Sulaimany, Kurdistan Region, 46001, Iraq
| | - Ali Najah Ahmed
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Ming Fai Chow
- Discipline of Civil Engineering, School of Engineering, Monash University Malaysia, Jalan Lagoon Selatan, 47500, Bandar Sunway, Selangor, Malaysia.
| | | | - Asep Suhendi
- School of Electrical Engineering, Telkom University, Bandung, Indonesia
| | - Balahaha Hadi Ziyad Sami
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Jee Khai Wong
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Ahmed H Birima
- Department of Civil Engineering, College of Engineering, Qassim University, Unaizah, Saudi Arabia
| | - Ahmed El-Shafie
- Department of Civil Engineering, Faculty of Engineering, University of Malaya, 50603, Kuala Lumpur, Malaysia.,National Water and Energy Center, United Arab Emirates University, Al Ain, United Arab Emirates
| |
Collapse
|
17
|
Khullar S, Singh N. Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:12875-12889. [PMID: 33988840 DOI: 10.1007/s11356-021-13875-w] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 04/06/2021] [Indexed: 06/12/2023]
Abstract
Water is a prime necessity for the survival and sustenance of all living beings. Over the past few years, the water quality of rivers is adversely affected due to harmful wastes and pollutants. This ever-increasing water pollution is a big matter of concern as it deteriorating the water quality, making it unfit for any type of use. Recently, water quality modelling using machine learning techniques has generated a lot of interest and can be very beneficial in ecological and water resources management. However, they suffer many times from high computational complexity and high prediction error. The good performance of a deep neural network like long short-term memory network (LSTM) has been exploited for the time-series data. In this paper, a deep learning-based Bi-LSTM model (DLBL-WQA) is introduced to forecast the water quality factors of Yamuna River, India. The existing schemes do not perform missing value imputation and focus only on the learning process without including a loss function pertaining to training error. The proposed model shows a novel scheme which includes missing value imputation in the first phase, the second phase generates the feature maps from the given input data, the third phase includes a Bi-LSTM architecture to improve the learning process, and finally, an optimized loss function is applied to reduce the training error. Thus, the proposed model improves forecasting accuracy. Data comprising monthly samples of different water quality factors were collected for 6 years (2013-2019) at several locations in the Delhi region. Experimental results reveal that predicted values of the model and the actual values were in a close agreement and could reveal a future trend. The performance of our model was compared with various state of the art techniques like SVR, random forest, artificial neural network, LSTM, and CNN-LSTM. To check the accuracy, metrics like root mean square errors (RMSE), the mean absolute error (MAE), mean square error (MSE), and mean absolute percentage error (MAPE) have been used. Experimental analysis is carried out by measuring the COD and BOD levels. COD analysis reveals the MSE, RMSE, MAE, and MAPE values as 0.015, 0.117, 0.115, and 20.32, respectively, for the Palla region. Similarly, BOD analysis indicates the MSE, RMSE, MAE, and MAPE values as 0.107, 0.108, 0.124, and 18.22, respectively. A comparative analysis reveals that the proposed model outperforms all other models in terms of the best forecasting accuracy and lowest error rates.
Collapse
Affiliation(s)
- Sakshi Khullar
- Guru Gobind Singh Indraprastha University, West Patel Nagar, New Delhi, 110008, India.
| | - Nanhey Singh
- CSE, GGSIPU, AIACTR, Krishna Nagar Road Chacha Nahru Bal Chikitsalaya, Geeta Colony, Delhi, New Delhi, 110031, India
| |
Collapse
|
18
|
Machine-learning algorithms for forecast-informed reservoir operation (FIRO) to reduce flood damages. Sci Rep 2021; 11:24295. [PMID: 34934081 PMCID: PMC8692612 DOI: 10.1038/s41598-021-03699-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 12/08/2021] [Indexed: 11/08/2022] Open
Abstract
Water is stored in reservoirs for various purposes, including regular distribution, flood control, hydropower generation, and meeting the environmental demands of downstream habitats and ecosystems. However, these objectives are often in conflict with each other and make the operation of reservoirs a complex task, particularly during flood periods. An accurate forecast of reservoir inflows is required to evaluate water releases from a reservoir seeking to provide safe space for capturing high flows without having to resort to hazardous and damaging releases. This study aims to improve the informed decisions for reservoirs management and water prerelease before a flood occurs by means of a method for forecasting reservoirs inflow. The forecasting method applies 1- and 2-month time-lag patterns with several Machine Learning (ML) algorithms, namely Support Vector Machine (SVM), Artificial Neural Network (ANN), Regression Tree (RT), and Genetic Programming (GP). The proposed method is applied to evaluate the performance of the algorithms in forecasting inflows into the Dez, Karkheh, and Gotvand reservoirs located in Iran during the flood of 2019. Results show that RT, with an average error of 0.43% in forecasting the largest reservoirs inflows in 2019, is superior to the other algorithms, with the Dez and Karkheh reservoir inflows forecasts obtained with the 2-month time-lag pattern, and the Gotvand reservoir inflow forecasts obtained with the 1-month time-lag pattern featuring the best forecasting accuracy. The proposed method exhibits accurate inflow forecasting using SVM and RT. The development of accurate flood-forecasting capability is valuable to reservoir operators and decision-makers who must deal with streamflow forecasts in their quest to reduce flood damages.
Collapse
|
19
|
Jamei M, Ahmadianfar I, Karbasi M, Jawad AH, Farooque AA, Yaseen ZM. The assessment of emerging data-intelligence technologies for modeling Mg +2 and SO 4-2 surface water quality. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2021; 300:113774. [PMID: 34560461 DOI: 10.1016/j.jenvman.2021.113774] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 09/08/2021] [Accepted: 09/16/2021] [Indexed: 06/13/2023]
Abstract
The concentration of soluble salts in surface water and rivers such as sodium, sulfate, chloride, magnesium ions, etc., plays an important role in the water salinity. Therefore, accurate determination of the distribution pattern of these ions can improve better management of drinking water resources and human health. The main goal of this research is to establish two novel wavelet-complementary intelligence paradigms so-called wavelet least square support vector machine coupled with improved simulated annealing (W-LSSVM-ISA) and the wavelet extended Kalman filter integrated with artificial neural network (W-EKF- ANN) for accurate forecasting of the monthly), magnesium (Mg+2), and sulfate (SO4-2) indices at Maroon River, in Southwest of Iran. The monthly River flow (Q), electrical conductivity (EC), Mg+2, and SO4-2 data recorded at Tange-Takab station for the period 1980-2016. Some preprocessing procedures consisting of specifying the number of lag times and decomposition of the existing original signals into multi-resolution sub-series using three mother wavelets were performed to develop predictive models. In addition, the best subset regression analysis was designed to separately assess the best selective combinations for Mg+2 and SO4-2. The statistical metrics and authoritative validation approaches showed that both complementary paradigms yielded promising accuracy compared with standalone artificial intelligence (AI) models. Furthermore, the results demonstrated that W-LSSVM-ISA-C1 (correlation coefficient (R) = 0.9521, root mean square error (RMSE) = 0.2637 mg/l, and Kling-Gupta efficiency (KGE) = 0.9361) and W-LSSVM-ISA-C4 (R = 0.9673, RMSE = 0.5534 mg/l and KGE = 0.9437), using Dmey mother that outperformed the W-EKF-ANN for predicting Mg+2 and SO4-2, respectively.
Collapse
Affiliation(s)
- Mehdi Jamei
- Faculty of Engineering, Shohadaye Hoveizeh Campus of Technology, Shahid Chamran University of Ahvaz, Dashte Azadegan, Iran.
| | - Iman Ahmadianfar
- Department of Civil Engineering, Behbahan Khatam Alanbia University of Technology, Behbahan, Iran.
| | - Masoud Karbasi
- Water Engineering Department, Faculty of Agriculture, University of Zanjan, Zanjan, Iran.
| | - Ali H Jawad
- Faculty of Applied Sciences, Universiti Teknologi MARA, 40450, Shah Alam, Selangor, Malaysia.
| | - Aitazaz A Farooque
- Faculty of Sustainable Design Engineering, University of Prince Edward Island, Charlottetown, PE C1A4P3, Canada; School of Climate Change and Adaptation, University of Prince Edward Island, Charlottetown, PE, C1A4P3, Canada.
| | - Zaher Mundher Yaseen
- New era and Development in Civil engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, 64001, Iraq; College of Creative Design, Asia University, Taichung City, Taiwan.
| |
Collapse
|
20
|
Khosravi K, Barzegar R, Golkarian A, Busico G, Cuoco E, Mastrocicco M, Colombani N, Tedesco D, Ntona MM, Kazakis N. Predictive modeling of selected trace elements in groundwater using hybrid algorithms of iterative classifier optimizer. JOURNAL OF CONTAMINANT HYDROLOGY 2021; 242:103849. [PMID: 34147829 DOI: 10.1016/j.jconhyd.2021.103849] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Revised: 06/06/2021] [Accepted: 06/09/2021] [Indexed: 06/12/2023]
Abstract
Trace element (TE) pollution in groundwater resources is one of the major concerns in both developing and developed countries as it can directly affect human health. Arsenic (As), Barium (Ba), and Rubidium (Rb) can be considered as TEs naturally present in groundwater due to water-rock interactions in Campania Plain (CP) aquifers, in South Italy. Their concentration could be predicted via some readily available input variables using an algorithm like the iterative classifier optimizer (ICO) for regression, and novel hybrid algorithms with additive regression (AR-ICO), attribute selected classifier (ASC-ICO) and bagging (BA-ICO). In this regard, 244 groundwater samples were collected from water wells within the CP and analyzed with respect to the electrical conductivity, pH, major ions and selected TEs. To develop the models, the available dataset was divided randomly into two subsets for model training (70% of the dataset) and evaluation (30% of the dataset), respectively. Based on the correlation coefficient (r), different input variables combinations were constructed to find the most effective one. Each model's performance was evaluated using common statistical and visual metrics. Results indicated that the prediction of As and Ba concentrations strongly depends on HCO3-, while Na+ is the most effective variable on Rb prediction. Also, the findings showed that the most powerful predictive models were those that used all the available input variables. According to models' performance evaluation metrics, the hybrid ASC-ICO outperformed other hybrid (BA- and AR-ICO) and standalone (ICO) algorithms to predict As and Ba concentrations, while both hybrid ASC- and BA-ICO models had higher accuracy and lower error than other algorithms for Rb prediction.
Collapse
Affiliation(s)
- Khabat Khosravi
- Department of Watershed Management Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Rahim Barzegar
- Department of Bioresource Engineering, McGill University, Montreal, Canada; Department of Geography & Environmental Studies, Wilfrid Laurier University, Waterloo, Canada
| | - Ali Golkarian
- Department of Watershed Management Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Gianluigi Busico
- Department of Geology, Lab. of Engineering Geology & Hydrogeology, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece; Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, University of Campania "Luigi Vanvitelli", Via Vivaldi 43, Caserta 81100, Italy.
| | - Emilio Cuoco
- Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, University of Campania "Luigi Vanvitelli", Via Vivaldi 43, Caserta 81100, Italy
| | - Micòl Mastrocicco
- Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, University of Campania "Luigi Vanvitelli", Via Vivaldi 43, Caserta 81100, Italy
| | - Nicolò Colombani
- Department of Materials, Environmental Sciences and Urban Planning, Polytechnic University of Marche, Via Brecce Bianche 12, 60131, Italy
| | - Dario Tedesco
- Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, University of Campania "Luigi Vanvitelli", Via Vivaldi 43, Caserta 81100, Italy
| | - Maria Margarita Ntona
- Department of Geology, Lab. of Engineering Geology & Hydrogeology, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece; Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, University of Campania "Luigi Vanvitelli", Via Vivaldi 43, Caserta 81100, Italy
| | - Nerantzis Kazakis
- Department of Geology, Lab. of Engineering Geology & Hydrogeology, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece.
| |
Collapse
|
21
|
Developing machine learning algorithms for meteorological temperature and humidity forecasting at Terengganu state in Malaysia. Sci Rep 2021; 11:18935. [PMID: 34556676 PMCID: PMC8460791 DOI: 10.1038/s41598-021-96872-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 08/17/2021] [Indexed: 12/12/2022] Open
Abstract
Accurately predicting meteorological parameters such as air temperature and humidity plays a crucial role in air quality management. This study proposes different machine learning algorithms: Gradient Boosting Tree (G.B.T.), Random forest (R.F.), Linear regression (LR) and different artificial neural network (ANN) architectures (multi-layered perceptron, radial basis function) for prediction of such as air temperature (T) and relative humidity (Rh). Daily data over 24 years for Kula Terengganu station were obtained from the Malaysia Meteorological Department. Results showed that MLP-NN performs well among the others in predicting daily T and Rh with R of 0.7132 and 0.633, respectively. However, in monthly prediction T also MLP-NN model provided closer standards deviation to actual value and can be used to predict monthly T with R 0.8462. Whereas in prediction monthly Rh, the RBF-NN model's efficiency was higher than other models with R of 0.7113. To validate the performance of the trained both artificial neural network (ANN) architectures MLP-NN and RBF-NN, both were applied to an unseen data set from observation data in the region. The results indicated that on either architecture of ANN, there is good potential to predict daily and monthly T and Rh values with an acceptable range of accuracy.
Collapse
|
22
|
Wong YJ, Shimizu Y, Kamiya A, Maneechot L, Bharambe KP, Fong CS, Nik Sulaiman NM. Application of artificial intelligence methods for monsoonal river classification in Selangor river basin, Malaysia. ENVIRONMENTAL MONITORING AND ASSESSMENT 2021; 193:438. [PMID: 34159431 DOI: 10.1007/s10661-021-09202-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 06/07/2021] [Indexed: 06/13/2023]
Abstract
Rivers in Malaysia are classified based on water quality index (WQI) that comprises of six parameters, namely, ammoniacal nitrogen (AN), biochemical oxygen demand (BOD), chemical oxygen demand (COD), dissolved oxygen (DO), pH, and suspended solids (SS). Due to its tropical climate, the impact of seasonal monsoons on river quality is significant, with the increased occurrence of extreme precipitation events; however, there has been little discussion on the application of artificial intelligence models for monsoonal river classification. In light of these, this study had applied artificial neural network (ANN) and support vector machine (SVM) models for monsoonal (dry and wet seasons) river classification using three of the water quality parameters to minimise the cost of river monitoring and associated errors in WQI computation. A structured trial-and-error approach was applied on input parameter selection and hyperparameter optimisation for both models. Accuracy, sensitivity, and precision were selected as the performance criteria. For dry season, BOD-DO-pH was selected as the optimum input combination by both ANN and SVM models, with testing accuracy of 88.7% and 82.1%, respectively. As for wet season, the optimum input combinations of ANN and SVM models were BOD-pH-SS and BOD-DO-pH with testing accuracy of 89.5% and 88.0%, respectively. As a result, both optimised ANN and SVM models have proven their prediction capacities for river classification, which may be deployed as effective and reliable tools in tropical regions. Notably, better learning and higher capacity of the ANN model for dataset characteristics extraction generated better predictability and generalisability than SVM model under imbalanced dataset.
Collapse
Affiliation(s)
- Yong Jie Wong
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan.
| | - Yoshihisa Shimizu
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan
| | - Akinori Kamiya
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan
- International Environment Department, Nippon Koei Co., Ltd, Tokyo, Japan
| | - Luksanaree Maneechot
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan
| | - Khagendra Pralhad Bharambe
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan
| | - Chng Saun Fong
- Institute for Advanced Studies, University of Malaya, 50603, Kuala Lumpur, Malaysia
| | - Nik Meriam Nik Sulaiman
- Department of Chemical Engineering, Faculty of Engineering, University of Malaya, 50603, Kuala Lumpur, Malaysia
| |
Collapse
|
23
|
Latif SD. Concrete compressive strength prediction modeling utilizing deep learning long short-term memory algorithm for a sustainable environment. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:30294-30302. [PMID: 33590396 DOI: 10.1007/s11356-021-12877-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Accepted: 02/05/2021] [Indexed: 06/12/2023]
Abstract
One of the most critical parameters in concrete design is compressive strength. As the compressive strength of concrete is correctly measured, time and cost can be decreased. Concrete strength is relatively resilient to impacts on the environment. The production of concrete compressive strength is greatly influenced by severe weather conditions and increases in humidity rates. In this research, a model has been developed to predict concrete compressive strength utilizing a detailed dataset obtained from previously published studies based on a deep learning method, namely, long short-term memory (LSTM), and a conventional machine learning (ML) algorithm, namely, support vector machine (SVM). The input variables of the model include cement, blast furnace slag, fly ash, water, superplasticizer, coarse aggregate, fine aggregate, and age of specimens. To demonstrate the efficiency of the proposed models, three statistical indices, namely, the coefficient of determination (R2), mean absolute error (MAE), and root mean square error (RMSE), were used. Findings shows that LSTM outperformed SVM with R2=0.98, R2= 0.78, MAE=1.861, MAE=6.152, and RMSE=2.36, RMSE=7.93, respectively. The results of this study suggest that high-performance concrete (HPC) compressive strength can be reliably measured using the proposed LSTM model.
Collapse
Affiliation(s)
- Sarmad Dashti Latif
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Selangor, Malaysia.
| |
Collapse
|
24
|
Exploring Artificial Intelligence Techniques for Groundwater Quality Assessment. WATER 2021. [DOI: 10.3390/w13091172] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Freshwater quality and quantity are some of the fundamental requirements for sustaining human life and civilization. The Water Quality Index is the most extensively used parameter for determining water quality worldwide. However, the traditional approach for the calculation of the WQI is often complex and time consuming since it requires handling large data sets and involves the calculation of several subindices. We investigated the performance of artificial intelligence techniques, including particle swarm optimization (PSO), a naive Bayes classifier (NBC), and a support vector machine (SVM), for predicting the water quality index. We used an SVM and NBC for prediction, in conjunction with PSO for optimization. To validate the obtained results, groundwater water quality parameters and their corresponding water quality indices were found for water collected from the Pindrawan tank area in Chhattisgarh, India. Our results show that PSO–NBC provided a 92.8% prediction accuracy of the WQI indices, whereas the PSO–SVM accuracy was 77.60%. The study’s outcomes further suggest that ensemble machine learning (ML) algorithms can be used to estimate and predict the Water Quality Index with significant accuracy. Thus, the proposed framework can be directly used for the prediction of the WQI using the measured field parameters while saving significant time and effort.
Collapse
|
25
|
Najah A, Teo FY, Chow MF, Huang YF, Latif SD, Abdullah S, Ismail M, El-Shafie A. Surface water quality status and prediction during movement control operation order under COVID-19 pandemic: Case studies in Malaysia. INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY : IJEST 2021; 18:1009-1018. [PMID: 33558809 PMCID: PMC7857098 DOI: 10.1007/s13762-021-03139-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/09/2020] [Accepted: 01/06/2021] [Indexed: 05/06/2023]
Abstract
Global concerns have been observed due to the outbreak and lockdown causal-based COVID-19, and hence, a global pandemic was announced by the World Health Organization (WHO) in January 2020. The Movement Control Order (MCO) in Malaysia acts to moderate the spread of COVID-19 through the enacted measures. Furthermore, massive industrial, agricultural activities and human encroachment were significantly reduced following the MCO guidelines. In this study, first, a reconnaissance survey was carried out on the effects of MCO on the health conditions of two urban rivers (i.e., Rivers of Klang and Penang) in Malaysia. Secondly, the effect of MCO lockdown on the water quality index (WQI) of a lake (Putrajaya Lake) in Malaysia is considered in this study. Finally, four machine learning algorithms have been investigated to predict WQI and the class in Putrajaya Lake. The main observations based on the analysis showed that noticeable enhancements of varying degrees in the WQI had occurred in the two investigated rivers. With regard to Putrajaya Lake, there is a significant increase in the WQI Class I, from 24% in February 2020 to 94% during the MCO month of March 2020. For WQI prediction, Multi-layer Perceptron (MLP) outperformed other models in predicting the changes in the index with a high level of accuracy. For sensitivity analysis results, it is shown that NH3-N and COD play vital rule and contributing significantly to predicting the class of WQI, followed by BOD, while the remaining three parameters (i.e. pH, DO, and TSS) exhibit a low level of importance.
Collapse
Affiliation(s)
- A. Najah
- Institute of Energy Infrastructure (IEI), Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Darul Ehsan Malaysia
| | - F. Y. Teo
- Faculty of Science and Engineering, University of Nottingham Malaysia, 43500 Semenyih, Selangor Malaysia
| | - M. F. Chow
- Institute of Sustainable Energy (ISE), Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Malaysia
| | - Y. F. Huang
- Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Kajang, Selangor Malaysia
| | - S. D. Latif
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Malaysia
| | - S. Abdullah
- Air Quality and Environment Research Group, Faculty of Ocean Engineering Technology and Informatics, Universiti Malaysia Terengganu, 21030 Kuala Nerus, Terengganu Malaysia
| | - M. Ismail
- Faculty of Science and Marine Environment, Universiti Malaysia Terengganu, 21030 Kuala Nerus, Terengganu Malaysia
- Institute of Tropical Biodiversity and Sustainable Development, Universiti Malaysia Terengganu, 21030 Kuala Nerus, Malaysia
| | - A. El-Shafie
- Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603 Kuala Lumpur, Malaysia
- National Water and Energy Center (NWC), United Arab Emirates University, P.O. Box. 15551, Al Ain, United Arab Emirates
| |
Collapse
|
26
|
Abba SI, Pham QB, Saini G, Linh NTT, Ahmed AN, Mohajane M, Khaledian M, Abdulkadir RA, Bach QV. Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2020; 27:41524-41539. [PMID: 32686045 DOI: 10.1007/s11356-020-09689-x] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Accepted: 06/10/2020] [Indexed: 05/14/2023]
Abstract
In recent decades, various conventional techniques have been formulated around the world to evaluate the overall water quality (WQ) at particular locations. In the present study, back propagation neural network (BPNN) and adaptive neuro-fuzzy inference system (ANFIS), support vector regression (SVR), and one multilinear regression (MLR) are considered for the prediction of water quality index (WQI) at three stations, namely Nizamuddin, Palla, and Udi (Chambal), across the Yamuna River, India. The nonlinear ensemble technique was proposed using the neural network ensemble (NNE) approach to improve the performance accuracy of the single models. The observed WQ parameters were provided by the Central Pollution Control Board (CPCB) including dissolved oxygen (DO), pH, biological oxygen demand (BOD), ammonia (NH3), temperature (T), and WQI. The performance of the models was evaluated by various statistical indices. The obtained results indicated the feasibility of the developed data intelligence models for predicting the WQI at the three stations with the superior modelling results of the NNE. The results also showed that the minimum values for root mean square (RMS) varied between 0.1213 and 0.4107, 0.003 and 0.0367, and 0.002 and 0.0272 for Nizamuddin, Palla, and Udi (Chambal), respectively. ANFIS-M3, BPNN-M4, and BPNN-M3 improved the performance with regard to an absolute error by 41%, 4%, and 3%, over other models for Nizamuddin, Palla, and Udi (Chambal) stations, respectively. The predictive comparison demonstrated that NNE proved to be effective and can therefore serve as a reliable prediction approach. The inferences of this paper would be of interest to policymakers in terms of WQ for establishing sustainable management strategies of water resources.
Collapse
Affiliation(s)
- Sani Isah Abba
- Department of Physical Planning Development, Yusuf Maitama Sule University Kano, Kano, Nigeria
| | - Quoc Bao Pham
- Institute of Research and Development, Duy Tan University, Danang 550000, Vietnam
- Faculty of Environmental and Chemical Engineering, Duy Tan University, Danang, 550000, Vietnam
| | - Gaurav Saini
- Department of Civil Engineering, Sharda University, Greater Noida, Uttar Pradesh, India
| | | | - Ali Najah Ahmed
- Institute of Energy Infrastructure (IEI), Civil Engineering Department, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Meriame Mohajane
- Soil and Environment Microbiology Team, Department of Biology, Faculty of Sciences, Moulay Ismail University, Meknes, Morocco
- Water Sciences and Environment Engineering Team, Department of Geology, Faculty of Sciences, Moulay Ismail University, Meknes, Morocco
| | - Mohammadreza Khaledian
- Water Engineering Dept., Faculty of Agricultural Sciences, University of Guilan, Rasht, Iran
- Department of Water Engineering and Environment, Caspian Sea Basin Research Center, Rasht, Iran
| | - Rabiu Aliyu Abdulkadir
- Department of Electrical and Electronic, Kano University of Science & Technology, Wudil, Wudil, Nigeria
| | - Quang-Vu Bach
- Sustainable Management of Natural Resources and Environment Research Group, Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Vietnam.
| |
Collapse
|
27
|
Banadkooki FB, Ehteram M, Ahmed AN, Teo FY, Ebrahimi M, Fai CM, Huang YF, El-Shafie A. Suspended sediment load prediction using artificial neural network and ant lion optimization algorithm. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2020; 27:38094-38116. [PMID: 32621196 DOI: 10.1007/s11356-020-09876-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 06/23/2020] [Indexed: 06/11/2023]
Abstract
Suspended sediment load (SSL) estimation is a required exercise in water resource management. This article proposes the use of hybrid artificial neural network (ANN) models, for the prediction of SSL, based on previous SSL values. Different input scenarios of daily SSL were used to evaluate the capacity of the ANN-ant lion optimization (ALO), ANN-bat algorithm (BA) and ANN-particle swarm optimization (PSO). The Goorganrood basin in Iran was selected for this study. First, the lagged SSL data were used as the inputs to the models. Next, the rainfall and temperature data were used. Optimization algorithms were used to fine-tune the parameters of the ANN model. Three statistical indexes were used to evaluate the accuracy of the models: the root-mean-square error (RMSE), mean absolute error (MAE) and Nash-Sutcliffe efficiency (NSE). An uncertainty analysis of the predicting models was performed to evaluate the capability of the hybrid ANN models. A comparison of models indicated that the ANN-ALO improved the RMSE accuracy of the ANN-BA and ANN-PSO models by 18% and 26%, respectively. Based on the uncertainty analysis, it can be surmised that the ANN-ALO has an acceptable degree of uncertainty in predicting daily SSL. Generally, the results indicate that the ANN-ALO is applicable for a variety of water resource management operations.
Collapse
Affiliation(s)
| | - Mohammad Ehteram
- Department of Water Engineering and Hydraulic Structures, Faculty of Civil Engineering, Semnan University, Semnan, Iran
| | - Ali Najah Ahmed
- Institute of Energy Infrastructure (IEI), Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia.
| | - Fang Yenn Teo
- Faculty of Science and Engineering, University of Nottingham Malaysia, 43500, Semenyih, Selangor, Malaysia
| | | | - Chow Ming Fai
- Institute of Sustainable Energy (ISE), Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
| | - Yuk Feng Huang
- Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, 43200, Kajang, Selangor, Malaysia
| | - Ahmed El-Shafie
- Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603, Kuala Lumpur, Malaysia
- National Water Center, United Arab Emirates University, 15551, Al Ain, United Arab Emirates
| |
Collapse
|
28
|
Kumar PGD, Viswanath NC, Cyrus S, Abraham BM. Mixing data for multivariate statistical study of groundwater quality. ENVIRONMENTAL MONITORING AND ASSESSMENT 2020; 192:506. [PMID: 32651635 DOI: 10.1007/s10661-020-08465-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2019] [Accepted: 06/28/2020] [Indexed: 06/11/2023]
Abstract
In the present paper, a multivariate statistical modeling study of water quality data from different places of Kozhikode Gity, Kerala, India, has been conducted applying multiple linear regression (MLR), structural equation modeling (SEM), and adaptive neuro-fuzzy inference system (ANFIS) modeling. First, we combined water quality data from different places in the study area over different time periods to obtain a unified multiple linear regression (MLR) model. By mixing three data sets from different places and time periods in four different ways, different regression models were formed with total dissolved solids (TDS) as the dependent variable and calcium, magnesium, nitrate, sodium, chloride, potassium, total hardness, and sulfate as independent variables. The effectiveness of each model was then tested against a data set, which corresponded to a different period and location. One unmixed model and three mixed models showed similar performance. An SEM was developed for the data set, which is obtained by mixing all the three data sets. The same regression coefficients are found for the SEM and the corresponding MLR. An improvement in the sample size as a result of mixing of data sets could be thought of as the reason for this phenomenon. We thus selected the MLR obtained by mixing all three data sets as our unified model. For the mixed data set, we then developed an ANFIS model with calcium, magnesium, nitrate, sodium, chloride, potassium, total hardness, and sulfate as input variables and TDS as the output variable. On the external data set, the ANFIS model showed a better performance than the MLR model.
Collapse
Affiliation(s)
- P G Dileep Kumar
- Division of Civil Engineering, School of Engineering, Cochin University of Science and Technology, Kochi, Kerala, 682022, India
| | - Narayanan C Viswanath
- Department of Mathematics, Government Engineering College, Thrissur, Thrissur, Kerala, 680009, India.
| | - Sobha Cyrus
- Division of Civil Engineering, School of Engineering, Cochin University of Science and Technology, Kochi, Kerala, 682022, India
| | - Benny Mathews Abraham
- Division of Civil Engineering, School of Engineering, Cochin University of Science and Technology, Kochi, Kerala, 682022, India
| |
Collapse
|
29
|
Investigating the Influence of Meteorological Parameters on the Accuracy of Sea-Level Prediction Models in Sabah, Malaysia. SUSTAINABILITY 2020. [DOI: 10.3390/su12031193] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
This study aims to investigate the impact of meteorological parameters such as wind direction, wind speed, rainfall, and mean cloud cover on sea-level rise projections for different time horizons—2019, 2023, 2028, 2048, and 2068—at three stations located in Kudat, Sandakan, and Kota Kinabalu, which are districts in the state of Sabah, Malaysia. Herein, two different scenarios, scenario1 (SC1) and scenario2 (SC2), were investigated, with each scenario comprising a different combination of input parameters. This study proposes two artificial intelligence techniques: a multilayer perceptron neural network (MLP-ANN) and an adaptive neuro-fuzzy inference system (ANFIS). Furthermore, three evaluation indexes were adopted to assess the performance of the proposed models. These indexes are the correlation coefficient, root mean square error, and scatter index. The trial and error method were used to tune the hyperparameters: the number of neurons in the hidden layer, training algorithms, transfer and activation functions, and number and shape of the membership function for the proposed models. Results show that for the above mentioned three stations, the ANFIS model outperformed MLP-ANN by 0.740%, 6.23%, and 9.39%, respectively. To assess the uncertainties of the best model, ANFIS, the percentage of observed data bracketed by 95 percent predicted uncertainties (95PPUs) and the band width of 95 percent confidence intervals (d-factors) are selected. The obtained values bracketed by 95PPUs are show about 75.2%, 77.4%, 76.8% and the d-factor has a value of 0.27, 0.21 and 0.23 at Kudat, Sandakan and Kota Kinabalu stations, respectively. A comparison between the two scenarios shows that SC1 achieved a high level of accuracy on Kudat and Sandakan data, whereas SC2 outperformed SC1 on Kota Kinabalu data.
Collapse
|
30
|
Investigation on the Potential to Integrate Different Artificial Intelligence Models with Metaheuristic Algorithms for Improving River Suspended Sediment Predictions. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9194149] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Suspended sediment load (SLL) prediction is a significant field in hydrology and hydraulic sciences, as sedimentation processes change the soil quality. Although the adaptive neuro fuzzy system (ANFIS) and multilayer feed-forward neural network (MFNN) have been widely used to simulate hydrological variables, improving the accuracy of the above models is an important issue for hydrologists. In this article, the ANFIS and MFNN models were improved by the bat algorithm (BA) and weed algorithm (WA). Thus, the current paper introduces improved ANFIS and MFNN models: ANFIS–BA, ANFIS–WA, MFNN–BA, and MFNN–WA. The models were validated by applying river discharge, rainfall, and monthly suspended sediment load (SSL) for the Atrek basin in Iran. In addition, seven input groups were used to predict monthly SSL. The best models were identified through root-mean-square error (RMSE), Nash–Sutcliff efficiency (NSE), standard deviation ratio (RSR), percent bias (PBIAS) indices, and uncertainty analysis. For the ANFIS–BA model, RMSE and RSR varied from 1.5 to 2.5 ton/d and from 5% to 25%, respectively. In addition, a variation range of NSE was between very good and good performance (0. 75 to 0.85 and 0.85 to 1). The uncertainty analysis showed that the ANFIS–BA had more reliable performance compared to other models. Thus, the ANFIS–BA model has high potential for predicting SSL.
Collapse
|