1
|
Miller T, Michoński G, Durlik I, Kozlovska P, Biczak P. Artificial Intelligence in Aquatic Biodiversity Research: A PRISMA-Based Systematic Review. BIOLOGY 2025; 14:520. [PMID: 40427709 PMCID: PMC12109572 DOI: 10.3390/biology14050520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2025] [Revised: 04/30/2025] [Accepted: 05/06/2025] [Indexed: 05/29/2025]
Abstract
Freshwater ecosystems are increasingly threatened by climate change and anthropogenic activities, necessitating innovative and scalable monitoring solutions. Artificial intelligence (AI) has emerged as a transformative tool in aquatic biodiversity research, enabling automated species identification, predictive habitat modeling, and conservation planning. This systematic review follows the PRISMA framework to analyze AI applications in freshwater biodiversity studies. Using a structured literature search across Scopus, Web of Science, and Google Scholar, we identified 312 relevant studies published between 2010 and 2024. This review categorizes AI applications into species identification, habitat assessment, ecological risk evaluation, and conservation strategies. A risk of bias assessment was conducted using QUADAS-2 and RoB 2 frameworks, highlighting methodological challenges, such as measurement bias and inconsistencies in the model validation. The citation trends demonstrate exponential growth in AI-driven biodiversity research, with leading contributions from China, the United States, and India. Despite the growing use of AI in this field, this review also reveals several persistent challenges, including limited data availability, regional imbalances, and concerns related to model generalizability and transparency. Our findings underscore AI's potential in revolutionizing biodiversity monitoring but also emphasize the need for standardized methodologies, improved data integration, and interdisciplinary collaboration to enhance ecological insights and conservation efforts.
Collapse
Affiliation(s)
- Tymoteusz Miller
- Institute of Marine and Environmental Sciences, University of Szczecin, 71-415 Szczecin, Poland;
| | - Grzegorz Michoński
- Institute of Marine and Environmental Sciences, University of Szczecin, 71-415 Szczecin, Poland;
| | - Irmina Durlik
- Polish Society of Bioinformatics and Data Science, Biodata, 71-214 Szczecin, Poland; (I.D.); (P.B.)
- Faculty of Navigation, Maritime University of Szczecin, 70-500 Szczecin, Poland
| | - Polina Kozlovska
- Faculty of Economics, Finance and Management, University of Szczecin, 71-412 Szczecin, Poland;
| | - Paweł Biczak
- Polish Society of Bioinformatics and Data Science, Biodata, 71-214 Szczecin, Poland; (I.D.); (P.B.)
| |
Collapse
|
2
|
Zhao F, Huang B, Wang J, Shao X, Wu Q, Xi D, Liu Y, Chen Y, Zhang G, Ren Z, Chen J, Mizuno K. Seafloor debris detection using underwater images and deep learning-driven image restoration: A case study from Koh Tao, Thailand. MARINE POLLUTION BULLETIN 2025; 214:117710. [PMID: 39978130 DOI: 10.1016/j.marpolbul.2025.117710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2024] [Revised: 02/13/2025] [Accepted: 02/17/2025] [Indexed: 02/22/2025]
Abstract
Traditional detection and monitoring of seafloor debris present considerable challenges due to the high costs associated with underwater imaging devices and the complex environmental conditions in marine ecosystems. In response to these challenges, this field study conducted in Koh Tao, Thailand, proposed an innovative and cost-effective approach that leverages super-resolution reconstruction (SRR) technology in conjunction with an optimized object detection model based on YOLOv8. Super-resolution (SR) images reconstructed by seven SRR models were fed into the proposed Seafloor-Debris-YOLO (SFD-YOLO) model for seafloor debris object detection. RDN model achieved the highest reconstruction results with a signal-to-noise ratio (PSNR) of 41.02 dB and structural similarity (SSIM) of 95.08 % and attained state-of-the-art (SOTA) accuracy in debris detection with a mean Average Precision (mAP) of 91.2 % using RDN-reconstructed images with a magnification factor of 4. Additionally, this study provided an in-depth analysis of the influence of magnification factors within the SRR process, offering a quantitative comparison of images before and after reconstruction, as well as a comparative evaluation across various detection algorithms with a novel pretraining strategy. This approach to underwater survey methods, combined with SRR technology, marks an advancement in the field of seafloor debris monitoring, presenting practical solutions to enhance image quality affected by field conditions and enabling the precise identification of marine debris.
Collapse
Affiliation(s)
- Fan Zhao
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan.
| | - Baoxi Huang
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Jiaqi Wang
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Xinlei Shao
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Qingyang Wu
- Department of Environmental Health Sciences, University of California, Los Angeles, CA 90095, USA
| | - Dianhan Xi
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Yongying Liu
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Yijia Chen
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Guochen Zhang
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Zhiyan Ren
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan
| | - Jundong Chen
- Data Science and AI Innovation Research Promotion Center, Shiga University, Hikone, Shiga 5228522, Japan
| | - Katsunori Mizuno
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 2778563, Japan.
| |
Collapse
|
3
|
Jia F, Wang W, Han Y, Du J, Zhang Y, Liu Z, Gao H. Predictive framework of vegetation resistance in channel flow. Sci Rep 2025; 15:7593. [PMID: 40038439 PMCID: PMC11880560 DOI: 10.1038/s41598-025-91668-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2024] [Accepted: 02/21/2025] [Indexed: 03/06/2025] Open
Abstract
Predicting vegetation-induced flow resistance remains a significant challenge due to the diverse and dynamic nature of river vegetation. Although numerous empirical models are available, they often fail to generalize across different environmental conditions, leading to inaccurate predictions. This study introduces a machine learning-based framework for predicting vegetation flow resistance, incorporating nine ML methods, including SVM, XGBoost, and BP. To improve predictive performance, optimization algorithms such as PSO, WSO, and RIME were applied. A comprehensive dataset of 490 samples across multiple scales was used to evaluate model accuracy, indicated: (1) The submergence ratio α and Froude number Fr are the most sensitive parameters affecting Cd, while missing parameters such as vegetation density λ and blockage ratio β significantly reduce accuracy; (2) XGBoost outperforms other models, achieving the highest predictive accuracy (R2 = 0.9552); (3) The framework remains stable across six parameter deficiency scenarios, with XGBoost maintaining R2 > 0.85 in all cases. In conclusion, this study highlights the transformative potential of the proposed predictive framework in overcoming the long-standing challenges of estimating flow resistance in vegetated channels. It provides valuable insights for sustainable river management, bolsters restoration efforts, and enhances predictive accuracy in complex, dynamic environments.
Collapse
Affiliation(s)
- Fengcong Jia
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China
| | - Weijie Wang
- State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research, Beijing, 100038, China
| | - Yu Han
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China.
| | - Jiayu Du
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China
| | - Yue Zhang
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China
| | - Zihan Liu
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China
| | - Hairong Gao
- College of Water Resources and Civil Engineering, China Agricultural University, Beijing, 100083, China
| |
Collapse
|
4
|
Elshewey AM, Youssef RY, El-Bakry HM, Osman AM. Water potability classification based on hybrid stacked model and feature selection. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2025; 32:7933-7949. [PMID: 40048059 DOI: 10.1007/s11356-025-36120-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Accepted: 02/13/2025] [Indexed: 03/29/2025]
Abstract
Clean water requires accurate water quality categorization. A water potability (WP) dataset with pH, hardness, solids, chloramines, sulfate, conductivity, and other metrics for 3276 water bodies was used in this paper. After median imputation for missing values, normalization for feature scaling, and class imbalance correction using SMOTE, the Kaggle public dataset was prepared. With binary particle swarm optimization (BPSO) and binary whale optimization algorithm (BWAO), feature selection (FS) was used to determine the most important features for classification. A subset of seven essential characteristics is selected with the lowest average error of 0.3745 by the BPSO. Random forest (RF), gradient boosting (GB), support vector machine (SVM), Extra Tree (ET), decision tree (DT), and XGBoost are tested for WP prediction. The ET classifier ranked first, with 70.63% accuracy and 71.17% F1-score. Predictive performance was improved by stacking random forest, extra trees, and XGBoost base learners with Logistic Regression meta-learner. The stacking model improved with 69.53% accuracy, 70.23% F1-score, and 77.62% AUC. We found that stacking uses high-performing models to create a strong and balanced categorization framework. This paper shows that ensemble learning can improve WP categorization and that stacking may be a feasible way for measuring and managing water quality.
Collapse
Affiliation(s)
- Ahmed M Elshewey
- Department of Computer Science, Faculty of Computers and Information, Suez University, P.O. Box: 43221, Suez, Egypt.
| | - Rasha Y Youssef
- Department of Information Systems, Faculty of Computers and Information, Suez University, P.O. Box: 43221, Suez, Egypt
| | - Hazem M El-Bakry
- Department of Information Systems, Faculty of Computers and Information, Mansoura University, P.O. Box: 35516, Mansoura, 35516, Egypt
| | - Ahmed M Osman
- Department of Information Systems, Faculty of Computers and Information, Suez University, P.O. Box: 43221, Suez, Egypt
- Department of Information Systems, Faculty of Computers and Information, Mansoura University, P.O. Box: 35516, Mansoura, 35516, Egypt
| |
Collapse
|
5
|
Younas F, Sardar MF, Ullah Z, Ali J, Yu X, Zhu P, Guo W, Al-Anazi KM, Farah MA, Cui Z. Assessment of groundwater chemistry to predict arsenic contamination from a canal commanded area: applications of different machine learning models. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2025; 47:46. [PMID: 39786503 DOI: 10.1007/s10653-024-02334-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2024] [Accepted: 12/10/2024] [Indexed: 01/12/2025]
Abstract
Groundwater arsenic (As), contamination is a significant issue worldwide including China and Pakistan, particularly in canal command areas. In this study, 131 groundwater samples were collected, and three machine learning models [Random Forest (RF), Logistic Regression (LR), and Artificial Neural Network (ANN)] were employed to predict As concentration. Descriptive statistics helped to conclude that all of the samples were inside the permitted limit of WHO for pH, Ca, Mg, Turbidity, Cl, K, Na, SO4, NO3, F and beyond limit of WHO for EC, HCO3, TDS, and As. RF suggested a median drop in Gini node impurity across all tree divisions. This predicted As contamination in samples due to presence of TDS, EC, HCO3- and turbidity in upper end of graph which expressed significance of these factors in contaminating water with Arsenic. Moreover, these factors were found positively correlated with Ar contamination. LR model expressed about best fitness of model. ANN classified large data set into two classes i.e. (1) Inside limit of WHO and (2) and outside limit of WHO. Total dissolved solids (TDS), turbidity, sodium (Na) and electrical conductivity (EC) were positively correlated with Ar (Arsenic concentration) in the collected samples. pH and K were negatively associated with Arsenic concentration of the observed samples. Confusion matrices and ROC-AUC scores evaluated that RF, model outperforming than LR, and ANN, in accuracy and sensitivity. Key variables influencing As concentration in the groundwater resources of the study area were identified, such parameters include TDS, chloride (Cl), bicarbonate (HCO3-) and turbidity. The study provided the complete profile of the 131 water samples which can be used to make strategies for the minimization of ground Water contamination for Rohri canal command area. Moreover, the steps can be taken to control the discussed parameters inside the WHO limit.
Collapse
Affiliation(s)
- Fazila Younas
- School of Environmental Science and Engineering, Shandong University, Qingdao, 266237, China
| | - Muhammad Fahad Sardar
- Key Laboratory of Ecological Prewarning, Protection and Restoration of Bohai Sea, Ministry of Natural Resources, School of Life Sciences, Shandong University, Qingdao, 266237, China.
| | - Zahid Ullah
- School of Environmental Studies, China University of Geosciences, Wuhan, 430070, China
| | - Jawad Ali
- Environment Research Institute, Shandong University, Qingdao, 266237, China
| | - Xiaona Yu
- Key Laboratory of Ecological Prewarning, Protection and Restoration of Bohai Sea, Ministry of Natural Resources, School of Life Sciences, Shandong University, Qingdao, 266237, China
| | - Pengcheng Zhu
- Key Laboratory of Ecological Prewarning, Protection and Restoration of Bohai Sea, Ministry of Natural Resources, School of Life Sciences, Shandong University, Qingdao, 266237, China
| | - Weihua Guo
- Key Laboratory of Ecological Prewarning, Protection and Restoration of Bohai Sea, Ministry of Natural Resources, School of Life Sciences, Shandong University, Qingdao, 266237, China
| | - Khalid Mashay Al-Anazi
- Department of Zoology, College of Science, King Saud University, 11451, Riyadh, Saudi Arabia
| | - Mohammad Abul Farah
- Department of Zoology, College of Science, King Saud University, 11451, Riyadh, Saudi Arabia
| | - Zhaojie Cui
- School of Environmental Science and Engineering, Shandong University, Qingdao, 266237, China.
| |
Collapse
|
6
|
Zheng Y, Li C, Zhou W, Xu Z, Zhang X, Cao W, Yang Z, Liu C. A novel method to estimate the 3D chlorophyll a distribution in the South China Sea surface waters using hydrometeorological parameters. Sci Rep 2024; 14:25516. [PMID: 39462029 PMCID: PMC11513029 DOI: 10.1038/s41598-024-76748-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Accepted: 10/16/2024] [Indexed: 10/28/2024] Open
Abstract
Chlorophyll a (Chl-a) is a key indicator of marine ecosystems, and certain hydro-meteorological parameters (HMPs) are highly correlated with its fluctuations. Here, relevant and accessible HMPs were used as inputs, combined with machine learning (ML) algorithms for estimating 3D Chl-a in the South China Sea (SCS). With the inputs of temperature, salinity, depth, wind speed, wind direction, sea surface pressure, and relative humidity, the LightGBM-based model performed well, achieving high R2 values of 0.985 and 0.789 in validation and testing sets, respectively. Based on a large number of in situ measurements, this model enables the estimation of the 3D distribution of summer Chl-a in the SCS over the past fifteen years using a 3D hydrographic dataset combined with surface meteorological parameters. The results show that the 3D distribution of the model estimated Chl-a is characterized similarly to the previous studies and can capture the effect of hydro-meteorological conditions on Chl-a distribution. The environmental variables affecting Chl-a were considered more comprehensively in this study, and the methodological framework has the potential to be applied to the low-cost monitoring of the remaining water quality parameters.
Collapse
Affiliation(s)
- Yuanning Zheng
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
- College of Marine Sciences, University of Chinese Academy of Sciences, Qingdao, 266400, China
| | - Cai Li
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China.
| | - Wen Zhou
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
| | - Zhantang Xu
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
| | - Xianqing Zhang
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
| | - Wenxi Cao
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
| | - Zeming Yang
- State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou, 510300, China
| | - Changjian Liu
- South China Sea Survey Center, South China Sea Bureau, Ministry of Natural Resources, Guangzhou, 510300, China
| |
Collapse
|
7
|
Robertson AM, Piggott JJ, Penk MR. Improving multiple stressor-response models through the inclusion of nonlinearity and interactions among stressor gradients. ENVIRONMENTAL MONITORING AND ASSESSMENT 2024; 196:1026. [PMID: 39373764 DOI: 10.1007/s10661-024-13169-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 09/24/2024] [Indexed: 10/08/2024]
Abstract
Stressor-response models are used to detect and predict changes within ecosystems in response to anthropogenic and naturally occurring stressors. While nonlinear stressor-response relationships and interactions between stressors are common in nature, predictive models often do not account for them due to perceived difficulties in the interpretation of results. We used Irish river monitoring data from 177 river sites to investigate if multiple stressor-response models can be improved by accounting for nonlinearity, interactions in stressor-response relationships and environmental context dependencies. Out of the six models of distinct biological responses, five models benefited from the inclusion of nonlinearity while all six benefited from the inclusion of interactions. The addition of nonlinearity means that we can better see the exponential increase in Trophic Diatom Index (TDI3) as phosphorus increases, inferring ecological conditions deteriorating at a faster rate with increasing phosphorus. Furthermore, our results show that the relationship between stressor and response has the potential to be dependent on other variables, as seen in the interaction of elevation with both siltation and nutrients in relation to Ephemeroptera, Plecoptera and Trichoptera (EPT) richness. Both relationships weakened at higher elevations, perhaps demonstrating that there is a decreased capacity for resilience to stressors at lower elevations due to greater cumulative effects. Understanding interactions such as this is vital to managing ecosystems. Our findings provide empirical support for the need to further develop and employ more complex modelling techniques in environmental assessment and management.
Collapse
Affiliation(s)
- Aoife M Robertson
- School of Natural Sciences, Trinity College Dublin, The University of Dublin, Dublin, Ireland.
| | - Jeremy J Piggott
- School of Natural Sciences, Trinity College Dublin, The University of Dublin, Dublin, Ireland
| | - Marcin R Penk
- School of Natural Sciences, Trinity College Dublin, The University of Dublin, Dublin, Ireland
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
| |
Collapse
|
8
|
Zare A, Ablakimova N, Kaliyev AA, Mussin NM, Tanideh N, Rahmanifar F, Tamadon A. An update for various applications of Artificial Intelligence (AI) for detection and identification of marine environmental pollutions: A bibliometric analysis and systematic review. MARINE POLLUTION BULLETIN 2024; 206:116751. [PMID: 39053264 DOI: 10.1016/j.marpolbul.2024.116751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 07/16/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]
Abstract
Marine environmental pollution is one of the growing concerns of humans all over the world. Therefore, managing these marine pollutants has been a crucial matter for scientists in recent decades. Thus, researchers have tried to implement artificial intelligence (AI) to handle marine environmental pollutants. Therefore, in this manuscript, we performed a bibliometric analysis to understand the main applications of AI for managing marine environments. Therefore, we examined both PubMed online database and Google Scholar to find any research articles that discuss the applications of AI in managing marine environmental pollution. Ultimately, we found that AI can detect, locate, and even predict aquatic contaminants like oil fingerprinting, oil spills, oil spill damage, oil slicks, forecasting marine water quality, water quality development, harmful algal blooms, benthic sediment toxicity, as well as detection of marine debris with high accuracy.
Collapse
Affiliation(s)
| | - Nurgul Ablakimova
- Department of Pharmacology, West Kazakhstan Marat Ospanov Medical University, Aktobe 030012, Kazakhstan
| | - Asset Askerovich Kaliyev
- Department of Surgery, West Kazakhstan Marat Ospanov Medical University, Aktobe 030012, Kazakhstan
| | - Nadiar Maratovich Mussin
- Department of Surgery, West Kazakhstan Marat Ospanov Medical University, Aktobe 030012, Kazakhstan.
| | - Nader Tanideh
- Stem Cells Technology Research Center, Shiraz University of Medical Sciences, Shiraz 71348-14336, Iran; Department of Pharmacology, Medical School, Shiraz University of Medical Sciences, Shiraz 71348-14336, Iran
| | - Farhad Rahmanifar
- Department of Basic Sciences, School of Veterinary Medicine, Shiraz University, Shiraz, Iran
| | - Amin Tamadon
- Department for Natural Sciences, West Kazakhstan Marat Ospanov Medical University, Aktobe, Kazakhstan
| |
Collapse
|
9
|
Meng R, Du X, Ge K, Wu C, Zhang Z, Liang X, Yang J, Zhang H. Does climate change increase the risk of marine toxins? Insights from changing seawater conditions. Arch Toxicol 2024; 98:2743-2762. [PMID: 38795135 DOI: 10.1007/s00204-024-03784-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 05/08/2024] [Indexed: 05/27/2024]
Abstract
Marine toxins produced by marine organisms threaten human health and impose a heavy public health burden on coastal countries. Lately, there has been an emergence of marine toxins in regions that were previously unaffected, and it is believed that climate change may be a significant factor. This paper systematically summarizes the impact of climate change on the risk of marine toxins in terms of changes in seawater conditions. From our findings, climate change can cause ocean warming, acidification, stratification, and sea-level rise. These climatic events can alter the surface temperature, salinity, pH, and nutrient conditions of seawater, which may promote the growth of various algae and bacteria, facilitating the production of marine toxins. On the other hand, climate change may expand the living ranges of marine organisms (such as algae, bacteria, and fish), thereby exacerbating the production and spread of marine toxins. In addition, the sources, distribution, and toxicity of ciguatoxin, tetrodotoxin, cyclic imines, and microcystin were described to improve public awareness of these emerging marine toxins. Looking ahead, developing interdisciplinary cooperation, strengthening monitoring of emerging marine toxins, and exploring more novel approaches are essential to better address the risks of marine toxins posed by climate change. Altogether, the interrelationships between climate, marine ecology, and marine toxins were analyzed in this study, providing a theoretical basis for preventing and managing future health risks from marine toxins.
Collapse
Affiliation(s)
- Ruiyang Meng
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Xingde Du
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Kangfeng Ge
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Chunrui Wu
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Zongxin Zhang
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Xiao Liang
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Jun Yang
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China
| | - Huizhen Zhang
- College of Public Health, Zhengzhou University, Zhengzhou, 450001, China.
| |
Collapse
|
10
|
Park J, Patel K, Lee WH. Recent advances in algal bloom detection and prediction technology using machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 938:173546. [PMID: 38810749 DOI: 10.1016/j.scitotenv.2024.173546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 05/18/2024] [Accepted: 05/24/2024] [Indexed: 05/31/2024]
Abstract
Harmful algal blooms (HAB) including red tides and cyanobacteria are a significant environmental issue that can have harmful effects on aquatic ecosystems and human health. Traditional methods of detecting and managing algal blooms have been limited by their reliance on manual observation and analysis, which can be time-consuming and costly. Recent advances in machine learning (ML) technology have shown promise in improving the accuracy and efficiency of algal bloom detection and prediction. This paper provides an overview of the latest developments in using ML for algal bloom detection and prediction using various water quality parameters and environmental factors. First, we introduced ML for algal bloom prediction using regression and classification models. Then we explored image-based ML for algae detection by utilizing satellite images, surveillance cameras, and microscopic images. This study also highlights several real-world examples of successful implementation of ML for algal bloom detection and prediction. These examples show how ML can enhance the accuracy and efficiency of detecting and predicting algal blooms, contributing to the protection of aquatic ecosystems and human health. The study also outlines recent efforts to enhance the field applicability of ML models and suggests future research directions. A recent interest in explainable artificial intelligence (XAI) was discussed in an effort to understand the most influencing environmental factors on algal blooms. XAI facilitates interpretations of ML model results, thereby enhancing the models' usability for decision-making in field management and improving their overall applicability in real-world settings. We also emphasize the significance of obtaining high-quality, field-representative data to enhance the efficiency of ML applications. The effectiveness of ML models in detecting and predicting algal blooms can be improved through management strategies for data quality, such as pre-treating missing data and integrating diverse datasets into a unified database. Overall, this paper presents a comprehensive review of the latest advancements in managing algal blooms using ML technology and proposes future research directions to enhance the utilization of ML techniques.
Collapse
Affiliation(s)
- Jungsu Park
- Department of Civil and Environmental Engineering, Hanbat National University,125, Dongseo-daero, Yuseong-gu, Daejeon 34158, Republic of Korea.
| | - Keval Patel
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, 12800 Pegasus Dr., Orlando, FL 32816, United States.
| | - Woo Hyoung Lee
- Department of Civil, Environmental and Construction Engineering, University of Central Florida, 12800 Pegasus Dr., Orlando, FL 32816, United States.
| |
Collapse
|
11
|
Guo C, Lan W, Guo M, Lv X, Xu X, Lei K. Spatiotemporal distribution patterns and coupling effects of aquatic environmental factors in the dry-wet season over a decade from the Beibu Gulf, South China Sea. MARINE POLLUTION BULLETIN 2024; 205:116596. [PMID: 38905738 DOI: 10.1016/j.marpolbul.2024.116596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 06/10/2024] [Accepted: 06/11/2024] [Indexed: 06/23/2024]
Abstract
Since the 21st century, the Beibu Gulf area has been affected by increasing anthropogenic activities, which makes the coastal aquatic ecosystem extremely concerning. However, the comprehensive exploration and analysis of the long-term scale behavior change characteristics of various water quality environment factors is still limited. Through comprehensively detecting coastal surface water environmental behavior information from 33 locations in the Beibu Gulf from 2005 to 2015, we revealed and quantified mutual response characteristics and patterns of various environmental indicators. The main environmental pollution indicators (e.g., COD, NH4+, NO3-, and DIP) showed a gradual decrease in concentration from the coast to the offshore sea area, and significantly increases during the wet season. The semi-enclosed Maowei Sea exhibited the most prominent performance with significant differences compared to other regions in Beibu Gulf. The average Chlorophyll-a (Chla) content in the coastal area of the Beibu Gulf during the wet season was more than twice that of the dry season, yet the interaction pattern between Chla and environmental factors in the two seasons was opposite to its concentration behavior, accompanied by a closely significant relationship with thermohaline structure and the input of nitrogen and phosphorous nutrients. The multivariate statistical analysis results of total nutrient dynamics suggested that the Beibu Gulf was clearly divided into different regions in both dry and wet season clusters. The present study can provide a comprehensive perspective for the spatial and temporal migration patterns and transformation laws of coastal water environmental factor, which should contribute to improve the prevention countermeasure of nutrient pollution in coastal environment.
Collapse
Affiliation(s)
- Chaochen Guo
- State Environmental Protection Key Laboratory of Estuarine and Coastal Environment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China
| | - Wenlu Lan
- Beibu Gulf Marine Ecological Environment Field Observation and Research Station of Guangxi, Marine Environmental Monitoring Center of Guangxi, Beihai 536000, China
| | - Meixiu Guo
- Beibu Gulf Marine Ecological Environment Field Observation and Research Station of Guangxi, Marine Environmental Monitoring Center of Guangxi, Beihai 536000, China
| | - Xubo Lv
- State Environmental Protection Key Laboratory of Estuarine and Coastal Environment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China
| | - Xiangqin Xu
- State Environmental Protection Key Laboratory of Estuarine and Coastal Environment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China
| | - Kun Lei
- State Environmental Protection Key Laboratory of Estuarine and Coastal Environment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China.
| |
Collapse
|
12
|
Yan Z, Alamdari N. Integrating temporal decomposition and data-driven approaches for predicting coastal harmful algal blooms. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 364:121463. [PMID: 38878579 DOI: 10.1016/j.jenvman.2024.121463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/23/2024] [Accepted: 06/09/2024] [Indexed: 06/24/2024]
Abstract
Frequent coastal harmful algal blooms (HABs) threaten the ecological environment and human health. Biscayne Bay in southeastern Florida also faces algal bloom issues; however, the mechanisms driving these blooms are not fully understood, emphasizing the importance of HAB prediction for effective environmental management. The overarching goal of this study is to offer a robust HAB predictive framework and try to enhance the understanding of HAB dynamics. This study established three scenarios to predict chlorophyll-a concentrations, a recognized representative of HABs: Scenario 1 (S1) using single nonlinear machine learning (ML) algorithms, hybrid Scenario 2 (S2) combining linear models and nonlinear ML algorithms, and hybrid Scenario 3 (S3) combining temporal decomposition and ML (TD-ML) algorithms. The novel-developed S3 TD-ML hybrid models demonstrated superior predictive capabilities, achieving all R2 values above 0.9 and MAPE under 30% in tests, significantly outperforming the S1 with an average R2 of 0.16 and the S2 with an R2 of -0.06. S3 models effectively captured the algal dynamics, successfully predicting complex time series with extremes and noise. In addition, we unveiled the relationship between environmental variables and chlorophyll-a through correlation analysis and found that climate change might intensify the HABs in Biscayne Bay. This research developed a precise predictive framework for early warning and proactive management of HABs, offering potential global applicability and improved prediction accuracy to address HAB challenges.
Collapse
Affiliation(s)
- Zhengxiao Yan
- Department of Civil and Environmental Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL, 32310, USA
| | - Nasrin Alamdari
- Department of Civil and Environmental Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL, 32310, USA.
| |
Collapse
|
13
|
Sheik AG, Krishna SBN, Patnaik R, Ambati SR, Bux F, Kumari S. Digitalization of phosphorous removal process in biological wastewater treatment systems: Challenges, and way forward. ENVIRONMENTAL RESEARCH 2024; 252:119133. [PMID: 38735379 DOI: 10.1016/j.envres.2024.119133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 03/22/2024] [Accepted: 05/10/2024] [Indexed: 05/14/2024]
Abstract
Phosphorus in wastewater poses a significant environmental threat, leading to water pollution and eutrophication. However, it plays a crucial role in the water-energy-resource recovery-environment (WERE) nexus. Recovering Phosphorus from wastewater can close the phosphorus loop, supporting circular economy principles by reusing it as fertilizer or in industrial applications. Despite the recognized importance of phosphorus recovery, there is a lack of analysis of the cyber-physical framework concerning the WERE nexus. Advanced methods like automatic control, optimal process technologies, artificial intelligence (AI), and life cycle assessment (LCA) have emerged to enhance wastewater treatment plants (WWTPs) operations focusing on improving effluent quality, energy efficiency, resource recovery, and reducing greenhouse gas (GHG) emissions. Providing insights into implementing modeling and simulation platforms, control, and optimization systems for Phosphorus recovery in WERE (P-WERE) in WWTPs is extremely important in WWTPs. This review highlights the valuable applications of AI algorithms, such as machine learning, deep learning, and explainable AI, for predicting phosphorus (P) dynamics in WWTPs. It emphasizes the importance of using AI to analyze microbial communities and optimize WWTPs for different various objectives. Additionally, it discusses the benefits of integrating mechanistic and data-driven models into plant-wide frameworks, which can enhance GHG simulation and enable simultaneous nitrogen (N) and Phosphorus (P) removal. The review underscores the significance of prioritizing recovery actions to redirect Phosphorus from effluent to reusable products for future considerations.
Collapse
Affiliation(s)
- Abdul Gaffar Sheik
- Institute for Water and Wastewater Technology, Durban University of Technology, Durban, 4001, South Africa.
| | - Suresh Babu Naidu Krishna
- Institute for Water and Wastewater Technology, Durban University of Technology, Durban, 4001, South Africa
| | - Reeza Patnaik
- Institute for Water and Wastewater Technology, Durban University of Technology, Durban, 4001, South Africa
| | - Seshagiri Rao Ambati
- Department of Chemical Engineering, Indian Institute of Petroleum and Energy, Visakhapatnam, 530003, Andhra Pradesh, India
| | - Faizal Bux
- Institute for Water and Wastewater Technology, Durban University of Technology, Durban, 4001, South Africa
| | - Sheena Kumari
- Institute for Water and Wastewater Technology, Durban University of Technology, Durban, 4001, South Africa.
| |
Collapse
|
14
|
Essamlali I, Nhaila H, El Khaili M. Advances in machine learning and IoT for water quality monitoring: A comprehensive review. Heliyon 2024; 10:e27920. [PMID: 38533055 PMCID: PMC10963334 DOI: 10.1016/j.heliyon.2024.e27920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 02/22/2024] [Accepted: 03/08/2024] [Indexed: 03/28/2024] Open
Abstract
Water holds great significance as a vital resource in our everyday lives, highlighting the important to continuously monitor its quality to ensure its usability. The advent of the. The Internet of Things (IoT) has brought about a revolutionary shift by enabling real-time data collection from diverse sources, thereby facilitating efficient monitoring of water quality (WQ). By employing Machine learning (ML) techniques, this gathered data can be analyzed to make accurate predictions regarding water quality. These predictive insights play a crucial role in decision-making processes aimed at safeguarding water quality, such as identifying areas in need of immediate attention and implementing preventive measures to avert contamination. This paper aims to provide a comprehensive review of the current state of the art in water quality monitoring, with a specific focus on the employment of IoT wireless technologies and ML techniques. The study examines the utilization of a range of IoT wireless technologies, including Low-Power Wide Area Networks (LpWAN), Wi-Fi, Zigbee, Radio Frequency Identification (RFID), cellular networks, and Bluetooth, in the context of monitoring water quality. Furthermore, it explores the application of both supervised and unsupervised ML algorithms for analyzing and interpreting the collected data. In addition to discussing the current state of the art, this survey also addresses the challenges and open research questions involved in integrating IoT wireless technologies and ML for water quality monitoring (WQM).
Collapse
Affiliation(s)
- Ismail Essamlali
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| | - Hasna Nhaila
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| | - Mohamed El Khaili
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| |
Collapse
|
15
|
Kim H, Lee G, Lee CG, Park SJ. Algae development in rivers with artificially constructed weirs: Dominant influence of discharge over temperature. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 355:120551. [PMID: 38460331 DOI: 10.1016/j.jenvman.2024.120551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 02/05/2024] [Accepted: 03/04/2024] [Indexed: 03/11/2024]
Abstract
Algal blooms contribute to water quality degradation, unpleasant odors, taste issues, and the presence of harmful substances in artificially constructed weirs. Mitigating these adverse effects through effective algal bloom management requires identifying the contributing factors and predicting algal concentrations. This study focused on the upstream region of the Seungchon Weir in Korea, which is characterized by elevated levels of total nitrogen and phosphorus due to a significant influx of water from a sewage treatment plant. We employed four distinct machine learning models to predict chlorophyll-a (Chl-a) concentrations and identified the influential variables linked to local algal bloom events. The gradient boosting model enabled an in-depth exploration of the intricate relationships between algal occurrence and water quality parameters, enabling accurate identification of the causal factors. The models identified the discharge flow rate (D-Flow) and water temperature as the primary determinants of Chl-a levels, with feature importance values of 0.236 and 0.212, respectively. Enhanced model precision was achieved by utilizing daily average D-Flow values, with model accuracy and significance of the D-Flow amplifying as the temporal span of daily averaging increased. Elevated Chl-a concentrations correlated with diminished D-Flow and temperature, highlighting the pivotal role of D-Flow in regulating Chl-a concentration. This trend can be attributed to the constrained discharge of the Seungchon Weir during winter. Calculating the requisite D-Flow to maintain a desirable Chl-a concentration of up to 20 mg/m3 across varying temperatures revealed an escalating demand for D-Flow with rising temperatures. Specific D-Flow ranges, corresponding to each season and temperature condition, were identified as particularly influential on Chl-a concentration. Thus, optimizing Chl-a reduction can be achieved by strategically increasing D-Flow within these specified ranges for each season and temperature variation. This study highlights the importance of maintaining sufficient D-Flow levels to mitigate algal proliferation within river systems featuring weirs.
Collapse
Affiliation(s)
- Hyunju Kim
- Faculty of Liberal Education, Seoul National University, Seoul, 08826, Republic of Korea
| | - Gyesik Lee
- School of Computer Engineering and Applied Mathematics, Hankyong National University, Anseong, 17579, Republic of Korea.
| | - Chang-Gu Lee
- Department of Environmental and Safety Engineering, Ajou University, Suwon, 16499, Republic of Korea
| | - Seong-Jik Park
- Department of Bioresources and Rural System Engineering, Hankyong National University, Anseong, 17579, Republic of Korea.
| |
Collapse
|
16
|
Yan Z, Kamanmalek S, Alamdari N. Predicting coastal harmful algal blooms using integrated data-driven analysis of environmental factors. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 912:169253. [PMID: 38101630 DOI: 10.1016/j.scitotenv.2023.169253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/20/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023]
Abstract
Coastal harmful algal blooms (HABs) have become one of the challenging environmental problems in the world's thriving coastal cities due to the interference of multiple stressors from human activities and climate change. Past HAB predictions primarily relied on single-source data, overlooked upstream land use, and typically used a single prediction algorithm. To address these limitations, this study aims to develop predictive models to establish the relationship between the HAB indicator - chlorophyll-a (Chl-a) and various environmental stressors, under appropriate lagging predictive scenarios. To achieve this, we first applied the partial autocorrelation function (PACF) to Chl-a to precisely identify two prediction scenarios. We then combined multi-source data and several machine learning algorithms to predict harmful algae, using SHapley Additive exPlanations (SHAP) to extract key features influencing output from the prediction models. Our findings reveal an apparent 1-month autoregressive characteristic in Chl-a, leading us to create two scenarios: 1-month lead prediction and current-month prediction. The Extra Tree Regressor (ETR), with an R2 of 0.92, excelled in 1-month lead predictions, while the Random Forest Regressor (RFR) was most effective for current-month predictions with an R2 of 0.69. Additionally, we identified current month Chl-a, developed land use, total phosphorus, and nitrogen oxides (NOx) as critical features for accurate predictions. Our predictive framework, which can be applied to coastal regions worldwide, provides decision-makers with crucial tools for effectively predicting and mitigating HAB threats in major coastal cities.
Collapse
Affiliation(s)
- Zhengxiao Yan
- Department of Civil and Environmental Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL 32310, USA
| | - Sara Kamanmalek
- Department of Civil and Environmental Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL 32310, USA
| | - Nasrin Alamdari
- Department of Civil and Environmental Engineering, FAMU-FSU College of Engineering, Florida State University, Tallahassee, FL 32310, USA.
| |
Collapse
|
17
|
Singh RB, Patra KC, Pradhan B, Samantra A. HDTO-DeepAR: A novel hybrid approach to forecast surface water quality indicators. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 352:120091. [PMID: 38228048 DOI: 10.1016/j.jenvman.2024.120091] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/23/2023] [Accepted: 01/08/2024] [Indexed: 01/18/2024]
Abstract
Water is a vital resource supporting a broad spectrum of ecosystems and human activities. The quality of river water has declined in recent years due to the discharge of hazardous materials and toxins. Deep learning and machine learning have gained significant attention for analysing time-series data. However, these methods often suffer from high complexity and significant forecasting errors, primarily due to non-linear datasets and hyperparameter settings. To address these challenges, we have developed an innovative HDTO-DeepAR approach for predicting water quality indicators. This proposed approach is compared with standalone algorithms, including DeepAR, BiLSTM, GRU and XGBoost, using performance metrics such as MAE, MSE, MAPE, and NSE. The NSE of the hybrid approach ranges between 0.8 to 0.96. Given the value's proximity to 1, the model appears to be efficient. The PICP values (ranging from 95% to 98%) indicate that the model is highly reliable in forecasting water quality indicators. Experimental results reveal a close resemblance between the model's predictions and actual values, providing valuable insights for predicting future trends. The comparative study shows that the suggested model surpasses all existing, well-known models.
Collapse
Affiliation(s)
- Rosysmita Bikram Singh
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Kanhu Charan Patra
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information System, School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, Australia; Institute of Climate Change, Universiti Kebangsaan Malaysia, Bangi, Malaysia.
| | - Avinash Samantra
- Department of Computer Science & Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| |
Collapse
|
18
|
Li Y, Tao C, Fu D, Jafvert CT, Zhu T. Integrating molecular descriptors for enhanced prediction: Shedding light on the potential of pH to model hydrated electron reaction rates for organic compounds. CHEMOSPHERE 2024; 349:140984. [PMID: 38122944 DOI: 10.1016/j.chemosphere.2023.140984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/23/2023]
Abstract
Hydrated electron reaction rate constant (ke-aq) is an important parameter to determine reductive degradation efficiency and to mitigate the ecological risk of organic compounds (OCs). However, OC species morphology and the concentration of hydrated electrons (e-aq) in water vary with pH, complicating OC fate assessment. This study introduced the environmental variable of pH, to develop models for ke-aq for 701 data points using 3 descriptor types: (i) molecular descriptors (MD), (ii) quantum chemical descriptors (QCD), and (iii) the combination of both (MD + QCD). Models were screened using 2 descriptor screening methods (MLR and RF) and 14 machine learning (ML) algorithms. The introduction of QCDs that characterized the electronic structure of OCs greatly improved the performance of models while ensuring the need for fewer descriptors. The optimal model MLR-XGBoost(MD + QCD), which included pH, achieved the most satisfactory prediction: R2tra = 0.988, Q2boot = 0.861, R2test = 0.875 and Q2test = 0.873. The mechanistic interpretation using the SHAP method further revealed that QCDs, polarizability, volume, and pH had a great influence on the reductive degradation of OCs by e-aq. Overall, the electrochemical parameters (QCDs, pH) related to the solvent and solute are of significance and should be considered in any future ML modeling that assesses the fate of OCs in aquatic environment.
Collapse
Affiliation(s)
- Yi Li
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou, 225127, Jiangsu, China
| | - Cuicui Tao
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou, 225127, Jiangsu, China
| | - Dafang Fu
- School of Civil Engineering, Southeast University, Nanjing, 210096, China
| | - Chad T Jafvert
- Lyles School of Civil Engineering, and Environmental & Ecological Engineering, Purdue University, West Lafayette, IN, 47907, USA
| | - Tengyi Zhu
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou, 225127, Jiangsu, China.
| |
Collapse
|
19
|
Chen Z, Wang S, Xu J, He L, Liu Q, Wang Y. Assessment and machine learning prediction of heavy metals fate in mining farmland assisted by Positive Matrix Factorization. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 350:119587. [PMID: 38000273 DOI: 10.1016/j.jenvman.2023.119587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/24/2023] [Accepted: 11/03/2023] [Indexed: 11/26/2023]
Abstract
The accurate pollutant prediction by Machine Learning (ML) is significant to efficient environmental monitoring and risk assessment. However, application of ML in soil is under studied. In this study, a Positive Matrix Factorization (PMF) assisted prediction method was developed with Support Vector Machine (SVM) and Random Forest (RF) for heavy metals (HMs) prediction in mining farmland. Principal Component Analysis (PCA) and Redundancy Analysis (RDA) were selected to pretreat data. Experiment results illustrated Cd was the main pollutant with heavy risks in the study area and Pb was easy to migrate. The method effects of HMs total concentration predicting were PMF > Simple > PCA > PCA - PMF, and RF predicted better than SVM. Data pretreatment by RDA prior inspection improved the model results. Characteristic HMs Tessier fractions prediction received good effects with average R value as 0.86. Risk classification prediction performed good in Cd, Cu, Ni and Zn, however, Pb showed weak effect by simple model. The best classifier method for Pb was PMF - RF method with relatively good effect (Area under ROC Curve = 0.896). Overall, our study suggested the combination between PMF and ML can assist the prediction of HMs in soil. Spatial weighted attribute of HMs can be provided by PMF.
Collapse
Affiliation(s)
- Zhaoming Chen
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Shengli Wang
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Jun Xu
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Liang He
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Qi Liu
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Yufan Wang
- Technology Research Center for Pollution Control and Remediation of Northwest Soil and Groundwater, College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| |
Collapse
|
20
|
Zhang C, Nong X, Behzadian K, Campos LC, Chen L, Shao D. A new framework for water quality forecasting coupling causal inference, time-frequency analysis and uncertainty quantification. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 350:119613. [PMID: 38007931 DOI: 10.1016/j.jenvman.2023.119613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 11/08/2023] [Accepted: 11/11/2023] [Indexed: 11/28/2023]
Abstract
Accurate forecasting of water quality variables in river systems is crucial for relevant administrators to identify potential water quality degradation issues and take countermeasures promptly. However, pure data-driven forecasting models are often insufficient to deal with the highly varying periodicity of water quality in today's more complex environment. This study presents a new holistic framework for time-series forecasting of water quality parameters by combining advanced deep learning algorithms (i.e., Long Short-Term Memory (LSTM) and Informer) with causal inference, time-frequency analysis, and uncertainty quantification. The framework was demonstrated for total nitrogen (TN) forecasting in the largest artificial lakes in Asia (i.e., the Danjiangkou Reservoir, China) with six-year monitoring data from January 2017 to June 2022. The results showed that the pre-processing techniques based on causal inference and wavelet decomposition can significantly improve the performance of deep learning algorithms. Compared to the individual LSTM and Informer models, wavelet-coupled approaches diminished well the apparent forecasting errors of TN concentrations, with 24.39%, 32.68%, and 41.26% reduction at most in the average, standard deviation, and maximum values of the errors, respectively. In addition, a post-processing algorithm based on the Copula function and Bayesian theory was designed to quantify the uncertainty of predictions. With the help of this algorithm, each deterministic prediction of our model can correspond to a range of possible outputs. The 95% forecast confidence interval covered almost all the observations, which proves a measure of the reliability and robustness of the predictions. This study provides rich scientific references for applying advanced data-driven methods in time-series forecasting tasks and a practical methodological framework for water resources management and similar projects.
Collapse
Affiliation(s)
- Chi Zhang
- State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan, 430072, China
| | - Xizhi Nong
- State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan, 430072, China; College of Civil Engineering and Architecture, Guangxi University, Nanning, 530004, China; The National Key Laboratory of Water Disaster Prevention, Nanjing Hydraulic Research Institute, Nanjing, 210029, China.
| | - Kourosh Behzadian
- Centre for Urban Sustainability and Resilience, Department of Civil, Environmental and Geomatic Engineering, University College London, London, WC1E 6BT, United Kingdom; School of Computing and Engineering, University of West London, London, W5 5RF, UK, United Kingdom
| | - Luiza C Campos
- Centre for Urban Sustainability and Resilience, Department of Civil, Environmental and Geomatic Engineering, University College London, London, WC1E 6BT, United Kingdom
| | - Lihua Chen
- College of Civil Engineering and Architecture, Guangxi University, Nanning, 530004, China
| | - Dongguo Shao
- State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan, 430072, China.
| |
Collapse
|
21
|
Zhang Y, Wu H, Xu R, Wang Y, Chen L, Wei C. Machine learning modeling for the prediction of phosphorus and nitrogen removal efficiency and screening of crucial microorganisms in wastewater treatment plants. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 907:167730. [PMID: 37852495 DOI: 10.1016/j.scitotenv.2023.167730] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 10/08/2023] [Accepted: 10/08/2023] [Indexed: 10/20/2023]
Abstract
The effectiveness of wastewater treatment plants (WWTPs) is largely determined by the microbial community structure in their activated sludge (AS). Interactions among microbial communities in AS systems and their indirect effects on water quality changes are crucial for WWTP performance. However, there is currently no quantitative method to evaluate the contribution of microorganisms to the operating efficiency of WWTPs. Traditional assessments of WWTP performance are limited by experimental conditions, methods, and other factors, resulting in increased costs and experimental pollutants. Therefore, an effective method is needed to predict WWTP efficiency based on AS community structure and quantitatively evaluate the contribution of microorganisms in the AS system. This study evaluated and compared microbial communities and water quality changes from WWTPs worldwide by meta-analysis of published high-throughput sequencing data. Six machine learning (ML) models were utilized to predict the efficiency of phosphorus and nitrogen removal in WWTPs; among them, XGBoost showed the highest prediction accuracy. Cross-entropy was used to screen the crucial microorganisms related to phosphorus and nitrogen removal efficiency, and the modeling confirmed the reasonableness of the results. Thirteen genera with nitrogen and phosphorus cycling pathways obtained from the screening were considered highly appropriate for the simultaneous removal of phosphorus and nitrogen. The results showed that the microbes Haliangium, Vicinamibacteraceae, Tolumonas, and SWB02 are potentially crucial for phosphorus and nitrogen removal, as they may be involved in the process of phosphorus and nitrogen removal in sewage treatment plants. Overall, these findings have deepened our understanding of the relationship between microbial community structure and performance of WWTPs, indicating that microbial data should play a critical role in the future design of sewage treatment plants. The ML model of this study can efficiently screen crucial microbes associated with WWTP system performance, and it is promising for the discovery of potential microbial metabolic pathways.
Collapse
Affiliation(s)
- Yinan Zhang
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, PR China
| | - Haizhen Wu
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, PR China.
| | - Rui Xu
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, PR China
| | - Ying Wang
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, PR China
| | - Liping Chen
- School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Centre, Guangzhou 510006, PR China
| | - Chaohai Wei
- School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Centre, Guangzhou 510006, PR China
| |
Collapse
|
22
|
Wang C, Liu J, Qiu C, Su X, Ma N, Li J, Wang S, Qu S. Identifying the drivers of chlorophyll-a dynamics in a landscape lake recharged by reclaimed water using interpretable machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 906:167483. [PMID: 37832666 DOI: 10.1016/j.scitotenv.2023.167483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 09/21/2023] [Accepted: 09/28/2023] [Indexed: 10/15/2023]
Abstract
The water quality of lakes recharged by reclaimed water is affected by both the fluctuation of reclaimed water quality and the biochemical processes in the lakes, and therefore the main controlling factors of algal blooms are difficult to identify. Taking a typical landscape lake recharged by reclaimed water as an example and using the spatiotemporal distribution characteristics and correlation analysis of water quality indexes, we propose an interpretable machine learning framework based on random forest to predict chlorophyll-a (Chl-a). The model considered nutrient difference indexes between reclaimed water and lake water, and further used feature importance ranking and partial dependence plot to identify nutrient drivers. Results show that the NO3--N input from reclaimed water is the dominant nutrient driver for algal bloom especially at high temperatures, and the negative correlation between NO3--N and Chl-a in the lake water is the consequence of algal bloom rather than the cause. Our study provides new insights into the identification of eutrophication factors for lakes recharged by reclaimed water.
Collapse
Affiliation(s)
- Chenchen Wang
- School of Environmental and Municipal Engineering, Tianjin Chengjian University, Tianjin 300384, China; Tianjin Key Laboratory of Aquatic Science and Technology, Tianjin Chengjian University, Tianjin 300384, China; Key Laboratory of Drinking Water Science and Technology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Juan Liu
- School of Environmental and Municipal Engineering, Tianjin Chengjian University, Tianjin 300384, China
| | - Chunsheng Qiu
- School of Environmental and Municipal Engineering, Tianjin Chengjian University, Tianjin 300384, China; Tianjin Key Laboratory of Aquatic Science and Technology, Tianjin Chengjian University, Tianjin 300384, China.
| | - Xiao Su
- Tianjin Water Group Co., Ltd, Tianjin 300042, China
| | - Ning Ma
- Tianjin Eco-City Water Investment and Construction Ltd, Tianjin 300467, China
| | - Jing Li
- School of Environmental and Municipal Engineering, Tianjin Chengjian University, Tianjin 300384, China
| | - Shaopo Wang
- School of Environmental and Municipal Engineering, Tianjin Chengjian University, Tianjin 300384, China; Tianjin Key Laboratory of Aquatic Science and Technology, Tianjin Chengjian University, Tianjin 300384, China
| | - Shen Qu
- Beijing Institute of Technology, Beijing 100081, China.
| |
Collapse
|
23
|
Gupta A, Hantush MM, Govindaraju RS. Sub-monthly time scale forecasting of harmful algal blooms intensity in Lake Erie using remote sensing and machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 900:165781. [PMID: 37499836 PMCID: PMC10552934 DOI: 10.1016/j.scitotenv.2023.165781] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 07/07/2023] [Accepted: 07/23/2023] [Indexed: 07/29/2023]
Abstract
Harmful algal blooms of cyanobacteria (CyanoHAB) have emerged as a serious environmental concern in large and small water bodies including many inland lakes. The growth dynamics of CyanoHAB can be chaotic at very short timescales but predictable at coarser timescales. In Lake Erie, cyanobacteria blooms occur in the spring-summer months, which, at annual timescale, are controlled by the total spring phosphorus (TP) load into the lake. This study aimed to forecast CyanoHAB cell count at sub-monthly (e.g., 10-day) timescales. Satellite-derived cyanobacterial index (CI) was used as a surrogate measure of CyanoHAB cell count. CI was related to the in-situ measured chlorophyll-a and phycocyanin concentrations and Microcystis biovolume in the lake. Using available data on environmental and lake hydrodynamics as predictor variables, four statistical models including LASSO (Least Absolute Shrinkage and Selection Operator), artificial neural network (ANN), random forest (RF), and an ensemble average of the three models (EA) were developed to forecast CI at 10-, 20- and 30-day lead times. The best predictions were obtained by using the RF and EA algorithms. It was found that CyanoHAB growth dynamics, even at sub-monthly timescales, are determined by coarser timescale variables. Meteorological, hydrological, and water quality variations at sub-monthly timescales exert lesser control over CyanoHAB growth dynamics. Nutrients discharged into the lake from rivers other than the Maumee River were also important in explaining the variations in CI. Surprisingly, to forecast CyanoHAB cell count, average solar radiation at 30 to 60 days lags were found to be more important than the average solar radiation at 0 to 30 days lag. Other important variables were TP discharged into the lake during the previous 10 years, TP and TKN discharged into the lake during the previous 120 days, the average water level at 10-day lag and 60-day lag.
Collapse
Affiliation(s)
- Abhinav Gupta
- Division of Hydrologic Sciences, Desert Research Institute, Las Vegas, NV, USA.
| | - Mohamed M Hantush
- U.S. EPA Center for Environmental Solutions and Emergency Response, Cincinnati, OH, USA
| | - Rao S Govindaraju
- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN, USA
| |
Collapse
|
24
|
Manochkumar J, Cherukuri AK, Kumar RS, Almansour AI, Ramamoorthy S, Efferth T. A critical review of machine-learning for "multi-omics" marine metabolite datasets. Comput Biol Med 2023; 165:107425. [PMID: 37696182 DOI: 10.1016/j.compbiomed.2023.107425] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 07/12/2023] [Accepted: 08/28/2023] [Indexed: 09/13/2023]
Abstract
During the last decade, genomic, transcriptomic, proteomic, metabolomic, and other omics datasets have been generated for a wide range of marine organisms, and even more are still on the way. Marine organisms possess unique and diverse biosynthetic pathways contributing to the synthesis of novel secondary metabolites with significant bioactivities. As marine organisms have a greater tendency to adapt to stressed environmental conditions, the chance to identify novel bioactive metabolites with potential biotechnological application is very high. This review presents a comprehensive overview of the available "-omics" and "multi-omics" approaches employed for characterizing marine metabolites along with novel data integration tools. The need for the development of machine-learning algorithms for "multi-omics" approaches is briefly discussed. In addition, the challenges involved in the analysis of "multi-omics" data and recommendations for conducting "multi-omics" study were discussed.
Collapse
Affiliation(s)
- Janani Manochkumar
- School of Bio Sciences and Technology, Vellore Institute of Technology, Vellore, 632014, India
| | - Aswani Kumar Cherukuri
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, 632014, India
| | - Raju Suresh Kumar
- Department of Chemistry, College of Science, King Saud University, P. O. Box 2455, Riyadh, 11451, Saudi Arabia
| | - Abdulrahman I Almansour
- Department of Chemistry, College of Science, King Saud University, P. O. Box 2455, Riyadh, 11451, Saudi Arabia
| | - Siva Ramamoorthy
- School of Bio Sciences and Technology, Vellore Institute of Technology, Vellore, 632014, India.
| | - Thomas Efferth
- Department of Pharmaceutical Biology, Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg University, Mainz, Germany.
| |
Collapse
|
25
|
Wang X, Li Y, Qiao Q, Tavares A, Liang Y. Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1186. [PMID: 37628216 PMCID: PMC10453428 DOI: 10.3390/e25081186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/26/2023] [Accepted: 08/02/2023] [Indexed: 08/27/2023]
Abstract
In the context of escalating global environmental concerns, the importance of preserving water resources and upholding ecological equilibrium has become increasingly apparent. As a result, the monitoring and prediction of water quality have emerged as vital tasks in achieving these objectives. However, ensuring the accuracy and dependability of water quality prediction has proven to be a challenging endeavor. To address this issue, this study proposes a comprehensive weight-based approach that combines entropy weighting with the Pearson correlation coefficient to select crucial features in water quality prediction. This approach effectively considers both feature correlation and information content, avoiding excessive reliance on a single criterion for feature selection. Through the utilization of this comprehensive approach, a comprehensive evaluation of the contribution and importance of the features was achieved, thereby minimizing subjective bias and uncertainty. By striking a balance among various factors, features with stronger correlation and greater information content can be selected, leading to improved accuracy and robustness in the feature-selection process. Furthermore, this study explored several machine learning models for water quality prediction, including Support Vector Machines (SVMs), Multilayer Perceptron (MLP), Random Forest (RF), XGBoost, and Long Short-Term Memory (LSTM). SVM exhibited commendable performance in predicting Dissolved Oxygen (DO), showcasing excellent generalization capabilities and high prediction accuracy. MLP demonstrated its strength in nonlinear modeling and performed well in predicting multiple water quality parameters. Conversely, the RF and XGBoost models exhibited relatively inferior performance in water quality prediction. In contrast, the LSTM model, a recurrent neural network specialized in processing time series data, demonstrated exceptional abilities in water quality prediction. It effectively captured the dynamic patterns present in time series data, offering stable and accurate predictions for various water quality parameters.
Collapse
Affiliation(s)
- Xianhe Wang
- School of Applied Chemistry and Materials, Zhuhai College of Science and Technology, Zhuhai 519041, China; (X.W.); (Y.L.)
- Department of Industrial Electronics, School of Engineering, University of Minho, 4704-553 Braga, Portugal
| | - Ying Li
- School of Applied Chemistry and Materials, Zhuhai College of Science and Technology, Zhuhai 519041, China; (X.W.); (Y.L.)
- Department of Industrial Electronics, School of Engineering, University of Minho, 4704-553 Braga, Portugal
| | - Qian Qiao
- School of Applied Chemistry and Materials, Zhuhai College of Science and Technology, Zhuhai 519041, China; (X.W.); (Y.L.)
| | - Adriano Tavares
- Department of Industrial Electronics, School of Engineering, University of Minho, 4704-553 Braga, Portugal
| | - Yanchun Liang
- School of Computer Science, Zhuhai College of Science and Technology, Zhuhai 519041, China
- Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| |
Collapse
|
26
|
Gai R, Yang J. Research on water quality spatiotemporal forecasting model based on ST-BIGRU-SVR neural network. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 88:530-541. [PMID: 37578872 PMCID: wst_2023_156 DOI: 10.2166/wst.2023.156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
With the serious deterioration of the water environment, accurate prediction of water quality changes has become a topic of increasing concern. To further improve the accuracy of water quality prediction and the stability and generalization ability of the model, we propose a new water quality spatiotemporal forecast model to predict future water quality. To capture the spatiotemporal characteristics of water quality pollution data, the three sites (station S1, station S2, station S4) with the highest temperature time series concentration correlation at the experimental sites were first extracted to predict the water temperature at station S1, and 17,380 records were collected at each monitoring station, and the spatiotemporal characteristics were extracted by BiGRU-SVR network model. This paper's prediction test is based on the actual water quality data of the Qinhuangdao sea area in Hebei province from 2 September to 26 September 2013 and compared with other baseline models. The experimental results show that the proposed model is better than other baseline models and effectively improves the accuracy of water quality prediction, and the mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R2) are 0.071, 0.076, and 0.957, respectively, which have good robustness.
Collapse
Affiliation(s)
- Rongli Gai
- School of Information Engineering, Dalian University, Dalian 116622, China E-mail:
| | - Jiahui Yang
- School of Information Engineering, Dalian University, Dalian 116622, China
| |
Collapse
|
27
|
Tselemponis A, Stefanis C, Giorgi E, Kalmpourtzi A, Olmpasalis I, Tselemponis A, Adam M, Kontogiorgis C, Dokas IM, Bezirtzoglou E, Constantinidis TC. Coastal Water Quality Modelling Using E. coli, Meteorological Parameters and Machine Learning Algorithms. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:6216. [PMID: 37444064 PMCID: PMC10341787 DOI: 10.3390/ijerph20136216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 06/19/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023]
Abstract
In this study, machine learning models were implemented to predict the classification of coastal waters in the region of Eastern Macedonia and Thrace (EMT) concerning Escherichia coli (E. coli) concentration and weather variables in the framework of the Directive 2006/7/EC. Six sampling stations of EMT, located on beaches of the regional units of Kavala, Xanthi, Rhodopi, Evros, Thasos and Samothraki, were selected. All 1039 samples were collected from May to September within a 14-year follow-up period (2009-2021). The weather parameters were acquired from nearby meteorological stations. The samples were analysed according to the ISO 9308-1 for the detection and the enumeration of E. coli. The vast majority of the samples fall into category 1 (Excellent), which is a mark of the high quality of the coastal waters of EMT. The experimental results disclose, additionally, that two-class classifiers, namely Decision Forest, Decision Jungle and Boosted Decision Tree, achieved high Accuracy scores over 99%. In addition, comparing our performance metrics with those of other researchers, diversity is observed in using algorithms for water quality prediction, with algorithms such as Decision Tree, Artificial Neural Networks and Bayesian Belief Networks demonstrating satisfactory results. Machine learning approaches can provide critical information about the dynamic of E. coli contamination and, concurrently, consider the meteorological parameters for coastal waters classification.
Collapse
Affiliation(s)
- Athanasios Tselemponis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Christos Stefanis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Elpida Giorgi
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Aikaterini Kalmpourtzi
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Ioannis Olmpasalis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Antonios Tselemponis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Maria Adam
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Christos Kontogiorgis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Ioannis M. Dokas
- Department of Civil Engineering, Democritus University of Thrace, 69100 Komotini, Greece;
| | - Eugenia Bezirtzoglou
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| | - Theodoros C. Constantinidis
- Laboratory of Hygiene and Environmental Protection, Medical School, Democritus University of Thrace, 68100 Alexandroupoli, Greece; (A.T.); (E.G.); (A.K.); (I.O.); (A.T.); (M.A.); (C.K.); (E.B.); (T.C.C.)
| |
Collapse
|
28
|
Zamani MG, Nikoo MR, Rastad D, Nematollahi B. A comparative study of data-driven models for runoff, sediment, and nitrate forecasting. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 341:118006. [PMID: 37163836 DOI: 10.1016/j.jenvman.2023.118006] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 04/22/2023] [Accepted: 04/22/2023] [Indexed: 05/12/2023]
Abstract
Effective prediction of qualitative and quantitative indicators for runoff is quite essential in water resources planning and management. However, although several data-driven and model-driven forecasting approaches have been employed in the literature for streamflow forecasting, to our knowledge, the literature lacks a comprehensive comparison of well-known data-driven and model-driven forecasting techniques for runoff evaluation in terms of quality and quantity. This study filled this knowledge gap by comparing the accuracy of runoff, sediment, and nitrate forecasting using four robust data-driven techniques: artificial neural network (ANN), long short-term memory (LSTM), wavelet artificial neural network (WANN), and wavelet long short-term memory (WLSTM) models. These comparisons were performed in two main tiers: (1) Comparing the machine learning algorithms' results with the model-driven approach; In order to simulate the runoff, sediment, and nitrate loads, the Soil and Water Assessment Tool (SWAT) model was employed, and (2) Comparing the machine learning algorithms with each other; The wavelet function was utilized in the ANN and LSTM algorithms. These comparisons were assessed based on the substantial statistical indices of coefficient of determination (R-Squared), Nash-Sutcliff efficiency coefficient (NSE), mean absolute error (MAE), and root mean square error (RMSE). Finally, to prove the applicability and efficiency of the proposed novel framework, it was successfully applied to Eagle Creek Watershed (ECW), Indiana, U.S. Results demonstrated that the data-driven algorithms significantly outperformed the model-driven models for both the calibration/training and validation/testing phases. Furthermore, it was found that the coupled ANN and LSTM models with wavelet function led to more accurate results than those without this function.
Collapse
Affiliation(s)
- Mohammad G Zamani
- Department of Civil and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran.
| | - Mohammad Reza Nikoo
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Dana Rastad
- Department of Civil and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran.
| | | |
Collapse
|
29
|
Deng L, Gao X, Xia B, Wang J, Dai Q, Fan Y, Wang S, Li H, Qian X. Improving the efficiency of machine learning in simulating sedimentary heavy metal contamination by coupling preposing feature selection methods. CHEMOSPHERE 2023; 322:138205. [PMID: 36822525 DOI: 10.1016/j.chemosphere.2023.138205] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Revised: 01/10/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
Sediment cores were collected from Taihu Lake in China. The chronology was determined by radionuclide. Heavy metals and magnetic properties of each core slice were assessed, respectively. The concentrations of most heavy metals in sediments surged at 20 cm from the surface, accompanying the increase in the concentrations of single-domain magnetic particles. This may be resulted from the influence of anthropic activities on the lake's environment after the 1970s. Two feature selection methods, random forest (RF) and maximal information coefficient (MIC), were combined with support vector machine (SVM) model to simulate heavy metals, with the inclusion of selected magnetic and physicochemical parameters. Compared with the modeling results obtained with the full set of parameters, a reasonable simulation performance was obtained with RF and MIC. RF performed better than MIC by increasing the R2 of simulation models for Cd, Cr, Cu, Pb, and Sb. For heavy metals with high ecological risks (As, Cd, Cr, Hg, Pb, Sb), the correlation coefficients for observed and predicted data ranged from 0.73 to 0.97 with only 14-27% of the parameters selected by RF as input variables. The RF-RBF-SVM enabled heavy metal predictions based on the magnetic properties of the lake sediments.
Collapse
Affiliation(s)
- Ligang Deng
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China
| | - Xiang Gao
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China; School of Environment, Nanjing Normal University, Nanjing, 210023, China
| | - Bisheng Xia
- College of Mathematics and Computer Science, Yan'an University, Yan'an, 716000, China
| | - Jinhua Wang
- Key Laboratory of Water Pollution Control and Wastewater Reuse of Anhui Province, Anhui Jianzhu University, Hefei, 230009, China
| | - Qianying Dai
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China
| | - Yifan Fan
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China
| | - Siyuan Wang
- College of Mathematics and Computer Science, Yan'an University, Yan'an, 716000, China
| | - Huiming Li
- School of Environment, Nanjing Normal University, Nanjing, 210023, China.
| | - Xin Qian
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China.
| |
Collapse
|
30
|
Zheng H, Liu Y, Wan W, Zhao J, Xie G. Large-scale prediction of stream water quality using an interpretable deep learning approach. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 331:117309. [PMID: 36657204 DOI: 10.1016/j.jenvman.2023.117309] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 01/13/2023] [Accepted: 01/14/2023] [Indexed: 06/17/2023]
Abstract
Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3-N), TN, TP, and turbidity in the stream water in the case area, respectively.
Collapse
Affiliation(s)
- Hang Zheng
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Yueyi Liu
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Wenhua Wan
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Jianshi Zhao
- Department of Hydraulic Engineering, Tsinghua University, Beijing, 100084, China
| | - Guanti Xie
- Dongguan Shigu Sewage Treatment Co., Ltd., Dongguan, 523808, China
| |
Collapse
|
31
|
Dimple, Singh PK, Rajput J, Kumar D, Gaddikeri V, Elbeltagi A. Combination of discretization regression with data-driven algorithms for modeling irrigation water quality indices. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.102093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023]
|
32
|
Jia Z, Wang J, Liu X, Yan Z, Bai X, Zhou X, He X, Hou J. Sediment diffusion is feasible to simultaneously reduce nitrate discharge from recirculating aquaculture system and ammonium release from sediments in receiving intensive aquaculture pond. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 858:160017. [PMID: 36370792 DOI: 10.1016/j.scitotenv.2022.160017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 11/02/2022] [Accepted: 11/03/2022] [Indexed: 06/16/2023]
Abstract
Nitrogen accumulation has become one of the greatest unresolved challenges restricting the development of aquaculture worldwide. In recirculating aquaculture system (RAS), lack of organic matter (OM) and sensitive organisms makes it difficult to apply efficient denitrifying technology, thus leading to a high nitrate‑nitrogen (NO3--N) accumulation. In contrast, excess OM accumulation in intensive aquaculture pond sediments is associated with dissolved oxygen depletion and ammonium‑nitrogen (NH4+-N) accumulation in the sediments. Based on the opposing effects of OM on the nitrogen accumulation in RAS and intensive aquaculture ponds, this study assessed the feasibility of simultaneously reducing NO3--N discharge from RAS and controlling NH4+-N accumulation in intensive aquaculture ponds by in situ diffusing RAS tailwater containing NO3--N into intensive aquaculture pond sediments. The results showed that NO3--N diffusion strategy improved the native sediment denitrification capacity, thus increasing NO3--N removal efficiency from RAS tailwater and significantly decreasing the NH4+-N concentration in interstitial water and the total organic carbon content in intensive aquaculture pond sediments. High-throughput sequencing and quantitative real-time polymerase chain reaction (qPCR) results revealed that NO3--N addition significantly increased both nitrifying bacteria and denitrifying bacteria abundance. These results implied that NO3--N diffusion strategy could effectively stimulate microbial decomposition of OM, thus relieving the hypoxia limitation of sediment nitrification. Overall, this study offers a feasible method for simultaneous reduction of NO3--N from RAS tailwater and NH4+-N in intensive aquaculture ponds with low cost and high efficiency.
Collapse
Affiliation(s)
- Zhiming Jia
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China
| | - Jie Wang
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China
| | - Xueyu Liu
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China
| | - Zuting Yan
- State key laboratory of Freshwater Ecology and Biotechnology, Key Laboratory of Algal Biology, Institute of Hydrobiology, the Chinese Academy of Sciences, Wuhan 430072, China
| | - Xuelan Bai
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiaodi Zhou
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China
| | - Xugang He
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China; Engineering Research Center of Green Development for Conventional Aquatic Biological Industry in the Yangtze River Economic Belt, Ministry of Education, Wuhan 430070, China; Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China.
| | - Jie Hou
- College of Fisheries, Huazhong Agricultural University, Wuhan 430070, China; Engineering Research Center of Green Development for Conventional Aquatic Biological Industry in the Yangtze River Economic Belt, Ministry of Education, Wuhan 430070, China; Freshwater Aquaculture Collaborative Innovation Center of Hubei Province, Wuhan 430070, China.
| |
Collapse
|
33
|
Jadhav AR, Pathak PD, Raut RY. Water and wastewater quality prediction: current trends and challenges in the implementation of artificial neural network. ENVIRONMENTAL MONITORING AND ASSESSMENT 2023; 195:321. [PMID: 36689041 DOI: 10.1007/s10661-022-10904-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 12/28/2022] [Indexed: 06/17/2023]
Abstract
Traditional freshwater supplies have been over-abstracted in the current global problem of water scarcity. Through the analysis of complex experimental and real-time data, to improve the activity of water and wastewater treatment (WWT) systems, an artificial neural network (ANN), a computational model inspired by the human brain, and its variants were created. This review paper focuses on recent trends and advances in modeling and simulating different water and wastewater systems using ANN. This study uses ANN in watershed management, impurity removal from wastewater, and wastewater treatment plants. According to the literature review, ANN can predict nonlinear, linear, and complex systems with high accuracy and well control. Finally, the limitations and future scope of ANNs were discussed. ANN proved itself in the prediction of various water and WWT processes. Still, implementation has practical challenges, which include a lack of data availability, poorly built models, timely updates in developed models, and low repeatability. The use of a proper toolbox, faster computing power, and proper domain knowledge makes the practical implementation of ANN successful. As a result, ANN can build a solid foundation for attracting and motivating investigators to work in this region in the forthcoming.
Collapse
Affiliation(s)
| | - Pranav D Pathak
- MIT School of Bioengineering Sciences & Research, MIT-Art, Design and Technology University, Pune, Maharashtra, India.
| | | |
Collapse
|
34
|
Mallya G, Hantush MM, Govindaraju RS. A Machine Learning Approach to Predict Watershed Health Indices for Sediments and Nutrients at Ungauged Basins. WATER 2023; 15:1-23. [PMID: 37309416 PMCID: PMC10259765 DOI: 10.3390/w15030586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Effective water quality management and reliable environmental modeling depend on the availability, size, and quality of water quality (WQ) data. Observed stream water quality data are usuallEEy sparse in both time and space. Reconstruction of water quality time series using surrogate variables such as streamflow have been used to evaluate risk metrics such as reliability, resilience, vulnerability, and watershed health (WH) but only at gauged locations. Estimating these indices for ungauged watersheds has not been attempted because of the high-dimensional nature of the potential predictor space. In this study, machine learning (ML) models, namely random forest regression, AdaBoost, gradient boosting machines, and Bayesian ridge regression (along with an ensemble model), were evaluated to predict watershed health and other risk metrics at ungauged hydrologic unit code 10 (HUC-10) basins using watershed attributes, long-term climate data, soil data, land use and land cover data, fertilizer sales data, and geographic information as predictor variables. These ML models were tested over the Upper Mississippi River Basin, the Ohio River Basin, and the Maumee River Basin for water quality constituents such as suspended sediment concentration, nitrogen, and phosphorus. Random forest, AdaBoost, and gradient boosting regressors typically showed a coefficient of determination R 2 > 0.8 for suspended sediment concentration and nitrogen during the testing stage, while the ensemble model exhibited R 2 > 0.95 . Watershed health values with respect to suspended sediments and nitrogen predicted by all ML models including the ensemble model were lower for areas with larger agricultural land use, moderate for areas with predominant urban land use, and higher for forested areas; the trained ML models adequately predicted WH in ungauged basins. However, low WH values (with respect to phosphorus) were predicted at some basins in the Upper Mississippi River Basin that had dominant forest land use. Results suggest that the proposed ML models provide robust estimates at ungauged locations when sufficient training data are available for a WQ constituent. ML models may be used as quick screening tools by decision makers and water quality monitoring agencies for identifying critical source areas or hotspots with respect to different water quality constituents, even for ungauged watersheds.
Collapse
Affiliation(s)
- Ganeshchandra Mallya
- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN 47907, USA
| | - Mohamed M Hantush
- U.S. EPA Center for Environmental Solutions and Emergency Response, 26 West Martin Luther King Dr., Cincinnati, OH 45268, USA
| | - Rao S Govindaraju
- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
35
|
Zhu X, Guo H, Huang JJ, Tian S, Xu W, Mai Y. An ensemble machine learning model for water quality estimation in coastal area based on remote sensing imagery. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022; 323:116187. [PMID: 36261960 DOI: 10.1016/j.jenvman.2022.116187] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 06/16/2023]
Abstract
The accurate estimation of coastal water quality parameters (WQPs) is crucial for decision-makers to manage water resources. Although various machine learning (ML) models have been developed for coastal water quality estimation using remote sensing data, the performance of these models has significant uncertainties when applied to regional scales. To address this issue, an ensemble ML-based model was developed in this study. The ensemble ML model was applied to estimate chlorophyll-a (Chla), turbidity, and dissolved oxygen (DO) based on Sentinel-2 satellite images in Shenzhen Bay, China. The optimal input features for each WQP were selected from eight spectral bands and seven spectral indices. A local explanation strategy termed Shapley Additive Explanations (SHAP) was employed to quantify contributions of each feature to model outputs. In addition, the impacts of three climate factors on the variation of each WQP were analyzed. The results suggested that the ensemble ML models have satisfied performance for Chla (errors = 1.7%), turbidity (errors = 1.5%) and DO estimation (errors = 0.02%). Band 3 (B3) has the highest positive contribution to Chla estimation, while Band Ration Index2 (BR2) has the highest negative contribution to turbidity estimation, and Band 7 (B7) has the highest positive contribution to DO estimation. The spatial patterns of the three WQPs revealed that the water quality deterioration in Shenzhen Bay was mainly influenced by input of terrestrial pollutants from the estuary. Correlation analysis demonstrated that air temperature (Temp) and average air pressure (AAP) exhibited the closest relationship with Chla. DO showed the strongest negative correlation with Temp, while turbidity was not sensitive to Temp, average wind speed (AWS), and AAP. Overall, the ensemble ML model proposed in this study provides an accurate and practical method for long-term Chla, turbidity, and DO estimation in coastal waters.
Collapse
Affiliation(s)
- Xiaotong Zhu
- College of Environmental Science and Engineering/Sino-Canada Joint R&D Centre for Water and Environment Safety,Nankai University, Tianjin, 300071, PR China
| | - Hongwei Guo
- College of Environmental Science and Engineering/Sino-Canada Joint R&D Centre for Water and Environment Safety,Nankai University, Tianjin, 300071, PR China
| | - Jinhui Jeanne Huang
- College of Environmental Science and Engineering/Sino-Canada Joint R&D Centre for Water and Environment Safety,Nankai University, Tianjin, 300071, PR China.
| | - Shang Tian
- College of Environmental Science and Engineering/Sino-Canada Joint R&D Centre for Water and Environment Safety,Nankai University, Tianjin, 300071, PR China
| | - Wang Xu
- Shenzhen Environmental Monitoring Center, Shenzhen, Guangdong, 518049, PR China
| | - Youquan Mai
- Shenzhen Environmental Monitoring Center, Shenzhen, Guangdong, 518049, PR China
| |
Collapse
|
36
|
Sahoo DP, Sahoo B, Tiwari MK, Behera GK. Integrated remote sensing and machine learning tools for estimating ecological flow regimes in tropical river reaches. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022; 322:116121. [PMID: 36070653 DOI: 10.1016/j.jenvman.2022.116121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 08/12/2022] [Accepted: 08/24/2022] [Indexed: 06/15/2023]
Abstract
With the gradual declining streamflow gauging stations in many world-rivers, emphasis is given nowadays to develop remote sensing (RS)-based approaches as the next-generation hydrometry for estimating riverine ecological flow regimes (EFR). For constructing EFR based on daily-streamflow data in scantily-gauged reaches, use of RS techniques in narrow flow-width tropical rain-fed rivers is constrained with the non-availability of finer spatial satellite data at daily scale. To address these limitations, this study proposes a novel framework that integrates the enhanced spatiotemporal adaptive reflectance fusion (FUS) of the 250 m × 1-day resolution Aqua-MODIS and 30 m × 1-day resolution Landsat satellite-based remote sensing images in the near-infrared region with the machine learning algorithms. These developed frameworks are named as Artificial Neural Network-based ANNFUS, Random Forest Regression-based RFRFUS, and Support Vector Regression-based SVRFUS models, which were tested for daily-scale streamflow estimation in a typical Brahmani River Basin, India. The results reveal that by addressing the linear and nonlinear dynamism between the streamflow and satellite signals, all the developed models could simulate the streamflow very well with the Nash-Sutcliffe efficiency>0.8, Kling-Gupta efficiency>0.8, relative root mean square error (rRMSE) of 0.051-0.12, and normalized RMSE of 0.23-0.36. However, for reproducing the high, median, and low streamflow regimes, the SVRFUS model was found to be the best with the NSE>0.85 and KGE>0.8. Conclusively, the proposed approach is found to have the potential to be replicated in other world-river basins to estimate ecological flow regimes at defunct gauging stations facilitating the basin-scale aquatic environmental management.
Collapse
Affiliation(s)
- Debi Prasad Sahoo
- Research Scholar, School of Water Resources, Indian Institute of Technology Kharagpur, West Bengal-721302, India.
| | - Bhabagrahi Sahoo
- Associate Professor, School of Water Resources, Indian Institute of Technology Kharagpur, West Bengal-721302, India.
| | - Manoj Kumar Tiwari
- Assistant Professor, School of Water Resources, Indian Institute of Technology Kharagpur, West Bengal-721302, India.
| | - Goutam Kumar Behera
- Under Graduate Student, Department of Metallurgical and Materials Engineering, Indian Institute of Technology Kharagpur, West Bengal-721302, India.
| |
Collapse
|
37
|
Xia L, Han Q, Shang L, Wang Y, Li X, Zhang J, Yang T, Liu J, Liu L. Quality assessment and prediction of municipal drinking water using water quality index and artificial neural network: A case study of Wuhan, central China, from 2013 to 2019. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 844:157096. [PMID: 35779730 DOI: 10.1016/j.scitotenv.2022.157096] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Revised: 06/26/2022] [Accepted: 06/27/2022] [Indexed: 06/15/2023]
Abstract
The sanitary security of drinking water is closely related to human health, but its quality assessment mainly focused on limited types of indicators and relatively restricted time span. The current study was aimed to evaluate the long-term spatial-temporal distribution of municipal drinking water quality and explore the origin of water contamination based on multiple water indicators of 137 finished water samples and 863 tap water samples from Wuhan city, China. Water quality indexes (WQIs) were calculated to integrate the measured indicators. WQIs of the finished water samples ranged from 0.24 to 0.92, with the qualification rate and excellent rate of 100 % and 96.4 %, respectively, while those of the tap water samples ranged from 0.09 to 3.20, with the qualification rate of 99.9 %, and excellent rate of 95.5 %. Artificial neural network model was constructed based on the time series of WQIs from 2013 to 2019 to predict the water quality thereafter. The predicted WQIs of finished and tap water in 2020 and 2021 qualified on the whole, with the excellent rate of 87.5 % and 92.9 %, respectively. Except for three samples exceeding the limits of free chlorine residual, chloroform and fluoride, respectively, the majority of indicators reached the threshold values for drinking. Our study suggested that municipal drinking water quality in Wuhan was generally stable and in line with the national hygiene standards. Moreover, principal component analysis illustrated that the main potential sources of drinking water contamination were inorganic salts and organic matters, followed by pollution from distribution systems, the use of aluminum-containing coagulants and turbidity involved in water treatment, which need more attention.
Collapse
Affiliation(s)
- Lu Xia
- Department of Epidemiology and Biostatistics, Ministry of Education Key Lab of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, PR China
| | - Qing Han
- Wuhan Center for Disease Control and Prevention, Wuhan, Hubei 430024, PR China
| | - Lv Shang
- Wuhan Center for Disease Control and Prevention, Wuhan, Hubei 430024, PR China
| | - Yao Wang
- Wuhan Center for Disease Control and Prevention, Wuhan, Hubei 430024, PR China
| | - Xinying Li
- Department of Epidemiology and Biostatistics, Ministry of Education Key Lab of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, PR China
| | - Jia Zhang
- Department of Epidemiology and Biostatistics, Ministry of Education Key Lab of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, PR China
| | - Tingting Yang
- Department of Epidemiology and Biostatistics, Ministry of Education Key Lab of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, PR China
| | - Junling Liu
- Wuhan Center for Disease Control and Prevention, Wuhan, Hubei 430024, PR China.
| | - Li Liu
- Department of Epidemiology and Biostatistics, Ministry of Education Key Lab of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, PR China.
| |
Collapse
|
38
|
Predicting Irrigation Water Quality Indices Based on Data-Driven Algorithms: Case Study in Semiarid Environment. J CHEM-NY 2022. [DOI: 10.1155/2022/4488446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Ascertaining water quality for irrigational use by employing conventional methods is often time taking and expensive due to the determination of multiple parameters needed, especially in developing countries. Therefore, constructing precise and adequate models may be beneficial in resolving this problem in agricultural water management to determine the suitable water quality classes for optimal crop yield production. To achieve this objective, five machine learning (ML) models, namely linear regression (LR), random subspace (RSS), additive regression (AR), reduced error pruning tree (REPTree), and support vector machine (SVM), have been developed and tested for predicting of six irrigation water quality (IWQ) indices such as sodium adsorption ratio (SAR), percent sodium (%Na), permeability index (PI), Kelly ratio (KR), soluble sodium percentage (SSP), and magnesium hazards (MH) in groundwater of the Nand Samand catchment of Rajasthan. The accuracy of these models was determined serially using the mean squared error (MSE), correlation coefficients (r), mean absolute error (MAE), and root mean square error (RMSE). The SVM model showed the best-fit model for all irrigation indices during testing, that is, RMSE: 0.0662, 4.0568, 3.0168, 0.1113, 3.7046, and 5.1066; r: 0.9364, 0.9618, 0.9588, 0.9819, 0.9547, and 0.8903; MSE: 0.004381, 16.45781, 9.101218, 0.012383, 13.72447, and 26.078; MAE: 0.042, 3.1999, 2.3584, 0.0726, 2.9603, and 4.0582 for KR, MH, SSP, SAR, %Na, and PI, respectively. The KR and SAR values were predicted accurately by the SVM model in comparison to the observed values. As a result, machine learning algorithms can improve irrigation water quality characteristics, which is critical for farmers and crop management in various irrigation procedures. Additionally, the findings of this research suggest that ML models are effective tools for reliably predicting groundwater quality using general water quality parameters that may be acquired directly on periodical basis. Assessment of water quality indices may also help in deriving optimal strategies to utilise inferior quality water conjunctively with fresh water resources in the water-limited areas.
Collapse
|
39
|
Inversion and Driving Force Analysis of Nutrient Concentrations in the Ecosystem of the Shenzhen-Hong Kong Bay Area. REMOTE SENSING 2022. [DOI: 10.3390/rs14153694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Although satellite remote sensing technology is intensively used for the monitoring of water quality, the inversion of coastal water bodies and non-optically active parameters is still a challenging issue. Few ongoing studies use remote sensing technology to analyze the driving forces of changes in water quality from multiple aspects based on inversion results. By the use of Landsat 5/8 imagery and measured in situ data of the total nitrogen (TN) and total phosphorus (TP) in the Shenzhen-Hong Kong Bay area from 1986 to 2020, this study evaluated the modeling effects of four machine learning methods named Tree Embedding (TE), Support Vector Regression (SVR), Gaussian Process Regression (GPR), and Back-propagation Neural Network (BPNN). The results show that the BPNN creates the most reliable and robust results. The values of the obtained correlation coefficients (r) are 0.83, 0.92, 0.84, and 0.90, and that of the coefficients of determination (R2) are 0.70, 0.84, 0.70, and 0.81. The calculated mean absolute errors (MAEs) are 0.41, 0.16, 0.06, and 0.02, while the root mean square errors (RMSEs) are 0.78, 0.29, 0.12, and 0.03. The concentrations of TN and TP (CTN, CTP) in the Shenzhen Bay, the Starling Inlet, and the Tolo Harbor were relatively high, fluctuated from 1986 to 2010, and decreased significantly after 2010. The CTN and CTP in the Mirs Bay kept continuously at a low level. We found that urbanization and polluted river discharges were the main drivers of spatial and inter-annual differences of CTN and CTP. Temperature, precipitation, and wind are further factors that influenced the intra-annual changes of CTN and CTP in the Shenzhen Bay, whilethe expansion of oyster rafts and mangroves had little effect. Our research confirms that machine learning algorithms are well suited for the inversion of non-optical activity parameters of coastal water bodies, and also shows the potential of remote sensing for large-scale, long-term monitoring of water quality and the subsequent comprehensive analysis of the driving forces.
Collapse
|
40
|
Mori M, Gonzalez Flores R, Suzuki Y, Nukazawa K, Hiraoka T, Nonaka H. Prediction of Microcystis Occurrences and Analysis Using Machine Learning in High-Dimension, Low-Sample-Size and Imbalanced Water Quality Data. HARMFUL ALGAE 2022; 117:102273. [PMID: 35944960 DOI: 10.1016/j.hal.2022.102273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 05/20/2022] [Accepted: 06/06/2022] [Indexed: 06/15/2023]
Abstract
Machine learning, Deep learning, and water quality data have been used in recent years to predict the outbreak of harmful algae, especially Microcystis, and analyze outbreak causes. However, for various reasons, water quality data are often High-Dimension, Low-Sample- Size (HDLSS), meaning the sample size is lower than the number of dimensions. Moreover, imbalance problems may arise due to bias in the occurrence frequency of Microcystis. These problems make predicting the occurrence of Microcystis and analyzing its causes with machine learning difficult. In this study, a machine learning model that applies Feature Engineering (FE) and Feature Selection (FS) algorithms are used to predict outbreaks of Microcystis and analyze the outbreak factors from imbalanced HDLSS water quality data. The prediction performance was verified with binary classification to determine whether Microcystis would occur in the future by applying three machine learning models to four data patterns. The cause analysis of Microcystis occurrence was performed by visualizing the results of applying FE and FS. For the test data, the predictive performance of FE and FS methods was significantly better than that of the conventional method, with an accuracy of .108 points and an F-value of .691 points higher than the conventional method. A prediction performance increase was observed with a smaller model capacity. Data-driven analysis suggested that total nitrogen, chemical oxygen demand, chlorophyll-a, dissolved oxygen saturation, and water temperature are associated with Microcystis occurrences. The results also indicated that basic statistics of the water quality distribution (especially mean, standard deviation, and skewness) over a year, not the concentrations of water components, are related to the occurrence of Microcystis. These are new findings not found in previous studies and are expected to contribute significantly to future studies of algae. This study provides a method for analyzing water quality data with high-dimensionality and small samples, imbalance problems, or both.
Collapse
Affiliation(s)
- Masaya Mori
- Nagaoka University of Technology, 1603-1, Kamitomioka, Nagaoka-shi, 940-2188, Niigata, Japan.
| | - Roberto Gonzalez Flores
- Nagaoka University of Technology, 1603-1, Kamitomioka, Nagaoka-shi, 940-2188, Niigata, Japan
| | - Yoshihiro Suzuki
- University of Miyazaki, 1-1, Gakuen Kibanadai-nishi, Miyazaki-shi, 889-2192, Miyazaki, Japan
| | - Kei Nukazawa
- University of Miyazaki, 1-1, Gakuen Kibanadai-nishi, Miyazaki-shi, 889-2192, Miyazaki, Japan
| | - Toru Hiraoka
- University of Nagasaki, 1-1-1, Manabino, Nagayo, Nishi-Sonogi, 851-2130, Nagasaki, Japan
| | - Hirofumi Nonaka
- Nagaoka University of Technology, 1603-1, Kamitomioka, Nagaoka-shi, 940-2188, Niigata, Japan
| |
Collapse
|
41
|
A Review of Hybrid Soft Computing and Data Pre-Processing Techniques to Forecast Freshwater Quality’s Parameters: Current Trends and Future Directions. ENVIRONMENTS 2022. [DOI: 10.3390/environments9070085] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Water quality has a significant influence on human health. As a result, water quality parameter modelling is one of the most challenging problems in the water sector. Therefore, the major factor in choosing an appropriate prediction model is accuracy. This research aims to analyse hybrid techniques and pre-processing data methods in freshwater quality modelling and forecasting. Hybrid approaches have generally been seen as a potential way of improving the accuracy of water quality modelling and forecasting compared with individual models. Consequently, recent studies have focused on using hybrid models to enhance forecasting accuracy. The modelling of dissolved oxygen is receiving more attention. From a review of relevant articles, it is clear that hybrid techniques are viable and precise methods for water quality prediction. Additionally, this paper presents future research directions to help researchers predict freshwater quality variables.
Collapse
|
42
|
Liu M, He J, Huang Y, Tang T, Hu J, Xiao X. Algal bloom forecasting with time-frequency analysis: A hybrid deep learning approach. WATER RESEARCH 2022; 219:118591. [PMID: 35598469 DOI: 10.1016/j.watres.2022.118591] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/30/2022] [Accepted: 05/11/2022] [Indexed: 06/15/2023]
Abstract
The rapid emergence of deep learning long-short-term-memory (LSTM) technique presents a promising solution to algal bloom forecasting. However, the discontinuous and non-stationary processes within algal dynamics still largely limit the functions of LSTMs. To overcome this challenge, an advanced time-frequency wavelet analysis (WA) technique was introduced to enhance the prediction accuracy of LSTMs. Herein, the novel hybrid approach (named WLSTM) successfully decreased the algal forecasting inaccuracy of classic LSTMs by 41% ± 8% in Lake Mendota (Wisconsin, USA), with powerful one-step-ahead predictions at hourly, daily, and monthly time resolutions (R2 = 0.976, 0.878, and 0.814, respectively). In addition, the WLSTM outperformed the other two widely used algal forecasting approaches - deep neural network (DNN), and autoregressive-integrated-moving-average (ARIMA) model, represented by average 72% and 85% decrease in root-mean-square-error, respectively. Furthermore, the WLSTM was implemented in an experimentally fertilized lake (Lake Tuesday, Michigan) for a multi-step forecasting examination. It satisfactorily forecasted the algal fluctuations involving substantial peak and extreme values (average R2 > 0.900) and presented accurate judgment outcomes to their bloom levels with high accuracy > 95% on average. This work highlighted the utility of deep learning approaches in effective early-warning for algal blooms, and demonstrated an important direction for improving the adaptability of conventional deep learning approaches to the aquatic problems.
Collapse
Affiliation(s)
- Muyuan Liu
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China
| | - Junyu He
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China; Ocean Academy, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China
| | - Yuzhou Huang
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China
| | - Tao Tang
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China
| | - Jing Hu
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China
| | - Xi Xiao
- Ocean College, Zhejiang University, #1 Zheda Road, Zhoushan, Zhejiang 316021, China; Key Laboratory of Watershed Non-point Source Pollution Control and Water Eco-security of Ministry of Water Resources, College of Environmental and Resources Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| |
Collapse
|
43
|
Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois. HYDROLOGY 2022. [DOI: 10.3390/hydrology9070117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Rainfall-runoff simulation is vital for planning and controlling flood control events. Hydrology modeling using Hydrological Engineering Center—Hydrologic Modeling System (HEC-HMS) is accepted globally for event-based or continuous simulation of the rainfall-runoff operation. Similarly, machine learning is a fast-growing discipline that offers numerous alternatives suitable for hydrology research’s high demands and limitations. Conventional and process-based models such as HEC-HMS are typically created at specific spatiotemporal scales and do not easily fit the diversified and complex input parameters. Therefore, in this research, the effectiveness of Random Forest, a machine learning model, was compared with HEC-HMS for the rainfall-runoff process. Furthermore, we also performed a hydraulic simulation in Hydrological Engineering Center—Geospatial River Analysis System (HEC-RAS) using the input discharge obtained from the Random Forest model. The reliability of the Random Forest model and the HEC-HMS model was evaluated using different statistical indexes. The coefficient of determination (R2), standard deviation ratio (RSR), and normalized root mean square error (NRMSE) were 0.94, 0.23, and 0.17 for the training data and 0.72, 0.56, and 0.26 for the testing data, respectively, for the Random Forest model. Similarly, the R2, RSR, and NRMSE were 0.99, 0.16, and 0.06 for the calibration period and 0.96, 0.35, and 0.10 for the validation period, respectively, for the HEC-HMS model. The Random Forest model slightly underestimated peak discharge values, whereas the HEC-HMS model slightly overestimated the peak discharge value. Statistical index values illustrated the good performance of the Random Forest and HEC-HMS models, which revealed the suitability of both models for hydrology analysis. In addition, the flood depth generated by HEC-RAS using the Random Forest predicted discharge underestimated the flood depth during the peak flooding event. This result proves that HEC-HMS could compensate Random Forest for the peak discharge and flood depth during extreme events. In conclusion, the integrated machine learning and physical-based model can provide more confidence in rainfall-runoff and flood depth prediction.
Collapse
|
44
|
Evaluating the Performance of Water Quality Indices: Application in Surface Water of Lake Union, Washington State-USA. HYDROLOGY 2022. [DOI: 10.3390/hydrology9070116] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Water quality indices (WQIs) are practical and versatile instruments for assessing, organizing, and disseminating information about the overall quality status of surface water bodies. The use of these indices may be beneficial in evaluating aquatic system water quality. The CCME (Canadian Council of Ministers of the Environment) and NSF (National Science Foundation) WQIs were used for the assessment of surface water (depth = 1 m) in Lake Union, Washington State. These WQIs were used in surface water at Lake Union, Seattle. The modified versions of the applied WQIs incorporate a varied number of the investigated parameters. The two WQIs were implemented utilizing specialized, publicly accessible software tools. A comparison of their performance is offered, along with a qualitative assessment of their appropriateness for describing the quality of a surface water body. Practical conclusions were generated and addressed based on the applicability and disadvantages of the evaluated indexes. When compared to the CCME-WQI, it is found that the NSF-WQI is a more robust index that yields a categorization stricter than CCME-WQI.
Collapse
|
45
|
Abstract
In this paper, the quality of a source of drinking water is assessed by measuring eight water quality (WQ) parameters using 710 samples collected from a water-stressed region of India, Jodhpur Rajasthan. The entire sample was divided into ten groups representing different geographic locations. Using American Public Health Association (APHA) specified methodology, eight WQ parameters, viz., pH, total dissolved solids (TDS), total alkalinity (TA), total hardness (TH), calcium hardness (Ca-H), residual chlorine, nitrate (as NO3−), and chloride (Cl−), were selected for describing the water quality for potability use. The quality of each parameter is examined as a function of the zone. Taking the average parametric values of different zones, a unique number was used to describe the overall quality of water. It was found that the average value of each parameter varies significantly with zones. Further, we used neural network (NN) modeling to map the nonlinear relationship between the above eight parametric inputs and the water quality index as the output. It can be observed that the NN designed in the present work acquired sufficient learning and can be satisfactorily used to predict the relational pattern between the input and the output. It can further be observed that the water quality index (WQI) from this work is highly efficient for a successful assessment of water quality in the study area. The major challenge to uniquely describing the drinking water quality lies in understanding the cumulative effect of various parameters affecting the quality of water; the quantified figure is subjected to debate, and this paper addresses the difficulty through a novel approach. The framework presented in this work can be automated with appropriate equipment and shall help government agencies understand changing water quality for better management.
Collapse
|
46
|
Elbeltagi A, Pande CB, Kouadri S, Islam ARMT. Applications of various data-driven models for the prediction of groundwater quality index in the Akot basin, Maharashtra, India. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:17591-17605. [PMID: 34671905 DOI: 10.1007/s11356-021-17064-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 10/12/2021] [Indexed: 06/13/2023]
Abstract
Data-driven models are important to predict groundwater quality which is controlling human health. The water quality index (WQI) has been developed based on the physicochemical parameters of water samples. In this area, water quality is medium to poor and is found in saline zones; very high pH ranges are directly affected on the water quality in this study area. Conventional WQI computation demands more time and is often observed with enormous errors during the calculation of sub-indices. In the present work, four standalone methods such as additive regression (AR), M5P tree model (M5P), random subspace (RSS), and support vector machine (SVM) were employed to predict WQI based on variable elimination technique. The groundwater samples were collected from the Akot basin area, located in the Akola district, Maharashtra, in India. A total of nine different input combinations were developed in this study. The datasets were demarcated into two classes (ratio 80:20) for model construction (training dataset) and model verification (testing dataset) using a fivefold cross-validation approach. The models were assessed using statistical and graphical appraisal metrics. The best input combinations varied among the model, generally, the optimal input variables (EC, pH, TDS, Ca, Mg, and Cl) during the training and validation stages. Results show that AR outperformed the other data-driven models (R2 = 0.9993, MAE = 0.5243, RMSE = 0.0.6356, %RAE = 3.8449, and RRSE% = 3.9925). The AR is proposed as an ideal model with satisfactory results due to enhanced prediction precision with the minimum number of input parameters and can thus act as the reliable and precise method in the prediction of WQI at the Akot basin.
Collapse
Affiliation(s)
- Ahmed Elbeltagi
- Agricultural Engineering Department, Faculty of Agriculture, Mansoura University, Mansoura, 35516, Egypt
| | - Chaitanya B Pande
- Sant Gadge Baba Amravati University, Amravati, MS, 444602, India.
- CAAST-CSAWM, MPKV Rahuri, Rahuri, India.
| | - Saber Kouadri
- Laboratory of Water and Environment Engineering in Sahara Milieu (GEEMS), Department of Civil Engineering, Hydraulics Faculty of Applied Sciences, Kasdi Merbah University Ouargla, Ouargla, Algeria
| | | |
Collapse
|
47
|
SmartWater: A Service-Oriented and Sensor Cloud-Based Framework for Smart Monitoring of Water Environments. REMOTE SENSING 2022. [DOI: 10.3390/rs14040922] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Due to the sharp increase in global industrial production, as well as the over-exploitation of land and sea resources, the quality of drinking water has deteriorated considerably. Furthermore, nowadays, many water supply systems serving growing human populations suffer from shortages since many rivers, lakes, and aquifers are drying up because of global climate change. To cope with these serious threats, smart water management systems are in great demand to ensure vigorous control of the quality and quantity of drinking water. Indeed, water monitoring is essential today since it allows to ensure the real-time control of water quality indicators and the appropriate management of resources in cities to provide an adequate water supply to citizens. In this context, a novel IoT-based framework is proposed to support smart water monitoring and management. The proposed framework, named SmartWater, combines cutting-edge technologies in the field of sensor clouds, deep learning, knowledge reasoning, and data processing and analytics. First, knowledge graphs are exploited to model the water network in a semantic and multi-relational manner. Then, incremental network embedding is performed to learn rich representations of water entities, in particular the affected water zones. Finally, a decision mechanism is defined to generate a water management plan depending on the water zones’ current states. A real-world dataset has been used in this study to experimentally validate the major features of the proposed smart water monitoring framework.
Collapse
|
48
|
Liu H, Li J, Cao H, Xie X, Wang Y. Prediction modeling of geogenic iodine contaminated groundwater throughout China. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022; 303:114249. [PMID: 34891008 DOI: 10.1016/j.jenvman.2021.114249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Revised: 11/10/2021] [Accepted: 12/04/2021] [Indexed: 06/13/2023]
Abstract
Geogenic iodine-contaminated groundwater represents a threat to public health in China. Identifying high-iodine areas is essential to guide the mitigation of this problem. Considering that traditional analytical techniques for iodine testing are generally time-consuming, laborious, and expensive, alternative methods are needed to supplement and enhance existing approaches. Therefore, we developed an artificial neural network (ANN) model and assessed its feasibility in terms of predicting high iodine levels in groundwater in China. A total of 22 indicators (including climate, topography, geology, and soil properties) and 3185 aggregated samples (measured groundwater iodine concentrations) were utilized to develop the ANN model. The results showed that the accuracy and area under the receiver operating characteristic curve of the model on the test dataset are 90.9% and 0.972, respectively, and climate and soil variables are the most effective predictors. Based on the prediction results, a high-resolution (1-km) nationwide prediction map of high-iodine groundwater was produced. The high-risk areas are mainly concentrated in the central provinces of Henan, Shaanxi, and Shanxi, the eastern provinces of Henan, Shandong, and Hebei, and the northeastern provinces of Liaoning, Jilin, and Heilongjiang. The total number of people estimated to potentially be at high-risk areas because they use untreated high-iodine groundwater as drinking water is approximately 30 million. Considering the growing demand for groundwater in China, this work can guide the prioritization of groundwater contamination mitigation efforts based on regional groundwater quality levels to enhance environmental management.
Collapse
Affiliation(s)
- Hongxing Liu
- School of Environmental Studies & State Key Laboratory of Biogeology and Environmental Geology & Key Laboratory of Source Apportionment and Control of Aquatic Pollution, Ministry of Ecology and Environment, China University of Geosciences, 430074, Wuhan, China
| | - Junxia Li
- School of Environmental Studies & State Key Laboratory of Biogeology and Environmental Geology & Key Laboratory of Source Apportionment and Control of Aquatic Pollution, Ministry of Ecology and Environment, China University of Geosciences, 430074, Wuhan, China.
| | - Hailong Cao
- School of Environmental Studies & State Key Laboratory of Biogeology and Environmental Geology & Key Laboratory of Source Apportionment and Control of Aquatic Pollution, Ministry of Ecology and Environment, China University of Geosciences, 430074, Wuhan, China
| | - Xianjun Xie
- School of Environmental Studies & State Key Laboratory of Biogeology and Environmental Geology & Key Laboratory of Source Apportionment and Control of Aquatic Pollution, Ministry of Ecology and Environment, China University of Geosciences, 430074, Wuhan, China
| | - Yanxin Wang
- School of Environmental Studies & State Key Laboratory of Biogeology and Environmental Geology & Key Laboratory of Source Apportionment and Control of Aquatic Pollution, Ministry of Ecology and Environment, China University of Geosciences, 430074, Wuhan, China
| |
Collapse
|
49
|
Wang X, Bouzembrak Y, Marvin HJP, Clarke D, Butler F. Bayesian Networks modeling of diarrhetic shellfish poisoning in Mytilus edulis harvested in Bantry Bay, Ireland. HARMFUL ALGAE 2022; 112:102171. [PMID: 35144818 DOI: 10.1016/j.hal.2021.102171] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 12/20/2021] [Accepted: 12/21/2021] [Indexed: 06/14/2023]
Abstract
Diarrhetic Shellfish Poisoning (DSP) results from the human consumption of contaminated shellfish with marine biotoxins, which are produced by some species of marine dinoflagellates, mainly belonging to the genus Dinophysis. Shellfish contamination with marine biotoxins not only pose a threat to human health, but also lead to financial loss to aquaculture operations from the temporary closure of production areas when toxin concentrations exceed regulatory levels. In this study, we developed a Bayesian Network (BN) model for forecasting the short-term variations of DSP toxins in blue mussels (Mytilus edulis) from Bantry Bay, Southwest Ireland. Data inputs to a BN model from 10 production sites in Bantry Bay included plankton cell densities in sea water, DSP toxin concentration in mussels and sea surface temperature. The model was trained with data from 2014 to 2018, and validated with data of 2019. Validation consisted of predicting the DSP toxin concentration at one production site using the model parameters from the other locations as input values. Model validation showed that the prediction accuracy was higher than 86%. Sensitivity analysis indicated that in general, DSP toxin concentration was more relevant than plankton abundance. This initial work has demonstrated the usefulness of BN modeling as an approach to short term forecasting. Further work is ongoing to use the model for scenario testing and to increase the number of environmental parameters used as inputs to the model.
Collapse
Affiliation(s)
- Xiyao Wang
- UCD School of Biosystems and Food Engineering, University College Dublin, Belfield, Dublin 4, Ireland.
| | - Yamine Bouzembrak
- Wageningen Food Safety Research, P.O. Box 230, 6700 AE Wageningen, The Netherlands
| | - Hans J P Marvin
- Wageningen Food Safety Research, P.O. Box 230, 6700 AE Wageningen, The Netherlands
| | - Dave Clarke
- Marine Institute, Rinville, Oranmore, Co. Galway, H91 R673, Ireland
| | - Francis Butler
- UCD School of Biosystems and Food Engineering, University College Dublin, Belfield, Dublin 4, Ireland
| |
Collapse
|
50
|
Necessary Conditions for Successful Application of Intra- and Inter-class Common Vector Classifiers. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-021-06509-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|