1
|
Wang P, Zheng T, Hu B, Yin J, Qian J, Guo W, Wang B. Integrating external stressors in supervised machine learning algorithm achieves high accuracy to predict multi-species biological integrity index of aquaculture wastewater. JOURNAL OF HAZARDOUS MATERIALS 2024; 480:136366. [PMID: 39486337 DOI: 10.1016/j.jhazmat.2024.136366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 10/28/2024] [Accepted: 10/29/2024] [Indexed: 11/04/2024]
Abstract
Monitoring and predicting the environmental impact of wastewater is essential for sustainable aquaculture. The environmental DNA metabarcoding-integrated supervised machine learning (SML) algorithm is an alternative method for ecological quality assessment and prediction. However, the ecological integrity of aquaculture wastewater and available effective input features for prediction remain unclear. Here, we used the multispecies biological integrity index (Ms-IBI) to provide a detailed categorization, identifying over half of the samples (53.85 %) as highly impacted, emphasizing the urgency to address ecological degradation; Ms-IBI emerged as a reliable label for SML models. By condensing 410 effective indicators and integrating 25 core operational taxonomic unit features with external stressors, an accuracy of 0.78 and R2 of 0.96 was achieved. Utilizing only external stressors yielded a comparably good performance with fewer input features, obtaining an accuracy of 0.74 and an R2 of 0.91. The integration of external stressors in this study highlights a practical predictive method that meets the ecological quality requirements of aquaculture wastewater, aiding the reversal of global ecological decline.
Collapse
Affiliation(s)
- Peifang Wang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China.
| | - Tianming Zheng
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| | - Bin Hu
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| | - Jinbao Yin
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| | - Jin Qian
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| | - Wenzhou Guo
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| | - Beibei Wang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing 210098, China
| |
Collapse
|
2
|
Wang K, Liu L, Ben X, Jin D, Zhu Y, Wang F. Hybrid deep learning based prediction for water quality of plain watershed. ENVIRONMENTAL RESEARCH 2024; 262:119911. [PMID: 39233036 DOI: 10.1016/j.envres.2024.119911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 08/30/2024] [Accepted: 08/31/2024] [Indexed: 09/06/2024]
Abstract
Establishing a highly reliable and accurate water quality prediction model is critical for effective water environment management. However, enhancing the performance of these predictive models continues to pose challenges, especially in the plain watershed with complex hydraulic conditions. This study aims to evaluate the efficacy of three traditional machine learning models versus three deep learning models in predicting the water quality of plain river networks and to develop a novel hybrid deep learning model to further improve prediction accuracy. The performance of the proposed model was assessed under various input feature sets and data temporal frequencies. The findings indicated that deep learning models outperformed traditional machine learning models in handling complex time series data. Long Short-Term Memory (LSTM) models improved the R2 by approximately 29% and lowered the Root Mean Square Error (RMSE) by about 48.6% on average. The hybrid Bayes-LSTM-GRU (Gated Recurrent Unit) model significantly enhanced prediction accuracy, reducing the average RMSE by 18.1% compared to the single LSTM model. Models trained on feature-selected datasets exhibited superior performance compared to those trained on original datasets. Higher temporal frequencies of input data generally provide more useful information. However, in datasets with numerous abrupt changes, increasing the temporal interval proves beneficial. Overall, the proposed hybrid deep learning model demonstrates an efficient and cost-effective method for improving water quality prediction performance, showing significant potential for application in managing water quality in plain watershed.
Collapse
Affiliation(s)
- Kefan Wang
- College of Environmental & Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Lei Liu
- College of Environmental & Resource Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xuechen Ben
- Zhejiang Zone-King Environmental Sci&Tech Co. Ltd., Hangzhou, 310064, China
| | - Danjun Jin
- Zhejiang Zone-King Environmental Sci&Tech Co. Ltd., Hangzhou, 310064, China
| | - Yao Zhu
- Taizhou Ecology and Environment Bureau Wenling Branch, Wenling, Zhejiang, 317599, China
| | - Feier Wang
- College of Environmental & Resource Sciences, Zhejiang University, Hangzhou, 310058, China; Zhejiang Ecological Civilization Academy, Anji, Zhejiang, 313300, China.
| |
Collapse
|
3
|
Huang Y, Chen S, Tang X, Sun C, Zhang Z, Huang J. Dynamic patterns and potential drivers of river water quality in a coastal city: Insights from a machine-learning-based framework and water management. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 370:122911. [PMID: 39405891 DOI: 10.1016/j.jenvman.2024.122911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 09/18/2024] [Accepted: 10/10/2024] [Indexed: 11/17/2024]
Abstract
River water quality continues to deteriorate under the coupled effects of climate change and human activities. Machine learning (ML) is a promising approach for analyzing water quality. Nevertheless, the spatiotemporal dynamics of river water quality and their potential mechanisms in changing environments remain incomprehensively understood through available ML-based researches. Here, we developed a ML-based framework integrating a self-organizing map (SOM) model with a random forest (RF) model. This framework was applied to simultaneously investigate the spatiotemporal patterns and potential drivers of river permanganate (CODMn), ammonia nitrogen (NH3-N), and total phosphorus (TP) dynamics across 34 sites from 2010 to 2020 in a coastal city threatened by deteriorating water environment in southeastern China. The sites were divided into two clusters in the spatial context with different water quality conditions. The year of 2015 for NH3-N and 2018 for CODMn and TP were identified as the key turning points of water quality variations. Features including sewage discharge, population dynamics, percentage of cultivated land, and fertilizer application contributed greatly to water quality deterioration. The increase in forest vegetation reflected by percentage of forest and leaf area index may improve water quality. The ML-based modeling framework demonstrated a promising way to address the spatiotemporal dynamics of river water quality, and provided insights for water management in a coastal city with intensifying human-nature interactions.
Collapse
Affiliation(s)
- Yicheng Huang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, 361102, Xiamen, China
| | - Shengyue Chen
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, 361102, Xiamen, China
| | - Xi Tang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, 361102, Xiamen, China
| | - Changyang Sun
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, 361102, Xiamen, China
| | - Zhenyu Zhang
- School of Geographical Sciences, Fujian Normal University, Fuzhou, 350007, China
| | - Jinliang Huang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, 361102, Xiamen, China.
| |
Collapse
|
4
|
Huang S, Wang Y, Xia J. Which riverine water quality parameters can be predicted by meteorologically-driven deep learning? THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 946:174357. [PMID: 38945234 DOI: 10.1016/j.scitotenv.2024.174357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 06/26/2024] [Accepted: 06/27/2024] [Indexed: 07/02/2024]
Abstract
River water quality has been significantly impacted by climate change and extreme weather events worldwide. Despite increasing studies on deep learning techniques for river water quality management, understanding which riverine water quality parameters can be well predicted by meteorologically-driven deep learning still requires further investigation. Here we explored the prediction performance of a traditional Recurrent Neural Network, a Long Short-Term Memory network (LSTM), and a Gated Recurrent Unit (GRU) using meteorological conditions as inputs in the Dahei River basin. We found that deep learning models (i.e., LSTM and GRU) demonstrated remarkable effectiveness in predicting multiple water quality parameters at daily scale, including water temperature, dissolved oxygen, electrical conductivity, chemical oxygen demand, ammonia nitrogen, total phosphorous, and total nitrogen, but not turbidity. The GRU model performed best with an average determination coefficient of 0.94. Compared to the daily-average prediction, the GRU model exhibited limited error increment of 10-40 % for most water quality parameters when predicting daily extreme values (i.e., the maximum and minimum). Moreover, deep learning showed superior performance in collective prediction for multiple water quality parameters than individual ones, enabling a more comprehensive understanding of the river water quality dynamics from meteorological data. This study holds the promise of applying meteorologically-driven deep learning techniques for water quality prediction to a broader range of watersheds, particularly in chemically ungauged areas.
Collapse
Affiliation(s)
- Sheng Huang
- State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan 430072, China
| | - Yueling Wang
- Key Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China.
| | - Jun Xia
- State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan 430072, China; Key Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
5
|
Lin Z, Lim JY, Oh JM. Innovative interpretable AI-guided water quality evaluation with risk adversarial analysis in river streams considering spatial-temporal effects. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 350:124015. [PMID: 38657892 DOI: 10.1016/j.envpol.2024.124015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/17/2024] [Accepted: 04/18/2024] [Indexed: 04/26/2024]
Abstract
Water security remains a critical issue given the looming threats of industrial pollution, necessitating comprehensive assessments of water quality to address seasonal fluctuations and influential factors while formulating effective strategies for decision makers. This study introduces a novel approach for evaluating water quality within a complex riverine zone in South Korea: Han River that encompasses five river streams situated at each junction of North and South streams (including Gyeongan Stream) that ultimately leading towards Paldang Lake. By utilizing the monthly water characteristic data from the year 2013-2022 across 14 different locations, the significant seasonal trends and potential influences on water quality are identified. The water quality here is calculated with the proposed method of sub-index water quality index (s-WQI). A combinatorial prediction approach of s-WQI for each location is conducted through a collective of data preprocessing approaches including Hampel filtering and feature selection in prior to the machine learning predictions. In return, light gradient boosting (LGB) is the most accurate predictor by outperforming other prediction algorithms, especially through LGB-Pearson and LGB-Spearman combinations for North and South stream intersections, and LGB-Pearson for Paldang Lake. To further evaluate the robustness of this evaluation and extending the results to a foreseeable scenario, a seasonal based Monte-Carlo Simulation with 10,000 attempts targeting the water characteristic distributions obtained from each location considered are carried out to identify the risk bounds within. The results are further interpreted with SHAP analysis on identifying the contributions of each water characteristics towards the water quality through local and global spectrum. This research yields practical implications, offering tailored strategies for water quality enhancement and early warning systems. The integration of AI-based prediction and feature selection underscores the transformative potential of computational techniques in advancing data-driven water quality assessments, shaping the future of environmental science research.
Collapse
Affiliation(s)
- ZiYu Lin
- Department of Environmental Science and Engineering, Kyung Hee University, Yongin-si, 17104, Gyeonggi, Republic of Korea
| | - Juin Yau Lim
- Korea Biochar Research Center & APRU Sustainable Waste Management Program & Division of Environmental Science and Ecological Engineering, Korea University, Seoul, 02841, Republic of Korea; School of Business Administration, Korea University, Seoul, 02841, Republic of Korea
| | - Jong-Min Oh
- Department of Environmental Science and Engineering, Kyung Hee University, Yongin-si, 17104, Gyeonggi, Republic of Korea.
| |
Collapse
|
6
|
Zhi W, Appling AP, Golden HE, Podgorski J, Li L. Deep learning for water quality. NATURE WATER 2024; 2:228-241. [PMID: 38846520 PMCID: PMC11151732 DOI: 10.1038/s44221-024-00202-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 01/10/2024] [Indexed: 06/09/2024]
Abstract
Understanding and predicting the quality of inland waters are challenging, particularly in the context of intensifying climate extremes expected in the future. These challenges arise partly due to complex processes that regulate water quality, and arduous and expensive data collection that exacerbate the issue of data scarcity. Traditional process-based and statistical models often fall short in predicting water quality. In this Review, we posit that deep learning represents an underutilized yet promising approach that can unravel intricate structures and relationships in high-dimensional data. We demonstrate that deep learning methods can help address data scarcity by filling temporal and spatial gaps and aid in formulating and testing hypotheses via identifying influential drivers of water quality. This Review highlights the strengths and limitations of deep learning methods relative to traditional approaches, and underscores its potential as an emerging and indispensable approach in overcoming challenges and discovering new knowledge in water-quality sciences.
Collapse
Affiliation(s)
- Wei Zhi
- The National Key Laboratory of Water Disaster Prevention, Yangtze Institute for Conservation and Development, Key Laboratory of Hydrologic-Cycle and Hydrodynamic-System of Ministry of Water Resources, Hohai University, Nanjing, China
- Department of Civil and Environmental Engineering, The Pennsylvania State University, University Park, PA, USA
| | | | - Heather E Golden
- Office of Research and Development, US Environmental Protection Agency, Cincinnati, OH, USA
| | - Joel Podgorski
- Department of Water Resources and Drinking Water, Swiss Federal Institute of Aquatic Science and Technology (EAWAG), Dübendorf, Switzerland
| | - Li Li
- Department of Civil and Environmental Engineering, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
7
|
Singh RB, Patra KC, Pradhan B, Samantra A. HDTO-DeepAR: A novel hybrid approach to forecast surface water quality indicators. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 352:120091. [PMID: 38228048 DOI: 10.1016/j.jenvman.2024.120091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/23/2023] [Accepted: 01/08/2024] [Indexed: 01/18/2024]
Abstract
Water is a vital resource supporting a broad spectrum of ecosystems and human activities. The quality of river water has declined in recent years due to the discharge of hazardous materials and toxins. Deep learning and machine learning have gained significant attention for analysing time-series data. However, these methods often suffer from high complexity and significant forecasting errors, primarily due to non-linear datasets and hyperparameter settings. To address these challenges, we have developed an innovative HDTO-DeepAR approach for predicting water quality indicators. This proposed approach is compared with standalone algorithms, including DeepAR, BiLSTM, GRU and XGBoost, using performance metrics such as MAE, MSE, MAPE, and NSE. The NSE of the hybrid approach ranges between 0.8 to 0.96. Given the value's proximity to 1, the model appears to be efficient. The PICP values (ranging from 95% to 98%) indicate that the model is highly reliable in forecasting water quality indicators. Experimental results reveal a close resemblance between the model's predictions and actual values, providing valuable insights for predicting future trends. The comparative study shows that the suggested model surpasses all existing, well-known models.
Collapse
Affiliation(s)
- Rosysmita Bikram Singh
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Kanhu Charan Patra
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information System, School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, Australia; Institute of Climate Change, Universiti Kebangsaan Malaysia, Bangi, Malaysia.
| | - Avinash Samantra
- Department of Computer Science & Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| |
Collapse
|
8
|
Chen S, Huang J, Wang P, Tang X, Zhang Z. A coupled model to improve river water quality prediction towards addressing non-stationarity and data limitation. WATER RESEARCH 2024; 248:120895. [PMID: 38000228 DOI: 10.1016/j.watres.2023.120895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 10/24/2023] [Accepted: 11/17/2023] [Indexed: 11/26/2023]
Abstract
Accurate predictions of river water quality are vital for sustainable water management. However, even the powerful deep learning model, i.e., long short-term memory (LSTM), has difficulty in accurately predicting water quality dynamics owing to the high non-stationarity and data limitation in a changing environment. To wiggle out of quagmires, wavelet analysis (WA) and transfer learning (TL) techniques were introduced in this study to assist LSTM modeling, termed WA-LSTM-TL. Total phosphorus, total nitrogen, ammonia nitrogen, and permanganate index were predicted in a 4 h step within 49 water quality monitoring sites in a coastal province of China. We selected suitable source domains for each target domain using an innovatively proposed regionalization approach that included 20 attributes to improve the prediction efficiency of WA-LSTM-TL. The coupled WA-LSTM facilitated capturing non-stationary patterns of water quality dynamics and improved the performance by 53 % during testing phase compared to conventional LSTM. The WA-LSTM-TL, aided by the knowledge of source domain, obtained a 17 % higher performance compared to locally trained WA-LSTM, and such improvement was more impressive when local data was limited (+66 %). The benefit of TL-based modeling diminished as data quantity increased; however, it outperformed locally direct modeling regardless of whether target domain data was limited or sufficient. This study demonstrates the reasoning for coupling WA and TL techniques with LSTM models and provides a newly coupled modeling approach for improving short-term prediction of river water quality from the perspectives of non-stationarity and data limitation.
Collapse
Affiliation(s)
- Shengyue Chen
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, Xiamen 361102, China
| | - Jinliang Huang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, Xiamen 361102, China.
| | - Peng Wang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, Xiamen 361102, China
| | - Xi Tang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, Xiamen 361102, China
| | - Zhenyu Zhang
- Fujian Key Laboratory of Coastal Pollution Prevention and Control, Xiamen University, Xiamen 361102, China; Department of Hydrology and Water Resources Management, Institute for Natural Resource Conservation, Kiel University, Kiel D-24118, Federal Republic of Germany
| |
Collapse
|