1
|
Ma Y, Qiao Y, Chen M, Rui D, Zhang X, Liu W, Ye L. How small is big enough? Big data-driven machine learning predictions for a full-scale wastewater treatment plant. WATER RESEARCH 2025; 274:123041. [PMID: 39740325 DOI: 10.1016/j.watres.2024.123041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 12/03/2024] [Accepted: 12/23/2024] [Indexed: 01/02/2025]
Abstract
Wastewater treatment plants (WWTPs) generate vast amounts of water quality, operational, and biological data. The potential of these big data, particularly through machine learning (ML), to improve WWTP management is increasingly recognized. However, the costs associated with data collection and processing can rise sharply as datasets grow larger, and research on determining the optimal data volume for effective ML application remains limited. In this study, we comprehensively analyzed water quality, operational, and biological data collected from a full-scale WWTP over 970 days. Our results demonstrate that ML models can predict not only operational and water quality parameters (concentrations of dissolved oxygen and effluent chemical oxygen demand) but also the abundances of functional bacteria. Notably, we discovered that increasing data volume does not always improve model performance, and that data collection intervals do not need to be excessively small, as moderate intervals can still yield reliable predictions. These findings suggest that excessively large datasets may not be necessary for effective ML predictions in WWTPs. Overall, this study underscores the importance of optimizing dataset size to balance computation efficiency and prediction accuracy, providing valuable insights into data management strategies that can enhance the operational efficiency and sustainability of WWTPs.
Collapse
Affiliation(s)
- Yanyan Ma
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Yiheng Qiao
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Mengxue Chen
- Nanjing Gaoke Environmental Technology Co., Ltd., Nanjing 210038, China
| | - Dongni Rui
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Xuxiang Zhang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Weijing Liu
- Jiangsu Provincial Key Laboratory of Environment Engineering, Jiangsu Provincial Academy of Environmental Science, Nanjing 210036, China
| | - Lin Ye
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China.
| |
Collapse
|
2
|
Belachqer-El Attar S, Rodríguez-García D, Soriano-Molina P, García Sánchez JL, Casas López JL, Sánchez Pérez JA. Model-based scenario analysis to support the operation of solar photo-Fenton plants. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2025; 373:123886. [PMID: 39756291 DOI: 10.1016/j.jenvman.2024.123886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 11/28/2024] [Accepted: 12/24/2024] [Indexed: 01/07/2025]
Abstract
Model-based tools applied to wastewater management have been identified as an emerging solution to address the associated challenges related to the optimization of the technologies, meeting more restricted water quality standards. Thus, for the first time, the demonstration of the solar photo-Fenton process for microcontaminant removal in the operating environment of a model-based tool is reported. This tool aids in determining the right cost-effective seasonal strategy for a 37-m2 demonstration-scale photoreactor operating in a rural wastewater treatment plant. It was developed using a model tuned adequately with experimental data obtained at lab scale and then validated in the solar photo-Fenton demonstration plant, proving its reliability, and enveloping a robust operation. Imidacloprid removal was the treatment target, and reagent concentrations were 0.1 mM for ferric nitrilotriacetate and 0.73 mM for hydrogen peroxide. According to the model-based tool, to attain the maximum treatment capacity, the best operating conditions were a liquid depth of 20-cm, and hydraulic residence time of 45 and 60-min in summer and winter, respectively, augmenting the treatment cost by 25% (0.49 €∙m-3 vs. 0.65 €∙m-3). This model-based tool allows the control and optimization of the technology to be improved, while promoting its attractiveness in the market.
Collapse
Affiliation(s)
- S Belachqer-El Attar
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain
| | - D Rodríguez-García
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain
| | - P Soriano-Molina
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain.
| | - J L García Sánchez
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain
| | - J L Casas López
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain.
| | - J A Sánchez Pérez
- Solar Energy Research Centre (CIESOL), Joint Centre University of Almería-CIEMAT, Ctra. de Sacramento s/n, Almería, 04120, Spain; Chemical Engineering Department, University of Almería, Carretera de Sacramento s/n, Almería, 04120, Spain
| |
Collapse
|
3
|
Nasir FB, Li J. Understanding machine learning predictions of wastewater treatment plant sludge with explainable artificial intelligence. WATER ENVIRONMENT RESEARCH : A RESEARCH PUBLICATION OF THE WATER ENVIRONMENT FEDERATION 2024; 96:e11136. [PMID: 39322560 DOI: 10.1002/wer.11136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 08/29/2024] [Accepted: 09/07/2024] [Indexed: 09/27/2024]
Abstract
This study investigates the use of machine learning (ML) models for wastewater treatment plant (WWTP) sludge predictions and explainable artificial intelligence (XAI) techniques for understanding the impact of variables behind the prediction. Three ML models, random forest (RF), gradient boosting machine (GBM), and gradient boosting tree (GBT), were evaluated for their performance using statistical indicators. Input variable combinations were selected through different feature selection (FS) methods. XAI techniques were employed to enhance the interpretability and transparency of ML models. The results suggest that prediction accuracy depends on the choice of model and the number of variables. XAI techniques were found to be effective in interpreting the decisions made by each ML model. This study provides an example of using ML models in sludge production prediction and interpreting models applying XAI to understand the factors influencing it. Understandable interpretation of ML model prediction can facilitate targeted interventions for process optimization and improve the efficiency and sustainability of wastewater treatment processes. PRACTITIONER POINTS: Explainable artificial intelligence can play a crucial role in promoting trust between machine learning models and their real-world applications. Widely practiced machine learning models were used to predict sludge production of a United States wastewater treatment plant. Feature selection methods can reduce the required number of input variables without compromising model accuracy. Explainable artificial intelligence techniques can explain driving variables behind machine learning prediction.
Collapse
Affiliation(s)
- Fuad Bin Nasir
- Department of Civil and Environ Engineering, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| | - Jin Li
- Department of Civil and Environ Engineering, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| |
Collapse
|
4
|
Kumari S, Chowdhry J, Kumar M, Garg MC. Machine learning (ML): An emerging tool to access the production and application of biochar in the treatment of contaminated water and wastewater. GROUNDWATER FOR SUSTAINABLE DEVELOPMENT 2024; 26:101243. [DOI: 10.1016/j.gsd.2024.101243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/20/2024]
|
5
|
Puri R, Emaminejad SA, Cusick RD. Mechanistic and data-driven modeling of carbon respiration with bio-electrochemical sensors. Curr Opin Biotechnol 2024; 88:103173. [PMID: 39033647 DOI: 10.1016/j.copbio.2024.103173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 06/27/2024] [Accepted: 06/28/2024] [Indexed: 07/23/2024]
Abstract
Bioelectrochemical sensor (BES) technologies have been developed to measure soluble carbon concentrations in wastewater. However, architectures and analytical methods developed in controlled laboratory environments fail to predict BES behavior during field deployments at water resource recovery facilities (WRRFs). Here, we examine the possibilities and obstacles associated with integrating BESs into environmental sensing networks and machine learning algorithms to monitor the biodegradable carbon dynamics and microbial metabolism at WRRFs. This approach highlights the potential of BESs to provide real-time insights into full-scale biodegradable carbon consumption across WRRFs.
Collapse
Affiliation(s)
- Rishabh Puri
- Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States
| | - Seyed A Emaminejad
- Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States; Black & Veatch, 180 N Wacker Dr Suite 550, Chicago, IL 60606, United States
| | - Roland D Cusick
- Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States.
| |
Collapse
|
6
|
Çimen Mesutoğlu Ö, Gök O. Prediction of COD in industrial wastewater treatment plant using an artificial neural network. Sci Rep 2024; 14:13750. [PMID: 38877150 PMCID: PMC11178879 DOI: 10.1038/s41598-024-64634-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/11/2024] [Indexed: 06/16/2024] Open
Abstract
In this investigation, the modeling of the Aksaray industrial wastewater treatment plant was performed using artificial neural networks with various architectures in the MATLAB software. The dataset utilized in this study was collected from the Aksaray wastewater treatment plant over a 9-month period through daily records. The treatment efficiency of the plants was assessed based on the output values of chemical oxygen demand (COD) output. Principal component analysis (PCA) was applied to furnish input for the Feedforward Backpropagation Artificial Neural Networks (FFBANN). The model's performance was evaluated using the Mean Squared Error (MSE), the Mean Absolute Error (MAE) and correlation coefficient (R2) parameters. The optimal architecture for the neural network model was determined through several trial and error iterations. According to the modeling results, the ANN exhibited a high predictive capability for plant performance, with an R2 reaching up to 0.9997 when comparing the observed and predicted output variables.
Collapse
Affiliation(s)
| | - Oğuzhan Gök
- Environmental Engineering Department, Aksaray University, Aksaray, Turkey
| |
Collapse
|
7
|
Mathaba M, Banza J. A comprehensive review on artificial intelligence in water treatment for optimization. Clean water now and the future. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART A, TOXIC/HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING 2024; 58:1047-1060. [PMID: 38293764 DOI: 10.1080/10934529.2024.2309102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 01/13/2024] [Indexed: 02/01/2024]
Abstract
Given the severe effects that toxic compounds present in wastewater streams have on humans, it is imperative that water and wastewater streams pollution be addressed globally. This review comprehensively examines various water and wastewater treatment methods and water quality management methods based on artificial intelligence (AI). Machine learning (ML) and AI have become a powerful tool for addressing problems in the real world and has gained a lot of interest since it can be used for a wide range of activities. The foundation of ML techniques involves training of a network with collected data, followed by application of learned network to the process simulation and prediction. The creation of ML models for process simulations requires measured data. In order to forecast and simulate chemical and physical processes such chemical reactions, heat transfer, mass transfer, energy, pharmaceutics and separation, a variety of machine-learning algorithms have recently been developed. These models have shown to be more adept at simulating and modeling processes than traditional models. Although AI offers many advantages, a number of disadvantages have kept these methods from being extensively applied in actual water treatment systems. Lack of evidence of application in actual water treatment scenarios, poor repeatability and data availability and selection are a few of the main problems that need to be resolved.
Collapse
Affiliation(s)
- Machodi Mathaba
- Department of Chemical Engineering, Faculty of Engineering and the Built Environment, University of Johannesburg, Johannesburg, South Africa
| | - JeanClaude Banza
- Department of Chemical Engineering, Faculty of Engineering and the Built Environment, University of Johannesburg, Johannesburg, South Africa
| |
Collapse
|