1
|
Jun BM, Chae SH, Kim D, Jung JY, Kim TJ, Nam SN, Yoon Y, Park C, Rho H. Adsorption of uranyl ion on hexagonal boron nitride for remediation of real U-contaminated soil and its interpretation using random forest. JOURNAL OF HAZARDOUS MATERIALS 2024; 469:134072. [PMID: 38522201 DOI: 10.1016/j.jhazmat.2024.134072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/09/2024] [Accepted: 03/16/2024] [Indexed: 03/26/2024]
Abstract
Acid leaching has been widely applied to treat contaminated soil, however, it contains several inorganic pollutants. The decommissioning of nuclear power plants introduces radioactive and soluble U(VI), a substance posing chemical toxicity to humans. Our investigation sought to ascertain the efficacy of hexagonal boron nitride (h-BN), an highly efficient adsorbent, in treating U(VI) in wastewater. The adsorption equilibrium of U(VI) by h-BN reached saturation within a mere 2 h. The adsorption of U(VI) by h-BN appears to be facilitated through electrostatic attraction, as evidenced by the observed impact of pH variations, acidic agents (i.e., HCl or H2SO4), and the presence of background ions on the adsorption performance. A reusability test demonstrated the successful completion of five cycles of adsorption/desorption, relying on the surface characteristics of h-BN as influenced by solution pH. Based on the experimental variables of initial U(VI) concentration, exposure time, temperature, pH, and the presence of background ions/organic matter, a feature importance analysis using random forest (RF) was carried out to evaluate the correlation between performances and conditions. To the best of our knowledge, this study is the first attempt to conduct the adsorption of U(VI) generated from real contaminated soil by h-BN, followed by interpretation of the correlation between performance and conditions using RF. Lastly, a. plausible adsorption mechanism between U(VI) and h-BN was explained based on the experimental results, characterizations, and a. comparison with previous adsorption studies on the removal of heavy metals by h-BN.
Collapse
Affiliation(s)
- Byung-Moon Jun
- Radwaste Management Center, Korea Atomic Energy Research Institute (KAERI), 111 Daedeok-Daero 989beon-gil, Yuseong-Gu, Daejeon 34057, Republic of Korea
| | - Sung Ho Chae
- Center for Water Cycle Research, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea
| | - Deokhwan Kim
- Department of Environment Research, Korea Institute of Civil Engineering and Building Technology (KICT), 283 Goyang-Daero, Ilsanseo-Gu, Goyang-si, Gyeonggi-do 10223, Republic of Korea; Department of Civil and Environment Engineering, University of Science and Technology (UST), 217 Gajeong-Ro, Yuseong-Gu, Daejeon 34113, Republic of Korea
| | - Jun-Young Jung
- Radwaste Management Center, Korea Atomic Energy Research Institute (KAERI), 111 Daedeok-Daero 989beon-gil, Yuseong-Gu, Daejeon 34057, Republic of Korea
| | - Tack-Jin Kim
- Radwaste Management Center, Korea Atomic Energy Research Institute (KAERI), 111 Daedeok-Daero 989beon-gil, Yuseong-Gu, Daejeon 34057, Republic of Korea
| | - Seong-Nam Nam
- Department of Chemical and Environmental Science, Korea Army Academy, Yeong-Cheon 495 Hoguk-ro, Gokyeong-myeon, Yeongcheon-si, Gyeongsangbuk-do, Republic of Korea
| | - Yeomin Yoon
- Department of Environmental Science and Engineering, Ewha Womans University, 52 Ewhayeodae-gil, Seodaemun-gu, Seoul 03760, Republic of Korea
| | - Chanhyuk Park
- Department of Environmental Science and Engineering, Ewha Womans University, 52 Ewhayeodae-gil, Seodaemun-gu, Seoul 03760, Republic of Korea
| | - Hojung Rho
- Department of Environment Research, Korea Institute of Civil Engineering and Building Technology (KICT), 283 Goyang-Daero, Ilsanseo-Gu, Goyang-si, Gyeonggi-do 10223, Republic of Korea; Department of Civil and Environment Engineering, University of Science and Technology (UST), 217 Gajeong-Ro, Yuseong-Gu, Daejeon 34113, Republic of Korea.
| |
Collapse
|
2
|
Lloyd SD, Carvajal G, Campey M, Taylor N, Osmond P, Roser DJ, Khan SJ. Predicting recreational water quality and public health safety in urban estuaries using Bayesian Networks. WATER RESEARCH 2024; 254:121319. [PMID: 38422692 DOI: 10.1016/j.watres.2024.121319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/05/2024] [Accepted: 02/14/2024] [Indexed: 03/02/2024]
Abstract
To support the reactivation of urban rivers and estuaries for bathing while ensuring public safety, it is critical to have access to real-time information on microbial water quality and associated health risks. Predictive modelling can provide this information, though challenges concerning the optimal size of training data, model transferability, and communication of uncertainty still need attention. Further, urban estuaries undergo distinctive hydrological variations requiring tailored modelling approaches. This study assessed the use of Bayesian Networks (BNs) for the prediction of enterococci exceedances and extrapolation of health risks at planned bathing sites in an urban estuary in Sydney, Australia. The transferability of network structures between sites was assessed. Models were validated using a novel application of the k-fold walk-forward validation procedure and further tested using independent compliance and event-based sampling datasets. Learning curves indicated the model's sensitivity reached a minimum performance threshold of 0.8 once training data included ≥ 400 observations. It was demonstrated that Semi-Naïve BN structures can be transferred while maintaining stable predictive performance. In all sites, salinity and solar exposure had the greatest influence on Posterior Probability Distributions (PPDs), when combined with antecedent rainfall. The BNs provided a novel and transparent framework to quantify and visualise enterococci, stormwater impact, health risks, and associated uncertainty under varying environmental conditions. This study has advanced the application of BNs in predicting recreational water quality and providing decision support in urban estuarine settings, proposed for bathing, where uncertainty is high.
Collapse
Affiliation(s)
- Simon D Lloyd
- School of Built Environment, University of New South Wales, NSW, Australia.
| | - Guido Carvajal
- Facultad de Ingeniería, Universidad Andrés Bello, Antonio Varas 880, Providencia, Santiago, Chile
| | - Meredith Campey
- Beachwatch, NSW Department of Planning and Environment, NSW, Australia
| | | | - Paul Osmond
- School of Built Environment, University of New South Wales, NSW, Australia
| | - David J Roser
- School of Civil and Environmental Engineering, University of New South Wales, NSW, Australia
| | - Stuart J Khan
- School of Civil Engineering, University of Sydney, NSW, Australia
| |
Collapse
|
3
|
Zeinolabedini Rezaabad M, Lacey H, Marshall L, Johnson F. Influence of resampling techniques on Bayesian network performance in predicting increased algal activity. WATER RESEARCH 2023; 244:120558. [PMID: 37666153 DOI: 10.1016/j.watres.2023.120558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 08/10/2023] [Accepted: 08/30/2023] [Indexed: 09/06/2023]
Abstract
Early warning of increased algal activity is important to mitigate potential impacts on aquatic life and human health. While many methods have been developed to predict increased algal activity, an ongoing issue is that severe algal blooms often occur with low frequency in water bodies. This results in imbalanced data sets available for model specification, leading to poor predictions of the frequency of increased algal activity. One approach to address this is to resample data sets of increased algal activity to increase the prevalence of higher than normal algal activity in calibration data and ultimately improve model predictions. This study aims to investigate the use of resampling techniques to address the imbalanced dataset and determine if such methods can improve the prediction of increased algal activity. Three techniques were investigated, Kmeans under-sampling (US_Kmeans), synthetic minority over-sampling technique (SMOTE), and 'SMOTE and cluster-based under-sampling technique' (SCUT). The resampling methods were applied to a Bayesian network (BN) model of Lake Burragorang in New South Wales, Australia. The model was developed to predict chlorophyll-a (chl-a) using a range of water quality parameters as predictors. The original data and each of the balanced datasets were used for BN structures and parameter learning. The results showed that the best graphical structure was obtained by adding synthetic data from SMOTE with the highest true positive rate (TPR) and area under the curve (AUC). When compared using a fixed graphical structure for the BN, all resampling techniques increased the ability of the BN to detect events with higher probability of increased algal activity. The resampling model results can also be used to better understand the most important influences on high chl-a concentrations and suggest future data collection and model development priorities.
Collapse
Affiliation(s)
- Maryam Zeinolabedini Rezaabad
- Water Research Centre, School of Civil and Environmental Engineering, University of New South Wales, Kensington, New South Wales, Australia; ARC Training Centre Data Analytics for Resources and Environments, School of Life and Environmental Sciences, The University of Sydney, Camperdown, New South Wales, Australia.
| | | | - Lucy Marshall
- Water Research Centre, School of Civil and Environmental Engineering, University of New South Wales, Kensington, New South Wales, Australia; ARC Training Centre Data Analytics for Resources and Environments, School of Life and Environmental Sciences, The University of Sydney, Camperdown, New South Wales, Australia; Faculty of Science and Engineering, Macquarie University, North Ryde, New South Wales, Australia
| | - Fiona Johnson
- Water Research Centre, School of Civil and Environmental Engineering, University of New South Wales, Kensington, New South Wales, Australia; ARC Training Centre Data Analytics for Resources and Environments, School of Life and Environmental Sciences, The University of Sydney, Camperdown, New South Wales, Australia
| |
Collapse
|
4
|
Fuentes-Cortés LF, Flores-Tlacuahuac A, Nigam KDP. Machine Learning Algorithms Used in PSE Environments: A Didactic Approach and Critical Perspective. Ind Eng Chem Res 2022. [DOI: 10.1021/acs.iecr.2c00335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Luis Fabián Fuentes-Cortés
- Departamento de Ingeniería Química, Tecnologico Nacional de México - Instituto Tecnológico de Celaya, Celaya, Guanajuato 38010, Mexico
| | - Antonio Flores-Tlacuahuac
- Tecnologico de Monterrey, Escuela de Ingeniería y Ciencias Ave. Eugenio Garza Sada 2501, Monterrey, N.L. 64849, Mexico
| | - Krishna D. P. Nigam
- Tecnologico de Monterrey, Escuela de Ingeniería y Ciencias Ave. Eugenio Garza Sada 2501, Monterrey, N.L. 64849, Mexico
- Department of Chemical Engineering, Indian Institute of Technology Delhi 600036, India
| |
Collapse
|
5
|
Zhu M, Wang J, Yang X, Zhang Y, Zhang L, Ren H, Wu B, Ye L. A review of the application of machine learning in water quality evaluation. ECO-ENVIRONMENT & HEALTH (ONLINE) 2022; 1:107-116. [PMID: 38075524 PMCID: PMC10702893 DOI: 10.1016/j.eehl.2022.06.001] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/19/2022] [Accepted: 06/01/2022] [Indexed: 12/31/2023]
Abstract
With the rapid increase in the volume of data on the aquatic environment, machine learning has become an important tool for data analysis, classification, and prediction. Unlike traditional models used in water-related research, data-driven models based on machine learning can efficiently solve more complex nonlinear problems. In water environment research, models and conclusions derived from machine learning have been applied to the construction, monitoring, simulation, evaluation, and optimization of various water treatment and management systems. Additionally, machine learning can provide solutions for water pollution control, water quality improvement, and watershed ecosystem security management. In this review, we describe the cases in which machine learning algorithms have been applied to evaluate the water quality in different water environments, such as surface water, groundwater, drinking water, sewage, and seawater. Furthermore, we propose possible future applications of machine learning approaches to water environments.
Collapse
Affiliation(s)
- Mengyuan Zhu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Jiawei Wang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Xiao Yang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Yu Zhang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Linyu Zhang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Hongqiang Ren
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Bing Wu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Lin Ye
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| |
Collapse
|
6
|
De Giglio O, Napoli C, Diella G, Fasano F, Lopuzzo M, Apollonio F, D'Ambrosio M, Campanale C, Triggiano F, Caggiano G, Montagna MT. Integrated approach for legionellosis risk analysis in touristic-recreational facilities. ENVIRONMENTAL RESEARCH 2021; 202:111649. [PMID: 34252427 DOI: 10.1016/j.envres.2021.111649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 06/30/2021] [Accepted: 07/02/2021] [Indexed: 06/13/2023]
Abstract
Legionellosis is a severe pneumonia caused by the inhalation of aerosols containing Legionella, Gram-negative bacteria present in the water systems of touristic-recreational facilities. The purpose of this study was to develop a scoring tool to predict the risk of both environmental contamination and Legionnaires' disease cases in such facilities in the Apulia region of southern Italy. We analyzed 47 structural and management parameters/risk factors related to the buildings, water systems, and air conditioning at the facilities. A Poisson regression model was used to compute an overall risk score for each facility with respect to three outcomes: water samples positive for Legionella (risk score range: 7-54), water samples positive for Legionella with an average load exceeding 1000 colony-forming units per liter (CFU/L) (risk score range: 22-179,871), and clinical cases of Legionnaire's disease (risk score range: 6-31). The cut-off values for three outcomes were determined by receiver operating characteristic curves (first outcome, samples positive for Legionella in a touristic-recreational facility: 19; second outcome, samples positive for Legionella in a touristic-recreational facility with an average load exceeding 1000 CFU/L: 2062; third outcome, clinical cases of Legionnaire's disease in a touristic-recreational facility: 22). Above these values, there was a significant probability of observing the outcome. We constructed this predictive model using 70% of a large dataset (18 years of clinical and environmental surveillance) and tested the model on the remaining 30% of the dataset to demonstrate its reliability. Our model enables the assessment of risk for a touristic facility and the creation of a conceptual framework to link the risk analysis with prevention measures.
Collapse
Affiliation(s)
- Osvalda De Giglio
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Christian Napoli
- Department of Medical Surgical Sciences and Translational Medicine, "Sapienza" University of Rome, Via di Grottarossa 1035/1039, 00189, Rome, Italy.
| | - Giusy Diella
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Fabrizio Fasano
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Marco Lopuzzo
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Francesca Apollonio
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Marilena D'Ambrosio
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Carmen Campanale
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Francesco Triggiano
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Giuseppina Caggiano
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| | - Maria Teresa Montagna
- Regional Reference Laboratory of Clinical and Environmental Surveillance of Legionellosis, Department of Biomedical Science and Human Oncology, University of Bari Aldo Moro, Piazza G. Cesare 11, 70124, Bari, Italy.
| |
Collapse
|
7
|
Graphical Trajectory Comparison to Identify Errors in Data of COVID-19: A Cross-Country Analysis. J Pers Med 2021; 11:jpm11100955. [PMID: 34683095 PMCID: PMC8537769 DOI: 10.3390/jpm11100955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 09/20/2021] [Accepted: 09/23/2021] [Indexed: 11/17/2022] Open
Abstract
Data from the early stage of a novel infectious disease outbreak provide vital information in risk assessment, prediction, and precise disease management. Since the first reported case of COVID-19, the pattern of the novel coronavirus transmission in Wuhan has become the interest of researchers in epidemiology and public health. To thoroughly map the mechanism of viral spreading, we used the patterns of data at the early onset of COVID-19 from seven countries to estimate the time lag between peak days of cases and deaths. This study compared these data with those of Wuhan and estimated the natural history of disease across the infected population and the time lag. The findings suggest that comparative analyses of data from different regions and countries reveal the differences between peaks of cases and deaths caused by COVID-19 and the incomplete and underestimated cases in Wuhan. Different countries may show different patterns of cases peak days, deaths peak days, and peak periods. Error in the early COVID-19 statistics in Brazil was identified. This study provides sound evidence for policymakers to understand the local circumstances in diagnosing the health of a population and propose precise and timely public health interventions to control and prevent infectious diseases.
Collapse
|
8
|
Yunana D, Maclaine S, Tng KH, Zappia L, Bradley I, Roser D, Leslie G, MacIntyre CR, Le-Clech P. Developing Bayesian networks in managing the risk of Legionella colonisation of groundwater aeration systems. WATER RESEARCH 2021; 193:116854. [PMID: 33550171 DOI: 10.1016/j.watres.2021.116854] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 12/23/2020] [Accepted: 01/17/2021] [Indexed: 06/12/2023]
Abstract
An Australian water utility has developed a Legionella High Level Risk Assessment (LHLRA) which provides a semi-qualitative assessment of the risk of Legionella proliferation and human exposure in engineered water systems using a combination of empirical observation and expert knowledge. Expanding on this LHLRA, we propose two iterative Bayesian network (BN) models to reduce uncertainty and allow for a probabilistic representation of the mechanistic interaction of the variables, built using data from 25 groundwater treatment plants. The risk of Legionella exposure in groundwater aeration units was quantified as a function of five critical areas including hydraulic conditions, nutrient availability and growth, water quality, system design (and maintenance), and location and access. First, the mechanistic relationship of the variables was conceptually mapped into a fishbone diagram, parameterised deterministically using an expert elicited weighted scoring system and translated into BN. The "sensitivity to findings" analysis of the BN indicated that system design was the most influential variable while elemental accumulation thresholds were the least influential variable for Legionella exposure. The diagnostic inference was used in high and low-risk scenarios to demonstrate the capabilities of the BNs to examine probable causes for diverse conditions. Subsequently, the causal relationship of Legionella growth and human exposure were improved through a conceptual bowtie representation. Finally, an improved model developed the predictors of Legionella growth and the risk of human exposure through the interaction of operational, water quality monitoring, operational parameters, and asset conditions. The use of BNs modelling based on risk estimation and improved functional decision outputs offer a complementary and more transparent alternative approach to quantitative analysis of uncertainties than the current LHLRA.
Collapse
Affiliation(s)
- Danladi Yunana
- UNESCO Centre for Membrane Science and Technology, School of Chemical Engineering, University of New South Wales (UNSW), Sydney, NSW2052, Australia
| | - Stuart Maclaine
- UNESCO Centre for Membrane Science and Technology, School of Chemical Engineering, University of New South Wales (UNSW), Sydney, NSW2052, Australia
| | - Keng Han Tng
- UNESCO Centre for Membrane Science and Technology, School of Chemical Engineering, University of New South Wales (UNSW), Sydney, NSW2052, Australia
| | - Luke Zappia
- Water Corporation of Western Australia, WCWA, Leederville, WA6007, Australia
| | - Ian Bradley
- Water Corporation of Western Australia, WCWA, Leederville, WA6007, Australia
| | - David Roser
- Water Research Centre (WRC), Civil and Environmental Engineering, UNSW, Sydney, Australia
| | - Greg Leslie
- UNESCO Centre for Membrane Science and Technology, School of Chemical Engineering, University of New South Wales (UNSW), Sydney, NSW2052, Australia
| | - C Raina MacIntyre
- The Biosecurity Program, The Kirby Institute, UNSW Medicine, UNSW, Sydney, Australia
| | - Pierre Le-Clech
- UNESCO Centre for Membrane Science and Technology, School of Chemical Engineering, University of New South Wales (UNSW), Sydney, NSW2052, Australia.
| |
Collapse
|
9
|
Li RA, McDonald JA, Sathasivan A, Khan SJ. A multivariate Bayesian network analysis of water quality factors influencing trihalomethanes formation in drinking water distribution systems. WATER RESEARCH 2021; 190:116712. [PMID: 33310438 DOI: 10.1016/j.watres.2020.116712] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 11/10/2020] [Accepted: 11/29/2020] [Indexed: 06/12/2023]
Abstract
Controlling disinfection by-products formation while ensuring effective drinking water disinfection is important for protecting public health. However, understanding and predicting disinfection by-product formation under a variety of conditions in drinking water distribution systems remains challenging as disinfection by-product formation is a multifactorial phenomenon. This study aimed to assess the application of Bayesian Network models to predict the concentration of trihalomethanes, the dominant halogenated disinfection by-product class, using various water quality parameters. Naïve Bayesian and semi-naïve Bayesian models were constructed from Sydney and South East Queensland datasets across 15 drinking water distribution systems in Australia. The targeted variable, total trihalomethanes concentration, was discretised into 3 bins (<0.1 mg L-1, 0.1 - 0.2 mg L-1 and >0.2 mg L-1). The Bayesian network structures were built using water quality parameters including concentrations of individual and total trihalomethanes, disinfectant species (free chlorine, monochloramine, dichloramine, total chlorine), nitrogen species (free ammonia, total ammonia, nitrate, nitrite), and other physical/chemical parameters (temperature, pH, dissolved organic carbon, total dissolved solids, conductivity and turbidity). Seven performance parameters, including predictive accuracy and the rates of true and false positive and negative results, were used to assess the accuracy and precision of the Bayesian network models. After evaluating the model performance, the optimum models were selected to be Bayesian network augmented naïve models. These were observed to have the highest predictive accuracies for Sydney (78%) and South East Queensland (94%). Although disinfectant residuals are among the key variables that lead to trihalomethanes formation, potential concentrations of trihalomethanes in distribution systems can be more confidently predicted, in terms of probability associated with a wider range of water quality variables, using Bayesian networks. The modelling procedure developed in this work can now be applied to develop system-specific Bayesian network models for trihalomethanes prediction in other drinking water distribution systems.
Collapse
Affiliation(s)
- Rebecca A Li
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW, 2052, Australia.
| | - James A McDonald
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW, 2052, Australia.
| | - Arumugam Sathasivan
- School of Computing Engineering and Mathematics, University of Western Sydney, Kingswood, NSW, 2747, Australia.
| | - Stuart J Khan
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW, 2052, Australia.
| |
Collapse
|
10
|
Wickramasinghe I, Kalutarage H. Naive Bayes: applications, variations and vulnerabilities: a review of literature with code snippets for implementation. Soft comput 2020. [DOI: 10.1007/s00500-020-05297-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
11
|
Figueiredo GVC, Fantin LH, Canteri MG, Ferreira da Rocha JC, Filho DDSJ. A Bayesian Probability Model Can Simulate the Knowledge of Soybean Rust Researchers to Optimize the Application of Fungicides. INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS 2019. [DOI: 10.4018/ijaeis.2019100103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Asian rust is the main soybean disease in Brazil, causing up to 80% of yield reduction. The use of fungicides is the main form of control; however, due to farmer's concern with outbreaks many unnecessary applications are performed. The present study aims to verify the usefulness of a probability model to estimate the timing and the number of fungicides sprays required to control Asian soybean rust, using Bayesian networks and knowledge engineering. The model was developed through interviews with rust researchers and a literature review. The Bayesian network was constructed with the GeNIe 2.0 software. The validation process was performed by 42 farmers and 10 rust researchers, using 28 test cases. Among the 28 tested cases, generated by the system, the agreement with the model was 47.5% for the farmers and 89.3% for the rust researchers. In general, the farmers overestimate the number. The results showed that the Bayesian network has accurately represented the knowledge of the expert, and also could help the farmers to avoid the unnecessary applications.
Collapse
|
12
|
Bertone E, Purandare J, Durand B. Spatiotemporal prediction of Escherichia coli and Enterococci for the Commonwealth Games triathlon event using Bayesian Networks. MARINE POLLUTION BULLETIN 2019; 146:11-21. [PMID: 31426138 DOI: 10.1016/j.marpolbul.2019.05.066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 05/28/2019] [Accepted: 05/29/2019] [Indexed: 06/10/2023]
Abstract
A number of Bayesian Networks were developed in order to nowcast and forecast, up to 4 days ahead and in different locations, the likelihood of water quality within the 2018 Commonwealth Games Triathlon swim course exceeding the critical limits for Enterococci and Escherichia coli. The models are data-driven, but the identification of potential inputs and optimal model structure was performed through the parallel contribution of several stakeholders and experts, consulted through workshops. The models, whose main nodes were discretised with a customised discretisation algorithm, were validated over a test set of data and deployed in real-time during the Commonwealth Games in support to a traditional water quality monitoring program. The proposed modelling framework proved to be cost-effective and less time-consuming than process-based models while still achieving high accuracy; in addition, the added value of a continuous stakeholder engagement guarantees a shared understanding of the model outputs and its future deployment.
Collapse
Affiliation(s)
- E Bertone
- School of Engineering and Built Environment, Griffith University, Gold Coast Campus, QLD 4222, Australia; Cities Research Institute, Griffith University, Gold Coast Campus, QLD 4222, Australia.
| | - J Purandare
- Cities Research Institute, Griffith University, Gold Coast Campus, QLD 4222, Australia; Gold Coast Water and Waste, City of Gold Coast, QLD 4211, Australia
| | - B Durand
- Gold Coast Water and Waste, City of Gold Coast, QLD 4211, Australia
| |
Collapse
|
13
|
Shan K, Shang M, Zhou B, Li L, Wang X, Yang H, Song L. Application of Bayesian network including Microcystis morphospecies for microcystin risk assessment in three cyanobacterial bloom-plagued lakes, China. HARMFUL ALGAE 2019; 83:14-24. [PMID: 31097252 DOI: 10.1016/j.hal.2019.01.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2018] [Revised: 12/12/2018] [Accepted: 01/09/2019] [Indexed: 05/23/2023]
Abstract
Microcystis spp., which occur as colonies of different sizes under natural conditions, have expanded in temperate and tropical freshwater ecosystems and caused seriously environmental and ecological problems. In the current study, a Bayesian network (BN) framework was developed to access the probability of microcystins (MCs) risk in large shallow eutrophic lakes in China, namely, Taihu Lake, Chaohu Lake, and Dianchi Lake. By means of a knowledge-supported way, physicochemical factors, Microcystis morphospecies, and MCs were integrated into different network structures. The sensitive analysis illustrated that Microcystis aeruginosa biomass was overall the best predictor of MCs risk, and its high biomass relied on the combined condition that water temperature exceeded 24 °C and total phosphorus was above 0.2 mg/L. Simulated scenarios suggested that the probability of hazardous MCs (≥1.0 μg/L) was higher under interactive effect of temperature increase and nutrients (nitrogen and phosphorus) imbalance than that of warming alone. Likewise, data-driven model development using a naïve Bayes classifier and equal frequency discretization resulted in a substantial technical performance (CCI = 0.83, K = 0.60), but the performance significantly decreased when model excluded species-specific biomasses from input variables (CCI = 0.76, K = 0.40). The BN framework provided a useful screening tool to evaluate cyanotoxin in three studied lakes in China, and it can also be used in other lakes suffering from cyanobacterial blooms dominated by Microcystis.
Collapse
Affiliation(s)
- Kun Shan
- Big Data Mining and Applications Center, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China; CAS Key Lab on Reservoir Environment, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China.
| | - Mingsheng Shang
- Big Data Mining and Applications Center, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China; CAS Key Lab on Reservoir Environment, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
| | - Botian Zhou
- Big Data Mining and Applications Center, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China; CAS Key Lab on Reservoir Environment, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China
| | - Lin Li
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Xiaoxiao Wang
- CAS Key Lab on Reservoir Environment, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing 400714, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hong Yang
- Department of Geography and Environmental Science, University of Reading, Whiteknights, Reading, RG6 6AB, UK
| | - Lirong Song
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
14
|
Evaluating the influential priority of the factors on insurance loss of public transit. PLoS One 2018; 13:e0190103. [PMID: 29298337 PMCID: PMC5752032 DOI: 10.1371/journal.pone.0190103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 12/10/2017] [Indexed: 12/04/2022] Open
Abstract
Understanding correlation between influential factors and insurance losses is beneficial for insurers to accurately price and modify the bonus-malus system. Although there have been a certain number of achievements in insurance losses and claims modeling, limited efforts focus on exploring the relative role of accidents characteristics in insurance losses. The primary objective of this study is to evaluate the influential priority of transit accidents attributes, such as the time, location and type of accidents. Based on the dataset from Washington State Transit Insurance Pool (WSTIP) in USA, we implement several key algorithms to achieve the objectives. First, K-means algorithm contributes to cluster the insurance loss data into 6 intervals; second, Grey Relational Analysis (GCA) model is applied to calculate grey relational grades of the influential factors in each interval; in addition, we implement Naive Bayes model to compute the posterior probability of factors values falling in each interval. The results show that the time, location and type of accidents significantly influence the insurance loss in the first five intervals, but their grey relational grades show no significantly difference. In the last interval which represents the highest insurance loss, the grey relational grade of the time is significant higher than that of the location and type of accidents. For each value of the time and location, the insurance loss most likely falls in the first and second intervals which refers to the lower loss. However, for accidents between buses and non-motorized road users, the probability of insurance loss falling in the interval 6 tends to be highest.
Collapse
|
15
|
Carvajal G, Roser DJ, Sisson SA, Keegan A, Khan SJ. Bayesian belief network modelling of chlorine disinfection for human pathogenic viruses in municipal wastewater. WATER RESEARCH 2017; 109:144-154. [PMID: 27883919 DOI: 10.1016/j.watres.2016.11.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 10/07/2016] [Accepted: 11/02/2016] [Indexed: 05/24/2023]
Abstract
Chlorine disinfection of biologically treated wastewater is practiced in many locations prior to environmental discharge or beneficial reuse. The effectiveness of chlorine disinfection processes may be influenced by several factors, such as pH, temperature, ionic strength, organic carbon concentration, and suspended solids. We investigated the use of Bayesian multilayer perceptron (BMLP) models as efficient and practical tools for compiling and analysing free chlorine and monochloramine virus disinfection performance as a multivariate problem. Corresponding to their relative susceptibility, Adenovirus 2 was used to assess disinfection by monochloramine and Coxsackievirus B5 was used for free chlorine. A BMLP model was constructed to relate key disinfection conditions (CT, pH, turbidity) to observed Log Reduction Values (LRVs) for these viruses at constant temperature. The models proved to be valuable for incorporating uncertainty in the chlor(am)ination performance estimation and interpolating between operating conditions. Various types of queries could be performed with this model including the identification of target CT for a particular combination of LRV, pH and turbidity. Similarly, it was possible to derive achievable LRVs for combinations of CT, pH and turbidity. These queries yielded probability density functions for the target variable reflecting the uncertainty in the model parameters and variability of the input variables. The disinfection efficacy was greatly impacted by pH and to a lesser extent by turbidity for both types of disinfections. Non-linear relationships were observed between pH and target CT, and turbidity and target CT, with compound effects on target CT also evidenced. This work demonstrated that the use of BMLP models had considerable ability to improve the resolution and understanding of the multivariate relationships between operational parameters and disinfection outcomes for wastewater treatment.
Collapse
Affiliation(s)
- Guido Carvajal
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW 2052, Australia.
| | - David J Roser
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW 2052, Australia.
| | - Scott A Sisson
- School of Mathematics & Statistics, University of New South Wales, NSW 2052, Australia.
| | - Alexandra Keegan
- Research and Innovation Services, SA Water Corporation, Adelaide, SA 5000, Australia.
| | - Stuart J Khan
- UNSW Water Research Centre, School of Civil & Environmental Engineering, University of New South Wales, NSW 2052, Australia.
| |
Collapse
|