1
|
Islam ARMT, Al Awadh M, Mallick J, Pal SC, Chakraborty R, Fattah MA, Ghose B, Kakoli MKA, Islam MA, Naqvi HR, Bilal M, Elbeltagi A. Estimating ground-level PM 2.5 using subset regression model and machine learning algorithms in Asian megacity, Dhaka, Bangladesh. AIR QUALITY, ATMOSPHERE, & HEALTH 2023; 16:1117-1139. [PMID: 37303964 PMCID: PMC9961308 DOI: 10.1007/s11869-023-01329-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 02/16/2023] [Indexed: 06/13/2023]
Abstract
Fine particulate matter (PM2.5) has become a prominent pollutant due to rapid economic development, urbanization, industrialization, and transport activities, which has serious adverse effects on human health and the environment. Many studies have employed traditional statistical models and remote-sensing technologies to estimate PM2.5 concentrations. However, statistical models have shown inconsistency in PM2.5 concentration predictions, while machine learning algorithms have excellent predictive capacity, but little research has been done on the complementary advantages of diverse approaches. The present study proposed the best subset regression model and machine learning approaches, including random tree, additive regression, reduced error pruning tree, and random subspace, to estimate the ground-level PM2.5 concentrations over Dhaka. This study used advanced machine learning algorithms to measure the effects of meteorological factors and air pollutants (NOX, SO2, CO, and O3) on the dynamics of PM2.5 in Dhaka from 2012 to 2020. Results showed that the best subset regression model was well-performed for forecasting PM2.5 concentrations for all sites based on the integration of precipitation, relative humidity, temperature, wind speed, SO2, NOX, and O3. Precipitation, relative humidity, and temperature have negative correlations with PM2.5. The concentration levels of pollutants are much higher at the beginning and end of the year. Random subspace is the optimal model for estimating PM2.5 because it has the least statistical error metrics compared to other models. This study suggests ensemble learning models to estimate PM2.5 concentrations. This study will help quantify ground-level PM2.5 concentration exposure and recommend regional government actions to prevent and regulate PM2.5 air pollution. Supplementary Information The online version contains supplementary material available at 10.1007/s11869-023-01329-w.
Collapse
Affiliation(s)
| | - Mohammed Al Awadh
- Department of Industrial Engineering, College of Engineering, King Khalid University, Abha, 61421 Saudi Arabia
| | - Javed Mallick
- Department of Civil Engineering, King Khalid University, Abha, Saudi Arabia
| | - Subodh Chandra Pal
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal 713104 India
| | - Rabin Chakraborty
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal 713104 India
| | - Md. Abdul Fattah
- Department of Urban and Regional Planning, Khulna University of Engineering and Technology, Khulna, Bangladesh
| | - Bonosri Ghose
- Department of Disaster Management, Begum Rokeya University, Rangpur, Rangpur, 5400 Bangladesh
| | | | - Md. Aminul Islam
- Department of Disaster Management, Begum Rokeya University, Rangpur, Rangpur, 5400 Bangladesh
| | - Hasan Raja Naqvi
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia (A Central University), New Delhi, 110025 India
| | - Muhammad Bilal
- School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo, 45003 China
| | - Ahmed Elbeltagi
- Agricultural Engineering Dept., Faculty of Agriculture, Mansoura University, Mansoura, 35516 Egypt
| |
Collapse
|
2
|
Ibrahim A, Ismail A, Juahir H, Iliyasu AB, Wailare BT, Mukhtar M, Aminu H. Water quality modelling using principal component analysis and artificial neural network. MARINE POLLUTION BULLETIN 2023; 187:114493. [PMID: 36566515 DOI: 10.1016/j.marpolbul.2022.114493] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 12/10/2022] [Accepted: 12/12/2022] [Indexed: 06/17/2023]
Abstract
The study investigates the latent pollution sources and most significant parameters that cause spatial variation and develops the best input for water quality modelling using principal component analysis (PCA) and artificial neural network (ANN). The dataset, 22 water quality parameters were obtained from Department of Environment Malaysia (DOE). The PCA generated six significant principal component scores (PCs) which explained 65.40 % of the total variance. Parameters for water quality variation are mainlyrelated to mineral components, anthropogenic activities, and natural processes. However, in ANN three input combination models (ANN A, B, and C) were developed to identify the best model that can predict water quality index (WQI) with very high precision. ANN A model appears to have the best prediction capacity with a coefficient of determination (R2) = 0.9999 and root mean square error (RMSE) = 0.0537. These results proved that the PCA and ANN methods can be applied as tools for decision-making and problem-solving for better managing of river quality.
Collapse
Affiliation(s)
- Aminu Ibrahim
- East Coast Environmental Research Institute Universiti Sultan Zainal Abidin Gong Badak, 21300 Terengganu, Malaysia; Department of Forestry Technology, Audu Bako College of Agriculture Dambatta, P.M.B 3159 Kano State, Nigeria.
| | - Azimah Ismail
- East Coast Environmental Research Institute Universiti Sultan Zainal Abidin Gong Badak, 21300 Terengganu, Malaysia
| | - Hafizan Juahir
- East Coast Environmental Research Institute Universiti Sultan Zainal Abidin Gong Badak, 21300 Terengganu, Malaysia
| | - Aisha B Iliyasu
- Department of Forestry Technology, Audu Bako College of Agriculture Dambatta, P.M.B 3159 Kano State, Nigeria
| | - Balarabe T Wailare
- Department of Remedial and General Studies, Audu Bako College of Agriculture Dambatta, P.M.B 3159 Kano State, Nigeria
| | - Mustapha Mukhtar
- Department of Remedial and General Studies, Audu Bako College of Agriculture Dambatta, P.M.B 3159 Kano State, Nigeria
| | - Hassan Aminu
- Department of Remedial and General Studies, Audu Bako College of Agriculture Dambatta, P.M.B 3159 Kano State, Nigeria
| |
Collapse
|
3
|
Ahmed AAM, Jui SJJ, Chowdhury MAI, Ahmed O, Sutradha A. The development of dissolved oxygen forecast model using hybrid machine learning algorithm with hydro-meteorological variables. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:7851-7873. [PMID: 36045185 PMCID: PMC9894995 DOI: 10.1007/s11356-022-22601-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 08/15/2022] [Indexed: 06/15/2023]
Abstract
Dissolved oxygen (DO) forecasting is essential for aquatic managers responsible for maintaining ecosystem health and the management of water bodies affected by water quality parameters. This paper aims to forecast dissolved oxygen (DO) concentration using a multivariate adaptive regression spline (MARS) hybrid model coupled with maximum overlap discrete wavelet transformation (MODWT) as a feature decomposition approach for Surma River water using a set of water quality hydro-meteorological variables. The proposed hybrid model is compared with numerous machine learning methods, namely Bayesian ridge regression (BNR), k-nearest neighbourhood (KNN), kernel ridge regression (KRR), random forest (RF), and support vector regression (SVR). The investigational results show that the proposed model of MODWT-MARS has a better prediction than the comparing benchmark models and individual standalone counter parts. The result shows that the hybrid algorithms (i.e. MODWT-MARS) outperformed the other models (r = 0.981, WI = 0.990, RMAE = 2.47%, and MAE = 0.089). This hybrid method may serve to forecast water quality variables with fewer predictor variables.
Collapse
Affiliation(s)
- Abul Abrar Masrur Ahmed
- Department of Infrastructure Engineering, University of Melbourne, Parkville, VIC 3010 Australia
- School of Mathematics Physics and Computing, University of Southern Queensland, Springfield, QLD 4300 Australia
| | - S. Janifer Jabin Jui
- School of Mathematics Physics and Computing, University of Southern Queensland, Springfield, QLD 4300 Australia
| | | | - Oli Ahmed
- School of Modern Sciences, Leading University, Sylhet, 3112 Bangladesh
| | - Ambica Sutradha
- School of Modern Sciences, Leading University, Sylhet, 3112 Bangladesh
| |
Collapse
|
4
|
Xiong Q, Wang W, Wang M, Zhang C, Zhang X, Chen C, Wang M. Prediction of ground-level ozone by SOM-NARX hybrid neural network based on the correlation of predictors. iScience 2022; 25:105658. [PMID: 36505938 PMCID: PMC9732375 DOI: 10.1016/j.isci.2022.105658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/22/2022] [Accepted: 11/21/2022] [Indexed: 11/27/2022] Open
Abstract
Current approaches to ozone prediction using hybrid neural networks are numerous but not perfect. Decomposition algorithms ignore the correlation between predictors and ozone, and feature extraction methods rarely select appropriate predictors in terms of correlation, especially for VOCs. Therefore, this study proposes a hybrid neural network model SOM-NARX based on the correlation of predictors. The model is based on MIC to filter predictors, using SOM to make predictors as feature sequences and using NARX networks to make predictions. Data from the JCDZURI site were used for training, testing, and validation. The results show that the correlation of the predictors, classification numbers of SOM, neuron numbers, and delay steps can affect prediction accuracy. Model comparison shows that the SOM-NARX model has 13.82, 10.60, 6.58% and 12.05, 9.44, 68.14% RMSE, MAE, and MAEP in winter and summer, which is smaller than CNN-LSTM, CNN-BiLSTM, CNN-GRU, SOM-LSTM, SOM-BiLSTM, and SOM-GRU.
Collapse
Affiliation(s)
- Qinqing Xiong
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China
| | - Wenju Wang
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China
| | - Mingya Wang
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China
| | - Chunhui Zhang
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China
| | - Xuechun Zhang
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China
| | - Chun Chen
- Henan Key Laboratory for Environmental Monitoring Technology, Zhengzhou 450004, China
| | - Mingshi Wang
- College of Resource and Environment, Henan Polytechnic University, Jiaozuo 454003, China,Corresponding author
| |
Collapse
|
5
|
Yang R, Yin L, Hao X, Liu L, Wang C, Li X, Liu Q. Identifying a suitable model for predicting hourly pollutant concentrations by using low-cost microstation data and machine learning. Sci Rep 2022; 12:19949. [PMID: 36402807 PMCID: PMC9675857 DOI: 10.1038/s41598-022-24470-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 11/15/2022] [Indexed: 11/21/2022] Open
Abstract
Accurately predicting the concentration of PM2.5 (fine particles with a diameter of 2.5 μm or less) is essential for health risk assessment and formulation of air pollution control strategies. At present, there is also a large amount of air pollution data. How to efficiently mine its hidden features to obtain the future concentration of pollutants is very important for the prevention and control of air pollution. Therefore we build a pollutant prediction model based on Lightweight Gradient Boosting Model (LightGBM) shallow machine learning and Long Short-Term Memory (LSTM) neural network. Firstly, the PM2.5 pollutant concentration data of 34 air quality stations in Beijing and the data of 18 weather stations were matched in time and space to obtain an input data set. Subsequently, the input data set was cleaned and preprocessed, and the training set was obtained by methods such as input feature extraction, input factor normalization, and data outlier processing. The hourly PM2.5 concentration value prediction was achieved in accordance with experiments conducted with the hourly PM2.5 data of Beijing from January 1, 2018 to October 1, 2020. Ultimately, the optimal hourly series prediction results were obtained after model comparisons. Through the comparison of these two models, it is found that the RMSE predicted by LSTM model for each pollutant is nearly 50% lower than that of LightGBM, and is more consistent with the fitting curve between the actual observations. The exploration of the input step size of LSTM model found that the accuracy of 3-h input data was higher than that of 12-h input data. It can be used for the management and decision-making of environmental protection departments and the formulation of preventive measures for emergency pollution incidents.
Collapse
Affiliation(s)
- Rongjin Yang
- grid.418569.70000 0001 2166 1076Chinese Research Academy of Environmental Sciences, Beijing, China
| | - Lizeyan Yin
- grid.494717.80000000115480420Higher Institute of Computer Modeling and Their Applications, Clermont Auvergne University, Clermont-Ferrand, France
| | - Xuejie Hao
- grid.20513.350000 0004 1789 9964State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
| | - Lu Liu
- grid.20513.350000 0004 1789 9964State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
| | - Chen Wang
- grid.20513.350000 0004 1789 9964State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
| | - Xiuhong Li
- grid.20513.350000 0004 1789 9964State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
| | - Qiang Liu
- grid.20513.350000 0004 1789 9964State Key Laboratory of Remote Sensing Science, College of Global Change and Earth System Science, Beijing Normal University, Beijing, China
| |
Collapse
|
6
|
An Improved Air Quality Index Machine Learning-Based Forecasting with Multivariate Data Imputation Approach. ATMOSPHERE 2022. [DOI: 10.3390/atmos13071144] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Accurate, timely air quality index (AQI) forecasting helps industries in selecting the most suitable air pollution control measures and the public in reducing harmful exposure to pollution. This article proposes a comprehensive method to forecast AQIs. Initially, the work focused on predicting hourly ambient concentrations of PM2.5 and PM10 using artificial neural networks. Once the method was developed, the work was extended to the prediction of other criteria pollutants, i.e., O3, SO2, NO2, and CO, which fed into the process of estimating AQI. The prediction of the AQI not only requires the selection of a robust forecasting model, it also heavily relies on a sequence of pre-processing steps to select predictors and handle different issues in data, including gaps. The presented method dealt with this by imputing missing entries using missForest, a machine learning-based imputation technique which employed the random forest (RF) algorithm. Unlike the usual practice of using RF at the final forecasting stage, we utilized RF at the data pre-processing stage, i.e., missing data imputation and feature selection, and we obtained promising results. The effectiveness of this imputation method was examined against a linear imputation method for the six criteria pollutants and the AQI. The proposed approach was validated against ambient air quality observations for Al-Jahra, a major city in Kuwait. Results obtained showed that models trained using missForest-imputed data could generalize AQI forecasting and with a prediction accuracy of 92.41% when tested on new unseen data, which is better than earlier findings.
Collapse
|
7
|
The Relationship between PM2.5 and PM10 in Central Italy: Application of Machine Learning Model to Segregate Anthropogenic from Natural Sources. ATMOSPHERE 2022. [DOI: 10.3390/atmos13030484] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Particular Matter (PM) data are the most used for the assessment of air quality, but it is also useful to monitor VOC and CO. The health impact of PM increases with decreasing aerodynamic dimensions, therefore most of the monitoring is aimed at PM10 (fraction of PM with aerodynamic dimensions smaller than 10 µm) and PM2.5 (fraction with aerodynamic dimensions lower than 2.5 µm). Generally, anthropogenic emissions contribute mainly to PM2.5 levels, whereas natural sources can largely affect PM10 concentrations. PM2.5/PM10 ratio can be used as a proxy of the origin (anthropogenic vs natural) of the PM, providing a useful indication about the main sources of PM that characterizes a specific geographical or urban setting. This paper presents the results of the analysis of continuous measurements of PM10 and PM2.5 concentrations at eight stations of the regional air quality monitoring network in Abruzzo (Central Italy), in the period 2017–2018. The application of models based on machine learning technique shows that PM2.5/PM10 ratio can be used to classify PM emissions and to know the nature of the emission source (natural and anthropogenic), under determinate conditions, and properly taking into account the meteorological parameters.
Collapse
|
8
|
Asadi M, McPhedran K. Biogas maximization using data-driven modelling with uncertainty analysis and genetic algorithm for municipal wastewater anaerobic digestion. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2021; 293:112875. [PMID: 34062425 DOI: 10.1016/j.jenvman.2021.112875] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 05/19/2021] [Accepted: 05/21/2021] [Indexed: 06/12/2023]
Abstract
Anaerobic digestion processes create biogases that can be useful sources of energy. The development of data-driven models of anaerobic digestion processes via operating parameters can lead to increased biogas production rates, resulting in greater energy production, through process modification and optimization. This study assessed processed and unprocessed input operating parameter variables for the development of regression models with transparent structures ('white-box' models) to: (1) estimate biogas production rates from municipal wastewater treatment plant (MWTP) anaerobic digestors; (2) compare their performances to artificial neural network (ANN) and adaptive network-based fuzzy inference system (ANFIS) models with opaque structures ('black-box' models) using Monte Carlo Simulation for uncertainty analysis; and (3) integrate the models with a genetic algorithm (GA) to optimize operating parameters for maximization of MWTP biogas production rates. The input variables were anaerobic digestion operating parameters from a MWTP including volatile fatty acids, total/fixed/volatile solids, pH, and inflow rate, which were processed via correlation tests and principal component analysis. Overall, the results indicated that the processed data did not improve regression model performances. Additionally, the developed non-linear regression model with the unprocessed inputs had the best performance based on values including R = 0.81, RMSE = 0.95, and IA = 0.89. However, this model was less accurate, but interestingly had less uncertainty, as compared to ANN and ANFIS models which indicates the compromise between model accuracy and uncertainty. Thus, all three models were coupled with GA optimization with maximum biogas production rate estimates of 22.0, 23.1, and 28.6 m3/min for ANN, ANFIS, and non-linear regression models, respectively.
Collapse
Affiliation(s)
- Mohsen Asadi
- Department of Civil, Geological & Environmental Engineering, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Kerry McPhedran
- Department of Civil, Geological & Environmental Engineering, University of Saskatchewan, Saskatoon, Saskatchewan, Canada.
| |
Collapse
|
9
|
Umar IK, Nourani V, Gökçekuş H. A novel multi-model data-driven ensemble approach for the prediction of particulate matter concentration. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:49663-49677. [PMID: 33939094 DOI: 10.1007/s11356-021-14133-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 04/22/2021] [Indexed: 06/12/2023]
Abstract
Accuracy in the prediction of the particulate matter (PM2.5 and PM10) concentration in the atmosphere is essential for both its monitoring and control. In this study, a novel neuro fuzzy ensemble (NF-E) model was proposed for prediction of hourly PM2.5 and PM10 concentration. The NF-E involves careful selection of relevant input parameters for base modelling and using an adaptive neuro fuzzy inference system (ANFIS) model as a nonlinear kernel for obtaining ensemble output. The four base models used include ANFIS, artificial neural network (ANN), support vector regression (SVR) and multilinear regression (MLR). The dominant input parameters for developing the base models were selected using two nonlinear approaches (mutual information and single-input single-output ANN-based sensitivity analysis) and a conventional Pearson correlation coefficient. The NF-E model was found to predict both PM2.5 and PM10 with higher generalization ability and least error. The NF-E model outperformed all the single base models and other linear ensemble techniques with a Nash-Sutcliffe efficiency (NSE) of 0.9594 and 0.9865, mean absolute error (MAE) of 1.63 μg/m3 and 1.66 μg/m3 and BIAS of 0.0760 and 0.0340 in the testing stage for PM2.5 and PM10, respectively. The NF-E could improve the efficiency of other models by 4-22% for PM2.5 and 3-20% for PM10 depending on the model.
Collapse
Affiliation(s)
- Ibrahim Khalil Umar
- Faculty of Civil and Environmental Engineering, Near East University, Near East Boulevard, Via Mersin, 99138, Nicosia, North Cyprus, Turkey.
| | - Vahid Nourani
- Center of Excellence in Hydroinformatics, Faculty of Civil Engineering, University of Tabriz, Tabriz, Iran
- Faculty of Civil and Environmental Engineering, Near East University, Near East Boulevard, Via Mersin, 99138, Nicosia, North Cyprus, Turkey
| | - Hüseyin Gökçekuş
- Faculty of Civil and Environmental Engineering, Near East University, Near East Boulevard, Via Mersin, 99138, Nicosia, North Cyprus, Turkey
| |
Collapse
|
10
|
Torkashvand J, Jafari AJ, Hopke PK, Shahsavani A, Hadei M, Kermani M. Airborne particulate matter in Tehran's ambient air. JOURNAL OF ENVIRONMENTAL HEALTH SCIENCE & ENGINEERING 2021; 19:1179-1191. [PMID: 34150304 PMCID: PMC8172739 DOI: 10.1007/s40201-020-00573-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Accepted: 10/15/2020] [Indexed: 05/09/2023]
Abstract
In recent decades, particulate matter (PM) concentrations in Tehran have exceeded the World Health Organization's (WHO) guideline on most days. In this study, a search protocol was defined by identifying the keywords, to carry out a systematic review of the concentrations and composition of PM in Tehran's ambient air. For this purpose, searches were done in Scopus, PubMed, and Web of Science in 2019. Among the founded articles (197 in Scopus, 61 in PubMed, and 153 in Web of Science). The results show that in Tehran, the annual average PM10 exceeded the WHO guidelines and for more than 50.0% of the days, the PM2.5 concentration was more than WHO 24-h guidance value. The PM concentration in Tehran has two seasonal peaks due to poorer dispersion and suspension from dry land, respectively. Tehran has two daily PM peaks due to traffic and changes in boundary-layer heights; one just after midnight and the other during morning rush hour. Indoor concentrations of PM10 and PM2.5 in Tehran were 10.6 and 21.8 times higher than the corresponding values in ambient air. Tehran represents a unique case of problems of controlling PM because of its geographical setting, emission sources, and land use. This review provided a comprehensive assessment for decision makers to assist them in making appropriate policy decisions to improve the air quality. Considering factors such as diversity of resources, temporal and spatial variations, and urban location is essential in developing control plans. Also future studies should focus more on PM reduction plans.
Collapse
Affiliation(s)
- Javad Torkashvand
- Research Center for Environmental Health Technology, Iran University of Medical Sciences, Tehran, Iran
- Department of Environmental Health Engineering, School of Public Health, Iran University of Medical Sciences, Tehran, IR Iran
| | - Ahamd Jonidi Jafari
- Department of Environmental Health Engineering, School of Public Health, Iran University of Medical Sciences, Tehran, IR Iran
| | - Philip K. Hopke
- Center for Air Resources Engineering and Science, Clarkson University, Potsdam, NY USA
- Department of Public Health Sciences, University of Rochester School of Medicine and Dentistry, Rochester, NY USA
| | - Abbas Shahsavani
- Environmental and Occupational Hazards Control Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Department of Environmental Health Engineering, School of Public Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mostafa Hadei
- Department of Environmental Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Majid Kermani
- Research Center for Environmental Health Technology, Iran University of Medical Sciences, Tehran, Iran
- Department of Environmental Health Engineering, School of Public Health, Iran University of Medical Sciences, Tehran, IR Iran
| |
Collapse
|
11
|
Fallah B, Torabi F. Application of periodic parameters and their effects on the ANN landfill gas modeling. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:28490-28506. [PMID: 33538970 DOI: 10.1007/s11356-021-12498-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 01/12/2021] [Indexed: 06/12/2023]
Abstract
To reach a practical landfill gas management system and to diminish the negative environmental impacts from landfills, accurate methane (CH4) prediction is essential. In this study, the preprocessing steps including minimizing multicollinearity, removal of outliers, and errors with missing data imputation are applied to enhance the data quality. This study is the first at employing periodic parameters in the two-stage non-linear auto-regressive model with exogenous inputs (NARX) with the aim of providing a convenient and precise approach to predict the daily CH4 collection rate from a municipal landfill in Regina, SK, Canada. Using a stepwise procedure, various volumes of training data were assessed, and concluded that employing the 3-year training data reduced the mean absolute percentage error (MAPE) of the CH4 prediction model by 26.97% at the testing stage. The favorable artificial neural network model performance was obtained using the day of the year (DOY) as a sole input of the time series model with MAPE of 2.12% showing its acceptable ability in CH4 prediction. Using an only DOY-based model is especially remarkable because of its simplicity and high accuracy showing a convenient and effective approach in time landfill gas modeling, particularly for the landfills with no reliable climatic data.
Collapse
Affiliation(s)
- Bahareh Fallah
- Environmental Systems Engineering, University of Regina, 3737 Wascana Parkway, Regina, Saskatchewan, S4S 0A2, Canada
| | - Farshid Torabi
- Petroleum Systems Engineering, University of Regina, 3737 Wascana Parkway, Regina, Saskatchewan, S4S 0A2, Canada.
| |
Collapse
|
12
|
Deep Learning Forecasts of Soil Moisture: Convolutional Neural Network and Gated Recurrent Unit Models Coupled with Satellite-Derived MODIS, Observations and Synoptic-Scale Climate Index Data. REMOTE SENSING 2021. [DOI: 10.3390/rs13040554] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Remotely sensed soil moisture forecasting through satellite-based sensors to estimate the future state of the underlying soils plays a critical role in planning and managing water resources and sustainable agricultural practices. In this paper, Deep Learning (DL) hybrid models (i.e., CEEMDAN-CNN-GRU) are designed for daily time-step surface soil moisture (SSM) forecasts, employing the gated recurrent unit (GRU), complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), and convolutional neural network (CNN). To establish the objective model’s viability for SSM forecasting at multi-step daily horizons, the hybrid CEEMDAN-CNN-GRU model is tested at 1st, 5th, 7th, 14th, 21st, and 30th day ahead period by assimilating a comprehensive pool of 52 predictor dataset obtained from three distinct data sources. Data comprise satellite-derived Global Land Data Assimilation System (GLDAS) repository a global, high-temporal resolution, unique terrestrial modelling system, and ground-based variables from Scientific Information Landowners (SILO) and synoptic-scale climate indices. The results demonstrate the forecasting capability of the hybrid CEEMDAN-CNN-GRU model with respect to the counterpart comparative models. This is supported by a relatively lower value of the mean absolute percentage and root mean square error. In terms of the statistical score metrics and infographics employed to test the final model’s utility, the proposed CEEMDAN-CNN-GRU models are considerably superior compared to a standalone and other hybrid method tested on independent SSM data developed through feature selection approaches. Thus, the proposed approach can be successfully implemented in hydrology and agriculture management.
Collapse
|
13
|
Ucun Ozel H, Gemici BT, Gemici E, Ozel HB, Cetin M, Sevik H. Application of artificial neural networks to predict the heavy metal contamination in the Bartin River. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2020; 27:42495-42512. [PMID: 32705560 DOI: 10.1007/s11356-020-10156-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Accepted: 07/15/2020] [Indexed: 06/11/2023]
Abstract
In this study, copper (Cu), iron (Fe), zinc (Zn), manganese (Mn), nickel (Ni), and lead (Pb) analyses were performed, and the results were modelled by artificial neural networks (ANN) and adaptive neuro-fuzzy inference system (ANFIS). Samples were taken from 3 stations selected on the Bartin River for 1 year between December 2012 and December 2013. Radial basis neural network (RBANN), multilayer perceptron (MLP) neural networks models, and adaptive neuro-fuzzy inference system (ANFIS) were applied to the data in order to predict the heavy metal concentrations. As a result of the study, the RMSE and MAE values of all the heavy metal models were found to have very low error values during the test phase, and it was found that the models created using MLP had R2 values higher than 0.77 during the test phase; the test phase R2 values of the models using RBN method were found to be ranging between 0.773 and 0.989, and the test phase R2 value of the ANFIS model was higher than 0.80. If sorted from the best model to the worst by taking the MAE and RMSE values into consideration based on the test evaluation results, according to the heavy metal types, where all of the MLP, RBN, and ANFIS models were generally approximate to each other, RBN was successful for Cu, Zn, and Mn, while MLP model was successful for Ni and ANFIS model for Fe and Pb. According to the results, it can be inferred that the heavy metal contents can be estimated approximately with artificial intelligence models and relatively easy-to-measure parameters; it will be possible to detect heavy metals which are harmful to the viability of the rivers, both quickly and economically.
Collapse
Affiliation(s)
- Handan Ucun Ozel
- Faculty of Engineering, Architecture and Design, Department of Environmental Engineering, Bartin University, Bartin, Turkey
| | - Betul Tuba Gemici
- Faculty of Engineering, Architecture and Design, Department of Environmental Engineering, Bartin University, Bartin, Turkey
| | - Ercan Gemici
- Faculty of Engineering, Architecture and Design, Department of Civil Engineering, Bartin University, Bartin, Turkey
| | - Halil Baris Ozel
- Faculty of Forestry, Department of Forest Engineering, Bartin University, Bartin, Turkey
| | - Mehmet Cetin
- Faculty of Engineering and Architecture, Department of Landscape Architecture, Kastamonu University, Kastamonu, Turkey.
| | - Hakan Sevik
- Faculty of Engineering and Architecture, Department of Environmental Engineering, Kastamonu University, Kastamonu, Turkey
| |
Collapse
|
14
|
Dupont MF, Elbourne A, Cozzolino D, Chapman J, Truong VK, Crawford RJ, Latham K. Chemometrics for environmental monitoring: a review. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2020; 12:4597-4620. [PMID: 32966380 DOI: 10.1039/d0ay01389g] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Environmental monitoring is necessary to ensure the overall health and conservation of an ecosystem. However, ecosystems (e.g. air, water, soil), are complex, involving numerous processes (both native and external), inputs, contaminants, and living organisms. As such, monitoring an environmental system is not a trivial task. The data obtained from natural systems is often multifaceted and convoluted, as a multitude of inputs can be intertwined within the matrix of the information obtained as part of a study. This means that trends and important results can be easily overlooked by conventional and single dimensional data analysis protocols. Recently, chemometric methods have emerged as a powerful method for maximizing the details contained within a chemical data set. Specifically, chemometrics refers to the use of mathematical and statistical analysis methods to evaluate chemical data, beyond univariant analysis. This type of analysis can provide a quantitative description of environmental measurements, while also having the capacity to reveal previously overlooked trends in data sets. Applying chemometrics to environmental data allows us to identify and describe the inter-relationship of environmental drivers, sources of contamination, and their potential impact upon the environment. This review aims to provide a detailed understanding of chemometric techniques, how they are currently used in environmental monitoring, and how these techniques can be used to improve current practices. An enhanced ability to monitor environmental conditions and to predict trends would be greatly beneficial to government and research agencies in their ability to develop environmental policies and analytical procedures.
Collapse
|
15
|
Fallah B, Ng KTW, Vu HL, Torabi F. Application of a multi-stage neural network approach for time-series landfill gas modeling with missing data imputation. WASTE MANAGEMENT (NEW YORK, N.Y.) 2020; 116:66-78. [PMID: 32784123 DOI: 10.1016/j.wasman.2020.07.034] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Revised: 05/06/2020] [Accepted: 07/20/2020] [Indexed: 05/20/2023]
Abstract
To mitigate the greenhouse gas effect, accurate and precise landfill gas prediction models are required for more precise prediction of the amount and recovery time of methane gas from landfills. When the study associates to greenhouse gas emissions problems, time series prediction models are of considerable interests, in which significant past records of gas data are required. This study is the first to specially impute the missing methane (CH4) data for applying in time series artificial neural network (ANN) model in an attempt to predict daily CH4 generation rate from a landfill in Regina, SK, Canada. Pre-processing was conducted on data to evaluate independent and significant meteorological input variables and provide suitable dataset for developing CH4 generation models. A two-stage time series model proposed in this study was performed by missing data imputation at the first stage, followed by a neural network auto-regressive model with exogenous inputs (NARX) at the second stage. The model with 3 layers, 5 climatic variables and 9 neurons in the hidden layer was the optimal structure. This model shows the high performance in CH4 prediction with the average index of agreement of 0.92 and the average mean absolute percentage error (MAPE) of 3.03% during the testing stage. Missing data imputation coupled with NARX method decreased the mean squared error (MSE) of the model by 84% (compared to Multilayer Perceptrons neural network model) in the testing period representing the effectiveness of missing data estimation coupling with time series ANN models in daily CH4 generation prediction.
Collapse
Affiliation(s)
- Bahareh Fallah
- Environmental Systems Engineering, University of Regina, Saskatchewan, Canada
| | - Kelvin Tsun Wai Ng
- Environmental Systems Engineering, University of Regina, Saskatchewan, Canada
| | - Hoang Lan Vu
- Environmental Systems Engineering, University of Regina, Saskatchewan, Canada
| | - Farshid Torabi
- Environmental Systems Engineering, University of Regina, Saskatchewan, Canada.
| |
Collapse
|
16
|
Tefera W, Kumie A, Berhane K, Gilliland F, Lai A, Sricharoenvech P, Samet J, Patz J, Schauer JJ. Chemical Characterization and Seasonality of Ambient Particles (PM 2.5) in the City Centre of Addis Ababa. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E6998. [PMID: 32987918 PMCID: PMC7579520 DOI: 10.3390/ijerph17196998] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/13/2020] [Accepted: 09/19/2020] [Indexed: 11/25/2022]
Abstract
Ambient air pollution is a growing public health concern in major African cities, including Addis Ababa (Ethiopia), where little information is available on fine particulate matter (PM2.5, with aerodynamic diameter <2.5 µm) pollution. This paper aims to characterize annual PM2.5, including bulk composition and seasonal patterns, in Addis Ababa. We collected 24-h PM2.5 samples in the central city every 6 days from November 2015 to November 2016. The mean (±SD) daily PM2.5 concentration was 53.8 (±25.0) µg/m3, with 90% of sampled days exceeding the World Health Organization's guidelines. Principal components were organic matter (OM, 44.5%), elemental carbon (EC, 25.4%), soil dust (13.5%), and SNA (sulfate, nitrate, and ammonium ions, 8.2%). Higher PM2.5 concentrations were observed during the heavy rain season, while crustal dust concentrations ranged from 2.9 to 37.6%, with higher levels during dry months. Meteorological variables, vehicle emissions, biomass fuels, unpaved roads, and construction activity contribute to poor air quality. Compared to the Air Quality Index (AQI), 31% and 36% of observed days were unhealthy for everyone and unhealthy for sensitive groups, respectively. We recommend adopting effective prevention strategies and pursuing research on vehicle emissions, biomass burning, and dust control to curb air pollution in the city.
Collapse
Affiliation(s)
- Worku Tefera
- School of Public Health, College of Health Sciences, Addis Ababa University, Addis Ababa 9086, Ethiopia; or
| | - Abera Kumie
- School of Public Health, College of Health Sciences, Addis Ababa University, Addis Ababa 9086, Ethiopia; or
| | - Kiros Berhane
- Department of Biostatistics, Columbia University, New York, NY 10032, USA;
| | - Frank Gilliland
- Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA;
| | - Alexandra Lai
- Environmental Chemistry and Technology Program, University of Wisconsin-Madison, Madison, WI 53706, USA; (A.L.); (P.S.); (J.J.S.)
| | - Piyaporn Sricharoenvech
- Environmental Chemistry and Technology Program, University of Wisconsin-Madison, Madison, WI 53706, USA; (A.L.); (P.S.); (J.J.S.)
| | - Jonathan Samet
- Office of the Dean, Colorado School of Public Health, Aurora, CO 80045, USA;
| | - Jonathan Patz
- Global Health Institute, University of Wisconsin, Madison, WI 53706, USA;
| | - James J. Schauer
- Environmental Chemistry and Technology Program, University of Wisconsin-Madison, Madison, WI 53706, USA; (A.L.); (P.S.); (J.J.S.)
- Wisconsin State Hygiene Laboratory, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
17
|
Goulier L, Paas B, Ehrnsperger L, Klemm O. Modelling of Urban Air Pollutant Concentrations with Artificial Neural Networks Using Novel Input Variables. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E2025. [PMID: 32204378 PMCID: PMC7143381 DOI: 10.3390/ijerph17062025] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 03/11/2020] [Accepted: 03/13/2020] [Indexed: 11/16/2022]
Abstract
Since operating urban air quality stations is not only time consuming but also costly, and because air pollutants can cause serious health problems, this paper presents the hourly prediction of ten air pollutant concentrations (CO2, NH3, NO, NO2, NOx, O3, PM1, PM2.5, PM10 and PN10) in a street canyon in Münster using an artificial neural network (ANN) approach. Special attention was paid to comparing three predictor options representing the traffic volume: we included acoustic sound measurements (sound), the total number of vehicles (traffic), and the hour of the day and the day of the week (time) as input variables and then compared their prediction powers. The models were trained, validated and tested to evaluate their performance. Results showed that the predictions of the gaseous air pollutants NO, NO2, NOx, and O3 reveal very good agreement with observations, whereas predictions for particle concentrations and NH3 were less successful, indicating that these models can be improved. All three input variable options (sound, traffic and time) proved to be suitable and showed distinct strengths for modelling various air pollutant concentrations.
Collapse
Affiliation(s)
- Laura Goulier
- Climatology Research Group, University of Münster, Heisenbergstraße 2, 48149 Münster, Germany; (B.P.); (L.E.); (O.K.)
| | | | | | | |
Collapse
|
18
|
Ye Z, Yang J, Zhong N, Tu X, Jia J, Wang J. Tackling environmental challenges in pollution controls using artificial intelligence: A review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 699:134279. [PMID: 33736193 DOI: 10.1016/j.scitotenv.2019.134279] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Revised: 09/02/2019] [Accepted: 09/03/2019] [Indexed: 06/12/2023]
Abstract
This review presents the developments in artificial intelligence technologies for environmental pollution controls. A number of AI approaches, which start with the reliable mapping of nonlinear behavior between inputs and outputs in chemical and biological processes in terms of prediction models to the emerging optimization and control algorithms that study the pollutants removal processes and intelligent control systems, have been developed for environmental clean-ups. The characteristics, advantages and limitations of AI methods, including single and hybrid AI methods, were overviewed. Hybrid AI methods exhibited synergistic effects, but with computational heaviness. The up-to-date review summarizes i) Various artificial neural networks employed in wastewater degradation process for the prediction of removal efficiency of pollutants and the search of optimizing experimental conditions; ii) Evaluation of fuzzy logic used for intelligent control of aerobic stage of wastewater treatment process; iii) AI-aided soft-sensors for precisely on-line/off-line estimation of hard-to-measure parameters in wastewater treatment plants; iv) Single and hybrid AI methods applied to estimate pollutants concentrations and design monitoring and early-warning systems for both aquatic and atmospheric environments; v) AI modelings of short-term, mid-term and long-term solid waste generations, and various ANNs for solid waste recycling and reduction. Finally, the future challenges of AI-based models employed in the environmental fields are discussed and proposed.
Collapse
Affiliation(s)
- Zhiping Ye
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Jiaqian Yang
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Na Zhong
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Xin Tu
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, United Kingdom
| | - Jining Jia
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Jiade Wang
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China.
| |
Collapse
|
19
|
Pak U, Ma J, Ryu U, Ryom K, Juhyok U, Pak K, Pak C. Deep learning-based PM2.5 prediction considering the spatiotemporal correlations: A case study of Beijing, China. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 699:133561. [PMID: 31689669 DOI: 10.1016/j.scitotenv.2019.07.367] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Revised: 06/28/2019] [Accepted: 07/22/2019] [Indexed: 05/26/2023]
Abstract
Air pollution is one of the serious environmental problems that humankind faces and also a hot topic in Northeastern Asia. Therefore, the accurate prediction of PM2.5 (particulate matter with an aerodynamic diameter of ≤2.5 μm) is very significant in the management of human health and the decision-making of government for the environmental management. In this study, a spatiotemporal convolutional neural network (CNN) and long short-term (LSTM) memory (CNN-LSTM) model (also called PM (particulate matter) predictor) was proposed and used to predict the next day's daily average PM2.5 concentration in Beijing City. The spatiotemporal correlation analysis using the mutual information (MI) was performed, considering not only the linear correlation but also nonlinear correlation between target and observation parameters; in addition, it was fully considered for the whole area of China with the target monitoring station as the center and also for the historic air quality and meteorological data. As a result, the spatiotemporal feature vector (STFV) which reflects both linear and nonlinear correlations between parameters was effectively constructed. The PM predictor secured a fast and accurate prediction performance by efficiently extracting the inherent features of the latent air quality and meteorological input data associated with PM2.5 through CNN and by fully reflecting the long-term historic process of input time series data through LSTM. The air quality and meteorological data from the 384 monitoring stations which represents the whole area of China with Beijing City as the center during the 3 years (Jan. 1st, 2015 to Dec. 31th, 2017) were used to verify the validity of the proposed method. In conclusion, the proposed method was proved to have a better stability and prediction performance compared to multi-layer perceptron (MLP) and LSTM models.
Collapse
Affiliation(s)
- Unjin Pak
- Department of Automation Engineering, Kim Chaek University of Technology, Pyongyang 950003, Democratic People's Republic of Korea.
| | - Jun Ma
- Department of Geology, Kim Il Sung University, Pyongyang 999093, Democratic People's Republic of Korea
| | - Unsok Ryu
- School of Information Science, Kim Il Sung University, Pyongyang 999093, Democratic People's Republic of Korea
| | - Kwangchol Ryom
- Department of Metallurgical Engineering, Kim Chaek University of Technology, Pyongyang 950003, Democratic People's Republic of Korea
| | - U Juhyok
- Digital Library, Kim Chaek University of Technology, Pyongyang 950003, Democratic People's Republic of Korea
| | - Kyongsok Pak
- School of Information Science, Kim Il Sung University, Pyongyang 999093, Democratic People's Republic of Korea
| | - Chanil Pak
- Information Center, Kim Chaek University of Technology, Pyongyang 950003, Democratic People's Republic of Korea
| |
Collapse
|
20
|
Li X, Zhang X. Predicting ground-level PM 2.5 concentrations in the Beijing-Tianjin-Hebei region: A hybrid remote sensing and machine learning approach. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2019; 249:735-749. [PMID: 30933771 DOI: 10.1016/j.envpol.2019.03.068] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 02/13/2019] [Accepted: 03/17/2019] [Indexed: 06/09/2023]
Abstract
An accurate estimation of PM2.5 (fine particulate matters with diameters ≤ 2.5 μm) concentration is critical for health risk assessment and generating air pollution control strategies. In this study, a hybrid remote sensing and machine learning approach, named RSRF model is proposed to estimate daily ground-level PM2.5 concentrations, which integrates Random Forest (RF), one of machine learning (ML) models, and aerosol optical depth (AOD), one of remote sensing (RS) products. The proposed RSRF model provides an opportunity for an adequate characterization of real-time spatiotemporal PM2.5 distributions at uninhabited places and complex surfaces. It also offers advantages in handling complicated non-linear relationships among a large number of meteorological, environmental and air pollutant factors, as well as ever-increasing environmental data sets. The applicability of the proposed RSRF model is tested in the Beijing-Tianjin-Hebei region (BTH region) during 2015-2017. Deep Blue (DB) AOD from Aqua-retrieved Collection 6.1 (C_61) aerosol products of Moderate Resolution Imaging Spectroradiometer (MODIS) is validated with Aerosol Robotic Network. The validation results indicate C_61 DB AOD has a high correlation with ground based AOD in the BTH region. The proposed RSRF model performed well in characterizing spatiotemporal variations of annual and seasonal PM2.5 concentrations. It not only is useful to quantify the relationships between PM2.5 and relevant factors such as DB AOD, meteorological and air pollutant variables, but also can provide decision support for air pollution control at a regional environment during haze periods.
Collapse
Affiliation(s)
- Xintong Li
- School of Environmental Science and Engineering, Shandong University, Qingdao, Shandong, 266237, China
| | - Xiaodong Zhang
- School of Environmental Science and Engineering, Shandong University, Qingdao, Shandong, 266237, China.
| |
Collapse
|
21
|
Singh SK, Bejagam KK, An Y, Deshmukh SA. Machine-Learning Based Stacked Ensemble Model for Accurate Analysis of Molecular Dynamics Simulations. J Phys Chem A 2019; 123:5190-5198. [DOI: 10.1021/acs.jpca.9b03420] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
| | - Karteek K. Bejagam
- Department of Chemical Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Yaxin An
- Department of Chemical Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Sanket A. Deshmukh
- Department of Chemical Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
22
|
Ansari M, Ehrampoush MH. Meteorological correlates and AirQ + health risk assessment of ambient fine particulate matter in Tehran, Iran. ENVIRONMENTAL RESEARCH 2019; 170:141-150. [PMID: 30579988 DOI: 10.1016/j.envres.2018.11.046] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Revised: 11/13/2018] [Accepted: 11/29/2018] [Indexed: 06/09/2023]
Abstract
There are few studies in the world that have been evaluated the mortality due to exposure to particulate matter smaller than 2.5 µm by AirQ+ software. Therefore, the study aimed to correlate between fine particulate matter (PM2.5) and meteorological variables and estimate all-cause annual mortality and mortality from cerebrovascular disease (stroke), ischemic heart disease (IHD), acute lower respiratory infection (ALRI), lung cancer (LC), chronic obstructive pulmonary disease (COPD) attributed to long-term exposure to ambient PM2.5 in Tehran from March 2017 to March 2018 using the WHO AirQ+ software. Data related to air quality, meteorological condition, population and the baseline incidence rates of health endpoints in Tehran were gathered from government agencies. The association between the PM2.5 concentrations and meteorological variables in the period of study were assessed by correlation analysis. The results of correlation analysis showed a weak positive correlation between PM2.5 concentrations and average monthly temperature (r = 0.42, P < 0.05) and average monthly humidity (r = 0.37, P < 0.05) in Tehran. The quantitative risk assessment related to all-cause annual mortality, the mortality of IHD, stroke, COPD, LC and ALRI were estimated 6710, 3797, 1145, 172, 135 and 27 cases, respectively. The results of regression association analysis between PM2.5 and the number of recorded deaths was showed that with an increase of one microgram per cubic meter of PM2.5, it is expected that about 27 cases will be added to air pollution mortality in Tehran.
Collapse
Affiliation(s)
- Mohsen Ansari
- Student Research Committee, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; Environmental Science and Technology Research Center, Department of Environmental Health Engineering, Shahid Sadoughi University of Medical Sciences, Yazd, Iran; Air Quality Control Company, Municipality of Tehran, Tehran, Iran
| | - Mohammad Hassan Ehrampoush
- Environmental Science and Technology Research Center, Department of Environmental Health Engineering, Shahid Sadoughi University of Medical Sciences, Yazd, Iran.
| |
Collapse
|
23
|
Nayeb Yazdi M, Arhami M, Delavarrafiee M, Ketabchy M. Developing air exchange rate models by evaluating vehicle in-cabin air pollutant exposures in a highway and tunnel setting: case study of Tehran, Iran. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019; 26:501-513. [PMID: 30406592 DOI: 10.1007/s11356-018-3611-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 10/26/2018] [Indexed: 06/08/2023]
Abstract
The passengers inside vehicles could be exposed to high levels of air pollutants particularly while driving on highly polluted and congested traffic roadways. In order to study such exposure levels and its relation to the cabin ventilation condition, a monitoring campaign was conducted to measure the levels inside the three most common types of vehicles in Tehran, Iran (a highly air polluted megacity). In this regard, carbon monoxide (CO) and particulate matter (PM) were measured for various ventilation settings, window positions, and vehicle speeds while driving on the Resalat Highway and through the Resalat Tunnel. Results showed on average in-cabin exposure to particle number and PM10 for the open windows condition was seven times greater when compared to closed windows and air conditioning on. When the vehicle was passing through the tunnel, in-cabin CO and particle number increased 100 and 30%, respectively, compared to driving on highway. Air exchange rate (AER) is a significant factor when evaluating in-cabin air pollutants level. AER was measured and simulated by a model developed through a Monte Carlo analysis of uncertainty and considering two main affecting variables, vehicle speed and fan speed. The lowest AER was 7 h-1 for the closed window and AC on conditions, whereas the highest AER was measured 70 h-1 for an open window condition and speed of 90 km h-1. The results of our study can assist policy makers in controlling in-cabin pollutant exposure and in planning effective strategies for the protection of public health.
Collapse
Affiliation(s)
- Mohammad Nayeb Yazdi
- Department of Civil Engineering, Sharif University of Technology, Azadi Avenue, P.O. Box 11365-8639, Tehran, Iran
- Department of Biological Systems Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Mohammad Arhami
- Department of Civil Engineering, Sharif University of Technology, Azadi Avenue, P.O. Box 11365-8639, Tehran, Iran.
| | - Maryam Delavarrafiee
- Department of Civil Engineering, Sharif University of Technology, Azadi Avenue, P.O. Box 11365-8639, Tehran, Iran
- Department of Civil, Construction, and Environmental Engineering, North Carolina State University, Raleigh, NC, USA
| | - Mehdi Ketabchy
- Department of Civil Engineering, Sharif University of Technology, Azadi Avenue, P.O. Box 11365-8639, Tehran, Iran
- Department of Biological Systems Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
- Transportation Business Line, Gannett Fleming, Fairfax, USA
| |
Collapse
|
24
|
Weichenthal S, Hatzopoulou M, Brauer M. A picture tells a thousand…exposures: Opportunities and challenges of deep learning image analyses in exposure science and environmental epidemiology. ENVIRONMENT INTERNATIONAL 2019; 122:3-10. [PMID: 30473381 PMCID: PMC7615261 DOI: 10.1016/j.envint.2018.11.042] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 11/16/2018] [Accepted: 11/17/2018] [Indexed: 05/11/2023]
Abstract
BACKGROUND Artificial intelligence (AI) is revolutionizing our world, with applications ranging from medicine to engineering. OBJECTIVES Here we discuss the promise, challenges, and probable data sources needed to apply AI in the fields of exposure science and environmental health. In particular, we focus on the use of deep convolutional neural networks to estimate environmental exposures using images and other complementary data sources such as cell phone mobility and social media information. DISCUSSION Characterizing the health impacts of multiple spatially-correlated exposures remains a challenge in environmental epidemiology. A shift toward integrated measures that simultaneously capture multiple aspects of the urban built environment could improve efficiency and provide important insights into how our collective environments influence population health. The widespread adoption of AI in exposure science is on the frontier. This will likely result in new ways of understanding environmental impacts on health and may allow for analyses to be efficiently scaled for broad coverage. Image-based convolutional neural networks may also offer a cost-effective means of estimating local environmental exposures in low and middle-income countries where monitoring and surveillance infrastructure is limited. However, suitable databases must first be assembled to train and evaluate these models and these novel approaches should be complemented with traditional exposure metrics. CONCLUSIONS The promise of deep learning in environmental health is great and will complement existing measurements for data-rich settings and could enhance the resolution and accuracy of estimates in data poor scenarios. Interdisciplinary partnerships will be needed to fully realize this potential.
Collapse
Affiliation(s)
- Scott Weichenthal
- McGill University, Department of Epidemiology, Biostatistics and Occupational Health, Montreal, QC, Canada.
| | | | - Michael Brauer
- University of British Columbia, School of Population and Public Health, Vancouver, BC, Canada
| |
Collapse
|
25
|
Isiyaka HA, Mustapha A, Juahir H, Phil-Eze P. Water quality modelling using artificial neural network and multivariate statistical techniques. ACTA ACUST UNITED AC 2018. [DOI: 10.1007/s40808-018-0551-9] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
26
|
Freeman BS, Taylor G, Gharabaghi B, Thé J. Forecasting air quality time series using deep learning. JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION (1995) 2018; 68:866-886. [PMID: 29652217 DOI: 10.1080/10962247.2018.1459956] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
UNLABELLED This paper presents one of the first applications of deep learning (DL) techniques to predict air pollution time series. Air quality management relies extensively on time series data captured at air monitoring stations as the basis of identifying population exposure to airborne pollutants and determining compliance with local ambient air standards. In this paper, 8 hr averaged surface ozone (O3) concentrations were predicted using deep learning consisting of a recurrent neural network (RNN) with long short-term memory (LSTM). Hourly air quality and meteorological data were used to train and forecast values up to 72 hours with low error rates. The LSTM was able to forecast the duration of continuous O3 exceedances as well. Prior to training the network, the dataset was reviewed for missing data and outliers. Missing data were imputed using a novel technique that averaged gaps less than eight time steps with incremental steps based on first-order differences of neighboring time periods. Data were then used to train decision trees to evaluate input feature importance over different time prediction horizons. The number of features used to train the LSTM model was reduced from 25 features to 5 features, resulting in improved accuracy as measured by Mean Absolute Error (MAE). Parameter sensitivity analysis identified look-back nodes associated with the RNN proved to be a significant source of error if not aligned with the prediction horizon. Overall, MAE's less than 2 were calculated for predictions out to 72 hours. IMPLICATIONS Novel deep learning techniques were used to train an 8-hour averaged ozone forecast model. Missing data and outliers within the captured data set were replaced using a new imputation method that generated calculated values closer to the expected value based on the time and season. Decision trees were used to identify input variables with the greatest importance. The methods presented in this paper allow air managers to forecast long range air pollution concentration while only monitoring key parameters and without transforming the data set in its entirety, thus allowing real time inputs and continuous prediction.
Collapse
Affiliation(s)
- Brian S Freeman
- a School of Engineering , University of Guelph , Guelph , Ontario , Canada
| | - Graham Taylor
- a School of Engineering , University of Guelph , Guelph , Ontario , Canada
| | - Bahram Gharabaghi
- a School of Engineering , University of Guelph , Guelph , Ontario , Canada
| | - Jesse Thé
- a School of Engineering , University of Guelph , Guelph , Ontario , Canada
- b Lakes Environmental , Waterloo , Ontario , Canada
| |
Collapse
|
27
|
Fan M, Hu J, Cao R, Ruan W, Wei X. A review on experimental design for pollutants removal in water treatment with the aid of artificial intelligence. CHEMOSPHERE 2018; 200:330-343. [PMID: 29494914 DOI: 10.1016/j.chemosphere.2018.02.111] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 01/27/2018] [Accepted: 02/19/2018] [Indexed: 06/08/2023]
Abstract
Water pollution occurs mainly due to inorganic and organic pollutants, such as nutrients, heavy metals and persistent organic pollutants. For the modeling and optimization of pollutants removal, artificial intelligence (AI) has been used as a major tool in the experimental design that can generate the optimal operational variables, since AI has recently gained a tremendous advance. The present review describes the fundamentals, advantages and limitations of AI tools. Artificial neural networks (ANNs) are the AI tools frequently adopted to predict the pollutants removal processes because of their capabilities of self-learning and self-adapting, while genetic algorithm (GA) and particle swarm optimization (PSO) are also useful AI methodologies in efficient search for the global optima. This article summarizes the modeling and optimization of pollutants removal processes in water treatment by using multilayer perception, fuzzy neural, radial basis function and self-organizing map networks. Furthermore, the results conclude that the hybrid models of ANNs with GA and PSO can be successfully applied in water treatment with satisfactory accuracies. Finally, the limitations of current AI tools and their new developments are also highlighted for prospective applications in the environmental protection.
Collapse
Affiliation(s)
- Mingyi Fan
- Guizhou Provincial Key Laboratory for Information Systems of Mountainous Areas and Protection of Ecological Environment, Guizhou Normal University, Guiyang 550001, Guizhou, China
| | - Jiwei Hu
- Guizhou Provincial Key Laboratory for Information Systems of Mountainous Areas and Protection of Ecological Environment, Guizhou Normal University, Guiyang 550001, Guizhou, China; Cultivation Base of Guizhou National Key Laboratory of Mountainous Karst Eco-environment, Guizhou Normal University, Guiyang 550001, Guizhou, China.
| | - Rensheng Cao
- Guizhou Provincial Key Laboratory for Information Systems of Mountainous Areas and Protection of Ecological Environment, Guizhou Normal University, Guiyang 550001, Guizhou, China
| | - Wenqian Ruan
- Guizhou Provincial Key Laboratory for Information Systems of Mountainous Areas and Protection of Ecological Environment, Guizhou Normal University, Guiyang 550001, Guizhou, China
| | - Xionghui Wei
- Department of Applied Chemistry, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| |
Collapse
|
28
|
Khajehpour H, Saboohi Y, Tsatsaronis G. Permissible emission limit estimation via iterative back-calculation: Case of Assaluyeh energy zone, southern Iran. INTEGRATED ENVIRONMENTAL ASSESSMENT AND MANAGEMENT 2018; 14:130-138. [PMID: 28815869 DOI: 10.1002/ieam.1970] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Revised: 06/26/2017] [Accepted: 08/14/2017] [Indexed: 06/07/2023]
Abstract
In the present research work, an environmental policy procedure for setting a cap on emissions, as a crucial step in any total emission control system, has been provided and evaluated. It is shown that general regulations on emission intensities and rates do not guarantee that ambient air quality standards are met in intense industrial zones. Local emission limits are necessary to meet ambient air quality standards in these zones. To that end, we used dispersion simulators to back-calculate pollutant concentration thresholds for a large and intense energy system in the Assaluyeh region of southern Iran. Verified modeling results indicate 218 d of pollutant concentration threshold exceedance in Assaluyeh in a simulated year. Back-calculation to assess the total permissible emission level indicates the need for 68% reduction in total emission to meet ambient air quality standards. We then used the model to help identify effective control strategies, including emission reductions and appropriate timing of specific operations. Integr Environ Assess Manag 2018;14:130-138. © 2017 SETAC.
Collapse
Affiliation(s)
- Hossein Khajehpour
- Department of Energy Engineering, Sharif University of Technology, Tehran, Iran
- Sharif Energy Research Institute, Tehran, Iran
- Institute for Energy Engineering, Technical University of Berlin, Germany
| | | | - George Tsatsaronis
- Institute for Energy Engineering, Technical University of Berlin, Germany
| |
Collapse
|
29
|
Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING 2017. [DOI: 10.1155/2017/5106045] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Outdoor air pollution costs millions of premature deaths annually, mostly due to anthropogenic fine particulate matter (or PM2.5). Quito, the capital city of Ecuador, is no exception in exceeding the healthy levels of pollution. In addition to the impact of urbanization, motorization, and rapid population growth, particulate pollution is modulated by meteorological factors and geophysical characteristics, which complicate the implementation of the most advanced models of weather forecast. Thus, this paper proposes a machine learning approach based on six years of meteorological and pollution data analyses to predict the concentrations of PM2.5 from wind (speed and direction) and precipitation levels. The results of the classification model show a high reliability in the classification of low (<10 µg/m3) versus high (>25 µg/m3) and low (<10 µg/m3) versus moderate (10–25 µg/m3) concentrations of PM2.5. A regression analysis suggests a better prediction of PM2.5 when the climatic conditions are getting more extreme (strong winds or high levels of precipitation). The high correlation between estimated and real data for a time series analysis during the wet season confirms this finding. The study demonstrates that the use of statistical models based on machine learning is relevant to predict PM2.5 concentrations from meteorological data.
Collapse
|
30
|
Šiljić A, Antanasijević D, Perić-Grujić A, Ristić M, Pocajt V. Artificial neural network modelling of biological oxygen demand in rivers at the national level with input selection based on Monte Carlo simulations. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2015; 22:4230-4241. [PMID: 25280507 DOI: 10.1007/s11356-014-3669-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 09/29/2014] [Indexed: 06/03/2023]
Abstract
Biological oxygen demand (BOD) is the most significant water quality parameter and indicates water pollution with respect to the present biodegradable organic matter content. European countries are therefore obliged to report annual BOD values to Eurostat; however, BOD data at the national level is only available for 28 of 35 listed European countries for the period prior to 2008, among which 46% of data is missing. This paper describes the development of an artificial neural network model for the forecasting of annual BOD values at the national level, using widely available sustainability and economical/industrial parameters as inputs. The initial general regression neural network (GRNN) model was trained, validated and tested utilizing 20 inputs. The number of inputs was reduced to 15 using the Monte Carlo simulation technique as the input selection method. The best results were achieved with the GRNN model utilizing 25% less inputs than the initial model and a comparison with a multiple linear regression model trained and tested using the same input variables using multiple statistical performance indicators confirmed the advantage of the GRNN model. Sensitivity analysis has shown that inputs with the greatest effect on the GRNN model were (in descending order) precipitation, rural population with access to improved water sources, treatment capacity of wastewater treatment plants (urban) and treatment of municipal waste, with the last two having an equal effect. Finally, it was concluded that the developed GRNN model can be useful as a tool to support the decision-making process on sustainable development at a regional, national and international level.
Collapse
Affiliation(s)
- Aleksandra Šiljić
- Faculty of Technology and Metallurgy, University of Belgrade, Karnegijeva 4, 11120, Belgrade, Serbia
| | | | | | | | | |
Collapse
|
31
|
Ren H, Li J, Yuan ZA, Hu JY, Yu Y, Lu YH. The development of a combined mathematical model to forecast the incidence of hepatitis E in Shanghai, China. BMC Infect Dis 2013; 13:421. [PMID: 24010871 PMCID: PMC3847129 DOI: 10.1186/1471-2334-13-421] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2013] [Accepted: 09/04/2013] [Indexed: 01/17/2023] Open
Abstract
Background Sporadic hepatitis E has become an important public health concern in China. Accurate forecasting of the incidence of hepatitis E is needed to better plan future medical needs. Few mathematical models can be used because hepatitis E morbidity data has both linear and nonlinear patterns. We developed a combined mathematical model using an autoregressive integrated moving average model (ARIMA) and a back propagation neural network (BPNN) to forecast the incidence of hepatitis E. Methods The morbidity data of hepatitis E in Shanghai from 2000 to 2012 were retrieved from the China Information System for Disease Control and Prevention. The ARIMA-BPNN combined model was trained with 144 months of morbidity data from January 2000 to December 2011, validated with 12 months of data January 2012 to December 2012, and then employed to forecast hepatitis E incidence January 2013 to December 2013 in Shanghai. Residual analysis, Root Mean Square Error (RMSE), normalized Bayesian Information Criterion (BIC), and stationary R square methods were used to compare the goodness-of-fit among ARIMA models. The Bayesian regularization back-propagation algorithm was used to train the network. The mean error rate (MER) was used to assess the validity of the combined model. Results A total of 7,489 hepatitis E cases was reported in Shanghai from 2000 to 2012. Goodness-of-fit (stationary R2=0.531, BIC= −4.768, Ljung-Box Q statistics=15.59, P=0.482) and parameter estimates were used to determine the best-fitting model as ARIMA (0,1,1)×(0,1,1)12. Predicted morbidity values in 2012 from best-fitting ARIMA model and actual morbidity data from 2000 to 2011 were used to further construct the combined model. The MER of the ARIMA model and the ARIMA-BPNN combined model were 0.250 and 0.176, respectively. The forecasted incidence of hepatitis E in 2013 was 0.095 to 0.372 per 100,000 population. There was a seasonal variation with a peak during January-March and a nadir during August-October. Conclusions Time series analysis suggested a seasonal pattern of hepatitis E morbidity in Shanghai, China. An ARIMA-BPNN combined model was used to fit the linear and nonlinear patterns of time series data, and accurately forecast hepatitis E infections.
Collapse
Affiliation(s)
- Hong Ren
- The Key Laboratory of Public Health Safety of Minister of Education - Department of Epidemiology, Fudan University School of Public Health, Building 8 Room 441, 138 Yi Xue Yuan Road, Shanghai 200032, China.
| | | | | | | | | | | |
Collapse
|