1
|
Méndez M, Merayo MG, Núñez M. Machine learning algorithms to forecast air quality: a survey. Artif Intell Rev 2023; 56:1-36. [PMID: 36820441 PMCID: PMC9933038 DOI: 10.1007/s10462-023-10424-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2023] [Indexed: 02/18/2023]
Abstract
Air pollution is a risk factor for many diseases that can lead to death. Therefore, it is important to develop forecasting mechanisms that can be used by the authorities, so that they can anticipate measures when high concentrations of certain pollutants are expected in the near future. Machine Learning models, in particular, Deep Learning models, have been widely used to forecast air quality. In this paper we present a comprehensive review of the main contributions in the field during the period 2011-2021. We have searched the main scientific publications databases and, after a careful selection, we have considered a total of 155 papers. The papers are classified in terms of geographical distribution, predicted values, predictor variables, evaluation metrics and Machine Learning model.
Collapse
Affiliation(s)
- Manuel Méndez
- Design and Testing of Reliable Systems Research Group, Universidad Complutense de Madrid, C/ Profesor José García Santesmases, 9, 28040 Madrid, Madrid Spain
| | - Mercedes G. Merayo
- Design and Testing of Reliable Systems Research Group, Universidad Complutense de Madrid, C/ Profesor José García Santesmases, 9, 28040 Madrid, Madrid Spain
| | - Manuel Núñez
- Design and Testing of Reliable Systems Research Group, Universidad Complutense de Madrid, C/ Profesor José García Santesmases, 9, 28040 Madrid, Madrid Spain
| |
Collapse
|
2
|
Particulate Matter in an Urban–Industrial Environment: Comparing Data of Dispersion Modeling with Tree Leaves Deposition. SUSTAINABILITY 2022. [DOI: 10.3390/su14020793] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Particulate matter represents a serious hazard to human health, and air quality models contribute to the understanding of its dispersion. This study describes particulate matter with a ≤10 μm diameter (PM10) dynamics in an urban–industrial area, through the comparison of three datasets: modeled (TAPM—The Air Pollution Model), measured concentration (environmental control stations—ECS), and leaf deposition values. Results showed a good agreement between ECS and TAPM data. A steel plant area was used as a PM10 emissions reference source, in relation to the four sampling areas, and a distance/wind-based factor was introduced (Steel Factor, SF). Through SF, the three datasets were compared. The SF was able to describe the PM10 dispersion values for ECS and leaf deposition (r2 = 0.61–0.94 for ECS; r2 = 0.45–0.70 for leaf); no relationship was found for TAPM results. Differences between measured and modeled data can be due to discrepancies in one district and explained by a lack of PM10 inventory for the steel plant emissions. The study suggests the use of TAPM as a suitable tool for PM10 modeling at the urban scale. Moreover, tree leaves are a low-cost tool to evaluate the urban environmental quality, by providing information on whether and when data from leaf deposition can be used as a proxy for air pollution concentration. Further studies to include the re-suspension of particles as a PM10 source within emission inventories are suggested.
Collapse
|
3
|
Estimation of Surface NO2 Concentrations over Germany from TROPOMI Satellite Observations Using a Machine Learning Method. REMOTE SENSING 2021. [DOI: 10.3390/rs13050969] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
In this paper, we present the estimation of surface NO2 concentrations over Germany using a machine learning approach. TROPOMI satellite observations of tropospheric NO2 vertical column densities (VCDs) and several meteorological parameters are used to train the neural network model for the prediction of surface NO2 concentrations. The neural network model is validated against ground-based in situ air quality monitoring network measurements and regional chemical transport model (CTM) simulations. Neural network estimation of surface NO2 concentrations show good agreement with in situ monitor data with Pearson correlation coefficient (R) of 0.80. The results also show that the machine learning approach is performing better than regional CTM simulations in predicting surface NO2 concentrations. We also performed a sensitivity analysis for each input parameter of the neural network model. The validated neural network model is then used to estimate surface NO2 concentrations over Germany from 2018 to 2020. Estimated surface NO2 concentrations are used to investigate the spatio-temporal characteristics, such as seasonal and weekly variations of NO2 in Germany. The estimated surface NO2 concentrations provide comprehensive information of NO2 spatial distribution which is very useful for exposure estimation. We estimated the annual average NO2 exposure for 2018, 2019 and 2020 is 15.53, 15.24 and 13.27 µµg/m3, respectively. While the annual average NO2 concentration of 2018, 2019 and 2020 is only 12.79, 12.60 and 11.15 µµg/m3. In addition, we used the surface NO2 data set to investigate the impacts of the coronavirus disease 2019 (COVID-19) pandemic on ambient NO2 levels in Germany. In general, 10–30% lower surface NO2 concentrations are observed in 2020 compared to 2018 and 2019, indicating the significant impacts of a series of restriction measures to reduce the spread of the virus.
Collapse
|
4
|
Dupont MF, Elbourne A, Cozzolino D, Chapman J, Truong VK, Crawford RJ, Latham K. Chemometrics for environmental monitoring: a review. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2020; 12:4597-4620. [PMID: 32966380 DOI: 10.1039/d0ay01389g] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Environmental monitoring is necessary to ensure the overall health and conservation of an ecosystem. However, ecosystems (e.g. air, water, soil), are complex, involving numerous processes (both native and external), inputs, contaminants, and living organisms. As such, monitoring an environmental system is not a trivial task. The data obtained from natural systems is often multifaceted and convoluted, as a multitude of inputs can be intertwined within the matrix of the information obtained as part of a study. This means that trends and important results can be easily overlooked by conventional and single dimensional data analysis protocols. Recently, chemometric methods have emerged as a powerful method for maximizing the details contained within a chemical data set. Specifically, chemometrics refers to the use of mathematical and statistical analysis methods to evaluate chemical data, beyond univariant analysis. This type of analysis can provide a quantitative description of environmental measurements, while also having the capacity to reveal previously overlooked trends in data sets. Applying chemometrics to environmental data allows us to identify and describe the inter-relationship of environmental drivers, sources of contamination, and their potential impact upon the environment. This review aims to provide a detailed understanding of chemometric techniques, how they are currently used in environmental monitoring, and how these techniques can be used to improve current practices. An enhanced ability to monitor environmental conditions and to predict trends would be greatly beneficial to government and research agencies in their ability to develop environmental policies and analytical procedures.
Collapse
|
5
|
Poor Air Quality and Its Association with Mortality in Ho Chi Minh City: Case Study. ATMOSPHERE 2020. [DOI: 10.3390/atmos11070750] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Along with its rapid urban development, Ho Chi Minh City (HCMC) in recent years has suffered a high concentration of air pollutants, especially fine particulate matters or PM2.5. A comprehensive study is required to evaluate the air quality conditions and their health impact in this city. Given the lack of adequate air quality monitoring data over a large area of the size of HCMC, an air quality modeling methodology is adopted to address the requirement. Here, by utilizing a corresponding emission inventory in combination with The Air Pollution Model-Chemical Transport Model (TAPM-CTM), the predicted concentration of air pollutants is first obtained for PM2.5, NOx, and SO2. Then by associating the pollutants exposed with the mortality rate from three causes, namely Ischemic Heart Disease (IHD), cardiopulmonary, and lung cancer, the impact of air pollution on human health is obtained for this purpose. Spatial distribution has shown a high amount of pollutants concentrated in the central city with a high density of combustion vehicles (motorcycles and automobiles). In addition, a significant amount of emissions can be observed from stevedoring and harbor activities, including ferries and cargo handling equipment located along the river. Other sources such as household activities also contribute to an even distribution of emission across the city. The results of air quality modeling showed that the annual average concentrations of NO2 were higher than the standard of Vietnam National Technical Regulation on Ambient Air Quality (QCVN 05: 2013 40 µg/m3) and World Health Organization (WHO) (40 µg/m3). The annual average concentrations of PM2.5 were 23 µg/m3 and were also much higher than the WHO (10 µg/m3) standard by about 2.3 times. In terms of public health impacts, PM2.5 was found to be responsible for about 1136 deaths, while the number of mortalities from exposure to NO2 and SO2 was 172 and 89 deaths, respectively. These figures demand some stringent measures from the authorities to potentially remedy the alarming situation of air pollution in HCM City.
Collapse
|
6
|
Soft Computing Applications in Air Quality Modeling: Past, Present, and Future. SUSTAINABILITY 2020. [DOI: 10.3390/su12104045] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Air quality models simulate the atmospheric environment systems and provide increased domain knowledge and reliable forecasting. They provide early warnings to the population and reduce the number of measuring stations. Due to the complexity and non-linear behavior associated with air quality data, soft computing models became popular in air quality modeling (AQM). This study critically investigates, analyses, and summarizes the existing soft computing modeling approaches. Among the many soft computing techniques in AQM, this article reviews and discusses artificial neural network (ANN), support vector machine (SVM), evolutionary ANN and SVM, the fuzzy logic model, neuro-fuzzy systems, the deep learning model, ensemble, and other hybrid models. Besides, it sheds light on employed input variables, data processing approaches, and targeted objective functions during modeling. It was observed that many advanced, reliable, and self-organized soft computing models like functional network, genetic programming, type-2 fuzzy logic, genetic fuzzy, genetic neuro-fuzzy, and case-based reasoning are rarely explored in AQM. Therefore, the partially explored and unexplored soft computing techniques can be appropriate choices for research in the field of air quality modeling. The discussion in this paper will help to determine the suitability and appropriateness of a particular model for a specific modeling context.
Collapse
|
7
|
|
8
|
Ghaemi Z, Alimohammadi A, Farnaghi M. LaSVM-based big data learning system for dynamic prediction of air pollution in Tehran. ENVIRONMENTAL MONITORING AND ASSESSMENT 2018; 190:300. [PMID: 29679160 PMCID: PMC5910457 DOI: 10.1007/s10661-018-6659-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 04/02/2018] [Indexed: 06/08/2023]
Abstract
Due to critical impacts of air pollution, prediction and monitoring of air quality in urban areas are important tasks. However, because of the dynamic nature and high spatio-temporal variability, prediction of the air pollutant concentrations is a complex spatio-temporal problem. Distribution of pollutant concentration is influenced by various factors such as the historical pollution data and weather conditions. Conventional methods such as the support vector machine (SVM) or artificial neural networks (ANN) show some deficiencies when huge amount of streaming data have to be analyzed for urban air pollution prediction. In order to overcome the limitations of the conventional methods and improve the performance of urban air pollution prediction in Tehran, a spatio-temporal system is designed using a LaSVM-based online algorithm. Pollutant concentration and meteorological data along with geographical parameters are continually fed to the developed online forecasting system. Performance of the system is evaluated by comparing the prediction results of the Air Quality Index (AQI) with those of a traditional SVM algorithm. Results show an outstanding increase of speed by the online algorithm while preserving the accuracy of the SVM classifier. Comparison of the hourly predictions for next coming 24 h, with those of the measured pollution data in Tehran pollution monitoring stations shows an overall accuracy of 0.71, root mean square error of 0.54 and coefficient of determination of 0.81. These results are indicators of the practical usefulness of the online algorithm for real-time spatial and temporal prediction of the urban air quality.
Collapse
Affiliation(s)
- Z. Ghaemi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
| | - A. Alimohammadi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
| | - M. Farnaghi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
- GIS Center, Department of Physical Geography and Ecosystem Science, Lund University, Lund, Sweden
| |
Collapse
|
9
|
Blanes-Vidal V, Cantuaria ML, Nadimi ES. A novel approach for exposure assessment in air pollution epidemiological studies using neuro-fuzzy inference systems: Comparison of exposure estimates and exposure-health associations. ENVIRONMENTAL RESEARCH 2017; 154:196-203. [PMID: 28092762 DOI: 10.1016/j.envres.2016.12.028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 11/24/2016] [Accepted: 12/26/2016] [Indexed: 06/06/2023]
Abstract
Many epidemiological studies have used proximity to sources as air pollution exposure assessment method. However, proximity measures are not generally good surrogates because of their complex non-linear relationship with exposures. Neuro-fuzzy inference systems (NFIS) can be used to map complex non-linear systems, but its usefulness in exposure assessment has not been extensively explored. We present a novel approach for exposure assessment using NFIS, where the inputs of the model were easily-obtainable proximity measures, and the output was residential exposure to an air pollutant. We applied it to a case-study on NH3 pollution, and compared health effects and exposures estimated from NFIS, with those obtained from emission-dispersion models, and linear and non-linear regression proximity models, using 10-fold cross validation. The agreement between emission-dispersion and NFIS exposures was high (Root-mean-square error (RMSE) =0.275, correlation coefficient (r)=0.91) and resulted in similar health effect estimates. Linear models showed poor performance (RMSE=0.527, r=0.59), while non-linear regression models resulted in heterocedasticity, non-normality and clustered data. NFIS could be a useful tool for estimating individual air pollution exposures in epidemiological studies on large populations, when emission-dispersion data are not available. The tradeoff between simplicity and accuracy needs to be considered.
Collapse
Affiliation(s)
- Victoria Blanes-Vidal
- The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, DK-5230 Odense, Denmark.
| | - Manuella Lech Cantuaria
- The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, DK-5230 Odense, Denmark
| | - Esmaeil S Nadimi
- The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, DK-5230 Odense, Denmark
| |
Collapse
|
10
|
Application Study of Comprehensive Forecasting Model Based on Entropy Weighting Method on Trend of PM2.5 Concentration in Guangzhou, China. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2015; 12:7085-99. [PMID: 26110332 PMCID: PMC4483750 DOI: 10.3390/ijerph120607085] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Revised: 06/03/2015] [Accepted: 06/17/2015] [Indexed: 11/17/2022]
Abstract
For the issue of haze-fog, PM2.5 is the main influence factor of haze-fog pollution in China. The trend of PM2.5 concentration was analyzed from a qualitative point of view based on mathematical models and simulation in this study. The comprehensive forecasting model (CFM) was developed based on the combination forecasting ideas. Autoregressive Integrated Moving Average Model (ARIMA), Artificial Neural Networks (ANNs) model and Exponential Smoothing Method (ESM) were used to predict the time series data of PM2.5 concentration. The results of the comprehensive forecasting model were obtained by combining the results of three methods based on the weights from the Entropy Weighting Method. The trend of PM2.5 concentration in Guangzhou China was quantitatively forecasted based on the comprehensive forecasting model. The results were compared with those of three single models, and PM2.5 concentration values in the next ten days were predicted. The comprehensive forecasting model balanced the deviation of each single prediction method, and had better applicability. It broadens a new prediction method for the air quality forecasting field.
Collapse
|
11
|
Ha Q, Wahid H, Duc H, Azzi M. Enhanced radial basis function neural networks for ozone level estimation. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.12.048] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Carnevale C, Finzi G, Pederzoli A, Turrini E, Volta M, Guariso G, Gianfreda R, Maffeis G, Pisoni E, Thunis P, Markl-Hummel L, Blond N, Clappier A, Dujardin V, Weber C, Perron G. Exploring trade-offs between air pollutants through an Integrated Assessment Model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2014; 481:7-16. [PMID: 24572927 DOI: 10.1016/j.scitotenv.2014.02.016] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Revised: 02/05/2014] [Accepted: 02/05/2014] [Indexed: 06/03/2023]
Abstract
When designing air pollution reduction policies, regional decision makers face a limited budget to choose the most efficient measures which will have impacts on several pollutants in different ways. RIAT+ is a regional integrated assessment tool that supports the policy maker in this selection of the optimal emission reduction technologies, to improve air quality at minimum costs. In this paper, this tool is formalized and applied to the specific case of a French region (Alsace), to illustrate how focusing on one single pollutant may exacerbate problems related to other pollutants, on top of conflicts related to budget allocation. In our case, results are shown for possible trade-offs between NO2 and O3 control policies. The paper suggests an approach to prioritize policy maker objectives when planning air pollution policies at regional scale.
Collapse
Affiliation(s)
- Claudio Carnevale
- Department of Mechanical and Industrial Engineering, University of Brescia, Via Branze 38, 25123 Brescia, Italy
| | - Giovanna Finzi
- Department of Mechanical and Industrial Engineering, University of Brescia, Via Branze 38, 25123 Brescia, Italy
| | - Anna Pederzoli
- Department of Mechanical and Industrial Engineering, University of Brescia, Via Branze 38, 25123 Brescia, Italy
| | - Enrico Turrini
- Department of Mechanical and Industrial Engineering, University of Brescia, Via Branze 38, 25123 Brescia, Italy
| | - Marialuisa Volta
- Department of Mechanical and Industrial Engineering, University of Brescia, Via Branze 38, 25123 Brescia, Italy
| | - Giorgio Guariso
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Piazza Leonardo Di Vinci 32, 20133 Milan, Italy
| | | | | | - Enrico Pisoni
- European Commission - Joint Research Centre, Institute for Environment and Sustainability, Via E. Fermi 2749, I-21027 Ispra, VA, Italy.
| | - Philippe Thunis
- European Commission - Joint Research Centre, Institute for Environment and Sustainability, Via E. Fermi 2749, I-21027 Ispra, VA, Italy
| | - Lioba Markl-Hummel
- Laboratoire Image Ville Environnement, UMR7362, CNRS, 3, rue de l'Argonne, F-67000 Strasbourg, France; ASPA, 5, rue de Madrid, F-67300 Schiltigheim, France
| | - Nadège Blond
- Laboratoire Image Ville Environnement, UMR7362, CNRS, 3, rue de l'Argonne, F-67000 Strasbourg, France
| | - Alain Clappier
- Laboratoire Image Ville Environnement, UMR7362, University of Strasbourg, 3, rue de l'Argonne, F-67000 Strasbourg, France
| | - Vincent Dujardin
- Laboratoire Image Ville Environnement, UMR7362, CNRS, 3, rue de l'Argonne, F-67000 Strasbourg, France; Laboratoire Image Ville Environnement, UMR7362, University of Strasbourg, 3, rue de l'Argonne, F-67000 Strasbourg, France
| | - Christiane Weber
- Laboratoire Image Ville Environnement, UMR7362, CNRS, 3, rue de l'Argonne, F-67000 Strasbourg, France
| | - Gilles Perron
- ASPA, 5, rue de Madrid, F-67300 Schiltigheim, France
| |
Collapse
|