1
|
Lyu Y, Xu H, Wu H, Han F, Lv F, Kang A, Pang X. Spatiotemporal variations of PM 2.5 and ozone in urban agglomerations of China and meteorological drivers for ozone using explainable machine learning. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2025; 365:125380. [PMID: 39581363 DOI: 10.1016/j.envpol.2024.125380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 11/02/2024] [Accepted: 11/21/2024] [Indexed: 11/26/2024]
Abstract
Ozone pollution was widely reported along with PM2.5 reduction since 2013 in China. However, the meteorological drivers for ozone varying with different regions of China remains unknown using explainable machine learning, especially during the COVID-19 pandemic. Here we first analyzed spatiotemporal variations of PM2.5 and ozone from 2015 to 2022 in eleven urban agglomerations of China. PM2.5 decreased in all regions, with the largest drop in Beijing-Tianjin-Hebei (BTH). In contrast, ozone declined initially but rose during the pandemic in most regions, especially in Cheng-Yu. Probability density curves showed pronounced increase (24.7%) and slight change in the proportion of PM2.5 and ozone meeting the pollution criterions during the pandemic, respectively. Leveraging Random Forest with SHAP analysis, we further established ozone models in typical urban agglomerations with good performance (CV-R2 = 0.80-0.90; CV-RMSE = 8.52-19.20 μg/m3) during the pandemic, and compared their relative importance of meteorological variables. Particularly, temperature and incoming shortwave flux at top of atmosphere were identified with high importance in high-ozone regions such as Middle Plain and BTH. Increasing importance of PM (e.g., PM10) was found in southern China, e.g., Yangtze River Delta and Pearl River Delta regions. The western China was characterized with more importance of meteorology, especially in Tibet. Surface albedo and sensible heat flux from turbulence were noted distinctively with high importance in Tibet, partly due to their impacts on ozone formation by generating heat source and sink. In addition, sea level pressure (SLP) was revealed with the highest importance (25.2%) in Cheng-Yu, consistent with the fact that synoptic patterns characterized by SLP field could affect ozone pollution in Sichuan Basin. Our results not only provide an understanding of meteorological factors in regional ozone formation in China, but also highlight the feasibility of explainable machine learning in ozone studies.
Collapse
Affiliation(s)
- Yan Lyu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310014, China; School of Environment and Spatial Informatics, China University of Mining and Technology, Xuzhou, 221116, China; Shaoxing Research Institute, Zhejiang University of Technology, Shaoxing, 312077, China.
| | - Haonan Xu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310014, China
| | - Haonan Wu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310014, China
| | - Fuliang Han
- School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, 610000, China
| | - Fengmao Lv
- School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, 610000, China
| | - Azhen Kang
- School of Civil Engineering, Southwest Jiaotong University, Chengdu, 610000, China
| | - Xiaobing Pang
- College of Environment, Zhejiang University of Technology, Hangzhou, 310014, China
| |
Collapse
|
2
|
Li Y, Huang T, Lee HF, Heo Y, Ho KF, Yim SHL. Integrating Doppler LiDAR and machine learning into land-use regression model for assessing contribution of vertical atmospheric processes to urban PM 2.5 pollution. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 952:175632. [PMID: 39168320 DOI: 10.1016/j.scitotenv.2024.175632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 08/06/2024] [Accepted: 08/17/2024] [Indexed: 08/23/2024]
Abstract
Air pollution has been recognized as a global issue, through adverse effects on environment and health. While vertical atmospheric processes substantially affect urban air pollution, traditional epidemiological research using Land-use regression (LUR) modeling usually focused on ground-level attributes without considering upper-level atmospheric conditions. This study aimed to integrate Doppler LiDAR and machine learning techniques into LUR models (LURF-LiDAR) to comprehensively evaluate urban air pollution in Hong Kong, and to assess complex interactions between vertical atmospheric processes and urban air pollution from long-term (i.e., annual) and short-term (i.e., two air pollution episodes) views in 2021. The results demonstrated significant improvements in model performance, achieving CV R2 values of 0.81 (95 % CI: 0.75-0.86) for the long-term PM2.5 prediction model and 0.90 (95 % CI: 0.87-0.91) for the short-term models. Approximately 69 % of ground-level air pollution arose from the mixing of ground- and lower-level (105 m-225 m) particles, while 21 % was associated with upper-level (825 m-945 m) atmospheric processes. The identified transboundary air pollution (TAP) layer was located at ~900 m above the ground. The identified Episode one (E1: 7 Jan-22 Jan) was induced by the accumulation of local emissions under stable atmospheric conditions, whereas Episode two (E2: 13 Dec-24 Dec) was regulated by TAP under instable and turbulent conditions. Our improved air quality prediction model is accurate and comprehensive with high interpretability for supporting urban planning and air quality policies.
Collapse
Affiliation(s)
- Yue Li
- Department of Geography and Resource Management, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong 999077, China
| | - Tao Huang
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore 639798, Singapore; Earth Observatory of Singapore, Nanyang Technological University, Singapore 639798, Singapore
| | - Harry Fung Lee
- Department of Geography and Resource Management, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong 999077, China
| | - Yeonsook Heo
- School of Civil, Environmental and Architectural Engineering, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul 02841, Republic of Korea
| | - Kin-Fai Ho
- The Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Sha Tin, N.T., Hong Kong 999077, China
| | - Steve H L Yim
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore 639798, Singapore; Earth Observatory of Singapore, Nanyang Technological University, Singapore 639798, Singapore; Asian School of the Environment, Nanyang Technological University, Singapore 639798, Singapore.
| |
Collapse
|
3
|
Araki S, Shimadera H, Chatani S, Kitayama K, Shima M. Long-term spatiotemporal variation of benzo[a]pyrene in Japan: Significant decrease in ambient concentrations, human exposure, and health risk. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 360:124650. [PMID: 39111529 DOI: 10.1016/j.envpol.2024.124650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 07/24/2024] [Accepted: 07/30/2024] [Indexed: 08/15/2024]
Abstract
Although Benzo[a]pyrene (BaP) is considered carcinogenic to humans, the health effects of exposure to ambient levels have not been sufficiently investigated. This study estimated the long-term spatiotemporal variation of BaP in Japan over nearly two decades at a fine spatial resolution of 1 km. This study aimed to obtain an accurate spatiotemporal distribution of BaP that can be used in epidemiological studies on the health effects of ambient BaP exposure. The annual BaP concentrations were estimated using an ensemble machine learning approach using various predictors, including the concentrations and emission intensities of the criteria air pollutants, and meteorological, land use, and traffic-related variables. The model performance, evaluated by location-based cross-validation, exhibited satisfactory accuracy (R2 of 0.693). Densely populated areas showed higher BaP levels and greater temporal reduction, whereas BaP levels remained higher in some industrial areas. The population-weighted BaP in 2018 was 0.12 ng m-3, a decrease of approximately 70% from its 2000 value of 0.44 ng m-3, which was also reflected in the estimated excess number of lung cancer incidences. Accordingly, the proportion of BaP exposure below 0.12 ng m-3, which is the BaP concentration associated with an excess lifetime cancer risk of 10-5, reached 67% in 2018. Our estimates can be used in epidemiological studies to assess the health effects of BaP exposure at ambient concentrations.
Collapse
Affiliation(s)
- Shin Araki
- Graduate School of Engineering, Osaka University, Suita, 565-0871, Japan.
| | - Hikari Shimadera
- Graduate School of Engineering, Osaka University, Suita, 565-0871, Japan.
| | - Satoru Chatani
- National Institute for Environmental Studies, Tsukuba, 305-8506, Japan.
| | - Kyo Kitayama
- National Institute for Environmental Studies, Tsukuba, 305-8506, Japan.
| | - Masayuki Shima
- Department of Public Health, School of Medicine, Hyogo Medical University, Nishinomiya, 663-8501, Japan.
| |
Collapse
|
4
|
Predicting Daily PM2.5 Exposure with Spatially Invariant Accuracy Using Co-Existing Pollutant Concentrations as Predictors. ATMOSPHERE 2022. [DOI: 10.3390/atmos13050782] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The spatiotemporal variation of PM2.5 should be accurately estimated for epidemiological studies. However, the accuracy of prediction models may change over geographical space, which is not conducive for proper exposure assessment. In this study, we developed a prediction model to estimate daily PM2.5 concentrations from 2010 to 2017 in the Kansai region of Japan with co-existing pollutant concentrations as predictors. The overall objective was to obtain daily estimates over the study domain with spatially homogeneous accuracy. We used random forest algorithm to model the relationship between the daily PM2.5 concentrations and various predictors. The model performance was evaluated via spatial and temporal cross-validation and the daily PM2.5 surface was estimated from 2010 to 2017 at a 1 km × 1 km resolution. We achieved R2 values of 0.91 and 0.92 for spatial and temporal cross-validation, respectively. The prediction accuracy for each monitoring site was found to be consistently high, regardless of the distance to the nearest monitoring location, up to 10 km. Even for distances greater than 10 km, the mean R2 value was 0.88. Our approach yielded spatially homogeneous prediction accuracy, which is beneficial for epidemiological studies. The daily PM2.5 estimates will be used in a related birth cohort study to evaluate the potential impact on human health.
Collapse
|
5
|
A National-Scale 1-km Resolution PM2.5 Estimation Model over Japan Using MAIAC AOD and a Two-Stage Random Forest Model. REMOTE SENSING 2021. [DOI: 10.3390/rs13183657] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Satellite-based models for estimating concentrations of particulate matter with an aerodynamic diameter less than 2.5 μm (PM2.5) have seldom been developed in islands with complex topography over the monsoon area, where the transport of PM2.5 is influenced by both the synoptic-scale winds and local-scale circulations compared with the continental regions. We validated Multi-Angle Implementation of Atmospheric Correction (MAIAC) aerosol optical depth (AOD) with ground observations in Japan and developed a 1-km-resolution national-scale model between 2011 and 2016 to estimate daily PM2.5 concentrations. A two-stage random forest model integrating MAIAC AOD with meteorological variables and land use data was applied to develop the model. The first-stage random forest model was used to impute the missing AOD values. The second-stage random forest model was then utilised to estimate ground PM2.5 concentrations. Ten-fold cross-validation was performed to evaluate the model performance. There was good consistency between MAIAC AOD and ground truth in Japan (correlation coefficient = 0.82 and 74.62% of data falling within the expected error). For model training, the model showed a training coefficient of determination (R2) of 0.98 and a root mean square error (RMSE) of 1.22 μg/m3. For the 10-fold cross-validation, the cross-validation R2 and RMSE of the model were 0.86 and 3.02 μg/m3, respectively. A subsite validation was used to validate the model at the grids overlapping with the AERONET sites, and the model performance was excellent at these sites with a validation R2 (RMSE) of 0.94 (1.78 μg/m3). Additionally, the model performance increased as increased AOD coverage. The top-ten important predictors for estimating ground PM2.5 concentrations were day of the year, temperature, AOD, relative humidity, 10-m-height zonal wind, 10-m-height meridional wind, boundary layer height, precipitation, surface pressure, and population density. MAIAC AOD showed high retrieval accuracy in Japan. The performance of the satellite-based model was excellent, which showed that PM2.5 estimates derived from the model were reliable and accurate. These estimates can be used to assess both the short-term and long-term effects of PM2.5 on health outcomes in epidemiological studies.
Collapse
|