1
|
Ravindiran G, Karthick K, Rajamanickam S, Datta D, Das B, Shyamala G, Hayder G, Maria A. Ensemble stacking of machine learning models for air quality prediction for Hyderabad city in India. iScience 2025; 28:111894. [PMID: 40051831 PMCID: PMC11883379 DOI: 10.1016/j.isci.2025.111894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 11/27/2024] [Accepted: 01/22/2025] [Indexed: 03/09/2025] Open
Abstract
Hyderabad, one of the rapidly developing cities in India, is facing with severe air pollution due to rapid urbanization, industrial operations, and climatic factors. To alleviate the negative impact on human health and the environment, accurate monitoring and forecasting of air quality are essential. This research utilized various machine learning models, such as XGBoost, LarsCV, Bayesian Ridge, AdaBoost, and ensemble stacking methods, to forecast the air quality index (AQI) using data from August 2016 to October 2023, which included 18 different air pollutants, including meteorological parameters. The ensemble stacking method showed excellent performance, attaining high training (R2 = 0.994) and validation (R2 = 0.999) accuracy with low error metrics (mean absolute error [MAE] = 0.496, mean square error [MSE] = 0.429, root-mean-square error [RMSE] = 0.655). These results highlight the efficacy of ensemble stacking for AQI prediction, providing crucial information for policymakers to formulate strategies to reduce air pollution's effects on public health and environmental sustainability.
Collapse
Affiliation(s)
- Gokulan Ravindiran
- Department of Civil Engineering, Dayananda Sagar College of Engineering, Bengaluru, Karnataka 560078, India
- Institute of Energy Infrastructure, Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Darul Ehsan, Malaysia
| | - K. Karthick
- Department of Electrical and Electronics Engineering, GMR Institute of Technology, Rajam 532 127, Andhra Pradesh, India
| | - Sivarethinamohan Rajamanickam
- Symbiosis Centre for Management Studies, Bengaluru Campus, Symbiosis International (Deemed University), Bengaluru, Karnataka, India
| | - Deepshikha Datta
- Department of Chemistry, Brainware University, Barasat, Kolkata, West Bengal, India
| | - Bimal Das
- Department of Chemical Engineering, National Institute of Technology, Durgapur, Durgapur, West Bengal, India
| | - G. Shyamala
- Department of Civil Engineering, SR University, Warangal 506371, Telangana, India
| | - Gasim Hayder
- Institute of Energy Infrastructure, Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Darul Ehsan, Malaysia
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000 Kajang, Selangor Darul Ehsan, Malaysia
| | - Azees Maria
- School of Computer Science and Engineering, VIT-AP University, Amaravati, Andhra Pradesh 522 237, India
| |
Collapse
|
2
|
Vo HHP, Nguyen TM, Bui KA, Yoo M. Traffic Flow Prediction in 5G-Enabled Intelligent Transportation Systems Using Parameter Optimization and Adaptive Model Selection. SENSORS (BASEL, SWITZERLAND) 2024; 24:6529. [PMID: 39460009 PMCID: PMC11510918 DOI: 10.3390/s24206529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 10/07/2024] [Accepted: 10/08/2024] [Indexed: 10/28/2024]
Abstract
This study proposes a novel hybrid method, FVMD-WOA-GA, for enhancing traffic flow prediction in 5G-enabled intelligent transportation systems. The method integrates fast variational mode decomposition (FVMD) with optimization techniques, namely, the whale optimization algorithm (WOA) and genetic algorithm (GA), to improve the accuracy of overall traffic flow based on models tailored for each decomposed sub-sequence. The selected predictive models-long short-term memory (LSTM), bidirectional LSTM (BiLSTM), gated recurrent unit (GRU), and bidirectional GRU (BiGRU)-were considered to capture diverse temporal dependencies in traffic data. This research explored a multi-stage approach, where the decomposition, optimization, and selection of models are performed systematically to improve prediction performance. Experimental validation on two real-world traffic datasets further underscores the method's efficacy, achieving root mean squared errors (RMSEs) of 152.43 and 7.91 on the respective datasets, which marks improvements of 3.44% and 12.87% compared to the existing methods. These results highlight the ability of the FVMD-WOA-GA approach to improve prediction accuracy significantly, reduce inference time, enhance system adaptability, and contribute to more efficient traffic management.
Collapse
Affiliation(s)
- Hanh Hong-Phuc Vo
- Department of Electronic Engineering, Soongsil University, Seoul 06978, Republic of Korea; (H.H.-P.V.); (T.M.N.); (K.A.B.)
| | - Thuan Minh Nguyen
- Department of Electronic Engineering, Soongsil University, Seoul 06978, Republic of Korea; (H.H.-P.V.); (T.M.N.); (K.A.B.)
| | - Khoi Anh Bui
- Department of Electronic Engineering, Soongsil University, Seoul 06978, Republic of Korea; (H.H.-P.V.); (T.M.N.); (K.A.B.)
| | - Myungsik Yoo
- School of Electronic Engineering, Soongsil University, Seoul 06978, Republic of Korea
| |
Collapse
|
3
|
Peng T, Xiong J, Sun K, Qian S, Tao Z, Nazir MS, Zhang C. Research and application of a novel selective stacking ensemble model based on error compensation and parameter optimization for AQI prediction. ENVIRONMENTAL RESEARCH 2024; 247:118176. [PMID: 38215922 DOI: 10.1016/j.envres.2024.118176] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/11/2023] [Accepted: 01/09/2024] [Indexed: 01/14/2024]
Abstract
With the ongoing process of industrialization, the issue of declining air quality is increasingly becoming a critical concern. Accurate prediction of the Air Quality Index (AQI), considered as an all-inclusive measure representing the extent of pollutants present in the atmosphere, is of paramount importance. This study introduces a novel methodology that combines stacking ensemble and error correction to improve AQI prediction. Additionally, the reptile search algorithm (RSA) is employed for optimizing model parameters. In this study, four distinct regional AQI data containing a collection of 34864 data samples are collected. Initially, we perform cross-validation on ten commonly used single models to obtain prediction results. Then, based on evaluation indices, five models are selected for ensemble. The results of the study show that the model proposed in this paper achieves an improvement of around 10% in terms of accuracy when compared to the conventional model. Thus, the model introduced in this study offers a more scientifically grounded approach in tackling air pollution.
Collapse
Affiliation(s)
- Tian Peng
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China; Jiangsu Permanent Magnet Motor Engineering Research Center, Huaiyin Institute of Technology, Huai'an, 223003, China.
| | - Jinlin Xiong
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China
| | - Kai Sun
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China
| | - Shijie Qian
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China
| | - Zihan Tao
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China
| | | | - Chu Zhang
- Faculty of Automation, Huaiyin Institute of Technology, Huai'an, 223003, China; Jiangsu Permanent Magnet Motor Engineering Research Center, Huaiyin Institute of Technology, Huai'an, 223003, China.
| |
Collapse
|
4
|
Natarajan SK, Shanmurthy P, Arockiam D, Balusamy B, Selvarajan S. Optimized machine learning model for air quality index prediction in major cities in India. Sci Rep 2024; 14:6795. [PMID: 38514669 PMCID: PMC10958024 DOI: 10.1038/s41598-024-54807-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 02/16/2024] [Indexed: 03/23/2024] Open
Abstract
Industrial advancements and utilization of large amount of fossil fuels, vehicle pollution, and other calamities increases the Air Quality Index (AQI) of major cities in a drastic manner. Major cities AQI analysis is essential so that the government can take proper preventive, proactive measures to reduce air pollution. This research incorporates artificial intelligence in AQI prediction based on air pollution data. An optimized machine learning model which combines Grey Wolf Optimization (GWO) with the Decision Tree (DT) algorithm for accurate prediction of AQI in major cities of India. Air quality data available in the Kaggle repository is used for experimentation, and major cities like Delhi, Hyderabad, Kolkata, Bangalore, Visakhapatnam, and Chennai are considered for analysis. The proposed model performance is experimentally verified through metrics like R-Square, RMSE, MSE, MAE, and accuracy. Existing machine learning models, like k-nearest Neighbor, Random Forest regressor, and Support vector regressor, are compared with the proposed model. The proposed model attains better prediction performance compared to traditional machine learning algorithms with maximum accuracy of 88.98% for New Delhi city, 91.49% for Bangalore city, 94.48% for Kolkata, 97.66% for Hyderabad, 95.22% for Chennai and 97.68% for Visakhapatnam city.
Collapse
Affiliation(s)
- Suresh Kumar Natarajan
- School of Computer Science and Engineering, Jain (Deemed-to-be University), Bengaluru, Karnataka, India
| | - Prakash Shanmurthy
- School of Computer Science and Engineering and Information Science, Presidency University, Bengaluru, Karnataka, India
| | | | | | - Shitharth Selvarajan
- School of Built Environment, Engineering and Computing, Leeds Beckett University, Leeds, LS1 3HE, UK.
| |
Collapse
|
5
|
Pari P, Abbasi T, Abbasi SA. AI-based prediction of the improvement in air quality induced by emergency measures. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 351:119716. [PMID: 38064985 DOI: 10.1016/j.jenvman.2023.119716] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 11/16/2023] [Accepted: 11/23/2023] [Indexed: 01/14/2024]
Abstract
Several cities in the developing world, of which the capital city of India, New Delhi, is an example, often experience air quality in which pollutant levels go way above the levels considered hazardous for human health. To bring down the air quality to within permissible limits quickly, the measures typically taken involve shutting down certain high-polluting activities for some time to enable the air quality to recover temporarily. This paper presents a first-ever model based on artificial neural networks to forecast the extent of reduction in air quality parameters that can be achieved and the time period within which a change can be experienced when the source of the emissions is cut off temporarily. The model is based on the extensive data on the extent of reduction in air quality parameters that occurred during the lockdown that was imposed during the COVID-19 pandemic. The non-linear autoregressive exogenous network-based model chosen for the purpose employs the hour since stopping of emissions, relative humidity, wind speed, wind direction, and ambient temperature as input parameters to predict the rate of change of PM2.5 with respect to the concentration at the start of the stopping of the emissions. Air quality data from a key monitoring station in New Delhi was used to develop the model. The model predicted the rate of drop in PM2.5 with an R and MSE of 0.0044 and 0.9736, respectively, while training and 0.0095 and 0.9583 while testing. The model was then tested with data from 19 other stations in New Delhi, and accuracy of the model was found to be exceptionally accurate, with the correlation between the measured and the predicted PM2.5 levels ranging from 0.74 to 0.94 and the MSE ranging from 0.0110 to 1.0746. Thus, the model can be employed to determine the number of hours of temporary stoppage of emissions required for the PM2.5 concentration to reach safe levels. The methodology of development of the model can be extrapolated to construct models tailored for use in other parts of the world as well.
Collapse
Affiliation(s)
- Pavithra Pari
- Centre for Pollution Control and Environmental Engineering, Pondicherry University, Pondicherry, 605014, India
| | - Tasneem Abbasi
- Centre for Pollution Control and Environmental Engineering, Pondicherry University, Pondicherry, 605014, India.
| | - S A Abbasi
- Centre for Pollution Control and Environmental Engineering, Pondicherry University, Pondicherry, 605014, India
| |
Collapse
|