1
|
Cheng Y, Cheng R, Xu T, Tan X, Bai Y, Yang J. Integrating meteorological data and hybrid intelligent models for dengue fever prediction. BMC Public Health 2025; 25:1516. [PMID: 40269831 PMCID: PMC12020127 DOI: 10.1186/s12889-025-22375-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2024] [Accepted: 03/18/2025] [Indexed: 04/25/2025] Open
Abstract
BACKGROUND Dengue fever is a globally prevalent arbovirus disease that poses a serious challenge to global health. Therefore, analyzing the relationship between dengue fever incidence and meteorological factors and developing a more effective prediction model based on this relationship can provide a theoretical basis for public health departments to formulate reasonable prevention strategies. METHODS We collected dengue fever cases and meteorological data, including temperature, humidity, sunshine duration, etc., from Guangdong and Zhejiang Provinces in China from 2005-2024. A distributed lag nonlinear model (DLNM) was used to analyze the exposure-response relationship between meteorological factors and dengue incidence. Moreover, the raw case data were classified into dengue warning levels using a fuzzy clustering algorithm. The improved horned lizard optimization algorithm (IHLOA) was then combined with support vector machine (SVM), random forest (RF) and k-nearest neighbor (KNN) for dengue prediction. The average accuracy (Avg acc ), average fitness value (Avg fit ), average feature reduction rate (Avg feature ), standard deviation (STD) andF 1 _ s c o r e micro were used to evaluate prediction performance. RESULTS The incidence risk of dengue fever was positively correlated with temperature, relative humidity, sunshine duration and the vegetation index but negatively correlated with visibility, wind speed and sea level pressure. Meteorological factors had a lag effect on the risk of dengue fever, and the magnitude of the effect varies dynamically with lag time. Compared with the other prediction models, our proposed hybrid prediction models exhibited relatively lowAvg feature values and relatively high Avg acc values, indicating the best prediction results. CONCLUSION Our experiment revealed the correlation and lag effect between meteorological factors and the incidence of dengue fever, indicating that meteorological factors have important value in predicting dengue fever. In addition, the hybrid prediction models constructed in this article can accurately predict outbreaks of dengue fever, which can lay a theoretical foundation for the construction of monitoring and early warning systems and improve the ability of relevant government departments to detect and identify dengue fever outbreaks in a timely manner.
Collapse
Affiliation(s)
- Yunyun Cheng
- School of Information and Communication Engineering, North University of China, Taiyuan, 030051, China
| | - Rong Cheng
- School of Mathematics, North University of China, Taiyuan, 030051, China
| | - Ting Xu
- School of Mathematics, North University of China, Taiyuan, 030051, China
| | - Xiuhui Tan
- School of Mathematics, North University of China, Taiyuan, 030051, China
| | - Yanping Bai
- School of Mathematics, North University of China, Taiyuan, 030051, China.
| | - Jing Yang
- Department of Science, Taiyuan Institute of Technology, Taiyuan, 030008, China
| |
Collapse
|
2
|
Chen X, Moraga P. Assessing dengue forecasting methods: a comparative study of statistical models and machine learning techniques in Rio de Janeiro, Brazil. Trop Med Health 2025; 53:52. [PMID: 40211309 PMCID: PMC11984044 DOI: 10.1186/s41182-025-00723-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2024] [Accepted: 03/04/2025] [Indexed: 04/14/2025] Open
Abstract
BACKGROUND Dengue is a mosquito-borne viral disease that poses a significant public health threat in tropical and subtropical regions worldwide. Accurate forecasting of dengue outbreaks is crucial for effective public health planning and intervention. This study aims to assess the predictive performance and computational efficiency of a number of statistical models and machine learning techniques for dengue forecasting, both with and without the inclusion of climate factors, to inform the design of dengue surveillance systems. METHODS The dengue forecasting methods comparison in this study considers dengue cases in Rio de Janeiro, Brazil, as well as climate factors known to affect disease transmission. Employing a dynamic window approach, various statistical methods and machine learning techniques were used to generate weekly forecasts at several time horizons. Error measures, uncertainty intervals, and computational efficiency obtained with each method were compared. Statistical models considered were Autoregressive (AR), Moving Average (MA), Autoregressive Integrated Moving Average (ARIMA), and Exponential Smoothing State Space Model (ETS). In addition, models incorporating temperature and humidity as covariates, such as Vector Autoregression (VAR) and Seasonal ARIMAX (SARIMAX), were employed. Machine learning techniques evaluated were Random Forest, XGBoost, Support Vector Machine (SVM), Long-Short-Term Memory (LSTM) networks, and Prophet. Ensemble approaches that integrated the top performing methods were also considered. The evaluated methods also incorporated lagged climatic variables to account for delayed effects. RESULTS Among the statistical models, ARIMA demonstrated the best performance using only historical case data, while SARIMAX significantly improved predictive accuracy by incorporating climate covariates. In general, the LSTM model, particularly when combined with climate covariates, proved to be the most accurate machine learning model, despite being slower to train and predict. For long-term forecasts, Prophet with climate covariates was the most effective. Ensemble models, such as the combination of LSTM and ARIMA, showed substantial improvements over individual models. CONCLUSIONS This study demonstrates the strengths and limitations of various methods for dengue forecasting across multiple timeframes. It highlights the best-performing statistical and machine learning methods, including their computational efficiency, underscoring the significance of machine learning techniques and the integration of climate covariates to improve forecasts. These findings offer valuable insights for public health officials, facilitating the development of dengue surveillance systems for more accurate forecasting and timely allocation of resources to mitigate dengue outbreaks.
Collapse
Affiliation(s)
- Xiang Chen
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900, Thuwal, Saudi Arabia.
| | - Paula Moraga
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900, Thuwal, Saudi Arabia
| |
Collapse
|
3
|
Chen X, Moraga P. Forecasting dengue across Brazil with LSTM neural networks and SHAP-driven lagged climate and spatial effects. BMC Public Health 2025; 25:973. [PMID: 40075398 PMCID: PMC11900637 DOI: 10.1186/s12889-025-22106-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2025] [Accepted: 02/26/2025] [Indexed: 03/14/2025] Open
Abstract
BACKGROUND Dengue fever is a mosquito-borne viral disease that poses significant health risks and socioeconomic challenges in Brazil, necessitating accurate forecasting across its 27 federal states. With the country's diverse climate and geographical spread, effective dengue prediction requires models that can account for both climate variations and spatial dynamics. This study addresses these needs by using Long Short-Term Memory (LSTM) neural networks enhanced with SHapley Additive exPlanations (SHAP) integrating optimal lagged climate variables and spatial influence from neighboring states. METHOD An LSTM-based model was developed to forecast dengue cases across Brazil's 27 federal states, incorporating a comprehensive set of climate and spatial variables. SHAP was used to identify and select the most important lagged climate predictors. Additionally, lagged dengue cases from neighboring states were included to capture spatial dependencies. Model performance was evaluated using MAE, MAPE, and CRPS, with comparisons to baseline models. RESULTS The LSTM-Climate-Spatial model consistently demonstrated superior performance, effectively integrating temporal, climatic, and spatial information to capture the complex dynamics of dengue transmission. SHAP-enhanced variable selection improved accuracy by focusing on key drivers such as temperature, precipitation and humidity. The inclusion of spatial effects further strengthened forecasts in highly connected states showcasing the model's adaptability and robustness. CONCLUSION This study presents a scalable and robust framework for dengue forecasting across Brazil, effectively integrating temporal, climatic, and spatial information into an LSTM-based model. The model's successful application across Brazil's diverse regions demonstrates its generalizability to other dengue-endemic areas with varying climatic and epidemiological conditions. By integrating diverse data sources, the framework captures key transmission drivers, demonstrating the potential of LSTM neural networks for robust predictions. These findings provide valuable insights to enhance public health strategies and outbreak preparedness in Brazil.
Collapse
Affiliation(s)
- Xiang Chen
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| | - Paula Moraga
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| |
Collapse
|
4
|
Lonlab P, Anupong S, Jainonthee C, Chadsuthi S. Seasonal and Meteorological Drivers of Hand, Foot, and Mouth Disease Outbreaks Using Data-Driven Machine Learning Models. Trop Med Infect Dis 2025; 10:48. [PMID: 39998051 PMCID: PMC11860531 DOI: 10.3390/tropicalmed10020048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2025] [Revised: 02/01/2025] [Accepted: 02/04/2025] [Indexed: 02/26/2025] Open
Abstract
Hand, Foot, and Mouth Disease (HFMD) predominantly affects children under the age of five and remains a significant public health concern in the Asia-Pacific region. HFMD outbreaks are closely linked to seasonal changes and meteorological factors, particularly in tropical and subtropical areas. In Thailand, a total of 657,570 HFMD cases were reported between 2011 and 2022 (12 years). This study aimed to identify the high- and low-risk HFMD outbreak areas using machine learning models: Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forests (RF), Gradient Boosting Machine (GBM), and Extreme Gradient Boosting (XGBoost). Our findings showed that the XGBoost model outperformed the other models in predicting unseen data and defining the best model. The best model can be used to detect high-risk outbreak areas and to explore the relationship between meteorological factors and HFMD outbreaks. The results highlighted the seasonal distribution of high-risk HFMD outbreak months across different provinces in Thailand, with average maximum temperature, average rainfall, and average vapor pressure identified as the most influential factors. Furthermore, the best model was used to analyze HFMD outbreaks during the COVID-19 pandemic, showing a notable reduction in high-risk outbreak months and areas, likely due to the control measures implemented during this period. Overall, our model shows great potential as a tool for warnings, providing useful insights to help public health officials reduce the impact of HFMD outbreaks.
Collapse
Affiliation(s)
- Pakorn Lonlab
- Department of Physics, Faculty of Science, Naresuan University, Phitsanulok 65000, Thailand;
| | - Suparinthon Anupong
- Department of Chemistry, Mahidol Wittayanusorn School (MWIT), Salaya, Nakhon Pathom 73170, Thailand;
| | - Chalita Jainonthee
- Research Center for Veterinary Biosciences and Veterinary Public Health, Faculty of Veterinary Medicine, Chiang Mai University, Chiang Mai 50100, Thailand;
| | - Sudarat Chadsuthi
- Department of Physics, Faculty of Science, Naresuan University, Phitsanulok 65000, Thailand;
| |
Collapse
|
5
|
Bilal H, Khan MN, Khan S, Shafiq M, Fang W, Khan RU, Rahman MU, Li X, Lv QL, Xu B. The role of artificial intelligence and machine learning in predicting and combating antimicrobial resistance. Comput Struct Biotechnol J 2025; 27:423-439. [PMID: 39906157 PMCID: PMC11791014 DOI: 10.1016/j.csbj.2025.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 01/06/2025] [Accepted: 01/13/2025] [Indexed: 02/06/2025] Open
Abstract
Antimicrobial resistance (AMR) is a major threat to global public health. The current review synthesizes to address the possible role of Artificial Intelligence and Machine Learning (AI/ML) in mitigating AMR. Supervised learning, unsupervised learning, deep learning, reinforcement learning, and natural language processing are some of the main tools used in this domain. AI/ML models can use various data sources, such as clinical information, genomic sequences, microbiome insights, and epidemiological data for predicting AMR outbreaks. Although AI/ML are relatively new fields, numerous case studies offer substantial evidence of their successful application in predicting AMR outbreaks with greater accuracy. These models can provide insights into the discovery of novel antimicrobials, the repurposing of existing drugs, and combination therapy through the analysis of their molecular structures. In addition, AI-based clinical decision support systems in real-time guide healthcare professionals to improve prescribing of antibiotics. The review also outlines how can AI improve AMR surveillance, analyze resistance trends, and enable early outbreak identification. Challenges, such as ethical considerations, data privacy, and model biases exist, however, the continuous development of novel methodologies enables AI/ML to play a significant role in combating AMR.
Collapse
Affiliation(s)
- Hazrat Bilal
- Jiangxi Key Laboratory of oncology (2024SSY06041), JXHC Key Laboratory of Tumour Metastasis, NHC Key Laboratory of Personalized Diagnosis and Treatment of Nasopharyngeal Carcinoma, Jiangxi Cancer Hospital & Institute, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, Jiangxi 330029, PR China
| | - Muhammad Nadeem Khan
- Department of Cell Biology and Genetics, Shantou University Medical College, Shantou 515041, China
| | - Sabir Khan
- Department of Dermatology, The Second Affiliated Hospital of Shantou University Medical College, Shantou 515041, China
| | - Muhammad Shafiq
- Research Institute of Clinical Pharmacy, Department of Pharmacology, Shantou University Medical College, Shantou 515041, China
| | - Wenjie Fang
- Department of Dermatology, Changzheng Hospital, Second Military Medical University, Shanghai 200003, China
| | - Rahat Ullah Khan
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 101408, China
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Center for Influenza Research and Early-warning (CASCIRE), CAS-TWAS Center of Excellence for Emerging Infectious Diseases (CEEID), Chinese Academy of Sciences, Beijing 100101, China
| | - Mujeeb Ur Rahman
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang 212013, China
| | - Xiaohui Li
- Jiangxi Key Laboratory of oncology (2024SSY06041), JXHC Key Laboratory of Tumour Metastasis, NHC Key Laboratory of Personalized Diagnosis and Treatment of Nasopharyngeal Carcinoma, Jiangxi Cancer Hospital & Institute, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, Jiangxi 330029, PR China
| | - Qiao-Li Lv
- Jiangxi Key Laboratory of oncology (2024SSY06041), JXHC Key Laboratory of Tumour Metastasis, NHC Key Laboratory of Personalized Diagnosis and Treatment of Nasopharyngeal Carcinoma, Jiangxi Cancer Hospital & Institute, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, Jiangxi 330029, PR China
| | - Bin Xu
- Jiangxi Key Laboratory of oncology (2024SSY06041), JXHC Key Laboratory of Tumour Metastasis, NHC Key Laboratory of Personalized Diagnosis and Treatment of Nasopharyngeal Carcinoma, Jiangxi Cancer Hospital & Institute, The Second Affiliated Hospital of Nanchang Medical College, Nanchang, Jiangxi 330029, PR China
| |
Collapse
|
6
|
Al Mobin M. Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach. Sci Rep 2024; 14:32073. [PMID: 39738719 PMCID: PMC11685631 DOI: 10.1038/s41598-024-83770-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Accepted: 12/17/2024] [Indexed: 01/02/2025] Open
Abstract
Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance the effectiveness of vector control strategies and resource allocations. This study formulates a dynamic data pipeline to forecast dengue incidence based on 13 meteorological variables using a suite of state-of-the-art machine learning models and custom features engineering, achieving an accuracy of 84.02%, marking a substantial improvement over existing studies. A novel wrapper feature selection algorithm employing a custom objective function is proposed in this study, which significantly improves model accuracy by 12.63% and reduces the mean absolute percentage error by 70.82%. The custom objective function's output can be transformed to quantify the contribution of each variable to the target variable's variability, providing deeper insights into the workings of black box models. The study concludes that relative humidity is redundant in predicting dengue infection, while meteorological factors exhibit more significant short-term impacts compared to long-term and immediate impacts. Sunshine (hours) emerges as the meteorological factor with the most immediate impact, whereas precipitation is the most impactful predictor over both short-term (8-month lag) and long-term (26-30-month lag) periods. Forecasts for 2024 using the best-performing model predict a rise in dengue cases starting in May, peaking at 24,000 cases per month by August and persisting at high levels through October before declining to half by year-end. These findings offer critical insights into temporal climate effects on dengue transmission, aiding the development of effective forecasting systems.
Collapse
Affiliation(s)
- Mahadee Al Mobin
- Bangladesh Institute of Governance and Management, Dhaka, 1207, Bangladesh.
| |
Collapse
|
7
|
Manocha A, Bhatia M, Kumar G. Smart monitoring solution for dengue infection control: A digital twin-inspired approach. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 257:108459. [PMID: 39426139 DOI: 10.1016/j.cmpb.2024.108459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 07/07/2024] [Accepted: 10/08/2024] [Indexed: 10/21/2024]
Abstract
BACKGROUND AND OBJECTIVE In the realm of smart healthcare, precise monitoring and prediction services are crucial for mitigating the impact of infectious diseases. This study introduces an innovative digital twin technology-inspired monitoring architecture that employs a similarity-based hybrid modeling scheme to significantly enhance accuracy in the smart healthcare domain. The research also delves into the potential of IoT technology in delivering advanced technological healthcare solutions, with a specific focus on the rapid expansion of dengue fever. METHODS The proposed digital twin-inspired healthcare system is designed to proactively combat the spread of dengue virus by enabling ubiquitous monitoring and forecasting of individuals' susceptibility to dengue infection. The system utilizes digital twin technology to observe the status of healthcare and generate likely predictions about the vulnerability to the virus by employing k-means Clustering and Artificial Neural Networks. RESULTS The proposed system has been validated and its effectiveness has been demonstrated through experimental evaluation using carefully defined methods. The results of the experimental assessment confirm that the system performs optimally in terms of Temporal Delay (14.15 s), Classification Accuracy (92.86%), Sensitivity (92.43%), Specificity (91.52%), F-measure (90.86%), and Prediction Effectiveness. Moreover, by integrating a hybrid model that corrects errors in physics-based predictions employing a model for error correction driven by data, this approach has demonstrated a noteworthy 48% reduction in prediction errors, particularly in health monitoring scenarios. CONCLUSIONS The digital twin-inspired healthcare system proposed in this study can assist healthcare providers in assessing the health vulnerability of the dengue virus, thereby reducing the likelihood of long-term or catastrophic health consequences. The integration of a hybrid modeling approach and the utilization of IoT technology has shown promising results in enhancing the accuracy and effectiveness of smart health monitoring and prediction services.
Collapse
Affiliation(s)
- Ankush Manocha
- National Institute of Technology, Kurukshetra, 136119, Haryana, India; Lovely Professional University, Jalandhar, 144001, Punjab, India.
| | - Munish Bhatia
- National Institute of Technology, Kurukshetra, 136119, Haryana, India.
| | - Gulshan Kumar
- Lovely Professional University, Jalandhar, 144001, Punjab, India.
| |
Collapse
|
8
|
Tuan DA, Dang TN. Leveraging Climate Data for Dengue Forecasting in Ba Ria Vung Tau Province, Vietnam: An Advanced Machine Learning Approach. Trop Med Infect Dis 2024; 9:250. [PMID: 39453277 PMCID: PMC11511084 DOI: 10.3390/tropicalmed9100250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Revised: 10/10/2024] [Accepted: 10/17/2024] [Indexed: 10/26/2024] Open
Abstract
Dengue fever is a persistent public health issue in tropical regions, including Vietnam, where climate variability plays a crucial role in disease transmission dynamics. This study focuses on developing climate-based machine learning models to forecast dengue outbreaks in Ba Ria Vung Tau (BRVT) province, Vietnam, using meteorological data from 2003 to 2022. We utilized four predictive models-Negative Binomial Regression (NBR), Seasonal AutoRegressive Integrated Moving Average with Exogenous Regressors (SARIMAX), Extreme Gradient Boosting (XGBoost) v2.0.3, and long short-term memory (LSTM)-to predict weekly dengue incidence. Key climate variables, including temperature, humidity, precipitation, and wind speed, were integrated into these models, with lagged variables included to capture delayed climatic effects on dengue transmission. The NBR model demonstrated the best performance in terms of predictive accuracy, achieving the lowest Mean Absolute Error (MAE), compared to other models. The inclusion of lagged climate variables significantly enhanced the model's ability to predict dengue cases. Although effective in capturing seasonal trends, SARIMAX and LSTM models struggled with overfitting and failed to accurately predict short-term outbreaks. XGBoost exhibited moderate predictive power but was sensitive to overfitting, particularly without fine-tuning. Our findings confirm that climate-based machine learning models, particularly the NBR model, offer valuable tools for forecasting dengue outbreaks in BRVT. However, improving the models' ability to predict short-term peaks remains a challenge. The integration of meteorological data into early warning systems is crucial for public health authorities to plan timely and effective interventions. This research contributes to the growing body of literature on climate-based disease forecasting and underscores the need for further model refinement to address the complexities of dengue transmission in highly endemic regions.
Collapse
Affiliation(s)
| | - Tran Ngoc Dang
- Faculty of Public Health, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City 70000, Vietnam;
| |
Collapse
|
9
|
Qaiser A, Manzoor S, Hashmi AH, Javed H, Zafar A, Ashraf J. Support Vector Machine Outperforms Other Machine Learning Models in Early Diagnosis of Dengue Using Routine Clinical Data. Adv Virol 2024; 2024:5588127. [PMID: 39435048 PMCID: PMC11493476 DOI: 10.1155/2024/5588127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 09/28/2024] [Indexed: 10/23/2024] Open
Abstract
Background: There is a dire need for the establishment of active dengue surveillance to continuously detect cases, circulating serotypes, and determine the disease burden of dengue fever (DF) in the country and region. Predicting dengue PCR results using machine learning (ML) models represents a significant advancement in pre-emptive healthcare measures. This study outlines the comprehensive process of data preprocessing, model selection, and the underlying mechanisms of each algorithm employed to accurately predict dengue PCR outcomes. Methods: We analyzed data from 300 suspected dengue patients in Islamabad and Rawalpindi, Pakistan, from August to October 2023. NS1 antigen ELISA, IgM and IgG antibody tests, and serotype-specific real-time polymerase chain reaction (RT-PCR) were used to detect the dengue virus (DENV). Representative PCR-positive samples were sequenced by Sanger sequencing to confirm the circulation of various dengue serotypes. Demographic information, serological test results, and hematological parameters were used as inputs to the ML models, with the dengue PCR result serving as the output to be predicted. The models used were logistic regression, XGBoost, LightGBM, random forest, support vector machine (SVM), and CatBoost. Results: Of the 300 patients, 184 (61.33%) were PCR positive. Among the total positive cases detected by PCR, 9 (4.89%), 171 (92.93%), and 4 (2.17%) were infected with serotypes 1, 2, and 3, respectively. A total of 147 (79.89%) males and 37 (20.11%) females were infected, with a mean age of 33 ± 16 years. In addition, the mean platelet and leukocyte counts and the hematocrit percentages were 75,447%, 4189.02%, and 46.05%, respectively. The SVM was the best-performing ML model for predicting RT-PCR results, with 71.4% accuracy, 97.4% recall, and 71.6% precision. Hyperparameter tuning improved the recall to 100%. Conclusion: Our study documents three circulating serotypes in the capital territory of Pakistan and highlights that the SVM outperformed other models, potentially serving as a valuable tool in clinical settings to aid in the rapid diagnosis of DF.
Collapse
Affiliation(s)
- Ariba Qaiser
- Molecular Virology Lab, National University of Science and Technology (NUST), Atta-ur-Rehman School of Applied Biosciences (ASAB), Islamabad, Pakistan
| | - Sobia Manzoor
- Molecular Virology Lab, National University of Science and Technology (NUST), Atta-ur-Rehman School of Applied Biosciences (ASAB), Islamabad, Pakistan
| | - Asraf Hussain Hashmi
- Institute of Biomedical and Genetic Engineering (IBGE), KRL Hospital, Islamabad, Pakistan
| | - Hasnain Javed
- Provincial Public Health Reference Lab, Punjab AIDS Control Programe, Lahore, Pakistan
| | - Anam Zafar
- Department of Pediatrics, Avicenna Medical Complex, Lahore, Pakistan
| | - Javed Ashraf
- Department of Community Dentistry, Riphah International University, Islamabad, Pakistan
- Institute of Dentistry, University of Eastern Finland, Kuopio, Finland
| |
Collapse
|
10
|
Mazhar B, Ali NM, Manzoor F, Khan MK, Nasir M, Ramzan M. Development of data-driven machine learning models and their potential role in predicting dengue outbreak. J Vector Borne Dis 2024; 61:503-514. [PMID: 38238798 DOI: 10.4103/0972-9062.393976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/11/2023] [Indexed: 11/29/2024] Open
Abstract
Dengue fever is one of the most widespread vector-borne viral infections in the world, resulting in increased socio-economic burden. WHO has reported that 2.5 billion people are infected with dengue fever across the world, resulting in high mortalities in tropical and subtropical regions. The current article endeavors to present an overview of predicting dengue outbreaks through data-based machine-learning models. This artificial intelligence model uses real world data such as dengue surveillance, climatic variables, and epidemiological data and combines big data with machine learning algorithms to forecast dengue. Monitoring and predicting dengue incidences has been significantly enhanced through innovative approaches. This involves gathering data on various climatic factors, including temperature, rainfall, relative humidity, and wind speed, along with monthly records of dengue cases. The study functions as an efficient warning system, enabling the anticipation of dengue outbreaks. This early warning system not only alerts communities but also aids relevant authorities in implementing crucial preventive measures.
Collapse
Affiliation(s)
- Bushra Mazhar
- Department of Zoology, Government College University, Lahore, Pakistan
| | - Nazish Mazhar Ali
- Department of Zoology, Government College University, Lahore, Pakistan
| | - Farkhanda Manzoor
- Department of Zoology, Lahore College for Women University, Lahore, Pakistan
| | | | - Muhammad Nasir
- Department of Zoology, Government College University, Lahore, Pakistan
| | - Muhammad Ramzan
- Department of Chemistry, Government College University, Lahore, Pakistan
| |
Collapse
|
11
|
Ren H, Xu N. Forecasting and mapping dengue fever epidemics in China: a spatiotemporal analysis. Infect Dis Poverty 2024; 13:50. [PMID: 38956632 PMCID: PMC11221048 DOI: 10.1186/s40249-024-01219-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 06/20/2024] [Indexed: 07/04/2024] Open
Abstract
BACKGROUND Dengue fever (DF) has emerged as a significant public health concern in China. The spatiotemporal patterns and underlying influencing its spread, however, remain elusive. This study aims to identify the factors driving these variations and to assess the city-level risk of DF epidemics in China. METHODS We analyzed the frequency, intensity, and distribution of DF cases in China from 2003 to 2022 and evaluated 11 natural and socioeconomic factors as potential drivers. Using the random forest (RF) model, we assessed the contributions of these factors to local DF epidemics and predicted the corresponding city-level risk. RESULTS Between 2003 and 2022, there was a notable correlation between local and imported DF epidemics in case numbers (r = 0.41, P < 0.01) and affected cities (r = 0.79, P < 0.01). With the increase in the frequency and intensity of imported epidemics, local epidemics have become more severe. Their occurrence has increased from five to eight months per year, with case numbers spanning from 14 to 6641 per month. The spatial distribution of city-level DF epidemics aligns with the geographical divisions defined by the Huhuanyong Line (Hu Line) and Qin Mountain-Huai River Line (Q-H Line) and matched well with the city-level time windows for either mosquito vector activity (83.59%) or DF transmission (95.74%). The RF models achieved a high performance (AUC = 0.92) when considering the time windows. Importantly, they identified imported cases as the primary influencing factor, contributing significantly (24.82%) to local DF epidemics at the city level in the eastern region of the Hu Line (E-H region). Moreover, imported cases were found to have a linear promoting impact on local epidemics, while five climatic and six socioeconomic factors exhibited nonlinear effects (promoting or inhibiting) with varying inflection values. Additionally, this model demonstrated outstanding accuracy (hitting ratio = 95.56%) in predicting the city-level risks of local epidemics in China. CONCLUSIONS China is experiencing an increasing occurrence of sporadic local DF epidemics driven by an unavoidably higher frequency and intensity of imported DF epidemics. This research offers valuable insights for health authorities to strengthen their intervention capabilities against this disease.
Collapse
Affiliation(s)
- Hongyan Ren
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Nankang Xu
- State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, 100101, China
- College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
12
|
Parthasarathi KTS, Gaikwad KB, Rajesh S, Rana S, Pandey A, Singh H, Sharma J. A machine learning-based strategy to elucidate the identification of antibiotic resistance in bacteria. FRONTIERS IN ANTIBIOTICS 2024; 3:1405296. [PMID: 39816256 PMCID: PMC11732175 DOI: 10.3389/frabi.2024.1405296] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 05/28/2024] [Indexed: 01/18/2025]
Abstract
Microorganisms, crucial for environmental equilibrium, could be destructive, resulting in detrimental pathophysiology to the human host. Moreover, with the emergence of antibiotic resistance (ABR), the microbial communities pose the century's largest public health challenges in terms of effective treatment strategies. Furthermore, given the large diversity and number of known bacterial strains, describing treatment choices for infected patients using experimental methodologies is time-consuming. An alternative technique, gaining popularity as sequencing prices fall and technology advances, is to use bacterial genotype rather than phenotype to determine ABR. Complementing machine learning into clinical practice provides a data-driven platform for categorization and interpretation of bacterial datasets. In the present study, k-mers were generated from nucleotide sequences of pathogenic bacteria resistant to antibiotics. Subsequently, they were clustered into groups of bacteria sharing similar genomic features using the Affinity propagation algorithm with a Silhouette coefficient of 0.82. Thereafter, a prediction model based on Random Forest algorithm was developed to explore the prediction capability of the k-mers. It yielded an overall specificity of 0.99 and a sensitivity of 0.98. Additionally, the genes and ABR drivers related to the k-mers were identified to explore their biological relevance. Furthermore, a multilayer perceptron model with a hamming loss of 0.05 was built to classify the bacterial strains into resistant and non-resistant strains against various antibiotics. Segregating pathogenic bacteria based on genomic similarities could be a valuable approach for assessing the severity of diseases caused by new bacterial strains. Utilization of this strategy could aid in enhancing our understanding of ABR patterns, paving the way for more informed and effective treatment options.
Collapse
Affiliation(s)
- K. T. Shreya Parthasarathi
- Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
- Institute of Bioinformatics, Bangalore, India
| | - Kiran Bharat Gaikwad
- Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
- Institute of Bioinformatics, Bangalore, India
| | | | - Shweta Rana
- Division of Biomedical Informatics, Indian Council of Medical Research, New Delhi, India
| | - Akhilesh Pandey
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, United States
| | - Harpreet Singh
- Division of Biomedical Informatics, Indian Council of Medical Research, New Delhi, India
| | - Jyoti Sharma
- Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
- Institute of Bioinformatics, Bangalore, India
| |
Collapse
|
13
|
Moukheiber D, Restrepo D, Cajas SA, Montoya MPA, Celi LA, Kuo KT, López DM, Moukheiber L, Moukheiber M, Moukheiber S, Osorio-Valencia JS, Purkayastha S, Paddo AR, Wu C, Kuo PC. A multimodal framework for extraction and fusion of satellite images and public health data. Sci Data 2024; 11:634. [PMID: 38879585 PMCID: PMC11180113 DOI: 10.1038/s41597-024-03366-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/10/2024] [Indexed: 06/19/2024] Open
Abstract
In low- and middle-income countries, the substantial costs associated with traditional data collection pose an obstacle to facilitating decision-making in the field of public health. Satellite imagery offers a potential solution, but the image extraction and analysis can be costly and requires specialized expertise. We introduce SatelliteBench, a scalable framework for satellite image extraction and vector embeddings generation. We also propose a novel multimodal fusion pipeline that utilizes a series of satellite imagery and metadata. The framework was evaluated generating a dataset with a collection of 12,636 images and embeddings accompanied by comprehensive metadata, from 81 municipalities in Colombia between 2016 and 2018. The dataset was then evaluated in 3 tasks: including dengue case prediction, poverty assessment, and access to education. The performance showcases the versatility and practicality of SatelliteBench, offering a reproducible, accessible and open tool to enhance decision-making in public health.
Collapse
Affiliation(s)
- Dana Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - David Restrepo
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.
- Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia.
| | - Sebastián Andrés Cajas
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, Massachusetts, USA
- School of Computer Science, University College Dublin, Dublin, Ireland
| | | | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, USA
- Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
| | - Kuan-Ting Kuo
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
| | - Diego M López
- Departamento de Telemática, Universidad del Cauca, Popayán, Cauca, Colombia
| | - Lama Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Mira Moukheiber
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Sulaiman Moukheiber
- Department of Computer Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
| | | | - Saptarshi Purkayastha
- Department of BioHealth Informatics, Indiana University Luddy School of Informatics, Computing, and Engineering, Indianapolis, Indiana, USA
| | - Atika Rahman Paddo
- Department of BioHealth Informatics, Indiana University Luddy School of Informatics, Computing, and Engineering, Indianapolis, Indiana, USA
| | - Chenwei Wu
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, Michigan, USA
| | - Po-Chih Kuo
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan.
| |
Collapse
|
14
|
Bohm BC, Borges FEDM, Silva SCM, Soares AT, Ferreira DD, Belo VS, Lignon JS, Bruhn FRP. Utilization of machine learning for dengue case screening. BMC Public Health 2024; 24:1573. [PMID: 38862945 PMCID: PMC11167742 DOI: 10.1186/s12889-024-19083-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 06/07/2024] [Indexed: 06/13/2024] Open
Abstract
Dengue causes approximately 10.000 deaths and 100 million symptomatic infections annually worldwide, making it a significant public health concern. To address this, artificial intelligence tools like machine learning can play a crucial role in developing more effective strategies for control, diagnosis, and treatment. This study identifies relevant variables for the screening of dengue cases through machine learning models and evaluates the accuracy of the models. Data from reported dengue cases in the states of Rio de Janeiro and Minas Gerais for the years 2016 and 2019 were obtained through the National Notifiable Diseases Surveillance System (SINAN). The mutual information technique was used to assess which variables were most related to laboratory-confirmed dengue cases. Next, a random selection of 10,000 confirmed cases and 10,000 discarded cases was performed, and the dataset was divided into training (70%) and testing (30%). Machine learning models were then tested to classify the cases. It was found that the logistic regression model with 10 variables (gender, age, fever, myalgia, headache, vomiting, nausea, back pain, rash, retro-orbital pain) and the Decision Tree and Multilayer Perceptron (MLP) models achieved the best results in decision metrics, with an accuracy of 98%. Therefore, a tree-based model would be suitable for building an application and implementing it on smartphones. This resource would be available to healthcare professionals such as doctors and nurses.
Collapse
Affiliation(s)
- Bianca Conrad Bohm
- Laboratory of Veterinary Epidemiology, Postgraduate Program in Veterinary, Federal University of Pelotas (UFPel), Capão do Leão, RS, Brazil.
| | | | - Suellen Caroline Matos Silva
- Laboratory of Veterinary Epidemiology, Postgraduate Program in Veterinary, Federal University of Pelotas (UFPel), Capão do Leão, RS, Brazil
| | - Alessandra Talaska Soares
- Laboratory of Veterinary Epidemiology, Graduate Program in Microbiology and Parasitology, Federal University of Pelotas, Capão do Leão, Rio Grande do Sul, Brazil
| | | | - Vinícius Silva Belo
- Federal University of São, João del-Rei, Midwest Dona Lindu campus, Divinópolis, Minas Gerais, Brazil
| | - Julia Somavilla Lignon
- Laboratory of Veterinary Epidemiology, Postgraduate Program in Veterinary, Federal University of Pelotas (UFPel), Capão do Leão, RS, Brazil
| | - Fábio Raphael Pascoti Bruhn
- Laboratory of Veterinary Epidemiology, Preventive Veterinary Department, Federal University of Pelotas,, Capão do Leão, Rio Grande do Sul, Brazil
| |
Collapse
|
15
|
Chowdhury AH, Rad D, Rahman MS. Predicting anxiety, depression, and insomnia among Bangladeshi university students using tree-based machine learning models. Health Sci Rep 2024; 7:e2037. [PMID: 38650723 PMCID: PMC11033350 DOI: 10.1002/hsr2.2037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 02/21/2024] [Accepted: 03/26/2024] [Indexed: 04/25/2024] Open
Abstract
Background and Aims Mental health problem is a rising public health concern. People of all ages, specially Bangladeshi university students, are more affected by this burden. Thus, the objective of the study was to use tree-based machine learning (ML) models to identify major risk factors and predict anxiety, depression, and insomnia in university students. Methods A social media-based cross-sectional survey was employed for data collection. We used Generalized Anxiety Disorder (GAD-7), Patient Health Questionnaire (PHQ-9) and Insomnia Severity Index (ISI-7) scale for measuring students' anxiety, depression and insomnia problems. The tree-based supervised decision tree (DT), random forest (RF) and robust eXtreme Gradient Boosting (XGBoost) ML algorithms were used to build the prediction models and their predictive performance was evaluated using confusion matrix and receiver operating characteristic (ROC) curves. Results Of the 1250 students surveyed, 64.7% were male and 35.3% were female. The students' ages ranged from 18 to 26 years old, with an average age of 22.24 years (SD = 1.30). Majority of the students (72.6%) were from rural areas and social media addicted (56.6%). Almost 83.3% of the students had moderate to severe anxiety, 84.7% had moderate to severe depression and 76.5% had moderate to severe insomnia problems. Students' social media addiction, age, academic performance, smoking status, monthly family income and morningness-eveningness are the main risk factors of anxiety, depression and insomnia. The highest predictive performance was observed from the XGBoost model for anxiety, depression and insomnia. Conclusion The study findings offer valuable insights for stakeholders, families and policymakers enabling a more profound comprehension of the pressing mental health disorders. This understanding can guide the formulation of improved policy strategies, initiatives for mental health promotion, and the development of effective counseling services within university campus. Additionally, our proposed model might play a critical role in diagnosing and predicting mental health problems among Bangladeshi university students and similar settings.
Collapse
Affiliation(s)
| | - Dana Rad
- Center of Research Development and Innovation in PsychologyAurel Vlaicu University of AradAradRomania
| | | |
Collapse
|
16
|
Kuo CY, Yang WW, Su ECY. Improving dengue fever predictions in Taiwan based on feature selection and random forests. BMC Infect Dis 2024; 24:334. [PMID: 38509486 PMCID: PMC10953060 DOI: 10.1186/s12879-024-09220-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 03/12/2024] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND Dengue fever is a well-studied vector-borne disease in tropical and subtropical areas of the world. Several methods for predicting the occurrence of dengue fever in Taiwan have been proposed. However, to the best of our knowledge, no study has investigated the relationship between air quality indices (AQIs) and dengue fever in Taiwan. RESULTS This study aimed to develop a dengue fever prediction model in which meteorological factors, a vector index, and AQIs were incorporated into different machine learning algorithms. A total of 805 meteorological records from 2013 to 2015 were collected from government open-source data after preprocessing. In addition to well-known dengue-related factors, we investigated the effects of novel variables, including particulate matter with an aerodynamic diameter < 10 µm (PM10), PM2.5, and an ultraviolet index, for predicting dengue fever occurrence. The collected dataset was randomly divided into an 80% training set and a 20% test set. The experimental results showed that the random forests achieved an area under the receiver operating characteristic curve of 0.9547 for the test set, which was the best compared with the other machine learning algorithms. In addition, the temperature was the most important factor in our variable importance analysis, and it showed a positive effect on dengue fever at < 30 °C but had less of an effect at > 30 °C. The AQIs were not as important as temperature, but one was selected in the process of filtering the variables and showed a certain influence on the final results. CONCLUSIONS Our study is the first to demonstrate that AQI negatively affects dengue fever occurrence in Taiwan. The proposed prediction model can be used as an early warning system for public health to prevent dengue fever outbreaks.
Collapse
Affiliation(s)
- Chao-Yang Kuo
- Smart Healthcare Interdisciplinary College, National Taipei University of Nursing and Health Sciences, No.365, Mingde Road, Beitou District, Taipei City, 112303, Taiwan
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, No.301, Yuantong Road, Zhonghe District, New Taipei City, 23564, Taiwan
| | - Wei-Wen Yang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, No.301, Yuantong Road, Zhonghe District, New Taipei City, 23564, Taiwan
| | - Emily Chia-Yu Su
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, No.301, Yuantong Road, Zhonghe District, New Taipei City, 23564, Taiwan.
- Clinical Big Data Research Center, Taipei Medical University Hospital, No.252 Wuxing Street, Xinyi District, Taipei City, 110, Taiwan.
| |
Collapse
|
17
|
Fang L, Hu W, Pan G. Meteorological factors cannot be ignored in machine learning-based methods for predicting dengue, a systematic review. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY 2024; 68:401-410. [PMID: 38150020 DOI: 10.1007/s00484-023-02605-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/18/2023] [Accepted: 12/13/2023] [Indexed: 12/28/2023]
Abstract
In recent years, there has been a rapid increase in the application of machine learning methods about predicting the incidence of dengue fever. However, the predictive factors and models employed in different studies vary greatly. Hence, we conducted a systematic review to summarize machine learning methods and predictors in previous studies. We searched PubMed, ScienceDirect, and Web of Science databases for articles published up to July 2023. The selected papers included not only the forecast of dengue incidence but also machine learning methods. A total of 23 papers were included in this study. Predictive factors included meteorological factors (22, 95.7%), historical dengue data (14, 60.9%), environmental factors (4, 17.4%), socioeconomic factors (4, 17.4%), vector surveillance data (2, 8.7%), and internet search data (3, 13.0%). Among meteorological factors, temperature (20, 87.0%), rainfall (20, 87.0%), and relative humidity (14, 60.9%) were the most commonly used. We found that Support Vector Machine (SVM) (6, 26.1%), Long Short-Term Memory (LSTM) (5, 21.7%), Random Forest (RF) (4, 17.4%), Least Absolute Shrinkage and Selection Operator (LASSO) (2, 8.7%), ensemble model (2, 8.7%), and other models (4, 17.4%) were identified as the best models based on evaluation metrics used in each article. These results indicate that meteorological factors are important predictors that cannot be ignored and SVM and LSTM algorithms are the most commonly used models in dengue fever prediction with good predictive performance. This review will contribute to the development of more robust early dengue warning systems and promote the application of machine learning methods in predicting climate-related infectious diseases.
Collapse
Affiliation(s)
- Lanlan Fang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China
| | - Wan Hu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China
| | - Guixia Pan
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, 230032, Anhui, China.
- The Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Anhui Medical University, Hefei, China.
| |
Collapse
|
18
|
Zhang T, Rabhi F, Chen X, Paik HY, MacIntyre CR. A machine learning-based universal outbreak risk prediction tool. Comput Biol Med 2024; 169:107876. [PMID: 38176209 DOI: 10.1016/j.compbiomed.2023.107876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 12/12/2023] [Accepted: 12/18/2023] [Indexed: 01/06/2024]
Abstract
In order to prevent and control the increasing number of serious epidemics, the ability to predict the risk caused by emerging outbreaks is essential. However, most current risk prediction tools, except EPIRISK, are limited by being designed for targeting only one specific disease and one country. Differences between countries and diseases (e.g., different economic conditions, different modes of transmission, etc.) pose challenges for building models with cross-country and cross-disease prediction capabilities. The limitation of universality affects domestic and international efforts to control and prevent pandemic outbreaks. To address this problem, we used outbreak data from 43 diseases in 206 countries to develop a universal risk prediction system that can be used across countries and diseases. This system used five machine learning models (including Neural Network XGBoost, Logistic Boost, Random Forest and Kernel SVM) to predict and vote together to make ensemble predictions. It can make predictions with around 80%-90 % accuracy from economic, cultural, social, and epidemiological factors. Three different datasets were designed to test the performance of ML models under different realistic situations. This prediction system has strong predictive ability, adaptability, and generality. It can give universal outbreak risk assessment that are not limited by border or disease type, facilitate rapid response to pandemic outbreaks, government decision-making and international cooperation.
Collapse
Affiliation(s)
- Tianyu Zhang
- FinanceIT Research Group, University of New South Wales, Sydney, NSW, Australia.
| | - Fethi Rabhi
- FinanceIT Research Group, University of New South Wales, Sydney, NSW, Australia
| | - Xin Chen
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Hye-Young Paik
- School of Computer Science and Engineering, Faulty of Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Chandini Raina MacIntyre
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, NSW, 2052, Australia; College of Public Service & Community Solutions, Arizona State University, Tempe, AZ, 85004, United States
| |
Collapse
|
19
|
Ong SQ, Isawasan P, Ngesom AMM, Shahar H, Lasim AM, Nair G. Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data. Sci Rep 2023; 13:19129. [PMID: 37926755 PMCID: PMC10625978 DOI: 10.1038/s41598-023-46342-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 10/31/2023] [Indexed: 11/07/2023] Open
Abstract
Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naïve Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
Collapse
Affiliation(s)
- Song Quan Ong
- Entomology Laboratory, Institute for Tropical Biology and Conservation, Universiti Malaysia Sabah, Jalan UMS, 88400, Kota Kinabalu, Sabah, Malaysia.
| | - Pradeep Isawasan
- Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Perak Branch, Tapah Campus, 35400, Tapah, Malaysia
| | - Ahmad Mohiddin Mohd Ngesom
- Centre for Communicable Diseases Research, Institute for Public Health, National Institutes of Health, Ministry of Health, Shah Alam, Malaysia
| | - Hanipah Shahar
- Entomology and Pest Unit, Federal Territory of Kuala Lumpur and Putrajaya Health Department, Jalan Cenderasari, 50590, Kuala Lumpur, Malaysia
| | - As'malia Md Lasim
- Phytochemistry Unit, Herbal Medicine Research Centre, Institute for Medical Research, National Health Institute, Setia Alam, Malaysia
| | - Gomesh Nair
- School of Electrical and Electronics Engineering, Universiti Sains Malaysia, Penang, Malaysia
| |
Collapse
|
20
|
Majeed MA, Shafri HZM, Zulkafli Z, Wayayok A. A Deep Learning Approach for Dengue Fever Prediction in Malaysia Using LSTM with Spatial Attention. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:4130. [PMID: 36901139 PMCID: PMC10002017 DOI: 10.3390/ijerph20054130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 02/19/2023] [Accepted: 02/21/2023] [Indexed: 06/18/2023]
Abstract
This research aims to predict dengue fever cases in Malaysia using machine learning techniques. A dataset consisting of weekly dengue cases at the state level in Malaysia from 2010 to 2016 was obtained from the Malaysia Open Data website and includes variables such as climate, geography, and demographics. Six different long short-term memory (LSTM) models were developed and compared for dengue prediction in Malaysia: LSTM, stacked LSTM (S-LSTM), LSTM with temporal attention (TA-LSTM), S-LSTM with temporal attention (STA-LSTM), LSTM with spatial attention (SA-LSTM), and S-LSTM with spatial attention (SSA-LSTM). The models were trained and evaluated on a dataset of monthly dengue cases in Malaysia from 2010 to 2016, with the task of predicting the number of dengue cases based on various climate, topographic, demographic, and land-use variables. The SSA-LSTM model, which used both stacked LSTM layers and spatial attention, performed the best, with an average root mean squared error (RMSE) of 3.17 across all lookback periods. When compared to three benchmark models (SVM, DT, ANN), the SSA-LSTM model had a significantly lower average RMSE. The SSA-LSTM model also performed well in different states in Malaysia, with RMSE values ranging from 2.91 to 4.55. When comparing temporal and spatial attention models, the spatial models generally performed better at predicting dengue cases. The SSA-LSTM model was also found to perform well at different prediction horizons, with the lowest RMSE at 4- and 5-month lookback periods. Overall, the results suggest that the SSA-LSTM model is effective at predicting dengue cases in Malaysia.
Collapse
Affiliation(s)
- Mokhalad A. Majeed
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Helmi Zulhaidi Mohd Shafri
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
- Geospatial Information Science Research Centre (GISRC), Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Zed Zulkafli
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Aimrun Wayayok
- Department of Biological and Agricultural Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| |
Collapse
|
21
|
Leung XY, Islam RM, Adhami M, Ilic D, McDonald L, Palawaththa S, Diug B, Munshi SU, Karim MN. A systematic review of dengue outbreak prediction models: Current scenario and future directions. PLoS Negl Trop Dis 2023; 17:e0010631. [PMID: 36780568 PMCID: PMC9956653 DOI: 10.1371/journal.pntd.0010631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 02/24/2023] [Accepted: 01/29/2023] [Indexed: 02/15/2023] Open
Abstract
Dengue is among the fastest-spreading vector-borne infectious disease, with outbreaks often overwhelm the health system and result in huge morbidity and mortality in its endemic populations in the absence of an efficient warning system. A large number of prediction models are currently in use globally. As such, this study aimed to systematically review the published literature that used quantitative models to predict dengue outbreaks and provide insights about the current practices. A systematic search was undertaken, using the Ovid MEDLINE, EMBASE, Scopus and Web of Science databases for published citations, without time or geographical restrictions. Study selection, data extraction and management process were devised in accordance with the 'Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies' ('CHARMS') framework. A total of 99 models were included in the review from 64 studies. Most models sourced climate (94.7%) and climate change (77.8%) data from agency reports and only 59.6% of the models adjusted for reporting time lag. All included models used climate predictors; 70.7% of them were built with only climate factors. Climate factors were used in combination with climate change factors (13.4%), both climate change and demographic factors (3.1%), vector factors (6.3%), and demographic factors (5.2%). Machine learning techniques were used for 39.4% of the models. Of these, random forest (15.4%), neural networks (23.1%) and ensemble models (10.3%) were notable. Among the statistical (60.6%) models, linear regression (18.3%), Poisson regression (18.3%), generalized additive models (16.7%) and time series/autoregressive models (26.7%) were notable. Around 20.2% of the models reported no validation at all and only 5.2% reported external validation. The reporting of methodology and model performance measures were inadequate in many of the existing prediction models. This review collates plausible predictors and methodological approaches, which will contribute to robust modelling in diverse settings and populations.
Collapse
Affiliation(s)
- Xing Yu Leung
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Rakibul M. Islam
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Mohammadmehdi Adhami
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Dragan Ilic
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Lara McDonald
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Shanika Palawaththa
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Basia Diug
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Saif U. Munshi
- Department of Virology, Bangabandhu Sheikh Mujib Medical University, Dhaka, Bangladesh
| | - Md Nazmul Karim
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
- * E-mail:
| |
Collapse
|
22
|
Bhattamisra SK, Banerjee P, Gupta P, Mayuren J, Patra S, Candasamy M. Artificial Intelligence in Pharmaceutical and Healthcare Research. BIG DATA AND COGNITIVE COMPUTING 2023; 7:10. [DOI: 10.3390/bdcc7010010] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
Artificial intelligence (AI) is a branch of computer science that allows machines to work efficiently, can analyze complex data. The research focused on AI has increased tremendously, and its role in healthcare service and research is emerging at a greater pace. This review elaborates on the opportunities and challenges of AI in healthcare and pharmaceutical research. The literature was collected from domains such as PubMed, Science Direct and Google scholar using specific keywords and phrases such as ‘Artificial intelligence’, ‘Pharmaceutical research’, ‘drug discovery’, ‘clinical trial’, ‘disease diagnosis’, etc. to select the research and review articles published within the last five years. The application of AI in disease diagnosis, digital therapy, personalized treatment, drug discovery and forecasting epidemics or pandemics was extensively reviewed in this article. Deep learning and neural networks are the most used AI technologies; Bayesian nonparametric models are the potential technologies for clinical trial design; natural language processing and wearable devices are used in patient identification and clinical trial monitoring. Deep learning and neural networks were applied in predicting the outbreak of seasonal influenza, Zika, Ebola, Tuberculosis and COVID-19. With the advancement of AI technologies, the scientific community may witness rapid and cost-effective healthcare and pharmaceutical research as well as provide improved service to the general public.
Collapse
Affiliation(s)
- Subrat Kumar Bhattamisra
- Department of Pharmacology, GITAM School of Pharmacy, GITAM (Deemed to Be University), Visakhapatnam 530045, Andhra Pradesh, India
| | - Priyanka Banerjee
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Pratibha Gupta
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Jayashree Mayuren
- Department of Pharmaceutical Technology, School of Pharmacy, International Medical University, Kuala Lumpur 57000, Malaysia
| | - Susmita Patra
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Mayuren Candasamy
- Department of Life Sciences, School of Pharmacy, International Medical University, Kuala Lumpur 57000, Malaysia
| |
Collapse
|
23
|
Annan E, Bukhari MH, Treviño J, Abad ZSH, Lubinda J, da Silva EA, Haque U. The ecological determinants of severe dengue: A Bayesian inferential model. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.101986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
24
|
Labadin J, Hong BH, Tiong WK, Gill BS, Perera D, Rigit ARH, Singh S, Tan CV, Ghazali SM, Jelip J, Mokhtar N, Rashid NBA, Bakar HBA, Lim JH, Taib NM, George A. Development and user testing study of MozzHub: a bipartite network-based dengue hotspot detector. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:17415-17436. [PMID: 36404933 PMCID: PMC9649007 DOI: 10.1007/s11042-022-14120-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 10/14/2022] [Accepted: 10/25/2022] [Indexed: 06/16/2023]
Abstract
Traditionally, dengue is controlled by fogging, and the prime location for the control measure is at the patient's residence. However, when Malaysia was hit by the first wave of the Coronavirus disease (COVID-19), and the government-imposed movement control order, dengue cases have decreased by more than 30% from the previous year. This implies that residential areas may not be the prime locations for dengue-infected mosquitoes. The existing early warning system was focused on temporal prediction wherein the lack of consideration for spatial component at the microlevel and human mobility were not considered. Thus, we developed MozzHub, which is a web-based application system based on the bipartite network-based dengue model that is focused on identifying the source of dengue infection at a small spatial level (400 m) by integrating human mobility and environmental predictors. The model was earlier developed and validated; therefore, this study presents the design and implementation of the MozzHub system and the results of a preliminary pilot test and user acceptance of MozzHub in six district health offices in Malaysia. It was found that the MozzHub system is well received by the sample of end-users as it was demonstrated as a useful (77.4%), easy-to-operate system (80.6%), and has achieved adequate client satisfaction for its use (74.2%).
Collapse
Affiliation(s)
- Jane Labadin
- Faculty of Computer Science and Information Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Malaysia
| | - Boon Hao Hong
- Faculty of Computer Science and Information Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Malaysia
| | - Wei King Tiong
- Faculty of Computer Science and Information Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Malaysia
| | | | - David Perera
- Institute for Health and Community Medicine, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Malaysia
| | | | - Sarbhan Singh
- Institute for Medical Research, Ministry of Health, Kuala Lumpur, Malaysia
| | - Cia Vei Tan
- Institute for Medical Research, Ministry of Health, Kuala Lumpur, Malaysia
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Epidemiology (2012-2019) and costs (2009-2019) of dengue in Malaysia: a systematic literature review. Int J Infect Dis 2022; 124:240-247. [PMID: 36089149 DOI: 10.1016/j.ijid.2022.09.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 08/10/2022] [Accepted: 09/05/2022] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND A systematic literature review was conducted to assess the epidemiology and economic burden of dengue in Malaysia. METHODS Embase, MEDLINE, Evidence-Based Reviews databases, and grey literature sources were searched for English and Malay studies and surveillance reports on the epidemiology (between 2012-2019) and costs (between 2009-2019) of dengue in Malaysia. Independent screening of titles/abstracts followed by full texts was performed using pre-specified criteria. RESULTS A total of 198 publications were included (55 peer-reviewed and 143 grey literature). Dengue incidence has been increasing in recent years, with 130,101 cases (dengue fever [DF] 129,578 cases; dengue haemorrhagic fever [DHF] 523 cases) reported in 2019, which is the highest since 2012. All DENV serotypes co-circulated between 2004 and 2017, and major outbreaks occurred in a cyclical pattern, often associated with a change in the predominant circulating serotype. Economic impacts are substantial, including the societal impact of lost work (7.2-8.8 days) and school days (3.2-4.1 days) due to dengue. CONCLUSIONS The rising incidence and high cost of dengue, coupled with overlapping diseases, will likely result in further pressures on the healthcare system. To appropriately mitigate and control dengue, it is critical to implement integrated strategies, including vaccination, to reduce the burden of dengue.
Collapse
|
26
|
Ismail S, Fildes R, Ahmad R, Wan Mohamad Ali WN, Omar T. The practicality of Malaysia dengue outbreak forecasting model as an early warning system. Infect Dis Model 2022; 7:510-525. [PMID: 36091345 PMCID: PMC9418377 DOI: 10.1016/j.idm.2022.07.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 07/07/2022] [Accepted: 07/30/2022] [Indexed: 11/26/2022] Open
Abstract
Dengue is a harmful tropical disease that causes death to many people. Currently, the dengue vaccine development is still at an early stage, and only intervention methods exist after dengue cases increase. Thus, previously, two scientific experimental field studies were conducted in producing a dengue outbreak forecasting model as an early warning system. Successfully, an Autoregressive Distributed Lag (ADL) Model was developed using three factors: the epidemiological, entomological, and environmental with an accuracy of 85%; but a higher percentage is required in minimizing the error for the model to be useful. Hence, this study aimed to develop a practical and cost-effective dengue outbreak forecasting model with at least 90% accuracy to be embedded in an early warning computer system using the Internet of Things (IoT) approach. Eighty-one weeks of time series data of the three factors were used in six forecasting models, which were Autoregressive Distributed Lag (ADL), Hierarchical Forecasting (Bottom-up and Optimal combination) and three Machine Learning methods: (Artificial Neural Network (ANN), Support Vector Machine (SVM) and Random Forest). Five error measures were used to evaluate the consistency performance of the models in order to ensure model performance. The findings indicated Random Forest outperformed the other models with an accuracy of 95% when including all three factors. But practically, collecting mosquito related data (the entomological factor) was very costly and time consuming. Thus, it was removed from the model, and the accuracy dropped to 92% but still high enough to be of practical use, i.e., beyond 90%. However, the practical ground operationalization of the early warning system also requires several rain gauges to be located at the dengue hot spots due to localized rainfall. Hence, further analysis was conducted in determining the location of the rain gauges. This has led to the recommendation that the rain gauges should be located about 3–4 km apart at the dengue hot spots to ensure the accuracy of the rainfall data to be included in the dengue outbreak forecasting model so that it can be embedded in the early warning system. Therefore, this early warning system can save lives, and prevention is better than cure.
Collapse
|
27
|
Nordin NRM, Arsad FS, Mahmud MH, Kamaruddin PSNM, Amir SM, Bahari NI, Hassan MR, Rahim SSSA, Lukman KA, Jeffree MS. Wolbachia in Dengue Control: A Systematic Review. Open Access Maced J Med Sci 2022. [DOI: 10.3889/oamjms.2022.9014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
BACKGROUND: Dengue fever outbreaks have been an important public health issue causing high morbidity and mortality, and serious economic effects, particularly in Asia. Control strategies are a challenge to be implemented due to a variety of factors. However, new approaches such as Wolbachia-infected Aedes aegypti have been shown to successfully lowering the life spans of the mosquito, eggs resistance, and disease transmission capabilities. Field trials are still on-going, and there are data to support its benefit in a large population. This systematic review aims to determine the current progress and impact of using Wolbachia in curbing dengue cases in high dengue case locations worldwide.
METHODOLOGY: The study uses the Preferred Reporting Items for Systematic reviews and Meta-Analyses review protocol, while the formulation of the research question was based on population of interest, comparison, and outcome. The selected databases include Web of Science, Scopus, PubMed, SAGE, and EBSCOhost. A thorough identification, screening, and included process were done and the results retrieved four articles. These articles were then ranked based on quality using mixed methods appraisal tool.
RESULTS: A total of four articles were included from 2019 and 2020 reports in both dengue- and non-dengue-endemic settings. In this review, comparisons in terms of the hierarchy of the study design, community engagement and acceptance, Wolbachia-infected A. aegypti deployment, entomological outcome, and epidemiological outcomes were detailed. All four studies showed a decrease in dengue incidence in Wolbachia-intervention populations.
CONCLUSION: Wolbachia programs have been shown to be an effective method in combating dengue diseases. Strong community engagement and involvement from multidisciplinary teams are important factors to ensure the effectiveness and good outcomes of the program.
Collapse
|
28
|
Dey SK, Rahman MM, Howlader A, Siddiqi UR, Uddin KMM, Borhan R, Rahman EU. Prediction of dengue incidents using hospitalized patients, metrological and socio-economic data in Bangladesh: A machine learning approach. PLoS One 2022; 17:e0270933. [PMID: 35857776 PMCID: PMC9299345 DOI: 10.1371/journal.pone.0270933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/21/2022] [Indexed: 11/20/2022] Open
Abstract
Dengue fever is a severe disease spread by Aedes mosquito-borne dengue viruses (DENVs) in tropical areas such as Bangladesh. Since its breakout in the 1960s, dengue fever has been endemic in Bangladesh, with the highest concentration of infections in the capital, Dhaka. This study aims to develop a machine learning model that can use relevant information about the factors that cause Dengue outbreaks within a geographic region. To predict dengue cases in 11 different districts of Bangladesh, we created a DengueBD dataset and employed two machine learning algorithms, Multiple Linear Regression (MLR) and Support Vector Regression (SVR). This research also explores the correlation among environmental factors like temperature, rainfall, and humidity with the rise and decline trend of Dengue cases in different cities of Bangladesh. The entire dataset was divided into an 80:20 ratio, with 80 percent used for training and 20% used for testing. The research findings imply that, for both the MLR with 67% accuracy along with Mean Absolute Error (MAE) of 4.57 and SVR models with 75% accuracy along with Mean Absolute Error (MAE) of 4.95, the number of dengue cases reduces throughout the winter season in the country and increases mainly during the rainy season in the next ten months, from August 2021 to May 2022. Importantly, Dhaka, Bangladesh's capital, will see the maximum number of dengue patients during this period. Overall, the results of this data-driven analysis show that machine learning algorithms have enormous potential for predicting dengue epidemics.
Collapse
Affiliation(s)
- Samrat Kumar Dey
- School of Science and Technology (SST), Bangladesh Open University (BOU), Gazipur, Bangladesh
| | - Md. Mahbubur Rahman
- Department of Computer Science and Engineering (CSE), Military Institute of Science and Technology (MIST), Dhaka, Bangladesh
| | - Arpita Howlader
- Department of Computer and Communication Engineering (CCE), Patuakhali Science and Technology University (PSTU), Dumki, Patuakhali, Bangladesh
| | - Umme Raihan Siddiqi
- Department of Physiology, Shaheed Suhrawardy Medical College (ShSMC), Dhaka, Bangladesh
| | | | - Rownak Borhan
- Department of Computer Science and Engineering (CSE), Dhaka International University (DIU), Dhaka, Bangladesh
| | - Elias Ur Rahman
- Department of Computer Science and Engineering (CSE), Dhaka International University (DIU), Dhaka, Bangladesh
| |
Collapse
|
29
|
Exploring the predictive capability of machine learning models in identifying foot and mouth disease outbreak occurrences in cattle farms in an endemic setting of Thailand. Prev Vet Med 2022; 207:105706. [DOI: 10.1016/j.prevetmed.2022.105706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/09/2022] [Accepted: 07/01/2022] [Indexed: 11/20/2022]
|
30
|
Annan E, Guo J, Angulo-Molina A, Yaacob WFW, Aghamohammadi N, C Guetterman T, Yavaşoglu Sİ, Bardosh K, Dom NC, Zhao B, Lopez-Lemus UA, Khan L, Nguyen USDT, Haque U. Community acceptability of dengue fever surveillance using unmanned aerial vehicles: A cross-sectional study in Malaysia, Mexico, and Turkey. Travel Med Infect Dis 2022; 49:102360. [PMID: 35644475 DOI: 10.1016/j.tmaid.2022.102360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 05/01/2022] [Accepted: 05/19/2022] [Indexed: 11/29/2022]
Abstract
Surveillance is a critical component of any dengue prevention and control program. There is an increasing effort to use drones in mosquito control surveillance. Due to the novelty of drones, data are scarce on the impact and acceptance of their use in the communities to collect health-related data. The use of drones raises concerns about the protection of human privacy. Here, we show how willingness to be trained and acceptance of drone use in tech-savvy communities can help further discussions in mosquito surveillance. A cross-sectional study was conducted in Malaysia, Mexico, and Turkey to assess knowledge of diseases caused by Aedes mosquitoes, perceptions about drone use for data collection, and acceptance of drones for Aedes mosquito surveillance around homes. Compared with people living in Turkey, Mexicans had 14.3 (p < 0.0001) times higher odds and Malaysians had 4.0 (p = 0.7030) times the odds of being willing to download a mosquito surveillance app. Compared to urban dwellers, rural dwellers had 1.56 times the odds of being willing to be trained. There is widespread community support for drone use in mosquito surveillance and this community buy-in suggests a potential for success in mosquito surveillance using drones. A successful surveillance and community engagement system may be used to monitor a variety of mosquito spp. Future research should include qualitative interview data to add context to these findings.
Collapse
Affiliation(s)
- Esther Annan
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA.
| | - Jinghui Guo
- Department of Computer Science, The University of Texas at Dallas, Richardson, TX, 75080, USA
| | - Aracely Angulo-Molina
- Department of Chemical and Biological Sciences, University of Sonora, Hermosillo, 83000, Sonora, Mexico
| | - Wan Fairos Wan Yaacob
- Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Kelantan, Kampus Kota Bharu, Lembah Sireh, 15050, Kota Bharu, Kelantan, Malaysia; Institute for Big Data Analytics and Artificial Intelligence (IBDAAI), Kompleks Al-Khawarizmi, Universiti Teknologi MARA, 40450, Shah Alam, Selangor, Malaysia
| | - Nasrin Aghamohammadi
- Centre for Epidemiology and Evidence-Based Practice, Department of Social and Preventive Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, 50603, Malaysia
| | | | - Sare İlknur Yavaşoglu
- Department of Biology, Faculty of Science and Arts, Aydın Adnan Menderes University, Aydın, 09010, Turkey
| | - Kevin Bardosh
- Center for One Health Research, School of Public Health, University of Washington, USA
| | - Nazri Che Dom
- Faculty of Health Sciences, Universiti Teknologi MARA Cawangan Selangor, Selangor, Malaysia
| | - Bingxin Zhao
- Department of Statistics, Purdue University, 250 N. University St, West Lafayette, IN, 47907, USA
| | - Uriel A Lopez-Lemus
- Department of Health Sciences, Center for Biodefense and Global Infectious Diseases, Colima, 28078, Mexico
| | - Latifur Khan
- Department of Computer Science, The University of Texas at Dallas, Richardson, TX, 75080, USA
| | - Uyen-Sa D T Nguyen
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| | - Ubydul Haque
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
| |
Collapse
|
31
|
Baharom M, Ahmad N, Hod R, Abdul Manaf MR. Dengue Early Warning System as Outbreak Prediction Tool: A Systematic Review. Healthc Policy 2022; 15:871-886. [PMID: 35535237 PMCID: PMC9078425 DOI: 10.2147/rmhp.s361106] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 04/16/2022] [Indexed: 12/01/2022] Open
Abstract
Early warning system (EWS) for vector-borne diseases is incredibly complex due to numerous factors originating from human, environmental, vector and the disease itself. Dengue EWS aims to collect data that leads to prompt decision-making processes that trigger disease intervention strategies to minimize the impact on a specific population. Dengue EWS may have a similar structural design, functions, and analytical approaches but different performance and ability to predict outbreaks. Hence, this review aims to summarise and discuss the evidence of different EWSs, their performance, and their ability to predict dengue outbreaks. A systematic literature search was performed of four primary databases: Scopus, Web of Science, Ovid MEDLINE, and EBSCOhost. Eligible articles were evaluated using a checklist for assessing the quality of the studies. A total of 17 studies were included in this systematic review. All EWS models demonstrated reasonably good predictive abilities to predict dengue outbreaks. However, the accuracy of their predictions varied greatly depending on the model used and the data quality. The reported sensitivity ranged from 50 to 100%, while specificity was 74 to 94.7%. A range between 70 to 96.3% was reported for prediction model accuracy and 43 to 86% for PPV. Overall, meteorological alarm indicators (temperatures and rainfall) were the most frequently used and displayed the best performing indicator. Other potential alarm indicators are entomology (female mosquito infection rate), epidemiology, population and socioeconomic factors. EWS is an essential tool to support district health managers and national health planners to mitigate or prevent disease outbreaks. This systematic review highlights the benefits of integrating several epidemiological tools focusing on incorporating climatic, environmental, epidemiological and socioeconomic factors to create an early warning system. The early warning system relies heavily on the country surveillance system. The lack of timely and high-quality data is critical for developing an effective EWS.
Collapse
Affiliation(s)
- Mazni Baharom
- Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia, Bandar Tun Razak, Kuala Lumpur, 56000, Malaysia
| | - Norfazilah Ahmad
- Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia, Bandar Tun Razak, Kuala Lumpur, 56000, Malaysia
- Correspondence: Norfazilah Ahmad, Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia, Bandar Tun Razak, Kuala Lumpur, 56000, Malaysia, Tel +60391458781, Fax +60391456670, Email
| | - Rozita Hod
- Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia, Bandar Tun Razak, Kuala Lumpur, 56000, Malaysia
| | - Mohd Rizal Abdul Manaf
- Department of Community Health, Faculty of Medicine, Universiti Kebangsaan Malaysia, Bandar Tun Razak, Kuala Lumpur, 56000, Malaysia
| |
Collapse
|
32
|
Payus AO, Ibrahim A, Liew Sat Lin C, Hui Jan T. Sensory Predominant Guillain-Barré Syndrome Concomitant with Dengue Infection: A Case Report. Case Rep Neurol 2022; 14:281-285. [PMID: 35949203 PMCID: PMC9251453 DOI: 10.1159/000524865] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 04/26/2022] [Indexed: 12/05/2022] Open
Abstract
Guillain-Barre syndrome is an acute demyelinating polyneuropathy disease which is autoimmune in nature and usually follows gastrointestinal or respiratory infections. Dengue fever is however not a common trigger to the condition. Here, we report a patient who developed sensory predominant demyelinating polyradiculopathy during febrile phase of dengue fever. It was later confirmed with serology test and nerve conduction study. He was successfully treated with intravenous immunoglobulin and discharged home well. The purpose of this case report is to highlight that Guillain-Barré syndrome can occur as an uncommon neurological complication of dengue fever which can occur during any phase of the illness.
Collapse
Affiliation(s)
- Alvin Oliver Payus
- Faculty of Medicine and Health Science, Universiti Malaysia Sabah (UMS), Kota Kinabalu, Malaysia
| | - Azliza Ibrahim
- Department of Neurology, Hospital Pengajar Universiti Putra Malaysia, Serdang, Malaysia
| | - Constance Liew Sat Lin
- Faculty of Medicine and Health Science, Universiti Malaysia Sabah (UMS), Kota Kinabalu, Malaysia
| | - Tan Hui Jan
- Department of Internal Medicine, Universiti Kebangsaan Malaysia Medical Centre (UKMMC), Kuala Lumpur, Malaysia
| |
Collapse
|
33
|
Li Z, Gurgel H, Xu L, Yang L, Dong J. Improving Dengue Forecasts by Using Geospatial Big Data Analysis in Google Earth Engine and the Historical Dengue Information-Aided Long Short Term Memory Modeling. BIOLOGY 2022; 11:biology11020169. [PMID: 35205036 PMCID: PMC8869738 DOI: 10.3390/biology11020169] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 01/04/2022] [Accepted: 01/17/2022] [Indexed: 11/26/2022]
Abstract
Simple Summary Forecasting dengue cases often face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without historical dengue cases. With the advance of the geospatial big data cloud computing in Google Earth Engine and deep learning, this study proposed an efficient framework of dengue prediction at an epidemiological week basis using geospatial big data analysis in Google Earth Engine and Long Short Term Memory modeling. We focused on the dengue epidemics in the Federal District of Brazil during 2007–2019. Based on Google Earth Engine and epidemiological calendar, we computed the weekly composite for each dengue driving factor, and spatially aggregated the pixel values into dengue transmission areas to generate the time series of driving factors. A multi-step-ahead Long Short Term Memory modeling was used, and the time-differenced natural log-transformed dengue cases and the time series of driving factors were considered as outcomes and explantary factors, respectively, with two modeling scenarios (with and without historical cases). The performance is better when historical cases were used, and the 5-weeks-ahead forecast has the best performance. Abstract Timely and accurate forecasts of dengue cases are of great importance for guiding disease prevention strategies, but still face challenges from (1) time-effectiveness due to time-consuming satellite data downloading and processing, (2) weak spatial representation capability due to data dependence on administrative unit-based statistics or weather station-based observations, and (3) stagnant accuracy without the application of historical case information. Geospatial big data, cloud computing platforms (e.g., Google Earth Engine, GEE), and emerging deep learning algorithms (e.g., long short term memory, LSTM) provide new opportunities for advancing these efforts. Here, we focused on the dengue epidemics in the urban agglomeration of the Federal District of Brazil (FDB) during 2007–2019. A new framework was proposed using geospatial big data analysis in the Google Earth Engine (GEE) platform and long short term memory (LSTM) modeling for dengue case forecasts over an epidemiological week basis. We first defined a buffer zone around an impervious area as the main area of dengue transmission by considering the impervious area as a human-dominated area and used the maximum distance of the flight range of Aedes aegypti and Aedes albopictus as a buffer distance. Those zones were used as units for further attribution analyses of dengue epidemics by aggregating the pixel values into the zones. The near weekly composite of potential driving factors was generated in GEE using the epidemiological weeks during 2007–2019, from the relevant geospatial data with daily or sub-daily temporal resolution. A multi-step-ahead LSTM model was used, and the time-differenced natural log-transformed dengue cases were used as outcomes. Two modeling scenarios (with and without historical dengue cases) were set to examine the potential of historical information on dengue forecasts. The results indicate that the performance was better when historical dengue cases were used and the 5-weeks-ahead forecast had the best performance, and the peak of a large outbreak in 2019 was accurately forecasted. The proposed framework in this study suggests the potential of the GEE platform, the LSTM algorithm, as well as historical information for dengue risk forecasting, which can easily be extensively applied to other regions or globally for timely and practical dengue forecasts.
Collapse
Affiliation(s)
- Zhichao Li
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Helen Gurgel
- Department of Geography, University of Brasilia (UnB), Brasilia 70910-900, Brazil;
| | - Lei Xu
- Vanke School of Public Health, Tsinghua University, Beijing 100084, China;
| | - Linsheng Yang
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
| | - Jinwei Dong
- Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; (Z.L.); (L.Y.)
- Correspondence:
| |
Collapse
|
34
|
Chattopadhyay AK, Chattopadhyay S. VIRDOCD: A VIRtual DOCtor to predict dengue fatality. EXPERT SYSTEMS 2022; 39. [DOI: 10.1111/exsy.12796] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 08/06/2021] [Indexed: 02/05/2023]
Abstract
AbstractClinicians make routine diagnosis by scrutinizing patients' medical signs and symptoms, a skill popularly referred to as ‘Clinical Eye’. This skill evolves through trial‐and‐error and improves with time. The success of the therapeutic regime relies largely on the accuracy of interpretation of such sign‐symptoms, analysing which a clinician assesses the severity of the illness. The present study is an attempt to propose a complementary medical front by mathematically modelling the ‘Clinical Eye’ of a VIRtual DOCtor, using statistical and machine intelligence tools (SMI), to analyse Dengue epidemic infected patients (100 case studies with 11 weighted sign‐symptoms). The SMI in VIRDOCD reads medical data and translates these into a vector comprising multiple linear regression (MLR) coefficients to predict infection severity grades of dengue patients that clone the clinician's experience‐based assessment. Risk managed through ANOVA, the dengue severity grade prediction accuracy from VIRDOCD is found higher (ca 75%) than conventional clinical practice (ca 71.4%, mean accuracy profile assessed by a team of 10 senior consultants). Free of human errors and capable of deciphering even minute differences from almost identical symptoms (to the Clinical eye), VIRDOCD is uniquely individualized in its decision‐making ability. The algorithm has been validated against Random Forest classification (RF, ca 63%), another regression‐based classifier similar to MLR that can be trained through supervised learning. We find that MLR‐based VIRDOCD is superior to RF in predicting the grade of Dengue morbidity. VIRDOCD can be further extended to analyse other epidemic infections, such as COVID‐19.
Collapse
|
35
|
Zhang T, Rabhi F, Behnaz A, Chen X, Paik HY, Yao L, MacIntyre CR. Use of automated machine learning for an outbreak risk prediction tool. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.101121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
36
|
Patil S, Pandya S. Forecasting Dengue Hotspots Associated With Variation in Meteorological Parameters Using Regression and Time Series Models. Front Public Health 2021; 9:798034. [PMID: 34900929 PMCID: PMC8661059 DOI: 10.3389/fpubh.2021.798034] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 11/04/2021] [Indexed: 11/13/2022] Open
Abstract
For forecasting the spread of dengue, monitoring climate change and its effects specific to the disease is necessary. Dengue is one of the most rapidly spreading vector-borne infectious diseases. This paper proposes a forecasting model for predicting dengue incidences considering climatic variability across nine cities of Maharashtra state of India over 10 years. The work involves the collection of five climatic factors such as mean minimum temperature, mean maximum temperature, relative humidity, rainfall, and mean wind speed for 10 years. Monthly incidences of dengue for the same locations are also collected. Different regression models such as random forest regression, decision trees regression, support vector regress, multiple linear regression, elastic net regression, and polynomial regression are used. Time-series forecasting models such as holt's forecasting, autoregressive, Moving average, ARIMA, SARIMA, and Facebook prophet are implemented and compared to forecast the dengue outbreak accurately. The research shows that humidity and mean maximum temperature are the major climate factors and exhibit strong positive and negative correlation, respectively, with dengue incidences for all locations of Maharashtra state. Mean minimum temperature and rainfall are moderately positively correlated with dengue incidences. Mean wind speed is a less significant factor and is weakly negatively correlated with dengue incidences. Root mean square error (RMSE), mean absolute error (MAE), and R square error (R 2) evaluation metrics are used to compare the performance of the prediction model. Random Forest Regression is the best-fit regression model for five out of nine cities, while Support Vector Regression is for two cities. Facebook Prophet Model is the best fit time series forecasting model for six out of nine cities. Based on the prediction, Mumbai, Thane, Nashik, and Pune are the high-risk regions, especially in August, September, and October. The findings exhibit an effective early warning system that would predict the outbreak of other infectious diseases. It will help the relevant authorities to take accurate preventive measures.
Collapse
Affiliation(s)
- Seema Patil
- Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| | - Sharnil Pandya
- Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| |
Collapse
|
37
|
Rahman M, Pientong C, Zafar S, Ekalaksananan T, Paul RE, Haque U, Rocklöv J, Overgaard HJ. Mapping the spatial distribution of the dengue vector Aedes aegypti and predicting its abundance in northeastern Thailand using machine-learning approach. One Health 2021; 13:100358. [PMID: 34934797 PMCID: PMC8661047 DOI: 10.1016/j.onehlt.2021.100358] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 12/02/2021] [Accepted: 12/02/2021] [Indexed: 10/19/2022] Open
Abstract
BACKGROUND Mapping the spatial distribution of the dengue vector Aedes (Ae.) aegypti and accurately predicting its abundance are crucial for designing effective vector control strategies and early warning tools for dengue epidemic prevention. Socio-ecological and landscape factors influence Ae. aegypti abundance. Therefore, we aimed to map the spatial distribution of female adult Ae. aegypti and predict its abundance in northeastern Thailand based on socioeconomic, climate change, and dengue knowledge, attitude and practices (KAP) and/or landscape factors using machine learning (ML)-based system. METHOD A total of 1066 females adult Ae. aegypti were collected from four villages in northeastern Thailand during January-December 2019. Information on household socioeconomics, KAP regarding climate change and dengue, and satellite-based landscape data were also acquired. Geographic information systems (GIS) were used to map the household-based spatial distribution of female adult Ae. aegypti abundance (high/low). Five popular supervised learning models, logistic regression (LR), support vector machine (SVM), k-nearest neighbor (kNN), artificial neural network (ANN), and random forest (RF), were used to predict females adult Ae. aegypti abundance (high/low). The predictive accuracy of each modeling technique was calculated and evaluated. Important variables for predicting female adult Ae. aegypti abundance were also identified using the best-fitted model. RESULTS Urban areas had higher abundance of female adult Ae. aegypti compared to rural areas. Overall, study respondents in both urban and rural areas had inadequate KAP regarding climate change and dengue. The average landscape factors per household in urban areas were rice crop (47.4%), natural tree cover (17.8%), built-up area (13.2%), permanent wetlands (21.2%), and rubber plantation (0%), and the corresponding figures for rural areas were 12.1, 2.0, 38.7, 40.1 and 0.1% respectively. Among all assessed models, RF showed the best prediction performance (socioeconomics: area under curve, AUC = 0.93, classification accuracy, CA = 0.86, F1 score = 0.85; KAP: AUC = 0.95, CA = 0.92, F1 = 0.90; landscape: AUC = 0.96, CA = 0.89, F1 = 0.87) for female adult Ae. aegypti abundance. The combined influences of all factors further improved the predictive accuracy in RF model (socioeconomics + KAP + landscape: AUC = 0.99, CA = 0.96 and F1 = 0.95). Dengue prevention practices were shown to be the most important predictor in the RF model for female adult Ae. aegypti abundance in northeastern Thailand. CONCLUSION The RF model is more suitable for the prediction of Ae. aegypti abundance in northeastern Thailand. Our study exemplifies that the application of GIS and machine learning systems has significant potential for understanding the spatial distribution of dengue vectors and predicting its abundance. The study findings might help optimize vector control strategies, future mosquito suppression, prediction and control strategies of epidemic arboviral diseases (dengue, chikungunya, and Zika). Such strategies can be incorporated into One Health approaches applying transdisciplinary approaches considering human-vector and agro-environmental interrelationships.
Collapse
Key Words
- ANN, Artificial neural network
- AUC, Area under curve
- Aedes aegypti
- CA, Classification accuracy.
- DENV, Dengue virus
- Dengue
- Early warning
- GIS, Geographic information systems
- HCI, Household crowding index
- KAP, Knowledge, attitude, and practice
- LR, logistic regression
- ML, Machine learning
- PCI, Premise condition index
- Prediction
- RF, Random forest
- SES, Socioeconomic status
- SVM, Support vector machine
- Supervised learning
- kNN, k-nearest neighbor
Collapse
Affiliation(s)
- M.S. Rahman
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Department of Statistics, Begum Rokeya University, Rangpur, Rangpur-5404, Bangladesh
| | - Chamsai Pientong
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- HPV & EBV and Carcinogenesis Research Group, Khon Kaen University, Khon Kaen, Thailand
| | - Sumaira Zafar
- Environmental Engineering and Management Program, Asian Institute of Technology, Pathumthani, Thailand
| | - Tipaya Ekalaksananan
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- HPV & EBV and Carcinogenesis Research Group, Khon Kaen University, Khon Kaen, Thailand
| | - Richard E. Paul
- Unité de la Génétique Fonctionnelle des Maladies Infectieuses, Institut Pasteur, CNRS UMR 2000, 75015 Paris, France
| | - Ubydul Haque
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX 76177, USA
| | - Joacim Rocklöv
- Department of Public Health and Clinical Medicine, Umeå University, 90187 Umeå, Sweden
| | - Hans J. Overgaard
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Faculty of Science and Technology, Norwegian University of Life Sciences, P.O. Box 5003, Ås, Norway
| |
Collapse
|
38
|
Hoyos W, Aguilar J, Toro M. Dengue models based on machine learning techniques: A systematic literature review. Artif Intell Med 2021; 119:102157. [PMID: 34531010 DOI: 10.1016/j.artmed.2021.102157] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 05/08/2021] [Accepted: 08/17/2021] [Indexed: 12/16/2022]
Abstract
BACKGROUND Dengue modeling is a research topic that has increased in recent years. Early prediction and decision-making are key factors to control dengue. This Systematic Literature Review (SLR) analyzes three modeling approaches of dengue: diagnostic, epidemic, intervention. These approaches require models of prediction, prescription and optimization. This SLR establishes the state-of-the-art in dengue modeling, using machine learning, in the last years. METHODS Several databases were selected to search the articles. The selection was made based on Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) methodology. Sixty-four articles were obtained and analyzed to describe their strengths and limitations. Finally, challenges and opportunities for research on machine-learning for dengue modeling were identified. RESULTS Logistic regression was the most used modeling approach for the diagnosis of dengue (59.1%). The analysis of the epidemic approach showed that linear regression (17.4%) is the most used technique within the spatial analysis. Finally, the most used intervention modeling is General Linear Model with 70%. CONCLUSIONS We conclude that cause-effect models may improve diagnosis and understanding of dengue. Models that manage uncertainty can also be helpful, because of low data-quality in healthcare. Finally, decentralization of data, using federated learning, may decrease computational costs and allow model building without compromising data security.
Collapse
Affiliation(s)
- William Hoyos
- Grupo de Investigaciones Microbiológicas y Biomédicas de Córdoba, Universidad de Córdoba, Montería, Colombia; Grupo de Investigación en I+D+i en TIC, Universidad EAFIT, Medellín, Colombia.
| | - Jose Aguilar
- Grupo de Investigación en I+D+i en TIC, Universidad EAFIT, Medellín, Colombia; Centro de Estudios en Microelectrónica y Sistemas Distribuidos, Universidad de Los Andes, Mérida, Venezuela; Universidad de Alcalá, Depto. de Automática, Alcalá de Henares, Spain
| | - Mauricio Toro
- Grupo de Investigación en I+D+i en TIC, Universidad EAFIT, Medellín, Colombia
| |
Collapse
|
39
|
McGough SF, Clemente L, Kutz JN, Santillana M. A dynamic, ensemble learning approach to forecast dengue fever epidemic years in Brazil using weather and population susceptibility cycles. J R Soc Interface 2021; 18:20201006. [PMID: 34129785 PMCID: PMC8205538 DOI: 10.1098/rsif.2020.1006] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Transmission of dengue fever depends on a complex interplay of human, climate and mosquito dynamics, which often change in time and space. It is well known that its disease dynamics are highly influenced by multiple factors including population susceptibility to infection as well as by microclimates: small-area climatic conditions which create environments favourable for the breeding and survival of mosquitoes. Here, we present a novel machine learning dengue forecasting approach, which, dynamically in time and space, identifies local patterns in weather and population susceptibility to make epidemic predictions at the city level in Brazil, months ahead of the occurrence of disease outbreaks. Weather-based predictions are improved when information on population susceptibility is incorporated, indicating that immunity is an important predictor neglected by most dengue forecast models. Given the generalizability of our methodology to any location or input data, it may prove valuable for public health decision-making aimed at mitigating the effects of seasonal dengue outbreaks in locations globally.
Collapse
Affiliation(s)
- Sarah F McGough
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02115, USA.,Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
| | - Leonardo Clemente
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02115, USA.,Tecnológico de Monterrey, 64849 Monterrey, Nuevo León, Mexico
| | - J Nathan Kutz
- Department of Applied Mathematics, University of Washington, Seattle, WA 98195, USA
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02115, USA.,Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA.,Department of Pediatrics, Harvard Medical School, Harvard University, Boston, MA 02115, USA
| |
Collapse
|
40
|
Thiruchelvam L, Dass SC, Asirvadam VS, Daud H, Gill BS. Determine neighboring region spatial effect on dengue cases using ensemble ARIMA models. Sci Rep 2021; 11:5873. [PMID: 33712664 PMCID: PMC7955078 DOI: 10.1038/s41598-021-84176-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 02/11/2021] [Indexed: 01/06/2023] Open
Abstract
The state of Selangor, in Malaysia consist of urban and peri-urban centres with good transportation system, and suitable temperature levels with high precipitations and humidity which make the state ideal for high number of dengue cases, annually. This study investigates if districts within the Selangor state do influence each other in determining pattern of dengue cases. Study compares two different models; the Autoregressive Integrated Moving Average (ARIMA) and Ensemble ARIMA models, using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) measurement to gauge their performance tools. ARIMA model is developed using the epidemiological data of dengue cases, whereas ensemble ARIMA incorporates the neighbouring regions' dengue models as the exogenous variable (X), into traditional ARIMA model. Ensemble ARIMA models have better model fit compared to the basic ARIMA models by incorporating neighbuoring effects of seven districts which made of state of Selangor. The AIC and BIC values of ensemble ARIMA models to be smaller compared to traditional ARIMA counterpart models. Thus, study concludes that pattern of dengue cases for a district is subject to spatial effects of its neighbouring districts and number of dengue cases in the surrounding areas.
Collapse
Affiliation(s)
- Loshini Thiruchelvam
- Insititute of Autonomous Systems, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia
| | - Sarat Chandra Dass
- School of Mathematical and Computer Sciences, Heriot-Watt University Malaysia, Putrajaya, Malaysia
| | - Vijanth Sagayan Asirvadam
- Department of Electric and Electronic Engineering, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia.
| | - Hanita Daud
- Department of Fundamental and Applied Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, Perak, Malaysia
| | - Balvinder Singh Gill
- Institute for Medical Research (IMR), Ministry of Health, Kuala Lumpur, Malaysia
| |
Collapse
|