1
|
Kumar P, Chaudhary B, Arya P, Chauhan R, Devi S, Parejiya PB, Gupta MM. Advanced Artificial Intelligence Technologies Transforming Contemporary Pharmaceutical Research. Bioengineering (Basel) 2025; 12:363. [PMID: 40281723 PMCID: PMC12024664 DOI: 10.3390/bioengineering12040363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2025] [Revised: 03/02/2025] [Accepted: 03/05/2025] [Indexed: 04/29/2025] Open
Abstract
One area of study within machine learning and artificial intelligence (AI) seeks to create computer programs with intelligence that can mimic human focal processes in order to produce results. This technique includes data collection, effective data usage system development, conclusion illustration, and arrangements. Analysis algorithms that are learning to mimic human cognitive activities are the most widespread application of AI. Artificial intelligence (AI) studies have proliferated, and the field is quickly beginning to understand its potential impact on medical services and investigation. This review delves deeper into the pros and cons of AI across the healthcare and pharmaceutical research industries. Research and review articles published throughout the last few years were selected from PubMed, Google Scholar, and Science Direct, using search terms like 'artificial intelligence', 'drug discovery', 'pharmacy research', 'clinical trial', etc. This article provides a comprehensive overview of how artificial intelligence (AI) is being used to diagnose diseases, treat patients digitally, find new drugs, and predict when outbreaks or pandemics may occur. In artificial intelligence, neural networks and deep learning are some of the most popular tools; in clinical research, Bayesian non-parametric approaches hold promise for better results, while smartphones and the processing of natural languages are employed in recognizing patients and trial monitoring. Seasonal flu, Ebola, Zika, COVID-19, tuberculosis, and outbreak predictions were made using deep computation and artificial intelligence. The academic world is hopeful that AI development will lead to more efficient and less expensive medical and pharmaceutical investigations and better public services.
Collapse
Affiliation(s)
- Parveen Kumar
- Department of Pharmaceutics, NIMS Institute of Pharmacy, NIMS University, Jaipur 303121, Rajasthan, India;
| | - Benu Chaudhary
- Shri Ram College of Pharmacy, Karnal 132001, Haryana, India; (B.C.); (P.A.)
| | - Preeti Arya
- Shri Ram College of Pharmacy, Karnal 132001, Haryana, India; (B.C.); (P.A.)
| | - Rupali Chauhan
- Chitkara College of Pharmacy, Chitkara University, Rajpura 140401, Punjab, India; (R.C.); (S.D.)
| | - Sushma Devi
- Chitkara College of Pharmacy, Chitkara University, Rajpura 140401, Punjab, India; (R.C.); (S.D.)
| | - Punit B. Parejiya
- Department of Pharmaceutics, K.B. Institute of Pharmaceutical Education and Research, Kadi Sarva Vishwavidyalaya, Gandhinagar 382 023, Gujarat, India;
| | - Madan Mohan Gupta
- Department of Pharmaceutics, NIMS Institute of Pharmacy, NIMS University, Jaipur 303121, Rajasthan, India;
| |
Collapse
|
2
|
Zeng Q, Zhou H, Long J, Jian Y, Feng L, Hu L, Zhou H, Zhu W, Yuan Z, Chen Y, Yi G. Developing Machine Learning Models Based on Clinical Manifestations to Predict Influenza - Chongqing Municipality, China, 2022-2023. China CDC Wkly 2025; 7:363-367. [PMID: 40226218 PMCID: PMC11983152 DOI: 10.46234/ccdcw2025.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 01/27/2025] [Indexed: 04/15/2025] Open
Abstract
What is already known about this topic? Current evidence regarding which clinical manifestations best predict influenza requires refinement, particularly considering regional variations in disease presentation and their importance for early diagnosis and surveillance. What is added by this report? The optimal machine learning model identified key influenza predictors, including epidemiological characteristics, critical symptoms and signs, and age. Based on this model, we introduced a new influenza-like illness (ILI) definition characterized by fever (≥37.9 °C) with either cough or rhinorrhea. What are the implications for public health practice? These findings provide evidence-based clinical manifestations for influenza prediction and offer an optimized definition of ILI for improved surveillance and early detection.
Collapse
Affiliation(s)
- Qianqian Zeng
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Hongyu Zhou
- Chongqing Medical University, Chongqing, China
| | - Jiang Long
- Chongqing Center for Disease Control and Prevention, Chongqing, China
| | - Yi Jian
- Cloudwalk Technology, Chongqing, China
| | - Li Feng
- People's Hospital of Chongqing Banan District, Chongqing, China
| | - Liangbo Hu
- The Affiliated Yongchuan Hospital of Chongqing Medical University, Chongqing, China
| | - Hongyu Zhou
- The Affiliated Yongchuan Hospital of Chongqing Medical University, Chongqing, China
| | - Weimin Zhu
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Zhe Yuan
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Yajuan Chen
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Guangzhao Yi
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| |
Collapse
|
3
|
Luo J, Wang X, Fan X, He Y, Du X, Chen YQ, Zhao Y. A novel graph neural network based approach for influenza-like illness nowcasting: exploring the interplay of temporal, geographical, and functional spatial features. BMC Public Health 2025; 25:408. [PMID: 39893390 PMCID: PMC11786584 DOI: 10.1186/s12889-025-21618-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 01/24/2025] [Indexed: 02/04/2025] Open
Abstract
BACKGROUND Accurate and timely monitoring of influenza prevalence is essential for effective healthcare interventions. This study proposes a graph neural network (GNN)-based method to address the issue of cross-regional connectivity in predicting influenza outbreaks, aiming to achieve real-time and accurate influenza prediction. METHODS We proposed a GNN-based approach with dual topology processing, capturing both geographical and socio-economic associations among counties/cities. The model inputs consist of weekly matrices of influenza-like illness (ILI) rates at city level, along with geographical topology and functional topology. The model construction involves temporal feature extraction through 1-dimensional gated causal convolution, spatial feature embedding through graph convolution, and additional adjustments to enhance spatiotemporal interaction exploration. Evaluation metrics include four commonly used measures: root mean square error (RMSE), mean absolute percentage error (MAPE), mean absolute error (MAE), and Pearson correlation (Corr). RESULTS Our approach for predicting influenza outbreaks achieves competitive performance on real-world datasets (Corr = 0.8202; RMSE = 0.0017; MAE = 0.0013; MAPE = 0.0966), surpassing established baselines. Notably, our approach exhibits excellent capability in accurately and timely capturing short-term influenza outbreaks during the flu season, outperforming competitors across all evaluation metrics. CONCLUSION The incorporation of dual topology processing and the subsequent fusion mechanism allows the model to explore in-depth spatiotemporal feature interactions. Demonstrating superior performance, our approach shows great potential in early detection of flu trends for facilitating public health decisions and resource optimization.
Collapse
Affiliation(s)
- Jiajia Luo
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, 518107, Guangdong, China
| | - Xuan Wang
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, 518107, Guangdong, China
| | - Xiaomao Fan
- College of Big Data and Internet, Shenzhen Technology University, Shenzhen, 518118, Guangdong, China
| | - Yuxin He
- College of Urban Transportation and Logistics, Shenzhen Technology University, Shenzhen, 518118, Guangdong, China
| | - Xiangjun Du
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, 518107, Guangdong, China
| | - Yao-Qing Chen
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, 518107, Guangdong, China
| | - Yang Zhao
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, 518107, Guangdong, China.
| |
Collapse
|
4
|
Luo C, Yang Y, Jiang C, Lv A, Zuo W, Ye Y, Ke J. Influenza and the gut microbiota: A hidden therapeutic link. Heliyon 2024; 10:e37661. [PMID: 39315196 PMCID: PMC11417228 DOI: 10.1016/j.heliyon.2024.e37661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 07/31/2024] [Accepted: 09/07/2024] [Indexed: 09/25/2024] Open
Abstract
Background The extensive community of gut microbiota significantly influences various biological functions throughout the body, making its characterization a focal point in biomedicine research. Over the past few decades, studies have revealed a potential link between specific gut bacteria, their associated metabolic pathways, and influenza. Bacterial metabolites can communicate directly or indirectly with organs beyond the gut via the intestinal barrier, thereby impacting the physiological functions of the host. As the microbiota increasingly emerges as a 'gut signature' in influenza, gaining a deeper understanding of its role may offer new insights into its pathophysiological relevance and open avenues for novel therapeutic targets. In this Review, we explore the differences in gut microbiota between healthy individuals and those with influenza, the relationship between gut microbiota metabolites and influenza, and potential strategies for preventing and treating influenza through the regulation of gut microbiota and its metabolites, including fecal microbiota transplantation and microecological preparations. Methods We utilized PubMed and Web of Science as our search databases, employing keywords such as "influenza," "gut microbiota," "traditional Chinese medicine," "metabolites," "prebiotics," "probiotics," and "machine learning" to retrieve studies examining the potential therapeutic connections between the modulation of gut microbiota and its metabolites in the treatment of influenza. The search encompassed literature from the inception of the databases up to December 2023. Results Fecal microbiota transplantation (FMT), microbial preparations (probiotics and prebiotics), and traditional Chinese medicine have unique advantages in regulating intestinal microbiota and its metabolites to improve influenza outcomes. The primary mechanism involves increasing beneficial intestinal bacteria such as Bacteroidetes and Bifidobacterium while reducing harmful bacteria such as Proteobacteria. These interventions act directly or indirectly on metabolites such as short-chain fatty acids (SCFAs), amino acids (AAs), bile acids, and monoamines to alleviate lung inflammation, reduce viral load, and exert anti-influenza virus effects. Conclusion The gut microbiota and its metabolites have direct or indirect therapeutic effects on influenza, presenting broad research potential for providing new directions in influenza research and offering references for clinical prevention and treatment. Future research should focus on identifying key strains, specific metabolites, and immune regulation mechanisms within the gut microbiota to accurately target microbiota interventions and prevent respiratory viral infections such as influenza.
Collapse
Affiliation(s)
- Cheng Luo
- Chengdu University of Traditional Chinese Medicine, Chengdu, 610032, China
| | - Yi Yang
- Hubei Provincial Hospital of Traditional Chinese Medicine, Hubei Academy of Traditional Chinese Medicine, Affiliated Hospital of Hubei University of Traditional Chinese Medicine, Wuhan, 430074, China
| | - Cheng Jiang
- Hubei Provincial Hospital of Traditional Chinese Medicine, Hubei Academy of Traditional Chinese Medicine, Affiliated Hospital of Hubei University of Traditional Chinese Medicine, Wuhan, 430074, China
| | - Anqi Lv
- College of Traditional Chinese Medicine, Hubei University of Chinese Medicine, Wuhan, 430061, China
| | - Wanzhao Zuo
- College of Traditional Chinese Medicine, Hubei University of Chinese Medicine, Wuhan, 430061, China
| | - Yuanhang Ye
- College of Traditional Chinese Medicine, Hubei University of Chinese Medicine, Wuhan, 430061, China
| | - Jia Ke
- Hubei Provincial Hospital of Traditional Chinese Medicine, Hubei Academy of Traditional Chinese Medicine, Affiliated Hospital of Hubei University of Traditional Chinese Medicine, Wuhan, 430074, China
| |
Collapse
|
5
|
Zhu H, Chen S, Qin W, Aynur J, Chen Y, Wang X, Chen K, Xie Z, Li L, Liu Y, Chen G, Ou J, Zheng K. Study on the impact of meteorological factors on influenza in different periods and prediction based on artificial intelligence RF-Bi-LSTM algorithm: to compare the COVID-19 period with the non-COVID-19 period. BMC Infect Dis 2024; 24:878. [PMID: 39198754 PMCID: PMC11360838 DOI: 10.1186/s12879-024-09750-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 08/12/2024] [Indexed: 09/01/2024] Open
Abstract
OBJECTIVE At different times, public health faces various challenges and the degree of intervention measures varies. The research on the impact and prediction of meteorology factors on influenza is increasing gradually, however, there is currently no evidence on whether its research results are affected by different periods. This study aims to provide limited evidence to reveal this issue. METHODS Daily data on influencing factors and influenza in Xiamen were divided into three parts: overall period (phase AB), non-COVID-19 epidemic period (phase A), and COVID-19 epidemic period (phase B). The association between influencing factors and influenza was analysed using generalized additive models (GAMs). The excess risk (ER) was used to represent the percentage change in influenza as the interquartile interval (IQR) of meteorology factors increases. The 7-day average daily influenza cases were predicted using the combination of bi-directional long short memory (Bi-LSTM) and random forest (RF) through multi-step rolling input of the daily multifactor values of the previous 7-day. RESULTS In periods A and AB, air temperature below 22 °C was a risk factor for influenza. However, in phase B, temperature showed a U-shaped effect on it. Relative humidity had a more significant cumulative effect on influenza in phase AB than in phase A (peak: accumulate 14d, AB: ER = 281.54, 95% CI = 245.47 ~ 321.37; A: ER = 120.48, 95% CI = 100.37 ~ 142.60). Compared to other age groups, children aged 4-12 were more affected by pressure, precipitation, sunshine, and day light, while those aged ≥ 13 were more affected by the accumulation of humidity over multiple days. The accuracy of predicting influenza was highest in phase A and lowest in phase B. CONCLUSIONS The varying degrees of intervention measures adopted during different phases led to significant differences in the impact of meteorology factors on influenza and in the influenza prediction. In association studies of respiratory infectious diseases, especially influenza, and environmental factors, it is advisable to exclude periods with more external interventions to reduce interference with environmental factors and influenza related research, or to refine the model to accommodate the alterations brought about by intervention measures. In addition, the RF-Bi-LSTM model has good predictive performance for influenza.
Collapse
Affiliation(s)
- Hansong Zhu
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China.
- School of Public Health, Fujian Medical University, Fuzhou, Fujian, 350011, China.
| | - Si Chen
- Fujian Institute of Meteorological Sciences, Fuzhou, Fujian, 350007, China
- Fujian Key Laboratory of Severe Weather, Fuzhou, Fujian, 350007, China
- Key Laboratory of Straits Severe Weather, China Meteorological Administration, Fuzhou, Fujian, 350007, China
| | - Weixia Qin
- The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, 361003, China
| | - Joldosh Aynur
- School of Public Health, Xiamen University, Xiamen, Fujian, 361100, China
| | - Yuyan Chen
- Fujian Provincial Judicial Drug Rehabilitation Hospital, Fuzhou, Fujian, 350007, China
| | - Xiaoying Wang
- School of Public Health, Xiamen University, Xiamen, Fujian, 361100, China
| | - Kaizhi Chen
- Fuzhou University, Fuzhou, Fujian, 350108, China
| | - Zhonghang Xie
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China
- School of Public Health, Fujian Medical University, Fuzhou, Fujian, 350011, China
| | - Lingfang Li
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China
| | - Yu Liu
- Xiangnan University, Chenzhou, Hunan, 423001, China.
| | - Guangmin Chen
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China.
- School of Public Health, Fujian Medical University, Fuzhou, Fujian, 350011, China.
| | - Jianming Ou
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China.
- School of Public Health, Fujian Medical University, Fuzhou, Fujian, 350011, China.
| | - Kuicheng Zheng
- Fujian Provincial Center for Disease Control and Prevention, Fuzhou, Fujian, 350012, China.
- School of Public Health, Fujian Medical University, Fuzhou, Fujian, 350011, China.
| |
Collapse
|
6
|
Botz J, Valderrama D, Guski J, Fröhlich H. A dynamic ensemble model for short-term forecasting in pandemic situations. PLOS GLOBAL PUBLIC HEALTH 2024; 4:e0003058. [PMID: 39172923 PMCID: PMC11340948 DOI: 10.1371/journal.pgph.0003058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 07/24/2024] [Indexed: 08/24/2024]
Abstract
During the COVID-19 pandemic, many hospitals reached their capacity limits and could no longer guarantee treatment of all patients. At the same time, governments endeavored to take sensible measures to stop the spread of the virus while at the same time trying to keep the economy afloat. Many models extrapolating confirmed cases and hospitalization rate over short periods of time have been proposed, including several ones coming from the field of machine learning. However, the highly dynamic nature of the pandemic with rapidly introduced interventions and new circulating variants imposed non-trivial challenges for the generalizability of such models. In the context of this paper, we propose the use of ensemble models, which are allowed to change in their composition or weighting of base models over time and could thus better adapt to highly dynamic pandemic or epidemic situations. In that regard, we also explored the use of secondary metadata-Google searches-to inform the ensemble model. We tested our approach using surveillance data from COVID-19, Influenza, and hospital syndromic surveillance of severe acute respiratory infections (SARI). In general, we found ensembles to be more robust than the individual models. Altogether we see our work as a contribution to enhance the preparedness for future pandemic situations.
Collapse
Affiliation(s)
- Jonas Botz
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - Diego Valderrama
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - Jannis Guski
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - Holger Fröhlich
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for IT, University of Bonn, Bonn, Germany
| |
Collapse
|
7
|
Hinson JS, Zhao X, Klein E, Badaki‐Makun O, Rothman R, Copenhaver M, Smith A, Fenstermacher K, Toerper M, Pekosz A, Levin S. Multisite development and validation of machine learning models to predict severe outcomes and guide decision-making for emergency department patients with influenza. J Am Coll Emerg Physicians Open 2024; 5:e13117. [PMID: 38500599 PMCID: PMC10945311 DOI: 10.1002/emp2.13117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 01/10/2024] [Accepted: 01/25/2024] [Indexed: 03/20/2024] Open
Abstract
Objective Millions of Americans are infected by influenza annually. A minority seek care in the emergency department (ED) and, of those, only a limited number experience severe disease or death. ED clinicians must distinguish those at risk for deterioration from those who can be safely discharged. Methods We developed random forest machine learning (ML) models to estimate needs for critical care within 24 h and inpatient care within 72 h in ED patients with influenza. Predictor data were limited to those recorded prior to ED disposition decision: demographics, ED complaint, medical problems, vital signs, supplemental oxygen use, and laboratory results. Our study population was comprised of adults diagnosed with influenza at one of five EDs in our university health system between January 1, 2017 and May 18, 2022; visits were divided into two cohorts to facilitate model development and validation. Prediction performance was assessed by the area under the receiver operating characteristic curve (AUC) and the Brier score. Results Among 8032 patients with laboratory-confirmed influenza, incidence of critical care needs was 6.3% and incidence of inpatient care needs was 19.6%. The most common reasons for ED visit were symptoms of respiratory tract infection, fever, and shortness of breath. Model AUCs were 0.89 (95% CI 0.86-0.93) for prediction of critical care and 0.90 (95% CI 0.88-0.93) for inpatient care needs; Brier scores were 0.026 and 0.042, respectively. Importantpredictors included shortness of breath, increasing respiratory rate, and a high number of comorbid diseases. Conclusions ML methods can be used to accurately predict clinical deterioration in ED patients with influenza and have potential to support ED disposition decision-making.
Collapse
Affiliation(s)
- Jeremiah S. Hinson
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
- Malone Center for Engineering in HealthcareJohns Hopkins University Whiting School of EngineeringBaltimoreMarylandUSA
| | - Xihan Zhao
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Eili Klein
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
- One Health TrustWashingtonDistrict of ColumbiaUSA
| | - Oluwakemi Badaki‐Makun
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
- Department of PediatricsJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Richard Rothman
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Martin Copenhaver
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Aria Smith
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
- Malone Center for Engineering in HealthcareJohns Hopkins University Whiting School of EngineeringBaltimoreMarylandUSA
| | - Katherine Fenstermacher
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Matthew Toerper
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Andrew Pekosz
- Department of Microbiology and ImmunologyJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Scott Levin
- Department of Emergency MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
- Malone Center for Engineering in HealthcareJohns Hopkins University Whiting School of EngineeringBaltimoreMarylandUSA
| |
Collapse
|
8
|
Wang P, Zhang W, Wang H, Shi C, Li Z, Wang D, Luo L, Du Z, Hao Y. Predicting the incidence of infectious diarrhea with symptom surveillance data using a stacking-based ensembled model. BMC Infect Dis 2024; 24:265. [PMID: 38408967 PMCID: PMC10898154 DOI: 10.1186/s12879-024-09138-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 02/14/2024] [Indexed: 02/28/2024] Open
Abstract
BACKGROUND Infectious diarrhea remains a major public health problem worldwide. This study used stacking ensemble to developed a predictive model for the incidence of infectious diarrhea, aiming to achieve better prediction performance. METHODS Based on the surveillance data of infectious diarrhea cases, relevant symptoms and meteorological factors of Guangzhou from 2016 to 2021, we developed four base prediction models using artificial neural networks (ANN), Long Short-Term Memory networks (LSTM), support vector regression (SVR) and extreme gradient boosting regression trees (XGBoost), which were then ensembled using stacking to obtain the final prediction model. All the models were evaluated with three metrics: mean absolute percentage error (MAPE), root mean square error (RMSE), and mean absolute error (MAE). RESULTS Base models that incorporated symptom surveillance data and weekly number of infectious diarrhea cases were able to achieve lower RMSEs, MAEs, and MAPEs than models that added meteorological data and weekly number of infectious diarrhea cases. The LSTM had the best prediction performance among the four base models, and its RMSE, MAE, and MAPE were: 84.85, 57.50 and 15.92%, respectively. The stacking ensembled model outperformed the four base models, whose RMSE, MAE, and MAPE were 75.82, 55.93, and 15.70%, respectively. CONCLUSIONS The incorporation of symptom surveillance data could improve the predictive accuracy of infectious diarrhea prediction models, and symptom surveillance data was more effective than meteorological data in enhancing model performance. Using stacking to combine multiple prediction models were able to alleviate the difficulty in selecting the optimal model, and could obtain a model with better performance than base models.
Collapse
Affiliation(s)
- Pengyu Wang
- Department of Medical Statistics, School of Public Health & Center for Health Information Research & Sun Yat-sen Global Health Institute, Sun Yat-sen University, Guangzhou, China
| | - Wangjian Zhang
- Department of Medical Statistics, School of Public Health & Center for Health Information Research & Sun Yat-sen Global Health Institute, Sun Yat-sen University, Guangzhou, China
| | - Hui Wang
- Department of Infectious Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Congxing Shi
- Department of Medical Statistics, School of Public Health & Center for Health Information Research & Sun Yat-sen Global Health Institute, Sun Yat-sen University, Guangzhou, China
| | - Zhiqiang Li
- Department of Medical Statistics, School of Public Health & Center for Health Information Research & Sun Yat-sen Global Health Institute, Sun Yat-sen University, Guangzhou, China
| | - Dahu Wang
- Department of Infectious Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Lei Luo
- Department of Infectious Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China.
| | - Zhicheng Du
- Department of Medical Statistics, School of Public Health & Center for Health Information Research & Sun Yat-sen Global Health Institute, Sun Yat-sen University, Guangzhou, China.
- Guangzhou Joint Research Center for Disease Surveillance and Risk Assessment, Sun Yat-sen University & Guangzhou Center for Disease Control and Prevention, Guangzhou, China.
| | - Yuantao Hao
- Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China.
- Department of Epidemiology & Biostatistics, School of Public Health, Peking University, Beijing, China.
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China.
| |
Collapse
|
9
|
Hongliang G, Zhiyao Z, Ahmadianfar I, Escorcia-Gutierrez J, Aljehane NO, Li C. Multi-step influenza forecasting through singular value decomposition and kernel ridge regression with MARCOS-guided gradient-based optimization. Comput Biol Med 2024; 169:107888. [PMID: 38157778 DOI: 10.1016/j.compbiomed.2023.107888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 11/28/2023] [Accepted: 12/18/2023] [Indexed: 01/03/2024]
Abstract
This research delves into the significance of influenza outbreaks in public health, particularly the importance of accurate forecasts using weekly Influenza-like illness (ILI) rates. The present work develops a novel hybrid machine-learning model by combining singular value decomposition with kernel ridge regression (SKRR). In this context, a novel hybrid model known as H-SKRR is developed by combining two robust forecasting approaches, SKRR and ridge regression, which aims to improve multi-step-ahead predictions for weekly ILI rates in Southern and Northern China. The study begins with feature selection via XGBoost in the preprocessing phase, identifying optimal precursor information guided by importance factors. It decomposes the original signal using multivariate variational mode decomposition (MVMD) to address non-stationarity and complexity. H-SKRR is implemented by incorporating significant lagged-time components across sub-components. The aggregated forecasted values from these sub-components generate ILI values for two horizons (i.e., 4-and 7-weekly ahead). Employing the gradient-based optimization (GBO) algorithm fine-tunes model parameters. Furthermore, the deep random vector functional link (dRVFL), Ridge regression, and gated recurrent unit neural network (GRU) models were employed to validate the MVMD-H-SKRR-GBO paradigm's effectiveness. The outcomes, assessed using the MARCOS (Measurement of alternatives and ranking according to compromise solution) method as a multi-criteria decision-making method, highlight the superior accuracy of the MVMD-H-SKRR-GBO model in predicting ILI rates. The results clearly highlight the exceptional performance of the MVMD-H-SKRR-GBO model, with outstanding precision demonstrated by impressive R, RMSE, IA, and U95 % values of 0.946, 0.388, 0.970, and 1.075, respectively, at t + 7.
Collapse
Affiliation(s)
- Guo Hongliang
- College of Information Technology, Jilin Agricultural University, Changchun, 130118, China.
| | - Zhang Zhiyao
- College of Information Technology, Jilin Agricultural University, Changchun, 130118, China.
| | - Iman Ahmadianfar
- Information and Communication Technology Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq.
| | - José Escorcia-Gutierrez
- Department of Computational Science and Electronics, Universidad de La Costa, CUC, Barranquilla, 080002, Colombia.
| | - Nojood O Aljehane
- Faculty of Computers and Information Technology, University of Tabuk, Tabuk, Saudi Arabia, Tabuk University, KSA.
| | - Chengye Li
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
10
|
Mellor J, Christie R, Overton CE, Paton RS, Leslie R, Tang M, Deeny S, Ward T. Forecasting influenza hospital admissions within English sub-regions using hierarchical generalised additive models. COMMUNICATIONS MEDICINE 2023; 3:190. [PMID: 38123630 PMCID: PMC10733380 DOI: 10.1038/s43856-023-00424-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 12/06/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Seasonal influenza places a substantial burden annually on healthcare services. Policies during the COVID-19 pandemic limited the transmission of seasonal influenza, making the timing and magnitude of a potential resurgence difficult to ascertain and its impact important to forecast. METHODS We have developed a hierarchical generalised additive model (GAM) for the short-term forecasting of hospital admissions with a positive test for the influenza virus sub-regionally across England. The model incorporates a multi-level structure of spatio-temporal splines, weekly cycles in admissions, and spatial correlation. Using multiple performance metrics including interval score, coverage, bias, and median absolute error, the predictive performance is evaluated for the 2022-2023 seasonal wave. Performance is measured against autoregressive integrated moving average (ARIMA) and Prophet time series models. RESULTS Across the epidemic phases the hierarchical GAM shows improved performance, at all geographic scales relative to the ARIMA and Prophet models. Temporally, the hierarchical GAM has overall an improved performance at 7 and 14 day time horizons. The performance of the GAM is most sensitive to the flexibility of the smoothing function that measures the national epidemic trend. CONCLUSIONS This study introduces an approach to short-term forecasting of hospital admissions for the influenza virus using hierarchical, spatial, and temporal components. The methodology was designed for the real time forecasting of epidemics. This modelling framework was used across the 2022-2023 winter for healthcare operational planning by the UK Health Security Agency and the National Health Service in England.
Collapse
Affiliation(s)
- Jonathon Mellor
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom.
| | - Rachel Christie
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| | - Christopher E Overton
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
- University of Liverpool, Department of Mathematical Sciences, Liverpool, United Kingdom
| | - Robert S Paton
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| | - Rhianna Leslie
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| | - Maria Tang
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| | - Sarah Deeny
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| | - Thomas Ward
- UK Health Security Agency, Data Analytics and Surveillance, 10 South Colonnade, London, United Kingdom
| |
Collapse
|
11
|
Ondrikova N, Clough H, Douglas A, Vivancos R, Itturiza-Gomara M, Cunliffe N, Harris JP. Comparison of statistical approaches to predicting norovirus laboratory reports before and during COVID-19: insights to inform public health surveillance. Sci Rep 2023; 13:21457. [PMID: 38052922 PMCID: PMC10697939 DOI: 10.1038/s41598-023-48069-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 11/22/2023] [Indexed: 12/07/2023] Open
Abstract
Social distancing interrupted transmission patterns of contact-driven infectious agents such as norovirus during the Covid-19 pandemic. Since routine surveillance of norovirus was additionally disrupted during the pandemic, traditional naïve forecasts that rely only on past public health surveillance data may not reliably represent norovirus activity. This study investigates the use of statistical modelling to predict the number of norovirus laboratory reports in England 4-weeks ahead of time before and during Covid-19 pandemic thus providing insights to inform existing practices in norovirus surveillance in England. We compare the predictive performance from three forecasting approaches that assume different underlying structure of the norovirus data and utilized various external data sources including mobility, air temperature and relative internet searches (Time Series and Regularized Generalized Linear Model, and Quantile Regression Forest). The performance of each approach was evaluated using multiple metrics, including a relative prediction error against the traditional naive forecast of a five-season mean. Our data suggest that all three forecasting approaches improve predictive performance over the naïve forecasts, especially in the 2020/21 season (30-45% relative improvement) when the number of norovirus reports reduced. The improvement ranged from 7 to 22% before the pandemic. However, performance varied: regularized regression incorporating internet searches showed the best forecasting score pre-pandemic and the time series approach achieved the best results post pandemic onset without external data. Overall, our results demonstrate that there is a significant value for public health in considering the adoption of more sophisticated forecasting tools, moving beyond traditional naïve methods, and utilizing available software to enhance the precision and timeliness of norovirus surveillance in England.
Collapse
Affiliation(s)
- Nikola Ondrikova
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK.
- Institute for Risk and Uncertainty, University of Liverpool, Liverpool, UK.
- NIHR Health Protection Research Unit in Gastrointestinal Infections, University of Liverpool, Liverpool, UK.
| | - Helen Clough
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- NIHR Health Protection Research Unit in Gastrointestinal Infections, University of Liverpool, Liverpool, UK
| | - Amy Douglas
- National Surveillance Gastrointestinal Pathogens Unit, UK Health Security Agency, London, UK
| | - Roberto Vivancos
- NIHR Health Protection Research Unit in Gastrointestinal Infections, University of Liverpool, Liverpool, UK
- Health Protection Operations, UK Health Security Agency, Liverpool, UK
- NIHR Health Protection Research Unit in Emerging and Zoonotic Infections, University of Liverpool, Liverpool, UK
| | | | - Nigel Cunliffe
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- NIHR Health Protection Research Unit in Gastrointestinal Infections, University of Liverpool, Liverpool, UK
| | - John P Harris
- Health Protection Operations, UK Health Security Agency, Liverpool, UK
| |
Collapse
|
12
|
Marquez E, Barrón-Palma EV, Rodríguez K, Savage J, Sanchez-Sandoval AL. Supervised Machine Learning Methods for Seasonal Influenza Diagnosis. Diagnostics (Basel) 2023; 13:3352. [PMID: 37958248 PMCID: PMC10647880 DOI: 10.3390/diagnostics13213352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/24/2023] [Accepted: 10/25/2023] [Indexed: 11/15/2023] Open
Abstract
Influenza has been a stationary disease in Mexico since 2009, and this causes a high cost for the national public health system, including its detection using RT-qPCR tests, treatments, and absenteeism in the workplace. Despite influenza's relevance, the main clinical features to detect the disease defined by international institutions like the World Health Organization (WHO) and the United States Centers for Disease Control and Prevention (CDC) do not follow the same pattern in all populations. The aim of this work is to find a machine learning method to facilitate decision making in the clinical differentiation between positive and negative influenza patients, based on their symptoms and demographic features. The research sample consisted of 15480 records, including clinical and demographic data of patients with a positive/negative RT-qPCR influenza tests, from 2010 to 2020 in the public healthcare institutions of Mexico City. The performance of the methods for classifying influenza cases were evaluated with indices like accuracy, specificity, sensitivity, precision, the f1-measure and the area under the curve (AUC). Results indicate that random forest and bagging classifiers were the best supervised methods; they showed promise in supporting clinical diagnosis, especially in places where performing molecular tests might be challenging or not feasible.
Collapse
Affiliation(s)
- Edna Marquez
- Genomic Medicine Department, General Hospital of México “Dr. Eduardo Liceaga”, Mexico City 06726, Mexico; (E.V.B.-P.)
| | - Eira Valeria Barrón-Palma
- Genomic Medicine Department, General Hospital of México “Dr. Eduardo Liceaga”, Mexico City 06726, Mexico; (E.V.B.-P.)
| | - Katya Rodríguez
- Institute for Research in Applied Mathematics and Systems, National Autonomous University of Mexico, Mexico City 04510, Mexico;
| | - Jesus Savage
- Signal Processing Department, Engineering School, National Autonomous University of Mexico, Mexico City 04510, Mexico;
| | - Ana Laura Sanchez-Sandoval
- Genomic Medicine Department, General Hospital of México “Dr. Eduardo Liceaga”, Mexico City 06726, Mexico; (E.V.B.-P.)
| |
Collapse
|
13
|
Chong KC, Chan PKS, Lee TC, Lau SYF, Wu P, Lai CKC, Fung KSC, Tse CWS, Leung SY, Kwok KL, Li C, Jiang X, Wei Y. Determining meteorologically-favorable zones for seasonal influenza activity in Hong Kong. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY 2023; 67:609-619. [PMID: 36847884 DOI: 10.1007/s00484-023-02439-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 02/03/2023] [Accepted: 02/08/2023] [Indexed: 06/18/2023]
Abstract
Investigations of simple and accurate meteorology classification systems for influenza epidemics, particularly in subtropical regions, are limited. To assist in preparing for potential upsurges in the demand on healthcare facilities during influenza seasons, our study aims to develop a set of meteorologically-favorable zones for epidemics of influenza A and B, defined as the intervals of meteorological variables with prediction performance optimized. We collected weekly detection rates of laboratory-confirmed influenza cases from four local major hospitals in Hong Kong between 2004 and 2019. Meteorological and air quality records for hospitals were collected from their closest monitoring stations. We employed classification and regression trees to identify zones that optimize the prediction performance of meteorological data in influenza epidemics, defined as a weekly rate > 50th percentile over a year. According to the results, a combination of temperature > 25.1℃ and relative humidity > 79% was favorable to epidemics in hot seasons, whereas either temperature < 16.4℃ or a combination of < 20.4℃ and relative humidity > 76% was favorable to epidemics in cold seasons. The area under the receiver operating characteristic curve (AUC) in model training achieved 0.80 (95% confidence interval [CI], 0.76-0.83) and was kept at 0.71 (95%CI, 0.65-0.77) in validation. The meteorologically-favorable zones for predicting influenza A or A and B epidemics together were similar, but the AUC for predicting influenza B epidemics was comparatively lower. In conclusion, we established meteorologically-favorable zones for influenza A and B epidemics with a satisfactory prediction performance, even though the influenza seasonality in this subtropical setting was weak and type-specific.
Collapse
Affiliation(s)
- Ka Chun Chong
- Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
- Centre for Health Systems and Policy Research, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Paul K S Chan
- Department of Microbiology, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Tsz Cheung Lee
- Hong Kong Observatory, Hong Kong Special Administrative Region, China
| | - Steven Y F Lau
- Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Peng Wu
- School of Public Health, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Christopher K C Lai
- Department of Microbiology, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Kitty S C Fung
- Department of Pathology, United Christian Hospital, Hong Kong Special Administrative Region, China
| | - Cindy W S Tse
- Department of Pathology, Kwong Wah Hospital, Hong Kong Special Administrative Region, China
| | - Shuk Yu Leung
- Department of Paediatrics, Kwong Wah Hospital, Hong Kong Special Administrative Region, China
| | - Ka Li Kwok
- Department of Paediatrics, Kwong Wah Hospital, Hong Kong Special Administrative Region, China
| | - Conglu Li
- Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Xiaoting Jiang
- Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Yuchen Wei
- Jockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China.
- Centre for Health Systems and Policy Research, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China.
| |
Collapse
|
14
|
Bhattamisra SK, Banerjee P, Gupta P, Mayuren J, Patra S, Candasamy M. Artificial Intelligence in Pharmaceutical and Healthcare Research. BIG DATA AND COGNITIVE COMPUTING 2023; 7:10. [DOI: 10.3390/bdcc7010010] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
Artificial intelligence (AI) is a branch of computer science that allows machines to work efficiently, can analyze complex data. The research focused on AI has increased tremendously, and its role in healthcare service and research is emerging at a greater pace. This review elaborates on the opportunities and challenges of AI in healthcare and pharmaceutical research. The literature was collected from domains such as PubMed, Science Direct and Google scholar using specific keywords and phrases such as ‘Artificial intelligence’, ‘Pharmaceutical research’, ‘drug discovery’, ‘clinical trial’, ‘disease diagnosis’, etc. to select the research and review articles published within the last five years. The application of AI in disease diagnosis, digital therapy, personalized treatment, drug discovery and forecasting epidemics or pandemics was extensively reviewed in this article. Deep learning and neural networks are the most used AI technologies; Bayesian nonparametric models are the potential technologies for clinical trial design; natural language processing and wearable devices are used in patient identification and clinical trial monitoring. Deep learning and neural networks were applied in predicting the outbreak of seasonal influenza, Zika, Ebola, Tuberculosis and COVID-19. With the advancement of AI technologies, the scientific community may witness rapid and cost-effective healthcare and pharmaceutical research as well as provide improved service to the general public.
Collapse
Affiliation(s)
- Subrat Kumar Bhattamisra
- Department of Pharmacology, GITAM School of Pharmacy, GITAM (Deemed to Be University), Visakhapatnam 530045, Andhra Pradesh, India
| | - Priyanka Banerjee
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Pratibha Gupta
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Jayashree Mayuren
- Department of Pharmaceutical Technology, School of Pharmacy, International Medical University, Kuala Lumpur 57000, Malaysia
| | - Susmita Patra
- Department of Pharmaceutical Technology, School of Medical Sciences, Adamas University, Kolkata 700126, West Bengal, India
| | - Mayuren Candasamy
- Department of Life Sciences, School of Pharmacy, International Medical University, Kuala Lumpur 57000, Malaysia
| |
Collapse
|
15
|
Moon J, Noh Y, Park S, Hwang E. Model-Agnostic Meta-Learning-based Region-Adaptive Parameter Adjustment Scheme for Influenza Forecasting. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2022.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
16
|
Forecasting the Potential Number of Influenza-like Illness Cases by Fusing Internet Public Opinion. SUSTAINABILITY 2022. [DOI: 10.3390/su14052803] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
As influenza viruses mutate rapidly, a prediction model for potential outbreaks of influenza-like illnesses helps detect the spread of the illnesses in real time. In order to create a better prediction model, in this study, in addition to using the traditional hydrological and atmospheric data, features, such as popular search keywords on Google Trends, public holiday information, population density, air quality indices, and the numbers of COVID-19 confirmed cases, were also used to train the model in this research. Furthermore, Random Forest and XGBoost were combined and used in the proposed prediction model to increase the prediction accuracy. The training data used in this research were the historical data taken from 2016 to 2021. In our experiments, different combinations of features were tested. The results show that features, such as popular search keywords on Google Trends, the numbers of COVID-19 confirmed cases, and air quality indices can improve the outcome of the prediction model. The evaluation results showed that the error rate between the predicted results and the actual number of influenza-like cases form Week 15 to Week 18 fell to less than 5%. The outbreak of COVID-19 in Taiwan began in Week 19 and resulted in a sharp rise in the number of clinic or hospital visits by patients of influenza-like illnesses. After that, from Week 21 to Week 26, the error rate between the predicted and actual numbers of influenza-like cases in the later period dropped down to 13%. It can be confirmed from the actual experimental results in this research that the use of the ensemble learning prediction model proposed in this research can accurately predict the trend of influenza-like cases.
Collapse
|
17
|
A Decision-Level Fusion Method for COVID-19 Patient Health Prediction. BIG DATA RESEARCH 2022; 27:100287. [PMCID: PMC8574072 DOI: 10.1016/j.bdr.2021.100287] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 08/11/2021] [Accepted: 10/28/2021] [Indexed: 06/16/2023]
Abstract
With the continuous attempts to develop effective machine learning methods, information fusion approaches play an important role in integrating data from multiple sources and improving these methods' performance. Among the different fusion techniques, decision-level fusion has unique advantages to fuse the decisions of various classifiers and getting an effective outcome. In this paper, we propose a decision-level fusion method that combines three well-calibrated ensemble classifiers, namely, a random forest (RF), gradient boosting (GB), and extreme gradient boosting (XGB) methods. It is used to predict the COVID-19 patient health for early monitoring and efficient treatment. A soft voting technique is used to generate the final decision result from the predictions of these calibrated classifiers. The method uses the COVID-19 patient's health information, travel demographic, and geographical data to predict the possible outcome of the COVID-19 case, recovered, or death. A different set of experiments is conducted on a public novel Corona Virus 2019 dataset using a different ratio of test sets. The experimental results show that the proposed fusion method achieved an accuracy of 97.24% and an F1-score of 0.97, which is higher than the current related work that has an accuracy of 94% and an F1-score 0.86, on 20% test set taken from the dataset.
Collapse
|
18
|
Dixon S, Keshavamurthy R, Farber DH, Stevens A, Pazdernik KT, Charles LE. A Comparison of Infectious Disease Forecasting Methods across Locations, Diseases, and Time. Pathogens 2022; 11:185. [PMID: 35215129 PMCID: PMC8875569 DOI: 10.3390/pathogens11020185] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 01/23/2022] [Accepted: 01/27/2022] [Indexed: 02/04/2023] Open
Abstract
Accurate infectious disease forecasting can inform efforts to prevent outbreaks and mitigate adverse impacts. This study compares the performance of statistical, machine learning (ML), and deep learning (DL) approaches in forecasting infectious disease incidences across different countries and time intervals. We forecasted three diverse diseases: campylobacteriosis, typhoid, and Q-fever, using a wide variety of features (n = 46) from public datasets, e.g., landscape, climate, and socioeconomic factors. We compared autoregressive statistical models to two tree-based ML models (extreme gradient boosted trees [XGB] and random forest [RF]) and two DL models (multi-layer perceptron and encoder-decoder model). The disease models were trained on data from seven different countries at the region-level between 2009-2017. Forecasting performance of all models was assessed using mean absolute error, root mean square error, and Poisson deviance across Australia, Israel, and the United States for the months of January through August of 2018. The overall model results were compared across diseases as well as various data splits, including country, regions with highest and lowest cases, and the forecasted months out (i.e., nowcasting, short-term, and long-term forecasting). Overall, the XGB models performed the best for all diseases and, in general, tree-based ML models performed the best when looking at data splits. There were a few instances where the statistical or DL models had minutely smaller error metrics for specific subsets of typhoid, which is a disease with very low case counts. Feature importance per disease was measured by using four tree-based ML models (i.e., XGB and RF with and without region name as a feature). The most important feature groups included previous case counts, region name, population counts and density, mortality causes of neonatal to under 5 years of age, sanitation factors, and elevation. This study demonstrates the power of ML approaches to incorporate a wide range of factors to forecast various diseases, regardless of location, more accurately than traditional statistical approaches.
Collapse
Affiliation(s)
- Samuel Dixon
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
| | - Ravikiran Keshavamurthy
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
- Paul G. Allen School for Global Health, Washington State University, Pullman, WA 99164, USA
| | - Daniel H. Farber
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
| | - Andrew Stevens
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
| | - Karl T. Pazdernik
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Lauren E. Charles
- Pacific Northwest National Laboratory, Richland, WA 99354, USA; (S.D.); (R.K.); (D.H.F.); (A.S.); (K.T.P.)
- Paul G. Allen School for Global Health, Washington State University, Pullman, WA 99164, USA
| |
Collapse
|
19
|
He Y, Zhao Y, Chen Y, Yuan H, Tsui K. Nowcasting influenza‐like illness (ILI) via a deep learning approach using google search data: An empirical study on Taiwan ILI. INT J INTELL SYST 2021. [DOI: 10.1002/int.22788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Yuxin He
- College of Urban Transportation and Logistics Shenzhen Technology University Shenzhen China
| | - Yang Zhao
- School of Public Health (Shenzhen) Sun Yat‐Sen University Guangzhou China
| | - Yupeng Chen
- Trial Retail Engineering (T. R. E. China) Yantai China
| | - Hsiang‐Yu Yuan
- Department of Biomedical Sciences City University of Hong Kong Hong Kong China
| | - Kwok‐Leung Tsui
- Department of Industrial and Systems Engineering Virginia Polytechnic Institute and State University Blacksburg Virginia USA
| |
Collapse
|
20
|
Yang S, Bao Y. Comprehensive learning particle swarm optimization enabled modeling framework for multi-step-ahead influenza prediction. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107994] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
21
|
Lv CX, An SY, Qiao BJ, Wu W. Time series analysis of hemorrhagic fever with renal syndrome in mainland China by using an XGBoost forecasting model. BMC Infect Dis 2021; 21:839. [PMID: 34412581 PMCID: PMC8377883 DOI: 10.1186/s12879-021-06503-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 07/30/2021] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Hemorrhagic fever with renal syndrome (HFRS) is still attracting public attention because of its outbreak in various cities in China. Predicting future outbreaks or epidemics disease based on past incidence data can help health departments take targeted measures to prevent diseases in advance. In this study, we propose a multistep prediction strategy based on extreme gradient boosting (XGBoost) for HFRS as an extension of the one-step prediction model. Moreover, the fitting and prediction accuracy of the XGBoost model will be compared with the autoregressive integrated moving average (ARIMA) model by different evaluation indicators. METHODS We collected HFRS incidence data from 2004 to 2018 of mainland China. The data from 2004 to 2017 were divided into training sets to establish the seasonal ARIMA model and XGBoost model, while the 2018 data were used to test the prediction performance. In the multistep XGBoost forecasting model, one-hot encoding was used to handle seasonal features. Furthermore, a series of evaluation indices were performed to evaluate the accuracy of the multistep forecast XGBoost model. RESULTS There were 200,237 HFRS cases in China from 2004 to 2018. A long-term downward trend and bimodal seasonality were identified in the original time series. According to the minimum corrected akaike information criterion (CAIC) value, the optimal ARIMA (3, 1, 0) × (1, 1, 0)12 model is selected. The index ME, RMSE, MAE, MPE, MAPE, and MASE indices of the XGBoost model were higher than those of the ARIMA model in the fitting part, whereas the RMSE of the XGBoost model was lower. The prediction performance evaluation indicators (MAE, MPE, MAPE, RMSE and MASE) of the one-step prediction and multistep prediction XGBoost model were all notably lower than those of the ARIMA model. CONCLUSIONS The multistep XGBoost prediction model showed a much better prediction accuracy and model stability than the multistep ARIMA prediction model. The XGBoost model performed better in predicting complicated and nonlinear data like HFRS. Additionally, Multistep prediction models are more practical than one-step prediction models in forecasting infectious diseases.
Collapse
Affiliation(s)
- Cai-Xia Lv
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning China
| | - Shu-Yi An
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Bao-Jun Qiao
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Wei Wu
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning China
| |
Collapse
|
22
|
Jung S, Moon J, Park S, Hwang E. Self-attention-based Deep Learning Network for Regional Influenza Forecasting. IEEE J Biomed Health Inform 2021; 26:922-933. [PMID: 34197330 DOI: 10.1109/jbhi.2021.3093897] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Early prediction of influenza plays an important role in minimizing the damage caused, as it provides the resources and time needed to formulate preventive measures. Compared to traditional mechanistic approach, deep/machine learning-based models have demonstrated excellent forecasting performance by efficiently handling various data such as weather and internet data. However, due to the limited availability and reliability of such data, many forecasting models use only historical occurrence data and formulate the influenza forecasting as a multivariate time-series task. Recently, attention mechanisms have been exploited to deal with this issue by selecting valuable data in the input data and giving them high weights. Particularly, self-attention has shown its potential in various forecasting tasks by utilizing the predictive relationship between objects from the input data describing target objects. Hence, in this study, we propose a forecasting model based on self-attention for regional influenza forecasting, called SAIFlu-Net. The model exploits a long short-term memory network for extracting time-series patterns of each region and the self-attention mechanism to find the similarities between the occurrence patterns. To evaluate its performance, we conducted extensive experiments with existing forecasting models using weekly regional influenza datasets. The results show that the proposed model outperforms other models in terms of root mean square error and Pearson correlation coefficient.
Collapse
|
23
|
Wen A, Wang L, He H, Liu S, Fu S, Sohn S, Kugel JA, Kaggal VC, Huang M, Wang Y, Shen F, Fan J, Liu H. An aberration detection-based approach for sentinel syndromic surveillance of COVID-19 and other novel influenza-like illnesses. J Biomed Inform 2021; 113:103660. [PMID: 33321199 PMCID: PMC7832634 DOI: 10.1016/j.jbi.2020.103660] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 11/06/2020] [Accepted: 12/09/2020] [Indexed: 02/08/2023]
Abstract
Coronavirus Disease 2019 has emerged as a significant global concern, triggering harsh public health restrictions in a successful bid to curb its exponential growth. As discussion shifts towards relaxation of these restrictions, there is significant concern of second-wave resurgence. The key to managing these outbreaks is early detection and intervention, and yet there is a significant lag time associated with usage of laboratory confirmed cases for surveillance purposes. To address this, syndromic surveillance can be considered to provide a timelier alternative for first-line screening. Existing syndromic surveillance solutions are however typically focused around a known disease and have limited capability to distinguish between outbreaks of individual diseases sharing similar syndromes. This poses a challenge for surveillance of COVID-19 as its active periods tend to overlap temporally with other influenza-like illnesses. In this study we explore performing sentinel syndromic surveillance for COVID-19 and other influenza-like illnesses using a deep learning-based approach. Our methods are based on aberration detection utilizing autoencoders that leverages symptom prevalence distributions to distinguish outbreaks of two ongoing diseases that share similar syndromes, even if they occur concurrently. We first demonstrate that this approach works for detection of outbreaks of influenza, which has known temporal boundaries. We then demonstrate that the autoencoder can be trained to not alert on known and well-managed influenza-like illnesses such as the common cold and influenza. Finally, we applied our approach to 2019-2020 data in the context of a COVID-19 syndromic surveillance task to demonstrate how implementation of such a system could have provided early warning of an outbreak of a novel influenza-like illness that did not match the symptom prevalence profile of influenza and other known influenza-like illnesses.
Collapse
Affiliation(s)
- Andrew Wen
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Liwei Wang
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Huan He
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Sijia Liu
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Sunyang Fu
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Sunghwan Sohn
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Jacob A Kugel
- Advanced Analytics Service Unit, Department of Information Technology, Mayo Clinic, Rochester, MN, USA
| | - Vinod C Kaggal
- Advanced Analytics Service Unit, Department of Information Technology, Mayo Clinic, Rochester, MN, USA
| | - Ming Huang
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Yanshan Wang
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Feichen Shen
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Jungwei Fan
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
| | - Hongfang Liu
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|