Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Taft LM, Evans RS, Shyu CR, Egger MJ, Chawla N, Mitchell JA, Thornton SN, Bray B, Varner M. Countering imbalanced datasets to improve adverse drug event predictive models in labor and delivery. J Biomed Inform 2009;42:356-64. [PMID: 18824133 PMCID: PMC2692750 DOI: 10.1016/j.jbi.2008.09.001] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2008] [Revised: 09/05/2008] [Accepted: 09/07/2008] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Niceta M, Ciolfi A, Ferilli M, Pedace L, Cappelletti C, Nardini C, Hildonen M, Chiriatti L, Miele E, Dentici ML, Gnazzo M, Cesario C, Pisaneschi E, Baban A, Novelli A, Maitz S, Selicorni A, Squeo GM, Merla G, Dallapiccola B, Tumer Z, Digilio MC, Priolo M, Tartaglia M. DNA methylation profiling in Kabuki syndrome: reclassification of germline KMT2D VUS and sensitivity in validating postzygotic mosaicism. Eur J Hum Genet 2024:10.1038/s41431-024-01597-9. [PMID: 38528056 DOI: 10.1038/s41431-024-01597-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 03/05/2024] [Accepted: 03/13/2024] [Indexed: 03/27/2024] Open

Affiliation(s)

Marcello Niceta Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Andrea Ciolfi Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Marco Ferilli Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy Department of Computer, Control and Management Engineering, Sapienza University, 00185, Rome, Italy
Lucia Pedace Department of Pediatric Hematology/Oncology, Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00165, Rome, Italy
Camilla Cappelletti Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Claudia Nardini Department of Pediatric Hematology/Oncology, Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00165, Rome, Italy
Mathis Hildonen Department of Clinical Genetics, Kennedy Center, Copenhagen University Hospital, Rigshopsitalet, 2600, Glostrup, Denmark
Luigi Chiriatti Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Evelina Miele Department of Pediatric Hematology/Oncology, Cell and Gene Therapy, Bambino Gesù Children's Hospital, IRCCS, 00165, Rome, Italy
Maria Lisa Dentici Medical Genetics Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Maria Gnazzo Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Claudia Cesario Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Elisa Pisaneschi Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Anwar Baban Pediatric Cardiology and Cardiac Arrhythmias Unit, Department of Pediatric Cardiology and Cardiac Surgery, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Antonio Novelli Laboratory of Medical Genetics, Translational Cytogenomics Research Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Silvia Maitz Genetica Clinica Pediatrica, Fondazione MBBM, ASST Monza Ospedale San Gerardo, 20900, Monza, Italy
Angelo Selicorni Pediatria, Ospedale Sant'Anna, ASST Lariana, 22100, Como, Italy
Gabriella Maria Squeo Laboratory of Regulatory and Functional Genomics, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, 71013, Foggia, Italy
Giuseppe Merla Laboratory of Regulatory and Functional Genomics, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, 71013, Foggia, Italy Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, 80131, Naples, Italy
Bruno Dallapiccola Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Zeynep Tumer Department of Clinical Genetics, Kennedy Center, Copenhagen University Hospital, Rigshopsitalet, 2600, Glostrup, Denmark Department of Clinical Medicine, Faculty of Medicine and Health Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
Maria Cristina Digilio Medical Genetics Unit, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy
Manuela Priolo Medical and Laboratory Genetics, Antonio Cardarelli Hospital, 80131, Naples, Italy
Marco Tartaglia Molecular Genetics and Functional Genomics, Bambino Gesù Children's Hospital, IRCCS, 00146, Rome, Italy.

Collapse

Aeberhard JL, Radan AP, Delgado-Gonzalo R, Strahm KM, Sigurthorsdottir HB, Schneider S, Surbek D. Artificial intelligence and machine learning in cardiotocography: A scoping review. Eur J Obstet Gynecol Reprod Biol 2023;281:54-62. [PMID: 36535071 DOI: 10.1016/j.ejogrb.2022.12.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 10/19/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022]

Li X, Yan L, Wang X, Ouyang C, Wang C, Chao J, Zhang J, Lian G. Predictive models for endoscopic disease activity in patients with ulcerative colitis: Practical machine learning-based modeling and interpretation. Front Med (Lausanne) 2022;9:1043412. [PMID: 36619650 PMCID: PMC9810755 DOI: 10.3389/fmed.2022.1043412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 12/07/2022] [Indexed: 12/24/2022] Open

Abstract

Background

Endoscopic disease activity monitoring is important for the long-term management of patients with ulcerative colitis (UC), there is currently no widely accepted non-invasive method that can effectively predict endoscopic disease activity. We aimed to develop and validate machine learning (ML) models for predicting it, which are desired to reduce the frequency of endoscopic examinations and related costs.

Methods

The patients with a diagnosis of UC in two hospitals from January 2016 to January 2021 were enrolled in this study. Thirty nine clinical and laboratory variables were collected. All patients were divided into four groups based on MES or UCEIS scores. Logistic regression (LR) and four ML algorithms were applied to construct the prediction models. The performance of models was evaluated in terms of accuracy, sensitivity, precision, F1 score, and area under the receiver-operating characteristic curve (AUC). Then Shapley additive explanations (SHAP) was applied to determine the importance of the selected variables and interpret the ML models.

Results

A total of 420 patients were entered into the study. Twenty four variables showed statistical differences among the groups. After synthetic minority oversampling technique (SMOTE) oversampling and RFE variables selection, the random forests (RF) model with 23 variables in MES and the extreme gradient boosting (XGBoost) model with 21 variables in USEIS, had the greatest discriminatory ability (AUC = 0.8192 in MES and 0.8006 in UCEIS in the test set). The results obtained from SHAP showed that albumin, rectal bleeding, and CRP/ALB contributed the most to the overall model. In addition, the above three variables had a more balanced contribution to each classification under the MES than the UCEIS according to the SHAP values.

Conclusion

This proof-of-concept study demonstrated that the ML model could serve as an effective non-invasive approach to predicting endoscopic disease activity for patients with UC. RF and XGBoost, which were first introduced into data-based endoscopic disease activity prediction, are suitable for the present prediction modeling.

Collapse

Syrowatka A, Song W, Amato MG, Foer D, Edrees H, Co Z, Kuznetsova M, Dulgarian S, Seger DL, Simona A, Bain PA, Purcell Jackson G, Rhee K, Bates DW. Key use cases for artificial intelligence to reduce the frequency of adverse drug events: a scoping review. Lancet Digit Health 2021;4:e137-e148. [PMID: 34836823 DOI: 10.1016/s2589-7500(21)00229-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 08/13/2021] [Accepted: 09/10/2021] [Indexed: 12/31/2022]

Affiliation(s)

Ania Syrowatka Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA.
Wenyu Song Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
Mary G Amato Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Massachusetts College of Pharmacy and Health Sciences, Boston, MA, USA
Dinah Foer Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Division of Allergy and Clinical Immunology, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
Heba Edrees Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Massachusetts College of Pharmacy and Health Sciences, Boston, MA, USA
Zoe Co Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA
Masha Kuznetsova Harvard Business School, Boston, MA, USA
Sevan Dulgarian Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA
Diane L Seger Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA
Aurélien Simona Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
Paul A Bain Countway Library of Medicine, Harvard Medical School, Boston, MA, USA
Gretchen Purcell Jackson IBM Watson Health, Cambridge, MA, USA; Department of Pediatric Surgery, Vanderbilt University Medical Center, Nashville, TN, USA
Kyu Rhee IBM Watson Health, Cambridge, MA, USA; CVS Health, Wellesley Hills, MA, USA
David W Bates Division of General Internal Medicine, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA; Harvard T H Chan School of Public Health, Boston, MA, USA

Collapse

Cheng Y, Chen C, Yang J, Yang H, Fu M, Zhong X, Wang B, He M, Hu Z, Zhang Z, Jin X, Kang Y, Wu Q. Using Machine Learning Algorithms to Predict Hospital Acquired Thrombocytopenia after Operation in the Intensive Care Unit: A Retrospective Cohort Study. Diagnostics (Basel) 2021;11:diagnostics11091614. [PMID: 34573956 PMCID: PMC8466367 DOI: 10.3390/diagnostics11091614] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 08/25/2021] [Accepted: 09/01/2021] [Indexed: 02/05/2023] Open

Affiliation(s)

Yisong Cheng Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Chaoyue Chen Department of Neurosurgery, West China Hospital, Sichuan University, Chengdu 610041, China;
Jie Yang Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Hao Yang Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Min Fu Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Xi Zhong Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Bo Wang Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Min He Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Zhi Hu Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Zhongwei Zhang Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Xiaodong Jin Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Yan Kang Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
Qin Wu Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.) Correspondence: ; Tel.: +86-028-8542-2506

Collapse

Vepa A, Saleem A, Rakhshan K, Daneshkhah A, Sedighi T, Shohaimi S, Omar A, Salari N, Chatrabgoun O, Dharmaraj D, Sami J, Parekh S, Ibrahim M, Raza M, Kapila P, Chakrabarti P. Using Machine Learning Algorithms to Develop a Clinical Decision-Making Tool for COVID-19 Inpatients. Int J Environ Res Public Health 2021;18:6228. [PMID: 34207560 DOI: 10.3390/ijerph18126228] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 05/28/2021] [Accepted: 06/01/2021] [Indexed: 12/21/2022]

Karabulut OC, Karpuzcu BA, Türk E, Ibrahim AH, Süzek BE. ML-AdVInfect: A Machine-Learning Based Adenoviral Infection Predictor. Front Mol Biosci 2021;8:647424. [PMID: 34026828 PMCID: PMC8139618 DOI: 10.3389/fmolb.2021.647424] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 04/22/2021] [Indexed: 01/08/2023] Open

Abstract

Adenoviruses (AdVs) constitute a diverse family with many pathogenic types that infect a broad range of hosts. Understanding the pathogenesis of adenoviral infections is not only clinically relevant but also important to elucidate the potential use of AdVs as vectors in therapeutic applications. For an adenoviral infection to occur, attachment of the viral ligand to a cellular receptor on the host organism is a prerequisite and, in this sense, it is a criterion to decide whether an adenoviral infection can potentially happen. The interaction between any virus and its corresponding host organism is a specific kind of protein-protein interaction (PPI) and several experimental techniques, including high-throughput methods are being used in exploring such interactions. As a result, there has been accumulating data on virus-host interactions including a significant portion reported at publicly available bioinformatics resources. There is not, however, a computational model to integrate and interpret the existing data to draw out concise decisions, such as whether an infection happens or not. In this study, accepting the cellular entry of AdV as a decisive parameter for infectivity, we have developed a machine learning, more precisely support vector machine (SVM), based methodology to predict whether adenoviral infection can take place in a given host. For this purpose, we used the sequence data of the known receptors of AdVs, we identified sets of adenoviral ligands and their respective host species, and eventually, we have constructed a comprehensive adenovirus–host interaction dataset. Then, we committed interaction predictions through publicly available virus-host PPI tools and constructed an AdV infection predictor model using SVM with RBF kernel, with the overall sensitivity, specificity, and AUC of 0.88 ± 0.011, 0.83 ± 0.064, and 0.86 ± 0.030, respectively. ML-AdVInfect is the first of its kind as an effective predictor to screen the infection capacity along with anticipating any cross-species shifts. We anticipate our approach led to ML-AdVInfect can be adapted in making predictions for other viral infections.

Collapse

Tan TH, Hsu CC, Chen CJ, Hsu SL, Liu TL, Lin HJ, Wang JJ, Liu CF, Huang CC. Predicting outcomes in older ED patients with influenza in real time using a big data-driven and machine learning approach to the hospital information system. BMC Geriatr 2021;21:280. [PMID: 33902485 PMCID: PMC8077903 DOI: 10.1186/s12877-021-02229-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 04/19/2021] [Indexed: 11/30/2022] Open

Vijayvargiya A, Prakash C, Kumar R, Bansal S, R.s. Tavares JM. Human knee abnormality detection from imbalanced sEMG data. Biomed Signal Process Control 2021;66:102406. [DOI: 10.1016/j.bspc.2021.102406] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Antaki F, Kahwati G, Sebag J, Coussa RG, Fanous A, Duval R, Sebag M. Predictive modeling of proliferative vitreoretinopathy using automated machine learning by ophthalmologists without coding experience. Sci Rep 2020;10:19528. [PMID: 33177614 PMCID: PMC7658348 DOI: 10.1038/s41598-020-76665-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 11/01/2020] [Indexed: 11/23/2022] Open

Na KS, Kim E. A Machine Learning-Based Predictive Model of Return to Work After Sick Leave. J Occup Environ Med 2019;61:e191-9. [PMID: 30829888 DOI: 10.1097/JOM.0000000000001567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Zhang PI, Hsu CC, Kao Y, Chen CJ, Kuo YW, Hsu SL, Liu TL, Lin HJ, Wang JJ, Liu CF, Huang CC. Real-time AI prediction for major adverse cardiac events in emergency department patients with chest pain. Scand J Trauma Resusc Emerg Med 2020;28:93. [PMID: 32917261 DOI: 10.1186/s13049-020-00786-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Accepted: 09/02/2020] [Indexed: 02/07/2023] Open

Park YW, Choi D, Lee J, Ahn SS, Lee SK, Lee SH, Bang M. Differentiating patients with schizophrenia from healthy controls by hippocampal subfields using radiomics. Schizophr Res 2020;223:337-44. [PMID: 32988740 DOI: 10.1016/j.schres.2020.09.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 08/11/2020] [Accepted: 09/14/2020] [Indexed: 12/16/2022]

Mishra S, Mallick PK, Jena L, Chae GS. Optimization of Skewed Data Using Sampling-Based Preprocessing Approach. Front Public Health 2020;8:274. [PMID: 32766193 PMCID: PMC7378392 DOI: 10.3389/fpubh.2020.00274] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 05/26/2020] [Indexed: 11/26/2022] Open

Gao L, Wu S. Response score of deep learning for out-of-distribution sample detection of medical images. J Biomed Inform 2020;107:103442. [PMID: 32450299 DOI: 10.1016/j.jbi.2020.103442] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 05/02/2020] [Accepted: 05/05/2020] [Indexed: 02/07/2023]

Rahman R, Kodesh A, Levine SZ, Sandin S, Reichenberg A, Schlessinger A. Identification of newborns at risk for autism using electronic medical records and machine learning. Eur Psychiatry 2020;63:e22. [PMID: 32100657 PMCID: PMC7315872 DOI: 10.1192/j.eurpsy.2020.17] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Abstract

BACKGROUND

Current approaches for early identification of individuals at high risk for autism spectrum disorder (ASD) in the general population are limited, and most ASD patients are not identified until after the age of 4. This is despite substantial evidence suggesting that early diagnosis and intervention improves developmental course and outcome. The aim of the current study was to test the ability of machine learning (ML) models applied to electronic medical records (EMRs) to predict ASD early in life, in a general population sample.

METHODS

We used EMR data from a single Israeli Health Maintenance Organization, including EMR information for parents of 1,397 ASD children (ICD-9/10) and 94,741 non-ASD children born between January 1st, 1997 and December 31st, 2008. Routinely available parental sociodemographic information, parental medical histories, and prescribed medications data were used to generate features to train various ML algorithms, including multivariate logistic regression, artificial neural networks, and random forest. Prediction performance was evaluated with 10-fold cross-validation by computing the area under the receiver operating characteristic curve (AUC; C-statistic), sensitivity, specificity, accuracy, false positive rate, and precision (positive predictive value [PPV]).

RESULTS

All ML models tested had similar performance. The average performance across all models had C-statistic of 0.709, sensitivity of 29.93%, specificity of 98.18%, accuracy of 95.62%, false positive rate of 1.81%, and PPV of 43.35% for predicting ASD in this dataset.

CONCLUSIONS

We conclude that ML algorithms combined with EMR capture early life ASD risk as well as reveal previously unknown features to be associated with ASD-risk. Such approaches may be able to enhance the ability for accurate and efficient early detection of ASD in large populations of children.

Collapse

Girault JB, Piven J. The Neurodevelopment of Autism from Infancy Through Toddlerhood. Neuroimaging Clin N Am 2019;30:97-114. [PMID: 31759576 DOI: 10.1016/j.nic.2019.09.009] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Guo H, Zhou J, Wu C. Ensemble learning via constraint projection and undersampling technique for class-imbalance problem. Soft comput 2020;24:4711-27. [DOI: 10.1007/s00500-019-04501-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Atallah DM, Badawy M, El-sayed A. A new proposed feature selection method to predict kidney transplantation outcome. Health Technol 2019;9:847-856. [DOI: 10.1007/s12553-019-00369-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Li J, Ogrodnik M, Kolachalama VB, Lin H, Au R. Assessment of the Mid-Life Demographic and Lifestyle Risk Factors of Dementia Using Data from the Framingham Heart Study Offspring Cohort. J Alzheimers Dis 2019;63:1119-1127. [PMID: 29710704 DOI: 10.3233/jad-170917] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Punitha N, Ramakrishnan S. Multifractal analysis of uterine electromyography signals to differentiate term and preterm conditions. Proc Inst Mech Eng H 2019;233:362-371. [PMID: 30706756 DOI: 10.1177/0954411919827323] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Tan X, Su S, Huang Z, Guo X, Zuo Z, Sun X, Li L. Wireless Sensor Networks Intrusion Detection Based on SMOTE and the Random Forest Algorithm. Sensors (Basel) 2019;19:E203. [PMID: 30626020 DOI: 10.3390/s19010203] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 12/27/2018] [Accepted: 01/04/2019] [Indexed: 11/21/2022]

Rodriguez LM, Fushman DD. Finding Understudied Disorders Potentially Associated with Maternal Morbidity and Mortality. AJP Rep 2019;9:e36-e43. [PMID: 30838163 PMCID: PMC6398998 DOI: 10.1055/s-0039-1683363] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 01/26/2019] [Indexed: 11/03/2022] Open

Fergus P, Selvaraj M, Chalmers C. Machine learning ensemble modelling to classify caesarean section and vaginal delivery types using Cardiotocography traces. Comput Biol Med 2017;93:7-16. [PMID: 29248699 DOI: 10.1016/j.compbiomed.2017.12.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2017] [Revised: 12/06/2017] [Accepted: 12/07/2017] [Indexed: 10/18/2022]

Abstract

Human visual inspection of Cardiotocography traces is used to monitor the foetus during labour and avoid neonatal mortality and morbidity. The problem, however, is that visual interpretation of Cardiotocography traces is subject to high inter and intra observer variability. Incorrect decisions, caused by miss-interpretation, can lead to adverse perinatal outcomes and in severe cases death. This study presents a review of human Cardiotocography trace interpretation and argues that machine learning, used as a decision support system by obstetricians and midwives, may provide an objective measure alongside normal practices. This will help to increase predictive capacity and reduce negative outcomes. A robust methodology is presented for feature set engineering using an open database comprising 552 intrapartum recordings. State-of-the-art in signal processing techniques is applied to raw Cardiotocography foetal heart rate traces to extract 13 features. Those with low discriminative capacity are removed using Recursive Feature Elimination. The dataset is imbalanced with significant differences between the prior probabilities of both normal deliveries and those delivered by caesarean section. This issue is addressed by oversampling the training instances using a synthetic minority oversampling technique to provide a balanced class distribution. Several simple, yet powerful, machine-learning algorithms are trained, using the feature set, and their performance is evaluated with real test data. The results are encouraging using an ensemble classifier comprising Fishers Linear Discriminant Analysis, Random Forest and Support Vector Machine classifiers, with 87% (95% Confidence Interval: 86%, 88%) for Sensitivity, 90% (95% CI: 89%, 91%) for Specificity, and 96% (95% CI: 96%, 97%) for the Area Under the Curve, with a 9% (95% CI: 9%, 10%) Mean Square Error.

Collapse

Fergus P, Hussain A, Al-Jumeily D, Huang DS, Bouguila N. Classification of caesarean section and normal vaginal deliveries using foetal heart rate signals and advanced machine learning algorithms. Biomed Eng Online 2017;16:89. [PMID: 28679415 PMCID: PMC5498914 DOI: 10.1186/s12938-017-0378-z] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2017] [Accepted: 06/26/2017] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Visual inspection of cardiotocography traces by obstetricians and midwives is the gold standard for monitoring the wellbeing of the foetus during antenatal care. However, inter- and intra-observer variability is high with only a 30% positive predictive value for the classification of pathological outcomes. This has a significant negative impact on the perinatal foetus and often results in cardio-pulmonary arrest, brain and vital organ damage, cerebral palsy, hearing, visual and cognitive defects and in severe cases, death. This paper shows that using machine learning and foetal heart rate signals provides direct information about the foetal state and helps to filter the subjective opinions of medical practitioners when used as a decision support tool. The primary aim is to provide a proof-of-concept that demonstrates how machine learning can be used to objectively determine when medical intervention, such as caesarean section, is required and help avoid preventable perinatal deaths.

METHODS

This is evidenced using an open dataset that comprises 506 controls (normal virginal deliveries) and 46 cases (caesarean due to pH ≤ 7.20-acidosis, n = 18; pH > 7.20 and pH < 7.25-foetal deterioration, n = 4; or clinical decision without evidence of pathological outcome measures, n = 24). Several machine-learning algorithms are trained, and validated, using binary classifier performance measures.

RESULTS

The findings show that deep learning classification achieves sensitivity = 94%, specificity = 91%, Area under the curve = 99%, F-score = 100%, and mean square error = 1%.

CONCLUSIONS

The results demonstrate that machine learning significantly improves the efficiency for the detection of caesarean section and normal vaginal deliveries using foetal heart rate signals compared with obstetrician and midwife predictions and systems reported in previous studies.

Collapse

Blagus R, Goeman JJ. What (not) to expect when classifying rare events. Brief Bioinform 2016;19:341-349. [DOI: 10.1093/bib/bbw107] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Indexed: 01/23/2023] Open

Blagus R, Lusa L. Joint use of over- and under-sampling techniques and cross-validation for the development and assessment of prediction models. BMC Bioinformatics 2015;16:363. [PMID: 26537827 PMCID: PMC4634915 DOI: 10.1186/s12859-015-0784-9] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2015] [Accepted: 10/17/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Prediction models are used in clinical research to develop rules that can be used to accurately predict the outcome of the patients based on some of their characteristics. They represent a valuable tool in the decision making process of clinicians and health policy makers, as they enable them to estimate the probability that patients have or will develop a disease, will respond to a treatment, or that their disease will recur. The interest devoted to prediction models in the biomedical community has been growing in the last few years. Often the data used to develop the prediction models are class-imbalanced as only few patients experience the event (and therefore belong to minority class).

RESULTS

Prediction models developed using class-imbalanced data tend to achieve sub-optimal predictive accuracy in the minority class. This problem can be diminished by using sampling techniques aimed at balancing the class distribution. These techniques include under- and oversampling, where a fraction of the majority class samples are retained in the analysis or new samples from the minority class are generated. The correct assessment of how the prediction model is likely to perform on independent data is of crucial importance; in the absence of an independent data set, cross-validation is normally used. While the importance of correct cross-validation is well documented in the biomedical literature, the challenges posed by the joint use of sampling techniques and cross-validation have not been addressed.

CONCLUSIONS

We show that care must be taken to ensure that cross-validation is performed correctly on sampled data, and that the risk of overestimating the predictive accuracy is greater when oversampling techniques are used. Examples based on the re-analysis of real datasets and simulation studies are provided. We identify some results from the biomedical literature where the incorrect cross-validation was performed, where we expect that the performance of oversampling techniques was heavily overestimated.

Collapse

Pai PF, Chen LC, Lin KP. A hybrid data mining model in analyzing corporate social responsibility. Neural Comput Appl 2015. [DOI: 10.1007/s00521-015-1893-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Fergus P, Hignett D, Hussain A, Al-Jumeily D, Abdel-Aziz K. Automatic epileptic seizure detection using scalp EEG and advanced artificial intelligence techniques. Biomed Res Int 2015;2015:986736. [PMID: 25710040 DOI: 10.1155/2015/986736] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Revised: 12/09/2014] [Accepted: 12/23/2014] [Indexed: 11/17/2022]

Ramezankhani A, Pournik O, Shahrabi J, Azizi F, Hadaegh F, Khalili D. The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes. Med Decis Making 2014;36:137-44. [PMID: 25449060 DOI: 10.1177/0272989x14560647] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2013] [Accepted: 10/23/2014] [Indexed: 11/15/2022]

Breathett K, Muhlestein D, Foraker R, Gulati M. Differences in Preeclampsia Rates Between African American and Caucasian Women: Trends from the National Hospital Discharge Survey. J Womens Health (Larchmt) 2014;23:886-93. [DOI: 10.1089/jwh.2014.4749] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Lee PH. Resampling methods improve the predictive power of modeling in class-imbalanced datasets. Int J Environ Res Public Health 2014;11:9776-89. [PMID: 25238271 PMCID: PMC4199049 DOI: 10.3390/ijerph110909776] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Revised: 09/04/2014] [Accepted: 09/12/2014] [Indexed: 11/20/2022]

Abstract

In the medical field, many outcome variables are dichotomized, and the two possible values of a dichotomized variable are referred to as classes. A dichotomized dataset is class-imbalanced if it consists mostly of one class, and performance of common classification models on this type of dataset tends to be suboptimal. To tackle such a problem, resampling methods, including oversampling and undersampling can be used. This paper aims at illustrating the effect of resampling methods using the National Health and Nutrition Examination Survey (NHANES) wave 2009–2010 dataset. A total of 4677 participants aged ≥20 without self-reported diabetes and with valid blood test results were analyzed. The Classification and Regression Tree (CART) procedure was used to build a classification model on undiagnosed diabetes. A participant demonstrated evidence of diabetes according to WHO diabetes criteria. Exposure variables included demographics and socio-economic status. CART models were fitted using a randomly selected 70% of the data (training dataset), and area under the receiver operating characteristic curve (AUC) was computed using the remaining 30% of the sample for evaluation (testing dataset). CART models were fitted using the training dataset, the oversampled training dataset, the weighted training dataset, and the undersampled training dataset. In addition, resampling case-to-control ratio of 1:1, 1:2, and 1:4 were examined. Resampling methods on the performance of other extensions of CART (random forests and generalized boosted trees) were also examined. CARTs fitted on the oversampled (AUC = 0.70) and undersampled training data (AUC = 0.74) yielded a better classification power than that on the training data (AUC = 0.65). Resampling could also improve the classification power of random forests and generalized boosted trees. To conclude, applying resampling methods in a class-imbalanced dataset improved the classification power of CART, random forests, and generalized boosted trees.

Collapse

Fergus P, Cheung P, Hussain A, Al-Jumeily D, Dobbins C, Iram S. Prediction of preterm deliveries from EHG signals using machine learning. PLoS One 2013;8:e77154. [PMID: 24204760 PMCID: PMC3810473 DOI: 10.1371/journal.pone.0077154] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Accepted: 08/30/2013] [Indexed: 12/16/2022] Open

Abstract

There has been some improvement in the treatment of preterm infants, which has helped to increase their chance of survival. However, the rate of premature births is still globally increasing. As a result, this group of infants are most at risk of developing severe medical conditions that can affect the respiratory, gastrointestinal, immune, central nervous, auditory and visual systems. In extreme cases, this can also lead to long-term conditions, such as cerebral palsy, mental retardation, learning difficulties, including poor health and growth. In the US alone, the societal and economic cost of preterm births, in 2005, was estimated to be $26.2 billion, per annum. In the UK, this value was close to £2.95 billion, in 2009. Many believe that a better understanding of why preterm births occur, and a strategic focus on prevention, will help to improve the health of children and reduce healthcare costs. At present, most methods of preterm birth prediction are subjective. However, a strong body of evidence suggests the analysis of uterine electrical signals (Electrohysterography), could provide a viable way of diagnosing true labour and predict preterm deliveries. Most Electrohysterography studies focus on true labour detection during the final seven days, before labour. The challenge is to utilise Electrohysterography techniques to predict preterm delivery earlier in the pregnancy. This paper explores this idea further and presents a supervised machine learning approach that classifies term and preterm records, using an open source dataset containing 300 records (38 preterm and 262 term). The synthetic minority oversampling technique is used to oversample the minority preterm class, and cross validation techniques, are used to evaluate the dataset against other similar studies. Our approach shows an improvement on existing studies with 96% sensitivity, 90% specificity, and a 95% area under the curve value with 8% global error using the polynomial classifier.

Collapse

Afzal Z, Schuemie MJ, van Blijderveen JC, Sen EF, Sturkenboom MCJM, Kors JA. Improving sensitivity of machine learning methods for automated case identification from free-text electronic medical records. BMC Med Inform Decis Mak 2013;13:30. [PMID: 23452306 PMCID: PMC3602667 DOI: 10.1186/1472-6947-13-30] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Accepted: 02/27/2013] [Indexed: 01/18/2023] Open

Yajuan Wang, Simon M, Bonde P, Harris BU, Teuteberg JJ, Kormos RL, Antaki JF. Prognosis of Right Ventricular Failure in Patients With Left Ventricular Assist Device Based on Decision Tree With SMOTE. ACTA ACUST UNITED AC 2012;16:383-90. [DOI: 10.1109/titb.2012.2187458] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Raeder T, Forman G, Chawla NV. Learning from Imbalanced Data: Evaluation Matters. Intelligent Systems Reference Library 2012. [DOI: 10.1007/978-3-642-23166-7_12] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Wang Y, Simon MA, Bonde P, Harris BU, Teuteberg JJ, Kormos RL, Antaki JF. Decision tree for adjuvant right ventricular support in patients receiving a left ventricular assist device. J Heart Lung Transplant 2011;31:140-9. [PMID: 22168963 DOI: 10.1016/j.healun.2011.11.003] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2011] [Revised: 10/12/2011] [Accepted: 11/07/2011] [Indexed: 11/26/2022] Open

Current awareness: Pharmacoepidemiology and drug safety. Pharmacoepidemiol Drug Saf 2009;18:i-x. [DOI: 10.1002/pds.1653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]