1
|
La Paglia L, Vazzana M, Mauro M, Urso A, Arizza V, Vizzini A. Bioactive Molecules from the Innate Immunity of Ascidians and Innovative Methods of Drug Discovery: A Computational Approach Based on Artificial Intelligence. Mar Drugs 2023; 22:6. [PMID: 38276644 PMCID: PMC10817596 DOI: 10.3390/md22010006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 12/12/2023] [Accepted: 12/17/2023] [Indexed: 01/27/2024] Open
Abstract
The study of bioactive molecules of marine origin has created an important bridge between biological knowledge and its applications in biotechnology and biomedicine. Current studies in different research fields, such as biomedicine, aim to discover marine molecules characterized by biological activities that can be used to produce potential drugs for human use. In recent decades, increasing attention has been paid to a particular group of marine invertebrates, the Ascidians, as they are a source of bioactive products. We describe omics data and computational methods relevant to identifying the mechanisms and processes of innate immunity underlying the biosynthesis of bioactive molecules, focusing on innovative computational approaches based on Artificial Intelligence. Since there is increasing attention on finding new solutions for a sustainable supply of bioactive compounds, we propose that a possible improvement in the biodiscovery pipeline might also come from the study and utilization of marine invertebrates' innate immunity.
Collapse
Affiliation(s)
- Laura La Paglia
- Istituto di Calcolo e Reti ad Alte Prestazioni–Consiglio Nazionale delle Ricerche, Via Ugo La Malfa 153, 90146 Palermo, Italy; (L.L.P.); (A.U.)
| | - Mirella Vazzana
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Manuela Mauro
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Alfonso Urso
- Istituto di Calcolo e Reti ad Alte Prestazioni–Consiglio Nazionale delle Ricerche, Via Ugo La Malfa 153, 90146 Palermo, Italy; (L.L.P.); (A.U.)
| | - Vincenzo Arizza
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| | - Aiti Vizzini
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche–Università di Palermo, Via Archirafi 18, 90100 Palermo, Italy; (M.V.); (M.M.); (V.A.)
| |
Collapse
|
2
|
Pantazis LJ, García RA. Detection of atypical response trajectories in biomedical longitudinal databases. Int J Biostat 2023; 19:389-415. [PMID: 36279154 DOI: 10.1515/ijb-2020-0076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 10/03/2022] [Indexed: 11/15/2023]
Abstract
Many health care professionals and institutions manage longitudinal databases, involving follow-ups for different patients over time. Longitudinal data frequently manifest additional complexities such as high variability, correlated measurements and missing data. Mixed effects models have been widely used to overcome these difficulties. This work proposes the use of linear mixed effects models as a tool that allows to search conceptually different types of anomalies in the data simultaneously.
Collapse
Affiliation(s)
- Lucio José Pantazis
- ITBA, Buenos Aires, Lavardén 315, CP 1437, Argentina
- CESyC, Department of Mathematics, Instituto Tecnológico de Buenos Aires, Lavardén 315, Buenos Aires, 1437, Argentina
| | - Rafael Antonio García
- ITBA, Buenos Aires, Lavardén 315, CP 1437, Argentina
- CESyC, Department of Mathematics, Instituto Tecnológico de Buenos Aires, Lavardén 315, Buenos Aires, 1437, Argentina
| |
Collapse
|
3
|
Zhu S, Bai Q, Li L, Xu T. Drug repositioning in drug discovery of T2DM and repositioning potential of antidiabetic agents. Comput Struct Biotechnol J 2022; 20:2839-2847. [PMID: 35765655 PMCID: PMC9189996 DOI: 10.1016/j.csbj.2022.05.057] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/30/2022] [Accepted: 05/30/2022] [Indexed: 12/19/2022] Open
Abstract
Repositioning or repurposing drugs account for a substantial part of entering approval pipeline drugs, which indicates that drug repositioning has huge market potential and value. Computational technologies such as machine learning methods have accelerated the process of drug repositioning in the last few decades years. The repositioning potential of type 2 diabetes mellitus (T2DM) drugs for various diseases such as cancer, neurodegenerative diseases, and cardiovascular diseases have been widely studied. Hence, the related summary about repurposing antidiabetic drugs is of great significance. In this review, we focus on the machine learning methods for the development of new T2DM drugs and give an overview of the repurposing potential of the existing antidiabetic agents.
Collapse
Affiliation(s)
- Sha Zhu
- Key Lab of Preclinical Study for New Drugs of Gansu Province, Institute of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, PR China
| | - Qifeng Bai
- Key Lab of Preclinical Study for New Drugs of Gansu Province, Institute of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, PR China
- Corresponding author.
| | | | | |
Collapse
|
4
|
Veltri P. Guest Editorial Innovative Data Analysis Methods for Biomedicine. IEEE J Biomed Health Inform 2021. [DOI: 10.1109/jbhi.2021.3116336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
5
|
Farrahi V, Niemelä M, Kärmeniemi M, Puhakka S, Kangas M, Korpelainen R, Jämsä T. Correlates of physical activity behavior in adults: a data mining approach. Int J Behav Nutr Phys Act 2020; 17:94. [PMID: 32703217 PMCID: PMC7376928 DOI: 10.1186/s12966-020-00996-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Accepted: 07/14/2020] [Indexed: 12/31/2022] Open
Abstract
PURPOSE A data mining approach was applied to establish a multilevel hierarchy predicting physical activity (PA) behavior, and to methodologically identify the correlates of PA behavior. METHODS Cross-sectional data from the population-based Northern Finland Birth Cohort 1966 study, collected in the most recent follow-up at age 46, were used to create a hierarchy using the chi-square automatic interaction detection (CHAID) decision tree technique for predicting PA behavior. PA behavior is defined as active or inactive based on machine-learned activity profiles, which were previously created through a multidimensional (clustering) approach on continuous accelerometer-measured activity intensities in one week. The input variables (predictors) used for decision tree fitting consisted of individual, demographical, psychological, behavioral, environmental, and physical factors. Using generalized linear mixed models, we also analyzed how factors emerging from the model were associated with three PA metrics, including daily time (minutes per day) in sedentary (SED), light PA (LPA), and moderate-to-vigorous PA (MVPA), to assure the relative importance of methodologically identified factors. RESULTS Of the 4582 participants with valid accelerometer data at the latest follow-up, 2701 and 1881 had active and inactive profiles, respectively. We used a total of 168 factors as input variables to classify these two PA behaviors. Out of these 168 factors, the decision tree selected 36 factors of different domains from which 54 subgroups of participants were formed. The emerging factors from the model explained minutes per day in SED, LPA, and/or MVPA, including body fat percentage (SED: B = 26.5, LPA: B = - 16.1, and MVPA: B = - 11.7), normalized heart rate recovery 60 s after exercise (SED: B = -16.1, LPA: B = 9.9, and MVPA: B = 9.6), average weekday total sitting time (SED: B = 34.1, LPA: B = -25.3, and MVPA: B = -5.8), and extravagance score (SED: B = 6.3 and LPA: B = - 3.7). CONCLUSIONS Using data mining, we established a data-driven model composed of 36 different factors of relative importance from empirical data. This model may be used to identify subgroups for multilevel intervention allocation and design. Additionally, this study methodologically discovered an extensive set of factors that can be a basis for additional hypothesis testing in PA correlates research.
Collapse
Affiliation(s)
- Vahid Farrahi
- Research Unit of Medical Imaging, Physics and Technology, University of Oulu, P.O. 5000, FI-90014, Oulu, Finland.
| | - Maisa Niemelä
- Research Unit of Medical Imaging, Physics and Technology, University of Oulu, P.O. 5000, FI-90014, Oulu, Finland
| | - Mikko Kärmeniemi
- Medical Research Center, Oulu University Hospital and University of Oulu, Oulu, Finland
- Center for Life Course Health Research, University of Oulu, Oulu, Finland
- Department of Sports and Exercise Medicine, Oulu Deaconess Institute Foundation sr, Oulu, Finland
| | - Soile Puhakka
- Center for Life Course Health Research, University of Oulu, Oulu, Finland
- Department of Sports and Exercise Medicine, Oulu Deaconess Institute Foundation sr, Oulu, Finland
- Geography Research Unit, University of Oulu, Oulu, Finland
| | - Maarit Kangas
- Research Unit of Medical Imaging, Physics and Technology, University of Oulu, P.O. 5000, FI-90014, Oulu, Finland
- Medical Research Center, Oulu University Hospital and University of Oulu, Oulu, Finland
| | - Raija Korpelainen
- Medical Research Center, Oulu University Hospital and University of Oulu, Oulu, Finland
- Center for Life Course Health Research, University of Oulu, Oulu, Finland
- Department of Sports and Exercise Medicine, Oulu Deaconess Institute Foundation sr, Oulu, Finland
| | - Timo Jämsä
- Research Unit of Medical Imaging, Physics and Technology, University of Oulu, P.O. 5000, FI-90014, Oulu, Finland
- Medical Research Center, Oulu University Hospital and University of Oulu, Oulu, Finland
- Diagnostic Radiology, Oulu University Hospital, Oulu, Finland
| |
Collapse
|
6
|
Goedel WC, Jin H, Sutten Coats C, Ogunbajo A, Restar AJ. Predictors of User Engagement With Facebook Posts Generated by a National Sample of Lesbian, Gay, Bisexual, Transgender, and Queer Community Centers in the United States: Content Analysis. JMIR Public Health Surveill 2020; 6:e16382. [PMID: 32012104 PMCID: PMC7013651 DOI: 10.2196/16382] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 10/28/2019] [Accepted: 11/12/2019] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Lesbian, gay, bisexual, transgender, and queer (LGBTQ) community centers remain important venues for reaching and providing crucial health and social services to LGBTQ individuals in the United States. These organizations commonly use Facebook to reach their target audiences, but little is known about factors associated with user engagement with their social media presence. OBJECTIVE This study aimed to identify factors associated with engagement with Facebook content generated by LGBTQ community centers in the United States. METHODS Content generated by LGBTQ community centers in 2017 was downloaded using Facebook's application programming interface. Posts were classified by their content and sentiment. Correlates of user engagement were identified using negative binomial regression. RESULTS A total of 32,014 posts from 175 community centers were collected. Posts with photos (incidence rate ratio, [IRR] 1.07; 95% CI 1.06-1.09) and videos (IRR 1.54; 95% CI 1.52-1.56) that contained a direct invitation for engagement (IRR 1.03; 95% CI 1.02-1.04), that expressed a positive sentiment (IRR 1.11; 95% CI 1.10-1.12), and that contained content related to stigma (IRR 1.16; 95% CI 1.14-1.17), mental health (IRR 1.33; 95% CI 1.31-1.35), and politics (IRR 1.28; 95% CI 1.27-1.29) received higher levels of engagement. CONCLUSIONS The results of this study provide support for the use of Facebook to extend the reach of LGBTQ community centers and highlight multiple factors that can be leveraged to optimize engagement.
Collapse
Affiliation(s)
- William C Goedel
- Department of Epidemiology, School of Public Health, Brown University, Providence, RI, United States
| | - Harry Jin
- Department of Epidemiology, School of Public Health, Brown University, Providence, RI, United States
| | - Cassandra Sutten Coats
- Department of Behavioral and Social Sciences, School of Public Health, Brown University, Providence, RI, United States
| | - Adedotun Ogunbajo
- Department of Behavioral and Social Sciences, School of Public Health, Brown University, Providence, RI, United States
| | - Arjee J Restar
- Department of Behavioral and Social Sciences, School of Public Health, Brown University, Providence, RI, United States
| |
Collapse
|
7
|
Ricciardi C, Valente AS, Edmund K, Cantoni V, Green R, Fiorillo A, Picone I, Santini S, Cesarelli M. Linear discriminant analysis and principal component analysis to predict coronary artery disease. Health Informatics J 2020; 26:2181-2192. [PMID: 31969043 DOI: 10.1177/1460458219899210] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Coronary artery disease is one of the most prevalent chronic pathologies in the modern world, leading to the deaths of thousands of people, both in the United States and in Europe. This article reports the use of data mining techniques to analyse a population of 10,265 people who were evaluated by the Department of Advanced Biomedical Sciences for myocardial ischaemia. Overall, 22 features are extracted, and linear discriminant analysis is implemented twice through both the Knime analytics platform and R statistical programming language to classify patients as either normal or pathological. The former of these analyses includes only classification, while the latter method includes principal component analysis before classification to create new features. The classification accuracies obtained for these methods were 84.5 and 86.0 per cent, respectively, with a specificity over 97 per cent and a sensitivity between 62 and 66 per cent. This article presents a practical implementation of traditional data mining techniques that can be used to help clinicians in decision-making; moreover, principal component analysis is used as an algorithm for feature reduction.
Collapse
Affiliation(s)
| | | | - Kyle Edmund
- Reykjavík University, Iceland; University of Oxford, UK
| | | | | | | | | | | | | |
Collapse
|
8
|
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem Rev 2019; 119:10520-10594. [PMID: 31294972 DOI: 10.1021/acs.chemrev.8b00728] [Citation(s) in RCA: 342] [Impact Index Per Article: 68.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Artificial intelligence (AI), and, in particular, deep learning as a subcategory of AI, provides opportunities for the discovery and development of innovative drugs. Various machine learning approaches have recently (re)emerged, some of which may be considered instances of domain-specific AI which have been successfully employed for drug discovery and design. This review provides a comprehensive portrayal of these machine learning techniques and of their applications in medicinal chemistry. After introducing the basic principles, alongside some application notes, of the various machine learning algorithms, the current state-of-the art of AI-assisted pharmaceutical discovery is discussed, including applications in structure- and ligand-based virtual screening, de novo drug design, physicochemical and pharmacokinetic property prediction, drug repurposing, and related aspects. Finally, several challenges and limitations of the current methods are summarized, with a view to potential future directions for AI-assisted drug discovery and design.
Collapse
Affiliation(s)
- Xin Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Ryan Byrne
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Gisbert Schneider
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| |
Collapse
|
9
|
An Enhanced Symptom Clustering with Profile Based Prescription Suggestion in Biomedical application. J Med Syst 2019; 43:172. [PMID: 31065809 DOI: 10.1007/s10916-019-1311-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 04/25/2019] [Indexed: 12/25/2022]
Abstract
The application of data mining has been increasing day to day whereas the data base is also enhancing simultaneously. Hence retrieving required content from a huge data base is a critical task. This paper focus on biomedical engineering field, it concentrates on initial stage of database such as data preprocessing and cleansing to deal with noise and missing data in large biomedical data sets. The database of biomedical is huge and enhancing nature retrieving of specific content will be a critical task. Suggesting prescription with respect to identified disease based on profile analysis of specific patient is not available in current system. This paper proposes a recommendation system of prescription based on disease identification is done by combining user and professional suggestion with profile based analysis. Hence this focuses on profile based suggestions and report will be generated. The retrieving of specific suggestion from a huge database is done by hybrid feature selection algorithm. This approach focuses on enabling recommendation based on user profile and implementing Hybrid feature selection algorithm to retrieve specific content from a huge database. Hence it attains better retrieval of required content from a huge database compared to other existing approaches and suggests better recommendation with respect to user profile.
Collapse
|
10
|
Piwowar M, Kocemba-Pilarczyk KA, Piwowar P. Regularization and grouping -omics data by GCA method: A transcriptomic case. PLoS One 2018; 13:e0206608. [PMID: 30383819 PMCID: PMC6211732 DOI: 10.1371/journal.pone.0206608] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 10/16/2018] [Indexed: 11/22/2022] Open
Abstract
The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that the GCA method could be used to find regularities in the analyzed collections and to create characteristic gene expression profiles for individual groups of patients. GCA iteratively permutes rows and columns to maximize the tau-Kendall or rho-Spearman coefficients, which makes it possible to arrange rows and columns in such a way that the most similar ones remain in each other’s neighbourhood. In this way, the GCA algorithm highlights regularities in the data matrix. The ranked data can then be grouped using the GCCA method, and after that aggregated in clusters, providing a representation that is easier to analyze–especially in the case of large sets of gene expression profiles. Regularization of transcriptomic data, which is presented in this manuscript, has enabled division of the data set into column clusters (representing genes) and row clusters (representing patients). Subsequently, rows were aggregated (based on medians) to visualise the gene expression profiles for patients with Multiple Myeloma in each collection. The presented analysis became the starting point for characterisation of differentiated genes and biochemical processes in which they are involved. GCA analysis may provide an alternative analytical method to support differentiation and analysis of gene expression profiles characterising individual groups of patients.
Collapse
Affiliation(s)
- Monika Piwowar
- Department of Bioinformatics and Telemedicine, Jagiellonian University–Medical College, Krakow, Poland
- * E-mail:
| | | | - Piotr Piwowar
- AGH University of Science and Technology, Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, Department of Measurements and Electronic, Krakow, Poland
| |
Collapse
|
11
|
Abrams ZB, Zucker M, Wang M, Asiaee Taheri A, Abruzzo LV, Coombes KR. Thirty biologically interpretable clusters of transcription factors distinguish cancer type. BMC Genomics 2018; 19:738. [PMID: 30305013 PMCID: PMC6180590 DOI: 10.1186/s12864-018-5093-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 09/19/2018] [Indexed: 12/27/2022] Open
Abstract
Background Transcription factors are essential regulators of gene expression and play critical roles in development, differentiation, and in many cancers. To carry out their regulatory programs, they must cooperate in networks and bind simultaneously to sites in promoter or enhancer regions of genes. We hypothesize that the mRNA co-expression patterns of transcription factors can be used both to learn how they cooperate in networks and to distinguish between cancer types. Results We recently developed a new algorithm, Thresher, that combines principal component analysis, outlier filtering, and von Mises-Fisher mixture models to cluster genes (in this case, transcription factors) based on expression, determining the optimal number of clusters in the process. We applied Thresher to the RNA-Seq expression data of 486 transcription factors from more than 10,000 samples of 33 kinds of cancer studied in The Cancer Genome Atlas (TCGA). We found that 30 clusters of transcription factors from a 29-dimensional principal component space were able to distinguish between most cancer types, and could separate tumor samples from normal controls. Moreover, each cluster of transcription factors could be either (i) linked to a tissue-specific expression pattern or (ii) associated with a fundamental biological process such as cell cycle, angiogenesis, apoptosis, or cytoskeleton. Clusters of the second type were more likely also to be associated with embryonically lethal mouse phenotypes. Conclusions Using our approach, we have shown that the mRNA expression patterns of transcription factors contain most of the information needed to distinguish different cancer types. The Thresher method is capable of discovering biologically interpretable clusters of genes. It can potentially be applied to other gene sets, such as signaling pathways, to decompose them into simpler, yet biologically meaningful, components. Electronic supplementary material The online version of this article (10.1186/s12864-018-5093-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zachary B Abrams
- Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Drive, Columbus, 43210, OH, USA
| | - Mark Zucker
- Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Drive, Columbus, 43210, OH, USA
| | - Min Wang
- Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Drive, Columbus, 43210, OH, USA.,Mathematical Biosciences Institute, The Ohio State University, 1735 Neil Avenue, Columbus, 43210, OH, USA
| | - Amir Asiaee Taheri
- Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Drive, Columbus, 43210, OH, USA.,Mathematical Biosciences Institute, The Ohio State University, 1735 Neil Avenue, Columbus, 43210, OH, USA
| | - Lynne V Abruzzo
- Department of Pathology, The Ohio State University, 129 Hamilton Hall, 1645 Neil Avenue, Columbus, 43210, OH, USA
| | - Kevin R Coombes
- Department of Biomedical Informatics, The Ohio State University, 1800 Cannon Drive, Columbus, 43210, OH, USA.
| |
Collapse
|
12
|
Harrison E, Dreisbach C, Basit N, Keim-Malpass J. An Application of Data Mining Techniques to Explore Congressional Lobbying Records for Patterns in Pediatric Special Interest Expenditures Prior to the Affordable Care Act. Front Big Data 2018; 1:3. [PMID: 33693319 PMCID: PMC7931899 DOI: 10.3389/fdata.2018.00003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 08/02/2018] [Indexed: 11/13/2022] Open
Affiliation(s)
- Elizabeth Harrison
- Data Science Institute, University of Virginia, Charlottesville, VA, United States
- School of Medicine, University of Virginia, Charlottesville, VA, United States
| | - Caitlin Dreisbach
- Data Science Institute, University of Virginia, Charlottesville, VA, United States
- School of Nursing, University of Virginia, Charlottesville, VA, United States
- *Correspondence: Caitlin Dreisbach
| | - Nada Basit
- Data Science Institute, University of Virginia, Charlottesville, VA, United States
- Department of Computer Science, University of Virginia, Charlottesville, VA, United States
| | | |
Collapse
|
13
|
Carter J, Tribe RM, Sandall J, Shennan AH. The Preterm Clinical Network (PCN) Database: a web-based systematic method of collecting data on the care of women at risk of preterm birth. BMC Pregnancy Childbirth 2018; 18:335. [PMID: 30119660 PMCID: PMC6098573 DOI: 10.1186/s12884-018-1967-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 08/06/2018] [Indexed: 11/10/2022] Open
Abstract
Background Despite much research effort, there is a paucity of conclusive evidence in the field of preterm birth prediction and prevention. The methods of monitoring and prevention strategies offered to women at risk vary considerably around the UK and depend on local maternity care provision. It is becoming increasingly recognised that this experience and knowledge, if captured on a larger scale, could be a utilized as a valuable source of evidence for others. The UK Preterm Clinical Network (UKPCN) was established with the aim of improving care and outcomes for women at risk of preterm birth through the sharing of a wealth of experience and knowledge, as well as the building of clinical and research collaboration. The design and development of a bespoke internet-based database was fundamental to achieving this aim. Method Following consultation with UKPCN members and agreement on a minimal dataset, the Preterm Clinical Network (PCN) Database was constructed to collect data from women at risk of preterm birth and their children. Information Governance and research ethics committee approval was given for the storage of historical as well as prospectively collected data. Collaborating centres have instant access to their own records, while use of pooled data is governed by the PCN Database Access Committee. Applications are welcomed from UKPCN members and other established research groups. The results of investigations using the data are expected to provide insights into the effectiveness of current surveillance practices and preterm birth interventions on a national and international scale, as well as the generation of ideas for innovation and research. To date, 31 sites are registered as Data Collection Centres, four of which are outside the UK. Conclusion This paper outlines the aims of the PCN Database along with the development process undertaken from the initial idea to live launch.
Collapse
Affiliation(s)
- Jenny Carter
- Department of Women and Children's Health, School of Life Course Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK.
| | - Rachel M Tribe
- Department of Women and Children's Health, School of Life Course Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
| | - Jane Sandall
- Department of Women and Children's Health, School of Life Course Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
| | - Andrew H Shennan
- Department of Women and Children's Health, School of Life Course Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
| | | |
Collapse
|
14
|
Card KG, Lachowsky N, Hawkins BW, Jollimore J, Baharuddin F, Hogg RS. Predictors of Facebook User Engagement With Health-Related Content for Gay, Bisexual, and Other Men Who Have Sex With Men: Content Analysis. JMIR Public Health Surveill 2018; 4:e38. [PMID: 29625953 PMCID: PMC5910534 DOI: 10.2196/publichealth.8145] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 01/24/2018] [Accepted: 02/12/2018] [Indexed: 11/17/2022] Open
Abstract
Background Social media is used by community-based organizations (CBOs) to promote the well-being of gay and bisexual men (GBM). However, few studies have quantified which factors facilitate the diffusion of health content tailored for sexual minorities. Objective The aim of this study was to identify post characteristics that can be leveraged to optimize the health promotion efforts of CBOs on Facebook. Methods The Facebook application programming interface was used to collect 5 years’ of posts shared across 10 Facebook pages administered by Vancouver-based CBOs promoting GBM health. Network analysis assessed basic indicators of network structure. Content analyses were conducted using informatics-based approaches. Hierarchical negative binomial regression of post engagement data was used to identify meaningful covariates of engagement. Results In total, 14,071 posts were shared and 21,537 users engaged with these posts. Most users (n=13,315) engaged only once. There was moderate correlation between the number of posts and the number of CBOs users engaged with (r=.53, P<.001). Higher user engagement was positively associated with positive sentiment, sharing multimedia, and posting about pre-exposure prophylaxis, stigma, and mental health. Engagement was negatively associated with asking questions, posting about dating, and sharing posts during or after work (versus before). Conclusions Results highlight the existence of a core group of Facebook users who facilitate diffusion. Factors associated with greater user engagement present CBOs with a number of strategies for improving the diffusion of health content.
Collapse
Affiliation(s)
- Kiffer George Card
- Faculty of Health Science, Simon Fraser University, Burnaby, BC, Canada.,British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
| | - Nathan Lachowsky
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada.,School of Public Health and Social Policy, University of Victoria, Victoria, BC, Canada
| | - Blake W Hawkins
- Interdisciplinary Studies Graduate Program, University of British Columbia, Vancouver, BC, Canada
| | - Jody Jollimore
- Community-Based Research Centre for Gay Men's Health, Vancouver, BC, Canada
| | | | - Robert S Hogg
- Faculty of Health Science, Simon Fraser University, Burnaby, BC, Canada.,British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
| |
Collapse
|
15
|
Adeola HA, Van Wyk JC, Arowolo A, Ngwanya RM, Mkentane K, Khumalo NP. Emerging Diagnostic and Therapeutic Potentials of Human Hair Proteomics. Proteomics Clin Appl 2017; 12. [PMID: 28960873 DOI: 10.1002/prca.201700048] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 06/09/2017] [Indexed: 01/22/2023]
Abstract
The use of noninvasive human substrates to interrogate pathophysiological conditions has become essential in the post- Human Genome Project era. Due to its high turnover rate, and its long term capability to incorporate exogenous and endogenous substances from the circulation, hair testing is emerging as a key player in monitoring long term drug compliance, chronic alcohol abuse, forensic toxicology, and biomarker discovery, among other things. Novel high-throughput 'omics based approaches like proteomics have been underutilized globally in comprehending human hair morphology and its evolving use as a diagnostic testing substrate in the era of precision medicine. There is paucity of scientific evidence that evaluates the difference in drug incorporation into hair based on lipid content, and very few studies have addressed hair growth rates, hair forms, and the biological consequences of hair grooming or bleaching. It is apparent that protein-based identification using the human hair proteome would play a major role in understanding these parameters akin to DNA single nucleotide polymorphism profiling, up to single amino acid polymorphism resolution. Hence, this work seeks to identify and discuss the progress made thus far in the field of molecular hair testing using proteomic approaches, and identify ways in which proteomics would improve the field of hair research, considering that the human hair is mostly composed of proteins. Gaps in hair proteomics research are identified and the potential of hair proteomics in establishing a historic medical repository of normal and disease-specific proteome is also discussed.
Collapse
Affiliation(s)
- Henry A Adeola
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Jennifer C Van Wyk
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Afolake Arowolo
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Reginald M Ngwanya
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa
| | - Khwezikazi Mkentane
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| | - Nonhlanhla P Khumalo
- Division of Dermatology, Department of Medicine, Faculty of Health Sciences and Groote Schuur Hospital, University of Cape Town, Cape Town, South Africa.,Hair and Skin Research Laboratory, Groote Schuur Hospital, Cape Town, South Africa
| |
Collapse
|
16
|
Baier AW, Snyder DJ, Leahy IC, Patak LS, Brustowicz RM. A Shared Opportunity for Improving Electronic Medical Record Data. Anesth Analg 2017. [PMID: 28632540 DOI: 10.1213/ane.0000000000002134] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
With the recent rapid adoption of electronic medical records (EMRs), studies reporting results based on EMR data have become increasingly common. While analyzing data extracted from our EMR for a retrospective study, we identified various types of erroneous data entries. This report investigates the root causes of the incompleteness, inconsistency, and inaccuracy of the medical records analyzed in our study. While experienced health information management professionals are well aware of the many shortcomings with EMR data, the aims of this case study are to highlight the significance of the negative impact of erroneous EMR data, to provide fundamental principles for managing EMRs, and to provide recommendations to help facilitate the successful use of electronic health data, whether to inform clinical decisions or for clinical research.
Collapse
Affiliation(s)
- Amanda W Baier
- From the *Department of Anesthesiology, Perioperative and Pain Medicine, Boston Children's Hospital, Boston, Massachusetts; †Department of Anaesthesia, Harvard Medical School, Boston, Massachusetts; and ‡Department of Anesthesiology and Pain Medicine, Seattle Children's Hospital, Seattle, Washington
| | | | | | | | | |
Collapse
|
17
|
Kwon Y, Natori Y, Tanokura M. New approach to generating insights for aging research based on literature mining and knowledge integration. PLoS One 2017; 12:e0183534. [PMID: 28817730 PMCID: PMC5560588 DOI: 10.1371/journal.pone.0183534] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 08/05/2017] [Indexed: 01/01/2023] Open
Abstract
The proportion of the elderly population in most countries worldwide is increasing dramatically. Therefore, social interest in the fields of health, longevity, and anti-aging has been increasing as well. However, the basic research results obtained from a reductionist approach in biology and a bioinformatic approach in genome science have limited usefulness for generating insights on future health, longevity, and anti-aging-related research on a case by case basis. We propose a new approach that uses our literature mining technique and bioinformatics, which lead to a better perspective on research trends by providing an expanded knowledge base to work from. We demonstrate that our approach provides useful information that deepens insights on future trends which differs from data obtained conventionally, and this methodology is already paving the way for a new field in aging-related research based on literature mining. One compelling example of this is how our new approach can be a useful tool in drug repositioning.
Collapse
Affiliation(s)
- Yeondae Kwon
- Laboratory of Basic Science on Healthy Longevity, Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Yukikazu Natori
- Laboratory of Basic Science on Healthy Longevity, Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Masaru Tanokura
- Laboratory of Basic Science on Healthy Longevity, Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
- * E-mail:
| |
Collapse
|
18
|
Mudumbai S, Ayer F, Stefanko J. Perioperative and ICU Healthcare Analytics within a Veterans Integrated System Network: a Qualitative Gap Analysis. J Med Syst 2017; 41:118. [PMID: 28685304 DOI: 10.1007/s10916-017-0762-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 06/09/2017] [Indexed: 11/24/2022]
Abstract
Health care facilities are implementing analytics platforms as a way to document quality of care. However, few gap analyses exist on platforms specifically designed for patients treated in the Operating Room, Post-Anesthesia Care Unit, and Intensive Care Unit (ICU). As part of a quality improvement effort, we undertook a gap analysis of an existing analytics platform within the Veterans Healthcare Administration. The objectives were to identify themes associated with 1) current clinical use cases and stakeholder needs; 2) information flow and pain points; and 3) recommendations for future analytics development. Methods consisted of semi-structured interviews in 2 phases with a diverse set (n = 9) of support personnel and end users from five facilities across a Veterans Integrated Service Network. Phase 1 identified underlying needs and previous experiences with the analytics platform across various roles and operational responsibilities. Phase 2 validated preliminary feedback, lessons learned, and recommendations for improvement. Emerging themes suggested that the existing system met a small pool of national reporting requirements. However, pain points were identified with accessing data in several information system silos and performing multiple manual validation steps of data content. Notable recommendations included enhancing systems integration to create "one-stop shopping" for data, and developing a capability to perform trends analysis. Our gap analysis suggests that analytics platforms designed for surgical and ICU patients should employ approaches similar to those being used for primary care patients.
Collapse
Affiliation(s)
- Seshadri Mudumbai
- VA Palo Alto Health Care System, Anesthesiology and Perioperative Care Service, Palo Alto, CA, USA. .,Department of Anesthesiology, Perioperative, and Pain Medicine, Stanford University School of Medicine, Stanford, CA, USA.
| | - Ferenc Ayer
- VA Office of Quality, Safety and Value, Product Effectiveness Program, Washington, DC, USA
| | | |
Collapse
|
19
|
Yang K, Peretz-Soroka H, Wu J, Zhu L, Cui X, Zhang M, Rigatto C, Liu Y, Lin F. Fibroblast growth factor 23 weakens chemotaxis of human blood neutrophils in microfluidic devices. Sci Rep 2017; 7:3100. [PMID: 28596573 PMCID: PMC5465076 DOI: 10.1038/s41598-017-03210-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 04/26/2017] [Indexed: 01/22/2023] Open
Abstract
Neutrophil trafficking in tissues critically regulates the body’s immune response. Neutrophil migration can either play a protective role in host defense or cause health problems. Fibroblast growth factor 23 (FGF23) is a known biomarker for chronic kidney disease (CKD) and was recently shown to impair neutrophil arrest on endothelium and transendothelial migration. In the present study, we further examined the effect of FGF23 on human blood neutrophil chemotaxis using two new microfluidic devices. Our results showed that chemotaxis of FGF23 pre-treated neutrophils to a fMLP gradient, in the presence or absence of a uniform FGF23 background, is quantitatively lower compared to the control cells. This effect is accompanied with a stronger drifting of FGF23 pre-treated cells along the flow. However, without the FGF23 pre-treatment, the FGF23 background only reduces chemotaxis of transmigrated cells through the thin barrier channel to the fMLP gradient. The effect of FGF23 on neutrophil migration and the correlation between multiple cell migration parameters are further revealed by chemotactic entropy and principle component analysis. Collectively, these results revealed the effect of FGF23 on weakening neutrophil chemotaxis, which shed light on FGF23 mediated neutrophil migration with direct disease relevance such as CKD.
Collapse
Affiliation(s)
- Ke Yang
- Institute of Applied Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, P.R. China.,Department of Physics and Astronomy, University of Manitoba, Winnipeg, MB, Canada
| | - Hagit Peretz-Soroka
- Department of Physics and Astronomy, University of Manitoba, Winnipeg, MB, Canada
| | - Jiandong Wu
- Department of Physics and Astronomy, University of Manitoba, Winnipeg, MB, Canada
| | - Ling Zhu
- Institute of Applied Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, P.R. China
| | - Xueling Cui
- Department of Genetics, Jilin University, Jilin Sheng, China
| | | | | | - Yong Liu
- Institute of Applied Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, P.R. China
| | - Francis Lin
- Department of Physics and Astronomy, University of Manitoba, Winnipeg, MB, Canada. .,Department of Biosystems Engineering, University of Manitoba, Winnipeg, MB, Canada. .,Department of Immunology, University of Manitoba, Winnipeg, MB, Canada. .,Department of Biological Sciences, University of Manitoba, Winnipeg, MB, Canada.
| |
Collapse
|
20
|
|
21
|
Sedlmayr M, Würfl T, Maier C, Häberle L, Fasching P, Prokosch HU, Christoph J. Optimizing R with SparkR on a commodity cluster for biomedical research. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016; 137:321-328. [PMID: 28110735 DOI: 10.1016/j.cmpb.2016.10.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Revised: 08/16/2016] [Accepted: 10/06/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND AND OBJECTIVES Medical researchers are challenged today by the enormous amount of data collected in healthcare. Analysis methods such as genome-wide association studies (GWAS) are often computationally intensive and thus require enormous resources to be performed in a reasonable amount of time. While dedicated clusters and public clouds may deliver the desired performance, their use requires upfront financial efforts or anonymous data, which is often not possible for preliminary or occasional tasks. We explored the possibilities to build a private, flexible cluster for processing scripts in R based on commodity, non-dedicated hardware of our department. METHODS For this, a GWAS-calculation in R on a single desktop computer, a Message Passing Interface (MPI)-cluster, and a SparkR-cluster were compared with regards to the performance, scalability, quality, and simplicity. RESULTS The original script had a projected runtime of three years on a single desktop computer. Optimizing the script in R already yielded a significant reduction in computing time (2 weeks). By using R-MPI and SparkR, we were able to parallelize the computation and reduce the time to less than three hours (2.6 h) on already available, standard office computers. While MPI is a proven approach in high-performance clusters, it requires rather static, dedicated nodes. SparkR and its Hadoop siblings allow for a dynamic, elastic environment with automated failure handling. SparkR also scales better with the number of nodes in the cluster than MPI due to optimized data communication. CONCLUSION R is a popular environment for clinical data analysis. The new SparkR solution offers elastic resources and allows supporting big data analysis using R even on non-dedicated resources with minimal change to the original code. To unleash the full potential, additional efforts should be invested to customize and improve the algorithms, especially with regards to data distribution.
Collapse
Affiliation(s)
- Martin Sedlmayr
- Friedrich-Alexander University Erlangen-Nürnberg, Wetterkreuz 13, 91058 Erlangen, Germany.
| | - Tobias Würfl
- Friedrich-Alexander University Erlangen-Nürnberg, Wetterkreuz 13, 91058 Erlangen, Germany
| | - Christian Maier
- Friedrich-Alexander University Erlangen-Nürnberg, Wetterkreuz 13, 91058 Erlangen, Germany
| | - Lothar Häberle
- Department of Gynecology and Obstetrics, Erlangen University Hospital, Universitätsstrasse 21-23, 91054 Erlangen, Germany
| | - Peter Fasching
- Department of Gynecology and Obstetrics, Erlangen University Hospital, Universitätsstrasse 21-23, 91054 Erlangen, Germany
| | - Hans-Ulrich Prokosch
- Friedrich-Alexander University Erlangen-Nürnberg, Wetterkreuz 13, 91058 Erlangen, Germany
| | - Jan Christoph
- Friedrich-Alexander University Erlangen-Nürnberg, Wetterkreuz 13, 91058 Erlangen, Germany
| |
Collapse
|
22
|
A systematic literature review on the ethics of palliative sedation: an update (2016). Curr Opin Support Palliat Care 2016; 10:201-7. [DOI: 10.1097/spc.0000000000000224] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
23
|
Middleton B, Sittig DF, Wright A. Clinical Decision Support: a 25 Year Retrospective and a 25 Year Vision. Yearb Med Inform 2016; Suppl 1:S103-16. [PMID: 27488402 DOI: 10.15265/iys-2016-s034] [Citation(s) in RCA: 92] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
OBJECTIVE The objective of this review is to summarize the state of the art of clinical decision support (CDS) circa 1990, review progress in the 25 year interval from that time, and provide a vision of what CDS might look like 25 years hence, or circa 2040. METHOD Informal review of the medical literature with iterative review and discussion among the authors to arrive at six axes (data, knowledge, inference, architecture and technology, implementation and integration, and users) to frame the review and discussion of selected barriers and facilitators to the effective use of CDS. RESULT In each of the six axes, significant progress has been made. Key advances in structuring and encoding standardized data with an increased availability of data, development of knowledge bases for CDS, and improvement of capabilities to share knowledge artifacts, explosion of methods analyzing and inferring from clinical data, evolution of information technologies and architectures to facilitate the broad application of CDS, improvement of methods to implement CDS and integrate CDS into the clinical workflow, and increasing sophistication of the end-user, all have played a role in improving the effective use of CDS in healthcare delivery. CONCLUSION CDS has evolved dramatically over the past 25 years and will likely evolve just as dramatically or more so over the next 25 years. Increasingly, the clinical encounter between a clinician and a patient will be supported by a wide variety of cognitive aides to support diagnosis, treatment, care-coordination, surveillance and prevention, and health maintenance or wellness.
Collapse
Affiliation(s)
- B Middleton
- Blackford Middleton, Cell: +1 617 335 7098, E-Mail:
| | | | | |
Collapse
|
24
|
Hodos RA, Kidd BA, Khader S, Readhead BP, Dudley JT. In silico methods for drug repurposing and pharmacology. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2016; 8:186-210. [PMID: 27080087 PMCID: PMC4845762 DOI: 10.1002/wsbm.1337] [Citation(s) in RCA: 168] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 02/08/2016] [Accepted: 02/11/2016] [Indexed: 12/18/2022]
Abstract
Data in the biological, chemical, and clinical domains are accumulating at ever-increasing rates and have the potential to accelerate and inform drug development in new ways. Challenges and opportunities now lie in developing analytic tools to transform these often complex and heterogeneous data into testable hypotheses and actionable insights. This is the aim of computational pharmacology, which uses in silico techniques to better understand and predict how drugs affect biological systems, which can in turn improve clinical use, avoid unwanted side effects, and guide selection and development of better treatments. One exciting application of computational pharmacology is drug repurposing-finding new uses for existing drugs. Already yielding many promising candidates, this strategy has the potential to improve the efficiency of the drug development process and reach patient populations with previously unmet needs such as those with rare diseases. While current techniques in computational pharmacology and drug repurposing often focus on just a single data modality such as gene expression or drug-target interactions, we argue that methods such as matrix factorization that can integrate data within and across diverse data types have the potential to improve predictive performance and provide a fuller picture of a drug's pharmacological action. WIREs Syst Biol Med 2016, 8:186-210. doi: 10.1002/wsbm.1337 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Rachel A Hodos
- New York University and Icahn School of Medicine at Mt. Sinai, New York, NY
| | - Brian A Kidd
- Icahn School of Medicine at Mt. Sinai, New York, NY
| | | | | | | |
Collapse
|
25
|
Schmidt JM, De Georgia M. Multimodality monitoring: informatics, integration data display and analysis. Neurocrit Care 2015; 21 Suppl 2:S229-38. [PMID: 25208675 DOI: 10.1007/s12028-014-0037-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The goal of multimodality neuromonitoring is to provide continuous, real-time assessment of brain physiology to prevent, detect, and attenuate secondary brain injury. Clinical informatics deals with biomedical data, information, and knowledge including their acquisition, storage, retrieval, and optimal use for clinical decision-making. An electronic literature search was conducted for English language articles describing the use of informatics in the intensive care unit setting from January 1990 to August 2013. A total of 64 studies were included in this review. Clinical informatics infrastructure should be adopted that enables a wide range of linear and nonlinear analytical methods be applied to patient data. Specific time epochs of clinical interest should be reviewable. Analysis strategies of monitor alarms may help address alarm fatigue. Ergonomic data display that present results from analyses with clinical information in a sensible uncomplicated manner improve clinical decision-making. Collecting and archiving the highest resolution physiologic and phenotypic data in a comprehensive open format data warehouse is a crucial first step toward information management and two-way translational research for multimodality monitoring. The infrastructure required is largely the same as that needed for telemedicine intensive care applications, which under the right circumstances improves care quality while reducing cost.
Collapse
Affiliation(s)
- J Michael Schmidt
- Division of Critical Care Neurology, Neurological Institute, Columbia University College of Physicians and Surgeons, 177 Fort Washington Avenue, MHB Suite 8-300, New York, NY, 10032, USA,
| | | | | |
Collapse
|
26
|
Martín-González F, González-Robledo J, Sánchez-Hernández F, Moreno-García MN. Success/Failure Prediction of Noninvasive Mechanical Ventilation in Intensive Care Units. Using Multiclassifiers and Feature Selection Methods. Methods Inf Med 2015; 55:234-41. [PMID: 25925616 DOI: 10.3414/me14-01-0015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Accepted: 03/18/2015] [Indexed: 11/09/2022]
Abstract
OBJECTIVES This paper addresses the problem of decision-making in relation to the administration of noninvasive mechanical ventilation (NIMV) in intensive care units. METHODS Data mining methods were employed to find out the factors influencing the success/failure of NIMV and to predict its results in future patients. These artificial intelligence-based methods have not been applied in this field in spite of the good results obtained in other medical areas. RESULTS Feature selection methods provided the most influential variables in the success/failure of NIMV, such as NIMV hours, PaCO2 at the start, PaO2 / FiO2 ratio at the start, hematocrit at the start or PaO2 / FiO2 ratio after two hours. These methods were also used in the preprocessing step with the aim of improving the results of the classifiers. The algorithms provided the best results when the dataset used as input was the one containing the attributes selected with the CFS method. CONCLUSIONS Data mining methods can be successfully applied to determine the most influential factors in the success/failure of NIMV and also to predict NIMV results in future patients. The results provided by classifiers can be improved by preprocessing the data with feature selection techniques.
Collapse
Affiliation(s)
| | | | | | - María N Moreno-García
- María N. Moreno-García, University of Salamanca, Department of Computing and Automation, Plaza de los Caídos s/n, 37008 Salamanca, Spain, E-mail:
| |
Collapse
|
27
|
Lotfnezhad Afshar H, Ahmadi M, Roudbari M, Sadoughi F. Prediction of breast cancer survival through knowledge discovery in databases. Glob J Health Sci 2015; 7:392-8. [PMID: 25946945 PMCID: PMC4802184 DOI: 10.5539/gjhs.v7n4p392] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Accepted: 11/25/2014] [Indexed: 11/12/2022] Open
Abstract
The collection of large volumes of medical data has offered an opportunity to develop prediction models for survival by the medical research community. Medical researchers who seek to discover and extract hidden patterns and relationships among large number of variables use knowledge discovery in databases (KDD) to predict the outcome of a disease. The study was conducted to develop predictive models and discover relationships between certain predictor variables and survival in the context of breast cancer. This study is Cross sectional. After data preparation, data of 22,763 female patients, mean age 59.4 years, stored in the Surveillance Epidemiology and End Results (SEER) breast cancer dataset were analyzed anonymously. IBM SPSS Statistics 16, Access 2003 and Excel 2003 were used in the data preparation and IBM SPSS Modeler 14.2 was used in the model design. Support Vector Machine (SVM) model outperformed other models in the prediction of breast cancer survival. Analysis showed SVM model detected ten important predictor variables contributing mostly to prediction of breast cancer survival. Among important variables, behavior of tumor as the most important variable and stage of malignancy as the least important variable were identified. In current study, applying of the knowledge discovery method in the breast cancer dataset predicted the survival condition of breast cancer patients with high confidence and identified the most important variables participating in breast cancer survival.
Collapse
Affiliation(s)
- Hadi Lotfnezhad Afshar
- 1. Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran 2. Department of Health Information Technology (HIT), Faculty of Paramedicine, Urmia University of Medical Sciences, Urmia, Iran.
| | | | | | | |
Collapse
|
28
|
|
29
|
Partington SN, Papakroni V, Menzies T. Optimizing data collection for public health decisions: a data mining approach. BMC Public Health 2014; 14:593. [PMID: 24919484 PMCID: PMC4077265 DOI: 10.1186/1471-2458-14-593] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2013] [Accepted: 06/02/2014] [Indexed: 11/21/2022] Open
Abstract
Background Collecting data can be cumbersome and expensive. Lack of relevant, accurate and timely data for research to inform policy may negatively impact public health. The aim of this study was to test if the careful removal of items from two community nutrition surveys guided by a data mining technique called feature selection, can (a) identify a reduced dataset, while (b) not damaging the signal inside that data. Methods The Nutrition Environment Measures Surveys for stores (NEMS-S) and restaurants (NEMS-R) were completed on 885 retail food outlets in two counties in West Virginia between May and November of 2011. A reduced dataset was identified for each outlet type using feature selection. Coefficients from linear regression modeling were used to weight items in the reduced datasets. Weighted item values were summed with the error term to compute reduced item survey scores. Scores produced by the full survey were compared to the reduced item scores using a Wilcoxon rank-sum test. Results Feature selection identified 9 store and 16 restaurant survey items as significant predictors of the score produced from the full survey. The linear regression models built from the reduced feature sets had R2 values of 92% and 94% for restaurant and grocery store data, respectively. Conclusions While there are many potentially important variables in any domain, the most useful set may only be a small subset. The use of feature selection in the initial phase of data collection to identify the most influential variables may be a useful tool to greatly reduce the amount of data needed thereby reducing cost.
Collapse
Affiliation(s)
- Susan N Partington
- Division of Animal and Nutritional Sciences, West Virginia University, Morgantown, WV, USA.
| | | | | |
Collapse
|
30
|
Haux R. On determining factors for good research in biomedical and health informatics. Some lessons learned. Yearb Med Inform 2014; 9:255-64. [PMID: 24853031 DOI: 10.15265/iy-2014-0025] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
OBJECTIVE What are the determining factors for good research in medical informatics or, from a broader perspective, in biomedical and health informatics? METHOD From the many lessons learned during my professional career, I tried to identify a fair sampling of such factors. On the occasion of giving the IMIA Award of Excellence lecture during MedInfo 2013, they were presented for discussion. RESULTS Sixteen determining factors (df) have been identified: early identification and promotion (df1), appropriate education (df2), stimulating persons and environments (df3), sufficient time and backtracking opportunities (df4), breadth of medical informatics competencies (df5), considering the necessary preconditions for good medical informatics research (df6), easy access to high-quality knowledge (df7), sufficient scientific career opportunities (df8), appropriate conditions for sustainable research (df9), ability to communicate and to solve problems (df10), as well as to convey research results (df11) in a highly inter- and multidisciplinary environment, ability to think for all and, when needed, taking the lead (df12), always staying unbiased (df13), always keeping doubt (df14), but also always trying to provide solutions (df15), and, finally, being aware that life is more (df16). CONCLUSIONS Medical Informatics is an inter- and multidisciplinary discipline "avant la lettre". Compared to monodisciplinary research, inter- and multidisciplinary research does not only provide significant opportunities for solving major problems in science and in society. It also faces considerable additional challenges for medical informatics as a scientific field. The determining factors, presented here, are in my opinion crucial for conducting successful research and for developing a research career. Since medical informatics as a field has today become an important driving force for research progress, especially in biomedicine and health care, but also in fields like computer science, it may be helpful to consider such factors in relation with research and education in our discipline.
Collapse
Affiliation(s)
- R Haux
- Prof. Dr. Reinhold Haux, Peter L. Reichertz Institute for Medical Informatics, University of Braunschweig - Institute of Technology and Hannover Medical School, Muehlenpfordtstr. 23, D-38106 Braunschweig, Germany, Tel: +49(0)531 391 9500, Fax: +49(0)531 391 9502, E-mail: , www.plri.de
| |
Collapse
|
31
|
Geldermann I, Grouls C, Kuhl C, Deserno TM, Spreckelsen C. Black box integration of computer-aided diagnosis into PACS deserves a second chance: results of a usability study concerning bone age assessment. J Digit Imaging 2014; 26:698-708. [PMID: 23529647 DOI: 10.1007/s10278-013-9590-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Open
Abstract
Usability aspects of different integration concepts for picture archiving and communication systems (PACS) and computer-aided diagnosis (CAD) were inquired on the example of BoneXpert, a program determining the skeletal age from a left hand's radiograph. CAD-PACS integration was assessed according to its levels: data, function, presentation, and context integration focusing on usability aspects. A user-based study design was selected. Statements of seven experienced radiologists using two alternative types of integration provided by BoneXpert were acquired and analyzed using a mixed-methods approach based on think-aloud records and a questionnaire. In both variants, the CAD module (BoneXpert) was easily integrated in the workflow, found comprehensible and fitting in the conceptual framework of the radiologists. Weak points of the software integration referred to data and context integration. Surprisingly, visualization of intermediate image processing states (presentation integration) was found less important as compared to efficient handling and fast computation. Seamlessly integrating CAD into the PACS without additional work steps or unnecessary interrupts and without visualizing intermediate images may considerably improve software performance and user acceptance with efforts in time.
Collapse
Affiliation(s)
- Ina Geldermann
- Department of Medical Informatics, RWTH Aachen University, Pauwelsstr. 30, 52057 Aachen, Germany.
| | | | | | | | | |
Collapse
|
32
|
|
33
|
Liu KE, Lo CL, Hu YH. Improvement of adequate use of warfarin for the elderly using decision tree-based approaches. Methods Inf Med 2013; 53:47-53. [PMID: 24136011 DOI: 10.3414/me13-01-0027] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 09/16/2013] [Indexed: 11/09/2022]
Abstract
OBJECTIVES Due to the narrow therapeutic range and high drug-to-drug interactions (DDIs), improving the adequate use of warfarin for the elderly is crucial in clinical practice. This study examines whether the effectiveness of using warfarin among elderly inpatients can be improved when machine learning techniques and data from the laboratory information system are incorporated. METHODS Having employed 288 validated clinical cases in the DDI group and 89 cases in the non-DDI group, we evaluate the prediction performance of seven classification techniques, with and without an Adaptive Boosting (AdaBoost) algorithm. Measures including accuracy, sensitivity, specificity and area under the curve are used to evaluate model performance. RESULTS Decision tree-based classifiers outperform other investigated classifiers in all evaluation measures. The classifiers supplemented with AdaBoost can generally improve the performance. In addition, weight, congestive heart failure, and gender are among the top three critical variables affecting prediction accuracy for the non-DDI group, while age, ALT, and warfarin doses are the most influential factors for the DDI group. CONCLUSION Medical decision support systems incorporating decision tree-based approaches improve predicting performance and thus may serve as a supplementary tool in clinical practice. Information from laboratory tests and inpatients' history should not be ignored because related variables are shown to be decisive in our prediction models, especially when the DDIs exist.
Collapse
Affiliation(s)
| | | | - Y-H Hu
- Ya-Han Hu, Department of Information Management and Graduate Institute of Healthcare Information Management, National Chung Cheng University, 168 University Road, Min-Hsiung Chia-Yi 62102, Taiwan, E-mail:
| |
Collapse
|
34
|
Alonso F, Lara JA, Martinez L, Pérez A, Valente JP. Generating reference models for structurally complex data. Application to the stabilometry medical domain. Methods Inf Med 2013; 52:441-53. [PMID: 24008894 DOI: 10.3414/me12-01-0106] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 04/16/2013] [Indexed: 11/09/2022]
Abstract
OBJECTIVES We present a framework specially designed to deal with structurally complex data, where all individuals have the same structure, as is the case in many medical domains. A structurally complex individual may be composed of any type of single-valued or multivalued attributes, including time series, for example. These attributes are structured according to domain-dependent hierarchies. Our aim is to generate reference models of population groups. These models represent the population archetype and are very useful for supporting such important tasks as diagnosis, detecting fraud, analyzing patient evolution, identifying control groups, etc. METHODS We have developed a conceptual model to represent structurally complex data hierarchically. Additionally, we have devised a method that uses the similarity tree concept to measure how similar two structurally complex individuals are, plus an outlier detection and filtering method. These methods provide the groundwork for the method that we have designed for generating reference models of a set of structurally complex individuals. A key idea of this method is to use event-based analysis for modeling time series. RESULTS The proposed framework has been applied to the medical field of stabilometry. To validate the outlier detection method we used 142 individuals, and there was a match between the outlier ratings by the experts and by the system for 139 individuals (97.8%). To validate the reference model generation method, we applied k-fold cross validation (k = 5) with 60 athletes (basketball players and ice-skaters), and the system correctly classified 55 (91.7%). We then added 30 non-athletes as a control group, and the method output the correct result in a very high percentage of cases (96.6%). CONCLUSIONS We have achieved very satisfactory results for the tests on data from such a complex domain as stabilometry and for the comparison of the reference model generation method with other methods. This supports the validity of this framework.
Collapse
Affiliation(s)
- F Alonso
- Fernando Alonso, Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660 Boadilla del Monte, Madrid, Spain, E-mail:
| | | | | | | | | |
Collapse
|
35
|
Fialho AS, Celi LA, Cismondi F, Vieira SM, Reti SR, Sousa JMC, Finkelstein SN. Disease-based modeling to predict fluid response in intensive care units. Methods Inf Med 2013; 52:494-502. [PMID: 23986268 DOI: 10.3414/me12-01-0093] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Accepted: 05/30/2013] [Indexed: 11/09/2022]
Abstract
OBJECTIVE To compare general and disease-based modeling for fluid resuscitation and vasopressor use in intensive care units. METHODS Retrospective cohort study involving 2944 adult medical and surgical intensive care unit (ICU) patients receiving fluid resuscitation. Within this cohort there were two disease-based groups, 802 patients with a diagnosis of pneumonia, and 143 patients with a diagnosis of pancreatitis. Fluid resuscitation either progressing to subsequent vasopressor administration or not was used as the primary outcome variable to compare general and disease-based modeling. RESULTS Patients with pancreatitis, pneumonia and the general group all shared three common predictive features as core variables, arterial base excess, lactic acid and platelets. Patients with pneumonia also had non-invasive systolic blood pressure and white blood cells added to the core model, and pancreatitis patients additionally had temperature. Disease-based models had significantly higher values of AUC (p < 0.05) than the general group (0.82 ± 0.02 for pneumonia and 0.83 ± 0.03 for pancreatitis vs. 0.79 ± 0.02 for general patients). CONCLUSIONS Disease-based predictive modeling reveals a different set of predictive variables compared to general modeling and improved performance. Our findings add support to the growing body of evidence advantaging disease specific predictive modeling.
Collapse
Affiliation(s)
- A S Fialho
- André S. Fialho, PhD, Massachusetts Institute of Technology, Engineering Systems Division, 77 Massachusetts Avenue, 02139 Cambridge, MA, USA, E-mail:
| | | | | | | | | | | | | |
Collapse
|
36
|
Franz NM, Cardona-Duque* J. Description of two new species and phylogenetic reassessment of Perelleschus O’Brien & Wibmer, 1986 (Coleoptera: Curculionidae), with a complete taxonomic concept history of Perelleschus sec. Franz & Cardona-Duque, 2013. SYST BIODIVERS 2013. [DOI: 10.1080/14772000.2013.806371] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- Nico M. Franz
- a School of Life Sciences, PO Box 874501 , Arizona State University , Tempe , AZ , 85287-4501 , USA
| | - Juliana Cardona-Duque*
- b Grupo de Entomología , Universidad de Antioquia (GEUA) , Medellín , AA , 1226 , Colombia
| |
Collapse
|
37
|
Takabayashi K. [The cutting-edge of medicine; application of medical informatics in internal medicine]. NIHON NAIKA GAKKAI ZASSHI. THE JOURNAL OF THE JAPANESE SOCIETY OF INTERNAL MEDICINE 2012; 101:3239-3246. [PMID: 23342599 DOI: 10.2169/naika.101.3239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
|
38
|
Paterson A, Ashtari M, Ribé D, Stenbeck G, Tucker A. Intelligent data analysis to model and understand live cell time-lapse sequences. Methods Inf Med 2012; 51:332-40. [PMID: 22814575 DOI: 10.3414/me11-02-0041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Accepted: 04/27/2012] [Indexed: 01/10/2023]
Abstract
BACKGROUND One important aspect of cellular function, which is at the basis of tissue homeostasis, is the delivery of proteins to their correct destinations. Significant advances in live cell microscopy have allowed tracking of these pathways by following the dynamics of fluorescently labelled proteins in living cells. OBJECTIVES This paper explores intelligent data analysis techniques to model the dynamic behavior of proteins in living cells as well as to classify different experimental conditions. METHODS We use a combination of decision tree classification and hidden Markov models. In particular, we introduce a novel approach to "align" hidden Markov models so that hidden states from different models can be cross-compared. RESULTS Our models capture the dynamics of two experimental conditions accurately with a stable hidden state for control data and multiple (less stable) states for the experimental data recapitulating the behaviour of particle trajectories within live cell time-lapse data. CONCLUSIONS In addition to having successfully developed an automated framework for the classification of protein transport dynamics from live cell time-lapse data our model allows us to understand the dynamics of a complex trafficking pathway in living cells in culture.
Collapse
Affiliation(s)
- Allan Paterson
- School of Information Systems Computing and Mathematics, Brunel University, West London, UK.
| | | | | | | | | |
Collapse
|
39
|
Minne L, Eslami S, de Keizer N, de Jonge E, de Rooij SE, Abu-Hanna A. Statistical process control for monitoring standardized mortality ratios of a classification tree model. Methods Inf Med 2012; 51:353-8. [PMID: 22773038 DOI: 10.3414/me11-02-0044] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 05/04/2012] [Indexed: 11/09/2022]
Abstract
OBJECTIVES The ratio of observed to expected mortality (standardized mortality ratio, SMR), is a key indicator of quality of care. We use PreControl Charts to investigate SMR behavior over time of an existing tree-model for predicting mortality in intensive care units (ICUs) and its implications for hospital ranking. We compare the results to those of a logistic regression model. METHODS We calculated SMRs of 30 equally-sized consecutive subsets from a total of 12,143 ICU patients aged 80 years or older and plotted them on a PreControl Chart. We calculated individual hospital SMRs in 2009, with and without repeated recalibration of the models on earlier data. RESULTS The overall SMR of the tree-model was stable over time, in contrast to logistic regression. Both models were stable after repeated recalibration. The overall SMR of the tree on the whole validation set was statistically significantly different (SMR 1.00 ± 0.012 vs. 0.94 ± 0.01) and worse in performance than the logistic regression model (AUC 0.76 ± 0.005 vs. 0.79 ± 0.004; Brier score 0.17 ± 0.012 vs. 0.16 ± 0.010). The individual SMRs' range in 2009 was 0.53-1.31 for the tree and 0.64-1.27 for logistic regression. The proportion of individual hospitals with SMR >1, hinting at poor quality of care, reduced from 38% to 29% after recalibration for the tree, and increased from 15% to 35% for logistic regression. CONCLUSIONS Although the tree-model has seemingly a longer shelf life than the logistic regression model, its SMR may be less useful for quality of care assessment as it insufficiently responds to changes in the population over time.
Collapse
Affiliation(s)
- Lilian Minne
- Academic Medical Center, Department of Medical Informatics, Amsterdam, The Netherlands.
| | | | | | | | | | | |
Collapse
|