1
|
Jin C, Li F, Ma S, Wang Y. Sampling scheme-based classification rule mining method using decision tree in big data environment. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108522] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
3
|
De Oliveira H, Augusto V, Jouaneton B, Lamarsalle L, Prodel M, Xie X. Automatic and Explainable Labeling of Medical Event Logs With Autoencoding. IEEE J Biomed Health Inform 2020; 24:3076-3084. [PMID: 32886615 DOI: 10.1109/jbhi.2020.3021790] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Process mining is a suitable method for knowledge extraction from patient pathways. Structured in event logs, medical events are complex, often described using various medical codes. An efficient labeling of these events before applying process mining analysis is challenging. This paper presents an innovative methodology to handle the complexity of events in medical event logs. Based on autoencoding, accurate labels are created by clustering similar events in latent space. Moreover, the explanation of created labels is provided by the decoding of its corresponding events. Tested on synthetic events, the method is able to find hidden clusters on sparse binary data, as well as accurately explain created labels. A case study on real healthcare data is performed. Results confirm the suitability of the method to extract knowledge from complex event logs representing patient pathways.
Collapse
|
4
|
Jacques J, Martin-Huyghe H, Lemtiri-Florek J, Taillard J, Jourdan L, Dhaenens C, Delerue D, Hansske A, Leclercq V. The detection of hospitalized patients at risk of testing positive to multi-drug resistant bacteria using MOCA-I, a rule-based "white-box" classification algorithm for medical data. Int J Med Inform 2020; 142:104242. [PMID: 32853975 DOI: 10.1016/j.ijmedinf.2020.104242] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 07/19/2020] [Accepted: 07/25/2020] [Indexed: 10/23/2022]
Abstract
BACKGROUND Multi-drug resistant (MDR) bacteria are a major health concern. In this retrospective study, a rule-based classification algorithm, MOCA-I (Multi-Objective Classification Algorithm for Imbalanced data) is used to identify hospitalized patients at risk of testing positive for multidrug-resistant (MDR) bacteria, including Methicillin-resistant Staphylococcus aureus (MRSA), before or during their stay. METHODS Applied to a data set of 48,945 hospital stays (including known cases of carriage) with up to 16,325 attributes per stay, MOCA-I generated alert rules for risk of carriage or infection. A risk score was then computed from each stay according to the triggered rules.Recall and precision curves were plotted. RESULTS The classification can be focused on specifically detecting high risk of having a positive test, or identifying large numbers of at-risk patients by modulating the risk score cut-off level. For a risk score above 0.85,recall (sensitivity) is 62 % with 69 % precision (confidence) for MDR bacteria, recall is 58 % with 88 % precision for MRSA. In addition, MOCA-I identifies 38 and 21 cases of previously unknown MDR and MRSA respectively. CONCLUSIONS MOCA-I generates medically pertinent alert rules. This classification algorithm can be used to detect patients with high risk of testing positive to MDR bacteria (including MRSA). Classification can be modulated by appropriately setting the risk score cut-off level to favor specific detection of small numbers of patients at very high risk or identification of large numbers of patients at risk. MOCA-I can thus contribute to more adapted treatments and preventive measures from admission, depending on the clinical setting or management strategy.
Collapse
Affiliation(s)
- Julie Jacques
- Lille Catholic University, Faculté de Gestion, Economie et Sciences, France; Univ. Lille, CNRS, Centrale Lille, UMR 9189, CRIStAL, Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France.
| | - Hélène Martin-Huyghe
- Lille Catholic Hospitals, Infection Control Department, Lille Catholic University, KASHMIR, Lille, France; CH Arras, Pharmacy Department, Arras, France
| | - Justine Lemtiri-Florek
- Lille Catholic Hospitals, Infection Control Department, Lille Catholic University, KASHMIR, Lille, France; CH Valenciennes, Intensive Care Department, F-59322 Valenciennes, France
| | | | - Laetitia Jourdan
- Univ. Lille, CNRS, Centrale Lille, UMR 9189, CRIStAL, Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| | - Clarisse Dhaenens
- Univ. Lille, CNRS, Centrale Lille, UMR 9189, CRIStAL, Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| | | | - Arnaud Hansske
- Lille Catholic Hospitals, IT System Department, Lille Catholic University, KASHMIR, Lille, France
| | - Valérie Leclercq
- Lille Catholic Hospitals, Infection Control Department, Lille Catholic University, KASHMIR, Lille, France
| |
Collapse
|
5
|
Hoffman RA, Venugopalan J, Qu L, Wu H, Wang MD. Improving Validity of Cause of Death on Death Certificates. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2018; 2018:178-183. [PMID: 32558825 PMCID: PMC7302107 DOI: 10.1145/3233547.3233581] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Accurate reporting of causes of death on death certificates is essential to formulate appropriate disease control, prevention and emergency response by national health-protection institutions such as Center for disease prevention and control (CDC). In this study, we utilize knowledge from publicly available expert-formulated rules for the cause of death to determine the extent of discordance in the death certificates in national mortality data with the expert knowledge base. We also report the most commonly occurring invalid causal pairs which physicians put in the death certificates. We use sequence rule mining to find patterns that are most frequent on death certificates and compare them with the rules from the expert knowledge based. Based on our results, 20.1% of the common patterns derived from entries into death certificates were discordant. The most probable causes of these discordance or invalid rules are missing steps and non-specific ICD-10 codes on the death certificates.
Collapse
Affiliation(s)
- Ryan A Hoffman
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Janani Venugopalan
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Li Qu
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Hang Wu
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - May D Wang
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
6
|
Dhaenens C, Jacques J, Vandewalle V, Vandromme M, Chazard E, Preda C, Amarioarei A, Chaiwuttisak P, Cozma C, Ficheur G, Kessaci ME, Perichon R, Taillard J, Bordet R, Lansiaux A, Jourdan L, Delerue D, Hansske A. ClinMine: Optimizing the Management of Patients in Hospital. Ing Rech Biomed 2018. [DOI: 10.1016/j.irbm.2017.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|