1
|
Li G, Yu Z, Yang K, Chen CLP, Li X. Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2025; 47:1103-1119. [PMID: 39446542 DOI: 10.1109/tpami.2024.3486319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2024]
Abstract
Graph-based methods have demonstrated exceptional performance in semi-supervised classification. However, existing graph-based methods typically construct either a predefined graph in the original space or an adaptive graph within the output space, which often limits their ability to fully utilize prior information and capture the optimal intrinsic data distribution, particularly in high-dimensional data with abundant redundant and noisy features. This paper introduces a novel approach: Semi-Supervised Classification with Optimized Graph Construction (SSC-OGC). SSC-OGC leverages both predefined and adaptive graphs to explore intrinsic data distribution and effectively employ prior information. Additionally, a graph constraint regularization term (GCR) and a collaborative constraint regularization term (CCR) are incorporated to further enhance the quality of the adaptive graph structure and the learned subspace, respectively. To eliminate the negative effect of constructing a predefined graph in the original data space, we further propose a Hybrid Subspace Ensemble-enhanced framework based on the proposed Optimized Graph Construction method (HSE-OGC). Specifically, we construct multiple hybrid subspaces, which consist of meticulously chosen features from the original data to achieve high-quality and diverse space representations. Then, HSE-OGC constructs multiple predefined graphs within hybrid subspaces and trains multiple SSC-OGC classifiers to complement each other, significantly improving the overall performance. Experimental results conducted on various high-dimensional datasets demonstrate that HSE-OGC exhibits outstanding performance.
Collapse
|
2
|
Hector I, Panjanathan R. Predictive maintenance in Industry 4.0: a survey of planning models and machine learning techniques. PeerJ Comput Sci 2024; 10:e2016. [PMID: 38855197 PMCID: PMC11157603 DOI: 10.7717/peerj-cs.2016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 04/02/2024] [Indexed: 06/11/2024]
Abstract
Equipment downtime resulting from maintenance in various sectors around the globe has become a major concern. The effectiveness of conventional reactive maintenance methods in addressing interruptions and enhancing operational efficiency has become inadequate. Therefore, acknowledging the constraints associated with reactive maintenance and the growing need for proactive approaches to proactively detect possible breakdowns is necessary. The need for optimisation of asset management and reduction of costly downtime emerges from the demand for industries. The work highlights the use of Internet of Things (IoT)-enabled Predictive Maintenance (PdM) as a revolutionary strategy across many sectors. This article presents a picture of a future in which the use of IoT technology and sophisticated analytics will enable the prediction and proactive mitigation of probable equipment failures. This literature study has great importance as it thoroughly explores the complex steps and techniques necessary for the development and implementation of efficient PdM solutions. The study offers useful insights into the optimisation of maintenance methods and the enhancement of operational efficiency by analysing current information and approaches. The article outlines essential stages in the application of PdM, encompassing underlying design factors, data preparation, feature selection, and decision modelling. Additionally, the study discusses a range of ML models and methodologies for monitoring conditions. In order to enhance maintenance plans, it is necessary to prioritise ongoing study and improvement in the field of PdM. The potential for boosting PdM skills and guaranteeing the competitiveness of companies in the global economy is significant through the incorporation of IoT, Artificial Intelligence (AI), and advanced analytics.
Collapse
Affiliation(s)
- Ida Hector
- School of Computer Science and Engineering, Vellore Institute of Technology Chennai, Chennai, Tamil Nadu, India
| | - Rukmani Panjanathan
- School of Computer Science and Engineering, Vellore Institute of Technology Chennai, Chennai, Tamil Nadu, India
| |
Collapse
|
3
|
Niemantsverdriet MSA, de Hond TAP, Hoefer IE, van Solinge WW, Bellomo D, Oosterheert JJ, Kaasjager KAH, Haitjema S. A machine learning approach using endpoint adjudication committee labels for the identification of sepsis predictors at the emergency department. BMC Emerg Med 2022; 22:208. [PMID: 36550392 PMCID: PMC9784058 DOI: 10.1186/s12873-022-00764-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 12/06/2022] [Indexed: 12/24/2022] Open
Abstract
Accurate sepsis diagnosis is paramount for treatment decisions, especially at the emergency department (ED). To improve diagnosis, clinical decision support (CDS) tools are being developed with machine learning (ML) algorithms, using a wide range of variable groups. ML models can find patterns in Electronic Health Record (EHR) data that are unseen by the human eye. A prerequisite for a good model is the use of high-quality labels. Sepsis gold-standard labels are hard to define due to a lack of reliable diagnostic tools for sepsis at the ED. Therefore, standard clinical tools, such as clinical prediction scores (e.g. modified early warning score and quick sequential organ failure assessment), and claims-based methods (e.g. ICD-10) are used to generate suboptimal labels. As a consequence, models trained with these "silver" labels result in ill-trained models. In this study, we trained ML models for sepsis diagnosis at the ED with labels of 375 ED visits assigned by an endpoint adjudication committee (EAC) that consisted of 18 independent experts. Our objective was to evaluate which routinely measured variables show diagnostic value for sepsis. We performed univariate testing and trained multiple ML models with 95 routinely measured variables of three variable groups; demographic and vital, laboratory and advanced haematological variables. Apart from known diagnostic variables, we identified added diagnostic value for less conventional variables such as eosinophil count and platelet distribution width. In this explorative study, we show that the use of an EAC together with ML can identify new targets for future sepsis diagnosis research.
Collapse
Affiliation(s)
- Michael S. A. Niemantsverdriet
- grid.7692.a0000000090126352Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht University, Room Number G03.551, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands ,SkylineDx, Rotterdam, The Netherlands
| | - Titus A. P. de Hond
- grid.7692.a0000000090126352Department of Internal Medicine and Acute Medicine, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Imo E. Hoefer
- grid.7692.a0000000090126352Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht University, Room Number G03.551, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | - Wouter W. van Solinge
- grid.7692.a0000000090126352Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht University, Room Number G03.551, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| | | | - Jan Jelrik Oosterheert
- grid.7692.a0000000090126352Department of Internal Medicine, Infectious Diseases, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Karin A. H. Kaasjager
- grid.7692.a0000000090126352Department of Internal Medicine and Acute Medicine, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Saskia Haitjema
- grid.7692.a0000000090126352Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht University, Room Number G03.551, Heidelberglaan 100, 3584 CX Utrecht, The Netherlands
| |
Collapse
|
4
|
Yue X, Liu S, Qian Q, Miao D, Gao C. Semi-supervised shadowed sets for three-way classification on partial labeled data. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.06.062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
5
|
de Vries S, Ten Doesschate T, Totté JEE, Heutz JW, Loeffen YGT, Oosterheert JJ, Thierens D, Boel E. A semi-supervised decision support system to facilitate antibiotic stewardship for urinary tract infections. Comput Biol Med 2022; 146:105621. [PMID: 35617725 DOI: 10.1016/j.compbiomed.2022.105621] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 02/18/2022] [Accepted: 03/19/2022] [Indexed: 11/15/2022]
Abstract
Urinary Tract Infections (UTIs) are among the most frequently occurring infections in the hospital. Urinalysis and urine culture are the main tools used for diagnosis. Whereas urinalysis is sufficiently sensitive for detecting UTI, it has a relatively low specificity, leading to unnecessary treatment with antibiotics and the risk of increasing antibiotic resistance. We performed an evaluation of the current diagnostic process with an expert-based label for UTI as outcome, retrospectively established using data from the Electronic Health Records. We found that the combination of urinalysis results with the Gram stain and other readily available parameters can be used effectively for predicting UTI. Based on the obtained information, we engineered a clinical decision support system (CDSS) using the reliable semi-supervised ensemble learning (RESSEL) method, and found it to be more accurate than urinalysis or the urine culture for prediction of UTI. The CDSS provides clinicians with this prediction within hours of ordering a culture and thereby enables them to hold off on prematurely prescribing antibiotics for UTI while awaiting the culture results.
Collapse
Affiliation(s)
- Sjoerd de Vries
- Department of Information and Computing Sciences, Utrecht University, Princetonplein 5, 3584 CC, Utrecht, the Netherlands; Department of Digital Health, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands.
| | - Thijs Ten Doesschate
- Department of Internal Medicine, Infectious Diseases, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands
| | - Joan E E Totté
- Department of Medical Microbiology, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands
| | - Judith W Heutz
- Department of Medical Microbiology, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands; Department of Rheumatology, Erasmus Medical Center, Dr. Molewaterplein 40, 3015 GD, Rotterdam, the Netherlands
| | - Yvette G T Loeffen
- Division of Pediatric Immunology and Infectious Diseases, Wilhelmina Children's Hospital Utrecht, Lundlaan 6, 3584 EA, Utrecht, the Netherlands
| | - Jan Jelrik Oosterheert
- Department of Internal Medicine, Infectious Diseases, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands
| | - Dirk Thierens
- Department of Information and Computing Sciences, Utrecht University, Princetonplein 5, 3584 CC, Utrecht, the Netherlands
| | - Edwin Boel
- Department of Medical Microbiology, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands
| |
Collapse
|