1
|
Lillo-Castellano JM, Mora-Jiménez I, Martín-Méndez M, Cerdá L, García-Alberola A, Rojo-Álvarez JL, Tuia D. Active learning and margin strategies for arrhythmia classification in implantable devices. Comput Biol Med 2025; 188:109747. [PMID: 39951979 DOI: 10.1016/j.compbiomed.2025.109747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 10/26/2024] [Accepted: 01/22/2025] [Indexed: 02/17/2025]
Abstract
BACKGROUND AND OBJECTIVES The massive storage of cardiac arrhythmic episodes from Implantable Cardioverter Defibrillators (ICD) and the advent of new artificial intelligence algorithms are opening up new opportunities for electrophysiological knowledge extraction. However, in this context, accurate and reliable episode labeling by expert cardiologists still remains a manual, costly, and time-consuming process. METHODS In this work, we propose using Active Learning (AL) to design classification models that streamline the manual labeling of cardiac arrhythmic episodes. When AL is used, relevant episodes for classification are selected and then presented to the human expert for labeling, thereby dramatically reducing the manual labeling burden. RESULTS We adapted four large-margin-based AL strategies to a previously proposed classification methodology. We benchmarked them on problems involving 3 and 8 arrhythmia types using 9908 episodes from a massive national ICD data repository. Specifically, the relevance of episode-patient diversity for classification was evaluated. Results showed that the gold standard performance, achieved using all episodes, was reached by using approximately 20% (50%) of episodes from 60% (85%) of patients in the 3-class (8-class) model design. CONCLUSIONS We can conclude that AL techniques are advantageous for designing classification models and can streamline the human labeling process of large ICD datasets.
Collapse
Affiliation(s)
- José-María Lillo-Castellano
- Universidad Rey Juan Carlos, Department of Signal Theory and Communications, Telematics and Computing Systems, Camino del Molino, 5. 28942, Fuenlabrada, Madrid, Spain
| | - Inmaculada Mora-Jiménez
- Universidad Rey Juan Carlos, Department of Signal Theory and Communications, Telematics and Computing Systems, Camino del Molino, 5. 28942, Fuenlabrada, Madrid, Spain
| | - María Martín-Méndez
- Medtronic Ibérica ®S.A, Dep. Cardiac Rhythm and Heart Failure, C/ María de Portugal 9, 28050 Madrid, Spain
| | - Laia Cerdá
- Medtronic Ibérica ®S.A, Dep. Cardiac Rhythm and Heart Failure, C/ María de Portugal 9, 28050 Madrid, Spain
| | - Arcadi García-Alberola
- Hospital CU Virgen de la Arrixaca. Arrhythmia Unit, Ctra. Madrid-Cartagena, s/n. 30120-El Palmar Murcia, Spain
| | - José Luis Rojo-Álvarez
- Universidad Rey Juan Carlos, Department of Signal Theory and Communications, Telematics and Computing Systems, Camino del Molino, 5. 28942, Fuenlabrada, Madrid, Spain; D!lemma Ltd startup, Camino del Molino, 5. 28942-Fuenlabrada Madrid, Spain.
| | - Devis Tuia
- Ecole Polytechnique Fédérale de Lausanne, Environmental Computational Science and Earth Observation Laboratory, 1950-Sion, Switzerland
| |
Collapse
|
2
|
Yuan H, Zhu M, Yang R, Liu H, Li I, Hong C. Rethinking Domain-Specific Pretraining by Supervised or Self-Supervised Learning for Chest Radiograph Classification: A Comparative Study Against ImageNet Counterparts in Cold-Start Active Learning. HEALTH CARE SCIENCE 2025; 4:110-143. [PMID: 40241982 PMCID: PMC11997468 DOI: 10.1002/hcs2.70009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 01/05/2025] [Accepted: 01/26/2025] [Indexed: 04/18/2025]
Abstract
Objective Deep learning (DL) has become the prevailing method in chest radiograph analysis, yet its performance heavily depends on large quantities of annotated images. To mitigate the cost, cold-start active learning (AL), comprising an initialization followed by subsequent learning, selects a small subset of informative data points for labeling. Recent advancements in pretrained models by supervised or self-supervised learning tailored to chest radiograph have shown broad applicability to diverse downstream tasks. However, their potential in cold-start AL remains unexplored. Methods To validate the efficacy of domain-specific pretraining, we compared two foundation models: supervised TXRV and self-supervised REMEDIS with their general domain counterparts pretrained on ImageNet. Model performance was evaluated at both initialization and subsequent learning stages on two diagnostic tasks: psychiatric pneumonia and COVID-19. For initialization, we assessed their integration with three strategies: diversity, uncertainty, and hybrid sampling. For subsequent learning, we focused on uncertainty sampling powered by different pretrained models. We also conducted statistical tests to compare the foundation models with ImageNet counterparts, investigate the relationship between initialization and subsequent learning, examine the performance of one-shot initialization against the full AL process, and investigate the influence of class balance in initialization samples on initialization and subsequent learning. Results First, domain-specific foundation models failed to outperform ImageNet counterparts in six out of eight experiments on informative sample selection. Both domain-specific and general pretrained models were unable to generate representations that could substitute for the original images as model inputs in seven of the eight scenarios. However, pretrained model-based initialization surpassed random sampling, the default approach in cold-start AL. Second, initialization performance was positively correlated with subsequent learning performance, highlighting the importance of initialization strategies. Third, one-shot initialization performed comparably to the full AL process, demonstrating the potential of reducing experts' repeated waiting during AL iterations. Last, a U-shaped correlation was observed between the class balance of initialization samples and model performance, suggesting that the class balance is more strongly associated with performance at middle budget levels than at low or high budgets. Conclusions In this study, we highlighted the limitations of medical pretraining compared to general pretraining in the context of cold-start AL. We also identified promising outcomes related to cold-start AL, including initialization based on pretrained models, the positive influence of initialization on subsequent learning, the potential for one-shot initialization, and the influence of class balance on middle-budget AL. Researchers are encouraged to improve medical pretraining for versatile DL foundations and explore novel AL methods.
Collapse
Affiliation(s)
- Han Yuan
- Duke‐NUS Medical School, Centre for Quantitative MedicineSingaporeSingapore
| | - Mingcheng Zhu
- Duke‐NUS Medical School, Centre for Quantitative MedicineSingaporeSingapore
- Department of Engineering ScienceUniversity of OxfordOxfordUK
| | - Rui Yang
- Duke‐NUS Medical School, Centre for Quantitative MedicineSingaporeSingapore
| | - Han Liu
- Department of Computer ScienceVanderbilt UniversityNashvilleTennesseeUSA
| | - Irene Li
- Information Technology CenterUniversity of TokyoBunkyo‐kuJapan
| | - Chuan Hong
- Department of Biostatistics and BioinformaticsDuke UniversityDurhamNorth CarolinaUSA
| |
Collapse
|
3
|
Xu H, Zhao Y, Zhang Y, Han J, Zan P, He S, Bo X. Deep active learning with high structural discriminability for molecular mutagenicity prediction. Commun Biol 2024; 7:1071. [PMID: 39217273 PMCID: PMC11366013 DOI: 10.1038/s42003-024-06758-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 08/21/2024] [Indexed: 09/04/2024] Open
Abstract
The assessment of mutagenicity is essential in drug discovery, as it may lead to cancer and germ cells damage. Although in silico methods have been proposed for mutagenicity prediction, their performance is hindered by the scarcity of labeled molecules. However, experimental mutagenicity testing can be time-consuming and costly. One solution to reduce the annotation cost is active learning, where the algorithm actively selects the most valuable molecules from a vast chemical space and presents them to the oracle (e.g., a human expert) for annotation, thereby rapidly improving the model's predictive performance with a smaller annotation cost. In this paper, we propose muTOX-AL, a deep active learning framework, which can actively explore the chemical space and identify the most valuable molecules, resulting in competitive performance with a small number of labeled samples. The experimental results show that, compared to the random sampling strategy, muTOX-AL can reduce the number of training molecules by about 57%. Additionally, muTOX-AL exhibits outstanding molecular structural discriminability, allowing it to pick molecules with high structural similarity but opposite properties.
Collapse
Affiliation(s)
- Huiyan Xu
- Shanghai Key Laboratory of Power Station Automation Technology, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China
- Academy of Military Medical Sciences, Beijing, China
| | - Yanpeng Zhao
- Academy of Military Medical Sciences, Beijing, China
| | - Yixin Zhang
- Academy of Military Medical Sciences, Beijing, China
| | - Junshan Han
- Academy of Military Medical Sciences, Beijing, China
| | - Peng Zan
- Shanghai Key Laboratory of Power Station Automation Technology, School of Mechatronics Engineering and Automation, Shanghai University, Shanghai, China.
| | - Song He
- Academy of Military Medical Sciences, Beijing, China.
| | - Xiaochen Bo
- Academy of Military Medical Sciences, Beijing, China.
| |
Collapse
|
4
|
de Oliveira ER, Bugatti PH, Saito PTM. Assessment of clustering techniques to support the analyses of soybean seed vigor. PLoS One 2023; 18:e0285566. [PMID: 37624819 PMCID: PMC10456222 DOI: 10.1371/journal.pone.0285566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 04/26/2023] [Indexed: 08/27/2023] Open
Abstract
Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acquire information and evaluate the quality of soy planting. The tetrazolium test, for example, provides information about moisture damage, bedbugs, or mechanical damage. However, the verification of the damage reason and its severity are done by an analyst, one by one. Since this is massive and exhausting work, it is susceptible to mistakes. Proposals involving different supervised learning approaches, including active learning strategies, have already been used, and have brought significant results. Therefore, this paper analyzes the performance of non-supervised techniques for classifying soybeans. An extensive experimental evaluation was performed, considering (9) different clustering algorithms (partitional, hierarchical, and density-based) applied to 5 image datasets of soybean seeds submitted to the tetrazolium test, including different damages and/or their levels. To describe those images, we considered 18 extractors of traditional features. We also considered four metrics (accuracy, FOWLKES, DAVIES, and CALINSKI) and two-dimensionality reduction techniques (principal component analysis and t-distributed stochastic neighbor embedding) for validation. Results show that this paper presents essential contributions since it makes it possible to identify descriptors and clustering algorithms that shall be used as preprocessing in other learning processes, accelerating and improving the classification process of key agricultural problems.
Collapse
Affiliation(s)
- Eduardo R. de Oliveira
- Department of Computing, Federal University of Technology - Parana, Cornelio Procopio, PR, Brazil
| | - Pedro H. Bugatti
- Department of Computing, Federal University of Technology - Parana, Cornelio Procopio, PR, Brazil
- Department of Computing, Federal University of Sao Carlos, Sao Carlos, SP, Brazil
| | - Priscila T. M. Saito
- Department of Computing, Federal University of Technology - Parana, Cornelio Procopio, PR, Brazil
- Department of Computing, Federal University of Sao Carlos, Sao Carlos, SP, Brazil
| |
Collapse
|
5
|
Tang S, Yu X, Cheang CF, Liang Y, Zhao P, Yu HH, Choi IC. Transformer-based multi-task learning for classification and segmentation of gastrointestinal tract endoscopic images. Comput Biol Med 2023; 157:106723. [PMID: 36907035 DOI: 10.1016/j.compbiomed.2023.106723] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 02/04/2023] [Accepted: 02/26/2023] [Indexed: 03/07/2023]
Abstract
Despite being widely utilized to help endoscopists identify gastrointestinal (GI) tract diseases using classification and segmentation, models based on convolutional neural network (CNN) have difficulties in distinguishing the similarities among some ambiguous types of lesions presented in endoscopic images, and in the training when lacking labeled datasets. Those will prevent CNN from further improving the accuracy of diagnosis. To address these challenges, we first proposed a Multi-task Network (TransMT-Net) capable of simultaneously learning two tasks (classification and segmentation), which has the transformer designed to learn global features and can combine the advantages of CNN in learning local features so that to achieve a more accurate prediction in identifying the lesion types and regions in GI tract endoscopic images. We further adopted the active learning in TransMT-Net to tackle the labeled image-hungry problem. A dataset was created from the CVC-ClinicDB dataset, Macau Kiang Wu Hospital, and Zhongshan Hospital to evaluate the model performance. Then, the experimental results show that our model not only achieved 96.94% accuracy in the classification task and 77.76% Dice Similarity Coefficient in the segmentation task but also outperformed those of other models on our test set. Meanwhile, active learning also produced positive results for the performance of our model with a small-scale initial training set, and even its performance with 30% of the initial training set was comparable to that of most comparable models with the full training set. Consequently, the proposed TransMT-Net has demonstrated its potential performance in GI tract endoscopic images and it through active learning can alleviate the shortage of labeled images.
Collapse
Affiliation(s)
- Suigu Tang
- Faculty of Innovation Engineering-School of Computer Science and Engineering, Macau University of Science and Technology, Macao Special Administrative Region of China
| | - Xiaoyuan Yu
- Faculty of Innovation Engineering-School of Computer Science and Engineering, Macau University of Science and Technology, Macao Special Administrative Region of China
| | - Chak Fong Cheang
- Faculty of Innovation Engineering-School of Computer Science and Engineering, Macau University of Science and Technology, Macao Special Administrative Region of China.
| | - Yanyan Liang
- Faculty of Innovation Engineering-School of Computer Science and Engineering, Macau University of Science and Technology, Macao Special Administrative Region of China
| | - Penghui Zhao
- Faculty of Innovation Engineering-School of Computer Science and Engineering, Macau University of Science and Technology, Macao Special Administrative Region of China
| | - Hon Ho Yu
- Kiang Wu Hospital, Macao Special Administrative Region of China
| | - I Cheong Choi
- Kiang Wu Hospital, Macao Special Administrative Region of China
| |
Collapse
|
6
|
Xie W, Wang L, Cheng Q, Wang X, Wang Y, Bi H, He B, Feng W. Integrated Random Negative Sampling and Uncertainty Sampling in Active Learning Improve Clinical Drug Safety Drug-Drug Interaction Information Retrieval. Front Pharmacol 2021; 11:582470. [PMID: 34017245 PMCID: PMC8130007 DOI: 10.3389/fphar.2020.582470] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 11/30/2020] [Indexed: 11/13/2022] Open
Abstract
Clinical drug-drug interactions (DDIs) have been a major cause for not only medical error but also adverse drug events (ADEs). The published literature on DDI clinical toxicity continues to grow significantly, and high-performance DDI information retrieval (IR) text mining methods are in high demand. The effectiveness of IR and its machine learning (ML) algorithm depends on the availability of a large amount of training and validation data that have been manually reviewed and annotated. In this study, we investigated how active learning (AL) might improve ML performance in clinical safety DDI IR analysis. We recognized that a direct application of AL would not address several primary challenges in DDI IR from the literature. For instance, the vast majority of abstracts in PubMed will be negative, existing positive and negative labeled samples do not represent the general sample distributions, and potentially biased samples may arise during uncertainty sampling in an AL algorithm. Therefore, we developed several novel sampling and ML schemes to improve AL performance in DDI IR analysis. In particular, random negative sampling was added as a part of AL since it has no expanse in the manual data label. We also used two ML algorithms in an AL process to differentiate random negative samples from manually labeled negative samples, and updated both the training and validation samples during the AL process to avoid or reduce biased sampling. Two supervised ML algorithms, support vector machine (SVM) and logistic regression (LR), were used to investigate the consistency of our proposed AL algorithm. Because the ultimate goal of clinical safety DDI IR is to retrieve all DDI toxicity-relevant abstracts, a recall rate of 0.99 was set in developing the AL methods. When we used our newly proposed AL method with SVM, the precision in differentiating the positive samples from manually labeled negative samples improved from 0.45 in the first round to 0.83 in the second round, and the precision in differentiating the positive samples from random negative samples improved from 0.70 to 0.82 in the first and second rounds, respectively. When our proposed AL method was used with LR, the improvements in precision followed a similar trend. However, the other AL algorithms tested did not show improved precision largely because of biased samples caused by the uncertainty sampling or differences between training and validation data sets.
Collapse
Affiliation(s)
- Weixin Xie
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Limei Wang
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China.,Key Laboratory of Tropical Translational Medicine of Ministry of Education, College of Biomedical Information and Engineering, Hainan Medical University, Haikou, China
| | - Qi Cheng
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Xueying Wang
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Ying Wang
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Hongyuan Bi
- The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Bo He
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| | - Weixing Feng
- Institute of Intelligent System and Bioinformatics, College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, China
| |
Collapse
|
7
|
Kiyasseh D, Zhu T, Clifton D. The Promise of Clinical Decision Support Systems Targetting Low-Resource Settings. IEEE Rev Biomed Eng 2020; 15:354-371. [PMID: 32813662 DOI: 10.1109/rbme.2020.3017868] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Low-resource clinical settings are plagued by low physician-to-patient ratios and a shortage of high-quality medical expertise and infrastructure. Together, these phenomena lead to over-burdened healthcare systems that under-serve the needs of the community. Alleviating this burden can be undertaken by the introduction of clinical decision support systems (CDSSs); systems that support stakeholders (ranging from physicians to patients) within the clinical setting in their day-to-day activities. Such systems, which have proven to be effective in the developed world, remain to be under-explored in low-resource settings. This review attempts to summarize the research focused on clinical decision support systems that either target stakeholders within low-resource clinical settings or diseases commonly found in such environments. When categorizing our findings according to disease applications, we find that CDSSs are predominantly focused on dealing with bacterial infections and maternal care, do not leverage deep learning, and have not been evaluated prospectively. Together, these highlight the need for increased research in this domain in order to impact a diverse set of medical conditions and ultimately improve patient outcomes.
Collapse
|
8
|
Pereira T, Tran N, Gadhoumi K, Pelter MM, Do DH, Lee RJ, Colorado R, Meisel K, Hu X. Photoplethysmography based atrial fibrillation detection: a review. NPJ Digit Med 2020; 3:3. [PMID: 31934647 PMCID: PMC6954115 DOI: 10.1038/s41746-019-0207-9] [Citation(s) in RCA: 139] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 11/22/2019] [Indexed: 01/04/2023] Open
Abstract
Atrial fibrillation (AF) is a cardiac rhythm disorder associated with increased morbidity and mortality. It is the leading risk factor for cardioembolic stroke and its early detection is crucial in both primary and secondary stroke prevention. Continuous monitoring of cardiac rhythm is today possible thanks to consumer-grade wearable devices, enabling transformative diagnostic and patient management tools. Such monitoring is possible using low-cost easy-to-implement optical sensors that today equip the majority of wearables. These sensors record blood volume variations-a technology known as photoplethysmography (PPG)-from which the heart rate and other physiological parameters can be extracted to inform about user activity, fitness, sleep, and health. Recently, new wearable devices were introduced as being capable of AF detection, evidenced by large prospective trials in some cases. Such devices would allow for early screening of AF and initiation of therapy to prevent stroke. This review is a summary of a body of work on AF detection using PPG. A thorough account of the signal processing, machine learning, and deep learning approaches used in these studies is presented, followed by a discussion of their limitations and challenges towards clinical applications.
Collapse
Affiliation(s)
- Tania Pereira
- Department of Physiological Nursing, University of California, San Francisco, CA USA
| | - Nate Tran
- Department of Physiological Nursing, University of California, San Francisco, CA USA
| | - Kais Gadhoumi
- Department of Physiological Nursing, University of California, San Francisco, CA USA
| | - Michele M. Pelter
- Department of Physiological Nursing, University of California, San Francisco, CA USA
| | - Duc H. Do
- David Geffen School of Medicine, University of California, Los Angeles, CA USA
| | - Randall J. Lee
- Cardiovascular Research Institute, Department of Medicine, Institute for Regeneration Medicine, University of California, San Francisco, CA USA
| | - Rene Colorado
- Department of Neurology, School of Medicine, University of California, San Francisco, CA USA
| | - Karl Meisel
- Department of Neurology, School of Medicine, University of California, San Francisco, CA USA
| | - Xiao Hu
- Department of Physiological Nursing, University of California, San Francisco, CA USA
- Department of Neurosurgery, School of Medicine, University of California, Los Angeles, CA USA
- Department of Neurological Surgery, University of California, San Francisco, CA USA
- Institute of Computational Health Sciences, University of California, San Francisco, CA USA
| |
Collapse
|