101
|
Jeon GW, Lee YS, Hahn WH, Jun YH. A Predictive Model for Perinatal Brain Injury Using Machine Learning Based on Early Birth Data. CHILDREN (BASEL, SWITZERLAND) 2024; 11:1313. [PMID: 39594888 PMCID: PMC11592682 DOI: 10.3390/children11111313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/28/2024]
Abstract
BACKGROUND/OBJECTIVE It is difficult to predict perinatal brain injury, and performing brain magnetic resonance imaging (MRI) based on suspected injury remains a clinical challenge. Therefore, we aimed to develop a reliable method for predicting perinatal brain injury using a machine learning model with early birth data. METHODS Neonates admitted to our institution from January 2017 to June 2024 with a gestational age of ≥36 weeks, a birth weight of ≥1800 g, admission within 6 h of birth, and who underwent brain MRI to confirm perinatal brain injury were included. Various machine learning models, including gradient boosting, were trained using early birth data to predict perinatal brain injury. Synthetic minority over-sampling and adaptive synthetic sampling (ADASYN) were applied to address class imbalance. Model performance was evaluated using accuracy, F1 score, and ROC curves. Feature importance scores and Shapley additive explanations (SHAP) values were also calculated. RESULTS Among 179 neonates, 39 had perinatal brain injury. There were significant differences between the injury and non-injury groups in mode of delivery, Apgar scores, capillary pH, lactate dehydrogenase (LDH) levels, and whether therapeutic hypothermia was performed. The gradient boosting model with the ADASYN method achieved the best performance. In terms of feature importance scores, the 1 min Apgar score was the most influential predictor. Additionally, SHAP analysis showed that LDH levels had the highest SHAP values. CONCLUSION the gradient boosting model with ADASYN oversampling effectively predicts perinatal brain injury, potentially improving early detection for predicting long-term outcomes, reducing unnecessary MRI scans, and lowering healthcare costs.
Collapse
Affiliation(s)
| | | | | | - Yong Hoon Jun
- Department of Pediatrics, Inha University Hospital, Inha University College of Medicine, Incheon 22332, Republic of Korea; (G.W.J.); (Y.S.L.); (W.-H.H.)
| |
Collapse
|
102
|
Nansen C, Savi PJ, Mantri A. Methods to optimize optical sensing of biotic plant stress - combined effects of hyperspectral imaging at night and spatial binning. PLANT METHODS 2024; 20:163. [PMID: 39468668 PMCID: PMC11520384 DOI: 10.1186/s13007-024-01292-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 10/21/2024] [Indexed: 10/30/2024]
Abstract
In spatio-temporal plant monitoring, optical sensing (including hyperspectral imaging), is being deployed to, non-invasively, detect and diagnose plant responses to abiotic and biotic stressors. Early and accurate detection and diagnosis of stressors are key objectives. Level of radiometric repeatability of optical sensing data and ability to accurately detect and diagnose biotic stress are inversely correlated. Accordingly, it may be argued that one of the most significant frontiers and challenges regarding widespread adoption of optical sensing in plant research and crop production hinges on methods to maximize radiometric repeatability. In this study, we acquired hyperspectral optical sensing data at noon and midnight from soybean (Glycine max) and coleus wizard velvet red (Solenostemon scutellarioides) plants with/without experimentally infestation of two-spotted spider mites (Tetranychus urticae). We addressed three questions related to optimization of radiometric repeatability: (1) are reflectance-based plant responses affected by time of optical sensing? (2) if so, are plant responses to two-spotted spider mite infestations (biotic stressor) more pronounced at midnight versus at noon? (3) Is detection of biotic stress enhanced by spatial binning (smoothing) of hyperspectral imaging data? Results from this study provide insight into calculations of radiometric repeatability. Results strongly support claims that acquisition of optical sensing data to detect and characterize stress responses by plants to detect biotic stressors should be performed at night. Moreover, the combination of midnight imaging and spatial binning increased classification accuracies with 29% and 31% for soybean and coleus, respectively. Practical implications of these findings are discussed. Study results are relevant to virtually all applications of optical sensing to detect and diagnose abiotic and biotic stress responses by plants in both controlled environments and in outdoor crop production systems.
Collapse
Affiliation(s)
- Christian Nansen
- Department of Entomology and Nematology, University of California, UC Davis Briggs Hall, Room 367, Davis, CA, 95616, USA.
| | - Patrice J Savi
- Department of Entomology and Nematology, University of California, UC Davis Briggs Hall, Room 367, Davis, CA, 95616, USA
| | - Anil Mantri
- Department of Entomology and Nematology, University of California, UC Davis Briggs Hall, Room 367, Davis, CA, 95616, USA
| |
Collapse
|
103
|
Maeda-Minami A, Yoshino T, Katayama K, Horiba Y, Hikiami H, Shimada Y, Namiki T, Tahara E, Minamizawa K, Muramatsu SI, Yamaguchi R, Imoto S, Miyano S, Mima H, Uneda K, Nogami T, Fukunaga K, Watanabe K. Machine learning model for predicting the cold-heat pattern in Kampo medicine: a multicenter prospective observational study. Front Pharmacol 2024; 15:1412593. [PMID: 39525633 PMCID: PMC11543495 DOI: 10.3389/fphar.2024.1412593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 09/30/2024] [Indexed: 11/16/2024] Open
Abstract
Objective The purpose of this study was to predict the four cold-heat patterns in patients who have the subjective symptoms of the cold-heat pattern described in the International Classification of Diseases Traditional Medicine Conditions - Module 1 by applying a machine learning algorithm. Methods Subjects were first-visit Kampo outpatients at six institutions who agreed to participate in this multicenter prospective observational study. The cold pattern model and the heat pattern model were created separately with 148 symptoms, body mass index, blood pressure (systolic and diastolic), age, and sex. Along with a single cold or heat pattern, the tangled heat/cold pattern is defined as being predicted by both cold and heat patterns, while the moderate (heat/cold) pattern is defined as being predicted by neither the cold pattern nor the heat pattern. Results We included 622 participants (mean age ±standard deviation, 54.4 ± 16.9; with female 501). The accuracy, macro-recall, precision, and F1-score of a combination of the two prediction models were 96.7%, 93.2%, 85.6%, and 88.5% respectively. The important items were compatible with the definitions of the cold-heat pattern. Conclusion We developed a prediction model on cold-heat patterns with data from patients whose subjective cold/heat-related symptoms matched the cold-heat pattern diagnosis by the physician.
Collapse
Affiliation(s)
- Ayako Maeda-Minami
- Faculty of Pharmaceutical Sciences, Tokyo University of Science, Chiba, Japan
| | - Tetsuhiro Yoshino
- Center for Kampo Medicine, Keio University School of Medicine, Shinjuku, Tokyo, Japan
| | - Kotoe Katayama
- Human Genome Center, the Institute of Medical Science, University of Tokyo, Minato, Tokyo, Japan
| | - Yuko Horiba
- Center for Kampo Medicine, Keio University School of Medicine, Shinjuku, Tokyo, Japan
| | | | | | - Takao Namiki
- Department of Japanese Oriental (Kampo) Medicine, Graduate School of Medicine, Chiba University, Chiba, Japan
| | - Eiichi Tahara
- Department of Kampo Medicine, Aizu Medical Center, Fukushima Medical University, Fukushima, Japan
| | | | - Shin-Ichi Muramatsu
- Division of Oriental Medicine, Center of Community Medicine, Jichi Medical University, Tochigi, Japan
| | - Rui Yamaguchi
- Human Genome Center, the Institute of Medical Science, University of Tokyo, Minato, Tokyo, Japan
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan
| | - Seiya Imoto
- Human Genome Center, the Institute of Medical Science, University of Tokyo, Minato, Tokyo, Japan
| | - Satoru Miyano
- Human Genome Center, the Institute of Medical Science, University of Tokyo, Minato, Tokyo, Japan
- Department of Integrated Analytics, M&D Data Science Center, Tokyo Medical and Dental University, Bunkyo, Tokyo, Japan
| | - Hideki Mima
- Promoting Organization for Future Creators, Kyushu University, Fukuoka, Japan
| | - Kazushi Uneda
- Department of Kampo Medicine, Aizu Medical Center, Fukushima Medical University, Fukushima, Japan
| | - Tatsuya Nogami
- Department of Kampo Medicine, Tokai University School of Medicine, Isehara, Kanagawa, Japan
| | - Koichi Fukunaga
- Center for Kampo Medicine, Keio University School of Medicine, Shinjuku, Tokyo, Japan
- Department of Medicine, Division of Pulmonary Medicine, Keio University School of Medicine, Shinjuku, Tokyo, Japan
| | - Kenji Watanabe
- Center for Kampo Medicine, Keio University School of Medicine, Shinjuku, Tokyo, Japan
| |
Collapse
|
104
|
Suaza-Medina M, Peñabaena-Niebles R, Jubiz-Diaz M. A model for predicting academic performance on standardised tests for lagging regions based on machine learning and Shapley additive explanations. Sci Rep 2024; 14:25306. [PMID: 39455844 PMCID: PMC11511897 DOI: 10.1038/s41598-024-76596-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 10/15/2024] [Indexed: 10/28/2024] Open
Abstract
Data are becoming more important in education since they allow for the analysis and prediction of future behaviour to improve academic performance and quality at educational institutions. However, academic performance is affected by regions' conditions, such as demographic, psychographic, socioeconomic and behavioural variables, especially in lagging regions. This paper presents a methodology based on applying nine classification algorithms and Shapley values to identify the variables that influence the performance of the Colombian standardised test: the Saber 11 exam. This study is innovative because, unlike others, it applies to lagging regions and combines the use of EDM and Shapley values to predict students' academic performance and analyse the influence of each variable on academic performance. The results show that the algorithms with the best accuracy are Extreme Gradient Boosting Machine, Light Gradient Boosting Machine, and Gradient Boosting Machine. According to the Shapley values, the most influential variables are the socioeconomic level index, gender, region, location of the educational institution, and age. For Colombia, the results showed that male students from urban educational institutions over 18 years have the best academic performance. Moreover, there are differences in educational quality among the lagging regions. Students from Nariño have advantages over ones from other departments. The proposed methodology allows for generating public policies better aligned with the reality of lagging regions and achieving equity in access to education.
Collapse
Affiliation(s)
- Mario Suaza-Medina
- Department of Informatics and Computer Science, Universidad de Zaragoza, Maria de Luna 1, Zaragoza, 50018, Spain
- Department of Industrial Engineering, Universidad del Norte, Km 5 Via Puerto Colombia, Barranquilla, 081007, Atlántico, Colombia
| | - Rita Peñabaena-Niebles
- Department of Industrial Engineering, Universidad del Norte, Km 5 Via Puerto Colombia, Barranquilla, 081007, Atlántico, Colombia.
| | - Maria Jubiz-Diaz
- Department of Industrial Engineering, Universidad del Norte, Km 5 Via Puerto Colombia, Barranquilla, 081007, Atlántico, Colombia
| |
Collapse
|
105
|
Adegbenjo AO, Ngadi MO. Handling the Imbalanced Problem in Agri-Food Data Analysis. Foods 2024; 13:3300. [PMID: 39456362 PMCID: PMC11507408 DOI: 10.3390/foods13203300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 09/07/2024] [Accepted: 10/15/2024] [Indexed: 10/28/2024] Open
Abstract
Imbalanced data situations exist in most fields of endeavor. The problem has been identified as a major bottleneck in machine learning/data mining and is becoming a serious issue of concern in food processing applications. Inappropriate analysis of agricultural and food processing data was identified as limiting the robustness of predictive models built from agri-food applications. As a result of rare cases occurring infrequently, classification rules that detect small groups are scarce, so samples belonging to small classes are largely misclassified. Most existing machine learning algorithms including the K-means, decision trees, and support vector machines (SVMs) are not optimal in handling imbalanced data. Consequently, models developed from the analysis of such data are very prone to rejection and non-adoptability in real industrial and commercial settings. This paper showcases the reality of the imbalanced data problem in agri-food applications and therefore proposes some state-of-the-art artificial intelligence algorithm approaches for handling the problem using methods including data resampling, one-class learning, ensemble methods, feature selection, and deep learning techniques. This paper further evaluates existing and newer metrics that are well suited for handling imbalanced data. Rightly analyzing imbalanced data from food processing application research works will improve the accuracy of results and model developments. This will consequently enhance the acceptability and adoptability of innovations/inventions.
Collapse
Affiliation(s)
- Adeyemi O. Adegbenjo
- Department of Bioresource Engineering, McGill University, 21111 Lakeshore Road, Ste-Anne-de-Bellevue, Montreal, QC H9X 3V9, Canada
- Process Quality Engineering, School of Engineering and Technology, Conestoga College Institute of Technology and Advanced Learning, 299 Doon Valley Drive, Kitchener, ON N2G 4M4, Canada
| | - Michael O. Ngadi
- Department of Bioresource Engineering, McGill University, 21111 Lakeshore Road, Ste-Anne-de-Bellevue, Montreal, QC H9X 3V9, Canada
| |
Collapse
|
106
|
Reis LFM, Nascimento DC, Ferreira PH, Louzada F. Fixing imbalanced binary classification: An asymmetric Bayesian learning approach. PLoS One 2024; 19:e0311246. [PMID: 39413090 PMCID: PMC11482710 DOI: 10.1371/journal.pone.0311246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 09/16/2024] [Indexed: 10/18/2024] Open
Abstract
Most statistical and machine learning models used for binary data modeling and classification assume that the data are balanced. However, this assumption can lead to poor predictive performance and bias in parameter estimation when there is an imbalance in the data due to the threshold election for the binary classification. To address this challenge, several authors suggest using asymmetric link functions in binary regression, instead of the traditional symmetric functions such as logit or probit, aiming to highlight characteristics that would help the classification task. Therefore, this study aims to introduce new classification functions based on the Lomax distribution (and its variations; including power and reverse versions). The proposed Bayesian functions have proven asymmetry and were implemented in a Stan program into the R workflow. Additionally, these functions showed promising results in real-world data applications, outperforming classical link functions in terms of metrics. For instance, in the first example, comparing the reverse power double Lomax (RPDLomax) with the logit link showed that, regardless of the data imbalance, the RPDLomax model assigns effectively lower mean posterior predictive probabilities to failure and higher probabilities to success (21.4% and 63.7%, respectively), unlike Logistic regression, which does not clearly distinguish between the mean posterior predictive probabilities for these two classes (36.0% and 39.5% for failure and success, respectively). That is, the proposed asymmetric Lomax approach is a competitive model for differentiating binary data classification in imbalanced tasks against the Logistic approach.
Collapse
Affiliation(s)
- Letícia F. M. Reis
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, São Paulo, Brazil
| | | | - Paulo H. Ferreira
- Department of Statistics, Federal University of Bahia, Salvador, Bahia, Brazil
| | - Francisco Louzada
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, São Paulo, Brazil
| |
Collapse
|
107
|
Teimouri H, Ghoreyshi ZS, Kolomeisky AB, George JT. Feature Selection Enhances Peptide Binding Predictions for TCR-Specific Interactions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.11.617901. [PMID: 39416168 PMCID: PMC11482946 DOI: 10.1101/2024.10.11.617901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
T-cell receptors (TCRs) play a critical role in the immune response by recognizing specific ligand peptides presented by major histocompatibility complex (MHC) molecules. Accurate prediction of peptide binding to TCRs is essential for advancing immunotherapy, vaccine design, and understanding mechanisms of autoimmune disorders. This study presents a novel theoretical method that explores the impact of feature selection techniques on enhancing the predictive accuracy of peptide binding models tailored for specific TCRs. To evaluate the universality of our approach across different TCR systems, we utilized a dataset that includes peptide libraries tested against three distinct murine TCRs. A broad range of physicochemical properties, including amino acid composition, dipeptide composition, and tripeptide features, were integrated into the machine learning-based feature selection framework to identify key features contributing to binding affinity. Our analysis reveals that leveraging optimized feature subsets not only simplifies the model complexity but also enhances predictive performance, enabling more precise identification of TCR-peptide interactions. The results of our feature selection method are consistent with findings from hybrid approaches that utilize both sequence and structural data as input as well as experimental data. Our theoretical approach highlights the role of feature selection in peptide-TCR interactions, providing a powerful tool for uncovering the molecular mechanisms of the T-cell response and assisting in the design of more advanced targeted therapeutics.
Collapse
Affiliation(s)
- Hamid Teimouri
- Department of Chemistry, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
| | - Zahra S Ghoreyshi
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, TX, 77843, USA
| | - Anatoly B Kolomeisky
- Department of Chemistry, Rice University, Houston, TX, 77005, USA
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, TX, 77005, USA
| | - Jason T George
- Center for Theoretical Biological Physics, Rice University, Houston, TX, 77005, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, TX, 77843, USA
- Department of Hematopoietic Biology and Malignancy, MD Anderson Cancer Center, Houston, TX, 77030, USA
| |
Collapse
|
108
|
Kim J, Youn K, Park J. Risk Factors for Musculoskeletal Disorders in Korean Farmers: Survey on Occupational Diseases in 2020 and 2022. Healthcare (Basel) 2024; 12:2026. [PMID: 39451441 PMCID: PMC11507647 DOI: 10.3390/healthcare12202026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 09/27/2024] [Accepted: 10/02/2024] [Indexed: 10/26/2024] Open
Abstract
BACKGROUND/OBJECTIVES This study investigated factors influencing the prevalence of musculoskeletal disorders (MSDs) resulting from agricultural work, utilizing the 2020 and 2022 occupational disease survey data collected by the Rural Development Administration. The combined data from these years indicated a 6.02% prevalence of MSDs, reflecting a significant class imbalance in the binary response variables. This imbalance could lead to classifiers overlooking rare events, potentially inflating accuracy assessments. METHODS We evaluated five distinct models to compare their performance using both original and synthetic data and assessing the models' performance based on synthetic data generation. In the multivariate logistic model, we focused on the main effects of the covariates as there were no statistically significant second-order interactions. RESULTS Focusing on the random over-sampling examples (ROSE) method, gender, age, and pesticide use were particularly impactful. The odds of experiencing MSDs were 1.29 times higher for females than males. The odds increased with age: 2.66 times higher for those aged 50-59, 4.60 times higher for those aged 60-69, and 7.16 times higher for those aged 70 or older, compared to those under 50. Pesticide use was associated with 1.26 times higher odds of developing MSDs. Among body part usage variables, all except wrists and knees were significant. Farmers who frequently used their necks, arms, and waist showed 1.27, 1.11, and 1.23 times higher odds of developing MSDs, respectively. CONCLUSIONS The accuracy of the raw method was high, but the ROSE method outperformed it for precision and F1 score, and both methods showed similar AUC.
Collapse
Affiliation(s)
- Jinheum Kim
- Department of Applied Statistics, University of Suwon, Hwaseong 18323, Republic of Korea;
| | - Kanwoo Youn
- Department of Occupational & Environmental Medicine, Wonjin Green Hospital, Seoul 02221, Republic of Korea;
| | - Jinwoo Park
- Department of Data Science, University of Suwon, Hwaseong 18323, Republic of Korea
| |
Collapse
|
109
|
Çetin-Kaya Y. Equilibrium Optimization-Based Ensemble CNN Framework for Breast Cancer Multiclass Classification Using Histopathological Image. Diagnostics (Basel) 2024; 14:2253. [PMID: 39410657 PMCID: PMC11475610 DOI: 10.3390/diagnostics14192253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 09/12/2024] [Accepted: 10/08/2024] [Indexed: 10/20/2024] Open
Abstract
Background: Breast cancer is one of the most lethal cancers among women. Early detection and proper treatment reduce mortality rates. Histopathological images provide detailed information for diagnosing and staging breast cancer disease. Methods: The BreakHis dataset, which includes histopathological images, is used in this study. Medical images are prone to problems such as different textural backgrounds and overlapping cell structures, unbalanced class distribution, and insufficiently labeled data. In addition to these, the limitations of deep learning models in overfitting and insufficient feature extraction make it extremely difficult to obtain a high-performance model in this dataset. In this study, 20 state-of-the-art models are trained to diagnose eight types of breast cancer using the fine-tuning method. In addition, a comprehensive experimental study was conducted to determine the most successful new model, with 20 different custom models reported. As a result, we propose a novel model called MultiHisNet. Results: The most effective new model, which included a pointwise convolution layer, residual link, channel, and spatial attention module, achieved 94.69% accuracy in multi-class breast cancer classification. An ensemble model was created with the best-performing transfer learning and custom models obtained in the study, and model weights were determined with an Equilibrium Optimizer. The proposed ensemble model achieved 96.71% accuracy in eight-class breast cancer detection. Conclusions: The results show that the proposed model will support pathologists in successfully diagnosing breast cancer.
Collapse
Affiliation(s)
- Yasemin Çetin-Kaya
- Department of Computer Engineering, Faculty of Engineering and Architecture, Tokat Gaziosmanpasa University, Tokat 60250, Turkey
| |
Collapse
|
110
|
Feng S, Wang C. When an extra rejection class meets out-of-distribution detection in long-tailed image classification. Neural Netw 2024; 178:106485. [PMID: 38959597 DOI: 10.1016/j.neunet.2024.106485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 04/13/2024] [Accepted: 06/19/2024] [Indexed: 07/05/2024]
Abstract
Detecting Out-of-Distribution (OOD) inputs is essential for reliable deep learning in the open world. However, most existing OOD detection methods have been developed based on training sets that exhibit balanced class distributions, making them susceptible when confronted with training sets following a long-tailed distribution. To alleviate this problem, we propose an effective three-branch training framework, which demonstrates the efficacy of incorporating an extra rejection class along with auxiliary outlier training data for effective OOD detection in long-tailed image classification. In our proposed framework, all outlier training samples are assigned the label of the rejection class. We employ an inlier loss, an outlier loss, and a Tail-class prototype induced Supervised Contrastive Loss (TSCL) to train both the in-distribution classifier and OOD detector within one network. During inference, the OOD detector is constructed using the rejection class. Extensive experimental results demonstrate that the superior OOD detection performance of our proposed method in long-tailed image classification. For example, in the more challenging case where CIFAR100-LT is used as in-distribution, our method improves the average AUROC by 1.23% and reduces the average FPR95 by 3.18% compared to the baseline method utilizing Outlier Exposure (OE). Code is available at github.
Collapse
Affiliation(s)
- Shuai Feng
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, China; Department of Computer Science and Technology, Nanjing University, Nanjing, 210023, China.
| | - Chongjun Wang
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, China; Department of Computer Science and Technology, Nanjing University, Nanjing, 210023, China.
| |
Collapse
|
111
|
Bae Y, Byun J, Lee H, Han B. Comparative analysis of chronic progressive nephropathy (CPN) diagnosis in rat kidneys using an artificial intelligence deep learning model. Toxicol Res 2024; 40:551-559. [PMID: 39345736 PMCID: PMC11436530 DOI: 10.1007/s43188-024-00247-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 05/04/2024] [Accepted: 05/15/2024] [Indexed: 10/01/2024] Open
Abstract
With the development of artificial intelligence (AI), technologies based on machines and deep learning are being used in many academic fields. In toxicopathology, research is actively underway to analyze whole slide image (WSI)-level images using AI deep-learning models. However, few studies have been conducted on models for diagnosing complex lesions comprising multiple lesions. Therefore, this study used deep learning segmentation models (YOLOv8, Mask R-CNN, and SOLOv2) to identify three representative lesions (tubular basophilia with atrophy, mononuclear cell infiltration, and hyaline casts) of chronic progressive nephropathy of the kidney, a complex lesion observed in a non-clinical test using rats and selected an initial model appropriate for diagnosing complex lesions by analyzing the characteristics of each algorithm. Approximately 2000 images containing three lesions were extracted using 33 WSI of rat kidneys with chronic progressive nephropathy. Among them, 1701 images were divided into first and second rounds of learning. The loss and mAP50 values were measured twice to confirm the performances of the three algorithms. Loss measurements were stopped at an appropriate epoch to prevent overfitting, and the loss value decreased in the second round based on the data learned in the first round. After measuring the accuracy twice, detection using Mask R-CNN showed the highest mAP50 in all lesions among the three models and was considered sufficient as an initial model for diagnosing complex lesions. By contrast, the YOLOv8 and SOLOv2 models showed low accuracy for all three lesions and had difficulty with segmentation tasks. Therefore, this paper proposes a Mask R-CNN as the initial model for segmenting complex lesions. Precise diagnosis is possible if the model can be trained by increasing the input data, thereby providing greater accuracy in diagnosing pathological images.
Collapse
Affiliation(s)
- Yeji Bae
- Department of Pharmaceutical Engineering, Life Health College, Hoseo University, Asan City, Republic of Korea
| | - Jongsu Byun
- Pathology Team, Microscopic Examination, Dt&CRO, Yongin City, Republic of Korea
| | - Hangyu Lee
- Program Development Team, DeepSoft, Seoul, Republic of Korea
| | - Beomseok Han
- Department of Pharmaceutical Engineering, Life Health College, Hoseo University, Asan City, Republic of Korea
| |
Collapse
|
112
|
Shuqair M, Jimenez-Shahed J, Ghoraani B. Reinforcement Learning-Based Adaptive Classification for Medication State Monitoring in Parkinson's Disease. IEEE J Biomed Health Inform 2024; 28:6168-6179. [PMID: 38968013 DOI: 10.1109/jbhi.2024.3423708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2024]
Abstract
Parkinson's Disease (PD) patients frequently transition between the 'ON' state, where medication is effective, and the 'OFF' state, affecting their quality of life. Monitoring these transitions is vital for personalized therapy. We introduced a framework based on Reinforcement Learning (RL) to detect transitions between medication states by learning from continuous movement data. Unlike traditional approaches that typically identify each state based on static data patterns, our approach focuses on understanding the dynamic patterns of change throughout the transitions, providing a more generalizable medication state monitoring method. We integrated a deep Long Short-Term Memory (LSTM) neural network and three one-class unsupervised classifiers to implement an RL-based adaptive classifier. We tested on two PD datasets: Dataset PD1 with 12 subjects (14-minute average recording) and Dataset PD2 with seven subjects (120-minute average recording). Data from wrist and ankle wearables captured transitions during 2 to 4-hour daily activities. The algorithm demonstrated its effectiveness in detecting medication states, achieving an average weighted F1-score of 82.94% when trained and tested on Dataset PD1. It performed well when trained on Dataset PD1 and tested on Dataset PD2, with a weighted F1-score of 76.67%. It surpassed other models, was resilient to severe PD symptoms, and performed well with imbalanced data. Notably, prior work has not addressed the generalizability from one dataset to another, essential for real-world applications with varied sensors. Our innovative framework revolutionizes PD monitoring, setting the stage for advanced therapeutic methods and greatly enhancing the life quality of PD patients.
Collapse
|
113
|
Gao Z, Cheng S, Wittrup E, Gryak J, Najarian K. Learning using privileged information with logistic regression on acute respiratory distress syndrome detection. Artif Intell Med 2024; 156:102947. [PMID: 39208711 DOI: 10.1016/j.artmed.2024.102947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 07/02/2024] [Accepted: 08/13/2024] [Indexed: 09/04/2024]
Abstract
The advanced learning paradigm, learning using privileged information (LUPI), leverages information in training that is not present at the time of prediction. In this study, we developed privileged logistic regression (PLR) models under the LUPI paradigm to detect acute respiratory distress syndrome (ARDS), with mechanical ventilation variables or chest x-ray image features employed in the privileged domain and electronic health records in the base domain. In model training, the objective of privileged logistic regression was designed to incorporate data from the privileged domain and encourage knowledge transfer across the privileged and base domains. An asymptotic analysis was also performed, yielding sufficient conditions under which the addition of privileged information increases the rate of convergence in the proposed model. Results for ARDS detection show that PLR models achieve better classification performances than logistic regression models trained solely on the base domain, even when privileged information is partially available. Furthermore, PLR models demonstrate performance on par with or superior to state-of-the-art models under the LUPI paradigm. As the proposed models are effective, easy to interpret, and highly explainable, they are ideal for other clinical applications where privileged information is at least partially available.
Collapse
Affiliation(s)
- Zijun Gao
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
| | - Shuyang Cheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
| | - Emily Wittrup
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA.
| | - Jonathan Gryak
- Queens College, City University of New York, New York, 11367, NY, USA.
| | - Kayvan Najarian
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48109, MI, USA; Michigan Institute for Data Science (MIDAS), University of Michigan, Ann Arbor, 48109, MI, USA; Department of Emergency Medicine, University of Michigan, Ann Arbor, 48109, MI, USA; Max Harry Weil Institute for Critical Care Research and Innovation, University of Michigan, Ann Arbor, 48109, MI, USA; Queens College, City University of New York, New York, 11367, NY, USA.
| |
Collapse
|
114
|
Marzano L. Predicting the resolution of hypertension following adrenalectomy in primary aldosteronism: Controversies and unresolved issues a narrative review. Langenbecks Arch Surg 2024; 409:295. [PMID: 39354235 DOI: 10.1007/s00423-024-03486-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Accepted: 09/23/2024] [Indexed: 10/03/2024]
Abstract
BACKGROUND Hypertension resolution following adrenalectomy in patients with primary aldosteronism (PA) remains a critical clinical challenge. Identifying preoperatively which patients will become normotensive is both a priority and a point of contention. In this narrative review, we explore the controversies and unresolved issues surrounding the prediction of hypertension resolution after adrenalectomy in PA. METHODS A comprehensive literature review was conducted, focusing on studies published between 1954 and 2024 that evaluated all studies that discussed predictive models for hypertension resolution post-adrenalectomy in PA patients. Databases searched included MEDLINE®, Ovid Embase, and Web of Science databases. RESULTS The review identified several predictors and predictive models of hypertension resolution, including female sex, duration of hypertension, antihypertensive medication, and BMI. However, inconsistencies in study designs and patient populations led to varied conclusions. CONCLUSIONS Although certain predictors and predictive models of hypertension resolution post-adrenalectomy in PA patients are supported by evidence, significant controversies and unresolved issues remain. While the current predictive models provide valuable insights, there is a clear need for further research in this area. Future studies should focus on validating and refining these models.
Collapse
Affiliation(s)
- Luigi Marzano
- Centro Per Lo Studio E La Cura Dell'Ipertensione Arteriosa, Internal Medicine Unit, San Bortolo Hospital, U.L.S.S. 8 Berica, Vicenza, Italy.
- Internal Medicine Unit, San Bortolo Hospital, U.L.S.S. 8 Berica, 36100, Vicenza, Italy.
| |
Collapse
|
115
|
Nakayama LF, Matos J, Quion J, Novaes F, Mitchell WG, Mwavu R, Hung CJYJ, Santiago APD, Phanphruk W, Cardoso JS, Celi LA. Unmasking biases and navigating pitfalls in the ophthalmic artificial intelligence lifecycle: A narrative review. PLOS DIGITAL HEALTH 2024; 3:e0000618. [PMID: 39378192 PMCID: PMC11460710 DOI: 10.1371/journal.pdig.0000618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2024]
Abstract
Over the past 2 decades, exponential growth in data availability, computational power, and newly available modeling techniques has led to an expansion in interest, investment, and research in Artificial Intelligence (AI) applications. Ophthalmology is one of many fields that seek to benefit from AI given the advent of telemedicine screening programs and the use of ancillary imaging. However, before AI can be widely deployed, further work must be done to avoid the pitfalls within the AI lifecycle. This review article breaks down the AI lifecycle into seven steps-data collection; defining the model task; data preprocessing and labeling; model development; model evaluation and validation; deployment; and finally, post-deployment evaluation, monitoring, and system recalibration-and delves into the risks for harm at each step and strategies for mitigating them.
Collapse
Affiliation(s)
- Luis Filipe Nakayama
- Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Sao Paulo, Brazil
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - João Matos
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Faculty of Engineering (FEUP), University of Porto, Porto, Portugal
- Institute for Systems and Computer Engineering (INESC TEC), Technology and Science, Porto, Portugal
| | - Justin Quion
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Frederico Novaes
- Department of Ophthalmology, Sao Paulo Federal University, Sao Paulo, Sao Paulo, Brazil
| | | | - Rogers Mwavu
- Department of Information Technology, Mbarara University of Science and Technology, Mbarara, Uganda
| | - Claudia Ju-Yi Ji Hung
- Department of Ophthalmology, Byers Eye Institute at Stanford, California, United States of America
- Department of Computer Science and Information Engineering, National Taiwan University, Taiwan
| | - Alvina Pauline Dy Santiago
- University of the Philippines Manila College of Medicine, Manila, Philippines
- Division of Pediatric Ophthalmology, Department of Ophthalmology & Visual Sciences, Philippine General Hospital, Manila, Philippines
- Section of Pediatric Ophthalmology, Eye and Vision Institute, The Medical City, Pasig, Philippines
- Section of Pediatric Ophthalmology, International Eye and Institute, St. Luke’s Medical Center, Quezon City, Philippines
| | - Warachaya Phanphruk
- Department of Ophthalmology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
| | - Jaime S. Cardoso
- Faculty of Engineering (FEUP), University of Porto, Porto, Portugal
- Institute for Systems and Computer Engineering (INESC TEC), Technology and Science, Porto, Portugal
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, United States of America
- Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| |
Collapse
|
116
|
Francisco ME, Carvajal TM, Watanabe K. Hybrid Machine Learning Approach to Zero-Inflated Data Improves Accuracy of Dengue Prediction. PLoS Negl Trop Dis 2024; 18:e0012599. [PMID: 39432557 PMCID: PMC11527386 DOI: 10.1371/journal.pntd.0012599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 10/31/2024] [Accepted: 10/01/2024] [Indexed: 10/23/2024] Open
Abstract
BACKGROUND Spatiotemporal dengue forecasting using machine learning (ML) can contribute to the development of prevention and control strategies for impending dengue outbreaks. However, training data for dengue incidence may be inflated with frequent zero values because of the rarity of cases, which lowers the prediction accuracy. This study aimed to understand the influence of spatiotemporal resolutions of data on the accuracy of dengue incidence prediction using ML models, to understand how the influence of spatiotemporal resolution differs between quantitative and qualitative predictions of dengue incidence, and to improve the accuracy of dengue incidence prediction with zero-inflated data. METHODOLOGY We predicted dengue incidence at six spatiotemporal resolutions and compared their prediction accuracy. Six ML algorithms were compared: generalized additive models, random forests, conditional inference forest, artificial neural networks, support vector machines and regression, and extreme gradient boosting. Data from 2009 to 2012 were used for training, and data from 2013 were used for model validation with quantitative and qualitative dengue variables. To address the inaccuracy in the quantitative prediction of dengue incidence due to zero-inflated data at fine spatiotemporal scales, we developed a hybrid approach in which the second-stage quantitative prediction is performed only when/where the first-stage qualitative model predicts the occurrence of dengue cases. PRINCIPAL FINDINGS At higher resolutions, the dengue incidence data were zero-inflated, which was insufficient for quantitative pattern extraction of relationships between dengue incidence and environmental variables by ML. Qualitative models, used as binary variables, eased the effect of data distribution. Our novel hybrid approach of combining qualitative and quantitative predictions demonstrated high potential for predicting zero-inflated or rare phenomena, such as dengue. SIGNIFICANCE Our research contributes valuable insights to the field of spatiotemporal dengue prediction and provides an alternative solution to enhance prediction accuracy in zero-inflated data where hurdle or zero-inflated models cannot be applied.
Collapse
Affiliation(s)
- Micanaldo Ernesto Francisco
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
- Graduate School of Science and Engineering, Ehime University, Matsuyama, Ehime, Japan
- Faculty of Architecture and Physical Planning (FAPF), Lurio University, Nampula, Mozambique
| | - Thaddeus M. Carvajal
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
- Department of Biology De La Salle University, Taft Ave Manila, Philippines
| | - Kozo Watanabe
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama, Japan
| |
Collapse
|
117
|
Agraz M, Deng Y, Karniadakis GE, Mantzoros CS. Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset. Sci Rep 2024; 14:22741. [PMID: 39349500 PMCID: PMC11444036 DOI: 10.1038/s41598-024-69844-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Accepted: 08/09/2024] [Indexed: 10/02/2024] Open
Abstract
Patients with type 2 diabetes mellitus (T2DM) who have severe hypoglycemia (SH) poses a considerable risk of long-term death, especially among the elderly, demanding urgent medical attention. Accurate prediction of SH remains challenging due to its multifaced nature, contributed from factors such as medications, lifestyle choices, and metabolic measurements. In this study, we propose a systematic approach to improve the robustness and accuracy of SH predictions using machine learning models, guided by clinical feature selection. Our focus is on developing long-term SH prediction models using both semi-supervised learning and supervised learning algorithms. Using the action to control cardiovascular risk in diabetes trial, which includes electronic health records for over 10,000 individuals, we focus on studying adults with T2DM. Our results indicate that the application of a multi-view co-training method, incorporating the random forest algorithm, improves the specificity of SH prediction, while the same setup with Naive Bayes replacing random forest demonstrates better sensitivity. Our framework also provides interpretability of machine learning models by identifying key predictors for hypoglycemia, including fasting plasma glucose, hemoglobin A1c, general diabetes education, and NPH or L insulins. The integration of data routinely available in electronic health records significantly enhances our model's capability to predict SH events, showcasing its potential to transform clinical practice by facilitating early interventions and optimizing patient management. By enhancing prediction accuracy and identifying crucial predictive features, our study contributes to advancing the understanding and management of hypoglycemia in this population.
Collapse
Affiliation(s)
- Melih Agraz
- Division of Applied Mathematics, Brown University, Providence, RI, 02912, USA
- Department of Statistics, Giresun University, Giresun, 28200, Turkey
- Department of Endocrinology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA
| | - Yixiang Deng
- Department of Computer and Information Science, College of Engineering, University of Delaware, Newark, DE, 19716, USA
- Ragon Institute of Mass General, MIT and Harvard, Cambridge, MA, 02142, USA
| | - George Em Karniadakis
- Division of Applied Mathematics, Brown University, Providence, RI, 02912, USA
- School of Engineering, Brown University, Providence, RI, 02912, USA
| | - Christos Socrates Mantzoros
- Department of Endocrinology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, 02215, USA.
| |
Collapse
|
118
|
Talebi Moghaddam M, Jahani Y, Arefzadeh Z, Dehghan A, Khaleghi M, Sharafi M, Nikfar G. Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm. BMC Med Res Methodol 2024; 24:220. [PMID: 39333899 PMCID: PMC11430121 DOI: 10.1186/s12874-024-02341-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2024] [Accepted: 09/16/2024] [Indexed: 09/30/2024] Open
Abstract
BACKGROUND Imbalanced datasets pose significant challenges in predictive modeling, leading to biased outcomes and reduced model reliability. This study addresses data imbalance in diabetes prediction using machine learning techniques. Utilizing data from the Fasa Adult Cohort Study (FACS) with a 5-year follow-up of 10,000 participants, we developed predictive models for Type 2 diabetes. METHODS We employed various data-level and algorithm-level interventions, including SMOTE, ADASYN, SMOTEENN, Random Over Sampling and KMeansSMOTE, paired with Random Forest, Gradient Boosting, Decision Tree and Multi-Layer Perceptron (MLP) classifier. We evaluated model performance using F1 score, AUC, and G-means-metrics chosen to provide a comprehensive assessment of model accuracy, discrimination ability, and overall balance in performance, particularly in the context of imbalanced datasets. RESULTS our study uncovered key factors influencing diabetes risk and evaluated the performance of various machine learning models. Feature importance analysis revealed that the most influential predictors of diabetes differ between males and females. For females, the most important factors are triglyceride (TG), basal metabolic rate (BMR), and total cholesterol (CHOL), whereas for males, the key predictors are body Mass Index (BMI), serum glutamate Oxaloacetate Transaminase (SGOT), and Gamma-Glutamyl (GGT). Across the entire dataset, BMI remains the most important variable, followed by SGOT, BMR, and energy intake. These insights suggest that gender-specific risk profiles should be considered in diabetes prevention and management strategies. In terms of model performance, our results show that ADASYN with MLP classifier achieved an F1 score of 82.17 ± 3.38, AUC of 89.61 ± 2.09, and G-means of 89.15 ± 2.31. SMOTE with MLP followed closely with an F1 score of 79.85 ± 3.91, AUC of 89.7 ± 2.54, and G-means of 89.31 ± 2.78. The SMOTEENN with Random Forest combination achieved an F1 score of 78.27 ± 1.54, AUC of 87.18 ± 1.12, and G-means of 86.47 ± 1.28. CONCLUSION These combinations effectively address class imbalance, improving the accuracy and reliability of diabetes predictions. The findings highlight the importance of using appropriate data-balancing techniques in medical data analysis.
Collapse
Affiliation(s)
- Maryam Talebi Moghaddam
- Noncommunicable Diseases Research Center, Fasa University of Medical Sciences, Fasa, Iran
- Student of Biostatistics, Department of Biostatistics and Epidemiology, School of Public Health, Kerman University of Medical Sciences, Kerman, Iran
| | - Yones Jahani
- Modeling in Health Research Center Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran
| | - Zahra Arefzadeh
- Faculty of Data Science and Intelligent Systems, Persian Gulf University, Bushehr, Iran
| | - Azizallah Dehghan
- Noncommunicable Diseases Research Center, Fasa University of Medical Sciences, Fasa, Iran
- Department of Epidemiology and Biostatistics, School of Health, Fasa University of Medical Sciences, Fasa, Iran
| | - Mohsen Khaleghi
- Department of Mathematics and Computer Science, Fasa Branch, Islamic Azad University, Fasa, Iran.
| | - Mehdi Sharafi
- Endocrinology and Metabolism Research Center, Hormozgan University of Medical Sciences, Bandar, Abbas, Iran.
| | - Ghasem Nikfar
- Research Development Unit Valiasr Hospital, Fasa University of Medical Sciences, Fasa, Iran
| |
Collapse
|
119
|
Li B, Zhu X, Zhao D, Li Y, Yang Y, Li J, Bi C, Zhang X. igRNA Prediction and Selection AI Models (igRNA-PS) for Bystander-less ABE Base Editing. J Mol Biol 2024; 436:168714. [PMID: 39029887 DOI: 10.1016/j.jmb.2024.168714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/28/2024] [Accepted: 07/13/2024] [Indexed: 07/21/2024]
Abstract
CRISPR derived base editing techniques tend to edit multiple bases in the targeted region, which impedes precise reversion of disease-associated single nucleotide variations (SNVs). We designed an imperfect gRNA (igRNA) editing strategy to achieve bystander-less single-base editing. To predict the performance and provide ready-to-use igRNAs, we employed a high-throughput method to edit 5000 loci, each with approximate 19 systematically designed ABE igRNAs. Through deep learning of the relationship of editing efficiency, original gRNA sequence and igRNA sequence, AI models were constructed and tested, designated igRNA Prediction and Selection AI models (igRNA-PS). The models have three functions, First, they can identify the major editing site from the bystanders on a gRNA protospacer with a near 90% accuracy. second, a modified single-base editing efficiency (SBE), considering both single-base editing efficiency and product purity, can be predicted for any given igRNAs. Third, for an editing locus, a set of 64 igRNAs derived from a gRNA can be generated, evaluated through igRNA-PS to select for the best performer, and provided to the user. In this work, we overcome one of the most significant obstacles of base editors, and provide a convenient and efficient approach for single-base bystander-less ABE base editing.
Collapse
Affiliation(s)
- Bo Li
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China
| | - Xiagu Zhu
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China; College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300222, China
| | - Dongdong Zhao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China
| | - Yaqiu Li
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China
| | - Yuanzhao Yang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China; College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300222, China
| | - Ju Li
- College of Life Science, Tianjin Normal University, Tianjin, China
| | - Changhao Bi
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China.
| | - Xueli Zhang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300000, China; National Center of Technology Innovation for Synthetic Biology, Tianjin 300000, China.
| |
Collapse
|
120
|
Wei S, Richard R, Hogue D, Mondal I, Xu T, Boyer T, Hamilton K. High resolution data visualization and machine learning prediction of free chlorine residual in a green building water system. WATER RESEARCH X 2024; 24:100244. [PMID: 39188328 PMCID: PMC11345929 DOI: 10.1016/j.wroa.2024.100244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Revised: 07/23/2024] [Accepted: 07/25/2024] [Indexed: 08/28/2024]
Abstract
People spend most of their time indoors and are exposed to numerous contaminants in the built environment. Water management plans implemented in buildings are designed to manage the risks of preventable diseases caused by drinking water contaminants such as opportunistic pathogens (e.g., Legionella spp.), metals, and disinfection by-products (DBPs). However, specialized training required to implement water management plans and heterogeneity in building characteristics limit their widespread adoption. Implementation of machine learning and artificial intelligence (ML/AI) models in building water settings presents an opportunity for faster, more widespread use of data-driven water quality management approaches. We demonstrate the utility of Random Forest and Long Short-Term Memory (LSTM) ML models for predicting a key public health parameter, free chlorine residual, as a function of data collected from building water quality sensors (ORP, pH, conductivity, and temperature) as well as WiFi signals as a proxy for building occupancy and water usage in a "green" Leadership in Energy and Environmental Design (LEED) commercial and institutional building. The models successfully predicted free chlorine residual declines below 0.2 ppm, a common minimum reference level for public health protection in drinking water distribution systems. The predictions were valid up to 5 min in advance, and in some cases reasonably accurate up to 24 h in advance, presenting opportunities for proactive water quality management as part of a sense-analyze-decide framework. An online data dashboard for visualizing water quality in the building is presented, with the potential to link these approaches for real-time water quality management.
Collapse
Affiliation(s)
- S. Wei
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
| | - R. Richard
- Wilson & Company Engineers, United States
| | - D. Hogue
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
| | - I. Mondal
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
- Biodesign Center for Environmental Health Engineering, Arizona State University, Tempe, AZ 85281, United States
| | - T. Xu
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
| | - T.H. Boyer
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
| | - K.A. Hamilton
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe, AZ 85281, United States
- Biodesign Center for Environmental Health Engineering, Arizona State University, Tempe, AZ 85281, United States
| |
Collapse
|
121
|
Islam S, Alfred M, Wilson D, Cohen E. Evaluating Active Learning Strategies for Automated Classification of Patient Safety Event Reports in Hospitals. PROCEEDINGS OF THE HUMAN FACTORS AND ERGONOMICS SOCIETY ... ANNUAL MEETING. HUMAN FACTORS AND ERGONOMICS SOCIETY. ANNUAL MEETING 2024; 68:465-472. [PMID: 39713192 PMCID: PMC11655274 DOI: 10.1177/10711813241260676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Patient safety event (PSE) reports, which document incidents that compromise patient safety, are fundamental for improving healthcare quality. Accurate classification of these reports is crucial for analyzing trends, guiding interventions, and supporting organizational learning. However, this process is labor-intensive due to the high volume and complex taxonomy of reports. Previous work has shown that machine learning (ML) can automate PSE report classification; however, its success depends on large manually-labeled datasets. This study leverages Active Learning (AL) strategies with human expertise to streamline PSE-report labeling. We utilize pool-based AL sampling to selectively query reports for human annotation, developing a robust dataset for training ML classifiers. Our experiments demonstrate that AL significantly outperforms random sampling in accuracy across various text representations, reducing the need for labeled samples by 24% to 69%. Based on these findings, we suggest that incorporating AL strategies into PSE-report labeling can effectively reduce manual workload while maintaining high classification accuracy.
Collapse
Affiliation(s)
- Shehnaz Islam
- Mechanical and Industrial Engineering, University of Toronto, Toronto, ON, Canada
| | - Myrtede Alfred
- Mechanical and Industrial Engineering, University of Toronto, Toronto, ON, Canada
| | - Dulaney Wilson
- Public Health Sciences, Medical University of South Carolina, Charleston, USA
| | - Eldan Cohen
- Mechanical and Industrial Engineering, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
122
|
Zahergivar A, Anari PY, Mendhiratta N, Lay N, Singh S, Firouzabadi FD, Chaurasia A, Golagha M, Homayounieh F, Gautam R, Harmon S, Turkbey E, Merino M, Jones EC, Ball MW, Turkbey B, Marston Linehan W, Malayeri AA. Non-Invasive Tumor Grade Evaluation in Von Hippel-Lindau-Associated Clear Cell Renal Cell Carcinoma: A Magnetic Resonance Imaging-Based Study. J Magn Reson Imaging 2024; 60:1076-1081. [PMID: 38299714 PMCID: PMC11291699 DOI: 10.1002/jmri.29222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/01/2023] [Accepted: 12/02/2023] [Indexed: 02/02/2024] Open
Abstract
BACKGROUND Pathology grading is an essential step for the treatment and evaluation of the prognosis in patients with clear cell renal cell carcinoma (ccRCC). PURPOSE To investigate the utility of texture analysis in evaluating Fuhrman grades of renal tumors in patients with Von Hippel-Lindau (VHL)-associated ccRCC, aiming to improve non-invasive diagnosis and personalized treatment. STUDY TYPE Retrospective analysis of a prospectively maintained cohort. POPULATION One hundred and thirty-six patients, 84 (61%) males and 52 (39%) females with pathology-proven ccRCC with a mean age of 52.8 ± 12.7 from 2010 to 2023. FIELD STRENGTH AND SEQUENCES 1.5 and 3 T MRIs. Segmentations were performed on the T1-weighted 3-minute delayed sequence and then registered on pre-contrast, T1-weighted arterial and venous sequences. ASSESSMENT A total of 404 lesions, 345 low-grade tumors, and 59 high-grade tumors were segmented using ITK-SNAP on a T1-weighted 3-minute delayed sequence of MRI. Radiomics features were extracted from pre-contrast, T1-weighted arterial, venous, and delayed post-contrast sequences. Preprocessing techniques were employed to address class imbalances. Features were then rescaled to normalize the numeric values. We developed a stacked model combining random forest and XGBoost to assess tumor grades using radiomics signatures. STATISTICAL TESTS The model's performance was evaluated using positive predictive value (PPV), sensitivity, F1 score, area under the curve of receiver operating characteristic curve, and Matthews correlation coefficient. Using Monte Carlo technique, the average performance of 100 benchmarks of 85% train and 15% test was reported. RESULTS The best model displayed an accuracy of 0.79. For low-grade tumor detection, a sensitivity of 0.79, a PPV of 0.95, and an F1 score of 0.86 were obtained. For high-grade tumor detection, a sensitivity of 0.78, PPV of 0.39, and F1 score of 0.52 were reported. DATA CONCLUSION Radiomics analysis shows promise in classifying pathology grades non-invasively for patients with VHL-associated ccRCC, potentially leading to better diagnosis and personalized treatment. LEVEL OF EVIDENCE 1 TECHNICAL EFFICACY: Stage 2.
Collapse
Affiliation(s)
- Aryan Zahergivar
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | - Pouria Yazdian Anari
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | - Neil Mendhiratta
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Nathan Lay
- Artificial Intelligence Resource, National Cancer Institute, National Institutes of Health, USA
| | - Shiva Singh
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | | | - Aditi Chaurasia
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Mahshid Golagha
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Fatemeh Homayounieh
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | - Rabindra Gautam
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Stephanie Harmon
- Artificial Intelligence Resource, National Cancer Institute, National Institutes of Health, USA
| | - Evrim Turkbey
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | - Maria Merino
- Pathology Department, National Cancer Institute, National Institutes of Health, USA
| | - Elizabeth C. Jones
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| | - Mark W. Ball
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Baris Turkbey
- Artificial Intelligence Resource, National Cancer Institute, National Institutes of Health, USA
| | - W. Marston Linehan
- Urology Oncology Branch, National cancer institutes, National Institutes of Health, USA
| | - Ashkan A. Malayeri
- Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, USA
| |
Collapse
|
123
|
Kawai Y, Yamamoto K, Miyazaki K, Asai H, Fukushima H. Effects of Post-Hospital Arrival Factors on Out-of-Hospital Cardiac Arrest Outcomes During the COVID-19 Pandemic. Crit Care Explor 2024; 6:e1154. [PMID: 39254650 PMCID: PMC11390052 DOI: 10.1097/cce.0000000000001154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024] Open
Abstract
IMPORTANCE The relationship between post-hospital arrival factors and out-of-hospital cardiac arrest (OHCA) outcomes remains unclear. OBJECTIVES This study assessed the impact of post-hospital arrival factors on OHCA outcomes during the COVID-19 pandemic using a prediction model. DESIGN, SETTING, AND PARTICIPANTS In this cohort study, data from the All-Japan Utstein Registry, a nationwide population-based database, between 2015 and 2021 were used. A total of 541,781 patients older than 18 years old who experienced OHCA of cardiac origin were included. MAIN OUTCOMES AND MEASURES The primary exposure was trends in COVID-19 cases. The study compared the predicted proportion of favorable neurologic outcomes 1 month after resuscitation with the actual outcomes. Neurologic outcomes were categorized based on the Cerebral Performance Category score (1, good cerebral function; 2, moderate cerebral function). RESULTS The prediction model, which had an area under the curve of 0.96, closely matched actual outcomes in 2019. However, a significant discrepancy emerged after the pandemic began in 2020, where outcomes continued to deteriorate as the virus spread, exacerbated by both pre- and post-hospital arrival factors. CONCLUSIONS AND RELEVANCE Post-hospital arrival factors were as important as pre-hospital factors in adversely affecting the prognosis of patients following OHCA during the COVID-19 pandemic. The results suggest that the overall response of the healthcare system needs to be improved during infectious disease outbreaks to improve outcomes.
Collapse
Affiliation(s)
- Yasuyuki Kawai
- All authors: Department of Emergency and Critical Care Medicine, Nara Medical University, Nara, Japan
| | | | | | | | | |
Collapse
|
124
|
Askar M, Tafavvoghi M, Småbrekke L, Bongo LA, Svendsen K. Using machine learning methods to predict all-cause somatic hospitalizations in adults: A systematic review. PLoS One 2024; 19:e0309175. [PMID: 39178283 PMCID: PMC11343463 DOI: 10.1371/journal.pone.0309175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/06/2024] [Indexed: 08/25/2024] Open
Abstract
AIM In this review, we investigated how Machine Learning (ML) was utilized to predict all-cause somatic hospital admissions and readmissions in adults. METHODS We searched eight databases (PubMed, Embase, Web of Science, CINAHL, ProQuest, OpenGrey, WorldCat, and MedNar) from their inception date to October 2023, and included records that predicted all-cause somatic hospital admissions and readmissions of adults using ML methodology. We used the CHARMS checklist for data extraction, PROBAST for bias and applicability assessment, and TRIPOD for reporting quality. RESULTS We screened 7,543 studies of which 163 full-text records were read and 116 met the review inclusion criteria. Among these, 45 predicted admission, 70 predicted readmission, and one study predicted both. There was a substantial variety in the types of datasets, algorithms, features, data preprocessing steps, evaluation, and validation methods. The most used types of features were demographics, diagnoses, vital signs, and laboratory tests. Area Under the ROC curve (AUC) was the most used evaluation metric. Models trained using boosting tree-based algorithms often performed better compared to others. ML algorithms commonly outperformed traditional regression techniques. Sixteen studies used Natural language processing (NLP) of clinical notes for prediction, all studies yielded good results. The overall adherence to reporting quality was poor in the review studies. Only five percent of models were implemented in clinical practice. The most frequently inadequately addressed methodological aspects were: providing model interpretations on the individual patient level, full code availability, performing external validation, calibrating models, and handling class imbalance. CONCLUSION This review has identified considerable concerns regarding methodological issues and reporting quality in studies investigating ML to predict hospitalizations. To ensure the acceptability of these models in clinical settings, it is crucial to improve the quality of future studies.
Collapse
Affiliation(s)
- Mohsen Askar
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Masoud Tafavvoghi
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Småbrekke
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Ailo Bongo
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Kristian Svendsen
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
125
|
Butner JD, Dogra P, Chung C, Koay EJ, Welsh JW, Hong DS, Cristini V, Wang Z. Hybridizing mechanistic modeling and deep learning for personalized survival prediction after immune checkpoint inhibitor immunotherapy. NPJ Syst Biol Appl 2024; 10:88. [PMID: 39143136 PMCID: PMC11324794 DOI: 10.1038/s41540-024-00415-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 07/29/2024] [Indexed: 08/16/2024] Open
Abstract
We present a study where predictive mechanistic modeling is combined with deep learning methods to predict individual patient survival probabilities under immune checkpoint inhibitor (ICI) immunotherapy. This hybrid approach enables prediction based on both measures that are calculable from mechanistic models of key mechanisms underlying ICI therapy that may not be directly measurable in the clinic and easily measurable quantities or patient characteristics that are not always readily incorporated into predictive mechanistic models. A deep learning time-to-event predictive model trained on a hybrid mechanistic + clinical data set from 93 patients achieved higher per-patient predictive accuracy based on event-time concordance, Brier score, and negative binomial log-likelihood-based criteria than when trained on only mechanistic model-derived values or only clinical data. Feature importance analysis revealed that both clinical and model-derived parameters play prominent roles in increasing prediction accuracy, further supporting the advantage of our hybrid approach.
Collapse
Affiliation(s)
- Joseph D Butner
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Institute for Data Science in Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- The Cameron School of Business, University of St. Thomas, Houston, TX, USA.
| | - Prashant Dogra
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Caroline Chung
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Institute for Data Science in Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Eugene J Koay
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - James W Welsh
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - David S Hong
- Department of Investigational Cancer Therapeutics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Vittorio Cristini
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
- Neal Cancer Center, Houston Methodist Research Institute, Houston, TX, USA
- Physiology, Biophysics, and Systems Biology Program, Graduate School of Medical Sciences, Weill Cornell Medicine, New York, NY, USA
- Department of Imaging Physics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Zhihui Wang
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA.
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.
- Neal Cancer Center, Houston Methodist Research Institute, Houston, TX, USA.
- Department of Medical Education, Texas A&M University School of Medicine, Bryan, TX, USA.
| |
Collapse
|
126
|
Latifi M, Beig Zali R, Javadi AA, Farmani R. Customised-sampling approach for pipe failure prediction in water distribution networks. Sci Rep 2024; 14:18224. [PMID: 39107389 PMCID: PMC11303377 DOI: 10.1038/s41598-024-69109-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 07/31/2024] [Indexed: 08/10/2024] Open
Abstract
This paper presents a new methodology for addressing imbalanced class data for failure prediction in Water Distribution Networks (WDNs). The proposed methodology relies on existing approaches including under-sampling, over-sampling, and class weighting as primary strategies. These techniques aim to treat the imbalanced datasets by adjusting the representation of minority and majority classes. Under-sampling reduces data in the majority class, over-sampling adds data to the minority class, and class weighting assigns unequal weights based on class counts to balance the influence of each class during machine learning (ML) model training. In this paper, the mentioned approaches were used at levels other than "balance point" to construct pipe failure prediction models for a WDN with highly imbalanced data. F1-score, and AUC-ROC, were selected to evaluate model performance. Results revealed that under-sampling above the balance point yields the highest F1-score, while over-sampling below the balance point achieves optimal results. Employing class weights during training and prediction emphasises the efficacy of lower weights than the balance. Combining under-sampling and over-sampling to the same ratio for both majority and minority classes showed limited improvement. However, a more effective predictive model emerged when over-sampling the minority class and under-sampling the majority class to different ratios, followed by applying class weights to balance data.
Collapse
Affiliation(s)
- Milad Latifi
- Centre for Water Systems, University of Exeter, Exeter, UK.
| | | | - Akbar A Javadi
- Centre for Water Systems, University of Exeter, Exeter, UK
| | | |
Collapse
|
127
|
Singleton AL, Glidden CK, Chamberlin AJ, Tuan R, Palasio RGS, Pinter A, Caldeira RL, Mendonça CLF, Carvalho OS, Monteiro MV, Athni TS, Sokolow SH, Mordecai EA, De Leo GA. Species distribution modeling for disease ecology: A multi-scale case study for schistosomiasis host snails in Brazil. PLOS GLOBAL PUBLIC HEALTH 2024; 4:e0002224. [PMID: 39093879 PMCID: PMC11296653 DOI: 10.1371/journal.pgph.0002224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 07/17/2024] [Indexed: 08/04/2024]
Abstract
Species distribution models (SDMs) are increasingly popular tools for profiling disease risk in ecology, particularly for infectious diseases of public health importance that include an obligate non-human host in their transmission cycle. SDMs can create high-resolution maps of host distribution across geographical scales, reflecting baseline risk of disease. However, as SDM computational methods have rapidly expanded, there are many outstanding methodological questions. Here we address key questions about SDM application, using schistosomiasis risk in Brazil as a case study. Schistosomiasis is transmitted to humans through contact with the free-living infectious stage of Schistosoma spp. parasites released from freshwater snails, the parasite's obligate intermediate hosts. In this study, we compared snail SDM performance across machine learning (ML) approaches (MaxEnt, Random Forest, and Boosted Regression Trees), geographic extents (national, regional, and state), types of presence data (expert-collected and publicly-available), and snail species (Biomphalaria glabrata, B. straminea, and B. tenagophila). We used high-resolution (1km) climate, hydrology, land-use/land-cover (LULC), and soil property data to describe the snails' ecological niche and evaluated models on multiple criteria. Although all ML approaches produced comparable spatially cross-validated performance metrics, their suitability maps showed major qualitative differences that required validation based on local expert knowledge. Additionally, our findings revealed varying importance of LULC and bioclimatic variables for different snail species at different spatial scales. Finally, we found that models using publicly-available data predicted snail distribution with comparable AUC values to models using expert-collected data. This work serves as an instructional guide to SDM methods that can be applied to a range of vector-borne and zoonotic diseases. In addition, it advances our understanding of the relevant environment and bioclimatic determinants of schistosomiasis risk in Brazil.
Collapse
Affiliation(s)
- Alyson L. Singleton
- Emmett Interdisciplinary Program in Environment and Resources, Stanford University, Stanford, California, United States of America
| | - Caroline K. Glidden
- Department of Biology, Stanford University, Stanford, California, United States of America
- Institute for Human-centered Artificial Intelligence, Stanford University, Stanford, California, United States of America
| | - Andrew J. Chamberlin
- Department of Oceans, Hopkins Marine Station, Stanford University, Pacific Grove, California, United States of America
| | | | | | | | | | | | - Omar S. Carvalho
- Fiocruz Minas/Belo Horizonte-Minas Gerais, Belo Horizonte, Brazil
| | - Miguel V. Monteiro
- Geoinformation & Earth Observation Division, National Institute for Space Research (INPE), São Paulo, Brazil
| | - Tejas S. Athni
- Department of Biology, Stanford University, Stanford, California, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Susanne H. Sokolow
- Department of Oceans, Hopkins Marine Station, Stanford University, Pacific Grove, California, United States of America
- Marine Science Institute, University of California Santa Barbara, Santa Barbara, California, United States of America
| | - Erin A. Mordecai
- Department of Biology, Stanford University, Stanford, California, United States of America
- Woods Institute for the Environment, Stanford University, Stanford, California, United States of America
| | - Giulio A. De Leo
- Department of Oceans, Hopkins Marine Station, Stanford University, Pacific Grove, California, United States of America
- Woods Institute for the Environment, Stanford University, Stanford, California, United States of America
| |
Collapse
|
128
|
Sonmez ME, Gumus NE, Eczacioglu N, Develi EE, Yücel K, Yildiz HB. Enhancing microalgae classification accuracy in marine ecosystems through convolutional neural networks and support vector machines. MARINE POLLUTION BULLETIN 2024; 205:116616. [PMID: 38936001 DOI: 10.1016/j.marpolbul.2024.116616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 06/13/2024] [Accepted: 06/16/2024] [Indexed: 06/29/2024]
Abstract
Accurately classifying microalgae species is vital for monitoring marine ecosystems and managing the emergence of marine mucilage, which is crucial for monitoring mucilage phenomena in marine environments. Traditional methods have been inadequate due to time-consuming processes and the need for expert knowledge. The purpose of this article is to employ convolutional neural networks (CNNs) and support vector machines (SVMs) to improve classification accuracy and efficiency. By employing advanced computational techniques, including MobileNet and GoogleNet models, alongside SVM classification, the study demonstrates significant advancements over conventional identification methods. In the classification of a dataset consisting of 7820 images using four different SVM kernel functions, the linear kernel achieved the highest success rate at 98.79 %. It is followed by the RBF kernel at 98.73 %, the polynomial kernel at 97.84 %, and the sigmoid kernel at 97.20 %. This research not only provides a methodological framework for future studies in marine biodiversity monitoring but also highlights the potential for real-time applications in ecological conservation and understanding mucilage dynamics amidst climate change and environmental pollution.
Collapse
Affiliation(s)
- Mesut Ersin Sonmez
- Department of Bioengineering, Faculty of Engineering, Karamanoglu Mehmetbey University, Karaman, Türkiye
| | - Numan Emre Gumus
- Department of Environmental Protection Technology, Kazım Karabekir Vocational School, Karamanoglu Mehmetbey University, Karaman, Türkiye.
| | - Numan Eczacioglu
- Department of Bioengineering, Faculty of Engineering, Karamanoglu Mehmetbey University, Karaman, Türkiye
| | | | - Kamile Yücel
- Department of Medical Biochemistry, Faculty of Medicine, KTO, Karatay University, Konya, Türkiye
| | | |
Collapse
|
129
|
Truong B, Zapala M, Kammen B, Luu K. Automated Detection of Pediatric Foreign Body Aspiration from Chest X-rays Using Machine Learning. Laryngoscope 2024; 134:3807-3814. [PMID: 38366768 DOI: 10.1002/lary.31338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 01/19/2024] [Accepted: 01/26/2024] [Indexed: 02/18/2024]
Abstract
OBJECTIVE/HYPOTHESIS Standard chest radiographs are a poor diagnostic tool for pediatric foreign body aspiration. Machine learning may improve upon the diagnostic capabilities of chest radiographs. The objective is to develop a machine learning algorithm that improves the diagnostic capabilities of chest radiographs in pediatric foreign body aspiration. METHOD This retrospective, diagnostic study included a retrospective chart review of patients with a potential diagnosis of FBA from 2010 to 2020. Frontal view chest radiographs were extracted, processed, and uploaded to Google AutoML Vision. The developed algorithm was then evaluated against a pediatric radiologist. RESULTS The study selected 566 patients who were presented with a suspected diagnosis of foreign body aspiration. One thousand six hundred and eighty eight chest radiograph images were collected. The sensitivity and specificity of the radiologist interpretation were 50.6% (43.1-58.0) and 88.7% (85.3-91.5), respectively. The sensitivity and specificity of the algorithm were 66.7% (43.0-85.4) and 95.3% (90.6-98.1), respectively. The precision and recall of the algorithm were both 91.8% with an AuPRC of 98.3%. CONCLUSION Chest radiograph analysis augmented with machine learning can diagnose foreign body aspiration in pediatric patients at a level similar to a read performed by a pediatric radiologist despite only using single-view, fixed images. Overall, this study highlights the potential and capabilities of machine learning in diagnosing conditions with a wide range of clinical presentations. LEVEL OF EVIDENCE 3 Laryngoscope, 134:3807-3814, 2024.
Collapse
Affiliation(s)
- Brandon Truong
- School of Medicine, University of California, San Francisco, California, U.S.A
| | - Matthew Zapala
- Division of Pediatric Radiology, Department of Radiology and Biomedical Imaging, University of California, San Francisco, California, U.S.A
| | - Bamidele Kammen
- Division of Pediatric Radiology, Department of Radiology and Biomedical Imaging, University of California, San Francisco, California, U.S.A
| | - Kimberly Luu
- Division of Pediatric Otolaryngology, Department of Otolaryngology-Head and Neck Surgery, University of California, San Francisco, California, U.S.A
| |
Collapse
|
130
|
Schumann P, Trentzsch K, Stölzer-Hutsch H, Jochim T, Scholz M, Malberg H, Ziemssen T. Using machine learning algorithms to detect fear of falling in people with multiple sclerosis in standardized gait analysis. Mult Scler Relat Disord 2024; 88:105721. [PMID: 38885599 DOI: 10.1016/j.msard.2024.105721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 06/04/2024] [Accepted: 06/10/2024] [Indexed: 06/20/2024]
Abstract
INTRODUCTION Multiple sclerosis (MS) is the most common chronic inflammatory disease of the central nervous system. The progressive impairment of gait is one of the most important pathognomic symptoms which are associated with falls and fear of falling (FOF) in people with MS (pwMS). 60 % of pwMS show a FOF, which leads to restrictions in mobility as well as physical activity and reduces the quality of life in general. Therefore, early detection of FOF is crucial because it enables early implementation of rehabilitation strategies as well as clinical decision-making to reduce progression. Qualitative and quantitative evaluation of gait pattern is an essential aspect of disease assessment and can provide valuable insights for personalized treatment decisions in pwMS. Our objective was to identify the most appropriate clinical gait analysis methods to identify FOF in pwMS and to detect the optimal machine learning (ML) algorithms to predict FOF using the complex multidimensional data from gait analysis. METHODS Data of 1240 pwMS was recorded at the MS Centre of the University Hospital Dresden between November 2020 and September 2021. Patients performed a multidimensional gait analysis with pressure and motion sensors, as well as patient-reported outcomes (PROs), according to a standardized protocol. A feature selection ensemble (FS-Ensemble) was developed to improve the classification performance. The FS-Ensemble consisted of four filtering methods: Chi-square test, information gain, minimum redundancy maximum relevance and ReliefF. Gaussian Naive Bayes, Decision Tree, k-Nearest Neighbor, and Support Vector Machines (SVM) were used to identify FOF. RESULTS The descriptive analysis showed that 37 % of the 1240 pwMS had a FOF (n = 458; age: 51 ± 16 years, 76 % women, median EDSS: 4.0). The FS-Ensemble improved classification performance in most cases. The SVM showed the best performance of the four classification models in detecting FOF. The PROs showed the best F1 scores (Early Mobility Impairment Questionnaire F1 = 0.81 ± 0.00 and 12-item Multiple Sclerosis Scale F1 = 0.80 ± 0.00). CONCLUSION FOF is an important psychological risk factor associated with an increased risk of falls. To integrate a functional early warning system for fall detection into MS management and progression monitoring, it is necessary to detect the relevant gait parameters as well as assessment methods. In this context, ML strategies allow the integration of gait parameters from clinical routine to support the initiation of early rehabilitation measures and adaptation of course-modifying therapeutics. The results of this study confirm that patients' self-assessments play an important role in disease management.
Collapse
Affiliation(s)
- Paula Schumann
- Institute of Biomedical Engineering, TU Dresden, Fetscherstr. 29, Dresden 01307, Germany
| | - Katrin Trentzsch
- Center of Clinical Neuroscience, Neurological Clinic, University Hospital Carl Gustav Carus, TU Dresden, Fetscherstr. 74, Dresden 01307, Germany
| | - Heidi Stölzer-Hutsch
- Center of Clinical Neuroscience, Neurological Clinic, University Hospital Carl Gustav Carus, TU Dresden, Fetscherstr. 74, Dresden 01307, Germany
| | - Thurid Jochim
- Institute of Biomedical Engineering, TU Dresden, Fetscherstr. 29, Dresden 01307, Germany
| | - Maria Scholz
- Center of Clinical Neuroscience, Neurological Clinic, University Hospital Carl Gustav Carus, TU Dresden, Fetscherstr. 74, Dresden 01307, Germany
| | - Hagen Malberg
- Institute of Biomedical Engineering, TU Dresden, Fetscherstr. 29, Dresden 01307, Germany
| | - Tjalf Ziemssen
- Center of Clinical Neuroscience, Neurological Clinic, University Hospital Carl Gustav Carus, TU Dresden, Fetscherstr. 74, Dresden 01307, Germany.
| |
Collapse
|
131
|
Teng S, Wang B, Yang F, Yi X, Zhang X, Sun Y. MediDRNet: Tackling category imbalance in diabetic retinopathy classification with dual-branch learning and prototypical contrastive learning. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 253:108230. [PMID: 38810377 DOI: 10.1016/j.cmpb.2024.108230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 04/17/2024] [Accepted: 05/14/2024] [Indexed: 05/31/2024]
Abstract
BACKGROUND AND OBJECTIVE The classification of diabetic retinopathy (DR) aims to utilize the implicit information in images for early diagnosis, to prevent and mitigate the further worsening of the condition. However, existing methods are often limited by the need to operate within large, annotated datasets to show significant advantages. Additionally, the number of samples for different categories within the dataset needs to be evenly distributed, because the characteristic of sample imbalance distribution can lead to an excessive focus on high-frequency disease categories, while neglecting the less common but equally important disease categories. Therefore, there is an urgent need to develop a new classification method that can effectively alleviate the issue of sample distribution imbalance, thereby enhancing the accuracy of diabetic retinopathy classification. METHODS In this work, we propose MediDRNet, a dual-branch network model based on prototypical contrastive learning. This model adopts prototype contrastive learning, creating prototypes for different levels of lesions, ensuring they represent the core features of each lesion level. It classifies by comparing the similarity between data points and their category prototypes. Our dual-branch network structure effectively resolves the issue of category imbalance and improves classification accuracy by emphasizing subtle differences in retinal lesions. Moreover, our approach combines a dual-branch network with specific lesion-level prototypes for core feature representation and incorporates the convolutional block attention module for enhanced lesion feature identification. RESULTS Our experiments using both the Kaggle and UWF classification datasets have demonstrated that MediDRNet exhibits exceptional performance compared to other advanced models in the industry, especially on the UWF DR classification dataset where it achieved state-of-the-art performance across all metrics. On the Kaggle DR classification dataset, it achieved the highest average classification accuracy (0.6327) and Macro-F1 score (0.6361). Particularly in the classification tasks for minority categories of diabetic retinopathy on the Kaggle dataset (Grades 1, 2, 3, and 4), the model reached high classification accuracies of 58.08%, 55.32%, 69.73%, and 90.21%, respectively. In the ablation study, the MediDRNet model proved to be more effective in feature extraction from diabetic retinal fundus images compared to other feature extraction methods. CONCLUSIONS This study employed prototype contrastive learning and bidirectional branch learning strategies, successfully constructing a grading system for diabetic retinopathy lesions within imbalanced diabetic retinopathy datasets. Through a dual-branch network, the feature learning branch effectively facilitated a smooth transition of features from the grading network to the classification learning branch, accurately identifying minority sample categories. This method not only effectively resolved the issue of sample imbalance but also provided strong support for the precise grading and early diagnosis of diabetic retinopathy in clinical applications, showcasing exceptional performance in handling complex diabetic retinopathy datasets. Moreover, this research significantly improved the efficiency of prevention and management of disease progression in diabetic retinopathy patients within medical practice. We encourage the use and modification of our code, which is publicly accessible on GitHub: https://github.com/ReinforceLove/MediDRNet.
Collapse
Affiliation(s)
- Siying Teng
- Department of Ophthalmology, the First Hospital of Jilin University, Changchun, 130021, Jilin, China
| | - Bo Wang
- University of Minho, Braga, 4710-057, Braga District, Portugal
| | - Feiyang Yang
- College of Computer Science and Technology, Jilin University, Changchun, 130012, Jilin, China
| | - Xingcheng Yi
- Laboratory of Cancer Precision Medicine, the First Hospital of Jilin University, Changchun, 130013, Jilin, China
| | - Xinmin Zhang
- Department of Regenerative Medicine, School of Pharmaceutical Science, Jilin University, Changchun, 130021, Jilin, China
| | - Yabin Sun
- Department of Ophthalmology, the First Hospital of Jilin University, Changchun, 130021, Jilin, China.
| |
Collapse
|
132
|
Palmieri F, Akhtar NF, Pané A, Jiménez A, Olbeyra RP, Viaplana J, Vidal J, de Hollanda A, Gama-Perez P, Jiménez-Chillarón JC, Garcia-Roves PM. Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics. Sci Rep 2024; 14:17263. [PMID: 39068287 PMCID: PMC11283481 DOI: 10.1038/s41598-024-68269-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 07/22/2024] [Indexed: 07/30/2024] Open
Abstract
The excessive accumulation and malfunctioning of visceral adipose tissue (VAT) is a major determinant of increased risk of obesity-related comorbidities. Thus, risk stratification of people living with obesity according to their amount of VAT is of clinical interest. Currently, the most common VAT measurement methods include mathematical formulae based on anthropometric dimensions, often biased by human measurement errors, bio-impedance, and image techniques such as X-ray absorptiometry (DXA) analysis, which requires specialized equipment. However, previous studies showed the possibility of classifying people living with obesity according to their VAT through blood chemical concentrations by applying machine learning techniques. In addition, most of the efforts were spent on men living with obesity while little was done for women. Therefore, this study aims to compare the performance of the multilinear regression model (MLR) in estimating VAT and six different supervised machine learning classifiers, including logistic regression (LR), support vector machine and decision tree-based models, to categorize 149 women living with obesity. For clustering, the study population was categorized into classes 0, 1, and 2 according to their VAT and the accuracy of each MLR and classification model was evaluated using DXA-data (DXAdata), blood chemical concentrations (BLDdata), and both DXAdata and BLDdata together (ALLdata). Estimation error and R 2 were computed for MLR, while receiver operating characteristic (ROC) and precision-recall curves (PR) area under the curve (AUC) were used to assess the performance of every classification model. MLR models showed a poor ability to estimate VAT with mean absolute error ≥ 401.40 andR 2 ≤ 0.62 in all the datasets. The highest accuracy was found for LR with values of 0.57, 0.63, and 0.53 for ALLdata, DXAdata, and BLDdata, respectively. The ROC AUC showed a poor ability of both ALLdata and DXAdata to distinguish class 1 from classes 0 and 2 (AUC = 0.31, 0.71, and 0.85, respectively) as also confirmed by PR (AUC = 0.24, 0.57, and 0.73, respectively). However, improved performances were obtained when applying LR model to BLDdata (ROC AUC ≥ 0.61 and PR AUC ≥ 0.42), especially for class 1. These results seem to suggest that, while a direct and reliable estimation of VAT was not possible in our cohort, blood sample-derived information can robustly classify women living with obesity by machine learning-based classifiers, a fact that could benefit the clinical practice, especially in those health centres where medical imaging devices are not available. Nonetheless, these promising findings should be further validated over a larger population.
Collapse
Affiliation(s)
- Flavio Palmieri
- Biophysics unit, Department of Physiological Sciences, Faculty of Medicine and Health, Universitat de Barcelona, Bellvitge campus, 08907, Barcelona, Spain.
- Nutrition, Metabolism and Gene Therapy Group; Diabetes and Metabolism Program; Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain.
| | - Nidà Farooq Akhtar
- Escola d'Enginyeria de Barcelona Est (EEBE) Universitat Politècnica De Catalunya. Barcelona Tech-UPC, 08019, Barcelona, Spain
| | - Adriana Pané
- Obesity Unit, Endocrinology and Nutrition Department, Hospital Clínic de Barcelona, 08036, Barcelona, Spain
- Centro de Investigación Biomédica en Red de la Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
| | - Amanda Jiménez
- Obesity Unit, Endocrinology and Nutrition Department, Hospital Clínic de Barcelona, 08036, Barcelona, Spain
- Centro de Investigación Biomédica en Red de la Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
- Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
| | - Romina Paula Olbeyra
- Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
| | - Judith Viaplana
- Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
| | - Josep Vidal
- Obesity Unit, Endocrinology and Nutrition Department, Hospital Clínic de Barcelona, 08036, Barcelona, Spain
- Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Diabetes y Enfermedades Metabólicas Asociadas (CIBERDEM), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
| | - Ana de Hollanda
- Obesity Unit, Endocrinology and Nutrition Department, Hospital Clínic de Barcelona, 08036, Barcelona, Spain
- Centro de Investigación Biomédica en Red de la Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
- Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
| | - Pau Gama-Perez
- Biophysics unit, Department of Physiological Sciences, Faculty of Medicine and Health, Universitat de Barcelona, Bellvitge campus, 08907, Barcelona, Spain
| | - Josep C Jiménez-Chillarón
- Biophysics unit, Department of Physiological Sciences, Faculty of Medicine and Health, Universitat de Barcelona, Bellvitge campus, 08907, Barcelona, Spain
- Metabolic diseases of pediatric origin unit, Institut de Recerca Sant Joan de Déu - Barcelona Children's Hospital, 08950, Esplugues del Llobregat, Spain
| | - Pablo M Garcia-Roves
- Biophysics unit, Department of Physiological Sciences, Faculty of Medicine and Health, Universitat de Barcelona, Bellvitge campus, 08907, Barcelona, Spain.
- Nutrition, Metabolism and Gene Therapy Group; Diabetes and Metabolism Program; Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain.
- Centro de Investigación Biomédica en Red de la Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain.
| |
Collapse
|
133
|
Kim YS, Park SH, Lee IY, Son GM, Baek KR. AI-driven automatic compression system for colorectal anastomosis. J Robot Surg 2024; 18:290. [PMID: 39039393 DOI: 10.1007/s11701-024-02015-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 06/12/2024] [Indexed: 07/24/2024]
Abstract
Although circular staplers offer technical advancements over traditional hand-sewn techniques, their use remains challenging for unskilled users, necessitating substantial time and experience for mastery. In particular, it is challenging to apply a consistent pressure of an appropriate magnitude. We developed an automated circular anastomosis device using artificial intelligence (AI) to solve this problem. Automation through AI reduces experiential factors during the anastomosis process. We defined damage occurring during the anastomosis process, noting that a greater depth of damage indicated a more severe injury. For automated anastomosis, data at a tissue strain of 40% were used for the AI model, as this strain level showed optimal performance based on the accuracy and cost matrix. We compared the outcomes of automated anastomosis using a trained AI with those of unskilled users. The results were validated using the Shapiro-Wilk test and t tests. Compression damage was verified on collagen sheets. The AI-driven automatic compression system resulted in less damage compared to unskilled users. In particular, a more significant difference in damage was observed in poor-condition collagen than in good-condition collagen. Damage to the collagen under poor conditions was 54.8% when handled by unskilled users, while the AI-driven automatic compression system resulted in 38.9% damage. This study confirmed that novices' use of AI for automated anastomosis reduces the risk of damage, especially for tissues in poor condition.
Collapse
Affiliation(s)
- Yong Seop Kim
- Department of Electrical and Electronics Engineering, Pusan National University, Busan, South Korea
| | - Sang Ho Park
- Department of Electrical and Electronics Engineering, Pusan National University, Busan, South Korea
| | - In Young Lee
- Department of Surgery, Pusan National University School of Medicine and Research Institute for Convergence of Biomedical Science and Technology, Pusan National University Yangsan Hospital, Yangsan, Korea
| | - Gyung Mo Son
- Department of Surgery, Pusan National University School of Medicine and Research Institute for Convergence of Biomedical Science and Technology, Pusan National University Yangsan Hospital, Yangsan, Korea.
| | - Kwang Ryul Baek
- Department of Electrical and Electronics Engineering, Pusan National University, Busan, South Korea.
| |
Collapse
|
134
|
Irani Shemirani M. Transcriptional markers classifying Escherichia coli and Staphylococcus aureus induced sepsis in adults: A data-driven approach. PLoS One 2024; 19:e0305920. [PMID: 38968271 PMCID: PMC11226107 DOI: 10.1371/journal.pone.0305920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 06/06/2024] [Indexed: 07/07/2024] Open
Abstract
Sepsis is a life-threatening condition mainly caused by gram-negative and gram-positive bacteria. Understanding the type of causative agent in the early stages is essential for precise antibiotic therapy. This study sought to identify a host gene set capable of distinguishing between sepsis induced by gram-negative bacteria; Escherichia coli and gram-positive bacteria; Staphylococcus aureus in community-onset adult patients. In the present study, microarray expression information was used to apply the Least Absolute Shrinkage and Selection Operator (Lasso) technique to select the predictive gene set for classifying sepsis induced by E. coli or S. aureus pathogens. We identified 25 predictive genes, including LILRA5 and TNFAIP6, which had previously been associated with sepsis in other research. Using these genes, we trained a logistic regression classifier to distinguish whether a sample contains an E. coli or S. aureus infection or belongs to a healthy control group, and subsequently assessed its performance. The classifier achieved an Area Under the Curve (AUC) of 0.96 for E. coli and 0.98 for S. aureus-induced sepsis, and perfect discrimination (AUC of 1) for healthy controls from the other conditions in a 10-fold cross-validation. The genes demonstrated an AUC of 0.75 in distinguishing between sepsis patients with E. coli and S. aureus pathogens. These findings were further confirmed in two distinct independent validation datasets which gave high prediction AUC ranging from 0.72-0.87 and 0.62 in distinguishing three groups of participants and two groups of patients respectively. These genes were significantly enriched in the immune system, cytokine signaling in immune system, innate immune system, and interferon signaling. Transcriptional patterns in blood can differentiate patients with E. coli-induced sepsis from those with S. aureus-induced sepsis. These diagnostic markers, upon validation in larger trials, may serve as a foundation for a reliable differential diagnostics assay.
Collapse
Affiliation(s)
- Mahnaz Irani Shemirani
- Department of Laboratory Medicine, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
135
|
Oku T, Furuya S, Lee A, Altenmüller E. Video-based diagnosis support system for pianists with Musician's dystonia. Front Neurol 2024; 15:1409962. [PMID: 39015318 PMCID: PMC11250081 DOI: 10.3389/fneur.2024.1409962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 06/18/2024] [Indexed: 07/18/2024] Open
Abstract
Background Musician's dystonia is a task-specific movement disorder that deteriorates fine motor control of skilled movements in musical performance. Although this disorder threatens professional careers, its diagnosis is challenging for clinicians who have no specialized knowledge of musical performance. Objectives To support diagnostic evaluation, the present study proposes a novel approach using a machine learning-based algorithm to identify the symptomatic movements of Musician's dystonia. Methods We propose an algorithm that identifies the dystonic movements using the anomaly detection method with an autoencoder trained with the hand kinematics of healthy pianists. A unique feature of the algorithm is that it requires only the video image of the hand, which can be derived by a commercially available camera. We also measured the hand biomechanical functions to assess the contribution of peripheral factors and improve the identification of dystonic symptoms. Results The proposed algorithm successfully identified Musician's dystonia with an accuracy and specificity of 90% based only on video footages of the hands. In addition, we identified the degradation of biomechanical functions involved in controlling multiple fingers, which is not specific to musical performance. By contrast, there were no dystonia-specific malfunctions of hand biomechanics, including the strength and agility of individual digits. Conclusion These findings demonstrate the effectiveness of the present technique in aiding in the accurate diagnosis of Musician's dystonia.
Collapse
Affiliation(s)
- Takanori Oku
- College of Engineering and Design, Shibaura Institute of Technology, Tokyo, Japan
- Sony Computer Science Laboratories, Inc., Tokyo, Japan
- NeuroPiano Institute, Kyoto, Japan
| | - Shinichi Furuya
- Sony Computer Science Laboratories, Inc., Tokyo, Japan
- NeuroPiano Institute, Kyoto, Japan
- Institute of Music Physiology and Musicians’ Medicine, University of Music, Drama and Media, Hanover, Germany
| | - André Lee
- Institute of Music Physiology and Musicians’ Medicine, University of Music, Drama and Media, Hanover, Germany
- Department of Neurology, Klinikum rechts der Isar, Technical University of Munich, München, Germany
| | - Eckart Altenmüller
- Institute of Music Physiology and Musicians’ Medicine, University of Music, Drama and Media, Hanover, Germany
| |
Collapse
|
136
|
Chen TLW, Shimizu MR, Buddhiraju A, Seo HH, Subih MA, Chen SF, Kwon YM. Predicting 30-day unplanned hospital readmission after revision total knee arthroplasty: machine learning model analysis of a national patient cohort. Med Biol Eng Comput 2024; 62:2073-2086. [PMID: 38451418 DOI: 10.1007/s11517-024-03054-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 02/18/2024] [Indexed: 03/08/2024]
Abstract
Revision total knee arthroplasty (TKA) is associated with a higher risk of readmission than primary TKA. Identifying individual patients predisposed to readmission can facilitate proactive optimization and increase care efficiency. This study developed machine learning (ML) models to predict unplanned readmission following revision TKA using a national-scale patient dataset. A total of 17,443 revision TKA cases (2013-2020) were acquired from the ACS NSQIP database. Four ML models (artificial neural networks, random forest, histogram-based gradient boosting, and k-nearest neighbor) were developed on relevant patient variables to predict readmission following revision TKA. The length of stay, operation time, body mass index (BMI), and laboratory test results were the strongest predictors of readmission. Histogram-based gradient boosting was the best performer in distinguishing readmission (AUC: 0.95) and estimating the readmission probability for individual patients (calibration slope: 1.13; calibration intercept: -0.00; Brier score: 0.064). All models produced higher net benefit than the default strategies of treating all or no patients, supporting the clinical utility of the models. ML demonstrated excellent performance for the prediction of readmission following revision TKA. Optimization of important predictors highlighted by our model may decrease preventable hospital readmission following surgery, thereby leading to reduced financial burden and improved patient satisfaction.
Collapse
Affiliation(s)
- Tony Lin-Wei Chen
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Engineering, Faculty of Engineering, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Michelle Riyo Shimizu
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Anirudh Buddhiraju
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Henry Hojoon Seo
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Murad Abdullah Subih
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Shane Fei Chen
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Engineering, Faculty of Engineering, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Young-Min Kwon
- Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
137
|
Cai Z, Xu Z, Chen Y, Zhang R, Guo B, Chen H, Ouyang F, Chen X, Chen X, Liu D, Luo C, Li X, Liu W, Zhou C, Guan X, Liu Z, Zhao H, Hu Q. Multiparametric MRI subregion radiomics for preoperative assessment of high-risk subregions in microsatellite instability of rectal cancer patients: a multicenter study. Int J Surg 2024; 110:4310-4319. [PMID: 38498392 PMCID: PMC11254239 DOI: 10.1097/js9.0000000000001335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/04/2024] [Indexed: 03/20/2024]
Abstract
BACKGROUND Microsatellite instability (MSI) is associated with treatment response and prognosis in patients with rectal cancer (RC). However, intratumoral heterogeneity limits MSI testing in patients with RC. The authors developed a subregion radiomics model based on multiparametric MRI to preoperatively assess high-risk subregions with MSI and predict the MSI status of patients with RC. METHODS This retrospective study included 475 patients (training cohort, 382; external test cohort, 93) with RC from two participating hospitals between April 2017 and June 2023. In the training cohort, subregion radiomic features were extracted from multiparametric MRI, which included T2-weighted, T1-weighted, diffusion-weighted, and contrast-enhanced T1-weighted imaging. MSI-related subregion radiomic features, classical radiomic features, and clinicoradiological variables were gathered to build five predictive models using logistic regression. Kaplan-Meier survival analysis was conducted to explore the prognostic information. RESULTS Among the 475 patients [median age, 64 years (interquartile range, IQR: 55-70 years); 304 men and 171 women], the prevalence of MSI was 11.16% (53/475). The subregion radiomics model outperformed the classical radiomics and clinicoradiological models in both training [area under the curve (AUC)=0.86, 0.72, and 0.59, respectively] and external test cohorts (AUC=0.83, 0.73, and 0.62, respectively). The subregion-clinicoradiological model combining clinicoradiological variables and subregion radiomic features performed the optimal, with AUCs of 0.87 and 0.85 in the training and external test cohorts, respectively. The 3-year disease-free survival rate of MSI groups predicted based on the model was higher than that of the predicted microsatellite stability groups in both patient cohorts (training, P =0.032; external test, P =0.046). CONCLUSIONS The authors developed and validated a model based on subregion radiomic features of multiparametric MRI to evaluate high-risk subregions with MSI and predict the MSI status of RC preoperatively, which may assist in individualized treatment decisions and positioning for biopsy.
Collapse
Affiliation(s)
- Zhiping Cai
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Zhenyu Xu
- Department of Radiology, The First People’s Hospital of Foshan, Foshan
| | - Yifan Chen
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Rong Zhang
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Baoliang Guo
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Haixiong Chen
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Fusheng Ouyang
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Xinjie Chen
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Xiaobo Chen
- Department of Radiology, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University
- Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application, Guangzhou, People’s Republic of China
| | - Dechao Liu
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Chun Luo
- Department of Radiology, The First People’s Hospital of Foshan, Foshan
| | - Xiaohong Li
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Wei Liu
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Cuiru Zhou
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Xinqun Guan
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Ziwei Liu
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| | - Hai Zhao
- Department of Radiology, The First People’s Hospital of Foshan, Foshan
| | - Qiugen Hu
- Department of Radiology, Shunde Hospital, Southern Medical University (The First People’s Hospital of Shunde)
| |
Collapse
|
138
|
Park SJ, Yang S, Kim JM, Kang JH, Kim JE, Huh KH, Lee SS, Yi WJ, Heo MS. Automatic and robust estimation of sex and chronological age from panoramic radiographs using a multi-task deep learning network: a study on a South Korean population. Int J Legal Med 2024; 138:1741-1757. [PMID: 38467754 PMCID: PMC11164743 DOI: 10.1007/s00414-024-03204-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 02/27/2024] [Indexed: 03/13/2024]
Abstract
Sex and chronological age estimation are crucial in forensic investigations and research on individual identification. Although manual methods for sex and age estimation have been proposed, these processes are labor-intensive, time-consuming, and error-prone. The purpose of this study was to estimate sex and chronological age from panoramic radiographs automatically and robustly using a multi-task deep learning network (ForensicNet). ForensicNet consists of a backbone and both sex and age attention branches to learn anatomical context features of sex and chronological age from panoramic radiographs and enables the multi-task estimation of sex and chronological age in an end-to-end manner. To mitigate bias in the data distribution, our dataset was built using 13,200 images with 100 images for each sex and age range of 15-80 years. The ForensicNet with EfficientNet-B3 exhibited superior estimation performance with mean absolute errors of 2.93 ± 2.61 years and a coefficient of determination of 0.957 for chronological age, and achieved accuracy, specificity, and sensitivity values of 0.992, 0.993, and 0.990, respectively, for sex prediction. The network demonstrated that the proposed sex and age attention branches with a convolutional block attention module significantly improved the estimation performance for both sex and chronological age from panoramic radiographs of elderly patients. Consequently, we expect that ForensicNet will contribute to the automatic and accurate estimation of both sex and chronological age from panoramic radiographs.
Collapse
Affiliation(s)
- Se-Jin Park
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea
| | - Su Yang
- Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, 03080, South Korea
| | - Jun-Min Kim
- Department of Electronics and Information Engineering, Hansung University, Seoul, 03080, South Korea
| | - Ju-Hee Kang
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea
| | - Jo-Eun Kim
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea
| | - Kyung-Hoe Huh
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea
| | - Sam-Sun Lee
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea
| | - Won-Jin Yi
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea.
- Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, 03080, South Korea.
- Department of Oral and Maxillofacial Radiology, School of Dentistry, Seoul National University, 101 Daehak-ro, Jongno-gu, Seoul, 03080, Korea.
| | - Min-Suk Heo
- Department of Oral and Maxillofacial Radiology and Dental Research Institute, School of Dentistry, Seoul National University, Seoul, 03080, South Korea.
- Department of Oral and Maxillofacial Radiology, School of Dentistry, Seoul National University, 101 Daehak-ro, Jongno-gu, Seoul, 03080, Korea.
| |
Collapse
|
139
|
Kim JK, Park D, Chang MC. Assessment of Bone Age Based on Hand Radiographs Using Regression-Based Multi-Modal Deep Learning. Life (Basel) 2024; 14:774. [PMID: 38929756 PMCID: PMC11204652 DOI: 10.3390/life14060774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 06/11/2024] [Accepted: 06/16/2024] [Indexed: 06/28/2024] Open
Abstract
(1) Objective: In this study, a regression-based multi-modal deep learning model was developed for use in bone age assessment (BAA) utilizing hand radiographic images and clinical data, including patient gender and chronological age, as input data. (2) Methods: A dataset of hand radiographic images from 2974 pediatric patients was used to develop a regression-based multi-modal BAA model. This model integrates hand radiographs using EfficientNetV2S convolutional neural networks (CNNs) and clinical data (gender and chronological age) processed by a simple deep neural network (DNN). This approach enhances the model's robustness and diagnostic precision, addressing challenges related to imbalanced data distribution and limited sample sizes. (3) Results: The model exhibited good performance on BAA, with an overall mean absolute error (MAE) of 0.410, root mean square error (RMSE) of 0.637, and accuracy of 91.1%. Subgroup analysis revealed higher accuracy in females ≤ 11 years (MAE: 0.267, RMSE: 0.453, accuracy: 95.0%) and >11 years (MAE: 0.402, RMSE: 0.634, accuracy 92.4%) compared to males ≤ 13 years (MAE: 0.665, RMSE: 0.912, accuracy: 79.7%) and >13 years (MAE: 0.647, RMSE: 1.302, accuracy: 84.6%). (4) Conclusion: This model showed a generally good performance on BAA, showing a better performance in female pediatrics compared to male pediatrics and an especially robust performance in female pediatrics ≤ 11 years.
Collapse
Affiliation(s)
- Jeoung Kun Kim
- Department of Business Administration, School of Business, Yeungnam University, Gyeongsan-si 38541, Republic of Korea;
| | - Donghwi Park
- Seoul Spine Rehabilitation Clinic, Ulsan-si, Republic of Korea;
| | - Min Cheol Chang
- Department of Rehabilitation Medicine, College of Medicine, Yeungnam University, Daegu 42415, Republic of Korea
| |
Collapse
|
140
|
Wang R, Chen ZS. Large-scale foundation models and generative AI for BigData neuroscience. Neurosci Res 2024:S0168-0102(24)00075-0. [PMID: 38897235 PMCID: PMC11649861 DOI: 10.1016/j.neures.2024.06.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 04/15/2024] [Accepted: 05/15/2024] [Indexed: 06/21/2024]
Abstract
Recent advances in machine learning have led to revolutionary breakthroughs in computer games, image and natural language understanding, and scientific discovery. Foundation models and large-scale language models (LLMs) have recently achieved human-like intelligence thanks to BigData. With the help of self-supervised learning (SSL) and transfer learning, these models may potentially reshape the landscapes of neuroscience research and make a significant impact on the future. Here we present a mini-review on recent advances in foundation models and generative AI models as well as their applications in neuroscience, including natural language and speech, semantic memory, brain-machine interfaces (BMIs), and data augmentation. We argue that this paradigm-shift framework will open new avenues for many neuroscience research directions and discuss the accompanying challenges and opportunities.
Collapse
Affiliation(s)
- Ran Wang
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA
| | - Zhe Sage Chen
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY 10016, USA; Department of Neuroscience and Physiology, Neuroscience Institute, New York University Grossman School of Medicine, New York, NY 10016, USA; Department of Biomedical Engineering, New York University Tandon School of Engineering, Brooklyn, NY 11201, USA.
| |
Collapse
|
141
|
Richardson E, Trevizani R, Greenbaum JA, Carter H, Nielsen M, Peters B. The receiver operating characteristic curve accurately assesses imbalanced datasets. PATTERNS (NEW YORK, N.Y.) 2024; 5:100994. [PMID: 39005487 PMCID: PMC11240176 DOI: 10.1016/j.patter.2024.100994] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 03/05/2024] [Accepted: 05/03/2024] [Indexed: 07/16/2024]
Abstract
Many problems in biology require looking for a "needle in a haystack," corresponding to a binary classification where there are a few positives within a much larger set of negatives, which is referred to as a class imbalance. The receiver operating characteristic (ROC) curve and the associated area under the curve (AUC) have been reported as ill-suited to evaluate prediction performance on imbalanced problems where there is more interest in performance on the positive minority class, while the precision-recall (PR) curve is preferable. We show via simulation and a real case study that this is a misinterpretation of the difference between the ROC and PR spaces, showing that the ROC curve is robust to class imbalance, while the PR curve is highly sensitive to class imbalance. Furthermore, we show that class imbalance cannot be easily disentangled from classifier performance measured via PR-AUC.
Collapse
Affiliation(s)
- Eve Richardson
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Raphael Trevizani
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Fiocruz Ceará, Fundação Oswaldo Cruz, Rua São José s/n, Precabura, Eusébio/CE, Brazil
| | - Jason A Greenbaum
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Hannah Carter
- Department of Medicine, University of California, La Jolla, CA, USA
| | - Morten Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Lyngby, Denmark
| | - Bjoern Peters
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| |
Collapse
|
142
|
Qiu C, Su K, Luo Z, Tian Q, Zhao L, Wu L, Deng H, Shen H. Developing and comparing deep learning and machine learning algorithms for osteoporosis risk prediction. Front Artif Intell 2024; 7:1355287. [PMID: 38919268 PMCID: PMC11196804 DOI: 10.3389/frai.2024.1355287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/31/2024] [Indexed: 06/27/2024] Open
Abstract
Introduction Osteoporosis, characterized by low bone mineral density (BMD), is an increasingly serious public health issue. So far, several traditional regression models and machine learning (ML) algorithms have been proposed for predicting osteoporosis risk. However, these models have shown relatively low accuracy in clinical implementation. Recently proposed deep learning (DL) approaches, such as deep neural network (DNN), which can discover knowledge from complex hidden interactions, offer a new opportunity to improve predictive performance. In this study, we aimed to assess whether DNN can achieve a better performance in osteoporosis risk prediction. Methods By utilizing hip BMD and extensive demographic and routine clinical data of 8,134 subjects with age more than 40 from the Louisiana Osteoporosis Study (LOS), we developed and constructed a novel DNN framework for predicting osteoporosis risk and compared its performance in osteoporosis risk prediction with four conventional ML models, namely random forest (RF), artificial neural network (ANN), k-nearest neighbor (KNN), and support vector machine (SVM), as well as a traditional regression model termed osteoporosis self-assessment tool (OST). Model performance was assessed by area under 'receiver operating curve' (AUC) and accuracy. Results By using 16 discriminative variables, we observed that the DNN approach achieved the best predictive performance (AUC = 0.848) in classifying osteoporosis (hip BMD T-score ≤ -1.0) and non-osteoporosis risk (hip BMD T-score > -1.0) subjects, compared to the other approaches. Feature importance analysis showed that the top 10 most important variables identified by the DNN model were weight, age, gender, grip strength, height, beer drinking, diastolic pressure, alcohol drinking, smoke years, and economic level. Furthermore, we performed subsampling analysis to assess the effects of varying number of sample size and variables on the predictive performance of these tested models. Notably, we observed that the DNN model performed equally well (AUC = 0.846) even by utilizing only the top 10 most important variables for osteoporosis risk prediction. Meanwhile, the DNN model can still achieve a high predictive performance (AUC = 0.826) when sample size was reduced to 50% of the original dataset. Conclusion In conclusion, we developed a novel DNN model which was considered to be an effective algorithm for early diagnosis and intervention of osteoporosis in the aging population.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Hongwen Deng
- Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, United States
| | - Hui Shen
- Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, United States
| |
Collapse
|
143
|
Ahmmed B, Rau EG, Mudunuru MK, Karra S, Tempelman JR, Wachtor AJ, Forien JB, Guss GM, Calta NP, DePond PJ, Matthews MJ. Deep learning with mixup augmentation for improved pore detection during additive manufacturing. Sci Rep 2024; 14:13365. [PMID: 38862686 PMCID: PMC11166652 DOI: 10.1038/s41598-024-63288-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 05/27/2024] [Indexed: 06/13/2024] Open
Abstract
In additive manufacturing (AM), process defects such as keyhole pores are difficult to anticipate, affecting the quality and integrity of the AM-produced materials. Hence, considerable efforts have aimed to predict these process defects by training machine learning (ML) models using passive measurements such as acoustic emissions. This work considered a dataset in which keyhole pores of a laser powder bed fusion (LPBF) experiment were identified using X-ray radiography and then registered both in space and time to acoustic measurements recorded during the LPBF experiment. Due to AM's intrinsic process controls, where a pore-forming event is relatively rare, the acoustic datasets collected during monitoring include more non-pores than pores. In other words, the dataset for ML model development is imbalanced. Moreover, this imbalanced and sparse data phenomenon remains ubiquitous across many AM monitoring schemes since training data is nontrivial to collect. Hence, we propose a machine learning approach to improve this dataset imbalance and enhance the prediction accuracy of pore-labeled data. Specifically, we investigate how data augmentation helps predict pores and non-pores better. This imbalance is improved using recent advances in data augmentation called Mixup, a weak-supervised learning method. Convolutional neural networks (CNNs) are trained on original and augmented datasets, and an appreciable increase in performance is reported when testing on five different experimental trials. When ML models are trained on original and augmented datasets, they achieve an accuracy of 95% and 99% on test datasets, respectively. We also provide information on how dataset size affects model performance. Lastly, we investigate the optimal Mixup parameters for augmentation in the context of CNN performance.
Collapse
Affiliation(s)
- Bulbul Ahmmed
- Earth and Environmental Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | | | - Maruti K Mudunuru
- Subsurface Science Group, Pacific Northwest National Laboratory, Richland, WA, 99352, USA.
| | - Satish Karra
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Joshua R Tempelman
- Engineering Institute, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Adam J Wachtor
- Engineering Institute, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Jean-Baptiste Forien
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| | - Gabe M Guss
- Engineering Directorate, Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| | - Nicholas P Calta
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| | - Phillip J DePond
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| | - Manyalibo J Matthews
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, 94550, USA
| |
Collapse
|
144
|
Dutta S, Mudaranthakam DP, Li Y, Sardiu ME. PerSEveML: a web-based tool to identify persistent biomarker structure for rare events using an integrative machine learning approach. Mol Omics 2024; 20:348-358. [PMID: 38690925 DOI: 10.1039/d4mo00008k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Omics data sets often pose a computational challenge due to their high dimensionality, large size, and non-linear structures. Analyzing these data sets becomes especially daunting in the presence of rare events. Machine learning (ML) methods have gained traction for analyzing rare events, yet there has been limited exploration of bioinformatics tools that integrate ML techniques to comprehend the underlying biology. Expanding upon our previously developed computational framework of an integrative machine learning approach, we introduce PerSEveML, an interactive web-based tool that uses crowd-sourced intelligence to predict rare events and determine feature selection structures. PerSEveML provides a comprehensive overview of the integrative approach through evaluation metrics that help users understand the contribution of individual ML methods to the prediction process. Additionally, PerSEveML calculates entropy and rank scores, which visually organize input features into a persistent structure of selected, unselected, and fluctuating categories that help researchers uncover meaningful hypotheses regarding the underlying biology. We have evaluated PerSEveML on three diverse biologically complex data sets with extremely rare events from small to large scale and have demonstrated its ability to generate valid hypotheses. PerSEveML is available at https://biostats-shinyr.kumc.edu/PerSEveML/ and https://github.com/sreejatadutta/PerSEveML.
Collapse
Affiliation(s)
- Sreejata Dutta
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Dinesh Pal Mudaranthakam
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.
- University of Kansas Cancer Center, Kansas City, USA
| | - Yanming Li
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.
- University of Kansas Cancer Center, Kansas City, USA
| | - Mihaela E Sardiu
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.
- University of Kansas Cancer Center, Kansas City, USA
- Kansas Institute for Precision Medicine, University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
145
|
Qu J, Liu S, Li H, Zhou J, Bian Z, Song Z, Jiang Z. Three-layer heterogeneous network based on the integration of CircRNA information for MiRNA-disease association prediction. PeerJ Comput Sci 2024; 10:e2070. [PMID: 38983241 PMCID: PMC11232581 DOI: 10.7717/peerj-cs.2070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 04/29/2024] [Indexed: 07/11/2024]
Abstract
Increasing research has shown that the abnormal expression of microRNA (miRNA) is associated with many complex diseases. However, biological experiments have many limitations in identifying the potential disease-miRNA associations. Therefore, we developed a computational model of Three-Layer Heterogeneous Network based on the Integration of CircRNA information for MiRNA-Disease Association prediction (TLHNICMDA). In the model, a disease-miRNA-circRNA heterogeneous network is built by known disease-miRNA associations, known miRNA-circRNA interactions, disease similarity, miRNA similarity, and circRNA similarity. Then, the potential disease-miRNA associations are identified by an update algorithm based on the global network. Finally, based on global and local leave-one-out cross validation (LOOCV), the values of AUCs in TLHNICMDA are 0.8795 and 0.7774. Moreover, the mean and standard deviation of AUC in 5-fold cross-validations is 0.8777+/-0.0010. Especially, the two types of case studies illustrated the usefulness of TLHNICMDA in predicting disease-miRNA interactions.
Collapse
Affiliation(s)
- Jia Qu
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Shuting Liu
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Han Li
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Jie Zhou
- Shaoxing University, School of Computer Science and Engineering, Shaoxing, Zhejiang, China
| | - Zekang Bian
- Jiangnan University, School of AI & Computer Science, Wuxi, Jiangsu, China
| | - Zihao Song
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Zhibin Jiang
- Shaoxing University, School of Computer Science and Engineering, Shaoxing, Zhejiang, China
| |
Collapse
|
146
|
Fatemi Y, Nikfar M, Oladazimi A, Zheng J, Hoy H, Ali H. Machine Learning Approach for Cardiovascular Death Prediction among Nonalcoholic Steatohepatitis (NASH) Liver Transplant Recipients. Healthcare (Basel) 2024; 12:1165. [PMID: 38921280 PMCID: PMC11202858 DOI: 10.3390/healthcare12121165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Revised: 05/30/2024] [Accepted: 06/06/2024] [Indexed: 06/27/2024] Open
Abstract
Cardiovascular disease is the leading cause of mortality among nonalcoholic steatohepatitis (NASH) patients who undergo liver transplants. In the present study, machine learning algorithms were used to identify important risk factors for cardiovascular death and to develop a prediction model. The Standard Transplant Analysis and Research data were gathered from the Organ Procurement and Transplantation Network. After cleaning and preprocessing, the dataset comprised 10,871 patients and 92 features. Recursive feature elimination (RFE) and select from model (SFM) were applied to select relevant features from the dataset and avoid overfitting. Multiple machine learning algorithms, including logistic regression, random forest, decision tree, and XGBoost, were used with RFE and SFM. Additionally, prediction models were developed using a support vector machine, Gaussian naïve Bayes, K-nearest neighbors, random forest, and XGBoost algorithms. Finally, SHapley Additive exPlanations (SHAP) were used to increase interpretability. The findings showed that the best feature selection method was RFE with a random forest estimator, and the most critical features were recipient and donor blood type, body mass index, recipient and donor state of residence, serum creatinine, and year of transplantation. Furthermore, among all the outcomes, the XGBoost model had the highest performance, with an accuracy value of 0.6909 and an area under the curve value of 0.86. The findings also revealed a predictive relationship between features and cardiovascular death after liver transplant among NASH patients. These insights may assist clinical decision-makers in devising strategies to prevent cardiovascular complications in post-liver transplant NASH patients.
Collapse
Affiliation(s)
- Yasin Fatemi
- Department of Industrial and Systems Engineering, Auburn University, Auburn, AL 36849, USA; (Y.F.); (M.N.); (A.O.)
| | - Mohsen Nikfar
- Department of Industrial and Systems Engineering, Auburn University, Auburn, AL 36849, USA; (Y.F.); (M.N.); (A.O.)
| | - Amir Oladazimi
- Department of Industrial and Systems Engineering, Auburn University, Auburn, AL 36849, USA; (Y.F.); (M.N.); (A.O.)
| | - Jingyi Zheng
- Department of Mathematics and Statistics, Auburn University, Auburn, AL 36849, USA;
| | - Haley Hoy
- College of Nursing, The University of Alabama in Huntsville, Huntsville, AL 35805, USA;
| | - Haneen Ali
- Department of Industrial and Systems Engineering, Auburn University, Auburn, AL 36849, USA; (Y.F.); (M.N.); (A.O.)
- Health Services Administration Program, Auburn University, Auburn, AL 36849, USA
| |
Collapse
|
147
|
Cao Y, Qiu B, Ning X, Fan L, Qin Y, Yu D, Yang C, Ma H, Liao X, You C. Enhancing Machine-Learning Prediction of Enzyme Catalytic Temperature Optima through Amino Acid Conservation Analysis. Int J Mol Sci 2024; 25:6252. [PMID: 38892439 PMCID: PMC11173260 DOI: 10.3390/ijms25116252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 05/22/2024] [Accepted: 05/30/2024] [Indexed: 06/21/2024] Open
Abstract
Enzymes play a crucial role in various industrial production and pharmaceutical developments, serving as catalysts for numerous biochemical reactions. Determining the optimal catalytic temperature (Topt) of enzymes is crucial for optimizing reaction conditions, enhancing catalytic efficiency, and accelerating the industrial processes. However, due to the limited availability of experimentally determined Topt data and the insufficient accuracy of existing computational methods in predicting Topt, there is an urgent need for a computational approach to predict the Topt values of enzymes accurately. In this study, using phosphatase (EC 3.1.3.X) as an example, we constructed a machine learning model utilizing amino acid frequency and protein molecular weight information as features and employing the K-nearest neighbors regression algorithm to predict the Topt of enzymes. Usually, when conducting engineering for enzyme thermostability, researchers tend not to modify conserved amino acids. Therefore, we utilized this machine learning model to predict the Topt of phosphatase sequences after removing conserved amino acids. We found that the predictive model's mean coefficient of determination (R2) value increased from 0.599 to 0.755 compared to the model based on the complete sequences. Subsequently, experimental validation on 10 phosphatase enzymes with undetermined optimal catalytic temperatures shows that the predicted values of most phosphatase enzymes based on the sequence without conservative amino acids are closer to the experimental optimal catalytic temperature values. This study lays the foundation for the rapid selection of enzymes suitable for industrial conditions.
Collapse
Affiliation(s)
- Yinyin Cao
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China; (Y.C.)
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
| | - Boyu Qiu
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- Department of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230022, China
| | - Xiao Ning
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lin Fan
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanmei Qin
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Dong Yu
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China; (Y.C.)
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
| | - Chunhe Yang
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China; (Y.C.)
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
| | - Hongwu Ma
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- National Center of Technology Innovation for Synthetic Biology, Tianjin 300308, China
| | - Xiaoping Liao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- National Center of Technology Innovation for Synthetic Biology, Tianjin 300308, China
| | - Chun You
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China; (B.Q.); (H.M.)
- University of Chinese Academy of Sciences, Beijing 100049, China
- National Center of Technology Innovation for Synthetic Biology, Tianjin 300308, China
| |
Collapse
|
148
|
Askar M, Småbrekke L, Holsbø E, Bongo LA, Svendsen K. "Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models". EXPLORATORY RESEARCH IN CLINICAL AND SOCIAL PHARMACY 2024; 14:100463. [PMID: 38974056 PMCID: PMC11227014 DOI: 10.1016/j.rcsop.2024.100463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 06/03/2024] [Accepted: 06/08/2024] [Indexed: 07/09/2024] Open
Abstract
Background Machine learning (ML) prediction models in healthcare and pharmacy-related research face challenges with encoding high-dimensional Healthcare Coding Systems (HCSs) such as ICD, ATC, and DRG codes, given the trade-off between reducing model dimensionality and minimizing information loss. Objectives To investigate using Network Analysis modularity as a method to group HCSs to improve encoding in ML models. Methods The MIMIC-III dataset was utilized to create a multimorbidity network in which ICD-9 codes are the nodes and the edges are the number of patients sharing the same ICD-9 code pairs. A modularity detection algorithm was applied using different resolution thresholds to generate 6 sets of modules. The impact of four grouping strategies on the performance of predicting 90-day Intensive Care Unit readmissions was assessed. The grouping strategies compared: 1) binary encoding of codes, 2) encoding codes grouped by network modules, 3) grouping codes to the highest level of ICD-9 hierarchy, and 4) grouping using the single-level Clinical Classification Software (CCS). The same methodology was also applied to encode DRG codes but limiting the comparison to a single modularity threshold to binary encoding.The performance was assessed using Logistic Regression, Support Vector Machine with a non-linear kernel, and Gradient Boosting Machines algorithms. Accuracy, Precision, Recall, AUC, and F1-score with 95% confidence intervals were reported. Results Models utilized modularity encoding outperformed ungrouped codes binary encoding models. The accuracy improved across all algorithms ranging from 0.736 to 0.78 for the modularity encoding, to 0.727 to 0.779 for binary encoding. AUC, recall, and precision also improved across almost all algorithms. In comparison with other grouping approaches, modularity encoding generally showed slightly higher performance in AUC, ranging from 0.813 to 0.837, and precision, ranging from 0.752 to 0.782. Conclusions Modularity encoding enhances the performance of ML models in pharmacy research by effectively reducing dimensionality and retaining necessary information. Across the three algorithms used, models utilizing modularity encoding showed superior or comparable performance to other encoding approaches. Modularity encoding introduces other advantages such as it can be used for both hierarchical and non-hierarchical HCSs, the approach is clinically relevant, and can enhance ML models' clinical interpretation. A Python package has been developed to facilitate the use of the approach for future research.
Collapse
Affiliation(s)
- Mohsen Askar
- Department of Pharmacy, Faculty of Health Sciences, UiT-The Arctic University of Norway, PO Box 6050, Stakkevollan, N-9037 Tromsø, Norway
| | - Lars Småbrekke
- Department of Pharmacy, Faculty of Health Sciences, UiT-The Arctic University of Norway, PO Box 6050, Stakkevollan, N-9037 Tromsø, Norway
| | - Einar Holsbø
- Department of Computer Science, Faculty of Science and Technology, UiT-The Arctic University of Norway, PO, Box 6050 Stakkevollan, N-9037 Tromsø, Norway
| | - Lars Ailo Bongo
- Department of Computer Science, Faculty of Science and Technology, UiT-The Arctic University of Norway, PO, Box 6050 Stakkevollan, N-9037 Tromsø, Norway
| | - Kristian Svendsen
- Department of Pharmacy, Faculty of Health Sciences, UiT-The Arctic University of Norway, PO Box 6050, Stakkevollan, N-9037 Tromsø, Norway
| |
Collapse
|
149
|
Yang H, Zhu D, He S, Xu Z, Liu Z, Zhang W, Cai J. Enhancing psychiatric rehabilitation outcomes through a multimodal multitask learning model based on BERT and TabNet: An approach for personalized treatment and improved decision-making. Psychiatry Res 2024; 336:115896. [PMID: 38626625 DOI: 10.1016/j.psychres.2024.115896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 04/03/2024] [Accepted: 04/05/2024] [Indexed: 04/18/2024]
Abstract
Evaluating the rehabilitation status of individuals with serious mental illnesses (SMI) necessitates a comprehensive analysis of multimodal data, including unstructured text records and structured diagnostic data. However, progress in the effective assessment of rehabilitation status remains limited. Our study develops a deep learning model integrating Bidirectional Encoder Representations from Transformers (BERT) and TabNet through a late fusion strategy to enhance rehabilitation prediction, including referral risk, dangerous behaviors, self-awareness, and medication adherence, in patients with SMI. BERT processes unstructured textual data, such as doctor's notes, whereas TabNet manages structured diagnostic information. The model's interpretability function serves to assist healthcare professionals in understanding the model's predictive decisions, improving patient care. Our model exhibited excellent predictive performance for all four tasks, with an accuracy exceeding 0.78 and an area under the curve of 0.70. In addition, a series of tests proved the model's robustness, fairness, and interpretability. This study combines multimodal and multitask learning strategies into a model and applies it to rehabilitation assessment tasks, offering a promising new tool that can be seamlessly integrated with the clinical workflow to support the provision of optimized patient care.
Collapse
Affiliation(s)
- Hongyi Yang
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Dian Zhu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Siyuan He
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiqi Xu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Zhao Liu
- School of Design, Shanghai Jiao Tong University, Shanghai, China.
| | - Weibo Zhang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| | - Jun Cai
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
150
|
Zhu S, Wu C, Du B, Zhang L. Adversarial pair-wise distribution matching for remote sensing image cross-scene classification. Neural Netw 2024; 174:106241. [PMID: 38508050 DOI: 10.1016/j.neunet.2024.106241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/18/2023] [Accepted: 03/13/2024] [Indexed: 03/22/2024]
Abstract
Remarkable achievements have been made in the field of remote sensing cross-scene classification in recent years. However, most methods directly align the entire image features for cross-scene knowledge transfer. They usually ignore the high background complexity and low category consistency of remote sensing images, which can significantly impair the performance of distribution alignment. Besides, shortcomings of the adversarial training paradigm and the inability to guarantee the prediction discriminability and diversity can also hinder cross-scene classification performance. To alleviate the above problems, we propose a novel cross-scene classification framework in a discriminator-free adversarial paradigm, called Adversarial Pair-wise Distribution Matching (APDM), to avoid irrelevant knowledge transfer and enable effective cross-domain modeling. Specifically, we propose the pair-wise cosine discrepancy for both inter-domain and intra-domain prediction measurements to fully leverage the prediction information, which can suppress negative semantic features and implicitly align the cross-scene distributions. Nuclear-norm maximization and minimization are introduced to enhance the target prediction quality and increase the applicability of the source knowledge, respectively. As a general cross-scene framework, APDM can be easily embedded with existing methods to boost the performance. Experimental results and analyses demonstrate that APDM can achieve competitive and effective performance on cross-scene classification tasks.
Collapse
Affiliation(s)
- Sihan Zhu
- The State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan 430079, China.
| | - Chen Wu
- The State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan 430079, China.
| | - Bo Du
- The National Engineering Research Center for Multimedia Software, Institute of Artificial Intelligence, School of Computer Science and Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan 430079, China.
| | - Liangpei Zhang
- The State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan 430079, China.
| |
Collapse
|