1
|
Ma Y, Lyu C, Li L, Wei Y, Xu Y. Algorithm of face anti-spoofing based on pseudo-negative features generation. Front Neurosci 2024; 18:1362286. [PMID: 38680444 PMCID: PMC11047124 DOI: 10.3389/fnins.2024.1362286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 03/21/2024] [Indexed: 05/01/2024] Open
Abstract
Introduction Despite advancements in face anti-spoofing technology, attackers continue to pose challenges with their evolving deceptive methods. This is primarily due to the increased complexity of their attacks, coupled with a diversity in presentation modes, acquisition devices, and prosthetic materials. Furthermore, the scarcity of negative sample data exacerbates the situation by causing domain shift issues and impeding robust generalization. Hence, there is a pressing need for more effective cross-domain approaches to bolster the model's capability to generalize across different scenarios. Methods This method improves the effectiveness of face anti-spoofing systems by analyzing pseudo-negative sample features, expanding the training dataset, and boosting cross-domain generalization. By generating pseudo-negative features with a new algorithm and aligning these features with the use of KL divergence loss, we enrich the negative sample dataset, aiding the training of a more robust feature classifier and broadening the range of attacks that the system can defend against. Results Through experiments on four public datasets (MSU-MFSD, OULU-NPU, Replay-Attack, and CASIA-FASD), we assess the model's performance within and across datasets by controlling variables. Our method delivers positive results in multiple experiments, including those conducted on smaller datasets. Discussion Through controlled experiments, we demonstrate the effectiveness of our method. Furthermore, our approach consistently yields favorable results in both intra-dataset and cross-dataset evaluations, thereby highlighting its excellent generalization capabilities. The superior performance on small datasets further underscores our method's remarkable ability to handle unseen data beyond the training set.
Collapse
Affiliation(s)
- Yukun Ma
- School of Software, Henan Institute of Science and Technology, Xinxiang, China
| | - Chengzhen Lyu
- School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China
| | - Liangliang Li
- School of Information and Electronics, Beijing Institute of Technology, Beijing, China
| | - Yajun Wei
- School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China
| | - Yaowen Xu
- Data and AI Technology Company, China Telecom Corporation Ltd., Beijing, China
| |
Collapse
|
2
|
Knab A, Anwer AG, Pedersen B, Handley S, Marupally AG, Habibalahi A, Goldys EM. Towards label-free non-invasive autofluorescence multispectral imaging for melanoma diagnosis. J Biophotonics 2024; 17:e202300402. [PMID: 38247053 DOI: 10.1002/jbio.202300402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 12/11/2023] [Accepted: 12/31/2023] [Indexed: 01/23/2024]
Abstract
This study focuses on the use of cellular autofluorescence which visualizes the cell metabolism by monitoring endogenous fluorophores including NAD(P)H and flavins. It explores the potential of multispectral imaging of native fluorophores in melanoma diagnostics using excitation wavelengths ranging from 340 nm to 510 nm and emission wavelengths above 391 nm. Cultured immortalized cells are utilized to compare the autofluorescent signatures of two melanoma cell lines to one fibroblast cell line. Feature analysis identifies the most significant and least correlated features for differentiating the cells. The investigation successfully applies this analysis to pre-processed, noise-removed images and original background-corrupted data. Furthermore, the applicability of distinguishing melanomas and healthy fibroblasts based on their autofluorescent characteristics is validated using the same evaluation technique on patient cells. Additionally, the study tentatively maps the detected features to underlying biological processes. This research demonstrates the potential of cellular autofluorescence as a promising tool for melanoma diagnostics.
Collapse
Affiliation(s)
- Aline Knab
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| | - Ayad G Anwer
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| | - Bernadette Pedersen
- Macquarie Medical School, Faculty of Medicine, Health and Human Sciences, Macquarie University, Sydney, New South Wales, Australia
- Melanoma Institute Australia, The University of Sydney, Sydney, New South Wales, Australia
| | - Shannon Handley
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| | - Abhilash Goud Marupally
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| | - Abbas Habibalahi
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| | - Ewa M Goldys
- Graduate School of Biomedical Engineering, Faculty of Engineering, University of New South Wales, Sydney, Australia
- ARC Centre of Excellence for Nanoscale Biophotonics, University of New South Wales, Sydney, Australia
| |
Collapse
|
3
|
Bocanegra-Pérez ÁJ, Piella G, Sebastian R, Jimenez-Perez G, Falasconi G, Saglietto A, Soto-Iglesias D, Berruezo A, Penela D, Camara O. Automatic and interpretable prediction of the site of origin in outflow tract ventricular arrhythmias: machine learning integrating electrocardiograms and clinical data. Front Cardiovasc Med 2024; 11:1353096. [PMID: 38572307 PMCID: PMC10987867 DOI: 10.3389/fcvm.2024.1353096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 03/07/2024] [Indexed: 04/05/2024] Open
Abstract
The treatment of outflow tract ventricular arrhythmias (OTVA) through radiofrequency ablation requires the precise identification of the site of origin (SOO). Pinpointing the SOO enhances the likelihood of a successful procedure, reducing intervention times and recurrence rates. Current clinical methods to identify the SOO are based on qualitative analysis of pre-operative electrocardiograms (ECG), heavily relying on physician's expertise. Although computational models and machine learning (ML) approaches have been proposed to assist OTVA procedures, they either consume substantial time, lack interpretability or do not use clinical information. Here, we propose an alternative strategy for automatically predicting the ventricular origin of OTVA patients using ML. Our objective was to classify ventricular (left/right) origin in the outflow tracts (LVOT and RVOT, respectively), integrating ECG and clinical data from each patient. Extending beyond differentiating ventricle origin, we explored specific SOO characterization. Utilizing four databases, we also trained supervised learning models on the QRS complexes of the ECGs, clinical data, and their combinations. The best model achieved an accuracy of 89%, highlighting the significance of precordial leads V1-V4, especially in the R/S transition and initiation of the QRS complex in V2. Unsupervised analysis revealed that some origins tended to group closer than others, e.g., right coronary cusp (RCC) with a less sparse group than the aortic cusp origins, suggesting identifiable patterns for specific SOOs.
Collapse
Affiliation(s)
- Álvaro J. Bocanegra-Pérez
- Physense, BCN Medtech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| | - Gemma Piella
- Physense, BCN Medtech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| | - Rafael Sebastian
- Computational Multiscale Simulation Lab (CoMMLab), Department of Computer Science, Universitat de Valencia, Valencia, Spain
| | - Guillermo Jimenez-Perez
- Physense, BCN Medtech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| | - Giulio Falasconi
- Cardiology Department, Heart Institute, Teknon Medical Center, Barcelona, Spain
| | - Andrea Saglietto
- Division of Cardiology, Department of Medical Sciences, University of Turin, Turin, Italy
| | - David Soto-Iglesias
- Cardiology Department, Heart Institute, Teknon Medical Center, Barcelona, Spain
| | - Antonio Berruezo
- Cardiology Department, Heart Institute, Teknon Medical Center, Barcelona, Spain
| | - Diego Penela
- Department of Arrhythmology, Humanitas Research Hospital, Milan, Italy
| | - Oscar Camara
- Physense, BCN Medtech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
4
|
Liao R, Zhuang Y, Li X, Chen K, Wang X, Feng C, Yin G, Zhu X, Lin J, Zhang X. Unveiling protein corona composition: predicting with resampling embedding and machine learning. Regen Biomater 2023; 11:rbad082. [PMID: 38213739 PMCID: PMC10781662 DOI: 10.1093/rb/rbad082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 08/25/2023] [Accepted: 09/02/2023] [Indexed: 01/13/2024] Open
Abstract
Biomaterials with surface nanostructures effectively enhance protein secretion and stimulate tissue regeneration. When nanoparticles (NPs) enter the living system, they quickly interact with proteins in the body fluid, forming the protein corona (PC). The accurate prediction of the PC composition is critical for analyzing the osteoinductivity of biomaterials and guiding the reverse design of NPs. However, achieving accurate predictions remains a significant challenge. Although several machine learning (ML) models like Random Forest (RF) have been used for PC prediction, they often fail to consider the extreme values in the abundance region of PC absorption and struggle to improve accuracy due to the imbalanced data distribution. In this study, resampling embedding was introduced to resolve the issue of imbalanced distribution in PC data. Various ML models were evaluated, and RF model was finally used for prediction, and good correlation coefficient (R2) and root-mean-square deviation (RMSE) values were obtained. Our ablation experiments demonstrated that the proposed method achieved an R2 of 0.68, indicating an improvement of approximately 10%, and an RMSE of 0.90, representing a reduction of approximately 10%. Furthermore, through the verification of label-free quantification of four NPs: hydroxyapatite (HA), titanium dioxide (TiO2), silicon dioxide (SiO2) and silver (Ag), and we achieved a prediction performance with an R2 value >0.70 using Random Oversampling. Additionally, the feature analysis revealed that the composition of the PC is most significantly influenced by the incubation plasma concentration, PDI and surface modification.
Collapse
Affiliation(s)
- Rong Liao
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Yan Zhuang
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Xiangfeng Li
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Ke Chen
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Xingming Wang
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Cong Feng
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Guangfu Yin
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Xiangdong Zhu
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Jiangli Lin
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| | - Xingdong Zhang
- College of Biomedical Engineering, National Engineering Research Centre for Biomaterials, Sichuan University, Chengdu, 610065, China
| |
Collapse
|
5
|
Wu J, Yang S, Wang W, Jaeger C. How effective are community-based disaster reduction strategies? Evidence from the largest-scale program so far. Risk Anal 2023; 43:1667-1681. [PMID: 36347524 DOI: 10.1111/risa.14043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/27/2022] [Accepted: 09/12/2022] [Indexed: 06/16/2023]
Abstract
Strategies of community-based disaster risk reduction have been advocated for more than 2 decades. However, we still lack in-depth quantitative assessments of the effectiveness of such strategies. Our research is based on a national experiment in this domain: the "Comprehensive Disaster Reduction Demonstration Community" project, a governmental program running in China since 2007. Information on more than 11,000 demonstration communities was collected. Combined with the local disaster information and socioeconomic conditions, the spatiotemporal characteristics of these communities over 12 years and their differences in performance by region and income group were analyzed. We performed an attribution analysis for disaster risk reduction effectiveness. This is the first time a series of quantitative evaluation methods have been applied to verify the effectiveness of a large-scale community-based disaster risk reduction project, both from the perspective of demonstrative effects and loss reduction benefits. Here, we find that the project is obviously effective from these two perspectives, and the disaster loss reduction effectiveness illustrates clear regional differences, where the regional economic level and hazard severity act as important drivers. Significant differences of urban-rural and income call for matching fortification measures, and the dynamic management of demonstration community size is required, since the loss reduction benefit converges when the penetration rate of the demonstration community reaches approximately 4% in a province. These and further results provide diverse implications for community-based disaster risk reduction policies and practices.
Collapse
Affiliation(s)
- Jingyan Wu
- Key Laboratory of Environmental Change and Natural Disaster of Ministry of Education, Beijing Normal University, Beijing, China
- Academy of Disaster Reduction and Emergency Management, Faculty of Geographical Science, Beijing Normal University, Beijing, China
| | - Saini Yang
- Key Laboratory of Environmental Change and Natural Disaster of Ministry of Education, Beijing Normal University, Beijing, China
- Academy of Disaster Reduction and Emergency Management, Faculty of Geographical Science, Beijing Normal University, Beijing, China
- School of National Safety and Emergency Management, Beijing Normal University, Beijing, China
| | - Weiping Wang
- School of National Safety and Emergency Management, Beijing Normal University, Beijing, China
| | - Carlo Jaeger
- Global Futures Laboratory, Arizona State University, Tempe, Arizona, USA
- Global Climate Forum, Berlin, Germany
| |
Collapse
|
6
|
Lutzenberger H, Fikkert P, DE Vos C, Crasborn O. Development of sign phonology in Kata Kolok. J Child Lang 2023:1-34. [PMID: 36891925 DOI: 10.1017/s0305000922000745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Much like early speech, early signing is characterised by modifications. Sign language phonology has been analysed on the feature level since the 1980s, yet acquisition studies predominately examine handshape, location, and movement. This study is the first to analyse the acquisition of phonology in the sign language of a Balinese village with a vibrant signing community and applies the same feature analysis to adult and child data. We analyse longitudinal data of four deaf children from the Kata Kolok Child Signing Corpus. The form comparison of child productions and adult targets yields three main findings: i) handshape modifications are most frequent, echoing cross-linguistic patterns; ii) modification rates of other features differ from previous studies, possibly due to differences in methodology or KK's phonology; iii) co-occurrence of modifications within a sign suggest feature interdependencies. We argue that nuanced approaches to child signing are necessary to understand the complexity of early signing.
Collapse
Affiliation(s)
- Hannah Lutzenberger
- Department of English Language and Linguistics, University of Birmingham, UK
| | - Paula Fikkert
- Centre for Language Studies, Radboud University, Netherlands
| | - Connie DE Vos
- Tilburg Center for Cognition and Communication, Tilburg University, Netherlands
| | - Onno Crasborn
- Centre for Language Studies, Radboud University, Netherlands
| |
Collapse
|
7
|
Trigka M, Dritsas E. Long-Term Coronary Artery Disease Risk Prediction with Machine Learning Models. Sensors (Basel) 2023; 23:1193. [PMID: 36772237 PMCID: PMC9920214 DOI: 10.3390/s23031193] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 01/17/2023] [Accepted: 01/18/2023] [Indexed: 06/18/2023]
Abstract
The heart is the most vital organ of the human body; thus, its improper functioning has a significant impact on human life. Coronary artery disease (CAD) is a disease of the coronary arteries through which the heart is nourished and oxygenated. It is due to the formation of atherosclerotic plaques on the wall of the epicardial coronary arteries, resulting in the narrowing of their lumen and the obstruction of blood flow through them. Coronary artery disease can be delayed or even prevented with lifestyle changes and medical intervention. Long-term risk prediction of coronary artery disease will be the area of interest in this work. In this specific research paper, we experimented with various machine learning (ML) models after the use or non-use of the synthetic minority oversampling technique (SMOTE), evaluating and comparing them in terms of accuracy, precision, recall and an area under the curve (AUC). The results showed that the stacking ensemble model after the SMOTE with 10-fold cross-validation prevailed over the other models, achieving an accuracy of 90.9 %, a precision of 96.7%, a recall of 87.6% and an AUC equal to 96.1%.
Collapse
|
8
|
Lamens A, Bajorath J. Explaining Accurate Predictions of Multitarget Compounds with Machine Learning Models Derived for Individual Targets. Molecules 2023; 28:molecules28020825. [PMID: 36677879 PMCID: PMC9860926 DOI: 10.3390/molecules28020825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 01/09/2023] [Accepted: 01/11/2023] [Indexed: 01/18/2023]
Abstract
In drug discovery, compounds with well-defined activity against multiple targets (multitarget compounds, MT-CPDs) provide the basis for polypharmacology and are thus of high interest. Typically, MT-CPDs for polypharmacology have been discovered serendipitously. Therefore, over the past decade, computational approaches have also been adapted for the design of MT-CPDs or their identification via computational screening. Such approaches continue to be under development and are far from being routine. Recently, different machine learning (ML) models have been derived to distinguish between MT-CPDs and corresponding compounds with activity against the individual targets (single-target compounds, ST-CPDs). When evaluating alternative models for predicting MT-CPDs, we discovered that MT-CPDs could also be accurately predicted with models derived for corresponding ST-CPDs; this was an unexpected finding that we further investigated using explainable ML. The analysis revealed that accurate predictions of ST-CPDs were determined by subsets of structural features of MT-CPDs required for their prediction. These findings provided a chemically intuitive rationale for the successful prediction of MT-CPDs using different ML models and uncovered general-feature subset relationships between MT- and ST-CPDs with activities against different targets.
Collapse
|
9
|
Rout RK, Umer S, Khandelwal M, Pati S, Mallik S, Balabantaray BK, Qin H. Identification of discriminant features from stationary pattern of nucleotide bases and their application to essential gene classification. Front Genet 2023; 14:1154120. [PMID: 37152988 PMCID: PMC10156977 DOI: 10.3389/fgene.2023.1154120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 04/04/2023] [Indexed: 05/09/2023] Open
Abstract
Introduction: Essential genes are essential for the survival of various species. These genes are a family linked to critical cellular activities for species survival. These genes are coded for proteins that regulate central metabolism, gene translation, deoxyribonucleic acid replication, and fundamental cellular structure and facilitate intracellular and extracellular transport. Essential genes preserve crucial genomics information that may hold the key to a detailed knowledge of life and evolution. Essential gene studies have long been regarded as a vital topic in computational biology due to their relevance. An essential gene is composed of adenine, guanine, cytosine, and thymine and its various combinations. Methods: This paper presents a novel method of extracting information on the stationary patterns of nucleotides such as adenine, guanine, cytosine, and thymine in each gene. For this purpose, some co-occurrence matrices are derived that provide the statistical distribution of stationary patterns of nucleotides in the genes, which is helpful in establishing the relationship between the nucleotides. For extracting discriminant features from each co-occurrence matrix, energy, entropy, homogeneity, contrast, and dissimilarity features are computed, which are extracted from all co-occurrence matrices and then concatenated to form a feature vector representing each essential gene. Finally, supervised machine learning algorithms are applied for essential gene classification based on the extracted fixed-dimensional feature vectors. Results: For comparison, some existing state-of-the-art feature representation techniques such as Shannon entropy (SE), Hurst exponent (HE), fractal dimension (FD), and their combinations have been utilized. Discussion: An extensive experiment has been performed for classifying the essential genes of five species that show the robustness and effectiveness of the proposed methodology.
Collapse
Affiliation(s)
- Ranjeet Kumar Rout
- National Institute of Technology Srinagar, Hazratbal, Jammu and Kashmir, India
| | - Saiyed Umer
- Aliah University, Kolkata, West Bengal, India
| | - Monika Khandelwal
- National Institute of Technology Srinagar, Hazratbal, Jammu and Kashmir, India
| | - Smitarani Pati
- Dr. B R Ambedkar National Institute of Technology Jalandhar, Jalandhar, Punjab, India
| | - Saurav Mallik
- Harvard T H Chan School of Public Health, Boston, United States
- Department of Pharmacology and Toxicology, University of Arizona, Tucson, AZ, United States
- *Correspondence: Saurav Mallik, , ; Hong Qin,
| | | | - Hong Qin
- Department of Computer Science and Engineering, University of Tennessee at Chattanooga, Chattanooga, TN, United States
- *Correspondence: Saurav Mallik, , ; Hong Qin,
| |
Collapse
|
10
|
Ren J, Guo W, Feng K, Huang T, Cai Y. Identifying MicroRNA Markers That Predict COVID-19 Severity Using Machine Learning Methods. Life (Basel) 2022; 12. [PMID: 36556329 DOI: 10.3390/life12121964] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 11/21/2022] [Accepted: 11/21/2022] [Indexed: 11/25/2022]
Abstract
Individuals with the SARS-CoV-2 infection may experience a wide range of symptoms, from being asymptomatic to having a mild fever and cough to a severe respiratory impairment that results in death. MicroRNA (miRNA), which plays a role in the antiviral effects of SARS-CoV-2 infection, has the potential to be used as a novel marker to distinguish between patients who have various COVID-19 clinical severities. In the current study, the existing blood expression profiles reported in two previous studies were combined for deep analyses. The final profiles contained 1444 miRNAs in 375 patients from six categories, which were as follows: 30 patients with mild COVID-19 symptoms, 81 patients with moderate COVID-19 symptoms, 30 non-COVID-19 patients with mild symptoms, 137 patients with severe COVID-19 symptoms, 31 non-COVID-19 patients with severe symptoms, and 66 healthy controls. An efficient computational framework containing four feature selection methods (LASSO, LightGBM, MCFS, and mRMR) and four classification algorithms (DT, KNN, RF, and SVM) was designed to screen clinical miRNA markers, and a high-precision RF model with a 0.780 weighted F1 was constructed. Some miRNAs, including miR-24-3p, whose differential expression was discovered in patients with acute lung injury complications brought on by severe COVID-19, and miR-148a-3p, differentially expressed against SARS-CoV-2 structural proteins, were identified, thereby suggesting the effectiveness and accuracy of our framework. Meanwhile, we extracted classification rules based on the DT model for the quantitative representation of the role of miRNA expression in differentiating COVID-19 patients with different severities. The search for novel biomarkers that could predict the severity of the disease could aid in the clinical diagnosis of COVID-19 and in exploring the specific mechanisms of the complications caused by SARS-CoV-2 infection. Moreover, new therapeutic targets for the disease may be found.
Collapse
|
11
|
Tian J, Wang H, Zheng S, Ning Y, Zhang X, Niu J, Vladareanu L. sEMG-Based Gain-Tuned Compliance Control for the Lower Limb Rehabilitation Robot during Passive Training. Sensors (Basel) 2022; 22:7890. [PMID: 36298256 PMCID: PMC9611623 DOI: 10.3390/s22207890] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 06/16/2023]
Abstract
The lower limb rehabilitation robot is a typical man-machine coupling system. Aiming at the problems of insufficient physiological information and unsatisfactory safety performance in the compliance control strategy for the lower limb rehabilitation robot during passive training, this study developed a surface electromyography-based gain-tuned compliance control (EGCC) strategy for the lower limb rehabilitation robot. First, the mapping function relationship between the normalized surface electromyography (sEMG) signal and the gain parameter was established and an overall EGCC strategy proposed. Next, the EGCC strategy without sEMG information was simulated and analyzed. The effects of the impedance control parameters on the position correction amount were studied, and the change rules of the robot end trajectory, man-machine contact force, and position correction amount analyzed in different training modes. Then, the sEMG signal acquisition and feature analysis of target muscle groups under different training modes were carried out. Finally, based on the lower limb rehabilitation robot control system, the influence of normalized sEMG threshold on the robot end trajectory and gain parameters under different training modes was experimentally studied. The simulation and experimental results show that the adoption of the EGCC strategy can significantly enhance the compliance of the robot end-effector by detecting the sEMG signal and improve the safety of the robot in different training modes, indicating the EGCC strategy has good application prospects in the rehabilitation robot field.
Collapse
Affiliation(s)
- Junjie Tian
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
| | - Hongbo Wang
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
- Academy for Engineering & Technology, Fudan University, Shanghai 200433, China
| | - Siyuan Zheng
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
| | - Yuansheng Ning
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
| | - Xingchao Zhang
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
| | - Jianye Niu
- Parallel Robot and Mechatronic System Laboratory of Hebei Province, Yanshan University, Qinhuangdao 066004, China
| | - Luige Vladareanu
- Institute of Solid Mechanics of the Romanian Academy, 010141 Bucharest, Romania
| |
Collapse
|
12
|
Lyu N, Wen J, Hao W. Lane-Level Regional Risk Prediction of Mainline at Freeway Diverge Area. Int J Environ Res Public Health 2022; 19:ijerph19105867. [PMID: 35627404 PMCID: PMC9141005 DOI: 10.3390/ijerph19105867] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 05/06/2022] [Accepted: 05/09/2022] [Indexed: 12/31/2022]
Abstract
Real-time regional risk prediction can play a crucial role in preventing traffic accidents. Thus, this study established a lane-level real-time regional risk prediction model. Based on observed data, the least squares-support vector machines (LS-SVM) algorithm was used to identify each lane region of the mainline, and the initial traffic parameters and surrogate safety measures (SSMs) were extracted and aggregated. The negative samples that characterized normal traffic and the positive samples that characterized regional risk were identified. Mutual information (MI) was used to determine the information gain of various feature variables in the samples, and the key feature variables affecting the regional conditions were tested and screened by means of binary logit regression analysis. Upon screening the variables and corresponding labels, the construction and verification of a lane-level regional risk prediction model was completed using the catastrophe theory. The results showed that lane difference is an important parameter to reduce the uncertainty of regional risk, and its odds ratio (OR) was 16.30 at the 95% confidence level. The 10%-quantile modified time to collision (MTTC) inverse, the speed difference between lanes, and 10%-quantile headway (DHW) had an obvious influence on regional status. The model achieved an overall accuracy of 86.50%, predicting 84.78% of regional risks with a false positive rate of 13.37% and 86.63% of normal traffic with a false positive rate of 15.22%. The proposed model can provide a basis for formulating individualized active traffic control strategies for different lanes.
Collapse
Affiliation(s)
- Nengchao Lyu
- Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430063, China; (N.L.); (J.W.)
- National Engineering Research Center for Water Transport Safety, Wuhan 430063, China
| | - Jiaqiang Wen
- Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430063, China; (N.L.); (J.W.)
- National Engineering Research Center for Water Transport Safety, Wuhan 430063, China
| | - Wei Hao
- Hunan Key Laboratory of Smart Roadway and Cooperative Vehicle-Infrastructure Systems, Changsha University of Science and Technology, Changsha 410205, China
- Correspondence:
| |
Collapse
|
13
|
Gascón A, Casas R, Buldain D, Marco Á. Providing Fault Detection from Sensor Data in Complex Machines That Build the Smart City. Sensors (Basel) 2022; 22:s22020586. [PMID: 35062547 PMCID: PMC8781749 DOI: 10.3390/s22020586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 01/07/2022] [Accepted: 01/11/2022] [Indexed: 02/06/2023]
Abstract
Household appliances, climate control machines, vehicles, elevators, cash counting machines, etc., are complex machines with key contributions to the smart city. Those devices have limited memory and processing power, but they are not just actuators; they embed tens of sensors and actuators managed by several microcontrollers and microprocessors communicated by control buses. On the other hand, predictive maintenance and the capability of identifying failures to avoid greater damage of machines is becoming a topic of great relevance in Industry 4.0, and the large amount of data to be processed is a concern. This article proposes a layered methodology to enable complex machines with automatic fault detection or predictive maintenance. It presents a layered structure to perform the collection, filtering and extraction of indicators, along with their processing. The aim is to reduce the amount of data to work with, and to optimize them by generating indicators that concentrate the information provided by data. To test its applicability, a prototype of a cash counting machine has been used. With this prototype, different failure cases have been simulated by introducing defective elements. After the extraction of the indicators, using the Kullback–Liebler divergence, it has been possible to visualize the differences between the data associated with normal and failure operation. Subsequently, using a neural network, good results have been obtained, being able to correctly classify the failure in 90% of the cases. The result of this application demonstrates the proper functioning of the proposed approach in complex machines.
Collapse
Affiliation(s)
- Alberto Gascón
- Aragon Institute of Engineering Research, University of Zaragoza, 50018 Zaragoza, Spain; (A.G.); (D.B.); (Á.M.)
| | - Roberto Casas
- Aragon Institute of Engineering Research, University of Zaragoza, 50018 Zaragoza, Spain; (A.G.); (D.B.); (Á.M.)
- Correspondence: ; Tel.: +34-976-762-856
| | - David Buldain
- Aragon Institute of Engineering Research, University of Zaragoza, 50018 Zaragoza, Spain; (A.G.); (D.B.); (Á.M.)
| | - Álvaro Marco
- Aragon Institute of Engineering Research, University of Zaragoza, 50018 Zaragoza, Spain; (A.G.); (D.B.); (Á.M.)
- GeoSpatium Lab S.L., Carlos Marx 6, 50015 Zaragoza, Spain
| |
Collapse
|
14
|
Yan S, Wang Y, Aghaei F, Qiu Y, Zheng B. Improving Performance of Breast Cancer Risk Prediction by Incorporating Optical Density Image Feature Analysis: An Assessment. Acad Radiol 2022; 29 Suppl 1:S199-S210. [PMID: 28985925 PMCID: PMC5882616 DOI: 10.1016/j.acra.2017.08.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2017] [Revised: 07/22/2017] [Accepted: 08/07/2017] [Indexed: 01/03/2023]
Abstract
RATIONALE AND OBJECTIVES The purpose of this study is to improve accuracy of near-term breast cancer risk prediction by applying a new mammographic image conversion method combined with a two-stage artificial neural network (ANN)-based classification scheme. MATERIALS AND METHODS The dataset included 168 negative mammography screening cases. In developing and testing our new risk model, we first converted the original grayscale value (GV)-based mammographic images into optical density (OD)-based images. For each case, our computer-aided scheme then computed two types of image features representing bilateral asymmetry and the maximum of the image features computed from GV and OD images, respectively. A two-stage classification scheme consisting of three ANNs was developed. The first stage included two ANNs trained using features computed separately from GV and OD images of 138 cases. The second stage included another ANN to fuse the prediction scores produced by two ANNs in the first stage. The risk prediction performance was tested using the rest 30 cases. RESULTS With the two-stage classification scheme, the computed area under the receiver operating characteristic curve (AUC) was 0.816 ± 0.071, which was significantly higher than the AUC values of 0.669 ± 0.099 and 0.646 ± 0.099 achieved using two ANNs trained using GV features and OD features, respectively (P < .05). CONCLUSION This study demonstrated that applying an OD image conversion method can acquire new complimentary information to those acquired from the original images. As a result, fusion image features computed from these two types of images yielded significantly higher performance in near-term breast cancer risk prediction.
Collapse
Affiliation(s)
- Shiju Yan
- School of Medical Instrument and Food Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China,School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019, USA
| | - Yunzhi Wang
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019, USA
| | - Faranak Aghaei
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019, USA
| | - Yuchen Qiu
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019, USA
| | - Bin Zheng
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019, USA
| |
Collapse
|
15
|
Liu S, Yuan H, Liu J, Lin H, Yang C, Cai X. Comprehensive analysis of resting tremor based on acceleration signals of patients with Parkinson's disease. Technol Health Care 2021; 30:895-907. [PMID: 34657861 DOI: 10.3233/thc-213205] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Resting tremor is an essential characteristic in patients suffering from Parkinson's disease (PD). OBJECTIVE Quantification and monitoring of tremor severity is clinically important to help achieve medication or rehabilitation guidance in daily monitoring. METHODS Wrist-worn tri-axial accelerometers were utilized to record the long-term acceleration signals of PD patients with different tremor severities rated by Unified Parkinson's Disease Rating Scale (UPDRS). Based on the extracted features, three kinds of classifiers were used to identify different tremor severities. Statistical tests were further designed for the feature analysis. RESULTS The support vector machine (SVM) achieved the best performance with an overall accuracy of 94.84%. Additional feature analysis indicated the validity of the proposed feature combination and revealed the importance of different features in differentiating tremor severities. CONCLUSION The present work obtains a high-accuracy classification in tremor severity, which is expected to play a crucial role in PD treatment and symptom monitoring in real life.
Collapse
Affiliation(s)
- Sen Liu
- Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China.,Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China
| | - Han Yuan
- Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China.,Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China
| | - Jiali Liu
- Department of Neurosurgery, Shenzhen Second People's Hospital, the First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong, China.,Shenzhen University School of Medicine, Shenzhen, Guangdong, China
| | - Hai Lin
- Department of Neurosurgery, Shenzhen Second People's Hospital, the First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong, China.,Shenzhen University School of Medicine, Shenzhen, Guangdong, China
| | - Cuiwei Yang
- Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China.,Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Shanghai Engineering Research Center of Assistive Devices, Shanghai, China.,Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China
| | - Xiaodong Cai
- Department of Neurosurgery, Shenzhen Second People's Hospital, the First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong, China.,Shenzhen University School of Medicine, Shenzhen, Guangdong, China
| |
Collapse
|
16
|
Jiang M, Zhao B, Luo S, Wang Q, Chu Y, Chen T, Mao X, Liu Y, Wang Y, Jiang X, Wei DQ, Xiong Y. NeuroPpred-Fuse: an interpretable stacking model for prediction of neuropeptides by fusing sequence information and feature selection methods. Brief Bioinform 2021; 22:6350884. [PMID: 34396388 DOI: 10.1093/bib/bbab310] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 07/01/2021] [Accepted: 07/18/2021] [Indexed: 12/13/2022] Open
Abstract
Neuropeptides acting as signaling molecules in the nervous system of various animals play crucial roles in a wide range of physiological functions and hormone regulation behaviors. Neuropeptides offer many opportunities for the discovery of new drugs and targets for the treatment of neurological diseases. In recent years, there have been several data-driven computational predictors of various types of bioactive peptides, but the relevant work about neuropeptides is little at present. In this work, we developed an interpretable stacking model, named NeuroPpred-Fuse, for the prediction of neuropeptides through fusing a variety of sequence-derived features and feature selection methods. Specifically, we used six types of sequence-derived features to encode the peptide sequences and then combined them. In the first layer, we ensembled three base classifiers and four feature selection algorithms, which select non-redundant important features complementarily. In the second layer, the output of the first layer was merged and fed into logistic regression (LR) classifier to train the model. Moreover, we analyzed the selected features and explained the feasibility of the selected features. Experimental results show that our model achieved 90.6% accuracy and 95.8% AUC on the independent test set, outperforming the state-of-the-art models. In addition, we exhibited the distribution of selected features by these tree models and compared the results on the training set to that on the test set. These results fully showed that our model has a certain generalization ability. Therefore, we expect that our model would provide important advances in the discovery of neuropeptides as new drugs for the treatment of neurological diseases.
Collapse
Affiliation(s)
- Mingming Jiang
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Bowen Zhao
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Shenggan Luo
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qiankun Wang
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yanyi Chu
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Tianhang Chen
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xueying Mao
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yatong Liu
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yanjing Wang
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xue Jiang
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
17
|
Matin SS, Pradhan B. Earthquake-Induced Building-Damage Mapping Using Explainable AI (XAI). Sensors (Basel) 2021; 21:4489. [PMID: 34209169 PMCID: PMC8271973 DOI: 10.3390/s21134489] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 06/15/2021] [Accepted: 06/25/2021] [Indexed: 11/21/2022]
Abstract
Building-damage mapping using remote sensing images plays a critical role in providing quick and accurate information for the first responders after major earthquakes. In recent years, there has been an increasing interest in generating post-earthquake building-damage maps automatically using different artificial intelligence (AI)-based frameworks. These frameworks in this domain are promising, yet not reliable for several reasons, including but not limited to the site-specific design of the methods, the lack of transparency in the AI-model, the lack of quality in the labelled image, and the use of irrelevant descriptor features in building the AI-model. Using explainable AI (XAI) can lead us to gain insight into identifying these limitations and therefore, to modify the training dataset and the model accordingly. This paper proposes the use of SHAP (Shapley additive explanation) to interpret the outputs of a multilayer perceptron (MLP)-a machine learning model-and analyse the impact of each feature descriptor included in the model for building-damage assessment to examine the reliability of the model. In this study, a post-event satellite image from the 2018 Palu earthquake was used. The results show that MLP can classify the collapsed and non-collapsed buildings with an overall accuracy of 84% after removing the redundant features. Further, spectral features are found to be more important than texture features in distinguishing the collapsed and non-collapsed buildings. Finally, we argue that constructing an explainable model would help to understand the model's decision to classify the buildings as collapsed and non-collapsed and open avenues to build a transferable AI model.
Collapse
Affiliation(s)
- Sahar S. Matin
- Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Ultimo, NSW 2007, Australia;
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Ultimo, NSW 2007, Australia;
- Department of Energy and Mineral Resources Engineering, Sejong University, Choongmu-gwan, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea
- Center of Excellence for Climate Change Research, King Abdulaziz University, P.O. Box 80234, Jeddah 21589, Saudi Arabia
- Earth Observation Center, Institute of Climate Change, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| |
Collapse
|
18
|
Ko H, Rim K, Praça I. Influence of Features on Accuracy of Anomaly Detection for an Energy Trading System. Sensors (Basel) 2021; 21:s21124237. [PMID: 34205584 PMCID: PMC8235115 DOI: 10.3390/s21124237] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Revised: 06/17/2021] [Accepted: 06/17/2021] [Indexed: 11/16/2022]
Abstract
The biggest problem with conventional anomaly signal detection using features was that it was difficult to use it in real time and it requires processing of network signals. Furthermore, analyzing network signals in real-time required vast amounts of processing for each signal, as each protocol contained various pieces of information. This paper suggests anomaly detection by analyzing the relationship among each feature to the anomaly detection model. The model analyzes the anomaly of network signals based on anomaly feature detection. The selected feature for anomaly detection does not require constant network signal updates and real-time processing of these signals. When the selected features are found in the received signal, the signal is registered as a potential anomaly signal and is then steadily monitored until it is determined as either an anomaly or normal signal. In terms of the results, it determined the anomaly with 99.7% (0.997) accuracy in f(4)(S0) and in case f(4)(REJ) received 11,233 signals with a normal or 171anomaly judgment accuracy of 98.7% (0.987).
Collapse
Affiliation(s)
- Hoon Ko
- Instituto Superior de Engenharia do Porto, Instituto Politecnico do Porto, R. Dr. Antonio Bernardino de Almeida, 431, 4249-015 Porto, Portugal;
| | - Kwangcheol Rim
- College of Basic & General Education, Chosun University, 309 Pilmundae-ro, Dong-Gu, Gwangju 61452, Korea;
| | - Isabel Praça
- Instituto Superior de Engenharia do Porto, Instituto Politecnico do Porto, R. Dr. Antonio Bernardino de Almeida, 431, 4249-015 Porto, Portugal;
- Correspondence:
| |
Collapse
|
19
|
Pan T, Wang H, Si H, Li Y, Shang L. Identification of Pilots' Fatigue Status Based on Electrocardiogram Signals. Sensors (Basel) 2021; 21:3003. [PMID: 33922915 DOI: 10.3390/s21093003] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Revised: 04/20/2021] [Accepted: 04/21/2021] [Indexed: 11/25/2022]
Abstract
Fatigue is an important factor affecting modern flight safety. It can easily lead to a decline in pilots’ operational ability, misjudgments, and flight illusions. Moreover, it can even trigger serious flight accidents. In this paper, a wearable wireless physiological device was used to obtain pilots’ electrocardiogram (ECG) data in a simulated flight experiment, and 1440 effective samples were determined. The Friedman test was adopted to select the characteristic indexes that reflect the fatigue state of the pilot from the time domain, frequency domain, and non-linear characteristics of the effective samples. Furthermore, the variation rules of the characteristic indexes were analyzed. Principal component analysis (PCA) was utilized to extract the features of the selected feature indexes, and the feature parameter set representing the fatigue state of the pilot was established. For the study on pilots’ fatigue state identification, the feature parameter set was used as the input of the learning vector quantization (LVQ) algorithm to train the pilots’ fatigue state identification model. Results show that the recognition accuracy of the LVQ model reached 81.94%, which is 12.84% and 9.02% higher than that of traditional back propagation neural network (BPNN) and support vector machine (SVM) model, respectively. The identification model based on the LVQ established in this paper is suitable for identifying pilots’ fatigue states. This is of great practical significance to reduce flight accidents caused by pilot fatigue, thus providing a theoretical foundation for pilot fatigue risk management and the development of intelligent aircraft autopilot systems.
Collapse
|
20
|
Liu F, Long Y, Luo J, Pu H, Duan C, Zhong S. Active Fault Localization of Actuators on Torpedo-Shaped Autonomous Underwater Vehicles. Sensors (Basel) 2021; 21:E476. [PMID: 33440899 DOI: 10.3390/s21020476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 01/07/2021] [Accepted: 01/08/2021] [Indexed: 11/16/2022]
Abstract
To ensure the mission implementation of Autonomous Underwater Vehicles (AUVs), faults occurring on actuators should be detected and located promptly; therefore, reliable control strategies and inputs can be effectively provided. In this paper, faults occurring on the propulsion and attitude control systems of a torpedo-shaped AUV are analyzed and located while fault features may induce confusions for conventional fault localization (FL). Selective features of defined fault parameters are assorted as necessary conditions against different faulty actuators and synthesized in a fault tree subsequently to state the sufficiency towards possible abnormal parts. By matching fault features with those of estimated fault parameters, suspected faulty sections are located. Thereafter, active FL strategies that analyze the related fault parameters after executing purposive actuator control are proposed to provide precise fault location. Moreover, the generality of the proposed methods is analyzed to support extensive implementations. Simulations based on finite element analysis against a torpedo-shaped AUV with actuator faults are carried out to illustrate the effectiveness of the proposed methods.
Collapse
|
21
|
Adedolapo O, Huichen Y, Avishek B, William H, Dan A, Mohammed T. Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support. Proc (Int Conf Comput Sci Comput Intell) 2020; 2020:1231-1236. [PMID: 35382513 PMCID: PMC8979371 DOI: 10.1109/csci51800.2020.00230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
We present a machine learning framework and a new test bed for data mining from the Slurm Workload Manager for high-performance computing (HPC) clusters. The focus was to find a method for selecting features to support decisions: helping users decide whether to resubmit failed jobs with boosted CPU and memory allocations or migrate them to a computing cloud. This task was cast as both supervised classification and regression learning, specifically, sequential problem solving suitable for reinforcement learning. Selecting relevant features can improve training accuracy, reduce training time, and produce a more comprehensible model, with an intelligent system that can explain predictions and inferences. We present a supervised learning model trained on a Simple Linux Utility for Resource Management (Slurm) data set of HPC jobs using three different techniques for selecting features: linear regression, lasso, and ridge regression. Our data set represented both HPC jobs that failed and those that succeeded, so our model was reliable, less likely to overfit, and generalizable. Our model achieved an R2 of 95% with 99% accuracy. We identified five predictors for both CPU and memory properties.
Collapse
Affiliation(s)
- Okanlawon Adedolapo
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| | - Yang Huichen
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| | - Bose Avishek
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| | - Hsu William
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| | - Andresen Dan
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| | - Tanash Mohammed
- Department of Computer Science, Kansas State University, Manhattan, Kansas, USA
| |
Collapse
|
22
|
Sunarya U, Sun Hariyani Y, Cho T, Roh J, Hyeong J, Sohn I, Kim S, Park C. Feature Analysis of Smart Shoe Sensors for Classification of Gait Patterns. Sensors (Basel) 2020; 20:s20216253. [PMID: 33147794 PMCID: PMC7662266 DOI: 10.3390/s20216253] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/25/2020] [Accepted: 10/28/2020] [Indexed: 12/30/2022]
Abstract
Gait analysis is commonly used to detect foot disorders and abnormalities such as supination, pronation, unstable left foot and unstable right foot. Early detection of these abnormalities could help us to correct the walking posture and avoid getting injuries. This paper presents extensive feature analyses on smart shoes sensor data, including pressure sensors, accelerometer and gyroscope signals, to obtain the optimum combination of the sensors for gait classification, which is crucial to implement a power-efficient mobile smart shoes system. In addition, we investigated the optimal length of data segmentation based on the gait cycle parameters, reduction of the feature dimensions and feature selection for the classification of the gait patterns. Benchmark tests among several machine learning algorithms were conducted using random forest, k-nearest neighbor (KNN), logistic regression and support vector machine (SVM) algorithms for the classification task. Our experiments demonstrated the combination of accelerometer and gyroscope sensor features with SVM achieved the best performance with 89.36% accuracy, 89.76% precision and 88.44% recall. This research suggests a new state-of-the-art gait classification approach, specifically on detecting human gait abnormalities.
Collapse
Affiliation(s)
- Unang Sunarya
- Department of Computer Engineering, Kwangwoon University, Seoul 01897, Korea; (U.S.); (Y.S.H.)
- School of Applied Science, Telkom University, Bandung 40257, Indonesia
| | - Yuli Sun Hariyani
- Department of Computer Engineering, Kwangwoon University, Seoul 01897, Korea; (U.S.); (Y.S.H.)
- School of Applied Science, Telkom University, Bandung 40257, Indonesia
| | - Taeheum Cho
- Department of Intelligent Information and Embedded Software Engineering, Kwangwoon University, Seoul 01897, Korea;
| | - Jongryun Roh
- Human Convergence Technology R&D Department, Korea Institute of Industrial Technology, Ansan 15588, Korea; (J.R.); (J.H.)
| | - Joonho Hyeong
- Human Convergence Technology R&D Department, Korea Institute of Industrial Technology, Ansan 15588, Korea; (J.R.); (J.H.)
| | - Illsoo Sohn
- Department of Computer Science and Engineering Seoul National University of Science and Technology, Seoul 01811, Korea;
| | - Sayup Kim
- Human Convergence Technology R&D Department, Korea Institute of Industrial Technology, Ansan 15588, Korea; (J.R.); (J.H.)
- Correspondence: (S.K.); (C.P.); Tel.: +82-2-940-8251 (C.P.)
| | - Cheolsoo Park
- Department of Computer Engineering, Kwangwoon University, Seoul 01897, Korea; (U.S.); (Y.S.H.)
- Correspondence: (S.K.); (C.P.); Tel.: +82-2-940-8251 (C.P.)
| |
Collapse
|
23
|
Zhang D, Guan ZX, Zhang ZM, Li SH, Dao FY, Tang H, Lin H. Recent Development of Computational Predicting Bioluminescent Proteins. Curr Pharm Des 2020; 25:4264-4273. [PMID: 31696804 DOI: 10.2174/1381612825666191107100758] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 11/04/2019] [Indexed: 12/22/2022]
Abstract
Bioluminescent Proteins (BLPs) are widely distributed in many living organisms that act as a key role of light emission in bioluminescence. Bioluminescence serves various functions in finding food and protecting the organisms from predators. With the routine biotechnological application of bioluminescence, it is recognized to be essential for many medical, commercial and other general technological advances. Therefore, the prediction and characterization of BLPs are significant and can help to explore more secrets about bioluminescence and promote the development of application of bioluminescence. Since the experimental methods are money and time-consuming for BLPs identification, bioinformatics tools have played important role in fast and accurate prediction of BLPs by combining their sequences information with machine learning methods. In this review, we summarized and compared the application of machine learning methods in the prediction of BLPs from different aspects. We wish that this review will provide insights and inspirations for researches on BLPs.
Collapse
Affiliation(s)
- Dan Zhang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Zheng-Xing Guan
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Zi-Mei Zhang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Shi-Hao Li
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Fu-Ying Dao
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hua Tang
- Department of Pathophysiology, Southwest Medical University, Luzhou 646000, China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
24
|
de Moura J, Vidal PL, Novo J, Rouco J, Penedo MG, Ortega M. Intraretinal Fluid Pattern Characterization in Optical Coherence Tomography Images. Sensors (Basel) 2020; 20:s20072004. [PMID: 32260062 PMCID: PMC7180444 DOI: 10.3390/s20072004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 03/27/2020] [Accepted: 03/31/2020] [Indexed: 12/20/2022]
Abstract
Optical Coherence Tomography (OCT) has become a relevant image modality in the ophthalmological clinical practice, as it offers a detailed representation of the eye fundus. This medical imaging modality is currently one of the main means of identification and characterization of intraretinal cystoid regions, a crucial task in the diagnosis of exudative macular disease or macular edema, among the main causes of blindness in developed countries. This work presents an exhaustive analysis of intensity and texture-based descriptors for its identification and classification, using a complete set of 510 texture features, three state-of-the-art feature selection strategies, and seven representative classifier strategies. The methodology validation and the analysis were performed using an image dataset of 83 OCT scans. From these images, 1609 samples were extracted from both cystoid and non-cystoid regions. The different tested configurations provided satisfactory results, reaching a mean cross-validation test accuracy of 92.69%. The most promising feature categories identified for the issue were the Gabor filters, the Histogram of Oriented Gradients (HOG), the Gray-Level Run-Length matrix (GLRL), and the Laws’ texture filters (LAWS), being consistently and considerably selected along all feature selector algorithms in the top positions of different relevance rankings.
Collapse
Affiliation(s)
- Joaquim de Moura
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
| | - Plácido L. Vidal
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
- Correspondence:
| | - Jorge Novo
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
| | - José Rouco
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
| | - Manuel G. Penedo
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
| | - Marcos Ortega
- Centro de investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain; (J.d.M.); (J.N.); (J.R.); (M.G.P.); (M.O.)
- Grupo VARPA, Instituto de Investigación Biomédica de A Coruña (INIBIC), Universidade da Coruña, 15006 A Coruña, Spain
| |
Collapse
|
25
|
Wang J, Yang B, An Y, Marquez-Lago T, Leier A, Wilksch J, Hong Q, Zhang Y, Hayashida M, Akutsu T, Webb GI, Strugnell RA, Song J, Lithgow T. Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches. Brief Bioinform 2019; 20:931-951. [PMID: 29186295 PMCID: PMC6585386 DOI: 10.1093/bib/bbx164] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2017] [Revised: 11/08/2017] [Indexed: 12/13/2022] Open
Abstract
In the course of infecting their hosts, pathogenic bacteria secrete numerous effectors, namely, bacterial proteins that pervert host cell biology. Many Gram-negative bacteria, including context-dependent human pathogens, use a type IV secretion system (T4SS) to translocate effectors directly into the cytosol of host cells. Various type IV secreted effectors (T4SEs) have been experimentally validated to play crucial roles in virulence by manipulating host cell gene expression and other processes. Consequently, the identification of novel effector proteins is an important step in increasing our understanding of host-pathogen interactions and bacterial pathogenesis. Here, we train and compare six machine learning models, namely, Naïve Bayes (NB), K-nearest neighbor (KNN), logistic regression (LR), random forest (RF), support vector machines (SVMs) and multilayer perceptron (MLP), for the identification of T4SEs using 10 types of selected features and 5-fold cross-validation. Our study shows that: (1) including different but complementary features generally enhance the predictive performance of T4SEs; (2) ensemble models, obtained by integrating individual single-feature models, exhibit a significantly improved predictive performance and (3) the 'majority voting strategy' led to a more stable and accurate classification performance when applied to predicting an ensemble learning model with distinct single features. We further developed a new method to effectively predict T4SEs, Bastion4 (Bacterial secretion effector predictor for T4SS), and we show our ensemble classifier clearly outperforms two recent prediction tools. In summary, we developed a state-of-the-art T4SE predictor by conducting a comprehensive performance evaluation of different machine learning algorithms along with a detailed analysis of single- and multi-feature selections.
Collapse
Affiliation(s)
- Jiawei Wang
- Biomedicine Discovery Institute and the Department of Microbiology at Monash University, Australia
| | - Bingjiao Yang
- National Engineering Research Center for Equipment and Technology of Cold Strip Rolling, College of Mechanical Engineering from Yanshan University, China
| | - Yi An
- College of Information Engineering, Northwest A&F University, China
| | - Tatiana Marquez-Lago
- Department of Genetics, University of Alabama at Birmingham (UAB) School of Medicine, USA
| | - André Leier
- Department of Genetics and the Informatics Institute, University of Alabama at Birmingham (UAB) School of Medicine, USA
| | - Jonathan Wilksch
- Department of Microbiology and Immunology at the University of Melbourne, Australia
| | | | - Yang Zhang
- Computer Science and Engineering in 2015 fromNorthwestern Polytechnical University, China
| | | | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Japan
| | - Geoffrey I Webb
- Faculty of Information Technology, Monash Centre for Data Science, Monash University
| | - Richard A Strugnell
- Department of Microbiology and Immunology, Faculty of Medicine Dentistry and Health Sciences, University of Melbourne
| | - Jiangning Song
- Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Trevor Lithgow
- Department of Microbiology at Monash University, Australia
| |
Collapse
|
26
|
Van der Mierden S, Tsaioun K, Bleich A, Leenaars CHC. Software tools for literature screening in systematic reviews in biomedical research. ALTEX 2019; 36:508-517. [PMID: 31113000 DOI: 10.14573/altex.1902131] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 05/07/2019] [Indexed: 11/23/2022]
Abstract
Systematic Reviews (SRs) hold promise for implementing the 3Rs in animal sciences: they can retrieve available alternative models, help refining experiments, and identify insufficiencies, or an excess of, scientific knowledge on a particular topic. Unfortunately, SRs can be labour- and time-intensive, especially the reference screening and data extraction phases. Fortunately, there are several software tools available that help make screening faster and easier. However, it is not always clear which features the tools offer. Therefore, a feature analysis was performed to compare different reference screening tools as objectively as possible. This analysis enables researchers to select the most appropriate tool for their needs. Fifteen different tools were compared: CADIMA, Covidence, DistillerSR, Endnote, Endnote using Bramer's method, EROS, HAWC, Microsoft Excel, Excel using VonVille's method, Microsoft Word, Rayyan, RevMan, SyRF, SysRev.com, and SWIFT Active Screener. Their support of 21 features was tested. Features were categorised as mandatory, desirable, and optional. DistillerSR, Covidence, and SWIFT Active Screener are the tools that support all mandatory features. These tools are preferred for screening references, but none of them are free. The best scoring free tool is Rayyan, which lacks one mandatory function: distinct title/abstract and full-text phases. The lowest scoring tools are those not specifically designed for SRs, like Microsoft Word and Endnote. Their use can only be advised for small and simple SRs. A well-informed selection of SR screening tools will benefit review quality and speed, which can contribute to the advancement of the 3Rs in animal studies.
Collapse
Affiliation(s)
| | - Katya Tsaioun
- Evidence-based Toxicology Collaboration at Johns Hopkins Bloomberg School of Public Health (EBTC), Baltimore, MD, USA
| | - André Bleich
- Institute for Laboratory Animal Science, Hannover Medical School, Hannover, Germany
| | - Cathalijn H C Leenaars
- Institute for Laboratory Animal Science, Hannover Medical School, Hannover, Germany.,Faculty of Veterinary Sciences, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
27
|
Chirra P, Leo P, Yim M, Bloch BN, Rastinehad AR, Purysko A, Rosen M, Madabhushi A, Viswanath SE. Multisite evaluation of radiomic feature reproducibility and discriminability for identifying peripheral zone prostate tumors on MRI. J Med Imaging (Bellingham) 2019; 6:024502. [PMID: 31259199 PMCID: PMC6566001 DOI: 10.1117/1.jmi.6.2.024502] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Accepted: 05/15/2019] [Indexed: 12/18/2022] Open
Abstract
Recent advances in the field of radiomics have enabled the development of a number of prognostic and predictive imaging-based tools for a variety of diseases. However, wider clinical adoption of these tools is contingent on their generalizability across multiple sites and scanners. This may be particularly relevant in the context of radiomic features derived from T1- or T2-weighted magnetic resonance images (MRIs), where signal intensity values are known to lack tissue-specific meaning and vary based on differing acquisition protocols between institutions. We present the first empirical study of benchmarking five different radiomic feature families in terms of both reproducibility and discriminability in a multisite setting, specifically, for identifying prostate tumors in the peripheral zone on MRI. Our cohort comprised 147 patient T2-weighted MRI datasets from four different sites, all of which are first preprocessed to correct for acquisition-related artifacts such as bias field, differing voxel resolutions, and intensity drift (nonstandardness). About 406 three-dimensional voxel-wise radiomic features from five different families (gray, Haralick, gradient, Laws, and Gabor) were evaluated in a cross-site setting to determine (a) how reproducible they are within a relatively homogeneous nontumor tissue region and (b) how well they could discriminate tumor regions from nontumor regions. Our results demonstrate that a majority of the popular Haralick features are reproducible in over 99% of all cross-site comparisons, as well as achieve excellent cross-site discriminability (classification accuracy of ≈ 0.8 ). By contrast, a majority of Laws features are highly variable across sites (reproducible in < 75 % of all cross-site comparisons) as well as resulting in low cross-site classifier accuracies ( < 0.6 ), likely due to a large number of noisy filter responses that can be extracted. These trends suggest that only a subset of radiomic features and associated parameters may be both reproducible and discriminable enough for use within machine learning classifier schemes.
Collapse
Affiliation(s)
- Prathyush Chirra
- Case Western Reserve University, Department of Biomedical Engineering, Cleveland, Ohio, United States
| | - Patrick Leo
- Case Western Reserve University, Department of Biomedical Engineering, Cleveland, Ohio, United States
| | - Michael Yim
- Northeast Ohio Medical University, College of Medicine, Rootstown, Ohio, United States
| | - B. Nicolas Bloch
- Boston University School of Medicine, Department of Radiology, Boston, Massachusetts, United States
| | - Ardeshir R. Rastinehad
- Icahn School of Medicine at Mount Sinai, Department of Urology, New York, New York, United States
| | - Andrei Purysko
- Cleveland Clinic, Department of Radiology, Cleveland, Ohio, United States
| | - Mark Rosen
- Hospital of the University of Pennsylvania, Department of Radiology, Philadelphia, Pennsylvania, United States
| | - Anant Madabhushi
- Case Western Reserve University, Department of Biomedical Engineering, Cleveland, Ohio, United States
- Louis Stokes Cleveland Veterans Administration Medical Center, Cleveland, Ohio, United States
| | - Satish E. Viswanath
- Case Western Reserve University, Department of Biomedical Engineering, Cleveland, Ohio, United States
| |
Collapse
|
28
|
Mukundan R. Analysis of Image Feature Characteristics for Automated Scoring of HER2 in Histology Slides. J Imaging 2019; 5:jimaging5030035. [PMID: 34460463 PMCID: PMC8320919 DOI: 10.3390/jimaging5030035] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 03/01/2019] [Accepted: 03/06/2019] [Indexed: 12/12/2022] Open
Abstract
The evaluation of breast cancer grades in immunohistochemistry (IHC) slides takes into account various types of visual markers and morphological features of stained membrane regions. Digital pathology algorithms using whole slide images (WSIs) of histology slides have recently been finding several applications in such computer-assisted evaluations. Features that are directly related to biomarkers used by pathologists are generally preferred over the pixel values of entire images, even though the latter has more information content. This paper explores in detail various types of feature measurements that are suitable for the automated scoring of human epidermal growth factor receptor 2 (HER2) in histology slides. These are intensity features known as characteristic curves, texture features in the form of uniform local binary patterns (ULBPs), morphological features specifying connectivity of regions, and first-order statistical features of the overall intensity distribution. This paper considers important properties of the above features and outlines methods for reducing information redundancy, maximizing inter-class separability, and improving classification accuracy in the combined feature set. This paper also presents a detailed experimental analysis performed using the aforementioned features on a WSI dataset of IHC stained slides.
Collapse
Affiliation(s)
- Ramakrishnan Mukundan
- Department of Computer Science and Software Engineering, University of Canterbury, Christchurch 8140, New Zealand
| |
Collapse
|
29
|
Banerjee A, Levy Y, Mitra P. Analyzing Change in Protein Stability Associated with Single Point Deletions in a Newly Defined Protein Structure Database. J Proteome Res 2019; 18:1402-1410. [PMID: 30735617 DOI: 10.1021/acs.jproteome.9b00048] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein backbone alternation due to insertion/deletion or mutation operation often results in a change of fundamental biophysical properties of proteins. The proposed work intends to encode the protein stability changes associated with single point deletions (SPDs) of amino acids in proteins. The encoding will help in the primary screening of detrimental backbone modifications before opting for expensive in vitro experimentations. In the absence of any benchmark database documenting SPDs, we curate a data set containing SPDs that lead to both folded conformations and unfolded state. We differentiate these SPD instances with the help of simple structural and physicochemical features and eventually classify the foldability resulting out of SPDs using a Random Forest classifier and an Elliptic Envelope based outlier detector. Adhering to leave one out cross validation, the accuracy of the Random Forest classifier and the Elliptic Envelope is of 99.4% and 98.1%, respectively. The newly defined database and the delineation of SPD instances based on its resulting foldability provide a head start toward finding a solution to the given problem.
Collapse
Affiliation(s)
| | - Yaakov Levy
- Department of Structural Biology , Weizmann Institute of Science , Rehovot 76100 , Israel
| | | |
Collapse
|
30
|
Wu J, Zhang B, Zhou J, Xiong Y, Gu B, Yang X. Automatic Recognition of Ripening Tomatoes by Combining Multi-Feature Fusion with a Bi-Layer Classification Strategy for Harvesting Robots. Sensors (Basel) 2019; 19:s19030612. [PMID: 30717147 PMCID: PMC6387124 DOI: 10.3390/s19030612] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 01/30/2019] [Accepted: 01/30/2019] [Indexed: 12/03/2022]
Abstract
Automatic recognition of ripening tomatoes is a main hurdle precluding the replacement of manual labour by robotic harvesting. In this paper, we present a novel automatic algorithm for recognition of ripening tomatoes using an improved method that combines multiple features, feature analysis and selection, a weighted relevance vector machine (RVM) classifier, and a bi-layer classification strategy. The algorithm operates using a two-layer strategy. The first-layer classification strategy aims to identify tomato-containing regions in images using the colour difference information. The second classification strategy is based on a classifier that is trained on multi-medium features. In our proposed algorithm, to simplify the calculation and to improve the recognition efficiency, the processed images are divided into 9 × 9 pixel blocks, and these blocks, rather than single pixels, are considered as the basic units in the classification task. Six colour-related features, namely the Red (R), Green (G), Blue (B), Hue (H), Saturation (S) and Intensity (I) components, respectively, colour components, and five textural features (entropy, energy, correlation, inertial moment and local smoothing) were extracted from pixel blocks. Relevant features and their weights were analysed using the iterative RELIEF (I-RELIEF) algorithm. The image blocks were classified into different categories using a weighted RVM classifier based on the selected relevant features. The final results of tomato recognition were determined by combining the block classification results and the bi-layer classification strategy. The algorithm demonstrated the detection accuracy of 94.90% on 120 images, this suggests that the proposed algorithm is effective and suitable for tomato detection
Collapse
Affiliation(s)
- Jingui Wu
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China.
| | - Baohua Zhang
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China.
| | - Jun Zhou
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China.
| | - Yingjun Xiong
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China.
| | - Baoxing Gu
- College of Engineering, Nanjing Agricultural University, Nanjing 210031, China.
| | - Xiaolong Yang
- College of Horticulture, Shenyang Agricultural University, Shenyang 110866, China.
| |
Collapse
|
31
|
Gu J, Zhu J, Qiu Q, Wang Y, Bai T, Duan J, Yin Y. The Feasibility Study of Megavoltage Computed Tomographic (MVCT) Image for Texture Feature Analysis. Front Oncol 2018; 8:586. [PMID: 30568918 PMCID: PMC6290333 DOI: 10.3389/fonc.2018.00586] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Accepted: 11/21/2018] [Indexed: 11/13/2022] Open
Abstract
Purpose: To determine whether radiomics texture features can be reproducibly obtained from megavoltage computed tomographic (MVCT) images acquired by Helical TomoTherapy (HT) with different imaging conditions. Methods: For each of the 195 textures enrolled, the mean intrapatient difference, which is considered to be the benchmark for reproducibility, was calculated from the MVCT images of 22 patients with early-stage non-small-cell lung cancer. Test–retest MVCT images of an in-house designed phantom were acquired to determine the concordance correlation coefficient (CCC) for these 195 texture features. Features with high reproducibility (CCC > 0.9) in the phantom test–retest set were investigated for sensitivities to different imaging protocols, scatter levels, and motion frequencies using a wood phantom and in-vitro animal tissues. Results: Of the 195 features, 165 (85%) features had CCC > 0.9. For the wood phantom, 124 features were reproducible in two kinds of scatter materials, and further investigations were performed on these features. For animal tissues, 108 features passed the criteria for reproducibility when one layer of scatter was covered, while 106 and 108 features of in-vitro liver and bone passed with two layers of scatter, respectively. Considering the effect of differing acquisition pitch (AcP), 97 features extracted from wood passed, while 103 and 59 features extracted from in-vitro liver and bone passed, respectively. Different reconstruction intervals (RI) had a small effect on the stability of the feature value. When AcP and RI were held consistent without motion, all 124 features calculated from wood passed, and a majority (122 of 124) of the features passed when imaging with a “fine” AcP with different RIs. However, only 55 and 40 features passed with motion frequencies of 20 and 25 beats per minute, respectively. Conclusion: Motion frequency has a significant impact on MVCT texture features, and features from MVCT were more reproducibility in different scatter conditions than those from CBCT. Considering the effects of AcP and RI, the scanning protocols should be kept consistent when MVCT images are used for feature analysis. Some radiomics features from HT MVCT images are reproducible and could be used for creating clinical prediction models in the future.
Collapse
Affiliation(s)
- Jiabing Gu
- School of Medicine and Life Sciences, University of Jinan-Shandong Academy of Medical Sciences, Jinan, China.,Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Jian Zhu
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Qingtao Qiu
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Yungang Wang
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Tong Bai
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Jinghao Duan
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| | - Yong Yin
- Department of Radiation Oncology Physics and Technology, Shandong Cancer Hospital Affiliated to Shandong University, Shandong Academy of Medical Sciences, Jinan, China
| |
Collapse
|
32
|
Zhao Z, Zhang Y, Deng Y. A Comprehensive Feature Analysis of the Fetal Heart Rate Signal for the Intelligent Assessment of Fetal State. J Clin Med 2018; 7:jcm7080223. [PMID: 30127256 PMCID: PMC6111566 DOI: 10.3390/jcm7080223] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 08/14/2018] [Accepted: 08/16/2018] [Indexed: 11/16/2022] Open
Abstract
Continuous monitoring of the fetal heart rate (FHR) signal has been widely used to allow obstetricians to obtain detailed physiological information about newborns. However, visual interpretation of FHR traces causes inter-observer and intra-observer variability. Therefore, this study proposed a novel computerized analysis software of the FHR signal (CAS-FHR), aimed at providing medical decision support. First, to the best of our knowledge, the software extracted the most comprehensive features (47) from different domains, including morphological, time, and frequency and nonlinear domains. Then, for the intelligent assessment of fetal state, three representative machine learning algorithms (decision tree (DT), support vector machine (SVM), and adaptive boosting (AdaBoost)) were chosen to execute the classification stage. To improve the performance, feature selection/dimensionality reduction methods (statistical test (ST), area under the curve (AUC), and principal component analysis (PCA)) were designed to determine informative features. Finally, the experimental results showed that AdaBoost had stronger classification ability, and the performance of the selected feature set using ST was better than that of the original dataset with accuracies of 92% and 89%, sensitivities of 92% and 89%, specificities of 90% and 88%, and F-measures of 95% and 92%, respectively. In summary, the results proved the effectiveness of our proposed approach involving the comprehensive analysis of the FHR signal for the intelligent prediction of fetal asphyxia accurately in clinical practice.
Collapse
Affiliation(s)
- Zhidong Zhao
- Hangdian Smart City Research Center of Zhejiang Province, Hangzhou Dianzi University, 310018 Hangzhou, China.
| | - Yang Zhang
- Hangdian Smart City Research Center of Zhejiang Province, Hangzhou Dianzi University, 310018 Hangzhou, China.
| | - Yanjun Deng
- Hangdian Smart City Research Center of Zhejiang Province, Hangzhou Dianzi University, 310018 Hangzhou, China.
| |
Collapse
|
33
|
Zhao X, Zhao X, Bao L, Zhang Y, Dai J, Yin M. Glypre: In Silico Prediction of Protein Glycation Sites by Fusing Multiple Features and Support Vector Machine. Molecules 2017; 22:molecules22111891. [PMID: 29099805 PMCID: PMC6150326 DOI: 10.3390/molecules22111891] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 10/26/2017] [Indexed: 12/22/2022] Open
Abstract
Glycation is a non-enzymatic process occurring inside or outside the host body by attaching a sugar molecule to a protein or lipid molecule. It is an important form of post-translational modification (PTM), which impairs the function and changes the characteristics of the proteins so that the identification of the glycation sites may provide some useful guidelines to understand various biological functions of proteins. In this study, we proposed an accurate prediction tool, named Glypre, for lysine glycation. Firstly, we used multiple informative features to encode the peptides. These features included the position scoring function, secondary structure, AAindex, and the composition of k-spaced amino acid pairs. Secondly, the distribution of distinctive features of the residues surrounding the glycation and non-glycation sites was statistically analysed. Thirdly, based on the distribution of these features, we developed a new predictor by using different optimal window sizes for different properties and a two-step feature selection method, which utilized the maximum relevance minimum redundancy method followed by a greedy feature selection procedure. The performance of Glypre was measured with a sensitivity of 57.47%, a specificity of 90.78%, an accuracy of 79.68%, area under the receiver-operating characteristic (ROC) curve (AUC) of 0.86, and a Matthews’s correlation coefficient (MCC) of 0.52 by 10-fold cross-validation. The detailed analysis results showed that our predictor may play a complementary role to other existing methods for identifying protein lysine glycation. The source code and datasets of the Glypre are available in the Supplementary File.
Collapse
Affiliation(s)
- Xiaowei Zhao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China.
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.
| | - Xiaosa Zhao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China.
| | - Lingling Bao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China.
| | - Yonggang Zhang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.
| | - Jiangyan Dai
- School of Computer Engineering, Weifang University, Weifang 261061, China.
| | - Minghao Yin
- School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China.
| |
Collapse
|
34
|
Komagata H, Ichimura T, Matsuta Y, Ishikawa M, Shinoda K, Kobayashi N, Sasaki A. Feature analysis of cell nuclear chromatin distribution in support of cervical cytology. J Med Imaging (Bellingham) 2017; 4:047501. [PMID: 29057290 PMCID: PMC5644512 DOI: 10.1117/1.jmi.4.4.047501] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Accepted: 09/26/2017] [Indexed: 11/20/2022] Open
Abstract
Cytology, a method of estimating cancer or cellular atypia from microscopic images of scraped specimens, is used according to the pathologist’s experience to diagnose cases based on the degree of structural changes and atypia. Several methods of cell feature quantification, including nuclear size, nuclear shape, cytoplasm size, and chromatin texture, have been studied. We focus on chromatin distribution in the cell nucleus and propose new feature values that indicate the chromatin complexity, spreading, and bias, including convex hull ratio on multiple binary images, intensity distribution from the gravity center, and tangential component intensity and texture biases. The characteristics and cellular classification accuracies of the proposed features were verified through experiments using cervical smear samples, for which clear nuclear morphologic diagnostic criteria are available. In this experiment, we also used a stepwise support vector machine to create a machine learning model and a cross-validation algorithm with which to derive identification accuracy. Our results demonstrate the effectiveness of our proposed feature values.
Collapse
Affiliation(s)
- Hideki Komagata
- Saitama Medical University, Faculty of Health and Medical Care, Saitama, Japan
| | - Takaya Ichimura
- Saitama Medical University, Department of Pathology, Saitama, Japan
| | | | - Masahiro Ishikawa
- Saitama Medical University, Faculty of Health and Medical Care, Saitama, Japan
| | - Kazuma Shinoda
- Utsunomiya University, Graduate School of Engineering, Tochigi, Japan
| | - Naoki Kobayashi
- Saitama Medical University, Faculty of Health and Medical Care, Saitama, Japan
| | - Atsushi Sasaki
- Saitama Medical University, Department of Pathology, Saitama, Japan
| |
Collapse
|
35
|
Elkerton JS, Xu Y, Pickering JG, Ward AD. Differentiation of arterioles from venules in mouse histology images using machine learning. J Med Imaging (Bellingham) 2017; 4:021104. [PMID: 28331891 PMCID: PMC5330885 DOI: 10.1117/1.jmi.4.2.021104] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 12/12/2016] [Indexed: 11/14/2022] Open
Abstract
Analysis and morphological comparison of the arteriolar and venular components of a microvascular network are essential to our understanding of multiple diseases affecting every organ system. We have developed and evaluated the first fully automatic software system for differentiation of arterioles from venules on high-resolution digital histology images of the mouse hind limb immunostained with smooth muscle [Formula: see text]-actin. Classifiers trained on statistical and morphological features by supervised machine learning provided useful classification accuracy for differentiation of arterioles from venules, achieving an area under the receiver operating characteristic curve of 0.89. Feature selection was consistent across cross validation iterations, and a small set of two features was required to achieve the reported performance, suggesting the generalizability of the system. This system eliminates the need for laborious manual classification of the hundreds of microvessels occurring in a typical sample and paves the way for high-throughput analysis of the arteriolar and venular networks in the mouse.
Collapse
Affiliation(s)
- J. Sachi Elkerton
- Western University, Department of Medical Biophysics, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
- Baines Imaging Research Laboratory, London Regional Cancer Program, 800 Commissioners Road East, London, Ontario N6A 5W9, Canada
| | - Yiwen Xu
- Western University, Department of Medical Biophysics, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
- Baines Imaging Research Laboratory, London Regional Cancer Program, 800 Commissioners Road East, London, Ontario N6A 5W9, Canada
- Western University, Robarts Research Institute, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
| | - J. Geoffrey Pickering
- Western University, Department of Medical Biophysics, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
- Western University, Robarts Research Institute, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
| | - Aaron D. Ward
- Western University, Department of Medical Biophysics, 1151 Richmond Street, London, Ontario N6A 3K7, Canada
- Baines Imaging Research Laboratory, London Regional Cancer Program, 800 Commissioners Road East, London, Ontario N6A 5W9, Canada
| |
Collapse
|
36
|
Kao EF, Lin WC, Jaw TS, Liu GC, Wu JS, Lee CN. Automated patient identity recognition by analysis of chest radiograph features. Acad Radiol 2013; 20:1024-31. [PMID: 23830608 DOI: 10.1016/j.acra.2013.04.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2012] [Revised: 10/17/2012] [Accepted: 04/24/2013] [Indexed: 11/16/2022]
Abstract
RATIONALE AND OBJECTIVES The aim of this study was to develop a computerized scheme for automated identity recognition based on chest radiograph features. MATERIALS AND METHODS The proposed method was evaluated on a database consisting of 1000 pairs of posteroanterior chest radiographs. The method was based on six features: length of the lung field, size of the heart, area of the body, and widths of the upper, middle, and lower thoracic cage. The values for the six features were determined from a chest image, and absolute differences in feature values between the two images (feature errors) were used as indices of image similarity. The performance of the proposed method was evaluated by receiver operating characteristic (ROC) analysis. The discriminant performance was evaluated as the area Az under the ROC curve. RESULTS The discriminant performance Az of the feature errors for lung field length, heart size, body area, upper cage width, middle cage width, and lower cage width were 0.794 ± 0.005, 0.737 ± 0.007, 0.820 ± 0.008, 0.860 ± 0.005, 0.894 ± 0.006, and 0.873 ± 0.006, respectively. The combination of the six feature errors obtained an Az value of 0.963 ± 0.002. CONCLUSION The results indicate that combining the six features yields a high discriminant performance in recognizing patient identity. The method has potential usefulness for automated identity recognition to ensure that chest radiographs are associated with the correct patient.
Collapse
Affiliation(s)
- E-Fong Kao
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan.
| | | | | | | | | | | |
Collapse
|
37
|
Peng Y, Jiang Y, Eisengart L, Healy MA, Straus FH, Yang XJ. Computer-aided identification of prostatic adenocarcinoma: Segmentation of glandular structures. J Pathol Inform 2011; 2:33. [PMID: 21845231 PMCID: PMC3153693 DOI: 10.4103/2153-3539.83193] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2010] [Accepted: 04/24/2011] [Indexed: 11/24/2022] Open
Abstract
Background: Identification of individual prostatic glandular structures is an important prerequisite to quantitative histological analysis of prostate cancer with the aid of a computer. We have developed a computer method to segment individual glandular units and to extract quantitative image features, for computer identification of prostatic adenocarcinoma. Methods: Two sets of digital histology images were used: database I (n = 57) for developing and testing the computer technique, and database II (n = 116) for independent validation. The segmentation technique was based on a k-means clustering and a region-growing method. Computer segmentation results were evaluated subjectively and also compared quantitatively against manual gland outlines, using the Jaccard similarity measure. Quantitative features that were extracted from the computer segmentation results include average gland size, spatial gland density, and average gland circularity. Linear discriminant analysis (LDA) was used to combine quantitative image features. Classification performance was evaluated with receiver operating characteristic (ROC) analysis and the area under the ROC curve (AUC). Results: Jaccard similarity coefficients between computer segmentation and manual outlines of individual glands were between 0.63 and 0.72 for non-cancer and between 0.48 and 0.54 for malignant glands, respectively, similar to an interobserver agreement of 0.79 for non-cancer and 0.75 for malignant glands, respectively. The AUC value for the features of average gland size and gland density combined via LDA was 0.91 for database I and 0.96 for database II. Conclusions: Using a computer, we are able to delineate individual prostatic glands automatically and identify prostatic adenocarcinoma accurately, based on the quantitative image features extracted from computer-segmented glandular structures.
Collapse
Affiliation(s)
- Yahui Peng
- Department of Radiology, The University of Chicago, Chicago, IL 60637, USA
| | | | | | | | | | | |
Collapse
|