Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Auslander N, Gussow AB, Koonin EV. Incorporating Machine Learning into Established Bioinformatics Frameworks. Int J Mol Sci 2021;22:2903. [PMID: 33809353 PMCID: PMC8000113 DOI: 10.3390/ijms22062903] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/08/2021] [Accepted: 03/10/2021] [Indexed: 12/23/2022] Open

Number

Cited by Other Article(s)

Mi N, Li Z, Zhang X, Gao Y, Wang Y, Liu S, Wang S. Identification of potential immunotherapeutic targets and prognostic biomarkers in Graves' disease using weighted gene co-expression network analysis. Heliyon 2024;10:e27175. [PMID: 38468967 PMCID: PMC10926144 DOI: 10.1016/j.heliyon.2024.e27175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 12/11/2023] [Accepted: 02/26/2024] [Indexed: 03/13/2024] Open

Abstract

Graves' disease (GD) is an autoimmune disorder characterized by hyperthyroidism resulting from autoantibody-induced stimulation of the thyroid gland. Despite recent advancements in understanding GD's pathogenesis, the molecular processes driving disease progression and treatment response remain poorly understood. In this study, we aimed to identify crucial immunogenic factors associated with GD prognosis and immunotherapeutic response. To achieve this, we implemented a comprehensive screening strategy that combined computational immunogenicity-potential scoring with multi-parametric cluster analysis to assess the immunomodulatory genes in GD-related subtypes involving stromal and immune cells. Utilizing weighted gene co-expression network analysis (WGCNA), we identified co-expressed gene modules linked to cellular senescence and immune infiltration in CD4+ and CD8+ GD samples. Additionally, gene set enrichment analysis enabled the identification of hallmark pathways distinguishing high- and low-immune subtypes. Our WGCNA analysis revealed 21 gene co-expression modules comprising 1,541 genes associated with immune infiltration components in various stages of GD, including T cells, M1 and M2 macrophages, NK cells, and Tregs. These genes primarily participated in T cell proliferation through purinergic signaling pathways, particularly neuroactive ligand-receptor interactions, and DNA binding transcription factor activity. Three genes, namely PRSS1, HCRTR1, and P2RY4, exhibited robustness in GD patients across multiple stages and were involved in immune cell infiltration during the late stage of GD (p < 0.05). Importantly, HCRTR1 and P2RY4 emerged as potential prognostic signatures for predicting overall survival in high-immunocore GD patients (p < 0.05). Overall, our study provides novel insights into the molecular mechanisms driving GD progression and highlights potential key immunogens for further investigation. These findings underscore the significance of immune infiltration-related cellular senescence in GD therapy and present promising targets for the development of new immunotherapeutic strategies.

Collapse

Mokhtari M, Khoshbakht S, Ziyaei K, Akbari ME, Moravveji SS. New classifications for quantum bioinformatics: Q-bioinformatics, QCt-bioinformatics, QCg-bioinformatics, and QCr-bioinformatics. Brief Bioinform 2024;25:bbae074. [PMID: 38446742 PMCID: PMC10939336 DOI: 10.1093/bib/bbae074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 11/14/2023] [Accepted: 02/07/2021] [Indexed: 03/08/2024] Open

Chen Y, Zhang Z, Hu X, Zhang Y. Epigenetic characterization of sarcopenia-associated genes based on machine learning and network screening. Eur J Med Res 2024;29:54. [PMID: 38229116 PMCID: PMC10790491 DOI: 10.1186/s40001-023-01603-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 12/17/2023] [Indexed: 01/18/2024] Open

Rauluseviciute I, Riudavets-Puig R, Blanc-Mathieu R, Castro-Mondragon J, Ferenc K, Kumar V, Lemma RB, Lucas J, Chèneby J, Baranasic D, Khan A, Fornes O, Gundersen S, Johansen M, Hovig E, Lenhard B, Sandelin A, Wasserman W, Parcy F, Mathelier A. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles. Nucleic Acids Res 2024;52:D174-D182. [PMID: 37962376 PMCID: PMC10767809 DOI: 10.1093/nar/gkad1059] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/20/2023] [Accepted: 10/31/2023] [Indexed: 11/15/2023] Open

Affiliation(s)

Ieva Rauluseviciute Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Rafael Riudavets-Puig Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Romain Blanc-Mathieu Laboratoire Physiologie Cellulaire et Végétale, Univ. Grenoble Alpes, CNRS, CEA, INRAE, IRIG-DBSCI-LPCV, 17 avenue des martyrs, F-38054, Grenoble, France
Jaime A Castro-Mondragon Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Katalin Ferenc Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Vipin Kumar Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Roza Berhanu Lemma Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
Jérémy Lucas Laboratoire Physiologie Cellulaire et Végétale, Univ. Grenoble Alpes, CNRS, CEA, INRAE, IRIG-DBSCI-LPCV, 17 avenue des martyrs, F-38054, Grenoble, France
Jeanne Chèneby Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
Damir Baranasic MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK Division of Electronics, Ruđer Bošković Institute, Bijenička cesta, 10000 Zagreb, Croatia
Aziz Khan Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
Oriol Fornes Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, BC Children's Hospital Research Institute, University of British Columbia, 950 W 28th Ave, Vancouver, BC V5Z 4H4, Canada
Sveinung Gundersen Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
Morten Johansen Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
Eivind Hovig Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, 0424 Oslo, Norway
Boris Lenhard MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
Albin Sandelin Department of Biology and Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaløes Vej 5, DK2200 Copenhagen N, Denmark
Wyeth W Wasserman Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, BC Children's Hospital Research Institute, University of British Columbia, 950 W 28th Ave, Vancouver, BC V5Z 4H4, Canada
François Parcy Laboratoire Physiologie Cellulaire et Végétale, Univ. Grenoble Alpes, CNRS, CEA, INRAE, IRIG-DBSCI-LPCV, 17 avenue des martyrs, F-38054, Grenoble, France
Anthony Mathelier Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway Center for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway Department of Medical Genetics, Institute of Clinical Medicine, University of Oslo and Oslo University Hospital, Oslo, Norway

Collapse

Luo X, Wang R, Zhang X, Wen X, Deng S, Xie W. Identification CCL2,CXCR2,S100A9 of the immune-related gene markers and immune infiltration characteristics of inflammatory bowel disease and heart failure via bioinformatics analysis and machine learning. Front Cardiovasc Med 2023;10:1268675. [PMID: 38034382 PMCID: PMC10687362 DOI: 10.3389/fcvm.2023.1268675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 11/02/2023] [Indexed: 12/02/2023] Open

Abstract

Background

Recently, heart failure (HF) and inflammatory bowel disease (IBD) have been considered to be related diseases with increasing incidence rates; both diseases are related to immunity. This study aims to analyze and identify immune-related gene (IRG) markers of HF and IBD through bioinformatics and machine learning (ML) methods and to explore their immune infiltration characteristics.

Methods

This study used gene expressiondata (GSE120895, GSE21610, GSE4183) from the Gene Expression Omnibus (GEO) database to screen differentially expressed genes (DEGs) and compare them with IRGs from the ImmPort database to obtain differentially expressed immune-related genes (DIRGs). Functional enrichment analysis of IRGs was performed using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). Subsequently, three machine models and protein-protein interactions (PPIs) were established to identify diagnostic biomarkers. The receiver operating characteristic (ROC) curves were applied to evaluate the diagnostic value of the candidate biomarkersin the validation set (GSE1145, GSE36807) and obtain their correlations with immune cells through the Spearman algorithm. Finally, the CIBERSORT algorithm was used to evaluate the immune cell infiltration of the two diseases.

Results

Thirty-four DIRGs were screened and GO and KEGG analysis results showed that these genes are mainly related to inflammatory and immune responses. CCL2, CXCR2 and S100A9 were identified as biomarkers.The immune correlation results indicated in both diseases that CCL2 is positively correlated with mast cell activation, CXCR2 is positively correlated with neutrophils and S100A9 is positively correlated with neutrophils and mast cell activation. Analysis of immune characteristics showed that macrophages M2, macrophages M0 and neutrophils were present in both diseases.

Conclusions

CCL2, CXCR2 and S100A9 are promising biomarkers that will become potential immunogenetic biomarkers for diagnosing comorbidities of HF and IBD. macrophages M2, macrophages M0, neutrophil-mediated inflammation and immune regulation play important roles in the development of HF and IBD and may become diagnostic and therapeutic targets.

Collapse

Ha AD, Aylward FO. Automated classification of giant virus genomes using a random forest model built on trademark protein families. bioRxiv 2023:2023.11.10.566645. [PMID: 38014039 PMCID: PMC10680617 DOI: 10.1101/2023.11.10.566645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Abstract

Viruses of the phylum Nucleocytoviricota , often referred to as "giant viruses," are prevalent in various environments around the globe and play significant roles in shaping eukaryotic diversity and activities in global ecosystems. Given the extensive phylogenetic diversity within this viral group and the highly complex composition of their genomes, taxonomic classification of giant viruses, particularly incomplete metagenome-assembled genomes (MAGs) can present a considerable challenge. Here we developed TIGTOG ( T axonomic Information of G iant viruses using T rademark O rthologous G roups), a machine learning-based approach to predict the taxonomic classification of novel giant virus MAGs based on profiles of protein family content. We applied a random forest algorithm to a training set of 1,531 quality-checked, phylogenetically diverse Nucleocytoviricota genomes using pre-selected sets of giant virus orthologous groups (GVOGs). The classification models were predictive of viral taxonomic assignments with a cross-validation accuracy of 99.6% to the order level and 97.3% to the family level. We found that no individual GVOGs or genome features significantly influenced the algorithm's performance or the models' predictions, indicating that classification predictions were based on a comprehensive genomic signature, which reduced the necessity of a fixed set of marker genes for taxonomic assigning purposes. Our classification models were validated with an independent test set of 823 giant virus genomes with varied genomic completeness and taxonomy and demonstrated an accuracy of 98.6% and 95.9% to the order and family level, respectively. Our results indicate that protein family profiles can be used to accurately classify large DNA viruses at different taxonomic levels and provide a fast and accurate method for the classification of giant viruses. This approach could easily be adapted to other viral groups.

Collapse

Gomes RAL, Zerbini FM. ConCreT, a 2D convolutional neural network for taxonomic classification applied to viruses in the phylum Cressdnaviricota. J Virol Methods 2023;320:114789. [PMID: 37536450 DOI: 10.1016/j.jviromet.2023.114789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 07/19/2023] [Accepted: 07/31/2023] [Indexed: 08/05/2023]

Tremmel R, Pirmann S, Zhou Y, Lauschke VM. Translating pharmacogenomic sequencing data into drug response predictions-How to interpret variants of unknown significance. Br J Clin Pharmacol 2023. [PMID: 37759374 DOI: 10.1111/bcp.15915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 09/20/2023] [Accepted: 09/22/2023] [Indexed: 09/29/2023] Open

Raslan MA, Raslan SA, Shehata EM, Mahmoud AS, Sabri NA. Advances in the Applications of Bioinformatics and Chemoinformatics. Pharmaceuticals (Basel) 2023;16:1050. [PMID: 37513961 PMCID: PMC10384252 DOI: 10.3390/ph16071050] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/19/2023] [Accepted: 07/20/2023] [Indexed: 07/30/2023] Open

Choi SR, Lee M. Transformer Architecture and Attention Mechanisms in Genome Data Analysis: A Comprehensive Review. Biology (Basel) 2023;12:1033. [PMID: 37508462 PMCID: PMC10376273 DOI: 10.3390/biology12071033] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 07/18/2023] [Accepted: 07/21/2023] [Indexed: 07/30/2023]

Aydin SG, Bilge HŞ. FPGA Implementation of Image Registration Using Accelerated CNN. Sensors (Basel) 2023;23:6590. [PMID: 37514883 PMCID: PMC10386551 DOI: 10.3390/s23146590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 07/17/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023]

Szulc NA, Mackiewicz Z, Bujnicki JM, Stefaniak F. Structural interaction fingerprints and machine learning for predicting and explaining binding of small molecule ligands to RNA. Brief Bioinform 2023;24:bbad187. [PMID: 37204195 DOI: 10.1093/bib/bbad187] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 04/07/2023] [Accepted: 04/25/2023] [Indexed: 05/20/2023] Open

Jin B, Cheng X, Fei G, Sang S, Zhong C. Identification of diagnostic biomarkers in Alzheimer's disease by integrated bioinformatic analysis and machine learning strategies. Front Aging Neurosci 2023;15:1169620. [PMID: 37434738 PMCID: PMC10331604 DOI: 10.3389/fnagi.2023.1169620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Accepted: 06/08/2023] [Indexed: 07/13/2023] Open

Abstract

Background

Alzheimer's disease (AD) is the most prevalent form of dementia, and is becoming one of the most burdening and lethal diseases. More useful biomarkers for diagnosing AD and reflecting the disease progression are in need and of significance.

Methods

The integrated bioinformatic analysis combined with machine-learning strategies was applied for exploring crucial functional pathways and identifying diagnostic biomarkers of AD. Four datasets (GSE5281, GSE131617, GSE48350, and GSE84422) with samples of AD frontal cortex are integrated as experimental datasets, and another two datasets (GSE33000 and GSE44772) with samples of AD frontal cortex were used to perform validation analyses. Functional Correlation enrichment analyses were conducted based on Gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and the Reactome database to reveal AD-associated biological functions and key pathways. Four models were employed to screen the potential diagnostic biomarkers, including one bioinformatic analysis of Weighted gene co-expression network analysis (WGCNA)and three machine-learning algorithms: Least absolute shrinkage and selection operator (LASSO), support vector machine-recursive feature elimination (SVM-RFE) and random forest (RF) analysis. The correlation analysis was performed to explore the correlation between the identified biomarkers with CDR scores and Braak staging.

Results

The pathways of the immune response and oxidative stress were identified as playing a crucial role during AD. Thioredoxin interacting protein (TXNIP), early growth response 1 (EGR1), and insulin-like growth factor binding protein 5 (IGFBP5) were screened as diagnostic markers of AD. The diagnostic efficacy of TXNIP, EGR1, and IGFBP5 was validated with corresponding AUCs of 0.857, 0.888, and 0.856 in dataset GSE33000, 0.867, 0.909, and 0.841 in dataset GSE44770. And the AUCs of the combination of these three biomarkers as a diagnostic tool for AD were 0.954 and 0.938 in the two verification datasets.

Conclusion

The pathways of immune response and oxidative stress can play a crucial role in the pathogenesis of AD. TXNIP, EGR1, and IGFBP5 are useful biomarkers for diagnosing AD and their mRNA level may reflect the development of the disease by correlation with the CDR scores and Breaking staging.

Collapse

Guzman NA, Guzman DE, Blanc T. Advancements in portable instruments based on affinity-capture-migration and affinity-capture-separation for use in clinical testing and life science applications. J Chromatogr A 2023;1704:464109. [PMID: 37315445 DOI: 10.1016/j.chroma.2023.464109] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Revised: 05/23/2023] [Accepted: 05/25/2023] [Indexed: 06/16/2023]

Hashizume T, Ozawa Y, Ying BW. Employing active learning in the optimization of culture medium for mammalian cells. NPJ Syst Biol Appl 2023;9:20. [PMID: 37253825 DOI: 10.1038/s41540-023-00284-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Accepted: 05/18/2023] [Indexed: 06/01/2023] Open

Rescifina A. Progress of the "Molecular Informatics" Section in 2022. Int J Mol Sci 2023;24:ijms24119442. [PMID: 37298393 DOI: 10.3390/ijms24119442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 05/19/2023] [Indexed: 06/12/2023] Open

Varshney N, Mishra AK. Deep Learning in Phosphoproteomics: Methods and Application in Cancer Drug Discovery. Proteomes 2023;11:proteomes11020016. [PMID: 37218921 DOI: 10.3390/proteomes11020016] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 04/24/2023] [Accepted: 04/25/2023] [Indexed: 05/24/2023] Open

Elhadary M, Elsabagh AA, Ferih K, Elsayed B, Elshoeibi AM, Kaddoura R, Akiki S, Ahmed K, Yassin M. Applications of Machine Learning in Chronic Myeloid Leukemia. Diagnostics (Basel) 2023;13:diagnostics13071330. [PMID: 37046547 PMCID: PMC10093579 DOI: 10.3390/diagnostics13071330] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 03/11/2023] [Accepted: 03/15/2023] [Indexed: 04/14/2023] Open

Sun Z, Lin J, Zhang T, Sun X, Wang T, Duan J, Yao K. Combining bioinformatics and machine learning to identify common mechanisms and biomarkers of chronic obstructive pulmonary disease and atrial fibrillation. Front Cardiovasc Med 2023;10:1121102. [PMID: 37057099 PMCID: PMC10086368 DOI: 10.3389/fcvm.2023.1121102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 03/14/2023] [Indexed: 03/30/2023] Open

Abstract BackgroundPatients with chronic obstructive pulmonary disease (COPD) often present with atrial fibrillation (AF), but the common pathophysiological mechanisms between the two are unclear. This study aimed to investigate the common biological mechanisms of COPD and AF and to search for important biomarkers through bioinformatic analysis of public RNA sequencing databases.MethodsFour datasets of COPD and AF were downloaded from the Gene Expression Omnibus (GEO) database. The overlapping genes common to both diseases were screened by WGCNA analysis, followed by protein-protein interaction network construction and functional enrichment analysis to elucidate the common mechanisms of COPD and AF. Machine learning algorithms were also used to identify key biomarkers. Co-expression analysis, “transcription factor (TF)-mRNA-microRNA (miRNA)” regulatory networks and drug prediction were performed for key biomarkers. Finally, immune cell infiltration analysis was performed to evaluate further the immune cell changes in the COPD dataset and the correlation between key biomarkers and immune cells.ResultsA total of 133 overlapping genes for COPD and AF were obtained, and the enrichment was mainly focused on pathways associated with the inflammatory immune response. A key biomarker, cyclin dependent kinase 8 (CDK8), was identified through screening by machine learning algorithms and validated in the validation dataset. Twenty potential drugs capable of targeting CDK8 were obtained. Immune cell infiltration analysis revealed the presence of multiple immune cell dysregulation in COPD. Correlation analysis showed that CDK8 expression was significantly associated with CD8+ T cells, resting dendritic cell, macrophage M2, and monocytes.ConclusionsThis study highlights the role of the inflammatory immune response in COPD combined with AF. The prominent link between CDK8 and the inflammatory immune response and its characteristic of not affecting the basal expression level of nuclear factor kappa B (NF-kB) make it a possible promising therapeutic target for COPD combined with AF. Collapse

Patterson A, Elbasir A, Tian B, Auslander N. Computational Methods Summarizing Mutational Patterns in Cancer: Promise and Limitations for Clinical Applications. Cancers (Basel) 2023;15:cancers15071958. [PMID: 37046619 PMCID: PMC10093138 DOI: 10.3390/cancers15071958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 02/24/2023] [Accepted: 03/09/2023] [Indexed: 03/29/2023] Open

Deyneko IV. Guidelines on the performance evaluation of motif recognition methods in bioinformatics. Front Genet 2023;14:1135320. [PMID: 36824436 PMCID: PMC9941176 DOI: 10.3389/fgene.2023.1135320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 01/19/2023] [Indexed: 02/09/2023] Open

Li B, Li H, Zhang L, Ren T, Meng J. Expression analysis of human glioma susceptibility gene and P53 in human glioma and its clinical significance based on bioinformatics. Ann Transl Med 2023;11:53. [PMID: 36819578 PMCID: PMC9929792 DOI: 10.21037/atm-22-5646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 12/07/2022] [Indexed: 01/18/2023]

Abstract

Background

The exact mechanism of glioblastoma multiforme (GBM) remains unclear. This study was to clarify the expression of P53 in glioma and its molecular mechanism, and to explore the possibility of P53 as a potential therapeutic target of glioma and its clinical application value, so as to provide a new theoretical basis for the treatment of glioma.

Methods

Firstly, a dataset was established to analyze the expression of P53 in different stages of glioma and its relationship with prognosis by using The Cancer Genome Atlas (TCGA) database, RNA-seq data, and survival data of glioma and normal control samples in gene expression profiling and interactive analysis (GEPIA). The genes co-expressed with P53 were screened out, their differential expression between glioma and normal control group was analyzed, and their functions were analyzed by enrichment analysis. The TGGA database was used for data verification and analysis. The correlation between P53 expression and clinicopathological parameters was analyzed. Kaplan-Meier survival analysis was used to analyze the relationship between P53 expression and overall survival (OS) and progression-free survival (PFS) of glioma patients, and Cox regression analysis was used to analyze the independent factors affecting OS and PFS of glioma patients.

Results

The results of TCGA data analysis were as follows: The expression level of P53 was different from that of different stages of glioma, namely, the expression level of P53 between grade II and grade III, grade III and grade IV, and grade II and grade IV were significantly different (P<0.05). The results of P53 gene-related survival analysis showed that KNL1 high expression and low expression were significantly different in OS, and the high expression group was associated with poor prognosis (P<0.05).

Conclusions

The P53 expression can be an effective biological indicator of poor prognosis of glioma.

Collapse

Lu Z, Xu J, Cao B, Jin C. Screening and identification of susceptibility genes for osteosarcoma based on bioinformatics analysis. Ann Transl Med 2023;11:87. [PMID: 36819543 PMCID: PMC9929789 DOI: 10.21037/atm-22-6369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/10/2023] [Indexed: 01/30/2023]

Feng H, Jin D, Li J, Li Y, Zou Q, Liu T. Matrix reconstruction with reliable neighbors for predicting potential MiRNA-disease associations. Brief Bioinform 2023;24:6960615. [PMID: 36567252 DOI: 10.1093/bib/bbac571] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/16/2022] [Accepted: 11/23/2022] [Indexed: 12/27/2022] Open

Tu DY, Cao J, Zhou J, Su BB, Wang SY, Jiang GQ, Jin SJ, Zhang C, Peng R, Bai DS. Identification of the mitophagy-related diagnostic biomarkers in hepatocellular carcinoma based on machine learning algorithm and construction of prognostic model. Front Oncol 2023;13:1132559. [PMID: 36937391 PMCID: PMC10014545 DOI: 10.3389/fonc.2023.1132559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 02/15/2023] [Indexed: 03/05/2023] Open

Abstract

Background and aims

As a result of increasing numbers of studies most recently, mitophagy plays a vital function in the genesis of cancer. However, research on the predictive potential and clinical importance of mitophagy-related genes (MRGs) in hepatocellular carcinoma (HCC) is currently lacking. This study aimed to uncover and analyze the mitophagy-related diagnostic biomarkers in HCC using machine learning (ML), as well as to investigate its biological role, immune infiltration, and clinical significance.

Methods

In our research, by using Least absolute shrinkage and selection operator (LASSO) regression and support vector machine- (SVM-) recursive feature elimination (RFE) algorithm, six mitophagy genes (ATG12, CSNK2B, MTERF3, TOMM20, TOMM22, and TOMM40) were identified from twenty-nine mitophagy genes, next, the algorithm of non-negative matrix factorization (NMF) was used to separate the HCC patients into cluster A and B based on the six mitophagy genes. And there was evidence from multi-analysis that cluster A and B were associated with tumor immune microenvironment (TIME), clinicopathological features, and prognosis. After then, based on the DEGs (differentially expressed genes) between cluster A and cluster B, the prognostic model (riskScore) of mitophagy was constructed, including ten mitophagy-related genes (G6PD, KIF20A, SLC1A5, TPX2, ANXA10, TRNP1, ADH4, CYP2C9, CFHR3, and SPP1).

Results

This study uncovered and analyzed the mitophagy-related diagnostic biomarkers in HCC using machine learning (ML), as well as to investigate its biological role, immune infiltration, and clinical significance. Based on the mitophagy-related diagnostic biomarkers, we constructed a prognostic model(riskScore). Furthermore, we discovered that the riskScore was associated with somatic mutation, TIME, chemotherapy efficacy, TACE and immunotherapy effectiveness in HCC patients.

Conclusion

Mitophagy may play an important role in the development of HCC, and further research on this issue is necessary. Furthermore, the riskScore performed well as a standalone prognostic marker in terms of accuracy and stability. It can provide some guidance for the diagnosis and treatment of HCC patients.

Collapse

Chen ZF, Wu LZ, Chen ZT, Su LJ, Fu CJ. The potential mechanisms of neuroblastoma in children based on bioinformatics big data. Transl Pediatr 2022;11:1908-1919. [PMID: 36643678 PMCID: PMC9834953 DOI: 10.21037/tp-22-504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 11/29/2022] [Indexed: 12/15/2022] Open

Moris D, Henao R, Hensman H, Stempora L, Chasse S, Schobel S, Dente CJ, Kirk AD, Elster E. Multidimensional machine learning models predicting outcomes after trauma. Surgery 2022;172:1851-1859. [PMID: 36116976 DOI: 10.1016/j.surg.2022.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 08/01/2022] [Accepted: 08/04/2022] [Indexed: 01/07/2023]

Abstract

BACKGROUND

An emerging body of literature supports the role of individualized prognostic tools to guide the management of patients after trauma. The aim of this study was to develop advanced modeling tools from multidimensional data sources, including immunological analytes and clinical and administrative data, to predict outcomes in trauma patients.

METHODS

This was a prospective study of trauma patients at Level 1 centers from 2015 to 2019. Clinical, flow cytometry, and serum cytokine data were collected within 48 hours of admission. Sparse logistic regression models were developed, jointly selecting predictors and estimating the risk of ventilator-associated pneumonia, acute kidney injury, complicated disposition (death, rehabilitation, or nursing facility), and return to the operating room. Model parameters (regularization controlling model sparsity) and performance estimation were obtained via nested leave-one-out cross-validation.

RESULTS

A total of 179 patients were included. The incidences of ventilator-associated pneumonia, acute kidney injury, complicated disposition, and return to the operating room were 17.7%, 28.8%, 22.5%, and 12.3%, respectively. Regarding extensive resource use, 30.7% of patients had prolonged intensive care unit stay, 73.2% had prolonged length of stay, and 23.5% had need for prolonged ventilatory support. The models were developed and cross-validated for ventilator-associated pneumonia, acute kidney injury, complicated dispositions, and return to the operating room, yielding predictive areas under the curve from 0.70 to 0.91. Each model derived its optimal predictive value by combining clinical, administrative, and immunological analyte data.

CONCLUSION

Clinical, immunological, and administrative data can be combined to predict post-traumatic outcomes and resource use. Multidimensional machine learning modeling can identify trauma patients with complicated clinical trajectories and high resource needs.

Collapse

Ge S, Xu C, Li Y, Zhang Y, Li N, Wang F, Ding L, Niu J, Shi Z. Identification of the Diagnostic Biomarker VIPR1 in Hepatocellular Carcinoma Based on Machine Learning Algorithm. Journal of Oncology 2022;2022:1-13. [PMID: 36157238 PMCID: PMC9499748 DOI: 10.1155/2022/2469592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 08/21/2022] [Accepted: 08/23/2022] [Indexed: 12/24/2022]

Kang M, Oh JH. Editorial of Special Issue "Deep Learning and Machine Learning in Bioinformatics". Int J Mol Sci 2022;23:ijms23126610. [PMID: 35743052 PMCID: PMC9224509 DOI: 10.3390/ijms23126610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 06/10/2022] [Indexed: 02/04/2023] Open

Sokhansanj BA, Rosen GL. Mapping Data to Deep Understanding: Making the Most of the Deluge of SARS-CoV-2 Genome Sequences. mSystems 2022;7:e0003522. [PMID: 35311562 PMCID: PMC9040592 DOI: 10.1128/msystems.00035-22] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/27/2022] [Indexed: 12/22/2022] Open

Kieft K, Anantharaman K. Virus genomics: what is being overlooked? Curr Opin Virol 2022;53:101200. [DOI: 10.1016/j.coviro.2022.101200] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 12/21/2021] [Accepted: 01/03/2022] [Indexed: 01/05/2023]

Khalili E, Ramazi S, Ghanati F, Kouchaki S. Predicting protein phosphorylation sites in soybean using interpretable deep tabular learning network. Brief Bioinform 2022;23:bbac015. [PMID: 35152280 DOI: 10.1093/bib/bbac015] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 12/17/2021] [Accepted: 01/12/2022] [Indexed: 12/17/2023] Open

Abstract

Phosphorylation of proteins is one of the most significant post-translational modifications (PTMs) and plays a crucial role in plant functionality due to its impact on signaling, gene expression, enzyme kinetics, protein stability and interactions. Accurate prediction of plant phosphorylation sites (p-sites) is vital as abnormal regulation of phosphorylation usually leads to plant diseases. However, current experimental methods for PTM prediction suffers from high-computational cost and are error-prone. The present study develops machine learning-based prediction techniques, including a high-performance interpretable deep tabular learning network (TabNet) to improve the prediction of protein p-sites in soybean. Moreover, we use a hybrid feature set of sequential-based features, physicochemical properties and position-specific scoring matrices to predict serine (Ser/S), threonine (Thr/T) and tyrosine (Tyr/Y) p-sites in soybean for the first time. The experimentally verified p-sites data of soybean proteins are collected from the eukaryotic phosphorylation sites database and database post-translational modification. We then remove the redundant set of positive and negative samples by dropping protein sequences with >40% similarity. It is found that the developed techniques perform >70% in terms of accuracy. The results demonstrate that the TabNet model is the best performing classifier using hybrid features and with window size of 13, resulted in 78.96 and 77.24% sensitivity and specificity, respectively. The results indicate that the TabNet method has advantages in terms of high-performance and interpretability. The proposed technique can automatically analyze the data without any measurement errors and any human intervention. Furthermore, it can be used to predict putative protein p-sites in plants effectively. The collected dataset and source code are publicly deposited at https://github.com/Elham-khalili/Soybean-P-sites-Prediction.

Collapse

Liu Z, Han N, Su T, Ji Y, Bao H, Zhou S, Luo S, Wang H, Liu J, Wang HJ. Interpretable machine learning to identify important predictors of birth weight: A prospective cohort study. Front Pediatr 2022;10:899954. [PMID: 36440327 PMCID: PMC9691849 DOI: 10.3389/fped.2022.899954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 10/24/2022] [Indexed: 11/29/2022] Open

Abstract

BACKGROUND

Predicting birth weight and identifying its risk factors are clinically important. This study aims to use interpretable machine learning to predict birth weight and identity important predictors.

METHODS

This prospective cohort study was conducted in Tongzhou Maternal and Child Health Care Hospital of Beijing, China, recruiting pregnant women between June 2018 and February 2019. We used 24 features to predict infant birth weight, including gestational age, mother's age, parity, history of macrosomia delivery, pre-pregnancy body mass index (BMI), height, father's BMI, lifestyle (diet, physical activity, smoking), and biomarker (fasting glucose and lipids) features. Study outcome was birth weight of infant. We used 8 supervised learning models including 4 individual [linear regression, ridge regression, lasso regression, support vector machines regression (SVR)], and 4 ensemble estimators (random forest, AdaBoost, gradient boosted trees, and voting ensemble for regression) to predict birth weight. Model accuracy was measured by root mean squared error (RMSE) of 10-fold cross validation on the training set and RMSE of prediction on the test set. We used permutation importance algorithm to understand the prediction from the models and what affected them.

RESULT

This study included 4,754 mother-child dyads. RMSEs were lower in voting ensemble for regression, linear regression, and SVR than random forest, AdaBoost, and gradient boosted tree. The 5 most important predictors for infant birth weight were gestational age, fetal sex, preterm birth, mother's height, and pre-pregnancy BMI. After adding ultrasound-measured indicators of fetal growth into predictors, mother's height and pre-pregnancy BMI remained the most important predictors in predicting the outcome.

CONCLUSION

Mother's height and pre-pregnancy BMI were identified as important predictors for infant birth weight. Interpretable machine learning is a promising tool in the prediction of birth weight.

Collapse

Hammad A, Elshaer M, Tang X. Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning. Math Biosci Eng 2021;18:8997-9015. [PMID: 34814332 DOI: 10.3934/mbe.2021443] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Abstract

Colorectal cancer (CRC) is one of the most common malignancies worldwide. Biomarker discovery is critical to improve CRC diagnosis, however, machine learning offers a new platform to study the etiology of CRC for this purpose. Therefore, the current study aimed to perform an integrated bioinformatics and machine learning analyses to explore novel biomarkers for CRC prognosis. In this study, we acquired gene expression microarray data from Gene Expression Omnibus (GEO) database. The microarray expressions GSE103512 dataset was downloaded and integrated. Subsequently, differentially expressed genes (DEGs) were identified and functionally analyzed via Gene Ontology (GO) and Kyoto Enrichment of Genes and Genomes (KEGG). Furthermore, protein protein interaction (PPI) network analysis was conducted using the STRING database and Cytoscape software to identify hub genes; however, the hub genes were subjected to Support Vector Machine (SVM), Receiver operating characteristic curve (ROC) and survival analyses to explore their diagnostic values. Meanwhile, TCGA transcriptomics data in Gene Expression Profiling Interactive Analysis (GEPIA) database and the pathology data presented by in the human protein atlas (HPA) database were used to verify our transcriptomic analyses. A total of 105 DEGs were identified in this study. Functional enrichment analysis showed that these genes were significantly enriched in biological processes related to cancer progression. Thereafter, PPI network explored a total of 10 significant hub genes. The ROC curve was used to predict the potential application of biomarkers in CRC diagnosis, with an area under ROC curve (AUC) of these genes exceeding 0.92 suggesting that this risk classifier can discriminate between CRC patients and normal controls. Moreover, the prognostic values of these hub genes were confirmed by survival analyses using different CRC patient cohorts. Our results demonstrated that these 10 differentially expressed hub genes could be used as potential biomarkers for CRC diagnosis.

Collapse

Lecca P. Machine Learning for Causal Inference in Biological Networks: Perspectives of This Challenge. Front Bioinform 2021;1:746712. [PMID: 36303798 PMCID: PMC9581010 DOI: 10.3389/fbinf.2021.746712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 09/08/2021] [Indexed: 11/13/2022] Open