1
|
Tamkini M, Nourbakhsh M, Movahedi M, Golestani A. Exploring genetic signatures of obesity: hub genes and miRNAs unveiled through comprehensive bioinformatic analysis. J Diabetes Metab Disord 2024; 23:2225-2232. [PMID: 39610518 PMCID: PMC11599662 DOI: 10.1007/s40200-024-01490-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Accepted: 08/14/2024] [Indexed: 11/30/2024]
Abstract
Objectives Adipogenesis, the process of fat accumulation in adipose tissue, is closely linked to obesity, a condition characterized by excessive fat storage. Genetic factors significantly contribute to an individual's susceptibility to adipogenesis and the development of obesity. Methods In this study, we conducted a comprehensive bioinformatic analysis, including Weighted Gene Co-expression Analysis, differentially expressed gene analysis, and protein-protein interaction analysis, to identify hub genes and miRNAs associated with obesity. Results Our findings highlight the potential involvement of genes such as ATP5F1A, FN1, CCl2, RPS14, and RPS16, as well as miRNAs including hsa-miR-6844, hsa-miR-4528, hsa-miR-3686, hsa-miR-3124-3p, hsa-miR-381-3p, and hsa-miR-300 in obesity. Conclusions The findings from this study contribute to the growing knowledge of adipogenesis and obesity genetics, and provide potential biomarkers for further investigation and translation into clinical or research applications. Graphical abstract Supplementary Information The online version contains supplementary material available at 10.1007/s40200-024-01490-8.
Collapse
Affiliation(s)
- Mahdieh Tamkini
- Department of Biochemistry, Faculty of Biological Sciences, North Tehran Branch, Islamic Azad University, Tehran, Iran
| | - Mitra Nourbakhsh
- Finetech in Medicine Research Center, Iran University of Medical Sciences, Tehran, Iran
- Department of Clinical Biochemistry, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Monireh Movahedi
- Department of Biochemistry, Faculty of Biological Sciences, North Tehran Branch, Islamic Azad University, Tehran, Iran
| | - Abolfazl Golestani
- Department of Biochemistry, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
2
|
Zabihi MR, Moradi Z, Safari N, Salehi Z, Kavousi K. Revealing disease subtypes and heterogeneity in common variable immunodeficiency through transcriptomic analysis. Sci Rep 2024; 14:23899. [PMID: 39396099 PMCID: PMC11470955 DOI: 10.1038/s41598-024-74728-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 09/27/2024] [Indexed: 10/14/2024] Open
Abstract
Common Variable Immunodeficiency (CVID) is a primary immunodeficiency characterized by reduced levels of specific immunoglobulins, resulting in frequent infections, autoimmune disorders, increased cancer risk, and diminished antibody production despite an adequate B cell count. With its clinical manifestations being highly variable, the classification of CVID, including the widely recognized Freiburg classification, is primarily based on clinical symptoms and genetic variations. Our study aims to refine the classification of CVID by analyzing transcriptomics data to identify distinct disease subtypes. We utilized the GSE51405 dataset, examining transcriptomic profiles from 30 CVID patients without complications. Employing a combination of clustering techniques-KMeans, hierarchical agglomerative clustering, spectral clustering, and Gaussian Mixture models-and differential gene expression analysis with R's limma package, we integrated molecular findings with demographic data (age and gender) through correlation analysis and identified common genes among clusters. Three distinct clusters of CVID patients were identified using KMeans, Agglomerative Clustering, and Gaussian Mixture Models, highlighting the disease's heterogeneity. Differential expression analysis unveiled 31 genes with variable expression levels across these clusters. Notably, nine genes (EIF5A, RPL21, ANP32A, DTX3L, NCF2, CDC42EP3, CHP1, FOLR3, and DEFA4) exhibited consistent differential expression across all clusters, independent of demographic factors. The study recommends categorizing patients based on the four genes, NCF2, CHP1, FOLR3, and DEFA4-as they may assist in prognostic prediction. Transcriptomic analysis of common variable immunodeficiency (CVID) patients identified three distinct clusters based on gene expression, independent of age and gender. Nine differentially expressed genes were identified across these clusters, suggesting potential biomarkers for CVID subtype classification. These findings highlight the genetic heterogeneity of CVID and provide novel insights into disease classification and potential personalized treatment approaches.
Collapse
Affiliation(s)
- Mohammad Reza Zabihi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Zahra Moradi
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Nima Safari
- School of Medicine, Islamic Azad University, Tehran Medical Branch, Tehran, Iran
| | - Zahra Salehi
- Hematology, Oncology and Stem Cell Transplantation Research Center, Research Institute for Oncology, Hematology and Cell Therapy, Tehran University of Medical Sciences, Tehran, Iran.
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
| |
Collapse
|
3
|
Corell-Sierra J, Marquez-Molins J, Marqués MC, Hernandez-Azurdia AG, Montagud-Martínez R, Cebriá-Mendoza M, Cuevas JM, Albert E, Navarro D, Rodrigo G, Gómez G. SARS-CoV-2 remodels the landscape of small non-coding RNAs with infection time and symptom severity. NPJ Syst Biol Appl 2024; 10:41. [PMID: 38632240 PMCID: PMC11024147 DOI: 10.1038/s41540-024-00367-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 04/08/2024] [Indexed: 04/19/2024] Open
Abstract
The COVID-19 pandemic caused by the coronavirus SARS-CoV-2 has significantly impacted global health, stressing the necessity of basic understanding of the host response to this viral infection. In this study, we investigated how SARS-CoV-2 remodels the landscape of small non-coding RNAs (sncRNA) from a large collection of nasopharyngeal swab samples taken at various time points from patients with distinct symptom severity. High-throughput RNA sequencing analysis revealed a global alteration of the sncRNA landscape, with abundance peaks related to species of 21-23 and 32-33 nucleotides. Host-derived sncRNAs, including microRNAs (miRNAs), transfer RNA-derived small RNAs (tsRNAs), and small nucleolar RNA-derived small RNAs (sdRNAs) exhibited significant differential expression in infected patients compared to controls. Importantly, miRNA expression was predominantly down-regulated in response to SARS-CoV-2 infection, especially in patients with severe symptoms. Furthermore, we identified specific tsRNAs derived from Glu- and Gly-tRNAs as major altered elements upon infection, with 5' tRNA halves being the most abundant species and suggesting their potential as biomarkers for viral presence and disease severity prediction. Additionally, down-regulation of C/D-box sdRNAs and altered expression of tinyRNAs (tyRNAs) were observed in infected patients. These findings provide valuable insights into the host sncRNA response to SARS-CoV-2 infection and may contribute to the development of further diagnostic and therapeutic strategies in the clinic.
Collapse
Affiliation(s)
- Julia Corell-Sierra
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
| | - Joan Marquez-Molins
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
- Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Sciences and Linnean Center for Plant Biology, Uppsala, Sweden
| | - María-Carmen Marqués
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
| | | | - Roser Montagud-Martínez
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
| | - María Cebriá-Mendoza
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
| | - José M Cuevas
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain
| | - Eliseo Albert
- Microbiology Service, Clinic University Hospital, INCLIVA Biomedical Research Institute, 46010, Valencia, Spain
| | - David Navarro
- Microbiology Service, Clinic University Hospital, INCLIVA Biomedical Research Institute, 46010, Valencia, Spain
- Department of Microbiology, School of Medicine, University of Valencia, 46010, Valencia, Spain
| | - Guillermo Rodrigo
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain.
| | - Gustavo Gómez
- Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, 46980, Paterna, Spain.
| |
Collapse
|
4
|
Cunningham CL, Frye CJ, Makowski JA, Kensinger AH, Shine M, Milback EJ, Lackey PE, Evanseck JD, Mihailescu MR. Effect of the SARS-CoV-2 Delta-associated G15U mutation on the s2m element dimerization and its interactions with miR-1307-3p. RNA (NEW YORK, N.Y.) 2023; 29:1754-1771. [PMID: 37604684 PMCID: PMC10578481 DOI: 10.1261/rna.079627.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 07/30/2023] [Indexed: 08/23/2023]
Abstract
The s2m, a highly conserved 41-nt hairpin structure in the SARS-CoV-2 genome, serves as an attractive therapeutic target that may have important roles in the virus life cycle or interactions with the host. However, the conserved s2m in Delta SARS-CoV-2, a previously dominant variant characterized by high infectivity and disease severity, has received relatively less attention than that of the original SARS-CoV-2 virus. The focus of this work is to identify and define the s2m changes between Delta and SARS-CoV-2 and the subsequent impact of those changes upon the s2m dimerization and interactions with the host microRNA miR-1307-3p. Bioinformatics analysis of the GISAID database targeting the s2m element reveals a >99% correlation of a single nucleotide mutation at the 15th position (G15U) in Delta SARS-CoV-2. Based on 1H NMR spectroscopy assignments comparing the imino proton resonance region of s2m and the s2m G15U at 19°C, we show that the U15-A29 base pair closes, resulting in a stabilization of the upper stem without overall secondary structure deviation. Increased stability of the upper stem did not affect the chaperone activity of the viral N protein, as it was still able to convert the kissing dimers formed by s2m G15U into a stable duplex conformation, consistent with the s2m reference. However, we show that the s2m G15U mutation drastically impacts the binding of host miR-1307-3p. These findings demonstrate that the observed G15U mutation alters the secondary structure of s2m with subsequent impact on viral binding of host miR-1307-3p, with potential consequences on immune responses.
Collapse
Affiliation(s)
- Caylee L Cunningham
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Caleb J Frye
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Joseph A Makowski
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Adam H Kensinger
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Morgan Shine
- Department of Biochemistry and Chemistry, Westminster College, New Wilmington, Pennsylvania 16172, USA
| | - Ella J Milback
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Patrick E Lackey
- Department of Biochemistry and Chemistry, Westminster College, New Wilmington, Pennsylvania 16172, USA
| | - Jeffrey D Evanseck
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| | - Mihaela-Rita Mihailescu
- Department of Chemistry and Biochemistry, Duquesne University, Pittsburgh, Pennsylvania 15282, USA
| |
Collapse
|
5
|
Zarei Ghobadi M, Afsaneh E, Emamzadeh R. Gene biomarkers and classifiers for various subtypes of HTLV-1-caused ATLL cancer identified by a combination of differential gene co‑expression and support vector machine algorithms. Med Microbiol Immunol 2023:10.1007/s00430-023-00767-8. [PMID: 37222763 DOI: 10.1007/s00430-023-00767-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 05/12/2023] [Indexed: 05/25/2023]
Abstract
Adult T-cell leukemia/lymphoma (ATLL) is pathogen-caused cancer that is progressed after the infection by human T-cell leukemia virus type 1. Four significant subtypes comprising acute, lymphoma, chronic, and smoldering have been identified for this cancer. However, there are no trustworthy prognostic biomarkers for these subtypes. We utilized a combination of two powerful network-based and machine-learning algorithms including differential co-expressed genes (DiffCoEx) and support vector machine-recursive feature elimination with cross-validation (SVM-RFECV) methods to categorize disparate ATLL subtypes from asymptomatic carriers (ACs). The results disclosed the significant involvement of CBX6, CNKSR1, and MAX in chronic, MYH10 and P2RY1 in acute, C22orf46 and HNRNPA0 in smoldering subtypes. These genes also can classify each ATLL subtype from AC carriers. The integration of the results of two powerful algorithms led to the identification of reliable gene classifiers and biomarkers for diverse ATLL subtypes.
Collapse
Affiliation(s)
- Mohadeseh Zarei Ghobadi
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | | | - Rahman Emamzadeh
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| |
Collapse
|
6
|
Ghobadi MZ, Afsaneh E, Emamzadeh R, Soroush M. Potential miRNA-gene interactions determining progression of various ATLL cancer subtypes after infection by HTLV-1 oncovirus. BMC Med Genomics 2023; 16:62. [PMID: 36978083 PMCID: PMC10045051 DOI: 10.1186/s12920-023-01492-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 03/17/2023] [Indexed: 03/30/2023] Open
Abstract
BACKGROUND Adult T-cell Leukemia/Lymphoma (ATLL) is a rapidly progressing type of T-cell non-Hodgkin lymphoma that is developed after the infection by human T-cell leukemia virus type 1 (HTLV-1). It could be categorized into four major subtypes, acute, lymphoma, chronic, and smoldering. These different subtypes have some shared clinical manifestations, and there are no trustworthy biomarkers for diagnosis of them. METHODS We applied weighted-gene co-expression network analysis to find the potential gene and miRNA biomarkers for various ATLL subtypes. Afterward, we found reliable miRNA-gene interactions by identifying the experimentally validated-target genes of miRNAs. RESULTS The outcomes disclosed the interactions of miR-29b-2-5p and miR-342-3p with LSAMP in ATLL_acute, miR-575 with UBN2, miR-342-3p with ZNF280B, and miR-342-5p with FOXRED2 in ATLL_chronic, miR-940 and miR-423-3p with C6orf141, miR-940 and miR-1225-3p with CDCP1, and miR-324-3p with COL14A1 in ATLL_smoldering. These miRNA-gene interactions determine the molecular factors involved in the pathogenesis of each ATLL subtype and the unique ones could be considered biomarkers. CONCLUSION The above-mentioned miRNAs-genes interactions are suggested as diagnostic biomarkers for different ATLL subtypes.
Collapse
Affiliation(s)
- Mohadeseh Zarei Ghobadi
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | | | - Rahman Emamzadeh
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | - Mona Soroush
- Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| |
Collapse
|
7
|
Shafiee A, Rezaian S, Aliyu M, Shayeghpour A, Mokhames Z, Mohammadi H, Yaslianifard S, Soleimani A, Soleimanifar F, Tojari T, Qorbani M, Mozhgani SH. Immunologic Profile of Severe COVID-19 Patients in Alborz Province, Iran. Jundishapur J Microbiol 2023. [DOI: 10.5812/jjm-134264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023] Open
Abstract
Background: The coronavirus disease 2019 (COVID-19) pandemic has prompted researchers to look for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pathogenicity in depth. Immune system dysregulation was one of the major mechanisms in its pathogenesis. The evidence regarding the levels of interferons (IFNs) and pro- and anti-inflammatory cytokines in COVID-19 patients is not well-established. Objectives: Therefore, this study evaluated the expression level of type-I, II, III IFNs, along with interleukin-1 (IL-1), interleukin-6 (IL-6), interleukin-10 (IL-10), and FOXP3 genes in patients with severe COVID-19 to provide additional insights regarding the regulation of these cytokines during COVID-19 infection. Methods: Peripheral blood mononuclear cells were isolated from two groups, including severe COVID-19 patients and healthy controls. Ribonucleic acid was extracted to evaluate the expression level of IFN-a, IFN-b, IFN-g, IFN-la, IL-1, IL-6, IL-10, and FOXP3 genes using real-time polymerase chain reaction. The correlations between the expression levels of these genes were also assessed. Results: A total of 40 samples were divided into two groups, with each group consisting of 20 samples. When comparing the severe COVID-19 group to the controls, the expression levels of IFN-g, tumor necrosis factor-alpha (TNF-α), IL-6, and IL-10 genes were significantly higher in the severe COVID-19 group. The two groups had no significant differences in IFN-a, IFN-b, IFN-la, IL-1, and FOXP3 expression. The correlation analysis revealed a negative correlation between type I and type III IFNs (i.e., IFN-a and IFN-la) and pro-inflammatory cytokines (i.e., IL-1 and IL-10). Conclusions: This study suggests the possible upregulation of IFN-g, IL-6, IL-10, and TNF-α during SARS-CoV-2 pathogenicity. The preliminary findings of this study and those reported previously show that the levels of IFNs and pro- and anti-inflammatory cytokines are not uniformly expressed among all COVID-19 patients and might differ as the disease progresses to the severe stage.
Collapse
|
8
|
Classification of COVID-19 Patients into Clinically Relevant Subsets by a Novel Machine Learning Pipeline Using Transcriptomic Features. Int J Mol Sci 2023; 24:ijms24054905. [PMID: 36902333 PMCID: PMC10002748 DOI: 10.3390/ijms24054905] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 02/24/2023] [Accepted: 03/01/2023] [Indexed: 03/06/2023] Open
Abstract
The persistent impact of the COVID-19 pandemic and heterogeneity in disease manifestations point to a need for innovative approaches to identify drivers of immune pathology and predict whether infected patients will present with mild/moderate or severe disease. We have developed a novel iterative machine learning pipeline that utilizes gene enrichment profiles from blood transcriptome data to stratify COVID-19 patients based on disease severity and differentiate severe COVID cases from other patients with acute hypoxic respiratory failure. The pattern of gene module enrichment in COVID-19 patients overall reflected broad cellular expansion and metabolic dysfunction, whereas increased neutrophils, activated B cells, T-cell lymphopenia, and proinflammatory cytokine production were specific to severe COVID patients. Using this pipeline, we also identified small blood gene signatures indicative of COVID-19 diagnosis and severity that could be used as biomarker panels in the clinical setting.
Collapse
|
9
|
Shayeghpour A, Forghani-Ramandi MM, Solouki S, Hosseini A, Hosseini P, Khodayar S, Hasani M, Aghajanian S, Siami Z, Zarei Ghobadi M, Mozhgani SH. Identification of novel miRNAs potentially involved in the pathogenesis of adult T-cell leukemia/lymphoma using WGCNA followed by RT-qPCR test of hub genes. Infect Agent Cancer 2023; 18:12. [PMID: 36841815 PMCID: PMC9968414 DOI: 10.1186/s13027-023-00492-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 02/17/2023] [Indexed: 02/27/2023] Open
Abstract
BACKGROUND Adult T-cell Lymphoma/Leukemia (ATLL) is characterized by the malignant proliferation of T-cells in Human T-Lymphotropic Virus Type 1 and a high mortality rate. Considering the emerging roles of microRNAs (miRNAs) in various malignancies, the analysis of high-throughput miRNA data employing computational algorithms helps to identify potential biomarkers. METHODS Weighted gene co-expression network analysis was utilized to analyze miRNA microarray data from ATLL and healthy uninfected samples. To identify miRNAs involved in the progression of ATLL, module preservation analysis was used. Subsequently, based on the target genes of the identified miRNAs, the STRING database was employed to construct protein-protein interaction networks (PPIN). Real-time quantitative PCR was also performed to validate the expression of identified hub genes in the PPIN network. RESULTS After constructing co-expression modules and then performing module preservation analysis, four out of 15 modules were determined as ATLL-specific modules. Next, the hub miRNA including hsa-miR-18a-3p, has-miR-187-5p, hsa-miR-196a-3p, and hsa-miR-346 were found as hub miRNAs. The protein-protein interaction networks were constructed for the target genes of each hub miRNA and hub genes were identified. Among them, UBB, RPS15A, and KMT2D were validated by Reverse-transcriptase PCR in ATLL patients. CONCLUSION The results of the network analysis of miRNAs and their target genes revealed the major players in the pathogenesis of ATLL. Further studies are required to confirm the role of these molecular factors and to discover their potential benefits as treatment targets and diagnostic biomarkers.
Collapse
Affiliation(s)
- Ali Shayeghpour
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | | | - Setayesh Solouki
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Amin Hosseini
- Department of Computer, Faculty of Engineering, Raja University, Qazvin, Iran
| | - Parastoo Hosseini
- grid.411705.60000 0001 0166 0922Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran ,grid.411705.60000 0001 0166 0922Research Center for Clinical Virology, Tehran University of Medical Sciences, Tehran, Iran
| | - Sara Khodayar
- grid.411705.60000 0001 0166 0922Department of Microbiology, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Mahsa Hasani
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Sepehr Aghajanian
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Zeinab Siami
- grid.411705.60000 0001 0166 0922Department of Infectious Diseases, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | | | - Sayed-Hamidreza Mozhgani
- Department of Microbiology, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran. .,Non-Communicable Disease Research Center, Alborz University of Medical Sciences, Karaj, Iran.
| |
Collapse
|
10
|
Huang AA, Huang SY. Increasing transparency in machine learning through bootstrap simulation and shapely additive explanations. PLoS One 2023; 18:e0281922. [PMID: 36821544 PMCID: PMC9949629 DOI: 10.1371/journal.pone.0281922] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 02/05/2023] [Indexed: 02/24/2023] Open
Abstract
Machine learning methods are widely used within the medical field. However, the reliability and efficacy of these models is difficult to assess, making it difficult for researchers to identify which machine-learning model to apply to their dataset. We assessed whether variance calculations of model metrics (e.g., AUROC, Sensitivity, Specificity) through bootstrap simulation and SHapely Additive exPlanations (SHAP) could increase model transparency and improve model selection. Data from the England National Health Services Heart Disease Prediction Cohort was used. After comparison of model metrics for XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boosting, XGBoost was used as the machine-learning model of choice in this study. Boost-strap simulation (N = 10,000) was used to empirically derive the distribution of model metrics and covariate Gain statistics. SHapely Additive exPlanations (SHAP) to provide explanations to machine-learning output and simulation to evaluate the variance of model accuracy metrics. For the XGBoost modeling method, we observed (through 10,000 completed simulations) that the AUROC ranged from 0.771 to 0.947, a difference of 0.176, the balanced accuracy ranged from 0.688 to 0.894, a 0.205 difference, the sensitivity ranged from 0.632 to 0.939, a 0.307 difference, and the specificity ranged from 0.595 to 0.944, a 0.394 difference. Among 10,000 simulations completed, we observed that the gain for Angina ranged from 0.225 to 0.456, a difference of 0.231, for Cholesterol ranged from 0.148 to 0.326, a difference of 0.178, for maximum heart rate (MaxHR) ranged from 0.081 to 0.200, a range of 0.119, and for Age ranged from 0.059 to 0.157, difference of 0.098. Use of simulations to empirically evaluate the variability of model metrics and explanatory algorithms to observe if covariates match the literature are necessary for increased transparency, reliability, and utility of machine learning methods. These variance statistics, combined with model accuracy statistics can help researchers identify the best model for a given dataset.
Collapse
Affiliation(s)
- Alexander A. Huang
- Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America
- Department of MD Education, Northwestern University Feinberg School of Medicine, Chicago, Illinois, United States of America
| | - Samuel Y. Huang
- Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America
- Department of Internal Medicine, Virginia Commonwealth University School of Medicine, Richmond, Virginia, United States of America
| |
Collapse
|
11
|
Cunningham CL, Frye CJ, Makowski JA, Kensinger AH, Shine M, Milback EJ, Lackey PE, Evanseck JD, Mihailescu MR. Effect of the SARS-CoV-2 Delta-associated G15U mutation on the s2m element dimerization and its interactions with miR-1307-3p. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.10.528014. [PMID: 36798421 PMCID: PMC9934655 DOI: 10.1101/2023.02.10.528014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
The stem loop 2 motif (s2m), a highly conserved 41-nucleotide hairpin structure in the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome, serves as an attractive therapeutic target that may have important roles in the virus life cycle or interactions with the host. However, the conserved s2m in Delta SARS-CoV-2, a previously dominant variant characterized by high infectivity and disease severity, has received relatively less attention than that of the original SARS-CoV-2 virus. The focus of this work is to identify and define the s2m changes between Delta and SARS-CoV-2 and subsequent impact of those changes upon the s2m dimerization and interactions with the host microRNA miR-1307-3p. Bioinformatics analysis of the GISAID database targeting the s2m element reveals a greater than 99% correlation of a single nucleotide mutation at the 15 th position (G15U) in Delta SARS-CoV-2. Based on 1 H NMR assignments comparing the imino proton resonance region of s2m and the G15U at 19°C, we find that the U15-A29 base pair closes resulting in a stabilization of the upper stem without overall secondary structure deviation. Increased stability of the upper stem did not affect the chaperone activity of the viral N protein, as it was still able to convert the kissing dimers formed by s2m G15U into a stable duplex conformation, consistent with the s2m reference. However, we find that the s2m G15U mutation drastically reduces the binding affinity of the host miR-1307-3p. These findings demonstrate that the observed G15U mutation alters the secondary structure of s2m with subsequent impact on viral binding of host miR-1307-3p, with potential consequences on the immune response.
Collapse
|
12
|
Afsaneh E, Sharifdini A, Ghazzaghi H, Ghobadi MZ. Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review. Diabetol Metab Syndr 2022; 14:196. [PMID: 36572938 PMCID: PMC9793536 DOI: 10.1186/s13098-022-00969-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Diabetes as a metabolic illness can be characterized by increased amounts of blood glucose. This abnormal increase can lead to critical detriment to the other organs such as the kidneys, eyes, heart, nerves, and blood vessels. Therefore, its prediction, prognosis, and management are essential to prevent harmful effects and also recommend more useful treatments. For these goals, machine learning algorithms have found considerable attention and have been developed successfully. This review surveys the recently proposed machine learning (ML) and deep learning (DL) models for the objectives mentioned earlier. The reported results disclose that the ML and DL algorithms are promising approaches for controlling blood glucose and diabetes. However, they should be improved and employed in large datasets to affirm their applicability.
Collapse
|