1
|
Li S, Chen J, Zhou B. The clinical significance of endoplasmic reticulum stress related genes in non-small cell lung cancer and analysis of single nucleotide polymorphism for CAV1. Front Mol Biosci 2024; 11:1414164. [PMID: 39165641 PMCID: PMC11334084 DOI: 10.3389/fmolb.2024.1414164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 07/09/2024] [Indexed: 08/22/2024] Open
Abstract
In recent years, protein homeostasis imbalance caused by endoplasmic reticulum stress has become a major hallmark of cancer. Studies have shown that endoplasmic reticulum stress is closely related to the occurrence, development, and drug resistance of non-small cell lung cancer, however, the role of various endoplasmic reticulum stress-related genes in non-small cell lung cancer is still unclear. In this study, we established an endoplasmic reticulum stress scores based on the Cancer Genome Atlas for non-small cell lung cancer to reflect patient features and predict prognosis. Survival analysis showed significant differences in overall survival among non-small cell lung cancer patients with different endoplasmic reticulum stress scores. In addition, endoplasmic reticulum stress scores was significantly correlated with the clinical features of non-small cell lung cancer patients, and can be served as an independent prognostic indicator. A nomogram based on endoplasmic reticulum stress scores indicated a certain clinical net benefit, while ssGSEA analysis demonstrated that there was a certain immunosuppressive microenvironment in high endoplasmic reticulum stress scores. Gene Set Enrichment Analysis showed that scores was associated with cancer pathways and metabolism. Finally, weighted gene co-expression network analysis displayed that CAV1 was closely related to the occurrence of non-small cell lung cancer. Therefore, in order to further analyze the role of this gene, Chinese non-smoking females were selected as the research subjects to investigate the relationship between CAV1 rs3779514 and susceptibility and prognosis of non-small cell lung cancer. The results showed that the mutation of rs3779514 significantly reduced the risk of non-small cell lung cancer in Chinese non-smoking females, but no prognostic effect was found. In summary, we proposed an endoplasmic reticulum stress scores, which was an independent prognostic factor and indicated immune characteristics in the microenvironment of non-small cell lung cancer. We also validated the relationship between single nucleotide polymorphism locus of core genes and susceptibility to non-small cell lung cancer.
Collapse
Affiliation(s)
| | | | - Baosen Zhou
- Department of Clinical Epidemiology and Center of Evidence-Based Medicine, The First Hospital of China Medical University, Shenyang, China
| |
Collapse
|
2
|
Li C, Huang J, Tang H, Liu B, Zhou X. Revealing Cavin-2 Gene Function in Lung Based on Multi-Omics Data Analysis Method. Front Cell Dev Biol 2022; 9:827108. [PMID: 35174175 PMCID: PMC8841408 DOI: 10.3389/fcell.2021.827108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 12/15/2021] [Indexed: 11/23/2022] Open
Abstract
Research points out that it is particularly important to comprehensively evaluate immune microenvironmental indicators and gene mutation characteristics to select the best treatment plan. Therefore, exploring the relevant genes of pulmonary injury is an important basis for the improvement of survival. In recent years, with the massive production of omics data, a large number of computational methods have been applied in the field of biomedicine. Most of these computational methods are devel-oped for a certain type of diseases or whole diseases. Algorithms that specifically identify genes associated with pulmonary injury have not yet been developed. To fill this gap, we developed a novel method, named AdaRVM, to identify pulmonary injury-related genes in large scale. AdaRVM is the fusion of Adaboost and Relevance Vector Machine (RVM) to achieve fast and high-precision pattern recognition of pulmonary injury genetic mechanism. AdaRVM found that Cavin-2 gene has strong potential to be related to pulmonary injury. As we known, the formation and function of Caveolae are mediated by two family proteins: Caveolin and Cavin. Many studies have explored the role of Caveolin proteins, but people still knew little about Cavin family members. To verify our method and reveal the functions of cavin-2, we integrated six genome-wide association studies (GWAS) data related to lung function traits, four expression Quantitative Trait Loci (eQTL) data, and one methylation Quantitative Trait Loci (mQTL) data by Summary data level Mendelian Randomization (SMR). We found strong relationship between cavin-2 and canonical signaling pathways ERK1/2, AKT, and STAT3 which are all known to be related to lung injury.
Collapse
Affiliation(s)
- Changsheng Li
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Jingyu Huang
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Hexiao Tang
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Bing Liu
- Department of Pulmonary and Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
- Wuhan Research Center for Infectious Diseases and Cancer, Chinese Academy of Medical Sciences, Wuhan, China
| | - Xuefeng Zhou
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, China
- *Correspondence: Xuefeng Zhou,
| |
Collapse
|
3
|
Spurr LF, Alomran N, Bousounis P, Reece-Stremtan D, Prashant NM, Liu H, Słowiński P, Li M, Zhang Q, Sein J, Asher G, Crandall KA, Tsaneva-Atanasova K, Horvath A. ReQTL: identifying correlations between expressed SNVs and gene expression using RNA-sequencing data. Bioinformatics 2020; 36:1351-1359. [PMID: 31589315 PMCID: PMC7058180 DOI: 10.1093/bioinformatics/btz750] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Revised: 09/19/2019] [Accepted: 10/01/2019] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION By testing for associations between DNA genotypes and gene expression levels, expression quantitative trait locus (eQTL) analyses have been instrumental in understanding how thousands of single nucleotide variants (SNVs) may affect gene expression. As compared to DNA genotypes, RNA genetic variation represents a phenotypic trait that reflects the actual allele content of the studied system. RNA genetic variation at expressed SNV loci can be estimated using the proportion of alleles bearing the variant nucleotide (variant allele fraction, VAFRNA). VAFRNA is a continuous measure which allows for precise allele quantitation in loci where the RNA alleles do not scale with the genotype count. We describe a method to correlate VAFRNA with gene expression and assess its ability to identify genetically regulated expression solely from RNA-sequencing (RNA-seq) datasets. RESULTS We introduce ReQTL, an eQTL modification which substitutes the DNA allele count for the variant allele fraction at expressed SNV loci in the transcriptome (VAFRNA). We exemplify the method on sets of RNA-seq data from human tissues obtained though the Genotype-Tissue Expression (GTEx) project and demonstrate that ReQTL analyses are computationally feasible and can identify a subset of expressed eQTL loci. AVAILABILITY AND IMPLEMENTATION A toolkit to perform ReQTL analyses is available at https://github.com/HorvathLab/ReQTL. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liam F Spurr
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA.,Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Nawaf Alomran
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Pavlos Bousounis
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Dacian Reece-Stremtan
- Computer Applications Support Services, School of Medicine and Health Sciences, The George Washington University, Washington, DC 20037, USA
| | - N M Prashant
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Hongyu Liu
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Piotr Słowiński
- Department of Mathematics & Living Systems Institute, University of Exeter, Exeter EX4 4QD, UK
| | - Muzi Li
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Qianqian Zhang
- Department of Biochemistry and Molecular Medicine.,Department of Biostatistics and Bioinformatics, School of Medicine and Health Sciences, George Washington University, Washington, DC 20037, USA
| | - Justin Sein
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Gabriel Asher
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center
| | - Keith A Crandall
- Computational Biology Institute, Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, The George Washington University, Washington, DC 20052, USA
| | - Krasimira Tsaneva-Atanasova
- Department of Mathematics & Living Systems Institute, University of Exeter, Exeter EX4 4QD, UK.,EPSRC Centre for Predictive Modelling in Healthcare, University of Exeter, Exeter EX4 4QJ, UK
| | - Anelia Horvath
- Biochemistry and Molecular Medicine, McCormick Genomics and Proteomics Center.,Department of Biochemistry and Molecular Medicine.,Department of Pharmacology and Physiology, School of Medicine and Health Sciences, The George Washington University, Washington, DC 20037, USA
| |
Collapse
|
4
|
Gorlov I, Xiao X, Mayes M, Gorlova O, Amos C. SNP eQTL status and eQTL density in the adjacent region of the SNP are associated with its statistical significance in GWA studies. BMC Genet 2019; 20:85. [PMID: 31718536 PMCID: PMC6852916 DOI: 10.1186/s12863-019-0786-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 10/18/2019] [Indexed: 01/05/2023] Open
Abstract
Background Over the relatively short history of Genome Wide Association Studies (GWASs), hundreds of GWASs have been published and thousands of disease risk-associated SNPs have been identified. Summary statistics from the conducted GWASs are often available and can be used to identify SNP features associated with the level of GWAS statistical significance. Those features could be used to select SNPs from gray zones (SNPs that are nominally significant but do not reach the genome-wide level of significance) for targeted analyses. Methods We used summary statistics from recently published breast and lung cancer and scleroderma GWASs to explore the association between the level of the GWAS statistical significance and the expression quantitative trait loci (eQTL) status of the SNP. Data from the Genotype-Tissue Expression Project (GTEx) were used to identify eQTL SNPs. Results We found that SNPs reported as eQTLs were more significant in GWAS (higher -log10p) regardless of the tissue specificity of the eQTL. Pan-tissue eQTLs (those reported as eQTLs in multiple tissues) tended to be more significant in the GWAS compared to those reported as eQTL in only one tissue type. eQTL density in the ±5 kb adjacent region of a given SNP was also positively associated with the level of GWAS statistical significance regardless of the eQTL status of the SNP. We found that SNPs located in the regions of high eQTL density were more likely to be located in regulatory elements (transcription factor or miRNA binding sites). When SNPs were stratified by the level of statistical significance, the proportion of eQTLs was positively associated with the mean level of statistical significance in the group. The association curve reaches a plateau around -log10p ≈ 5. The observed associations suggest that quasi-significant SNPs (10− 5 < p < 5 × 10− 8) and SNPs at the genome wide level of statistical significance (p < 5 × 10− 8) may have a similar proportions of risk associated SNPs. Conclusions The results of this study indicate that the SNP’s eQTL status, as well as eQTL density in the adjacent region are positively associated with the level of statistical significance of the SNP in GWAS.
Collapse
Affiliation(s)
- Ivan Gorlov
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Xiangjun Xiao
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
| | - Maureen Mayes
- Department of Internal Medicine, Division of Rheumatology, University of Texas McGovern Medical School, Houston, TX, USA
| | - Olga Gorlova
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA
| | - Christopher Amos
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
| |
Collapse
|
5
|
Han J, Li J, Achour I, Pesce L, Foster I, Li H, Lussier YA. Convergent downstream candidate mechanisms of independent intergenic polymorphisms between co-classified diseases implicate epistasis among noncoding elements. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018; 23:524-535. [PMID: 29218911 PMCID: PMC5730078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Eighty percent of DNA outside protein coding regions was shown biochemically functional by the ENCODE project, enabling studies of their interactions. Studies have since explored how convergent downstream mechanisms arise from independent genetic risks of one complex disease. However, the cross-talk and epistasis between intergenic risks associated with distinct complex diseases have not been comprehensively characterized. Our recent integrative genomic analysis unveiled downstream biological effectors of disease-specific polymorphisms buried in intergenic regions, and we then validated their genetic synergy and antagonism in distinct GWAS. We extend this approach to characterize convergent downstream candidate mechanisms of distinct intergenic SNPs across distinct diseases within the same clinical classification. We construct a multipartite network consisting of 467 diseases organized in 15 classes, 2,358 disease-associated SNPs, 6,301 SNPassociated mRNAs by eQTL, and mRNA annotations to 4,538 Gene Ontology mechanisms. Functional similarity between two SNPs (similar SNP pairs) is imputed using a nested information theoretic distance model for which p-values are assigned by conservative scale-free permutation of network edges without replacement (node degrees constant). At FDR≤5%, we prioritized 3,870 intergenic SNP pairs associated, among which 755 are associated with distinct diseases sharing the same disease class, implicating 167 intergenic SNPs, 14 classes, 230 mRNAs, and 134 GO terms. Co-classified SNP pairs were more likely to be prioritized as compared to those of distinct classes confirming a noncoding genetic underpinning to clinical classification (odds ratio ∼3.8; p≤10-25). The prioritized pairs were also enriched in regions bound to the same/interacting transcription factors and/or interacting in long-range chromatin interactions suggestive of epistasis (odds ratio ∼ 2,500; p≤10-25). This prioritized network implicates complex epistasis between intergenic polymorphisms of co-classified diseases and offers a roadmap for a novel therapeutic paradigm: repositioning medications that target proteins within downstream mechanisms of intergenic disease-associated SNPs. Supplementary information and software: http://lussiergroup.org/publications/disease_class.
Collapse
Affiliation(s)
- Jiali Han
- Center for Biomedical Informatics and Biostatistics (CB2) and Departments of Medicine and of Systems and Industrial Engineering, The University of Arizona, Tucson, AZ 85721, USA,
| | | | | | | | | | | | | |
Collapse
|
6
|
Yu CH, Pal LR, Moult J. Consensus Genome-Wide Expression Quantitative Trait Loci and Their Relationship with Human Complex Trait Disease. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2017; 20:400-14. [PMID: 27428252 DOI: 10.1089/omi.2016.0063] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Most of the risk loci identified from genome-wide association (GWA) studies do not provide direct information on the biological basis of a disease or on the underlying mechanisms. Recent expression quantitative trait locus (eQTL) association studies have provided information on genetic factors associated with gene expression variation. These eQTLs might contribute to phenotype diversity and disease susceptibility, but interpretation is handicapped by low reproducibility of the expression results. To address this issue, we have generated a set of consensus eQTLs by integrating publicly available data for specific human populations and cell types. Overall, we find over 4000 genes that are involved in high-confidence eQTL relationships. To elucidate the role that eQTLs play in human common diseases, we matched the high-confidence eQTLs to a set of 335 disease risk loci identified from the Wellcome Trust Case Control Consortium GWA study and follow-up studies for 7 human complex trait diseases-bipolar disorder (BD), coronary artery disease (CAD), Crohn's disease (CD), hypertension (HT), rheumatoid arthritis (RA), type 1 diabetes (T1D), and type 2 diabetes (T2D). The results show that the data are consistent with ∼50% of these disease loci arising from an underlying expression change mechanism.
Collapse
Affiliation(s)
- Chen-Hsin Yu
- 1 Institute for Bioscience and Biotechnology Research, University of Maryland , Rockville, Maryland.,2 Molecular and Cell Biology Concentration Area, Biological Sciences Graduate Program, University of Maryland , College Park, Maryland
| | - Lipika R Pal
- 1 Institute for Bioscience and Biotechnology Research, University of Maryland , Rockville, Maryland
| | - John Moult
- 1 Institute for Bioscience and Biotechnology Research, University of Maryland , Rockville, Maryland.,3 Department of Cell Biology and Molecular Genetics, University of Maryland , College Park, Maryland
| |
Collapse
|
7
|
Odhams CA, Cortini A, Chen L, Roberts AL, Viñuela A, Buil A, Small KS, Dermitzakis ET, Morris DL, Vyse TJ, Cunninghame Graham DS. Mapping eQTLs with RNA-seq reveals novel susceptibility genes, non-coding RNAs and alternative-splicing events in systemic lupus erythematosus. Hum Mol Genet 2017; 26:1003-1017. [PMID: 28062664 PMCID: PMC5409091 DOI: 10.1093/hmg/ddw417] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 12/05/2016] [Indexed: 12/19/2022] Open
Abstract
Studies attempting to functionally interpret complex-disease susceptibility loci by GWAS and eQTL integration have predominantly employed microarrays to quantify gene-expression. RNA-Seq has the potential to discover a more comprehensive set of eQTLs and illuminate the underlying molecular consequence. We examine the functional outcome of 39 variants associated with Systemic Lupus Erythematosus (SLE) through the integration of GWAS and eQTL data from the TwinsUK microarray and RNA-Seq cohort in lymphoblastoid cell lines. We use conditional analysis and a Bayesian colocalisation method to provide evidence of a shared causal-variant, then compare the ability of each quantification type to detect disease relevant eQTLs and eGenes. We discovered the greatest frequency of candidate-causal eQTLs using exon-level RNA-Seq, and identified novel SLE susceptibility genes (e.g. NADSYN1 and TCF7) that were concealed using microarrays, including four non-coding RNAs. Many of these eQTLs were found to influence the expression of several genes, supporting the notion that risk haplotypes may harbour multiple functional effects. Novel SLE associated splicing events were identified in the T-reg restricted transcription factor, IKZF2, and other candidate genes (e.g. WDFY4) through asQTL mapping using the Geuvadis cohort. We have significantly increased our understanding of the genetic control of gene-expression in SLE by maximising the leverage of RNA-Seq and performing integrative GWAS-eQTL analysis against gene, exon, and splice-junction quantifications. We conclude that to better understand the true functional consequence of regulatory variants, quantification by RNA-Seq should be performed at the exon-level as a minimum, and run in parallel with gene and splice-junction level quantification.
Collapse
Affiliation(s)
| | - Andrea Cortini
- Department of Medical & Molecular Genetics, King's College London, London, UK
| | - Lingyan Chen
- Department of Medical & Molecular Genetics, King's College London, London, UK
| | - Amy L Roberts
- Department of Medical & Molecular Genetics, King's College London, London, UK
| | - Ana Viñuela
- Department of Twin Research, King's College London, London, UK
| | | | - Kerrin S Small
- Department of Twin Research, King's College London, London, UK
| | | | - David L Morris
- Department of Medical & Molecular Genetics, King's College London, London, UK
| | - Timothy J Vyse
- Department of Medical & Molecular Genetics, King's College London, London, UK.,Division of Immunology, Infection and Inflammatory Disease, King's College London, London, UK
| | | |
Collapse
|