1
|
Mahmood MS, Afzal M, Batool H, Saif A, Aqdas T, Ashraf NM, Saleem M. Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach. Bioinform Biol Insights 2022; 16:11779322221115547. [PMID: 35966807 PMCID: PMC9373111 DOI: 10.1177/11779322221115547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/11/2022] [Indexed: 11/15/2022] Open
Abstract
LHPP gene encodes a phospholysine phosphohistidine inorganic pyrophosphate phosphatase, which functions as a tumor-suppressor protein. The tumor suppression by this protein has been confirmed in various cancers, including hepatocellular carcinoma (HCC). LHPP downregulation promotes cell growth and proliferation by modulating the PI3K/AKT signaling pathway. This study identifies potentially deleterious missense single nucleotide variants (SNVs) associated with the LHPP gene using multiple computational tools based on different algorithms. A total of 4 destabilizing mutants are identified as L22P, I212T, G227R, and G236R, from the conserved region of the phosphatase. The 3-dimensional (3D) modeling and structural comparison of variants with the native protein reveals significant structural and conformational variations after mutations, suggesting disruption in the function of phospholysine phosphohistidine inorganic pyrophosphate phosphatase. The identified mutations might, therefore, participate in the cause of HCC.
Collapse
Affiliation(s)
- Malik Siddique Mahmood
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan.,Department of Biochemistry, NUR International University, Lahore, Pakistan
| | - Maryam Afzal
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| | - Hina Batool
- Department of Life Sciences, University of Management and Technology, Lahore, Pakistan
| | - Amara Saif
- Department of Life Sciences, University of Management and Technology, Lahore, Pakistan
| | - Tahreem Aqdas
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| | - Naeem Mahmood Ashraf
- Department of Biochemistry & Biotechnology, University of Gujrat, Gujrat, Pakistan
| | - Mahjabeen Saleem
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| |
Collapse
|
2
|
SNP characteristics and validation success in genome wide association studies. Hum Genet 2022; 141:229-238. [PMID: 34981173 PMCID: PMC8855685 DOI: 10.1007/s00439-021-02407-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 11/27/2021] [Indexed: 02/03/2023]
Abstract
Genome wide association studies (GWASs) have identified tens of thousands of single nucleotide polymorphisms (SNPs) associated with human diseases and characteristics. A significant fraction of GWAS findings can be false positives. The gold standard for true positives is an independent validation. The goal of this study was to identify SNP features associated with validation success. Summary statistics from the Catalog of Published GWASs were used in the analysis. Since our goal was an analysis of reproducibility, we focused on the diseases/phenotypes targeted by at least 10 GWASs. GWASs were arranged in discovery-validation pairs based on the time of publication, with the discovery GWAS published before validation. We used four definitions of the validation success that differ by stringency. Associations of SNP features with validation success were consistent across the definitions. The strongest predictor of SNP validation was the level of statistical significance in the discovery GWAS. The magnitude of the effect size was associated with validation success in a non-linear manner. SNPs with risk allele frequencies in the range 30-70% showed a higher validation success rate compared to rarer or more common SNPs. Missense, 5'UTR, stop gained, and SNPs located in transcription factor binding sites had a higher validation success rate compared to intergenic, intronic and synonymous SNPs. There was a positive association between validation success and the level of evolutionary conservation of the sites. In addition, validation success was higher when discovery and validation GWASs targeted the same ethnicity. All predictors of validation success remained significant in a multivariate logistic regression model indicating their independent contribution. To conclude, we identified SNP features predicting validation success of GWAS hits. These features can be used to select SNPs for validation and downstream functional studies.
Collapse
|
3
|
O'Sullivan JW, Ioannidis JPA. Reproducibility in the UK biobank of genome-wide significant signals discovered in earlier genome-wide association studies. Sci Rep 2021; 11:18625. [PMID: 34545148 PMCID: PMC8452698 DOI: 10.1038/s41598-021-97896-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 08/31/2021] [Indexed: 12/20/2022] Open
Abstract
With the establishment of large biobanks, discovery of single nucleotide variants (SNVs, also known as single nucleotide polymorphisms (SNVs)) associated with various phenotypes has accelerated. An open question is whether genome-wide significant SNVs identified in earlier genome-wide association studies (GWAS) are replicated in later GWAS conducted in biobanks. To address this, we examined a publicly available GWAS database and identified two, independent GWAS on the same phenotype (an earlier, “discovery” GWAS and a later, “replication” GWAS done in the UK biobank). The analysis evaluated 136,318,924 SNVs (of which 6289 reached P < 5e−8 in the discovery GWAS) from 4,397,962 participants across nine phenotypes. The overall replication rate was 85.0%; although lower for binary than quantitative phenotypes (58.1% versus 94.8% respectively). There was a 18.0% decrease in SNV effect size for binary phenotypes, but a 12.0% increase for quantitative phenotypes. Using the discovery SNV effect size, phenotype trait (binary or quantitative), and discovery P value, we built and validated a model that predicted SNV replication with area under the Receiver Operator Curve = 0.90. While non-replication may reflect lack of power rather than genuine false-positives, these results provide insights about which discovered associations are likely to be replicated across subsequent GWAS.
Collapse
Affiliation(s)
- Jack W O'Sullivan
- Division of Cardiovascular Medicine, Stanford University, Stanford, CA, USA. .,Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, USA.
| | - John P A Ioannidis
- Meta-Research Innovation Center at Stanford (METRICS), Stanford University, Stanford, CA, USA.,Departments of Medicine, of Epidemiology and Population Health, of Biomedical Data Science, and of Statistics, Stanford University, Stanford, CA, USA
| |
Collapse
|
4
|
Perng W, Aslibekyan S. Find the Needle in the Haystack, Then Find It Again: Replication and Validation in the 'Omics Era. Metabolites 2020; 10:metabo10070286. [PMID: 32664690 PMCID: PMC7408356 DOI: 10.3390/metabo10070286] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 07/01/2020] [Accepted: 07/10/2020] [Indexed: 01/25/2023] Open
Abstract
Advancements in high-throughput technologies have made it feasible to study thousands of biological pathways simultaneously for a holistic assessment of health and disease risk via ‘omics platforms. A major challenge in ‘omics research revolves around the reproducibility of findings—a feat that hinges upon balancing false-positive associations with generalizability. Given the foundational role of reproducibility in scientific inference, replication and validation of ‘omics findings are cornerstones of this effort. In this narrative review, we define key terms relevant to replication and validation, present issues surrounding each concept with historical and contemporary examples from genomics (the most well-established and upstream ‘omics), discuss special issues and unique considerations for replication and validation in metabolomics (an emerging field and most downstream ‘omics for which best practices remain yet to be established), and make suggestions for future research leveraging multiple ‘omics datasets.
Collapse
Affiliation(s)
- Wei Perng
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Denver Anschutz Medical Campus, Aurora, CO 80045, USA
- Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, University of Colorado Denver Anschutz Medical Campus, Aurora, CO 80045, USA
- Correspondence:
| | - Stella Aslibekyan
- Department of Epidemiology, University of Alabama at Birmingham, Birmingham, AL 35294, USA;
| |
Collapse
|
5
|
Bakhashab S, Filimban N, Altall RM, Nassir R, Qusti SY, Alqahtani MH, Abuzenadah AM, Dallol A. The Effect Sizes of PPARγ rs1801282 , FTO rs9939609, and MC4R rs2229616 Variants on Type 2 Diabetes Mellitus Risk among the Western Saudi Population: A Cross-Sectional Prospective Study. Genes (Basel) 2020; 11:genes11010098. [PMID: 31947684 PMCID: PMC7017045 DOI: 10.3390/genes11010098] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Revised: 01/05/2020] [Accepted: 01/09/2020] [Indexed: 12/20/2022] Open
Abstract
Type 2 diabetes mellitus (T2DM) is a common polygenic disease with associated comorbidities. Obesity is a major risk factor for the development of T2DM. The aim of this study is to determine the allele and genotype frequency of peroxisome proliferator-activated receptor-γ (PPARγ) rs1801282, fat mass and obesity-associated protein (FTO) rs9939609, and melanocortin 4 receptor (MC4R) rs2229616 polymorphisms and their association with risk of T2DM in the western Saudi population as mediators of adiposity phenotypes. In a cross-sectional prospective study, genomic DNA from control and T2DM patients were isolated and genotyped for these single-nucleotide polymorphisms. There was a significant association of the MC4R rs2229616 variant with T2DM, but no association with T2DM was detected with PPARγ rs1801282 or FTO rs9939609. The combination of C/C for PPARγ rs1801282, A/A for FTO rs9939609, and C/C for MC4R rs2229616 increased the risk of T2DM by 1.82. The A/T genotype for FTO rs9939609 was predicted to decrease the risk of T2DM when combined with C/C for PPARγ rs1801282 and C/C for MC4R rs2229616 or C/C for PPARγ rs1801282 and C/T MC4R rs2229616. In conclusion, our study showed the risk of the assessed variants for the development of T2DM in the Saudi population.
Collapse
Affiliation(s)
- Sherin Bakhashab
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, P.O Box 80218, Jeddah 21589, Saudi Arabia; (N.F.); (R.M.A.); (S.Y.Q.)
- Center of Innovation in Personalized Medicine, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia; (A.M.A.); (A.D.)
- Correspondence: ; Tel.: +966126400000
| | - Najlaa Filimban
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, P.O Box 80218, Jeddah 21589, Saudi Arabia; (N.F.); (R.M.A.); (S.Y.Q.)
- King Faisal Specialist Hospital and Research Center, Clinical Genomics, Department of Genetics, P.O. Box 3354, Riyadh 11211, Saudi Arabia
| | - Rana M. Altall
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, P.O Box 80218, Jeddah 21589, Saudi Arabia; (N.F.); (R.M.A.); (S.Y.Q.)
| | - Rami Nassir
- Department of Pathology, Faculty of Medicine, Umm Al-Qura University, P.O. Box 715, Makkah 21955, Saudi Arabia;
| | - Safaa Y. Qusti
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, P.O Box 80218, Jeddah 21589, Saudi Arabia; (N.F.); (R.M.A.); (S.Y.Q.)
| | - Mohammed H. Alqahtani
- Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia;
- Center of Excellence in Genomic Medicine Research, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia
| | - Adel M. Abuzenadah
- Center of Innovation in Personalized Medicine, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia; (A.M.A.); (A.D.)
- Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia;
| | - Ashraf Dallol
- Center of Innovation in Personalized Medicine, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia; (A.M.A.); (A.D.)
- Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, P.O Box 80216, Jeddah 21589, Saudi Arabia;
| |
Collapse
|
6
|
Gorlov I, Xiao X, Mayes M, Gorlova O, Amos C. SNP eQTL status and eQTL density in the adjacent region of the SNP are associated with its statistical significance in GWA studies. BMC Genet 2019; 20:85. [PMID: 31718536 PMCID: PMC6852916 DOI: 10.1186/s12863-019-0786-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 10/18/2019] [Indexed: 01/05/2023] Open
Abstract
Background Over the relatively short history of Genome Wide Association Studies (GWASs), hundreds of GWASs have been published and thousands of disease risk-associated SNPs have been identified. Summary statistics from the conducted GWASs are often available and can be used to identify SNP features associated with the level of GWAS statistical significance. Those features could be used to select SNPs from gray zones (SNPs that are nominally significant but do not reach the genome-wide level of significance) for targeted analyses. Methods We used summary statistics from recently published breast and lung cancer and scleroderma GWASs to explore the association between the level of the GWAS statistical significance and the expression quantitative trait loci (eQTL) status of the SNP. Data from the Genotype-Tissue Expression Project (GTEx) were used to identify eQTL SNPs. Results We found that SNPs reported as eQTLs were more significant in GWAS (higher -log10p) regardless of the tissue specificity of the eQTL. Pan-tissue eQTLs (those reported as eQTLs in multiple tissues) tended to be more significant in the GWAS compared to those reported as eQTL in only one tissue type. eQTL density in the ±5 kb adjacent region of a given SNP was also positively associated with the level of GWAS statistical significance regardless of the eQTL status of the SNP. We found that SNPs located in the regions of high eQTL density were more likely to be located in regulatory elements (transcription factor or miRNA binding sites). When SNPs were stratified by the level of statistical significance, the proportion of eQTLs was positively associated with the mean level of statistical significance in the group. The association curve reaches a plateau around -log10p ≈ 5. The observed associations suggest that quasi-significant SNPs (10− 5 < p < 5 × 10− 8) and SNPs at the genome wide level of statistical significance (p < 5 × 10− 8) may have a similar proportions of risk associated SNPs. Conclusions The results of this study indicate that the SNP’s eQTL status, as well as eQTL density in the adjacent region are positively associated with the level of statistical significance of the SNP in GWAS.
Collapse
Affiliation(s)
- Ivan Gorlov
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Xiangjun Xiao
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
| | - Maureen Mayes
- Department of Internal Medicine, Division of Rheumatology, University of Texas McGovern Medical School, Houston, TX, USA
| | - Olga Gorlova
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA
| | - Christopher Amos
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
| |
Collapse
|
7
|
Gorlov IP, Gorlova OY, Amos CI. Untouchable genes in the human genome: Identifying ideal targets for cancer treatment. Cancer Genet 2019; 231-232:67-79. [PMID: 30803560 DOI: 10.1016/j.cancergen.2019.01.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Revised: 10/30/2018] [Accepted: 01/18/2019] [Indexed: 12/12/2022]
Abstract
BACKGROUND Usually, genes with a higher-than-expected number of somatic mutations in tumor samples are assumed to be cancer related. We identified genes with a fewer-than-expected number of somatic mutations - "untouchable genes". METHODS To predict the expected number of somatic mutations, we used a linear regression model with the number of mutations in the gene as an outcome, and gene characteristics, including gene size, nucleotide composition, level of evolutionary conservation, expression level and others, as predictors. Analysis of residuals from the regression model was used to compare the observed and predicted number of mutations. RESULTS We have identified 19 genes with a less-than-expected number of loss-off-function (nonsense, frameshift or pathogenic missense) mutations - i.e., untouchable genes. The number of silent or neutral missense mutations in untouchable genes was equal or higher than the expected number. Many mucins, including MUC16, MUC17, MUC6, MUC5AC, MUC5B, and MUC12, are untouchable. We hypothesized that untouchable mucins help tumor cells to avoid immune response by providing a protective coat that prevents direct contact between effector immune cells, e.g., cytotoxic T-cells, and tumor cells. Survival analysis of available TCGA data demonstrated that overall survival of patients with low (below the median) expression of untouchable mucins was better compared to patients with high expression of untouchable mucins. Aside from mucins, we have identified a number of other untouchable genes. CONCLUSIONS Untouchable genes may be ideal targets for cancer treatment since suppression of untouchable genes is expected to inhibit survival of tumor cells.
Collapse
Affiliation(s)
- Ivan P Gorlov
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH 03756, United States.
| | - Olga Y Gorlova
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH 03756, United States
| | - Christopher I Amos
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| |
Collapse
|
8
|
Gorlov IP, Pikielny CW, Frost HR, Her SC, Cole MD, Strohbehn SD, Wallace-Bradley D, Kimmel M, Gorlova OY, Amos CI. Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples. BMC Bioinformatics 2018; 19:430. [PMID: 30453881 PMCID: PMC6245819 DOI: 10.1186/s12859-018-2455-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 10/31/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Because driver mutations provide selective advantage to the mutant clone, they tend to occur at a higher frequency in tumor samples compared to selectively neutral (passenger) mutations. However, mutation frequency alone is insufficient to identify cancer genes because mutability is influenced by many gene characteristics, such as size, nucleotide composition, etc. The goal of this study was to identify gene characteristics associated with the frequency of somatic mutations in the gene in tumor samples. RESULTS We used data on somatic mutations detected by genome wide screens from the Catalog of Somatic Mutations in Cancer (COSMIC). Gene size, nucleotide composition, expression level of the gene, relative replication time in the cell cycle, level of evolutionary conservation and other gene characteristics (totaling 11) were used as predictors of the number of somatic mutations. We applied stepwise multiple linear regression to predict the number of mutations per gene. Because missense, nonsense, and frameshift mutations are associated with different sets of gene characteristics, they were modeled separately. Gene characteristics explain 88% of the variation in the number of missense, 40% of nonsense, and 23% of frameshift mutations. Comparisons of the observed and expected numbers of mutations identified genes with a higher than expected number of mutations- positive outliers. Many of these are known driver genes. A number of novel candidate driver genes was also identified. CONCLUSIONS By comparing the observed and predicted number of mutations in a gene, we have identified known cancer-associated genes as well as 111 novel cancer associated genes. We also showed that adding the number of silent mutations per gene reported by genome/exome wide screens across all cancer type (COSMIC data) as a predictor substantially exceeds predicting accuracy of the most popular cancer gene predicting tool - MutsigCV.
Collapse
Affiliation(s)
- Ivan P Gorlov
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon.
| | - Claudio W Pikielny
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - Hildreth R Frost
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - Stephanie C Her
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - Michael D Cole
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - Samuel D Strohbehn
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - David Wallace-Bradley
- Department of Statistics, Rice University, M.S. 138, 6100 Main Street, Houston, TX, 77005, USA
| | - Marek Kimmel
- Department of Statistics, Rice University, M.S. 138, 6100 Main Street, Houston, TX, 77005, USA
| | - Olga Y Gorlova
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| | - Christopher I Amos
- The Geisel School of Medicine, Department of Biomedical Data Science, Dartmouth College, HB7936, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Beirut, NH, 03756, Lebanon
| |
Collapse
|
9
|
El Rouby N, McDonough CW, Gong Y, McClure LA, Mitchell BD, Horenstein RB, Talbert RL, Crawford DC, Gitzendanner MA, Takahashi A, Tanaka T, Kubo M, Pepine CJ, Cooper-DeHoff RM, Benavente OR, Shuldiner AR, Johnson JA. Genome-wide association analysis of common genetic variants of resistant hypertension. THE PHARMACOGENOMICS JOURNAL 2018; 19:295-304. [PMID: 30237584 DOI: 10.1038/s41397-018-0049-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 08/02/2018] [Accepted: 08/10/2018] [Indexed: 12/24/2022]
Abstract
Resistant hypertension (RHTN), defined as uncontrolled blood pressure (BP) ≥ 140/90 using three or more drugs or controlled BP (<140/90) using four or more drugs, is associated with adverse outcomes, including decline in kidney function. We conducted a genome-wide association analysis in 1194 White and Hispanic participants with hypertension and coronary artery disease from the INternational VErapamil-SR Trandolapril STudy-GENEtic Substudy (INVEST-GENES). Top variants associated with RHTN at p < 10-4 were tested for replication in 585 White and Hispanic participants with hypertension and subcortical strokes from the Secondary Prevention of Subcortical Strokes GENEtic Substudy (SPS3-GENES). A genetic risk score for RHTN was created by summing the risk alleles of replicated RHTN signals. rs11749255 in MSX2 was associated with RHTN in INVEST (odds ratio (OR) (95% CI) = 1.50 (1.2-1.8), p = 7.3 × 10-5) and replicated in SPS3 (OR = 2.0 (1.4-2.8), p = 4.3 × 10-5), with genome-wide significance in meta-analysis (OR = 1.60 (1.3-1.9), p = 3.8 × 10-8). Other replicated signals were in IFLTD1 and PTPRD. IFLTD1 rs6487504 was associated with RHTN in INVEST (OR = 1.90 (1.4-2.5), p = 1.1 × 10-5) and SPS3 (OR = 1.70 (1.2-2.5), p = 4 × 10-3). PTPRD rs324498, a previously reported RHTN signal, was among the top signals in INVEST (OR = 1.60 (1.3-2.0), p = 3.4 × 10-5) and replicated in SPS3 (OR = 1.60 (1.1-2.4), one-sided p = 0.005). Participants with the highest number of risk alleles were at increased risk of RHTN compared to participants with a lower number (p-trend = 1.8 × 10-15). Overall, we identified and replicated associations with RHTN in the MSX2, IFLTD1, and PTPRD regions, and combined these associations to create a genetic risk score.
Collapse
Affiliation(s)
- Nihal El Rouby
- Department of Pharmacotherapy and Translational Research and Center for Pharmacogenomics, College of Pharmacy, University of Florida, Gainesville, FL, USA
| | - Caitrin W McDonough
- Department of Pharmacotherapy and Translational Research and Center for Pharmacogenomics, College of Pharmacy, University of Florida, Gainesville, FL, USA
| | - Yan Gong
- Department of Pharmacotherapy and Translational Research and Center for Pharmacogenomics, College of Pharmacy, University of Florida, Gainesville, FL, USA
| | - Leslie A McClure
- Department of Epidemiology and Biostatistics, Dornsife School of Public Health, Drexel University, Philadelphia, PA, USA
| | - Braxton D Mitchell
- Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, MD, USA.,Geriatric Research and Education Clinical Center, Veterans Administration Medical Center, Baltimore, MD, USA
| | - Richard B Horenstein
- Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, MD, USA.,Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Robert L Talbert
- College of Pharmacy, University of Texas at Austin, Austin, TX, USA
| | - Dana C Crawford
- Epidemiology and Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | | | | | | | | | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Carl J Pepine
- Division of Cardiovascular Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Rhonda M Cooper-DeHoff
- Department of Pharmacotherapy and Translational Research and Center for Pharmacogenomics, College of Pharmacy, University of Florida, Gainesville, FL, USA.,Division of Cardiovascular Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Oscar R Benavente
- Department of Neurology, University of British Columbia, Vancouver, BC, Canada
| | - Alan R Shuldiner
- Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, MD, USA.,Geriatric Research and Education Clinical Center, Veterans Administration Medical Center, Baltimore, MD, USA.,Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Julie A Johnson
- Department of Pharmacotherapy and Translational Research and Center for Pharmacogenomics, College of Pharmacy, University of Florida, Gainesville, FL, USA. .,Division of Cardiovascular Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
10
|
Lopez-Rodriguez R, Perez-Pampin E, Marquez A, Blanco FJ, Joven B, Carreira P, Ferrer MA, Caliz R, Valor L, Narvaez J, Cañete JD, Ordoñez MDC, Manrique-Arija S, Vasilopoulos Y, Balsa A, Pascual-Salcedo D, Moreno-Ramos MJ, Alegre-Sancho JJ, Navarro-Sarabia F, Moreira V, Garcia-Portales R, Raya E, Magro-Checa C, Martin J, Gomez-Reino JJ, Gonzalez A. Validation study of genetic biomarkers of response to TNF inhibitors in rheumatoid arthritis. PLoS One 2018; 13:e0196793. [PMID: 29734345 PMCID: PMC5937760 DOI: 10.1371/journal.pone.0196793] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 04/19/2018] [Indexed: 11/19/2022] Open
Abstract
Genetic biomarkers are sought to personalize treatment of patients with rheumatoid arthritis (RA), given their variable response to TNF inhibitors (TNFi). However, no genetic biomaker is yet sufficiently validated. Here, we report a validation study of 18 previously reported genetic biomarkers, including 11 from GWAS of response to TNFi. The validation was attempted in 581 patients with RA that had not been treated with biologic antirheumatic drugs previously. Their response to TNFi was evaluated at 3, 6 and 12 months in two ways: change in the DAS28 measure of disease activity, and according to the EULAR criteria for response to antirheumatic drugs. Association of these parameters with the genotypes, obtained by PCR amplification followed by single-base extension, was tested with regression analysis. These analyses were adjusted for baseline DAS28, sex, and the specific TNFi. However, none of the proposed biomarkers was validated, as none showed association with response to TNFi in our study, even at the time of assessment and with the outcome that showed the most significant result in previous studies. These negative results are notable because this was the first independent validation study for 12 of the biomarkers, and because they indicate that prudence is needed in the interpretation of the proposed biomarkers of response to TNFi even when they are supported by very low p values. The results also emphasize the requirement of independent replication for validation, and the need to search protocols that could increase reproducibility of the biomarkers of response to TNFi.
Collapse
Affiliation(s)
- Rosario Lopez-Rodriguez
- Experimental and Observational Rheumatology and Rheumatology Unit, Instituto de Investigación Sanitaria, Hospital Clínico Universitario de Santiago, Santiago de Compostela, Spain
| | - Eva Perez-Pampin
- Experimental and Observational Rheumatology and Rheumatology Unit, Instituto de Investigación Sanitaria, Hospital Clínico Universitario de Santiago, Santiago de Compostela, Spain
| | - Ana Marquez
- Instituto de Parasitología y Biomedicina López-Neyra, CSIC, Granada, Spain
| | - Francisco J. Blanco
- Rheumatology Department, Instituto de Investigacion Biomedica–Complejo Hospitalario Universitario A Coruna, Coruna, Spain
| | | | | | - Miguel Angel Ferrer
- Rheumatology Unit, Hospital Universitario Virgen de las Nieves, Granada, Spain
| | - Rafael Caliz
- Rheumatology Unit, Hospital Universitario Virgen de las Nieves, Granada, Spain
| | - Lara Valor
- Rheumatology Unit, Hospital General Universitario Gregorio Marañón, Madrid, Spain
| | - Javier Narvaez
- Department of Rheumatology, Hospital Universitario de Bellvitge, Barcelona, Spain
| | - Juan D. Cañete
- Arthritis Unit, Rheumatology Dpt, Hospital Clinic and IDIBAPS, Barcelona, Spain
| | - Maria del Carmen Ordoñez
- Servicio de Reumatología, HRU Carlos Haya, Universidad de Málaga, Instituto de Investigación Biomédica de Málaga (IBIMA), Málaga Spain
| | - Sara Manrique-Arija
- Servicio de Reumatología, HRU Carlos Haya, Universidad de Málaga, Instituto de Investigación Biomédica de Málaga (IBIMA), Málaga Spain
| | - Yiannis Vasilopoulos
- Department of Biochemistry and Biotechnology, University of Thessaly, Larissa, Greece
| | - Alejandro Balsa
- Rheumatology Unit, Instituto de Investigación Sanitaria del Hospital Universitario La Paz (IdiPAZ), Hospital Universitario La Paz, Madrid, Spain
| | - Dora Pascual-Salcedo
- Department of Immunology, Instituto de Investigación Hospital Universitario La Paz (IdiPAZ), Madrid, Spain
| | | | | | | | - Virginia Moreira
- Rheumatology Unit, Hospital Universitario Virgen Macarena, Sevilla, Spain
| | | | - Enrique Raya
- Department of Rheumatology, Hospital Clínico San Cecilio, Granada, Spain
| | - Cesar Magro-Checa
- Department of Rheumatology, Hospital Clínico San Cecilio, Granada, Spain
- Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands
| | - Javier Martin
- Instituto de Parasitología y Biomedicina López-Neyra, CSIC, Granada, Spain
| | - Juan J. Gomez-Reino
- Experimental and Observational Rheumatology and Rheumatology Unit, Instituto de Investigación Sanitaria, Hospital Clínico Universitario de Santiago, Santiago de Compostela, Spain
| | - Antonio Gonzalez
- Experimental and Observational Rheumatology and Rheumatology Unit, Instituto de Investigación Sanitaria, Hospital Clínico Universitario de Santiago, Santiago de Compostela, Spain
- * E-mail:
| |
Collapse
|
11
|
Yilmaz AD, Yazicioğlu D, Tuzuner Oncul MA, Ereş G, Sayan NB. Association of Matrilin-3 Gene Polymorphism with Temporomandibular Joint Internal Derangement. Genet Test Mol Biomarkers 2016; 20:563-568. [PMID: 27533128 DOI: 10.1089/gtmb.2016.0037] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
AIMS Temporomandibular joint internal derangement (TMJ ID) is a multifactorial complex disease characterised by articular disc degeneration. Matrilin-3 is a cartilage and bone-specific adaptor protein, and amino-acid substitutions in the protein are associated with skeletal diseases and joint disorders. We aimed to detect the variants of Matrilin-3 gene (MATN3) in a TMJ ID case-control group and to investigate the risk association of the detected variants with TMJ ID. MATERIALS AND METHODS A case control study was conducted consisting of 57 unrelated TMJ ID patients (32.7 ± 8.2) and 96 unrelated healthy controls (26.63 ± 3.05) without TMJ ID to look for associations with variants of the MATN3 gene. DNA from individual subjects was extracted and each of the eight exons was amplified by polymerase chain reaction using and analyzed by single-strand conformational polymorphism (SSCP) analysis. SSCP variants were subjected to DNA sequence analysis, which yielded band pattern variations in exon 2 of the gene. We further analyzed exon 2 by DNA sequencing to determine the sequence of these variants. RESULTS We identified SSCP band patterns variants in exon 2 of the MATN3 gene which upon sequencing revealed a single C to T transition mutation (rs28598872) c.447 C>T (g.11608 C>T). This polymorphism is predicted to result in a synonymous mutation (pAla149 = ). The TT and CT genotypes were more prevalent than the CC genotype in TMJ ID patients compared to the control group with a risk factor of 2.12 (confidence intervals [CI] :0.88-5.08) and 2.0 (CI:0.726-5.508). In addition, TMJ ID patients were divided into two groups as anterior disc displacement with reduction (ADDWR) and anterior disc displacement without reduction (ADDWOR) and compared with the controls. The TT and CT genotypes were more prevalent than the CC genotype in ADDWR patients compared to the control group with a risk factor of 3.85 (CI:0.927-16.048) and 3.75 (1.02-13.786), respectively. We found that, among ADDWR patients, the T allele is a risk factor both in homozygous and heterozygous carriers (p < 0.052, p < 0.036). CONCLUSION The results of the study indicate a potential role for the MATN3 rs28598872 polymorphism in the pathogenesis of TMJ ID.
Collapse
Affiliation(s)
- Ayça Dilara Yilmaz
- 1 Molecular Biology Laboratory, Faculty of Dentistry, Ankara University , Ankara, Turkey
| | - Duygu Yazicioğlu
- 2 Private Practice , Oral and Maxillofacial Surgery, Ankara, Turkey
| | - Mine Aysegul Tuzuner Oncul
- 3 Department of Oral and Maxillofacial Surgery, Faculty of Dentistry, Ankara University , Ankara, Turkey
| | - Gülden Ereş
- 4 Department of Periodontology, Faculty of Dentistry, Ankara University , Ankara, Turkey
| | - Nejat Bora Sayan
- 2 Private Practice , Oral and Maxillofacial Surgery, Ankara, Turkey
| |
Collapse
|
12
|
Schrodi SJ. The Use of Multiplicity Corrections, Order Statistics and Generalized Family-Wise Statistics with Application to Genome-Wide Studies. PLoS One 2016; 11:e0154472. [PMID: 27128491 PMCID: PMC4851310 DOI: 10.1371/journal.pone.0154472] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 04/14/2016] [Indexed: 01/02/2023] Open
Abstract
The most important decision faced by large-scale studies, such as those presently encountered in human genetics, is to distinguish between those tests that are true positives from those that are not. In the context of genetics, this entails the determination of genetic markers that actually underlie medically-relevant phenotypes from a vast number of makers typically interrogated in genome-wide studies. A critical part of these decisions relies on the appropriate statistical assessment of data obtained from tests across numerous markers. Several methods have been developed to aid with such analyses, with family-wise approaches, such as the Bonferroni and Dunn-Šidàk corrections, being popular. Conditions that motivate the use of family-wise corrections are explored. Although simple to implement, one major limitation of these approaches is that they assume that p-values are i.i.d. uniformly distributed under the null hypothesis. However, several factors may violate this assumption in genome-wide studies including effects from confounding by population stratification, the presence of related individuals, the correlational structure among genetic markers, and the use of limiting distributions for test statistics. Even after adjustment for such effects, the distribution of p-values can substantially depart from a uniform distribution under the null hypothesis. In this work, I present a decision theory for the use of family-wise corrections for multiplicity and a generalization of the Dunn-Šidàk correction that relaxes the assumption of uniformly-distributed null p-values. The independence assumption is also relaxed and handled through calculating the effective number of independent tests. I also explicitly show the relationship between order statistics and family-wise correction procedures. This generalization may be applicable to multiplicity problems outside of genomics.
Collapse
Affiliation(s)
- Steven J. Schrodi
- Center for Human Genetics, Marshfield Clinic Research Foundation, Marshfield, WI, United States of America
- Computation and Informatics in Biology and Medicine, University of Wisconsin-Madison, Madison, WI, United States of America
- * E-mail:
| |
Collapse
|
13
|
König IR, Auerbach J, Gola D, Held E, Holzinger ER, Legault MA, Sun R, Tintle N, Yang HC. Machine learning and data mining in complex genomic data--a review on the lessons learned in Genetic Analysis Workshop 19. BMC Genet 2016; 17 Suppl 2:1. [PMID: 26866367 PMCID: PMC4895282 DOI: 10.1186/s12863-015-0315-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
In the analysis of current genomic data, application of machine learning and data mining techniques has become more attractive given the rising complexity of the projects. As part of the Genetic Analysis Workshop 19, approaches from this domain were explored, mostly motivated from two starting points. First, assuming an underlying structure in the genomic data, data mining might identify this and thus improve downstream association analyses. Second, computational methods for machine learning need to be developed further to efficiently deal with the current wealth of data.In the course of discussing results and experiences from the machine learning and data mining approaches, six common messages were extracted. These depict the current state of these approaches in the application to complex genomic data. Although some challenges remain for future studies, important forward steps were taken in the integration of different data types and the evaluation of the evidence. Mining the data for underlying genetic or phenotypic structure and using this information in subsequent analyses proved to be extremely helpful and is likely to become of even greater use with more complex data sets.
Collapse
Affiliation(s)
- Inke R König
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.
| | - Jonathan Auerbach
- Department of Statistics, Columbia University, New York, NY, 10027, USA.
| | - Damian Gola
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.
| | - Elizabeth Held
- Department of Mathematics, Iowa State University, Ames, IA, 50011, USA.
| | - Emily R Holzinger
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD, 21224, USA.
| | - Marc-André Legault
- Université de Montréal, Faculty of Medicine, 2900 Chemin de la Tour, Montreal, QC, H3T 1N8, Canada.
| | - Rui Sun
- Division of Biostatistics, School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, Hong Kong SAR.
| | - Nathan Tintle
- Department of Mathematics, Statistics and Computer Science, Dordt College, Sioux Center, IA, 51250, USA.
| | - Hsin-Chou Yang
- Institute of Statistical Science, Academia Sinica, Nankang 115, Taipei, Taiwan.
| |
Collapse
|
14
|
Endler L, Betancourt AJ, Nolte V, Schlötterer C. Reconciling Differences in Pool-GWAS Between Populations: A Case Study of Female Abdominal Pigmentation in Drosophila melanogaster. Genetics 2016; 202:843-55. [PMID: 26715669 PMCID: PMC4788253 DOI: 10.1534/genetics.115.183376] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Accepted: 12/21/2015] [Indexed: 12/16/2022] Open
Abstract
The degree of concordance between populations in the genetic architecture of a given trait is an important issue in medical and evolutionary genetics. Here, we address this problem, using a replicated pooled genome-wide association study approach (Pool-GWAS) to compare the genetic basis of variation in abdominal pigmentation in female European and South African Drosophila melanogaster. We find that, in both the European and the South African flies, variants near the tan and bric-à-brac 1 (bab1) genes are most strongly associated with pigmentation. However, the relative contribution of these loci differs: in the European populations, tan outranks bab1, while the converse is true for the South African flies. Using simulations, we show that this result can be explained parsimoniously, without invoking different causal variants between the populations, by a combination of frequency differences between the two populations and dominance for the causal alleles at the bab1 locus. Our results demonstrate the power of cost-effective, replicated Pool-GWAS to shed light on differences in the genetic architecture of a given trait between populations.
Collapse
Affiliation(s)
- Lukas Endler
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | | | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | | |
Collapse
|
15
|
Halloran JW, Zhu D, Qian DC, Byun J, Gorlova OY, Amos CI, Gorlov IP. Prediction of the gene expression in normal lung tissue by the gene expression in blood. BMC Med Genomics 2015; 8:77. [PMID: 26576671 PMCID: PMC4650316 DOI: 10.1186/s12920-015-0152-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 11/10/2015] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Comparative analysis of gene expression in human tissues is important for understanding the molecular mechanisms underlying tissue-specific control of gene expression. It can also open an avenue for using gene expression in blood (which is the most easily accessible human tissue) to predict gene expression in other (less accessible) tissues, which would facilitate the development of novel gene expression based models for assessing disease risk and progression. Until recently, direct comparative analysis across different tissues was not possible due to the scarcity of paired tissue samples from the same individuals. METHODS In this study we used paired whole blood/lung gene expression data from the Genotype-Tissue Expression (GTEx) project. We built a generalized linear regression model for each gene using gene expression in lung as the outcome and gene expression in blood, age and gender as predictors. RESULTS For ~18 % of the genes, gene expression in blood was a significant predictor of gene expression in lung. We found that the number of single nucleotide polymorphisms (SNPs) influencing expression of a given gene in either blood or lung, also known as the number of quantitative trait loci (eQTLs), was positively associated with efficacy of blood-based prediction of that gene's expression in lung. This association was strongest for shared eQTLs: those influencing gene expression in both blood and lung. CONCLUSIONS In conclusion, for a considerable number of human genes, their expression levels in lung can be predicted using observable gene expression in blood. An abundance of shared eQTLs may explain the strong blood/lung correlations in the gene expression.
Collapse
Affiliation(s)
- Justin W Halloran
- Department of Biological Sciences, Dartmouth College, 78 College St., Hanover, NH, 03755, USA.
| | - Dakai Zhu
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - David C Qian
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Jinyoung Byun
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Olga Y Gorlova
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Christopher I Amos
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| | - Ivan P Gorlov
- Department of Biomedical Data Science, The Geisel School of Medicine, Dartmouth College, HB7937, One Medical Center Dr., Dartmouth-Hitchcock Medical Center, Lebanon, NH, 03756, USA.
| |
Collapse
|