1
|
Nisar A, Kayani MA, Nasir W, Mehmood A, Ahmed MW, Parvez A, Mahjabeen I. Fyn and Lyn gene polymorphisms impact the risk of thyroid cancer. Mol Genet Genomics 2022; 297:1649-1659. [PMID: 36058999 DOI: 10.1007/s00438-022-01946-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 08/11/2022] [Indexed: 10/14/2022]
Abstract
Thyroid cancer is the most common malignancy of the endocrine glands, and during last couple of decades, its incidence has risen alarmingly, across the globe. Etiology of thyroid cancer is still debatable. There are a few worth mentioning risk factors which contribute to initiation of abnormalities in thyroid gland leading to cancer. Genetic instability is major risk factors in thyroid carcinogenesis. Among the genetic factors, the Src family of genes (Src, Yes1, Fyn and Lyn) have been implicated in many cancers but there is little data regarding the association of these (Src, Yes1, Fyn and Lyn) genes with thyroid carcinogenesis. Fyn and Lyn genes of Src family found engaged in proliferation, migration, invasion, angiogenesis, and metastasis in different cancers. This study was planned to examine the effect of Fyn and Lyn SNPs on thyroid cancer risk in Pakistani population in 500 patients and 500 controls. Three polymorphisms of Fyn gene (rs6916861, rs2182644 and rs12910) and three polymorphisms of Lyn gene (rs2668011, rs45587541 and rs45489500) were analyzed using Tetra-primer ARMS-PCR followed by DNA sequencing. SNP rs6916861 of Fyn gene mutant genotype (CC) showed statistically significant threefold increased risk of thyroid cancer (P < 0.0001). In case of rs2182644 of Fyn gene, mutant genotype (AA) indicated statistically significant 17-fold increased risk of thyroid cancer (P < 0.0001). Statistically significant threefold increased risk of thyroid cancer was observed in genotype AC (P < 0.0001) of Fyn gene polymorphism rs12910. In SNP rs2668011 of Lyn gene, TT genotype showed statistically significant threefold increased risk of thyroid cancer (P < 0.0001). In case of rs45587541 of Lyn gene, GA genotypes showed statistically significant 11-fold increased risk in thyroid cancer (P < 0.0001). Haplotype analysis revealed that AAATAG*, AGACAG*, AGCCAA*, AGCCAG*, CAATAG*, CGCCAG* and CGCCGA* haplotypes of Fyn and Lyn polymorphisms are associated with increased thyroid cancer risk. These results showed that genotypes and allele distribution of Fyn and Lyn are significantly linked with increased thyroid cancer risk and could be genetic adjuster for said disease.
Collapse
Affiliation(s)
- Asif Nisar
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Mahmood Akhtar Kayani
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Wajiha Nasir
- Department of Radiation, Nuclear Oncology Radiation Institute, Islamabad, Pakistan
| | - Azhar Mehmood
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Malik Waqar Ahmed
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan.,Pakistan Institute of Rehabilitation Sciences (PIRS), Isra University Islamabad Campus, Islamabad, Pakistan
| | - Aamir Parvez
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Ishrat Mahjabeen
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan.
| |
Collapse
|
2
|
Nguyen AP, Nicoletti P, Arnol D, Califano A, Rodríguez Martínez M. Identifying the Potential Mechanism of Action of SNPs Associated With Breast Cancer Susceptibility With GVITamIN. Front Bioeng Biotechnol 2020; 8:798. [PMID: 32850701 PMCID: PMC7417307 DOI: 10.3389/fbioe.2020.00798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 06/22/2020] [Indexed: 11/24/2022] Open
Abstract
In the last decade, a large number of genome-wide association studies have uncovered many single-nucleotide polymorphisms (SNPs) that are associated with complex traits and confer susceptibility to diseases, such as cancer. However, so far only a few heritable traits with medium-to-high penetrance have been identified. The vast majority of the discovered variants only leads to disease in combination with other still unknown factors. Furthermore, while many studies aimed to link the effect of SNPs to changes in molecular phenotypes, the analysis has been often focused on testing associations between a single SNP and a transcript, hence disregarding the dysregulation of gene regulatory networks that has been shown to play an essential role in disease onset, notably in cancer. Here we take a systems biology approach and develop GVITamIN (Genetic VarIaTIoN functional analysis tool), a new statistical and computational approach to characterize the effect of a SNP on both genes and transcriptional regulatory programs. GVITamIN exploits a novel statistical approach to combine the usually small effect of disease-susceptibility SNPs, and reveals important potential oncogenic mechanisms, hence taking one step further in the direction of understanding the SNP mechanism of action. We apply GVITamIN on a breast cancer cohort and identify well-known cancer-related transcription factors, such as CTCF, LEF1, and FOXA1, as TFs dysregulated by breast cancer-associated SNPs. Furthermore, our results reveal that SNPs located on the RAD51B gene are significantly associated with an abnormal regulatory activity, suggesting a pivotal role for homologous recombination repair mechanisms in breast cancer.
Collapse
Affiliation(s)
- An-Phi Nguyen
- IBM Research-Zurich, Zurich, Switzerland.,ETH-Zürich, Zurich, Switzerland
| | - Paola Nicoletti
- Herbert Irving Cancer Research Center, Columbia University Medical Center, New York, NY, United States
| | | | - Andrea Califano
- Herbert Irving Cancer Research Center, Columbia University Medical Center, New York, NY, United States.,Department of Systems Biology, Columbia University, New York, NY, United States.,Herbert Irving Comprehensive Cancer Center, Columbia University, New York, NY, United States.,Department of Biomedical Informatics, Columbia University, New York, NY, United States.,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, United States.,Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, United States.,J.P. Sulzberger Columbia Genome Center, Columbia University, New York, NY, United States
| | - María Rodríguez Martínez
- Herbert Irving Cancer Research Center, Columbia University Medical Center, New York, NY, United States
| |
Collapse
|
3
|
Emerson MA, Reeder-Hayes KE, Tipaldos HJ, Bell ME, Sweeney MR, Carey LA, Earp HS, Olshan AF, Troester MA. Integrating biology and access to care in addressing breast cancer disparities: 25 years' research experience in the Carolina Breast Cancer Study. CURRENT BREAST CANCER REPORTS 2020; 12:149-160. [PMID: 33815665 DOI: 10.1007/s12609-020-00365-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Purpose of Review To review research on breast cancer mortality disparities, emphasizing research conducted in the Carolina Breast Cancer Study, with a focus on challenges and opportunities for integration of tumor biology and access characteristics across the cancer care continuum. Recent Findings Black women experience higher mortality following breast cancer diagnosis, despite lower incidence compared to white women. Biological factors, such as stage at diagnosis and breast cancer subtypes, play a role in these disparities. Simultaneously, social, behavioral, environmental, and access to care factors are important. However, integrated studies of biology and access are challenging and it is uncommon to have both data types available in the same study population. The central emphasis of Phase 3 of the Carolina Breast Cancer Study, initiated in 2008, was to collect rich data on biology (including germline and tumor genomics and pathology) and health care access in a diverse study population, with the long term goal of defining intervention opportunities to reduce disparities across the cancer care continuum. Summary Early and ongoing research from CBCS has identified important interactions between biology and access, leading to opportunities to build greater equity. However, sample size, population-specific relationships among variables, and complexities of treatment paths along the care continuum pose important research challenges. Interdisciplinary teams, including experts in novel data integration and causal inference, are needed to address gaps in our understanding of breast cancer disparities.
Collapse
Affiliation(s)
- Marc A Emerson
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, NC, USA.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Katherine E Reeder-Hayes
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Heather J Tipaldos
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Mary E Bell
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Marina R Sweeney
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, NC, USA
| | - Lisa A Carey
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - H Shelton Earp
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Andrew F Olshan
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, NC, USA.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Melissa A Troester
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, NC, USA.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
4
|
Maqbool SN, Nazeer HS, Rafiq M, Javed A, Hanif R. Bridging the gap by discerning SNPs in linkage disequilibrium and their role in breast cancer. Gene 2018; 679:44-56. [PMID: 30118891 DOI: 10.1016/j.gene.2018.06.102] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 06/21/2018] [Accepted: 06/28/2018] [Indexed: 12/22/2022]
Abstract
Breast Cancer is the most common cancer among women with several genes involved in disease susceptibility. As majority of genome-wide significant variants fall outside the coding region, it is likely that some of them alter specific gene functions. GWAS database was used to interpret the regulatory functions of these genetic variants. A total of 320 SNPs for breast cancer were selected via GWAS, which were entered into the SNAP web portal tool, to determine the one's found to be in Linkage Disequilibrium (r2 < 0.80). The resulting 2024 proxy SNP's were processed in RegulomeDB to predict their regulatory role. Of these, 1440 produced a score ranging from 1-6, whereas the remaining produced no data. Only the variants under score 4 (cut-off value) in RegulomeDB has been studied further. From these variants, 221 had scores of less than 4, indicating a high degree of potential regulatory role associated with them. Further study revealed that 61 of the 221 SNPs were reported to be genome-wide significant for breast cancer, 52 to be associated with other diseases, 99 as unconfirmed for association with breast cancer, leaving only 9 to be novel proxy SNPs linked to breast cancer. Therefore, the study further confirmed postulation of non-coding variants being linked to disease risk thereby, requiring additional validation through genome-wide association studies to substantiate their underlying mechanism.
Collapse
Affiliation(s)
- Sundus Naila Maqbool
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Sector: H-12, Islamabad 44000, Pakistan
| | - Haleema Saadiya Nazeer
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Sector: H-12, Islamabad 44000, Pakistan
| | - Mehak Rafiq
- Research Center for Modeling & Simulation (RCMS), National University of Sciences and Technology, Islamabad, Pakistan
| | - Aneela Javed
- Harvard Medical School, 65 Landsdowne's Street, Cambridge, MA 02139, United States
| | - Rumeza Hanif
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Sector: H-12, Islamabad 44000, Pakistan.
| |
Collapse
|
5
|
Lilyquist J, Ruddy KJ, Vachon CM, Couch FJ. Common Genetic Variation and Breast Cancer Risk-Past, Present, and Future. Cancer Epidemiol Biomarkers Prev 2018; 27:380-394. [PMID: 29382703 PMCID: PMC5884707 DOI: 10.1158/1055-9965.epi-17-1144] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 01/05/2018] [Accepted: 01/11/2018] [Indexed: 11/16/2022] Open
Abstract
Breast cancer is the most common cancer among women in the United States, with up to 30% of those diagnosed displaying a family history of breast cancer. To date, 18% of the familial risk of breast cancer can be explained by SNPs. This review summarizes the discovery of risk-associated SNPs using candidate gene and genome-wide association studies (GWAS), including discovery and replication in large collaborative efforts such as The Collaborative Oncologic Gene-environment Study and OncoArray. We discuss the evolution of GWAS studies, efforts to discover additional SNPs, and methods for identifying causal variants. We summarize findings associated with overall breast cancer, pathologic subtypes, and mutation carriers (BRCA1, BRCA2, and CHEK2). In addition, we summarize the development of polygenic risk scores (PRS) using the risk-associated SNPs and show how PRS can contribute to estimation of individual risks for developing breast cancer. Cancer Epidemiol Biomarkers Prev; 27(4); 380-94. ©2018 AACRSee all articles in this CEBP Focus section, "Genome-Wide Association Studies in Cancer."
Collapse
Affiliation(s)
- Jenna Lilyquist
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota
| | | | - Celine M Vachon
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota
| | - Fergus J Couch
- Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota.
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota
| |
Collapse
|
6
|
Accelerating Wright-Fisher Forward Simulations on the Graphics Processing Unit. G3-GENES GENOMES GENETICS 2017; 7:3229-3236. [PMID: 28768689 PMCID: PMC5592947 DOI: 10.1534/g3.117.300103] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Forward Wright–Fisher simulations are powerful in their ability to model complex demography and selection scenarios, but suffer from slow execution on the Central Processor Unit (CPU), thus limiting their usefulness. However, the single-locus Wright–Fisher forward algorithm is exceedingly parallelizable, with many steps that are so-called “embarrassingly parallel,” consisting of a vast number of individual computations that are all independent of each other and thus capable of being performed concurrently. The rise of modern Graphics Processing Units (GPUs) and programming languages designed to leverage the inherent parallel nature of these processors have allowed researchers to dramatically speed up many programs that have such high arithmetic intensity and intrinsic concurrency. The presented GPU Optimized Wright–Fisher simulation, or “GO Fish” for short, can be used to simulate arbitrary selection and demographic scenarios while running over 250-fold faster than its serial counterpart on the CPU. Even modest GPU hardware can achieve an impressive speedup of over two orders of magnitude. With simulations so accelerated, one can not only do quick parametric bootstrapping of previously estimated parameters, but also use simulated results to calculate the likelihoods and summary statistics of demographic and selection models against real polymorphism data, all without restricting the demographic and selection scenarios that can be modeled or requiring approximations to the single-locus forward algorithm for efficiency. Further, as many of the parallel programming techniques used in this simulation can be applied to other computationally intensive algorithms important in population genetics, GO Fish serves as an exciting template for future research into accelerating computation in evolution. GO Fish is part of the Parallel PopGen Package available at: http://dl42.github.io/ParallelPopGen/.
Collapse
|
7
|
Liu Y, Walavalkar NM, Dozmorov MG, Rich SS, Civelek M, Guertin MJ. Identification of breast cancer associated variants that modulate transcription factor binding. PLoS Genet 2017; 13:e1006761. [PMID: 28957321 PMCID: PMC5619690 DOI: 10.1371/journal.pgen.1006761] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 04/12/2017] [Indexed: 01/11/2023] Open
Abstract
Genome-wide association studies (GWAS) have discovered thousands loci associated with disease risk and quantitative traits, yet most of the variants responsible for risk remain uncharacterized. The majority of GWAS-identified loci are enriched for non-coding single-nucleotide polymorphisms (SNPs) and defining the molecular mechanism of risk is challenging. Many non-coding causal SNPs are hypothesized to alter transcription factor (TF) binding sites as the mechanism by which they affect organismal phenotypes. We employed an integrative genomics approach to identify candidate TF binding motifs that confer breast cancer-specific phenotypes identified by GWAS. We performed de novo motif analysis of regulatory elements, analyzed evolutionary conservation of identified motifs, and assayed TF footprinting data to identify sequence elements that recruit TFs and maintain chromatin landscape in breast cancer-relevant tissue and cell lines. We identified candidate causal SNPs that are predicted to alter TF binding within breast cancer-relevant regulatory regions that are in strong linkage disequilibrium with significantly associated GWAS SNPs. We confirm that the TFs bind with predicted allele-specific preferences using CTCF ChIP-seq data. We used The Cancer Genome Atlas breast cancer patient data to identify ANKLE1 and ZNF404 as the target genes of candidate TF binding site SNPs in the 19p13.11 and 19q13.31 GWAS-identified loci. These SNPs are associated with the expression of ZNF404 and ANKLE1 in breast tissue. This integrative analysis pipeline is a general framework to identify candidate causal variants within regulatory regions and TF binding sites that confer phenotypic variation and disease risk.
Collapse
Affiliation(s)
- Yunxian Liu
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Ninad M. Walavalkar
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Mikhail G. Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, Richmond, Virginia, United States of America
| | - Stephen S. Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
| | - Mete Civelek
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United Statess of America
| | - Michael J. Guertin
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia, United States of America
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia, United States of America
| |
Collapse
|
8
|
Shi M, O'Brien KM, Sandler DP, Taylor JA, Zaykin DV, Weinberg CR. Previous GWAS hits in relation to young-onset breast cancer. Breast Cancer Res Treat 2017; 161:333-344. [PMID: 27848153 PMCID: PMC5226879 DOI: 10.1007/s10549-016-4053-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 11/09/2016] [Indexed: 12/17/2022]
Abstract
PURPOSE Genome-wide association studies (GWAS) have identified dozens of single-nucleotide polymorphisms (SNPs) associated with breast cancer. Few studies focused on young-onset breast cancer, which exhibits etiologic and tumor-type differences from older-onset disease. Possible confounding by prenatal effects of the maternal genome has also not been considered. METHODS Using a family-based design for breast cancer before age 50, we assessed the relationship between breast cancer and 77 GWAS-identified breast cancer risk SNPs. We estimated relative risks (RR) for inherited and maternally mediated genetic effects. We also used published RR estimates to calculate genetic risk scores and model joint effects. RESULTS Seventeen of the candidate SNPs were nominally associated with young-onset breast cancer in our 1296 non-Hispanic white affected families (uncorrected p value <0.05). Top-ranked SNPs included rs3803662-A (TOX3, RR = 1.39; p = 7.0 × 10-6), rs12662670-G (ESR1, RR = 1.56; p = 5.7 × 10-4), rs2981579-A (FGFR2, RR = 1.24; p = 0.002), and rs999737-G (RAD51B, RR = 1.37; p = 0.003). No maternally mediated effects were found. A risk score based on all 77 SNPs indicated that their overall relationship to young-onset breast cancer risk was more than additive (additive-fit p = 2.2 × 10-7) and consistent with a multiplicative joint effect (multiplicative-fit p = 0.27). With the multiplicative formulation, the case sister's genetic risk score exceeded that of her unaffected sister in 59% of families. CONCLUSIONS The results of this family-based study indicate that no effects of previously identified risk SNPs were explained by prenatal effects of maternal variants. Many of the known breast cancer risk variants were associated with young-onset breast cancer, with evidence that TOX3, ESR1, FGFR2, and RAD51B are important for young-onset disease.
Collapse
Affiliation(s)
- Min Shi
- Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Dr, Research Triangle Park, Durham, NC, 27709, USA
| | - Katie M O'Brien
- Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Dr, Research Triangle Park, Durham, NC, 27709, USA
| | - Dale P Sandler
- Epidemiology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Fr, Research Triangle Park, Durham, NC, 27709, USA
| | - Jack A Taylor
- Epidemiology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Fr, Research Triangle Park, Durham, NC, 27709, USA
| | - Dmitri V Zaykin
- Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Dr, Research Triangle Park, Durham, NC, 27709, USA
| | - Clarice R Weinberg
- Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, 111 TW Alexander Dr, Research Triangle Park, Durham, NC, 27709, USA.
| |
Collapse
|
9
|
Zhu Q, Shepherd L, Lunetta KL, Yao S, Liu Q, Hu Q, Haddad SA, Sucheston-Campbell L, Bensen JT, Bandera EV, Rosenberg L, Liu S, Haiman CA, Olshan AF, Palmer JR, Ambrosone CB. Trans-ethnic follow-up of breast cancer GWAS hits using the preferential linkage disequilibrium approach. Oncotarget 2016; 7:83160-83176. [PMID: 27825120 PMCID: PMC5341253 DOI: 10.18632/oncotarget.13075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 10/12/2016] [Indexed: 12/22/2022] Open
Abstract
Leveraging population-distinct linkage equilibrium (LD) patterns, trans-ethnic follow-up of variants discovered from genome-wide association studies (GWAS) has proved to be useful in facilitating the identification of bona fide causal variants. We previously developed the preferential LD approach, a novel method that successfully identified causal variants driving the GWAS signals within European-descent populations even when the causal variants were only weakly linked with the GWAS-discovered variants. To evaluate the performance of our approach in a trans-ethnic setting, we applied it to follow up breast cancer GWAS hits identified mostly from populations of European ancestry in African Americans (AA). We evaluated 74 breast cancer GWAS variants in 8,315 AA women from the African American Breast Cancer Epidemiology and Risk (AMBER) consortium. Only 27% of them were associated with breast cancer risk at significance level α=0.05, suggesting race-specificity of the identified breast cancer risk loci. We followed up on those replicated GWAS hits in the AMBER consortium utilizing the preferential LD approach, to search for causal variants or better breast cancer markers from the 1000 Genomes variant catalog. Our approach identified stronger breast cancer markers for 80% of the GWAS hits with at least nominal breast cancer association, and in 81% of these cases, the marker identified was among the top 10 of all 1000 Genomes variants in the corresponding locus. The results support trans-ethnic application of the preferential LD approach in search for candidate causal variants, and may have implications for future genetic research of breast cancer in AA women.
Collapse
Affiliation(s)
- Qianqian Zhu
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Lori Shepherd
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Kathryn L. Lunetta
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Song Yao
- Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Qian Liu
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Qiang Hu
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | | | - Lara Sucheston-Campbell
- Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Jeannette T. Bensen
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Elisa V. Bandera
- Cancer Prevention and Control, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
| | - Lynn Rosenberg
- Slone Epidemiology Center, Boston University, Boston, MA, USA
| | - Song Liu
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Christopher A. Haiman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, CA, USA
| | - Andrew F. Olshan
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Julie R. Palmer
- Slone Epidemiology Center, Boston University, Boston, MA, USA
| | - Christine B. Ambrosone
- Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY, USA
| |
Collapse
|
10
|
Qian F, Feng Y, Zheng Y, Ogundiran TO, Ojengbede O, Zheng W, Blot W, Ambrosone CB, John EM, Bernstein L, Hu JJ, Ziegler RG, Nyante S, Bandera EV, Ingles SA, Press MF, Nathanson KL, Hennis A, Nemesure B, Ambs S, Kolonel LN, Olopade OI, Haiman CA, Huo D. Genetic variants in microRNA and microRNA biogenesis pathway genes and breast cancer risk among women of African ancestry. Hum Genet 2016; 135:1145-59. [PMID: 27380242 DOI: 10.1007/s00439-016-1707-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 06/25/2016] [Indexed: 10/21/2022]
Abstract
MicroRNAs (miRNA) regulate breast biology by binding to specific RNA sequences, leading to RNA degradation and inhibition of translation of their target genes. While germline genetic variations may disrupt some of these interactions between miRNAs and their targets, studies assessing the relationship between genetic variations in the miRNA network and breast cancer risk are still limited, particularly among women of African ancestry. We systematically put together a list of 822 and 10,468 genetic variants among primary miRNA sequences and 38 genes in the miRNA biogenesis pathway, respectively; and examined their association with breast cancer risk in the ROOT consortium which includes women of African ancestry. Findings were replicated in an independent consortium. Logistic regression was used to estimate the odds ratio (OR) and 95 % confidence intervals (CI). For overall breast cancer risk, three single-nucleotide polymorphisms (SNPs) in miRNA biogenesis genes DROSHA rs78393591 (OR = 0.69, 95 % CI: 0.55-0.88, P = 0.003), ESR1 rs523736 (OR = 0.88, 95 % CI: 0.82-0.95, P = 3.99 × 10(-4)), and ZCCHC11 rs114101502 (OR = 1.33, 95 % CI: 1.11-1.59, P = 0.002), and one SNP in primary miRNA sequence (rs116159732 in miR-6826, OR = 0.74, 95 % CI: 0.63-0.89, P = 0.001) were found to have significant associations in both discovery and validation phases. In a subgroup analysis, two SNPs were associated with risk of estrogen receptor (ER)-negative breast cancer, and three SNPs were associated with risk of ER-positive breast cancer. Several variants in miRNA and miRNA biogenesis pathway genes were associated with breast cancer risk. Risk associations varied by ER status, suggesting potential new mechanisms in etiology.
Collapse
Affiliation(s)
- Frank Qian
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Ye Feng
- Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | - Yonglan Zheng
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Temidayo O Ogundiran
- Department of Surgery, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | - Oladosu Ojengbede
- Center for Population and Reproductive Health, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | - Wei Zheng
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville, TN, USA
| | - William Blot
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville, TN, USA
| | | | - Esther M John
- Cancer Prevention Institute of California, Fremont, CA, USA.,Department of Health Research and Policy (Epidemiology) and Stanford Cancer Institute, Stanford University School of Medicine Stanford, Stanford, CA, USA
| | - Leslie Bernstein
- Division of Cancer Etiology, Department of Population Science, Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Jennifer J Hu
- Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Regina G Ziegler
- Epidemiology and Biostatistics Program, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, DC, USA
| | - Sarah Nyante
- Department of Epidemiology, Gillings School of Global Public Health and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA
| | - Elisa V Bandera
- Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
| | - Sue A Ingles
- Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | - Michael F Press
- Department of Pathology, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | | | - Anselm Hennis
- Chronic Disease Research Centre and Tropical Medicine Research Institute, University of the West Indies, Bridgetown, Barbados
| | - Barbara Nemesure
- Department of Preventive Medicine, State University of New York at Stony Brook, Stony Brook, NY, USA
| | - Stefan Ambs
- Laboratory of Human Carcinogenesis, National Cancer Institute, Bethesda, MD, USA
| | - Laurence N Kolonel
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
| | | | - Christopher A Haiman
- Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
| | - Dezheng Huo
- Department of Public Health Sciences, University of Chicago, 5841 S. Maryland Ave., MC 2007, Chicago, IL, 60637, USA.
| |
Collapse
|
11
|
Fejerman L, Stern MC, John EM, Torres-Mejía G, Hines LM, Wolff RK, Baumgartner KB, Giuliano AR, Ziv E, Pérez-Stable EJ, Slattery ML. Interaction between common breast cancer susceptibility variants, genetic ancestry, and nongenetic risk factors in Hispanic women. Cancer Epidemiol Biomarkers Prev 2015; 24:1731-8. [PMID: 26364163 DOI: 10.1158/1055-9965.epi-15-0392] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Accepted: 08/14/2015] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Most genetic variants associated with breast cancer risk have been discovered in women of European ancestry, and only a few genome-wide association studies (GWAS) have been conducted in minority groups. This research disparity persists in post-GWAS gene-environment interaction analyses. We tested the interaction between hormonal and lifestyle risk factors for breast cancer, and ten GWAS-identified SNPs among 2,107 Hispanic women with breast cancer and 2,587 unaffected controls, to gain insight into a previously reported gene by ancestry interaction in this population. METHODS We estimated genetic ancestry with a set of 104 ancestry-informative markers selected to discriminate between Indigenous American and European ancestry. We used logistic regression models to evaluate main effects and interactions. RESULTS We found that the rs13387042-2q35(G/A) SNP was associated with breast cancer risk only among postmenopausal women who never used hormone therapy [per A allele OR: 0.94 (95% confidence intervals, 0.74-1.20), 1.20 (0.94-1.53), and 1.49 (1.28-1.75) for current, former, and never hormone therapy users, respectively, Pinteraction 0.002] and premenopausal women who breastfed >12 months [OR: 1.01 (0.72-1.42), 1.19 (0.98-1.45), and 1.69 (1.26-2.26) for never, <12 months, and >12 months breastfeeding, respectively, Pinteraction 0.014]. CONCLUSIONS The correlation between genetic ancestry, hormone replacement therapy use, and breastfeeding behavior partially explained a previously reported interaction between a breast cancer risk variant and genetic ancestry in Hispanic women. IMPACT These results highlight the importance of understanding the interplay between genetic ancestry, genetics, and nongenetic risk factors and their contribution to breast cancer risk.
Collapse
Affiliation(s)
- Laura Fejerman
- Division of General Internal Medicine, Department of Medicine, Institute for Human Genetics and Helen Diller Family Comprehensive Cancer Center, UCSF, San Francisco, California.
| | - Mariana C Stern
- Department of Preventive Medicine, Norris Comprehensive Cancer Center, Keck School of Medicine of USC, Los Angeles, California
| | - Esther M John
- Cancer Prevention Institute of California, Fremont, California and Department of Health Research and Policy (Epidemiology), and Stanford Cancer Institute, Stanford University School of Medicine, Stanford, California
| | - Gabriela Torres-Mejía
- Instituto Nacional de Salud Pública, Centro de Investigación en Salud Poblacional, Cuernavaca, Morelos, Mexico
| | - Lisa M Hines
- Department of Biology, University of Colorado at Colorado Springs, Colorado Springs, Colorado
| | - Roger K Wolff
- Department of Medicine, University of Utah, Salt Lake City, Utah
| | - Kathy B Baumgartner
- Department of Epidemiology and Population Health, James Graham Brown Cancer Center, University of Louisville, Louisville, Kentucky
| | | | - Elad Ziv
- Division of General Internal Medicine, Department of Medicine, Institute for Human Genetics and Helen Diller Family Comprehensive Cancer Center, UCSF, San Francisco, California
| | - Eliseo J Pérez-Stable
- Division of General Internal Medicine, Department of Medicine, Institute for Human Genetics and Helen Diller Family Comprehensive Cancer Center, UCSF, San Francisco, California
| | | |
Collapse
|
12
|
Peprah E, Xu H, Tekola-Ayele F, Royal CD. Genome-wide association studies in Africans and African Americans: expanding the framework of the genomics of human traits and disease. Public Health Genomics 2014; 18:40-51. [PMID: 25427668 DOI: 10.1159/000367962] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 08/29/2014] [Indexed: 01/11/2023] Open
Abstract
Genomic research is one of the tools for elucidating the pathogenesis of diseases of global health relevance and paving the research dimension to clinical and public health translation. Recent advances in genomic research and technologies have increased our understanding of human diseases, genes associated with these disorders, and the relevant mechanisms. Genome-wide association studies (GWAS) have proliferated since the first studies were published several years ago and have become an important tool in helping researchers comprehend human variation and the role genetic variants play in disease. However, the need to expand the diversity of populations in GWAS has become increasingly apparent as new knowledge is gained about genetic variation. Inclusion of diverse populations in genomic studies is critical to a more complete understanding of human variation and elucidation of the underpinnings of complex diseases. In this review, we summarize the available data on GWAS in recent African ancestry populations within the western hemisphere (i.e. African Americans and peoples of the Caribbean) and continental African populations. Furthermore, we highlight ways in which genomic studies in populations of recent African ancestry have led to advances in the areas of malaria, HIV, prostate cancer, and other diseases. Finally, we discuss the advantages of conducting GWAS in recent African ancestry populations in the context of addressing existing and emerging global health conditions.
Collapse
|
13
|
Li S, Mukherjee B, Taylor JMG, Rice KM, Wen X, Rice JD, Stringham HM, Boehnke M. The role of environmental heterogeneity in meta-analysis of gene-environment interactions with quantitative traits. Genet Epidemiol 2014; 38:416-29. [PMID: 24801060 DOI: 10.1002/gepi.21810] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2013] [Revised: 02/07/2014] [Accepted: 03/25/2014] [Indexed: 11/11/2022]
Abstract
With challenges in data harmonization and environmental heterogeneity across various data sources, meta-analysis of gene-environment interaction studies can often involve subtle statistical issues. In this paper, we study the effect of environmental covariate heterogeneity (within and between cohorts) on two approaches for fixed-effect meta-analysis: the standard inverse-variance weighted meta-analysis and a meta-regression approach. Akin to the results in Simmonds and Higgins (), we obtain analytic efficiency results for both methods under certain assumptions. The relative efficiency of the two methods depends on the ratio of within versus between cohort variability of the environmental covariate. We propose to use an adaptively weighted estimator (AWE), between meta-analysis and meta-regression, for the interaction parameter. The AWE retains full efficiency of the joint analysis using individual level data under certain natural assumptions. Lin and Zeng (2010a, b) showed that a multivariate inverse-variance weighted estimator retains full efficiency as joint analysis using individual level data, if the estimates with full covariance matrices for all the common parameters are pooled across all studies. We show consistency of our work with Lin and Zeng (2010a, b). Without sacrificing much efficiency, the AWE uses only univariate summary statistics from each study, and bypasses issues with sharing individual level data or full covariance matrices across studies. We compare the performance of the methods both analytically and numerically. The methods are illustrated through meta-analysis of interaction between Single Nucleotide Polymorphisms in FTO gene and body mass index on high-density lipoprotein cholesterol data from a set of eight studies of type 2 diabetes.
Collapse
Affiliation(s)
- Shi Li
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America
| | | | | | | | | | | | | | | |
Collapse
|
14
|
Véron A, Blein S, Cox DG. Genome-wide association studies and the clinic: a focus on breast cancer. Biomark Med 2014; 8:287-96. [DOI: 10.2217/bmm.13.121] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Breast cancer is the most frequently diagnosed cancer among women worldwide, and has long been considered to be a genetic disease. A wide range of genetic variants, both rare mutations and more common variants, have been shown to influence breast cancer risk. In particular, recent studies have identified a number of common genetic variants, or single nucleotide polymorphisms, that are associated with breast cancer risk. In this review, we will briefly present the genetic epidemiology of breast cancer, genome-wide association study technology and how this technology may influence breast cancer screening in the clinic.
Collapse
Affiliation(s)
- Amélie Véron
- Université de Lyon, F-69000 Lyon, France
- Université Lyon 1, ISPB, Lyon, F-69622, France
- INSERM U1052, Centre de Recherche en Cancérologie de Lyon, F-69000 Lyon, France
- CNRS UMR5286, Centre de Recherche en Cancérologie de Lyon, F-69000 Lyon, France
- Centre Léon Bérard, F-69008 Lyon, France
| | - Sophie Blein
- Université de Lyon, F-69000 Lyon, France
- Université Lyon 1, ISPB, Lyon, F-69622, France
- INSERM U1052, Centre de Recherche en Cancérologie de Lyon, F-69000 Lyon, France
- CNRS UMR5286, Centre de Recherche en Cancérologie de Lyon, F-69000 Lyon, France
- Centre Léon Bérard, F-69008 Lyon, France
| | | |
Collapse
|
15
|
Taliun D, Gamper J, Pattaro C. Efficient haplotype block recognition of very long and dense genetic sequences. BMC Bioinformatics 2014; 15:10. [PMID: 24423111 PMCID: PMC3898000 DOI: 10.1186/1471-2105-15-10] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 12/18/2013] [Indexed: 11/10/2022] Open
Abstract
Background The new sequencing technologies enable to scan very long and dense genetic sequences, obtaining datasets of genetic markers that are an order of magnitude larger than previously available. Such genetic sequences are characterized by common alleles interspersed with multiple rarer alleles. This situation has renewed the interest for the identification of haplotypes carrying the rare risk alleles. However, large scale explorations of the linkage-disequilibrium (LD) pattern to identify haplotype blocks are not easy to perform, because traditional algorithms have at least Θ(n2) time and memory complexity. Results We derived three incremental optimizations of the widely used haplotype block recognition algorithm proposed by Gabriel et al. in 2002. Our most efficient solution, called MIG ++, has only Θ(n) memory complexity and, on a genome-wide scale, it omits >80% of the calculations, which makes it an order of magnitude faster than the original algorithm. Differently from the existing software, the MIG ++ analyzes the LD between SNPs at any distance, avoiding restrictions on the maximal block length. The haplotype block partition of the entire HapMap II CEPH dataset was obtained in 457 hours. By replacing the standard likelihood-based D′ variance estimator with an approximated estimator, the runtime was further improved. While producing a coarser partition, the approximate method allowed to obtain the full-genome haplotype block partition of the entire 1000 Genomes Project CEPH dataset in 44 hours, with no restrictions on allele frequency or long-range correlations. These experiments showed that LD-based haplotype blocks can span more than one million base-pairs in both HapMap II and 1000 Genomes datasets. An application to the North American Rheumatoid Arthritis Consortium (NARAC) dataset shows how the MIG ++ can support genome-wide haplotype association studies. Conclusions The MIG ++ enables to perform LD-based haplotype block recognition on genetic sequences of any length and density. In the new generation sequencing era, this can help identify haplotypes that carry rare variants of interest. The low computational requirements open the possibility to include the haplotype block structure into genome-wide association scans, downstream analyses, and visual interfaces for online genome browsers.
Collapse
Affiliation(s)
- Daniel Taliun
- Center for Biomedicine, European Academy of Bolzano/Bozen (EURAC), Bozen-Bolzano, Italy.
| | | | | |
Collapse
|
16
|
Lee K, Han S, Tark Y, Kim S. Short Reads Phasing to Construct Haplotypes in Genomic Regions That Are Associated with Body Mass Index in Korean Individuals. Genomics Inform 2014; 12:165-70. [PMID: 25705154 PMCID: PMC4330250 DOI: 10.5808/gi.2014.12.4.165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Revised: 11/11/2014] [Accepted: 11/20/2014] [Indexed: 11/20/2022] Open
Abstract
Genome-wide association (GWA) studies have found many important genetic variants that affect various traits. Since these studies are useful to investigate untyped but causal variants using linkage disequilibrium (LD), it would be useful to explore the haplotypes of single-nucleotide polymorphisms (SNPs) within the same LD block of significant associations based on high-density variants from population references. Here, we tried to make a haplotype catalog affecting body mass index (BMI) through an integrative analysis of previously published whole-genome next-generation sequencing (NGS) data of 7 representative Korean individuals and previously known Korean GWA signals. We selected 435 SNPs that were significantly associated with BMI from the GWA analysis and searched 53 LD ranges nearby those SNPs. With the NGS data, the haplotypes were phased within the LDs. A total of 44 possible haplotype blocks for Korean BMI were cataloged. Although the current result constitutes little data, this study provides new insights that may help to identify important haplotypes for traits and low variants nearby significant SNPs. Furthermore, we can build a more comprehensive catalog as a larger dataset becomes available.
Collapse
Affiliation(s)
- Kichan Lee
- Department of Bioinformatics and Life Science, Soongsil University, Seoul 156-743, Korea
| | - Seonggyun Han
- Department of Bioinformatics and Life Science, Soongsil University, Seoul 156-743, Korea
| | - Yeonjeong Tark
- Department of Bioinformatics and Life Science, Soongsil University, Seoul 156-743, Korea
| | - Sangsoo Kim
- Department of Bioinformatics and Life Science, Soongsil University, Seoul 156-743, Korea
| |
Collapse
|