1
|
Simmons SK, Adiconis X, Haywood N, Parker J, Lin Z, Liao Z, Tuncali I, Al’Khafaji AM, Shin A, Jagadeesh K, Gosik K, Gatzen M, Smith JT, El Kodsi DN, Kuras Y, Baecher-Allan C, Serrano GE, Beach TG, Garimella K, Rozenblatt-Rosen O, Regev A, Dong X, Scherzer CR, Levin JZ. Experimental and Computational Methods for Allelic Imbalance Analysis from Single-Nucleus RNA-seq Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.08.13.607784. [PMID: 39185246 PMCID: PMC11343128 DOI: 10.1101/2024.08.13.607784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Single-cell RNA-seq (scRNA-seq) is emerging as a powerful tool for understanding gene function across diverse cells. Recently, this has included the use of allele-specific expression (ASE) analysis to better understand how variation in the human genome affects RNA expression at the single-cell level. We reasoned that because intronic reads are more prevalent in single-nucleus RNA-Seq (snRNA-Seq), and introns are under lower purifying selection and thus enriched for genetic variants, that snRNA-seq should facilitate single-cell analysis of ASE. Here we demonstrate how experimental and computational choices can improve the results of allelic imbalance analysis. We explore how experimental choices, such as RNA source, read length, sequencing depth, genotyping, etc., impact the power of ASE-based methods. We developed a new suite of computational tools to process and analyze scRNA-seq and snRNA-seq for ASE. As hypothesized, we extracted more ASE information from reads in intronic regions than those in exonic regions and show how read length can be set to increase power. Additionally, hybrid selection improved our power to detect allelic imbalance in genes of interest. We also explored methods to recover allele-specific isoform expression levels from both long- and short-read snRNA-seq. To further investigate ASE in the context of human disease, we applied our methods to a Parkinson's disease cohort of 94 individuals and show that ASE analysis had more power than eQTL analysis to identify significant SNP/gene pairs in our direct comparison of the two methods. Overall, we provide an end-to-end experimental and computational approach for future studies.
Collapse
Affiliation(s)
- Sean K. Simmons
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Xian Adiconis
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Nathan Haywood
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jacob Parker
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Stephen & Denise Adams Center for Parkinson’s Disease Research of Yale School of Medicine, New Haven, CT 06510, USA
| | - Zechuan Lin
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Stephen & Denise Adams Center for Parkinson’s Disease Research of Yale School of Medicine, New Haven, CT 06510, USA
| | - Zhixiang Liao
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Precision Neurology Program of Brigham & Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Idil Tuncali
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Precision Neurology Program of Brigham & Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Aziz M. Al’Khafaji
- Broad Clinical Labs, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Asa Shin
- Broad Clinical Labs, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Karthik Jagadeesh
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kirk Gosik
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Michael Gatzen
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jonathan T. Smith
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Daniel N. El Kodsi
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Precision Neurology Program of Brigham & Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Yuliya Kuras
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Precision Neurology Program of Brigham & Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Clare Baecher-Allan
- Dept. of Neurology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Geidy E. Serrano
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Banner Sun Health Research Institute, Sun City, AZ 85351, USA
| | - Thomas G. Beach
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Banner Sun Health Research Institute, Sun City, AZ 85351, USA
| | - Kiran Garimella
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Orit Rozenblatt-Rosen
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Present address: Genentech, South San Francisco, CA 94080, USA
| | - Aviv Regev
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Present address: Genentech, South San Francisco, CA 94080, USA
| | - Xianjun Dong
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Stephen & Denise Adams Center for Parkinson’s Disease Research of Yale School of Medicine, New Haven, CT 06510, USA
| | - Clemens R. Scherzer
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Stephen & Denise Adams Center for Parkinson’s Disease Research of Yale School of Medicine, New Haven, CT 06510, USA
| | - Joshua Z. Levin
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD 20815, USA
- Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|
2
|
Shao M, Chen K, Zhang S, Tian M, Shen Y, Cao C, Gu N. Multiome-wide Association Studies: Novel Approaches for Understanding Diseases. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae077. [PMID: 39471467 PMCID: PMC11630051 DOI: 10.1093/gpbjnl/qzae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 10/06/2024] [Accepted: 10/23/2024] [Indexed: 11/01/2024]
Abstract
The rapid development of multiome (transcriptome, proteome, cistrome, imaging, and regulome)-wide association study methods have opened new avenues for biologists to understand the susceptibility genes underlying complex diseases. Thorough comparisons of these methods are essential for selecting the most appropriate tool for a given research objective. This review provides a detailed categorization and summary of the statistical models, use cases, and advantages of recent multiome-wide association studies. In addition, to illustrate gene-disease association studies based on transcriptome-wide association study (TWAS), we collected 478 disease entries across 22 categories from 235 manually reviewed publications. Our analysis reveals that mental disorders are the most frequently studied diseases by TWAS, indicating its potential to deepen our understanding of the genetic architecture of complex diseases. In summary, this review underscores the importance of multiome-wide association studies in elucidating complex diseases and highlights the significance of selecting the appropriate method for each study.
Collapse
Affiliation(s)
- Mengting Shao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Kaiyang Chen
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Shuting Zhang
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Min Tian
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Yan Shen
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Chen Cao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Ning Gu
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
- Nanjing Key Laboratory for Cardiovascular Information and Health Engineering Medicine, Institute of Clinical Medicine, Nanjing Drum Tower Hospital, Medical School, Nanjing University, Nanjing 210093, China
| |
Collapse
|
3
|
Bhattacharyya S, Ay F. Identifying genetic variants associated with chromatin looping and genome function. Nat Commun 2024; 15:8174. [PMID: 39289357 PMCID: PMC11408621 DOI: 10.1038/s41467-024-52296-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 08/30/2024] [Indexed: 09/19/2024] Open
Abstract
Here we present a comprehensive HiChIP dataset on naïve CD4 T cells (nCD4) from 30 donors and identify QTLs that associate with genotype-dependent and/or allele-specific variation of HiChIP contacts defining loops between active regulatory regions (iQTLs). We observe a substantial overlap between iQTLs and previously defined eQTLs and histone QTLs, and an enrichment for fine-mapped QTLs and GWAS variants. Furthermore, we describe a distinct subset of nCD4 iQTLs, for which the significant variation of chromatin contacts in nCD4 are translated into significant eQTL trends in CD4 T cell memory subsets. Finally, we define connectivity-QTLs as iQTLs that are significantly associated with concordant genotype-dependent changes in chromatin contacts over a broad genomic region (e.g., GWAS SNP in the RNASET2 locus). Our results demonstrate the importance of chromatin contacts as a complementary modality for QTL mapping and their power in identifying previously uncharacterized QTLs linked to cell-specific gene expression and connectivity.
Collapse
Affiliation(s)
| | - Ferhat Ay
- La Jolla Institute for Immunology, La Jolla, CA, USA.
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
4
|
Zeng B, Bendl J, Deng C, Lee D, Misir R, Reach SM, Kleopoulos SP, Auluck P, Marenco S, Lewis DA, Haroutunian V, Ahituv N, Fullard JF, Hoffman GE, Roussos P. Genetic regulation of cell type-specific chromatin accessibility shapes brain disease etiology. Science 2024; 384:eadh4265. [PMID: 38781378 DOI: 10.1126/science.adh4265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 12/20/2023] [Indexed: 05/25/2024]
Abstract
Nucleotide variants in cell type-specific gene regulatory elements in the human brain are risk factors for human disease. We measured chromatin accessibility in 1932 aliquots of sorted neurons and non-neurons from 616 human postmortem brains and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTLs). Only 10.4% of caQTLs are shared between neurons and non-neurons, which supports cell type-specific genetic regulation of the brain regulome. Incorporating allele-specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms that underlie disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs and identified the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides insights into disease etiology.
Collapse
Affiliation(s)
- Biao Zeng
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ruth Misir
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sarah M Reach
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Steven P Kleopoulos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Pavan Auluck
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD 20892, USA
| | - Stefano Marenco
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD 20892, USA
| | - David A Lewis
- Translational Neuroscience Program, Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Vahram Haroutunian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| |
Collapse
|
5
|
Deshpande D, Chhugani K, Chang Y, Karlsberg A, Loeffler C, Zhang J, Muszyńska A, Munteanu V, Yang H, Rotman J, Tao L, Balliu B, Tseng E, Eskin E, Zhao F, Mohammadi P, P. Łabaj P, Mangul S. RNA-seq data science: From raw data to effective interpretation. Front Genet 2023; 14:997383. [PMID: 36999049 PMCID: PMC10043755 DOI: 10.3389/fgene.2023.997383] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 02/24/2023] [Indexed: 03/14/2023] Open
Abstract
RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.
Collapse
Affiliation(s)
- Dhrithi Deshpande
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Karishma Chhugani
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Yutong Chang
- Department of Pharmacology and Pharmaceutical Sciences, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Aaron Karlsberg
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Caitlin Loeffler
- Department of Computer Science, University of California, Los Angeles, CA, United States
| | - Jinyang Zhang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Agata Muszyńska
- Małopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Institute of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Gliwice, Poland
| | - Viorel Munteanu
- Department of Computers, Informatics and Microelectronics, Technical University of Moldova, Chisinau, Moldova
| | - Harry Yang
- Department of Microbiology, Immunology and Molecular Genetics, University of California Los Angeles, Los Angeles, CA, United States
| | - Jeremy Rotman
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
| | - Laura Tao
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
| | - Brunilda Balliu
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
| | | | - Eleazar Eskin
- Department of Computer Science, University of California, Los Angeles, CA, United States
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, CHS, Los Angeles, CA, United States
- Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, United States
| | - Fangqing Zhao
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Pejman Mohammadi
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, United States
| | - Paweł P. Łabaj
- Małopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Department of Biotechnology, Boku University Vienna, Vienna, Austria
| | - Serghei Mangul
- Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, Los Angeles, CA, United States
- Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA, United States
- *Correspondence: Serghei Mangul,
| |
Collapse
|
6
|
Zeng B, Bendl J, Deng C, Lee D, Misir R, Reach SM, Kleopoulos SP, Auluck P, Marenco S, Lewis DA, Haroutunian V, Ahituv N, Fullard JF, Hoffman GE, Roussos P. Genetic regulation of cell-type specific chromatin accessibility shapes the etiology of brain diseases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.02.530826. [PMID: 37090548 PMCID: PMC10120699 DOI: 10.1101/2023.03.02.530826] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Nucleotide variants in cell type-specific gene regulatory elements in the human brain are major risk factors of human disease. We measured chromatin accessibility in sorted neurons and glia from 1,932 samples of human postmortem brain and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTL). Only 10.4% of caQTL are shared between neurons and glia, supporting the cell type specificity of genetic regulation of the brain regulome. Incorporating allele specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms underlying disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs, identifying the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides novel insights into brain disease etiology.
Collapse
Affiliation(s)
- Biao Zeng
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruth Misir
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sarah M. Reach
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Steven P. Kleopoulos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Pavan Auluck
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD, USA
| | - Stefano Marenco
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD, USA
| | - David A. Lewis
- Translational Neuroscience Program, Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Vahram Haroutunian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - John F. Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Gabriel E. Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY, USA
| |
Collapse
|
7
|
Nassar AH, Abou Alaiwi S, Baca SC, Adib E, Corona RI, Seo JH, Fonseca MAS, Spisak S, El Zarif T, Tisza V, Braun DA, Du H, He M, Flaifel A, Alchoueiry M, Denize T, Matar SG, Acosta A, Shukla S, Hou Y, Steinharter J, Bouchard G, Berchuck JE, O'Connor E, Bell C, Nuzzo PV, Mary Lee GS, Signoretti S, Hirsch MS, Pomerantz M, Henske E, Gusev A, Lawrenson K, Choueiri TK, Kwiatkowski DJ, Freedman ML. Epigenomic charting and functional annotation of risk loci in renal cell carcinoma. Nat Commun 2023; 14:346. [PMID: 36681680 PMCID: PMC9867739 DOI: 10.1038/s41467-023-35833-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 01/04/2023] [Indexed: 01/22/2023] Open
Abstract
While the mutational and transcriptional landscapes of renal cell carcinoma (RCC) are well-known, the epigenome is poorly understood. We characterize the epigenome of clear cell (ccRCC), papillary (pRCC), and chromophobe RCC (chRCC) by using ChIP-seq, ATAC-Seq, RNA-seq, and SNP arrays. We integrate 153 individual data sets from 42 patients and nominate 50 histology-specific master transcription factors (MTF) to define RCC histologic subtypes, including EPAS1 and ETS-1 in ccRCC, HNF1B in pRCC, and FOXI1 in chRCC. We confirm histology-specific MTFs via immunohistochemistry including a ccRCC-specific TF, BHLHE41. FOXI1 overexpression with knock-down of EPAS1 in the 786-O ccRCC cell line induces transcriptional upregulation of chRCC-specific genes, TFCP2L1, ATP6V0D2, KIT, and INSRR, implicating FOXI1 as a MTF for chRCC. Integrating RCC GWAS risk SNPs with H3K27ac ChIP-seq and ATAC-seq data reveals that risk-variants are significantly enriched in allelically-imbalanced peaks. This epigenomic atlas in primary human samples provides a resource for future investigation.
Collapse
Affiliation(s)
- Amin H Nassar
- Department of Hematology/Oncology, Yale New Haven Hospital, New Haven, CT, 06510, USA
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Sarah Abou Alaiwi
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Sylvan C Baca
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Elio Adib
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Rosario I Corona
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Center for Bioinformatics and Functional Genomics, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Ji-Heui Seo
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Marcos A S Fonseca
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Sandor Spisak
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA
| | - Talal El Zarif
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Viktoria Tisza
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA
| | - David A Braun
- Department of Hematology/Oncology, Yale New Haven Hospital, New Haven, CT, 06510, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA
| | - Heng Du
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Monica He
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Abdallah Flaifel
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Michel Alchoueiry
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Thomas Denize
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Sayed G Matar
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Andres Acosta
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Sachet Shukla
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Yue Hou
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - John Steinharter
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Gabrielle Bouchard
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Jacob E Berchuck
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Edward O'Connor
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Connor Bell
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Pier Vitale Nuzzo
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Gwo-Shu Mary Lee
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Sabina Signoretti
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Michelle S Hirsch
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
| | - Mark Pomerantz
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Elizabeth Henske
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Alexander Gusev
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- McGraw/Patterson Center for Population Sciences, Dana-Farber Cancer Institute, Boston, MA, 02115, USA
| | - Kate Lawrenson
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Center for Bioinformatics and Functional Genomics, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Toni K Choueiri
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA.
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
| | - David J Kwiatkowski
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, 02115, USA.
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
| | - Matthew L Freedman
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA.
| |
Collapse
|
8
|
Baldrighi GN, Nova A, Bernardinelli L, Fazia T. A Pipeline for Phasing and Genotype Imputation on Mixed Human Data (Parents-Offspring Trios and Unrelated Subjects) by Reviewing Current Methods and Software. LIFE (BASEL, SWITZERLAND) 2022; 12:life12122030. [PMID: 36556394 PMCID: PMC9781110 DOI: 10.3390/life12122030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 12/01/2022] [Accepted: 12/02/2022] [Indexed: 12/09/2022]
Abstract
Genotype imputation has become an essential prerequisite when performing association analysis. It is a computational technique that allows us to infer genetic markers that have not been directly genotyped, thereby increasing statistical power in subsequent association studies, which consequently has a crucial impact on the identification of causal variants. Many features need to be considered when choosing the proper algorithm for imputation, including the target sample on which it is performed, i.e., related individuals, unrelated individuals, or both. Problems could arise when dealing with a target sample made up of mixed data, composed of both related and unrelated individuals, especially since the scientific literature on this topic is not sufficiently clear. To shed light on this issue, we examined existing algorithms and software for performing phasing and imputation on mixed human data from SNP arrays, specifically when related subjects belong to trios. By discussing the advantages and limitations of the current algorithms, we identified LD-based methods as being the most suitable for reconstruction of haplotypes in this specific context, and we proposed a feasible pipeline that can be used for imputing genotypes in both phased and unphased human data.
Collapse
|
9
|
Baca SC, Singler C, Zacharia S, Seo JH, Morova T, Hach F, Ding Y, Schwarz T, Huang CCF, Anderson J, Fay AP, Kalita C, Groha S, Pomerantz MM, Wang V, Linder S, Sweeney CJ, Zwart W, Lack NA, Pasaniuc B, Takeda DY, Gusev A, Freedman ML. Genetic determinants of chromatin reveal prostate cancer risk mediated by context-dependent gene regulation. Nat Genet 2022; 54:1364-1375. [PMID: 36071171 PMCID: PMC9784646 DOI: 10.1038/s41588-022-01168-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 07/19/2022] [Indexed: 12/25/2022]
Abstract
Many genetic variants affect disease risk by altering context-dependent gene regulation. Such variants are difficult to study mechanistically using current methods that link genetic variation to steady-state gene expression levels, such as expression quantitative trait loci (eQTLs). To address this challenge, we developed the cistrome-wide association study (CWAS), a framework for identifying genotypic and allele-specific effects on chromatin that are also associated with disease. In prostate cancer, CWAS identified regulatory elements and androgen receptor-binding sites that explained the association at 52 of 98 known prostate cancer risk loci and discovered 17 additional risk loci. CWAS implicated key developmental transcription factors in prostate cancer risk that are overlooked by eQTL-based approaches due to context-dependent gene regulation. We experimentally validated associations and demonstrated the extensibility of CWAS to additional epigenomic datasets and phenotypes, including response to prostate cancer treatment. CWAS is a powerful and biologically interpretable paradigm for studying variants that influence traits by affecting transcriptional regulation.
Collapse
Affiliation(s)
- Sylvan C. Baca
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA,The Eli and Edythe L. Broad Institute, Cambridge, MA, USA
| | - Cassandra Singler
- Laboratory of Genitourinary Cancer Pathogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Soumya Zacharia
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Ji-Heui Seo
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Tunc Morova
- Vancouver Prostate Centre University of British Columbia, Vancouver, BC, Canada
| | - Faraz Hach
- Vancouver Prostate Centre University of British Columbia, Vancouver, BC, Canada
| | - Yi Ding
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA
| | - Tommer Schwarz
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA
| | | | - Jacob Anderson
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - André P. Fay
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Cynthia Kalita
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Division of Genetics, Brigham & Women’s Hospital, Boston, MA, USA
| | - Stefan Groha
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,The Eli and Edythe L. Broad Institute, Cambridge, MA, USA
| | - Mark M. Pomerantz
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Victoria Wang
- Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA,Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
| | - Simon Linder
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands,Laboratory of Chemical Biology and Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
| | | | - Wilbert Zwart
- Division of Oncogenomics, Oncode Institute, The Netherlands Cancer Institute, Amsterdam, The Netherlands,Laboratory of Chemical Biology and Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Nathan A. Lack
- Vancouver Prostate Centre University of British Columbia, Vancouver, BC, Canada,School of Medicine, Koç University, Istanbul, Turkey
| | - Bogdan Pasaniuc
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA,Department of Computational Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA USA,Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA,Department of Pathology and Laboratory Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, USA
| | - David Y. Takeda
- Laboratory of Genitourinary Cancer Pathogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Alexander Gusev
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,The Eli and Edythe L. Broad Institute, Cambridge, MA, USA,Division of Genetics, Brigham & Women’s Hospital, Boston, MA, USA,These authors jointly supervised this work. Correspondence should be directed to M.L.F or A.G. ()
| | - Matthew L. Freedman
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA,Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA,The Eli and Edythe L. Broad Institute, Cambridge, MA, USA,These authors jointly supervised this work. Correspondence should be directed to M.L.F or A.G. ()
| |
Collapse
|
10
|
Grishin D, Gusev A. Allelic imbalance of chromatin accessibility in cancer identifies candidate causal risk variants and their mechanisms. Nat Genet 2022; 54:837-849. [PMID: 35697866 PMCID: PMC9886437 DOI: 10.1038/s41588-022-01075-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 04/08/2022] [Indexed: 02/02/2023]
Abstract
While many germline cancer risk variants have been identified through genome-wide association studies (GWAS), the mechanisms by which these variants operate remain largely unknown. Here we used 406 cancer ATAC-Seq samples across 23 cancer types to identify 7,262 germline allele-specific accessibility QTLs (as-aQTLs). Cancer as-aQTLs had stronger enrichment for cancer risk heritability (up to 145 fold) than any other functional annotation across seven cancer GWAS. Most cancer as-aQTLs directly altered transcription factor (TF) motifs and exhibited differential TF binding and gene expression in functional screens. To connect as-aQTLs to putative risk mechanisms, we introduced the regulome-wide associations study (RWAS). RWAS identified genetically associated accessible peaks at >70% of known breast and prostate loci and discovered new risk loci in all examined cancer types. Integrating as-aQTL discovery, motif analysis and RWAS identified candidate causal regulatory elements and their probable upstream regulators. Our work establishes cancer as-aQTLs and RWAS analysis as powerful tools to study the genetic architecture of cancer risk.
Collapse
Affiliation(s)
- Dennis Grishin
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Alexander Gusev
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. .,The Eli and Edythe L. Broad Institute, Cambridge, MA, USA. .,Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
| |
Collapse
|
11
|
Recknagel H, Trontelj P. From Cave Dragons to Genomics: Advancements in the Study of Subterranean Tetrapods. Bioscience 2022; 72:254-266. [PMID: 35241972 PMCID: PMC8888124 DOI: 10.1093/biosci/biab117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Throughout most of the kingdom Animalia, evolutionary transitions from surface life to a life permanently bound to caves and other subterranean habitats have occurred innumerous times. Not so in tetrapods, where a mere 14 cave-obligate species-all plethodontid and proteid salamanders-are known. We discuss why cave tetrapods are so exceptional and why only salamanders have made the transition. Their evolution follows predictable and convergent, albeit independent pathways. Among the many known changes associated with transitions to subterranean life, eye degeneration, starvation resistance, and longevity are especially relevant to human biomedical research. Recently, sequences of salamander genomes have become available opening up genomic research for cave tetrapods. We discuss new genomic methods that can spur our understanding of the evolutionary mechanisms behind convergent phenotypic change, the relative roles of selective and neutral evolution, cryptic species diversity, and data relevant for conservation such as effective population size and demography.
Collapse
Affiliation(s)
- Hans Recknagel
- University of Ljubljana, Slovenia, working, Biotechnical Faculty, Dept. of Biology, Subterranean Biology Lab
| | - Peter Trontelj
- University of Ljubljana, Slovenia, working, Biotechnical Faculty, Dept. of Biology, Subterranean Biology Lab
| |
Collapse
|
12
|
Liang Y, Aguet F, Barbeira AN, Ardlie K, Im HK. A scalable unified framework of total and allele-specific counts for cis-QTL, fine-mapping, and prediction. Nat Commun 2021; 12:1424. [PMID: 33658504 PMCID: PMC7930098 DOI: 10.1038/s41467-021-21592-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 01/29/2021] [Indexed: 01/31/2023] Open
Abstract
Genetic studies of the transcriptome help bridge the gap between genetic variation and phenotypes. To maximize the potential of such studies, efficient methods to identify expression quantitative trait loci (eQTLs) and perform fine-mapping and genetic prediction of gene expression traits are needed. Current methods that leverage both total read counts and allele-specific expression to identify eQTLs are generally computationally intractable for large transcriptomic studies. Here, we describe a unified framework that addresses these needs and is scalable to thousands of samples. Using simulations and data from GTEx, we demonstrate its calibration and performance. For example, mixQTL shows a power gain equivalent to a 29% increase in sample size for genes with sufficient allele-specific read coverage. To showcase the potential of mixQTL, we apply it to 49 GTEx tissues and find 20% additional eQTLs (FDR < 0.05, per tissue) that are significantly more enriched among trait associated variants and candidate cis-regulatory elements comparing to the standard approach.
Collapse
Affiliation(s)
- Yanyu Liang
- Section of Genetic Medicine, The University of Chicago, Chicago, IL, USA.
| | - François Aguet
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alvaro N Barbeira
- Section of Genetic Medicine, The University of Chicago, Chicago, IL, USA
| | - Kristin Ardlie
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Hae Kyung Im
- Section of Genetic Medicine, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
13
|
Robles-Espinoza CD, Mohammadi P, Bonilla X, Gutierrez-Arcelus M. Allele-specific expression: applications in cancer and technical considerations. Curr Opin Genet Dev 2021; 66:10-19. [PMID: 33383480 PMCID: PMC7985293 DOI: 10.1016/j.gde.2020.10.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 10/26/2020] [Accepted: 10/31/2020] [Indexed: 11/18/2022]
Abstract
Allele-specific gene expression can influence disease traits. Non-coding germline genetic variants that alter regulatory elements can cause allele-specific gene expression and contribute to cancer susceptibility. In tumors, both somatic copy number alterations and somatic single nucleotide variants have been shown to lead to allele-specific expression of genes, many of which are considered drivers of tumor growth. Here, we review recent studies revealing the pervasive presence of this phenomenon in cancer susceptibility and progression. Furthermore, we underscore the importance of careful experimental design and computational analysis for accurate allelic expression quantification and avoidance of false positives. Finally, we discuss additional methodological challenges encountered in cancer studies and in the burgeoning field of single-cell transcriptomics.
Collapse
Affiliation(s)
- Carla Daniela Robles-Espinoza
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Boulevard Juriquilla 3001, Santiago de Querétaro 76230, Mexico; Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
| | - Pejman Mohammadi
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, USA; Scripps Translational Science Institute, The Scripps Research Institute, La Jolla, CA, USA
| | - Ximena Bonilla
- Department of Computer Science, ETH Zurich, Universitätsstr. 6, 8092 Zürich, Switzerland; Swiss Institute of Bioinformatics, Quartier Sorge - Bâtiment Amphipôle, Lausanne 1015, Switzerland; University Hospital Zurich, Rämistrasse 100, 8091 Zürich, Switzerland
| | - Maria Gutierrez-Arcelus
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Division of Rheumatology, Inflammation and Immunity, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA; Division of Immunology, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
| |
Collapse
|