1
|
Naqvi S, Kim S, Tabatabaee S, Pampari A, Kundaje A, Pritchard JK, Wysocka J. Transfer learning reveals sequence determinants of the quantitative response to transcription factor dosage. CELL GENOMICS 2025; 5:100780. [PMID: 40020686 PMCID: PMC11960506 DOI: 10.1016/j.xgen.2025.100780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 01/29/2025] [Accepted: 01/30/2025] [Indexed: 03/03/2025]
Abstract
Deep learning models have advanced our ability to predict cell-type-specific chromatin patterns from transcription factor (TF) binding motifs, but their application to perturbed contexts remains limited. We applied transfer learning to predict how concentrations of the dosage-sensitive TFs TWIST1 and SOX9 affect regulatory element (RE) chromatin accessibility in facial progenitor cells, achieving near-experimental accuracy. High-affinity motifs that allow for heterotypic TF co-binding and are concentrated at the center of REs buffer against quantitative changes in TF dosage and predict unperturbed accessibility. Conversely, low-affinity or homotypic binding motifs distributed throughout REs drive sensitive responses with minimal impact on unperturbed accessibility. Both buffering and sensitizing features display purifying selection signatures. We validated these sequence features through reporter assays and demonstrated that TF-nucleosome competition can explain low-affinity motifs' sensitizing effects. This combination of transfer learning and quantitative chromatin response measurements provides a novel approach for uncovering additional layers of the cis-regulatory code.
Collapse
Affiliation(s)
- Sahin Naqvi
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA; Division of Gastroenterology, Hepatology, and Nutrition, Boston Children's Hospital, Boston, MA 02115, USA; Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
| | - Seungsoo Kim
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Saman Tabatabaee
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Anusri Pampari
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Biology, Stanford University, Stanford, CA 94305, USA.
| | - Joanna Wysocka
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA.
| |
Collapse
|
2
|
Naqvi S, Kim S, Tabatabaee S, Pampari A, Kundaje A, Pritchard JK, Wysocka J. Transfer learning reveals sequence determinants of the quantitative response to transcription factor dosage. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.28.596078. [PMID: 38853998 PMCID: PMC11160683 DOI: 10.1101/2024.05.28.596078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Deep learning approaches have made significant advances in predicting cell type-specific chromatin patterns from the identity and arrangement of transcription factor (TF) binding motifs. However, most models have been applied in unperturbed contexts, precluding a predictive understanding of how chromatin state responds to TF perturbation. Here, we used transfer learning to train and interpret deep learning models that use DNA sequence to predict, with accuracy approaching experimental reproducibility, how the concentration of two dosage-sensitive TFs (TWIST1, SOX9) affects regulatory element (RE) chromatin accessibility in facial progenitor cells. High-affinity motifs that allow for heterotypic TF co-binding and are concentrated at the center of REs buffer against quantitative changes in TF dosage and strongly predict unperturbed accessibility. In contrast, motifs with low-affinity or homotypic binding distributed throughout REs lead to sensitive responses with minimal contributions to unperturbed accessibility. Both buffering and sensitizing features show signatures of purifying selection. We validated these predictive sequence features using reporter assays and showed that a biophysical model of TF-nucleosome competition can explain the sensitizing effect of low-affinity motifs. Our approach of combining transfer learning and quantitative measurements of the chromatin response to TF dosage therefore represents a powerful method to reveal additional layers of the cis-regulatory code.
Collapse
Affiliation(s)
- Sahin Naqvi
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, California, USA
- Division of Gastroenterology, Hepatology, and Nutrition, Boston Children’s Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
- Lead contact
| | - Seungsoo Kim
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
- These authors contributed equally
| | - Saman Tabatabaee
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- These authors contributed equally
| | - Anusri Pampari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, Stanford, California, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, California, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Joanna Wysocka
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
3
|
Wiśniewska K, Gaffke L, Żabińska M, Węgrzyn G, Pierzynowska K. Cellular Organelle-Related Transcriptomic Profile Abnormalities in Neuronopathic Types of Mucopolysaccharidosis: A Comparison with Other Neurodegenerative Diseases. Curr Issues Mol Biol 2024; 46:2678-2700. [PMID: 38534785 PMCID: PMC10968730 DOI: 10.3390/cimb46030169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 03/16/2024] [Accepted: 03/18/2024] [Indexed: 03/28/2024] Open
Abstract
Mucopolysaccharidoses (MPS) are a group of diseases caused by mutations in genes encoding lysosomal enzymes that catalyze reactions of glycosaminoglycan (GAG) degradation. As a result, GAGs accumulate in lysosomes, impairing the proper functioning of entire cells and tissues. There are 14 types/subtypes of MPS, which are differentiated by the kind(s) of accumulated GAG(s) and the type of a non-functional lysosomal enzyme. Some of these types (severe forms of MPS types I and II, MPS III, and MPS VII) are characterized by extensive central nervous system disorders. The aim of this work was to identify, using transcriptomic methods, organelle-related genes whose expression levels are changed in neuronopathic types of MPS compared to healthy cells while remaining unchanged in non-neuronopathic types of MPS. The study was conducted with fibroblast lines derived from patients with neuronopathic and non-neuronopathic types of MPS and control (healthy) fibroblasts. Transcriptomic analysis has identified genes related to cellular organelles whose expression is altered. Then, using fluorescence and electron microscopy, we assessed the morphology of selected structures. Our analyses indicated that the genes whose expression is affected in neuronopathic MPS are often associated with the structures or functions of the cell nucleus, endoplasmic reticulum, or Golgi apparatus. Electron microscopic studies confirmed disruptions in the structures of these organelles. Special attention was paid to up-regulated genes, such as PDIA3 and MFGE8, and down-regulated genes, such as ARL6IP6, ABHD5, PDE4DIP, YIPF5, and CLDN11. Of particular interest is also the GM130 (GOLGA2) gene, which encodes golgin A2, which revealed an increased expression in neuronopathic MPS types. We propose to consider the levels of mRNAs of these genes as candidates for biomarkers of neurodegeneration in MPS. These genes may also become potential targets for therapies under development for neurological disorders associated with MPS and candidates for markers of the effectiveness of these therapies. Although fibroblasts rather than nerve cells were used in this study, it is worth noting that potential genetic markers characteristic solely of neurons would be impractical in testing patients, contrary to somatic cells that can be relatively easily obtained from assessed persons.
Collapse
Affiliation(s)
| | | | | | | | - Karolina Pierzynowska
- Department of Molecular Biology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, 80-308 Gdansk, Poland; (K.W.); (L.G.); (M.Ż.); (G.W.)
| |
Collapse
|
4
|
Lin S, Lim B. Multifaceted effects on even-skipped transcriptional dynamics upon Krüppel dosage changes. Development 2024; 151:dev202132. [PMID: 38345298 PMCID: PMC10948998 DOI: 10.1242/dev.202132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 02/08/2024] [Indexed: 03/05/2024]
Abstract
Although fluctuations in transcription factor (TF) dosage are often well tolerated, TF dosage modulation can change the target gene expression dynamics and result in significant non-lethal developmental phenotypes. Using MS2/MCP-mediated quantitative live imaging in early Drosophila embryos, we analyzed how changing levels of the gap gene Krüppel (Kr) affects transcriptional dynamics of the pair-rule gene even-skipped (eve). Halving the Kr dosage leads to a transient posterior expansion of the eve stripe 2 and an anterior shift of stripe 5. Surprisingly, the most significant changes are observed in eve stripes 3 and 4, the enhancers of which do not contain Kr-binding sites. In Kr heterozygous embryos, both stripes 3 and 4 display narrower widths, anteriorly shifted boundaries and reduced mRNA production levels. We show that Kr dosage indirectly affects stripe 3 and 4 dynamics by modulating other gap gene dynamics. We quantitatively correlate moderate body segment phenotypes of Kr heterozygotes with spatiotemporal changes in eve expression. Our results indicate that nonlinear relationships between TF dosage and phenotypes underlie direct TF-DNA and indirect TF-TF interactions.
Collapse
Affiliation(s)
- Shufan Lin
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Bomyi Lim
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
5
|
Zilocchi M, Rahmatbakhsh M, Moutaoufik MT, Broderick K, Gagarinova A, Jessulat M, Phanse S, Aoki H, Aly KA, Babu M. Co-fractionation-mass spectrometry to characterize native mitochondrial protein assemblies in mammalian neurons and brain. Nat Protoc 2023; 18:3918-3973. [PMID: 37985878 DOI: 10.1038/s41596-023-00901-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 08/09/2023] [Indexed: 11/22/2023]
Abstract
Human mitochondrial (mt) protein assemblies are vital for neuronal and brain function, and their alteration contributes to many human disorders, e.g., neurodegenerative diseases resulting from abnormal protein-protein interactions (PPIs). Knowledge of the composition of mt protein complexes is, however, still limited. Affinity purification mass spectrometry (MS) and proximity-dependent biotinylation MS have defined protein partners of some mt proteins, but are too technically challenging and laborious to be practical for analyzing large numbers of samples at the proteome level, e.g., for the study of neuronal or brain-specific mt assemblies, as well as altered mtPPIs on a proteome-wide scale for a disease of interest in brain regions, disease tissues or neurons derived from patients. To address this challenge, we adapted a co-fractionation-MS platform to survey native mt assemblies in adult mouse brain and in human NTERA-2 embryonal carcinoma stem cells or differentiated neuronal-like cells. The workflow consists of orthogonal separations of mt extracts isolated from chemically cross-linked samples to stabilize PPIs, data-dependent acquisition MS to identify co-eluted mt protein profiles from collected fractions and a computational scoring pipeline to predict mtPPIs, followed by network partitioning to define complexes linked to mt functions as well as those essential for neuronal and brain physiological homeostasis. We developed an R/CRAN software package, Macromolecular Assemblies from Co-elution Profiles for automated scoring of co-fractionation-MS data to define complexes from mtPPI networks. Presently, the co-fractionation-MS procedure takes 1.5-3.5 d of proteomic sample preparation, 31 d of MS data acquisition and 8.5 d of data analyses to produce meaningful biological insights.
Collapse
Affiliation(s)
- Mara Zilocchi
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | | | | | - Kirsten Broderick
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Alla Gagarinova
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
- Department of Biology, University of New Brunswick, Fredericton, New Brunswick, Canada
| | - Matthew Jessulat
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Sadhna Phanse
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Hiroyuki Aoki
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Khaled A Aly
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Mohan Babu
- Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada.
| |
Collapse
|
6
|
Gupta R, Kanai M, Durham TJ, Tsuo K, McCoy JG, Kotrys AV, Zhou W, Chinnery PF, Karczewski KJ, Calvo SE, Neale BM, Mootha VK. Nuclear genetic control of mtDNA copy number and heteroplasmy in humans. Nature 2023; 620:839-848. [PMID: 37587338 PMCID: PMC10447254 DOI: 10.1038/s41586-023-06426-5] [Citation(s) in RCA: 62] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 07/11/2023] [Indexed: 08/18/2023]
Abstract
Mitochondrial DNA (mtDNA) is a maternally inherited, high-copy-number genome required for oxidative phosphorylation1. Heteroplasmy refers to the presence of a mixture of mtDNA alleles in an individual and has been associated with disease and ageing. Mechanisms underlying common variation in human heteroplasmy, and the influence of the nuclear genome on this variation, remain insufficiently explored. Here we quantify mtDNA copy number (mtCN) and heteroplasmy using blood-derived whole-genome sequences from 274,832 individuals and perform genome-wide association studies to identify associated nuclear loci. Following blood cell composition correction, we find that mtCN declines linearly with age and is associated with variants at 92 nuclear loci. We observe that nearly everyone harbours heteroplasmic mtDNA variants obeying two principles: (1) heteroplasmic single nucleotide variants tend to arise somatically and accumulate sharply after the age of 70 years, whereas (2) heteroplasmic indels are maternally inherited as mixtures with relative levels associated with 42 nuclear loci involved in mtDNA replication, maintenance and novel pathways. These loci may act by conferring a replicative advantage to certain mtDNA alleles. As an illustrative example, we identify a length variant carried by more than 50% of humans at position chrM:302 within a G-quadruplex previously proposed to mediate mtDNA transcription/replication switching2,3. We find that this variant exerts cis-acting genetic control over mtDNA abundance and is itself associated in-trans with nuclear loci encoding machinery for this regulatory switch. Our study suggests that common variation in the nuclear genome can shape variation in mtCN and heteroplasmy dynamics across the human population.
Collapse
Affiliation(s)
- Rahul Gupta
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
| | - Masahiro Kanai
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Timothy J Durham
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kristin Tsuo
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Jason G McCoy
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Anna V Kotrys
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Wei Zhou
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Patrick F Chinnery
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK
- MRC Mitochondrial Biology Unit, University of Cambridge, Cambridge, UK
| | - Konrad J Karczewski
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Sarah E Calvo
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin M Neale
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
| | - Vamsi K Mootha
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
7
|
Naqvi S, Kim S, Hoskens H, Matthews HS, Spritz RA, Klein OD, Hallgrímsson B, Swigut T, Claes P, Pritchard JK, Wysocka J. Precise modulation of transcription factor levels identifies features underlying dosage sensitivity. Nat Genet 2023; 55:841-851. [PMID: 37024583 PMCID: PMC10181932 DOI: 10.1038/s41588-023-01366-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 03/07/2023] [Indexed: 04/08/2023]
Abstract
Transcriptional regulation exhibits extensive robustness, but human genetics indicates sensitivity to transcription factor (TF) dosage. Reconciling such observations requires quantitative studies of TF dosage effects at trait-relevant ranges, largely lacking so far. TFs play central roles in both normal-range and disease-associated variation in craniofacial morphology; we therefore developed an approach to precisely modulate TF levels in human facial progenitor cells and applied it to SOX9, a TF associated with craniofacial variation and disease (Pierre Robin sequence (PRS)). Most SOX9-dependent regulatory elements (REs) are buffered against small decreases in SOX9 dosage, but REs directly and primarily regulated by SOX9 show heightened sensitivity to SOX9 dosage; these RE responses partially predict gene expression responses. Sensitive REs and genes preferentially affect functional chondrogenesis and PRS-like craniofacial shape variation. We propose that such REs and genes underlie the sensitivity of specific phenotypes to TF dosage, while buffering of other genes leads to robust, nonlinear dosage-to-phenotype relationships.
Collapse
Affiliation(s)
- Sahin Naqvi
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
- Departments of Genetics and Biology, Stanford University, Stanford, CA, USA
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Seungsoo Kim
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Hanne Hoskens
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
- Department of Cell Biology & Anatomy, Alberta Children's Hospital Research Institute and McCaig Bone and Joint Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Harold S Matthews
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
| | - Richard A Spritz
- Human Medical Genetics and Genomics Program and Department of Pediatrics, University of Colorado School of Medicine, Aurora, CO, USA
| | - Ophir D Klein
- Departments of Orofacial Sciences and Pediatrics, Program in Craniofacial Biology, and Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA
- Department of Pediatrics, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Benedikt Hallgrímsson
- Department of Cell Biology & Anatomy, Alberta Children's Hospital Research Institute and McCaig Bone and Joint Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Tomek Swigut
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Peter Claes
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
- Murdoch Children's Research Institute, Melbourne, Victoria, Australia
| | | | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
8
|
Gupta R, Kanai M, Durham TJ, Tsuo K, McCoy JG, Chinnery PF, Karczewski KJ, Calvo SE, Neale BM, Mootha VK. Nuclear genetic control of mtDNA copy number and heteroplasmy in humans. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.01.19.23284696. [PMID: 36711677 PMCID: PMC9882621 DOI: 10.1101/2023.01.19.23284696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Human mitochondria contain a high copy number, maternally transmitted genome (mtDNA) that encodes 13 proteins required for oxidative phosphorylation. Heteroplasmy arises when multiple mtDNA variants co-exist in an individual and can exhibit complex dynamics in disease and in aging. As all proteins involved in mtDNA replication and maintenance are nuclear-encoded, heteroplasmy levels can, in principle, be under nuclear genetic control, however this has never been shown in humans. Here, we develop algorithms to quantify mtDNA copy number (mtCN) and heteroplasmy levels using blood-derived whole genome sequences from 274,832 individuals of diverse ancestry and perform GWAS to identify nuclear loci controlling these traits. After careful correction for blood cell composition, we observe that mtCN declines linearly with age and is associated with 92 independent nuclear genetic loci. We find that nearly every individual carries heteroplasmic variants that obey two key patterns: (1) heteroplasmic single nucleotide variants are somatic mutations that accumulate sharply after age 70, while (2) heteroplasmic indels are maternally transmitted as mtDNA mixtures with resulting levels influenced by 42 independent nuclear loci involved in mtDNA replication, maintenance, and novel pathways. These nuclear loci do not appear to act by mtDNA mutagenesis, but rather, likely act by conferring a replicative advantage to specific mtDNA molecules. As an illustrative example, the most common heteroplasmy we identify is a length variant carried by >50% of humans at position m.302 within a G-quadruplex known to serve as a replication switch. We find that this heteroplasmic variant exerts cis -acting genetic control over mtDNA abundance and is itself under trans -acting genetic control of nuclear loci encoding protein components of this regulatory switch. Our study showcases how nuclear haplotype can privilege the replication of specific mtDNA molecules to shape mtCN and heteroplasmy dynamics in the human population.
Collapse
Affiliation(s)
- Rahul Gupta
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, United States
- Broad Institute of MIT and Harvard, United States
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
| | - Masahiro Kanai
- Broad Institute of MIT and Harvard, United States
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
| | - Timothy J Durham
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, United States
- Broad Institute of MIT and Harvard, United States
| | - Kristin Tsuo
- Broad Institute of MIT and Harvard, United States
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
| | - Jason G McCoy
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, United States
- Broad Institute of MIT and Harvard, United States
| | - Patrick F Chinnery
- Department of Clinical Neurosciences & MRC Mitochondrial Biology Unit, University of Cambridge, United Kingdom
| | - Konrad J Karczewski
- Broad Institute of MIT and Harvard, United States
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
| | - Sarah E Calvo
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, United States
- Broad Institute of MIT and Harvard, United States
| | - Benjamin M Neale
- Broad Institute of MIT and Harvard, United States
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, United States
| | - Vamsi K Mootha
- Howard Hughes Medical Institute and Department of Molecular Biology, Massachusetts General Hospital, United States
- Broad Institute of MIT and Harvard, United States
- Department of Systems Biology, Harvard Medical School, United States
| |
Collapse
|
9
|
Chinnery PF. Precision mitochondrial medicine. CAMBRIDGE PRISMS. PRECISION MEDICINE 2022; 1:e6. [PMID: 38550943 PMCID: PMC10953752 DOI: 10.1017/pcm.2022.8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 09/29/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2024]
Abstract
Mitochondria play a key role in cell homeostasis as a major source of intracellular energy (adenosine triphosphate), and as metabolic hubs regulating many canonical cell processes. Mitochondrial dysfunction has been widely documented in many common diseases, and genetic studies point towards a causal role in the pathogenesis of specific late-onset disorder. Together this makes targeting mitochondrial genes an attractive strategy for precision medicine. However, the genetics of mitochondrial biogenesis is complex, with over 1,100 candidate genes found in two different genomes: the nuclear DNA and mitochondrial DNA (mtDNA). Here, we review the current evidence associating mitochondrial genetic variants with distinct clinical phenotypes, with some having clear therapeutic implications. The strongest evidence has emerged through the investigation of rare inherited mitochondrial disorders, but genome-wide association studies also implicate mtDNA variants in the risk of developing common diseases, opening to door for the incorporation of mitochondrial genetic variant analysis in population disease risk stratification.
Collapse
Affiliation(s)
- Patrick F. Chinnery
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
- Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| |
Collapse
|