1
|
English AC, Dolzhenko E, Ziaei Jam H, McKenzie SK, Olson ND, De Coster W, Park J, Gu B, Wagner J, Eberle MA, Gymrek M, Chaisson MJP, Zook JM, Sedlazeck FJ. Analysis and benchmarking of small and large genomic variants across tandem repeats. Nat Biotechnol 2024:10.1038/s41587-024-02225-z. [PMID: 38671154 DOI: 10.1038/s41587-024-02225-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 03/28/2024] [Indexed: 04/28/2024]
Abstract
Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits and are linked to over 60 disease phenotypes. However, they are often excluded from at-scale studies because of challenges with variant calling and representation, as well as a lack of a genome-wide standard. Here, to promote the development of TR methods, we created a catalog of TR regions and explored TR properties across 86 haplotype-resolved long-read human assemblies. We curated variants from the Genome in a Bottle (GIAB) HG002 individual to create a TR dataset to benchmark existing and future TR analysis methods. We also present an improved variant comparison method that handles variants greater than 4 bp in length and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ~24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 'truth-set' TR benchmark. We demonstrate the utility of this pipeline across short-read and long-read technologies.
Collapse
Affiliation(s)
- Adam C English
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
| | | | - Helyaneh Ziaei Jam
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | | | - Nathan D Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Wouter De Coster
- Applied and Translational Neurogenomics Group, VIB Center for Molecular Neurology, VIB, Antwerp, Belgium
- Applied and Translational Neurogenomics Group, Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| | - Jonghun Park
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Bida Gu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | - Melissa Gymrek
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
2
|
Tajeddin N, Arabfard M, Alizadeh S, Salesi M, Khamse S, Delbari A, Ohadi M. Novel islands of GGC and GCC repeats coincide with human evolution. Gene 2024; 902:148194. [PMID: 38262548 DOI: 10.1016/j.gene.2024.148194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/29/2023] [Accepted: 01/18/2024] [Indexed: 01/25/2024]
Abstract
BACKGROUND Because of high mutation rate, overrepresentation in genic regions, and link with various neurological, neurodegenerative, and movement disorders, GGC and GCC short tandem repeats (STRs) are prone to natural selection. Among a number of lacking data, the 3-repeats of these STRs remain widely unexplored. RESULTS In a genome-wide search in human, here we mapped GGC and GCC STRs of ≥3-repeats, and found novel islands of up to 45 of those STRs, populating spans of 1 to 2 kb of genomic DNA. RGPD4 and NOC4L harbored the densest (GGC)3 (probability 3.09061E-71) and (GCC)3 (probability 1.72376E-61) islands, respectively, and were human-specific. We also found prime instances of directional incremented density of STRs at specific loci in human versus other species, including the FOXK2 and SKI GGC islands. The genes containing those islands significantly diverged in expression in human versus other species, and the proteins encoded by those genes interact closely in a physical interaction network, consequence of which may be human-specific characteristics such as higher order brain functions. CONCLUSION We report novel islands of GGC and GCC STRs of evolutionary relevance to human. The density, and in some instances, periodicity of these islands support them as a novel genomic entity, which need to be further explored in evolutionary, mechanistic, and functional platforms.
Collapse
Affiliation(s)
- N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
3
|
Khamse S, Alizadeh S, Khorshid HRK, Delbari A, Tajeddin N, Ohadi M. A Hypermutable Region in the DISP2 Gene Links to Natural Selection and Late-Onset Neurocognitive Disorders in Humans. Mol Neurobiol 2024:10.1007/s12035-024-04155-y. [PMID: 38565786 DOI: 10.1007/s12035-024-04155-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 03/25/2024] [Indexed: 04/04/2024]
Abstract
(CCG) short tandem repeats (STRs) are predominantly enriched in genic regions, mutation hotspots for C to T truncating substitutions, and involved in various neurological and neurodevelopmental disorders. However, intact blocks of this class of STRs are widely overlooked with respect to their link with natural selection. The human neuron-specific gene, DISP2 (dispatched RND transporter family member 2), contains a (CCG) repeat in its 5' untranslated region. Here, we sequenced this STR in a sample of 448 Iranian individuals, consisting of late-onset neurocognitive disorder (NCD) (N = 203) and controls (N = 245). We found that the region spanning the (CCG) repeat was highly mutated, resulting in several flanking (CCG) residues. However, an 8-repeat of the (CCG) repeat was predominantly abundant (frequency = 0.92) across the two groups. While the overall distribution of genotypes was not different between the two groups (p > 0.05), we detected four genotypes in the NCD group only (2% of the NCD genotypes, Mid-p = 0.02), consisting of extreme short alleles, 5- and 6-repeats, that were not detected in the control group. The patients harboring those genotypes received the diagnoses of probable Alzheimer's disease and vascular dementia. We also found six genotypes in the control group only (2.5% of the control genotypes, Mid-p = 0.01) that consisted of the 8-repeat and extreme long alleles, 9- and 10-repeats, of which the 10-repeat was not detected in the NCD group. The (CCG) repeat specifically expanded in primates. In conclusion, we report an indication of natural selection at a novel hypermutable region in the human genome and divergent alleles and genotypes in late-onset NhCDs and controls. These findings reinforce the hypothesis that a collection of rare alleles and genotypes in a number of genes may unambiguously contribute to the cognition impairment component of late-onset NCDs.
Collapse
Affiliation(s)
- S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
4
|
Arabfard M, Tajeddin N, Alizadeh S, Salesi M, Bayat H, Khorram Khorshid HR, Khamse S, Delbari A, Ohadi M. Dyads of GGC and GCC form hotspot colonies that coincide with the evolution of human and other great apes. BMC Genom Data 2024; 25:21. [PMID: 38383300 PMCID: PMC10880355 DOI: 10.1186/s12863-024-01207-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 02/11/2024] [Indexed: 02/23/2024] Open
Abstract
BACKGROUND GGC and GCC short tandem repeats (STRs) are of various evolutionary, biological, and pathological implications. However, the fundamental two-repeats (dyads) of these STRs are widely unexplored. RESULTS On a genome-wide scale, we mapped (GGC)2 and (GCC)2 dyads in human, and found monumental colonies (distance between each dyad < 500 bp) of extraordinary density, and in some instances periodicity. The largest (GCC)2 and (GGC)2 colonies were intergenic, homogeneous, and human-specific, consisting of 219 (GCC)2 on chromosome 2 (probability < 1.545E-219) and 70 (GGC)2 on chromosome 9 (probability = 1.809E-148). We also found that several colonies were shared in other great apes, and directionally increased in density and complexity in human, such as a colony of 99 (GCC)2 on chromosome 20, that specifically expanded in great apes, and reached maximum complexity in human (probability 1.545E-220). Numerous other colonies of evolutionary relevance in human were detected in other largely overlooked regions of the genome, such as chromosome Y and pseudogenes. Several of the genes containing or nearest to those colonies were divergently expressed in human. CONCLUSION In conclusion, (GCC)2 and (GGC)2 form unprecedented genomic colonies that coincide with the evolution of human and other great apes. The extent of the genomic rearrangements leading to those colonies support overlooked recombination hotspots, shared across great apes. The identified colonies deserve to be studied in mechanistic, evolutionary, and functional platforms.
Collapse
Affiliation(s)
- M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
- Research Center for Prevention of Oral and Dental Diseases, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - H Bayat
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
5
|
Alizadeh S, Khamse S, Tajeddin N, Khorram Khorshid HR, Delbari A, Ohadi M. A GCC repeat in RAB26 undergoes natural selection in human and harbors divergent genotypes in late-onset Alzheimer's disease. Gene 2024; 893:147968. [PMID: 37931854 DOI: 10.1016/j.gene.2023.147968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 10/28/2023] [Accepted: 11/03/2023] [Indexed: 11/08/2023]
Abstract
Although mainly located in genic regions and being mutation hotspots, intact blocks of CG-rich trinucleotide short tandem repeats (STRs) are largely overlooked with respect to their link with natural selection. The human RAB26 (member RAS oncogene family) directs synaptic and secretory vesicles into preautophagosomal structures, inhibition of which specifically disrupts axonal transport of degradative organelles and leads to an axonal dystrophy, resembling Alzheimer's disease (AD). Human RAB26 contains a GCC repeat in the top 1st percent in respect of length. Here we sequenced this STR in 441 Iranian individuals, consisting of late-onset neurocognitive disorder (NCD) (N = 216) and controls (N = 225). In both groups, the 12-repeat allele and the 12/12 genotype were predominantly abundant. We found excess of homozygosity for non-12 alleles in the NCD group (Mid-P exact = 0.027). Furthermore, divergent genotypes were detected that were specific to the NCD group (2.8% of genotypes) (Mid-P exact = 0.006) or controls (3.1% of genotypes) (Mid-P exact = 0.004). The patients harboring divergent genotypes received the diagnosis of AD. Based on the predominant abundance of the 12-repeat and 12/12 genotype in both groups, excess of non-12 homozygosity in the NCD group, and divergent genotypes across the NCD and control groups, we propose natural selection at this locus and link with late-onset AD. Our findings strengthen the hypothesis that a collection of rare genotypes unambiguously contribute to the pathogenesis of late-onset NCDs, such as AD.
Collapse
Affiliation(s)
- S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
6
|
Rafehi H, Bennett MF, Bahlo M. Detection and discovery of repeat expansions in ataxia enabled by next-generation sequencing: present and future. Emerg Top Life Sci 2023; 7:349-359. [PMID: 37733280 PMCID: PMC10754322 DOI: 10.1042/etls20230018] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 08/29/2023] [Accepted: 09/12/2023] [Indexed: 09/22/2023]
Abstract
Hereditary cerebellar ataxias are a heterogenous group of progressive neurological disorders that are disproportionately caused by repeat expansions (REs) of short tandem repeats (STRs). Genetic diagnosis for RE disorders such as ataxias are difficult as the current gold standard for diagnosis is repeat-primed PCR assays or Southern blots, neither of which are scalable nor readily available for all STR loci. In the last five years, significant advances have been made in our ability to detect STRs and REs in short-read sequencing data, especially whole-genome sequencing. Given the increasing reliance of genomics in diagnosis of rare diseases, the use of established RE detection pipelines for RE disorders is now a highly feasible and practical first-step alternative to molecular testing methods. In addition, many new pathogenic REs have been discovered in recent years by utilising WGS data. Collectively, genomes are an important resource/platform for further advancements in both the discovery and diagnosis of REs that cause ataxia and will lead to much needed improvement in diagnostic rates for patients with hereditary ataxia.
Collapse
Affiliation(s)
- Haloom Rafehi
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia
| | - Mark F Bennett
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia
- Epilepsy Research Centre, Department of Medicine, University of Melbourne, Austin Health, Heidelberg, VIC, Australia
| | - Melanie Bahlo
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia
| |
Collapse
|
7
|
Annear DJ, Kooy RF. Unravelling the link between neurodevelopmental disorders and short tandem CGG-repeat expansions. Emerg Top Life Sci 2023; 7:265-275. [PMID: 37768318 PMCID: PMC10754333 DOI: 10.1042/etls20230021] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 08/23/2023] [Accepted: 09/11/2023] [Indexed: 09/29/2023]
Abstract
Neurodevelopmental disorders (NDDs) encompass a diverse group of disorders characterised by impaired cognitive abilities and developmental challenges. Short tandem repeats (STRs), repetitive DNA sequences found throughout the human genome, have emerged as potential contributors to NDDs. Specifically, the CGG trinucleotide repeat has been implicated in a wide range of NDDs, including Fragile X Syndrome (FXS), the most common inherited form of intellectual disability and autism. This review focuses on CGG STR expansions associated with NDDs and their impact on gene expression through repeat expansion-mediated epigenetic silencing. We explore the molecular mechanisms underlying CGG-repeat expansion and the resulting epigenetic modifications, such as DNA hypermethylation and gene silencing. Additionally, we discuss the involvement of other CGG STRs in neurodevelopmental diseases. Several examples, including FMR1, AFF2, AFF3, XYLT1, FRA10AC1, CBL, and DIP2B, highlight the complex relationship between CGG STR expansions and NDDs. Furthermore, recent advancements in this field are highlighted, shedding light on potential future research directions. Understanding the role of STRs, particularly CGG-repeats, in NDDs has the potential to uncover novel diagnostic and therapeutic strategies for these challenging disorders.
Collapse
Affiliation(s)
- Dale J Annear
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - R Frank Kooy
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
8
|
English A, Dolzhenko E, Jam HZ, Mckenzie S, Olson ND, De Coster W, Park J, Gu B, Wagner J, Eberle MA, Gymrek M, Chaisson MJP, Zook JM, Sedlazeck FJ. Benchmarking of small and large variants across tandem repeats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.29.564632. [PMID: 37961319 PMCID: PMC10634962 DOI: 10.1101/2023.10.29.564632] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits, and are linked to over 60 disease phenotypes. However, their complexity often excludes them from at-scale studies due to challenges with variant calling, representation, and lack of a genome-wide standard. To promote TR methods development, we create a comprehensive catalog of TR regions and explore its properties across 86 samples. We then curate variants from the GIAB HG002 individual to create a tandem repeat benchmark. We also present a variant comparison method that handles small and large alleles and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ∼24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 TR benchmark. We work with the GIAB community to demonstrate the utility of this benchmark across short and long read technologies.
Collapse
|
9
|
Tassone F, Protic D, Allen EG, Archibald AD, Baud A, Brown TW, Budimirovic DB, Cohen J, Dufour B, Eiges R, Elvassore N, Gabis LV, Grudzien SJ, Hall DA, Hessl D, Hogan A, Hunter JE, Jin P, Jiraanont P, Klusek J, Kooy RF, Kraan CM, Laterza C, Lee A, Lipworth K, Losh M, Loesch D, Lozano R, Mailick MR, Manolopoulos A, Martinez-Cerdeno V, McLennan Y, Miller RM, Montanaro FAM, Mosconi MW, Potter SN, Raspa M, Rivera SM, Shelly K, Todd PK, Tutak K, Wang JY, Wheeler A, Winarni TI, Zafarullah M, Hagerman RJ. Insight and Recommendations for Fragile X-Premutation-Associated Conditions from the Fifth International Conference on FMR1 Premutation. Cells 2023; 12:2330. [PMID: 37759552 PMCID: PMC10529056 DOI: 10.3390/cells12182330] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 09/09/2023] [Accepted: 09/12/2023] [Indexed: 09/29/2023] Open
Abstract
The premutation of the fragile X messenger ribonucleoprotein 1 (FMR1) gene is characterized by an expansion of the CGG trinucleotide repeats (55 to 200 CGGs) in the 5' untranslated region and increased levels of FMR1 mRNA. Molecular mechanisms leading to fragile X-premutation-associated conditions (FXPAC) include cotranscriptional R-loop formations, FMR1 mRNA toxicity through both RNA gelation into nuclear foci and sequestration of various CGG-repeat-binding proteins, and the repeat-associated non-AUG (RAN)-initiated translation of potentially toxic proteins. Such molecular mechanisms contribute to subsequent consequences, including mitochondrial dysfunction and neuronal death. Clinically, premutation carriers may exhibit a wide range of symptoms and phenotypes. Any of the problems associated with the premutation can appropriately be called FXPAC. Fragile X-associated tremor/ataxia syndrome (FXTAS), fragile X-associated primary ovarian insufficiency (FXPOI), and fragile X-associated neuropsychiatric disorders (FXAND) can fall under FXPAC. Understanding the molecular and clinical aspects of the premutation of the FMR1 gene is crucial for the accurate diagnosis, genetic counseling, and appropriate management of affected individuals and families. This paper summarizes all the known problems associated with the premutation and documents the presentations and discussions that occurred at the International Premutation Conference, which took place in New Zealand in 2023.
Collapse
Affiliation(s)
- Flora Tassone
- Department of Biochemistry and Molecular Medicine, School of Medicine, University of California Davis, Sacramento, CA 95817, USA;
- MIND Institute, University of California Davis, Davis, CA 95817, USA; (B.D.); (D.H.); (V.M.-C.)
| | - Dragana Protic
- Department of Pharmacology, Clinical Pharmacology and Toxicology, Faculty of Medicine, University of Belgrade, 11129 Belgrade, Serbia;
- Fragile X Clinic, Special Hospital for Cerebral Palsy and Developmental Neurology, 11040 Belgrade, Serbia
| | - Emily Graves Allen
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA; (E.G.A.); (P.J.); (K.S.)
| | - Alison D. Archibald
- Victorian Clinical Genetics Services, Royal Children’s Hospital, Melbourne, VIC 3052, Australia;
- Department of Paediatrics, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, VIC 3052, Australia;
- Genomics in Society Group, Murdoch Children’s Research Institute, Royal Children’s Hospital, Melbourne, VIC 3052, Australia
| | - Anna Baud
- Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, 61-614 Poznan, Poland; (A.B.); (K.T.)
| | - Ted W. Brown
- Central Clinical School, University of Sydney, Sydney, NSW 2006, Australia;
- Fragile X Association of Australia, Brookvale, NSW 2100, Australia;
- NYS Institute for Basic Research in Developmental Disabilities, New York, NY 10314, USA
| | - Dejan B. Budimirovic
- Department of Psychiatry, Fragile X Clinic, Kennedy Krieger Institute, Baltimore, MD 21205, USA;
- Department of Psychiatry & Behavioral Sciences-Child Psychiatry, School of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Jonathan Cohen
- Fragile X Alliance Clinic, Melbourne, VIC 3161, Australia;
| | - Brett Dufour
- MIND Institute, University of California Davis, Davis, CA 95817, USA; (B.D.); (D.H.); (V.M.-C.)
- Department of Pathology and Laboratory Medicine, Institute for Pediatric Regenerative Medicine, Shriners Hospitals for Children of Northern California, School of Medicine, University of California Davis, Sacramento, CA 95817, USA;
| | - Rachel Eiges
- Stem Cell Research Laboratory, Medical Genetics Institute, Shaare Zedek Medical Center Affiliated with the Hebrew University School of Medicine, Jerusalem 91031, Israel;
| | - Nicola Elvassore
- Veneto Institute of Molecular Medicine (VIMM), 35129 Padova, Italy; (N.E.); (C.L.)
- Department of Industrial Engineering, University of Padova, 35131 Padova, Italy
| | - Lidia V. Gabis
- Keshet Autism Center Maccabi Wolfson, Holon 5822012, Israel;
- Faculty of Medicine, Tel-Aviv University, Tel Aviv 6997801, Israel
| | - Samantha J. Grudzien
- Department of Neurology, University of Michigan, 4148 BSRB, 109 Zina Pitcher Place, Ann Arbor, MI 48109, USA; (S.J.G.); (P.K.T.)
- Neuroscience Graduate Program, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Deborah A. Hall
- Department of Neurological Sciences, Rush University, Chicago, IL 60612, USA;
| | - David Hessl
- MIND Institute, University of California Davis, Davis, CA 95817, USA; (B.D.); (D.H.); (V.M.-C.)
- Department of Psychiatry and Behavioral Sciences, School of Medicine, University of California Davis, Sacramento, CA 95817, USA
| | - Abigail Hogan
- Department of Communication Sciences and Disorders, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208, USA; (A.H.); (J.K.)
| | - Jessica Ezzell Hunter
- RTI International, Research Triangle Park, NC 27709, USA; (J.E.H.); (S.N.P.); (M.R.); (A.W.)
| | - Peng Jin
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA; (E.G.A.); (P.J.); (K.S.)
| | - Poonnada Jiraanont
- Faculty of Medicine, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand;
| | - Jessica Klusek
- Department of Communication Sciences and Disorders, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208, USA; (A.H.); (J.K.)
| | - R. Frank Kooy
- Department of Medical Genetics, University of Antwerp, 2000 Antwerp, Belgium;
| | - Claudine M. Kraan
- Department of Paediatrics, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Melbourne, VIC 3052, Australia;
- Diagnosis and Development, Murdoch Children’s Research Institute, Melbourne, VIC 3052, Australia
| | - Cecilia Laterza
- Veneto Institute of Molecular Medicine (VIMM), 35129 Padova, Italy; (N.E.); (C.L.)
- Department of Industrial Engineering, University of Padova, 35131 Padova, Italy
| | - Andrea Lee
- Fragile X New Zealand, Nelson 7040, New Zealand;
| | - Karen Lipworth
- Fragile X Association of Australia, Brookvale, NSW 2100, Australia;
| | - Molly Losh
- Roxelyn and Richard Pepper Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL 60201, USA;
| | - Danuta Loesch
- School of Psychology and Public Health, La Trobe University, Melbourne, VIC 3086, Australia;
| | - Reymundo Lozano
- Departments of Genetics and Genomic Sciences and Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA;
| | - Marsha R. Mailick
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA;
| | - Apostolos Manolopoulos
- Intramural Research Program, Laboratory of Clinical Investigation, National Institute on Aging, Baltimore, MD 21224, USA;
| | - Veronica Martinez-Cerdeno
- MIND Institute, University of California Davis, Davis, CA 95817, USA; (B.D.); (D.H.); (V.M.-C.)
- Department of Pathology and Laboratory Medicine, Institute for Pediatric Regenerative Medicine, Shriners Hospitals for Children of Northern California, School of Medicine, University of California Davis, Sacramento, CA 95817, USA;
| | - Yingratana McLennan
- Department of Pathology and Laboratory Medicine, Institute for Pediatric Regenerative Medicine, Shriners Hospitals for Children of Northern California, School of Medicine, University of California Davis, Sacramento, CA 95817, USA;
| | | | - Federica Alice Maria Montanaro
- Child and Adolescent Neuropsychiatry Unit, Department of Neuroscience, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy;
- Department of Education, Psychology, Communication, University of Bari Aldo Moro, 70121 Bari, Italy
| | - Matthew W. Mosconi
- Schiefelbusch Institute for Life Span Studies, University of Kansas, Lawrence, KS 66045, USA;
- Clinical Child Psychology Program, University of Kansas, Lawrence, KS 66045, USA
- Kansas Center for Autism Research and Training (K-CART), University of Kansas, Lawrence, KS 66045, USA
| | - Sarah Nelson Potter
- RTI International, Research Triangle Park, NC 27709, USA; (J.E.H.); (S.N.P.); (M.R.); (A.W.)
| | - Melissa Raspa
- RTI International, Research Triangle Park, NC 27709, USA; (J.E.H.); (S.N.P.); (M.R.); (A.W.)
| | - Susan M. Rivera
- Department of Psychology, University of Maryland, College Park, MD 20742, USA;
| | - Katharine Shelly
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA; (E.G.A.); (P.J.); (K.S.)
| | - Peter K. Todd
- Department of Neurology, University of Michigan, 4148 BSRB, 109 Zina Pitcher Place, Ann Arbor, MI 48109, USA; (S.J.G.); (P.K.T.)
- Ann Arbor Veterans Administration Healthcare, Ann Arbor, MI 48105, USA
| | - Katarzyna Tutak
- Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Uniwersytetu Poznańskiego 6, 61-614 Poznan, Poland; (A.B.); (K.T.)
| | - Jun Yi Wang
- Center for Mind and Brain, University of California Davis, Davis, CA 95618, USA;
| | - Anne Wheeler
- RTI International, Research Triangle Park, NC 27709, USA; (J.E.H.); (S.N.P.); (M.R.); (A.W.)
| | - Tri Indah Winarni
- Center for Biomedical Research (CEBIOR), Faculty of Medicine, Universitas Diponegoro, Semarang 502754, Central Java, Indonesia;
| | - Marwa Zafarullah
- Department of Biochemistry and Molecular Medicine, School of Medicine, University of California Davis, Sacramento, CA 95817, USA;
| | - Randi J. Hagerman
- MIND Institute, University of California Davis, Davis, CA 95817, USA; (B.D.); (D.H.); (V.M.-C.)
- Department of Pediatrics, School of Medicine, University of California Davis, Sacramento, CA 95817, USA
| |
Collapse
|
10
|
Global abundance of short tandem repeats is non-random in rodents and primates. BMC Genom Data 2022; 23:77. [PMID: 36329409 PMCID: PMC9635179 DOI: 10.1186/s12863-022-01092-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 10/18/2022] [Indexed: 11/06/2022] Open
Abstract
Background While of predominant abundance across vertebrate genomes and significant biological implications, the relevance of short tandem repeats (STRs) (also known as microsatellites) to speciation remains largely elusive and attributed to random coincidence for the most part. Here we collected data on the whole-genome abundance of mono-, di-, and trinucleotide STRs in nine species, encompassing rodents and primates, including rat, mouse, olive baboon, gelada, macaque, gorilla, chimpanzee, bonobo, and human. The collected data were used to analyze hierarchical clustering of the STR abundances in the selected species. Results We found massive differential STR abundances between the rodent and primate orders. In addition, while numerous STRs had random abundance across the nine selected species, the global abundance conformed to three consistent < clusters>, as follows: <rat, mouse>, <gelada, macaque, olive baboon>, and <gorilla, chimpanzee, bonobo, human>, which coincided with the phylogenetic distances of the selected species (p < 4E-05). Exceptionally, in the trinucleotide STR compartment, human was significantly distant from all other species. Conclusion Based on hierarchical clustering, we propose that the global abundance of STRs is non-random in rodents and primates, and probably had a determining impact on the speciation of the two orders. We also propose the STRs and STR lengths, which predominantly conformed to the phylogeny of the selected species, exemplified by (t)10, (ct)6, and (taa4). Phylogenetic and experimental platforms are warranted to further examine the observed patterns and the biological mechanisms associated with those STRs.
Collapse
|
11
|
A (GCC) repeat in SBF1 reveals a novel biological phenomenon in human and links to late onset neurocognitive disorder. Sci Rep 2022; 12:15480. [PMID: 36104480 PMCID: PMC9474449 DOI: 10.1038/s41598-022-19878-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Accepted: 09/06/2022] [Indexed: 12/22/2022] Open
Abstract
The human SBF1 (SET binding factor 1) gene, alternatively known as MTMR5, is predominantly expressed in the brain, and its epigenetic dysregulation is linked to late-onset neurocognitive disorders (NCDs), such as Alzheimer’s disease. This gene contains a (GCC)-repeat at the interval between + 1 and + 60 of the transcription start site (SBF1-202 ENST00000380817.8). We sequenced the SBF1 (GCC)-repeat in a sample of 542 Iranian individuals, consisting of late-onset NCDs (N = 260) and controls (N = 282). While multiple alleles were detected at this locus, the 8 and 9 repeats were predominantly abundant, forming > 95% of the allele pool across the two groups. Among a number of anomalies, the allele distribution was significantly different in the NCD group versus controls (Fisher’s exact p = 0.006), primarily as a result of enrichment of the 8-repeat in the former. The genotype distribution departed from the Hardy–Weinberg principle in both groups (p < 0.001), and was significantly different between the two groups (Fisher’s exact p = 0.001). We detected significantly low frequency of the 8/9 genotype in both groups, higher frequency of this genotype in the NCD group, and reverse order of 8/8 versus 9/9 genotypes in the NCD group versus controls. Biased heterozygous/heterozygous ratios were also detected for the 6/8 versus 6/9 genotypes (in favor of 6/8) across the human samples studied (Fisher’s exact p = 0.0001). Bioinformatics studies revealed that the number of (GCC)-repeats may change the RNA secondary structure and interaction sites at least across human exon 1. This STR was specifically expanded beyond 2-repeats in primates. In conclusion, we report indication of a novel biological phenomenon, in which there is selection against certain heterozygous genotypes at a STR locus in human. We also report different allele and genotype distribution at this STR locus in late-onset NCD versus controls. In view of the location of this STR in the 5′ untranslated region, RNA/RNA or RNA/DNA heterodimer formation of the involved genotypes and alternative RNA processing and/or translation should be considered.
Collapse
|
12
|
Yousuf A, Ahmed N, Qurashi A. Non-canonical DNA/RNA structures associated with the pathogenesis of Fragile X-associated tremor/ataxia syndrome and Fragile X syndrome. Front Genet 2022; 13:866021. [PMID: 36110216 PMCID: PMC9468596 DOI: 10.3389/fgene.2022.866021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 07/22/2022] [Indexed: 11/13/2022] Open
Abstract
Fragile X-associated tremor/ataxia syndrome (FXTAS) and fragile X syndrome (FXS) are primary examples of fragile X-related disorders (FXDs) caused by abnormal expansion of CGG repeats above a certain threshold in the 5′-untranslated region of the fragile X mental retardation (FMR1) gene. Both diseases have distinct clinical manifestations and molecular pathogenesis. FXTAS is a late-adult-onset neurodegenerative disorder caused by a premutation (PM) allele (CGG expansion of 55–200 repeats), resulting in FMR1 gene hyperexpression. On the other hand, FXS is a neurodevelopmental disorder that results from a full mutation (FM) allele (CGG expansions of ≥200 repeats) leading to heterochromatization and transcriptional silencing of the FMR1 gene. The main challenge is to determine how CGG repeat expansion affects the fundamentally distinct nature of FMR1 expression in FM and PM ranges. Abnormal CGG repeat expansions form a variety of non-canonical DNA and RNA structures that can disrupt various cellular processes and cause distinct effects in PM and FM alleles. Here, we review these structures and how they are related to underlying mutations and disease pathology in FXS and FXTAS. Finally, as new CGG expansions within the genome have been identified, it will be interesting to determine their implications in disease pathology and treatment.
Collapse
|
13
|
Zeng YH, Yang K, Du GQ, Chen YK, Cao CY, Qiu YS, He J, Lv HD, Qu QQ, Chen JN, Xu GR, Chen L, Zheng FZ, Zhao M, Lin MT, Chen WJ, Hu J, Wang ZQ, Wang N. GGC repeat expansion of RILPL1 is associated with oculopharyngodistal myopathy. Ann Neurol 2022; 92:512-526. [PMID: 35700120 DOI: 10.1002/ana.26436] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 05/04/2022] [Accepted: 05/23/2022] [Indexed: 11/11/2022]
Abstract
OBJECTIVE Oculopharyngodistal myopathy (OPDM) is an adult-onset neuromuscular disease characterized by progressive ptosis, dysarthria, ophthalmoplegia, and distal muscle weakness. Recent studies revealed GGC repeat expansions in 5'-UTR of LRP12, GIPC1, and NOTCH2NLC are associated with OPDM. Despite these advances, around 30% of OPDM patients remain genetically undiagnosed. Herein, we aim to investigate genetic basis for undiagnosed OPDM patients in two unrelated Chinese Han families. METHODS Parametric linkage analysis was performed. Long-read sequencing followed by repeat-primed polymerase chain reaction (RP-PCR) and amplicon length polymerase chain reaction (AL-PCR) were used to determine the genetic cause. Targeted methylation sequencing was implemented to detect epigenetic changes. The possible pathogenesis mechanism was investigated by qPCR, immunoblotting, RNA FISH, and immunofluorescence staining of muscle biopsy samples. RESULTS The disease locus was mapped to 12q24.3. Subsequently, GGC repeat expansion in the promoter region of RILPL1 was identified in six OPDM patients from two families, findings consistent with a founder effect, designated as OPDM type 4 (OPDM4). Targeted methylation sequencing revealed hypermethylation at RILPL1 locus in unaffected individuals with ultralong expansion. Analysis of muscle samples showed no significant differences in RILPL1 mRNA or RILPL1 protein levels between patients and controls. Public CAGE-seq data indicated that alternative TSSs exist upstream of the RefSeq-annotated RILPL1 TSS. Strand-specific RNAseq data revealed bidirectional transcription from the RILPL1 locus. Finally, FISH/IF indicated that both sense and antisense transcripts formed RNA foci and were co-localized with hnRNPA2B1 and p62 in the intranuclear inclusions of OPDM4 patients. INTERPRETATION Our findings implicate abnormal GGC repeat expansions in the promoter region of RILPL1 as a novel genetic cause for OPDM, and suggest a methylation mechanism and a potential RNA toxicity mechanism are involved in OPDM4 pathogenesis. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Yi-Heng Zeng
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Kang Yang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Gan-Qin Du
- The First Affiliated Hospital, College of Clinical Medicine of Henan University of Science and Technology, Luoyang, 471000, China
| | - Yi-Kun Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Chun-Yan Cao
- The First Affiliated Hospital, College of Clinical Medicine of Henan University of Science and Technology, Luoyang, 471000, China
| | - Yu-Sen Qiu
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Jin He
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Hai-Dong Lv
- Department of Neurology, The People's Hospital of Jiaozuo City, Jiaozuo, 454150, China
| | - Qian-Qian Qu
- Department of Neurology, The People's Hospital of Jiaozuo City, Jiaozuo, 454150, China
| | - Jian-Nan Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Guo-Rong Xu
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Long Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Fu-Ze Zheng
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Miao Zhao
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Min-Ting Lin
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Wan-Jin Chen
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Jing Hu
- Department of Neuromuscular Disorders, The Third Hospital of Hebei Medical University, Shijiazhuang, 050000, China
| | - Zhi-Qiang Wang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| | - Ning Wang
- Department of Neurology and Institute of Neurology of First Affiliated Hospital, Institute of Neuroscience, and Fujian Key Laboratory of Molecular Neurology, Fujian Medical University, Fuzhou, 350005, China
| |
Collapse
|
14
|
Boldyreva LV, Andreyeva EN, Pindyurin AV. Position Effect Variegation: Role of the Local Chromatin Context in Gene Expression Regulation. Mol Biol 2022. [DOI: 10.1134/s0026893322030049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
15
|
Neurodegenerative diseases associated with non-coding CGG tandem repeat expansions. Nat Rev Neurol 2022; 18:145-157. [PMID: 35022573 DOI: 10.1038/s41582-021-00612-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2021] [Indexed: 02/07/2023]
Abstract
Non-coding CGG repeat expansions cause multiple neurodegenerative disorders, including fragile X-associated tremor/ataxia syndrome, neuronal intranuclear inclusion disease, oculopharyngeal myopathy with leukodystrophy, and oculopharyngodistal myopathy. The underlying genetic causes of several of these diseases have been identified only in the past 2-3 years. These expansion disorders have substantial overlapping clinical, neuroimaging and histopathological features. The shared features suggest common mechanisms that could have implications for the development of therapies for this group of diseases - similar therapeutic strategies or drugs may be effective for various neurodegenerative disorders induced by non-coding CGG expansions. In this Review, we provide an overview of clinical and pathological features of these CGG repeat expansion diseases and consider the likely pathological mechanisms, including RNA toxicity, CGG repeat-associated non-AUG-initiated translation, protein aggregation and mitochondrial impairment. We then discuss future research needed to improve the identification and diagnosis of CGG repeat expansion diseases, to improve modelling of these diseases and to understand their pathogenesis. We also consider possible therapeutic strategies. Finally, we propose that CGG repeat expansion diseases may represent manifestations of a single underlying neuromyodegenerative syndrome in which different organs are affected to different extents depending on the gene location of the repeat expansion.
Collapse
|
16
|
Annear DJ, Vandeweyer G, Sanchis-Juan A, Raymond FL, Kooy RF. Non-Mendelian inheritance patterns and extreme deviation rates of CGG repeats in autism. Genome Res 2022; 32:1967-1980. [PMID: 36351771 PMCID: PMC9808627 DOI: 10.1101/gr.277011.122] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 10/14/2022] [Indexed: 11/10/2022]
Abstract
As expansions of CGG short tandem repeats (STRs) are established as the genetic etiology of many neurodevelopmental disorders, we aimed to elucidate the inheritance patterns and role of CGG STRs in autism-spectrum disorder (ASD). By genotyping 6063 CGG STR loci in a large cohort of trios and quads with an ASD-affected proband, we determined an unprecedented rate of CGG repeat length deviation across a single generation. Although the concept of repeat length being linked to deviation rate was solidified, we show how shorter STRs display greater degrees of size variation. We observed that CGG STRs did not segregate by Mendelian principles but with a bias against longer repeats, which appeared to magnify as repeat length increased. Through logistic regression, we identified 19 genes that displayed significantly higher rates and degrees of CGG STR expansion within the ASD-affected probands (P < 1 × 10-5). This study not only highlights novel repeat expansions that may play a role in ASD but also reinforces the hypothesis that CGG STRs are specifically linked to human cognition.
Collapse
Affiliation(s)
- Dale J. Annear
- Department of Medical Genetics, University of Antwerp, 2600 Antwerp, Belgium
| | - Geert Vandeweyer
- Department of Medical Genetics, University of Antwerp, 2600 Antwerp, Belgium
| | - Alba Sanchis-Juan
- NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, United Kingdom;,Department of Haematology, University of Cambridge, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, United Kingdom
| | - F. Lucy Raymond
- NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, United Kingdom;,Department of Medical Genetics, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 0XY, United Kingdom
| | - R. Frank Kooy
- Department of Medical Genetics, University of Antwerp, 2600 Antwerp, Belgium
| |
Collapse
|
17
|
Gall-Duncan T, Sato N, Yuen RKC, Pearson CE. Advancing genomic technologies and clinical awareness accelerates discovery of disease-associated tandem repeat sequences. Genome Res 2022; 32:1-27. [PMID: 34965938 PMCID: PMC8744678 DOI: 10.1101/gr.269530.120] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 11/29/2021] [Indexed: 11/25/2022]
Abstract
Expansions of gene-specific DNA tandem repeats (TRs), first described in 1991 as a disease-causing mutation in humans, are now known to cause >60 phenotypes, not just disease, and not only in humans. TRs are a common form of genetic variation with biological consequences, observed, so far, in humans, dogs, plants, oysters, and yeast. Repeat diseases show atypical clinical features, genetic anticipation, and multiple and partially penetrant phenotypes among family members. Discovery of disease-causing repeat expansion loci accelerated through technological advances in DNA sequencing and computational analyses. Between 2019 and 2021, 17 new disease-causing TR expansions were reported, totaling 63 TR loci (>69 diseases), with a likelihood of more discoveries, and in more organisms. Recent and historical lessons reveal that properly assessed clinical presentations, coupled with genetic and biological awareness, can guide discovery of disease-causing unstable TRs. We highlight critical but underrecognized aspects of TR mutations. Repeat motifs may not be present in current reference genomes but will be in forthcoming gapless long-read references. Repeat motif size can be a single nucleotide to kilobases/unit. At a given locus, repeat motif sequence purity can vary with consequence. Pathogenic repeats can be "insertions" within nonpathogenic TRs. Expansions, contractions, and somatic length variations of TRs can have clinical/biological consequences. TR instabilities occur in humans and other organisms. TRs can be epigenetically modified and/or chromosomal fragile sites. We discuss the expanding field of disease-associated TR instabilities, highlighting prospects, clinical and genetic clues, tools, and challenges for further discoveries of disease-causing TR instabilities and understanding their biological and pathological impacts-a vista that is about to expand.
Collapse
Affiliation(s)
- Terence Gall-Duncan
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1L7, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Nozomu Sato
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1L7, Canada
| | - Ryan K C Yuen
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1L7, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Christopher E Pearson
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1L7, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
18
|
Deshmukh AL, Caron MC, Mohiuddin M, Lanni S, Panigrahi GB, Khan M, Engchuan W, Shum N, Faruqui A, Wang P, Yuen RKC, Nakamori M, Nakatani K, Masson JY, Pearson CE. FAN1 exo- not endo-nuclease pausing on disease-associated slipped-DNA repeats: A mechanism of repeat instability. Cell Rep 2021; 37:110078. [PMID: 34879276 DOI: 10.1016/j.celrep.2021.110078] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 07/02/2021] [Accepted: 11/09/2021] [Indexed: 12/19/2022] Open
Abstract
Ongoing inchworm-like CAG and CGG repeat expansions in brains, arising by aberrant processing of slipped DNAs, may drive Huntington's disease, fragile X syndrome, and autism. FAN1 nuclease modifies hyper-expansion rates by unknown means. We show that FAN1, through iterative cycles, binds, dimerizes, and cleaves slipped DNAs, yielding striking exo-nuclease pauses along slip-outs: 5'-C↓A↓GC↓A↓G-3' and 5'-C↓T↓G↓C↓T↓G-3'. CAG excision is slower than CTG and requires intra-strand A·A and T·T mismatches. Fully paired hairpins arrested excision, whereas disease-delaying CAA interruptions further slowed excision. Endo-nucleolytic cleavage is insensitive to slip-outs. Rare FAN1 variants are found in individuals with autism with CGG/CCG expansions, and CGG/CCG slip-outs show exo-nuclease pauses. The slip-out-specific ligand, naphthyridine-azaquinolone, which induces contractions of expanded repeats in vivo, requires FAN1 for its effect, and protects slip-outs from FAN1 exo-, but not endo-, nucleolytic digestion. FAN1's inchworm pausing of slip-out excision rates is well suited to modify inchworm expansion rates, which modify disease onset and progression.
Collapse
Affiliation(s)
- Amit Laxmikant Deshmukh
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Marie-Christine Caron
- Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, QC G1R 3S3, Canada; Department of Molecular Biology, Medical Biochemistry, and Pathology, Laval University Cancer Research Center, Québec City, QC G1R 3S3, Canada
| | - Mohiuddin Mohiuddin
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Stella Lanni
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Gagan B Panigrahi
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Mahreen Khan
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Program of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Worrawat Engchuan
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Natalie Shum
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Program of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Aisha Faruqui
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Program of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Peixiang Wang
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada
| | - Ryan K C Yuen
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Program of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Masayuki Nakamori
- Department of Neurology, Osaka University Graduate School of Medicine, Osaka 565-0871, Japan
| | - Kazuhiko Nakatani
- Department of Regulatory Bioorganic Chemistry, the Institute of Scientific and Industrial Research, Osaka University, Osaka 567-0047, Japan
| | - Jean-Yves Masson
- Genome Stability Laboratory, CHU de Québec Research Center, HDQ Pavilion, Oncology Division, Québec City, QC G1R 3S3, Canada; Department of Molecular Biology, Medical Biochemistry, and Pathology, Laval University Cancer Research Center, Québec City, QC G1R 3S3, Canada
| | - Christopher E Pearson
- Program of Genetics & Genome Biology, The Hospital for Sick Children, PGCRL, Toronto, Canada, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Program of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada.
| |
Collapse
|
19
|
Natural selection at the RASGEF1C (GGC) repeat in human and divergent genotypes in late-onset neurocognitive disorder. Sci Rep 2021; 11:19235. [PMID: 34584172 PMCID: PMC8479062 DOI: 10.1038/s41598-021-98725-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 09/14/2021] [Indexed: 12/17/2022] Open
Abstract
Expression dysregulation of the neuron-specific gene, RASGEF1C (RasGEF Domain Family Member 1C), occurs in late-onset neurocognitive disorders (NCDs), such as Alzheimer's disease. This gene contains a (GGC)13, spanning its core promoter and 5' untranslated region (RASGEF1C-201 ENST00000361132.9). Here we sequenced the (GGC)-repeat in a sample of human subjects (N = 269), consisting of late-onset NCDs (N = 115) and controls (N = 154). We also studied the status of this STR across various primate and non-primate species based on Ensembl 103. The 6-repeat allele was the predominant allele in the controls (frequency = 0.85) and NCD patients (frequency = 0.78). The NCD genotype compartment consisted of an excess of genotypes that lacked the 6-repeat (divergent genotypes) (Mid-P exact = 0.004). A number of those genotypes were not detected in the control group (Mid-P exact = 0.007). The RASGEF1C (GGC)-repeat expanded beyond 2-repeats specifically in primates, and was at maximum length in human. We conclude that there is natural selection for the 6-repeat allele of the RASGEF1C (GGC)-repeat in human, and significant divergence from that allele in late-onset NCDs. STR alleles that are predominantly abundant and genotypes that deviate from those alleles are underappreciated features, which may have deep evolutionary and pathological consequences.
Collapse
|