1
|
Ohadi M, Arabfard M, Khamse S, Alizadeh S, Vafadar S, Bayat H, Tajeddin N, Maddi AMA, Delbari A, Khorram Khorshid HR. Novel crossover and recombination hotspots massively spread across primate genomes. Biol Direct 2024; 19:70. [PMID: 39169390 PMCID: PMC11340189 DOI: 10.1186/s13062-024-00508-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 07/29/2024] [Indexed: 08/23/2024] Open
Abstract
BACKGROUND The recombination landscape and subsequent natural selection have vast consequences forevolution and speciation. However, most of the crossover and recombination hotspots are yet to be discovered. We previously reported the relevance of C and G trinucleotide two-repeat units (CG-TTUs) in crossovers and recombination. METHODS On a genome-wide scale, here we mapped all combinations of A and T trinucleotide two-repeat units (AT-TTUs) in human, consisting of AATAAT, ATAATA, ATTATT, TTATTA, TATTAT, and TAATAA. We also compared a number of the colonies formed by the AT-TTUs (distance between consecutive AT-TTUs < 500 bp) in several other primates and mouse. RESULTS We found that the majority of the AT-TTUs (> 96%) resided in approximately 1.4 million colonies, spread throughout the human genome. In comparison to the CG-TTU colonies, the AT-TTU colonies were significantly more abundant and larger in size. Pure units and overlapping units of the pure units were readily detectable in the same colonies, signifying that the units were the sites of unequal crossover. We discovered dynamic sharedness of several of the colonies across the primate species studied, which mainly reached maximum complexity and size in human. CONCLUSIONS We report novel crossover and recombination hotspots of the finest molecular resolution, massively spread and shared across the genomes of human and several other primates. With respect to crossovers and recombination, these genomes are far more dynamic than previously envisioned.
Collapse
Affiliation(s)
- Mina Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - Masoud Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran.
| | - Safoura Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Samira Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Sara Vafadar
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Hadi Bayat
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Biochemical Neuroendocrinology, Montreal Clinical and Research Institute (IRCM, affiliated to the McGill University, Montreal, QC, H2W 1R7, Canada
| | - Nahid Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - Ali M A Maddi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Ahmad Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Hamid R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| |
Collapse
|
2
|
Tajeddin N, Arabfard M, Alizadeh S, Salesi M, Khamse S, Delbari A, Ohadi M. Novel islands of GGC and GCC repeats coincide with human evolution. Gene 2024; 902:148194. [PMID: 38262548 DOI: 10.1016/j.gene.2024.148194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 10/29/2023] [Accepted: 01/18/2024] [Indexed: 01/25/2024]
Abstract
BACKGROUND Because of high mutation rate, overrepresentation in genic regions, and link with various neurological, neurodegenerative, and movement disorders, GGC and GCC short tandem repeats (STRs) are prone to natural selection. Among a number of lacking data, the 3-repeats of these STRs remain widely unexplored. RESULTS In a genome-wide search in human, here we mapped GGC and GCC STRs of ≥3-repeats, and found novel islands of up to 45 of those STRs, populating spans of 1 to 2 kb of genomic DNA. RGPD4 and NOC4L harbored the densest (GGC)3 (probability 3.09061E-71) and (GCC)3 (probability 1.72376E-61) islands, respectively, and were human-specific. We also found prime instances of directional incremented density of STRs at specific loci in human versus other species, including the FOXK2 and SKI GGC islands. The genes containing those islands significantly diverged in expression in human versus other species, and the proteins encoded by those genes interact closely in a physical interaction network, consequence of which may be human-specific characteristics such as higher order brain functions. CONCLUSION We report novel islands of GGC and GCC STRs of evolutionary relevance to human. The density, and in some instances, periodicity of these islands support them as a novel genomic entity, which need to be further explored in evolutionary, mechanistic, and functional platforms.
Collapse
Affiliation(s)
- N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
3
|
Khamse S, Alizadeh S, Khorshid HRK, Delbari A, Tajeddin N, Ohadi M. A Hypermutable Region in the DISP2 Gene Links to Natural Selection and Late-Onset Neurocognitive Disorders in Humans. Mol Neurobiol 2024:10.1007/s12035-024-04155-y. [PMID: 38565786 DOI: 10.1007/s12035-024-04155-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 03/25/2024] [Indexed: 04/04/2024]
Abstract
(CCG) short tandem repeats (STRs) are predominantly enriched in genic regions, mutation hotspots for C to T truncating substitutions, and involved in various neurological and neurodevelopmental disorders. However, intact blocks of this class of STRs are widely overlooked with respect to their link with natural selection. The human neuron-specific gene, DISP2 (dispatched RND transporter family member 2), contains a (CCG) repeat in its 5' untranslated region. Here, we sequenced this STR in a sample of 448 Iranian individuals, consisting of late-onset neurocognitive disorder (NCD) (N = 203) and controls (N = 245). We found that the region spanning the (CCG) repeat was highly mutated, resulting in several flanking (CCG) residues. However, an 8-repeat of the (CCG) repeat was predominantly abundant (frequency = 0.92) across the two groups. While the overall distribution of genotypes was not different between the two groups (p > 0.05), we detected four genotypes in the NCD group only (2% of the NCD genotypes, Mid-p = 0.02), consisting of extreme short alleles, 5- and 6-repeats, that were not detected in the control group. The patients harboring those genotypes received the diagnoses of probable Alzheimer's disease and vascular dementia. We also found six genotypes in the control group only (2.5% of the control genotypes, Mid-p = 0.01) that consisted of the 8-repeat and extreme long alleles, 9- and 10-repeats, of which the 10-repeat was not detected in the NCD group. The (CCG) repeat specifically expanded in primates. In conclusion, we report an indication of natural selection at a novel hypermutable region in the human genome and divergent alleles and genotypes in late-onset NhCDs and controls. These findings reinforce the hypothesis that a collection of rare alleles and genotypes in a number of genes may unambiguously contribute to the cognition impairment component of late-onset NCDs.
Collapse
Affiliation(s)
- S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
4
|
Arabfard M, Tajeddin N, Alizadeh S, Salesi M, Bayat H, Khorram Khorshid HR, Khamse S, Delbari A, Ohadi M. Dyads of GGC and GCC form hotspot colonies that coincide with the evolution of human and other great apes. BMC Genom Data 2024; 25:21. [PMID: 38383300 PMCID: PMC10880355 DOI: 10.1186/s12863-024-01207-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 02/11/2024] [Indexed: 02/23/2024] Open
Abstract
BACKGROUND GGC and GCC short tandem repeats (STRs) are of various evolutionary, biological, and pathological implications. However, the fundamental two-repeats (dyads) of these STRs are widely unexplored. RESULTS On a genome-wide scale, we mapped (GGC)2 and (GCC)2 dyads in human, and found monumental colonies (distance between each dyad < 500 bp) of extraordinary density, and in some instances periodicity. The largest (GCC)2 and (GGC)2 colonies were intergenic, homogeneous, and human-specific, consisting of 219 (GCC)2 on chromosome 2 (probability < 1.545E-219) and 70 (GGC)2 on chromosome 9 (probability = 1.809E-148). We also found that several colonies were shared in other great apes, and directionally increased in density and complexity in human, such as a colony of 99 (GCC)2 on chromosome 20, that specifically expanded in great apes, and reached maximum complexity in human (probability 1.545E-220). Numerous other colonies of evolutionary relevance in human were detected in other largely overlooked regions of the genome, such as chromosome Y and pseudogenes. Several of the genes containing or nearest to those colonies were divergently expressed in human. CONCLUSION In conclusion, (GCC)2 and (GGC)2 form unprecedented genomic colonies that coincide with the evolution of human and other great apes. The extent of the genomic rearrangements leading to those colonies support overlooked recombination hotspots, shared across great apes. The identified colonies deserve to be studied in mechanistic, evolutionary, and functional platforms.
Collapse
Affiliation(s)
- M Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Salesi
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
- Research Center for Prevention of Oral and Dental Diseases, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - H Bayat
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
5
|
Alizadeh S, Khamse S, Tajeddin N, Khorram Khorshid HR, Delbari A, Ohadi M. A GCC repeat in RAB26 undergoes natural selection in human and harbors divergent genotypes in late-onset Alzheimer's disease. Gene 2024; 893:147968. [PMID: 37931854 DOI: 10.1016/j.gene.2023.147968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 10/28/2023] [Accepted: 11/03/2023] [Indexed: 11/08/2023]
Abstract
Although mainly located in genic regions and being mutation hotspots, intact blocks of CG-rich trinucleotide short tandem repeats (STRs) are largely overlooked with respect to their link with natural selection. The human RAB26 (member RAS oncogene family) directs synaptic and secretory vesicles into preautophagosomal structures, inhibition of which specifically disrupts axonal transport of degradative organelles and leads to an axonal dystrophy, resembling Alzheimer's disease (AD). Human RAB26 contains a GCC repeat in the top 1st percent in respect of length. Here we sequenced this STR in 441 Iranian individuals, consisting of late-onset neurocognitive disorder (NCD) (N = 216) and controls (N = 225). In both groups, the 12-repeat allele and the 12/12 genotype were predominantly abundant. We found excess of homozygosity for non-12 alleles in the NCD group (Mid-P exact = 0.027). Furthermore, divergent genotypes were detected that were specific to the NCD group (2.8% of genotypes) (Mid-P exact = 0.006) or controls (3.1% of genotypes) (Mid-P exact = 0.004). The patients harboring divergent genotypes received the diagnosis of AD. Based on the predominant abundance of the 12-repeat and 12/12 genotype in both groups, excess of non-12 homozygosity in the NCD group, and divergent genotypes across the NCD and control groups, we propose natural selection at this locus and link with late-onset AD. Our findings strengthen the hypothesis that a collection of rare genotypes unambiguously contribute to the pathogenesis of late-onset NCDs, such as AD.
Collapse
Affiliation(s)
- S Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - S Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - N Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
6
|
Panoyan MA, Wendt FR. The role of tandem repeat expansions in brain disorders. Emerg Top Life Sci 2023; 7:249-263. [PMID: 37401564 DOI: 10.1042/etls20230022] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/05/2023] [Accepted: 06/19/2023] [Indexed: 07/05/2023]
Abstract
The human genome contains numerous genetic polymorphisms contributing to different health and disease outcomes. Tandem repeat (TR) loci are highly polymorphic yet under-investigated in large genomic studies, which has prompted research efforts to identify novel variations and gain a deeper understanding of their role in human biology and disease outcomes. We summarize the current understanding of TRs and their implications for human health and disease, including an overview of the challenges encountered when conducting TR analyses and potential solutions to overcome these challenges. By shedding light on these issues, this article aims to contribute to a better understanding of the impact of TRs on the development of new disease treatments.
Collapse
Affiliation(s)
- Mary Anne Panoyan
- Department of Anthropology, University of Toronto, Mississauga, ON, Canada
| | - Frank R Wendt
- Department of Anthropology, University of Toronto, Mississauga, ON, Canada
- Biostatistics Division, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- Forensic Science Program, University of Toronto, Mississauga, ON, Canada
| |
Collapse
|
7
|
Wang Q, Bao H, Li Z. Genomic comparison between two Inonotus hispidus strains isolated from growing in different tree species. Front Genet 2023; 14:1221491. [PMID: 37519891 PMCID: PMC10372432 DOI: 10.3389/fgene.2023.1221491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 07/03/2023] [Indexed: 08/01/2023] Open
Abstract
Inonotus hispidus mainly growing in broad-leaved trees, including Morus alba, Fraxinus mandshurica, and Ulmus macrocarpa etc. The fruiting body of I. hispidus growing in M. alba (hereafter as MA) is used as a traditional Chinese medicine "Sanghuang". However, differences between the genetic material basis of I. hispidus growing in other tree species have not been reported. Therefore, in this paper, the genomic comparison between MA and I. hispidus growing in F. mandshurica (hereafter as FM) were studied. The whole genome of MA monokaryon was sequenced by Illumina combined with Pac Bio platform. Next, genome assembly, genome component prediction and genome functional annotation were performed. Comparative genomics analysis was performed between FM monokaryon and MA monokaryon, using MA as the reference. The results showed that, MA had 24 contigs with a N50 length of 2.6 Mb. Specifically, 5,342, 6,564, 1,595, 383 and 123 genes were annotated from GO, KEGG, KOG, CAZymes and CYP450, respectively. Moreover, comparative genomics showed that, the coding genes and total number of genes annotated in different databases of FM were higher than that of MA. This study provides a foundation for the medicinal application of FM as MA from the perspective of genetic composition.
Collapse
Affiliation(s)
- Qingchun Wang
- Key Laboratory for Development and Utilization of Fungi Traditional Chinese Medicine Resources, Jilin Agricultural University, Changchun, Jilin, China
- Key Laboratory of Edible Fungal Resources and Utilization (North), Ministry of Agriculture and Rural Affairs, Jilin Agricultural University, Changchun, Jilin, China
| | - Haiying Bao
- Key Laboratory for Development and Utilization of Fungi Traditional Chinese Medicine Resources, Jilin Agricultural University, Changchun, Jilin, China
- Key Laboratory of Edible Fungal Resources and Utilization (North), Ministry of Agriculture and Rural Affairs, Jilin Agricultural University, Changchun, Jilin, China
| | - Zhijun Li
- Key Laboratory for Development and Utilization of Fungi Traditional Chinese Medicine Resources, Jilin Agricultural University, Changchun, Jilin, China
- Key Laboratory of Edible Fungal Resources and Utilization (North), Ministry of Agriculture and Rural Affairs, Jilin Agricultural University, Changchun, Jilin, China
| |
Collapse
|
8
|
Global abundance of short tandem repeats is non-random in rodents and primates. BMC Genom Data 2022; 23:77. [PMID: 36329409 PMCID: PMC9635179 DOI: 10.1186/s12863-022-01092-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 10/18/2022] [Indexed: 11/06/2022] Open
Abstract
Background While of predominant abundance across vertebrate genomes and significant biological implications, the relevance of short tandem repeats (STRs) (also known as microsatellites) to speciation remains largely elusive and attributed to random coincidence for the most part. Here we collected data on the whole-genome abundance of mono-, di-, and trinucleotide STRs in nine species, encompassing rodents and primates, including rat, mouse, olive baboon, gelada, macaque, gorilla, chimpanzee, bonobo, and human. The collected data were used to analyze hierarchical clustering of the STR abundances in the selected species. Results We found massive differential STR abundances between the rodent and primate orders. In addition, while numerous STRs had random abundance across the nine selected species, the global abundance conformed to three consistent < clusters>, as follows: <rat, mouse>, <gelada, macaque, olive baboon>, and <gorilla, chimpanzee, bonobo, human>, which coincided with the phylogenetic distances of the selected species (p < 4E-05). Exceptionally, in the trinucleotide STR compartment, human was significantly distant from all other species. Conclusion Based on hierarchical clustering, we propose that the global abundance of STRs is non-random in rodents and primates, and probably had a determining impact on the speciation of the two orders. We also propose the STRs and STR lengths, which predominantly conformed to the phylogeny of the selected species, exemplified by (t)10, (ct)6, and (taa4). Phylogenetic and experimental platforms are warranted to further examine the observed patterns and the biological mechanisms associated with those STRs.
Collapse
|