1
|
Tan X, Zeng W, Yang Y, Lin Z, Li F, Liu J, Chen S, Liu YG, Xie W, Xie X. Genome-wide profiling of polymorphic short tandem repeats and their influence on gene expression and trait variation in diverse rice populations. J Genet Genomics 2025:S1673-8527(25)00078-5. [PMID: 40089018 DOI: 10.1016/j.jgg.2025.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2025] [Revised: 03/10/2025] [Accepted: 03/10/2025] [Indexed: 03/17/2025]
Abstract
Short tandem repeats (STRs) modulate gene expression and contribute to trait variation. However, a systematic evaluation of the genomic characteristics of STRs has not been conducted, and their influence on gene expression in rice remains unclear. Here, we construct a map of 137,629 polymorphic STRs in the rice (Oryza sativa L.) genome using a population-scale resequencing dataset. A genome-wide survey encompassing 4726 accessions shows that the occurrence frequency, mutational patterns, chromosomal distribution, and functional properties of STRs are correlated with the sequences and lengths of repeat motifs. Leveraging a transcriptome dataset from 127 rice accessions, we identify 44,672 expression STRs (eSTRs) by modeling gene expression in response to the length variation of STRs. These eSTRs are notably enriched in the regulatory regions of genes with active transcriptional signatures. Population analysis identifies numerous STRs that have undergone genetic divergence among different rice groups and 1726 tagged STRs that may be associated with agronomic traits. By editing the (ACT)7 STR in OsFD1 promoter, we further experimentally validate its role in regulating gene expression and phenotype. Our study highlights the contribution of STRs to transcriptional regulation in plants and establishes the foundation for their potential use as alternative targets for genetic improvement.
Collapse
Affiliation(s)
- Xiyu Tan
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Wanyong Zeng
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Yujian Yang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Zhansheng Lin
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Fuquan Li
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Jianhong Liu
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Shaotong Chen
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Yao-Guang Liu
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China.
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei 430070, China.
| | - Xianrong Xie
- Guangdong Basic Research Center of Excellence for Precise Breeding of Future Crops, State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, Guangdong 510642, China.
| |
Collapse
|
2
|
Eckman EA, Clausen DM, Solé-Domėnech S, Lee CW, Sinobas-Pereira C, Domalewski RJ, Nichols MR, Pacheco-Quinto J. Nascent Aβ42 Fibrillization in Synaptic Endosomes Precedes Plaque Formation in a Mouse Model of Alzheimer's-like β-Amyloidosis. J Neurosci 2023; 43:8812-8824. [PMID: 37884349 PMCID: PMC10727180 DOI: 10.1523/jneurosci.1318-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 10/05/2023] [Accepted: 10/23/2023] [Indexed: 10/28/2023] Open
Abstract
Accumulation of amyloid-β peptide (Aβ) aggregates in synapses may contribute to the profound synaptic loss characteristic of Alzheimer's disease (AD). The origin of synaptic Aβ aggregates remains elusive, but loss of endosomal proteostasis may trigger their formation. In this study, we identified the synaptic compartments where Aβ accumulates, and performed a longitudinal analysis of synaptosomes isolated from brains of TgCRND8 APP transgenic mice of either sex. To evaluate the specific contribution of Aβ-degrading protease endothelin-converting enzyme (ECE-1) to synaptic/endosomal Aβ homeostasis, we analyzed the effect of partial Ece1 KO in brain and complete ECE1 KO in SH-SY5Y cells. Global inhibition of ECE family members was used to further assess their role in preventing synaptic Aβ accumulation. Results showed that, before extracellular amyloid deposition, synapses were burdened with detergent-soluble Aβ monomers, oligomers, and fibrils. Levels of all soluble Aβ species declined thereafter, as Aβ42 turned progressively insoluble and accumulated in Aβ-producing synaptic endosomal vesicles with characteristics of multivesicular bodies. Accordingly, fibrillar Aβ was detected in brain exosomes. ECE-1-deficient mice had significantly increased endogenous synaptosomal Aβ42 levels, and protease inhibitor experiments showed that, in TgCRND8 mice, synaptic Aβ42 became nearly resistant to degradation by ECE-related proteases. Our study supports that Aβ accumulating in synapses is produced locally, within endosomes, and does not require the presence of amyloid plaques. ECE-1 is a determinant factor controlling the accumulation and fibrillization of nascent Aβ in endosomes and, in TgCRND8 mice, Aβ overproduction causes rapid loss of Aβ42 solubility that curtails ECE-mediated degradation.SIGNIFICANCE STATEMENT Deposition of aggregated Aβ in extracellular plaques is a defining feature of AD. Aβ aggregates also accumulate in synapses and may contribute to the profound synaptic loss and cognitive dysfunction typical of the disease. However, it is not clear whether synaptotoxic Aβ is mainly derived from plaques or if it is produced and aggregated locally, within affected synaptic compartments. Filling this knowledge gap is important for the development of an effective treatment for AD, as extracellular and intrasynaptic pools of Aβ may not be equally modulated by immunotherapies or other therapeutic approaches. In this manuscript, we provide evidence that Aβ aggregates building up in synapses are formed locally, within synaptic endosomes, because of disruptions in nascent Aβ proteostasis.
Collapse
Affiliation(s)
- Elizabeth A Eckman
- Biomedical Research Institute of New Jersey, Cedar Knolls, New Jersey 07927
| | - Dana M Clausen
- Biomedical Research Institute of New Jersey, Cedar Knolls, New Jersey 07927
| | | | - Chris W Lee
- Biomedical Research Institute of New Jersey, Cedar Knolls, New Jersey 07927
| | - Cristina Sinobas-Pereira
- Department of Chemistry & Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121
| | - Ryan J Domalewski
- Department of Chemistry & Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121
| | - Michael R Nichols
- Department of Chemistry & Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri 63121
| | | |
Collapse
|
3
|
Shi Y, Niu Y, Zhang P, Luo H, Liu S, Zhang S, Wang J, Li Y, Liu X, Song T, Xu T, He S. Characterization of genome-wide STR variation in 6487 human genomes. Nat Commun 2023; 14:2092. [PMID: 37045857 PMCID: PMC10097659 DOI: 10.1038/s41467-023-37690-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/27/2023] [Indexed: 04/14/2023] Open
Abstract
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
Collapse
Affiliation(s)
- Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shuai Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Sijia Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jiajia Wang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xinyue Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tingrui Song
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, 250117, Shandong, China.
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
4
|
Dhaliwal J, Wagner J. STR-based feature extraction and selection for genetic feature discovery in neurological disease genes. Sci Rep 2023; 13:2480. [PMID: 36774368 PMCID: PMC9922266 DOI: 10.1038/s41598-023-29376-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 02/03/2023] [Indexed: 02/13/2023] Open
Abstract
Gene expression, often determined by single nucleotide polymorphisms, short repeated sequences known as short tandem repeats (STRs), structural variants, and environmental factors, provides means for an organism to produce gene products necessary to live. Variation in expression levels, sometimes known as enrichment patterns, has been associated with disease progression. Thus, the STR enrichment patterns have recently gained interest as potential genetic markers for disease progression. However, to the best of our knowledge, we are unaware of any study that evaluates and explores STRs, particularly trinucleotide sequences, as machine learning features for classifying neurological disease genes for the purpose of discovering genetic features. Thus, in this paper, we proposed a new metric and a novel feature extraction and selection algorithm based on statistically significant STR-based features and their respective enrichment patterns to create a statistically significant feature set. The proposed new metric has shown that the neurological disease family genes have a non-random AA, AT, TA, TG, and TT enrichment pattern. This is an important result, as it supports prior research that has established that certain trinucleotides, such as AAT, ATA, ATT, TAT, and TTA, are favored during protein misfolding. In contrast, trinucleotides, such as TAA, TAG, and TGA, are favored during premature termination codon mutations as they are stop codons. This suggests that the metric has the potential to identify patterns that may be genetic features in a sample of neurological genes. Moreover, the practical performance and high prediction results of the statistically significant STR-based feature set indicate that variations in STR enrichment patterns can distinguish neurological disease genes. In conclusion, the proposed approach may have the potential to discover differential genetic features for other diseases.
Collapse
Affiliation(s)
- Jasbir Dhaliwal
- Faculty of Information Technology, Monash University, Clayton, VIC, 3800, Australia.
| | - John Wagner
- PsychoGenics Inc., Paramus, New Jersey, 07652, United States of America
| |
Collapse
|
5
|
Gharesouran J, Hosseinzadeh H, Ghafouri-Fard S, Taheri M, Rezazadeh M. STRs: Ancient Architectures of the Genome beyond the Sequence. J Mol Neurosci 2021; 71:2441-2455. [PMID: 34056692 DOI: 10.1007/s12031-021-01850-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 04/22/2021] [Indexed: 01/24/2023]
Abstract
Short tandem repeats (STRs) are commonly defined as short runs of repetitive nucleotides, consisting of tandemly repeating 2-6- bp motif units, which are ubiquitously distributed throughout genomes. Functional STRs are polymorphic in the population, and their variations influence gene expression, which subsequently may result in pathogenic phenotypes. To understand STR phenotypic effects and their functional roles, we describe four different mutational mechanisms including the unequal crossing-over model, gene conversion, retrotransposition mechanism and replication slippage. Due to the multi-allelic nature, small length, abundance, high variability, codominant inheritance, nearly neutral evolution, extensive genome coverage and simple assaying of STRs, these markers are widely used in various types of biological research, including population genetics studies, genome mapping, molecular epidemiology, paternity analysis and gene flow studies. In this review, we focus on the current knowledge regarding STR genomic distribution, function, mutation and applications.
Collapse
Affiliation(s)
- Jalal Gharesouran
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Hassan Hosseinzadeh
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Soudeh Ghafouri-Fard
- Department of Medical Genetics, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Taheri
- Skull Base Research Center, Loghman Hakim Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Maryam Rezazadeh
- Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran. .,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
6
|
Lemche E. Early Life Stress and Epigenetics in Late-onset Alzheimer's Dementia: A Systematic Review. Curr Genomics 2018; 19:522-602. [PMID: 30386171 PMCID: PMC6194433 DOI: 10.2174/1389202919666171229145156] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2017] [Revised: 07/27/2017] [Accepted: 12/12/2017] [Indexed: 11/22/2022] Open
Abstract
Involvement of life stress in Late-Onset Alzheimer's Disease (LOAD) has been evinced in longitudinal cohort epidemiological studies, and endocrinologic evidence suggests involvements of catecholamine and corticosteroid systems in LOAD. Early Life Stress (ELS) rodent models have successfully demonstrated sequelae of maternal separation resulting in LOAD-analogous pathology, thereby supporting a role of insulin receptor signalling pertaining to GSK-3beta facilitated tau hyper-phosphorylation and amyloidogenic processing. Discussed are relevant ELS studies, and findings from three mitogen-activated protein kinase pathways (JNK/SAPK pathway, ERK pathway, p38/MAPK pathway) relevant for mediating environmental stresses. Further considered were the roles of autophagy impairment, neuroinflammation, and brain insulin resistance. For the meta-analytic evaluation, 224 candidate gene loci were extracted from reviews of animal studies of LOAD pathophysiological mechanisms, of which 60 had no positive results in human LOAD association studies. These loci were combined with 89 gene loci confirmed as LOAD risk genes in previous GWAS and WES. Of the 313 risk gene loci evaluated, there were 35 human reports on epigenomic modifications in terms of methylation or histone acetylation. 64 microRNA gene regulation mechanisms were published for the compiled loci. Genomic association studies support close relations of both noradrenergic and glucocorticoid systems with LOAD. For HPA involvement, a CRHR1 haplotype with MAPT was described, but further association of only HSD11B1 with LOAD found; however, association of FKBP1 and NC3R1 polymorphisms was documented in support of stress influence to LOAD. In the brain insulin system, IGF2R, INSR, INSRR, and plasticity regulator ARC, were associated with LOAD. Pertaining to compromised myelin stability in LOAD, relevant associations were found for BIN1, RELN, SORL1, SORCS1, CNP, MAG, and MOG. Regarding epigenetic modifications, both methylation variability and de-acetylation were reported for LOAD. The majority of up-to-date epigenomic findings include reported modifications in the well-known LOAD core pathology loci MAPT, BACE1, APP (with FOS, EGR1), PSEN1, PSEN2, and highlight a central role of BDNF. Pertaining to ELS, relevant loci are FKBP5, EGR1, GSK3B; critical roles of inflammation are indicated by CRP, TNFA, NFKB1 modifications; for cholesterol biosynthesis, DHCR24; for myelin stability BIN1, SORL1, CNP; pertaining to (epi)genetic mechanisms, hTERT, MBD2, DNMT1, MTHFR2. Findings on gene regulation were accumulated for BACE1, MAPK signalling, TLR4, BDNF, insulin signalling, with most reports for miR-132 and miR-27. Unclear in epigenomic studies remains the role of noradrenergic signalling, previously demonstrated by neuropathological findings of childhood nucleus caeruleus degeneration for LOAD tauopathy.
Collapse
Affiliation(s)
- Erwin Lemche
- Section of Cognitive Neuropsychiatry, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
| |
Collapse
|
7
|
Nazaripanah N, Adelirad F, Delbari A, Sahaf R, Abbasi-Asl T, Ohadi M. Genome-scale portrait and evolutionary significance of human-specific core promoter tri- and tetranucleotide short tandem repeats. Hum Genomics 2018; 12:17. [PMID: 29622039 PMCID: PMC5887250 DOI: 10.1186/s40246-018-0149-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 03/20/2018] [Indexed: 03/05/2023] Open
Abstract
BACKGROUND While there is an ongoing trend to identify single nucleotide substitutions (SNSs) that are linked to inter/intra-species differences and disease phenotypes, short tandem repeats (STRs)/microsatellites may be of equal (if not more) importance in the above processes. Genes that contain STRs in their promoters have higher expression divergence compared to genes with fixed or no STRs in the gene promoters. In line with the above, recent reports indicate a role of repetitive sequences in the rise of young transcription start sites (TSSs) in human evolution. RESULTS Following a comparative genomics study of all human protein-coding genes annotated in the GeneCards database, here we provide a genome-scale portrait of human-specific short- and medium-size (≥ 3-repeats) tri- and tetranucleotide STRs and STR motifs in the critical core promoter region between - 120 and + 1 to the TSS and evidence of skewing of this compartment in reference to the STRs that are not human-specific (Levene's test p < 0.001). Twenty-five percent and 26% enrichment of human-specific transcripts was detected in the tri and tetra human-specific compartments (mid-p < 0.00002 and mid-p < 0.002, respectively). CONCLUSION Our findings provide the first evidence of genome-scale skewing of STRs at a specific region of the human genome and a link between a number of these STRs and TSS selection/transcript specificity. The STRs and genes listed here may have a role in the evolution and development of characteristics and phenotypes that are unique to the human species.
Collapse
Affiliation(s)
- N Nazaripanah
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - F Adelirad
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - R Sahaf
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - T Abbasi-Asl
- Department of Biostatistics, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
8
|
Bagshaw AT. Functional Mechanisms of Microsatellite DNA in Eukaryotic Genomes. Genome Biol Evol 2017; 9:2428-2443. [PMID: 28957459 PMCID: PMC5622345 DOI: 10.1093/gbe/evx164] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2017] [Indexed: 02/06/2023] Open
Abstract
Microsatellite repeat DNA is best known for its length mutability, which is implicated in several neurological diseases and cancers, and often exploited as a genetic marker. Less well-known is the body of work exploring the widespread and surprisingly diverse functional roles of microsatellites. Recently, emerging evidence includes the finding that normal microsatellite polymorphism contributes substantially to the heritability of human gene expression on a genome-wide scale, calling attention to the task of elucidating the mechanisms involved. At present, these are underexplored, but several themes have emerged. I review evidence demonstrating roles for microsatellites in modulation of transcription factor binding, spacing between promoter elements, enhancers, cytosine methylation, alternative splicing, mRNA stability, selection of transcription start and termination sites, unusual structural conformations, nucleosome positioning and modification, higher order chromatin structure, noncoding RNA, and meiotic recombination hot spots.
Collapse
|
9
|
Nikkhah M, Rezazadeh M, Khorram Khorshid HR, Biglarian A, Ohadi M. An exceptionally long CA-repeat in the core promoter of SCGB2B2 links with the evolution of apes and Old World monkeys. Gene 2015; 576:109-14. [PMID: 26437309 DOI: 10.1016/j.gene.2015.09.070] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2015] [Revised: 09/25/2015] [Accepted: 09/28/2015] [Indexed: 12/31/2022]
Abstract
We have recently reported a genome-scale catalog of human protein-coding genes that contain "exceptionally long" STRs (≥6-repeats) in their core promoter, which may be of selective advantage in this species. At the top of that list, SCGB2B2 (also known as SCGBL), contains one of the longest CA-repeat STRs identified in a human gene core promoter, at 25-repeats. In the study reported here, we analyzed the conservation status of this CA-STR across evolution. The functional implication of this STR to alter gene expression activity was also analyzed in the HEK-293 cell line. We report that the SCGB2B2 core promoter CA-repeat reaches exceptional lengths, ranging from 9- to 25-repeats, across Apes (Hominoids) and the Old World monkeys (CA>2-repeats were not detected in any other species). The longest CA-repeats and highest identity in the SCGB2B2 protein sequence were observed between human and bonobo. A trend for increased gene expression activity was observed from the shorter to the longer CA-repeats (p<0.009), and the CA-repeat increased gene expression activity, per se (p<0.02). We propose that the SCGB2B2 gene core promoter CA-repeat functions as an expression code for the evolution of Apes and the Old World monkeys.
Collapse
Affiliation(s)
- M Nikkhah
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Rezazadeh
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - H R Khorram Khorshid
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - A Biglarian
- Department of Biostatistics, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - M Ohadi
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| |
Collapse
|
10
|
Wang YD, Zhang J, Li CH, Xu HP, Chen W, Zeng YQ, Wang H. Molecular cloning, sequence characteristics, and tissue expression analysis of ECE1 gene in Tibetan pig. Gene 2015; 571:237-44. [DOI: 10.1016/j.gene.2015.06.054] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2015] [Revised: 06/17/2015] [Accepted: 06/22/2015] [Indexed: 11/25/2022]
|
11
|
Barage SH, Sonawane KD. Amyloid cascade hypothesis: Pathogenesis and therapeutic strategies in Alzheimer's disease. Neuropeptides 2015; 52:1-18. [PMID: 26149638 DOI: 10.1016/j.npep.2015.06.008] [Citation(s) in RCA: 399] [Impact Index Per Article: 39.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Revised: 06/24/2015] [Accepted: 06/24/2015] [Indexed: 01/08/2023]
Abstract
Alzheimer's disease is an irreversible, progressive neurodegenerative disorder. Various therapeutic approaches are being used to improve the cholinergic neurotransmission, but their role in AD pathogenesis is still unknown. Although, an increase in tau protein concentration in CSF has been described in AD, but several issues remains unclear. Extensive and accurate analysis of CSF could be helpful to define presence of tau proteins in physiological conditions, or released during the progression of neurodegenerative disease. The amyloid cascade hypothesis postulates that the neurodegeneration in AD caused by abnormal accumulation of amyloid beta (Aβ) plaques in various areas of the brain. The amyloid hypothesis has continued to gain support over the last two decades, particularly from genetic studies. Therefore, current research progress in several areas of therapies shall provide an effective treatment to cure this devastating disease. This review critically evaluates general biochemical and physiological functions of Aβ directed therapeutics and their relevance.
Collapse
Affiliation(s)
- Sagar H Barage
- Department of Biotechnology, Shivaji University, Kolhapur 416004, Maharashtra (M.S.), India
| | - Kailas D Sonawane
- Structural Bioinformatics Unit, Department of Biochemistry, Shivaji University, Kolhapur 416004, Maharashtra (M.S.), India; Department of Microbiology, Shivaji University, Kolhapur 416004, Maharashtra (M.S.), India.
| |
Collapse
|
12
|
A primate-specific functional GTTT-repeat in the core promoter of CYTH4 is linked to bipolar disorder in human. Prog Neuropsychopharmacol Biol Psychiatry 2015; 56:161-7. [PMID: 25240857 DOI: 10.1016/j.pnpbp.2014.09.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 09/10/2014] [Indexed: 12/20/2022]
Abstract
Evidence of primate-specific genes and gene regulatory mechanisms linked to bipolar disorder (BD) lend support to evolutionary/adaptive processes in the pathogenesis of this disorder. Following a genome-scale analysis of the entire protein coding genes annotated in the GeneCards database, we have recently reported that cytohesin-4 (CYTH4) contains the longest tetra-nucleotide short tandem repeat (STR) identified in a human protein-coding gene core promoter, which may be of adaptive advantage to this species. In the current study, we analyzed the evolutionary trend of this STR across evolution. We also analyzed the functional implication and distribution of this STR in a group of patients with type 1 BD (n=233) and controls (n=262). We found that this STR is exceptionally expanded in primates (Fisher exact p<0.00003). Association was observed between type I BD and the 6-repeat allele of this STR, (GTTT)₆ (Yates corrected Χ(2)=12.68, p<0.0001, OR: 1.68). This allele is the shortest length of the GTTT-repeat identified in the human subjects studied. Consistent with that finding, excess homozygosity was observed for the shorter alleles, (GTTT)₆ and (GTTT)₇, vs. the longer alleles, (GTTT)₈ and (GTTT)₉ in the BD group (Yates corrected Χ(2)=5.18, p<0.01, 1 df, OR: 1.96). Using Dual Glo luciferase system in HEK-293 cells, a trend for gene expression repression was observed from the 6- to the 9-repeat allele (p<0.003), and the GTTT-repeat significantly down-regulated gene expression, per se (p<0.0006). This is the first evidence of a link between a primate-specific STR and a major psychiatric disorder in human. It may be speculated that the CYTH4 GTTT-repeat in primates may have conferred selective advantage to this order, reflected in neural function and neurophenotypes. The role of the CYTH4 gene in the pathogenesis of type I BD remains to be clarified in the future studies.
Collapse
|
13
|
Robinson AS, Materna SC, Barnes RM, De Val S, Xu SM, Black BL. An arterial-specific enhancer of the human endothelin converting enzyme 1 (ECE1) gene is synergistically activated by Sox17, FoxC2, and Etv2. Dev Biol 2014; 395:379-389. [PMID: 25179465 DOI: 10.1016/j.ydbio.2014.08.027] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 08/19/2014] [Indexed: 11/19/2022]
Abstract
Endothelin-converting enzyme-1 (Ece-1), a crucial component of the Endothelin signaling pathway, is required for embryonic development and is an important regulator of vascular tone, yet the transcriptional regulation of the ECE1 gene has remained largely unknown. Here, we define the activity and regulation of an enhancer from the human ECE1 locus in vivo. The enhancer identified here becomes active in endothelial progenitor cells shortly after their initial specification and is dependent on a conserved FOX:ETS motif, a composite binding site for Forkhead transcription factors and the Ets transcription factor Etv2, for activity in vivo. The ECE1 FOX:ETS motif is bound and cooperatively activated by FoxC2 and Etv2, but unlike other described FOX:ETS-dependent enhancers, ECE1 enhancer activity becomes restricted to arterial endothelium and endocardium by embryonic day 9.5 in transgenic mouse embryos. The ECE1 endothelial enhancer also contains an evolutionarily-conserved, consensus SOX binding site, which is required for activity in transgenic mouse embryos. Importantly, the ECE1 SOX site is bound and activated by Sox17, a transcription factor involved in endothelial cell differentiation and an important regulator of arterial identity. Moreover, the ECE1 enhancer is cooperatively activated by the combinatorial action of FoxC2, Etv2, and Sox17. Although Sox17 is required for arterial identity, few direct transcriptional targets have been identified in endothelial cells. Thus, this work has important implications for our understanding of endothelial specification and arterial subspecification.
Collapse
Affiliation(s)
- Ashley S Robinson
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
| | - Stefan C Materna
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
| | - Ralston M Barnes
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
| | - Sarah De Val
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
| | - Shan-Mei Xu
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
| | - Brian L Black
- Cardiovascular Research Institute, University of California, San Francisco, CA 94158-2517
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA 94158-2517
| |
Collapse
|
14
|
Ohadi M, Valipour E, Ghadimi-Haddadan S, Namdar-Aligoodarzi P, Bagheri A, Kowsari A, Rezazadeh M, Darvish H, Kazeminasab S. Core promoter short tandem repeats as evolutionary switch codes for primate speciation. Am J Primatol 2014; 77:34-43. [PMID: 25099915 DOI: 10.1002/ajp.22308] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Revised: 04/07/2014] [Accepted: 05/16/2014] [Indexed: 01/27/2023]
Abstract
Alteration in gene expression levels underlies many of the phenotypic differences across species. Because of their highly mutable nature, proximity to the +1 transcription start site (TSS), and the emerging evidence of functional impact on gene expression, core promoter short tandem repeats (STRs) may be considered an ideal source of variation across species. In a genome-scale analysis of the entire Homo sapiens protein-coding genes, we have previously identified core promoters with at least one STR of ≥ 6-repeats, with possible selective advantage in this species. In the current study, we performed reverse analysis of the entire Homo sapiens orthologous genes in mouse in the Ensembl database, in order to identify conserved STRs that have shrunk as an evolutionary advantage to humans. Two protocols were used to minimize ascertainment bias. Firstly, two species sharing a more recent ancestor with Homo sapiens (i.e. Pan troglodytes and Gorilla gorilla gorilla) were also included in the study. Secondly, four non-primate species encompassing the major orders across Mammals, including Scandentia, Laurasiatheria, Afrotheria, and Xenarthra were analyzed as out-groups. We introduce STR evolutionary events specifically identical in primates (i.e. Homo sapiens, Pan troglodytes, and Gorilla gorilla gorilla) vs. non-primate out-groups. The average frequency of the identically shared STR motifs across those primates ranged between 0.00005 and 0.06. The identified genes are involved in important evolutionary and developmental processes, such as normal craniofacial development (TFAP2B), regulation of cell shape (PALMD), learning and long-term memory (RGS14), nervous system development (GFRA2), embryonic limb morphogenesis (PBX2), and forebrain development (APAF1). We provide evidence of core promoter STRs as evolutionary switch codes for primate speciation, and the first instance of identity-by-descent for those motifs at the interspecies level.
Collapse
Affiliation(s)
- Mina Ohadi
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Mohammadparast S, Bayat H, Biglarian A, Ohadi M. Exceptional expansion and conservation of a CT-repeat complex in the core promoter of PAXBP1 in primates. Am J Primatol 2014; 76:747-56. [PMID: 24573656 DOI: 10.1002/ajp.22266] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Revised: 12/28/2013] [Accepted: 01/28/2014] [Indexed: 11/11/2022]
Abstract
Adaptive evolution may be linked with the genomic distribution and function of short tandem repeats (STRs). Proximity of the core promoter STRs to the +1 transcription start site (TSS), and their mutable nature are characteristics that highlight those STRs as a novel source of interspecies variation. The PAXBP1 gene (alternatively known as GCFC1) core promoter contains the longest STR identified in a Homo sapiens gene core promoter. Indeed, this core promoter is a stretch of four consecutive CT-STRs. In the current study, we used the Ensembl, NCBI, and UCSC databases to analyze the evolutionary trend and functional implication of this CT-STR complex in six major lineages across vertebrates, including primates, non-primate mammals, birds, reptiles, amphibians, and fish. We observed exceptional expansion (≥4-repeats) and conservation of this CT-STR complex across primates, except prosimians, Microcebus murinus and Otolemur garnettii (Fisher exact P<4.1×10(-7)). H. sapiens has the most complex STR formula, and longest repeats. Macaca mulatta and Callithrix jacchus monkeys have the simplest STR formulas, and shortest repeat numbers. CT≥4-repeats were not detected in non-primate lineages. Different length alleles across the PAXBP1 core promoter CT-STRs significantly altered gene expression in vitro (P<0.001, t-test). PAXBP1 has a crucial role in craniofacial development, myogenesis, and spine morphogenesis, properties that have been diverged between primates and non-primates. To our knowledge, this is the first instance of expansion and conservation of a STR complex co-occurring specifically with the primate lineage.
Collapse
Affiliation(s)
- Saeid Mohammadparast
- Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | | | | | | |
Collapse
|