1
|
Borovská I, Vořechovský I, Královičová J. Alu RNA fold links splicing with signal recognition particle proteins. Nucleic Acids Res 2023; 51:8199-8216. [PMID: 37309897 PMCID: PMC10450188 DOI: 10.1093/nar/gkad500] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 05/23/2023] [Accepted: 05/31/2023] [Indexed: 06/14/2023] Open
Abstract
Transcriptomic diversity in primates was considerably expanded by exonizations of intronic Alu elements. To better understand their cellular mechanisms we have used structure-based mutagenesis coupled with functional and proteomic assays to study the impact of successive primate mutations and their combinations on inclusion of a sense-oriented AluJ exon in the human F8 gene. We show that the splicing outcome was better predicted by consecutive RNA conformation changes than by computationally derived splicing regulatory motifs. We also demonstrate an involvement of SRP9/14 (signal recognition particle) heterodimer in splicing regulation of Alu-derived exons. Nucleotide substitutions that accumulated during primate evolution relaxed the conserved left-arm AluJ structure including helix H1 and reduced the capacity of SRP9/14 to stabilize the closed Alu conformation. RNA secondary structure-constrained mutations that promoted open Y-shaped conformations of the Alu made the Alu exon inclusion reliant on DHX9. Finally, we identified additional SRP9/14 sensitive Alu exons and predicted their functional roles in the cell. Together, these results provide unique insights into architectural elements required for sense Alu exonization, identify conserved pre-mRNA structures involved in exon selection and point to a possible chaperone activity of SRP9/14 outside the mammalian signal recognition particle.
Collapse
Affiliation(s)
- Ivana Borovská
- Institute of Molecular Physiology and Genetics, Centre of Biosciences, Slovak Academy of Sciences, Bratislava 840 05, Slovak Republic
| | - Igor Vořechovský
- Faculty of Medicine, University of Southampton, HDH, MP808, Southampton SO16 6YD, United Kingdom
| | - Jana Královičová
- Institute of Molecular Physiology and Genetics, Centre of Biosciences, Slovak Academy of Sciences, Bratislava 840 05, Slovak Republic
- Institute of Zoology, Slovak Academy of Sciences, Bratislava 845 06, Slovak Republic
| |
Collapse
|
2
|
Jiang L, Hao Y, Shao C, Wu Q, Prager BC, Gimple RC, Sulli G, Kim LJ, Zhang G, Qiu Z, Zhu Z, Fu XD, Rich JN. ADAR1-mediated RNA editing links ganglioside catabolism to glioblastoma stem cell maintenance. J Clin Invest 2022; 132:143397. [PMID: 35133980 PMCID: PMC8920333 DOI: 10.1172/jci143397] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 02/03/2022] [Indexed: 11/17/2022] Open
Abstract
Glioblastoma (GBM) is the most common and lethal primary malignant brain tumor, containing GBM stem cells (GSCs) that contribute to therapeutic resistance and relapse. Exposing potential GSC vulnerabilities may provide therapeutic strategies against GBM. Here, we interrogated the role of Adenosine-to-Inosine (A-to-I) RNA editing mediated by ADAR1 (adenosine deaminase acting on RNA 1) in GSCs and found that both ADAR1 and global RNA editomes were elevated in GSCs compared to normal neural stem cells (NSCs). ADAR1 inactivation or blocking the upstream JAK/STAT pathway through TYK2 inhibition impaired GSC self-renewal and stemness. Downstream of ADAR1, RNA editing of the 3'UTR of GM2A, a key ganglioside catabolism activator, proved to be critical, as interfering with ganglioside catabolism showed similar functional impact on GSCs as ADAR1 disruption. These findings reveal RNA editing links ganglioside catabolism to GSC self-renewal and stemness, exposing a potential vulnerability of GBM for therapeutic intervention.
Collapse
Affiliation(s)
- Li Jiang
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Yajing Hao
- Department of Cellular and Molecular Medicine, University of California, San Diego, San Diego, United States of America
| | - Changwei Shao
- Department of Cellular and Molecular Medicine, University of California, San Diego, San Diego, United States of America
| | - Qiulian Wu
- Hillman Cancer Center, Cancer Institute, University of Pittsburgh, Pittsburgh, United States of America
| | - Briana C Prager
- Stem Cell Biology, Cleveland Clinic, Cleveland, United States of America
| | - Ryan C Gimple
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Gabriele Sulli
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Leo Jk Kim
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Guoxin Zhang
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Zhixin Qiu
- Hillman Cancer Center, Cancer Institute, University of Pittsburgh, Pittsburgh, United States of America
| | - Zhe Zhu
- Department of Medicine, University of California, San Diego, San Diego, United States of America
| | - Xiang-Dong Fu
- Department of Cellular and Molecular Medicine, University of California, San Diego, San Diego, United States of America
| | - Jeremy N Rich
- Hillman Cancer Center, Cancer Institute, University of Pittsburgh, Pittsburgh, United States of America
| |
Collapse
|
3
|
Abstract
Transposable elements (TEs) are mobile DNA sequences that propagate within genomes. Through diverse invasion strategies, TEs have come to occupy a substantial fraction of nearly all eukaryotic genomes, and they represent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic behavior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species.
Collapse
Affiliation(s)
- Jonathan N Wells
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| |
Collapse
|
4
|
Yang N, Chen H, Hu M, Zhang G, Amanullah, Deng C. Evolution of a splice variant that acts as an endogenous antagonist of the original INSL3 in primates. Gene 2020; 754:144861. [PMID: 32531454 DOI: 10.1016/j.gene.2020.144861] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 05/03/2020] [Accepted: 06/05/2020] [Indexed: 02/08/2023]
Abstract
Alu sequences are the most abundant repetitive elements in the human genome, and have proliferated to more than one million copies in the human genome. Primate-specific Alu sequences account for ~10% of the human genome, and their spread within the genome has the potential to generate new exons. The new exons produced by Alu elements appear in various primate genes, and their functions have been elucidated. Here, we identified a new exon in the insulin-like 3 gene (INSL3), which evolved ~50 million years ago, and led to a splicing variant with 31 extra amino acid residues in addition to the original 95 nucleotides (NTs) of INSL3. The Alu-INSL3 isoform underwent diverse changes during primate evolution; we identified that human Alu-INSL3 might be on its way to functionality and has potential to antagonize LGR8-INSL3 function. Therefore, the present study is designed to provide an example of the evolutionary trajectory of a variant peptide hormone antagonist that caused by the insertion of an Alu element in primates.
Collapse
Affiliation(s)
- Na Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Haidi Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China; Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Minghui Hu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Geyu Zhang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China; Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Amanullah
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China
| | - Cheng Deng
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210023, China; Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
5
|
Samson J, Cronin S, Dean K. BC200 (BCYRN1) - The shortest, long, non-coding RNA associated with cancer. Noncoding RNA Res 2018; 3:131-143. [PMID: 30175286 PMCID: PMC6114260 DOI: 10.1016/j.ncrna.2018.05.003] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Revised: 05/14/2018] [Accepted: 05/17/2018] [Indexed: 12/22/2022] Open
Abstract
With the discovery that the level of RNA synthesis in human cells far exceeds what is required to express protein-coding genes, there has been a concerted scientific effort to identify, catalogue and uncover the biological functions of the non-coding transcriptome. Long, non-coding RNAs (lncRNAs) are a diverse group of RNAs with equally wide-ranging biological roles in the cell. An increasing number of studies have reported alterations in the expression of lncRNAs in various cancers, although unravelling how they contribute specifically to the disease is a bigger challenge. Originally described as a brain-specific, non-coding RNA, BC200 (BCYRN1) is a 200-nucleotide, predominantly cytoplasmic lncRNA that has been linked to neurodegenerative disease and several types of cancer. Here we summarise what is known about BC200, primarily from studies in neuronal systems, before turning to a review of recent work that aims to understand how this lncRNA contributes to cancer initiation, progression and metastasis, along with its possible clinical utility as a biomarker or therapeutic target.
Collapse
Affiliation(s)
| | | | - K. Dean
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| |
Collapse
|
6
|
Jung J, Lee S, Cho HS, Park K, Ryu JW, Jung M, Kim J, Kim H, Kim DS. Bioinformatic analysis of regulation of natural antisense transcripts by transposable elements in human mRNA. Genomics 2018; 111:159-166. [PMID: 29366860 DOI: 10.1016/j.ygeno.2018.01.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 01/16/2018] [Accepted: 01/17/2018] [Indexed: 12/19/2022]
Abstract
Non-coding RNA is no longer considered to be "junk" DNA, based on evidence uncovered in recent decades. In particular, the important role played by natural antisense transcripts (NATs) in regulating the expression of genes is receiving increasing attention. However, the regulatory mechanisms of NATs remain incompletely understood. It is well-known that the insertion of transposable elements (TEs) can affect gene transcription. Using a bioinformatics approach, we identified NATs using human mRNA sequences from the UCSC Genome Browser Database. Our in silico analysis identified 1079 NATs and 700 sense-antisense gene pairs. We identified 179 NATs that showed evidence of having been affected by TEs during cellular gene expression. These findings may provide an understanding of the complex regulation mechanisms of NATs. If our understanding of NATs as modulators of gene expression is further enhanced, we can develop ways to control gene expression.
Collapse
Affiliation(s)
- Jaeeun Jung
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Sugi Lee
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Hyun-Soo Cho
- Department of Stem Cell Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Kunhyang Park
- Department of Core Facility Management Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Jea-Woon Ryu
- Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Minah Jung
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Jeongkil Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - HyeRan Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Plant Systems Engineering Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea
| | - Dae-Soo Kim
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), 217 Gajeong-ro, Yuseong-gu, Daejeon, Republic of Korea; Department of Rare Disease Research Center, Korea Research Institute of Bioscience & Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, Republic of Korea.
| |
Collapse
|
7
|
Chen H, Chen L, Wu Y, Shen H, Yang G, Deng C. The Exonization and Functionalization of an Alu-J Element in the Protein Coding Region of Glycoprotein Hormone Alpha Gene Represent a Novel Mechanism to the Evolution of Hemochorial Placentation in Primates. Mol Biol Evol 2017; 34:3216-3231. [DOI: 10.1093/molbev/msx252] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
|
8
|
Urokinase-type plasminogen activator receptor (uPAR) expression enhances invasion and metastasis in RAS mutated tumors. Sci Rep 2017; 7:9388. [PMID: 28839232 PMCID: PMC5571185 DOI: 10.1038/s41598-017-10062-1] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Accepted: 07/21/2017] [Indexed: 01/08/2023] Open
Abstract
The urokinase-type plasminogen activator receptor (uPAR) is a GPI-anchored cell membrane receptor that focuses urokinase (uPA) proteolytic activity on the cell surface. Its expression is increased in many human cancers, including non-small cell lung cancer (NSCLC) and colorectal cancer (CRC), and correlates with a poor prognosis and early invasion and metastasis. uPAR is able to control, through a cross-talk with tyrosine kinase receptors, the shift between tumor dormancy and proliferation, that usually precedes metastasis formation. Therefore, we investigated the role of uPAR expression in RAS mutated NSCLC and CRC cells. In this study we provided evidence, for the first time, that RAS mutational condition is functionally correlated to uPAR overexpression in NSCLC and CRC cancer cell lines and patient-derived tissue samples. Moreover, oncogenic features related to uPAR overexpression in RAS mutated NSCLC and CRC, such as adhesion, migration and metastatic process may be targeted, in vitro and in vivo, by new anti-uPAR small molecules, specific inhibitors of uPAR-vitronectin interaction. Therefore, anti-uPAR drugs could represent an effective pharmacological strategy for NSCLC and CRC patients carrying RAS mutations.
Collapse
|
9
|
Sheinman M, Ramisch A, Massip F, Arndt PF. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences. Sci Rep 2016; 6:30851. [PMID: 27488939 PMCID: PMC4973250 DOI: 10.1038/srep30851] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 07/08/2016] [Indexed: 12/26/2022] Open
Abstract
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf's law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.
Collapse
Affiliation(s)
- Michael Sheinman
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Biology, Faculty of Science, Utrecht University, Utrecht, the Netherlands
| | - Anna Ramisch
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Florian Massip
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- INRA, UR1404 Mathématique Informatique Appliquées du Génome á l’Environnement-F-78350 Jouy-en Josas, France
| | - Peter F. Arndt
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
10
|
Mameli E, Lepori MB, Chiappe F, Ranucci G, Di Dato F, Iorio R, Loudianos G. Wilson's disease caused by alternative splicing and Alu exonization due to a homozygous 3039-bp deletion spanning from intron 1 to exon 2 of the ATP7B gene. Gene 2015; 569:276-9. [PMID: 26031236 DOI: 10.1016/j.gene.2015.05.067] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2015] [Revised: 05/26/2015] [Accepted: 05/27/2015] [Indexed: 11/18/2022]
Abstract
We describe a case of Wilson's disease (WD) diagnosed at 5 years after routine biochemical test showed increased aminotransferases. Mutation analysis of the ATP7B gene revealed a 3039-bp deletion in the homozygous state spanning from the terminal part of intron 1 to nt position 368 of exon 2. This deletion results in the activation of 3 cryptic splice sites: an AG acceptor splice site in nt positions 578-579 producing a different breakpoint and removing the first 577 nts of exon 2, an acceptor and a donor splice site in nt positions 20363-4 and 20456-7, respectively, in intron 1, resulting in the activation of a 94-bp cryptic Alu exon being incorporated into the mature transcript. The resulting alternative transcript contains a TAG stop codon in the first amino acid position of the cryptic exon, likely producing a truncated, non-functional protein. This study shows that intron exonization can also occur in humans through naturally occurring gross deletions. The results suggest that the combination of DNA and RNA analyses can be used for molecular characterization of gross ATP7B deletions, thus improving genetic counseling and diagnosis of WD. Moreover these studies help to better establish new molecular mechanisms producing Wilson's disease.
Collapse
Affiliation(s)
- Eva Mameli
- Dipartimento di Sanita' Pubblica, Medicina Clinica e Molecolare, Universita' degli Studi di Cagliari, Italy
| | - Maria Barbara Lepori
- Dipartimento di Sanita' Pubblica, Medicina Clinica e Molecolare, Universita' degli Studi di Cagliari, Italy
| | - Francesca Chiappe
- Dipartimento di Sanita' Pubblica, Medicina Clinica e Molecolare, Universita' degli Studi di Cagliari, Italy
| | - Giusy Ranucci
- Department of Translational Medical Science, Section of Pediatrics, University Federico II, Naples, Italy
| | - Fabiola Di Dato
- Department of Translational Medical Science, Section of Pediatrics, University Federico II, Naples, Italy
| | - Raffaele Iorio
- Department of Translational Medical Science, Section of Pediatrics, University Federico II, Naples, Italy
| | | |
Collapse
|
11
|
Inference of transposable element ancestry. PLoS Genet 2014; 10:e1004482. [PMID: 25121584 PMCID: PMC4133154 DOI: 10.1371/journal.pgen.1004482] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2013] [Accepted: 05/16/2014] [Indexed: 01/11/2023] Open
Abstract
Most common methods for inferring transposable element (TE) evolutionary relationships are based on dividing TEs into subfamilies using shared diagnostic nucleotides. Although originally justified based on the “master gene” model of TE evolution, computational and experimental work indicates that many of the subfamilies generated by these methods contain multiple source elements. This implies that subfamily-based methods give an incomplete picture of TE relationships. Studies on selection, functional exaptation, and predictions of horizontal transfer may all be affected. Here, we develop a Bayesian method for inferring TE ancestry that gives the probability that each sequence was replicative, its frequency of replication, and the probability that each extant TE sequence came from each possible ancestral sequence. Applying our method to 986 members of the newly-discovered LAVA family of TEs, we show that there were far more source elements in the history of LAVA expansion than subfamilies identified using the CoSeg subfamily-classification program. We also identify multiple replicative elements in the AluSc subfamily in humans. Our results strongly indicate that a reassessment of subfamily structures is necessary to obtain accurate estimates of mutation processes, phylogenetic relationships and historical times of activity. The most common entities in vertebrate genomes are transposable elements (TEs), DNA sequences that have been repeatedly copied and inserted into new locations throughout the genome. Some TEs have been replicated hundreds of thousands of times, and their ecology and evolutionary history within a genome is thus critical to understanding how genome structure evolves. It was once thought that only a few “master gene” copies could replicate, while the rest were inactive (dead on arrival), but recent computational and laboratory studies have indicated that this is not the case. However, previous methods for reconstructing TE evolutionary history were not designed to solve the problem of determining the ancestral source sequence for large numbers of elements. Here, we present a new method that is. Our method surveys all likely TE ancestors and determines the probability that each modern element arose from each of its plausible ancestors. We applied our method to the gibbon-derived LAVA TE family and to the human AluSc subfamily and inferred many more source elements than indicated by previous methods. This new method will help us better understand TE evolution, including both the impact of sequence on replication and the substitution process after replication.
Collapse
|
12
|
Ideue T, Cho Y, Nishimura K, Tani T. Involvement of satellite I noncoding RNA in regulation of chromosome segregation. Genes Cells 2014; 19:528-38. [PMID: 24750444 DOI: 10.1111/gtc.12149] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 03/12/2014] [Indexed: 12/30/2022]
Abstract
Human centromeres consist of repetitive sequences from which satellite I noncoding RNAs are transcribed. We found that knockdown of satellite I RNA causes abnormal chromosome segregation and generation of nuclei with a grape-shape phenotype. Co-immunoprecipitation experiments showed that satellite I RNA associates with Aurora B, a component of the chromosome passenger complex (CPC) regulating proper attachment of microtubules to kinetochores, in mitotic HeLa cells. Satellite I RNA was also shown to associate with INCENP, another component of the CPC. In addition, depletion of satellite I RNA resulted in up-regulation of kinase activity of Aurora B and delocalization of the CPC from the centromere region. These results suggest that satellite I RNA is involved in chromosome segregation through controlling activity and centromeric localization of Aurora B kinase.
Collapse
Affiliation(s)
- Takashi Ideue
- Department of Biological Sciences, Graduate School of Science Technology, Kumamoto University, Kumamoto, 860-8555, Japan
| | | | | | | |
Collapse
|
13
|
Gill N, Buti M, Kane N, Bellec A, Helmstetter N, Berges H, Rieseberg LH. Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome. BIOLOGY 2014; 3:295-319. [PMID: 24833511 PMCID: PMC4085609 DOI: 10.3390/biology3020295] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/16/2014] [Accepted: 03/25/2014] [Indexed: 12/19/2022]
Abstract
Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
Collapse
Affiliation(s)
- Navdeep Gill
- Department of Botany and The Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, BC, Canada.
| | - Matteo Buti
- Applied Rosaceous Genomics Group, Centre for Research and Innovation, Michele all'Adige (TN) P.IVA 020384102, Italy.
| | - Nolan Kane
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO 80309, USA.
| | - Arnaud Bellec
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Nicolas Helmstetter
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Hélène Berges
- French Plant Genomic Resource Centre, INRA-CNRGV, Chemin de Borde Rouge, CS 52627, 31326 Castanet Tolosan, France.
| | - Loren H Rieseberg
- Department of Botany and The Biodiversity Research Centre, University of British Columbia, Vancouver V6T 1Z4, BC, Canada.
| |
Collapse
|
14
|
Ahmed M, Li W, Liang P. Identification of three new Alu Yb subfamilies by source tracking of recently integrated Alu Yb elements. Mob DNA 2013; 4:25. [PMID: 24216009 PMCID: PMC3831846 DOI: 10.1186/1759-8753-4-25] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Accepted: 10/09/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Alu elements are the most abundant mobile elements in the human genome, with over 1 million copies and constituting more than 10% of the genome. The majority of these Alu elements were inserted into the primate genome 35 to 60 million years ago, but certain subfamilies of Alu elements are relatively very new and suspected to be still evolving. We attempted to trace the source/master copies of all human-specific members of the Alu Yb lineage using a computational approach by clustering similar Yb elements and constructing an evolutionary relation among the members of a cluster. RESULTS We discovered that one copy of Yb8 at 10p14 is the source of several active Yb8 copies, which retrotransposed to generate 712 copies or 54% of all human-specific Yb8 elements. We detected eight other Yb8 elements that had generated ten or more copies, potentially acting as 'stealth drivers'. One Yb8 element at 14q32.31 seemed to act as the source copy for all Yb9 elements tested, having producing 13 active Yb9 elements, and subsequently generated a total of 131 full-length copies. We identified and characterized three new subclasses of Yb elements: Yb8a1, Yb10 and Yb11. Their copy numbers in the reference genome are 75, 8 and 16. We analysed personal genome data from the 1000 Genome Project and detected an additional 6 Yb8a1, 3 Yb10 and 15 Yb11 copies outside the reference genome. Our analysis indicates that the Yb8a1 subfamily has a similar age to Yb9 (1.93 million years and 2.15 million years, respectively), while Yb10 and Yb11 evolved only 1.4 and 0.71 million years ago, suggesting a linear evolutionary path from Yb8a1 to Yb10 and then to Yb11. Our preliminary data indicate that members in Yb10 and Yb11 are mostly polymorphic, indicating their young age. CONCLUSIONS Our findings suggest that the Yb lineage is still evolving with new subfamilies being formed. Due to their very young age and the high rate of being polymorphic, insertions from these young subfamilies are very useful genetic markers for studying human population genetics and migration patterns, and the trend for mobile element insertions in the human genome.
Collapse
Affiliation(s)
| | | | - Ping Liang
- Department of Biological Sciences, Brock University, St Catharines, Ontario L2S 3A1, Canada.
| |
Collapse
|
15
|
Transcriptional variations mediated by an alternative promoter of the FPR3 gene. Mamm Genome 2011; 22:621-33. [PMID: 21717223 DOI: 10.1007/s00335-011-9341-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2010] [Accepted: 05/20/2011] [Indexed: 10/18/2022]
Abstract
Formyl peptide receptor 3 (FPR3) is a potential player in innate immunity and appears with FPR2 as a FPR cluster during primate evolution. Comparative genome analyses indicate that a segmental duplication (SD) event upstream of the FPR3 gene after the divergence of New and Old World monkeys led to the emergence of an alternative promoter. In this study we combined computational and experimental approaches to identify a FPR3 gene that is controlled by an alternative promoter derived during a SD event. Its transcriptional activity was detected by quantitative reverse transcription polymerase chain reaction. Human alternative transcripts (FPR3-1 and FPR3-2) showed tissue-specific patterns with strong expressions in lung or uterus, while the FPR3-1 transcript of rhesus macaque is broadly expressed in various tissues. Overall, transcriptional variations of FPR3 occur by an alternative promoter during primate evolution.
Collapse
|
16
|
Bettecken T, Frenkel ZM, Trifonov EN. Human nucleosomes: special role of CG dinucleotides and Alu-nucleosomes. BMC Genomics 2011; 12:273. [PMID: 21627783 PMCID: PMC3117857 DOI: 10.1186/1471-2164-12-273] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2010] [Accepted: 05/31/2011] [Indexed: 11/28/2022] Open
Abstract
Background The periodical occurrence of dinucleotides with a period of 10.4 bases now is undeniably a hallmark of nucleosome positioning. Whereas many eukaryotic genomes contain visible and even strong signals for periodic distribution of dinucleotides, the human genome is rather featureless in this respect. The exact sequence features in the human genome that govern the nucleosome positioning remain largely unknown. Results When analyzing the human genome sequence with the positional autocorrelation method, we found that only the dinucleotide CG shows the 10.4 base periodicity, which is indicative of the presence of nucleosomes. There is a high occurrence of CG dinucleotides that are either 31 (10.4 × 3) or 62 (10.4 × 6) base pairs apart from one another - a sequence bias known to be characteristic of Alu-sequences. In a similar analysis with repetitive sequences removed, peaks of repeating CG motifs can be seen at positions 10, 21 and 31, the nearest integers of multiples of 10.4. Conclusions Although the CG dinucleotides are dominant, other elements of the standard nucleosome positioning pattern are present in the human genome as well. The positional autocorrelation analysis of the human genome demonstrates that the CG dinucleotide is, indeed, one visible element of the human nucleosome positioning pattern, which appears both in Alu sequences and in sequences without repeats. The dominant role that CG dinucleotides play in organizing human chromatin is to indicate the involvement of human nucleosomes in tuning the regulation of gene expression and chromatin structure, which is very likely due to cytosine-methylation/-demethylation in CG dinucleotides contained in the human nucleosomes. This is further confirmed by the positions of CG-periodical nucleosomes on Alu sequences. Alu repeats appear as monomers, dimers and trimers, harboring two to six nucleosomes in a run. Considering the exceptional role CG dinucleotides play in the nucleosome positioning, we hypothesize that Alu-nucleosomes, especially, those that form tightly positioned runs, could serve as "anchors" in organizing the chromatin in human cells.
Collapse
Affiliation(s)
- Thomas Bettecken
- CAGT-Center for Applied Genotyping, Max Planck Institute of Psychiatry, Kraepelinstr. 2-10, D-80804 Munich, Germany.
| | | | | |
Collapse
|
17
|
Artemis splice defects cause atypical SCID and can be restored in vitro by an antisense oligonucleotide. Genes Immun 2011; 12:434-44. [PMID: 21390052 DOI: 10.1038/gene.2011.16] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Artemis deficiency is known to result in classical T-B- severe combined immunodeficiency (SCID) in case of Artemis null mutations, or Omenn's syndrome in case of hypomorphic mutations in the Artemis gene. We describe two unrelated patients with a relatively mild clinical T-B- SCID phenotype, caused by different homozygous Artemis splice-site mutations. The splice-site mutations concern either dysfunction of a 5' splice-site or an intronic point mutation creating a novel 3' splice-site, resulting in mutated Artemis protein with residual activity or low levels of wild type (WT) Artemis transcripts. During the first 10 years of life, the patients suffered from recurrent infections necessitating antibiotic prophylaxis and intravenous immunoglobulins. Both mutations resulted in increased ionizing radiation sensitivity and insufficient variable, diversity and joining (V(D)J) recombination, causing B-lymphopenia and exhaustion of the naive T-cell compartment. The patient with the novel 3' splice-site had progressive granulomatous skin lesions, which disappeared after stem cell transplantation (SCT). We showed that an alternative approach to SCT can, in principle, be used in this case; an antisense oligonucleotide (AON) covering the intronic mutation restored WT Artemis transcript levels and non-homologous end-joining pathway activity in the patient fibroblasts.
Collapse
|
18
|
Berger A, Strub K. Multiple Roles of Alu-Related Noncoding RNAs. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2011; 51:119-46. [PMID: 21287136 DOI: 10.1007/978-3-642-16502-3_6] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Repetitive Alu and Alu-related elements are present in primates, tree shrews (Scandentia), and rodents and have expanded to 1.3 million copies in the human genome by nonautonomous retrotransposition. Pol III transcription from these elements occurs at low levels under normal conditions but increases transiently after stress, indicating a function of Alu RNAs in cellular stress response. Alu RNAs assemble with cellular proteins into ribonucleoprotein complexes and can be processed into the smaller scAlu RNAs. Alu and Alu-related RNAs play a role in regulating transcription and translation. They provide a source for the biogenesis of miRNAs and, embedded into mRNAs, can be targeted by miRNAs. When present as inverted repeats in mRNAs, they become substrates of the editing enzymes, and their modification causes the nuclear retention of these mRNAs. Certain Alu elements evolved into unique transcription units with specific expression profiles producing RNAs with highly specific cellular functions.
Collapse
Affiliation(s)
- Audrey Berger
- Department of Cell Biology, University of Geneva, 30 quai Ernest Ansermet, 1211, Geneva 4, Switzerland
| | | |
Collapse
|
19
|
Churakov G, Grundmann N, Kuritzin A, Brosius J, Makałowski W, Schmitz J. A novel web-based TinT application and the chronology of the Primate Alu retroposon activity. BMC Evol Biol 2010; 10:376. [PMID: 21126360 PMCID: PMC3014933 DOI: 10.1186/1471-2148-10-376] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2010] [Accepted: 12/02/2010] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND DNA sequences afford access to the evolutionary pathways of life. Particularly mobile elements that constantly co-evolve in genomes encrypt recent and ancient information of their host's history. In mammals there is an extraordinarily abundant activity of mobile elements that occurs in a dynamic succession of active families, subfamilies, types, and subtypes of retroposed elements. The high frequency of retroposons in mammals implies that, by chance, such elements also insert into each other. While inactive elements are no longer able to retropose, active elements retropose by chance into other active and inactive elements. Thousands of such directional, element-in-element insertions are found in present-day genomes. To help analyze these events, we developed a computational algorithm (Transpositions in Transpositions, or TinT) that examines the different frequencies of nested transpositions and reconstructs the chronological order of retroposon activities. RESULTS By examining the different frequencies of such nested transpositions, the TinT application reconstructs the chronological order of retroposon activities. We use such activity patterns as a comparative tool to (1) delineate the historical rise and fall of retroposons and their relations to each other, (2) understand the retroposon-induced complexity of recent genomes, and (3) find selective informative homoplasy-free markers of phylogeny. The efficiency of the new application is demonstrated by applying it to dimeric Alu Short INterspersed Elements (SINE) to derive a complete chronology of such elements in primates. CONCLUSION The user-friendly, web-based TinT interface presented here affords an easy, automated screening for nested transpositions from genome assemblies or trace data, assembles them in a frequency-matrix, and schematically displays their chronological activity history.
Collapse
Affiliation(s)
- Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Norbert Grundmann
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Andrej Kuritzin
- Department of Physics and Mathematics, Saint Petersburg State Institute of Technology, 26 Moskovsky av., St.-Petersburg 198013, Russia
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Wojciech Makałowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| |
Collapse
|
20
|
Venkatachalam G, Kumar AP, Yue LS, Pervaiz S, Clement MV, Sakharkar MK. Computational identification and experimental validation of PPRE motifs in NHE1 and MnSOD genes of human. BMC Genomics 2009; 10 Suppl 3:S5. [PMID: 19958503 PMCID: PMC2788392 DOI: 10.1186/1471-2164-10-s3-s5] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Activation of PPARs has been reported to inhibit the proliferation of malignant cells from different lineages. They are involved in transcription regulation of genes upon activation by a ligand. The binding of PPARs to the promoter sequence either represses or activates the gene. Hence, PPARs represent promising targets for cancer treatment because of their anti-proliferative and pro-apoptotic activities. Here we computationally identified PPAR binding regions in NHE1 and MnSOD. We further validated the predictions in vitro. RESULTS Our results computationally predicted the presence of 2 PPRE motifs in NHE1 and 3 PPRE motifs in MnSOD. We experimentally confirmed the true motifs and their regulation by PPAR. CONCLUSION Our results suggest that both NHE1 and MnSOD have PPRE binding motif in their upstream/promoter region and hence are regulated by PPAR upon ligand binding.
Collapse
Affiliation(s)
- Gireedhar Venkatachalam
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
| | | | | | | | | | | |
Collapse
|
21
|
A shotgun approach to discovering and reconstructing consensus retrotransposons ex novo from dense contigs of short sequences derived from Genbank Genome Survey Sequence database records. Gene 2009; 448:168-73. [DOI: 10.1016/j.gene.2009.06.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Revised: 06/12/2009] [Accepted: 06/19/2009] [Indexed: 01/19/2023]
|
22
|
Kumar AP, Quake AL, Chang MKX, Zhou T, Lim KSY, Singh R, Hewitt RE, Salto-Tellez M, Pervaiz S, Clément MV. Repression of NHE1 expression by PPARgamma activation is a potential new approach for specific inhibition of the growth of tumor cells in vitro and in vivo. Cancer Res 2009; 69:8636-44. [PMID: 19887620 DOI: 10.1158/0008-5472.can-09-0219] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Ligand-induced activation of peroxisome proliferator-activated receptor gamma (PPARgamma) inhibits proliferation in cancer cells in vitro and in vivo; however, the downstream targets remain undefined. We report the identification of a peroxisome proliferator response element in the promoter region of the Na(+)/H(+) transporter gene NHE1, the overexpression of which has been associated with carcinogenesis. Exposure of breast cancer cells expressing high levels of PPARgamma to its natural and synthetic agonists resulted in downregulation of NHE1 transcription as well as protein expression. Furthermore, the inhibitory effect of activated PPARgamma on tumor colony-forming ability was abrogated on overexpression of NHE1, whereas small interfering RNA-mediated gene silencing of NHE1 significantly increased the sensitivity of cancer cells to growth-inhibitory stimuli. Finally, histopathologic analysis of breast cancer biopsies obtained from patients with type II diabetes treated with the synthetic agonist rosiglitazone showed significant repression of NHE1 in the tumor tissue. These data provide evidence for tumor-selective downregulation of NHE1 by activated PPARgamma in vitro and in pathologic specimens from breast cancer patients and could have potential implications for the judicious use of low doses of PPARgamma ligands in combination chemotherapy regimens for an effective therapeutic response.
Collapse
Affiliation(s)
- Alan Prem Kumar
- Departments of Physiology, National University Medical Institutes, Yong Loo Lin School of Medicine, Singapore
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Styles P, Brookfield JFY. Source gene composition and gene conversion of the AluYh and AluYi lineages of retrotransposons. BMC Evol Biol 2009; 9:102. [PMID: 19442302 PMCID: PMC2686708 DOI: 10.1186/1471-2148-9-102] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 05/14/2009] [Indexed: 11/20/2022] Open
Abstract
Background Alu elements are a family of SINE retrotransposons in primates. They are classified into subfamilies according to specific diagnostic mutations from the general Alu consensus. It is now believed that there may be several retrotranspositionally-competent source genes within an Alu subfamily. In this study, subfamilies falling on the AluYi and AluYh lineages, and the AluYg6 subfamily, are assessed for the presence of secondary source genes, and the influence of gene conversion on the AluYh and AluYi lineages is also described. Results The AluYh7 and AluYi6 subfamilies appear to contain multiple source genes. The novel subfamilies AluYh3a1 and AluYh3a3 are described, for which there is no convincing evidence to suggest the presence of secondary sources. The mutational substructure of AluYh3a3 can be explained completely by inference of single master gene. A complete backwards gene conversion event appears to have inactivated the AluYh3a3 master gene in humans. Polymorphism data suggest a larger number of secondary source elements may be active in the AluYg6 family than previously thought. Conclusion It is clear that there is considerable variation in the number of source genes present in each of the young Alu subfamilies. This can range from a single master source gene, as for AluYh3a3, to as many as 14 source elements in AluYi6.
Collapse
Affiliation(s)
- Pamela Styles
- Institute of Genetics, School of Biology, University of Nottingham, Nottingham, UK.
| | | |
Collapse
|
24
|
Abstract
Eukaryotic genomes are full of repetitive DNA, transposable elements (TEs) in particular, and accordingly there are a number of computational methods that can be used to identify TEs from genomic sequences. We present here a survey of two of the most readily available and widely used bioinformatics applications for the detection, characterization, and analysis of TE sequences in eukaryotic genomes: CENSOR and RepeatMasker. For each program, information on availability, input, output, and the algorithmic methods used is provided. Specific examples of the use of CENSOR and RepeatMasker are also described. CENSOR and RepeatMasker both rely on homology-based methods for the detection of TE sequences. There are several other classes of methods available for the analysis of repetitive DNA sequences including de novo methods that compare genomic sequences against themselves, class-specific methods that use structural characteristics of specific classes of elements to aid in their identification, and pipeline methods that combine aspects of some or all of the aforementioned methods. We briefly consider the strengths and weaknesses of these different classes of methods with an emphasis on their complementary utility for the analysis of repetitive DNA in eukaryotes.
Collapse
|
25
|
Pidpala OV, Yatsishina AP, Lukash LL. Human mobile genetic elements: Structure, distribution and functional role. CYTOL GENET+ 2008. [DOI: 10.3103/s009545270806011x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
26
|
Salih F, Salih B, Kogan S, Trifonov EN. Epigenetic nucleosomes: Alu sequences and CG as nucleosome positioning element. J Biomol Struct Dyn 2008; 26:9-16. [PMID: 18533722 DOI: 10.1080/07391102.2008.10507219] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Alu sequences carry periodical pattern with CG dinucleotides (CpG) repeating every 31-32 bases. Similar distances are observed in distribution of DNA curvature in crystallized nucleosomes, at positions +/-1.5 and +/-4.5 periods of DNA from nucleosome DNA dyad. Since CG elements are also found to impart to nucleosomes higher stability when positioned at +/-1.5 sites, it suggests that CG dinucleotides may play a role in modulation of the nucleosome strength when the CG elements are methylated. Thus, Alu sequences may harbor special epigenetic nucleosomes with methylation-dependent regulatory functions. Nucleosome DNA sequence probe is suggested to detect locations of such regulatory nucleosomes in the sequences.
Collapse
Affiliation(s)
- F Salih
- Genome Diversity Center, Institute of Evolution, University of Haifa, Israel
| | | | | | | |
Collapse
|
27
|
Chernov I, Stukacheva E, Akopov S, Didych D, Nikolaev L, Sverdlov E. A new technique for selective identification and mapping of enhancers within long genomic sequences. Biotechniques 2008; 44:775-84. [PMID: 18476831 DOI: 10.2144/000112732] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We report a new experimental method of direct selection, identification, and mapping of potential enhancer sequences within extended stretches of genomic DNA. The method allows simultaneous cloning of a quantity of sequences instead of tedious screening of the separate ones, thus providing a robust and high-throughput approach to the mapping of enhancers. The selection procedure is based on the ability of such sequences to activate a minimal promoter that drives expression of a selective gene. To this end a mixture of short DNA fragments derived from the segment of interest was cloned in a retroviral vector containing the neomycin phosphotransferase II gene under control of a cytomegalovirus (CMV) minimal promoter. The pool of retroviruses obtained was used to infect HeLa cells and then to select neomycin-resistant colonies containing constructs with enhancer-like sequences. The pool of the genomic fragments was rescued by PCR and cloned, forming a library of the potential enhancers. Fifteen enhancer-like fragments were selected from 1-Mb human genome locus, and enhancer activity of 13 of them was verified in a transient transfection reporter gene assay. The sequences selected were found to be predominantly located near 5' regions of genes or within gene introns.
Collapse
Affiliation(s)
- Igor Chernov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | | | | | | | | | | |
Collapse
|
28
|
Urrutia AO, Ocaña LB, Hurst LD. Do Alu repeats drive the evolution of the primate transcriptome? Genome Biol 2008; 9:R25. [PMID: 18241332 PMCID: PMC2374697 DOI: 10.1186/gb-2008-9-2-r25] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2007] [Revised: 01/02/2008] [Accepted: 02/01/2008] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Of all repetitive elements in the human genome, Alus are unusual in being enriched near to genes that are expressed across a broad range of tissues. This has led to the proposal that Alus might be modifying the expression breadth of neighboring genes, possibly by providing CpG islands, modifying transcription factor binding, or altering chromatin structure. Here we consider whether Alus have increased expression breadth of genes in their vicinity. RESULTS Contrary to the modification hypothesis, we find that those genes that have always had broad expression are richest in Alus, whereas those that are more likely to have become more broadly expressed have lower enrichment. This finding is consistent with a model in which Alus accumulate near broadly expressed genes but do not affect their expression breadth. Furthermore, this model is consistent with the finding that expression breadth of mouse genes predicts Alu density near their human orthologs. However, Alus were found to be related to some alternative measures of transcription profile divergence, although evidence is contradictory as to whether Alus associate with lowly or highly diverged genes. If Alu have any effect it is not by provision of CpG islands, because they are especially rare near to transcriptional start sites. Previously reported Alu enrichment for genes serving certain cellular functions, suggested to be evidence of functional importance of Alus, appears to be partly a byproduct of the association with broadly expressed genes. CONCLUSION The abundance of Alu near broadly expressed genes is better explained by their preferential preservation near to housekeeping genes rather than by a modifying effect on expression of genes.
Collapse
Affiliation(s)
- Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath, BA4 7AY, UK.
| | | | | |
Collapse
|
29
|
Umylny B, Presting G, Efird JT, Klimovitsky BI, Ward WS. Most human Alu and murine B1 repeats are unique. J Cell Biochem 2007; 102:110-21. [PMID: 17407136 DOI: 10.1002/jcb.21278] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Alus and B1s are short interspersed repeat elements (SINEs) indirectly derived from the 7SL RNA gene. While most researchers recognize that there exists extensive variability between individual elements, the extent of this variability has never been systematically tested. We examined all Alu elements over 200 nucleotides and all B1 elements over 100 nucleotides in the human and mouse genomes, and analyzed the number of copies of each element at various stringencies from 22 nucleotides to full length. Over 98% of 923,277 Alus and 365,377 B1s examined were unique when queried at full length. When the criterion was reduced to half the length of the repeat, 97% of the Alus and 73% of the B1s were still found to be a single copy. All single and multi-copy sequences have been mapped and documented. Access to the data is possible using the AluPlus website http://www.ibr.hawaii.edu.
Collapse
Affiliation(s)
- Boris Umylny
- Institute of Biogenesis Research, John A. Burns School of Medicine, University of Hawaii, Honolulu, Hawaii 96822, USA
| | | | | | | | | |
Collapse
|
30
|
Abstract
Alus and B1s are short interspersed repeat elements (SINEs) derived from the 7SL RNA gene. Alus and B1s exist in the cytoplasm as non-coding RNA indicating that they are actively transcribed, but their function, if any, is unknown. Transcription of individual SINEs is a prerequisite for retroposition, but it is also possible that individual Alu and B1 elements have some cellular functions. Previous studies suggest that transcription of Alu elements depends on the presence of an RNA polymerase-III bipartite promoter and the poly-A tail. Sequencing of small RNAs has demonstrated that the members of the Y and S subfamily are expressed. We analyzed almost one million Alu sequences longer than 200 nucleotides for the presence of RNA polymerase-III bipartite promoter sequences. More than half contained a promoter indicating some potential for expression. We searched 7.7 million human EST sequences in dbEST for the presence of Alu non-coding RNAs and found evidence for the expression of 452. Analysis of mouse spermatogenic dbEST libraries revealed an apparent relationship between the level of differentiation and the level of B1-related sequences in the EST library.
Collapse
Affiliation(s)
- Boris Umylny
- Asia Pacific Bioinformatics Research Institute, Honolulu, HI, USA
| | | | | |
Collapse
|
31
|
Pelley JL, Nicholls CD, Beattie TL, Brown CB. Discovery and characterization of a novel splice variant of the GM-CSF receptor α subunit. Exp Hematol 2007; 35:1483-94. [PMID: 17681666 DOI: 10.1016/j.exphem.2007.06.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2006] [Revised: 06/06/2007] [Accepted: 06/06/2007] [Indexed: 10/23/2022]
Abstract
OBJECTIVE To characterize a novel splice variant of the alpha subunit of the granulocyte-macrophage colony-stimulating factor (GM-CSF) receptor (GMRalpha), which we discovered in human neutrophils. METHODS We used reverse transcriptase polymerase chain reaction to identify, characterize, and examine the expression of a novel splice variant of the GMRalpha transcript. At the protein level, surface plasmon resonance was used to measure the affinity of a recombinant soluble form of the novel GMRalpha protein for GM-CSF ligand. The full-length novel GMRalpha protein was expressed in a recombinant cell culture system, and its expression and localization were examined using Western blotting, I(125) GM-CSF binding assays, flow cytometry, and a soluble GMRalpha enzyme-linked immunosorbent assay. RESULTS The novel GMRalpha transcript identified herein contains a previously undescribed exon of the GMRalpha gene; this exon derives from an Alu DNA repeat element, and is alternatively spliced in the novel GMRalpha transcript. Inclusion of this 102 nucleotide exon results in translation of a protein product, which we have named Alu-GMRalpha. Alu-GMRalpha is identical to cell surface GMRalpha, but additionally contains a 34 amino-acid insert in the juxtamembrane region of the extracellular domain of GMRalpha. Functionally, the Alu-GMRalpha-specific epitope does not modify the ability of the protein to bind GM-CSF, but rather appears to be preferentially targeted by ectodomain proteases to mediate the release of a third soluble GM-CSF receptor into the extracellular space. CONCLUSIONS This study provides the first example of a cytokine receptor system in which soluble receptors are produced by three distinct mechanisms. Our results highlight the importance of soluble GMRalpha proteins in regulation of GM-CSF signaling.
Collapse
Affiliation(s)
- Jennifer L Pelley
- Department of Biochemistry and Molecular Biology and Southern Alberta Cancer Research Institute, University of Calgary, Calgary, Alberta, Canada
| | | | | | | |
Collapse
|
32
|
Analysis of the features and source gene composition of the AluYg6 subfamily of human retrotransposons. BMC Evol Biol 2007; 7:102. [PMID: 17603915 PMCID: PMC1925064 DOI: 10.1186/1471-2148-7-102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2007] [Accepted: 07/01/2007] [Indexed: 11/19/2022] Open
Abstract
Background Alu elements are a family of SINE retrotransposons in primates. They are classified into subfamilies according to specific diagnostic mutations from the general Alu consensus. It is now believed that there may be several retrotranspositionally-competent source genes within an Alu subfamily. To investigate the evolution of young Alu elements it is critical to have access to complete subfamilies, which, following the release of the final human genome assembly, can now be obtained using in silico methods. Results 380 elements belonging to the young AluYg6 subfamily were identified in the human genome, a number significantly exceeding prior expectations. An AluYg6 element was also identified in the chimpanzee genome, indicating that the subfamily is older than previously estimated, and appears to have undergone a period of dormancy before its expansion. The relative contributions of back mutation and gene conversion to variation at the six diagnostic positions are examined, and cases of complete forward gene conversion events are reported. Two small subfamilies derived from AluYg6 have been identified, named AluYg6a2 and AluYg5b3, which contain 40 and 27 members, respectively. These small subfamilies are used to illustrate the ambiguity regarding Alu subfamily definition, and to assess the contribution of secondary source genes to the AluYg6 subfamily. Conclusion The number of elements in the AluYg6 subfamily greatly exceeds prior expectations, indicating that the current knowledge of young Alu subfamilies is incomplete, and that prior analyses that have been carried out using these data may have generated inaccurate results. A definition of primary and secondary source genes has been provided, and it has been shown that several source genes have contributed to the proliferation of the AluYg6 subfamily. Access to the sequence data for the complete AluYg6 subfamily will be invaluable in future computational analyses investigating the evolution of young Alu subfamilies.
Collapse
|
33
|
Netchvolodov KK, Boiko AV, Ryskov AP, Kupriyanova NS. Evolutionary divergence of the pre-promotor region of ribosomal DNA in the great apes. ACTA ACUST UNITED AC 2007; 17:378-91. [PMID: 17343212 DOI: 10.1080/10425170600752643] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The human ribosomal intergenic spacer (rIGS) differs considerably on nucleotide sequence and regulatory elements positioning from their counterparts in the mouse, rat and Xenopus laevis. In the present study, we have PCR amplified, cloned and sequenced the rIGS fragments of about 4.5 kb length, located approximately 2 kb upstream of the rRNA transcription start point for the great apes, Pan paniscus, Pan troglodytes, Gorilla gorilla and Pongo pygmaeus. Alignment of the primates' orthologic nucleotide sequences reveals high extent of similarity, with the exception of highly repetitious region between the two Alu repeats, nearest to the onset of transcription. Data obtained have been analyzed for further understanding of the evolution of repetitive sequences. We have also shown, that MARs/SARs distribution patterns in the pre-promoter rIGSs of the great apes and the mouse are surprisingly similar in spite of an absence of similarity in the primary structure and regulatory elements organization in the region under study.
Collapse
|
34
|
Hamada T, Murasawa S, Asahara T. Simple screening method for differentially methylated regions of the genome using a small number of cells. Biochem Biophys Res Commun 2007; 353:275-9. [PMID: 17178104 DOI: 10.1016/j.bbrc.2006.12.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2006] [Accepted: 12/04/2006] [Indexed: 10/23/2022]
Abstract
Genomic DNA methylation is a major epigenetic mechanism controlling the expression of genetic information. Therefore, identifying regions of the genome that are differentially methylated in different cells is a useful strategy for the study of biological phenomena. To date, several useful screening methods have been established for identifying differentially methylated genomic regions. However, it is impossible to use these methods in fields of study in which it is difficult to obtain a large number of uniform cells, because considerable amounts of genomic DNA are required. Given this situation, we developed a method for preparing large genomic DNA from a small number of cells, and a simple and highly sensitive method for screening for differentially methylated sites. Combined, these two methods comprise a simple screening method, which we named "Differential Methylation Site Scanning" (DMSS), for identifying differentially methylated regions of the genome from a small number of cells. Just 10 cells are sufficient for the method described here.
Collapse
Affiliation(s)
- Tsuyoshi Hamada
- Department of Anatomy and Neurobiology, National Defense Medical Collage, 3-2 Namiki, Tokorozawa, Saitama 359-8513, Japan.
| | | | | |
Collapse
|
35
|
Mola G, Vela E, Fernández-Figueras MT, Isamat M, Muñoz-Mármol AM. Exonization of Alu-generated splice variants in the survivin gene of human and non-human primates. J Mol Biol 2006; 366:1055-63. [PMID: 17204284 DOI: 10.1016/j.jmb.2006.11.089] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2006] [Revised: 11/28/2006] [Accepted: 11/28/2006] [Indexed: 11/23/2022]
Abstract
Survivin is a member of the inhibitor apoptosis family that is overexpressed in many malignancies. It has five known alternative splice forms, some of which differ in their antiapoptotic properties and expression levels in human cancers. Here we describe a novel donor splice site (DSS), 2B+32 DSS, which is used in conjunction with survivin alternative exon 2B, resulting in the inclusion of 32 additional nucleotides from intron 2 at the 3' end of this exon. Sequence analysis showed that both the classical exon 2B DSS and 2B+32 are provided by an Alu sequence, which is inserted in intron 2 downstream of a functional acceptor splice site, leading to the exonization of part of the repetitive element. Minor transcripts including the 2B+32 alternative exon, or retaining the whole intronic region comprised between exons 2B and 3, were detected in several human cell lines and in some human tissues. Survivin 2B+32 containing variants acquire a premature stop codon (PTC) and may therefore be degraded by the nonsense mediated decay pathway. The implication of these novel isoforms, as well as other PTC+ survivin variants, in the overall regulation of survivin expression is discussed. Sequence analysis of intron 2 which contains the Alu Y element was performed on different primate species in order to trace its insertion and exonization during primate evolution.
Collapse
Affiliation(s)
- Gemma Mola
- Department of Pathology, Hospital Universitari Germans Trias i Pujol, Autonomous University of Barcelona, Spain
| | | | | | | | | |
Collapse
|
36
|
Lefebvre J, Fan J, Chevalier S, Sullivan R, Carmona E, Manjunath P. Genomic structure and tissue-specific expression of human and mouse genes encoding homologues of the major bovine seminal plasma proteins. Mol Hum Reprod 2006; 13:45-53. [PMID: 17085770 DOI: 10.1093/molehr/gal098] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Sperm capacitation is a maturation event that takes place in the female reproductive tract and is essential for fertilization. A family of phospholipid-binding proteins present in bovine seminal plasma (BSP proteins) binds the sperm membrane at ejaculation and promotes bovine sperm capacitation. Homologues of these proteins have also been isolated from boar, ram, goat, bison and stallion seminal fluid, suggesting that BSP proteins and their homologues are conserved among mammals. However, there have been no reports on BSP-homologous proteins in mice and humans to date. A search of the mouse and human genomes, using the nucleic acid sequences of BSP proteins, revealed the presence of three BSP-like sequences in the mouse genome, named mouse BSP Homologue 1 (mBSPH1), mBSPH2 and mBSPH3, and one sequence in the human genome (hBSPH1). Mouse epididymal expressed sequence tags corresponding to partial sequences of mBSPH1 and mBSPH2 were identified. The entire complementary DNA (cDNA) sequences of mBSPH1 and mBSPH2 from mouse epididymis and hBSPH1 from human epididymis were obtained by 5'-/3'-rapid amplification of cDNA ends (RACE) and encode predicted proteins containing two tandemly repeated fibronectin type II domains, which is the signature of the BSP family of proteins. Using RT-PCR, it was revealed that mBSPH1, mBSPH2 and hBSPH1 mRNA are expressed only in the epididymis. Expression of mBSPH3 was not detected in any tissue and probably represents a pseudogene. This work shows, for the first time, that BSP homologues are expressed in mouse and human and may be involved in sperm capacitation in these species.
Collapse
Affiliation(s)
- J Lefebvre
- Guy-Bernier Research Centre, Maisonneuve-Rosemont Hospital and Department of Medicine, University of Montreal, Montreal, Québec, Canada
| | | | | | | | | | | |
Collapse
|
37
|
Abrusán G, Krambeck HJ. The distribution of L1 and Alu retroelements in relation to GC content on human sex chromosomes is consistent with the ectopic recombination model. J Mol Evol 2006; 63:484-92. [PMID: 16955238 DOI: 10.1007/s00239-005-0275-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2005] [Accepted: 05/30/2006] [Indexed: 10/24/2022]
Abstract
The distribution of Alu and L1 retroelements in the human genome changes with their age. Active retroelements target AT-rich regions, but their frequency increases in GC- and gene-rich regions of the genome with increasing age of the insertions. Currently there is no consensus on the mechanism generating this pattern. In this paper we test the hypothesis that selection against deleterious deletions caused by ectopic recombination between repeats is the main cause of the inhomogeneous distribution of L1s and Alus, by means of a detailed analysis of the GC distribution of the repeats on the sex chromosomes. We show that (1) unlike on the autosomes and X chromosome, L1s do not accumulate on the Y chromosome in GC-rich regions, whereas Alus accumulate there to a minor extent; (2) on the Y chromosome Alu and L1 densities are positively correlated, unlike the negative correlation on other chromosomes; and (3) in gene-poor regions of chromosome 4 and X, the distribution of Alus and L1s does not shift toward GC-rich regions. In addition, we show that although local GC content of long L1 insertions is lower than average, their selective loss from recombining chromosomes is not the main cause of the enrichment of ancient L1s in GC-rich regions. The results support the hypothesis that ectopic recombination causes the shift of Alu and L1 distributions toward the gene-rich regions of the genome.
Collapse
Affiliation(s)
- György Abrusán
- Max Planck Institute of Limnology, Department of Ecophysiology, August Thienemann Str. 2, 24306, Plön, Germany,
| | | |
Collapse
|
38
|
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res 2005; 110:462-7. [PMID: 16093699 DOI: 10.1159/000084979] [Citation(s) in RCA: 2338] [Impact Index Per Article: 123.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2003] [Accepted: 04/06/2004] [Indexed: 12/13/2022] Open
Abstract
Repbase Update is a comprehensive database of repetitive elements from diverse eukaryotic organisms. Currently, it contains over 3600 annotated sequences representing different families and subfamilies of repeats, many of which are unreported anywhere else. Each sequence is accompanied by a short description and references to the original contributors. Repbase Update includes Repbase Reports, an electronic journal publishing newly discovered transposable elements, and the Transposon Pub, a web-based browser of selected chromosomal maps of transposable elements. Sequences from Repbase Update are used to screen and annotate repetitive elements using programs such as Censor and RepeatMasker. Repbase Update is available on the worldwide web at http://www.girinst.org/Repbase_Update.html.
Collapse
Affiliation(s)
- J Jurka
- Genetic Information Research Institute, Mountain View, CA 94043, USA.
| | | | | | | | | | | |
Collapse
|
39
|
Abstract
Background Alu elements are Short INterspersed Elements (SINEs) in primate genomes that have proven useful as markers for studying genome evolution, population biology and phylogenetics. Most of these applications, however, have been limited to humans and their nearest relatives, chimpanzees. In an effort to expand our understanding of Alu sequence evolution and to increase the applicability of these markers to non-human primate biology, we have analyzed available Alu sequences for loci specific to platyrrhine (New World) primates. Results Branching patterns along an Alu sequence phylogeny indicate three major classes of platyrrhine-specific Alu sequences. Sequence comparisons further reveal at least three New World monkey-specific subfamilies; AluTa7, AluTa10, and AluTa15. Two of these subfamilies appear to be derived from a gene conversion event that has produced a recently active fusion of AluSc- and AluSp-type elements. This is a novel mode of origin for new Alu subfamilies. Conclusion The use of Alu elements as genetic markers in studies of genome evolution, phylogenetics, and population biology has been very productive when applied to humans. The characterization of these three new Alu subfamilies not only increases our understanding of Alu sequence evolution in primates, but also opens the door to the application of these genetic markers outside the hominid lineage.
Collapse
Affiliation(s)
- David A Ray
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Multiscale Systems, Louisiana State University, Baton Rouge, LA, 70803, USA
- Department of Biology, West Virginia University, Morgantown, WV, 26506, USA
| | - Mark A Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Multiscale Systems, Louisiana State University, Baton Rouge, LA, 70803, USA
| |
Collapse
|
40
|
Hedges DJ, Cordaux R, Xing J, Witherspoon DJ, Rogers AR, Jorde LB, Batzer MA. Modeling the amplification dynamics of human Alu retrotransposons. PLoS Comput Biol 2005; 1:e44. [PMID: 16201008 PMCID: PMC1239904 DOI: 10.1371/journal.pcbi.0010044] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Accepted: 08/24/2005] [Indexed: 11/19/2022] Open
Abstract
Retrotransposons have had a considerable impact on the overall architecture of the human genome. Currently, there are three lineages of retrotransposons (Alu, L1, and SVA) that are believed to be actively replicating in humans. While estimates of their copy number, sequence diversity, and levels of insertion polymorphism can readily be obtained from existing genomic sequence data and population sampling, a detailed understanding of the temporal pattern of retrotransposon amplification remains elusive. Here we pose the question of whether, using genomic sequence and population frequency data from extant taxa, one can adequately reconstruct historical amplification patterns. To this end, we developed a computer simulation that incorporates several known aspects of primate Alu retrotransposon biology and accommodates sampling effects resulting from the methods by which mobile elements are typically discovered and characterized. By modeling a number of amplification scenarios and comparing simulation-generated expectations to empirical data gathered from existing Alu subfamilies, we were able to statistically reject a number of amplification scenarios for individual subfamilies, including that of a rapid expansion or explosion of Alu amplification at the time of human–chimpanzee divergence. Nearly 50% of the human genome is composed of mobile elements. While much of this sequence consists of inactive “fossil” elements that are no longer actively moving or generating new copies, three families are currently proliferating in human genomes. Among these, the Alu lineage has reached a copy number of over 1 million and alone accounts for approximately 10% of the genome. While considerable evidence has been gathered concerning the underlying biological mechanisms of Alu mobilization and proliferation, a detailed understanding of Alu amplification history is currently lacking. Researchers are aware, for example, that several thousand Alu elements have inserted within the human genome since the divergence of humans and chimpanzees, but how those insertions were distributed over this ~6-million-year time period is currently unknown. In this work, the authors introduce a simulation framework that seeks to incorporate both sequence diversity and empirically gathered population data from human Alu elements, in order to provide a better understanding of the last several million years of human Alu evolution. The results suggest that a rapid explosion of Alu amplification at the time of the human–chimpanzee divergence is unlikely. Therefore, it is improbable that an increase in Alu retrotransposition activity was involved in the speciation of humans and chimpanzees.
Collapse
Affiliation(s)
- Dale J Hedges
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Microsystems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Richard Cordaux
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Microsystems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Jinchuan Xing
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Microsystems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - David J Witherspoon
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, Utah, United States of America
| | - Alan R Rogers
- Department of Anthropology, University of Utah, Salt Lake City, Utah, United States of America
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, Utah, United States of America
| | - Mark A Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Microsystems, Louisiana State University, Baton Rouge, Louisiana, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
41
|
Ludwig A, Rozhdestvensky TS, Kuryshev VY, Schmitz J, Brosius J. An Unusual Primate Locus that Attracted Two Independent Alu Insertions and Facilitates their Transcription. J Mol Biol 2005; 350:200-14. [PMID: 15922354 DOI: 10.1016/j.jmb.2005.03.058] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2005] [Revised: 03/18/2005] [Accepted: 03/21/2005] [Indexed: 10/25/2022]
Abstract
BC200 RNA, a neuronal, small non-messenger RNA that originated from a monomeric Alu element is specific to anthropoid primates. Tarsiers lack an insert at the orthologous genomic position, whereas strepsirrhines (Lemuriformes and Lorisiformes) acquired a dimeric Alu element, independently from anthropoids. In Galago moholi, the CpG dinucleotides are conspicuously conserved, while in Eulemur coronatus a large proportion is changed, indicating that the G.moholi Alu is under purifying selection and might be transcribed. Indeed, Northern blot analysis of total brain RNA from G.moholi with a specific probe revealed a prominent signal. In contrast, a corresponding signal was absent from brain RNA from E.coronatus. Isolation and sequence analysis of additional strepsirrhine loci confirmed the differential sequence conservation including CpG patterns of the orthologous dimeric Alu elements in Lorisiformes and Lemuriformes. Interestingly, all examined Alu elements from Lorisiformes were transcribed, while all from Lemuriformes were silent when transiently transfected into HeLa cells. Upstream sequences, especially those between the transcriptional start site and -22 upstream, were important for basal transcriptional activity. Thus, the BC200 RNA gene locus attracted two independent Alu insertions during its evolutionary history and provided upstream promoter elements required for their transcription.
Collapse
Affiliation(s)
- A Ludwig
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, D-48149 Münster, Germany
| | | | | | | | | |
Collapse
|
42
|
Krull M, Brosius J, Schmitz J. Alu-SINE exonization: en route to protein-coding function. Mol Biol Evol 2005; 22:1702-11. [PMID: 15901843 DOI: 10.1093/molbev/msi164] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The majority of more than one million primate-specific Alu elements map to nonfunctional parts of introns or intergenic sequences. Once integrated, they have the potential to become exapted as functional modules, e.g., as protein-coding domains via alternative splicing. This particular process is also termed exonization and increases protein versatility. Here we investigate 153 human chromosomal loci where Alu elements were conceivably exonized. In four selected examples, we generated, with the aid of representatives of all primate infraorders, phylogenetic reconstructions of the evolutionary steps presumably leading to exonization of Alu elements. We observed a variety of possible scenarios in which Alu elements led to novel mRNA splice forms and which, like most evolutionary processes, took different courses in different lineages. Our data show that, once acquired, some exonizations were lost again in some lineages. In general, Alu exonization occurred at various time points over the evolutionary history of primate lineages, and protein-coding potential was acquired either relatively soon after integration or millions of years thereafter. The course of these paths can probably be generalized to the exonization of other elements as well.
Collapse
Affiliation(s)
- Maren Krull
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| | | | | |
Collapse
|
43
|
Sawai H, Kawamoto Y, Takahata N, Satta Y. Evolutionary relationships of major histocompatibility complex class I genes in simian primates. Genetics 2005; 166:1897-907. [PMID: 15126407 PMCID: PMC1470823 DOI: 10.1534/genetics.166.4.1897] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
New World monkeys (NWMs) occupy a critical phylogenetic position in elucidating the evolutionary process of major histocompatibility complex (MHC) class I genes in primates. From three subfamilies of Aotinae, Cebinae, and Atelinae, the 5'-flanking regions of 18 class I genes are obtained and phylogenetically examined in terms of Alu/LINE insertion elements as well as the nucleotide substitutions. Two pairs of genes from Aotinae and Atelinae are clearly orthologous to human leukocyte antigen (HLA) -E and -F genes. Of the remaining 14 genes, 8 belong to the distinct group B, together with HLA-B and -C, to the exclusion of all other HLA class I genes. These NWM genes are classified into four groups, designated as NWM-B1, -B2, -B3, and -B4. Of these, NWM-B2 is orthologous to HLA-B/C. Also, orthologous relationships of NWM-B1, -B2, and -B3 exist among different families of Cebidae and Atelidae, which is in sharp contrast to the genus-specific gene organization within the subfamily Callitrichinae. The other six genes belong to the distinct group G. However, a clade of these NWM genes is almost equally related to HLA-A, -J, -G, and -K, and there is no evidence for their orthologous relationships to HLA-G. It is argued that class I genes in simian primates duplicated extensively in their common ancestral lineage and that subsequent evolution in descendant species has been facilitated mainly by independent loss of genes.
Collapse
Affiliation(s)
- Hiromi Sawai
- Department of Biosystems Science, Graduate University for Advanced Studies (Sokendai), Hayama, Kanagawa 240-0193, Japan
| | | | | | | |
Collapse
|
44
|
Price AL, Eskin E, Pevzner PA. Whole-genome analysis of Alu repeat elements reveals complex evolutionary history. Genome Res 2004; 14:2245-52. [PMID: 15520288 PMCID: PMC525682 DOI: 10.1101/gr.2693004] [Citation(s) in RCA: 164] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2004] [Accepted: 08/14/2004] [Indexed: 11/24/2022]
Abstract
Alu repeats are the most abundant family of repeats in the human genome, with over 1 million copies comprising 10% of the genome. They have been implicated in human genetic disease and in the enrichment of gene-rich segmental duplications in the human genome, and they form a rich fossil record of primate and human history. Alu repeat elements are believed to have arisen from the replication of a small number of source elements, whose evolution over time gives rise to the 31 Alu subfamilies currently reported in Repbase Update. We apply a novel method to identify and statistically validate 213 Alu subfamilies. We build an evolutionary tree of these subfamilies and conclude that the history of Alu evolution is more complex than previous studies had indicated.
Collapse
Affiliation(s)
- Alkes L Price
- Department of Computer Science and Engineering, University of California-San Diego, La Jolla, California 92093-0114, USA.
| | | | | |
Collapse
|
45
|
Evolution and distribution of RNA polymerase II regulatory sites from RNA polymerase III dependant mobile Alu elements. BMC Evol Biol 2004; 4:37. [PMID: 15461819 PMCID: PMC524483 DOI: 10.1186/1471-2148-4-37] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2004] [Accepted: 10/04/2004] [Indexed: 11/24/2022] Open
Abstract
Background The primate-specific Alu elements, which originated 65 million years ago, exist in over a million copies in the human genome. These elements have been involved in genome shuffling and various diseases not only through retrotransposition but also through large scale Alu-Alu mediated recombination. Only a few subfamilies of Alus are currently retropositionally active and show insertion/deletion polymorphisms with associated phenotypes. Retroposition occurs by means of RNA intermediates synthesised by a RNA polymerase III promoter residing in the A-Box and B-Box in these elements. Alus have also been shown to harbour a number of transcription factor binding sites, as well as hormone responsive elements. The distribution of Alus has been shown to be non-random in the human genome and these elements are increasingly being implicated in diverse functions such as transcription, translation, response to stress, nucleosome positioning and imprinting. Results We conducted a retrospective analysis of putative functional sites, such as the RNA pol III promoter elements, pol II regulatory elements like hormone responsive elements and ligand-activated receptor binding sites, in Alus of various evolutionary ages. We observe a progressive loss of the RNA pol III transcriptional potential with concomitant accumulation of RNA pol II regulatory sites. We also observe a significant over-representation of Alus harboring these sites in promoter regions of signaling and metabolism genes of chromosome 22, when compared to genes of information pathway components, structural and transport proteins. This difference is not so significant between functional categories in the intronic regions of the same genes. Conclusions Our study clearly suggests that Alu elements, through retrotransposition, could distribute functional and regulatable promoter elements, which in the course of subsequent selection might be stabilized in the genome. Exaptation of regulatory elements in the preexisting genes through Alus could thus have contributed to evolution of novel regulatory networks in the primate genomes. With such a wide spectrum of regulatory sites present in Alus, it also becomes imperative to screen for variations in these sites in candidate genes, which are otherwise repeat-masked in studies pertaining to identification of predisposition markers.
Collapse
|
46
|
Singer SS, Männel DN, Hehlgans T, Brosius J, Schmitz J. From “Junk” to Gene: Curriculum vitae of a Primate Receptor Isoform Gene. J Mol Biol 2004; 341:883-6. [PMID: 15328599 DOI: 10.1016/j.jmb.2004.06.070] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2004] [Revised: 06/28/2004] [Accepted: 06/28/2004] [Indexed: 10/26/2022]
Abstract
Exonization of Alu retroposons awakens public opinion, particularly when causing genetic diseases. However, often neglected, alternative "Alu-exons" also carry the potential to greatly enhance genetic diversity by increasing the transcriptome of primates chiefly via alternative splicing.Here, we report a 5' exon generated from one of the two alternative transcripts in human tumor necrosis factor receptor gene type 2 (p75TNFR) that contains an ancient Alu-SINE, which provides an alternative N-terminal protein-coding domain. We follow the primate evolution over the past 63 million years to reconstruct the key events that gave rise to a novel receptor isoform. The Alu integration and start codon formation occurred between 58 and 40 million years ago (MYA) in the common ancestor of anthropoid primates. Yet a functional gene product could not be generated until a novel splice site and an open reading frame were introduced between 40 and 25 MYA on the catarrhine lineage (Old World monkeys including apes).
Collapse
Affiliation(s)
- Silke S Singer
- Institute of Immunology, University of Regensburg, Germany.
| | | | | | | | | |
Collapse
|
47
|
Sorek R, Lev-Maor G, Reznik M, Dagan T, Belinky F, Graur D, Ast G. Minimal conditions for exonization of intronic sequences: 5' splice site formation in alu exons. Mol Cell 2004; 14:221-31. [PMID: 15099521 DOI: 10.1016/s1097-2765(04)00181-9] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2004] [Revised: 03/17/2004] [Accepted: 03/23/2004] [Indexed: 02/05/2023]
Abstract
Alu exonization, which is an evolutionary pathway that creates primate-specific transcriptomic diversity, is a powerful tool for studying alternative-splicing regulation. Through bioinformatic analyses combined with experimental methodology, we identified the mutational changes needed to create functional 5' splice sites in Alu. We revealed a complex mechanism by which the sequence composition of the 5' splice site and its base pairing with the small nuclear RNA U1 govern alternative splicing. We show that in Alu-derived GC introns the strength of the base pairing between U1 snRNA and the 5' splice site controls the skipping/inclusion ratio of alternative splicing. Based on these findings, we identified 7810 Alus within the human genome that are prone to exonization. Mutations in these Alus may cause genetic disorders or contribute to human-specific protein diversity.
Collapse
Affiliation(s)
- Rotem Sorek
- Department of Human Genetics and Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | | | | | |
Collapse
|
48
|
Sawai H, Kawamoto Y, Takahata N, Satta Y. Evolutionary Relationships of Major Histocompatibility Complex Class I Genes in Simian Primates. Genetics 2004. [DOI: 10.1093/genetics/166.4.1897] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Abstract
New World monkeys (NWMs) occupy a critical phylogenetic position in elucidating the evolutionary process of major histocompatibility complex (MHC) class I genes in primates. From three subfamilies of Aotinae, Cebinae, and Atelinae, the 5′-flanking regions of 18 class I genes are obtained and phylogenetically examined in terms of Alu/LINE insertion elements as well as the nucleotide substitutions. Two pairs of genes from Aotinae and Atelinae are clearly orthologous to human leukocyte antigen (HLA) -E and -F genes. Of the remaining 14 genes, 8 belong to the distinct group B, together with HLA-B and -C, to the exclusion of all other HLA class I genes. These NWM genes are classified into four groups, designated as NWM-B1, -B2, -B3, and -B4. Of these, NWM-B2 is orthologous to HLA-B/C. Also, orthologous relationships of NWM-B1, -B2, and -B3 exist among different families of Cebidae and Atelidae, which is in sharp contrast to the genus-specific gene organization within the subfamily Callitrichinae. The other six genes belong to the distinct group G. However, a clade of these NWM genes is almost equally related to HLA-A, -J, -G, and -K, and there is no evidence for their orthologous relationships to HLA-G. It is argued that class I genes in simian primates duplicated extensively in their common ancestral lineage and that subsequent evolution in descendant species has been facilitated mainly by independent loss of genes.
Collapse
Affiliation(s)
- Hiromi Sawai
- Department of Biosystems Science, Graduate University for Advanced Studies (Sokendai), Hayama, Kanagawa 240-0193, Japan
| | - Yoshi Kawamoto
- Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan
| | - Naoyuki Takahata
- Department of Biosystems Science, Graduate University for Advanced Studies (Sokendai), Hayama, Kanagawa 240-0193, Japan
| | - Yoko Satta
- Department of Biosystems Science, Graduate University for Advanced Studies (Sokendai), Hayama, Kanagawa 240-0193, Japan
| |
Collapse
|
49
|
Shigenari A, Ando A, Renard C, Chardon P, Shiina T, Kulski JK, Yasue H, Inoko H. Nucleotide sequencing analysis of the swine 433-kb genomic segment located between the non-classical and classical SLA class I gene clusters. Immunogenetics 2003; 55:695-705. [PMID: 14673549 DOI: 10.1007/s00251-003-0627-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2003] [Revised: 11/06/2003] [Indexed: 10/26/2022]
Abstract
Genome analysis of the swine leukocyte antigen ( SLA) region is needed to obtain information on the MHC genomic sequence similarities and differences between the swine and human, given the possible use of swine organs for xenotransplantation. Here, the genomic sequences of a 433-kb segment located between the non-classical and classical SLA class I gene clusters were determined and analyzed for gene organization and contents of repetitive sequences. The genomic organization and diversity of this swine non-class I gene region was compared with the orthologous region of the human leukocyte antigen ( HLA) complex. The length of the fully sequenced SLA genomic segment was 433 kb compared with 595 kb in the corresponding HLA class I region. This 162-kb difference in size between the swine and human genomic segments can be explained by indel activity, and the greater variety and density of repetitive sequences within the human MHC. Twenty-one swine genes with strong sequence similarity to the corresponding human genes were identified, with the gene order from the centromere to telomere of HCR - SPR1 - SEEK1 - CDSN - STG - DPCR1 - KIAA1885 - TFIIH - DDR - IER3 - FLOT1 - TUBB - KIAA0170 - NRM - KIAA1949 - DDX16 - FLJ13158 - MRPS18B - FB19 - ABCFI - CAT56. The human SEEK1 and DPCR1 genes are pseudogenes in swine. We conclude that the swine non-class I gene region that we have sequenced is highly conserved and therefore homologous to the corresponding region located between the HLA-C and HLA-E genes in the human.
Collapse
Affiliation(s)
- Atsuko Shigenari
- Department of Molecular Life Science, Tokai University School of Medicine, Bohseidai, Isehara, 259-1193, Kanagawa, Japan
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Bailey JA, Liu G, Eichler EE. An Alu transposition model for the origin and expansion of human segmental duplications. Am J Hum Genet 2003; 73:823-34. [PMID: 14505274 PMCID: PMC1180605 DOI: 10.1086/378594] [Citation(s) in RCA: 323] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2003] [Accepted: 07/17/2003] [Indexed: 01/17/2023] Open
Abstract
Relative to genomes of other sequenced organisms, the human genome appears particularly enriched for large, highly homologous segmental duplications (> or =90% sequence identity and > or =10 kbp in length). The molecular basis for this enrichment is unknown. We sought to gain insight into the mechanism of origin, by systematically examining sequence features at the junctions of duplications. We analyzed 9,464 junctions within regions of high-quality finished sequence from a genomewide set of 2,366 duplication alignments. We observed a highly significant (P<.0001) enrichment of Alu short interspersed element (SINE) sequences near or within the junction. Twenty-seven percent of all segmental duplications terminated within an Alu repeat. The Alu junction enrichment was most pronounced for interspersed segmental duplications separated by > or =1 Mb of intervening sequence. Alu elements at the junctions showed higher levels of divergence, consistent with Alu-Alu-mediated recombination events. When we classified Alu elements into major subfamilies, younger elements (AluY and AluS) accounted for the enrichment, whereas the oldest primate family (AluJ) showed no enrichment. We propose that the primate-specific burst of Alu retroposition activity (which occurred 35-40 million years ago) sensitized the ancestral human genome for Alu-Alu-mediated recombination events, which, in turn, initiated the expansion of gene-rich segmental duplications and their subsequent role in nonallelic homologous recombination.
Collapse
Affiliation(s)
- Jeffrey A Bailey
- Department of Genetics, Center for Computational Genomics, Case Western Reserve University School of Medicine and University Hospitals of Cleveland, Cleveland, OH, 44106, USA
| | | | | |
Collapse
|