1
|
Lallemand T, Leduc M, Landès C, Rizzon C, Lerat E. An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice. Genes (Basel) 2020; 11:E1046. [PMID: 32899740 PMCID: PMC7565063 DOI: 10.3390/genes11091046] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/01/2020] [Accepted: 09/02/2020] [Indexed: 12/11/2022] Open
Abstract
Gene duplication is an important evolutionary mechanism allowing to provide new genetic material and thus opportunities to acquire new gene functions for an organism, with major implications such as speciation events. Various processes are known to allow a gene to be duplicated and different models explain how duplicated genes can be maintained in genomes. Due to their particular importance, the identification of duplicated genes is essential when studying genome evolution but it can still be a challenge due to the various fates duplicated genes can encounter. In this review, we first describe the evolutionary processes allowing the formation of duplicated genes but also describe the various bioinformatic approaches that can be used to identify them in genome sequences. Indeed, these bioinformatic approaches differ according to the underlying duplication mechanism. Hence, understanding the specificity of the duplicated genes of interest is a great asset for tool selection and should be taken into account when exploring a biological question.
Collapse
Affiliation(s)
- Tanguy Lallemand
- IRHS, Agrocampus-Ouest, INRAE, Université d’Angers, SFR 4207 QuaSaV, 49071 Beaucouzé, France; (T.L.); (M.L.); (C.L.)
| | - Martin Leduc
- IRHS, Agrocampus-Ouest, INRAE, Université d’Angers, SFR 4207 QuaSaV, 49071 Beaucouzé, France; (T.L.); (M.L.); (C.L.)
| | - Claudine Landès
- IRHS, Agrocampus-Ouest, INRAE, Université d’Angers, SFR 4207 QuaSaV, 49071 Beaucouzé, France; (T.L.); (M.L.); (C.L.)
| | - Carène Rizzon
- Laboratoire de Mathématiques et Modélisation d’Evry (LaMME), Université d’Evry Val d’Essonne, Université Paris-Saclay, UMR CNRS 8071, ENSIIE, USC INRAE, 23 bvd de France, CEDEX, 91037 Evry Paris, France;
| | - Emmanuelle Lerat
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| |
Collapse
|
2
|
Kim S, Park J, Yeom SI, Kim YM, Seo E, Kim KT, Kim MS, Lee JM, Cheong K, Shin HS, Kim SB, Han K, Lee J, Park M, Lee HA, Lee HY, Lee Y, Oh S, Lee JH, Choi E, Choi E, Lee SE, Jeon J, Kim H, Choi G, Song H, Lee J, Lee SC, Kwon JK, Lee HY, Koo N, Hong Y, Kim RW, Kang WH, Huh JH, Kang BC, Yang TJ, Lee YH, Bennetzen JL, Choi D. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol 2017; 18:210. [PMID: 29089032 PMCID: PMC5664825 DOI: 10.1186/s13059-017-1341-9] [Citation(s) in RCA: 152] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 10/06/2017] [Indexed: 01/08/2023] Open
Abstract
Background Transposable elements are major evolutionary forces which can cause new genome structure and species diversification. The role of transposable elements in the expansion of nucleotide-binding and leucine-rich-repeat proteins (NLRs), the major disease-resistance gene families, has been unexplored in plants. Results We report two high-quality de novo genomes (Capsicum baccatum and C. chinense) and an improved reference genome (C. annuum) for peppers. Dynamic genome rearrangements involving translocations among chromosomes 3, 5, and 9 were detected in comparison between C. baccatum and the two other peppers. The amplification of athila LTR-retrotransposons, members of the gypsy superfamily, led to genome expansion in C. baccatum. In-depth genome-wide comparison of genes and repeats unveiled that the copy numbers of NLRs were greatly increased by LTR-retrotransposon-mediated retroduplication. Moreover, retroduplicated NLRs are abundant across the angiosperms and, in most cases, are lineage-specific. Conclusions Our study reveals that retroduplication has played key roles for the massive emergence of NLR genes including functional disease-resistance genes in pepper plants. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1341-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Seungill Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jieun Park
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Seon-In Yeom
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Yong-Min Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Eunyoung Seo
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Ki-Tae Kim
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Myung-Shin Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Je Min Lee
- Department of Horticultural Science, Kyungpook National University, Daegu, 41566, South Korea
| | - Kyeongchae Cheong
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea.,Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Ho-Sub Shin
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Saet-Byul Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Koeun Han
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Jundae Lee
- Department of Horticulture, Chonbuk National University, Jeonju, 54896, South Korea
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, 30602-7223, USA
| | - Hyun-Ah Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Hye-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Youngsill Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Soohyun Oh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Joo Hyun Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunhye Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunbi Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - So Eui Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jongbum Jeon
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyunbin Kim
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Gobong Choi
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyeunjeong Song
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - JunKi Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Sang-Choon Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jin-Kyung Kwon
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Hea-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Namjin Koo
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Yunji Hong
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Ryan W Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Won-Hee Kang
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Jin Hoe Huh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Byoung-Cheorl Kang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Yong-Hwan Lee
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea.,Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | | | - Doil Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.
| |
Collapse
|
3
|
Kim S, Park J, Yeom SI, Kim YM, Seo E, Kim KT, Kim MS, Lee JM, Cheong K, Shin HS, Kim SB, Han K, Lee J, Park M, Lee HA, Lee HY, Lee Y, Oh S, Lee JH, Choi E, Choi E, Lee SE, Jeon J, Kim H, Choi G, Song H, Lee J, Lee SC, Kwon JK, Lee HY, Koo N, Hong Y, Kim RW, Kang WH, Huh JH, Kang BC, Yang TJ, Lee YH, Bennetzen JL, Choi D. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol 2017; 18:210. [PMID: 29089032 DOI: 10.1007/s13580-019-00157-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 06/19/2019] [Accepted: 10/06/2017] [Indexed: 05/24/2023] Open
Abstract
BACKGROUND Transposable elements are major evolutionary forces which can cause new genome structure and species diversification. The role of transposable elements in the expansion of nucleotide-binding and leucine-rich-repeat proteins (NLRs), the major disease-resistance gene families, has been unexplored in plants. RESULTS We report two high-quality de novo genomes (Capsicum baccatum and C. chinense) and an improved reference genome (C. annuum) for peppers. Dynamic genome rearrangements involving translocations among chromosomes 3, 5, and 9 were detected in comparison between C. baccatum and the two other peppers. The amplification of athila LTR-retrotransposons, members of the gypsy superfamily, led to genome expansion in C. baccatum. In-depth genome-wide comparison of genes and repeats unveiled that the copy numbers of NLRs were greatly increased by LTR-retrotransposon-mediated retroduplication. Moreover, retroduplicated NLRs are abundant across the angiosperms and, in most cases, are lineage-specific. CONCLUSIONS Our study reveals that retroduplication has played key roles for the massive emergence of NLR genes including functional disease-resistance genes in pepper plants.
Collapse
Affiliation(s)
- Seungill Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jieun Park
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Seon-In Yeom
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Yong-Min Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Eunyoung Seo
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Ki-Tae Kim
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Myung-Shin Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Je Min Lee
- Department of Horticultural Science, Kyungpook National University, Daegu, 41566, South Korea
| | - Kyeongchae Cheong
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Ho-Sub Shin
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Saet-Byul Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Koeun Han
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Jundae Lee
- Department of Horticulture, Chonbuk National University, Jeonju, 54896, South Korea
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, 30602-7223, USA
| | - Hyun-Ah Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Hye-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Youngsill Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Soohyun Oh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Joo Hyun Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunhye Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunbi Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - So Eui Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jongbum Jeon
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyunbin Kim
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Gobong Choi
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyeunjeong Song
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - JunKi Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Sang-Choon Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jin-Kyung Kwon
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Hea-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Namjin Koo
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Yunji Hong
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Ryan W Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Won-Hee Kang
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Jin Hoe Huh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Byoung-Cheorl Kang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Yong-Hwan Lee
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | | | - Doil Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.
| |
Collapse
|
4
|
Kubiak MR, Makałowska I. Protein-Coding Genes' Retrocopies and Their Functions. Viruses 2017; 9:E80. [PMID: 28406439 DOI: 10.3390/v9040080] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2017] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 12/11/2022] Open
Abstract
Transposable elements, often considered to be not important for survival, significantly contribute to the evolution of transcriptomes, promoters, and proteomes. Reverse transcriptase, encoded by some transposable elements, can be used in trans to produce a DNA copy of any RNA molecule in the cell. The retrotransposition of protein-coding genes requires the presence of reverse transcriptase, which could be delivered by either non-long terminal repeat (non-LTR) or LTR transposons. The majority of these copies are in a state of “relaxed” selection and remain “dormant” because they are lacking regulatory regions; however, many become functional. In the course of evolution, they may undergo subfunctionalization, neofunctionalization, or replace their progenitors. Functional retrocopies (retrogenes) can encode proteins, novel or similar to those encoded by their progenitors, can be used as alternative exons or create chimeric transcripts, and can also be involved in transcriptional interference and participate in the epigenetic regulation of parental gene expression. They can also act in trans as natural antisense transcripts, microRNA (miRNA) sponges, or a source of various small RNAs. Moreover, many retrocopies of protein-coding genes are linked to human diseases, especially various types of cancer.
Collapse
|
5
|
Tan S, Cardoso-Moreira M, Shi W, Zhang D, Huang J, Mao Y, Jia H, Zhang Y, Chen C, Shao Y, Leng L, Liu Z, Huang X, Long M, Zhang YE. LTR-mediated retroposition as a mechanism of RNA-based duplication in metazoans. Genome Res 2016; 26:1663-1675. [PMID: 27934698 PMCID: PMC5131818 DOI: 10.1101/gr.204925.116] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 10/18/2016] [Indexed: 01/09/2023]
Abstract
In a broad range of taxa, genes can duplicate through an RNA intermediate in a process mediated by retrotransposons (retroposition). In mammals, L1 retrotransposons drive retroposition, but the elements responsible for retroposition in other animals have yet to be identified. Here, we examined young retrocopies from various animals that still retain the sequence features indicative of the underlying retroposition mechanism. In Drosophila melanogaster, we identified and de novo assembled 15 polymorphic retrocopies and found that all retroposed loci are chimeras of internal retrocopies flanked by discontinuous LTR retrotransposons. At the fusion points between the mRNAs and the LTR retrotransposons, we identified shared short similar sequences that suggest the involvement of microsimilarity-dependent template switches. By expanding our approach to mosquito, zebrafish, chicken, and mammals, we identified in all these species recently originated retrocopies with a similar chimeric structure and shared microsimilarities at the fusion points. We also identified several retrocopies that combine the sequences of two or more parental genes, demonstrating LTR-retroposition as a novel mechanism of exon shuffling. Finally, we found that LTR-mediated retrocopies are immediately cotranscribed with their flanking LTR retrotransposons. Transcriptional profiling coupled with sequence analyses revealed that the sense-strand transcription of the retrocopies often lead to the origination of in-frame proteins relative to the parental genes. Overall, our data show that LTR-mediated retroposition is highly conserved across a wide range of animal taxa; combined with previous work from plants and yeast, it represents an ancient and ongoing mechanism continuously shaping gene content evolution in eukaryotes.
Collapse
Affiliation(s)
- Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | | | - Wenwen Shi
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Dan Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiawei Huang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanan Mao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Chunyan Chen
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Shao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liang Leng
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhonghua Liu
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xun Huang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637, USA
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
6
|
Abstract
The past decades have revealed an unexpected yet prominent role of so-called 'junk DNA' in the regulation of gene expression, thereby challenging our view of the mechanisms underlying phenotypic evolution. In particular, several mechanisms through which transposable elements (TEs) participate in functional genome diversity have been depicted, bringing to light the 'TEs bright side'. However, the relative contribution of those mechanisms and, more generally, the importance of TE-based polymorphisms on past and present phenotypic variation in crops species remain poorly understood. Here, we review current knowledge on both issues, and discuss how analyses of massively parallel sequencing data combined with statistical methodologies and functional validations will help unravelling the impact of TEs on crop evolution in a near future.
Collapse
|
7
|
Steinbauerová V, Neumann P, Novák P, Macas J. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons. Genetica 2012; 139:1543-55. [PMID: 22544262 DOI: 10.1007/s10709-012-9654-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/16/2012] [Indexed: 01/21/2023]
Abstract
Long terminal repeat (LTR) retrotransposons make up substantial parts of most higher plant genomes where they accumulate due to their replicative mode of transposition. Although the transposition is facilitated by proteins encoded within the gag-pol region which is common to all autonomous elements, some LTR retrotransposons were found to potentially carry an additional protein coding capacity represented by extra open reading frames located upstream or downstream of gag-pol. In this study, we performed a comprehensive in silico survey and comparative analysis of these extra open reading frames (ORFs) in the group of Ty3/gypsy LTR retrotransposons as the first step towards our understanding of their origin and function. We found that extra ORFs occur in all three major lineages of plant Ty3/gypsy elements, being the most frequent in the Tat lineage where most (77 %) of identified elements contained extra ORFs. This lineage was also characterized by the highest diversity of extra ORF arrangement (position and orientation) within the elements. On the other hand, all of these ORFs could be classified into only two broad groups based on their mutual similarities or the presence of short conserved motifs in their inferred protein sequences. In the Athila lineage, the extra ORFs were confined to the element 3' regions but they displayed much higher sequence diversity compared to those found in Tat. In the lineage of Chromoviruses the extra ORFs were relatively rare, occurring only in 5' regions of a group of elements present in a single plant family (Poaceae). In all three lineages, most extra ORFs lacked sequence similarities to characterized gene sequences or functional protein domains, except for two Athila-like elements with similarities to LOGL4 gene and part of the Chromoviruses extra ORFs that displayed partial similarity to histone H3 gene. Thus, in these cases the extra ORFs most likely originated by transduction or recombination of cellular gene sequences. In addition, the protein domain which is otherwise associated with DNA transposons have been detected in part of the Tat-like extra ORFs, pointing to their origin from an insertion event of a mobile element.
Collapse
Affiliation(s)
- Veronika Steinbauerová
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branišovská 31, Ceske Budejovice, Czech Republic
| | | | | | | |
Collapse
|
8
|
Abrarova N, Simonova L, Vinogradova T, Sverdlov E. Different transcription activity of HERV-K LTR-containing and LTR-lacking genes of the KIAA1245/NBPF gene subfamily. Genetica 2011; 139:733-41. [PMID: 21544646 DOI: 10.1007/s10709-011-9577-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2010] [Accepted: 04/15/2011] [Indexed: 10/18/2022]
Abstract
Long terminal repeats (LTRs) of human endogenous retroviruses (HERVs) located near or within genes might affect their expression. We used the KIAA1245/NBPF human gene subfamily in an attempt to assess the regulatory potential of HERV LTRs. The subfamily includes five closely related paralogous genes: three of them contain an LTR in the second intron, and two genes lack it. Earlier we reported that the second and third exons of only LTR-containing genes of this subfamily could be detected in mature mRNAs of various cell lines and human tissues. The corresponding parts of mRNA of LTR-lacking genes analyzed in our study were absent from EST libraries, but other fragments of their mRNAs were available in EST databases. For a more unbiased view on the correlation between gene transcription and the intronic LTRs, in the present work we analyzed non-spliced pre-mRNA thus avoiding splicing effects. Based on RT-PCR analysis, we demonstrated that the KIAA1245/NBPF LTR-lacking gene AL592309/NBPF3 was transcriptionally active, but the LTR-containing genes showed significantly higher transcription levels. The data are in agreement with the suggestion that HERV-K LTRs within the second intron of the KIAA1245/NBPF subfamily genes might affect their transcriptional activity. However, it still remains to be investigated whether the revealed effect is due just to the LTR insertion or other factors are responsible for the difference.
Collapse
|
9
|
Dhillon B, Cavaletto JR, Wood KV, Goodwin SB. Accidental amplification and inactivation of a methyltransferase gene eliminates cytosine methylation in Mycosphaerella graminicola. Genetics 2010; 186:67-77. [PMID: 20610411 PMCID: PMC2940312 DOI: 10.1534/genetics.110.117408] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A de novo search for repetitive elements in the genome sequence of the wheat pathogen Mycosphaerella graminicola identified a family of repeats containing a DNA cytosine methyltransferase sequence (MgDNMT). All 23 MgDNMT sequences identified carried signatures of repeat induced point mutation (RIP). All copies were subtelomeric in location except for one on chromosome 6. Synteny with M. fijiensis implied that the nontelomeric copy on chromosome 6 served as a template for subsequent amplifications. Southern analysis revealed that the MgDNMT sequence also was amplified in 15 additional M. graminicola isolates from various geographical regions. However, this amplification event was specific to M. graminicola; a search for MgDNMT homologs identified only a single, unmutated copy in the genomes of 11 other ascomycetes. A genome-wide methylation assay revealed that M. graminicola lacks cytosine methylation, as expected if its MgDNMT gene is inactivated. Methylation was present in several other species tested, including the closest known relatives of M. graminicola, species S1 and S2. Therefore, the observed changes most likely occurred within the past 10,500 years since the divergence between M. graminicola and S1. Our data indicate that the recent amplification of a single-copy MgDNMT gene made it susceptible to RIP, resulting in complete loss of cytosine methylation in M. graminicola.
Collapse
Affiliation(s)
- Braham Dhillon
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
| | | | | | | |
Collapse
|
10
|
Elrouby N, Bureau TE. Bs1, a new chimeric gene formed by retrotransposon-mediated exon shuffling in maize. Plant Physiol 2010; 153:1413-24. [PMID: 20488894 PMCID: PMC2899935 DOI: 10.1104/pp.110.157420] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2010] [Accepted: 05/16/2010] [Indexed: 05/19/2023]
Abstract
Transposons are major components of all eukaryotic genomes. Although traditionally regarded as causes of detrimental mutations, recent evidence suggests that transposons may play a role in host gene diversification and evolution. For example, host gene transduction by retroelements has been suggested to be both common and to have the potential to create new chimeric genes by the shuffling of existing sequences. We have previously shown that the maize (Zea mays subsp. mays) retrotransposon Bs1 has transduced sequences from three different host genes. Here, we provide evidence that these transduction events led to the generation of a chimeric new gene that is both transcribed and translated. Expression of Bs1 is tightly controlled and occurs during a narrow developmental window in early ear development. Although all Bs1-associated transduction events took place before Zea speciation, a full uninterrupted open reading frame encoding the BS1 protein may have arisen in domesticated maize or in the diverse populations of its progenitor Z. mays subsp. parviglumis. We discuss potential functions based on domain conservation and evidence for functional constraints between the transduced sequences and their host gene counterparts.
Collapse
Affiliation(s)
- Nabil Elrouby
- Department of Biology, McGill University, Montreal, Quebec, Canada.
| | | |
Collapse
|
11
|
Peng Z, Lu T, Li L, Liu X, Gao Z, Hu T, Yang X, Feng Q, Guan J, Weng Q, Fan D, Zhu C, Lu Y, Han B, Jiang Z. Genome-wide characterization of the biggest grass, bamboo, based on 10,608 putative full-length cDNA sequences. BMC Plant Biol 2010; 10:116. [PMID: 20565830 PMCID: PMC3017805 DOI: 10.1186/1471-2229-10-116] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 06/18/2010] [Indexed: 05/04/2023]
Abstract
BACKGROUND With the availability of rice and sorghum genome sequences and ongoing efforts to sequence genomes of other cereal and energy crops, the grass family (Poaceae) has become a model system for comparative genomics and for better understanding gene and genome evolution that underlies phenotypic and ecological divergence of plants. While the genomic resources have accumulated rapidly for almost all major lineages of grasses, bamboo remains the only large subfamily of Poaceae with little genomic information available in databases, which seriously hampers our ability to take a full advantage of the wealth of grass genomic data for effective comparative studies. RESULTS Here we report the cloning and sequencing of 10,608 putative full length cDNAs (FL-cDNAs) primarily from Moso bamboo, Phyllostachys heterocycla cv. pubescens, a large woody bamboo with the highest ecological and economic values of all bamboos. This represents the third largest FL-cDNA collection to date of all plant species, and provides the first insight into the gene and genome structures of bamboos. We developed a Moso bamboo genomic resource database that so far contained the sequences of 10,608 putative FL-cDNAs and nearly 38,000 expressed sequence tags (ESTs) generated in this study. CONCLUSION Analysis of FL-cDNA sequences show that bamboo diverged from its close relatives such as rice, wheat, and barley through an adaptive radiation. A comparative analysis of the lignin biosynthesis pathway between bamboo and rice suggested that genes encoding caffeoyl-CoA O-methyltransferase may serve as targets for genetic manipulation of lignin content to reduce pollutants generated from bamboo pulping.
Collapse
Affiliation(s)
- Zhenhua Peng
- Chinese Academy of Forestry, Wanshou Shan, Beijing 100091, PR China
- International Network for Bamboo and Rattan, 8 Fu Tong Dong Da Jie, Chaoyang District, Beijing 100102, PR China
| | - Tingting Lu
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Lubin Li
- Chinese Academy of Forestry, Wanshou Shan, Beijing 100091, PR China
| | - Xiaohui Liu
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Zhimin Gao
- International Network for Bamboo and Rattan, 8 Fu Tong Dong Da Jie, Chaoyang District, Beijing 100102, PR China
| | - Tao Hu
- Chinese Academy of Forestry, Wanshou Shan, Beijing 100091, PR China
| | - Xuewen Yang
- International Network for Bamboo and Rattan, 8 Fu Tong Dong Da Jie, Chaoyang District, Beijing 100102, PR China
| | - Qi Feng
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Jianping Guan
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Qijun Weng
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Danlin Fan
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Chuanrang Zhu
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Ying Lu
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
| | - Bin Han
- National Center for Gene Research & Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, PR China
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, PR China
| | - Zehui Jiang
- Chinese Academy of Forestry, Wanshou Shan, Beijing 100091, PR China
- International Network for Bamboo and Rattan, 8 Fu Tong Dong Da Jie, Chaoyang District, Beijing 100102, PR China
| |
Collapse
|
12
|
Abstract
Determining the evolutionary forces responsible for the maintenance of gene duplicates is key to understanding the processes leading to evolutionary adaptation and novelty. In his highly prescient book, Susumu Ohno recognized that duplicate genes are fixed and maintained within a population with 3 distinct outcomes: neofunctionalization, subfunctionalization, and conservation of function. Subsequent researchers have proposed a multitude of population genetic models that lead to these outcomes, each differing largely in the role played by adaptive natural selection. In this paper, I present a nonmathematical review of these models, their predictions, and the evidence collected in support of each of them. Though the various outcomes of gene duplication are often strictly associated with the presence or absence of adaptive natural selection, I argue that determining the outcome of duplication is orthogonal to determining whether natural selection has acted. Despite an ever-growing field of research into the fate of gene duplicates, there is not yet clear evidence for the preponderance of one outcome over the others, much less evidence for the importance of adaptive or nonadaptive forces in maintaining these duplicates.
Collapse
Affiliation(s)
- Matthew W Hahn
- Department of Biology and School of Informatics, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
13
|
Jameson N, Georgelis N, Fouladbash E, Martens S, Hannah LC, Lal S. Helitron mediated amplification of cytochrome P450 monooxygenase gene in maize. Plant Mol Biol 2008; 67:295-304. [PMID: 18327644 DOI: 10.1007/s11103-008-9318-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2007] [Accepted: 02/22/2008] [Indexed: 05/13/2023]
Abstract
The mass movement of gene sequences by Helitrons has significantly contributed to the lack of gene collinearity reported between different maize inbred lines. However, Helitron captured-genes reported to date represent truncated versions of their progenitor genes. In this report, we provide evidence that maize CYP72A27-Zm gene represents a cytochrome P450 monooxygenase (P450) gene recently captured by a Helitron and transposed into an Opie-2 retroposon. The four exons of the CYP72A27 gene contained within the element contain a putative open reading frame (ORF) for 428 amino acid residues. We provide evidence that Helitron captured CYP72A27-Zm is transcribed. To identify the progenitor gene and the evolutionary time of capture, we searched the plant genome database and discovered other closely related CYP72A27-Zm genes in maize and grasses. Our analysis indicates that CYP72A27-Zm represents an almost complete copy of maize CYP72A26-Zm gene captured by a Helitron about 3.1 million years ago (mya). The Helitron-captured gene then duplicated twice, approximately 1.5-1.6 mya giving rise to CYP72A36-Zm and CYP72A37-Zm. These data provide evidence that Helitrons can capture and mobilize intact genes that are transcribed and potentially encode biologically relevant proteins.
Collapse
Affiliation(s)
- Natalie Jameson
- Department of Biological Sciences, Oakland University, Rochester, MI 48309-4401, USA
| | | | | | | | | | | |
Collapse
|
14
|
Krom N, Recla J, Ramakrishna W. Analysis of genes associated with retrotransposons in the rice genome. Genetica 2007; 134:297-310. [PMID: 18066688 DOI: 10.1007/s10709-007-9237-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2007] [Accepted: 11/28/2007] [Indexed: 12/24/2022]
Abstract
Retrotransposons comprise a significant fraction of the rice genome. Despite their prevalence, the effects of retrotransposon insertions are not well understood, especially with regard to how they affect the expression of genes. In this study, we identified one-sixth of rice genes as being associated with retrotransposons, with insertions either in the gene itself or within its putative promoter region. Among genes with insertions in the promoter region, the likelihood of the gene being expressed was shown to be directly proportional to the distance of the retrotransposon from the translation start site. In addition, retrotransposon insertions in the transcribed region of the gene were found to be positively correlated with the presence of alternative splicing forms. Furthermore, preferential association of retrotransposon insertions with genes in several functional classes was identified. Some of the retrotransposons that are part of full-length cDNA (fl-cDNA) contribute splice sites and give rise to novel exons. Several interesting trends concerning the effects of retrotransposon insertions on gene expression were identified. Taken together, our data suggests that retrotransposon association with genes have a role in gene regulation. The data presented in this study provides a foundation for experimental studies to determine the role of retrotransposons in gene regulation.
Collapse
Affiliation(s)
- Nicholas Krom
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | | | | |
Collapse
|
15
|
Illarionova AE, Vinogradova TV, Sverdlov ED. Only those genes of the KIAA1245 gene subfamily that contain HERV(K) LTRs in their introns are transcriptionally active. Virology 2006; 358:39-47. [PMID: 16997346 DOI: 10.1016/j.virol.2006.06.027] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2006] [Revised: 04/13/2006] [Accepted: 06/21/2006] [Indexed: 11/26/2022]
Abstract
Insertion of LTRs into some genome locations might seriously affect regulation of the neighboring genes expression. This hypothesis is widely accepted but, however, not confirmed directly. Earlier, we have identified a family of closely related genes highly similar to the KIAA1245 mRNA counterpart. This family included a subfamily of genes some of which contained and the others lacked an LTR in their structure. We compared transcription of several closely related genes of the subfamily differing in the presence or absence of LTRs. Only LTR-containing genes were transcribed in transformed cell lines, tumorous and embryonic human tissues, whereas LTR-lacking genes remained silent. Since the genes were in the same intracellular microenvironment, we suggested that this effect was most probably due to intrinsic cis-characteristics of integrated LTRs and confirmed this by demonstrating high enhancer activity of KIAA1245 LTRs. The expression of the LTR-containing genes in embryonic tissues might suggest their involvement in evolutionary events during primate speciation.
Collapse
Affiliation(s)
- A E Illarionova
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya, Moscow 117997, Russia
| | | | | |
Collapse
|
16
|
Hoen DR, Park KC, Elrouby N, Yu Z, Mohabir N, Cowan RK, Bureau TE. Transposon-mediated expansion and diversification of a family of ULP-like genes. Mol Biol Evol 2006; 23:1254-68. [PMID: 16581939 DOI: 10.1093/molbev/msk015] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Transposons comprise a major component of eukaryotic genomes, yet it remains controversial whether they are merely genetic parasites or instead significant contributors to organismal function and evolution. In plants, thousands of DNA transposons were recently shown to contain duplicated cellular gene fragments, a process termed transduplication. Although transduplication is a potentially rich source of novel coding sequences, virtually all appear to be pseudogenes in rice. Here we report the results of a genome-wide survey of transduplication in Mutator-like elements (MULEs) in Arabidopsis thaliana, which shows that the phenomenon is generally similar to rice transduplication, with one important exception: KAONASHI (KI). A family of more than 97 potentially functional genes and apparent pseudogenes, evidently derived at least 15 MYA from a cellular small ubiquitin-like modifier-specific protease gene, KI is predominantly located in potentially autonomous non-terminal inverted repeat MULEs and has evolved under purifying selection to maintain a conserved peptidase domain. Similar to the associated transposase gene but unlike cellular genes, KI is targeted by small RNAs and silenced in most tissues but has elevated expression in pollen. In an Arabidopsis double mutant deficient in histone and DNA methylation with elevated KI expression compared to wild type, at least one KI-MULE is mobile. The existence of KI demonstrates that transduplicated genes can retain protein-coding capacity and evolve novel functions. However, in this case, our evidence suggests that the function of KI may be selfish rather than cellular.
Collapse
Affiliation(s)
- Douglas R Hoen
- Department of Biology, McGill University, Montreal, Québec, Canada
| | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
Different maize inbred lines are polymorphic for the presence or absence of genic sequences at various allelic chromosomal locations. In the bz genomic region, located in 9S, sequences homologous to four different genes from rice and Arabidopsis are present in line McC but absent from line B73. It is shown here that this apparent intraspecific violation of genetic colinearity arises from the movement of genes or gene fragments by Helitrons, a recently discovered class of eukaryotic transposons. Two Helitrons, HelA and HelB, account for all of the genic differences distinguishing the two bz locus haplotypes. HelA is 5.9 kb long and contains sequences for three of the four genes found only in the McC bz genomic region. A nearly identical copy of HelA was isolated from a 5S chromosomal location in B73. Both the 9S and 5S sites appear to be polymorphic in maize, suggesting that these Helitrons have been active recently. Helitrons lack the strong predictive terminal features of other transposons, so the definition of their ends is greatly facilitated by the identification of their vacant sites in Helitron-minus lines. The ends of the 2.7-kb HelB Helitron were discerned from a comparison of the McC haplotype sequence with that of yet a third line, Mo17, because the HelB vacant site is deleted in B73. Maize Helitrons resemble rice Pack-MULEs in their ability to capture genes or gene fragments from several loci and move them around the genome, features that confer on them a potential role in gene evolution.
Collapse
Affiliation(s)
- Jinsheng Lai
- The Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, NJ 08855, USA
| | | | | | | |
Collapse
|
18
|
Brunner S, Fengler K, Morgante M, Tingey S, Rafalski A. Evolution of DNA sequence nonhomologies among maize inbreds. Plant Cell 2005; 17:343-60. [PMID: 15659640 PMCID: PMC548811 DOI: 10.1105/tpc.104.025627] [Citation(s) in RCA: 211] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2004] [Accepted: 11/17/2004] [Indexed: 05/18/2023]
Abstract
Allelic chromosomal regions totaling more than 2.8 Mb and located on maize (Zea mays) chromosomes 1L, 2S, 7L, and 9S have been sequenced and compared over distances of 100 to 350 kb between the two maize inbred lines Mo17 and B73. The alleles contain extended regions of nonhomology. On average, more than 50% of the compared sequence is noncolinear, mainly because of the insertion of large numbers of long terminal repeat (LTR)-retrotransposons. Only 27 LTR-retroelements are shared between alleles, whereas 62 are allele specific. The insertion of LTR-retrotransposons into the maize genome is statistically more recent for nonshared than shared ones. Most surprisingly, more than one-third of the genes (27/72) are absent in one of the inbreds at the loci examined. Such nonshared genes usually appear to be truncated and form clusters in which they are oriented in the same direction. However, the nonshared genome segments are gene-poor, relative to regions shared by both inbreds, with up to 12-fold difference in gene density. By contrast, miniature inverted terminal repeats (MITEs) occur at a similar frequency in the shared and nonshared fractions. Many times, MITES are present in an identical position in both LTRs of a retroelement, indicating that their insertion occurred before the replication of the retroelement in question. Maize ESTs and/or maize massively parallel signature sequencing tags were identified for the majority of the nonshared genes or homologs of them. In contrast with shared genes, which are usually conserved in gene order and location relative to rice (Oryza sativa), nonshared genes violate the maize colinearity with rice. Based on this, insertion by a yet unknown mechanism, rather than deletion events, seems to be the origin of the nonshared genes. The intergenic space between conserved genes is enlarged up to sixfold in maize compared with rice. Frequently, retroelement insertions create a different sequence environment adjacent to conserved genes.
Collapse
Affiliation(s)
- Stephan Brunner
- DuPont Crop Genetics Research, Wilmington, Delaware 19880, USA.
| | | | | | | | | |
Collapse
|
19
|
Robinson SJ, Cram DJ, Lewis CT, Parkin IAP. Maximizing the efficacy of SAGE analysis identifies novel transcripts in Arabidopsis. Plant Physiol 2004; 136:3223-33. [PMID: 15489285 PMCID: PMC523381 DOI: 10.1104/pp.104.043406] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2004] [Revised: 07/14/2004] [Accepted: 07/16/2004] [Indexed: 05/19/2023]
Abstract
The efficacy of using Serial Analysis of Gene Expression (SAGE) to analyze the transcriptome of the model dicotyledonous plant Arabidopsis was assessed. We describe an iterative tag-to-gene matching process that exploits the availability of the whole genome sequence of Arabidopsis. The expression patterns of 98% of the annotated Arabidopsis genes could theoretically be evaluated through SAGE and using an iterative matching process 79% could be identified by a tag found at a unique site in the genome. A total of 145,170 reliable experimental tags from two Arabidopsis leaf tissue SAGE libraries were analyzed, of which 29,632 were distinct. The majority (93%) of the 12,988 experimental tags observed greater than once could be matched within the Arabidopsis genome. However, only 78% were matched to a single locus within the genome, reflecting the complexities associated with working in a highly duplicated genome. In addition to a comprehensive assessment of gene expression in Arabidopsis leaf tissue, we describe evidence of transcription from pseudo-genes as well as evidence of alternative mRNA processing and anti-sense transcription. This collection of experimental SAGE tags could be exploited to assist in the on-going annotation of the Arabidopsis genome.
Collapse
Affiliation(s)
- Stephen J Robinson
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, Saskatoon, Saskatchewan, Canada
| | | | | | | |
Collapse
|
20
|
Kuhl JC, Cheung F, Yuan Q, Martin W, Zewdie Y, McCallum J, Catanach A, Rutherford P, Sink KC, Jenderek M, Prince JP, Town CD, Havey MJ. A unique set of 11,008 onion expressed sequence tags reveals expressed sequence and genomic differences between the monocot orders Asparagales and Poales. Plant Cell 2004; 16:114-25. [PMID: 14671025 PMCID: PMC301399 DOI: 10.1105/tpc.017202] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2003] [Accepted: 11/05/2003] [Indexed: 05/18/2023]
Abstract
Enormous genomic resources have been developed for plants in the monocot order Poales; however, it is not clear how representative the Poales are for the monocots as a whole. The Asparagales are a monophyletic order sister to the lineage carrying the Poales and possess economically important plants such as asparagus, garlic, and onion. To assess the genomic differences between the Asparagales and Poales, we generated 11,008 unique ESTs from a normalized cDNA library of onion. Sequence analyses of these ESTs revealed microsatellite markers, single nucleotide polymorphisms, and homologs of transposable elements. Mean nucleotide similarity between rice and the Asparagales was 78% across coding regions. Expressed sequence and genomic comparisons revealed strong differences between the Asparagales and Poales for codon usage and mean GC content, GC distribution, and relative GC content at each codon position, indicating that genomic characteristics are not uniform across the monocots. The Asparagales were more similar to eudicots than to the Poales for these genomic characteristics.
Collapse
Affiliation(s)
- Joseph C Kuhl
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Wicker T, Yahiaoui N, Guyot R, Schlagenhauf E, Liu ZD, Dubcovsky J, Keller B. Rapid genome divergence at orthologous low molecular weight glutenin loci of the A and Am genomes of wheat. Plant Cell 2003; 15:1186-97. [PMID: 12724543 PMCID: PMC153725 DOI: 10.1105/tpc.011023] [Citation(s) in RCA: 144] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2003] [Accepted: 02/20/2003] [Indexed: 05/18/2023]
Abstract
To study genome evolution in wheat, we have sequenced and compared two large physical contigs of 285 and 142 kb covering orthologous low molecular weight (LMW) glutenin loci on chromosome 1AS of a diploid wheat species (Triticum monococcum subsp monococcum) and a tetraploid wheat species (Triticum turgidum subsp durum). Sequence conservation between the two species was restricted to small regions containing the orthologous LMW glutenin genes, whereas >90% of the compared sequences were not conserved. Dramatic sequence rearrangements occurred in the regions rich in repetitive elements. Dating of long terminal repeat retrotransposon insertions revealed different insertion events occurring during the last 5.5 million years in both species. These insertions are partially responsible for the lack of homology between the intergenic regions. In addition, the gene space was conserved only partially, because different predicted genes were identified on both contigs. Duplications and deletions of large fragments that might be attributable to illegitimate recombination also have contributed to the differentiation of this region in both species. The striking differences in the intergenic landscape between the A and A(m) genomes that diverged 1 to 3 million years ago provide evidence for a dynamic and rapid genome evolution in wheat species.
Collapse
Affiliation(s)
- Thomas Wicker
- Institute of Plant Biology, University of Zürich, Switzerland
| | | | | | | | | | | | | |
Collapse
|
22
|
Knappe S, Flügge UI, Fischer K. Analysis of the plastidic phosphate translocator gene family in Arabidopsis and identification of new phosphate translocator-homologous transporters, classified by their putative substrate-binding site. Plant Physiol 2003; 131:1178-90. [PMID: 12644669 PMCID: PMC166879 DOI: 10.1104/pp.016519] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2002] [Revised: 11/12/2002] [Accepted: 11/12/2002] [Indexed: 05/17/2023]
Abstract
Analysis of the Arabidopsis genome revealed the complete set of plastidic phosphate translocator (pPT) genes. The Arabidopsis genome contains 16 pPT genes: single copies of genes coding for the triose phosphate/phosphate translocator and the xylulose phosphate/phosphate translocator, and two genes coding for each the phosphoenolpyruvate/phosphate translocator and the glucose-6-phosphate/phosphate translocator. A relatively high number of truncated phosphoenolpyruvate/phosphate translocator genes (six) and glucose-6-phosphate/phosphate translocator genes (four) could be detected with almost conserved intron/exon structures as compared with the functional genes. In addition, a variety of PT-homologous (PTh) genes could be identified in Arabidopsis and other organisms. They all belong to the drug/metabolite transporter superfamily showing significant similarities to nucleotide sugar transporters (NSTs). The pPT, PTh, and NST proteins all possess six to eight transmembrane helices. According to the analysis of conserved motifs in these proteins, the PTh proteins can be divided into (a) the lysine (Lys)/arginine group comprising only non-plant proteins, (b) the Lys-valine/alanine/glycine group of Arabidopsis proteins, (c) the Lys/asparagine group of Arabidopsis proteins, and (d) the Lys/threonine group of plant and non-plant proteins. None of these proteins have been characterized so far. The analysis of the putative substrate-binding sites of the pPT, PTh, and NST proteins led to the suggestion that all these proteins share common substrate-binding sites on either side of the membrane each of which contain a conserved Lys residue.
Collapse
Affiliation(s)
- Silke Knappe
- Botanisches Institut der Universität zu Köln, Lehrstuhl II, Gyrhofstrasse 15, D-50931 Cologne, Germany
| | | | | |
Collapse
|
23
|
Cristofari G, Bampi C, Wilhelm M, Wilhelm FX, Darlix JL. A 5'-3' long-range interaction in Ty1 RNA controls its reverse transcription and retrotransposition. EMBO J 2002; 21:4368-79. [PMID: 12169639 PMCID: PMC126173 DOI: 10.1093/emboj/cdf436] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
LTR-retrotransposons are abundant components of all eukaryotic genomes and appear to be key players in their evolution. They share with retroviruses a reverse transcription step during their replication cycle. To better understand the replication of retrotransposons as well as their similarities to and differences from retroviruses, we set up an in vitro model system to examine minus-strand cDNA synthesis of the yeast Ty1 LTR-retrotransposon. Results show that the 5' and 3' ends of Ty1 genomic RNA interact through 14 nucleotide 5'-3' complementary sequences (CYC sequences). This 5'-3' base pairing results in an efficient initiation of reverse transcription in vitro. Transposition of a marked Ty1 element and Ty1 cDNA synthesis in yeast rely on the ability of the CYC sequences to base pair. This 5'-3' interaction is also supported by phylogenic analysis of all full-length Ty1 and Ty2 elements present in the Saccharomyces cerevisiae genome. These novel findings lead us to propose that circularization of the Ty1 genomic RNA controls initiation of reverse transcription and may limit reverse transcription of defective retroelements.
Collapse
Affiliation(s)
| | | | - Marcelle Wilhelm
- LaboRetro, INSERM U412, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon Cedex 07 and
Institut de Biologie Moléculaire et Cellulaire, 15, rue R. Descartes, 67084 Strasbourg, France Corresponding author e-mail:
| | - François-Xavier Wilhelm
- LaboRetro, INSERM U412, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon Cedex 07 and
Institut de Biologie Moléculaire et Cellulaire, 15, rue R. Descartes, 67084 Strasbourg, France Corresponding author e-mail:
| | - Jean-Luc Darlix
- LaboRetro, INSERM U412, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon Cedex 07 and
Institut de Biologie Moléculaire et Cellulaire, 15, rue R. Descartes, 67084 Strasbourg, France Corresponding author e-mail:
| |
Collapse
|
24
|
Shim S, Bae N, Han JK. Bone morphogenetic protein-4-induced activation of Xretpos is mediated by Smads and Olf-1/EBF associated zinc finger (OAZ). Nucleic Acids Res 2002; 30:3107-17. [PMID: 12136093 PMCID: PMC135757 DOI: 10.1093/nar/gkf437] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
We have previously isolated Xretpos, a novel family of long terminal repeat (LTR)-retrotransposons in Xenopus laevis, whose transcript is restricted to ventro-posterior-specific regions and induced by bone morphogenetic protein-4 (BMP-4) signaling. To explore the molecular mechanism of the transcriptional regulation, we identified and characterized Xretpos promoter regions consisting of LTRs and a 5'-untranslated region. We demonstrated that this promoter region contains all the necessary regulatory elements for the spatial and temporal expression of XRETPOS: Sequence analysis of the Xretpos promoter revealed multiple Smad-binding elements and Olf-1/EBF-associated zinc finger (OAZ) binding sites similar to BMP-4 response element, which were identified and proved to be required for BMP-4 induction in the Xvent2 promoter. We further demonstrated that Smads and OAZ proteins bind to their response elements in the promoter and these bindings are essential for the BMP-4-induced activation of the Xretpos promoter. Furthermore, we showed that the endogenous expression of Xretpos protein indeed occurred and was temporally regulated and BMP-4-inducible during the early Xenopus development. Finally, overexpression and partial loss-of-function study revealed that Xretpos has a posterio-ventralizing activity. Together, our results place Xretpos downstream of BMP-4 and provide evidence for the conserved mechanism of transcriptional regulation of the BMP-4 target genes.
Collapse
Affiliation(s)
- Sangwoo Shim
- Laboratory of Developmental Biology, Department of Life Science and Division of Molecular and Life Sciences, Pohang University of Science and Technology, San 31 Hyoja-Dong, Pohang, Kyungbuk, 790-784, South Korea
| | | | | |
Collapse
|