1
|
Tang Y, Li X, Hu C, Qiu X, Li J, Li X, Zhu H, Wang J, Sui J, Qiao L. Identification and characterization of transposable element AhMITE1 in the genomes of cultivated and two wild peanuts. BMC Genomics 2022; 23:500. [PMID: 35820800 PMCID: PMC9277781 DOI: 10.1186/s12864-022-08732-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 06/30/2022] [Indexed: 12/04/2022] Open
Abstract
Background The cultivated peanut (Arachis hypogaea L., AABB) is an allotetraploid hybrid between two diploid peanuts, A. duranensis (AA genome) and A. ipaensis (BB genome). Miniature inverted-repeat transposable elements (MITEs), some of which are known as active nonautonomous DNA transposons with high copy numbers, play important roles in genome evolution and diversification. AhMITE1, a member of the MITE family of transposons, but information on the peanut genomes is still limited. Here, we analyzed AhMITE1, AuMITE1 and ApMITE1 in the cultivated (A. hypogaea) and two wild peanut (A. duranensis and A. ipaensis) genomes. Results The cultivated and the two wild peanut genomes harbored 142, 14 and 21 AhMITE1, AuMITE1 and ApMITE1 family members, respectively. These three family members exhibited highly conserved TIR sequences, and insertions preferentially occurred within 2 kb upstream and downstream of gene-coding and AT-rich regions. Phylogenetic and pairwise nucleotide diversity analysis showed that AhMITE1 and ApMITE1 family members have undergone one round of amplification bursts during the evolution of the peanut genome. PCR analyses were performed in 23 peanut varieties and demonstrated that AhMITE1 is an active transposon and that hybridization or chemical mutagenesis can promote the mobilization of AhMITE1. Conclusions AhMITE1, AuMITE1 and ApMITE1 family members were identified based on local BLAST search with MAK between the cultivated and the two wild peanut genomes. The phylogenetic, nucleotide diversity and variation copy numbers of AhMITE1, AuMITE1 and ApMITE1 members provides opportunities for investigating their roles during peanut evolution. These findings will contribute to knowledge on diversity of AhMITE1, provide information about the potential impact on the gene expression and promote the development of DNA markers in peanut. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08732-0.
Collapse
Affiliation(s)
- Yanyan Tang
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Xiaoting Li
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Changli Hu
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Xiaochen Qiu
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Jingjing Li
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Xin Li
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Hong Zhu
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Jingshan Wang
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China
| | - Jiongming Sui
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China.
| | - Lixian Qiao
- College of Agronomy, Dry-Land Farming Technology Laboratory of Shandong Province, Key Laboratory of Qingdao Major Crop Germplasm Resource Innovation and Application, Qingdao Agricultural University, Qingdao, 266109, China.
| |
Collapse
|
2
|
Lin L, Sharma A, Yu Q. Recent amplification of microsatellite-associated miniature inverted-repeat transposable elements in the pineapple genome. BMC PLANT BIOLOGY 2021; 21:424. [PMID: 34537020 PMCID: PMC8449440 DOI: 10.1186/s12870-021-03194-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 08/09/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND Miniature inverted-repeat transposable elements (MITEs) are non-autonomous DNA transposable elements that play important roles in genome organization and evolution. Genome-wide identification and characterization of MITEs provide essential information for understanding genome structure and evolution. RESULTS We performed genome-wide identification and characterization of MITEs in the pineapple genome. The top two MITE families, accounting for 29.39% of the total MITEs and 3.86% of the pineapple genome, have insertion preference in (TA) n dinucleotide microsatellite regions. We therefore named these MITEs A. comosus microsatellite-associated MITEs (Ac-mMITEs). The two Ac-mMITE families, Ac-mMITE-1 and Ac-mMITE-2, shared sequence similarity in the terminal inverted repeat (TIR) regions, suggesting that these two Ac-mMITE families might be derived from a common or closely related autonomous elements. The Ac-mMITEs are frequently clustered via adjacent insertions. Among the 21,994 full-length Ac-mMITEs, 46.1% of them were present in clusters. By analyzing the Ac-mMITEs without (TA) n microsatellite flanking sequences, we found that Ac-mMITEs were likely derived from Mutator-like DNA transposon. Ac-MITEs showed highly polymorphic insertion sites between cultivated pineapples and their wild relatives. To better understand the evolutionary history of Ac-mMITEs, we filtered and performed comparative analysis on the two distinct groups of Ac-mMITEs, microsatellite-targeting MITEs (mt-MITEs) that are flanked by dinucleotide microsatellites on both sides and mutator-like MITEs (ml-MITEs) that contain 9/10 bp TSDs. Epigenetic analysis revealed a lower level of host-induced silencing on the mt-MITEs in comparison to the ml-MITEs, which partially explained the significantly higher abundance of mt-MITEs in pineapple genome. The mt-MITEs and ml-MITEs exhibited differential insertion preference to gene-related regions and RNA-seq analysis revealed their differential influences on expression regulation of nearby genes. CONCLUSIONS Ac-mMITEs are the most abundant MITEs in the pineapple genome and they were likely derived from Mutator-like DNA transposon. Preferential insertion in (TA) n microsatellite regions of Ac-mMITEs occurred recently and is likely the result of damage-limiting strategy adapted by Ac-mMITEs during co-evolution with their host. Insertion in (TA) n microsatellite regions might also have promoted the amplification of mt-MITEs. In addition, mt-MITEs showed no or negligible impact on nearby gene expression, which may help them escape genome control and lead to their amplification.
Collapse
Affiliation(s)
- Lianyu Lin
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - Anupma Sharma
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
| | - Qingyi Yu
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA.
| |
Collapse
|
3
|
Hou J, Lu D, Mason AS, Li B, An S, Li G, Cai D. Distribution of MITE family Monkey King in rapeseed (Brassica napus L) and its influence on gene expression. Genomics 2021; 113:2934-2943. [PMID: 34182079 DOI: 10.1016/j.ygeno.2021.06.034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 05/06/2021] [Accepted: 06/24/2021] [Indexed: 10/21/2022]
Abstract
Miniature inverted-repeat transposable elements (MITEs) are a group of class II transposable elements. The MITE Monkey King (MK) was first discovered upstream of BnFLC.A10. In this study, genome resequencing of four selected B. napus accessions, revealed more than 4000 distributed copies of MKs constituting ~2.4 Mb of the B. napus genomic sequence and caused 677 polymorphisms among the four accessions. MK -polymorphism-related markers across 128 natural and 58 synthetic accessions revealed more polymorphic MKs in natural than synthetic accessions. Ten MK -induced indels significantly affected the expression levels of the nearest gene based on RNAseq analysis, six of these effects were subsequently confirmed using qRT-PCR. Decreased expression pattern of MK -derived miRNA-bna-miR6031 was also observed under various stress treatments. Further research focused on the MITE families should promote not only our understanding of gene regulatory networks but also inform crop improvement efforts.
Collapse
Affiliation(s)
- Jinna Hou
- Henan Academy of Agricultural Sciences, Zhengzhou 450002, China.
| | - Dandan Lu
- Henan Academy of Agricultural Sciences, Zhengzhou 450002, China
| | - Annaliese S Mason
- Chair of Plant Breeding, Institute of Crop Science and Resource Conservation, University of Bonn, Bonn, Germany.
| | - Baoquan Li
- Henan Academy of Agricultural Sciences, Zhengzhou 450002, China
| | - Sufang An
- Henan Academy of Agricultural Sciences, Zhengzhou 450002, China
| | - Gaoyuan Li
- Bioinformatic Institute, Huazhong Agricultural University, Wuhan 430071, China.
| | - Dongfang Cai
- Henan Academy of Agricultural Sciences, Zhengzhou 450002, China.
| |
Collapse
|
4
|
Kundu S, Ray MD, Sharma A. Interplay between genome organization and epigenomic alterations of pericentromeric DNA in cancer. J Genet Genomics 2021; 48:184-197. [PMID: 33840602 DOI: 10.1016/j.jgg.2021.02.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 02/07/2021] [Accepted: 02/20/2021] [Indexed: 12/16/2022]
Abstract
In eukaryotic genome biology, the genomic organization inside the three-dimensional (3D) nucleus is highly complex, and whether this organization governs gene expression is poorly understood. Nuclear lamina (NL) is a filamentous meshwork of proteins present at the lining of inner nuclear membrane that serves as an anchoring platform for genome organization. Large chromatin domains termed as lamina-associated domains (LADs), play a major role in silencing genes at the nuclear periphery. The interaction of the NL and genome is dynamic and stochastic. Furthermore, many genes change their positions during developmental processes or under disease conditions such as cancer, to activate certain sorts of genes and/or silence others. Pericentromeric heterochromatin (PCH) is mostly in the silenced region within the genome, which localizes at the nuclear periphery. Studies show that several genes located at the PCH are aberrantly expressed in cancer. The interesting question is that despite being localized in the pericentromeric region, how these genes still manage to overcome pericentromeric repression. Although epigenetic mechanisms control the expression of the pericentromeric region, recent studies about genome organization and genome-nuclear lamina interaction have shed light on a new aspect of pericentromeric gene regulation through a complex and coordinated interplay between epigenomic remodeling and genomic organization in cancer.
Collapse
Affiliation(s)
- Subhadip Kundu
- Laboratory of Chromatin and Cancer Epigenetics, Department of Biochemistry, All India Institute of Medical Sciences, Ansari Nagar, New Delhi 110029, India
| | - M D Ray
- Department of Surgical Oncology, IRCH, All India Institute of Medical Sciences, Ansari Nagar, New Delhi 110029, India
| | - Ashok Sharma
- Laboratory of Chromatin and Cancer Epigenetics, Department of Biochemistry, All India Institute of Medical Sciences, Ansari Nagar, New Delhi 110029, India.
| |
Collapse
|
5
|
Bariah I, Keidar-Friedman D, Kashkush K. Identification and characterization of large-scale genomic rearrangements during wheat evolution. PLoS One 2020; 15:e0231323. [PMID: 32287287 PMCID: PMC7156093 DOI: 10.1371/journal.pone.0231323] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 03/20/2020] [Indexed: 12/25/2022] Open
Abstract
Following allopolyploidization, nascent polyploid wheat species react with massive genomic rearrangements, including deletion of transposable element-containing sequences. While such massive rearrangements are considered to be a prominent process in wheat genome evolution and speciation, their structure, extent, and underlying mechanisms remain poorly understood. In this study, we retrieved ~3500 insertions of a specific variant of Fatima, one of the most dynamic gypsy long-terminal repeat retrotransposons in wheat from the recently available high-quality genome drafts of Triticum aestivum (bread wheat) and Triticum turgidum ssp. dicoccoides or wild emmer, the allotetraploid mother of all modern wheats. The dynamic nature of Fatima facilitated the identification of large (i.e., up to ~ 1 million bases) Fatima-containing insertions/deletions (indels) upon comparison of bread wheat and wild emmer genomes. We characterized 11 such indels using computer-assisted analysis followed by PCR validation, and found that they might have occurred via unequal intra-strand recombination or double-strand break (DSB) events. Additionally, we observed one case of introgression of novel DNA fragments from an unknown source into the wheat genome. Our data thus indicate that massive large-scale DNA rearrangements might play a prominent role in wheat speciation.
Collapse
Affiliation(s)
- Inbar Bariah
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel
| | | | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel
- * E-mail:
| |
Collapse
|
6
|
Domb K, Keidar-Friedman D, Kashkush K. A novel miniature transposon-like element discovered in the coding sequence of a gene that encodes for 5-formyltetrahydrofolate in wheat. BMC PLANT BIOLOGY 2019; 19:461. [PMID: 31675912 PMCID: PMC6824096 DOI: 10.1186/s12870-019-2034-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Accepted: 09/12/2019] [Indexed: 05/02/2023]
Abstract
BACKGROUND Transposable elements (TEs) comprise over 80% of the wheat genome and usually possess unique features for specific super-families and families. However, the role of TEs in wheat evolution and reshaping the wheat genome remains largely unclear. RESULTS In this study, we discovered a miniature (307 bp in length) TE-like sequence in exon 6 of a gene that encodes for 5-formyltetrahydrofolate, in two accessions of wild emmer wheat (T. turgidum ssp. dicoccoides) and has interfered with the gene translation by creating a shorter reading frame as a result of a stop codon. The sequence that was termed Mariam, does not show any structural similarity to known TEs. It does not possess terminal inverted repeats (TIRs) that would allow us to assign this element to one of the TIR DNA super-families, and it does not possess characteristic features of SINE, such as a Pol-III promotor or a poly-A tail. In-silico analysis of five publicly available genome drafts of Triticum and Aegilops species revealed that Mariam element appears in a very low copy number (1-3 insertions) in diploid wheat species and ~ 12 insertions in tetraploid and hexaploidy wheat species. In addition, Mariam element was found to be unique to wheat, as it was not found in other plant genomes. The dynamic nature of Mariam in the wheat genome was assessed by site-specific PCR analysis and revealed that it retained activity in wild emmer populations in a population-specific manner. CONCLUSIONS This study provides additional insight into the evolutionary impact of TEs in wheat.
Collapse
Affiliation(s)
- Katherine Domb
- Present Address: Department of Molecular Biology and Ecology of Plants, Tel-Aviv University, Tel Aviv, Israel
| | | | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, 84105, Beer-Sheva, Israel.
| |
Collapse
|
7
|
Tang Y, Ma X, Zhao S, Xue W, Zheng X, Sun H, Gu P, Zhu Z, Sun C, Liu F, Tan L. Identification of an active miniature inverted-repeat transposable element mJing in rice. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 98:639-653. [PMID: 30689248 PMCID: PMC6850418 DOI: 10.1111/tpj.14260] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 01/01/2019] [Accepted: 01/18/2019] [Indexed: 05/27/2023]
Abstract
Miniature inverted-repeat transposable elements (MITEs) are structurally homogeneous non-autonomous DNA transposons with high copy numbers that play important roles in genome evolution and diversification. Here, we analyzed the rice high-tillering dwarf (htd) mutant in an advanced backcross population between cultivated and wild rice, and identified an active MITE named miniature Jing (mJing). The mJing element belongs to the PIF/Harbinger superfamily. japonica rice var. Nipponbare and indica var. 93-11 harbor 72 and 79 mJing family members, respectively, have undergone multiple rounds of amplification bursts during the evolution of Asian cultivated rice (Oryza sativa L.). A heterologous transposition experiment in Arabidopsis thaliana indicated that the autonomous element Jing is likely to have provides the transposase needed for mJing mobilization. We identified 297 mJing insertion sites and their presence/absence polymorphism among 71 rice samples through targeted high-throughput sequencing. The results showed that the copy number of mJing varies dramatically among Asian cultivated rice (O. sativa), its wild ancestor (O. rufipogon), and African cultivated rice (O. glaberrima) and that some mJing insertions are subject to directional selection. These findings suggest that the amplification and removal of mJing elements have played an important role in rice genome evolution and species diversification.
Collapse
Affiliation(s)
- Yanyan Tang
- State Key Laboratory of Plant Physiology and BiochemistryChina Agricultural UniversityBeijing100193China
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Xin Ma
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Shuangshuang Zhao
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Wei Xue
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Xu Zheng
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Hongying Sun
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Ping Gu
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Zuofeng Zhu
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Chuanqing Sun
- State Key Laboratory of Plant Physiology and BiochemistryChina Agricultural UniversityBeijing100193China
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Fengxia Liu
- State Key Laboratory of Plant Physiology and BiochemistryChina Agricultural UniversityBeijing100193China
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| | - Lubin Tan
- National Center for Evaluation of Agricultural Wild Plants (Rice)MOE Laboratory of Crop Heterosis and UtilizationDepartment of Plant Genetics and BreedingChina Agricultural UniversityBeijing100193China
| |
Collapse
|
8
|
Keidar-Friedman D, Bariah I, Kashkush K. Genome-wide analyses of miniature inverted-repeat transposable elements reveals new insights into the evolution of the Triticum-Aegilops group. PLoS One 2018; 13:e0204972. [PMID: 30356268 PMCID: PMC6200218 DOI: 10.1371/journal.pone.0204972] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2018] [Accepted: 09/08/2018] [Indexed: 11/19/2022] Open
Abstract
The sequence drafts of wild emmer and bread wheat facilitated high resolution, genome-wide analysis of transposable elements (TEs), which account for up to 90% of the wheat genome. Despite extensive studies, the role of TEs in reshaping nascent polyploid genomes remains to be fully understood. In this study, we retrieved miniature inverted-repeat transposable elements (MITEs) from the recently published genome drafts of Triticum aestivum, Triticum turgidum ssp. dicoccoides, Aegilops tauschii and the available genome draft of Triticum urartu. Overall, 239,126 MITE insertions were retrieved, including 3,874 insertions of a newly identified, wheat-unique MITE family that we named "Inbar". The Stowaway superfamily accounts for ~80% of the retrieved MITE insertions, while Thalos is the most abundant family. MITE insertions are distributed in the seven homologous chromosomes of the wild emmer and bread wheat genomes. The remarkably high level of insertions in the B sub-genome (~59% of total retrieved MITE insertions in the wild emmer genome draft, and ~41% in the bread wheat genome draft), emphasize its highly repetitive nature. Nearly 52% of all MITE insertions were found within or close (less than 100bp) to coding genes, and ~400 MITE sequences were found in the bread wheat transcriptome, indicating that MITEs might have a strong impact on wheat genome expression. In addition, ~40% of MITE insertions were found within TE sequences, and remarkably, ~90% of Inbar insertions were located in retrotransposon sequences. Our data thus shed new light on the role of MITEs in the diversification of allopolyploid wheat species.
Collapse
Affiliation(s)
| | - Inbar Bariah
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel
| | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel
- * E-mail:
| |
Collapse
|
9
|
Keidar D, Doron C, Kashkush K. Genome-wide analysis of a recently active retrotransposon, Au SINE, in wheat: content, distribution within subgenomes and chromosomes, and gene associations. PLANT CELL REPORTS 2018; 37:193-208. [PMID: 29164313 PMCID: PMC5787218 DOI: 10.1007/s00299-017-2213-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 10/05/2017] [Indexed: 05/02/2023]
Abstract
Here, we show that Au SINE elements have strong associations with protein-coding genes in wheat. Most importantly Au SINE insertion within introns causes allelic variation and might induce intron retention. The impact of transposable elements (TEs) on genome structure and function is intensively studied in eukaryotes, especially in plants where TEs can reach up to 90% of the genome in some cases, such as in wheat. Here, we have performed a genome-wide in-silico analysis using the updated publicly available genome draft of bread wheat (T. aestivum), in addition to the updated genome drafts of the diploid donor species, T. urartu and Ae. tauschii, to retrieve and analyze a non-LTR retrotransposon family, termed Au SINE, which was found to be widespread in plant species. Then, we have performed site-specific PCR and realtime RT-PCR analyses to assess the possible impact of Au SINE on gene structure and function. To this end, we retrieved 133, 180 and 1886 intact Au SINE insertions from T. urartu, Ae. tauschii and T. aestivum genome drafts, respectively. The 1886 Au SINE insertions were distributed in the seven homoeologous chromosomes of T. aestivum, while ~ 67% of the insertions were associated with genes. Detailed analysis of 40 genes harboring Au SINE revealed allelic variation of those genes in the Triticum-Aegilops genus. In addition, expression analysis revealed that both regular transcripts and alternative Au SINE-containing transcripts were simultaneously amplified in the same tissue, indicating retention of Au SINE-containing introns. Analysis of the wheat transcriptome revealed that hundreds of protein-coding genes harbor Au SINE in at least one of their mature splice variants. Au SINE might play a prominent role in speciation by creating transcriptome variation.
Collapse
Affiliation(s)
- Danielle Keidar
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel
| | - Chen Doron
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel
| | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, 84105, Israel.
| |
Collapse
|
10
|
Domb K, Keidar D, Yaakov B, Khasdan V, Kashkush K. Transposable elements generate population-specific insertional patterns and allelic variation in genes of wild emmer wheat (Triticum turgidum ssp. dicoccoides). BMC PLANT BIOLOGY 2017; 17:175. [PMID: 29078757 PMCID: PMC5659041 DOI: 10.1186/s12870-017-1134-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2016] [Accepted: 10/17/2017] [Indexed: 05/02/2023]
Abstract
BACKGROUND Natural populations of the tetraploid wild emmer wheat (genome AABB) were previously shown to demonstrate eco-geographically structured genetic and epigenetic diversity. Transposable elements (TEs) might make up a significant part of the genetic and epigenetic variation between individuals and populations because they comprise over 80% of the wild emmer wheat genome. In this study, we performed detailed analyses to assess the dynamics of transposable elements in 50 accessions of wild emmer wheat collected from 5 geographically isolated sites. The analyses included: the copy number variation of TEs among accessions in the five populations, population-unique insertional patterns, and the impact of population-unique/specific TE insertions on structure and expression of genes. RESULTS We assessed the copy numbers of 12 TE families using real-time quantitative PCR, and found significant copy number variation (CNV) in the 50 wild emmer wheat accessions, in a population-specific manner. In some cases, the CNV difference reached up to 6-fold. However, the CNV was TE-specific, namely some TE families showed higher copy numbers in one or more populations, and other TE families showed lower copy numbers in the same population(s). Furthermore, we assessed the insertional patterns of 6 TE families using transposon display (TD), and observed significant population-specific insertional patterns. The polymorphism levels of TE-insertional patterns reached 92% among all wild emmer wheat accessions, in some cases. In addition, we observed population-specific/unique TE insertions, some of which were located within or close to protein-coding genes, creating allelic variations in a population-specific manner. We also showed that those genes are differentially expressed in wild emmer wheat. CONCLUSIONS For the first time, this study shows that TEs proliferate in wild emmer wheat in a population-specific manner, creating new alleles of genes, which contribute to the divergent evolution of homeologous genes from the A and B subgenomes.
Collapse
Affiliation(s)
- Katherine Domb
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| | - Danielle Keidar
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| | - Beery Yaakov
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| | - Vadim Khasdan
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, 84105 Beer-Sheva, Israel
| |
Collapse
|
11
|
Ge R, Mai G, Zhang R, Wu X, Wu Q, Zhou F. MUSTv2: An Improved De Novo Detection Program for Recently Active Miniature Inverted Repeat Transposable Elements (MITEs). J Integr Bioinform 2017; 14:/j/jib.ahead-of-print/jib-2017-0029/jib-2017-0029.xml. [PMID: 28796642 PMCID: PMC6042816 DOI: 10.1515/jib-2017-0029] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 05/08/2017] [Indexed: 11/15/2022] Open
Abstract
Background Miniature inverted repeat transposable element (MITE) is a short transposable element, carrying no protein-coding regions. However, its high proliferation rate and sequence-specific insertion preference renders it as a good genetic tool for both natural evolution and experimental insertion mutagenesis. Recently active MITE copies are those with clear signals of Terminal Inverted Repeats (TIRs) and Direct Repeats (DRs), and are recently translocated into their current sites. Their proliferation ability renders them good candidates for the investigation of genomic evolution. Results This study optimizes the C++ code and running pipeline of the MITE Uncovering SysTem (MUST) by assuming no prior knowledge of MITEs required from the users, and the current version, MUSTv2, shows significantly increased detection accuracy for recently active MITEs, compared with similar programs. The running speed is also significantly increased compared with MUSTv1. We prepared a benchmark dataset, the simulated genome with 150 MITE copies for researchers who may be of interest. Conclusions MUSTv2 represents an accurate detection program of recently active MITE copies, which is complementary to the existing template-based MITE mapping programs. We believe that the release of MUSTv2 will greatly facilitate the genome annotation and structural analysis of the bioOMIC big data researchers.
Collapse
|
12
|
Genome-wide analysis of transposable elements in the coffee berry borer Hypothenemus hampei (Coleoptera: Curculionidae): description of novel families. Mol Genet Genomics 2017; 292:565-583. [PMID: 28204924 DOI: 10.1007/s00438-017-1291-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2016] [Accepted: 01/12/2017] [Indexed: 10/20/2022]
Abstract
The coffee berry borer (CBB) Hypothenemus hampei is the most limiting pest of coffee production worldwide. The CBB genome has been recently sequenced; however, information regarding the presence and characteristics of transposable elements (TEs) was not provided. Using systematic searching strategies based on both de novo and homology-based approaches, we present a library of TEs from the draft genome of CBB sequenced by the Colombian Coffee Growers Federation. The library consists of 880 sequences classified as 66% Class I (LTRs: 46%, non-LTRs: 20%) and 34% Class II (DNA transposons: 8%, Helitrons: 16% and MITEs: 10%) elements, including families of the three main LTR (Gypsy, Bel-Pao and Copia) and non-LTR (CR1, Daphne, I/Nimb, Jockey, Kiri, R1, R2 and R4) clades and DNA superfamilies (Tc1-mariner, hAT, Merlin, P, PIF-Harbinger, PiggyBac and Helitron). We propose the existence of novel families: Hypo, belonging to the LTR Gypsy superfamily; Hamp, belonging to non-LTRs; and rosa, belonging to Class II or DNA transposons. Although the rosa clade has been previously described, it was considered to be a basal subfamily of the mariner family. Based on our phylogenetic analysis, including Tc1, mariner, pogo, rosa and Lsra elements from other insects, we propose that rosa and Lsra elements are subfamilies of an independent family of Class II elements termed rosa. The annotations obtained indicate that a low percentage of the assembled CBB genome (approximately 8.2%) consists of TEs. Although these TEs display high diversity, most sequences are degenerate, with few full-length copies of LTR and DNA transposons and several complete and putatively active copies of non-LTR elements. MITEs constitute approximately 50% of the total TEs content, with a high proportion associated with DNA transposons in the Tc1-mariner superfamily.
Collapse
|
13
|
Han MJ, Zhou QZ, Zhang HH, Tong X, Lu C, Zhang Z, Dai F. iMITEdb: the genome-wide landscape of miniature inverted-repeat transposable elements in insects. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw148. [PMID: 28025339 PMCID: PMC5199201 DOI: 10.1093/database/baw148] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Revised: 09/19/2016] [Accepted: 10/18/2016] [Indexed: 01/23/2023]
Abstract
Miniature inverted-repeat transposable elements (MITEs) have attracted much attention due to their widespread occurrence and high copy numbers in eukaryotic genomes. However, the systematic knowledge about MITEs in insects and other animals is still lacking. In this study, we identified 6012 MITE families from 98 insect species genomes. Comparison of these MITEs with known MITEs in the NCBI non-redundant database and Repbase showed that 5701(∼95%) of 6012 MITE families are novel. The abundance of MITEs varies drastically among different insect species, and significantly correlates with genome size. In general, larger genomes contain more MITEs than small genomes. Furthermore, all identified MITEs were included in a newly constructed database (iMITEdb) (http://gene.cqu.edu.cn/iMITEdb/), which has functions such as browse, search, BLAST and download. Overall, our results not only provide insight on insect MITEs but will also improve assembly and annotation of insect genomes. More importantly, the results presented in this study will promote studies of MITEs function, evolution and application in insects. Database URL: http://gene.cqu.edu.cn/iMITEdb/
Collapse
Affiliation(s)
- Min-Jin Han
- State Key Laboratory of Silkworm Genome Biology, Key Laboratory for Sericulture Functional Genomics and Biotechnology of Agricultural Ministry, Southwest University, Chongqing 400715, China
| | - Qiu-Zhong Zhou
- Laboratory of Evolutionary and Functional Genomics, School of Life Sciences, Chongqing University, Chongqing 401331, China
| | - Hua-Hao Zhang
- College of Pharmacy and Life Science, Jiujiang University, Jiujiang 332000, China
| | - Xiaoling Tong
- State Key Laboratory of Silkworm Genome Biology, Key Laboratory for Sericulture Functional Genomics and Biotechnology of Agricultural Ministry, Southwest University, Chongqing 400715, China
| | - Cheng Lu
- State Key Laboratory of Silkworm Genome Biology, Key Laboratory for Sericulture Functional Genomics and Biotechnology of Agricultural Ministry, Southwest University, Chongqing 400715, China
| | - Ze Zhang
- Laboratory of Evolutionary and Functional Genomics, School of Life Sciences, Chongqing University, Chongqing 401331, China
| | - Fangyin Dai
- State Key Laboratory of Silkworm Genome Biology, Key Laboratory for Sericulture Functional Genomics and Biotechnology of Agricultural Ministry, Southwest University, Chongqing 400715, China
| |
Collapse
|
14
|
Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica. Mol Genet Genomics 2015; 290:2297-312. [DOI: 10.1007/s00438-015-1076-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Accepted: 05/29/2015] [Indexed: 11/26/2022]
|
15
|
Liu Y, Yang G. Tc1-like transposable elements in plant genomes. Mob DNA 2014; 5:17. [PMID: 24926322 PMCID: PMC4054914 DOI: 10.1186/1759-8753-5-17] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 05/12/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Tc1/mariner superfamily of transposable elements (TEs) is widespread in animal genomes. Mariner-like elements, which bear a DDD triad catalytic motif, have been identified in a wide range of flowering plant species. However, as the founding member of the superfamily, Tc1-like elements that bear a DD34E triad catalytic motif are only known to unikonts (animals, fungi, and Entamoeba). RESULTS Here we report the identification of Tc1-like elements (TLEs) in plant genomes. These elements bear the four terminal nucleotides and the characteristic DD34E triad motif of Tc1 element. The two TLE families (PpTc1, PpTc2) identified in the moss (Physcomitrella patens) genome contain highly similar copies. Multiple copies of PpTc1 are actively transcribed and the transcripts encode intact full length transposase coding sequences. TLEs are also found in angiosperm genome sequence databases of rice (Oryza sativa), dwarf birch (Betula nana), cabbage (Brassica rapa), hemp (Cannabis sativa), barley (Hordium valgare), lettuce (Lactuta sativa), poplar (Populus trichocarpa), pear (Pyrus x bretschneideri), and wheat (Triticum urartu). CONCLUSIONS This study extends the occurrence of TLEs to the plant phylum. The elements in the moss genome have amplified recently and may still be capable of transposition. The TLEs are also present in angiosperm genomes, but apparently much less abundant than in moss.
Collapse
Affiliation(s)
- Yuan Liu
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, L5L 1C6 Mississauga, ON, Canada ; Cell and Systems Biology, University of Toronto, Toronto, Canada
| | - Guojun Yang
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, L5L 1C6 Mississauga, ON, Canada ; Cell and Systems Biology, University of Toronto, Toronto, Canada
| |
Collapse
|
16
|
Yang G, Fattash I, Lee CN, Liu K, Cavinder B. Birth of three stowaway-like MITE families via microhomology-mediated miniaturization of a Tc1/Mariner element in the yellow fever mosquito. Genome Biol Evol 2014; 5:1937-48. [PMID: 24068652 PMCID: PMC3814204 DOI: 10.1093/gbe/evt146] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Eukaryotic genomes contain numerous DNA transposons that move by a cut-and-paste mechanism. The majority of these elements are self-insufficient and dependent on their autonomous relatives to transpose. Miniature inverted repeat transposable elements (MITEs) are often the most numerous nonautonomous DNA elements in a higher eukaryotic genome. Little is known about the origin of these MITE families as few of them are accompanied by their direct ancestral elements in a genome. Analyses of MITEs in the yellow fever mosquito identified its youngest MITE family, designated as Gnome, that contains at least 116 identical copies. Genome-wide search for direct ancestral autonomous elements of Gnome revealed an elusive single copy Tc1/Mariner-like element, named as Ozma, that encodes a transposase with a DD37E triad motif. Strikingly, Ozma also gave rise to two additional MITE families, designated as Elf and Goblin. These three MITE families were derived at different times during evolution and bear internal sequences originated from different regions of Ozma. Upon close inspection of the sequence junctions, the internal deletions during the formation of these three MITE families always occurred between two microhomologous sites (6–8 bp). These results suggest that multiple MITE families may originate from a single ancestral autonomous element, and formation of MITEs can be mediated by sequence microhomology. Ozma and its related MITEs are exceptional candidates for the long sought-after endogenous active transposon tool in genetic control of mosquitoes.
Collapse
Affiliation(s)
- Guojun Yang
- Department of Biology, University of Toronto Mississauga, Ontario, Canada
- *Corresponding author: E-mail:
| | - Isam Fattash
- Department of Biology, University of Toronto Mississauga, Ontario, Canada
| | - Chia-Ni Lee
- Department of Biology, University of Toronto Mississauga, Ontario, Canada
| | - Kun Liu
- Department of Botany and Plant Sciences, University of California Riverside
| | - Brad Cavinder
- Department of Plant Pathology and Microbiology, University of California Riverside
| |
Collapse
|
17
|
Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea. PLoS One 2014; 9:e94499. [PMID: 24747717 PMCID: PMC3991616 DOI: 10.1371/journal.pone.0094499] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Accepted: 03/17/2014] [Indexed: 12/25/2022] Open
Abstract
Miniature inverted-repeat transposable elements (MITEs) are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5) were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1) were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP) analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion.
Collapse
|
18
|
Ben-David S, Yaakov B, Kashkush K. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 76:201-10. [PMID: 23855320 PMCID: PMC4223381 DOI: 10.1111/tpj.12285] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 06/04/2013] [Accepted: 07/03/2013] [Indexed: 05/02/2023]
Abstract
Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation.
Collapse
|
19
|
Yang G. MITE Digger, an efficient and accurate algorithm for genome wide discovery of miniature inverted repeat transposable elements. BMC Bioinformatics 2013; 14:186. [PMID: 23758809 PMCID: PMC3680318 DOI: 10.1186/1471-2105-14-186] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2013] [Accepted: 06/02/2013] [Indexed: 11/25/2022] Open
Abstract
Background Miniature inverted repeat transposable elements (MITEs) are abundant non-autonomous elements, playing important roles in shaping gene and genome evolution. Their characteristic structural features are suitable for automated identification by computational approaches, however, de novo MITE discovery at genomic levels is still resource expensive. Efficient and accurate computational tools are desirable. Existing algorithms process every member of a MITE family, therefore a major portion of the computing task is redundant. Results In this study, redundant computing steps were analyzed and a novel algorithm emphasizing on the reduction of such redundant computing was implemented in MITE Digger. It completed processing the whole rice genome sequence database in ~15 hours and produced 332 MITE candidates with low false positive (1.8%) and false negative (0.9%) rates. MITE Digger was also tested for genome wide MITE discovery with four other genomes. Conclusions MITE Digger is efficient and accurate for genome wide retrieval of MITEs. Its user friendly interface further facilitates genome wide analyses of MITEs on a routine basis. The MITE Digger program is available at: http://labs.csb.utoronto.ca/yang/MITEDigger.
Collapse
Affiliation(s)
- Guojun Yang
- Department of Biology, University of Toronto Mississauga, Mississauga, ON L5L 1C6, Canada.
| |
Collapse
|
20
|
Fattash I, Rooke R, Wong A, Hui C, Luu T, Bhardwaj P, Yang G. Miniature inverted-repeat transposable elements: discovery, distribution, and activity. Genome 2013; 56:475-86. [PMID: 24168668 DOI: 10.1139/gen-2012-0174] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Eukaryotic organisms have dynamic genomes, with transposable elements (TEs) as a major contributing factor. Although the large autonomous TEs can significantly shape genomic structures during evolution, genomes often harbor more miniature nonautonomous TEs that can infest genomic niches where large TEs are rare. In spite of their cut-and-paste transposition mechanisms that do not inherently favor copy number increase, miniature inverted-repeat transposable elements (MITEs) are abundant in eukaryotic genomes and exist in high copy numbers. Based on the large number of MITE families revealed in previous studies, accurate annotation of MITEs, particularly in newly sequenced genomes, will identify more genomes highly rich in these elements. Novel families identified from these analyses, together with the currently known families, will further deepen our understanding of the origins, transposase sources, and dramatic amplification of these elements.
Collapse
Affiliation(s)
- Isam Fattash
- a Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L 1C6, Canada
| | | | | | | | | | | | | |
Collapse
|
21
|
Gambin T, Startek M, Walczak K, Paszek J, Grzebelus D, Gambin A. TIRfinder: A Web Tool for Mining Class II Transposons Carrying Terminal Inverted Repeats. Evol Bioinform Online 2013. [PMCID: PMC3562082 DOI: 10.4137/ebo.s10619] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Transposable elements (TEs) can be found in virtually all known genomes; plant genomes are exceptionally rich in this kind of dispersed repetitive sequences. Current knowledge on TE proliferation dynamics places them among the main forces of molecular evolution. Therefore efficient tools to analyze TE distribution in genomes are needed that would allow for comparative genomics studies and for studying TE dynamics in a genome. This was our main motivation underpinning TIRfinder construction–-an efficient tool for mining class II TEs carrying terminal inverted repeats. TIRfinder takes as an input a genomic sequence and information on structural properties of a TE family, and identifies all TEs in the genome showing the desired structural characteristics. The efficiency and small memory requirements of our approach stem from the use of suffix trees to identify all DNA segments surrounded by user-specified terminal inverse repeats (TIR) and target site duplications (TSD) which together constitute a mask. On the other hand, the flexibility of the notion of the TIR/TSD mask makes it possible to use the tool for de novo detection. The main advantages of TIRfinder are its speed, accuracy and convenience of use for biologists. A web-based interface is freely available at http:/bioputer.mimuw.edu.pl/tirfindertool/ .
Collapse
Affiliation(s)
- Tomasz Gambin
- Institute of Computer Science, Warsaw University of Technology, Warsaw, Poland
| | - Michał Startek
- College of Inter-Faculty Individual Studies in Mathematics and Natural Sciences, University of Warsaw, Warsaw, Poland
| | - Krzysztof Walczak
- Institute of Computer Science, Warsaw University of Technology, Warsaw, Poland
| | - Jarosław Paszek
- Institute of Informatics, University of Warsaw, Warsaw, Poland
| | - Dariusz Grzebelus
- Department of Genetics, Plant Breeding and Seed Science, University of Agriculture in Krakow, Krakow, Poland
| | - Anna Gambin
- Institute of Informatics, University of Warsaw, Warsaw, Poland
- Mossakowski Medical Research Centre Polish Academy of Sciences, Warsaw, Poland
| |
Collapse
|
22
|
Yaakov B, Ben-David S, Kashkush K. Genome-wide analysis of Stowaway-like MITEs in wheat reveals high sequence conservation, gene association, and genomic diversification. PLANT PHYSIOLOGY 2013; 161:486-96. [PMID: 23104862 PMCID: PMC3532278 DOI: 10.1104/pp.112.204404] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2012] [Accepted: 10/24/2012] [Indexed: 05/18/2023]
Abstract
The diversity and evolution of wheat (Triticum-Aegilops group) genomes is determined, in part, by the activity of transposable elements that constitute a large fraction of the genome (up to 90%). In this study, we retrieved sequences from publicly available wheat databases, including a 454-pyrosequencing database, and analyzed 18,217 insertions of 18 Stowaway-like miniature inverted-repeat transposable element (MITE) families previously characterized in wheat that together account for approximately 1.3 Mb of sequence. All 18 families showed high conservation in length, sequence, and target site preference. Furthermore, approximately 55% of the elements were inserted in transcribed regions, into or near known wheat genes. Notably, we observed significant correlation between the mean length of the MITEs and their copy number. In addition, the genomic composition of nine MITE families was studied by real-time quantitative polymerase chain reaction analysis in 40 accessions of Triticum spp. and Aegilops spp., including diploids, tetraploids, and hexaploids. The quantitative polymerase chain reaction data showed massive and significant intraspecific and interspecific variation as well as genome-specific proliferation and nonadditive quantities in the polyploids. We also observed significant differences in the methylation status of the insertion sites among MITE families. Our data thus suggest a possible role for MITEs in generating genome diversification and in the establishment of nascent polyploid species in wheat.
Collapse
Affiliation(s)
| | | | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva 84105, Israel
| |
Collapse
|
23
|
Abstract
The initial identification of transposable elements (TEs) was attributed to the activity of DNA transposable elements, which are prevalent in plants. Unlike RNA elements, which accumulate in the gene-poor heterochromatic regions, most DNA elements are located in the gene rich regions and many of them carry genes or gene fragments. As such, DNA elements have a more intimate relationship with genes and may have an immediate impact on gene expression and gene function. DNA elements are structurally distinct from RNA elements and most of them have terminal inverted repeats (TIRs). Such structural features have been used to identify the relevant elements from genomic sequences. Among the DNA elements in plants, the most abundant type is the miniature inverted repeat transposable elements (MITEs). This chapter discusses the methods to identify MITEs, Helitrons, and other DNA transposable elements.
Collapse
Affiliation(s)
- Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
24
|
Xu HE, Zhang HH, Han MJ, Shen YH, Huang XZ, Xiang ZH, Zhang Z. [Computational approaches for identification and classification of transposable elements in eukaryotic genomes]. YI CHUAN = HEREDITAS 2012; 34:1009-1019. [PMID: 22917906 DOI: 10.3724/sp.j.1005.2012.01009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Repetitive sequences (repeats) represent a significant fraction of the eukaryotic genomes and can be divided into tandem repeats, segmental duplications, and interspersed repeats on the basis of their sequence characteristics and how they are formed. Most interspersed repeats are derived from transposable elements (TEs). Eukaryotic TEs have been subdivided into two major classes according to the intermediate they use to move. The transposition and amplification of TEs have a great impact on the evolution of genes and the stability of genomes. However, identification and classification of TEs are complex and difficult due to the fact that their structure and classification are complex and diverse compared with those of other types of repeats. Here, we briefly introduced the function and classification of TEs, and summarized three different steps for identification, classification and annotation of TEs in eukaryotic genomes: (1) assembly of a repeat library, (2) repeat correction and classification, and (3) genome annotation. The existing computational approaches for each step were summarized and the advantages and disadvantages of the approaches were also highlighted in this review. To accurately identify, classify, and annotate the TEs in eukaryotic genomes requires combined methods. This review provides useful information for biologists who are not familiar with these approaches to find their way through the forest of programs.
Collapse
Affiliation(s)
- Hong-En Xu
- The Institute of Sericulture and Systems Biology, Southwest University, Chongqing, China.
| | | | | | | | | | | | | |
Collapse
|
25
|
Yang G, Wong A, Rooke R. ATon, abundant novel nonautonomous mobile genetic elements in yellow fever mosquito (Aedes aegypti). BMC Genomics 2012; 13:283. [PMID: 22738224 PMCID: PMC3422177 DOI: 10.1186/1471-2164-13-283] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2012] [Accepted: 06/27/2012] [Indexed: 12/16/2022] Open
Abstract
Background Mosquitoes are important pathogen vectors affecting human and other animals. Studies on genetic control of mosquito mediated disease transmission gained traction recently due to mosquito transgenesis technology. Active transposons are considered valuable tools to propagate pathogen resistance transgenes among mosquitoes, rendering the whole population recalcitrant to diseases. A major hurdle in this approach is the inefficient remobilization activity after the integration of heterologous transposon vectors bearing transgenes into chromosomes. Therefore, endogenous active transposons in mosquito genomes are highly desirable. Results Starting with the transposable element database of the yellow fever mosquito Aedes aegypti genome, detailed analyses of the members of each TE family were performed to identify sequences with multiple identical copies, an indicator of their latest or current transposition activity. Among a dozen of potentially active TE families, two DNA elements (TF000728 and TF000742 in TEfam) are short and nonautonomous. Close inspection of the elements revealed that these two families were previously mis-categorized and, unlike other known TEs, insert specifically at dinucleotide “AT”. These two families were therefore designated as ATon-I and ATon-II. ATon-I has a total copy number of 294, among which three elements have more than 10 identical copies (146, 61 and 17). ATon-II has a total copy number of 317, among which three elements have more than 10 identical copies (84, 15 and 12). Genome wide searches revealed additional 24 ATon families in A. aegypti genome with nearly 6500 copies in total. Transposon display analysis of ATon-1 family using different A. aegypti strains suggests that the elements are similarly abundant in the tested mosquito strains. Conclusion ATons are novel mobile genetic elements bearing terminal inverted repeats and insert specifically at dinucleotide “AT”. Five ATon families contain elements existing at more than 10 identical copies, suggesting very recent or current transposition activity. A total of 24 new TE families with nearly 6000 copies were identified in this study.
Collapse
Affiliation(s)
- Guojun Yang
- Department of Biology, University of Toronto Mississauga, SB3058, Mississauga, ON, L5L 1 C6, Canada.
| | | | | |
Collapse
|
26
|
Janicki M, Rooke R, Yang G. Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res 2012; 19:787-808. [PMID: 21850457 DOI: 10.1007/s10577-011-9230-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Collapse
Affiliation(s)
- Mateusz Janicki
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L1C6, Canada
| | | | | |
Collapse
|
27
|
Flutre T, Permal E, Quesneville H. Transposable Element Annotation in Completely Sequenced Eukaryote Genomes. PLANT TRANSPOSABLE ELEMENTS 2012. [DOI: 10.1007/978-3-642-31842-9_2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
28
|
Lu C, Chen J, Zhang Y, Hu Q, Su W, Kuang H. Miniature inverted-repeat transposable elements (MITEs) have been accumulated through amplification bursts and play important roles in gene expression and species diversity in Oryza sativa. Mol Biol Evol 2011; 29:1005-17. [PMID: 22096216 PMCID: PMC3278479 DOI: 10.1093/molbev/msr282] [Citation(s) in RCA: 147] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Miniature inverted–repeat transposable elements (MITEs) are predicted to play important roles on genome evolution. We developed a BLASTN-based approach for de novo identification of MITEs and systematically analyzed MITEs in rice genome. The genome of rice cultivar Nipponbare (Oryza sativa ssp. japonica) harbors 178,533 MITE-related sequences classified into 338 families. Pairwise nucleotide diversity and phylogenetic tree analysis indicated that individual MITE families were resulted from one or multiple rounds of amplification bursts. The timing of amplification burst varied considerably between different MITE families or subfamilies. MITEs are associated with 23,623 (58.2%) genes in rice genome. At least 7,887 MITEs are transcribed and more than 3,463 were transcribed with rice genes. The MITE sequences transcribed with rice coding genes form 1,130 pairs of potential natural sense/antisense transcripts. MITEs generate 23.5% (183,837 of 781,885) of all small RNAs identified from rice. Some MITE families generated small RNAs mainly from the terminals, while other families generated small RNAs predominantly from the central region. More than half (51.8%) of the MITE-derived small RNAs were generated exclusively by MITEs located away from genes. Genome-wide analysis showed that genes associated with MITEs have significantly lower expression than genes away from MITEs. Approximately 14.8% of loci with full-length MITEs have presence/absence polymorphism between rice cultivars 93-11 (O. sativa ssp. indica) and Nipponbare. Considering that different sets of genes may be regulated by MITE-derived small RNAs in different genotypes, MITEs provide considerable diversity for O. sativa.
Collapse
Affiliation(s)
- Chen Lu
- Key Laboratory of Horticulture Biology, Ministry of Education and Department of Vegetable Crops, College of Horticulture and Forestry, Huazhong Agricultural University, Wuhan, People's Republic of China
| | | | | | | | | | | |
Collapse
|
29
|
Abstract
Miniature inverted terminal repeat elements (MITEs) are nonautonomous mobile elements that have a significant impact on bacterial evolution. Here we characterize E622, a 611-bp virulence-associated MITE from Pseudomonas syringae, which contains no coding region but has almost perfect 168-bp inverted repeats. Using an antibiotic coupling assay, we show that E622 is transposable and can mobilize an antibiotic resistance gene contained between its borders. Its predicted parent element, designated TnE622, has a typical transposon structure with a three-gene operon, consisting of resolvase, integrase, and exeA-like genes, which is bounded by the same terminal inverted repeats as E622. A broader genome level survey of the E622/TnE622 inverted repeats identified homologs in Pseudomonas, Salmonella, Shewanella, Erwinia, Pantoea, and the cyanobacteria Nostoc and Cyanothece, many of which appear to encompass known virulence genes, including genes encoding toxins, enzymes, and type III secreted effectors. Its association with niche-specific genetic determinants, along with its persistence and evolutionary diversification, indicates that this mobile element family has played a prominent role in the evolution of many agriculturally and clinically relevant pathogenic bacteria.
Collapse
|
30
|
Abstract
Fungal genome annotation is the starting point for analysis of genome content. This generally involves the application of diverse methods to identify features on a genome assembly such as protein-coding and non-coding genes, repeats and transposable elements, and pseudogenes. Here we describe tools and methods leveraged for eukaryotic genome annotation with a focus on the annotation of fungal nuclear and mitochondrial genomes. We highlight the application of the latest technologies and tools to improve the quality of predicted gene sets. The Broad Institute eukaryotic genome annotation pipeline is described as one example of how such methods and tools are integrated into a sequencing center's production genome annotation environment.
Collapse
Affiliation(s)
- Brian J Haas
- Genome Sequencing and Analysis Program, Broad Institute, 7 Cambridge Center, Cambridge, MA 02142, U.S.A
| | | | | | | | | |
Collapse
|
31
|
Abstract
Advances in sequencing technologies have fundamentally changed the pace of genome sequencing projects and have contributed to the ever-increasing volume of genomic data. This has been paralleled by an increase in computational power and resources to process and translate raw sequence data into meaningful information. In addition to protein coding regions, an integral part of all the genomes studied so far has been the presence of repetitive sequences. Previously considered as "junk," numerous studies have implicated repetitive sequences in important biological and structural roles in the genome. Therefore, the identification and characterization of these repetitive sequences has become an indispensable part of genome sequencing projects. Numerous similarity-based and de novo methods have been developed to search for and annotate repeats in the genome, many of which have been discussed in this chapter.
Collapse
|
32
|
Identifying repeats and transposable elements in sequenced genomes: how to find your way through the dense forest of programs. Heredity (Edinb) 2009; 104:520-33. [PMID: 19935826 DOI: 10.1038/hdy.2009.165] [Citation(s) in RCA: 143] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The production of genome sequences has led to another important advance in their annotation, which is closely linked to the exact determination of their content in terms of repeats, among which are transposable elements (TEs). The evolutionary implications and the presence of coding regions in some TEs can confuse gene annotation, and also hinder the process of genome assembly, making particularly crucial to be able to annotate and classify them correctly in genome sequences. This review is intended to provide an overview as comprehensive as possible of the automated methods currently used to annotate and classify TEs in sequenced genomes. Different categories of programs exist according to their methodology and the repeat, which they can identify. I describe here the main characteristics of the programs, their main goals and the difficulties they can entail. The drawbacks of the different methods are also highlighted to help biologists who are unfamiliar with algorithmic methods to understand this methodology better. Globally, using several different programs and carrying out a cross comparison of their results has the best chance of finding reliable results as any single program. However, this makes it essential to verify the results provided by each program independently. The ideal solution would be to test all programs against the same data set to obtain a true comparison of their actual performance.
Collapse
|
33
|
Hikosaka A, Kawahara A. A systematic search and classification of T2 family miniature inverted-repeat transposable elements (MITEs) in Xenopus tropicalis suggests the existence of recently active MITE subfamilies. Mol Genet Genomics 2009; 283:49-62. [PMID: 19915867 DOI: 10.1007/s00438-009-0496-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2009] [Accepted: 10/23/2009] [Indexed: 01/06/2023]
Abstract
To reveal the genome-wide aspects of Xenopus T2 family miniature inverted-repeat transposable elements (MITEs), we performed a systematic search and classification of MITEs by a newly developed procedure. A terminal sequence motif (T2-motif: TTAAAGGRR) was retrieved from the Xenopus tropicalis genome database. We then selected 51- to 1,000-bp MITE candidates framed by an inverted pair of 2 T2-motifs. The 34,398 candidates were classified into possible clusters by a novel terminal sequence (TS)-clustering method on the basis of differences in their short terminal sequences. Finally, 19,242 MITEs were classified into 16 major MITE subfamilies (TS subfamilies), 10 of which showed apparent homologies to known T2 MITE subfamilies, and the rest were novel TS subfamilies. Intra- and inter-subfamily similarities or differences were investigated by analyses of diversity in GC content, total length, and sequence alignments. Furthermore, genome-wide conservation of the inverted pair structure of subfamily-specific TS stretches and their target site sequence (TTAA) were analyzed. The results suggested that some TS subfamilies might include active or at least recently active MITEs for transposition and/or amplification, but some others might have lost such activities a long time ago. The present methodology was efficient in identifying and classifying MITEs, thereby providing information on the evolutionary dynamics of MITEs.
Collapse
Affiliation(s)
- Akira Hikosaka
- Graduate School of Integrated Arts and Sciences, Hiroshima University, Higashi-Hiroshima, Hiroshima 739-8521, Japan.
| | | |
Collapse
|
34
|
Population dynamics of miniature inverted-repeat transposable elements (MITEs) in Medicago truncatula. Gene 2009; 448:214-20. [PMID: 19539732 DOI: 10.1016/j.gene.2009.06.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Revised: 06/09/2009] [Accepted: 06/11/2009] [Indexed: 11/21/2022]
Abstract
Miniature inverted-repeat transposable elements (MITEs) are small and high copy number transposons, related to and mobilized by some class II autonomous elements. New MITE families can be identified by computer-based mining of sequenced genomes. We describe four MITE families related to MtPH transposons mined de novo in the genome of Medicago truncatula, together with one previously described family MITRAV. Different levels of their intra-family sequence diversity and insertion polymorphism indicate that they were active at different evolutionary periods. MetMIT1 and MITRAV families were uniform in sequence and produced highly polymorphic insertion sites in 26 ecotypes representing a M. truncatula core collection. A subset of insertions was present only in the reference genome of A17 'Jemalong', suggesting that the two families might have been active in the course of domestication. In contrast, all investigated insertions of the MetMIT2 family were fixed, showing that it was not active after M. truncatula speciation. MetMIT1 elements were divided into three clusters, i.e. (I) relatively heterogenous copies fixed in the genome of M. truncatula, (II) uniform but also mostly fixed, and (III) uniform and polymorphic among the investigated accessions. It might reflect the evolutionary history of the MetMIT1 family, showing multiple bursts of activity. A number of MetMIT1 and MITRAV insertions were present within 1 kb upstream or downstream the ORF. A high proportion of insertions proximal to coding regions was unique to A17 'Jemalong'.
Collapse
|
35
|
Chen Y, Zhou F, Li G, Xu Y. MUST: A system for identification of miniature inverted-repeat transposable elements and applications to Anabaena variabilis and Haloquadratum walsbyi. Gene 2009; 436:1-7. [DOI: 10.1016/j.gene.2009.01.019] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Revised: 01/23/2009] [Accepted: 01/24/2009] [Indexed: 01/30/2023]
|
36
|
Abstract
Eukaryotic genomes are full of repetitive DNA, transposable elements (TEs) in particular, and accordingly there are a number of computational methods that can be used to identify TEs from genomic sequences. We present here a survey of two of the most readily available and widely used bioinformatics applications for the detection, characterization, and analysis of TE sequences in eukaryotic genomes: CENSOR and RepeatMasker. For each program, information on availability, input, output, and the algorithmic methods used is provided. Specific examples of the use of CENSOR and RepeatMasker are also described. CENSOR and RepeatMasker both rely on homology-based methods for the detection of TE sequences. There are several other classes of methods available for the analysis of repetitive DNA sequences including de novo methods that compare genomic sequences against themselves, class-specific methods that use structural characteristics of specific classes of elements to aid in their identification, and pipeline methods that combine aspects of some or all of the aforementioned methods. We briefly consider the strengths and weaknesses of these different classes of methods with an emphasis on their complementary utility for the analysis of repetitive DNA in eukaryotes.
Collapse
|
37
|
Zhou F, Olman V, Xu Y. Insertion Sequences show diverse recent activities in Cyanobacteria and Archaea. BMC Genomics 2008; 9:36. [PMID: 18218090 PMCID: PMC2246112 DOI: 10.1186/1471-2164-9-36] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2007] [Accepted: 01/24/2008] [Indexed: 11/11/2022] Open
Abstract
Background Mobile genetic elements (MGEs) play an essential role in genome rearrangement and evolution, and are widely used as an important genetic tool. Results In this article, we present genetic maps of recently active Insertion Sequence (IS) elements, the simplest form of MGEs, for all sequenced cyanobacteria and archaea, predicted based on the previously identified ~1,500 IS elements. Our predicted IS maps are consistent with the NCBI annotations of the IS elements. By linking the predicted IS elements to various characteristics of the organisms under study and the organism's living conditions, we found that (a) the activities of IS elements heavily depend on the environments where the host organisms live; (b) the number of recently active IS elements in a genome tends to increase with the genome size; (c) the flanking regions of the recently active IS elements are significantly enriched with genes encoding DNA binding factors, transporters and enzymes; and (d) IS movements show no tendency to disrupt operonic structures. Conclusion This is the first genome-scale maps of IS elements with detailed structural information on the sequence level. These genetic maps of recently active IS elements and the several interesting observations would help to improve our understanding of how IS elements proliferate and how they are involved in the evolution of the host genomes.
Collapse
Affiliation(s)
- Fengfeng Zhou
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.
| | | | | |
Collapse
|
38
|
Grzebelus D, Lasota S, Gambin T, Kucherov G, Gambin A. Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula. BMC Genomics 2007; 8:409. [PMID: 17996080 PMCID: PMC2213677 DOI: 10.1186/1471-2164-8-409] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2007] [Accepted: 11/09/2007] [Indexed: 11/25/2022] Open
Abstract
Background Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous PIF/Harbinger-like elements. Based on the above features, PIF/Harbinger-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of Medicago truncatula genomic sequence allowed for mining PIF/Harbinger-like elements, starting from a single previously described element MtMaster. Results Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous PIF/Harbinger-like elements were found in the genome of M. truncatula. They were divided into five families, MtPH-A5, MtPH-A6, MtPH-D,MtPH-E, and MtPH-M, corresponding to three previously identified and two new lineages. The largest families, MtPH-A6 and MtPH-M were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic – the presence of 60 bp tandem repeats – was observed in a group of elements of subfamily MtPH-A6-4. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty loci (RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition. Conclusion The population of PIF/Harbinger-like elements in the genome of M. truncatula is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the MtPH elements and related MITE families in different populations of M. truncatula, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems.
Collapse
Affiliation(s)
- Dariusz Grzebelus
- Department of Genetics, Plant Breeding and Seed Science, Agricultural University of Krakow, Al, 29 Listopada 54, 31-425 Krakow, Poland.
| | | | | | | | | |
Collapse
|
39
|
Xu L, Wang L, Liu T, Qian W, Gao Y, An C. Triton, a novel family of miniature inverted-repeat transposable elements (MITEs) in Trichosanthes kirilowii Maximowicz and its effect on gene regulation. Biochem Biophys Res Commun 2007; 364:668-74. [PMID: 18028877 DOI: 10.1016/j.bbrc.2007.10.069] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2007] [Accepted: 10/12/2007] [Indexed: 11/29/2022]
Abstract
Miniature inverted-repeat transposable elements (MITEs) have a broad impact on genome structure and function. Although MITEs are found associated to genes, little is known about their effect on gene regulation. We have identified a novel MITE family, named Triton, whilst analyzing two independent trichosanthin (TCS) gene promoters (TP9 and TP12) cloned from Trichosanthes kirilowii Maximowicz. Triton1 and Triton2 are nested in TP9, and Triton3 (with 93% sequence similarity to Triton2) is in TP12. To assess the effect of MITE insertion on TCS promoters, we excised Triton1 from TP9 and inserted it into TP12. GUS activity analysis revealed that nested Triton1 is required for effective repression of promoter activity. Detailed analyses of a series of 5'-truncated promoters concerning Triton1 showed that a dark-specific repressor and some constitutive elements endow Triton1 with ability to response to light conditions. These results suggest that Triton1 MITE, which contains cis-regulatory elements, could mediate gene expression.
Collapse
Affiliation(s)
- Li Xu
- National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing 100871, China
| | | | | | | | | | | |
Collapse
|
40
|
Tóth G, Deák G, Barta E, Kiss GB. PLOTREP: a web tool for defragmentation and visual analysis of dispersed genomic repeats. Nucleic Acids Res 2006; 34:W708-13. [PMID: 16845104 PMCID: PMC1538846 DOI: 10.1093/nar/gkl263] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Identification of dispersed or interspersed repeats, most of which are derived from transposons, retrotransposons or retrovirus-like elements, is an important step in genome annotation. Software tools that compare genomic sequences with precompiled repeat reference libraries using sensitive similarity-based methods provide reliable means of finding the positions of fragments homologous to known repeats. However, their output is often incomplete and fragmented owing to the mutations (nucleotide substitutions, deletions or insertions) that can result in considerable divergence from the reference sequence. Merging these fragments to identify the whole region that represents an ancient copy of a mobile element is challenging, particularly if the element is large and suffered multiple deletions or insertions. Here we report PLOTREP, a tool designed to post-process results obtained by sequence similarity search and merge fragments belonging to the same copy of a repeat. The software allows rapid visual inspection of the results using a dot-plot like graphical output. The web implementation of PLOTREP is available at .
Collapse
Affiliation(s)
- Gábor Tóth
- Agricultural Biotechnology Center, Gödöllo Szent-Györgyi Albert u. 4, H-2100, Hungary.
| | | | | | | |
Collapse
|
41
|
Mennecier S, Servant P, Coste G, Bailone A, Sommer S. Mutagenesis via IS transposition in Deinococcus radiodurans. Mol Microbiol 2006; 59:317-25. [PMID: 16359337 DOI: 10.1111/j.1365-2958.2005.04936.x] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Analysis of the complete genome indicates that insertion sequences (ISs) are abundant in the radio-resistant bacterium Deinococcus radiodurans. By developing a forward mutagenesis assay to detect any inactivation events in D. radiodurans, we found that in the presence of an active mismatch repair system 75% of the mutations to trimethoprim-resistance (Tmp(R)) resulted from an IS insertion into the thyA coding region. Analysis of their distribution among the spontaneous Tmp(R) mutants indicated that five different ISs were transpositionally active. A type II Miniature Inverted-repeat Transposable Element (MITE), related to one of the deinococcal ISs, was also discovered as an insertion into thyA. Seven additional genomic copies of this MITE element were identified by BLASTN. Gamma-ray irradiation of D. radiodurans led to an increase of up to 10-fold in the frequency of Tmp(R) mutants. Analysis of the induced mutations in cells exposed to 10 kGy indicated that gamma-irradiation induced transposition of ISDra2 approximately 100-fold. A 50-fold induction of ISDra2 transposition was also observed in cells exposed to 600 J m(-2) UV-irradiation. Point mutations to rifampicin resistance (Rif(R)) were also induced by gamma-irradiation to reach a plateau at 2 kGy. The plateau value represented a 16-fold increase in the mutant frequency over the background. Although error-free repair strategies predominate in D. radiodurans, an upregulation of transposition, as well as induction of point mutations in cells recovering from DNA damage, provide a genetic variability that may have long-term evolutionary consequences on the fitness of this organism in its habitat.
Collapse
Affiliation(s)
- Samuel Mennecier
- Institut de Génétique et Microbiologie, CNRS UMR 8621, LRC CEA 42V, Bâtiment 409, Université Paris-Sud, F-91405 Orsay Cedex, France
| | | | | | | | | |
Collapse
|
42
|
Blount ZD, Grogan DW. New insertion sequences of Sulfolobus: functional properties and implications for genome evolution in hyperthermophilic archaea. Mol Microbiol 2005; 55:312-25. [PMID: 15612937 DOI: 10.1111/j.1365-2958.2004.04391.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Analyses of complete genomes indicate that insertion sequences (ISs) are abundant and widespread in hyperthermophilic archaea, but few experimental studies have measured their activities in these hosts. As a way to investigate the impact of ISs on Sulfolobus genomes, we identified seven transpositionally active ISs in a widely distributed Sulfolobus species, and measured their functional properties. Six of the seven were found to be distinct from previously described ISs of Sulfolobus, and one of the six could not be assigned to any known IS family. A type II 'Miniature Inverted-repeat Transposable Element' (MITE) related to one of the ISs was also recovered. Rates of transposition of the different ISs into the pyrEF region of their host strains varied over a 250-fold range. The Sulfolobus ISs also differed with respect to target-site selectivity, although several shared an apparent preference for the pyrEF promoter region. Despite the number of distinct ISs assayed and their molecular diversity, only one demonstrated precise excision from the chromosomal target region. The fact that this IS is the only one lacking inverted repeats and target-site duplication suggests that the observed precise excision may be promoted by the IS itself. Sequence searches revealed previously unidentified partial copies of the newly identified ISs in the Sulfolobus tokodaii and Sulfolobus solfataricus genomes. The structures of these fragmentary copies suggest several distinct molecular mechanisms which, in the absence of precise excision, inactivate ISs and gradually eliminate the defective copies from Sulfolobus genomes.
Collapse
Affiliation(s)
- Zachary D Blount
- Department of Biological Sciences, University of Cincinnati, Cincinnati, OH 45221, USA
| | | |
Collapse
|
43
|
Yang G, Lee YH, Jiang Y, Shi X, Kertbundit S, Hall TC. A two-edged role for the transposable element Kiddo in the rice ubiquitin2 promoter. THE PLANT CELL 2005; 17:1559-68. [PMID: 15805485 PMCID: PMC1091774 DOI: 10.1105/tpc.104.030528] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Miniature inverted repeat transposable elements (MITEs) are thought to be a driving force for genome evolution. Although numerous MITEs are found associated with genes, little is known about their function in gene regulation. Whereas the rice ubiquitin2 (rubq2) promoter in rice (Oryza sativa) line IR24 contains two nested MITEs (Kiddo and MDM1), that in line T309 has lost Kiddo, providing an opportunity to understand the role of MITEs in promoter function. No difference in endogenous rubq2 transcript levels between T309 and IR24 was evident using RT-PCR. However, promoter analysis using both transient and stably transformed calli revealed that Kiddo contributed some 20% of the total expression. Bisulfite genomic sequencing of the rubq2 promoters revealed specific DNA methylation at both symmetric and asymmetric cytosine residues on the MITE sequences, possibly induced by low levels of homologous transcripts. When methylation of the MITEs was blocked by 5-azacytidine treatment, a threefold increase in the endogenous rubq2 transcript level was detected in IR24 compared with that in T309. Together with the observed MITE methylation pattern, the detection of low levels of transcripts, but not small RNAs, corresponding to Kiddo and MDM1 suggested that RNA-dependent DNA methylation is induced by MITE transcripts. We conclude that, although Kiddo enhances transcription from the rubq2 promoter, this effect is mitigated by sequence-specific epigenetic modification.
Collapse
Affiliation(s)
- Guojun Yang
- Institute of Developmental and Molecular Biology, Texas A&M University, College Station, Texas 77843-3155, USA
| | | | | | | | | | | |
Collapse
|
44
|
Jiang N, Feschotte C, Zhang X, Wessler SR. Using rice to understand the origin and amplification of miniature inverted repeat transposable elements (MITEs). CURRENT OPINION IN PLANT BIOLOGY 2004; 7:115-9. [PMID: 15003209 DOI: 10.1016/j.pbi.2004.01.004] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Recent studies of rice miniature inverted repeat transposable elements (MITEs), largely fueled by the availability of genomic sequence, have provided answers to many of the outstanding questions regarding the existence of active MITEs, their source of transposases (TPases) and their chromosomal distribution. Although many questions remain about MITE origins and mode of amplification, data accumulated over the past two years have led to the formulation of testable models.
Collapse
Affiliation(s)
- Ning Jiang
- Department of Plant Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | |
Collapse
|