1
|
Genome wide inherited modifications of the tomato epigenome by trans-activated bacterial CG methyltransferase. Cell Mol Life Sci 2024; 81:222. [PMID: 38767725 PMCID: PMC11106227 DOI: 10.1007/s00018-024-05255-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/16/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024]
Abstract
BACKGROUND Epigenetic variation is mediated by epigenetic marks such as DNA methylation occurring in all cytosine contexts in plants. CG methylation plays a critical role in silencing transposable elements and regulating gene expression. The establishment of CG methylation occurs via the RNA-directed DNA methylation pathway and CG methylation maintenance relies on METHYLTRANSFERASE1, the homologue of the mammalian DNMT1. PURPOSE Here, we examined the capacity to stably alter the tomato genome methylome by a bacterial CG-specific M.SssI methyltransferase expressed through the LhG4/pOP transactivation system. RESULTS Methylome analysis of M.SssI expressing plants revealed that their euchromatic genome regions are specifically hypermethylated in the CG context, and so are most of their genes. However, changes in gene expression were observed only with a set of genes exhibiting a greater susceptibility to CG hypermethylation near their transcription start site. Unlike gene rich genomic regions, our analysis revealed that heterochromatic regions are slightly hypomethylated at CGs only. Notably, some M.SssI-induced hypermethylation persisted even without the methylase or transgenes, indicating inheritable epigenetic modification. CONCLUSION Collectively our findings suggest that heterologous expression of M.SssI can create new inherited epigenetic variations and changes in the methylation profiles on a genome wide scale. This open avenues for the conception of epigenetic recombinant inbred line populations with the potential to unveil agriculturally valuable tomato epialleles.
Collapse
|
2
|
Teaching transposon classification as a means to crowd source the curation of repeat annotation - a tardigrade perspective. Mob DNA 2024; 15:10. [PMID: 38711146 DOI: 10.1186/s13100-024-00319-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 04/09/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND The advancement of sequencing technologies results in the rapid release of hundreds of new genome assemblies a year providing unprecedented resources for the study of genome evolution. Within this context, the significance of in-depth analyses of repetitive elements, transposable elements (TEs) in particular, is increasingly recognized in understanding genome evolution. Despite the plethora of available bioinformatic tools for identifying and annotating TEs, the phylogenetic distance of the target species from a curated and classified database of repetitive element sequences constrains any automated annotation effort. Moreover, manual curation of raw repeat libraries is deemed essential due to the frequent incompleteness of automatically generated consensus sequences. RESULTS Here, we present an example of a crowd-sourcing effort aimed at curating and annotating TE libraries of two non-model species built around a collaborative, peer-reviewed teaching process. Manual curation and classification are time-consuming processes that offer limited short-term academic rewards and are typically confined to a few research groups where methods are taught through hands-on experience. Crowd-sourcing efforts could therefore offer a significant opportunity to bridge the gap between learning the methods of curation effectively and empowering the scientific community with high-quality, reusable repeat libraries. CONCLUSIONS The collaborative manual curation of TEs from two tardigrade species, for which there were no TE libraries available, resulted in the successful characterization of hundreds of new and diverse TEs in a reasonable time frame. Our crowd-sourcing setting can be used as a teaching reference guide for similar projects: A hidden treasure awaits discovery within non-model organisms.
Collapse
|
3
|
Frequent horizontal chromosome transfer between asexual fungal insect pathogens. Proc Natl Acad Sci U S A 2024; 121:e2316284121. [PMID: 38442176 PMCID: PMC10945790 DOI: 10.1073/pnas.2316284121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 01/24/2024] [Indexed: 03/07/2024] Open
Abstract
Entire chromosomes are typically only transmitted vertically from one generation to the next. The horizontal transfer of such chromosomes has long been considered improbable, yet gained recent support in several pathogenic fungi where it may affect the fitness or host specificity. To date, it is unknown how these transfers occur, how common they are, and whether they can occur between different species. In this study, we show multiple independent instances of horizontal transfers of the same accessory chromosome between two distinct strains of the asexual entomopathogenic fungus Metarhizium robertsii during experimental co-infection of its insect host, the Argentine ant. Notably, only the one chromosome-but no other-was transferred from the donor to the recipient strain. The recipient strain, now harboring the accessory chromosome, exhibited a competitive advantage under certain host conditions. By phylogenetic analysis, we further demonstrate that the same accessory chromosome was horizontally transferred in a natural environment between M. robertsii and another congeneric insect pathogen, Metarhizium guizhouense. Hence, horizontal chromosome transfer is not limited to the observed frequent events within species during experimental infections but also occurs naturally across species. The accessory chromosome that was transferred contains genes that may be involved in its preferential horizontal transfer or support its establishment. These genes encode putative histones and histone-modifying enzymes, as well as putative virulence factors. Our study reveals that both intra- and interspecies horizontal transfer of entire chromosomes is more frequent than previously assumed, likely representing a not uncommon mechanism for gene exchange.
Collapse
|
4
|
Quantitative pathogenicity and host adaptation in a fungal plant pathogen revealed by whole-genome sequencing. Nat Commun 2024; 15:1933. [PMID: 38431601 PMCID: PMC10908820 DOI: 10.1038/s41467-024-46191-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 02/14/2024] [Indexed: 03/05/2024] Open
Abstract
Knowledge of genetic determinism and evolutionary dynamics mediating host-pathogen interactions is essential to manage fungal plant diseases. Studies on the genetic architecture of fungal pathogenicity often focus on large-effect effector genes triggering strong, qualitative resistance. It is not clear how this translates to predominately quantitative interactions. Here, we use the Zymoseptoria tritici-wheat model to elucidate the genetic architecture of quantitative pathogenicity and mechanisms mediating host adaptation. With a multi-host genome-wide association study, we identify 19 high-confidence candidate genes associated with quantitative pathogenicity. Analysis of genetic diversity reveals that sequence polymorphism is the main evolutionary process mediating differences in quantitative pathogenicity, a process that is likely facilitated by genetic recombination and transposable element dynamics. Finally, we use functional approaches to confirm the role of an effector-like gene and a methyltransferase in phenotypic variation. This study highlights the complex genetic architecture of quantitative pathogenicity, extensive diversifying selection and plausible mechanisms facilitating pathogen adaptation.
Collapse
|
5
|
Hypometabolism to survive the long polar night and subsequent successful return to light in the diatom Fragilariopsis cylindrus. THE NEW PHYTOLOGIST 2024; 241:2193-2208. [PMID: 38095198 DOI: 10.1111/nph.19387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 10/17/2023] [Indexed: 02/09/2024]
Abstract
Diatoms, the main eukaryotic phytoplankton of the polar marine regions, are essential for the maintenance of food chains specific to Arctic and Antarctic ecosystems, and are experiencing major disturbances under current climate change. As such, it is fundamental to understand the physiological mechanisms and associated molecular basis of their endurance during the long polar night. Here, using the polar diatom Fragilariopsis cylindrus, we report an integrative analysis combining transcriptomic, microscopic and biochemical approaches to shed light on the strategies used to survive the polar night. We reveal that in prolonged darkness, diatom cells enter a state of quiescence with reduced metabolic and transcriptional activity, during which no cell division occurs. We propose that minimal energy is provided by respiration and degradation of protein, carbohydrate and lipid stores and that homeostasis is maintained by autophagy in prolonged darkness. We also report internal structural changes that manifest the morphological acclimation of cells to darkness, including the appearance of a large vacuole. Our results further show that immediately following a return to light, diatom cells are able to use photoprotective mechanisms and rapidly resume photosynthesis, demonstrating the remarkable robustness of polar diatoms to prolonged darkness at low temperature.
Collapse
|
6
|
Genome biology and evolution of mating-type loci in four cereal rust fungi. PLoS Genet 2024; 20:e1011207. [PMID: 38498573 PMCID: PMC10977897 DOI: 10.1371/journal.pgen.1011207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 03/28/2024] [Accepted: 03/04/2024] [Indexed: 03/20/2024] Open
Abstract
Permanent heterozygous loci, such as sex- or mating-compatibility regions, often display suppression of recombination and signals of genomic degeneration. In Basidiomycota, two distinct loci confer mating compatibility. These loci encode homeodomain (HD) transcription factors and pheromone receptor (Pra)-ligand allele pairs. To date, an analysis of genome level mating-type (MAT) loci is lacking for obligate biotrophic basidiomycetes in the Pucciniales, an order containing serious agricultural plant pathogens. Here, we focus on four species of Puccinia that infect oat and wheat, including P. coronata f. sp. avenae, P. graminis f. sp. tritici, P. triticina and P. striiformis f. sp. tritici. MAT loci are located on two separate chromosomes supporting previous hypotheses of a tetrapolar mating compatibility system in the Pucciniales. The HD genes are multiallelic in all four species while the PR locus appears biallelic, except for P. graminis f. sp. tritici, which potentially has multiple alleles. HD loci are largely conserved in their macrosynteny, both within and between species, without strong signals of recombination suppression. Regions proximal to the PR locus, however, displayed signs of recombination suppression and genomic degeneration in the three species with a biallelic PR locus. Our observations support a link between recombination suppression, genomic degeneration, and allele diversity of MAT loci that is consistent with recent mathematical modelling and simulations. Finally, we confirm that MAT genes are expressed during the asexual infection cycle, and we propose that this may support regulating nuclear maintenance and pairing during infection and spore formation. Our study provides insights into the evolution of MAT loci of key pathogenic Puccinia species. Understanding mating compatibility can help predict possible combinations of nuclear pairs, generated by sexual reproduction or somatic recombination, and the potential evolution of new virulent isolates of these important plant pathogens.
Collapse
|
7
|
The good, the bad and the ugly of transposable elements annotation tools. Genet Mol Biol 2024; 46:e20230138. [PMID: 38373163 PMCID: PMC10876081 DOI: 10.1590/1678-4685-gmb-2023-0138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 11/26/2023] [Indexed: 02/21/2024] Open
Abstract
Transposable elements are repetitive and mobile DNA segments that can be found in virtually all organisms investigated to date. Their complex structure and variable nature are particularly challenging from the genomic annotation point of view. Many softwares have been developed to automate and facilitate TEs annotation at the genomic level, but they are highly heterogeneous regarding documentation, usability and methods. In this review, we revisited the existing software for TE genomic annotation, concentrating on the most often used ones, the methodologies they apply, and usability. Building on the state of the art of TE annotation software we propose best practices and highlight the strengths and weaknesses from the available solutions.
Collapse
|
8
|
High-fidelity (repeat) consensus sequences from short reads using combined read clustering and assembly. BMC Genomics 2024; 25:109. [PMID: 38267856 PMCID: PMC10809544 DOI: 10.1186/s12864-023-09948-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 12/28/2023] [Indexed: 01/26/2024] Open
Abstract
BACKGROUND Despite the many cheap and fast ways to generate genomic data, good and exact genome assembly is still a problem, with especially the repeats being vastly underrepresented and often misassembled. As short reads in low coverage are already sufficient to represent the repeat landscape of any given genome, many read cluster algorithms were brought forward that provide repeat identification and classification. But how can trustworthy, reliable and representative repeat consensuses be derived from unassembled genomes? RESULTS Here, we combine methods from repeat identification and genome assembly to derive these robust consensuses. We test several use cases, such as (1) consensus building from clustered short reads of non-model genomes, (2) from genome-wide amplification setups, and (3) specific repeat-centred questions, such as the linked vs. unlinked arrangement of ribosomal genes. In all our use cases, the derived consensuses are robust and representative. To evaluate overall performance, we compare our high-fidelity repeat consensuses to RepeatExplorer2-derived contigs and check, if they represent real transposable elements as found in long reads. Our results demonstrate that it is possible to generate useful, reliable and trustworthy consensuses from short reads by a combination from read cluster and genome assembly methods in an automatable way. CONCLUSION We anticipate that our workflow opens the way towards more efficient and less manual repeat characterization and annotation, benefitting all genome studies, but especially those of non-model organisms.
Collapse
|
9
|
Chromosome-Level Assemblies for the Pine Pitch Canker Pathogen Fusarium circinatum. Pathogens 2024; 13:70. [PMID: 38251377 PMCID: PMC10819268 DOI: 10.3390/pathogens13010070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 12/26/2023] [Accepted: 01/09/2024] [Indexed: 01/23/2024] Open
Abstract
The pine pitch canker pathogen, Fusarium circinatum, is globally regarded as one of the most important threats to commercial pine-based forestry. Although genome sequences of this fungus are available, these remain highly fragmented or structurally ill-defined. Our overall goal was to provide high-quality assemblies for two notable strains of F. circinatum, and to characterize these in terms of coding content, repetitiveness and the position of telomeres and centromeres. For this purpose, we used Oxford Nanopore Technologies MinION long-read sequences, as well as Illumina short sequence reads. By leveraging the genomic synteny inherent to F. circinatum and its close relatives, these sequence reads were assembled to chromosome level, where contiguous sequences mostly spanned from telomere to telomere. Comparative analyses unveiled remarkable variability in the twelfth and smallest chromosome, which is known to be dispensable. It presented a striking length polymorphism, with one strain lacking substantial portions from the chromosome's distal and proximal regions. These regions, characterized by a lower gene density, G+C content and an increased prevalence of repetitive elements, contrast starkly with the syntenic segments of the chromosome, as well as with the core chromosomes. We propose that these unusual regions might have arisen or expanded due to the presence of transposable elements. A comparison of the overall chromosome structure revealed that centromeric elements often underpin intrachromosomal differences between F. circinatum strains, especially at chromosomal breakpoints. This suggests a potential role for centromeres in shaping the chromosomal architecture of F. circinatum and its relatives. The publicly available genome data generated here, together with the detailed metadata provided, represent essential resources for future studies of this important plant pathogen.
Collapse
|
10
|
A chromosome-scale assembly of the quinoa genome provides insights into the structure and dynamics of its subgenomes. Commun Biol 2023; 6:1263. [PMID: 38092895 PMCID: PMC10719370 DOI: 10.1038/s42003-023-05613-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 11/20/2023] [Indexed: 12/17/2023] Open
Abstract
Quinoa (Chenopodium quinoa Willd.) is an allotetraploid seed crop with the potential to help address global food security concerns. Genomes have been assembled for four accessions of quinoa; however, all assemblies are fragmented and do not reflect known chromosome biology. Here, we use in vitro and in vivo Hi-C data to produce a chromosome-scale assembly of the Chilean accession PI 614886 (QQ74). The final assembly spans 1.326 Gb, of which 90.5% is assembled into 18 chromosome-scale scaffolds. The genome is annotated with 54,499 protein-coding genes, 96.9% of which are located on the 18 largest scaffolds. We also report an updated genome assembly for the B-genome diploid C. suecicum and use it, together with the A-genome diploid C. pallidicaule, to identify genomic rearrangements within the quinoa genome, including a large pericentromeric inversion representing 71.7% of chromosome Cq3B. Repetitive sequences comprise 65.2%, 48.6%, and 57.9% of the quinoa, C. pallidicaule, and C. suecicum genomes, respectively. Evidence suggests that the B subgenome is more dynamic and has expanded more than the A subgenome. These genomic resources will enable more accurate assessments of genome evolution within the Amaranthaceae and will facilitate future efforts to identify variation in genes underlying important agronomic traits in quinoa.
Collapse
|
11
|
The highly improved genome of Ixodes scapularis with X and Y pseudochromosomes. Life Sci Alliance 2023; 6:e202302109. [PMID: 37813487 PMCID: PMC10561763 DOI: 10.26508/lsa.202302109] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 09/21/2023] [Accepted: 09/22/2023] [Indexed: 10/12/2023] Open
Abstract
Ixodes scapularis, the black-legged tick, is the principal vector of the Lyme disease spirochete, Borrelia burgdorferi, and is responsible for most of the ∼470,000 estimated Lyme disease cases annually in the USA. Ixodes scapularis can transmit six additional pathogens of human health significance. Because of its medical importance, I. scapularis was the first tick genome to be sequenced and annotated. However, the first assembly, I. scapularis Wikel (IscaW), was highly fragmented because of the technical challenges posed by the long, repetitive genome sequences characteristic of arthropod genomes and the lack of long-read sequencing techniques. Although I. scapularis has emerged as a model for tick research because of the availability of new tools such as embryo injection and CRISPR-Cas9-mediated gene editing yet the lack of chromosome-scale scaffolds has slowed progress in tick biology and the development of tools for their control. Here we combine diverse technologies to produce the I. scapularis Gulia-Nuss (IscGN) genome assembly and gene set. We used DNA from eggs and male and female adult ticks and took advantage of Hi-C, PacBio HiFi sequencing, and Illumina short-read sequencing technologies to produce a chromosome-level assembly. In this work, we present the predicted pseudochromosomes consisting of 13 autosomes and the sex pseudochromosomes: X and Y, and a markedly improved genome annotation compared with the existing assemblies and annotations.
Collapse
|
12
|
Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
|
13
|
Unistrand piRNA clusters are an evolutionarily conserved mechanism to suppress endogenous retroviruses across the Drosophila genus. Nat Commun 2023; 14:7337. [PMID: 37957172 PMCID: PMC10643416 DOI: 10.1038/s41467-023-42787-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 10/18/2023] [Indexed: 11/15/2023] Open
Abstract
The PIWI-interacting RNA (piRNA) pathway prevents endogenous genomic parasites, i.e. transposable elements, from damaging the genetic material of animal gonadal cells. Specific regions in the genome, called piRNA clusters, are thought to define each species' piRNA repertoire and therefore its capacity to recognize and silence specific transposon families. The unistrand cluster flamenco (flam) is essential in the somatic compartment of the Drosophila ovary to restrict Gypsy-family transposons from infecting the neighbouring germ cells. Disruption of flam results in transposon de-repression and sterility, yet it remains unknown whether this silencing mechanism is present more widely. Here, we systematically characterise 119 Drosophila species and identify five additional flam-like clusters separated by up to 45 million years of evolution. Small RNA-sequencing validated these as bona-fide unistrand piRNA clusters expressed in somatic cells of the ovary, where they selectively target transposons of the Gypsy family. Together, our study provides compelling evidence of a widely conserved transposon silencing mechanism that co-evolved with virus-like Gypsy-family transposons.
Collapse
|
14
|
ChimeraTE: a pipeline to detect chimeric transcripts derived from genes and transposable elements. Nucleic Acids Res 2023; 51:9764-9784. [PMID: 37615575 PMCID: PMC10570057 DOI: 10.1093/nar/gkad671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 07/25/2023] [Accepted: 08/09/2023] [Indexed: 08/25/2023] Open
Abstract
Transposable elements (TEs) produce structural variants and are considered an important source of genetic diversity. Notably, TE-gene fusion transcripts, i.e. chimeric transcripts, have been associated with adaptation in several species. However, the identification of these chimeras remains hindered due to the lack of detection tools at a transcriptome-wide scale, and to the reliance on a reference genome, even though different individuals/cells/strains have different TE insertions. Therefore, we developed ChimeraTE, a pipeline that uses paired-end RNA-seq reads to identify chimeric transcripts through two different modes. Mode 1 is the reference-guided approach that employs canonical genome alignment, and Mode 2 identifies chimeras derived from fixed or insertionally polymorphic TEs without any reference genome. We have validated both modes using RNA-seq data from four Drosophila melanogaster wild-type strains. We found ∼1.12% of all genes generating chimeric transcripts, most of them from TE-exonized sequences. Approximately ∼23% of all detected chimeras were absent from the reference genome, indicating that TEs belonging to chimeric transcripts may be recent, polymorphic insertions. ChimeraTE is the first pipeline able to automatically uncover chimeric transcripts without a reference genome, consisting of two running Modes that can be used as a tool to investigate the contribution of TEs to transcriptome plasticity.
Collapse
|
15
|
Programmed DNA elimination in Mesorhabditis nematodes. Curr Biol 2023; 33:3711-3721.e5. [PMID: 37607549 DOI: 10.1016/j.cub.2023.07.058] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 07/04/2023] [Accepted: 07/26/2023] [Indexed: 08/24/2023]
Abstract
Some species undergo programmed DNA elimination (PDE), whereby portions of the genome are systematically destroyed in somatic cells. PDE has emerged independently in several phyla, but its function is unknown. Although the mechanisms are partially solved in ciliates, PDE remains mysterious in metazoans because the study species were not yet amenable to functional approaches. We fortuitously discovered massive PDE in the free-living nematode genus Mesorhabditis, from the same family as C. elegans. As such, these species offer many experimental advantages to start elucidating the PDE mechanisms in an animal. Here, we used cytology to describe the dynamics of chromosome fragmentation and destruction in early embryos. Elimination occurs once in development, at the third embryonic cell division in the somatic blastomeres. Chromosomes are first fragmented during S phase. Next, some of the fragments fail to align on the mitotic spindle and remain outside the re-assembled nuclei after mitosis. These fragments are gradually lost after a few cell cycles. The retained fragments form new mini chromosomes, which are properly segregated in the subsequent cell divisions. With genomic approaches, we found that Mesorhabditis mainly eliminate repeated regions and also about a hundred genes. Importantly, none of the eliminated protein-coding genes are shared between closely related Mesorhabditis species. Our results strongly suggest PDE has not been selected for regulating genes with important biological functions in Mesorhabditis but rather mainly to irreversibly remove repeated sequences in the soma. We propose that PDE may target genes, provided their elimination in the soma is invisible to selection.
Collapse
|
16
|
Abundance and Diversification of Repetitive Elements in Decapoda Genomes. Genes (Basel) 2023; 14:1627. [PMID: 37628678 PMCID: PMC10454600 DOI: 10.3390/genes14081627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/05/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
Repetitive elements are a major component of DNA sequences due to their ability to propagate through the genome. Characterization of Metazoan repetitive profiles is improving; however, current pipelines fail to identify a significant proportion of divergent repeats in non-model organisms. The Decapoda order, for which repeat content analyses are largely lacking, is characterized by extremely variable genome sizes that suggest an important presence of repetitive elements. Here, we developed a new standardized pipeline to annotate repetitive elements in non-model organisms, which we applied to twenty Decapoda and six other Crustacea genomes. Using this new tool, we identified 10% more repetitive elements than standard pipelines. Repetitive elements were more abundant in Decapoda species than in other Crustacea, with a very large number of highly repeated satellite DNA families. Moreover, we demonstrated a high correlation between assembly size and transposable elements and different repeat dynamics between Dendrobranchiata and Reptantia. The patterns of repetitive elements largely reflect the phylogenetic relationships of Decapoda and the distinct evolutionary trajectories within Crustacea. In summary, our results highlight the impact of repetitive elements on genome evolution in Decapoda and the value of our novel annotation pipeline, which will provide a baseline for future comparative analyses.
Collapse
|
17
|
Efficient homology-based annotation of transposable elements using minimizers. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11520. [PMID: 37601317 PMCID: PMC10439823 DOI: 10.1002/aps3.11520] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 03/02/2023] [Accepted: 03/04/2023] [Indexed: 08/22/2023]
Abstract
Premise Transposable elements (TEs) make up more than half of the genomes of complex plant species and can modulate the expression of neighboring genes, producing significant variability of agronomically relevant traits. The availability of long-read sequencing technologies allows the building of genome assemblies for plant species with large and complex genomes. Unfortunately, TE annotation currently represents a bottleneck in the annotation of genome assemblies. Methods and Results We present a new functionality of the Next-Generation Sequencing Experience Platform (NGSEP) to perform efficient homology-based TE annotation. Sequences in a reference library are treated as long reads and mapped to an input genome assembly. A hierarchical annotation is then assigned by homology using the annotation of the reference library. We tested the performance of our algorithm on genome assemblies of different plant species, including Arabidopsis thaliana, Oryza sativa, Coffea humblotiana, and Triticum aestivum (bread wheat). Our algorithm outperforms traditional homology-based annotation tools in speed by a factor of three to >20, reducing the annotation time of the T. aestivum genome from months to hours, and recovering up to 80% of TEs annotated with RepeatMasker with a precision of up to 0.95. Conclusions NGSEP allows rapid analysis of TEs, especially in very large and TE-rich plant genomes.
Collapse
|
18
|
A secreted protease-like protein in Zymoseptoria tritici is responsible for avirulence on Stb9 resistance gene in wheat. PLoS Pathog 2023; 19:e1011376. [PMID: 37172036 DOI: 10.1371/journal.ppat.1011376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 05/24/2023] [Accepted: 04/19/2023] [Indexed: 05/14/2023] Open
Abstract
Zymoseptoria tritici is the fungal pathogen responsible for Septoria tritici blotch on wheat. Disease outcome in this pathosystem is partly determined by isolate-specific resistance, where wheat resistance genes recognize specific fungal factors triggering an immune response. Despite the large number of known wheat resistance genes, fungal molecular determinants involved in such cultivar-specific resistance remain largely unknown. We identified the avirulence factor AvrStb9 using association mapping and functional validation approaches. Pathotyping AvrStb9 transgenic strains on Stb9 cultivars, near isogenic lines and wheat mapping populations, showed that AvrStb9 interacts with Stb9 resistance gene, triggering an immune response. AvrStb9 encodes an unusually large avirulence gene with a predicted secretion signal and a protease domain. It belongs to a S41 protease family conserved across different filamentous fungi in the Ascomycota class and may constitute a core effector. AvrStb9 is also conserved among a global Z. tritici population and carries multiple amino acid substitutions caused by strong positive diversifying selection. These results demonstrate the contribution of an 'atypical' conserved effector protein to fungal avirulence and the role of sequence diversification in the escape of host recognition, adding to our understanding of host-pathogen interactions and the evolutionary processes underlying pathogen adaptation.
Collapse
|
19
|
Uniparental Inheritance and Recombination as Strategies to Avoid Competition and Combat Muller's Ratchet among Mitochondria in Natural Populations of the Fungus Amanita phalloides. J Fungi (Basel) 2023; 9:476. [PMID: 37108928 PMCID: PMC10142858 DOI: 10.3390/jof9040476] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/03/2023] [Accepted: 04/13/2023] [Indexed: 04/29/2023] Open
Abstract
Uniparental inheritance of mitochondria enables organisms to avoid the costs of intracellular competition among potentially selfish organelles. By preventing recombination, uniparental inheritance may also render a mitochondrial lineage effectively asexual and expose mitochondria to the deleterious effects of Muller's ratchet. Even among animals and plants, the evolutionary dynamics of mitochondria remain obscure, and less is known about mitochondrial inheritance among fungi. To understand mitochondrial inheritance and test for mitochondrial recombination in one species of filamentous fungus, we took a population genomics approach. We assembled and analyzed 88 mitochondrial genomes from natural populations of the invasive death cap Amanita phalloides, sampling from both California (an invaded range) and Europe (its native range). The mitochondrial genomes clustered into two distinct groups made up of 57 and 31 mushrooms, but both mitochondrial types are geographically widespread. Multiple lines of evidence, including negative correlations between linkage disequilibrium and distances between sites and coalescent analysis, suggest low rates of recombination among the mitochondria (ρ = 3.54 × 10-4). Recombination requires genetically distinct mitochondria to inhabit a cell, and recombination among A. phalloides mitochondria provides evidence for heteroplasmy as a feature of the death cap life cycle. However, no mushroom houses more than one mitochondrial genome, suggesting that heteroplasmy is rare or transient. Uniparental inheritance emerges as the primary mode of mitochondrial inheritance, even as recombination appears as a strategy to alleviate Muller's ratchet.
Collapse
|
20
|
TrEMOLO: accurate transposable element allele frequency estimation using long-read sequencing data combining assembly and mapping-based approaches. Genome Biol 2023; 24:63. [PMID: 37013657 PMCID: PMC10069131 DOI: 10.1186/s13059-023-02911-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 03/23/2023] [Indexed: 04/05/2023] Open
Abstract
Transposable Element MOnitoring with LOng-reads (TrEMOLO) is a new software that combines assembly- and mapping-based approaches to robustly detect genetic elements called transposable elements (TEs). Using high- or low-quality genome assemblies, TrEMOLO can detect most TE insertions and deletions and estimate their allele frequency in populations. Benchmarking with simulated data revealed that TrEMOLO outperforms other state-of-the-art computational tools. TE detection and frequency estimation by TrEMOLO were validated using simulated and experimental datasets. Therefore, TrEMOLO is a comprehensive and suitable tool to accurately study TE dynamics. TrEMOLO is available under GNU GPL3.0 at https://github.com/DrosophilaGenomeEvolution/TrEMOLO .
Collapse
|
21
|
High resolution genomes of multiple Xiphophorus species provide new insights into microevolution, hybrid incompatibility, and epistasis. Genome Res 2023; 33:557-571. [PMID: 37147111 PMCID: PMC10234306 DOI: 10.1101/gr.277434.122] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 03/29/2023] [Indexed: 05/07/2023]
Abstract
Because of diverged adaptative phenotypes, fish species of the genus Xiphophorus have contributed to a wide range of research for a century. Existing Xiphophorus genome assemblies are not at the chromosomal level and are prone to sequence gaps, thus hindering advancement of the intra- and inter-species differences for evolutionary, comparative, and translational biomedical studies. Herein, we assembled high-quality chromosome-level genome assemblies for three distantly related Xiphophorus species, namely, X. maculatus, X. couchianus, and X. hellerii Our overall goal is to precisely assess microevolutionary processes in the clade to ascertain molecular events that led to the divergence of the Xiphophorus species and to progress understanding of genetic incompatibility to disease. In particular, we measured intra- and inter-species divergence and assessed gene expression dysregulation in reciprocal interspecies hybrids among the three species. We found expanded gene families and positively selected genes associated with live bearing, a special mode of reproduction. We also found positively selected gene families are significantly enriched in nonpolymorphic transposable elements, suggesting the dispersal of these nonpolymorphic transposable elements has accompanied the evolution of the genes, possibly by incorporating new regulatory elements in support of the Britten-Davidson hypothesis. We characterized inter-specific polymorphisms, structural variants, and polymorphic transposable element insertions and assessed their association to interspecies hybridization-induced gene expression dysregulation related to specific disease states in humans.
Collapse
|
22
|
Abstract
Effectors play a central role in determining the outcome of plant-pathogen interactions. As key virulence proteins, effectors are collectively indispensable for disease development. By understanding the virulence mechanisms of effectors, fundamental knowledge of microbial pathogenesis and disease resistance have been revealed. Effectors are also considered double-edged swords because some of them activate immunity in disease resistant plants after being recognized by specific immune receptors, which evolved to monitor pathogen presence or activity. Characterization of effector recognition by their cognate immune receptors and the downstream immune signaling pathways is instrumental in implementing resistance. Over the past decades, substantial research effort has focused on effector biology, especially concerning their interactions with virulence targets or immune receptors in plant cells. A foundation of this research is robust identification of the effector repertoire from a given pathogen, which depends heavily on bioinformatic prediction. In this review, we summarize methodologies that have been used for effector mining in various microbial pathogens which use different effector delivery mechanisms. We also discuss current limitations and provide perspectives on how recently developed analytic tools and technologies may facilitate effector identification and hence generation of a more complete vision of host-pathogen interactions. [Formula: see text] Copyright © 2023 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
|
23
|
Abundance of Transgene Transcript Variants Associated with Somatically Active Transgenic Helitrons from Multiple T-DNA Integration Sites in Maize. Int J Mol Sci 2023; 24:ijms24076574. [PMID: 37047545 PMCID: PMC10095026 DOI: 10.3390/ijms24076574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 03/27/2023] [Accepted: 03/28/2023] [Indexed: 04/05/2023] Open
Abstract
Helitrons, a novel type of mysterious DNA transposons discovered computationally prior to bench work confirmation, are components ubiquitous in most sequenced genomes of various eukaryotes, including plants, animals, and fungi. There is a paucity of empirical evidence to elucidate the mechanism of Helitrons transposition in plants. Here, by constructing several artificial defective Helitron (dHel) reporter systems, we aim to identify the autonomous Helitrons (aHel) in maize genetically and to demonstrate the transposition and repair mechanisms of Helitrons upon the dHel-GFP excision in maize. When crossing with various inbred lines, several transgenic lines produced progeny of segregated, purple-blotched kernels, resulting from a leaky expression of the C1 gene driven by the dHel-interrupted promoter. Transcription analysis indicated that the insertion of different dHels into the C1 promoter or exon would lead to multiple distinct mRNA transcripts corresponding to transgenes in the host genome. Simple excision products and circular intermediates of dHel-GFP transposition have been detected from the leaf tissue of the seedlings in F1 hybrids of transgenic lines with corresponding c1 tester, although they failed to be detected in all primary transgenic lines. These results revealed the transposition and repair mechanism of Helitrons in maize. It is strongly suggested that this reporter system can detect the genetic activity of autonomic Helitron at the molecular level. Sequence features of dHel itself, together with the flanking regions, impact the excision activity of dHel and the regulation of the dHel on the transcription level of the host gene.
Collapse
|
24
|
Hybrid de novo genome assembly and comparative genomics of three different isolates of Gnomoniopsis castaneae. Sci Rep 2023; 13:3356. [PMID: 36849528 PMCID: PMC9971261 DOI: 10.1038/s41598-023-30496-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 02/24/2023] [Indexed: 03/01/2023] Open
Abstract
The first genome assemblies of Gnomoniopsis castaneae (syn. G. smithogilvyi), the causal agent of chestnut brown rot of kernels, shoot blight and cankers, are provided here. Specifically, the complete genome of the Italian ex-type MUT401 isolate was compared to the draft genome of a second Italian isolate (GN01) and to the ICMP 14040 isolate from New Zealand. The three genome sequences were obtained through a hybrid assembly using both short Illumina reads and long Nanopore reads, their coding sequences were annotated and compared with each other and with other Diaporthales. The information offered by the genome assembly of the three isolates represents the base of data for further application related to -omics strategies of the fungus and to develop markers for population studies at a local and global scale.
Collapse
|
25
|
Gene expression differences consistent with water loss reduction underlie desiccation tolerance of natural Drosophila populations. BMC Biol 2023; 21:35. [PMID: 36797754 PMCID: PMC9933328 DOI: 10.1186/s12915-023-01530-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 01/27/2023] [Indexed: 02/18/2023] Open
Abstract
BACKGROUND Climate change is one of the main factors shaping the distribution and biodiversity of organisms, among others by greatly altering water availability, thus exposing species and ecosystems to harsh desiccation conditions. However, most of the studies so far have focused on the effects of increased temperature. Integrating transcriptomics and physiology is key to advancing our knowledge on how species cope with desiccation stress, and these studies are still best accomplished in model organisms. RESULTS Here, we characterized the natural variation of European D. melanogaster populations across climate zones and found that strains from arid regions were similar or more tolerant to desiccation compared with strains from temperate regions. Tolerant and sensitive strains differed not only in their transcriptomic response to stress but also in their basal expression levels. We further showed that gene expression changes in tolerant strains correlated with their physiological response to desiccation stress and with their cuticular hydrocarbon composition, and functionally validated three of the candidate genes identified. Transposable elements, which are known to influence stress response across organisms, were not found to be enriched nearby differentially expressed genes. Finally, we identified several tRNA-derived small RNA fragments that differentially targeted genes in response to desiccation stress. CONCLUSIONS Overall, our results showed that basal gene expression differences across individuals should be analyzed if we are to understand the genetic basis of differential stress survival. Moreover, tRNA-derived small RNA fragments appear to be relevant across stress responses and allow for the identification of stress-response genes not detected at the transcriptional level.
Collapse
|
26
|
Abstract
The detection and quantification of transposable elements (TE) are notoriously challenging despite their relevance in evolutionary genomics and molecular ecology. The main hurdle is caused by the dependence of numerous tools on genome assemblies, whose level of completion directly affects the comparability of the results across species or populations. dnaPipeTE, whose use is demonstrated here, tackles this issue by directly performing TE detection, classification, and quantification from unassembled short reads. This chapter details all the required steps to perform a comparative analysis of the TE content between two related species, starting from the installation of a recently containerized version of the program to the post-processing of the outputs.
Collapse
|
27
|
An Overview of Best Practices for Transposable Element Identification, Classification, and Annotation in Eukaryotic Genomes. Methods Mol Biol 2023; 2607:1-23. [PMID: 36449155 PMCID: PMC10149145 DOI: 10.1007/978-1-0716-2883-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Transposable elements (TEs) exert an increasingly diverse spectrum of influences on eukaryotic genome structure, function, and evolution. A deluge of genomic, transcriptomic, and proteomic data provides the foundation for turning essentially any non-model eukaryotic species into an emerging model to study any and all aspects of organismal biology, ultimately shaping future directions for biomedical, environmental, and biodiversity research. However, identification and annotation of the mobile genome component still lags behind the standards accepted for host gene annotation. To achieve the objective of providing every genome project with a comprehensive description of its mobilome component in addition to the standard genic and transcriptomic datasets, each step of TE identification, classification, and annotation should be focused on improving TE boundary designation, reducing identification error rates, and providing accurate information on the type and integrity of TE insertions. Here, we offer practical advice for generating TE models in de novo assemblies for non-model organisms, provide step-by-step instructions to guide inexperienced TE annotators through some of the commonly utilized TE analysis pipelines, and entertain suggestions for tool improvement which could be implemented by interested developers.
Collapse
|
28
|
Detecting Horizontal Transfer of Transposons. Methods Mol Biol 2023; 2607:45-62. [PMID: 36449157 DOI: 10.1007/978-1-0716-2883-6_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Transposable elements (TEs) are prevalent genomic components which can replicate as a function of mobilization in eukaryotes. Not only do they alter genome structure, they also play regulatory functions or organize chromatin structure. In addition to vertical parent-to-offspring inheritance, TEs can also horizontally "jump" between species, known as horizontal transposon transfer (HTT). This can rapidly alter the course of genome evolution. In this chapter, we provide a practical framework to detect HTT events. Our HTT detection framework is based on the use of sequence alignment to determine the divergence/conservation profiles of TE families to determine the history of expansion events. In summary, it includes (a) workflow of HTT detection from Ab initio identified TEs; (b) workflow for detecting HTT for specific, curated TEs; and (c) workflow for validating detected HTT candidates. Our framework covers two common scenarios of HTT detection in the modern omics era, and we believe it will serve as a valuable toolbox for the TE and genomics research community.
Collapse
|
29
|
Transposable element and host silencing activity in gigantic genomes. Front Cell Dev Biol 2023; 11:1124374. [PMID: 36910142 PMCID: PMC9998948 DOI: 10.3389/fcell.2023.1124374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/09/2023] [Indexed: 02/26/2023] Open
Abstract
Transposable elements (TEs) and the silencing machinery of their hosts are engaged in a germline arms-race dynamic that shapes TE accumulation and, therefore, genome size. In animal species with extremely large genomes (>10 Gb), TE accumulation has been pushed to the extreme, prompting the question of whether TE silencing also deviates from typical conditions. To address this question, we characterize TE silencing via two pathways-the piRNA pathway and KRAB-ZFP transcriptional repression-in the male and female gonads of Ranodon sibiricus, a salamander species with a ∼21 Gb genome. We quantify 1) genomic TE diversity, 2) TE expression, and 3) small RNA expression and find a significant relationship between the expression of piRNAs and TEs they target for silencing in both ovaries and testes. We also quantified TE silencing pathway gene expression in R. sibiricus and 14 other vertebrates with genome sizes ranging from 1 to 130 Gb and find no association between pathway expression and genome size. Taken together, our results reveal that the gigantic R. sibiricus genome includes at least 19 putatively active TE superfamilies, all of which are targeted by the piRNA pathway in proportion to their expression levels, suggesting comprehensive piRNA-mediated silencing. Testes have higher TE expression than ovaries, suggesting that they may contribute more to the species' high genomic TE load. We posit that apparently conflicting interpretations of TE silencing and genomic gigantism in the literature, as well as the absence of a correlation between TE silencing pathway gene expression and genome size, can be reconciled by considering whether the TE community or the host is currently "on the attack" in the arms race dynamic.
Collapse
|
30
|
Complete Genome Sequences of Septoria linicola: A Resource for Studying a Damaging Flax Pathogen. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2023; 36:59-63. [PMID: 36537804 DOI: 10.1094/mpmi-09-22-0185-a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Fungal genus Septoria causes diseases in a wide range of plants. Here, we report the first genome sequences of two strains of Septoria linicola, the causal agent of the pasmo disease of flax (Linum usitatissimum). The genome of the first strain, SE15195, was fully assembled in 16 chromosomes, while 35 unitigs were obtained for a second strain, SE14017. Structural annotations predicted 13,096 and 13,085 protein-encoding genes and transposable elements content of 19.0 and 18.1% of the genome for SE15195 and SE14017, respectively. The four smaller chromosomes 13 to 16 show genomics features of potential accessory chromosomes. The assembly of these two genomes is a new resource for studying S. linicola and improving management of pasmo. [Formula: see text] Copyright © 2022 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
|
31
|
CAULIFINDER: a pipeline for the automated detection and annotation of caulimovirid endogenous viral elements in plant genomes. Mob DNA 2022; 13:31. [PMID: 36463202 PMCID: PMC9719215 DOI: 10.1186/s13100-022-00288-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 11/24/2022] [Indexed: 12/04/2022] Open
Abstract
Plant, animal and protist genomes often contain endogenous viral elements (EVEs), which correspond to partial and sometimes entire viral genomes that have been captured in the genome of their host organism through a variety of integration mechanisms. While the number of sequenced eukaryotic genomes is rapidly increasing, the annotation and characterization of EVEs remains largely overlooked. EVEs that derive from members of the family Caulimoviridae are widespread across tracheophyte plants, and sometimes they occur in very high copy numbers. However, existing programs for annotating repetitive DNA elements in plant genomes are poor at identifying and then classifying these EVEs. Other than accurately annotating plant genomes, there is intrinsic value in a tool that could identify caulimovirid EVEs as they testify to recent or ancient host-virus interactions and provide valuable insights into virus evolution. In response to this research need, we have developed CAULIFINDER, an automated and sensitive annotation software package. CAULIFINDER consists of two complementary workflows, one to reconstruct, annotate and group caulimovirid EVEs in a given plant genome and the second to classify these genetic elements into officially recognized or tentative genera in the Caulimoviridae. We have benchmarked the CAULIFINDER package using the Vitis vinifera reference genome, which contains a rich assortment of caulimovirid EVEs that have previously been characterized using manual methods. The CAULIFINDER package is distributed in the form of a Docker image.
Collapse
|
32
|
Transposable elements maintain genome-wide heterozygosity in inbred populations. Nat Commun 2022; 13:7022. [PMID: 36396660 PMCID: PMC9672359 DOI: 10.1038/s41467-022-34795-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 11/08/2022] [Indexed: 11/18/2022] Open
Abstract
Elevated levels of inbreeding increase the risk of inbreeding depression and extinction, yet many inbred species are widespread, suggesting that inbreeding has little impact on evolutionary potential. Here, we explore the potential for transposable elements (TEs) to maintain genetic variation in functional genomic regions under extreme inbreeding. Capitalizing on the mixed mating system of Arabidopsis lyrata, we assess genome-wide heterozygosity and signatures of selection at single nucleotide polymorphisms near transposable elements across an inbreeding gradient. Under intense inbreeding, we find systematically elevated heterozygosity downstream of several TE superfamilies, associated with signatures of balancing selection. In addition, we demonstrate increased heterozygosity in stress-responsive genes that consistently occur downstream of TEs. We finally reveal that TE superfamilies are associated with specific signatures of selection that are reproducible across independent evolutionary lineages of A. lyrata. Together, our study provides an important hypothesis for the success of self-fertilizing species.
Collapse
|
33
|
A chromosome-level, haplotype-phased Vanilla planifolia genome highlights the challenge of partial endoreplication for accurate whole-genome assembly. PLANT COMMUNICATIONS 2022; 3:100330. [PMID: 35617961 PMCID: PMC9482989 DOI: 10.1016/j.xplc.2022.100330] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 04/10/2022] [Accepted: 04/27/2022] [Indexed: 06/02/2023]
Abstract
Vanilla planifolia, the species cultivated to produce one of the world's most popular flavors, is highly prone to partial genome endoreplication, which leads to highly unbalanced DNA content in cells. We report here the first molecular evidence of partial endoreplication at the chromosome scale by the assembly and annotation of an accurate haplotype-phased genome of V. planifolia. Cytogenetic data demonstrated that the diploid genome size is 4.09 Gb, with 16 chromosome pairs, although aneuploid cells are frequently observed. Using PacBio HiFi and optical mapping, we assembled and phased a diploid genome of 3.4 Gb with a scaffold N50 of 1.2 Mb and 59 128 predicted protein-coding genes. The atypical k-mer frequencies and the uneven sequencing depth observed agreed with our expectation of unbalanced genome representation. Sixty-seven percent of the genes were scattered over only 30% of the genome, putatively linking gene-rich regions and the endoreplication phenomenon. By contrast, low-coverage regions (non-endoreplicated) were rich in repeated elements but also contained 33% of the annotated genes. Furthermore, this assembly showed distinct haplotype-specific sequencing depth variation patterns, suggesting complex molecular regulation of endoreplication along the chromosomes. This high-quality, anchored assembly represents 83% of the estimated V. planifolia genome. It provides a significant step toward the elucidation of this complex genome. To support post-genomics efforts, we developed the Vanilla Genome Hub, a user-friendly integrated web portal that enables centralized access to high-throughput genomic and other omics data and interoperable use of bioinformatics tools.
Collapse
|
34
|
Intergeneric hybridization of two stickleback species leads to introgression of membrane-associated genes and invasive TE expansion. Front Genet 2022; 13:863547. [PMID: 36092944 PMCID: PMC9452749 DOI: 10.3389/fgene.2022.863547] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 07/20/2022] [Indexed: 12/03/2022] Open
Abstract
Interspecific hybridization has occurred relatively frequently during the evolution of vertebrates. This process usually abolishes reproductive isolation between the parental species. Moreover, it results in the exchange of genetic material and can lead to hybridogenic speciation. Hybridization between species has predominately been observed at the interspecific level, whereas intergeneric hybridization is rarer. Here, using whole-genome sequencing analysis, we describe clear and reliable signals of intergeneric introgression between the three-spined stickleback (Gasterosteus aculeatus) and its distant mostly freshwater relative the nine-spined stickleback (Pungitius pungitius) that inhabit northwestern Russia. Through comparative analysis, we demonstrate that such introgression phenomena apparently take place in the moderate-salinity White Sea basin, although it is not detected in Japanese sea stickleback populations. Bioinformatical analysis of the sites influenced by introgression showed that they are located near transposable elements, whereas those in protein-coding sequences are mostly found in membrane-associated and alternative splicing-related genes.
Collapse
|
35
|
Revealing Landscapes of Transposable Elements in Apis Species by Meta-Analysis. INSECTS 2022; 13:insects13080698. [PMID: 36005323 PMCID: PMC9408917 DOI: 10.3390/insects13080698] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 07/29/2022] [Accepted: 08/01/2022] [Indexed: 12/04/2022]
Abstract
Transposable elements (TEs) are grouped into several families with diverse sequences. Owing to their diversity, studies involving the detection, classification, and annotation of TEs are difficult tasks. Moreover, simple comparisons of TEs among different species with different methods can lead to misinterpretations. The genome data of several honey bee (Apis) species are available in public databases. Therefore, we conducted a meta-analysis of TEs, using 11 sets of genome data for Apis species, in order to establish data of “landscape of TEs”. Consensus TE sequences were constructed and their distributions in the Apis genomes were determined. Our results showed that TEs belonged to four to seven TE families among 13 and 15 families of TEs detected in classes I and II respectively mainly consisted of Apis TEs and that more DNA/TcMar-Mariner consensus sequences and copies were present in all Apis genomes tested. In addition, more consensus sequences and copy numbers of DNA/TcMar-Mariner were detected in Apis mellifera than in other Apis species. These results suggest that TcMar-Mariner might exert A. mellifera-specific effects on the host A. mellifera species. In conclusion, our unified approach enabled comparison of Apis genome sequences to determine the TE landscape, which provide novel evolutionary insights into Apis species.
Collapse
|
36
|
Spodoptera littoralis genome mining brings insights on the dynamic of expansion of gustatory receptors in polyphagous noctuidae. G3 (BETHESDA, MD.) 2022; 12:6598846. [PMID: 35652787 PMCID: PMC9339325 DOI: 10.1093/g3journal/jkac131] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 05/17/2022] [Indexed: 11/13/2022]
Abstract
The bitter taste, triggered via gustatory receptors, serves as an important natural defense against the ingestion of poisonous foods in animals, and the increased host breadth is usually linked to an increase in the number of gustatory receptor genes. This has been especially observed in polyphagous insect species, such as noctuid species from the Spodoptera genus. However, the dynamic and physical mechanisms leading to these gene expansions and the evolutionary pressures behind them remain elusive. Among major drivers of genome dynamics are the transposable elements but, surprisingly, their potential role in insect gustatory receptor expansion has not been considered yet. In this work, we hypothesized that transposable elements and possibly positive selection would be involved in the highly dynamic evolution of gustatory receptor in Spodoptera spp. We first sequenced de novo the full 465 Mb genome of S. littoralis, and manually annotated the main chemosensory genes, including a large repertoire of 373 gustatory receptor genes (including 19 pseudogenes). We also improved the completeness of S. frugiperda and S. litura gustatory receptor gene repertoires. Then, we annotated transposable elements and revealed that a particular category of class I retrotransposons, the SINE transposons, was significantly enriched in the vicinity of gustatory receptor gene clusters, suggesting a transposon-mediated mechanism for the formation of these clusters. Selection pressure analyses indicated that positive selection within the gustatory receptor gene family is cryptic, only 7 receptors being identified as positively selected. Altogether, our data provide a new good quality Spodoptera genome, pinpoint interesting gustatory receptor candidates for further functional studies and bring valuable genomic information on the mechanisms of gustatory receptor expansions in polyphagous insect species.
Collapse
|
37
|
Insights into agar and secondary metabolite pathways from the genome of the red alga Gracilaria domingensis (Rhodophyta, Gracilariales). JOURNAL OF PHYCOLOGY 2022; 58:406-423. [PMID: 35090189 DOI: 10.1111/jpy.13238] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 11/24/2021] [Indexed: 06/14/2023]
Abstract
Gracilariales is a clade of florideophycean red macroalgae known for being the main source of agar. We present a de novo genome assembly and annotation of Gracilaria domingensis, an agarophyte alga with flattened thallus widely distributed along Central and South American Atlantic intertidal zones. In addition to structural analysis, an organizational comparison was done with other Rhodophyta genomes. The nuclear genome has 78 Mbp, with 11,437 predicted coding genes, 4,075 of which did not have hits in sequence databases. We also predicted 1,567 noncoding RNAs, distributed in 14 classes. The plastid and mitochondrion genome structures were also obtained. Genes related to agar synthesis were identified. Genes for type II galactose sulfurylases could not be found. Genes related to ascorbate synthesis were found. These results suggest an intricate connection of cell wall polysaccharide synthesis and the redox systems through the use of L-galactose in Rhodophyta. The genome of G. domingensis should be valuable to phycological and aquacultural research, as it is the first tropical and Western Atlantic red macroalgal genome to be sequenced.
Collapse
|
38
|
Whole-genome sequence and methylome profiling of the almond [Prunus dulcis (Mill.) D.A. Webb] cultivar 'Nonpareil'. G3 (BETHESDA, MD.) 2022; 12:jkac065. [PMID: 35325123 PMCID: PMC9073694 DOI: 10.1093/g3journal/jkac065] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 03/19/2022] [Indexed: 01/27/2023]
Abstract
Almond [Prunus dulcis (Mill.) D.A. Webb] is an economically important, specialty nut crop grown almost exclusively in the United States. Breeding and improvement efforts worldwide have led to the development of key, productive cultivars, including 'Nonpareil,' which is the most widely grown almond cultivar. Thus far, genomic resources for this species have been limited, and a whole-genome assembly for 'Nonpareil' is not currently available despite its economic importance and use in almond breeding worldwide. We generated a 571X coverage genome sequence using Illumina, PacBio, and optical mapping technologies. Gene prediction revealed 49,321 putative genes using MinION Oxford nanopore and Illumina RNA sequencing, and genome annotation found that 68% of predicted models are associated with at least one biological function. Furthermore, epigenetic signatures of almond, namely DNA cytosine methylation, have been implicated in a variety of phenotypes including self-compatibility, bud dormancy, and development of noninfectious bud failure. In addition to the genome sequence and annotation, this report also provides the complete methylome of several almond tissues, including leaf, flower, endocarp, mesocarp, exocarp, and seed coat. Comparisons between methylation profiles in these tissues revealed differences in genome-wide weighted % methylation and chromosome-level methylation enrichment.
Collapse
|
39
|
A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae. Gigascience 2022; 11:6576245. [PMID: 35488861 PMCID: PMC9055855 DOI: 10.1093/gigascience/giac027] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/26/2022] [Accepted: 02/22/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Ensete glaucum (2n = 2x = 18) is a giant herbaceous monocotyledonous plant in the small Musaceae family along with banana (Musa). A high-quality reference genome sequence assembly of E. glaucum is a resource for functional and evolutionary studies of Ensete, Musaceae, and the Zingiberales. FINDINGS Using Oxford Nanopore Technologies, chromosome conformation capture (Hi-C), Illumina and RNA survey sequence, supported by molecular cytogenetics, we report a high-quality 481.5 Mb genome assembly with 9 pseudo-chromosomes and 36,836 genes. A total of 55% of the genome is composed of repetitive sequences with predominantly LTR-retroelements (37%) and DNA transposons (7%). The single 5S ribosomal DNA locus had an exceptionally long monomer length of 1,056 bp, more than twice that of the monomers at multiple loci in Musa. A tandemly repeated satellite (1.1% of the genome, with no similar sequence in Musa) was present around all centromeres, together with a few copies of a long interspersed nuclear element (LINE) retroelement. The assembly enabled us to characterize in detail the chromosomal rearrangements occurring between E. glaucum and the x = 11 species of Musa. One E. glaucum chromosome has the same gene content as Musa acuminata, while others show multiple, complex, but clearly defined evolutionary rearrangements in the change between x= 9 and 11. CONCLUSIONS The advance towards a Musaceae pangenome including E. glaucum, tolerant of extreme environments, makes a complete set of gene alleles, copy number variation, and a reference for structural variation available for crop breeding and understanding environmental responses. The chromosome-scale genome assembly shows the nature of chromosomal fusion and translocation events during speciation, and features of rapid repetitive DNA change in terms of copy number, sequence, and genomic location, critical to understanding its role in diversity and evolution.
Collapse
|
40
|
Abstract
Transposable elements (TEs) are major genomic components in most eukaryotic genomes and play an important role in genome evolution. However, despite their relevance the identification of TEs is not an easy task and a number of tools were developed to tackle this problem. To better understand how they perform, we tested several widely used tools for de novo TE detection and compared their performance on both simulated data and well curated genomic sequences. As expected, tools that build TE-models performed better than k-mer counting ones, with RepeatModeler beating competitors in most datasets. However, there is a tendency for most tools to identify TE-regions in a fragmented manner and it is also frequent that small TEs or fragmented TEs are not detected. Consequently, the identification of TEs is still a challenging endeavor and it requires a significant manual curation by an experienced expert. The results will be helpful for identifying common issues associated with TE-annotation and for evaluating how comparable are the results obtained with different tools.
Collapse
|
41
|
Genome-Wide Screening of Transposable Elements in the Whitefly, Bemisia tabaci (Hemiptera: Aleyrodidae), Revealed Insertions with Potential Insecticide Resistance Implications. INSECTS 2022; 13:insects13050396. [PMID: 35621732 PMCID: PMC9143410 DOI: 10.3390/insects13050396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/13/2022] [Accepted: 04/15/2022] [Indexed: 12/10/2022]
Abstract
Simple Summary Transposable elements (TEs) are mobile DNA sequences hosted in the genomes of various organisms. These elements have the ability to mediate regulatory changes, which can result in changes in gene expression. Bemisia tabaci is an important agricultural pest that has been linked to several cases of insecticide resistance. In this study, we conducted a genome-wide screening of TEs in the B. tabaci genome using bioinformatics tools. Results revealed a total of 1,292,393 TE copies clustered into 4872 lineages. The TE insertion site analysis revealed 94 insertions within or near defensome genes. Abstract Transposable elements (TEs) are genetically mobile units that move from one site to another within a genome. These units can mediate regulatory changes that can result in massive changes in genes expression. In fact, a precise identification of TEs can allow the detection of the mechanisms involving these elements in gene regulation and genome evolution. In the present study, a genome-wide analysis of the Hemipteran pest Bemisia tabaci was conducted using bioinformatics tools to identify, annotate and estimate the age of TEs, in addition to their insertion sites, within or near of the defensome genes involved in insecticide resistance. Overall, 1,292,393 TE copies were identified in the B. tabaci genome grouped into 4872 lineages. A total of 699 lineages were found to belong to Class I of TEs, 1348 belong to Class II, and 2825 were uncategorized and form the largest part of TEs (28.81%). The TE age estimation revealed that the oldest TEs invasion happened 14 million years ago (MYA) and the most recent occurred 0.2 MYA with the insertion of Class II TE elements. The analysis of TE insertion sites in defensome genes revealed 94 insertions. Six of these TE insertions were found within or near previously identified differentially expressed insecticide resistance genes. These insertions may have a potential role in the observed insecticide resistance in these pests.
Collapse
|
42
|
Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve. Even in well-studied species, there is still substantial natural genetic variation that has not been characterized. Here, the authors use long read sequencing to discover transposable elements in the Drosophila genome not detected by short read sequencing, and link them to gene expression.
Collapse
|
43
|
Bacterial N4-methylcytosine as an epigenetic mark in eukaryotic DNA. Nat Commun 2022; 13:1072. [PMID: 35228526 PMCID: PMC8885841 DOI: 10.1038/s41467-022-28471-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 01/21/2022] [Indexed: 01/04/2023] Open
Abstract
DNA modifications are used to regulate gene expression and defend against invading genetic elements. In eukaryotes, modifications predominantly involve C5-methylcytosine (5mC) and occasionally N6-methyladenine (6mA), while bacteria frequently use N4-methylcytosine (4mC) in addition to 5mC and 6mA. Here we report that 4mC can serve as an epigenetic mark in eukaryotes. Bdelloid rotifers, tiny freshwater invertebrates with transposon-poor genomes rich in foreign genes, lack canonical eukaryotic C5-methyltransferases for 5mC addition, but encode an amino-methyltransferase, N4CMT, captured from bacteria >60 Mya. N4CMT deposits 4mC at active transposons and certain tandem repeats, and fusion to a chromodomain shapes its “histone-read-DNA-write” architecture recognizing silent chromatin marks. Furthermore, amplification of SETDB1 H3K9me3 histone methyltransferases yields variants preferentially binding 4mC-DNA, suggesting “DNA-read-histone-write” partnership to maintain chromatin-based silencing. Our results show how non-native DNA methyl groups can reshape epigenetic systems to silence transposons and demonstrate the potential of horizontal gene transfer to drive regulatory innovation in eukaryotes. Eukaryotic DNA can be methylated as 5-methylcytosine and N6-methyladenine, but whether other forms of DNA methylation occur has been controversial. Here the authors show that a bacterial DNA methyltransferase was acquired >60 Mya in bdelloid rotifers that catalyzes N4-methylcytosine addition and is involved in suppression of transposon proliferation.
Collapse
|
44
|
Small RNA pathways in the nematode Ascaris in the absence of piRNAs. Nat Commun 2022; 13:837. [PMID: 35149688 PMCID: PMC8837657 DOI: 10.1038/s41467-022-28482-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 01/13/2022] [Indexed: 02/07/2023] Open
Abstract
Small RNA pathways play key and diverse regulatory roles in C. elegans, but our understanding of their conservation and contributions in other nematodes is limited. We analyzed small RNA pathways in the divergent parasitic nematode Ascaris. Ascaris has ten Argonautes with five worm-specific Argonautes (WAGOs) that associate with secondary 5’-triphosphate 22-24G-RNAs. These small RNAs target repetitive sequences or mature mRNAs and are similar to the C. elegans mutator, nuclear, and CSR-1 small RNA pathways. Even in the absence of a piRNA pathway, Ascaris CSR-1 may still function to “license” as well as fine-tune or repress gene expression. Ascaris ALG-4 and its associated 26G-RNAs target and likely repress specific mRNAs during testis meiosis. Ascaris WAGO small RNAs demonstrate target plasticity changing their targets between repeats and mRNAs during development. We provide a unique and comprehensive view of mRNA and small RNA expression throughout spermatogenesis. Overall, our study illustrates the conservation, divergence, dynamics, and flexibility of small RNA pathways in nematodes. The parasitic nematode Ascaris lacks piRNAs. Here the authors compare Argonaute proteins and small RNAs from C. elegans and Ascaris, expanding our understanding of the conservation, divergence, and flexibility of Argonautes and small RNA pathways in nematodes.
Collapse
|
45
|
Mosquito long non-coding RNAs are enriched with Transposable Elements. Genet Mol Biol 2022; 45:e20210215. [PMID: 35088819 PMCID: PMC8796034 DOI: 10.1590/1678-4685-gmb-2021-0215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 11/29/2021] [Indexed: 11/22/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) lack coding capacity and mounting evidence suggests that they have a regulatory role in diverse organisms. Most knowledge about lncRNAs comes from studies on vertebrates, including a structural association between lncRNAs and transposable elements (TEs). TE sequences are genomic parasites found in all branches of life and are particularly active and abundant in insect genomes. Here we investigate the contribution of TEs to lncRNA biogenesis in Aedes albopictus and Culex quinquefasciatus. We found that a large fraction of lncRNA loci co-occurs with TE loci in both species. Around 40% of A. albopictus and 52% of C. quinquefasciatus lncRNAs show some association with TEs. Most of the lncRNA/TE associations are represented by TE-derived sequences that are expressed as one or all exons of lncRNAs, including five lncRNAs that seem to influence immune-related genes involved in antiviral response. The contribution of TEs to lncRNAs also varies among the different types of TEs. The Gypsi superfamily is particularly enriched in lncRNAs sequences. In sum, this study demonstrates that transposable elements substantially contribute to lncRNAs biogenesis in A. albopictus and C. quinquefasciatus and may have an impact on regulatory modulation in these species.
Collapse
|
46
|
Cantaloupe melon genome reveals 3D chromatin features and structural relationship with the ancestral cucurbitaceae karyotype. iScience 2022; 25:103696. [PMID: 35059606 PMCID: PMC8760558 DOI: 10.1016/j.isci.2021.103696] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 12/01/2021] [Accepted: 12/20/2021] [Indexed: 11/21/2022] Open
Abstract
Cucumis melo displays a large diversity of horticultural groups with cantaloupe melon the most cultivated type. Using a combination of single-molecule sequencing, 10X Genomics link-reads, high-density optical and genetic maps, and chromosome conformation capture (Hi-C), we assembled a chromosome scale C. melo var. cantalupensis Charentais mono genome. Integration of RNA-seq, MeDip-seq, ChIP-seq, and Hi-C data revealed a widespread compartmentalization of the melon genome, segregating constitutive heterochromatin and euchromatin. Genome-wide comparative and evolutionary analysis between melon botanical groups identified Charentais mono genome increasingly more divergent from Harukei-3 (reticulatus), Payzawat (inodorus), and HS (ssp. agrestis) genomes. To assess the paleohistory of the Cucurbitaceae, we reconstructed the ancestral Cucurbitaceae karyotype and compared it to sequenced cucurbit genomes. In contrast to other species that experienced massive chromosome shuffling, melon has retained the ancestral genome structure. We provide comprehensive genomic resources and new insights in the diversity of melon horticultural groups and evolution of cucurbits. We provide a chromosome scale C. melo var. cantalupensis Charentais mono genome Epigenomic analysis revealed a widespread compartmentalization of the melon genome We reconstructed the ancestral Cucurbitaceae karyotype Melon has retained the ancestral Cucurbitaceae genome structure
Collapse
|
47
|
Finding and Characterizing Repeats in Plant Genomes. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2443:327-385. [PMID: 35037215 DOI: 10.1007/978-1-0716-2067-0_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
Collapse
|
48
|
Abstract
The Fusarium oxysporum species complex (FOSC) is a group of closely related plant pathogens long-considered strictly clonal, as sexual stages have never been recorded. Several studies have questioned whether recombination occurs in FOSC, and if it occurs its nature and frequency are unknown. We analysed 410 assembled genomes to answer whether FOSC diversified by occasional sexual reproduction interspersed with numerous cycles of asexual reproduction akin to a model of predominant clonal evolution (PCE). We tested the hypothesis that sexual reproduction occurred in the evolutionary history of FOSC by examining the distribution of idiomorphs at the mating locus, phylogenetic conflict and independent measures of recombination from genome-wide SNPs and genes. A phylogenomic dataset of 40 single copy orthologs was used to define structure a priori within FOSC based on genealogical concordance. Recombination within FOSC was tested using the pairwise homoplasy index and divergence ages were estimated by molecular dating. We called SNPs from assembled genomes using a k-mer approach and tested for significant linkage disequilibrium as an indication of PCE. We clone-corrected and tested whether SNPs were randomly associated as an indication of recombination. Our analyses provide evidence for sexual or parasexual reproduction within, but not between, clades of FOSC that diversified from a most recent common ancestor about 500 000 years ago. There was no evidence of substructure based on geography or host that might indicate how clades diversified. Competing evolutionary hypotheses for FOSC are discussed in the context of our results.
Collapse
|
49
|
Bioinformatics Approaches for Determining the Functional Impact of Repetitive Elements on Non-coding RNAs. Methods Mol Biol 2022; 2509:315-340. [PMID: 35796972 DOI: 10.1007/978-1-0716-2380-0_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
With a large number of annotated non-coding RNAs (ncRNAs), repetitive sequences are found to constitute functional components (termed as repetitive elements) in ncRNAs that perform specific biological functions. Bioinformatics analysis is a powerful tool for improving our understanding of the role of repetitive elements in ncRNAs. This chapter summarizes recent findings that reveal the role of repetitive elements in ncRNAs. Furthermore, relevant bioinformatics approaches are systematically reviewed, which promises to provide valuable resources for studying the functional impact of repetitive elements on ncRNAs.
Collapse
|
50
|
Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
|