1
|
Faulk C. Genome skimming with nanopore sequencing precisely determines global and transposon DNA methylation in vertebrates. Genome Res 2023; 33:948-956. [PMID: 37442577 PMCID: PMC10519409 DOI: 10.1101/gr.277743.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 06/07/2023] [Indexed: 07/15/2023]
Abstract
Genome skimming is defined as low-pass sequencing below 0.05× coverage and is typically used for mitochondrial genome recovery and species identification. Long-read nanopore sequencers enable simultaneous reading of both DNA sequence and methylation and can multiplex samples for low-cost genome skimming. Here I present nanopore sequencing as a highly precise platform for global DNA methylation and transposon assessment. At coverage of just 0.001×, or 30 Mb of reads, accuracy is sub-1%. Biological and technical replicates validate high precision. Skimming 40 vertebrate species reveals conserved patterns of global methylation consistent with whole-genome bisulfite sequencing and an average mapping rate >97%. Genome size directly correlates to global DNA methylation, explaining 39% of its variance. Accurate SINE and LINE transposon methylation in both the mouse and primates can be obtained with just 0.0001× coverage, or 3 Mb of reads. Sample multiplexing, field portability, and the low price of this instrument combine to make genome skimming for DNA methylation an accessible method for epigenetic assessment from ecology to epidemiology and for low-resource groups.
Collapse
Affiliation(s)
- Christopher Faulk
- Department of Animal Science, College of Food, Agricultural and Natural Resource Sciences, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
2
|
Faulk C. Genome Skimming with Nanopore Sequencing Precisely Determines Global and Transposon DNA Methylation in Vertebrates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.25.525540. [PMID: 36747817 PMCID: PMC9900854 DOI: 10.1101/2023.01.25.525540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Genome skimming is defined as low-pass sequencing below 0.05X coverage and is typically used for mitochondrial genome recovery and species identification. Long read nanopore sequencers enable simultaneous reading of both DNA sequence and methylation and can multiplex samples for low-cost genome skimming. Here I present nanopore sequencing as a highly precise platform for global DNA methylation and transposon assessment. At coverage of just 0.001X, or 30 Mb of reads, accuracy is sub-1%. Biological and technical replicates validate high precision. Skimming 40 vertebrate species reveals conserved patterns of global methylation consistent with whole genome bisulfite sequencing and an average mapping rate above 97%. Genome size directly correlates to global DNA methylation, explaining 44% of its variance. Accurate SINE and LINE transposon methylation in both mouse and primates can be obtained with just 0.0001X coverage, or 3 Mb of reads. Sample multiplexing, field portability, and the low price of this instrument combine to make genome skimming for DNA methylation an accessible method for epigenetic assessment from ecology to epidemiology, and by low resource groups.
Collapse
Affiliation(s)
- Christopher Faulk
- Department of Animal Science, College of Food, Agricultural and Natural Resource Sciences, University of Minnesota
| |
Collapse
|
3
|
Wötzel S, Andrello M, Albani MC, Koch MA, Coupland G, Gugerli F. Arabis alpina: A perennial model plant for ecological genomics and life-history evolution. Mol Ecol Resour 2021; 22:468-486. [PMID: 34415668 PMCID: PMC9293087 DOI: 10.1111/1755-0998.13490] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 07/28/2021] [Accepted: 08/16/2021] [Indexed: 01/03/2023]
Abstract
Many model organisms were chosen and achieved prominence because of an advantageous combination of their life‐history characteristics, genetic properties and also practical considerations. Discoveries made in Arabidopsis thaliana, the most renowned noncrop plant model species, have markedly stimulated studies in other species with different biology. Within the family Brassicaceae, the arctic–alpine Arabis alpina has become a model complementary to Arabidopsis thaliana to study the evolution of life‐history traits, such as perenniality, and ecological genomics in harsh environments. In this review, we provide an overview of the properties that facilitated the rapid emergence of A. alpina as a plant model. We summarize the evolutionary history of A. alpina, including genomic aspects, the diversification of its mating system and demographic properties, and we discuss recent progress in the molecular dissection of developmental traits that are related to its perennial life history and environmental adaptation. From this published knowledge, we derive open questions that might inspire future research in A. alpina, other Brassicaceae species or more distantly related plant families.
Collapse
Affiliation(s)
- Stefan Wötzel
- Institute of Ecology, Evolution and Diversity, Goethe University Frankfurt and Senckenberg Biodiversity and Climate Research Centre, Frankfurt (Main), Germany
| | - Marco Andrello
- Institute for the Study of Anthropic Impacts and Sustainability in the Marine Environment, National Research Council, CNR-IAS, Rome, Italy
| | - Maria C Albani
- Institute for Plant Sciences, University of Cologne, Cologne, Germany
| | - Marcus A Koch
- Biodiversity and Plant Systematics, Centre for Organismal Studies (COS), Heidelberg University, Heidelberg, Germany
| | - George Coupland
- Department of Plant Development Biology, MPI for Plant Breeding Research, Cologne, Germany
| | - Felix Gugerli
- WSL Swiss Federal Research Institute, Birmensdorf, Switzerland
| |
Collapse
|
4
|
Nowak MD, Birkeland S, Mandáková T, Roy Choudhury R, Guo X, Gustafsson ALS, Gizaw A, Schrøder‐Nielsen A, Fracassetti M, Brysting AK, Rieseberg L, Slotte T, Parisod C, Lysak MA, Brochmann C. The genome of Draba nivalis shows signatures of adaptation to the extreme environmental stresses of the Arctic. Mol Ecol Resour 2021; 21:661-676. [PMID: 33058468 PMCID: PMC7983928 DOI: 10.1111/1755-0998.13280] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 08/26/2020] [Accepted: 10/09/2020] [Indexed: 01/04/2023]
Abstract
The Arctic is one of the most extreme terrestrial environments on the planet. Here, we present the first chromosome-scale genome assembly of a plant adapted to the high Arctic, Draba nivalis (Brassicaceae), an attractive model species for studying plant adaptation to the stresses imposed by this harsh environment. We used an iterative scaffolding strategy with data from short-reads, single-molecule long reads, proximity ligation data, and a genetic map to produce a 302 Mb assembly that is highly contiguous with 91.6% assembled into eight chromosomes (the base chromosome number). To identify candidate genes and gene families that may have facilitated adaptation to Arctic environmental stresses, we performed comparative genomic analyses with nine non-Arctic Brassicaceae species. We show that the D. nivalis genome contains expanded suites of genes associated with drought and cold stress (e.g., related to the maintenance of oxidation-reduction homeostasis, meiosis, and signaling pathways). The expansions of gene families associated with these functions appear to be driven in part by the activity of transposable elements. Tests of positive selection identify suites of candidate genes associated with meiosis and photoperiodism, as well as cold, drought, and oxidative stress responses. Our results reveal a multifaceted landscape of stress adaptation in the D. nivalis genome, offering avenues for the continued development of this species as an Arctic model plant.
Collapse
Affiliation(s)
| | | | | | | | - Xinyi Guo
- CEITECMasaryk UniversityBrnoCzech Republic
| | | | - Abel Gizaw
- Natural History MuseumUniversity of OsloOsloNorway
| | | | - Marco Fracassetti
- Science for Life Laboratory and Department of EcologyEnvironment and Plant ScienceStockholm UniversityStockholmSweden
| | - Anne K. Brysting
- Centre for Ecological and Evolutionary SynthesisDepartment of BiosciencesUniversity of OsloOsloNorway
| | - Loren Rieseberg
- Department of BotanyThe University of British ColumbiaVancouverBCCanada
| | - Tanja Slotte
- Science for Life Laboratory and Department of EcologyEnvironment and Plant ScienceStockholm UniversityStockholmSweden
| | | | | | | |
Collapse
|
5
|
Wos G, Choudhury RR, Kolář F, Parisod C. Transcriptional activity of transposable elements along an elevational gradient in Arabidopsis arenosa. Mob DNA 2021; 12:7. [PMID: 33639991 PMCID: PMC7916287 DOI: 10.1186/s13100-021-00236-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 02/16/2021] [Indexed: 01/10/2023] Open
Abstract
Background Plant genomes can respond rapidly to environmental changes and transposable elements (TEs) arise as important drivers contributing to genome dynamics. Although some elements were reported to be induced by various abiotic or biotic factors, there is a lack of general understanding on how environment influences the activity and diversity of TEs. Here, we combined common garden experiment with short-read sequencing to investigate genomic abundance and expression of 2245 consensus TE sequences (containing retrotransposons and DNA transposons) in an alpine environment in Arabidopsis arenosa. To disentangle general trends from local differentiation, we leveraged four foothill-alpine population pairs from different mountain regions. Seeds of each of the eight populations were raised under four treatments that differed in temperature and irradiance, two factors varying with elevation. RNA-seq analysis was performed on leaves of young plants to test for the effect of elevation and subsequently of temperature and irradiance on expression of TE sequences. Results Genomic abundance of the 2245 consensus TE sequences varied greatly between the mountain regions in line with neutral divergence among the regions, representing distinct genetic lineages of A. arenosa. Accounting for intraspecific variation in abundance, we found consistent transcriptomic response for some TE sequences across the different pairs of foothill-alpine populations suggesting parallelism in TE expression. In particular expression of retrotransposon LTR Copia (e.g. Ivana and Ale clades) and LTR Gypsy (e.g. Athila and CRM clades) but also non-LTR LINE or DNA transposon TIR MuDR consistently varied with elevation of origin. TE sequences responding specifically to temperature and irradiance belonged to the same classes as well as additional TE clades containing potentially stress-responsive elements (e.g. LTR Copia Sire and Tar, LTR Gypsy Reina). Conclusions Our study demonstrated that the A. arenosa genome harbours a considerable diversity of TE sequences whose abundance and expression response varies across its native range. Some TE clades may contain transcriptionally active elements responding to a natural environmental gradient. This may further contribute to genetic variation between populations and may ultimately provide new regulatory mechanisms to face environmental challenges. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-021-00236-0.
Collapse
Affiliation(s)
- Guillaume Wos
- Department of Botany, Charles University, 128 01, Prague, Czech Republic.
| | | | - Filip Kolář
- Department of Botany, Charles University, 128 01, Prague, Czech Republic
| | - Christian Parisod
- Institute of Plant Sciences, University of Bern, 3013, Bern, Switzerland
| |
Collapse
|
6
|
Choudhury RR, Parisod C. Jumping genes: Genomic ballast or powerhouse of biological diversification. Mol Ecol 2019; 26:4587-4590. [PMID: 28949090 DOI: 10.1111/mec.14247] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Revised: 07/04/2017] [Accepted: 07/06/2017] [Indexed: 01/08/2023]
Abstract
Studying hybridization has the potential to elucidate challenging questions in evolutionary biology such as the nature of adaptive genetic variation and reproductive isolation. A growing body of work highlights that the merging of divergent genomes goes beyond the reshuffling of standing variation from related species and promotes mutations (Abbott et al., ). However, to what extent such genome instability generates evolutionary significant variation remains largely elusive. In this issue of Molecular Ecology, Dennenmoser et al. () report considerable dynamics of transposable elements (TEs) in a recent invasive fish species of hybrid origin (Cottus; Figure ). It adds to the recent examples from plants to support TE-specific genome variation following hybridization. Insights from early, as well as established, hybrids are largely coherent with increased TE activity, and this fish system thus represents an inspiring opportunity to further address the possible association between genome dynamics and "rapid evolution of hybrid species." This work based on genome (re)sequencing contrasts with prior transcriptomics or PCR-based studies of TEs and illustrates how unprecedented amount of information promises a better understanding of the multiple patterns of variation across eukaryotic genomes; provided that we get the better of methodological advances. As discussed here, unbiased assessment of TE variation from genome surveys indeed remains a challenge precluding firm conclusions to be reached about the evolutionary significance of TEs. Despite methodological and conceptual developments that appear necessary to unambiguously uncover the unexplored iceberg below the known tip, the role of coding genes vs. TEs in promoting adaptation and speciation might be clarified in a not so remote future.
Collapse
Affiliation(s)
| | - Christian Parisod
- Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
7
|
Rogivue A, Choudhury RR, Zoller S, Joost S, Felber F, Kasser M, Parisod C, Gugerli F. Genome-wide variation in nucleotides and retrotransposons in alpine populations of Arabis alpina (Brassicaceae). Mol Ecol Resour 2019; 19:773-787. [PMID: 30636378 DOI: 10.1111/1755-0998.12991] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 12/14/2018] [Accepted: 12/17/2018] [Indexed: 02/01/2023]
Abstract
Advances in high-throughput sequencing have promoted the collection of reference genomes and genome-wide diversity. However, the assessment of genomic variation among populations has hitherto mainly been surveyed through single-nucleotide polymorphisms (SNPs) and largely ignored the often major fraction of genomes represented by transposable elements (TEs). Despite accumulating evidence supporting the evolutionary significance of TEs, comprehensive surveys remain scarce. Here, we sequenced the full genomes of 304 individuals of Arabis alpina sampled from four nearby natural populations to genotype SNPs as well as polymorphic long terminal repeat retrotransposons (polymorphic TEs; i.e., presence/absence of TE insertions at specific loci). We identified 291,396 SNPs and 20,548 polymorphic TEs, comparing their contributions to genomic diversity and divergence across populations. Few SNPs were shared among populations and overall showed high population-specific variation, whereas most polymorphic TEs segregated among populations. The genomic context of these two classes of variants further highlighted candidate adaptive loci having a putative impact on functional genes. In particular, 4.96% of the SNPs were identified as nonsynonymous or affecting start/stop codons. In contrast, 43% of the polymorphic TEs were present next to Arabis genes enriched in functional categories related to the regulation of reproduction and responses to biotic as well as abiotic stresses. This unprecedented data set, mapping variation gained from SNPs and complementary polymorphic TEs within and among populations, will serve as a rich resource for addressing microevolutionary processes shaping genome variation.
Collapse
Affiliation(s)
- Aude Rogivue
- WSL Swiss Federal Research Institute, Birmensdorf, Switzerland
| | - Rimjhim R Choudhury
- University of Neuchâtel, Neuchâtel, Switzerland.,Institute of Plant Sciences, University of Berne, Bern, Switzerland
| | - Stefan Zoller
- Genetic Diversity Centre, ETH Zürich, Zürich, Switzerland
| | - Stéphane Joost
- Laboratory of Geographic Information Systems (LASIG), School of Architecture, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - François Felber
- University of Neuchâtel, Neuchâtel, Switzerland.,Musée et Jardins botaniques cantonaux, Lausanne, Switzerland
| | | | | | - Felix Gugerli
- WSL Swiss Federal Research Institute, Birmensdorf, Switzerland
| |
Collapse
|
8
|
Choudhury RR, Rogivue A, Gugerli F, Parisod C. Impact of polymorphic transposable elements on linkage disequilibrium along chromosomes. Mol Ecol 2019; 28:1550-1562. [DOI: 10.1111/mec.15014] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 12/26/2018] [Indexed: 01/03/2023]
Affiliation(s)
| | - Aude Rogivue
- WSL Swiss Federal Research Institute Birmensdorf Switzerland
| | - Felix Gugerli
- WSL Swiss Federal Research Institute Birmensdorf Switzerland
| | | |
Collapse
|
9
|
Lee YI, Yap JW, Izan S, Leitch IJ, Fay MF, Lee YC, Hidalgo O, Dodsworth S, Smulders MJM, Gravendeel B, Leitch AR. Satellite DNA in Paphiopedilum subgenus Parvisepalum as revealed by high-throughput sequencing and fluorescent in situ hybridization. BMC Genomics 2018; 19:578. [PMID: 30068293 PMCID: PMC6090851 DOI: 10.1186/s12864-018-4956-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2017] [Accepted: 07/23/2018] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Satellite DNA is a rapidly diverging, largely repetitive DNA component of many eukaryotic genomes. Here we analyse the evolutionary dynamics of a satellite DNA repeat in the genomes of a group of Asian subtropical lady slipper orchids (Paphiopedilum subgenus Parvisepalum and representative species in the other subgenera/sections across the genus). A new satellite repeat in Paphiopedilum subgenus Parvisepalum, SatA, was identified and characterized using the RepeatExplorer pipeline in HiSeq Illumina reads from P. armeniacum (2n = 26). Reconstructed monomers were used to design a satellite-specific fluorescent in situ hybridization (FISH) probe. The data were also analysed within a phylogenetic framework built using the internal transcribed spacer (ITS) sequences of 45S nuclear ribosomal DNA. RESULTS SatA comprises c. 14.5% of the P. armeniacum genome and is specific to subgenus Parvisepalum. It is composed of four primary monomers that range from 230 to 359 bp and contains multiple inverted repeat regions with hairpin-loop motifs. A new karyotype of P. vietnamense (2n = 28) is presented and shows that the chromosome number in subgenus Parvisepalum is not conserved at 2n = 26, as previously reported. The physical locations of SatA sequences were visualised on the chromosomes of all seven Paphiopedilum species of subgenus Parvisepalum (2n = 26-28), together with the 5S and 45S rDNA loci using FISH. The SatA repeats were predominantly localisedin the centromeric, peri-centromeric and sub-telocentric chromosome regions, but the exact distribution pattern was species-specific. CONCLUSIONS We conclude that the newly discovered, highly abundant and rapidly evolving satellite sequence SatA is specific to Paphiopedilum subgenus Parvisepalum. SatA and rDNA chromosomal distributions are characteristic of species, and comparisons between species reveal that the distribution patterns generate a strong phylogenetic signal. We also conclude that the ancestral chromosome number of subgenus Parvisepalum and indeed of all Paphiopedilum could be either 2n = 26 or 28, if P. vietnamense is sister to all species in the subgenus as suggested by the ITS data.
Collapse
Affiliation(s)
- Yung-I Lee
- Biology Department, National Museum of Natural Science, No 1, Kuan-Chien Rd, 40453 Taichung, Taiwan, Republic of China
- Department of Life Sciences, National Chung Hsing University, 40227 Taichung, Taiwan, Republic of China
| | - Jing Wei Yap
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS UK
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB UK
- Forest Research Institute Malaysia (FRIM), 52109 Kepong, Selangor Darul Ehsan Malaysia
| | - Shairul Izan
- Plant Breeding, Wageningen University & Research, P.O. Box 386, NL-6700 AJ Wageningen, The Netherlands
- Department of Crop Science, Faculty of Agriculture, University Putra Malaysia (UPM) Serdang, Serdang, Selangor Malaysia
| | - Ilia J. Leitch
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB UK
| | - Michael F. Fay
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB UK
- School of Plant Biology, University of Western Australia, Crawley, WA 6009 Australia
| | - Yi-Ching Lee
- Biology Department, National Museum of Natural Science, No 1, Kuan-Chien Rd, 40453 Taichung, Taiwan, Republic of China
| | - Oriane Hidalgo
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB UK
| | - Steven Dodsworth
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB UK
| | - Marinus J. M. Smulders
- Plant Breeding, Wageningen University & Research, P.O. Box 386, NL-6700 AJ Wageningen, The Netherlands
| | - Barbara Gravendeel
- Endless Forms Group, Naturalis Biodiversity Center, Vondellaan 55, 2332 AA Leiden, The Netherlands
- Faculty of Science and Technology, University of Applied Sciences Leiden, Zernikedreef 11, 2333 CK Leiden, The Netherlands
- Institute Biology Leiden, Leiden University, Sylviusweg 72, 2333 BE Leiden, The Netherlands
| | - Andrew R. Leitch
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS UK
| |
Collapse
|