1
|
Ernst E, Abramson B, Acosta K, Hoang PTN, Mateo-Elizalde C, Schubert V, Pasaribu B, Albert PS, Hartwick N, Colt K, Aylward A, Ramu U, Birchler JA, Schubert I, Lam E, Michael TP, Martienssen RA. Duckweed genomes and epigenomes underlie triploid hybridization and clonal reproduction. Curr Biol 2025; 35:1828-1847.e9. [PMID: 40174586 PMCID: PMC12015598 DOI: 10.1016/j.cub.2025.03.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2025] [Revised: 03/03/2025] [Accepted: 03/11/2025] [Indexed: 04/04/2025]
Abstract
The Lemnaceae (duckweeds) are the world's smallest but fastest-growing flowering plants. Prolific clonal propagation facilitates continuous micro-cropping for plant-based protein and starch production and holds tremendous promise for sequestration of atmospheric CO2. Here, we present chromosomal assemblies, annotations, and phylogenomic analysis of Lemna genomes that uncover candidate genes responsible for the unique metabolic and developmental traits of the family, such as anatomical reduction, adaxial stomata, lack of stomatal closure, and carbon sequestration via crystalline calcium oxalate. Lemnaceae have selectively lost genes required for RNA interference, including Argonaute genes required for reproductive isolation (the triploid block) and haploid gamete formation. Triploid hybrids arise commonly among Lemna, and we have found mutations in highly conserved meiotic crossover genes that could support polyploid meiosis. Further, mapping centromeres by chromatin immunoprecipitation suggests their epigenetic origin despite divergence of underlying tandem repeats and centromeric retrotransposons. Syntenic comparisons with Wolffia and Spirodela reveal that diversification of these genera coincided with the "Azolla event" in the mid-Eocene, during which aquatic macrophytes reduced high atmospheric CO2 levels to those of the current ice age. Facile regeneration of transgenic fronds from tissue culture, aided by reduced epigenetic silencing, makes Lemna a powerful biotechnological platform, as exemplified by recent engineering of high-oil Lemna that outperforms oil-seed crops.
Collapse
Affiliation(s)
- Evan Ernst
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA
| | - Bradley Abramson
- Plant Molecular and Cellular Biology Laboratory, the Salk Institute for Biological Studies, 10010 N Torrey Pines Rd, La Jolla, CA 92037, USA
| | - Kenneth Acosta
- Department of Plant Biology, Rutgers, The State University of New Jersey, 59 Dudley Road, New Brunswick, NJ 08901, USA
| | - Phuong T N Hoang
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Corrensstraße 3, D-06466 Stadt Seeland, Germany; Biology Faculty, Dalat University, 1 Phu Dong Thien Vuong, Dalat City 670000, Vietnam
| | - Cristian Mateo-Elizalde
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Corrensstraße 3, D-06466 Stadt Seeland, Germany
| | - Buntora Pasaribu
- Department of Plant Biology, Rutgers, The State University of New Jersey, 59 Dudley Road, New Brunswick, NJ 08901, USA; Department of Marine Sciences, Faculty of Fisheries and Marine Sciences, Universitas Padjadjaran, Bandung Sumedang Highway KM 21, Jatinangor 40600, Indonesia
| | - Patrice S Albert
- Biological Sciences, University of Missouri, Columbia, 105 Tucker Hall, Columbia, MO 65211, USA
| | - Nolan Hartwick
- Plant Molecular and Cellular Biology Laboratory, the Salk Institute for Biological Studies, 10010 N Torrey Pines Rd, La Jolla, CA 92037, USA
| | - Kelly Colt
- Plant Molecular and Cellular Biology Laboratory, the Salk Institute for Biological Studies, 10010 N Torrey Pines Rd, La Jolla, CA 92037, USA
| | - Anthony Aylward
- Plant Molecular and Cellular Biology Laboratory, the Salk Institute for Biological Studies, 10010 N Torrey Pines Rd, La Jolla, CA 92037, USA
| | - Umamaheswari Ramu
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA
| | - James A Birchler
- Biological Sciences, University of Missouri, Columbia, 105 Tucker Hall, Columbia, MO 65211, USA
| | - Ingo Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Corrensstraße 3, D-06466 Stadt Seeland, Germany
| | - Eric Lam
- Department of Plant Biology, Rutgers, The State University of New Jersey, 59 Dudley Road, New Brunswick, NJ 08901, USA.
| | - Todd P Michael
- Plant Molecular and Cellular Biology Laboratory, the Salk Institute for Biological Studies, 10010 N Torrey Pines Rd, La Jolla, CA 92037, USA.
| | - Robert A Martienssen
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724, USA.
| |
Collapse
|
2
|
Langsiri N, Meyer W, Irinyi L, Worasilchai N, Pombubpa N, Wongsurawat T, Jenjaroenpun P, Luangsa-Ard JJ, Chindamporn A. Optimizing fungal DNA extraction and purification for Oxford Nanopore untargeted shotgun metagenomic sequencing from simulated hemoculture specimens. mSystems 2025:e0116624. [PMID: 40197053 DOI: 10.1128/msystems.01166-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Accepted: 03/06/2025] [Indexed: 04/09/2025] Open
Abstract
Long-read metagenomics provides a promising alternative approach to fungal identification, circumventing methodological biases, associated with DNA amplification, which is a prerequisite for DNA barcoding/metabarcoding based on the primary fungal DNA barcode (Internal Transcribed Spacer (ITS) region). However, DNA extraction for long-read sequencing-based fungal identification poses a significant challenge, as obtaining long and intact fungal DNA is imperative. Comparing different lysis methods showed that chemical lysis with CTAB/SDS generated DNA from pure fungal cultures with high yields (ranging from 11.20 ± 0.17 µg to 22.99 ± 2.22 µg depending on the species) while preserving integrity. Evaluating the efficacy of human DNA depletion protocols demonstrated an 88.73% reduction in human reads and a 99.53% increase in fungal reads compared to the untreated yeast-spiked human blood control. Evaluation of the developed DNA extraction protocol on simulated clinical hemocultures revealed that the obtained DNA sequences exceed 10 kb in length, enabling a highly efficient sequencing run with over 80% active pores. The quality of the DNA, as indicated by the 260/280 and 260/230 ratios obtained from NanoDrop spectrophotometer readings, exceeded 1.8 and 2.0, respectively. This demonstrated the great potential of the herein optimized protocol to extract high-quality fungal DNA from clinical specimens enabling long-read metagenomics sequencing. IMPORTANCE A novel streamlined DNA extraction protocol was developed to efficiently isolate high molecular weight fungal DNA from hemoculture samples, which is crucial for long-read sequencing applications. By eliminating the need for labor-intensive and shear-force-inducing steps, such as liquid nitrogen grinding or bead beating, the protocol is more user-friendly and better suited for clinical laboratory settings. The automation of cleanup and extraction steps further shortens the overall turnaround time to under 6 hours. Although not specifically designed for ultra-long DNA extraction, this protocol effectively supports fungal identification through Oxford Nanopore Technology (ONT) sequencing. It yields high molecular weight DNA, resulting in longer sequence fragments that improve the number of fungal reads over human reads. Future improvements, including adaptive sampling technology, could further simplify the process by reducing the need for human DNA depletion, paving the way for more automated, bioinformatics-driven workflows.
Collapse
Affiliation(s)
- Nattapong Langsiri
- Department of Microbiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Wieland Meyer
- Westerdijk Fungal Biodiversity Institute, Utrecht, the Netherlands
- Molecular Mycology Research Laboratory, Centre for Infectious Diseases and Microbiology, Westmead Clinical School, Sydney Medical School, Faculty of Medicine and Health, Sydney Infectious Diseases Institute, University of Sydney, Westmead Hospital, Research and Education Network, Westmead, New South Wales, Australia
| | - Laszlo Irinyi
- Molecular Mycology Research Laboratory, Centre for Infectious Diseases and Microbiology, Westmead Clinical School, Sydney Medical School, Faculty of Medicine and Health, Sydney Infectious Diseases Institute, University of Sydney, Westmead Hospital, Research and Education Network, Westmead, New South Wales, Australia
| | - Navaporn Worasilchai
- Department of Transfusion Medicine and Clinical Microbiology, Faculty of Allied Health Sciences, Chulalongkorn University, Bangkok, Thailand
- Research Unit of Medical Mycology Diagnosis, Chulalongkorn University, Bangkok, Thailand
| | - Nuttapon Pombubpa
- Department of Microbiology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
- Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
| | - Thidathip Wongsurawat
- Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, Arkansas, USA
- Division of Medical Bioinformatics, Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Piroon Jenjaroenpun
- Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, Arkansas, USA
- Division of Medical Bioinformatics, Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - J Jennifer Luangsa-Ard
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Ariya Chindamporn
- Department of Microbiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Center of Excellence in Antimicrobial Resistance and Stewardship, Chulalongkorn University, Bangkok, Thailand
| |
Collapse
|
3
|
Grewal S, Yang CY, Krasheninnikova K, Collins J, Wood JMD, Ashling S, Scholefield D, Kaithakottil GG, Swarbreck D, Yao E, Sen TZ, King IP, King J. Chromosome-level haplotype-resolved genome assembly of bread wheat's wild relative Aegilops mutica. Sci Data 2025; 12:438. [PMID: 40082453 PMCID: PMC11906796 DOI: 10.1038/s41597-025-04737-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Accepted: 02/28/2025] [Indexed: 03/16/2025] Open
Abstract
Bread wheat (Triticum aestivum) is a vital staple crop, with an urgent need for increased production to help feed the world's growing population. Aegilops mutica (2n = 2x = 14; T genome) is a diploid wild relative of wheat carrying valuable agronomic traits resulting in its extensive exploitation for wheat improvement. This paper reports a chromosome-scale, haplotype-resolved genome assembly of Ae. mutica using HiFi reads and Omni-C data. The final lengths for the curated genomes were ~4.65 Gb (haplotype 1) and 4.56 Gb (haplotype 2), featuring a contig N50 of ~4.35 Mb and ~4.60 Mb, respectively. Genome annotation predicted 96,723 gene models and repeats. In summary, the genome assembly of Ae. mutica provides a valuable resource for the wheat breeding community, facilitating faster and more efficient pre-breeding of wheat to enhance food security.
Collapse
Affiliation(s)
- Surbhi Grewal
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK.
| | - Cai-Yun Yang
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK
| | | | - Joanna Collins
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1RQ, UK
| | - Jonathan M D Wood
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1RQ, UK
| | - Stephen Ashling
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK
| | - Duncan Scholefield
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK
| | | | - David Swarbreck
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UZ, UK
| | - Eric Yao
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Crop Improvement and Genetics Research Unit, 800 Buchanan St., Albany, CA, 94710, USA
| | - Taner Z Sen
- United States Department of Agriculture-Agricultural Research Service, Western Regional Research Center, Crop Improvement and Genetics Research Unit, 800 Buchanan St., Albany, CA, 94710, USA
- University of California, Department of Bioengineering, Berkeley, CA, 94720, USA
| | - Ian P King
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK
| | - Julie King
- Wheat Research Centre, School of Biosciences, University of Nottingham, Loughborough, LE12 5RD, UK
| |
Collapse
|
4
|
Medhi U, Chaliha C, Singh A, Nath BK, Kalita E. Third generation sequencing transforming plant genome research: Current trends and challenges. Gene 2025; 940:149187. [PMID: 39724994 DOI: 10.1016/j.gene.2024.149187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 12/15/2024] [Accepted: 12/17/2024] [Indexed: 12/28/2024]
Abstract
In recent years, third-generation sequencing (TGS) technologies have transformed genomics and transcriptomics research, providing novel opportunities for significant discoveries. The long-read sequencing platforms, with their unique advantages over next-generation sequencing (NGS), including a definitive protocol, reduced operational time, and real-time sequencing, possess the potential to transform plant genomics. TGS optimizes and enhances the efficiency of data analysis by removing the necessity for time-consuming assembly tools. The current review examines the development and application of bioinformatics tools for data analysis and annotation, driven by the rapid advancement of TGS platforms like Oxford Nanopore Technologies and Pacific Biosciences. Transcriptome analysis utilizing TGS has been extensively employed to elucidate complex plant transcriptomes and genomes, particularly those characterized by high frequencies of duplicated genomes and repetitive sequences. As a result, current methodologies that allow for generating transcriptomes and comprehensive whole-genome sequences of complex plant genomes employing tailored hybrid sequencing techniques that integrate NGS and TGS technologies have been emphasized herein. This paper, thus, articulates a vision for a future in which TGS effectively addresses the challenges faced in plant research, offering a comprehensive understanding of its advantages, applications, limitations, and promising prospects.
Collapse
Affiliation(s)
- Upasana Medhi
- Department of Molecular Biology and Biotechnology, Cotton University, Panbazar, Guwahati, Assam, 781001, India
| | - Chayanika Chaliha
- School of Natural Resource Management, College of Post Graduate Studies in Agricultural Sciences-CAU Imphal, Umiam, Meghalaya, 793104, India
| | - Archana Singh
- Department of Plant Molecular Biology, University of Delhi South Campus, Benito Juarez Road, Dhaula Kuan, New Delhi, 110021, India
| | - Bikash K Nath
- Department of Molecular Biology and Biotechnology, Tezpur University, Assam, 784028, India
| | - Eeshan Kalita
- Department of Molecular Biology and Biotechnology, Cotton University, Panbazar, Guwahati, Assam, 781001, India.
| |
Collapse
|
5
|
Pu Y, Zhou X, Cai H, Lou T, Liu C, Kong M, Sun Z, Wang Y, Zhang R, Zhu Y, Ye L, Zheng Y, Zhu B, Quan Z, Zhao G, Zheng Y. Impact of DNA Extraction Methods on Gut Microbiome Profiles: A Comparative Metagenomic Study. PHENOMICS (CHAM, SWITZERLAND) 2025; 5:76-90. [PMID: 40313603 PMCID: PMC12040788 DOI: 10.1007/s43657-025-00232-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 01/21/2025] [Accepted: 01/27/2025] [Indexed: 05/03/2025]
Abstract
In gut microbial research, DNA extraction remarkably influences study outcomes and biological interpretations. Rapid advancements in the research scale and technological upgrades necessitate evaluating new methods to ensure reliability and precision in microbial community profiling. We systematically evaluated the performance of eight recent and commonly used extraction methods using a microbial mock community (MMC) and fecal samples from two healthy volunteers, incorporating bacterial, archaeal, and fungal constituents. Performance metrics included nucleic acid assessment, microbial profile assessment, and scalability for large-scale studies, leveraging shotgun metagenomics for in-depth analysis. Despite variations in DNA quantity and quality, all methods yielded sufficient DNA for shotgun metagenomic sequencing. In the MMC microbial profile assessment, the QIAamp PowerFecal pro Kit (PF) and DNeasy PowerSoil HTP kit (PS) methods exhibited higher similarity with the theoretical composition and lower variability across technical replicates compared to other methods. For fecal samples, the extraction method accounted for 21.4% of the overall microbiome variation and significantly affected the abundances of 32% of detected microbial species. Methods using mechanical lysis with small beads, such as PF and PS, demonstrated better efficiency, indicated by increased microbial diversity in extracting DNA from Gram-positive bacteria. Furthermore, the PF and PS methods are notably simple to execute and automation-friendly, though relatively costly. Our study underscores the importance of maintaining consistency in DNA extraction methods for reliable comparative metagenomic analyses. We recommend PF and PS methods as optimal for expansive gut metagenomic research, emphasizing the critical role of mechanical lysis in DNA extraction. Supplementary Information The online version contains supplementary material available at 10.1007/s43657-025-00232-x.
Collapse
Affiliation(s)
- Yanni Pu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Xiaofeng Zhou
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Hao Cai
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Tao Lou
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Chenglin Liu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Mengmeng Kong
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Zhonghan Sun
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Yanren Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Institute of Biodiversity Science, School of Life Sciences, Fudan University, Shanghai, 200438 China
| | - Ruyi Zhang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Institute of Biodiversity Science, School of Life Sciences, Fudan University, Shanghai, 200438 China
| | - Yuxuan Zhu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Lin Ye
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Shanghai Cancer Center, Fudan University, Shanghai, 200438 China
| | - Baoli Zhu
- Key Laboratory of Agro-Ecological Processes in Subtropical Regions, Taoyuan Agro-Ecosystem Research Station, Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha, 410125 China
| | - Zhexue Quan
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Institute of Biodiversity Science, School of Life Sciences, Fudan University, Shanghai, 200438 China
| | - Guoping Zhao
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
- Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200032 China
| | - Yan Zheng
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, and School of Life Sciences, Fudan University, Room C601, No. 2005 Songhu Road, Yangpu District, Shanghai, 200438 China
- Department of Cardiology, Shanghai Institute of Cardiovascular Disease, Zhongshan Hospital, Fudan University, Shanghai, 200032 China
- Yiwu Research Institute, Fudan University, Yiwu, 322000 China
| |
Collapse
|
6
|
Scott S, Westhaus A, Nazareth D, Cabanes-Creus M, Navarro RG, Chandra D, Zhu E, Venkateswaran A, Alexander IE, Bauer DC, Wilson LO, Lisowski L. AAVolve: Concatenated long-read deep sequencing enables whole capsid tracking during shuffled AAV library selection. Mol Ther Methods Clin Dev 2024; 32:101351. [PMID: 39498467 PMCID: PMC11532298 DOI: 10.1016/j.omtm.2024.101351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 09/30/2024] [Indexed: 11/07/2024]
Abstract
Gene therapies using recombinant adeno-associated virus (AAV) vectors have demonstrated considerable clinical success in the treatment of genetic disorders. Improved vectors with favorable tropism profiles, decreased immunogenicity, and enhanced manufacturability are poised to further improve the state of gene therapies. Such vectors can be identified through directed evolution, a process of subjecting a diverse capsid library to a selection pressure to identify individual variants with a desired trait. Currently, libraries that involve changes distributed throughout the AAV capsid coding region, such as DNA family shuffled libraries, are largely characterized using low-throughput Sanger sequencing of individual clones. However, improvements in long-read sequencing technologies have increased their applicability to capsid libraries and evaluation of the selection process. Here, we explore the application of Oxford Nanopore Technologies refined by a concatemeric consensus method for initial library characterization and monitoring selection of a shuffled AAV capsid library. Furthermore, we present AAVolve, a bioinformatic pipeline for processing long-read data from AAV-directed evolution experiments. Our approach allows high-throughput characterization of AAV capsids in a streamlined manner, facilitating deeper insights into library composition through multiple rounds of selection, and generalization through training of machine learning models.
Collapse
Affiliation(s)
- Suzanne Scott
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Adrian Westhaus
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
| | - Deborah Nazareth
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
| | - Marti Cabanes-Creus
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
| | - Renina Gale Navarro
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
| | - Deborah Chandra
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
| | - Erhua Zhu
- Gene Therapy Research Unit, Children’s Medical Research Institute and The Children’s Hospital at Westmead, Faculty of Medicine and Health, The University of Sydney, and Sydney Children’s Hospitals Network, Westmead, NSW 2145, Australia
| | - Aravind Venkateswaran
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Ian E. Alexander
- Gene Therapy Research Unit, Children’s Medical Research Institute and The Children’s Hospital at Westmead, Faculty of Medicine and Health, The University of Sydney, and Sydney Children’s Hospitals Network, Westmead, NSW 2145, Australia
- Discipline of Child and Adolescent Health, The University of Sydney, Sydney Medical School, Faculty of Medicine and Health, Westmead, NSW 2145, Australia
| | - Denis C. Bauer
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
- Department of Biomedical Sciences, Faculty of Medicine and Health Science, Macquarie University, Macquarie Park, NSW 2113, Australia
- Applied BioSciences, Faculty of Science and Engineering, Macquarie University, Macquarie Park, NSW 2113, Australia
| | - Laurence O.W. Wilson
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
- Applied BioSciences, Faculty of Science and Engineering, Macquarie University, Macquarie Park, NSW 2113, Australia
| | - Leszek Lisowski
- Translational Vectorology Research Unit, Children’s Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW 2145, Australia
- Laboratory of Molecular Oncology and Innovative Therapies, Military Institute of Medicine – National Research Institute, 04-141 Warsaw, Poland
| |
Collapse
|
7
|
Chen SH, Jones A, Lu-Irving P, Yap JYS, van der Merwe M, Bragg JG, Edwards RJ. Chromosome-Level Genome Assembly of the Australian Rainforest Tree Rhodamnia argentea (Malletwood). Genome Biol Evol 2024; 16:evae238. [PMID: 39487819 PMCID: PMC11604068 DOI: 10.1093/gbe/evae238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Revised: 09/23/2024] [Accepted: 10/04/2024] [Indexed: 11/04/2024] Open
Abstract
Myrtaceae are a large family of woody plants, including hundreds that are currently under threat from the global spread of a fungal pathogen, Austropuccinia psidii (G. Winter) Beenken, which causes myrtle rust. A reference genome for the Australian native rainforest tree Rhodamnia argentea Benth. (malletwood) was assembled from Oxford Nanopore Technologies long-reads, 10x Genomics Chromium linked-reads, and Hi-C data (N50 = 32.3 Mb and BUSCO completeness 98.0%) with 99.0% of the 347 Mb assembly anchored to 11 chromosomes (2n = 22). The R. argentea genome will inform conservation efforts for Myrtaceae species threatened by myrtle rust, against which it shows variable resistance. We observed contamination in the sequencing data, and further investigation revealed an arthropod source. This study emphasizes the importance of checking sequencing data for contamination, especially when working with nonmodel organisms. It also enhances our understanding of a tree that faces conservation challenges, contributing to broader biodiversity initiatives.
Collapse
Affiliation(s)
- Stephanie H Chen
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, NSW 2052, Australia
- Research Centre for Ecosystem Resilience, Botanic Gardens of Sydney, Sydney, NSW 2000, Australia
- Centre for Australian National Biodiversity Research (a joint venture between Parks Australia and CSIRO), Canberra, ACT 2601, Australia
| | - Ashley Jones
- Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
| | - Patricia Lu-Irving
- Research Centre for Ecosystem Resilience, Botanic Gardens of Sydney, Sydney, NSW 2000, Australia
| | - Jia-Yee S Yap
- Research Centre for Ecosystem Resilience, Botanic Gardens of Sydney, Sydney, NSW 2000, Australia
| | - Marlien van der Merwe
- Research Centre for Ecosystem Resilience, Botanic Gardens of Sydney, Sydney, NSW 2000, Australia
| | - Jason G Bragg
- Research Centre for Ecosystem Resilience, Botanic Gardens of Sydney, Sydney, NSW 2000, Australia
| | - Richard J Edwards
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, NSW 2052, Australia
- Minderoo OceanOmics Centre at UWA, Oceans Institute, University of Western Australia, Perth, WA 6009, Australia
| |
Collapse
|
8
|
Vigouroux M, Novák P, Oliveira LC, Santos C, Cheema J, Wouters RHM, Paajanen P, Vickers M, Koblížková A, Vaz Patto MC, Macas J, Steuernagel B, Martin C, Emmrich PMF. A chromosome-scale reference genome of grasspea (Lathyrus sativus). Sci Data 2024; 11:1035. [PMID: 39333203 PMCID: PMC11437036 DOI: 10.1038/s41597-024-03868-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 09/05/2024] [Indexed: 09/29/2024] Open
Abstract
Grasspea (Lathyrus sativus L.) is an underutilised but promising legume crop with tolerance to a wide range of abiotic and biotic stress factors, and potential for climate-resilient agriculture. Despite a long history and wide geographical distribution of cultivation, only limited breeding resources are available. This paper reports a 5.96 Gbp genome assembly of grasspea genotype LS007, of which 5.03 Gbp is scaffolded into 7 pseudo-chromosomes. The assembly has a BUSCO completeness score of 99.1% and is annotated with 31719 gene models and repeat elements. This represents the most contiguous and accurate assembly of the grasspea genome to date.
Collapse
Affiliation(s)
- Marielle Vigouroux
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Petr Novák
- Institute of Plant Molecular Biology, Biology Centre CAS, Branisovska 31, Ceske Budejovice, CZ, 37005, Czech Republic
| | - Ludmila Cristina Oliveira
- Institute of Plant Molecular Biology, Biology Centre CAS, Branisovska 31, Ceske Budejovice, CZ, 37005, Czech Republic
| | - Carmen Santos
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, 2780-157, Portugal
| | - Jitender Cheema
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, CB10 1SD, Cambridge, United Kingdom
| | - Roland H M Wouters
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Pirita Paajanen
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Martin Vickers
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Andrea Koblížková
- Institute of Plant Molecular Biology, Biology Centre CAS, Branisovska 31, Ceske Budejovice, CZ, 37005, Czech Republic
| | - Maria Carlota Vaz Patto
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, 2780-157, Portugal
| | - Jiří Macas
- Institute of Plant Molecular Biology, Biology Centre CAS, Branisovska 31, Ceske Budejovice, CZ, 37005, Czech Republic
| | | | - Cathie Martin
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Peter M F Emmrich
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK.
- Norwich Institute for Sustainable Development, School of International Development, University of East Anglia, Norwich, NR4 7TJ, UK.
| |
Collapse
|
9
|
Berube B, Ernst E, Cahn J, Roche B, de Santis Alves C, Lynn J, Scheben A, Grimanelli D, Siepel A, Ross-Ibarra J, Kermicle J, Martienssen RA. Teosinte Pollen Drive guides maize diversification and domestication by RNAi. Nature 2024; 633:380-388. [PMID: 39112710 PMCID: PMC11390486 DOI: 10.1038/s41586-024-07788-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 07/04/2024] [Indexed: 08/17/2024]
Abstract
Selfish genetic elements contribute to hybrid incompatibility and bias or 'drive' their own transmission1,2. Chromosomal drive typically functions in asymmetric female meiosis, whereas gene drive is normally post-meiotic and typically found in males. Here, using single-molecule and single-pollen genome sequencing, we describe Teosinte Pollen Drive, an instance of gene drive in hybrids between maize (Zea mays ssp. mays) and teosinte mexicana (Z. mays ssp. mexicana) that depends on RNA interference (RNAi). 22-nucleotide small RNAs from a non-coding RNA hairpin in mexicana depend on Dicer-like 2 (Dcl2) and target Teosinte Drive Responder 1 (Tdr1), which encodes a lipase required for pollen viability. Dcl2, Tdr1 and the hairpin are in tight pseudolinkage on chromosome 5, but only when transmitted through the male. Introgression of mexicana into early cultivated maize is thought to have been critical to its geographical dispersal throughout the Americas3, and a tightly linked inversion in mexicana spans a major domestication sweep in modern maize4. A survey of maize traditional varieties and sympatric populations of teosinte mexicana reveals correlated patterns of admixture among unlinked genes required for RNAi on at least four chromosomes that are also subject to gene drive in pollen from synthetic hybrids. Teosinte Pollen Drive probably had a major role in maize domestication and diversification, and offers an explanation for the widespread abundance of 'self' small RNAs in the germ lines of plants and animals.
Collapse
Affiliation(s)
- Benjamin Berube
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Evan Ernst
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jonathan Cahn
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Benjamin Roche
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | - Jason Lynn
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Armin Scheben
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology, Center for Population Biology and Genome Center, University of California at Davis, Davis, CA, USA
| | - Jerry Kermicle
- Laboratory of Genetics, University of Wisconsin, Madison, WI, USA
| | - Robert A Martienssen
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
10
|
Zhang M, Huang X, Wu H. Application of Biological Nanopore Sequencing Technology in the Detection of Microorganisms †. CHINESE J CHEM 2023; 41:3473-3483. [DOI: 10.1002/cjoc.202300255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 07/14/2023] [Indexed: 01/05/2025]
Abstract
Comprehensive SummaryEnvironmental pollution and the spread of pathogenic microorganisms pose a significant threat to the health of humans and the planet. Thus, understanding and detecting microorganisms is crucial for maintaining a healthy living environment. Nanopore sequencing is a single‐molecule detection method developed in the 1990s that has revolutionized various research fields. It offers several advantages over traditional sequencing methods, including low cost, label‐free, time‐saving detection speed, long sequencing reading, real‐time monitoring, convenient carrying, and other significant advantages. In this review, we summarize the technical principles and characteristics of nanopore sequencing and discuss its applications in amplicon sequencing, metagenome sequencing, and whole‐genome sequencing of environmental microorganisms, as well as its in situ application under some special circumstances. We also analyze the advantages and challenges of nanopore sequencing in microbiology research. Overall, nanopore sequencing has the potential to greatly enhance the detection and understanding of microorganisms in environmental research, but further developments are needed to overcome the current challenges.
Collapse
Affiliation(s)
- Ming‐Qian Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry Chinese Academy of Sciences Beijing 100190 China
- University of Chinese Academy of Sciences Beijing 100049 China
| | - Xiao‐Bin Huang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry Chinese Academy of Sciences Beijing 100190 China
- University of Chinese Academy of Sciences Beijing 100049 China
| | - Hai‐Chen Wu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry Chinese Academy of Sciences Beijing 100190 China
- University of Chinese Academy of Sciences Beijing 100049 China
| |
Collapse
|
11
|
Lamas A, Garrido-Maestu A, Prieto A, Cepeda A, Franco CM. Whole genome sequencing in the palm of your hand: how to implement a MinION Galaxy-based workflow in a food safety laboratory for rapid Salmonella spp. serotyping, virulence, and antimicrobial resistance gene identification. Front Microbiol 2023; 14:1254692. [PMID: 38107857 PMCID: PMC10722185 DOI: 10.3389/fmicb.2023.1254692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 11/02/2023] [Indexed: 12/19/2023] Open
Abstract
Introduction Whole Genome Sequencing (WGS) implementation in food safety laboratories is a significant advancement in food pathogen control and outbreak tracking. However, the initial investment for acquiring next-generation sequencing platforms and the need for bioinformatic skills represented an obstacle for the widespread use of WGS. Long-reading technologies, such as the one developed by Oxford Nanopore Technologies, can be easily implemented with a minor initial investment and with simple protocols that can be performed with basic laboratory equipment. Methods Herein, we report a simple MinION Galaxy-based workflow with analysis parameters that allow its implementation in food safety laboratories with limited computer resources and without previous knowledge in bioinformatics for rapid Salmonella serotyping, virulence, and identification of antimicrobial resistance genes. For that purpose, the single use Flongle flow cells, along with the MinION Mk1B for WGS, and the community-driven web-based analysis platform Galaxy for bioinformatic analysis was used. Three strains belonging to three different serotypes, monophasic S. Typhimurium, S. Grancanaria, and S. Senftenberg, were sequenced. Results After 24 h of sequencing, enough coverage was achieved in order to perform de novo assembly in all three strains. After evaluating different tools, Flye de novo assemblies with medaka polishing were shown to be optimal for in silico Salmonella spp. serotyping with SISRT tool followed by antimicrobial and virulence gene identification with ABRicate. Discussion The implementation of the present workflow in food safety laboratories with limited computer resources allows a rapid characterization of Salmonella spp. isolates.
Collapse
Affiliation(s)
- Alexandre Lamas
- Food Hygiene, Inspection and Control Laboratory (Lhica), Department of Analytical Chemistry, Nutrition and Bromatology, Veterinary School, Universidade da Santiago de Compostela, Lugo, Spain
| | - Alejandro Garrido-Maestu
- Food Quality and Safety Research Group, International Iberian Nanotechnology Laboratory, Braga, Portugal
| | - Alberto Prieto
- Department of Animal Pathology (INVESAGA Group), Faculty of Veterinary Sciences, Universidade de Santiago de Compostela, Lugo, Spain
| | - Alberto Cepeda
- Food Hygiene, Inspection and Control Laboratory (Lhica), Department of Analytical Chemistry, Nutrition and Bromatology, Veterinary School, Universidade da Santiago de Compostela, Lugo, Spain
| | - Carlos Manuel Franco
- Food Hygiene, Inspection and Control Laboratory (Lhica), Department of Analytical Chemistry, Nutrition and Bromatology, Veterinary School, Universidade da Santiago de Compostela, Lugo, Spain
| |
Collapse
|
12
|
Berube B, Ernst E, Cahn J, Roche B, de Santis Alves C, Lynn J, Scheben A, Siepel A, Ross-Ibarra J, Kermicle J, Martienssen R. Teosinte Pollen Drive guides maize diversification and domestication by RNAi. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.12.548689. [PMID: 37503269 PMCID: PMC10370002 DOI: 10.1101/2023.07.12.548689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Meiotic drivers subvert Mendelian expectations by manipulating reproductive development to bias their own transmission. Chromosomal drive typically functions in asymmetric female meiosis, while gene drive is normally postmeiotic and typically found in males. Using single molecule and single-pollen genome sequencing, we describe Teosinte Pollen Drive, an instance of gene drive in hybrids between maize (Zea mays ssp. mays) and teosinte mexicana (Zea mays ssp. mexicana), that depends on RNA interference (RNAi). 22nt small RNAs from a non-coding RNA hairpin in mexicana depend on Dicer-Like 2 (Dcl2) and target Teosinte Drive Responder 1 (Tdr1), which encodes a lipase required for pollen viability. Dcl2, Tdr1, and the hairpin are in tight pseudolinkage on chromosome 5, but only when transmitted through the male. Introgression of mexicana into early cultivated maize is thought to have been critical to its geographical dispersal throughout the Americas, and a tightly linked inversion in mexicana spans a major domestication sweep in modern maize. A survey of maize landraces and sympatric populations of teosinte mexicana reveals correlated patterns of admixture among unlinked genes required for RNAi on at least 4 chromosomes that are also subject to gene drive in pollen from synthetic hybrids. Teosinte Pollen Drive likely played a major role in maize domestication and diversification, and offers an explanation for the widespread abundance of "self" small RNAs in the germlines of plants and animals.
Collapse
Affiliation(s)
- Benjamin Berube
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Evan Ernst
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Jonathan Cahn
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Benjamin Roche
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | | | - Jason Lynn
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Armin Scheben
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| | - Jeffrey Ross-Ibarra
- Dept. of Evolution & Ecology, Center for Population Biology and Genome Center, University of California, Davis CA
| | - Jerry Kermicle
- Laboratory of Genetics, University of Wisconsin, Madison WI
| | - Rob Martienssen
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor NY11724
| |
Collapse
|
13
|
Koo H, Lee GW, Ko SR, Go S, Kwon SY, Kim YM, Shin AY. Two long read-based genome assembly and annotation of polyploidy woody plants, Hibiscus syriacus L. using PacBio and Nanopore platforms. Sci Data 2023; 10:713. [PMID: 37853021 PMCID: PMC10584963 DOI: 10.1038/s41597-023-02631-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023] Open
Abstract
Improvements in long read DNA sequencing and related techniques facilitated the generation of complex eukaryotic genomes. Despite these advances, the quality of constructed plant reference genomes remains relatively poor due to the large size of genomes, high content of repetitive sequences, and wide variety of ploidy. Here, we developed the de novo sequencing and assembly of high polyploid plant genome, Hibiscus syriacus, a flowering plant species of the Malvaceae family, using the Oxford Nanopore Technologies and Pacific Biosciences Sequel sequencing platforms. We investigated an efficient combination of high-quality and high-molecular-weight DNA isolation procedure and suitable assembler to achieve optimal results using long read sequencing data. We found that abundant ultra-long reads allow for large and complex polyploid plant genome assemblies with great recovery of repetitive sequences and error correction even at relatively low depth Nanopore sequencing data and polishing compared to previous studies. Collectively, our combination provides cost effective methods to improve genome continuity and quality compared to the previously reported reference genome by accessing highly repetitive regions. The application of this combination may enable genetic research and breeding of polyploid crops, thus leading to improvements in crop production.
Collapse
Affiliation(s)
- Hyunjin Koo
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Gir-Won Lee
- SML Genetree Co. Ltd., Seoul, 05855, Republic of Korea
| | - Seo-Rin Ko
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Biosystems and Bioengineering Program, University of Science and Technology, Daejeon, 34113, Korea
| | - Sangjin Go
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Biosystems and Bioengineering Program, University of Science and Technology, Daejeon, 34113, Korea
| | - Suk-Yoon Kwon
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Biosystems and Bioengineering Program, University of Science and Technology, Daejeon, 34113, Korea
| | - Yong-Min Kim
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), Daejeon, 34141, Republic of Korea.
- Digital Biotech Innovation Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
| | - Ah-Young Shin
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
- Department of Bioinformatics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), Daejeon, 34141, Republic of Korea.
| |
Collapse
|
14
|
Merkulov P, Gvaramiya S, Dudnikov M, Komakhin R, Omarov M, Kocheshkova A, Konstantinov Z, Soloviev A, Karlov G, Divashuk M, Kirov I. Cas9-targeted Nanopore sequencing rapidly elucidates the transposition preferences and DNA methylation profiles of mobile elements in plants. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:2242-2261. [PMID: 37555565 DOI: 10.1111/jipb.13555] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 07/27/2023] [Accepted: 08/08/2023] [Indexed: 08/10/2023]
Abstract
Transposable element insertions (TEIs) are an important source of genomic innovation by contributing to plant adaptation, speciation, and the production of new varieties. The often large, complex plant genomes make identifying TEIs from short reads difficult and expensive. Moreover, rare somatic insertions that reflect mobilome dynamics are difficult to track using short reads. To address these challenges, we combined Cas9-targeted Nanopore sequencing (CANS) with the novel pipeline NanoCasTE to trace both genetically inherited and somatic TEIs in plants. We performed CANS of the EVADÉ (EVD) retrotransposon in wild-type Arabidopsis thaliana and rapidly obtained up to 40× sequence coverage. Analysis of hemizygous T-DNA insertion sites and genetically inherited insertions of the EVD transposon in the ddm1 (decrease in DNA methylation 1) genome uncovered the crucial role of DNA methylation in shaping EVD insertion preference. We also investigated somatic transposition events of the ONSEN transposon family, finding that genes that are downregulated during heat stress are preferentially targeted by ONSENs. Finally, we detected hypomethylation of novel somatic insertions for two ONSENs. CANS and NanoCasTE are effective tools for detecting TEIs and exploring mobilome organization in plants in response to stress and in different genetic backgrounds, as well as screening T-DNA insertion mutants and transgenic plants.
Collapse
Affiliation(s)
- Pavel Merkulov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, 141701, Russia
| | - Sofya Gvaramiya
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Maxim Dudnikov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, 141701, Russia
| | - Roman Komakhin
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Murad Omarov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Alina Kocheshkova
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Zakhar Konstantinov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Alexander Soloviev
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
- All-Russia Center for Plant Quarantine, Ramenski, 140150, Russia
- N.V. Tsitsin Main Botanical Garden of the Russian Academy of Sciences, Moscow, 127276, Russia
| | - Gennady Karlov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Mikhail Divashuk
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
| | - Ilya Kirov
- All-Russia Research Institute of Agricultural Biotechnology, Moscow, 127550, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, 141701, Russia
| |
Collapse
|
15
|
Cunha TJ, de Medeiros BAS, Lord A, Sørensen MV, Giribet G. Rampant loss of universal metazoan genes revealed by a chromosome-level genome assembly of the parasitic Nematomorpha. Curr Biol 2023; 33:3514-3521.e4. [PMID: 37467752 DOI: 10.1016/j.cub.2023.07.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 05/21/2023] [Accepted: 07/03/2023] [Indexed: 07/21/2023]
Abstract
Parasites may manipulate host behavior to increase the odds of transmission or to reach the proper environment to complete their life cycle.1,2 Members of the phylum Nematomorpha (known as horsehair worms, hairworms, or Gordian worms) are large endoparasites that affect the behavior of their arthropod hosts. In terrestrial hosts, they cause erratic movements toward bodies of water,3,4,5,6 where the adult worm emerges from the host to find mates for reproduction. We present a chromosome-level genome assembly for the freshwater Acutogordius australiensis and a draft assembly for one of the few known marine species, Nectonema munidae. The assemblies span 201 Mbp and 213 Mbp in length (N50: 38 Mbp and 716 Kbp), respectively, and reveal four chromosomes in Acutogordius, which are largely rearranged compared to the inferred ancestral condition in animals. Both nematomorph genomes have a relatively low number of genes (11,114 and 8,717, respectively) and lack a high proportion (∼30%) of universal single-copy metazoan orthologs (BUSCO genes7). We demonstrate that missing genes are not an artifact of the assembly process, with the majority of missing orthologs being shared by the two independent assemblies. Missing BUSCOs are enriched for Gene Ontology (GO) terms associated with the organization of cilia and cell projections in other animals. We show that most cilium-related genes conserved across eukaryotes have been lost in Nematomorpha, providing a molecular basis for the suspected absence of ciliary structures in these animals.
Collapse
Affiliation(s)
- Tauana J Cunha
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Field Museum of Natural History, 1400 S DuSable Lake Shore Drive, Chicago, IL 60605, USA.
| | - Bruno A S de Medeiros
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Field Museum of Natural History, 1400 S DuSable Lake Shore Drive, Chicago, IL 60605, USA
| | - Arianna Lord
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Martin V Sørensen
- Natural History Museum of Denmark, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen, Denmark
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
16
|
Cordeiro D, Camelo A, Pedrosa AC, Brandão I, Canhoto J, Espírito Santo C, Correia S. An Efficient Method to Prepare Barcoded cDNA Libraries from Plant Callus for Long-Read Sequencing. Methods Protoc 2023; 6:mps6020031. [PMID: 36961051 PMCID: PMC10037601 DOI: 10.3390/mps6020031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/07/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023] Open
Abstract
Long-read sequencing methods allow a comprehensive analysis of transcriptomes in identifying full-length transcripts. This revolutionary method represents a considerable breakthrough for non-model species since it allows enhanced gene annotation and gene expression studies when compared to former sequencing methods. However, woody plant tissues are challenging to the successful preparation of cDNA libraries, thus, impairing further cutting-edge sequencing analyses. Here, a detailed protocol for preparing cDNA libraries suitable for high throughput RNA sequencing using Oxford Nanopore Technologies® is described. This method was used to prepare eight barcoded cDNA libraries from two Solanum betaceum cell lines: one with compact morphology and embryogenic competency (EC) and another with friable and non-embryogenic (NEC). The libraries were successfully sequenced, and data quality assessment showed high mean quality scores. Using this method, long-read sequencing will allow a comprehensive analysis of plant transcriptomes.
Collapse
Affiliation(s)
- Daniela Cordeiro
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - Alexandra Camelo
- Centro de Apoio Tecnológico Agro-Alimentar (CATAA) de Castelo Branco, 6000-459 Castelo Branco, Portugal
| | - Ana Carolina Pedrosa
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - Inês Brandão
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- Centro de Apoio Tecnológico Agro-Alimentar (CATAA) de Castelo Branco, 6000-459 Castelo Branco, Portugal
| | - Jorge Canhoto
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - Christophe Espírito Santo
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- Centro de Apoio Tecnológico Agro-Alimentar (CATAA) de Castelo Branco, 6000-459 Castelo Branco, Portugal
| | - Sandra Correia
- Centre for Functional Ecology, TERRA Associate Laboratory, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- InnovPlantProtect CoLab, Estrada de Gil Vaz, 7350-478 Elvas, Portugal
| |
Collapse
|
17
|
Hasing T, Bombarely A. Genomic Approaches for the Study of Flower Development in Floriculture Crops. Methods Mol Biol 2023; 2686:453-494. [PMID: 37540373 DOI: 10.1007/978-1-0716-3299-4_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
Abstract
The advances in genomics and bioinformatics have made possible the study in non-model plants of phenotypes associated to flower development. Floriculture crops are an interesting source of traits associated to flower development such as the transition between zygomorphic and actinomorphic flowers or the production of flowers with double and triple corollas. In this chapter, we summarize the material and methods for the use of floriculture crops to study flower development using genomic tools, from the sequencing and assembly of a reference genome to QTL and RNA-Seq analysis to search candidate genes associated to specific traits.
Collapse
Affiliation(s)
| | - Aureliano Bombarely
- Instituto de Biología Molecular y Celular de Plantas (IBMCP) (UPV-CSIC), Valencia, Spain.
| |
Collapse
|
18
|
Valderrama E, Landis JB, Skinner D, Maas PJM, Maas-van de Kramer H, André T, Grunder N, Sass C, Pinilla-Vargas M, Guan CJ, Phillips HR, de Almeida AMR, Specht CD. The genetic mechanisms underlying the convergent evolution of pollination syndromes in the Neotropical radiation of Costus L. FRONTIERS IN PLANT SCIENCE 2022; 13:874322. [PMID: 36161003 PMCID: PMC9493542 DOI: 10.3389/fpls.2022.874322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 06/27/2022] [Indexed: 06/16/2023]
Abstract
Selection together with variation in floral traits can act to mold floral form, often driven by a plant's predominant or most effective pollinators. To investigate the evolution of traits associated with pollination, we developed a phylogenetic framework for evaluating tempo and mode of pollination shifts across the genus Costus L., known for its evolutionary toggle between traits related to bee and bird pollination. Using a target enrichment approach, we obtained 957 loci for 171 accessions to expand the phylogenetic sampling of Neotropical Costus. In addition, we performed whole genome resequencing for a subset of 20 closely related species with contrasting pollination syndromes. For each of these 20 genomes, a high-quality assembled transcriptome was used as reference for consensus calling of candidate loci hypothesized to be associated with pollination-related traits of interest. To test for the role these candidate genes may play in evolutionary shifts in pollinators, signatures of selection were estimated as dN/dS across the identified candidate loci. We obtained a well-resolved phylogeny for Neotropical Costus despite conflict among gene trees that provide evidence of incomplete lineage sorting and/or reticulation. The overall topology and the network of genome-wide single nucleotide polymorphisms (SNPs) indicate that multiple shifts in pollination strategy have occurred across Costus, while also suggesting the presence of previously undetected signatures of hybridization between distantly related taxa. Traits related to pollination syndromes are strongly correlated and have been gained and lost in concert several times throughout the evolution of the genus. The presence of bract appendages is correlated with two traits associated with defenses against herbivory. Although labellum shape is strongly correlated with overall pollination syndrome, we found no significant impact of labellum shape on diversification rates. Evidence suggests an interplay of pollination success with other selective pressures shaping the evolution of the Costus inflorescence. Although most of the loci used for phylogenetic inference appear to be under purifying selection, many candidate genes associated with functional traits show evidence of being under positive selection. Together these results indicate an interplay of phylogenetic history with adaptive evolution leading to the diversification of pollination-associated traits in Neotropical Costus.
Collapse
Affiliation(s)
- Eugenio Valderrama
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
| | - Jacob B. Landis
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
- BTI Computational Biology Center, Boyce Thompson Institute, Ithaca, NY, United States
| | - Dave Skinner
- Le Jardin Ombragé, Tallahassee, FL, United States
| | - Paul J. M. Maas
- Section Botany, Naturalis Biodiversity Center, Leiden, Netherlands
| | | | - Thiago André
- Departamento de Botânica, Instituto de Ciências Biológicas, Universidade de Brasília, Brasília, DF, Brazil
| | - Nikolaus Grunder
- Department of Biological Sciences, California State University, East Bay, Hayward, CA, United States
| | - Chodon Sass
- University and Jepson Herbaria, University of California, Berkeley, Berkeley, CA, United States
| | - Maria Pinilla-Vargas
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
| | - Clarice J. Guan
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
| | - Heather R. Phillips
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
| | | | - Chelsea D. Specht
- School of Integrative Plant Science, Section of Plant Biology and the L.H. Bailey Hortorium, Cornell University, Ithaca, NY, United States
| |
Collapse
|
19
|
Mgwatyu Y, Cornelissen S, van Heusden P, Stander A, Ranketse M, Hesse U. Establishing MinION Sequencing and Genome Assembly Procedures for the Analysis of the Rooibos ( Aspalathus linearis) Genome. PLANTS (BASEL, SWITZERLAND) 2022; 11:2156. [PMID: 36015459 PMCID: PMC9416007 DOI: 10.3390/plants11162156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Revised: 08/08/2022] [Accepted: 08/14/2022] [Indexed: 11/17/2022]
Abstract
While plant genome analysis is gaining speed worldwide, few plant genomes have been sequenced and analyzed on the African continent. Yet, this information holds the potential to transform diverse industries as it unlocks medicinally and industrially relevant biosynthesis pathways for bioprospecting. Considering that South Africa is home to the highly diverse Cape Floristic Region, local establishment of methods for plant genome analysis is essential. Long-read sequencing is becoming standard procedure for plant genome research, as these reads can span repetitive regions of the DNA, substantially facilitating reassembly of a contiguous genome. With the MinION, Oxford Nanopore offers a cost-efficient sequencing method to generate long reads; however, DNA purification protocols must be adapted for each plant species to generate ultra-pure DNA, essential for these analyses. Here, we describe a cost-effective procedure for the extraction and purification of plant DNA and evaluate diverse genome assembly approaches for the reconstruction of the genome of rooibos (Aspalathus linearis), an endemic South African medicinal plant widely used for tea production. We discuss the pros and cons of nine tested assembly programs, specifically Redbean and NextDenovo, which generated the most contiguous assemblies, and Flye, which produced an assembly closest to the predicted genome size.
Collapse
Affiliation(s)
- Yamkela Mgwatyu
- Department of Biotechnology, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
| | - Stephanie Cornelissen
- Agricultural Research Council, Biotechnology Platform, 100 Old Soutpans Road, Onderstepoort 0110, South Africa
| | - Peter van Heusden
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
| | - Allison Stander
- Department of Biotechnology, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
| | - Mary Ranketse
- Agricultural Research Council, Biotechnology Platform, 100 Old Soutpans Road, Onderstepoort 0110, South Africa
| | - Uljana Hesse
- Department of Biotechnology, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
- Institute for Microbial Biotechnology and Metagenomics, University of the Western Cape, Robert Sobukwe Road, Bellville 7535, South Africa
| |
Collapse
|
20
|
Dmitriev AA, Pushkova EN, Melnikova NV. Plant Genome Sequencing: Modern Technologies and Novel Opportunities for Breeding. Mol Biol 2022. [DOI: 10.1134/s0026893322040045] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
21
|
Jaudou S, Tran ML, Vorimore F, Fach P, Delannoy S. Evaluation of high molecular weight DNA extraction methods for long-read sequencing of Shiga toxin-producing Escherichia coli. PLoS One 2022; 17:e0270751. [PMID: 35830426 PMCID: PMC9278759 DOI: 10.1371/journal.pone.0270751] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 06/16/2022] [Indexed: 11/18/2022] Open
Abstract
Next generation sequencing has become essential for pathogen characterization and typing. The most popular second generation sequencing technique produces data of high quality with very low error rates and high depths. One major drawback of this technique is the short reads. Indeed, short-read sequencing data of Shiga toxin-producing Escherichia coli (STEC) are difficult to assemble because of the presence of numerous mobile genetic elements (MGEs), which contain repeated elements. The resulting draft assemblies are often highly fragmented, which results in a loss of information, especially concerning MGEs or large structural variations. The use of long-read sequencing can circumvent these problems and produce complete or nearly complete genomes. The ONT MinION, for its small size and minimal investment requirements, is particularly popular. The ultra-long reads generated with the MinION can easily span prophages and repeat regions. In order to take full advantage of this technology it requires High Molecular Weight (HMW) DNA of high quality in high quantity. In this study, we have tested three different extraction methods: bead-based, solid-phase and salting-out, and evaluated their impact on STEC DNA yield, quality and integrity as well as performance in MinION long-read sequencing. Both the bead-based and salting-out methods allowed the recovery of large quantities of HMW STEC DNA suitable for MinION library preparation. The DNA extracted using the salting-out method consistently produced longer reads in the subsequent MinION runs, compared with the bead-based methods. While both methods performed similarly in subsequent STEC genome assembly, DNA extraction based on salting-out appeared to be the overall best method to produce high quantity of pure HMW STEC DNA for MinION sequencing.
Collapse
Affiliation(s)
- Sandra Jaudou
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Mai-Lan Tran
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Fabien Vorimore
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Patrick Fach
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| | - Sabine Delannoy
- Pathogenic E. coli Unit, Laboratory for Food Safety, Anses, Maisons-Alfort, France
- IdentyPath Platform, Laboratory for Food Safety, Anses, Maisons-Alfort, France
| |
Collapse
|
22
|
Russo A, Mayjonade B, Frei D, Potente G, Kellenberger RT, Frachon L, Copetti D, Studer B, Frey JE, Grossniklaus U, Schlüter PM. Low-Input High-Molecular-Weight DNA Extraction for Long-Read Sequencing From Plants of Diverse Families. FRONTIERS IN PLANT SCIENCE 2022; 13:883897. [PMID: 35665166 PMCID: PMC9161206 DOI: 10.3389/fpls.2022.883897] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 04/21/2022] [Indexed: 05/16/2023]
Abstract
Long-read DNA sequencing technologies require high molecular weight (HMW) DNA of adequate purity and integrity, which can be difficult to isolate from plant material. Plant leaves usually contain high levels of carbohydrates and secondary metabolites that can impact DNA purity, affecting downstream applications. Several protocols and kits are available for HMW DNA extraction, but they usually require a high amount of input material and often lead to substantial DNA fragmentation, making sequencing suboptimal in terms of read length and data yield. We here describe a protocol for plant HMW DNA extraction from low input material (0.1 g) which is easy to follow and quick (2.5 h). This method successfully enabled us to extract HMW from four species from different families (Orchidaceae, Poaceae, Brassicaceae, Asteraceae). In the case of recalcitrant species, we show that an additional purification step is sufficient to deliver a clean DNA sample. We demonstrate the suitability of our protocol for long-read sequencing on the Oxford Nanopore Technologies PromethION® platform, with and without the use of a short fragment depletion kit.
Collapse
Affiliation(s)
- Alessia Russo
- Department of Plant and Microbial Biology and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | - Baptiste Mayjonade
- Laboratoire des Interactions Plantes Microbes Environnement (LIPME), INRAE, Toulouse, France
| | - Daniel Frei
- Department of Method Development and Analytics, Agroscope, Wädenswil, Switzerland
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | | | - Léa Frachon
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | - Dario Copetti
- Institute of Agricultural Sciences and Zurich-Basel Plant Science Centre, ETH Zürich, Zurich, Switzerland
| | - Bruno Studer
- Institute of Agricultural Sciences and Zurich-Basel Plant Science Centre, ETH Zürich, Zurich, Switzerland
| | - Jürg E. Frey
- Department of Method Development and Analytics, Agroscope, Wädenswil, Switzerland
| | - Ueli Grossniklaus
- Department of Plant and Microbial Biology and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | - Philipp M. Schlüter
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| |
Collapse
|
23
|
Complete Genome Sequences of Two Closely Related Paenarthrobacter nicotinovorans Strains. Microbiol Resour Announc 2022; 11:e0013322. [PMID: 35536014 PMCID: PMC9202424 DOI: 10.1128/mra.00133-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Paenarthrobacter nicotinovorans is a soil bacterium that uses the pyridine pathway to degrade nicotine. The genome of strain ATCC 49919 is composed of a ~4.3-Mbp chromosome and a ~165-kbp plasmid. The second strain, termed here nic-, is a cured derivative lacking the plasmid and not able to degrade nicotine.
Collapse
|
24
|
Ko KKK, Chng KR, Nagarajan N. Metagenomics-enabled microbial surveillance. Nat Microbiol 2022; 7:486-496. [PMID: 35365786 DOI: 10.1038/s41564-022-01089-w] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 02/22/2022] [Indexed: 12/13/2022]
Abstract
Lessons learnt from the COVID-19 pandemic include increased awareness of the potential for zoonoses and emerging infectious diseases that can adversely affect human health. Although emergent viruses are currently in the spotlight, we must not forget the ongoing toll of morbidity and mortality owing to antimicrobial resistance in bacterial pathogens and to vector-borne, foodborne and waterborne diseases. Population growth, planetary change, international travel and medical tourism all contribute to the increasing frequency of infectious disease outbreaks. Surveillance is therefore of crucial importance, but the diversity of microbial pathogens, coupled with resource-intensive methods, compromises our ability to scale-up such efforts. Innovative technologies that are both easy to use and able to simultaneously identify diverse microorganisms (viral, bacterial or fungal) with precision are necessary to enable informed public health decisions. Metagenomics-enabled surveillance methods offer the opportunity to improve detection of both known and yet-to-emerge pathogens.
Collapse
Affiliation(s)
- Karrie K K Ko
- Laboratory of Metagenomic Technologies and Microbial Systems, Genome Institute of Singapore, Singapore, Singapore.,Department of Microbiology, Singapore General Hospital, Singapore, Singapore.,Department of Molecular Pathology, Singapore General Hospital, Singapore, Singapore.,Duke-NUS Medical School, Singapore, Singapore.,Yong Loo Lin School of Medicine, National Univerisity of Singapore, Singapore, Singapore
| | - Kern Rei Chng
- Laboratory of Metagenomic Technologies and Microbial Systems, Genome Institute of Singapore, Singapore, Singapore.,National Centre for Food Science, Singapore Food Agency, Singapore, Singapore
| | - Niranjan Nagarajan
- Laboratory of Metagenomic Technologies and Microbial Systems, Genome Institute of Singapore, Singapore, Singapore. .,Yong Loo Lin School of Medicine, National Univerisity of Singapore, Singapore, Singapore.
| |
Collapse
|
25
|
Epichloë scottii sp. nov., a new endophyte isolated from Melica uniflora is the missing ancestor of Epichloë disjuncta. IMA Fungus 2022; 13:2. [PMID: 35109929 PMCID: PMC8812020 DOI: 10.1186/s43008-022-00088-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 01/25/2022] [Indexed: 11/13/2022] Open
Abstract
Here we describe a new, haploid and stroma forming species within the genus Epichloë, as Epichloë scottii sp. nov. The fungus was isolated from Melica uniflora growing in Bad Harzburg, Germany. Phylogenetic reconstruction using a combined dataset of the tubB and tefA genes strongly support that E. scottii is a distinct species and the so far unknown ancestor species of the hybrid E. disjuncta. A distribution analysis showed a high infection rate in close vicinity of the initial sampling site and only two more spots with low infection rates. Genetic variations in key genes required for alkaloid production suggested that E. scottii sp. nov. might not be capable of producing any of the major alkaloids including ergot alkaloid, loline, indole-diterpene and peramine. All isolates and individuals found in the distribution analysis were identified as mating-type B explaining the lack of mature stromata during this study. We further release a telomere-to-telomere de novo assembly of all seven chromosomes and the mitogenome of E. scottii sp. nov.
Collapse
|
26
|
Chen SH, Rossetto M, van der Merwe M, Lu-Irving P, Yap JYS, Sauquet H, Bourke G, Amos TG, Bragg JG, Edwards RJ. Chromosome-level de novo genome assembly of Telopea speciosissima (New South Wales waratah) using long-reads, linked-reads and Hi-C. Mol Ecol Resour 2022; 22:1836-1854. [PMID: 35016262 DOI: 10.1111/1755-0998.13574] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 12/03/2021] [Accepted: 12/03/2021] [Indexed: 11/29/2022]
Abstract
Telopea speciosissima, the New South Wales waratah, is an Australian endemic woody shrub in the family Proteaceae. Waratahs have great potential as a model clade to better understand processes of speciation, introgression and adaptation, and are significant from a horticultural perspective. Here, we report the first chromosome-level genome for T. speciosissima. Combining Oxford Nanopore long-reads, 10x Genomics Chromium linked-reads and Hi-C data, the assembly spans 823 Mb (scaffold N50 of 69.0 Mb) with 97.8% of Embryophyta BUSCOs "Complete". We present a new method in Diploidocus (https://github.com/slimsuite/diploidocus) for classifying, curating and QC-filtering scaffolds, which combines read depths, k-mer frequencies and BUSCO predictions. We also present a new tool, DepthSizer (https://github.com/slimsuite/depthsizer), for genome size estimation from the read depth of single-copy orthologues and estimate the genome size to be approximately 900 Mb. The largest 11 scaffolds contained 94.1% of the assembly, conforming to the expected number of chromosomes (2n = 22). Genome annotation predicted 40,158 protein-coding genes, 351 rRNAs and 728 tRNAs. We investigated CYCLOIDEA (CYC) genes, which have a role in determination of floral symmetry, and confirm the presence of two copies in the genome. Read depth analysis of 180 "Duplicated" BUSCO genes using a new tool, DepthKopy (https://github.com/slimsuite/depthkopy), suggests almost all are real duplications, increasing confidence in the annotation and highlighting a possible need to revise the BUSCO set for this lineage. The chromosome-level T. speciosissima reference genome (Tspe_v1) provides an important new genomic resource of Proteaceae to support the conservation of flora in Australia and further afield.
Collapse
Affiliation(s)
- Stephanie H Chen
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, New South Wales, Australia.,Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia
| | - Maurizio Rossetto
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia.,Queensland Alliance of Agriculture and Food Innovation, University of Queensland, St Lucia, Queensland, Australia
| | - Marlien van der Merwe
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia
| | - Patricia Lu-Irving
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia
| | - Jia-Yee S Yap
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia.,Queensland Alliance of Agriculture and Food Innovation, University of Queensland, St Lucia, Queensland, Australia
| | - Hervé Sauquet
- National Herbarium of New South Wales, Royal Botanic Gardens and Domain Trust, Sydney, New South Wales, Australia.,School of Biological, Earth and Environmental Sciences, UNSW Sydney, New South Wales, Australia
| | - Greg Bourke
- Blue Mountains Botanic Garden, Mount Tomah, New South Wales, Australia
| | - Timothy G Amos
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, New South Wales, Australia
| | - Jason G Bragg
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden Sydney, Sydney, New South Wales, Australia.,School of Biological, Earth and Environmental Sciences, UNSW Sydney, New South Wales, Australia
| | - Richard J Edwards
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, New South Wales, Australia
| |
Collapse
|
27
|
Raymond-Bouchard I, Maggiori C, Brennan L, Altshuler I, Manchado JM, Parro V, Whyte LG. Assessment of Automated Nucleic Acid Extraction Systems in Combination with MinION Sequencing As Potential Tools for the Detection of Microbial Biosignatures. ASTROBIOLOGY 2022; 22:87-103. [PMID: 34962136 DOI: 10.1089/ast.2020.2349] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The utilization of nanopore technologies for the detection of organic biogenic compounds has garnered significant focus in recent years. Oxford Nanopore Technologies' (ONT) MinION instrument, which can detect and sequence nucleic acids (NAs), is one such example. These technologies have much promise for unambiguous life detection but require significant development in terms of methods for extraction and preparation of NAs for biosignature detection and their feasibility for use in astrobiology-focused field missions. In this study, we tested pre-existing, automated, or semiautomated NA extraction technologies, coupled with automated ONT VolTRAX NA sample preparation, and verification with Nanopore MinION sequencing. All of the extraction systems tested (SuperFastPrep2, ClaremontX1, and SOLID-Sample Preparation Unit) showed potential for extracting DNA from Canadian High Arctic environments analogous to Mars, Europa, and Enceladus, which could subsequently be detected and sequenced with the MinION. However, they differed with regard to efficacy, yield, purity, and sequencing and annotation quality. Overall, bead beating-based systems performed the best for these parameters. In addition, we showed that the MinION could sequence unpurified DNA contained in crude cell lysates. This is valuable from an astrobiology perspective because purification steps are time-consuming and complicate the requirements for an automated extraction and life detection system. Our results indicate that semiautomated NA extraction and preparation technologies hold much promise, and with increased optimization and automation could be coupled to a larger platform incorporating nanopore detection and sequencing of NAs for life detection applications.
Collapse
Affiliation(s)
| | - Catherine Maggiori
- Department of Natural Resource Sciences, McGill University, Quebec, Canada
| | - Laura Brennan
- Department of Natural Resource Sciences, McGill University, Quebec, Canada
| | - Ianina Altshuler
- Department of Natural Resource Sciences, McGill University, Quebec, Canada
| | | | - Victor Parro
- Centro de Astrobiología (CSIC-INTA), Torrejón de Ardoz, Madrid, Spain
| | - Lyle G Whyte
- Department of Natural Resource Sciences, McGill University, Quebec, Canada
| |
Collapse
|
28
|
Johnson LK, Sahasrabudhe R, Gill JA, Roach JL, Froenicke L, Brown CT, Whitehead A. Draft genome assemblies using sequencing reads from Oxford Nanopore Technology and Illumina platforms for four species of North American Fundulus killifish. Gigascience 2021; 9:5859380. [PMID: 32556169 PMCID: PMC7301629 DOI: 10.1093/gigascience/giaa067] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 04/16/2020] [Accepted: 05/27/2020] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Whole-genome sequencing data from wild-caught individuals of closely related North American killifish species (Fundulus xenicus, Fundulus catenatus, Fundulus nottii, and Fundulus olivaceus) were obtained using long-read Oxford Nanopore Technology (ONT) PromethION and short-read Illumina platforms. FINDINGS Draft de novo reference genome assemblies were generated using a combination of long and short sequencing reads. For each species, the PromethION platform was used to generate 30-45× sequence coverage, and the Illumina platform was used to generate 50-160× sequence coverage. Illumina-only assemblies were fragmented with high numbers of contigs, while ONT-only assemblies were error prone with low BUSCO scores. The highest N50 values, ranging from 0.4 to 2.7 Mb, were from assemblies generated using a combination of short- and long-read data. BUSCO scores were consistently >90% complete using the Eukaryota database. CONCLUSIONS High-quality genomes can be obtained from a combination of using short-read Illumina data to polish assemblies generated with long-read ONT data. Draft assemblies and raw sequencing data are available for public use. We encourage use and reuse of these data for assembly benchmarking and other analyses.
Collapse
Affiliation(s)
- Lisa K Johnson
- Department of Environmental Toxicology, University of California. 1 Shields Avenue, Davis, CA 95616, Davis, CA, USA
- Department of Population Health & Reproduction, School of Veterinary Medicine, University of California. 1 Shields Avenue, Davis, CA 95616, Davis, CA, USA
| | - Ruta Sahasrabudhe
- DNA Technologies Core, Genome Center, University of California, 1 Shields Avenue, Davis, CA 95616
| | - James Anthony Gill
- Department of Environmental Toxicology, University of California. 1 Shields Avenue, Davis, CA 95616, Davis, CA, USA
| | - Jennifer L Roach
- Department of Environmental Toxicology, University of California. 1 Shields Avenue, Davis, CA 95616, Davis, CA, USA
| | - Lutz Froenicke
- DNA Technologies Core, Genome Center, University of California, 1 Shields Avenue, Davis, CA 95616
| | - C Titus Brown
- Department of Population Health & Reproduction, School of Veterinary Medicine, University of California. 1 Shields Avenue, Davis, CA 95616, Davis, CA, USA
| | - Andrew Whitehead
- Correspondence address. Andrew Whitehead, Department of Environmental Toxicology, University of California. 1 Shields Avenue, Davis, CA 95616, USA, Davis, CA, USA. E-mail:
| |
Collapse
|
29
|
LeafGo: Leaf to Genome, a quick workflow to produce high-quality de novo plant genomes using long-read sequencing technology. Genome Biol 2021; 22:256. [PMID: 34479618 PMCID: PMC8414726 DOI: 10.1186/s13059-021-02475-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 08/20/2021] [Indexed: 02/06/2023] Open
Abstract
Currently, different sequencing platforms are used to generate plant genomes and no workflow has been properly developed to optimize time, cost, and assembly quality. We present LeafGo, a complete de novo plant genome workflow, that starts from tissue and produces genomes with modest laboratory and bioinformatic resources in approximately 7 days and using one long-read sequencing technology. LeafGo is optimized with ten different plant species, three of which are used to generate high-quality chromosome-level assemblies without any scaffolding technologies. Finally, we report the diploid genomes of Eucalyptus rudis and E. camaldulensis and the allotetraploid genome of Arachis hypogaea.
Collapse
|
30
|
Melo Clavijo J, Drews F, Pirritano M, Simon M, Salhab A, Donath A, Frankenbach S, Serôdio J, Bleidißel S, Preisfeld A, Christa G. The complete mitochondrial genome of the photosymbiotic sea slug Berghia stephanieae (Valdés, 2005) (Gastropoda, Nudibranchia). Mitochondrial DNA B Resour 2021; 6:2281-2284. [PMID: 34291161 PMCID: PMC8279152 DOI: 10.1080/23802359.2021.1914211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open
Abstract
Berghia stephanieae (Nudibranchia, Cladobranchia) is a photosymbiotic sea slug that feeds exclusively on sea anemones from the genus Exaiptasia. It then specifically incorporates dinoflagellates belonging to the Symbiodiniaceae obtained from their prey. Here, we present the complete mitochondrial genome sequence of B. stephanieae combining Oxford Nanopore long read and Illumina short-read sequencing data. The mitochondrial genome has a total length of 14,786 bp, it contains the 13 protein-encoding genes, 23 tRNAs, and two rRNAs and is similar to other nudibranchs except for the presence of a duplicated tRNA-Ser 1.
Collapse
Affiliation(s)
- Jenny Melo Clavijo
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Zoologie und Biologiedidaktik, Wuppertal, Germany
| | - Franziska Drews
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Chemie und Biologie, Molekulare Zellbiologie und Mikrobiologie, Wuppertal, Germany
| | - Marcello Pirritano
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Chemie und Biologie, Molekulare Zellbiologie und Mikrobiologie, Wuppertal, Germany
| | - Martin Simon
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Chemie und Biologie, Molekulare Zellbiologie und Mikrobiologie, Wuppertal, Germany
| | | | - Alexander Donath
- Zoologisches Forschungsinstitut und Museum Alexander Koenig, Bonn, Germany
| | - Silja Frankenbach
- Department of Biology and CESAM, Centre for Environmental and Marine Studies, University of Aveiro, Aveiro, Portugal
| | - João Serôdio
- Department of Biology and CESAM, Centre for Environmental and Marine Studies, University of Aveiro, Aveiro, Portugal
| | - Sabrina Bleidißel
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Zoologie und Biologiedidaktik, Wuppertal, Germany
| | - Angelika Preisfeld
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Zoologie und Biologiedidaktik, Wuppertal, Germany
| | - Gregor Christa
- Bergische Universität Wuppertal, Fakultät für Mathematik und Naturwissenschaften, Zoologie und Biologiedidaktik, Wuppertal, Germany
| |
Collapse
|
31
|
Four Complete Genome Sequences for Bradyrhizobium sp. Strains Isolated from an Endemic Australian Acacia Legume Reveal Structural Variation. Microbiol Resour Announc 2021; 10:10/19/e00229-21. [PMID: 33986081 PMCID: PMC8142567 DOI: 10.1128/mra.00229-21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Bradyrhizobium sp. strains were isolated from root nodules of the Australian legume, Acacia acuminata (Fabaceae). Here, we report the complete genome sequences of four strains using a hybrid long- and short-read assembly approach. The genome sizes range between ∼7.1 Mbp and ∼8.1 Mbp, each with one single circular chromosome. Whole-genome alignments show extensive structural rearrangement. Bradyrhizobium sp. strains were isolated from root nodules of the Australian legume, Acacia acuminata (Fabaceae). Here, we report the complete genome sequences of four strains using a hybrid long- and short-read assembly approach. The genome sizes range between ∼7.1 Mbp and ∼8.1 Mbp, each with one single circular chromosome. Whole-genome alignments show extensive structural rearrangement.
Collapse
|
32
|
Mathers TC, Wouters RHM, Mugford ST, Swarbreck D, van Oosterhout C, Hogenhout SA. Chromosome-Scale Genome Assemblies of Aphids Reveal Extensively Rearranged Autosomes and Long-Term Conservation of the X Chromosome. Mol Biol Evol 2021; 38:856-875. [PMID: 32966576 PMCID: PMC7947777 DOI: 10.1093/molbev/msaa246] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Chromosome rearrangements are arguably the most dramatic type of mutations, often leading to rapid evolution and speciation. However, chromosome dynamics have only been studied at the sequence level in a small number of model systems. In insects, Diptera and Lepidoptera have conserved genome structure at the scale of whole chromosomes or chromosome arms. Whether this reflects the diversity of insect genome evolution is questionable given that many species exhibit rapid karyotype evolution. Here, we investigate chromosome evolution in aphids-an important group of hemipteran plant pests-using newly generated chromosome-scale genome assemblies of the green peach aphid (Myzus persicae) and the pea aphid (Acyrthosiphon pisum), and a previously published assembly of the corn-leaf aphid (Rhopalosiphum maidis). We find that aphid autosomes have undergone dramatic reorganization over the last 30 My, to the extent that chromosome homology cannot be determined between aphids from the tribes Macrosiphini (Myzus persicae and Acyrthosiphon pisum) and Aphidini (Rhopalosiphum maidis). In contrast, gene content of the aphid sex (X) chromosome remained unchanged despite rapid sequence evolution, low gene expression, and high transposable element load. To test whether rapid evolution of genome structure is a hallmark of Hemiptera, we compared our aphid assemblies with chromosome-scale assemblies of two blood-feeding Hemiptera (Rhodnius prolixus and Triatoma rubrofasciata). Despite being more diverged, the blood-feeding hemipterans have conserved synteny. The exceptional rate of structural evolution of aphid autosomes renders them an important emerging model system for studying the role of large-scale genome rearrangements in evolution.
Collapse
Affiliation(s)
- Thomas C Mathers
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, United Kingdom
| | - Roland H M Wouters
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, United Kingdom
| | - Sam T Mugford
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Cock van Oosterhout
- School of Environmental Sciences, University of East Anglia, Norwich, United Kingdom
| | - Saskia A Hogenhout
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, United Kingdom
| |
Collapse
|
33
|
Mader M, Schroeder H, Schott T, Schöning-Stierand K, Leite Montalvão AP, Liesebach H, Liesebach M, Fussi B, Kersten B. Mitochondrial Genome of Fagus sylvatica L. as a Source for Taxonomic Marker Development in the Fagales. PLANTS (BASEL, SWITZERLAND) 2020; 9:E1274. [PMID: 32992588 PMCID: PMC7650814 DOI: 10.3390/plants9101274] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/23/2020] [Accepted: 09/24/2020] [Indexed: 11/16/2022]
Abstract
European beech, Fagus sylvatica L., is one of the most important and widespread deciduous tree species in Central Europe and is widely managed for its hard wood. The complete DNA sequence of the mitochondrial genome of Fagus sylvatica L. was assembled and annotated based on Illumina MiSeq reads and validated using long reads from nanopore MinION sequencing. The genome assembled into a single DNA sequence of 504,715 bp in length containing 58 genes with predicted function, including 35 protein-coding, 20 tRNA and three rRNA genes. Additionally, 23 putative protein-coding genes were predicted supported by RNA-Seq data. Aiming at the development of taxon-specific mitochondrial genetic markers, the tool SNPtax was developed and applied to select genic SNPs potentially specific for different taxa within the Fagales. Further validation of a small SNP set resulted in the development of four CAPS markers specific for Fagus, Fagaceae, or Fagales, respectively, when considering over 100 individuals from a total of 69 species of deciduous trees and conifers from up to 15 families included in the marker validation. The CAPS marker set is suitable to identify the genus Fagus in DNA samples from tree tissues or wood products, including wood composite products.
Collapse
Affiliation(s)
- Malte Mader
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Hilke Schroeder
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Thomas Schott
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Katrin Schöning-Stierand
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
- Center for Bioinformatics, Universität Hamburg, 20146 Hamburg, Germany
| | - Ana Paula Leite Montalvão
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Heike Liesebach
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Mirko Liesebach
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| | - Barbara Fussi
- Bavarian Office for Forest Genetics, 83317 Teisendorf, Germany;
| | - Birgit Kersten
- Thünen Institute of Forest Genetics, D-22927 Grosshansdorf, Germany; (M.M.); (H.S.); (T.S.); (K.S.-S.); (A.P.L.M.); (H.L.); (M.L.)
| |
Collapse
|
34
|
Dumschott K, Schmidt MHW, Chawla HS, Snowdon R, Usadel B. Oxford Nanopore sequencing: new opportunities for plant genomics? JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:5313-5322. [PMID: 32459850 PMCID: PMC7501810 DOI: 10.1093/jxb/eraa263] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 05/25/2020] [Indexed: 05/06/2023]
Abstract
DNA sequencing was dominated by Sanger's chain termination method until the mid-2000s, when it was progressively supplanted by new sequencing technologies that can generate much larger quantities of data in a shorter time. At the forefront of these developments, long-read sequencing technologies (third-generation sequencing) can produce reads that are several kilobases in length. This greatly improves the accuracy of genome assemblies by spanning the highly repetitive segments that cause difficulty for second-generation short-read technologies. Third-generation sequencing is especially appealing for plant genomes, which can be extremely large with long stretches of highly repetitive DNA. Until recently, the low basecalling accuracy of third-generation technologies meant that accurate genome assembly required expensive, high-coverage sequencing followed by computational analysis to correct for errors. However, today's long-read technologies are more accurate and less expensive, making them the method of choice for the assembly of complex genomes. Oxford Nanopore Technologies (ONT), a third-generation platform for the sequencing of native DNA strands, is particularly suitable for the generation of high-quality assemblies of highly repetitive plant genomes. Here we discuss the benefits of ONT, especially for the plant science community, and describe the issues that remain to be addressed when using ONT for plant genome sequencing.
Collapse
Affiliation(s)
- Kathryn Dumschott
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
| | - Maximilian H-W Schmidt
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Rod Snowdon
- Department of Plant Breeding, Justus Liebig University Giessen, Giessen, Germany
| | - Björn Usadel
- Institute for Biology I, BioSC, RWTH Aachen University, Aachen, Germany
- IBG-4 Bioinformatics, CEPLAS, Forschungszentrum Jülich, Jülich, Germany
- Institute for Biological Data Science, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| |
Collapse
|
35
|
Müller NA, Kersten B, Leite Montalvão AP, Mähler N, Bernhardsson C, Bräutigam K, Carracedo Lorenzo Z, Hoenicka H, Kumar V, Mader M, Pakull B, Robinson KM, Sabatti M, Vettori C, Ingvarsson PK, Cronk Q, Street NR, Fladung M. A single gene underlies the dynamic evolution of poplar sex determination. NATURE PLANTS 2020; 6:630-637. [PMID: 32483326 DOI: 10.1038/s41477-020-0672-9] [Citation(s) in RCA: 119] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2019] [Accepted: 04/22/2020] [Indexed: 05/20/2023]
Abstract
Although hundreds of plant lineages have independently evolved dioecy (that is, separation of the sexes), the underlying genetic basis remains largely elusive1. Here we show that diverse poplar species carry partial duplicates of the ARABIDOPSIS RESPONSE REGULATOR 17 (ARR17) orthologue in the male-specific region of the Y chromosome. These duplicates give rise to small RNAs apparently causing male-specific DNA methylation and silencing of the ARR17 gene. CRISPR-Cas9-induced mutations demonstrate that ARR17 functions as a sex switch, triggering female development when on and male development when off. Despite repeated turnover events, including a transition from the XY system to a ZW system, the sex-specific regulation of ARR17 is conserved across the poplar genus and probably beyond. Our data reveal how a single-gene-based mechanism of dioecy can enable highly dynamic sex-linked regions and contribute to maintaining recombination and integrity of sex chromosomes.
Collapse
Affiliation(s)
- Niels A Müller
- Thünen Institute of Forest Genetics, Grosshansdorf, Germany.
| | - Birgit Kersten
- Thünen Institute of Forest Genetics, Grosshansdorf, Germany
| | | | - Niklas Mähler
- Department of Plant Physiology, Umeå Plant Science Centre, Umeå, Sweden
| | - Carolina Bernhardsson
- Department of Plant Biology, Linnean Centre for Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Katharina Bräutigam
- Department of Biology, University of Toronto Mississauga, Mississauga, Ontario, Canada
| | | | - Hans Hoenicka
- Thünen Institute of Forest Genetics, Grosshansdorf, Germany
| | - Vikash Kumar
- Department of Plant Physiology, Umeå Plant Science Centre, Umeå, Sweden
| | - Malte Mader
- Thünen Institute of Forest Genetics, Grosshansdorf, Germany
| | - Birte Pakull
- Thünen Institute of Forest Genetics, Grosshansdorf, Germany
| | | | - Maurizio Sabatti
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Viterbo, Italy
| | - Cristina Vettori
- Institute of Biosciences and BioResources, Division of Florence, National Research Council, Sesto Fiorentino, Italy
| | - Pär K Ingvarsson
- Department of Plant Biology, Linnean Centre for Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Quentin Cronk
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
36
|
Kersten B, Leite Montalvão AP, Hoenicka H, Vettori C, Paffetti D, Fladung M. Sequencing of two transgenic early-flowering poplar lines confirmed vector-free single-locus T-DNA integration. Transgenic Res 2020; 29:321-337. [PMID: 32356192 PMCID: PMC7283205 DOI: 10.1007/s11248-020-00203-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 04/18/2020] [Indexed: 02/02/2023]
Abstract
Next-generation sequencing (NGS) approaches are attractive alternatives to the PCR-based characterisation of genetically modified plants for safety assessment and labelling since NGS is highly sensitive to the detection of T-DNA inserts as well as vector backbone sequences in transgenic plants. In this study, two independent transgenic male Populus tremula lines, T193-2 and T195-1, both carrying the FLOWERING LOCUS T gene from Arabidopsis thaliana under control of a heat-inducible promoter (pHSP::AtFT) and the non-transgenic control clone W52, were further characterised by NGS and third-generation sequencing. The results support previous findings that the T-DNA was hemizygously inserted in one genomic locus of each line. However, the T-DNA insertions consist of conglomerations of one or two T-DNA copies together with a small T-DNA fragment without AtFT parts. Based on NGS data, no additional T-DNA splinters or vector backbone sequences could be identified in the genome of the two transgenic lines. Seedlings derived from crosses between the pHSP::AtFT transgenic male parents and female wild type plants are therefore expected to be T-DNA splinter or vector backbone free. Thus, PCR analyses amplifying a partial T-DNA fragment with AtFT-specific primers are sufficient to determine whether the seedlings are transgenic or not. An analysis of 72 second generation-seedlings clearly showed that about 50% of them still reveal the presence of the T-DNA, confirming data already published. To prove if unanticipated genomic changes were induced by T-DNA integration, extended future studies using long-range sequencing technologies are required once a suitable chromosome-level P. tremula reference genome sequence is available.
Collapse
Affiliation(s)
- Birgit Kersten
- Thünen Institute of Forest Genetics, 22927, Grosshansdorf, Germany.
| | | | - Hans Hoenicka
- Thünen Institute of Forest Genetics, 22927, Grosshansdorf, Germany
| | - Cristina Vettori
- Institute of Bioscience and Bioresources (IBBR), National Research Council (CNR), Via Madonna del Piano 10, 50019, Sesto Fiorentino, FI, Italy
| | - Donatella Paffetti
- Department of Agriculture, Food, Environment and Forestry, Agricultural Genetics Section, University of Florence, P. le delle Cascine 18, 50144, Florence, Italy
| | - Matthias Fladung
- Thünen Institute of Forest Genetics, 22927, Grosshansdorf, Germany.
| |
Collapse
|
37
|
Hatfield RG, Batista FM, Bean TP, Fonseca VG, Santos A, Turner AD, Lewis A, Dean KJ, Martinez-Urtaza J. The Application of Nanopore Sequencing Technology to the Study of Dinoflagellates: A Proof of Concept Study for Rapid Sequence-Based Discrimination of Potentially Harmful Algae. Front Microbiol 2020; 11:844. [PMID: 32457722 PMCID: PMC7227484 DOI: 10.3389/fmicb.2020.00844] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 04/08/2020] [Indexed: 01/05/2023] Open
Abstract
Harmful algal blooms (HABs) are a naturally occurring global phenomena that have the potential to impact fisheries, leisure and ecosystems, as well as posing a significant hazard to animal and human health. There is significant interest in the development and application of methodologies to study all aspects of the causative organisms and toxins associated with these events. This paper reports the first application of nanopore sequencing technology for the detection of eukaryotic harmful algal bloom organisms. The MinION sequencing platform from Oxford Nanopore technologies provides long read sequencing capabilities in a compact, low cost, and portable format. In this study we used the MinION to sequence long-range PCR amplicons from multiple dinoflagellate species with a focus on the genus Alexandrium. Primers applicable to a wide range of dinoflagellates were selected, meaning that although the study was primarily focused on Alexandrium the applicability to three additional genera of toxic algae, namely; Gonyaulax, Prorocentrum, and Lingulodinium was also demonstrated. The amplicon generated here spanned approximately 3 kb of the rDNA cassette, including most of the 18S, the complete ITS1, 5.8S, ITS2 and regions D1 and D2 of the 28S. The inclusion of barcode genes as well as highly conserved regions resulted in identification of organisms to the species level. The analysis of reference cultures resulted in over 99% of all sequences being attributed to the correct species with an average identity above 95% from a reference list of over 200 species (see Supplementary Material 1). The use of mock community analysis within environmental samples highlighted that complex matrices did not prevent the ability to distinguish between phylogenetically similar species. Successful identification of causative organisms in environmental samples during natural toxic events further highlighted the potential of the assay. This study proves the suitability of nanopore sequencing technology for taxonomic identification of harmful algal bloom organisms and acquisition of data relevant to the World Health Organisations "one health" approach to marine monitoring.
Collapse
Affiliation(s)
- Robert G. Hatfield
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Frederico M. Batista
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | | | - Vera G. Fonseca
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Andres Santos
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
- Scientific and Technological Bioresource Nucleus (BIOREN), Universidad de La Frontera, Temuco, Chile
| | - Andrew D. Turner
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Adam Lewis
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | - Karl J. Dean
- Centre for Environment, Fisheries and Aquaculture Science, Dorset, United Kingdom
| | | |
Collapse
|
38
|
Genome Sequences of a Green-Colored Chlorobium phaeovibrioides Strain Containing Two Plasmids and a Closely Related Plasmid-Free Brown-Colored Strain. Microbiol Resour Announc 2020; 9:9/2/e01172-19. [PMID: 31919163 PMCID: PMC6952649 DOI: 10.1128/mra.01172-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Here, we report the draft genome sequences of the green sulfur bacterium Chlorobium phaeovibrioides strains GrTcv12 and PhvTcv-s14, isolated from the chemocline zone from meromictic Lake Trekhtzvetnoe, separated from the White Sea, in Russia. This is the first report showing the presence of plasmids containing antiphage systems in the Chlorobium sp. genome. Here, we report the draft genome sequences of the green sulfur bacterium Chlorobium phaeovibrioides strains GrTcv12 and PhvTcv-s14, isolated from the chemocline zone from meromictic Lake Trekhtzvetnoe, separated from the White Sea, in Russia. This is the first report showing the presence of plasmids containing antiphage systems in the Chlorobium sp. genome.
Collapse
|
39
|
Baldi P, La Porta N. Molecular Approaches for Low-Cost Point-of-Care Pathogen Detection in Agriculture and Forestry. FRONTIERS IN PLANT SCIENCE 2020; 11:570862. [PMID: 33193502 PMCID: PMC7655913 DOI: 10.3389/fpls.2020.570862] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 09/29/2020] [Indexed: 05/14/2023]
Abstract
Early detection of plant diseases is a crucial factor to prevent or limit the spread of a rising infection that could cause significant economic loss. Detection test on plant diseases in the laboratory can be laborious, time consuming, expensive, and normally requires specific technical expertise. Moreover, in the developing countries, it is often difficult to find laboratories equipped for this kind of analysis. Therefore, in the past years, a high effort has been made for the development of fast, specific, sensitive, and cost-effective tests that can be successfully used in plant pathology directly in the field by low-specialized personnel using minimal equipment. Nucleic acid-based methods have proven to be a good choice for the development of detection tools in several fields, such as human/animal health, food safety, and water analysis, and their application in plant pathogen detection is becoming more and more common. In the present review, the more recent nucleic acid-based protocols for point-of-care (POC) plant pathogen detection and identification are described and analyzed. All these methods have a high potential for early detection of destructive diseases in agriculture and forestry, they should help make molecular detection for plant pathogens accessible to anyone, anywhere, and at any time. We do not suggest that on-site methods should replace lab testing completely, which remains crucial for more complex researches, such as identification and classification of new pathogens or the study of plant defense mechanisms. Instead, POC analysis can provide a useful, fast, and efficient preliminary on-site screening that is crucial in the struggle against plant pathogens.
Collapse
Affiliation(s)
- Paolo Baldi
- IASMA Research and Innovation Centre, Fondazione Edmund Mach, Trento, Italy
- *Correspondence: Paolo Baldi,
| | - Nicola La Porta
- IASMA Research and Innovation Centre, Fondazione Edmund Mach, Trento, Italy
- The EFI Project Centre on Mountain Forests (MOUNTFOR), San Michele a/Adige, Trento, Italy
| |
Collapse
|
40
|
Narum S, Kelley J, Sibbett B. Editorial 2020. Mol Ecol Resour 2020; 20:1-7. [DOI: 10.1111/1755-0998.13125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 12/09/2019] [Indexed: 11/29/2022]
|
41
|
Sauvage T, Schmidt WE, Yoon HS, Paul VJ, Fredericq S. Promising prospects of nanopore sequencing for algal hologenomics and structural variation discovery. BMC Genomics 2019; 20:850. [PMID: 31722669 PMCID: PMC6854639 DOI: 10.1186/s12864-019-6248-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 10/30/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The MinION Access Program (MAP, 2014-2016) allowed selected users to test the prospects of long nanopore reads for diverse organisms and applications through the rapid development of improving chemistries. In 2014, faced with a fragmented Illumina assembly for the chloroplast genome of the green algal holobiont Caulerpa ashmeadii, we applied to the MAP to test the prospects of nanopore reads to investigate such intricacies, as well as further explore the hologenome of this species with native and hybrid approaches. RESULTS The chloroplast genome could only be resolved as a circular molecule in nanopore assemblies, which also revealed structural variants (i.e. chloroplast polymorphism or heteroplasmy). Signal and Illumina polishing of nanopore-assembled organelle genomes (chloroplast and mitochondrion) reflected the importance of coverage on final quality and current limitations. In hybrid assembly, our modest nanopore data sets showed encouraging results to improve assembly length, contiguity, repeat content, and binning of the larger nuclear and bacterial genomes. Profiling of the holobiont with nanopore or Illumina data unveiled a dominant Rhodospirillaceae (Alphaproteobacteria) species among six putative endosymbionts. While very fragmented, the cumulative hybrid assembly length of C. ashmeadii's nuclear genome reached 24.4 Mbp, including 2.1 Mbp in repeat, ranging closely with GenomeScope's estimate (> 26.3 Mbp, including 4.8 Mbp in repeat). CONCLUSION Our findings relying on a very modest number of nanopore R9 reads as compared to current output with newer chemistries demonstrate the promising prospects of the technology for the assembly and profiling of an algal hologenome and resolution of structural variation. The discovery of polymorphic 'chlorotypes' in C. ashmeadii, most likely mediated by homing endonucleases and/or retrohoming by reverse transcriptases, represents the first report of chloroplast heteroplasmy in the siphonous green algae. Improving contiguity of C. ashmeadii's nuclear and bacterial genomes will require deeper nanopore sequencing to greatly increase the coverage of these larger genomic compartments.
Collapse
Affiliation(s)
| | - William E. Schmidt
- Biology Department, University of Louisiana at Lafayette, Louisiana, USA
| | - Hwan Su Yoon
- Department of Biological Sciences, Sungkyunkwan University, Suwon, Korea
| | | | - Suzanne Fredericq
- Biology Department, University of Louisiana at Lafayette, Louisiana, USA
| |
Collapse
|
42
|
Jung H, Winefield C, Bombarely A, Prentis P, Waterhouse P. Tools and Strategies for Long-Read Sequencing and De Novo Assembly of Plant Genomes. TRENDS IN PLANT SCIENCE 2019; 24:700-724. [PMID: 31208890 DOI: 10.1016/j.tplants.2019.05.003] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2019] [Revised: 05/01/2019] [Accepted: 05/10/2019] [Indexed: 05/16/2023]
Abstract
The commercial release of third-generation sequencing technologies (TGSTs), giving long and ultra-long sequencing reads, has stimulated the development of new tools for assembling highly contiguous genome sequences with unprecedented accuracy across complex repeat regions. We survey here a wide range of emerging sequencing platforms and analytical tools for de novo assembly, provide background information for each of their steps, and discuss the spectrum of available options. Our decision tree recommends workflows for the generation of a high-quality genome assembly when used in combination with the specific needs and resources of a project.
Collapse
Affiliation(s)
- Hyungtaek Jung
- Centre for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, QLD 4001, Australia.
| | - Christopher Winefield
- Department of Wine, Food, and Molecular Biosciences, Lincoln University, 7647 Christchurch, New Zealand
| | - Aureliano Bombarely
- Department of Bioscience, University of Milan, Milan 20133, Italy; School of Plants and Environmental Sciences, Virginia Tech, Blacksburg, VA 24061, USA
| | - Peter Prentis
- School of Earth, Environmental, and Biological Sciences, Queensland University of Technology, Brisbane, QLD, 4001, Australia
| | - Peter Waterhouse
- Centre for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, QLD 4001, Australia; School of Biological Sciences, University of Sydney, Sydney, NSW 2006, Australia.
| |
Collapse
|
43
|
De Coster W, De Rijk P, De Roeck A, De Pooter T, D'Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. Genome Res 2019; 29:1178-1187. [PMID: 31186302 PMCID: PMC6633254 DOI: 10.1101/gr.244939.118] [Citation(s) in RCA: 92] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 06/06/2019] [Indexed: 01/17/2023]
Abstract
We sequenced the genome of the Yoruban reference individual NA19240 on the long-read sequencing platform Oxford Nanopore PromethION for evaluation and benchmarking of recently published aligners and germline structural variant calling tools, as well as a comparison with the performance of structural variant calling from short-read sequencing data. The structural variant caller Sniffles after NGMLR or minimap2 alignment provides the most accurate results, but additional confidence or sensitivity can be obtained by a combination of multiple variant callers. Sensitive and fast results can be obtained by minimap2 for alignment and a combination of Sniffles and SVIM for variant identification. We describe a scalable workflow for identification, annotation, and characterization of tens of thousands of structural variants from long-read genome sequencing of an individual or population. By discussing the results of this well-characterized reference individual, we provide an approximation of what can be expected in future long-read sequencing studies aiming for structural variant identification.
Collapse
Affiliation(s)
- Wouter De Coster
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Peter De Rijk
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Arne De Roeck
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Tim De Pooter
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Svenn D'Hert
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Mojca Strazisar
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Kristel Sleegers
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Christine Van Broeckhoven
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| |
Collapse
|
44
|
Gan HM, Linton SM, Austin CM. Two reads to rule them all: Nanopore long read-guided assembly of the iconic Christmas Island red crab, Gecarcoidea natalis (Pocock, 1888), mitochondrial genome and the challenges of AT-rich mitogenomes. Mar Genomics 2019; 45:64-71. [DOI: 10.1016/j.margen.2019.02.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Revised: 02/10/2019] [Accepted: 02/11/2019] [Indexed: 01/01/2023]
|
45
|
Schalamun M, Nagar R, Kainer D, Beavan E, Eccles D, Rathjen JP, Lanfear R, Schwessinger B. Harnessing the MinION: An example of how to establish long-read sequencing in a laboratory using challenging plant tissue from Eucalyptus pauciflora. Mol Ecol Resour 2019; 19:77-89. [PMID: 30118581 PMCID: PMC7380007 DOI: 10.1111/1755-0998.12938] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Revised: 08/08/2018] [Accepted: 08/10/2018] [Indexed: 11/28/2022]
Abstract
Long-read sequencing technologies are transforming our ability to assemble highly complex genomes. Realizing their full potential is critically reliant on extracting high-quality, high-molecular-weight (HMW) DNA from the organisms of interest. This is especially the case for the portable MinION sequencer which enables all laboratories to undertake their own genome sequencing projects, due to its low entry cost and minimal spatial footprint. One challenge of the MinION is that each group has to independently establish effective protocols for using the instrument, which can be time-consuming and costly. Here, we present a workflow and protocols that enabled us to establish MinION sequencing in our own laboratories, based on optimizing DNA extraction from a challenging plant tissue as a case study. Following the workflow illustrated, we were able to reliably and repeatedly obtain >6.5 Gb of long-read sequencing data with a mean read length of 13 kb and an N50 of 26 kb. Our protocols are open source and can be performed in any laboratory without special equipment. We also illustrate some more elaborate workflows which can increase mean and average read lengths if this is desired. We envision that our workflow for establishing MinION sequencing, including the illustration of potential pitfalls and suggestions of how to adapt it to other tissue types, will be useful to others who plan to establish long-read sequencing in their own laboratories.
Collapse
Affiliation(s)
- Miriam Schalamun
- Research School of BiologyThe Australian National UniversityActonACTAustralia
- Present address:
University of Natural Resources and Life SciencesViennaAustria
| | - Ramawatar Nagar
- Research School of BiologyThe Australian National UniversityActonACTAustralia
| | - David Kainer
- Research School of BiologyThe Australian National UniversityActonACTAustralia
| | - Eleanor Beavan
- Research School of BiologyThe Australian National UniversityActonACTAustralia
| | - David Eccles
- Malaghan Institute of Medical ResearchWellingtonNew Zealand
- Present address:
Malaghan Institute of Medical ResearchWellingtonNew Zealand
| | - John P. Rathjen
- Research School of BiologyThe Australian National UniversityActonACTAustralia
| | - Robert Lanfear
- Research School of BiologyThe Australian National UniversityActonACTAustralia
| | | |
Collapse
|
46
|
Wang W, Schalamun M, Morales-Suarez A, Kainer D, Schwessinger B, Lanfear R. Assembly of chloroplast genomes with long- and short-read data: a comparison of approaches using Eucalyptus pauciflora as a test case. BMC Genomics 2018; 19:977. [PMID: 30594129 PMCID: PMC6311037 DOI: 10.1186/s12864-018-5348-8] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 12/03/2018] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Chloroplasts are organelles that conduct photosynthesis in plant and algal cells. The information chloroplast genome contained is widely used in agriculture and studies of evolution and ecology. Correctly assembling chloroplast genomes can be challenging because the chloroplast genome contains a pair of long inverted repeats (10-30 kb). Typically, it is simply assumed that the gross structure of the chloroplast genome matches the most commonly observed structure of two single-copy regions separated by a pair of inverted repeats. The advent of long-read sequencing technologies should remove the need to make this assumption by providing sufficient information to completely span the inverted repeat regions. Yet, long-reads tend to have higher error rates than short-reads, and relatively little is known about the best way to combine long- and short-reads to obtain the most accurate chloroplast genome assemblies. Using Eucalyptus pauciflora, the snow gum, as a test case, we evaluated the effect of multiple parameters, such as different coverage of long-(Oxford nanopore) and short-(Illumina) reads, different long-read lengths, different assembly pipelines, with a view to determining the most accurate and efficient approach to chloroplast genome assembly. RESULTS Hybrid assemblies combining at least 20x coverage of both long-reads and short-reads generated a single contig spanning the entire chloroplast genome with few or no detectable errors. Short-read-only assemblies generated three contigs (the long single copy, short single copy and inverted repeat regions) of the chloroplast genome. These contigs contained few single-base errors but tended to exclude several bases at the beginning or end of each contig. Long-read-only assemblies tended to create multiple contigs with a much higher single-base error rate. The chloroplast genome of Eucalyptus pauciflora is 159,942 bp, contains 131 genes of known function. CONCLUSIONS Our results suggest that very accurate assemblies of chloroplast genomes can be achieved using a combination of at least 20x coverage of long- and short-reads respectively, provided that the long-reads contain at least ~5x coverage of reads longer than the inverted repeat region. We show that further increases in coverage give little or no improvement in accuracy, and that hybrid assemblies are more accurate than long-read-only or short-read-only assemblies.
Collapse
Affiliation(s)
- Weiwen Wang
- Research School of Biology, Australian National University, Canberra, Australia.
| | - Miriam Schalamun
- Research School of Biology, Australian National University, Canberra, Australia.,Institute of Applied Genetics and Cell Biology, University of Natural Resources and Life Sciences, Vienna, Austria
| | | | - David Kainer
- Research School of Biology, Australian National University, Canberra, Australia
| | | | - Robert Lanfear
- Research School of Biology, Australian National University, Canberra, Australia
| |
Collapse
|