1
|
Kawato S, Nozaki R, Kondo H, Hirono I. Integrase-associated niche differentiation of endogenous large DNA viruses in crustaceans. Microbiol Spectr 2024; 12:e0055923. [PMID: 38063384 PMCID: PMC10871703 DOI: 10.1128/spectrum.00559-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 11/15/2023] [Indexed: 01/13/2024] Open
Abstract
IMPORTANCE Crustacean genomes harbor sequences originating from a family of large DNA viruses called nimaviruses, but it is unclear why they are present. We show that endogenous nimaviruses selectively insert into repetitive sequences within the host genome, and this insertion specificity was correlated with different types of integrases, which are DNA recombination enzymes encoded by the nimaviruses themselves. This suggests that endogenous nimaviruses have colonized various genomic niches through the acquisition of integrases with different insertion specificities. Our results point to a novel survival strategy of endogenous large DNA viruses colonizing the host genomes. These findings may clarify the evolution and spread of nimaviruses in crustaceans and lead to measures to control and prevent the spread of pathogenic nimaviruses in aquaculture settings.
Collapse
Affiliation(s)
- Satoshi Kawato
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Reiko Nozaki
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Hidehiro Kondo
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| | - Ikuo Hirono
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo, Japan
| |
Collapse
|
2
|
Rutz C, Bonassin L, Kress A, Francesconi C, Boštjančić LL, Merlat D, Theissinger K, Lecompte O. Abundance and Diversification of Repetitive Elements in Decapoda Genomes. Genes (Basel) 2023; 14:1627. [PMID: 37628678 PMCID: PMC10454600 DOI: 10.3390/genes14081627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/05/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
Repetitive elements are a major component of DNA sequences due to their ability to propagate through the genome. Characterization of Metazoan repetitive profiles is improving; however, current pipelines fail to identify a significant proportion of divergent repeats in non-model organisms. The Decapoda order, for which repeat content analyses are largely lacking, is characterized by extremely variable genome sizes that suggest an important presence of repetitive elements. Here, we developed a new standardized pipeline to annotate repetitive elements in non-model organisms, which we applied to twenty Decapoda and six other Crustacea genomes. Using this new tool, we identified 10% more repetitive elements than standard pipelines. Repetitive elements were more abundant in Decapoda species than in other Crustacea, with a very large number of highly repeated satellite DNA families. Moreover, we demonstrated a high correlation between assembly size and transposable elements and different repeat dynamics between Dendrobranchiata and Reptantia. The patterns of repetitive elements largely reflect the phylogenetic relationships of Decapoda and the distinct evolutionary trajectories within Crustacea. In summary, our results highlight the impact of repetitive elements on genome evolution in Decapoda and the value of our novel annotation pipeline, which will provide a baseline for future comparative analyses.
Collapse
Affiliation(s)
- Christelle Rutz
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Lena Bonassin
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Arnaud Kress
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Caterina Francesconi
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Ljudevit Luka Boštjančić
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Dorine Merlat
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Kathrin Theissinger
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
| | - Odile Lecompte
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| |
Collapse
|
3
|
Kawato S, Nishitsuji K, Arimoto A, Hisata K, Kawamitsu M, Nozaki R, Kondo H, Shinzato C, Ohira T, Satoh N, Shoguchi E, Hirono I. Genome and transcriptome assemblies of the kuruma shrimp, Marsupenaeus japonicus. G3 (Bethesda) 2021; 11:jkab268. [PMID: 34515781 PMCID: PMC8527471 DOI: 10.1093/g3journal/jkab268] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 07/18/2021] [Indexed: 11/25/2022]
Abstract
The kuruma shrimp Marsupenaeus japonicus (order Decapoda, family Penaeidae) is an economically important crustacean that occurs in shallow, warm seas across the Indo-Pacific. Here, using a combination of Illumina and Oxford Nanopore Technologies platforms, we produced a draft genome assembly of M. japonicus (1.70 Gbp; 18,210 scaffolds; scaffold N50 = 234.9 kbp; 34.38% GC, 93.4% BUSCO completeness) and a complete mitochondrial genome sequence (15,969 bp). As with other penaeid shrimp genomes, the M. japonicus genome is extremely rich in simple repeats, which occupies 27.4% of the assembly. A total of 26,381 protein-coding gene models (94.7% BUSCO completeness) were predicted, of which 18,005 genes (68.2%) were assigned functional description by at least one method. We also produced an Illumina-based transcriptome shotgun assembly (40,991 entries; 93.0% BUSCO completeness) and a PacBio Iso-Seq transcriptome assembly (25,415 entries; 67.5% BUSCO completeness). We envision that the M. japonicus genome and transcriptome assemblies will serve as useful resources for the basic research, fisheries management, and breeding programs of M. japonicus.
Collapse
Affiliation(s)
- Satoshi Kawato
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo 108-8477, Japan
| | - Koki Nishitsuji
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Asuka Arimoto
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Kanako Hisata
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Mayumi Kawamitsu
- DNA Sequencing Section, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Reiko Nozaki
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo 108-8477, Japan
| | - Hidehiro Kondo
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo 108-8477, Japan
| | - Chuya Shinzato
- Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba 277-0882, Japan
| | - Tsuyoshi Ohira
- Faculty of Science, Department of Biological Sciences, Kanagawa University, Kanagawa 221-8686, Japan
| | - Noriyuki Satoh
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Eiichi Shoguchi
- Marine Genomics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan
| | - Ikuo Hirono
- Laboratory of Genome Science, Tokyo University of Marine Science and Technology, Tokyo 108-8477, Japan
| |
Collapse
|
4
|
Uengwetwanit T, Pootakham W, Nookaew I, Sonthirod C, Angthong P, Sittikankaew K, Rungrassamee W, Arayamethakorn S, Wongsurawat T, Jenjaroenpun P, Sangsrakru D, Leelatanawit R, Khudet J, Koehorst JJ, Schaap PJ, Martins dos Santos V, Tangy F, Karoonuthaisiri N. A chromosome-level assembly of the black tiger shrimp (Penaeus monodon) genome facilitates the identification of growth-associated genes. Mol Ecol Resour 2021; 21:1620-1640. [PMID: 33586292 PMCID: PMC8197738 DOI: 10.1111/1755-0998.13357] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Revised: 01/31/2021] [Accepted: 02/10/2021] [Indexed: 12/13/2022]
Abstract
To salvage marine ecosystems from fishery overexploitation, sustainable and efficient aquaculture must be emphasized. The knowledge obtained from available genome sequence of marine organisms has accelerated marine aquaculture in many cases. The black tiger shrimp (Penaeus monodon) is one of the most prominent cultured penaeid shrimps (Crustacean) with an average annual global production of half a million tons in the last decade. However, its currently available genome assemblies lack the contiguity and completeness required for accurate genome annotation due to the highly repetitive nature of the genome and technical difficulty in extracting high-quality, high-molecular weight DNA. Here, we report the first chromosome-level whole-genome assembly of P. monodon. The combination of long-read Pacific Biosciences (PacBio) and long-range Chicago and Hi-C technologies enabled a successful assembly of this first high-quality genome sequence. The final assembly covered 2.39 Gb (92.3% of the estimated genome size) and contained 44 pseudomolecules, corresponding to the haploid chromosome number. Repetitive elements occupied a substantial portion of the assembly (62.5%), the highest of the figures reported among crustacean species. The availability of this high-quality genome assembly enabled the identification of genes associated with rapid growth in the black tiger shrimp through the comparison of hepatopancreas transcriptome of slow-growing and fast-growing shrimps. The results highlighted several growth-associated genes. Our high-quality genome assembly provides an invaluable resource for genetic improvement and breeding penaeid shrimp in aquaculture. The availability of P. monodon genome enables analyses of ecological impact, environment adaptation and evolution, as well as the role of the genome to protect the ecological resources by promoting sustainable shrimp farming.
Collapse
Affiliation(s)
- Tanaporn Uengwetwanit
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Wirulda Pootakham
- National Omics CenterNational Science and Technology Development AgencyPathum ThaniThailand
| | - Intawat Nookaew
- Department of Biomedical Informatics, College of MedicineUniversity of Arkansas for Medical SciencesLittle RockArkansasUSA
| | - Chutima Sonthirod
- National Omics CenterNational Science and Technology Development AgencyPathum ThaniThailand
| | - Pacharaporn Angthong
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Kanchana Sittikankaew
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Wanilada Rungrassamee
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Sopacha Arayamethakorn
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Thidathip Wongsurawat
- Department of Biomedical Informatics, College of MedicineUniversity of Arkansas for Medical SciencesLittle RockArkansasUSA
- Division of Bioinformatics and Data Management for ResearchDepartment of Research and DevelopmentFaculty of MedicineSiriraj HospitalMahidol UniversityBangkokThailand
| | - Piroon Jenjaroenpun
- Department of Biomedical Informatics, College of MedicineUniversity of Arkansas for Medical SciencesLittle RockArkansasUSA
- Division of Bioinformatics and Data Management for ResearchDepartment of Research and DevelopmentFaculty of MedicineSiriraj HospitalMahidol UniversityBangkokThailand
| | - Duangjai Sangsrakru
- National Omics CenterNational Science and Technology Development AgencyPathum ThaniThailand
| | - Rungnapa Leelatanawit
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| | - Jutatip Khudet
- Shrimp Genetic Improvement CenterIntegrative Aquaculture Biotechnology Research GroupSurat ThaniThailand
| | - Jasper J. Koehorst
- Laboratory of Systems and Synthetic BiologyDepartment of Agrotechnology and Food SciencesWageningen University and ResearchWageningenThe Netherlands
| | - Peter J. Schaap
- Laboratory of Systems and Synthetic BiologyDepartment of Agrotechnology and Food SciencesWageningen University and ResearchWageningenThe Netherlands
| | - Vitor Martins dos Santos
- Laboratory of Systems and Synthetic BiologyDepartment of Agrotechnology and Food SciencesWageningen University and ResearchWageningenThe Netherlands
| | - Frédéric Tangy
- Viral Genomics and Vaccination UnitUMR3569 CNRSVirology DepartmentInstitut PasteurParisFrance
| | - Nitsara Karoonuthaisiri
- National Center for Genetic Engineering and Biotechnology (BIOTEC)National Science and Technology Development Agency (NSTDA)Pathum Thani12120Thailand
| |
Collapse
|
5
|
Abstract
Eukaryotic genome sequencing and de novo assembly, once the exclusive domain of well-funded international consortia, have become increasingly affordable, thus fitting the budgets of individual research groups. Third-generation long-read DNA sequencing technologies are increasingly used, providing extensive genomic toolkits that were once reserved for a few select model organisms. Generating high-quality genome assemblies and annotations for many aquatic species still presents significant challenges due to their large genome sizes, complexity, and high chromosome numbers. Indeed, selecting the most appropriate sequencing and software platforms and annotation pipelines for a new genome project can be daunting because tools often only work in limited contexts. In genomics, generating a high-quality genome assembly/annotation has become an indispensable tool for better understanding the biology of any species. Herein, we state 12 steps to help researchers get started in genome projects by presenting guidelines that are broadly applicable (to any species), sustainable over time, and cover all aspects of genome assembly and annotation projects from start to finish. We review some commonly used approaches, including practical methods to extract high-quality DNA and choices for the best sequencing platforms and library preparations. In addition, we discuss the range of potential bioinformatics pipelines, including structural and functional annotations (e.g., transposable elements and repetitive sequences). This paper also includes information on how to build a wide community for a genome project, the importance of data management, and how to make the data and results Findable, Accessible, Interoperable, and Reusable (FAIR) by submitting them to a public repository and sharing them with the research community.
Collapse
Affiliation(s)
- Hyungtaek Jung
- School of Biological Sciences, The University of Queensland, St Lucia, Queensland, Australia
- Centre for Agriculture and Bioeconomy, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Tomer Ventura
- Genecology Research Centre, School of Science and Engineering, University of the Sunshine Coast, Sippy Downs, Queensland, Australia
| | - J. Sook Chung
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, Maryland, United States of America
| | - Woo-Jin Kim
- Genetics and Breeding Research Center, National Institute of Fisheries Science, Geoje, Korea
| | - Bo-Hye Nam
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Hee Jeong Kong
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Young-Ok Kim
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Min-Seung Jeon
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Seong-il Eyun
- Department of Life Science, Chung-Ang University, Seoul, Korea
| |
Collapse
|
6
|
Carducci F, Barucca M, Canapa A, Carotti E, Biscotti MA. Mobile Elements in Ray-Finned Fish Genomes. Life (Basel) 2020; 10:E221. [PMID: 32992841 PMCID: PMC7599744 DOI: 10.3390/life10100221] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 09/18/2020] [Accepted: 09/22/2020] [Indexed: 12/12/2022] Open
Abstract
Ray-finned fishes (Actinopterygii) are a very diverse group of vertebrates, encompassing species adapted to live in freshwater and marine environments, from the deep sea to high mountain streams. Genome sequencing offers a genetic resource for investigating the molecular bases of this phenotypic diversity and these adaptations to various habitats. The wide range of genome sizes observed in fishes is due to the role of transposable elements (TEs), which are powerful drivers of species diversity. Analyses performed to date provide evidence that class II DNA transposons are the most abundant component in most fish genomes and that compared to other vertebrate genomes, many TE superfamilies are present in actinopterygians. Moreover, specific TEs have been reported in ray-finned fishes as a possible result of an intricate relationship between TE evolution and the environment. The data summarized here underline the biological interest in Actinopterygii as a model group to investigate the mechanisms responsible for the high biodiversity observed in this taxon.
Collapse
Affiliation(s)
| | | | | | | | - Maria Assunta Biscotti
- Dipartimento di Scienze della Vita e dell’Ambiente, Università Politecnica delle Marche, 60131 Ancona, Italy; (F.C.); (M.B.); (A.C.); (E.C.)
| |
Collapse
|