1
|
Selwyn JD, Vollmer SV. Whole genome assembly and annotation of the endangered Caribbean coral Acropora cervicornis. G3 (BETHESDA, MD.) 2023; 13:jkad232. [PMID: 37804092 PMCID: PMC10700113 DOI: 10.1093/g3journal/jkad232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 09/25/2023] [Accepted: 09/27/2023] [Indexed: 10/08/2023]
Abstract
Coral species in the genus Acropora are key ecological components of coral reefs worldwide and represent the most diverse genus of scleractinian corals. While key species of Indo-Pacific Acropora have annotated genomes, no annotated genome has been published for either of the two species of Caribbean Acropora. Here we present the first fully annotated genome of the endangered Caribbean staghorn coral, Acropora cervicornis. We assembled and annotated this genome using high-fidelity nanopore long-read sequencing with gene annotations validated with mRNA sequencing. The assembled genome size is 318 Mb, with 28,059 validated genes. Comparative genomic analyses with other Acropora revealed unique features in A. cervicornis, including contractions in immune pathways and expansions in signaling pathways. Phylogenetic analysis confirms previous findings showing that A. cervicornis diverged from Indo-Pacific relatives around 41 million years ago, with the closure of the western Tethys Sea, prior to the primary radiation of Indo-Pacific Acropora. This new A. cervicornis genome enriches our understanding of the speciose Acropora and addresses evolutionary inquiries concerning speciation and hybridization in this diverse clade.
Collapse
Affiliation(s)
- Jason D Selwyn
- Department of Marine and Environmental Sciences, Northeastern University, Nahant, MA 01908, USA
| | - Steven V Vollmer
- Department of Marine and Environmental Sciences, Northeastern University, Nahant, MA 01908, USA
| |
Collapse
|
2
|
Ramos E, Selleghin-Veiga G, Magpali L, Daros B, Silva F, Picorelli A, Freitas L, Nery MF. Molecular Footprints on Osmoregulation-Related Genes Associated with Freshwater Colonization by Cetaceans and Sirenians. J Mol Evol 2023; 91:865-881. [PMID: 38010516 DOI: 10.1007/s00239-023-10141-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 10/29/2023] [Indexed: 11/29/2023]
Abstract
The genetic basis underlying adaptive physiological mechanisms has been extensively explored in mammals after colonizing the seas. However, independent lineages of aquatic mammals exhibit complex patterns of secondary colonization in freshwater environments. This change in habitat represents new osmotic challenges, and additional changes in key systems, such as the osmoregulatory system, are expected. Here, we studied the selective regime on coding and regulatory regions of 20 genes related to the osmoregulation system in strict aquatic mammals from independent evolutionary lineages, cetaceans, and sirenians, with representatives in marine and freshwater aquatic environments. We identified positive selection signals in genes encoding the protein vasopressin (AVP) in mammalian lineages with secondary colonization in the fluvial environment and in aquaporins for lineages inhabiting the marine and fluvial environments. A greater number of sites with positive selection signals were found for the dolphin species compared to the Amazonian manatee. Only the AQP5 and AVP genes showed selection signals in more than one independent lineage of these mammals. Furthermore, the vasopressin gene tree indicates greater similarity in river dolphin sequences despite the independence of their lineages based on the species tree. Patterns of distribution and enrichment of Transcription Factors in the promoter regions of target genes were analyzed and appear to be phylogenetically conserved among sister species. We found accelerated evolution signs in genes ACE, AQP1, AQP5, AQP7, AVP, NPP4, and NPR1 for the fluvial mammals. Together, these results allow a greater understanding of the molecular bases of the evolution of genes responsible for osmotic control in aquatic mammals.
Collapse
Affiliation(s)
- Elisa Ramos
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Giovanna Selleghin-Veiga
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Letícia Magpali
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Beatriz Daros
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Felipe Silva
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Agnello Picorelli
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Lucas Freitas
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Mariana F Nery
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil.
| |
Collapse
|
3
|
Cao L, Dai Z, Tan H, Zheng H, Wang Y, Chen J, Kuang H, Chong RA, Han M, Hu F, Sun W, Sun C, Zhang Z. Population Structure, Demographic History, and Adaptation of Giant Honeybees in China Revealed by Population Genomic Data. Genome Biol Evol 2023; 15:7044694. [PMID: 36799935 PMCID: PMC9991589 DOI: 10.1093/gbe/evad025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 02/05/2023] [Accepted: 02/11/2023] [Indexed: 02/18/2023] Open
Abstract
There have been many population-based genomic studies on human-managed honeybees (Apis mellifera and Apis cerana), but there has been a notable lack of analysis with regard to wild honeybees, particularly in relation to their evolutionary history. Nevertheless, giant honeybees have been found to occupy distinct habitats and display remarkable characteristics, which are attracting an increased amount of attention. In this study, we de novo sequenced and then assembled the draft genome sequence of the Himalayan giant honeybee, Apis laboriosa. Phylogenetic analysis based on genomic information indicated that A. laboriosa and its tropical sister species Apis dorsata diverged ∼2.61 Ma, which supports the speciation hypothesis that links A. laboriosa to geological changes throughout history. Furthermore, we re-sequenced A. laboriosa and A. dorsata samples from five and six regions, respectively, across their population ranges in China. These analyses highlighted major genetic differences for Tibetan A. laboriosa as well as the Hainan Island A. dorsata. The demographic history of most giant honeybee populations has mirrored glacial cycles. More importantly, contrary to what has occurred among human-managed honeybees, the demographic history of these two wild honeybee species indicates a rapid decline in effective population size in the recent past, reflecting their differences in evolutionary histories. Several genes were found to be subject to selection, which may help giant honeybees to adapt to specific local conditions. In summary, our study sheds light on the evolutionary and adaptational characteristics of two wild giant honeybee species, which was useful for giant honeybee conservation.
Collapse
Affiliation(s)
- Lianfei Cao
- Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Zhijun Dai
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Hongwei Tan
- School of Life Sciences, Chongqing University, Chongqing, China.,Chongqing General Station of Animal Husbandry Technology Extension, Chongqing, China
| | - Huoqing Zheng
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Yun Wang
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Jie Chen
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Haiou Kuang
- Faculty of Animal Science and Technology, Yunnan Agricultural University, Kunming, China
| | - Rebecca A Chong
- School of Life Sciences, University of Hawai'i at Mānoa, Honolulu, Hawai'i, USA
| | - Minjin Han
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing, China
| | - Fuliang Hu
- College of Animal Sciences, Zhejiang University, Hangzhou, China
| | - Wei Sun
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Cheng Sun
- College of Life Sciences, Capital Normal University, Beijing, China
| | - Ze Zhang
- School of Life Sciences, Chongqing University, Chongqing, China
| |
Collapse
|
4
|
Liedtke HC, Cruz F, Gómez-Garrido J, Fuentes Palacios D, Marcet-Houben M, Gut M, Alioto T, Gabaldón T, Gomez-Mestre I. Chromosome-level assembly, annotation and phylome of Pelobates cultripes, the western spadefoot toad. DNA Res 2022; 29:6588074. [PMID: 35583263 PMCID: PMC9164646 DOI: 10.1093/dnares/dsac013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 05/12/2022] [Indexed: 11/14/2022] Open
Abstract
Abstract
Genomic resources for amphibians are still hugely under-represented in vertebrate genomic research, despite being a group of major interest for ecology, evolution and conservation. Amphibians constitute a highly threatened group of vertebrates, present a vast diversity in reproductive modes, are extremely diverse in morphology, occupy most ecoregions of the world, and present the widest range in genome sizes of any major group of vertebrates. We combined Illumina, Nanopore and Hi-C sequencing technologies to assemble a chromosome-level genome sequence for an anuran with a moderate genome size (assembly span 3.09 Gb); Pelobates cultripes, the western spadefoot toad. The genome has an N50 length of 330 Mb with 98.6% of the total sequence length assembled into 14 super scaffolds, and 87.7% complete BUSCO genes. We use published transcriptomic data to provide annotations, identifying 32,684 protein-coding genes. We also reconstruct the P. cultripes phylome and identify 2,527 gene expansions. We contribute the first draft of the genome of the western spadefoot toad, P. cultripes. This species represents a relatively basal lineage in the anuran tree with an interesting ecology and a high degree of developmental plasticity, and thus is an important resource for amphibian genomic research.
Collapse
Affiliation(s)
- Hans Christoph Liedtke
- Ecology, Evolution and Development Group, Department of Wetland Ecology, Estación Biológica de Doñana (CSIC) , 41092 Sevilla, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Jèssica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Diego Fuentes Palacios
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Marina Marcet-Houben
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
| | - Tyler Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
- Universitat Pompeu Fabra (UPF) , Barcelona, Spain
| | - Toni Gabaldón
- Barcelona Supercomputing Centre (BSC-CNS) , 08034 Barcelona, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology (BIST) , 08028 Barcelona, Spain
- Catalan Institution for Research and Advanced Studies (ICREA) , Barcelona, Spain
| | - Ivan Gomez-Mestre
- Ecology, Evolution and Development Group, Department of Wetland Ecology, Estación Biológica de Doñana (CSIC) , 41092 Sevilla, Spain
| |
Collapse
|
5
|
Chen Y, Zhang T, Xian M, Zhang R, Yang W, Su B, Yang G, Sun L, Xu W, Xu S, Gao H, Xu L, Gao X, Li J. A draft genome of Drung cattle reveals clues to its chromosomal fusion and environmental adaptation. Commun Biol 2022; 5:353. [PMID: 35418663 PMCID: PMC9008013 DOI: 10.1038/s42003-022-03298-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 03/21/2022] [Indexed: 12/02/2022] Open
Abstract
Drung cattle (Bos frontalis) have 58 chromosomes, differing from the Bos taurus 2n = 60 karyotype. To date, its origin and evolution history have not been proven conclusively, and the mechanisms of chromosome fusion and environmental adaptation have not been clearly elucidated. Here, we assembled a high integrity and good contiguity genome of Drung cattle with 13.7-fold contig N50 and 4.1-fold scaffold N50 improvements over the recently published Indian mithun assembly, respectively. Speciation time estimation and phylogenetic analysis showed that Drung cattle diverged from Bos taurus into an independent evolutionary clade. Sequence evidence of centromere regions provides clues to the breakpoints in BTA2 and BTA28 centromere satellites. We furthermore integrated a circulation and contraction-related biological process involving 43 evolutionary genes that participated in pathways associated with the evolution of the cardiovascular system. These findings may have important implications for understanding the molecular mechanisms of chromosome fusion, alpine valleys adaptability and cardiovascular function.
Collapse
Affiliation(s)
- Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Tianliu Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Ming Xian
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Rui Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Weifei Yang
- 1 Gene Co., Ltd, 310051, Hangzhou, P.R. China
- Annoroad Gene Technology (Beijing) Co., Ltd, 100176, Beijing, P.R. China
| | - Baqi Su
- Drung Cattle Conservation Farm in Jiudang Wood, Drung and Nu Minority Autonomous County, Gongshan, 673500, Kunming, Yunnan, P.R. China
| | - Guoqiang Yang
- Livestock and Poultry Breed Improvement Center, Nujiang Lisu Minority Autonomous Prefecture, 673199, Kunming, Yunnan, P.R. China
| | - Limin Sun
- Yunnan Animal Husbandry Service, 650224, Kunming, Yunnan, P.R. China
| | - Wenkun Xu
- Yunnan Animal Husbandry Service, 650224, Kunming, Yunnan, P.R. China
| | - Shangzhong Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China.
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, 100193, Beijing, P.R. China.
| |
Collapse
|
6
|
Buitrago-López C, Mariappan KG, Cárdenas A, Gegner HM, Voolstra CR. The Genome of the Cauliflower Coral Pocillopora verrucosa. Genome Biol Evol 2021; 12:1911-1917. [PMID: 32857844 PMCID: PMC7594246 DOI: 10.1093/gbe/evaa184] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/24/2020] [Indexed: 02/06/2023] Open
Abstract
Climate change and ocean warming threaten the persistence of corals worldwide. Genomic resources are critical to study the evolutionary trajectory, adaptive potential, and genetic distinctiveness of coral species. Here, we provide a reference genome of the cauliflower coral Pocillopora verrucosa, a broadly prevalent reef-building coral with important ecological roles in the maintenance of reefs across the Red Sea, the Indian Ocean, and the Pacific Ocean. The genome has an assembly size of 380,505,698 bp with a scaffold N50 of 333,696 bp and a contig N50 of 75,704 bp. The annotation of the assembled genome returned 27,439 gene models of which 89.88% have evidence of transcription from RNA-Seq data and 97.87% show homology to known genes. A high proportion of the genome (41.22%) comprised repetitive elements in comparison to other cnidarian genomes, in particular in relation to the small genome size of P. verrucosa.
Collapse
Affiliation(s)
- Carol Buitrago-López
- Red Sea Research Center, Division of Biological BESE, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Kiruthiga G Mariappan
- Red Sea Research Center, Division of Biological BESE, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Anny Cárdenas
- Department of Biology, University of Konstanz, Germany
| | - Hagen M Gegner
- Red Sea Research Center, Division of Biological BESE, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.,Centre for Organismal Studies (COS), University of Heidelberg, Germany
| | - Christian R Voolstra
- Red Sea Research Center, Division of Biological BESE, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.,Department of Biology, University of Konstanz, Germany
| |
Collapse
|
7
|
Sieriebriennikov B, Reinberg D, Desplan C. A molecular toolkit for superorganisms. Trends Genet 2021; 37:846-859. [PMID: 34116864 PMCID: PMC8355152 DOI: 10.1016/j.tig.2021.05.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 05/14/2021] [Accepted: 05/17/2021] [Indexed: 12/16/2022]
Abstract
Social insects, such as ants, bees, wasps, and termites, draw biologists' attention due to their distinctive lifestyles. As experimental systems, they provide unique opportunities to study organismal differentiation, division of labor, longevity, and the evolution of development. Ants are particularly attractive because several ant species can be propagated in the laboratory. However, the same lifestyle that makes social insects interesting also hampers the use of molecular genetic techniques. Here, we summarize the efforts of the ant research community to surmount these hurdles and obtain novel mechanistic insight into the biology of social insects. We review current approaches and propose novel ones involving genomics, transcriptomics, chromatin and DNA methylation profiling, RNA interference (RNAi), and genome editing in ants and discuss future experimental strategies.
Collapse
Affiliation(s)
- Bogdan Sieriebriennikov
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, USA; Department of Biology, New York University, New York, NY, USA
| | - Danny Reinberg
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, USA; Howard Hughes Medical Institute, NYU Grossman School of Medicine, New York, NY, USA.
| | - Claude Desplan
- Department of Biology, New York University, New York, NY, USA.
| |
Collapse
|
8
|
Ranavat S, Becher H, Newman MF, Gowda V, Twyford AD. A Draft Genome of the Ginger Species Alpinia nigra and New Insights into the Genetic Basis of Flexistyly. Genes (Basel) 2021; 12:1297. [PMID: 34573279 PMCID: PMC8468202 DOI: 10.3390/genes12091297] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 08/18/2021] [Accepted: 08/20/2021] [Indexed: 11/17/2022] Open
Abstract
Angiosperms possess various strategies to ensure reproductive success, such as stylar polymorphisms that encourage outcrossing. Here, we investigate the genetic basis of one such dimorphism that combines both temporal and spatial separation of sexual function, termed flexistyly. It is a floral strategy characterised by the presence of two morphs that differ in the timing of stylar movement. We performed a de novo assembly of the genome of Alpinia nigra using high-depth genomic sequencing. We then used Pool-seq to identify candidate regions for flexistyly based on allele frequency or coverage differences between pools of anaflexistylous and cataflexistylous morphs. The final genome assembly size was 2 Gb, and showed no evidence of recent polyploidy. The Pool-seq did not reveal large regions with high FST values, suggesting large structural chromosomal polymorphisms are unlikely to underlie differences between morphs. Similarly, no region had a 1:2 mapping depth ratio which would be indicative of hemizygosity. We propose that flexistyly is governed by a small genomic region that might be difficult to detect with Pool-seq, or a complex genomic region that proved difficult to assemble. Our genome will be a valuable resource for future studies of gingers, and provides the first steps towards characterising this complex floral phenotype.
Collapse
Affiliation(s)
- Surabhi Ranavat
- Institute of Evolutionary Biology, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK; (H.B.); (A.D.T.)
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, UK;
| | - Hannes Becher
- Institute of Evolutionary Biology, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK; (H.B.); (A.D.T.)
| | - Mark F. Newman
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, UK;
| | - Vinita Gowda
- Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal Bypass Road, Bhauri, Bhopal 462 066, Madhya Pradesh, India;
| | - Alex D. Twyford
- Institute of Evolutionary Biology, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK; (H.B.); (A.D.T.)
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, UK;
| |
Collapse
|
9
|
Genome Sequence of the Asian Honeybee in Pakistan Sheds Light on Its Phylogenetic Relationship with Other Honeybees. INSECTS 2021; 12:insects12070652. [PMID: 34357312 PMCID: PMC8305923 DOI: 10.3390/insects12070652] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Revised: 07/06/2021] [Accepted: 07/15/2021] [Indexed: 11/17/2022]
Abstract
Simple Summary The Asian honeybee, Apis cerana, is used for honey production and pollination services in Pakistan. However, its genome sequence is still unknown. We collected A. cerana samples from its main rearing region in Pakistan and performed whole genome sequencing. We obtained a remarkably complete genome sequence for A. cerana in Pakistan, from which we identified a total of 11,864 protein-coding genes. Phylogeny analysis indicated an unexpectedly close relationship between A. cerana in Pakistan and those in China, suggesting a potential human introduction of the species between the two countries. Our results will facilitate the genetic improvement and conservation of A. cerana in Pakistan. Abstract In Pakistan, Apis cerana, the Asian honeybee, has been used for honey production and pollination services. However, its genomic makeup and phylogenetic relationship with those in other countries are still unknown. We collected A. cerana samples from the main cerana-keeping region in Pakistan and performed whole genome sequencing. A total of 28 Gb of Illumina shotgun reads were generated, which were used to assemble the genome. The obtained genome assembly had a total length of 214 Mb, with a GC content of 32.77%. The assembly had a scaffold N50 of 2.85 Mb and a BUSCO completeness score of 99%, suggesting a remarkably complete genome sequence for A. cerana in Pakistan. A MAKER pipeline was employed to annotate the genome sequence, and a total of 11,864 protein-coding genes were identified. Of them, 6750 genes were assigned at least one GO term, and 8813 genes were annotated with at least one protein domain. Genome-scale phylogeny analysis indicated an unexpectedly close relationship between A. cerana in Pakistan and those in China, suggesting a potential human introduction of the species between the two countries. Our results will facilitate the genetic improvement and conservation of A. cerana in Pakistan.
Collapse
|
10
|
Sun C, Huang J, Wang Y, Zhao X, Su L, Thomas GWC, Zhao M, Zhang X, Jungreis I, Kellis M, Vicario S, Sharakhov IV, Bondarenko SM, Hasselmann M, Kim CN, Paten B, Penso-Dolfin L, Wang L, Chang Y, Gao Q, Ma L, Ma L, Zhang Z, Zhang H, Zhang H, Ruzzante L, Robertson HM, Zhu Y, Liu Y, Yang H, Ding L, Wang Q, Ma D, Xu W, Liang C, Itgen MW, Mee L, Cao G, Zhang Z, Sadd BM, Hahn MW, Schaack S, Barribeau SM, Williams PH, Waterhouse RM, Mueller RL. Genus-Wide Characterization of Bumblebee Genomes Provides Insights into Their Evolution and Variation in Ecological and Behavioral Traits. Mol Biol Evol 2021; 38:486-501. [PMID: 32946576 PMCID: PMC7826183 DOI: 10.1093/molbev/msaa240] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Bumblebees are a diverse group of globally important pollinators in natural ecosystems and for agricultural food production. With both eusocial and solitary life-cycle phases, and some social parasite species, they are especially interesting models to understand social evolution, behavior, and ecology. Reports of many species in decline point to pathogen transmission, habitat loss, pesticide usage, and global climate change, as interconnected causes. These threats to bumblebee diversity make our reliance on a handful of well-studied species for agricultural pollination particularly precarious. To broadly sample bumblebee genomic and phenotypic diversity, we de novo sequenced and assembled the genomes of 17 species, representing all 15 subgenera, producing the first genus-wide quantification of genetic and genomic variation potentially underlying key ecological and behavioral traits. The species phylogeny resolves subgenera relationships, whereas incomplete lineage sorting likely drives high levels of gene tree discordance. Five chromosome-level assemblies show a stable 18-chromosome karyotype, with major rearrangements creating 25 chromosomes in social parasites. Differential transposable element activity drives changes in genome sizes, with putative domestications of repetitive sequences influencing gene coding and regulatory potential. Dynamically evolving gene families and signatures of positive selection point to genus-wide variation in processes linked to foraging, diet and metabolism, immunity and detoxification, as well as adaptations for life at high altitudes. Our study reveals how bumblebee genes and genomes have evolved across the Bombus phylogeny and identifies variations potentially linked to key ecological and behavioral traits of these important pollinators.
Collapse
Affiliation(s)
- Cheng Sun
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jiaxing Huang
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yun Wang
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Xiaomeng Zhao
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Long Su
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Gregg W C Thomas
- Division of Biological Sciences, University of Montana, Missoula, MT
| | - Mengya Zhao
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
| | - Xingtan Zhang
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA.,Broad Institute of MIT and Harvard, Cambridge, MA
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA.,Broad Institute of MIT and Harvard, Cambridge, MA
| | - Saverio Vicario
- Institute of Atmospheric Pollution Research-Italian National Research Council C/O Department of Physics, University of Bari, Bari, Italy
| | - Igor V Sharakhov
- Department of Entomology, Virginia Polytechnic and State University, Blacksburg, VA.,Department of Cytology and Genetics, Tomsk State University, Tomsk, Russian Federation
| | - Semen M Bondarenko
- Department of Entomology, Virginia Polytechnic and State University, Blacksburg, VA
| | - Martin Hasselmann
- Department of Livestock Population Genomics, Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
| | - Chang N Kim
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA
| | | | - Li Wang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Yuxiao Chang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Qiang Gao
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Ling Ma
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Lina Ma
- China National Center for Bioinformation & Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Zhang Zhang
- China National Center for Bioinformation & Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Hongbo Zhang
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Huahao Zhang
- College of Pharmacy and Life Science, Jiujiang University, Jiujiang, China
| | - Livio Ruzzante
- Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Hugh M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Champaign, IL
| | - Yihui Zhu
- Department of Medical Microbiology and Immunology, Genome Center, and MIND Institute, University of California Davis, Davis, CA
| | - Yanjie Liu
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huipeng Yang
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lele Ding
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Quangui Wang
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Dongna Ma
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Weilin Xu
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Cheng Liang
- Institute of Sericultural and Apiculture, Yunnan Academy of Agricultural Sciences, Mengzi, China
| | - Michael W Itgen
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Lauren Mee
- Department of Ecology, Evolution and Behaviour, Institute of Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Gang Cao
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, China
| | - Ze Zhang
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Ben M Sadd
- School of Biological Sciences, Illinois State University, Normal, IL
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN.,Department of Computer Science, Indiana University, Bloomington, IN
| | | | - Seth M Barribeau
- Department of Ecology, Evolution and Behaviour, Institute of Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Paul H Williams
- Department of Life Sciences, Natural History Museum, London, United Kingdom
| | - Robert M Waterhouse
- Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | |
Collapse
|
11
|
Seixas FA, Edelman NB, Mallet J. Synteny-Based Genome Assembly for 16 Species of Heliconius Butterflies, and an Assessment of Structural Variation across the Genus. Genome Biol Evol 2021; 13:6207971. [PMID: 33792688 PMCID: PMC8290116 DOI: 10.1093/gbe/evab069] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/29/2021] [Indexed: 12/11/2022] Open
Abstract
Heliconius butterflies (Lepidoptera: Nymphalidae) are a group of 48 neotropical species widely studied in evolutionary research. Despite the wealth of genomic data generated in past years, chromosomal level genome assemblies currently exist for only two species, Heliconius melpomene and Heliconius erato, each a representative of one of the two major clades of the genus. Here, we use these reference genomes to improve the contiguity of previously published draft genome assemblies of 16 Heliconius species. Using a reference-assisted scaffolding approach, we place and order the scaffolds of these genomes onto chromosomes, resulting in 95.7-99.9% of their genomes anchored to chromosomes. Genome sizes are somewhat variable among species (270-422 Mb) and in one small group of species (Heliconius hecale, Heliconius elevatus, and Heliconius pardalinus) expansions in genome size are driven mainly by repetitive sequences that map to four small regions in the H. melpomene reference genome. Genes from these repeat regions show an increase in exon copy number, an absence of internal stop codons, evidence of constraint on nonsynonymous changes, and increased expression, all of which suggest that at least some of the extra copies are functional. Finally, we conducted a systematic search for inversions and identified five moderately large inversions fixed between the two major Heliconius clades. We infer that one of these inversions was transferred by introgression between the lineages leading to the erato/sara and burneyi/doris clades. These reference-guided assemblies represent a major improvement in Heliconius genomic resources that enable further genetic and evolutionary discoveries in this genus.
Collapse
Affiliation(s)
- Fernando A Seixas
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Nathaniel B Edelman
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA.,Yale Institute for Biospheric Studies, Yale University, New Haven, Connecticut, USA
| | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| |
Collapse
|
12
|
Zhou C, Olukolu B, Gemenet DC, Wu S, Gruneberg W, Cao MD, Fei Z, Zeng ZB, George AW, Khan A, Yencho GC, Coin LJM. Assembly of whole-chromosome pseudomolecules for polyploid plant genomes using outbred mapping populations. Nat Genet 2020; 52:1256-1264. [PMID: 33128049 DOI: 10.1038/s41588-020-00717-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 09/15/2020] [Indexed: 12/31/2022]
Abstract
Despite advances in sequencing technologies, assembly of complex plant genomes remains elusive due to polyploidy and high repeat content. Here we report PolyGembler for grouping and ordering contigs into pseudomolecules by genetic linkage analysis. Our approach also provides an accurate method with which to detect and fix assembly errors. Using simulated data, we demonstrate that our approach is of high accuracy and outperforms three existing state-of-the-art genetic mapping tools. Particularly, our approach is more robust to the presence of missing genotype data and genotyping errors. We used our method to construct pseudomolecules for allotetraploid lawn grass utilizing PacBio long reads in combination with restriction site-associated DNA sequencing, and for diploid Ipomoea trifida and autotetraploid potato utilizing contigs assembled from Illumina reads in combination with genotype data generated by single-nucleotide polymorphism arrays and genotyping by sequencing, respectively. We resolved 13 assembly errors for a published I. trifida genome assembly and anchored eight unplaced scaffolds in the published potato genome.
Collapse
Affiliation(s)
- Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
- Department of Clinical Pathology, University of Melbourne, Melbourne, Victoria, Australia
| | - Bode Olukolu
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, USA
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Dorcus C Gemenet
- International Potato Center, Lima, Peru
- CGIAR Excellence in Breeding Platform, International Maize and Wheat Improvement Center, Nairobi, Kenya
| | - Shan Wu
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | | | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | - Zhao-Bang Zeng
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
| | - Andrew W George
- Data61, Commonwealth Scientific and Industrial Research Organisation, Brisbane, Queensland, Australia
| | - Awais Khan
- International Potato Center, Lima, Peru
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Geneva, NY, USA
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, USA
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia.
- Department of Clinical Pathology, University of Melbourne, Melbourne, Victoria, Australia.
- The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia.
| |
Collapse
|
13
|
Etherington GJ, Heavens D, Baker D, Lister A, McNelly R, Garcia G, Clavijo B, Macaulay I, Haerty W, Di Palma F. Sequencing smart: De novo sequencing and assembly approaches for a non-model mammal. Gigascience 2020; 9:5836134. [PMID: 32396200 PMCID: PMC7216774 DOI: 10.1093/gigascience/giaa045] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 02/28/2020] [Accepted: 04/15/2020] [Indexed: 01/05/2023] Open
Abstract
Background Whilst much sequencing effort has focused on key mammalian model organisms such as mouse and human, little is known about the relationship between genome sequencing techniques for non-model mammals and genome assembly quality. This is especially relevant to non-model mammals, where the samples to be sequenced are often degraded and of low quality. A key aspect when planning a genome project is the choice of sequencing data to generate. This decision is driven by several factors, including the biological questions being asked, the quality of DNA available, and the availability of funds. Cutting-edge sequencing technologies now make it possible to achieve highly contiguous, chromosome-level genome assemblies, but rely on high-quality high molecular weight DNA. However, funding is often insufficient for many independent research groups to use these techniques. Here we use a range of different genomic technologies generated from a roadkill European polecat (Mustela putorius) to assess various assembly techniques on this low-quality sample. We evaluated different approaches for de novo assemblies and discuss their value in relation to biological analyses. Results Generally, assemblies containing more data types achieved better scores in our ranking system. However, when accounting for misassemblies, this was not always the case for Bionano and low-coverage 10x Genomics (for scaffolding only). We also find that the extra cost associated with combining multiple data types is not necessarily associated with better genome assemblies. Conclusions The high degree of variability between each de novo assembly method (assessed from the 7 key metrics) highlights the importance of carefully devising the sequencing strategy to be able to carry out the desired analysis. Adding more data to genome assemblies does not always result in better assemblies, so it is important to understand the nuances of genomic data integration explained here, in order to obtain cost-effective value for money when sequencing genomes.
Collapse
Affiliation(s)
| | - Darren Heavens
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - David Baker
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Ashleigh Lister
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Rose McNelly
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Gonzalo Garcia
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Bernardo Clavijo
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Iain Macaulay
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Wilfried Haerty
- The Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | | |
Collapse
|
14
|
Lindblad-Toh K. What animals can teach us about evolution, the human genome, and human disease. Ups J Med Sci 2020; 125:1-9. [PMID: 32054372 PMCID: PMC7054949 DOI: 10.1080/03009734.2020.1722298] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Accepted: 01/23/2020] [Indexed: 12/14/2022] Open
Abstract
During the past 20 years, since I started as a postdoc, the world of genetics and genomics has changed dramatically. My main research goal throughout my career has been to understand human disease genetics, and I have developed comparative genomics and comparative genetics to generate resources and tools for understanding human disease. Through comparative genomics I have worked to sequence enough mammals to understand the functional potential of each base in the human genome as well as chosen vertebrates to study the evolutionary changes that have given many species their key traits. Through comparative genetics, I have developed the dog as a model for human disease, characterising the genome itself and determining a list of germ-line loci and somatic mutations causing complex diseases and cancer in the dog. Pulling all these findings and resources together opens new doors for understanding genome evolution, the genetics of complex traits and cancer in man and his best friend.
Collapse
Affiliation(s)
- Kerstin Lindblad-Toh
- Department for Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| |
Collapse
|
15
|
Edelman NB, Frandsen PB, Miyagi M, Clavijo B, Davey J, Dikow RB, García-Accinelli G, Van Belleghem SM, Patterson N, Neafsey DE, Challis R, Kumar S, Moreira GRP, Salazar C, Chouteau M, Counterman BA, Papa R, Blaxter M, Reed RD, Dasmahapatra KK, Kronforst M, Joron M, Jiggins CD, McMillan WO, Di Palma F, Blumberg AJ, Wakeley J, Jaffe D, Mallet J. Genomic architecture and introgression shape a butterfly radiation. Science 2019; 366:594-599. [PMID: 31672890 PMCID: PMC7197882 DOI: 10.1126/science.aaw2090] [Citation(s) in RCA: 229] [Impact Index Per Article: 45.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 09/16/2019] [Indexed: 12/26/2022]
Abstract
We used 20 de novo genome assemblies to probe the speciation history and architecture of gene flow in rapidly radiating Heliconius butterflies. Our tests to distinguish incomplete lineage sorting from introgression indicate that gene flow has obscured several ancient phylogenetic relationships in this group over large swathes of the genome. Introgressed loci are underrepresented in low-recombination and gene-rich regions, consistent with the purging of foreign alleles more tightly linked to incompatibility loci. Here, we identify a hitherto unknown inversion that traps a color pattern switch locus. We infer that this inversion was transferred between lineages by introgression and is convergent with a similar rearrangement in another part of the genus. These multiple de novo genome sequences enable improved understanding of the importance of introgression and selective processes in adaptive radiation.
Collapse
Affiliation(s)
- Nathaniel B Edelman
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | - Paul B Frandsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | - Miriam Miyagi
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | | | - John Davey
- Bioscience Technology Facility, Department of Biology, University of York, York YO10 5DD, UK
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - Rebecca B Dikow
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | | | - Steven M Van Belleghem
- Department of Biology, University of Puerto Rico, Río Piedras Campus, San Juan, PR 00931-3360, Puerto Rico
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
| | - Daniel E Neafsey
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
- Harvard TH Chan School of Public Health, Boston, MA 02115, USA
| | - Richard Challis
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Sujai Kumar
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, UK
| | - Gilson R P Moreira
- Departamento de Zoologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, 91501-970 Brasil
| | - Camilo Salazar
- Biology Program, Faculty of Natural Sciences and Mathematics, Universidad del Rosario, Carrera 24, No. 63C-69, Bogotá D.C. 111221, Colombia
| | - Mathieu Chouteau
- Laboratoire Ecologie, Evolution, Interactions des Systèmes Amazoniens (LEEISA), USR 3456, Université De Guyane, CNRS Guyane, 275 Route de Montabo, 97334 Cayenne, French Guiana
| | - Brian A Counterman
- Department of Biological Sciences, Mississippi State University, Starkville, MS 39762, USA
| | - Riccardo Papa
- Department of Biology, University of Puerto Rico, Río Piedras Campus, San Juan, PR 00931-3360, Puerto Rico
- Molecular Sciences and Research Center, University of Puerto Rico, San Juan, PR 00931-3360, Puerto Rico
| | - Mark Blaxter
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Robert D Reed
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY 14853, USA
| | - Kanchon K Dasmahapatra
- Bioscience Technology Facility, Department of Biology, University of York, York YO10 5DD, UK
| | - Marcus Kronforst
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
| | - Mathieu Joron
- CEFE, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, EPHE, IRD, 34090 Montpellier, France
| | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Apartado 0843-03092 Panamá, Panama
| | | | - Andrew J Blumberg
- Department of Mathematics, University of Texas, Austin, TX 78712, USA
| | - John Wakeley
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - David Jaffe
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
- 10x Genomics, Pleasanton, CA 94566, USA
| | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
16
|
De Novo Assembly Discovered Novel Structures in Genome of Plastids and Revealed Divergent Inverted Repeats in Mammillaria (Cactaceae, Caryophyllales). PLANTS 2019; 8:plants8100392. [PMID: 31581555 PMCID: PMC6843559 DOI: 10.3390/plants8100392] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Revised: 09/20/2019] [Accepted: 09/22/2019] [Indexed: 11/17/2022]
Abstract
The complete sequence of chloroplast genome (cpDNA) has been documented for single large columnar species of Cactaceae, lacking inverted repeats (IRs). We sequenced cpDNA for seven species of the short-globose cacti of Mammillaria and de novo assembly revealed three novel structures in land plants. These structures have a large single copy (LSC) that is 2.5 to 10 times larger than the small single copy (SSC), and two IRs that contain strong differences in length and gene composition. Structure 1 is distinguished by short IRs of <1 kb composed by rpl23-trnI-CAU-ycf2; with a total length of 110,189 bp and 113 genes. In structure 2, each IR is approximately 7.2 kb and is composed of 11 genes and one Intergenic Spacer-(psbK-trnQ)-trnQ-UUG-rps16-trnK-UUU-matK-trnK-UUU-psbA-trnH-GUG-rpl2-rpl23-trnI-CAU-ycf2; with a total size of 116,175 bp and 120 genes. Structure 3 has divergent IRs of approximately 14.1 kb, where IRA is composed of 20 genes: psbA-trnH-GUG-rpl23-trnI-CAU-ycf2-ndhB-rps7-rps12-trnV-GAC-rrn16-ycf68-trnI-GAU-trnA-AGC-rrn23-rrn4.5-rrn5-trnR-ACG-trnN-GUU-ndhF-rpl32; and IRB is identical to the IRA, but lacks rpl23. This structure has 131 genes and, by pseudogenization, it is shown to have the shortest cpDNA, of just 107,343 bp. Our findings show that Mammillaria bears an unusual structural diversity of cpDNA, which supports the elucidation of the evolutionary processes involved in cacti lineages.
Collapse
|
17
|
Abstract
Affordable, high-throughput DNA sequencing has accelerated the pace of genome assembly over the past decade. Genome assemblies from high-throughput, short-read sequencing, however, are often not as contiguous as the first generation of genome assemblies. Whereas early genome assembly projects were often aided by clone maps or other mapping data, many current assembly projects forego these scaffolding data and only assemble genomes into smaller segments. Recently, new technologies have been invented that allow chromosome-scale assembly at a lower cost and faster speed than traditional methods. Here, we give an overview of the problem of chromosome-scale assembly and traditional methods for tackling this problem. We then review new technologies for chromosome-scale assembly and recent genome projects that used these technologies to create highly contiguous genome assemblies at low cost.
Collapse
Affiliation(s)
- Edward S. Rice
- Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA;,
| | - Richard E. Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA;,
- Dovetail Genomics, LLC, Santa Cruz, California 95060, USA
| |
Collapse
|
18
|
Ye G, Zhang H, Chen B, Nie S, Liu H, Gao W, Wang H, Gao Y, Gu L. De novo genome assembly of the stress tolerant forest species Casuarina equisetifolia provides insight into secondary growth. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 97:779-794. [PMID: 30427081 DOI: 10.1111/tpj.14159] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 11/07/2018] [Accepted: 11/09/2018] [Indexed: 05/18/2023]
Abstract
Casuarina equisetifolia (C. equisetifolia), a conifer-like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia, making it a valuable model to study secondary growth associated genes and stress-tolerance traits. However, the genome sequence is unavailable and therefore wood-associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high-quality draft genome sequence of C. equisetifolia by a combination of Illumina second-generation sequencing reads and Pacific Biosciences single-molecule real-time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA-seq data, generated 29 827 annotated protein-coding genes and 1983 non-coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one-third of the genome assembly. Here we also construct the genome-wide map of DNA modification, such as two novel forms N6 -adenine (6mA) and N4-methylcytosine (4mC) at the level of single-nucleotide resolution using single-molecule real-time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin-related genes, which were associated with secondary growth and contained different DNA modifications. The high-quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.
Collapse
Affiliation(s)
- Gongfu Ye
- Fujian Academy of Forestry Sciences, Fuzhou, Fujian, 350012, China
- Fujian Casuarina Engineering Technology Research Center, Fuzhou, Fujian, 350012, China
| | - Hangxiao Zhang
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Bihua Chen
- Fujian Academy of Forestry Sciences, Fuzhou, Fujian, 350012, China
| | - Sen Nie
- Fujian Academy of Forestry Sciences, Fuzhou, Fujian, 350012, China
| | - Hai Liu
- Fujian Forestry Investigations and Planning Institute, Fuzhou, Fujian, 350003, China
| | - Wei Gao
- Fujian Academy of Forestry Sciences, Fuzhou, Fujian, 350012, China
| | - Huiyuan Wang
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Yubang Gao
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Lianfeng Gu
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| |
Collapse
|
19
|
Neller KCM, Diaz CA, Platts AE, Hudak KA. De novo Assembly of the Pokeweed Genome Provides Insight Into Pokeweed Antiviral Protein (PAP) Gene Expression. FRONTIERS IN PLANT SCIENCE 2019; 10:1002. [PMID: 31447869 PMCID: PMC6691146 DOI: 10.3389/fpls.2019.01002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 07/17/2019] [Indexed: 05/21/2023]
Abstract
Ribosome-inactivating proteins (RIPs) are RNA glycosidases thought to function in defense against pathogens. These enzymes remove purine bases from RNAs, including rRNA; the latter activity decreases protein synthesis in vitro, which is hypothesized to limit pathogen proliferation by causing host cell death. Pokeweed antiviral protein (PAP) is a RIP synthesized by the American pokeweed plant (Phytolacca americana). PAP inhibits virus infection when expressed in crop plants, yet little is known about the function of PAP in pokeweed due to a lack of genomic tools for this non-model species. In this work, we de novo assembled the pokeweed genome and annotated protein-coding genes. Sequencing comprised paired-end reads from a short-insert library of 83X coverage, and our draft assembly (N50 = 42.5 Kb) accounted for 74% of the measured pokeweed genome size of 1.3 Gb. We obtained 29,773 genes, 73% of which contained known protein domains, and identified several PAP isoforms. Within the gene models of each PAP isoform, a long 5' UTR intron was discovered, which was validated by RT-PCR and sequencing. Presence of the intron stimulated reporter gene expression in tobacco. To gain further understanding of PAP regulation, we complemented this genomic resource with expression profiles of pokeweed plants subjected to stress treatments [jasmonic acid (JA), salicylic acid, polyethylene glycol, and wounding]. Cluster analysis of the top differentially expressed genes indicated that some PAP isoforms shared expression patterns with genes involved in terpenoid biosynthesis, JA-mediated signaling, and metabolism of amino acids and carbohydrates. The newly sequenced promoters of all PAP isoforms contained cis-regulatory elements associated with diverse biotic and abiotic stresses. These elements mediated response to JA in tobacco, based on reporter constructs containing promoter truncations of PAP-I, the most abundant isoform. Taken together, this first genomic resource for the Phytolaccaceae plant family provides new insight into the regulation and function of PAP in pokeweed.
Collapse
Affiliation(s)
| | | | - Adrian E. Platts
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, United States
| | - Katalin A. Hudak
- Department of Biology, York University, Toronto, ON, Canada
- *Correspondence: Katalin A. Hudak,
| |
Collapse
|
20
|
Puerma E, Orengo DJ, Cruz F, Gómez-Garrido J, Librado P, Salguero D, Papaceit M, Gut M, Segarra C, Alioto TS, Aguadé M. The High-Quality Genome Sequence of the Oceanic Island Endemic Species Drosophila guanche Reveals Signals of Adaptive Evolution in Genes Related to Flight and Genome Stability. Genome Biol Evol 2018; 10:1956-1969. [PMID: 29947749 PMCID: PMC6101566 DOI: 10.1093/gbe/evy135] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/26/2018] [Indexed: 12/18/2022] Open
Abstract
Drosophila guanche is a member of the obscura group that originated in the Canary Islands archipelago upon its colonization by D. subobscura. It evolved into a new species in the laurisilva, a laurel forest present in wet regions that in the islands have only minor long-term weather fluctuations. Oceanic island endemic species such as D. guanche can become model species to investigate not only the relative role of drift and adaptation in speciation processes but also how population size affects nucleotide variation. Moreover, the previous identification of two satellite DNAs in D. guanche makes this species attractive for studying how centromeric DNA evolves. As a prerequisite for its establishment as a model species suitable to address all these questions, we generated a high-quality D. guanche genome sequence composed of 42 cytologically mapped scaffolds, which are assembled into six super-scaffolds (one per chromosome). The comparative analysis of the D. guanche proteome with that of twelve other Drosophila species identified 151 genes that were subject to adaptive evolution in the D. guanche lineage, with a subset of them being involved in flight and genome stability. For example, the Centromere Identifier (CID) protein, directly interacting with centromeric satellite DNA, shows signals of adaptation in this species. Both genomic analyses and FISH of the two satellites would support an ongoing replacement of centromeric satellite DNA in D. guanche.
Collapse
Affiliation(s)
- Eva Puerma
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Dorcas J Orengo
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Fernando Cruz
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Jèssica Gómez-Garrido
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Pablo Librado
- Centre for GeoGenetics, Natural History Museum of Denmark, Copenhagen, Denmark.,Laboratoire d'Anthropobiologie Moléculaire et d'Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, France
| | - David Salguero
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Montserrat Papaceit
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Carmen Segarra
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| | - Tyler S Alioto
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Montserrat Aguadé
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Spain
| |
Collapse
|
21
|
Brand P, Robertson HM, Lin W, Pothula R, Klingeman WE, Jurat-Fuentes JL, Johnson BR. The origin of the odorant receptor gene family in insects. eLife 2018; 7:e38340. [PMID: 30063003 PMCID: PMC6080948 DOI: 10.7554/elife.38340] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2018] [Accepted: 07/24/2018] [Indexed: 02/04/2023] Open
Abstract
The origin of the insect odorant receptor (OR) gene family has been hypothesized to have coincided with the evolution of terrestriality in insects. Missbach et al. (2014) suggested that ORs instead evolved with an ancestral OR co-receptor (Orco) after the origin of terrestriality and the OR/Orco system is an adaptation to winged flight in insects. We investigated genomes of the Collembola, Diplura, Archaeognatha, Zygentoma, Odonata, and Ephemeroptera, and find ORs present in all insect genomes but absent from lineages predating the evolution of insects. Orco is absent only in the ancestrally wingless insect lineage Archaeognatha. Our new genome sequence of the zygentoman firebrat Thermobia domestica reveals a full OR/Orco system. We conclude that ORs evolved before winged flight, perhaps as an adaptation to terrestriality, representing a key evolutionary novelty in the ancestor of all insects, and hence a molecular synapomorphy for the Class Insecta.
Collapse
Affiliation(s)
- Philipp Brand
- Department of Evolution and EcologyCenter for Population Biology, University of California, DavisDavisUnited States
| | - Hugh M Robertson
- Department of EntomologyUniversity of Illinois at Urbana-ChampaignUrbanaUnited States
| | - Wei Lin
- Department of Entomology and NematologyUniversity of California, DavisDavisUnited States
| | - Ratnasri Pothula
- Department of Entomology and Plant PathologyUniversity of TennesseeKnoxvilleUnited States
| | | | | | - Brian R Johnson
- Department of Entomology and NematologyUniversity of California, DavisDavisUnited States
| |
Collapse
|
22
|
Choudhury O, Chakrabarty A, Emrich SJ. HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning. Sci Rep 2018; 8:9936. [PMID: 29967328 PMCID: PMC6028576 DOI: 10.1038/s41598-018-28364-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 05/31/2018] [Indexed: 11/22/2022] Open
Abstract
Second-generation DNA sequencing techniques generate short reads that can result in fragmented genome assemblies. Third-generation sequencing platforms mitigate this limitation by producing longer reads that span across complex and repetitive regions. However, the usefulness of such long reads is limited because of high sequencing error rates. To exploit the full potential of these longer reads, it is imperative to correct the underlying errors. We propose HECIL-Hybrid Error Correction with Iterative Learning-a hybrid error correction framework that determines a correction policy for erroneous long reads, based on optimal combinations of decision weights obtained from short read alignments. We demonstrate that HECIL outperforms state-of-the-art error correction algorithms for an overwhelming majority of evaluation metrics on diverse, real-world data sets including E. coli, S. cerevisiae, and the malaria vector mosquito A. funestus. Additionally, we provide an optional avenue of improving the performance of HECIL's core algorithm by introducing an iterative learning paradigm that enhances the correction policy at each iteration by incorporating knowledge gathered from previous iterations via data-driven confidence metrics assigned to prior corrections.
Collapse
Affiliation(s)
- Olivia Choudhury
- Postdoctoral Researcher, IBM Research, Cambridge, MA, 02142, USA.
| | - Ankush Chakrabarty
- Visiting Research Scientist, Mitsubishi Electric Research Laboratories, Cambridge, MA, 02139, USA
| | - Scott J Emrich
- Associate Professor, Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, 37996, USA
| |
Collapse
|
23
|
Choque E, Klopp C, Valiere S, Raynal J, Mathieu F. Whole-genome sequencing of Aspergillus tubingensis G131 and overview of its secondary metabolism potential. BMC Genomics 2018; 19:200. [PMID: 29703136 PMCID: PMC6389250 DOI: 10.1186/s12864-018-4574-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 03/02/2018] [Indexed: 01/01/2023] Open
Abstract
Background Black Aspergilli represent one of the most important fungal resources of primary and secondary metabolites for biotechnological industry. Having several black Aspergilli sequenced genomes should allow targeting the production of certain metabolites with bioactive properties. Results In this study, we report the draft genome of a black Aspergilli, A. tubingensis G131, isolated from a French Mediterranean vineyard. This 35 Mb genome includes 10,994 predicted genes. A genomic-based discovery identifies 80 secondary metabolites biosynthetic gene clusters. Genomic sequences of these clusters were blasted on 3 chosen black Aspergilli genomes: A. tubingensis CBS 134.48, A. niger CBS 513.88 and A. kawachii IFO 4308. This comparison highlights different levels of clusters conservation between the four strains. It also allows identifying seven unique clusters in A. tubingensis G131. Moreover, the putative secondary metabolites clusters for asperazine and naphtho-gamma-pyrones production were proposed based on this genomic analysis. Key biosynthetic genes required for the production of 2 mycotoxins, ochratoxin A and fumonisin, are absent from this draft genome. Even if intergenic sequences of these mycotoxins biosynthetic pathways are present, this could not lead to the production of those mycotoxins by A. tubingensis G131. Conclusions Functional and bioinformatics analyses of A. tubingensis G131 genome highlight its potential for metabolites production in particular for TAN-1612, asperazine and naphtho-gamma-pyrones presenting antioxidant, anticancer or antibiotic properties. Electronic supplementary material The online version of this article (10.1186/s12864-018-4574-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Elodie Choque
- Université de Toulouse, Laboratoire de Génie Chimique, UMR 5503 CNRS/INPT/UPS, INP-ENSAT, 1, avenue de l'Agrobiopôle, 31326, Castanet-Tolosan, France.,Present address: Unité de Recherche Biologie des Plantes et Innovation (BIOPI-EA 3900), Université de Picardie Jules Verne, 33 rue Saint Leu, 80039, Amiens Cedex, France
| | - Christophe Klopp
- Plate-forme Genotoul Bioinfo, UR875 Biométrie et Intelligence Artificielle, Institut National de la Recherche Agronomique, Castanet-Tolosan, France
| | - Sophie Valiere
- INRA, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France
| | - José Raynal
- Université de Toulouse, Laboratoire de Génie Chimique, UMR 5503 CNRS/INPT/UPS, INP-ENSAT, 1, avenue de l'Agrobiopôle, 31326, Castanet-Tolosan, France
| | - Florence Mathieu
- Université de Toulouse, Laboratoire de Génie Chimique, UMR 5503 CNRS/INPT/UPS, INP-ENSAT, 1, avenue de l'Agrobiopôle, 31326, Castanet-Tolosan, France.
| |
Collapse
|
24
|
|
25
|
Mays HL, Hung CM, Shaner PJ, Denvir J, Justice M, Yang SF, Roth TL, Oehler DA, Fan J, Rekulapally S, Primerano DA. Genomic Analysis of Demographic History and Ecological Niche Modeling in the Endangered Sumatran Rhinoceros Dicerorhinus sumatrensis. Curr Biol 2017; 28:70-76.e4. [PMID: 29249659 DOI: 10.1016/j.cub.2017.11.021] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Revised: 10/11/2017] [Accepted: 11/07/2017] [Indexed: 12/30/2022]
Abstract
The vertebrate extinction rate over the past century is approximately 22-100 times greater than background extinction rates [1], and large mammals are particularly at risk [2, 3]. Quaternary megafaunal extinctions have been attributed to climate change [4], overexploitation [5], or a combination of the two [6]. Rhinoceroses (Family: Rhinocerotidae) have a rich fossil history replete with iconic examples of climate-induced extinctions [7], but current pressures threaten to eliminate this group entirely. The Sumatran rhinoceros (Dicerorhinus sumatrensis) is among the most imperiled mammals on earth. The 2011 population was estimated at ≤216 wild individuals [8], and currently the species is extirpated, or nearly so, throughout the majority of its former range [8-12]. Understanding demographic history is important in placing current population status into a broader ecological and evolutionary context. Analysis of the Sumatran rhinoceros genome reveals extreme changes in effective population size throughout the Pleistocene. Population expansion during the early to middle Pleistocene was followed by decline. Ecological niche modeling indicated that changing climate most likely played a role in the decline of the Sumatran rhinoceros, as less suitable habitat on an emergent Sundaland corridor isolated Sumatran rhinoceros populations. By the end of the Pleistocene, the Sundaland corridor was submerged, and populations were fragmented and consequently reduced to low Holocene levels from which they would never recover. Past events denuded the Sumatran rhinoceros of genetic diversity through population decline, fragmentation, or some combination of the two and most likely made the species even more susceptible to later exploitation and habitat loss. VIDEO ABSTRACT.
Collapse
Affiliation(s)
- Herman L Mays
- Marshall University, Department of Biological Sciences, Huntington, WV 25755, USA; Cincinnati Museum Center, Cincinnati, OH 45203, USA.
| | - Chih-Ming Hung
- Academia Sinica, Biodiversity Research Center, Taipei 11529, Taiwan
| | - Pei-Jen Shaner
- National Taiwan Normal University, Department of Life Sciences, Taipei 116, Taiwan
| | - James Denvir
- Marshall University, Department of Biomedical Sciences, Huntington, WV 25755, USA
| | - Megan Justice
- Marshall University, Department of Biomedical Sciences, Huntington, WV 25755, USA
| | - Shang-Fang Yang
- Academia Sinica, Biodiversity Research Center, Taipei 11529, Taiwan
| | - Terri L Roth
- Cincinnati Zoo and Botanical Garden, Center for Conservation and Research of Endangered Wildlife, Cincinnati, OH 45220, USA
| | - David A Oehler
- Wildlife Conservation Society, Bronx Zoo, New York, NY 10460, USA
| | - Jun Fan
- Marshall University, Department of Biomedical Sciences, Huntington, WV 25755, USA
| | | | - Donald A Primerano
- Marshall University, Department of Biomedical Sciences, Huntington, WV 25755, USA
| |
Collapse
|
26
|
Farrer RA, Fisher MC. Describing Genomic and Epigenomic Traits Underpinning Emerging Fungal Pathogens. ADVANCES IN GENETICS 2017; 100:73-140. [PMID: 29153405 DOI: 10.1016/bs.adgen.2017.09.009] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
An unprecedented number of pathogenic fungi are emerging and causing disease in animals and plants, putting the resilience of wild and managed ecosystems in jeopardy. While the past decades have seen an increase in the number of pathogenic fungi, they have also seen the birth of new big data technologies and analytical approaches to tackle these emerging pathogens. We review how the linked fields of genomics and epigenomics are transforming our ability to address the challenge of emerging fungal pathogens. We explore the methodologies and bioinformatic toolkits that currently exist to rapidly analyze the genomes of unknown fungi, then discuss how these data can be used to address key questions that shed light on their epidemiology. We show how genomic approaches are leading a revolution into our understanding of emerging fungal diseases and speculate on future approaches that will transform our ability to tackle this increasingly important class of emerging pathogens.
Collapse
|
27
|
Tamiru M, Natsume S, Takagi H, White B, Yaegashi H, Shimizu M, Yoshida K, Uemura A, Oikawa K, Abe A, Urasaki N, Matsumura H, Babil P, Yamanaka S, Matsumoto R, Muranaka S, Girma G, Lopez-Montes A, Gedil M, Bhattacharjee R, Abberton M, Kumar PL, Rabbi I, Tsujimura M, Terachi T, Haerty W, Corpas M, Kamoun S, Kahl G, Takagi H, Asiedu R, Terauchi R. Genome sequencing of the staple food crop white Guinea yam enables the development of a molecular marker for sex determination. BMC Biol 2017; 15:86. [PMID: 28927400 PMCID: PMC5604175 DOI: 10.1186/s12915-017-0419-x] [Citation(s) in RCA: 78] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 08/10/2017] [Indexed: 11/10/2022] Open
Abstract
Background Root and tuber crops are a major food source in tropical Africa. Among these crops are several species in the monocotyledonous genus Dioscorea collectively known as yam, a staple tuber crop that contributes enormously to the subsistence and socio-cultural lives of millions of people, principally in West and Central Africa. Yam cultivation is constrained by several factors, and yam can be considered a neglected “orphan” crop that would benefit from crop improvement efforts. However, the lack of genetic and genomic tools has impeded the improvement of this staple crop. Results To accelerate marker-assisted breeding of yam, we performed genome analysis of white Guinea yam (Dioscorea rotundata) and assembled a 594-Mb genome, 76.4% of which was distributed among 21 linkage groups. In total, we predicted 26,198 genes. Phylogenetic analyses with 2381 conserved genes revealed that Dioscorea is a unique lineage of monocotyledons distinct from the Poales (rice), Arecales (palm), and Zingiberales (banana). The entire Dioscorea genus is characterized by the occurrence of separate male and female plants (dioecy), a feature that has limited efficient yam breeding. To infer the genetics of sex determination, we performed whole-genome resequencing of bulked segregants (quantitative trait locus sequencing [QTL-seq]) in F1 progeny segregating for male and female plants and identified a genomic region associated with female heterogametic (male = ZZ, female = ZW) sex determination. We further delineated the W locus and used it to develop a molecular marker for sex identification of Guinea yam plants at the seedling stage. Conclusions Guinea yam belongs to a unique and highly differentiated clade of monocotyledons. The genome analyses and sex-linked marker development performed in this study should greatly accelerate marker-assisted breeding of Guinea yam. In addition, our QTL-seq approach can be utilized in genetic studies of other outcrossing crops and organisms with highly heterozygous genomes. Genomic analysis of orphan crops such as yam promotes efforts to improve food security and the sustainability of tropical agriculture. Electronic supplementary material The online version of this article (doi:10.1186/s12915-017-0419-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Hiroki Takagi
- Iwate Biotechnology Research Center, Kitakami, Japan
| | | | | | | | | | - Aiko Uemura
- Iwate Biotechnology Research Center, Kitakami, Japan
| | - Kaori Oikawa
- Iwate Biotechnology Research Center, Kitakami, Japan
| | - Akira Abe
- Iwate Biotechnology Research Center, Kitakami, Japan
| | | | | | | | - Shinsuke Yamanaka
- Japan International Research Center for Agricultural Sciences, Tsukuba, Japan
| | - Ryo Matsumoto
- Japan International Research Center for Agricultural Sciences, Tsukuba, Japan
| | - Satoru Muranaka
- Japan International Research Center for Agricultural Sciences, Tsukuba, Japan
| | - Gezahegn Girma
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | | | - Melaku Gedil
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | | | - Michael Abberton
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | - P Lava Kumar
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | - Ismail Rabbi
- International Institute of Tropical Agriculture, Ibadan, Nigeria
| | | | | | | | | | | | | | - Hiroko Takagi
- Japan International Research Center for Agricultural Sciences, Tsukuba, Japan.
| | - Robert Asiedu
- International Institute of Tropical Agriculture, Ibadan, Nigeria.
| | - Ryohei Terauchi
- Iwate Biotechnology Research Center, Kitakami, Japan. .,Kyoto University, Kyoto, Japan.
| |
Collapse
|
28
|
Zadesenets KS, Ershov NI, Rubtsov NB. Whole-genome sequencing of eukaryotes: From sequencing of DNA fragments to a genome assembly. RUSS J GENET+ 2017. [DOI: 10.1134/s102279541705012x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
29
|
Clavijo BJ, Venturini L, Schudoma C, Accinelli GG, Kaithakottil G, Wright J, Borrill P, Kettleborough G, Heavens D, Chapman H, Lipscombe J, Barker T, Lu FH, McKenzie N, Raats D, Ramirez-Gonzalez RH, Coince A, Peel N, Percival-Alwyn L, Duncan O, Trösch J, Yu G, Bolser DM, Namaati G, Kerhornou A, Spannagl M, Gundlach H, Haberer G, Davey RP, Fosker C, Palma FD, Phillips AL, Millar AH, Kersey PJ, Uauy C, Krasileva KV, Swarbreck D, Bevan MW, Clark MD. An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. Genome Res 2017; 27:885-896. [PMID: 28420692 PMCID: PMC5411782 DOI: 10.1101/gr.217117.116] [Citation(s) in RCA: 243] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 03/14/2017] [Indexed: 01/16/2023]
Abstract
Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Tom Barker
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | - Fu-Hao Lu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | | | - Dina Raats
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | | | - Ned Peel
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | - Owen Duncan
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Josua Trösch
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Guotai Yu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | - Dan M Bolser
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Guy Namaati
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Arnaud Kerhornou
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Manuel Spannagl
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Georg Haberer
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Robert P Davey
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Federica Di Palma
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - A Harvey Millar
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Paul J Kersey
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | | | - Ksenia V Krasileva
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
- The Sainsbury Laboratory, Norwich, NR4 7UH, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Matthew D Clark
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| |
Collapse
|
30
|
Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, Aiden EL. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 2017; 356:92-95. [PMID: 28336562 PMCID: PMC5635820 DOI: 10.1126/science.aal3327] [Citation(s) in RCA: 1149] [Impact Index Per Article: 164.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 03/13/2017] [Indexed: 01/04/2023]
Abstract
The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Aeaegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species.
Collapse
Affiliation(s)
- Olga Dudchenko
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
- Center for Theoretical and Biological Physics, Rice University, Houston, TX 77030, USA
| | - Sanjit S Batra
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Arina D Omer
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Sarah K Nyquist
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Marie Hoeger
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Neva C Durand
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Muhammad S Shamim
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Ido Machol
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
| | - Eric S Lander
- Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA
- Department of Biology, MIT, Cambridge, MA 02139, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Aviva Presser Aiden
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Bioengineering, Rice University, Houston, TX 77030, USA
- Department of Pediatrics, Texas Children's Hospital, Houston, TX 77030, USA
| | - Erez Lieberman Aiden
- The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
- Center for Theoretical and Biological Physics, Rice University, Houston, TX 77030, USA
- Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA
| |
Collapse
|
31
|
Genomic innovation for crop improvement. Nature 2017; 543:346-354. [DOI: 10.1038/nature22011] [Citation(s) in RCA: 222] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2016] [Accepted: 02/01/2017] [Indexed: 12/24/2022]
|
32
|
Abstract
Identifying the genomic changes that control morphological variation and understanding how they generate diversity is a major goal of evolutionary biology. In Heliconius butterflies, a small number of genes control the development of diverse wing color patterns. Here, we used full genome sequencing of individuals across the Heliconius erato radiation and closely related species to characterize genomic variation associated with wing pattern diversity. We show that variation around color pattern genes is highly modular, with narrow genomic intervals associated with specific differences in color and pattern. This modular architecture explains the diversity of color patterns and provides a flexible mechanism for rapid morphological diversification.
Collapse
|
33
|
Abstract
Multiple technologies and software are now available facilitating the de novo sequencing and assembly of any vertebrate genome. Yet the quality of most available sequenced genomes is substantially poorer than that of the golden standard in the field: the human genome. Here, we present a step-by-step protocol for the successful sequencing and assembly of a high-quality snake genome that can be applied to any other reptilian or avian species. We combine the great sequencing depth and accuracy of short reads with the use of different insert size libraries for extended scaffolding followed by optical mapping. We show that this procedure improved the corn snake scaffold N50 from 3.7 kbp to 1.4 Mbp, currently making it one of the snake genomes with the longest scaffolds. Short guidelines are also given on the extraction of long DNA molecules from reptilian blood and the necessary modifications in DNA extraction protocols. This chapter is accompanied by a website ( www.reptilomics.org/stepbystep.html ), where we provide links to the suggested software, examples of input and output files, and running parameters.
Collapse
Affiliation(s)
- Asier Ullate-Agote
- Laboratory of Artificial and Natural Evolution (LANE), Department of Genetics and Evolution, University of Geneva, Sciences III, 30, Quai Ernest-Ansermet, 1211, Geneva, Switzerland
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, Geneva, Switzerland
| | - Yingguang Frank Chan
- Friedrich Miescher Laboratory of the Max Planck Society, Spemannstrasse 39, Tübingen, 72076, Germany
| | - Athanasia C Tzika
- Laboratory of Artificial and Natural Evolution (LANE), Department of Genetics and Evolution, University of Geneva, Sciences III, 30, Quai Ernest-Ansermet, 1211, Geneva, Switzerland.
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland.
- Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, Geneva, Switzerland.
| |
Collapse
|