1
|
Jiang L, Quail MA, Fraser-Govil J, Wang H, Shi X, Oliver K, Mellado Gomez E, Yang F, Ning Z. The Bioinformatic Applications of Hi-C and Linked Reads. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae048. [PMID: 38905513 PMCID: PMC11580686 DOI: 10.1093/gpbjnl/qzae048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 05/07/2024] [Accepted: 06/19/2024] [Indexed: 06/23/2024]
Abstract
Long-range sequencing grants insight into additional genetic information beyond what can be accessed by both short reads and modern long-read technology. Several new sequencing technologies, such as "Hi-C" and "Linked Reads", produce long-range datasets for high-throughput and high-resolution genome analyses, which are rapidly advancing the field of genome assembly, genome scaffolding, and more comprehensive variant identification. In this review, we focused on five major long-range sequencing technologies: high-throughput chromosome conformation capture (Hi-C), 10X Genomics Linked Reads, haplotagging, transposase enzyme linked long-read sequencing (TELL-seq), and single- tube long fragment read (stLFR). We detailed the mechanisms and data products of the five platforms and their important applications, evaluated the quality of sequencing data from different platforms, and discussed the currently available bioinformatics tools. This work will benefit the selection of appropriate long-range technology for specific biological studies.
Collapse
Affiliation(s)
- Libo Jiang
- School of Life Sciences and Medicine, Shandong University of Technology, Zibo 255049, China
| | - Michael A Quail
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Jack Fraser-Govil
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Haipeng Wang
- School of Life Sciences and Medicine, Shandong University of Technology, Zibo 255049, China
| | - Xuequn Shi
- College of Food Science and Technology, Hainan University, Haikou 570228, China
| | - Karen Oliver
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Esther Mellado Gomez
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Fengtang Yang
- School of Life Sciences and Medicine, Shandong University of Technology, Zibo 255049, China
| | - Zemin Ning
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| |
Collapse
|
2
|
Šimková H, Câmara AS, Mascher M. Hi-C techniques: from genome assemblies to transcription regulation. JOURNAL OF EXPERIMENTAL BOTANY 2024; 75:5357-5365. [PMID: 38430521 DOI: 10.1093/jxb/erae085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 02/28/2024] [Indexed: 03/04/2024]
Abstract
The invention of chromosome conformation capture (3C) techniques, in particular the key method Hi-C providing genome-wide information about chromatin contacts, revolutionized the way we study the three-dimensional organization of the nuclear genome and how it affects transcription, replication, and DNA repair. Because the frequency of chromatin contacts between pairs of genomic segments predictably relates to the distance in the linear genome, the information obtained by Hi-C has also proved useful for scaffolding genomic sequences. Here, we review recent improvements in experimental procedures of Hi-C and its various derivatives, such as Micro-C, HiChIP, and Capture Hi-C. We assess the advantages and limitations of the techniques, and present examples of their use in recent plant studies. We also report on progress in the development of computational tools used in assembling genome sequences.
Collapse
Affiliation(s)
- Hana Šimková
- Institute of Experimental Botany of the Czech Academy of Sciences, Slechtitelu 31, CZ-779 00 Olomouc, Czech Republic
| | - Amanda Souza Câmara
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 3, Gatersleben, D-06466 Seeland, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 3, Gatersleben, D-06466 Seeland, Germany
| |
Collapse
|
3
|
Sylvester T, Hoover Z, Hjelmen CE, Jonika MM, Blackmon LT, Alfieri JM, Johnston JS, Chien S, Esfandani T, Blackmon H. A reference quality genome assembly for the jewel scarab Chrysina gloriosa. G3 (BETHESDA, MD.) 2024; 14:jkae084. [PMID: 38630623 PMCID: PMC11152064 DOI: 10.1093/g3journal/jkae084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 01/23/2024] [Accepted: 03/29/2024] [Indexed: 04/19/2024]
Abstract
The jewel scarab Chrysina gloriosa is one of the most charismatic beetles in the United States and is found from the mountains of West Texas to the Southeastern Arizona sky islands. This species is highly sought by professional and amateur collectors worldwide due to its gleaming metallic coloration. However, the impact of the large-scale collection of this beetle on its populations is unknown, and there is a limited amount of genetic information available to make informed decisions about its conservation. As a first step, we present the genome of C. gloriosa, which we reconstructed using a single female specimen sampled from our ongoing effort to document population connectivity and the demographic history of this beetle. Using a combination of long-read sequencing and Omni-C data, we reconstructed the C. gloriosa genome at a near-chromosome level. Our genome assembly consisted of 454 scaffolds spanning 642 MB, with the 10 largest scaffolds capturing 98% of the genome. The scaffold N50 was 72 MB, and the BUSCO score was 95.5%. This genome assembly will be an essential tool to accelerate understanding C. gloriosa biology and help make informed decisions for the conservation of Chrysina and other species with similar distributions in this region. This genome assembly will further serve as a community resource for comparative genomic analysis.
Collapse
Affiliation(s)
- Terrence Sylvester
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
- Department of Biology, University of Memphis, Memphis, TN 38111, USA
| | - Zachary Hoover
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA
| | - Carl E Hjelmen
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
- Department of Biology, Utah Valley University, Orem, UT 84058, USA
| | - Michelle M Jonika
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
- Interdisciplinary Program in Genetics and Genomics, Texas A&M University, College Station, TX 77843, USA
| | - Leslie T Blackmon
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - James M Alfieri
- Interdisciplinary Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, TX 77843, USA
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| | - J Spencer Johnston
- Department of Entomology, Texas A&M University, College Station, TX 77843, USA
| | - Sean Chien
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - Tahmineh Esfandani
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - Heath Blackmon
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
- Interdisciplinary Program in Genetics and Genomics, Texas A&M University, College Station, TX 77843, USA
- Interdisciplinary Program in Ecology and Evolutionary Biology, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|
4
|
Feng S, DeGrey SP, Guédot C, Schoville SD, Pool JE. Genomic Diversity Illuminates the Environmental Adaptation of Drosophila suzukii. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.03.547576. [PMID: 37461625 PMCID: PMC10349955 DOI: 10.1101/2023.07.03.547576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Biological invasions carry substantial practical and scientific importance, and represent natural evolutionary experiments on contemporary timescales. Here, we investigated genomic diversity and environmental adaptation of the crop pest Drosophila suzukii using whole-genome sequencing data and environmental metadata for 29 population samples from its native and invasive range. Through a multifaceted analysis of this population genomic data, we increase our understanding of the D. suzukii genome, its diversity and its evolution, and we identify an appropriate genotype-environment association pipeline for our data set. Using this approach, we detect genetic signals of local adaptation associated with nine distinct environmental factors related to altitude, wind speed, precipitation, temperature, and human land use. We uncover unique functional signatures for each environmental variable, such as a prevalence of cuticular genes associated with annual precipitation. We also infer biological commonalities in the adaptation to diverse selective pressures, particularly in terms of the apparent contribution of nervous system evolution to enriched processes (ranging from neuron development to circadian behavior) and to top genes associated with all nine environmental variables. Our findings therefore depict a finer-scale adaptive landscape underlying the rapid invasion success of this agronomically important species.
Collapse
Affiliation(s)
- Siyuan Feng
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| | - Samuel P. DeGrey
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - Christelle Guédot
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - Sean D. Schoville
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - John E. Pool
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
5
|
Ma X, Lu L, Yao F, Fang M, Wang P, Meng J, Shao K, Sun X, Zhang Y. High-quality genome assembly and multi-omics analysis of pigment synthesis pathway in Auricularia cornea. Front Microbiol 2023; 14:1211795. [PMID: 37396365 PMCID: PMC10308021 DOI: 10.3389/fmicb.2023.1211795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 05/22/2023] [Indexed: 07/04/2023] Open
Abstract
Owing to its great market potential for food and health care, white Auricularia cornea, a rare edible fungus, has received increased attention in recent years. This study presents a high-quality genome assembly of A. cornea and multi-omics analysis of its pigment synthesis pathway. Continuous Long Reads libraries, combined with Hi-C-assisted assembly were used to assemble of white A. cornea. Based on this data, we analyzed the transcriptome and metabolome of purple and white strains during the mycelium, primordium, and fruiting body stages. Finally, we obtained the genome of A.cornea assembled from 13 clusters. Comparative and evolutionary analysis suggests that A.cornea is more closely related to Auricularia subglabra than to Auricularia heimuer. The divergence of white/purple A.cornea occurred approximately 40,000 years ago, and there were numerous inversions and translocations between homologous regions of the two genomes. Purple strain synthesized pigment via the shikimate pathway. The pigment in the fruiting body of A. cornea was γ-glutaminyl-3,4-dihydroxy-benzoate. During pigment synthesis, α-D-glucose-1P, citrate, 2-Oxoglutarate, and glutamate were four important intermediate metabolites, whereas polyphenol oxidase and other 20 enzyme genes were the key enzymes. This study sheds light on the genetic blueprint and evolutionary history of the white A.cornea genome, revealing the mechanism of pigment synthesis in A.cornea. It has important theoretical and practical implications for understanding the evolution of basidiomycetes, molecular breeding of white A.cornea, and deciphering the genetic regulations of edible fungi. Additionally, it provides valuable insights for the study of phenotypic traits in other edible fungi.
Collapse
Affiliation(s)
- Xiaoxu Ma
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
- Guizhou Academy of Agricultural Sciences, Guizhou Key Laboratory of Edible Fungi Breeding, Guiyang, China
| | - Lixin Lu
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
| | - Fangjie Yao
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
- Country Engineering Research Centre of Chinese Ministry of Education for Edible and Medicinal Fungi, Jilin Agricultural University, Changchun, China
| | - Ming Fang
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
| | - Peng Wang
- Economic Plants Research Insitute, Jilin Academy of Agricultural Sciences, Changchun, China
| | - Jingjing Meng
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
| | - Kaisheng Shao
- Guizhou Academy of Agricultural Sciences, Guizhou Key Laboratory of Edible Fungi Breeding, Guiyang, China
| | - Xu Sun
- Guizhou Academy of Agricultural Sciences, Guizhou Key Laboratory of Edible Fungi Breeding, Guiyang, China
| | - Youmin Zhang
- Lab of Genetic Breeding of Edible Fungi, Horticultural, College of Horticulture, Jilin Agricultural University, Changchun, China
| |
Collapse
|
6
|
Ludt WB, Corbett EC, Kattawar J, Chakrabarty P, Faircloth BC. A reference genome for Bluegill (Centrarchidae: Lepomis macrochirus). G3 (BETHESDA, MD.) 2023; 13:6997878. [PMID: 36683458 PMCID: PMC9997549 DOI: 10.1093/g3journal/jkad019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 01/09/2023] [Accepted: 01/11/2023] [Indexed: 01/24/2023]
Abstract
North American sunfishes (Family Centrarchidae) are among the most popular sportfish throughout the United States and Canada. Despite the popularity of sunfishes, their ecological importance, and their extensive stocking and aquacultural history, few molecular studies have examined the evolutionary relationships and species boundaries among members of this group, many of which are known to hybridize. Here, we describe a chromosome-scale genome assembly representing Bluegill (Lepomis macrochirus), one of the most widespread centrarchid species. By combining long-read, Oxford Nanopore sequencing data with short-insert, whole-genome and HiC sequence reads, we produced an assembly (Lm_LA_1.1) having a total length of 889 Mb including 1,841 scaffolds and having a scaffold N50 of 36 Mb, L50 of 12, N90 of 29 Mb, and L90 of 22. We detected 99% (eukaryota_odb10) and 98% (actinopterygii_odb10) universal single-copy orthologs (BUSCOs), and ab initio gene prediction performed using this new assembly identified a set of 17,233 genes that were supported by external (OrthoDB v10) data. This new assembly provides an important addition to the growing set of assemblies already available for spiny-rayed fishes (Acanthomorpha), and it will serve as a resource for future studies that focus on the complex evolutionary history of centrarchids.
Collapse
Affiliation(s)
- William B Ludt
- Department of Ichthyology, Natural History Museum of Los Angeles County, Los Angeles, CA 90007, USA
| | - Eamon C Corbett
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Jerry Kattawar
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Prosanta Chakrabarty
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Brant C Faircloth
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
7
|
Wang S, Wang H, Jiang F, Wang A, Liu H, Zhao H, Yang B, Xu D, Zhang Y, Fan W. EndHiC: assemble large contigs into chromosome-level scaffolds using the Hi-C links from contig ends. BMC Bioinformatics 2022; 23:528. [PMID: 36482318 PMCID: PMC9730666 DOI: 10.1186/s12859-022-05087-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 11/28/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The application of PacBio HiFi and ultra-long ONT reads have enabled huge progress in the contig-level assembly, but it is still challenging to assemble large contigs into chromosomes with available Hi-C scaffolding tools, which count Hi-C links between contigs using the whole or a large part of contig regions. As the Hi-C links of two adjacent contigs concentrate only at the neighbor ends of the contigs, larger contig size will reduce the power to differentiate adjacent (signal) and non-adjacent (noise) contig linkages, leading to a higher rate of mis-assembly. RESULTS We design and develop a novel Hi-C based scaffolding tool EndHiC, which is suitable to assemble large contigs into chromosomal-level scaffolds. The core idea behind EndHiC, which distinguishes it from other Hi-C scaffolding tools, is using Hi-C links only from the most effective regions of contig ends. By this way, the signal neighbor contig linkages and noise non-neighbor contig linkages are separated more clearly. Benefiting from the increased signal to noise ratio, the reciprocal best requirement, as well as the robustness evaluation, EndHiC achieves higher accuracy for scaffolding large contigs compared to existing tools. EndHiC has been successfully applied in the Hi-C scaffolding of simulated data from human, rice and Arabidopsis, and real data from human, great burdock, water spinach, chicory, endive, yacon, and Ipomoea cairica, suggesting that EndHiC can be applied to a broad range of plant and animal genomes. CONCLUSIONS EndHiC is a novel Hi-C scaffolding tool, which is suitable for scaffolding of contig assemblies with contig N50 size near or over 10 Mb and N90 size near or over 1 Mb. EndHiC is efficient both in time and memory, and it is interface-friendly to the users. As more genome projects have been launched and the contig continuity constantly improved, we believe EndHiC has the potential to make a great contribution to the genomics field and liberate the scientists from labor-intensive manual curation works.
Collapse
Affiliation(s)
- Sen Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Hengchao Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Fan Jiang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Anqi Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Hangwei Liu
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Hanbo Zhao
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Boyuan Yang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Dong Xu
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China
| | - Yan Zhang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China.
| | - Wei Fan
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, Guangdong, China.
| |
Collapse
|
8
|
Geneva AJ, Park S, Bock DG, de Mello PLH, Sarigol F, Tollis M, Donihue CM, Reynolds RG, Feiner N, Rasys AM, Lauderdale JD, Minchey SG, Alcala AJ, Infante CR, Kolbe JJ, Schluter D, Menke DB, Losos JB. Chromosome-scale genome assembly of the brown anole (Anolis sagrei), an emerging model species. Commun Biol 2022; 5:1126. [PMID: 36284162 PMCID: PMC9596491 DOI: 10.1038/s42003-022-04074-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 10/06/2022] [Indexed: 12/12/2022] Open
Abstract
Rapid technological improvements are democratizing access to high quality, chromosome-scale genome assemblies. No longer the domain of only the most highly studied model organisms, now non-traditional and emerging model species can be genome-enabled using a combination of sequencing technologies and assembly software. Consequently, old ideas built on sparse sampling across the tree of life have recently been amended in the face of genomic data drawn from a growing number of high-quality reference genomes. Arguably the most valuable are those long-studied species for which much is already known about their biology; what many term emerging model species. Here, we report a highly complete chromosome-scale genome assembly for the brown anole, Anolis sagrei - a lizard species widely studied across a variety of disciplines and for which a high-quality reference genome was long overdue. This assembly exceeds the vast majority of existing reptile and snake genomes in contiguity (N50 = 253.6 Mb) and annotation completeness. Through the analysis of this genome and population resequence data, we examine the history of repetitive element accumulation, identify the X chromosome, and propose a hypothesis for the evolutionary history of fusions between autosomes and the X that led to the sex chromosomes of A. sagrei.
Collapse
Affiliation(s)
- Anthony J Geneva
- Department of Biology, Center for Computational and Integrative Biology, Rutgers University-Camden, Camden, NJ, USA.
| | - Sungdae Park
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Dan G Bock
- Department of Biology, Washington University in St. Louis, St. Louis, MO, USA
| | - Pietro L H de Mello
- Department of Ecology and Evolutionary Biology and Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, KS, USA
| | - Fatih Sarigol
- Max Perutz Labs, Medical University of Vienna, Vienna, Austria
| | - Marc Tollis
- School of Informatics, Computing and Cyber Systems, Northern Arizona University, Flagstaff, AZ, USA
| | - Colin M Donihue
- Institute at Brown for Environment and Society, Brown University, Providence, RI, USA
| | - R Graham Reynolds
- Department of Biology, University of North Carolina Asheville, Asheville, NC, USA
| | - Nathalie Feiner
- Department of Biology, Lund University, 223 62, Lund, Sweden
| | - Ashley M Rasys
- Department of Genetics, University of Georgia, Athens, GA, USA
- Department of Cellular Biology, University of Georgia, Athens, GA, USA
| | | | | | - Aaron J Alcala
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Carlos R Infante
- Department of Integrative Biology, University of Colorado Denver, Denver, CO, USA
| | - Jason J Kolbe
- Department of Biological Sciences, University of Rhode Island, Kingston, RI, USA
| | - Dolph Schluter
- Department of Zoology, University of British Columbia, Vancouver, Canada
| | - Douglas B Menke
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Jonathan B Losos
- Department of Biology, Washington University in St. Louis, St. Louis, MO, USA
- Living Earth Collaborative, Washington University in St. Louis, St. Louis, MO, USA
| |
Collapse
|
9
|
Yamaguchi K, Kadota M, Nishimura O, Ohishi Y, Naito Y, Kuraku S. Technical considerations in Hi-C scaffolding and evaluation of chromosome-scale genome assemblies. Mol Ecol 2021; 30:5923-5934. [PMID: 34432923 PMCID: PMC9292758 DOI: 10.1111/mec.16146] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 07/28/2021] [Accepted: 08/18/2021] [Indexed: 12/15/2022]
Abstract
The recent development of ecological studies has been fueled by the introduction of massive information based on chromosome-scale genome sequences, even for species for which genetic linkage is not accessible. This was enabled mainly by the application of Hi-C, a method for genome-wide chromosome conformation capture that was originally developed for investigating the long-range interaction of chromatins. Performing genomic scaffolding using Hi-C data is highly resource-demanding and employs elaborate laboratory steps for sample preparation. It starts with building a primary genome sequence assembly as an input, which is followed by computation for genome scaffolding using Hi-C data, requiring careful validation. This article presents technical considerations for obtaining optimal Hi-C scaffolding results and provides a test case of its application to a reptile species, the Madagascar ground gecko (Paroedura picta). Among the metrics that are frequently used for evaluating scaffolding results, we investigate the validity of the completeness assessment of chromosome-scale genome assemblies using single-copy reference orthologues.
Collapse
Affiliation(s)
- Kazuaki Yamaguchi
- Laboratory for PhyloinformaticsRIKEN Center for Biosystems Dynamics ResearchKobeJapan
| | - Mitsutaka Kadota
- Laboratory for PhyloinformaticsRIKEN Center for Biosystems Dynamics ResearchKobeJapan
| | - Osamu Nishimura
- Laboratory for PhyloinformaticsRIKEN Center for Biosystems Dynamics ResearchKobeJapan
| | - Yuta Ohishi
- Laboratory for PhyloinformaticsRIKEN Center for Biosystems Dynamics ResearchKobeJapan
| | - Yuki Naito
- Database Center for Life Science (DBCLS)MishimaJapan
| | - Shigehiro Kuraku
- Laboratory for PhyloinformaticsRIKEN Center for Biosystems Dynamics ResearchKobeJapan
- Molecular Life History LaboratoryNational Institute of GeneticsMishimaJapan
- Department of GeneticsSokendai (Graduate University for Advanced Studies)MishimaJapan
| |
Collapse
|
10
|
Keeling CI, Campbell EO, Batista PD, Shegelski VA, Trevoy SAL, Huber DPW, Janes JK, Sperling FAH. Chromosome-level genome assembly reveals genomic architecture of northern range expansion in the mountain pine beetle, Dendroctonus ponderosae Hopkins (Coleoptera: Curculionidae). Mol Ecol Resour 2021; 22:1149-1167. [PMID: 34637588 DOI: 10.1111/1755-0998.13528] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 10/04/2021] [Accepted: 10/05/2021] [Indexed: 12/22/2022]
Abstract
Genome sequencing methods and assembly tools have improved dramatically since the 2013 publication of draft genome assemblies for the mountain pine beetle, Dendroctonus ponderosae Hopkins (Coleoptera: Curculionidae). We conducted proximity ligation library sequencing and scaffolding to improve contiguity, and then used linkage mapping and recent bioinformatic tools for correction and further improvement. The new assemblies have dramatically improved contiguity and gaps compared to the originals: N50 values increased 26- to 36-fold, and the number of gaps were reduced by half. Ninety per cent of the content of the assemblies is now contained in 12 and 11 scaffolds for the female and male assemblies, respectively. Based on linkage mapping information, the 12 largest scaffolds in both assemblies represent all 11 autosomal chromosomes and the neo-X chromosome. These assemblies now have nearly chromosome-sized scaffolds and will be instrumental for studying genomic architecture, chromosome evolution, population genomics, functional genomics, and adaptation in this and other pest insects. We also identified regions in two chromosomes, including the ancestral-X portion of the neo-X chromosome, with elevated differentiation between northern and southern Canadian populations.
Collapse
Affiliation(s)
- Christopher I Keeling
- Laurentian Forestry Centre, Canadian Forest Service, Natural Resources Canada, Québec, QC, Canada.,Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, QC, Canada
| | - Erin O Campbell
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Philip D Batista
- Faculty of Environment, University of Northern British Columbia, Prince George, BC, Canada
| | - Victor A Shegelski
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Stephen A L Trevoy
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Dezene P W Huber
- Faculty of Environment, University of Northern British Columbia, Prince George, BC, Canada
| | - Jasmine K Janes
- Biology Department, Vancouver Island University, Nanaimo, BC, Canada.,School of Environmental and Rural Studies, University of New England, Armidale, NSW, Australia
| | - Felix A H Sperling
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
11
|
Tang M, He S, Gong X, Lü P, Taha RH, Chen K. High-Quality de novo Chromosome-Level Genome Assembly of a Single Bombyx mori With BmNPV Resistance by a Combination of PacBio Long-Read Sequencing, Illumina Short-Read Sequencing, and Hi-C Sequencing. Front Genet 2021; 12:718266. [PMID: 34603381 PMCID: PMC8481875 DOI: 10.3389/fgene.2021.718266] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 08/05/2021] [Indexed: 12/17/2022] Open
Abstract
The reference genomes of Bombyx mori (B. mori), Silkworm Knowledge-based database (SilkDB) and SilkBase, have served as the gold standard for nearly two decades. Their use has fundamentally shaped model organisms and accelerated relevant studies on lepidoptera. However, the current reference genomes of B. mori do not accurately represent the full set of genes for any single strain. As new genome-wide sequencing technologies have emerged and the cost of high-throughput sequencing technology has fallen, it is now possible for standard laboratories to perform full-genome assembly for specific strains. Here we present a high-quality de novo chromosome-level genome assembly of a single B. mori with nuclear polyhedrosis virus (BmNPV) resistance through the integration of PacBio long-read sequencing, Illumina short-read sequencing, and Hi-C sequencing. In addition, regular bioinformatics analyses, such as gene family, phylogenetic, and divergence analyses, were performed. The sample was from our unique B. mori species (NB), which has strong inborn resistance to BmNPV. Our genome assembly showed good collinearity with SilkDB and SilkBase and particular regions. To the best of our knowledge, this is the first genome assembly with BmNPV resistance, which should be a more accurate insect model for resistance studies.
Collapse
Affiliation(s)
- Min Tang
- School of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Suqun He
- School of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Xun Gong
- Institute of Clinical Pharmacology, Anhui Medical University, Hefei, China.,Department of Medical Rheumatology, Columbia University, New York, NY, United States
| | - Peng Lü
- School of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Rehab H Taha
- Department of Sericulture, Plant Protection Research Institute, Agricultural Research Center, Giza, Egypt
| | - Keping Chen
- School of Life Sciences, Jiangsu University, Zhenjiang, China
| |
Collapse
|
12
|
Zheng X, Chen D, Chen B, Liang L, Huang Z, Fan W, Chen J, He W, Chen H, Huang L, Chen Y, Zhu J, Xue T. Insights into salvianolic acid B biosynthesis from chromosome-scale assembly of the Salvia bowleyana genome. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2021; 63:1309-1323. [PMID: 33634943 DOI: 10.1111/jipb.13085] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 02/26/2021] [Indexed: 05/21/2023]
Abstract
Salvia bowleyana is a traditional Chinese medicinal plant that is a source of nutritional supplements rich in salvianolic acid B and a potential experimental system for the exploration of salvianolic acid B biosynthesis in the Labiatae. Here, we report a high-quality chromosome-scale genome assembly of S. bowleyana covering 462.44 Mb, with a scaffold N50 value of 57.96 Mb and 44,044 annotated protein-coding genes. Evolutionary analysis revealed an estimated divergence time between S. bowleyana and its close relative S. miltiorrhiza of ~3.94 million years. We also observed evidence of a whole-genome duplication in the S. bowleyana genome. Transcriptome analysis showed that SbPAL1 (PHENYLALANINE AMMONIA-LYASE1) is highly expressed in roots relative to stem and leaves, paralleling the location of salvianolic acid B accumulation. The laccase gene family in S. bowleyana outnumbered their counterparts in both S. miltiorrhiza and Arabidopsis thaliana, suggesting that the gene family has undergone expansion in S. bowleyana. Several laccase genes were also highly expressed in roots, where their encoded proteins may catalyze the oxidative reaction from rosmarinic acid to salvianolic acid B. These findings provide an invaluable genomic resource for understanding salvianolic acid B biosynthesis and its regulation, and will be useful for exploring the evolution of the Labiatae.
Collapse
Affiliation(s)
- Xuehai Zheng
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Duo Chen
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Binghua Chen
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Limin Liang
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Zhen Huang
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Wenfang Fan
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Jiannan Chen
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Wenjin He
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Huibin Chen
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Luqiang Huang
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Youqiang Chen
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Jinmao Zhu
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| | - Ting Xue
- The Public Service Platform for Industrialization Development Technology of Marine Biological Medicine and Products of the State Oceanic Administration, Fujian Key Laboratory of Special Marine Bioresource Sustainable Utilization, Southern Institute of Oceanography, College of Life Sciences, Fujian Normal University, Fuzhou, 350117, China
| |
Collapse
|