1
|
Kelsang GA, Ni L, Zhao Z. Insights from the first chromosome-level genome assembly of the alpine gentian Gentiana straminea Maxim. DNA Res 2024; 31:dsae022. [PMID: 39017645 PMCID: PMC11375616 DOI: 10.1093/dnares/dsae022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 07/10/2024] [Accepted: 07/16/2024] [Indexed: 07/18/2024] Open
Abstract
Gentiana straminea Maxim. is a perennial herb and mainly distributed in the Qinghai-Tibetan Plateau. To adapt to the extreme environment, it has developed particular morphological, physiological, and genetic structures. Also, rich in iridoids, it is one of the original plants of traditional Chinese herb 'Qinjiao'. Herein, we present its first chromosome-level genome sequence assembly and compare it with the genomes of other Gentiana species to facilitate the analysis of genomic characteristics. The assembled genome size of G. straminea was 1.25 Gb, with a contig N50 of 7.5 Mb. A total of 96.08% of the genome sequences was anchored on 13 pseudochromosomes, with a scaffold N50 of 92.70 Mb. A total of 54,310 protein-coding genes were predicted, 80.25% of which were functionally annotated. Comparative genomic analyses indicated that G. straminea experienced two whole-genome duplication events after the γ whole-genome triplication with other eudicots, and it diverged from other Gentiana species at ~3.2 Mya. A total of 142 enzyme-coding genes related to iridoid biosynthesis were identified in its genome. Additionally, we identified differences in the number and expression patterns of iridoid biosynthetic pathway genes in G. straminea compared with two other Gentiana species by integrating whole-genome sequence and transcriptomic analyses.
Collapse
Affiliation(s)
- Gyab Ala Kelsang
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China
- Mentseekhang, Traditional Tibetan Hospital, Lhasa 850000, China
| | - Lianghong Ni
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China
| | - Zhili Zhao
- School of Pharmacy, Shanghai University of Traditional Chinese Medicine, Shanghai 201203, China
| |
Collapse
|
2
|
Wang D, Rastas P, Yi X, Löytynoja A, Kivikoski M, Feng X, Reid K, Merilä J. Improved assembly of the Pungitius pungitius reference genome. G3 (BETHESDA, MD.) 2024; 14:jkae126. [PMID: 38861393 PMCID: PMC11304971 DOI: 10.1093/g3journal/jkae126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 05/23/2024] [Accepted: 05/30/2024] [Indexed: 06/13/2024]
Abstract
The nine-spined stickleback (Pungitius pungitius) has been increasingly used as a model system in studies of local adaptation and sex chromosome evolution but its current reference genome assembly is far from perfect, lacking distinct sex chromosomes. We generated an improved assembly of the nine-spined stickleback reference genome (98.3% BUSCO completeness) with the aid of linked-read mapping. While the new assembly (v8) was of similar size as the earlier version (v7), we were able to assign 4.4 times more contigs to the linkage groups and improve the contiguity of the genome. Moreover, the new assembly contains a ∼22.8 Mb Y-linked scaffold (LG22) consisting mainly of previously assigned X-contigs, putative Y-contigs, putative centromere contigs, and highly repetitive elements. The male individual showed an even mapping depth on LG12 (pseudo X chromosome) and LG22 (Y-linked scaffold) in the segregating sites, suggesting near-pure X and Y representation in the v8 assembly. A total of 26,803 genes were annotated, and about 33% of the assembly was found to consist of repetitive elements. The high proportion of repetitive elements in LG22 (53.10%) suggests it can be difficult to assemble the complete sequence of the species' Y chromosome. Nevertheless, the new assembly is a significant improvement over the previous version and should provide a valuable resource for genomic studies of stickleback fishes.
Collapse
Affiliation(s)
- Dandan Wang
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, 999077, Hong Kong SAR
| | - Pasi Rastas
- Institute of Biotechnology, University of Helsinki, Helsinki FI-00014, Finland
| | - Xueling Yi
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, 999077, Hong Kong SAR
| | - Ari Löytynoja
- Institute of Biotechnology, University of Helsinki, Helsinki FI-00014, Finland
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki FI-00014, Finland
| | - Mikko Kivikoski
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki FI-00014, Finland
- Department of Computer Science, University of Helsinki, Helsinki FI-00014, Finland
| | - Xueyun Feng
- Institute of Biotechnology, University of Helsinki, Helsinki FI-00014, Finland
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki FI-00014, Finland
| | - Kerry Reid
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, 999077, Hong Kong SAR
| | - Juha Merilä
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, 999077, Hong Kong SAR
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki FI-00014, Finland
| |
Collapse
|
3
|
Anderson T, Wheeler TJ. An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models. BMC Bioinformatics 2024; 25:247. [PMID: 39075359 PMCID: PMC11285124 DOI: 10.1186/s12859-024-05879-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 07/23/2024] [Indexed: 07/31/2024] Open
Abstract
BACKGROUND Sequence alignment lies at the heart of genome sequence annotation. While the BLAST suite of alignment tools has long held an important role in alignment-based sequence database search, greater sensitivity is achieved through the use of profile hidden Markov models (pHMMs). Here, we describe an FPGA hardware accelerator, called HAVAC, that targets a key bottleneck step (SSV) in the analysis pipeline of the popular pHMM alignment tool, HMMER. RESULTS The HAVAC kernel calculates the SSV matrix at 1739 GCUPS on a ∼ $3000 Xilinx Alveo U50 FPGA accelerator card, ∼ 227× faster than the optimized SSV implementation in nhmmer. Accounting for PCI-e data transfer data processing, HAVAC is 65× faster than nhmmer's SSV with one thread and 35× faster than nhmmer with four threads, and uses ∼ 31% the energy of a traditional high end Intel CPU. CONCLUSIONS HAVAC demonstrates the potential offered by FPGA hardware accelerators to produce dramatic speed gains in sequence annotation and related bioinformatics applications. Because these computations are performed on a co-processor, the host CPU remains free to simultaneously compute other aspects of the analysis pipeline.
Collapse
Affiliation(s)
- Tim Anderson
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Travis J Wheeler
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, AZ, USA.
| |
Collapse
|
4
|
Wu Y, Sha H, Luo X, Zou G, Liang H. Chromosome-level genome assembly of Plagiognathops microlepis based on PacBio HiFi and Hi-C sequencing. Sci Data 2024; 11:802. [PMID: 39030192 PMCID: PMC11271555 DOI: 10.1038/s41597-024-03645-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 07/12/2024] [Indexed: 07/21/2024] Open
Abstract
Plagiognathops microlepis is an economic freshwater fish in the subfamily Xenocyprinae of Cyprinidae. It is widely distributed in the freshwater ecosystem of China, with moderate economic value and broad development prospects. However, the lack of genomic resources has limited our understanding on the genetic basis, phylogenetic status and adaptive evolution strategies of this fish. Here, we assembled a chromosome-level reference genome of P. microlepis by integrating Pacbio HiFi long-reads, Illumina short-reads and Hi-C sequencing data. The size of this genome is 1004.34 Mb with a contig N50 of 38.80 Mb. Using Hi-C sequencing data, 99.59% of the assembled sequences were further anchored to 24 chromosomes. A total of 578.91 Mb repeat sequences and 28,337 protein-coding genes were predicted in the current genome, of which, 26,929 genes were functionally annotated. This genome provides valuable information for investigating the phylogeny and evolutionary history of cyprinid fishes, as well as the genetic basis of adaptive strategies and special traits in P. microlepis.
Collapse
Affiliation(s)
- Yidi Wu
- Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China
| | - Hang Sha
- Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China
| | - Xiangzhong Luo
- Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China
| | - Guiwei Zou
- Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China
| | - Hongwei Liang
- Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China.
- Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China.
| |
Collapse
|
5
|
Olson DR, Wheeler TJ. ULTRA-Effective Labeling of Repetitive Genomic Sequence. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.03.597269. [PMID: 38895435 PMCID: PMC11185745 DOI: 10.1101/2024.06.03.597269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
In the age of long read sequencing, genomics researchers now have access to accurate repetitive DNA sequence (including satellites) that, due to the limitations of short read sequencing, could previously be observed only as unmappable fragments. Tools that annotate repetitive sequence are now more important than ever, so that we can better understand newly uncovered repetitive sequences, and also so that we can mitigate errors in bioinformatic software caused by those repetitive sequences. To that end, we introduce the 1.0 release of our tool for identifying and annotating locally-repetitive sequence, ULTRA (ULTRA Locates Tandemly Repetitive Areas). ULTRA is fast enough to use as part of an efficient annotation pipeline, produces state-of-the-art reliable coverage of repetitive regions containing many mutations, and provides interpretable statistics and labels for repetitive regions. It released under an open license, and available for download at https://github.com/TravisWheelerLab/ULTRA.
Collapse
Affiliation(s)
- Daniel R. Olson
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT, USA
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
6
|
Scheidecker B, Poulain S, Sugimoto M, Arakawa H, Kim SH, Kawanishi T, Kato Y, Danoy M, Nishikawa M, Sakai Y. Mechanobiological stimulation in organ-on-a-chip systems reduces hepatic drug metabolic capacity in favor of regenerative specialization. Biotechnol Bioeng 2024; 121:1435-1452. [PMID: 38184801 DOI: 10.1002/bit.28653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 12/20/2023] [Accepted: 12/22/2023] [Indexed: 01/08/2024]
Abstract
Hepatic physiology depends on the liver's complex structural composition which among others, provides high oxygen supply rates, locally differential oxygen tension, endothelial paracrine signaling, as well as residual hemodynamic shear stress to resident hepatocytes. While functional improvements were shown by implementing these factors into hepatic culture systems, direct cause-effect relationships are often not well characterized-obfuscating their individual contribution in more complex microphysiological systems. By comparing increasingly complex hepatic in vitro culture systems that gradually implement these parameters, we investigate the influence of the cellular microenvironment to overall hepatic functionality in pharmacological applications. Here, hepatocytes were modulated in terms of oxygen tension and supplementation, endothelial coculture, and exposure to fluid shear stress delineated from oxygen influx. Results from transcriptomic and metabolomic evaluation indicate that particularly oxygen supply rates are critical to enhance cellular functionality-with cellular drug metabolism remaining comparable to physiological conditions after prolonged static culture. Endothelial signaling was found to be a major contributor to differential phenotype formation known as metabolic zonation, indicated by WNT pathway activity. Lastly, oxygen-delineated shear stress was identified to direct cellular fate towards increased hepatic plasticity and regenerative phenotypes at the cost of drug metabolic functionality - in line with regenerative effects observed in vivo. With these results, we provide a systematic evaluation of critical parameters and their impact in hepatic systems. Given their adherence to physiological effects in vivo, this highlights the importance of their implementation in biomimetic devices, such as organ-on-a-chip systems. Considering recent advances in basic liver biology, direct translation of physiological structures into in vitro models is a promising strategy to expand the capabilities of pharmacological models.
Collapse
Affiliation(s)
| | - Stéphane Poulain
- Institute of Industrial Science, University of Tokyo, Tokyo, Japan
| | - Masahiro Sugimoto
- Institute for Advanced Biosciences, Keio University, Yamagata, Japan
- Institute of Medical Science, Tokyo Medical University, Tokyo, Japan
| | - Hiroshi Arakawa
- Institute of Medical, Pharmaceutical and Health Sciences, Kanazawa University, Kanazawa, Japan
| | - Soo H Kim
- Institute of Industrial Science, University of Tokyo, Tokyo, Japan
| | - Takumi Kawanishi
- Institute of Medical, Pharmaceutical and Health Sciences, Kanazawa University, Kanazawa, Japan
| | - Yukio Kato
- Institute of Medical, Pharmaceutical and Health Sciences, Kanazawa University, Kanazawa, Japan
| | - Mathieu Danoy
- Department of Chemical System Engineering, University of Tokyo, Tokyo, Japan
| | - Masaki Nishikawa
- Department of Chemical System Engineering, University of Tokyo, Tokyo, Japan
| | - Yasuyuki Sakai
- Department of Chemical System Engineering, University of Tokyo, Tokyo, Japan
| |
Collapse
|
7
|
Chorlton SD. Ten common issues with reference sequence databases and how to mitigate them. FRONTIERS IN BIOINFORMATICS 2024; 4:1278228. [PMID: 38560517 PMCID: PMC10978663 DOI: 10.3389/fbinf.2024.1278228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 03/05/2024] [Indexed: 04/04/2024] Open
Abstract
Metagenomic sequencing has revolutionized our understanding of microbiology. While metagenomic tools and approaches have been extensively evaluated and benchmarked, far less attention has been given to the reference sequence database used in metagenomic classification. Issues with reference sequence databases are pervasive. Database contamination is the most recognized issue in the literature; however, it remains relatively unmitigated in most analyses. Other common issues with reference sequence databases include taxonomic errors, inappropriate inclusion and exclusion criteria, and sequence content errors. This review covers ten common issues with reference sequence databases and the potential downstream consequences of these issues. Mitigation measures are discussed for each issue, including bioinformatic tools and database curation strategies. Together, these strategies present a path towards more accurate, reproducible and translatable metagenomic sequencing.
Collapse
|
8
|
Song B, Buckler ES, Stitzer MC. New whole-genome alignment tools are needed for tapping into plant diversity. TRENDS IN PLANT SCIENCE 2024; 29:355-369. [PMID: 37749022 DOI: 10.1016/j.tplants.2023.08.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/19/2023] [Accepted: 08/23/2023] [Indexed: 09/27/2023]
Abstract
Genome alignment is one of the most foundational methods for genome sequence studies. With rapid advances in sequencing and assembly technologies, these newly assembled genomes present challenges for alignment tools to meet the increased complexity and scale. Plant genome alignment is technologically challenging because of frequent whole-genome duplications (WGDs) as well as chromosome rearrangements and fractionation, high nucleotide diversity, widespread structural variation, and high transposable element (TE) activity causing large proportions of repeat elements. We summarize classical pairwise and multiple genome alignment (MGA) methods, and highlight techniques that are widely used or are being developed by the plant research community. We also outline the remaining challenges for precise genome alignment and the interpretation of alignment results in plants.
Collapse
Affiliation(s)
- Baoxing Song
- National Key Laboratory of Wheat Improvement, Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agriculture Sciences in Weifang, Weifang, Shandong 261325, China; Key Laboratory of Maize Biology and Genetic Breeding in Arid Area of Northwest Region of the Ministry of Agriculture, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14853, USA; Agricultural Research Service, United States Department of Agriculture, Ithaca, NY 14853, USA
| | - Michelle C Stitzer
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
9
|
Feng X, Merilä J, Löytynoja A. Secondary Contact, Introgressive Hybridization, and Genome Stabilization in Sticklebacks. Mol Biol Evol 2024; 41:msae031. [PMID: 38366566 PMCID: PMC10903534 DOI: 10.1093/molbev/msae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 12/20/2023] [Accepted: 02/09/2024] [Indexed: 02/18/2024] Open
Abstract
Advances in genomic studies have revealed that hybridization in nature is pervasive and raised questions about the dynamics of different genetic and evolutionary factors following the initial hybridization event. While recent research has proposed that the genomic outcomes of hybridization might be predictable to some extent, many uncertainties remain. With comprehensive whole-genome sequence data, we investigated the genetic introgression between 2 divergent lineages of 9-spined sticklebacks (Pungitius pungitius) in the Baltic Sea. We found that the intensity and direction of selection on the introgressed variation has varied across different genomic elements: while functionally important regions displayed reduced rates of introgression, promoter regions showed enrichment. Despite the general trend of negative selection, we identified specific genomic regions that were enriched for introgressed variants, and within these regions, we detected footprints of selection, indicating adaptive introgression. Geographically, we found the selection against the functional changes to be strongest in the vicinity of the secondary contact zone and weaken as a function of distance from the initial contact. Altogether, the results suggest that the stabilization of introgressed variation in the genomes is a complex, multistage process involving both negative and positive selection. In spite of the predominance of negative selection against introgressed variants, we also found evidence for adaptive introgression variants likely associated with adaptation to Baltic Sea environmental conditions.
Collapse
Affiliation(s)
- Xueyun Feng
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki 00014, Finland
- Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland
| | - Juha Merilä
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki 00014, Finland
- Area of Ecology and Biodiversity, The School of Biological Sciences, Kadoorie Biological Sciences Building, The University of Hong Kong, Hong Kong, Hong Kong SAR
| | - Ari Löytynoja
- Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland
| |
Collapse
|
10
|
Viner C, Ishak CA, Johnson J, Walker NJ, Shi H, Sjöberg-Herrera MK, Shen SY, Lardo SM, Adams DJ, Ferguson-Smith AC, De Carvalho DD, Hainer SJ, Bailey TL, Hoffman MM. Modeling methyl-sensitive transcription factor motifs with an expanded epigenetic alphabet. Genome Biol 2024; 25:11. [PMID: 38191487 PMCID: PMC10773111 DOI: 10.1186/s13059-023-03070-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 09/21/2023] [Indexed: 01/10/2024] Open
Abstract
BACKGROUND Transcription factors bind DNA in specific sequence contexts. In addition to distinguishing one nucleobase from another, some transcription factors can distinguish between unmodified and modified bases. Current models of transcription factor binding tend not to take DNA modifications into account, while the recent few that do often have limitations. This makes a comprehensive and accurate profiling of transcription factor affinities difficult. RESULTS Here, we develop methods to identify transcription factor binding sites in modified DNA. Our models expand the standard A/C/G/T DNA alphabet to include cytosine modifications. We develop Cytomod to create modified genomic sequences and we also enhance the MEME Suite, adding the capacity to handle custom alphabets. We adapt the well-established position weight matrix (PWM) model of transcription factor binding affinity to this expanded DNA alphabet. Using these methods, we identify modification-sensitive transcription factor binding motifs. We confirm established binding preferences, such as the preference of ZFP57 and C/EBPβ for methylated motifs and the preference of c-Myc for unmethylated E-box motifs. CONCLUSIONS Using known binding preferences to tune model parameters, we discover novel modified motifs for a wide array of transcription factors. Finally, we validate our binding preference predictions for OCT4 using cleavage under targets and release using nuclease (CUT&RUN) experiments across conventional, methylation-, and hydroxymethylation-enriched sequences. Our approach readily extends to other DNA modifications. As more genome-wide single-base resolution modification data becomes available, we expect that our method will yield insights into altered transcription factor binding affinities across many different modifications.
Collapse
Affiliation(s)
- Coby Viner
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Charles A Ishak
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - James Johnson
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Nicolas J Walker
- Department of Genetics, University of Cambridge, Cambridge, England
| | - Hui Shi
- Department of Genetics, University of Cambridge, Cambridge, England
| | - Marcela K Sjöberg-Herrera
- Wellcome Sanger Institute, Cambridge, England
- Faculty of Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Shu Yi Shen
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
| | - Santana M Lardo
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | | | | | - Daniel D De Carvalho
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Sarah J Hainer
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Timothy L Bailey
- Department of Pharmacology, University of Nevada, Reno, Reno, NV, USA
| | - Michael M Hoffman
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada.
| |
Collapse
|
11
|
Wattam AR, Bowers N, Brettin T, Conrad N, Cucinell C, Davis JJ, Dickerman AW, Dietrich EM, Kenyon RW, Machi D, Mao C, Nguyen M, Olson RD, Overbeek R, Parrello B, Pusch GD, Shukla M, Stevens RL, Vonstein V, Warren AS. Comparative Genomic Analysis of Bacterial Data in BV-BRC: An Example Exploring Antimicrobial Resistance. Methods Mol Biol 2024; 2802:547-571. [PMID: 38819571 DOI: 10.1007/978-1-0716-3838-5_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
As genomic and related data continue to expand, research biologists are often hampered by the computational hurdles required to analyze their data. The National Institute of Allergy and Infectious Diseases (NIAID) established the Bioinformatics Resource Centers (BRC) to assist researchers with their analysis of genome sequence and other omics-related data. Recently, the PAThosystems Resource Integration Center (PATRIC), the Influenza Research Database (IRD), and the Virus Pathogen Database and Analysis Resource (ViPR) BRCs merged to form the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) at https://www.bv-brc.org/ . The combined BV-BRC leverages the functionality of the original resources for bacterial and viral research communities with a unified data model, enhanced web-based visualization and analysis tools, and bioinformatics services. Here we demonstrate how antimicrobial resistance data can be analyzed in the new resource.
Collapse
Affiliation(s)
- Alice R Wattam
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA.
| | - Nicole Bowers
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Thomas Brettin
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Computing, Environment and Life Sciences, Argonne National Laboratory, Argonne, IL, USA
| | - Neal Conrad
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Clark Cucinell
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| | - James J Davis
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Allan W Dickerman
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| | - Emily M Dietrich
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Ronald W Kenyon
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| | - Dustin Machi
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| | - Chunhong Mao
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| | - Marcus Nguyen
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Robert D Olson
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Ross Overbeek
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Fellowship for Interpretation of Genomes, Burr Ridge, IL, USA
| | - Bruce Parrello
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Fellowship for Interpretation of Genomes, Burr Ridge, IL, USA
| | - Gordon D Pusch
- Fellowship for Interpretation of Genomes, Burr Ridge, IL, USA
| | - Maulik Shukla
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA
| | - Rick L Stevens
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | | | - Andrew S Warren
- Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
12
|
Rosing-Asvid A, Löytynoja A, Momigliano P, Hansen RG, Scharff-Olsen CH, Valtonen M, Kammonen J, Dietz R, Rigét FF, Ferguson SH, Lydersen C, Kovacs KM, Holland DM, Jernvall J, Auvinen P, Tange Olsen M. An evolutionarily distinct ringed seal in the Ilulissat Icefjord. Mol Ecol 2023; 32:5932-5943. [PMID: 37855154 DOI: 10.1111/mec.17163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 09/20/2023] [Accepted: 10/02/2023] [Indexed: 10/20/2023]
Abstract
The Earth's polar regions are low rates of inter- and intraspecific diversification. An extreme mammalian example is the Arctic ringed seal (Pusa hispida hispida), which is assumed to be panmictic across its circumpolar Arctic range. Yet, local Inuit communities in Greenland and Canada recognize several regional variants; a finding supported by scientific studies of body size variation. It is however unclear whether this phenotypic variation reflects plasticity, morphs or distinct ecotypes. Here, we combine genomic, biologging and survey data, to document the existence of a unique ringed seal ecotype in the Ilulissat Icefjord (locally 'Kangia'), Greenland; a UNESCO World Heritage site, which is home to the most productive marine-terminating glacier in the Arctic. Genomic analyses reveal a divergence of Kangia ringed seals from other Arctic ringed seals about 240 kya, followed by secondary contact since the Last Glacial Maximum. Despite ongoing gene flow, multiple genomic regions appear under strong selection in Kangia ringed seals, including candidate genes associated with pelage coloration, growth and osmoregulation, potentially explaining the Kangia seal's phenotypic and behavioural uniqueness. The description of 'hidden' diversity and adaptations in yet another Arctic species merits a reassessment of the evolutionary processes that have shaped Arctic diversity and the traditional view of this region as an evolutionary freezer. Our study highlights the value of indigenous knowledge in guiding science and calls for efforts to identify distinct populations or ecotypes to understand how these might respond differently to environmental change.
Collapse
Affiliation(s)
| | - Ari Löytynoja
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Paolo Momigliano
- Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, Vigo, Spain
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, Pok Fu Lam, Hong Kong
| | | | | | - Mia Valtonen
- Wildlife Ecology Group, Natural Resources Institute Finland, Helsinki, Finland
| | - Juhana Kammonen
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Rune Dietz
- Department of Ecoscience, Aarhus University, Roskilde, Denmark
| | | | | | | | - Kit M Kovacs
- Norwegian Polar Institute, Fram Centre, Tromsø, Norway
| | - David M Holland
- Mathematics and Atmosphere/Ocean Science, Courant Institute of Mathematical Sciences, New York University, New York City, New York, USA
| | - Jukka Jernvall
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Petri Auvinen
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Morten Tange Olsen
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
13
|
Yuan J, Wang G, Zhao L, Kitchener AC, Sun T, Chen W, Huang C, Wang C, Xu X, Wang J, Lu H, Xu L, Jiangzuo Q, Murphy WJ, Wu D, Li G. How genomic insights into the evolutionary history of clouded leopards inform their conservation. SCIENCE ADVANCES 2023; 9:eadh9143. [PMID: 37801506 PMCID: PMC10558132 DOI: 10.1126/sciadv.adh9143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 09/06/2023] [Indexed: 10/08/2023]
Abstract
Clouded leopards (Neofelis spp.), a morphologically and ecologically distinct lineage of big cats, are severely threatened by habitat loss and fragmentation, targeted hunting, and other human activities. The long-held poor understanding of their genetics and evolution has undermined the effectiveness of conservation actions. Here, we report a comprehensive investigation of the whole genomes, population genetics, and adaptive evolution of Neofelis. Our results indicate the genus Neofelis arose during the Pleistocene, coinciding with glacial-induced climate changes to the distributions of savannas and rainforests, and signatures of natural selection associated with genes functioning in tooth, pigmentation, and tail development, associated with clouded leopards' unique adaptations. Our study highlights high-altitude adaptation as the main factor driving nontaxonomic population differentiation in Neofelis nebulosa. Population declines and inbreeding have led to reduced genetic diversity and the accumulation of deleterious variation that likely affect reproduction of clouded leopards, highlighting the urgent need for effective conservation efforts.
Collapse
Affiliation(s)
- Jiaqing Yuan
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Guiqiang Wang
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Le Zhao
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
- QinLing-Bashan Mountains Bioresources Comprehensive Development C. I. C., School of Bioscience and Engineering, Shaanxi University of Technology, Hanzhong, China
| | - Andrew C. Kitchener
- Department of Natural Sciences, National Museums Scotland, Chambers Street, Edinburgh EH1 1JF, UK
- School of Geosciences, University of Edinburgh, Drummond Street, Edinburgh EH9 3PX, UK
| | - Ting Sun
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Wu Chen
- Guangzhou Zoo, Guangzhou Wildlife Research Center, Guangzhou, China
| | - Chen Huang
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Chen Wang
- Guangzhou Zoo, Guangzhou Wildlife Research Center, Guangzhou, China
| | - Xiao Xu
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Jinhong Wang
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Huimeng Lu
- School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
| | - Lulu Xu
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Qigao Jiangzuo
- Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China
| | - William J. Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
| | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Natural History Museum of Zoology Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
- Guangzhou Zoo, Guangzhou Wildlife Research Center, Guangzhou, China
| |
Collapse
|
14
|
Benson CW, Sheltra MR, Maughan PJ, Jellen EN, Robbins MD, Bushman BS, Patterson EL, Hall ND, Huff DR. Homoeologous evolution of the allotetraploid genome of Poa annua L. BMC Genomics 2023; 24:350. [PMID: 37365554 DOI: 10.1186/s12864-023-09456-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 06/15/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND Poa annua (annual bluegrass) is an allotetraploid turfgrass, an agronomically significant weed, and one of the most widely dispersed plant species on earth. Here, we report the chromosome-scale genome assemblies of P. annua's diploid progenitors, P. infirma and P. supina, and use multi-omic analyses spanning all three species to better understand P. annua's evolutionary novelty. RESULTS We find that the diploids diverged from their common ancestor 5.5 - 6.3 million years ago and hybridized to form P. annua ≤ 50,000 years ago. The diploid genomes are similar in chromosome structure and most notably distinguished by the divergent evolutionary histories of their transposable elements, leading to a 1.7 × difference in genome size. In allotetraploid P. annua, we find biased movement of retrotransposons from the larger (A) subgenome to the smaller (B) subgenome. We show that P. annua's B subgenome is preferentially accumulating genes and that its genes are more highly expressed. Whole-genome resequencing of several additional P. annua accessions revealed large-scale chromosomal rearrangements characterized by extensive TE-downsizing and evidence to support the Genome Balance Hypothesis. CONCLUSIONS The divergent evolutions of the diploid progenitors played a central role in conferring onto P. annua its remarkable phenotypic plasticity. We find that plant genes (guided by selection and drift) and transposable elements (mostly guided by host immunity) each respond to polyploidy in unique ways and that P. annua uses whole-genome duplication to purge highly parasitized heterochromatic sequences. The findings and genomic resources presented here will enable the development of homoeolog-specific markers for accelerated weed science and turfgrass breeding.
Collapse
Affiliation(s)
- Christopher W Benson
- Department of Plant Science, Pennsylvania State University, University Park, PA, USA.
- Intercollegiate Graduate Degree Program in Plant Biology, Pennsylvania State University, University Park, PA, USA.
| | - Matthew R Sheltra
- Department of Plant Science, Pennsylvania State University, University Park, PA, USA
- Intercollegiate Graduate Degree Program in Plant Biology, Pennsylvania State University, University Park, PA, USA
| | - Peter J Maughan
- Department of Plant and Wildlife Sciences, Brigham Young University, Logan, UT, USA
| | - Eric N Jellen
- Department of Plant and Wildlife Sciences, Brigham Young University, Logan, UT, USA
| | | | | | - Eric L Patterson
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, MI, USA
| | - Nathan D Hall
- Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, MI, USA
| | - David R Huff
- Department of Plant Science, Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
15
|
Suzuki M, Hashimoto Y, Hirabayashi A, Yahara K, Yoshida M, Fukano H, Hoshino Y, Shibayama K, Tomita H. Genomic Epidemiological Analysis of Antimicrobial-Resistant Bacteria with Nanopore Sequencing. Methods Mol Biol 2023; 2632:227-246. [PMID: 36781732 DOI: 10.1007/978-1-0716-2996-3_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Antimicrobial-resistant (AMR) bacterial infections caused by clinically important bacteria, including ESKAPE pathogens (Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species) and mycobacteria (Mycobacterium tuberculosis and nontuberculous mycobacteria), have become a global public health threat. Their epidemic and pandemic clones often accumulate useful accessory genes in their genomes, such as AMR genes (ARGs) and virulence factor genes (VFGs). This process is facilitated by horizontal gene transfer among microbial communities via mobile genetic elements (MGEs), such as plasmids and phages. Nanopore long-read sequencing allows easy and inexpensive analysis of complex bacterial genome structures, although some aspects of sequencing data calculation and genome analysis methods are not systematically understood. Here we describe the latest and most recommended experimental and bioinformatics methods available for the construction of complete bacterial genomes from nanopore sequencing data and the detection and classification of genotypes of bacterial chromosomes, ARGs, VFGs, plasmids, and other MGEs based on their genomic sequences for genomic epidemiological analysis of AMR bacteria.
Collapse
Affiliation(s)
- Masato Suzuki
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan.
| | - Yusuke Hashimoto
- Department of Bacteriology, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Aki Hirabayashi
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Koji Yahara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Mitsunori Yoshida
- Department of Mycobacteriology, Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Hanako Fukano
- Department of Mycobacteriology, Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Yoshihiko Hoshino
- Department of Mycobacteriology, Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Keigo Shibayama
- Department of Bacteriology, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Haruyoshi Tomita
- Department of Bacteriology, Gunma University Graduate School of Medicine, Maebashi, Japan.,Laboratory of Bacterial Drug Resistance, Gunma University Graduate School of Medicine, Maebashi, Japan
| |
Collapse
|
16
|
Colin E, Duffourd Y, Chevarin M, Tisserant E, Verdez S, Paccaud J, Bruel AL, Tran Mau-Them F, Denommé-Pichon AS, Thevenon J, Safraou H, Besnard T, Goldenberg A, Cogné B, Isidor B, Delanne J, Sorlin A, Moutton S, Fradin M, Dubourg C, Gorce M, Bonneau D, El Chehadeh S, Debray FG, Doco-Fenzy M, Uguen K, Chatron N, Aral B, Marle N, Kuentz P, Boland A, Olaso R, Deleuze JF, Sanlaville D, Callier P, Philippe C, Thauvin-Robinet C, Faivre L, Vitobello A. Stepwise use of genomics and transcriptomics technologies increases diagnostic yield in Mendelian disorders. Front Cell Dev Biol 2023; 11:1021920. [PMID: 36926521 PMCID: PMC10011630 DOI: 10.3389/fcell.2023.1021920] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 01/30/2023] [Indexed: 03/08/2023] Open
Abstract
Purpose: Multi-omics offer worthwhile and increasingly accessible technologies to diagnostic laboratories seeking potential second-tier strategies to help patients with unresolved rare diseases, especially patients clinically diagnosed with a rare OMIM (Online Mendelian Inheritance in Man) disease. However, no consensus exists regarding the optimal diagnostic care pathway to adopt after negative results with standard approaches. Methods: In 15 unsolved individuals clinically diagnosed with recognizable OMIM diseases but with negative or inconclusive first-line genetic results, we explored the utility of a multi-step approach using several novel omics technologies to establish a molecular diagnosis. Inclusion criteria included a clinical autosomal recessive disease diagnosis and single heterozygous pathogenic variant in the gene of interest identified by first-line analysis (60%-9/15) or a clinical diagnosis of an X-linked recessive or autosomal dominant disease with no causative variant identified (40%-6/15). We performed a multi-step analysis involving short-read genome sequencing (srGS) and complementary approaches such as mRNA sequencing (mRNA-seq), long-read genome sequencing (lrG), or optical genome mapping (oGM) selected according to the outcome of the GS analysis. Results: SrGS alone or in combination with additional genomic and/or transcriptomic technologies allowed us to resolve 87% of individuals by identifying single nucleotide variants/indels missed by first-line targeted tests, identifying variants affecting transcription, or structural variants sometimes requiring lrGS or oGM for their characterization. Conclusion: Hypothesis-driven implementation of combined omics technologies is particularly effective in identifying molecular etiologies. In this study, we detail our experience of the implementation of genomics and transcriptomics technologies in a pilot cohort of previously investigated patients with a typical clinical diagnosis without molecular etiology.
Collapse
Affiliation(s)
- Estelle Colin
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Service de Génétique Médicale, CHU d'Angers, Angers, France
| | - Yannis Duffourd
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France
| | - Martin Chevarin
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Emilie Tisserant
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France
| | - Simon Verdez
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France
| | - Julien Paccaud
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France
| | - Ange-Line Bruel
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Frédéric Tran Mau-Them
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Anne-Sophie Denommé-Pichon
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Julien Thevenon
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France
| | - Hana Safraou
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Thomas Besnard
- Service de Génétique Médicale, Nantes Université, CHU Nantes, Nantes, France.,CNRS, INSERM, L'institut du thorax, Nantes Université, CHU Nantes, Nantes, France
| | - Alice Goldenberg
- Department of Genetics and Reference Center for Developmental Disorders, Normandy Center for Genomic and Personalized Medicine, Rouen University Hospital, Rouen, France.,Normandie Univ, UNIROUEN, Inserm U1245, Rouen, France
| | - Benjamin Cogné
- Service de Génétique Médicale, Nantes Université, CHU Nantes, Nantes, France.,CNRS, INSERM, L'institut du thorax, Nantes Université, CHU Nantes, Nantes, France
| | - Bertrand Isidor
- Service de Génétique Médicale, CHU de Nantes, Nantes, France
| | - Julian Delanne
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Centre de Génétique et Centre de référence "Anomalies du Développement et Syndromes Malformatifs", Hôpital d'Enfants, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Arthur Sorlin
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Centre de Génétique et Centre de référence "Anomalies du Développement et Syndromes Malformatifs", Hôpital d'Enfants, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Sébastien Moutton
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Centre de Génétique et Centre de référence "Anomalies du Développement et Syndromes Malformatifs", Hôpital d'Enfants, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Mélanie Fradin
- CHU Rennes, Service de Génétique Clinique, Centre de Référence Maladies Rares, CLAD-Ouest, Rennes, France
| | - Christèle Dubourg
- Service de Génétique Moléculaire et Génomique, CHU Rennes, Rennes, France.,Univ Rennes, CNRS, Institut de Genetique et Developpement de Rennes, UMR 6290, Rennes, France
| | - Magali Gorce
- Service de Génétique Médicale, CHU d'Angers, Angers, France
| | | | - Salima El Chehadeh
- Service de Génétique Médicale, Hôpital de Hautepierre, CHU Strasbourg, Strasbourg, France
| | | | - Martine Doco-Fenzy
- Medical School IFR53, EA3801, Université de Reims Champagne-Ardenne, Reims, France.,Service de Génétique, CHU Reims, Reims, France
| | - Kevin Uguen
- Department of Genetics and Reference Center for Developmental Disorders, Lyon University Hospital, Groupement Hospitalier Est, Hospices Civils de Lyon, Lyon, France.,CHU Brest, Inserm, Univ Brest, EFS, UMR 1078, GGB, Brest, France
| | - Nicolas Chatron
- Department of Genetics and Reference Center for Developmental Disorders, Lyon University Hospital, Groupement Hospitalier Est, Hospices Civils de Lyon, Lyon, France
| | - Bernard Aral
- Laboratoire de Génétique Chromosomique et Moléculaire, Pôle Biologie, CHU de Dijon, Dijon, France
| | - Nathalie Marle
- Laboratoire de Génétique Chromosomique et Moléculaire, Pôle Biologie, CHU de Dijon, Dijon, France
| | - Paul Kuentz
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Oncobiologie Génétique Bioinformatique, PCBio, Centre Hospitalier Universitaire de Besançon, Besançon, France
| | - Anne Boland
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
| | - Robert Olaso
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France.,LabEx GENMED (Medical Genomics), Dijon, France
| | - Jean-François Deleuze
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France.,LabEx GENMED (Medical Genomics), Dijon, France
| | - Damien Sanlaville
- Department of Genetics and Reference Center for Developmental Disorders, Lyon University Hospital, Groupement Hospitalier Est, Hospices Civils de Lyon, Lyon, France
| | - Patrick Callier
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Laboratoire de Génétique Chromosomique et Moléculaire, Pôle Biologie, CHU de Dijon, Dijon, France
| | - Christophe Philippe
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Christel Thauvin-Robinet
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France.,Centre de Référence Maladies Rares "Déficiences Intellectuelles de Causes Rares", Centre de Génétique, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| | - Laurence Faivre
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Centre de Génétique et Centre de référence "Anomalies du Développement et Syndromes Malformatifs", Hôpital d'Enfants, Centre Hospitalier Universitaire de Dijon, Dijon, France
| | - Antonio Vitobello
- UFR Des Sciences de Santé, INSERM-Université de Bourgogne UMR1231 GAD "Génétique des Anomalies du Développement", FHUTRANSLAD, Dijon, France.,Unité Fonctionnelle Innovation en Diagnostic Génomique des Maladies Rares, FHU-TRANSLAD, CHU Dijon Bourgogne, Dijon, France
| |
Collapse
|
17
|
Han W, Liu L, Wang J, Wei H, Li Y, Zhang L, Guo Z, Li Y, Liu T, Zeng Q, Xing Q, Shu Y, Wang T, Yang Y, Zhang M, Li R, Yu J, Pu Z, Lv J, Lian S, Hu J, Hu X, Bao Z, Bao L, Zhang L, Wang S. Ancient homomorphy of molluscan sex chromosomes sustained by reversible sex-biased genes and sex determiner translocation. Nat Ecol Evol 2022; 6:1891-1906. [PMID: 36280781 DOI: 10.1038/s41559-022-01898-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 09/05/2022] [Indexed: 12/15/2022]
Abstract
Contrary to classic theory prediction, sex-chromosome homomorphy is prevalent in the animal kingdom but it is unclear how ancient homomorphic sex chromosomes avoid chromosome-scale degeneration. Molluscs constitute the second largest, Precambrian-originated animal phylum and have ancient, uncharacterized homomorphic sex chromosomes. Here, we profile eight genomes of the bivalve mollusc family of Pectinidae in a phylogenetic context and show 350 million years sex-chromosome homomorphy, which is the oldest known sex-chromosome homomorphy in the animal kingdom, far exceeding the ages of well-known heteromorphic sex chromosomes such as 130-200 million years in mammals, birds and flies. The long-term undifferentiation of molluscan sex chromosomes is potentially sustained by the unexpected intertwined regulation of reversible sex-biased genes, together with the lack of sexual dimorphism and occasional sex chromosome turnover. The pleiotropic constraint of regulation of reversible sex-biased genes is widely present in ancient homomorphic sex chromosomes and might be resolved in heteromorphic sex chromosomes through gene duplication followed by subfunctionalization. The evolutionary dynamics of sex chromosomes suggest a mechanism for 'inheritance' turnover of sex-determining genes that is mediated by translocation of a sex-determining enhancer. On the basis of these findings, we propose an evolutionary model for the long-term preservation of homomorphic sex chromosomes.
Collapse
Affiliation(s)
- Wentao Han
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Liangjie Liu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Jing Wang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Huilan Wei
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Yuli Li
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Lijing Zhang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Zhenyi Guo
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Yajuan Li
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Tian Liu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Qifan Zeng
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China
| | - Qiang Xing
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Ya Shu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Tong Wang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Yaxin Yang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Meiwei Zhang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Ruojiao Li
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Jiachen Yu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Zhongqi Pu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Jia Lv
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Shanshan Lian
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Jingjie Hu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China
| | - Xiaoli Hu
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Zhenmin Bao
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China
- Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China
| | - Lisui Bao
- Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao, China.
| | - Lingling Zhang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China.
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.
| | - Shi Wang
- Sars-Fang Centre & MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao, China.
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.
- Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Sanya, China.
| |
Collapse
|
18
|
Agarwal R, Gupta M, Antony A, Sen R, Raychoudhury R. In Vitro Studies Reveal that Pseudomonas, from Odontotermes obesus Colonies, can Function as a Defensive Mutualist as it Prevents the Weedy Fungus While Keeping the Crop Fungus Unaffected. MICROBIAL ECOLOGY 2022; 84:391-403. [PMID: 34495359 DOI: 10.1007/s00248-021-01798-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 06/14/2021] [Indexed: 06/13/2023]
Abstract
Insects that farm monocultures of fungi are canonical examples of nutritional symbiosis as well as independent evolution of agriculture in non-human animals. But just like in human agriculture, these fungal crops face constant threat of invasion by weeds which, if unchecked, take over the crop fungus. In fungus-growing termites, the crop fungus (Termitomyces) faces such challenges from the weedy fungus Pseudoxylaria. The mechanism by which Pseudoxylaria is suppressed is not known. However, evidence suggests that some bacterial secondary symbionts can serve as defensive mutualists by preventing the growth of Pseudoxylaria. However, such secondary symbionts must possess the dual, yet contrasting, capabilities of suppressing the weedy fungus while keeping the growth of the crop fungus unaffected. This study describes the isolation, identification, and culture-dependent estimation of the roles of several such putative defensive mutualists from the colonies of the wide-spread fungus-growing termite from India, Odontotermes obesus. From the 38 bacterial cultures tested, a strain of Pseudomonas showed significantly greater suppression of the weedy fungus than the crop fungus. Moreover, a 16S rRNA pan-microbiome survey, using the Nanopore platform, revealed Pseudomonas to be a part of the core microbiota of O. obesus. A meta-analysis of microbiota composition across different species of Odontotermes also confirms the widespread prevalence of Pseudomonas within this termite. These lines of evidence indicate that Pseudomonas could be playing the role of defensive mutualist within Odontotermes.
Collapse
Affiliation(s)
- Renuka Agarwal
- Department of Biological Sciences, Indian Institute of Science and Education Research Mohali (IISER Mohali), Knowledge City, Sector 81, Manauli, SAS Nagar, Punjab, PO 140306, India
| | - Manisha Gupta
- Department of Biological Sciences, Indian Institute of Science and Education Research Mohali (IISER Mohali), Knowledge City, Sector 81, Manauli, SAS Nagar, Punjab, PO 140306, India
| | - Abin Antony
- Department of Biological Sciences, Indian Institute of Science and Education Research Mohali (IISER Mohali), Knowledge City, Sector 81, Manauli, SAS Nagar, Punjab, PO 140306, India
| | - Ruchira Sen
- Sri Guru Gobind Singh College, Sector 26, Chandigarh, 160019, India
| | - Rhitoban Raychoudhury
- Department of Biological Sciences, Indian Institute of Science and Education Research Mohali (IISER Mohali), Knowledge City, Sector 81, Manauli, SAS Nagar, Punjab, PO 140306, India.
| |
Collapse
|
19
|
Miura H, Takeda M, Yamaguchi M, Ohtani Y, Endo G, Masuda Y, Ito K, Nagura Y, Iwashita K, Mitani T, Suzuki Y, Kobayashi Y, Koike S. Application of MinION Amplicon Sequencing to Buccal Swab Samples for Improving Resolution and Throughput of Rumen Microbiota Analysis. Front Microbiol 2022; 13:783058. [PMID: 35401463 PMCID: PMC8989143 DOI: 10.3389/fmicb.2022.783058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 03/02/2022] [Indexed: 11/17/2022] Open
Abstract
The Illumina MiSeq platform has been widely used as a standard method for studying the rumen microbiota. However, the low resolution of taxonomic identification is the only disadvantage of MiSeq amplicon sequencing, as it targets a part of the 16S rRNA gene. In the present study, we performed three experiments to establish a high-resolution and high-throughput rumen microbial profiling approach using a combination of MinION platform and buccal swab sample, which is a proxy for rumen contents. In experiment 1, rumen contents and buccal swab samples were collected simultaneously from cannulated cattle (n = 6) and used for microbiota analysis using three different analytical workflows: amplicon sequencing of the V3–V4 region of the 16S rRNA gene using MiSeq and amplicon sequencing of near full-length 16S rRNA gene using MinION or PacBio Sequel II. All reads derived from the MinION and PacBio platforms were classified at the species-level. In experiment 2, rumen fluid samples were collected from beef cattle (n = 28) and used for 16S rRNA gene amplicon sequencing using the MinION platform to evaluate this sequencing platform for rumen microbiota analysis. We confirmed that the MinION platform allowed species-level taxa assignment for the predominant bacterial groups, which were previously identified at the family- and genus-level using the MiSeq platform. In experiment 3, buccal swab samples were collected from beef cattle (n = 30) and used for 16S rRNA gene amplicon sequencing using the MinION platform to validate the applicability of a combination of the MinION platform and buccal swab samples for rumen microbiota analysis. The distribution of predominant bacterial taxa in the buccal swab samples was similar to that in the rumen samples observed in experiment 2. Based on these results, we concluded that the combination of the MinION platform and buccal swab samples may be potentially applied for rumen microbial analysis in large-scale studies.
Collapse
Affiliation(s)
- Hiroto Miura
- Graduate School of Agriculture, Hokkaido University, Hokkaido, Japan
| | | | - Megumi Yamaguchi
- Graduate School of Agriculture, Hokkaido University, Hokkaido, Japan
| | | | - Go Endo
- Field Science Center for Northern Biosphere, Hokkaido University, Hokkaido, Japan
| | - Yasuhisa Masuda
- Field Science Center for Northern Biosphere, Hokkaido University, Hokkaido, Japan
| | - Kaede Ito
- Field Science Center for Northern Biosphere, Hokkaido University, Hokkaido, Japan
| | - Yoshio Nagura
- Field Science Center for Northern Biosphere, Hokkaido University, Hokkaido, Japan
| | | | - Tomohiro Mitani
- Field Science Center for Northern Biosphere, Hokkaido University, Hokkaido, Japan
| | - Yutaka Suzuki
- Graduate School of Agriculture, Hokkaido University, Hokkaido, Japan
| | - Yasuo Kobayashi
- Graduate School of Agriculture, Hokkaido University, Hokkaido, Japan
| | - Satoshi Koike
- Graduate School of Agriculture, Hokkaido University, Hokkaido, Japan
- *Correspondence: Satoshi Koike,
| |
Collapse
|
20
|
Peng K, Yin Y, Li Y, Qin S, Liu Y, Yang X, Wang Z, Li R. QitanTech Nanopore Long-Read Sequencing Enables Rapid Resolution of Complete Genomes of Multi-Drug Resistant Pathogens. Front Microbiol 2022; 13:778659. [PMID: 35401428 PMCID: PMC8985760 DOI: 10.3389/fmicb.2022.778659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 01/21/2022] [Indexed: 11/15/2022] Open
Abstract
Advancement of novel sequencing technologies facilitates modern life science and medicine unprecedentedly. Exploring complete genome sequences of bacteria by long-read sequencing technology is significant for microbial genomics research. However, third-generation long-read sequencing technologies are available with limited choices, which generate technological barrier to scientific research. Recently, a novel QitanTech nanopore long-read sequencing technology has emerged in China, but the potential application and performance were unexplored. Herein, we comprehensively evaluated the feasibility of the emerging sequencing technology in assembling complete genomes of MDR pathogens. The results showed that 500 Mbp QitanTech nanopore sequencing data could be generated within 8 h in one flow cell with the standard library preparation method. The mean read length, longest read length, and mean read-level accuracy of QitanTech sequencing data were 6,041 bp, 57,037 bp, and 81.50% (LAST)/81.40% (Minimap2), respectively. Two routine assembly strategies including long-read assembly and hybrid assembly enable the achievement of complete bacterial genomes. The accuracy of assembled draft bacterial genomes with QitanTech long-read data could be improved up to 99.9% dramatically by polishing using accurate short-read data. Furthermore, the assembled bacterial genomes cover accurate structures of complex resistance plasmids harboring critical resistance genes such as tet(X), tmexCD-toprJ, and blaVIM–2, even the complex fusion MDR plasmid generated from homologous recombination. In conclusion, QitanTech nanopore sequencing, as a nanopore long-read sequencing technology launched in China, could be a good option for investigation of complex bacterial genomes. More potential applications based on this novel platform warrant investigations.
Collapse
Affiliation(s)
- Kai Peng
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
| | - Yi Yin
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
| | - Yan Li
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
| | - Shangshang Qin
- Key Laboratory of Advanced Drug Preparation Technologies, School of Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
| | - Yuan Liu
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
| | - Xiaorong Yang
- Center for Disease Control and Prevention of Sichuan Province, Chengdu, China
| | - Zhiqiang Wang
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
- *Correspondence: Zhiqiang Wang,
| | - Ruichao Li
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonoses, College of Veterinary Medicine, Yangzhou University, Yangzhou, China
- Institute of Comparative Medicine, Yangzhou University, Yangzhou, China
- Ruichao Li,
| |
Collapse
|
21
|
Pei T, Yan M, Li T, Li X, Yin Y, Cui M, Fang Y, Liu J, Kong Y, Xu P, Zhao Q. Characterization of UDP-glycosyltransferase family members reveals how major flavonoid glycoside accumulates in the roots of Scutellaria baicalensis. BMC Genomics 2022; 23:169. [PMID: 35232374 PMCID: PMC8888134 DOI: 10.1186/s12864-022-08391-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 02/15/2022] [Indexed: 01/01/2023] Open
Abstract
Background Flavonoid glycosides extracted from roots of Scutellaria baicalensis exhibit strong pharmaceutical antitumor, antioxidative, anti-inflammatory, and antiviral activities. UDP glycosyltransferase (UGT) family members are responsible for the transfer of a glycosyl moiety from UDP sugars to a wide range of acceptor flavonoids. Baicalin is the major flavonoid glycoside found in S. baicalensis roots, and its aglycone baicalein is synthesized from a specially evolved pathway that has been elucidated. However, it is necessary to carry out a genome-wide study of genes involved in 7-O-glucuronidation, the final biosynthesis step of baicalin, which might elucidate the relationship between the enzymes and the metabolic accumulation patterns in this medicinal plant. Results We reported the phylogenetic analysis, tissue-specific expression, biochemical characterization and evolutionary analysis of glucosyltransferases (SbUGTs) and glucuronosyltransferases (SbUGATs) genes based on the recently released genome of S. baicalensis. A total of 124 UGTs were identified, and over one third of them were highly expressed in roots. In vitro enzyme assays showed that 6 SbUGTs could use UDP-glucose as a sugar donor and convert baicalein to oroxin A (baicalein 7-O-glucoside), while 4 SbUGATs used only UDP-glucuronic acid as the sugar donor and catalyzed baicalein to baicalin. SbUGAT4 and SbUGT2 are the most highly expressed SbUGAT and SbUGT genes in root tissues, respectively. Kinetic measurements revealed that SbUGAT4 had a lower Km value and higher Vmax/Km ratio to baicalein than those of SbUGT2. Furthermore, tandem duplication events were detected in SbUGTs and SbUGATs. Conclusions This study demonstrated that glucosylation and glucuronidation are two major glycosylated decorations in the roots of S. baicalensis. Higher expression level and affinity to substrate of SbUGAT4, and expansion of this gene family contribute high accumulation of baicalin in the root of S. baicalensis. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08391-1.
Collapse
Affiliation(s)
- Tianlin Pei
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China.,National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Mengxiao Yan
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Tian Li
- Shanghai Center for Plant Stress Biology and Center of Excellence in Molecular Plant Sciences, Chinese Academy of Science, Shanghai, 201602, China
| | - Xiaoqiang Li
- Shanghai Center for Plant Stress Biology and Center of Excellence in Molecular Plant Sciences, Chinese Academy of Science, Shanghai, 201602, China
| | - Yijia Yin
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Mengying Cui
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Yumin Fang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Jie Liu
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Yu Kong
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Ping Xu
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Qing Zhao
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China. .,National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China.
| |
Collapse
|
22
|
Bagal UR, Phan J, Welsh RM, Misas E, Wagner D, Gade L, Litvintseva AP, Cuomo CA, Chow NA. MycoSNP: A Portable Workflow for Performing Whole-Genome Sequencing Analysis of Candida auris. Methods Mol Biol 2022; 2517:215-228. [PMID: 35674957 DOI: 10.1007/978-1-0716-2417-3_17] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Candida auris is an urgent public health threat characterized by high drug-resistant rates and rapid spread in healthcare settings worldwide. As part of the C. auris response, molecular surveillance has helped public health officials track the global spread and investigate local outbreaks. Here, we describe whole-genome sequencing analysis methods used for routine C. auris molecular surveillance in the United States; methods include reference selection, reference preparation, quality assessment and control of sequencing reads, read alignment, and single-nucleotide polymorphism calling and filtration. We also describe the newly developed pipeline MycoSNP, a portable workflow for performing whole-genome sequencing analysis of fungal organisms including C. auris.
Collapse
Affiliation(s)
- Ujwal R Bagal
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - John Phan
- Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Rory M Welsh
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Elizabeth Misas
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | | | - Lalitha Gade
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | | | - Christina A Cuomo
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nancy A Chow
- Mycotic Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA.
| |
Collapse
|
23
|
Jiang X, Song Q, Ye W, Chen ZJ. Concerted genomic and epigenomic changes accompany stabilization of Arabidopsis allopolyploids. Nat Ecol Evol 2021; 5:1382-1393. [PMID: 34413505 PMCID: PMC8484014 DOI: 10.1038/s41559-021-01523-y] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 06/24/2021] [Indexed: 02/06/2023]
Abstract
During evolution successful allopolyploids must overcome 'genome shock' between hybridizing species but the underlying process remains elusive. Here, we report concerted genomic and epigenomic changes in resynthesized and natural Arabidopsis suecica (TTAA) allotetraploids derived from Arabidopsis thaliana (TT) and Arabidopsis arenosa (AA). A. suecica shows conserved gene synteny and content with more gene family gain and loss in the A and T subgenomes than respective progenitors, although A. arenosa-derived subgenome has more structural variation and transposon distributions than A. thaliana-derived subgenome. These balanced genomic variations are accompanied by pervasive convergent and concerted changes in DNA methylation and gene expression among allotetraploids. The A subgenome is hypomethylated rapidly from F1 to resynthesized allotetraploids and convergently to the T-subgenome level in natural A. suecica, despite many other methylated loci being inherited from F1 to all allotetraploids. These changes in DNA methylation, including small RNAs, in allotetraploids may affect gene expression and phenotypic variation, including flowering, silencing of self-incompatibility and upregulation of meiosis- and mitosis-related genes. In conclusion, concerted genomic and epigenomic changes may improve stability and adaptation during polyploid evolution.
Collapse
Affiliation(s)
- Xinyu Jiang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Qingxin Song
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Wenxue Ye
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Z Jeffrey Chen
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA.
| |
Collapse
|
24
|
Rapp JZ, Sullivan MB, Deming JW. Divergent Genomic Adaptations in the Microbiomes of Arctic Subzero Sea-Ice and Cryopeg Brines. Front Microbiol 2021; 12:701186. [PMID: 34367102 PMCID: PMC8339730 DOI: 10.3389/fmicb.2021.701186] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 06/29/2021] [Indexed: 11/16/2022] Open
Abstract
Subzero hypersaline brines are liquid microbial habitats within otherwise frozen environments, where concentrated dissolved salts prevent freezing. Such extreme conditions presumably require unique microbial adaptations, and possibly altered ecologies, but specific strategies remain largely unknown. Here we examined prokaryotic taxonomic and functional diversity in two seawater-derived subzero hypersaline brines: first-year sea ice, subject to seasonally fluctuating conditions; and ancient cryopeg, under relatively stable conditions geophysically isolated in permafrost. Overall, both taxonomic composition and functional potential were starkly different. Taxonomically, sea-ice brine communities (∼105 cells mL–1) had greater richness, more diversity and were dominated by bacterial genera, including Polaribacter, Paraglaciecola, Colwellia, and Glaciecola, whereas the more densely inhabited cryopeg brines (∼108 cells mL–1) lacked these genera and instead were dominated by Marinobacter. Functionally, however, sea ice encoded fewer accessory traits and lower average genomic copy numbers for shared traits, though DNA replication and repair were elevated; in contrast, microbes in cryopeg brines had greater genetic versatility with elevated abundances of accessory traits involved in sensing, responding to environmental cues, transport, mobile elements (transposases and plasmids), toxin-antitoxin systems, and type VI secretion systems. Together these genomic features suggest adaptations and capabilities of sea-ice communities manifesting at the community level through seasonal ecological succession, whereas the denser cryopeg communities appear adapted to intense bacterial competition, leaving fewer genera to dominate with brine-specific adaptations and social interactions that sacrifice some members for the benefit of others. Such cryopeg genomic traits provide insight into how long-term environmental stability may enable life to survive extreme conditions.
Collapse
Affiliation(s)
- Josephine Z Rapp
- School of Oceanography, University of Washington, Seattle, WA, United States
| | - Matthew B Sullivan
- Byrd Polar and Climate Research Center, Ohio State University, Columbus, OH, United States.,Department of Microbiology, Ohio State University, Columbus, OH, United States.,Department of Civil, Environmental and Geodetic Engineering, Ohio State University, Columbus, OH, United States.,Center of Microbiome Science, Ohio State University, Columbus, OH, United States
| | - Jody W Deming
- School of Oceanography, University of Washington, Seattle, WA, United States
| |
Collapse
|
25
|
Gu W, Zhou A, Wang L, Sun S, Cui X, Zhu D. SVLR: Genome Structural Variant Detection Using Long-Read Sequencing Data. J Comput Biol 2021; 28:774-788. [PMID: 33973820 DOI: 10.1089/cmb.2021.0048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Genome structural variants (SVs) have great impacts on human phenotype and diversity, and have been linked to numerous diseases. Long-read sequencing technologies arise to make it possible to find SVs of as long as 10,000 nucleotides. Thus, long read-based SV detection has been drawing attention of many recent research projects, and many tools have been developed for long reads to detect SVs recently. In this article, we present a new method, called SVLR, to detect SVs based on long-read sequencing data. Comparing with existing methods, SVLR can detect three new kinds of SVs: block replacements, block interchanges, and translocations. Although these new SVs are structurally more complicated, SVLR achieves accuracies that are comparable with those of the classic SVs. Moreover, for the classic SVs that can be detected by state-of-the-art methods (e.g., SVIM and Sniffles), our experiments demonstrate recall improvements of up to 38% without harming the precisions (i.e., >78%). We also point out three directions to further improve SV detection in the future. Source codes: https://github.com/GWYSDU/SVLR.
Collapse
Affiliation(s)
- Wenyan Gu
- School of Computer Science and Technology, Shandong University, Qindao, China
| | - Aizhong Zhou
- School of Computer Science and Technology, Shandong University, Qindao, China
| | - Lusheng Wang
- Department of Computer Science, City University of Hong Kong, Hong Kong, China
| | - Shiwei Sun
- Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Xuefeng Cui
- School of Computer Science and Technology, Shandong University, Qindao, China
| | - Daming Zhu
- School of Computer Science and Technology, Shandong University, Qindao, China
| |
Collapse
|
26
|
Liu Y, Zhang X, Han K, Li R, Xu G, Han Y, Cui F, Fan S, Seim I, Fan G, Li G, Wan S. Insights into amphicarpy from the compact genome of the legume Amphicarpaea edgeworthii. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:952-965. [PMID: 33236503 PMCID: PMC8131047 DOI: 10.1111/pbi.13520] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 11/13/2020] [Accepted: 11/18/2020] [Indexed: 05/04/2023]
Abstract
Amphicarpy (seed heteromorphy) is a unique and fascinating reproductive strategy wherein a single plant produces both aerial and subterranean fruits. This strategy is believed to be an adaptation to life under stressful or uncertain environments. Here, we sequenced and de novo assembled a chromosome-level genome assembly of the legume Amphicarpaea edgeworthii Benth. The 299-Mb A. edgeworthii genome encodes 27 899 protein-coding genes and is the most compact sequenced legume genome reported until date. Its reduced genome size may be attributed to the reduced long-terminal repeat retrotransposon content, which stems from the unequal homologous recombination. Gene families related to immunity and stress resistance have been contracted in A. edgeworthii, which is consistent with the notion that the amphicarpic reproductive strategy may be a complementary mechanism for its weak environmental-adaptation ability. We demonstrated the 'ABCE' model for the differentiation of chasmogamous and cleistogamous flowers. In addition, the characteristics of aerial and subterranean seeds in hard-seededness were explored. Thus, we suggest that the A. edgeworthii genome, which is the first of an amphicarpic plant, offers significant insights into its unusual reproductive strategy that is a key resource towards comprehending the evolution of angiosperms.
Collapse
Affiliation(s)
- Yiyang Liu
- Bio‐technology Research CenterShandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and PhysiologyShandong Academy of Agricultural SciencesJi’nanChina
| | - Xuejie Zhang
- College of Life SciencesShandong Normal UniversityJi’nanChina
| | - Kai Han
- BGI‐QingdaoBGI‐ShenzhenQingdaoChina
| | - Rongchong Li
- Bio‐technology Research CenterShandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and PhysiologyShandong Academy of Agricultural SciencesJi’nanChina
| | - Guoxin Xu
- Shandong Rice Research InstituteShandong Academy of Agricultural SciencesJi’nanChina
| | - Yan Han
- College of Life SciencesShandong Normal UniversityJi’nanChina
| | - Feng Cui
- Bio‐technology Research CenterShandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and PhysiologyShandong Academy of Agricultural SciencesJi’nanChina
| | - Shoujin Fan
- College of Life SciencesShandong Normal UniversityJi’nanChina
| | - Inge Seim
- Integrative Biology LaboratoryCollege of Life SciencesNanjing Normal UniversityNanjingChina
| | - Guangyi Fan
- BGI‐QingdaoBGI‐ShenzhenQingdaoChina
- BGI‐ShenzhenShenzhenChina
- State Key Laboratory of Agricultural GenomicsBGI‐ShenzhenShenzhenChina
| | - Guowei Li
- Bio‐technology Research CenterShandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and PhysiologyShandong Academy of Agricultural SciencesJi’nanChina
| | - Shubo Wan
- Bio‐technology Research CenterShandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and PhysiologyShandong Academy of Agricultural SciencesJi’nanChina
| |
Collapse
|
27
|
Song YJ, Cho DH. Local Alignment of DNA Sequence Based on Deep Reinforcement Learning. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY 2021; 2:170-178. [PMID: 35402982 PMCID: PMC8975175 DOI: 10.1109/ojemb.2021.3076156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 04/20/2021] [Accepted: 04/23/2021] [Indexed: 11/06/2022] Open
Abstract
Goal: Over the decades, there have been improvements in the sequence alignment algorithm, with significant advances in various aspects such as complexity and accuracy. However, human-defined algorithms have an explicit limitation in view of developmental completeness. This paper introduces a novel local alignment method to obtain optimal sequence alignment based on reinforcement learning. Methods: There is a DQNalign algorithm that learns and performs sequence alignment through deep reinforcement learning. This paper proposes a DQN x-drop algorithm that performs local alignment without human intervention by combining the x-drop algorithm with this DQNalign algorithm. The proposed algorithm performs local alignment by repeatedly observing the subsequences and selecting the next alignment direction until the x-drop algorithm terminates the DQNalign algorithm. This proposed algorithm has an advantage in view of linear computational complexity compared to conventional local alignment algorithms. Results: This paper compares alignment performance (coverage and identity) and complexity for a fair comparison between the proposed DQN x-drop algorithm and the conventional greedy x-drop algorithm. Firstly, we prove the proposed algorithm's superiority by comparing the two algorithms' computational complexity through numerical analysis. After that, we tested the alignment performance actual HEV and E.coli sequence datasets. The proposed method shows the comparable identity and coverage performance to the conventional alignment method while having linear complexity for the [Formula: see text] parameter. Conclusions: Through this study, it was possible to confirm the possibility of a new local alignment algorithm that minimizes computational complexity without human intervention.
Collapse
Affiliation(s)
- Yong-Joon Song
- School of Electrical EngineeringKorea Advanced Institute of Science and TechnologyDaejeon305-701South Korea
| | - Dong-Ho Cho
- School of Electrical EngineeringKorea Advanced Institute of Science and TechnologyDaejeon305-701South Korea
| |
Collapse
|
28
|
Pei T, Yan M, Kong Y, Fan H, Liu J, Cui M, Fang Y, Ge B, Yang J, Zhao Q. The genome of Tripterygium wilfordii and characterization of the celastrol biosynthesis pathway. GIGABYTE 2021; 2021:gigabyte14. [PMID: 36967728 PMCID: PMC10038137 DOI: 10.46471/gigabyte.14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 02/25/2021] [Indexed: 11/09/2022] Open
Abstract
Tripterygium wilfordii is a vine from the Celastraceae family that is used in traditional Chinese medicine (TCM). The active ingredient, celastrol, is a friedelane-type pentacyclic triterpenoid with putative roles as an antitumor, immunosuppressive, and anti-obesity agent. Here, we report a reference genome assembly of T. wilfordii with high-quality annotation using a hybrid sequencing strategy. The total genome size obtained is 340.12 Mb, with a contig N50 value of 3.09 Mb. We successfully anchored 91.02% of sequences into 23 pseudochromosomes using high-throughput chromosome conformation capture (Hi–C) technology. The super-scaffold N50 value was 13.03 Mb. We also annotated 31,593 structural genes, with a repeat percentage of 44.31%. These data demonstrate that T. wilfordii diverged from Malpighiales species approximately 102.4 million years ago. By integrating genome, transcriptome and metabolite analyses, as well as in vivo and in vitro enzyme assays of two cytochrome P450 (CYP450) genes, TwCYP712K1 and TwCYP712K2, it is possible to investigate the second biosynthesis step of celastrol and demonstrate that this was derived from a common ancestor. These data provide insights and resources for further investigation of pathways related to celastrol, and valuable information to aid the conservation of resources, as well as understand the evolution of Celastrales.
Collapse
Affiliation(s)
- Tianlin Pei
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
- State Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Mengxiao Yan
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Yu Kong
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Hang Fan
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
- State Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Jie Liu
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Mengying Cui
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Yumin Fang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Binjie Ge
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Jun Yang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
- State Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Qing Zhao
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
- State Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
29
|
Million-year-old DNA sheds light on the genomic history of mammoths. Nature 2021; 591:265-269. [PMID: 33597750 PMCID: PMC7116897 DOI: 10.1038/s41586-021-03224-9] [Citation(s) in RCA: 117] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Accepted: 01/11/2021] [Indexed: 11/17/2022]
Abstract
Temporal genomic data hold great potential for studying evolutionary processes, including speciation. However, sampling across speciation events would in many cases require genomic time series that stretch well into the Early Pleistocene (>1 million years). Although theoretical models suggest that DNA should survive on this timescale1, the oldest genomic data recovered so far is from a 560-780 ka old horse specimen2. Here we report the recovery of genome-wide data from three Early and Middle Pleistocene mammoth specimens, two of which are more than one million years old. We find that two distinct mammoth lineages were present in eastern Siberia during the Early Pleistocene. One of these gave rise to the woolly mammoth, whereas the other represents a previously unrecognised lineage that was ancestral to the first mammoths to colonise North America. Our analyses reveal that the North American Columbian mammoth traces its ancestry to a Middle Pleistocene hybridisation between these two lineages, with roughly equal admixture proportions. Finally, we show that the majority of protein-coding changes associated with cold adaptation in woolly mammoths were present already a million years ago. These findings highlight the potential of deep time palaeogenomics to expand our understanding of speciation and long-term adaptive evolution.
Collapse
|
30
|
Comparative analysis of nuclear, chloroplast, and mitochondrial genomes of watermelon and melon provides evidence of gene transfer. Sci Rep 2021; 11:1595. [PMID: 33452307 PMCID: PMC7811005 DOI: 10.1038/s41598-020-80149-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 12/17/2020] [Indexed: 11/08/2022] Open
Abstract
During plant evolution, there is genetic communication between organelle and nuclear genomes. A comparative analysis was performed on the organelle and nuclear genomes of the watermelon and melon. In the watermelon, chloroplast-derived sequences accounted for 7.6% of the total length of the mitochondrial genome. In the melon, chloroplast-derived sequences accounted for approximately 2.73% of the total mitochondrial genome. In watermelon and melon, the chloroplast-derived small-fragment sequences are either a subset of large-fragment sequences or appeared multiple times in the mitochondrial genome, indicating that these fragments may have undergone multiple independent migration integrations or emerged in the mitochondrial genome after migration, replication, and reorganization. There was no evidence of migration from the mitochondria to chloroplast genome. A sequence with a total length of about 73 kb (47%) in the watermelon chloroplast genome was homologous to a sequence of about 313 kb in the nuclear genome. About 33% of sequences in the watermelon mitochondrial genome was homologous with a 260 kb sequence in the nuclear genome. A sequence with a total length of about 38 kb (25%) in the melon chloroplast genome was homologous with 461 sequences in the nuclear genome, with a total length of about 301 kb. A 3.4 Mb sequence in the nuclear genome was homologous with a melon mitochondrial sequence. These results indicate that, during the evolution of watermelon and melon, a large amount of genetic material was exchanged between the nuclear genome and the two organelle genomes in the cytoplasm.
Collapse
|
31
|
Ben-Assa N, Coyne MJ, Fomenkov A, Livny J, Robins WP, Muniesa M, Carey V, Carasso S, Gefen T, Jofre J, Roberts RJ, Comstock LE, Geva-Zatorsky N. Analysis of a phase-variable restriction modification system of the human gut symbiont Bacteroides fragilis. Nucleic Acids Res 2020; 48:11040-11053. [PMID: 33045731 PMCID: PMC7641763 DOI: 10.1093/nar/gkaa824] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 09/10/2020] [Accepted: 10/06/2020] [Indexed: 11/28/2022] Open
Abstract
The genomes of gut Bacteroidales contain numerous invertible regions, many of which contain promoters that dictate phase-variable synthesis of surface molecules such as polysaccharides, fimbriae, and outer surface proteins. Here, we characterize a different type of phase-variable system of Bacteroides fragilis, a Type I restriction modification system (R-M). We show that reversible DNA inversions within this R-M locus leads to the generation of eight specificity proteins with distinct recognition sites. In vitro grown bacteria have a different proportion of specificity gene combinations at the expression locus than bacteria isolated from the mammalian gut. By creating mutants, each able to produce only one specificity protein from this region, we identified the R-M recognition sites of four of these S-proteins using SMRT sequencing. Transcriptome analysis revealed that the locked specificity mutants, whether grown in vitro or isolated from the mammalian gut, have distinct transcriptional profiles, likely creating different phenotypes, one of which was confirmed. Genomic analyses of diverse strains of Bacteroidetes from both host-associated and environmental sources reveal the ubiquity of phase-variable R-M systems in this phylum.
Collapse
Affiliation(s)
- Nadav Ben-Assa
- Department of Cell Biology and Cancer Science, Rappaport Faculty of Medicine, Technion - Israel Institute of Technology, Technion Integrated Cancer Center (TICC), Haifa, 3525422 Israel
| | - Michael J Coyne
- Division of Infectious Diseases, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | | | - Jonathan Livny
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - William P Robins
- Department of Microbiology, Harvard Medical School, Boston, 02115, MA, USA
| | - Maite Muniesa
- Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, Avda. Diagonal 643 08028 Barcelona Spain
| | - Vincent Carey
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Shaqed Carasso
- Department of Cell Biology and Cancer Science, Rappaport Faculty of Medicine, Technion - Israel Institute of Technology, Technion Integrated Cancer Center (TICC), Haifa, 3525422 Israel
| | - Tal Gefen
- Department of Cell Biology and Cancer Science, Rappaport Faculty of Medicine, Technion - Israel Institute of Technology, Technion Integrated Cancer Center (TICC), Haifa, 3525422 Israel
| | - Juan Jofre
- Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, Avda. Diagonal 643 08028 Barcelona Spain
| | | | - Laurie E Comstock
- Division of Infectious Diseases, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Naama Geva-Zatorsky
- Department of Cell Biology and Cancer Science, Rappaport Faculty of Medicine, Technion - Israel Institute of Technology, Technion Integrated Cancer Center (TICC), Haifa, 3525422 Israel.,Canadian Institute for Advanced Research (CIFAR) Azrieli Global Scholar, MaRS Centre, West Tower 661 University Ave., Suite 505 Toronto, ON M5G 1M1, Canada
| |
Collapse
|
32
|
Karyotypic Evolution of Sauropsid Vertebrates Illuminated by Optical and Physical Mapping of the Painted Turtle and Slider Turtle Genomes. Genes (Basel) 2020; 11:genes11080928. [PMID: 32806747 PMCID: PMC7464131 DOI: 10.3390/genes11080928] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 07/25/2020] [Accepted: 07/31/2020] [Indexed: 02/07/2023] Open
Abstract
Recent sequencing and software enhancements have advanced our understanding of the evolution of genomic structure and function, especially addressing novel evolutionary biology questions. Yet fragmentary turtle genome assemblies remain a challenge to fully decipher the genetic architecture of adaptive evolution. Here, we use optical mapping to improve the contiguity of the painted turtle (Chrysemys picta) genome assembly and use de novo fluorescent in situ hybridization (FISH) of bacterial artificial chromosome (BAC) clones, BAC-FISH, to physically map the genomes of the painted and slider turtles (Trachemys scripta elegans). Optical mapping increased C. picta's N50 by ~242% compared to the previous assembly. Physical mapping permitted anchoring ~45% of the genome assembly, spanning 5544 genes (including 20 genes related to the sex determination network of turtles and vertebrates). BAC-FISH data revealed assembly errors in C. picta and T. s. elegans assemblies, highlighting the importance of molecular cytogenetic data to complement bioinformatic approaches. We also compared C. picta's anchored scaffolds to the genomes of other chelonians, chicken, lizards, and snake. Results revealed a mostly one-to-one correspondence between chromosomes of painted and slider turtles, and high homology among large syntenic blocks shared with other turtles and sauropsids. Yet, numerous chromosomal rearrangements were also evident across chelonians, between turtles and squamates, and between avian and non-avian reptiles.
Collapse
|
33
|
Aubé J, Senin P, Bonin P, Pringault O, Jeziorski C, Bouchez O, Klopp C, Guyoneaud R, Goñi-Urriza M. Meta-omics Provides Insights into the Impact of Hydrocarbon Contamination on Microbial Mat Functioning. MICROBIAL ECOLOGY 2020; 80:286-295. [PMID: 32076743 DOI: 10.1007/s00248-020-01493-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Accepted: 02/04/2020] [Indexed: 06/10/2023]
Abstract
Photosynthetic microbial mats are stable, self-supported communities. Due to their coastal localization, these mats are frequently exposed to hydrocarbon contamination and are able to grow on it. To decipher how this contamination disturbs the functioning of microbial mats, we compared two mats: a contaminated mat exposed to chronic petroleum contamination and a reference mat. The taxonomic and metabolic structures of the mats in spring and fall were determined using metagenomic and metatranscriptomic approaches. Extremely high contamination disturbed the seasonal variations of the mat. ABC transporters, two-component systems, and type IV secretion system-related genes were overabundant in the contaminated mats. Xenobiotic degradation metabolism was minor in the metagenomes of both mats, and only the expression of genes involved in polycyclic aromatic hydrocarbon degradation was higher in the contaminated mat. Interestingly, the expression rates of genes involved in hydrocarbon activation decreased during the 1-year study period, concomitant with the decrease in easily degradable hydrocarbons, suggesting a transient effect of hydrocarbon contamination. Alteromonadales and Oceanospirillales hydrocarbonoclastic bacteria appeared to be key in hydrocarbon remediation in the contaminated mat. Overall, the contaminated microbial mat was able to cope with hydrocarbon contamination and displayed an adaptive functioning that modified seasonal behaviour.
Collapse
Affiliation(s)
- Johanne Aubé
- Environmental Microbiology, Universite de Pau et des Pays de l'Adour, E2S UPPA, CNRS, IPREM, Pau, France
- Laboratoire de Microbiologie des Environnements Extrêmes, UMR6197, IFREMER, CNRS, Université de Bretagne Occidentale, Plouzané, France
| | - Pavel Senin
- Environmental Microbiology, Universite de Pau et des Pays de l'Adour, E2S UPPA, CNRS, IPREM, Pau, France
- Plateforme Bioinformatique Genotoul, UR875 Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
| | - Patricia Bonin
- Mediterranean Institute of Oceanography (MIO), Aix Marseille University, Université de Toulon, CNRS/INSU/IRD, UM 110, Marseille, France
| | - Olivier Pringault
- UMR 9190 MARBEC IRD-Ifremer-CNRS, Université de Montpellier, Place Eugène Bataillon, Montpellier, France
| | | | - Olivier Bouchez
- GeT-PlaGe, Genotoul, INRA Auzeville, Castanet-Tolosan, France
| | - Christophe Klopp
- Plateforme Bioinformatique Genotoul, UR875 Biométrie et Intelligence Artificielle, INRA, Castanet-Tolosan, France
| | - Rémy Guyoneaud
- Environmental Microbiology, Universite de Pau et des Pays de l'Adour, E2S UPPA, CNRS, IPREM, Pau, France
| | - Marisol Goñi-Urriza
- Environmental Microbiology, Universite de Pau et des Pays de l'Adour, E2S UPPA, CNRS, IPREM, Pau, France.
| |
Collapse
|
34
|
Asalone KC, Ryan KM, Yamadi M, Cohen AL, Farmer WG, George DJ, Joppert C, Kim K, Mughal MF, Said R, Toksoz-Exley M, Bisk E, Bracht JR. Regional sequence expansion or collapse in heterozygous genome assemblies. PLoS Comput Biol 2020; 16:e1008104. [PMID: 32735589 PMCID: PMC7423139 DOI: 10.1371/journal.pcbi.1008104] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 08/12/2020] [Accepted: 06/29/2020] [Indexed: 12/13/2022] Open
Abstract
High levels of heterozygosity present a unique genome assembly challenge and can adversely impact downstream analyses, yet is common in sequencing datasets obtained from non-model organisms. Here we show that by re-assembling a heterozygous dataset with variant parameters and different assembly algorithms, we are able to generate assemblies whose protein annotations are statistically enriched for specific gene ontology categories. While total assembly length was not significantly affected by assembly methodologies tested, the assemblies generated varied widely in fragmentation level and we show local assembly collapse or expansion underlying the enrichment or depletion of specific protein functional groups. We show that these statistically significant deviations in gene ontology groups can occur in seemingly high-quality assemblies, and result from difficult-to-detect local sequence expansion or contractions. Given the unpredictable interplay between assembly algorithm, parameter, and biological sequence data heterozygosity, we highlight the need for better measures of assembly quality than N50 value, including methods for assessing local expansion and collapse. In the genomic era, genomes must be reconstructed from fragments using computational methods, or assemblers. How do we know that a new genome assembly is correct? This is important because errors in assembly can lead to downstream problems in gene predictions and these inaccurate results can contaminate databases, affecting later comparative studies. A particular challenge occurs when a diploid organism inherits two highly divergent genome copies from its parents. While it is widely appreciated that this type of data is difficult for assemblers to handle properly, here we show that the process is prone to more errors than previously appreciated. Specifically, we document examples of regional expansion and collapse, affecting downstream gene prediction accuracy, but without changing the overall genome assembly size or other metrics of accuracy. Our results suggest that assembly evaluation methods should be altered to identify whether regional expansions and collapses are present in the genome assembly.
Collapse
Affiliation(s)
- Kathryn C. Asalone
- Biology Department, American University, Washington DC, United States of America
| | - Kara M. Ryan
- Biology Department, American University, Washington DC, United States of America
| | - Maryam Yamadi
- Biology Department, American University, Washington DC, United States of America
| | - Annastelle L. Cohen
- Biology Department, American University, Washington DC, United States of America
| | - William G. Farmer
- Biology Department, American University, Washington DC, United States of America
| | - Deborah J. George
- Biology Department, American University, Washington DC, United States of America
| | - Claudia Joppert
- Biology Department, American University, Washington DC, United States of America
| | - Kaitlyn Kim
- Biology Department, American University, Washington DC, United States of America
| | - Madeeha Froze Mughal
- Biology Department, American University, Washington DC, United States of America
| | - Rana Said
- Biology Department, American University, Washington DC, United States of America
| | - Metin Toksoz-Exley
- Mathematics and Statistics Department, American University, Washington DC, United States of America
| | - Evgeny Bisk
- Office of Information Technology, American University, Washington DC, United States of America
| | - John R. Bracht
- Biology Department, American University, Washington DC, United States of America
- * E-mail:
| |
Collapse
|
35
|
Ghelfi A, Shirasawa K, Hirakawa H, Isobe S. Hayai-Annotation Plants: an ultra-fast and comprehensive functional gene annotation system in plants. Bioinformatics 2020; 35:4427-4429. [PMID: 31093654 PMCID: PMC6821316 DOI: 10.1093/bioinformatics/btz380] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2018] [Revised: 04/25/2019] [Accepted: 04/30/2019] [Indexed: 11/15/2022] Open
Abstract
Summary Hayai-Annotation Plants is a browser-based interface for an ultra-fast and accurate functional gene annotation system for plant species using R. The pipeline combines the sequence-similarity searches, using USEARCH against UniProtKB (taxonomy Embryophyta), with a functional annotation step. Hayai-Annotation Plants provides five layers of annotation: i) protein name; ii) gene ontology terms consisting of its three main domains (Biological Process, Molecular Function and Cellular Component); iii) enzyme commission number; iv) protein existence level; and v) evidence type. It implements a new algorithm that gives priority to protein existence level to propagate GO and EC information and annotated Arabidopsis thaliana representative peptide sequences (Araport11) within 5 min at the PC level. Availability and implementation The software is implemented in R and runs on Macintosh and Linux systems. It is freely available at https://github.com/kdri-genomics/Hayai-Annotation-Plants under the GPLv3 license. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Andrea Ghelfi
- Laboratory of Plant Genetics and Genomics, Department of Frontier Research and Development, Kazusa DNA Research Institute, Kazusa-kamatari, Chiba, Japan
| | - Kenta Shirasawa
- Laboratory of Plant Genetics and Genomics, Department of Frontier Research and Development, Kazusa DNA Research Institute, Kazusa-kamatari, Chiba, Japan
| | - Hideki Hirakawa
- Laboratory of Plant Genetics and Genomics, Department of Frontier Research and Development, Kazusa DNA Research Institute, Kazusa-kamatari, Chiba, Japan
| | - Sachiko Isobe
- Laboratory of Plant Genetics and Genomics, Department of Frontier Research and Development, Kazusa DNA Research Institute, Kazusa-kamatari, Chiba, Japan
| |
Collapse
|
36
|
Yang W, Liu Y, Dong R, Liu J, Lang J, Yang J, Wang W, Li J, Meng B, Tian G. Accurate Detection of HPV Integration Sites in Cervical Cancer Samples Using the Nanopore MinION Sequencer Without Error Correction. Front Genet 2020; 11:660. [PMID: 32714374 PMCID: PMC7344299 DOI: 10.3389/fgene.2020.00660] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 05/29/2020] [Indexed: 12/14/2022] Open
Abstract
During the carcinogenesis of cervical cancer, the DNA of human papillomavirus (HPV) is frequently integrated into the human genome, which might be a biomarker for the early diagnosis of cervical cancer. Although the detection sensitivity of virus infection status increased significantly through the Illumina sequencing platform, there were still disadvantages remain for further improvement, including the detection accuracy and the complex integrated genome structure identification, etc. Nanopore sequencing has been proven to be a fast yet accurate technique of detecting pathogens in clinical samples with significant longer sequencing length. However, the identification of virus integration sites, especially HPV integration sites was seldom carried out by using nanopore platform. In this study, we evaluated the feasibility of identifying HPV integration sites by nanopore sequencer. Specifically, we re-sequenced the integration sites of a previously published sample by both nanopore and Illumina sequencing. After analyzing the results, three points of conclusions were drawn: first, 13 out of 19 previously published integration sites were found from all three datasets (i.e., nanopore, Illumina, and the published data), indicating a high overlap rate and comparability among the three platforms; second, our pipeline of nanopore and Illumina data identified 66 unique integration sites compared with previous published paper with 13 of them being verified by Sanger sequencing, indicating the higher integration sites detection sensitivity of our results compared with published data; third, we established a pipeline which could be used in HPV integration site detection by nanopore sequencing data without doing error correction analysis. In summary, a new nanopore data analysis method was tested and proved to be reliable in integration sites detection compared with methods of existing Illumina data analysis pipeline with less sequencing data required. It provides a solid evidence and tool to support the potential application of nanopore in virus status identification.
Collapse
Affiliation(s)
| | - Ying Liu
- Laboratory of Genetics, Key Laboratory of Carcinogenesis and Translational Research, Ministry of Education, Peking University Cancer Hospital and Institute, Beijing, China
| | - Ruyi Dong
- Geneis (Beijing) Co., Ltd., Beijing, China
| | - Jia Liu
- Geneis (Beijing) Co., Ltd., Beijing, China
| | | | | | | | - Jingjing Li
- The Precision Medicine Centre of Drum Tower Hospital, Medical School of Nanjing University, Nanjing, China
| | - Bo Meng
- Geneis (Beijing) Co., Ltd., Beijing, China
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing, China
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| |
Collapse
|
37
|
Quan L, Dong R, Yang W, Chen L, Lang J, Liu J, Song Y, Ma S, Yang J, Wang W, Meng B, Tian G. Simultaneous detection and comprehensive analysis of HPV and microbiome status of a cervical liquid-based cytology sample using Nanopore MinION sequencing. Sci Rep 2019; 9:19337. [PMID: 31852945 PMCID: PMC6920169 DOI: 10.1038/s41598-019-55843-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 11/21/2019] [Indexed: 02/06/2023] Open
Abstract
Human papillomavirus (HPV) is a major pathogen that causes cervical cancer and many other related diseases. HPV infection related cervical microbiome could be an induce factor of cervical cancer. However, it is uncommon to find a single test on the market that can simultaneously provide information on both HPV and the microbiome. Herein, a novel method was developed in this study to simultaneously detect HPV infection and microbiota composition promptly and accurately. It provides a new and simple way to detect vaginal pathogen situation and also provide valuable information for clinical diagnose. This approach combined multiplex PCR, which targeted both HPV16 E6E7 and full-length 16S rRNA, and Nanopore sequencing to generate enough information to understand the vagina condition of patients. One HPV positive liquid-based cytology (LBC) sample was sequenced and analyzed. After comparing with Illumina sequencing, the results from Nanopore showed a similar microbiome composition. An instant sequencing evaluation showed that 15 min sequencing is enough to identify the top 10 most abundant bacteria. Moreover, two HPV integration sites were identified and verified by Sanger sequencing. This approach has many potential applications in pathogen detection and can potentially aid in providing a more rapid clinical diagnosis.
Collapse
Affiliation(s)
- Lili Quan
- Department of Gynaecology and Obstetrics, Sanmenxia Central Hospital of Henan University of Science and Technology, Sanmenxia, 472000, Henan, China
| | - Ruyi Dong
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China
| | | | - Lanyou Chen
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China
| | - Jidong Lang
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China
| | - Jia Liu
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China
| | - Yu Song
- Department of Gynaecology and Obstetrics, Sanmenxia Central Hospital of Henan University of Science and Technology, Sanmenxia, 472000, Henan, China
| | - Shuiqing Ma
- Department of Gynaecology and Obstetrics, Peking Union Medical College Hospital, Beijing, 100730, China
| | | | - Weiwei Wang
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China
| | - Bo Meng
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China.
| | - Geng Tian
- Geneis (Beijing) Co.Ltd, Beijing, 100102, China.
| |
Collapse
|
38
|
Mattingsdal M, Jorde PE, Knutsen H, Jentoft S, Stenseth NC, Sodeland M, Robalo JI, Hansen MM, André C, Blanco Gonzalez E. Demographic history has shaped the strongly differentiated corkwing wrasse populations in Northern Europe. Mol Ecol 2019; 29:160-171. [DOI: 10.1111/mec.15310] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 11/06/2019] [Accepted: 11/13/2019] [Indexed: 12/11/2022]
Affiliation(s)
- Morten Mattingsdal
- Department of Natural Sciences Centre for Coastal Research University of Agder Kristiansand Norway
| | | | - Halvor Knutsen
- Department of Natural Sciences Centre for Coastal Research University of Agder Kristiansand Norway
- Institute of Marine Research Flødevigen Norway
| | - Sissel Jentoft
- Department of Biosciences Centre for Ecological and Evolutionary Synthesis University of Oslo Oslo Norway
| | - Nils Christian Stenseth
- Department of Natural Sciences Centre for Coastal Research University of Agder Kristiansand Norway
- Department of Biosciences Centre for Ecological and Evolutionary Synthesis University of Oslo Oslo Norway
| | - Marte Sodeland
- Department of Natural Sciences Centre for Coastal Research University of Agder Kristiansand Norway
| | - Joana I. Robalo
- Marine and Environmental Sciences Centre ISPA Instituto Universitário de Ciências Psicológicas, Sociais e da Vida Lisboa Portugal
| | | | - Carl André
- Department of Marine Sciences‐Tjärnö Göteborg University Strömstad Sweden
| | - Enrique Blanco Gonzalez
- Department of Natural Sciences Centre for Coastal Research University of Agder Kristiansand Norway
- Norwegian College of Fishery Science UiT The Arctic University of Norway Tromsø Norway
| |
Collapse
|
39
|
Hoang NV, Furtado A, Perlo V, Botha FC, Henry RJ. The Impact of cDNA Normalization on Long-Read Sequencing of a Complex Transcriptome. Front Genet 2019; 10:654. [PMID: 31396260 PMCID: PMC6664245 DOI: 10.3389/fgene.2019.00654] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 06/20/2019] [Indexed: 11/13/2022] Open
Abstract
Normalization of cDNA is widely used to improve the coverage of rare transcripts in analysis of transcriptomes employing next-generation sequencing. Recently, long-read technology has been emerging as a powerful tool for sequencing and construction of transcriptomes, especially for complex genomes containing highly similar transcripts and transcript-spliced isoforms. Here, we analyzed the transcriptome of sugarcane, a highly polyploidy plant genome, by PacBio isoform sequencing (Iso-Seq) of two different cDNA library preparations, with and without a normalization step. The results demonstrated that, while the two libraries included many of the same transcripts, many longer transcripts were removed, and many new generally shorter transcripts were detected by normalization. For the same input cDNA and data yield, the normalized library recovered more total transcript isoforms and number of predicted gene families and orthologous groups, resulting in a higher representation for the sugarcane transcriptome, compared to the non-normalized library. The non-normalized library, on the other hand, included a wider transcript length range with more longer transcripts above ∼1.25 kb and more transcript isoforms per gene family and gene ontology terms per transcript. A large proportion of the unique transcripts comprising ∼52% of the normalized library were expressed at a lower level than the unique transcripts from the non-normalized library, across three tissue types tested including leaf, stalk, and root. About 83% of the total 5,348 predicted long noncoding transcripts was derived from the normalized library, of which ∼80% was derived from the lowly expressed fraction. Functional annotation of the unique transcripts suggested that each library enriched different functional transcript fractions. This demonstrated the complementation of the two approaches in obtaining a complete transcriptome of a complex genome at the sequencing depth used in this study.
Collapse
Affiliation(s)
- Nam V. Hoang
- College of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
| | - Virginie Perlo
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
| | - Frederik C. Botha
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
- Sugar Research Australia, Indooroopilly, QLD, Australia
| | - Robert J. Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, Australia
| |
Collapse
|
40
|
Turmel M, Lopes Dos Santos A, Otis C, Sergerie R, Lemieux C. Tracing the Evolution of the Plastome and Mitogenome in the Chloropicophyceae Uncovered Convergent tRNA Gene Losses and a Variant Plastid Genetic Code. Genome Biol Evol 2019; 11:1275-1292. [PMID: 30937436 PMCID: PMC6486808 DOI: 10.1093/gbe/evz074] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2019] [Indexed: 01/06/2023] Open
Abstract
The tiny green algae belonging to the Chloropicophyceae play a key role in marine phytoplankton communities; this newly erected class of prasinophytes comprises two genera (Chloropicon and Chloroparvula) containing each several species. We sequenced the plastomes and mitogenomes of eight Chloropicon and five Chloroparvula species to better delineate the phylogenetic affinities of these taxa and to infer the suite of changes that their organelle genomes sustained during evolution. The relationships resolved in organelle-based phylogenomic trees were essentially congruent with previously reported rRNA trees, and similar evolutionary trends but distinct dynamics were identified for the plastome and mitogenome. Although the plastome sustained considerable changes in gene content and order at the time the two genera split, subsequently it remained stable and maintained a very small size. The mitogenome, however, was remodeled more gradually and showed more fluctuation in size, mainly as a result of expansions/contractions of intergenic regions. Remarkably, the plastome and mitogenome lost a common set of three tRNA genes, with the trnI(cau) and trnL(uaa) losses being accompanied with important variations in codon usage. Unexpectedly, despite the disappearance of trnI(cau) from the plastome in the Chloroparvula lineage, AUA codons (the codons recognized by this gene product) were detected in certain plastid genes. By comparing the sequences of plastid protein-coding genes from chloropicophycean and phylogenetically diverse chlorophyte algae with those of the corresponding predicted proteins, we discovered that the AUA codon was reassigned from isoleucine to methionine in Chloroparvula. This noncanonical genetic code has not previously been uncovered in plastids.
Collapse
Affiliation(s)
- Monique Turmel
- Département de biochimie, de microbiologie et de bio-informatique, Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec City, Québec, Canada
| | - Adriana Lopes Dos Santos
- Asian School of the Environment, Nanyang Technological University, Singapore.,CNRS, Sorbonne Université, UMR 7144 Station Biologique de Roscoff, Roscoff, France
| | - Christian Otis
- Département de biochimie, de microbiologie et de bio-informatique, Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec City, Québec, Canada
| | - Roxanne Sergerie
- Département de biochimie, de microbiologie et de bio-informatique, Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec City, Québec, Canada
| | - Claude Lemieux
- Département de biochimie, de microbiologie et de bio-informatique, Institut de Biologie Intégrative et des Systèmes, Université Laval, Québec City, Québec, Canada
| |
Collapse
|
41
|
Voorhuijzen-Harink MM, Hagelaar R, van Dijk JP, Prins TW, Kok EJ, Staats M. Toward on-site food authentication using nanopore sequencing. FOOD CHEMISTRY-X 2019; 2:100035. [PMID: 31432019 PMCID: PMC6694865 DOI: 10.1016/j.fochx.2019.100035] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 04/25/2019] [Accepted: 05/23/2019] [Indexed: 11/03/2022]
Abstract
•MinION DNA metabarcoding is a promising tool for species identification in food.•MinION and Illumina MiSeq sequencing platforms perform equally accurate.•Species identification with MinION sequencing requires dedicated bioinformatics.
Collapse
Affiliation(s)
| | - Rico Hagelaar
- Princess Máxima Center for Pediatric Oncology, Netherlands
| | - Jeroen P van Dijk
- Wageningen University & Research - Wageningen Food Safety Research (WFSR), Netherlands
| | - Theo W Prins
- Wageningen University & Research - Wageningen Food Safety Research (WFSR), Netherlands
| | - Esther J Kok
- Wageningen University & Research - Wageningen Food Safety Research (WFSR), Netherlands
| | - Martijn Staats
- Wageningen University & Research - Wageningen Food Safety Research (WFSR), Netherlands
| |
Collapse
|
42
|
Abstract
Whole-genome alignment (WGA) is the prediction of evolutionary relationships at the nucleotide level between two or more genomes. It combines aspects of both colinear sequence alignment and gene orthology prediction and is typically more challenging to address than either of these tasks due to the size and complexity of whole genomes. Despite the difficulty of this problem, numerous methods have been developed for its solution because WGAs are valuable for genome-wide analyses such as phylogenetic inference, genome annotation, and function prediction. In this chapter, we discuss the meaning and significance of WGA and present an overview of the methods that address it. We also examine the problem of evaluating whole-genome aligners and offer a set of methodological challenges that need to be tackled in order to make most effective use of our rapidly growing databases of whole genomes.
Collapse
Affiliation(s)
- Colin N Dewey
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
43
|
Kirsch R, Seemann SE, Ruzzo WL, Cohen SM, Stadler PF, Gorodkin J. Identification and characterization of novel conserved RNA structures in Drosophila. BMC Genomics 2018; 19:899. [PMID: 30537930 PMCID: PMC6288889 DOI: 10.1186/s12864-018-5234-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Accepted: 11/08/2018] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.
Collapse
Affiliation(s)
- Rebecca Kirsch
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
| | - Stefan E. Seemann
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| | - Walter L. Ruzzo
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- School of Computer Science and Engineering, University of Washington, Box 352350, Seattle, 98195-2350 WA USA
- Department of Genome Sciences, University of Washington, Box 355065, Seattle, 98195-5065 WA USA
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, 98109-1024 WA USA
| | - Stephen M. Cohen
- Department of Cellular and Molecular Medicine, University of Copenhagen, Blegdamsvej 3, Copenhagen N, DK-2200 Denmark
| | - Peter F. Stadler
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103 Germany
- Faculdad de Ciencias, Universidad Nacional de Colombia, Sede Bogotá, Ciudad Universitaria, Bogotá, COL-111321 D.C. Colombia
- Department of Theoretical Chemistry, University of Vienna, Währinger Straße 17, Vienna, A-1090 Austria
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501 USA
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| |
Collapse
|
44
|
Toomey MB, Marques CI, Andrade P, Araújo PM, Sabatino S, Gazda MA, Afonso S, Lopes RJ, Corbo JC, Carneiro M. A non-coding region near Follistatin controls head colour polymorphism in the Gouldian finch. Proc Biol Sci 2018; 285:20181788. [PMID: 30282656 PMCID: PMC6191701 DOI: 10.1098/rspb.2018.1788] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 09/05/2018] [Indexed: 12/17/2022] Open
Abstract
Discrete colour morphs coexisting within a single population are common in nature. In a broad range of organisms, sympatric colour morphs often display major differences in other traits, including morphology, physiology or behaviour. Despite the repeated occurrence of this phenomenon, our understanding of the genetics that underlie multi-trait differences and the factors that promote the long-term maintenance of phenotypic variability within a freely interbreeding population are incomplete. Here, we investigated the genetic basis of red and black head colour in the Gouldian finch (Erythrura gouldiae), a classic polymorphic system in which naturally occurring colour morphs also display differences in aggressivity and reproductive success. We show that the candidate locus is a small (approx. 70 kb) non-coding region mapping to the Z chromosome near the Follistatin (FST) gene. Unlike recent findings in other systems where phenotypic morphs are explained by large inversions containing hundreds of genes (so-called supergenes), we did not identify any structural rearrangements between the two haplotypes using linked-read sequencing technology. Nucleotide divergence between the red and black alleles was high when compared to the remainder of the Z chromosome, consistent with their maintenance as balanced polymorphisms over several million years. Our results illustrate how pleiotropic phenotypes can arise from simple genetic variation, probably regulatory in nature.
Collapse
Affiliation(s)
- Matthew B Toomey
- Department of Pathology and Immunology, Washington University School of Medicine, St Louis, MO 63110, USA
| | - Cristiana I Marques
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal
| | - Pedro Andrade
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal
| | - Pedro M Araújo
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
- Centro de Ciências do Mar e do Ambiente, Departamento de Ciências da Vida, Universidade de Coimbra, 3004-517 Coimbra, Portugal
| | - Stephen Sabatino
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Małgorzata A Gazda
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal
| | - Sandra Afonso
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Ricardo J Lopes
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Joseph C Corbo
- Department of Pathology and Immunology, Washington University School of Medicine, St Louis, MO 63110, USA
| | - Miguel Carneiro
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, 4485-661 Vairão, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, 4169-007 Porto, Portugal
| |
Collapse
|
45
|
Han R, Li Y, Gao X, Wang S. An accurate and rapid continuous wavelet dynamic time warping algorithm for end-to-end mapping in ultra-long nanopore sequencing. Bioinformatics 2018; 34:i722-i731. [DOI: 10.1093/bioinformatics/bty555] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Renmin Han
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Yu Li
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Xin Gao
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Sheng Wang
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| |
Collapse
|
46
|
Olson D, Wheeler T. ULTRA: A Model Based Tool to Detect Tandem Repeats. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2018; 2018:37-46. [PMID: 31080962 DOI: 10.1145/3233547.3233604] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
In biological sequences, tandem repeats consist of tens to hundreds of residues of a repeated pattern, such as atgatgatgatgatg ('atg' repeated), often the result of replication slippage. Over time, these repeats decay so that the original sharp pattern of repetition is somewhat obscured, but even degenerate repeats pose a problem for sequence annotation: when two sequences both contain shared patterns of similar repetition, the result can be a false signal of sequence homology. We describe an implementation of a new hidden Markov model for detecting tandem repeats that shows substantially improved sensitivity to labeling decayed repetitive regions, presents low and reliable false annotation rates across a wide range of sequence composition, and produces scores that follow a stable distribution. On typical genomic sequence, the time and memory requirements of the resulting tool (ULTRA) are competitive with the most heavily used tool for repeat masking (TRF). ULTRA is released under an open source license and lays the groundwork for inclusion of the model in sequence alignment tools and annotation pipelines.
Collapse
|
47
|
Kukekova AV, Johnson JL, Xiang X, Feng S, Liu S, Rando HM, Kharlamova AV, Herbeck Y, Serdyukova NA, Xiong Z, Beklemischeva V, Koepfli KP, Gulevich RG, Vladimirova AV, Hekman JP, Perelman PL, Graphodatsky AS, O'Brien SJ, Wang X, Clark AG, Acland GM, Trut LN, Zhang G. Red fox genome assembly identifies genomic regions associated with tame and aggressive behaviours. Nat Ecol Evol 2018; 2:1479-1491. [PMID: 30082739 DOI: 10.1038/s41559-018-0611-6] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 06/18/2018] [Indexed: 12/30/2022]
Abstract
Strains of red fox (Vulpes vulpes) with markedly different behavioural phenotypes have been developed in the famous long-term selective breeding programme known as the Russian farm-fox experiment. Here we sequenced and assembled the red fox genome and re-sequenced a subset of foxes from the tame, aggressive and conventional farm-bred populations to identify genomic regions associated with the response to selection for behaviour. Analysis of the re-sequenced genomes identified 103 regions with either significantly decreased heterozygosity in one of the three populations or increased divergence between the populations. A strong positional candidate gene for tame behaviour was highlighted: SorCS1, which encodes the main trafficking protein for AMPA glutamate receptors and neurexins and suggests a role for synaptic plasticity in fox domestication. Other regions identified as likely to have been under selection in foxes include genes implicated in human neurological disorders, mouse behaviour and dog domestication. The fox represents a powerful model for the genetic analysis of affiliative and aggressive behaviours that can benefit genetic studies of behaviour in dogs and other mammals, including humans.
Collapse
Affiliation(s)
- Anna V Kukekova
- Animal Sciences Department, College of ACES, University of Illinois at Urbana, Champaign, IL, USA.
| | - Jennifer L Johnson
- Animal Sciences Department, College of ACES, University of Illinois at Urbana, Champaign, IL, USA
| | - Xueyan Xiang
- China National Genebank, BGI -Shenzhen, Shenzhen, China
| | - Shaohong Feng
- China National Genebank, BGI -Shenzhen, Shenzhen, China
| | - Shiping Liu
- China National Genebank, BGI -Shenzhen, Shenzhen, China
| | - Halie M Rando
- Animal Sciences Department, College of ACES, University of Illinois at Urbana, Champaign, IL, USA
| | - Anastasiya V Kharlamova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Yury Herbeck
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Natalya A Serdyukova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Zijun Xiong
- China National Genebank, BGI -Shenzhen, Shenzhen, China.,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Violetta Beklemischeva
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Klaus-Peter Koepfli
- Smithsonian Conservation Biology Institute, National Zoological Park, Washington DC, USA.,Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, Saint Petersburg, Russia
| | - Rimma G Gulevich
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Anastasiya V Vladimirova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Jessica P Hekman
- Animal Sciences Department, College of ACES, University of Illinois at Urbana, Champaign, IL, USA.,The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Polina L Perelman
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia
| | - Aleksander S Graphodatsky
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, Saint Petersburg, Russia.,Guy Harvey Oceanographic Center, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Fort Lauderdale, FL, USA
| | - Xu Wang
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.,Department of Pathobiology, Auburn University, Auburn, AL, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Gregory M Acland
- Baker Institute for Animal Health, Cornell University, College of Veterinary Medicine, Ithaca, NY, USA
| | - Lyudmila N Trut
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Guojie Zhang
- China National Genebank, BGI -Shenzhen, Shenzhen, China. .,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China. .,Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
48
|
Karamitros T, van Wilgenburg B, Wills M, Klenerman P, Magiorkinis G. Nanopore sequencing and full genome de novo assembly of human cytomegalovirus TB40/E reveals clonal diversity and structural variations. BMC Genomics 2018; 19:577. [PMID: 30068288 PMCID: PMC6090854 DOI: 10.1186/s12864-018-4949-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/19/2018] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Human cytomegalovirus (HCMV) has a double-stranded DNA genome of approximately 235 Kbp that is structurally complex including extended GC-rich repeated regions. Genomic recombination events are frequent in HCMV cultures but have also been observed in vivo. Thus, the assembly of HCMV whole genomes from technologies producing shorter than 500 bp sequences is technically challenging. Here we improved the reconstruction of HCMV full genomes by means of a hybrid, de novo genome-assembly bioinformatics pipeline upon data generated from the recently released MinION MkI B sequencer from Oxford Nanopore Technologies. RESULTS The MinION run of the HCMV (strain TB40/E) library resulted in ~ 47,000 reads from a single R9 flowcell and in ~ 100× average read depth across the virus genome. We developed a novel, self-correcting bioinformatics algorithm to assemble the pooled HCMV genomes in three stages. In the first stage of the bioinformatics algorithm, long contigs (N50 = 21,892) of lower accuracy were reconstructed. In the second stage, short contigs (N50 = 5686) of higher accuracy were assembled, while in the final stage the high quality contigs served as template for the correction of the longer contigs resulting in a high-accuracy, full genome assembly (N50 = 41,056). We were able to reconstruct a single representative haplotype without employing any scaffolding steps. The majority (98.8%) of the genomic features from the reference strain were accurately annotated on this full genome construct. Our method also allowed the detection of multiple alternative sub-genomic fragments and non-canonical structures suggesting rearrangement events between the unique (UL /US) and the repeated (T/IRL/S) genomic regions. CONCLUSIONS Third generation high-throughput sequencing technologies can accurately reconstruct full-length HCMV genomes including their low-complexity and highly repetitive regions. Full-length HCMV genomes could prove crucial in understanding the genetic determinants and viral evolution underpinning drug resistance, virulence and pathogenesis.
Collapse
Affiliation(s)
- Timokratis Karamitros
- Department of Zoology, University of Oxford, Oxford, United Kingdom. .,Public Health Laboratories, Department of Microbiology, Hellenic Pasteur Institute, 127 Vas Sofias Ave, 11527, Athens, Greece.
| | - Bonnie van Wilgenburg
- Nuffield Department of Clinical Medicine, University of Oxford, Oxford, United Kingdom
| | - Mark Wills
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Paul Klenerman
- Nuffield Department of Clinical Medicine, University of Oxford, Oxford, United Kingdom.,NIHR Biomedical Research Centre, Oxford, United Kingdom
| | - Gkikas Magiorkinis
- Department of Zoology, University of Oxford, Oxford, United Kingdom. .,Department of Hygiene, Epidemiology and Medical Statistics, Medical School, National and Kapodistrian University of Athens, M. Asias 75 str., 11527, Athens, Greece.
| |
Collapse
|
49
|
Karamitros T, Magiorkinis G. Multiplexed Targeted Sequencing for Oxford Nanopore MinION: A Detailed Library Preparation Procedure. Methods Mol Biol 2018; 1712:43-51. [PMID: 29224067 DOI: 10.1007/978-1-4939-7514-3_4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
MinION is a small form factor sequencer recently retailed by Oxford Nanopore technologies. This lighter-sized USB3.0-interfaced device uses innovative nanotechnology to generate extra-long reads from libraries prepared using only standard molecular biology lab equipment. The flexibility and the portability of the platform makes it ideal for point-of-interest and real-time surveillance applications. However, MinION's limited capacity is not enough for the study of specific targets within larger genomes. Apart from just PCR-amplifying regions of interest, the capture of long reads spanning the edges of known-unknown genomic regions is of great importance for structural studies, such as the identification of mobile elements' integrations sites, bridging over low complexity repetitive regions etc.In this study, using MinION-kit-included and commercially available reagents, we have developed an easy and versatile wet-lab procedure for the targeted enrichment of MinION libraries, capturing DNA fragments of interest before the ligation of the sensitive MinION sequencing-adapters. This method allows for simultaneous target-enrichment and barcode-multiplexing of up to 12 libraries, which can be loaded in the same sequencing run.
Collapse
Affiliation(s)
- Timokratis Karamitros
- Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK. .,Department of Medical Microbiology, Hellenic Pasteur Institute, 127 Vas. Sofias Ave., 11527, Athens, Greece.
| | | |
Collapse
|
50
|
Integrated metatranscriptomics and metaproteomics for the characterization of bacterial microbiota in unfed Ixodes ricinus. Ticks Tick Borne Dis 2018; 9:1241-1251. [DOI: 10.1016/j.ttbdis.2018.04.020] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 04/28/2018] [Accepted: 04/29/2018] [Indexed: 12/12/2022]
|