Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Leinonen M, Salmela L. Optical map guided genome assembly. BMC Bioinformatics 2020;21:285. [PMID: 32631227 PMCID: PMC7336458 DOI: 10.1186/s12859-020-03623-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 06/22/2020] [Indexed: 11/26/2022] Open

For:	Leinonen M, Salmela L. Optical map guided genome assembly. BMC Bioinformatics 2020;21:285. [PMID: 32631227 PMCID: PMC7336458 DOI: 10.1186/s12859-020-03623-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 06/22/2020] [Indexed: 11/26/2022] Open

Number

Cited by Other Article(s)

Jackson DJ, Cerveau N, Posnien N. De novo assembly of transcriptomes and differential gene expression analysis using short-read data from emerging model organisms - a brief guide. Front Zool 2024;21:17. [PMID: 38902827 PMCID: PMC11188175 DOI: 10.1186/s12983-024-00538-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open

Abstract

Many questions in biology benefit greatly from the use of a variety of model systems. High-throughput sequencing methods have been a triumph in the democratization of diverse model systems. They allow for the economical sequencing of an entire genome or transcriptome of interest, and with technical variations can even provide insight into genome organization and the expression and regulation of genes. The analysis and biological interpretation of such large datasets can present significant challenges that depend on the 'scientific status' of the model system. While high-quality genome and transcriptome references are readily available for well-established model systems, the establishment of such references for an emerging model system often requires extensive resources such as finances, expertise and computation capabilities. The de novo assembly of a transcriptome represents an excellent entry point for genetic and molecular studies in emerging model systems as it can efficiently assess gene content while also serving as a reference for differential gene expression studies. However, the process of de novo transcriptome assembly is non-trivial, and as a rule must be empirically optimized for every dataset. For the researcher working with an emerging model system, and with little to no experience with assembling and quantifying short-read data from the Illumina platform, these processes can be daunting. In this guide we outline the major challenges faced when establishing a reference transcriptome de novo and we provide advice on how to approach such an endeavor. We describe the major experimental and bioinformatic steps, provide some broad recommendations and cautions for the newcomer to de novo transcriptome assembly and differential gene expression analyses. Moreover, we provide an initial selection of tools that can assist in the journey from raw short-read data to assembled transcriptome and lists of differentially expressed genes.

Collapse

Young BD, Williamson OM, Kron NS, Andrade Rodriguez N, Isma LM, MacKnight NJ, Muller EM, Rosales SM, Sirotzke SM, Traylor-Knowles N, Williams SD, Studivan MS. Annotated genome and transcriptome of the endangered Caribbean mountainous star coral (Orbicella faveolata) using PacBio long-read sequencing. BMC Genomics 2024;25:226. [PMID: 38424480 PMCID: PMC10905781 DOI: 10.1186/s12864-024-10092-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 02/05/2024] [Indexed: 03/02/2024] Open

Affiliation(s)

Benjamin D Young Cooperative Institute of Marine and Atmospheric Science, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA. Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA.
Olivia M Williamson Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA
Nicholas S Kron Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA
Natalia Andrade Rodriguez Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA
Lys M Isma Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA
Nicholas J MacKnight Cooperative Institute of Marine and Atmospheric Science, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA
Erinn M Muller Mote Marine Laboratory, Sarasota, FL, USA
Stephanie M Rosales Cooperative Institute of Marine and Atmospheric Science, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA
Stephanie M Sirotzke Mote Marine Laboratory, Sarasota, FL, USA
Nikki Traylor-Knowles Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA
Sara D Williams Mote Marine Laboratory, Sarasota, FL, USA
Michael S Studivan Cooperative Institute of Marine and Atmospheric Science, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA

Collapse

Bringloe TT, Parent GJ. Contrasting new and available reference genomes to highlight uncertainties in assemblies and areas for future improvement: an example with monodontid species. BMC Genomics 2023;24:693. [PMID: 37985969 PMCID: PMC10659057 DOI: 10.1186/s12864-023-09779-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/31/2023] [Indexed: 11/22/2023] Open

Abstract

BACKGROUND

Reference genomes provide a foundational framework for evolutionary investigations, ecological analysis, and conservation science, yet uncertainties in the assembly of reference genomes are difficult to assess, and by extension rarely quantified. Reference genomes for monodontid cetaceans span a wide spectrum of data types and analytical approaches, providing the context to derive broader insights related to discrepancies and regions of uncertainty in reference genome assembly. We generated three beluga (Delphinapterus leucas) and one narwhal (Monodon monoceros) reference genomes and contrasted these with published chromosomal scale assemblies for each species to quantify discrepancies associated with genome assemblies.

RESULTS

The new reference genomes achieved chromosomal scale assembly using a combination of PacBio long reads, Illumina short reads, and Hi-C scaffolding data. For beluga, we identified discrepancies in the order and orientation of contigs in 2.2-3.7% of the total genome depending on the pairwise comparison of references. In addition, unsupported higher order scaffolding was identified in published reference genomes. In contrast, we estimated 8.2% of the compared narwhal genomes featured discrepancies, with inversions being notably abundant (5.3%). Discrepancies were linked to repetitive elements in both species.

CONCLUSIONS

We provide several new reference genomes for beluga (Delphinapterus leucas), while highlighting potential avenues for improvements. In particular, additional layers of data providing information on ultra-long genomic distances are needed to resolve persistent errors in reference genome construction. The comparative analyses of monodontid reference genomes suggested that the three new reference genomes for beluga are more accurate compared to the currently published reference genome, but that the new narwhal genome is less accurate than one published. We also present a conceptual summary for improving the accuracy of reference genomes with relevance to end-user needs and how they relate to levels of assembly quality and uncertainty.

Collapse

Bajic M, Ravishankar S, Sheth M, Rowe LA, Pacheco MA, Patel DS, Batra D, Loparev V, Olsen C, Escalante AA, Vannberg F, Udhayakumar V, Barnwell JW, Talundzic E. The first complete genome of the simian malaria parasite Plasmodium brasilianum. Sci Rep 2022;12:19802. [PMID: 36396703 PMCID: PMC9671904 DOI: 10.1038/s41598-022-20706-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 09/16/2022] [Indexed: 11/18/2022] Open

Abstract

Naturally occurring human infections by zoonotic Plasmodium species have been documented for P. knowlesi, P. cynomolgi, P. simium, P. simiovale, P. inui, P. inui-like, P. coatneyi, and P. brasilianum. Accurate detection of each species is complicated by their morphological similarities with other Plasmodium species. PCR-based assays offer a solution but require prior knowledge of adequate genomic targets that can distinguish the species. While whole genomes have been published for P. knowlesi, P. cynomolgi, P. simium, and P. inui, no complete genome for P. brasilianum has been available. Previously, we reported a draft genome for P. brasilianum, and here we report the completed genome for P. brasilianum. The genome is 31.4 Mb in size and comprises 14 chromosomes, the mitochondrial genome, the apicoplast genome, and 29 unplaced contigs. The chromosomes consist of 98.4% nucleotide sites that are identical to the P. malariae genome, the closest evolutionarily related species hypothesized to be the same species as P. brasilianum, with 41,125 non-synonymous SNPs (0.0722% of genome) identified between the two genomes. Furthermore, P. brasilianum had 4864 (82.1%) genes that share 80% or higher sequence similarity with 4970 (75.5%) P. malariae genes. This was demonstrated by the nearly identical genomic organization and multiple sequence alignments for the merozoite surface proteins msp3 and msp7. We observed a distinction in the repeat lengths of the circumsporozoite protein (CSP) gene sequences between P. brasilianum and P. malariae. Our results demonstrate a 97.3% pairwise identity between the P. brasilianum and the P. malariae genomes. These findings highlight the phylogenetic proximity of these two species, suggesting that P. malariae and P. brasilianum are strains of the same species, but this could not be fully evaluated with only a single genomic sequence for each species.

Collapse

Affiliation(s)

Marko Bajic grid.422961.a0000 0001 0029 6188Association of Public Health Laboratories, Silver Spring, MD USA ,2grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA
Shashidhar Ravishankar grid.270240.30000 0001 2180 1622Fred Hutchinson Cancer Research Center, Seattle, WA USA
Mili Sheth grid.416738.f0000 0001 2163 0069Biotechnology Core Facility Branch, Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, GA USA
Lori A. Rowe grid.416738.f0000 0001 2163 0069Biotechnology Core Facility Branch, Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, GA USA ,5grid.265219.b0000 0001 2217 8588Virus Characterization Isolation Production and Sequencing Core, Tulane National Primate Research Center, Covington, LA USA
M. Andreina Pacheco grid.264727.20000 0001 2248 3398Biology Department/Institute of Genomics and Evolutionary Medicine (iGEM), Temple University, Philadelphia, PA USA
Dhruviben S. Patel grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA
Dhwani Batra grid.416738.f0000 0001 2163 0069Biotechnology Core Facility Branch, Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, GA USA
Vladimir Loparev grid.416738.f0000 0001 2163 0069Biotechnology Core Facility Branch, Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, GA USA
Christian Olsen grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA
Ananias A. Escalante grid.264727.20000 0001 2248 3398Biology Department/Institute of Genomics and Evolutionary Medicine (iGEM), Temple University, Philadelphia, PA USA
Fredrik Vannberg grid.213917.f0000 0001 2097 4943Center for Integrative Genomics at Georgia Tech, Georgia Institute of Technology, Atlanta, GA USA
Venkatachalam Udhayakumar grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA
John W. Barnwell grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA
Eldin Talundzic grid.416738.f0000 0001 2163 0069Malaria Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA USA

Collapse

Rayamajhi N, Cheng CHC, Catchen JM. Evaluating Illumina-, Nanopore-, and PacBio-based genome assembly strategies with the bald notothen, Trematomus borchgrevinki. G3 (BETHESDA, MD.) 2022;12:jkac192. [PMID: 35904764 PMCID: PMC9635638 DOI: 10.1093/g3journal/jkac192] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 07/18/2022] [Indexed: 11/16/2022]

Walve R, Salmela L. HGGA: hierarchical guided genome assembler. BMC Bioinformatics 2022;23:167. [PMID: 35525918 PMCID: PMC9077837 DOI: 10.1186/s12859-022-04701-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 04/25/2022] [Indexed: 11/10/2022] Open

Ferreira RCU, da Costa Lima Moraes A, Chiari L, Simeão RM, Vigna BBZ, de Souza AP. An Overview of the Genetics and Genomics of the Urochloa Species Most Commonly Used in Pastures. FRONTIERS IN PLANT SCIENCE 2021;12:770461. [PMID: 34966402 PMCID: PMC8710810 DOI: 10.3389/fpls.2021.770461] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 11/17/2021] [Indexed: 06/14/2023]

Mukherjee K, Dole-Muinos D, Ajayi A, Rossi M, Prosperi M, Boucher C. Finding Overlapping Rmaps via Clustering. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;PP:1-1. [PMID: 34890332 DOI: 10.1109/tcbb.2021.3132534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Raeisi Dehkordi S, Luebeck J, Bafna V. FaNDOM: Fast nested distance-based seeding of optical maps. PATTERNS (NEW YORK, N.Y.) 2021;2:100248. [PMID: 34027500 PMCID: PMC8134938 DOI: 10.1016/j.patter.2021.100248] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/08/2021] [Accepted: 04/01/2021] [Indexed: 12/25/2022]