Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hon T, Mars K, Young G, Tsai YC, Karalius JW, Landolin JM, Maurer N, Kudrna D, Hardigan MA, Steiner CC, Knapp SJ, Ware D, Shapiro B, Peluso P, Rank DR. Highly accurate long-read HiFi sequencing data for five complex genomes. Sci Data 2020;7:399. [PMID: 33203859 PMCID: PMC7673114 DOI: 10.1038/s41597-020-00743-4] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/27/2020] [Indexed: 02/06/2023] Open

For:	Hon T, Mars K, Young G, Tsai YC, Karalius JW, Landolin JM, Maurer N, Kudrna D, Hardigan MA, Steiner CC, Knapp SJ, Ware D, Shapiro B, Peluso P, Rank DR. Highly accurate long-read HiFi sequencing data for five complex genomes. Sci Data 2020;7:399. [PMID: 33203859 PMCID: PMC7673114 DOI: 10.1038/s41597-020-00743-4] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/27/2020] [Indexed: 02/06/2023] Open

Number

Cited by Other Article(s)

Yu HJ, Byun YH, Park CK. Techniques for assessing telomere length: A methodological review. Comput Struct Biotechnol J 2024;23:1489-1498. [PMID: 38633384 PMCID: PMC11021795 DOI: 10.1016/j.csbj.2024.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 04/04/2024] [Accepted: 04/05/2024] [Indexed: 04/19/2024] Open

Dobner J, Nguyen T, Pavez-Giani MG, Cyganek L, Distelmaier F, Krutmann J, Prigione A, Rossi A. mtDNA analysis using Mitopore. Mol Ther Methods Clin Dev 2024;32:101231. [PMID: 38572068 PMCID: PMC10988129 DOI: 10.1016/j.omtm.2024.101231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 03/08/2024] [Indexed: 04/05/2024]

LeMaster C, Schwendinger-Schreck C, Ge B, Cheung W, McLennan R, Johnston J, Pastinen T, Smail C. Mapping structural variants to rare disease genes using long-read whole genome sequencing and trait-relevant polygenic scores. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.15.24304216. [PMID: 38562793 PMCID: PMC10984062 DOI: 10.1101/2024.03.15.24304216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Abstract

Recent studies have revealed the pervasive landscape of rare structural variants (rSVs) present in human genomes. rSVs can have extreme effects on the expression of proximal genes and, in a rare disease context, have been implicated in patient cases where no diagnostic single nucleotide variant (SNV) was found. Approaches for integrating rSVs to date have focused on targeted approaches in known Mendelian rare disease genes. This approach is intractable for rare diseases with many causal loci or patients with complex, multi-phenotype syndromes. We hypothesized that integrating trait-relevant polygenic scores (PGS) would provide a substantial reduction in the number of candidate disease genes in which to assess rSV effects. We further implemented a method for ranking PGS genes to define a set of core/key genes where a rSV has the potential to exert relatively larger effects on disease risk. Among a subset of patients enrolled in the Genomic Answers for Kids (GA4K) rare disease program (N=497), we used PacBio HiFi long-read whole genome sequencing (lrWGS) to identify rSVs intersecting genes in trait-relevant PGSs. Illustrating our approach in Autism (N=54 cases), we identified 22,019 deletions, 2,041 duplications, 87,826 insertions, and 214 inversions overlapping putative core/key PGS genes. Additionally, by integrating genomic constraint annotations from gnomAD, we observed that rare duplications overlapping putative core/key PGS genes were frequently in higher constraint regions compared to controls (P = 1x10-03). This difference was not observed in the lowest-ranked gene set (P = 0.15). Overall, our study provides a framework for the annotation of long-read rSVs from lrWGS data and prioritization of disease-linked genomic regions for downstream functional validation of rSV impacts. To enable reuse by other researchers, we have made SV allele frequencies and gene associations freely available.

Collapse

Wang J, Xu Y, Peng Y, Wang Y, Kang Z, Zhao J. A fully haplotype-resolved and nearly gap-free genome assembly of wheat stripe rust fungus. Sci Data 2024;11:508. [PMID: 38755209 PMCID: PMC11099153 DOI: 10.1038/s41597-024-03361-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 05/10/2024] [Indexed: 05/18/2024] Open

Tang T, Liu Y, Zheng B, Li R, Zhang X, Liu Y. Integration of hybrid and self-correction method improves the quality of long-read sequencing data. Brief Funct Genomics 2024;23:249-255. [PMID: 37340778 DOI: 10.1093/bfgp/elad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 06/04/2023] [Accepted: 06/05/2023] [Indexed: 06/22/2023] Open

Wang YC, Mao Y, Fu HM, Wang J, Weng X, Liu ZH, Xu XW, Yan P, Fang F, Guo JS, Shen Y, Chen YP. New insights into functional divergence and adaptive evolution of uncultured bacteria in anammox community by complete genome-centric analysis. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;924:171530. [PMID: 38453092 DOI: 10.1016/j.scitotenv.2024.171530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/13/2023] [Accepted: 03/04/2024] [Indexed: 03/09/2024]

Abstract

Anaerobic ammonium-oxidation (anammox) bacteria play a crucial role in global nitrogen cycling and wastewater nitrogen removal, but they share symbiotic relationships with various other microorganisms. Functional divergence and adaptive evolution of uncultured bacteria in anammox community remain underexplored. Although shotgun metagenomics based on short reads has been widely used in anammox research, metagenome-assembled genomes (MAGs) are often discontinuous and highly contaminated, which limits in-depth analyses of anammox communities. Here, for the first time, we performed Pacific Biosciences high-fidelity (HiFi) long-read sequencing on the anammox granule sludge sample from a lab-scale bioreactor, and obtained 30 accurate and complete metagenome-assembled genomes (cMAGs). These cMAGs were obtained by selecting high-quality circular contigs from initial assemblies of long reads generated by HiFi sequencing, eliminating the need for Illumina short reads, binning, and reassembly. One new anammox species affiliated with Candidatus Jettenia and three species affiliated with novel families were found in this anammox community. cMAG-centric analysis revealed functional divergence in general and nitrogen metabolism among the anammox community members, and they might adopt a cross-feeding strategy in organic matter, cofactors, and vitamins. Furthermore, we identified 63 mobile genetic elements (MGEs) and 50 putative horizontal gene transfer (HGT) events within these cMAGs. The results suggest that HGT events and MGEs related to phage and integration or excision, particularly transposons containing tnpA in anammox bacteria, might play important roles in the adaptive evolution of this anammox community. The cMAGs generated in the present study could be used to establish of a comprehensive database for anammox bacteria and associated microorganisms. These findings highlight the advantages of HiFi sequencing for the studies of complex mixed cultures and advance the understanding of anammox communities.

Collapse

Pacheco MA, Cepeda AS, Miller EA, Beckerman S, Oswald M, London E, Mateus-Pinilla NE, Escalante AA. A new long-read mitochondrial-genome protocol (PacBio HiFi) for haemosporidian parasites: a tool for population and biodiversity studies. Malar J 2024;23:134. [PMID: 38704592 PMCID: PMC11069185 DOI: 10.1186/s12936-024-04961-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 04/24/2024] [Indexed: 05/06/2024] Open

Abstract

BACKGROUND

Studies on haemosporidian diversity, including origin of human malaria parasites, malaria's zoonotic dynamic, and regional biodiversity patterns, have used target gene approaches. However, current methods have a trade-off between scalability and data quality. Here, a long-read Next-Generation Sequencing protocol using PacBio HiFi is presented. The data processing is supported by a pipeline that uses machine-learning for analysing the reads.

METHODS

A set of primers was designed to target approximately 6 kb, almost the entire length of the haemosporidian mitochondrial genome. Amplicons from different samples were multiplexed in an SMRTbell® library preparation. A pipeline (HmtG-PacBio Pipeline) to process the reads is also provided; it integrates multiple sequence alignments, a machine-learning algorithm that uses modified variational autoencoders, and a clustering method to identify the mitochondrial haplotypes/species in a sample. Although 192 specimens could be studied simultaneously, a pilot experiment with 15 specimens is presented, including in silico experiments where multiple data combinations were tested.

RESULTS

The primers amplified various haemosporidian parasite genomes and yielded high-quality mt genome sequences. This new protocol allowed the detection and characterization of mixed infections and co-infections in the samples. The machine-learning approach converged into reproducible haplotypes with a low error rate, averaging 0.2% per read (minimum of 0.03% and maximum of 0.46%). The minimum recommended coverage per haplotype is 30X based on the detected error rates. The pipeline facilitates inspecting the data, including a local blast against a file of provided mitochondrial sequences that the researcher can customize.

CONCLUSIONS

This is not a diagnostic approach but a high-throughput method to study haemosporidian sequence assemblages and perform genotyping by targeting the mitochondrial genome. Accordingly, the methodology allowed for examining specimens with multiple infections and co-infections of different haemosporidian parasites. The pipeline enables data quality assessment and comparison of the haplotypes obtained to those from previous studies. Although a single locus approach, whole mitochondrial data provide high-quality information to characterize species pools of haemosporidian parasites.

Collapse

Schulz T, Medvedev P. ESKEMAP: exact sketch-based read mapping. Algorithms Mol Biol 2024;19:19. [PMID: 38704605 PMCID: PMC11069465 DOI: 10.1186/s13015-024-00261-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 03/19/2024] [Indexed: 05/06/2024] Open

Renoz F, Parisot N, Baa-Puyoulet P, Gerlin L, Fakhour S, Charles H, Hance T, Calevro F. PacBio Hi-Fi genome assembly of Sipha maydis, a model for the study of multipartite mutualism in insects. Sci Data 2024;11:450. [PMID: 38704391 PMCID: PMC11069519 DOI: 10.1038/s41597-024-03297-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Accepted: 04/23/2024] [Indexed: 05/06/2024] Open

Bose E, Xiong S, Jones AN. Probing RNA structure and dynamics using nanopore and next generation sequencing. J Biol Chem 2024;300:107317. [PMID: 38677514 DOI: 10.1016/j.jbc.2024.107317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 04/10/2024] [Accepted: 04/11/2024] [Indexed: 04/29/2024] Open

Yuan CU, Quah FX, Hemberg M. Single-cell and spatial transcriptomics: Bridging current technologies with long-read sequencing. Mol Aspects Med 2024;96:101255. [PMID: 38368637 DOI: 10.1016/j.mam.2024.101255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/30/2024] [Accepted: 02/07/2024] [Indexed: 02/20/2024]

Xie L, Gong X, Yang K, Huang Y, Zhang S, Shen L, Sun Y, Wu D, Ye C, Zhu QH, Fan L. Technology-enabled great leap in deciphering plant genomes. NATURE PLANTS 2024;10:551-566. [PMID: 38509222 DOI: 10.1038/s41477-024-01655-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 02/20/2024] [Indexed: 03/22/2024]

Ermini L, Driguez P. The Application of Long-Read Sequencing to Cancer. Cancers (Basel) 2024;16:1275. [PMID: 38610953 PMCID: PMC11011098 DOI: 10.3390/cancers16071275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 03/20/2024] [Accepted: 03/21/2024] [Indexed: 04/14/2024] Open

Filipović I, Marshall JM, Rašić G. Finding divergent sequences of homomorphic sex chromosomes via diploidized nanopore-based assembly from a single male. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.29.582759. [PMID: 38464271 PMCID: PMC10925256 DOI: 10.1101/2024.02.29.582759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]

Mo C, Wang H, Wei M, Zeng Q, Zhang X, Fei Z, Zhang Y, Kong Q. Complete genome assembly provides a high-quality skeleton for pan-NLRome construction in melon. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024. [PMID: 38430487 DOI: 10.1111/tpj.16705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/16/2024] [Accepted: 02/22/2024] [Indexed: 03/03/2024]

Carpinteyro-Ponce J, Machado CA. The Complex Landscape of Structural Divergence Between the Drosophila pseudoobscura and D. persimilis Genomes. Genome Biol Evol 2024;16:evae047. [PMID: 38482945 PMCID: PMC10980976 DOI: 10.1093/gbe/evae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2024] [Indexed: 04/01/2024] Open

Garg D, Patel N, Rawat A, Rosado AS. Cutting edge tools in the field of soil microbiology. CURRENT RESEARCH IN MICROBIAL SCIENCES 2024;6:100226. [PMID: 38425506 PMCID: PMC10904168 DOI: 10.1016/j.crmicr.2024.100226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open

Abstract

The study of the whole of the genetic material contained within the microbial populations found in a certain environment is made possible by metagenomics. This technique enables a thorough knowledge of the variety, function, and interactions of microbial communities that are notoriously difficult to research. Due to the limitations of conventional techniques such as culturing and PCR-based methodologies, soil microbiology is a particularly challenging field. Metagenomics has emerged as an effective technique for overcoming these obstacles and shedding light on the dynamic nature of the microbial communities in soil. This review focuses on the principle of metagenomics techniques, their potential applications and limitations in soil microbial diversity analysis. The effectiveness of target-based metagenomics in determining the function of individual genes and microorganisms in soil ecosystems is also highlighted. Targeted metagenomics, including high-throughput sequencing and stable-isotope probing, is essential for studying microbial taxa and genes in complex ecosystems. Shotgun metagenomics may reveal the diversity of soil bacteria, composition, and function impacted by land use and soil management. Sanger, Next Generation Sequencing, Illumina, and Ion Torrent sequencing revolutionise soil microbiome research. Oxford Nanopore Technology (ONT) and Pacific Biosciences (PacBio)'s third and fourth generation sequencing systems revolutionise long-read technology. GeoChip, clone libraries, metagenomics, and metabarcoding help comprehend soil microbial communities. The article indicates that metagenomics may improve environmental management and agriculture despite existing limitations.Metagenomics has revolutionised soil microbiology research by revealing the complete diversity, function, and interactions of microorganisms in soil. Metagenomics is anticipated to continue defining the future of soil microbiology research despite some limitations, such as the difficulty of locating the appropriate sequencing method for specific genes.

Collapse

Packiaraj J, Thakur J. DNA satellite and chromatin organization at mouse centromeres and pericentromeres. Genome Biol 2024;25:52. [PMID: 38378611 PMCID: PMC10880262 DOI: 10.1186/s13059-024-03184-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 02/12/2024] [Indexed: 02/22/2024] Open

Abstract

BACKGROUND

RESULTS

Using recently available PacBio long-read sequencing data from the C57BL/6 strain, we find that contrary to the previous reports of their homogeneous nature, both centromeric minor satellites and pericentromeric major satellites exhibit a high degree of variation in sequence and organization within and between arrays. While most arrays are continuous, a significant fraction is interspersed with non-satellite sequences, including transposable elements. Using chromatin immunoprecipitation sequencing (ChIP-seq), we find that the occupancy of CENP-A and H3K9me3 chromatin at centromeric and pericentric regions, respectively, is associated with increased sequence enrichment and homogeneity at these regions. The transposable elements at centromeric regions are not part of functional centromeres as they lack significant CENP-A enrichment. Furthermore, both CENP-A and H3K9me3 nucleosomes occupy minor and major satellites spanning centromeric-pericentric junctions and a low yet significant amount of CENP-A spreads locally at centromere junctions on both pericentric and telocentric sides. Finally, while H3K9me3 nucleosomes display a well-phased organization on major satellite arrays, CENP-A nucleosomes on minor satellite arrays are poorly phased. Interestingly, the homogeneous class of major satellites also phase CENP-A and H3K27me3 nucleosomes, indicating that the nucleosome phasing is an inherent property of homogeneous major satellites.

CONCLUSIONS

Our findings reveal that mouse centromeres and pericentromeres display a high diversity in satellite sequence, organization, and chromatin structure.

Collapse

Vancaester E, Blaxter ML. MarkerScan: Separation and assembly of cobionts sequenced alongside target species in biodiversity genomics projects. Wellcome Open Res 2024;9:33. [PMID: 38617467 PMCID: PMC11016177 DOI: 10.12688/wellcomeopenres.20730.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/18/2023] [Indexed: 04/16/2024] Open

Cook R, Brown N, Rihtman B, Michniewski S, Redgwell T, Clokie M, Stekel DJ, Chen Y, Scanlan DJ, Hobman JL, Nelson A, Jones MA, Smith D, Millard A. The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies. Microb Genom 2024;10:001198. [PMID: 38376377 PMCID: PMC10926689 DOI: 10.1099/mgen.0.001198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 01/25/2024] [Indexed: 02/21/2024] Open

Abstract

Viral metagenomics has fuelled a rapid change in our understanding of global viral diversity and ecology. Long-read sequencing and hybrid assembly approaches that combine long- and short-read technologies are now being widely implemented in bacterial genomics and metagenomics. However, the use of long-read sequencing to investigate viral communities is still in its infancy. While Nanopore and PacBio technologies have been applied to viral metagenomics, it is not known to what extent different technologies will impact the reconstruction of the viral community. Thus, we constructed a mock bacteriophage community of previously sequenced phage genomes and sequenced them using Illumina, Nanopore and PacBio sequencing technologies and tested a number of different assembly approaches. When using a single sequencing technology, Illumina assemblies were the best at recovering phage genomes. Nanopore- and PacBio-only assemblies performed poorly in comparison to Illumina in both genome recovery and error rates, which both varied with the assembler used. The best Nanopore assembly had errors that manifested as SNPs and INDELs at frequencies 41 and 157 % higher than found in Illumina only assemblies, respectively. While the best PacBio assemblies had SNPs at frequencies 12 and 78 % higher than found in Illumina-only assemblies, respectively. Despite high-read coverage, long-read-only assemblies recovered a maximum of one complete genome from any assembly, unless reads were down-sampled prior to assembly. Overall the best approach was assembly by a combination of Illumina and Nanopore reads, which reduced error rates to levels comparable with short-read-only assemblies. When using a single technology, Illumina only was the best approach. The differences in genome recovery and error rates between technology and assembler had downstream impacts on gene prediction, viral prediction, and subsequent estimates of diversity within a sample. These findings will provide a starting point for others in the choice of reads and assembly algorithms for the analysis of viromes.

Collapse

Affiliation(s)

Ryan Cook School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
Nathan Brown Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK
Branko Rihtman School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
Slawomir Michniewski Warwick Medical School, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
Tamsin Redgwell COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Ledreborg Alle 34, 2820, Gentofte, Denmark
Martha Clokie Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK
Dov J. Stekel School of Biosciences, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK Department of Mathematics and Applied Mathematics, University of Johannesburg, Rossmore 2029, South Africa
Yin Chen School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
David J. Scanlan School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
Jon L. Hobman School of Biosciences, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
Andrew Nelson Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
Michael A. Jones School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
Darren Smith Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
Andrew Millard Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK

Collapse

Kim C, Pongpanich M, Porntaveetus T. Unraveling metagenomics through long-read sequencing: a comprehensive review. J Transl Med 2024;22:111. [PMID: 38282030 PMCID: PMC10823668 DOI: 10.1186/s12967-024-04917-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 01/21/2024] [Indexed: 01/30/2024] Open

Tournayre J, Polonais V, Wawrzyniak I, Akossi RF, Parisot N, Lerat E, Delbac F, Souvignet P, Reichstadt M, Peyretaillade E. MicroAnnot: A Dedicated Workflow for Accurate Microsporidian Genome Annotation. Int J Mol Sci 2024;25:880. [PMID: 38255958 PMCID: PMC10815200 DOI: 10.3390/ijms25020880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 12/29/2023] [Accepted: 01/04/2024] [Indexed: 01/24/2024] Open

Benoit G, Raguideau S, James R, Phillippy AM, Chikhi R, Quince C. High-quality metagenome assembly from long accurate reads with metaMDBG. Nat Biotechnol 2024:10.1038/s41587-023-01983-6. [PMID: 38168989 DOI: 10.1038/s41587-023-01983-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 09/08/2023] [Indexed: 01/05/2024]

Salava H, Deák T, Czepe C, Maghuly F. Sample and Library Preparation for PacBio Long-Read Sequencing in Grapevine. Methods Mol Biol 2024;2787:183-197. [PMID: 38656490 DOI: 10.1007/978-1-0716-3778-4_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024;2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]

Song H, Zou S, Huang Y, Jian C, Liu W, Tian L, Gong L, Chen Z, Sun Z, Wang Y. Salmonella Typhimurium with Eight Tandem Copies of bla_NDM-1 on a HI2 Plasmid. Microorganisms 2023;12:20. [PMID: 38257847 PMCID: PMC10819877 DOI: 10.3390/microorganisms12010020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Revised: 12/09/2023] [Accepted: 12/15/2023] [Indexed: 01/24/2024] Open

Landi M, Shah T, Falquet L, Niazi A, Stavolone L, Bongcam-Rudloff E, Gisel A. Haplotype-resolved genome of heterozygous African cassava cultivar TMEB117 (Manihot esculenta). Sci Data 2023;10:887. [PMID: 38071206 PMCID: PMC10710486 DOI: 10.1038/s41597-023-02800-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 11/29/2023] [Indexed: 12/18/2023] Open

Yang Y, Wu Z, Wu Z, Li T, Shen Z, Zhou X, Wu X, Li G, Zhang Y. A near-complete assembly of asparagus bean provides insights into anthocyanin accumulation in pods. PLANT BIOTECHNOLOGY JOURNAL 2023;21:2473-2489. [PMID: 37558431 PMCID: PMC10651155 DOI: 10.1111/pbi.14142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 07/11/2023] [Accepted: 07/23/2023] [Indexed: 08/11/2023]

Ferrer A, Stephens ZD, Kocher JPA. Experimental and Computational Approaches to Measure Telomere Length: Recent Advances and Future Directions. Curr Hematol Malig Rep 2023;18:284-291. [PMID: 37947937 PMCID: PMC10709248 DOI: 10.1007/s11899-023-00717-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/12/2023]

Rodriguez Ruiz A, Van Dam AR. Metagenomic binning of PacBio HiFi data prior to assembly reveals a complete genome of Cosmopolites sordidus (Germar) (Coleopterea: Curculionidae, Dryophthorinae) the most damaging arthropod pest of bananas and plantains. PeerJ 2023;11:e16276. [PMID: 38025758 PMCID: PMC10676084 DOI: 10.7717/peerj.16276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 09/20/2023] [Indexed: 12/01/2023] Open

Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. Genome Res 2023;33:gr.278005.123. [PMID: 37918962 PMCID: PMC10760446 DOI: 10.1101/gr.278005.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 10/18/2023] [Indexed: 11/04/2023]

Mao Y, Zeineldin M, Usmani M, Jutla A, Shisler JL, Whitaker RJ, Nguyen TH. Local and Environmental Reservoirs of Salmonella enterica After Hurricane Florence Flooding. GEOHEALTH 2023;7:e2023GH000877. [PMID: 37928215 PMCID: PMC10624599 DOI: 10.1029/2023gh000877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/28/2023] [Accepted: 10/13/2023] [Indexed: 11/07/2023]

Denoyes B, Prohaska A, Petit J, Rothan C. Deciphering the genetic architecture of fruit color in strawberry. JOURNAL OF EXPERIMENTAL BOTANY 2023;74:6306-6320. [PMID: 37386925 PMCID: PMC10627153 DOI: 10.1093/jxb/erad245] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 06/28/2023] [Indexed: 07/01/2023]

Ding C, Zhang Z. Effective omics tools are still lacking for improvement of stress tolerance in polyploid crops. FRONTIERS IN PLANT SCIENCE 2023;14:1295528. [PMID: 38023865 PMCID: PMC10646182 DOI: 10.3389/fpls.2023.1295528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 10/18/2023] [Indexed: 12/01/2023]

Li J, Cullis C. Comparative Analysis of Tylosema esculentum Mitochondrial DNA Revealed Two Distinct Genome Structures. BIOLOGY 2023;12:1244. [PMID: 37759643 PMCID: PMC10525999 DOI: 10.3390/biology12091244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 09/11/2023] [Accepted: 09/13/2023] [Indexed: 09/29/2023]

Espinosa E, Bautista R, Fernandez I, Larrosa R, Zapata EL, Plata O. Comparing assembly strategies for third-generation sequencing technologies across different genomes. Genomics 2023;115:110700. [PMID: 37598732 DOI: 10.1016/j.ygeno.2023.110700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 08/07/2023] [Accepted: 08/16/2023] [Indexed: 08/22/2023]

Gorman Z, Chen J, de Leon AAP, Wallis CM. Comparison of assembly platforms for the assembly of the nuclear genome of Trichoderma harzianum strain PAR3. BMC Genomics 2023;24:454. [PMID: 37568116 PMCID: PMC10416523 DOI: 10.1186/s12864-023-09544-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 07/28/2023] [Indexed: 08/13/2023] Open

Abstract

BACKGROUND

Trichoderma is a diverse genus of fungi that includes several species that possess biotechnological and agricultural applications, including the biocontrol of pathogenic fungi and nematodes. The mitochondrial genome of a putative strain of Trichoderma harzianum called PAR3 was analyzed after isolation from the roots of Scarlet Royal grapevine scion grafted to Freedom rootstock, located in a grapevine vineyard in Parlier, CA, USA. Here, we report the sequencing, comparative assembly, and annotation of the nuclear genome of PAR3 and confirm its identification as a strain of T. harzianum. We subsequently compared the genes found in T. harzianum PAR3 to other known T. harzianum strains. Assembly of Illumina and/or Oxford Nanopore reads by the popular long-read assemblers, Flye and Canu, and the hybrid assemblers, SPAdes and MaSuRCA, was performed and the quality of the resulting assemblies were compared to ascertain which assembler generated the highest quality draft genome assembly.

RESULTS

MaSuRCA produced the most complete and high-fidelity assembly yielding a nuclear genome of 40.7 Mb comprised of 112 scaffolds. Subsequent annotation of this assembly produced 12,074 gene models and 210 tRNAs. This included 221 genes that did not have equivalent genes in other T. harzainum strains. Phylogenetic analysis of ITS, rpb2, and tef1a sequences from PAR3 and established Trichoderma spp. showed that all three sequences from PAR3 possessed more than 99% identity to those of Trichoderma harzianum, confirming that PAR3 is an isolate of Trichoderma harzianum. We also found that comparison of gene models between T. harzianum PAR3 and other T. harzianum strains resulted in the identification of significant differences in gene type and number, with 221 unique genes identified in the PAR3 strain.

CONCLUSIONS

This study gives insight into the efficacy of several popular assembly platforms for assembly of fungal nuclear genomes, and found that the hybrid assembler, MaSuRCA, was the most effective program for genome assembly. The annotated draft nuclear genome and the identification of genes not found in other T. harzainum strains could be used to investigate the potential applications of T. harzianum PAR3 for biocontrol of grapevine fungal canker pathogens and as source of anti-microbial compounds.

Collapse

Zhang C, Johnson NA, Hall N, Tian X, Yu Q, Patterson EL. Subtelomeric 5-enolpyruvylshikimate-3-phosphate synthase copy number variation confers glyphosate resistance in Eleusine indica. Nat Commun 2023;14:4865. [PMID: 37567866 PMCID: PMC10421919 DOI: 10.1038/s41467-023-40407-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 07/25/2023] [Indexed: 08/13/2023] Open

Jin X, Du H, Zhu C, Wan H, Liu F, Ruan J, Mower JP, Zhu A. Haplotype-resolved genomes of wild octoploid progenitors illuminate genomic diversifications from wild relatives to cultivated strawberry. NATURE PLANTS 2023;9:1252-1266. [PMID: 37537397 DOI: 10.1038/s41477-023-01473-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Accepted: 07/03/2023] [Indexed: 08/05/2023]

Huff M, Hulse-Kemp AM, Scheffler BE, Youngblood RC, Simpson SA, Babiker E, Staton M. Long-read, chromosome-scale assembly of Vitis rotundifolia cv. Carlos and its unique resistance to Xylella fastidiosa subsp. fastidiosa. BMC Genomics 2023;24:409. [PMID: 37474911 PMCID: PMC10357881 DOI: 10.1186/s12864-023-09514-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 07/13/2023] [Indexed: 07/22/2023] Open

Abstract

BACKGROUND

Muscadine grape (Vitis rotundifolia) is resistant to many of the pathogens that negatively impact the production of common grape (V. vinifera), including the bacterial pathogen Xylella fastidiosa subsp. fastidiosa (Xfsf), which causes Pierce's Disease (PD). Previous studies in common grape have indicated Xfsf delays host immune response with a complex O-chain antigen produced by the wzy gene. Muscadine cultivars range from tolerant to completely resistant to Xfsf, but the mechanism is unknown.

RESULTS

We assembled and annotated a new, long-read genome assembly for 'Carlos', a cultivar of muscadine that exhibits tolerance, to build upon the existing genetic resources available for muscadine. We used these resources to construct an initial pan-genome for three cultivars of muscadine and one cultivar of common grape. This pan-genome contains a total of 34,970 synteny-constrained entries containing genes of similar structure. Comparison of resistance gene content between the 'Carlos' and common grape genomes indicates an expansion of resistance (R) genes in 'Carlos.' We further identified genes involved in Xfsf response by transcriptome sequencing 'Carlos' plants inoculated with Xfsf. We observed 234 differentially expressed genes with functions related to lipid catabolism, oxidation-reduction signaling, and abscisic acid (ABA) signaling as well as seven R genes. Leveraging public data from previous experiments of common grape inoculated with Xfsf, we determined that most differentially expressed genes in the muscadine response were not found in common grape, and three of the R genes identified as differentially expressed in muscadine do not have an ortholog in the common grape genome.

CONCLUSIONS

Our results support the utility of a pan-genome approach to identify candidate genes for traits of interest, particularly disease resistance to Xfsf, within and between muscadine and common grape.

Collapse

Packiaraj J, Thakur J. DNA satellite and chromatin organization at house mouse centromeres and pericentromeres. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.18.549612. [PMID: 37503200 PMCID: PMC10370071 DOI: 10.1101/2023.07.18.549612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Abstract

Centromeres are essential for faithful chromosome segregation during mitosis and meiosis. However, the organization of satellite DNA and chromatin at mouse centromeres and pericentromeres is poorly understood due to the challenges of sequencing and assembling repetitive genomic regions. Using recently available PacBio long-read sequencing data from the C57BL/6 strain and chromatin profiling, we found that contrary to the previous reports of their highly homogeneous nature, centromeric and pericentromeric satellites display varied sequences and organization. We find that both centromeric minor satellites and pericentromeric major satellites exhibited sequence variations within and between arrays. While most arrays are continuous, a significant fraction is interspersed with non-satellite sequences, including transposable elements. Additionally, we investigated CENP-A and H3K9me3 chromatin organization at centromeres and pericentromeres using Chromatin immunoprecipitation sequencing (ChIP-seq). We found that the occupancy of CENP-A and H3K9me3 chromatin at centromeric and pericentric regions, respectively, is associated with increased sequence abundance and homogeneity at these regions. Furthermore, the transposable elements at centromeric regions are not part of functional centromeres as they lack CENP-A enrichment. Finally, we found that while H3K9me3 nucleosomes display a well-phased organization on major satellite arrays, CENP-A nucleosomes on minor satellite arrays lack phased organization. Interestingly, the homogeneous class of major satellites phase CENP-A and H3K27me3 nucleosomes as well, indicating that the nucleosome phasing is an inherent property of homogeneous major satellites. Overall, our findings reveal that house mouse centromeres and pericentromeres, which were previously thought to be highly homogenous, display significant diversity in satellite sequence, organization, and chromatin structure.

Collapse

Gogoi A, Rossmann SL, Lysøe E, Stensvand A, Brurberg MB. Genome analysis of Phytophthora cactorum strains associated with crown- and leather-rot in strawberry. Front Microbiol 2023;14:1214924. [PMID: 37465018 PMCID: PMC10351607 DOI: 10.3389/fmicb.2023.1214924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 06/12/2023] [Indexed: 07/20/2023] Open

Schmeing S, Robinson MD. Gapless provides combined scaffolding, gap filling, and assembly correction with long reads. Life Sci Alliance 2023;6:e202201471. [PMID: 37142439 PMCID: PMC10166144 DOI: 10.26508/lsa.202201471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 04/17/2023] [Accepted: 04/18/2023] [Indexed: 05/06/2023] Open

Shinde SS, Sharma A, Vijay N. Decoding the fibromelanosis locus complex chromosomal rearrangement of black-bone chicken: genetic differentiation, selective sweeps and protein-coding changes in Kadaknath chicken. Front Genet 2023;14:1180658. [PMID: 37424723 PMCID: PMC10325862 DOI: 10.3389/fgene.2023.1180658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/05/2023] [Indexed: 07/11/2023] Open

Pardo-Palacios FJ, Arzalluz-Luque A, Kondratova L, Salguero P, Mestre-Tomás J, Amorín R, Estevan-Morió E, Liu T, Nanni A, McIntyre L, Tseng E, Conesa A. SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541248. [PMID: 37398077 PMCID: PMC10312485 DOI: 10.1101/2023.05.17.541248] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Zheng Y, Shang X. SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data. BMC Bioinformatics 2023;24:213. [PMID: 37221476 DOI: 10.1186/s12859-023-05324-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 05/06/2023] [Indexed: 05/25/2023] Open

Wong J, Coombe L, Nikolić V, Zhang E, Nip KM, Sidhu P, Warren RL, Birol I. Linear time complexity de novo long read genome assembly with GoldRush. Nat Commun 2023;14:2906. [PMID: 37217507 DOI: 10.1038/s41467-023-38716-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 05/11/2023] [Indexed: 05/24/2023] Open

Kucuk E, van der Sanden BPGH, O'Gorman L, Kwint M, Derks R, Wenger AM, Lambert C, Chakraborty S, Baybayan P, Rowell WJ, Brunner HG, Vissers LELM, Hoischen A, Gilissen C. Comprehensive de novo mutation discovery with HiFi long-read sequencing. Genome Med 2023;15:34. [PMID: 37158973 PMCID: PMC10169305 DOI: 10.1186/s13073-023-01183-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 04/19/2023] [Indexed: 05/10/2023] Open

Abstract

BACKGROUND

Long-read sequencing (LRS) techniques have been very successful in identifying structural variants (SVs). However, the high error rate of LRS made the detection of small variants (substitutions and short indels < 20 bp) more challenging. The introduction of PacBio HiFi sequencing makes LRS also suited for detecting small variation. Here we evaluate the ability of HiFi reads to detect de novo mutations (DNMs) of all types, which are technically challenging variant types and a major cause of sporadic, severe, early-onset disease.

METHODS

We sequenced the genomes of eight parent-child trios using high coverage PacBio HiFi LRS (~ 30-fold coverage) and Illumina short-read sequencing (SRS) (~ 50-fold coverage). De novo substitutions, small indels, short tandem repeats (STRs) and SVs were called in both datasets and compared to each other to assess the accuracy of HiFi LRS. In addition, we determined the parent-of-origin of the small DNMs using phasing.

RESULTS

We identified a total of 672 and 859 de novo substitutions/indels, 28 and 126 de novo STRs, and 24 and 1 de novo SVs in LRS and SRS respectively. For the small variants, there was a 92 and 85% concordance between the platforms. For the STRs and SVs, the concordance was 3.6 and 0.8%, and 4 and 100% respectively. We successfully validated 27/54 LRS-unique small variants, of which 11 (41%) were confirmed as true de novo events. For the SRS-unique small variants, we validated 42/133 DNMs and 8 (19%) were confirmed as true de novo event. Validation of 18 LRS-unique de novo STR calls confirmed none of the repeat expansions as true DNM. Confirmation of the 23 LRS-unique SVs was possible for 19 candidate SVs of which 10 (52.6%) were true de novo events. Furthermore, we were able to assign 96% of DNMs to their parental allele with LRS data, as opposed to just 20% with SRS data.

CONCLUSIONS

HiFi LRS can now produce the most comprehensive variant dataset obtainable by a single technology in a single laboratory, allowing accurate calling of substitutions, indels, STRs and SVs. The accuracy even allows sensitive calling of DNMs on all variant levels, and also allows for phasing, which helps to distinguish true positive from false positive DNMs.

Collapse

Affiliation(s)

Erdi Kucuk Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
Bart P G H van der Sanden Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
Luke O'Gorman Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Michael Kwint Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Ronny Derks Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands
Aaron M Wenger Pacific Biosciences, Menlo Park, CA, USA
Christine Lambert Pacific Biosciences, Menlo Park, CA, USA
Shreyasee Chakraborty Pacific Biosciences, Menlo Park, CA, USA
Primo Baybayan Pacific Biosciences, Menlo Park, CA, USA
William J Rowell Pacific Biosciences, Menlo Park, CA, USA
Han G Brunner Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, The Netherlands GROW School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands
Lisenka E L M Vissers Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
Alexander Hoischen Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands. Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands. Department of Internal Medicine, Radboud University Medical Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, the Netherlands.
Christian Gilissen Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, The Netherlands. Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands.

Collapse

Luo J, Guan T, Chen G, Yu Z, Zhai H, Yan C, Luo H. SLHSD: hybrid scaffolding method based on short and long reads. Brief Bioinform 2023;24:7152317. [PMID: 37141142 DOI: 10.1093/bib/bbad169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 01/08/2023] [Accepted: 04/12/2023] [Indexed: 05/05/2023] Open

De La Cerda GY, Landis JB, Eifler E, Hernandez AI, Li F, Zhang J, Tribble CM, Karimi N, Chan P, Givnish T, Strickler SR, Specht CD. Balancing read length and sequencing depth: Optimizing Nanopore long-read sequencing for monocots with an emphasis on the Liliales. APPLICATIONS IN PLANT SCIENCES 2023;11:e11524. [PMID: 37342170 PMCID: PMC10278932 DOI: 10.1002/aps3.11524] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/20/2023] [Accepted: 01/30/2023] [Indexed: 06/22/2023]