Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Garg S, Aach J, Li H, Sebenius I, Durbin R, Church G. A haplotype-aware de novo assembly of related individuals using pedigree sequence graph. Bioinformatics 2020;36:2385-2392. [PMID: 31860070 DOI: 10.1093/bioinformatics/btz942] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 11/23/2019] [Accepted: 12/18/2019] [Indexed: 01/11/2023] Open

For:	Garg S, Aach J, Li H, Sebenius I, Durbin R, Church G. A haplotype-aware de novo assembly of related individuals using pedigree sequence graph. Bioinformatics 2020;36:2385-2392. [PMID: 31860070 DOI: 10.1093/bioinformatics/btz942] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 11/23/2019] [Accepted: 12/18/2019] [Indexed: 01/11/2023] Open

Number

Cited by Other Article(s)

Wang S, Wang M, Chen L, Pan G, Wang Y, Li SC. SpecHLA enables full-resolution HLA typing from sequencing data. CELL REPORTS METHODS 2023;3:100589. [PMID: 37714157 PMCID: PMC10545945 DOI: 10.1016/j.crmeth.2023.100589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 06/20/2023] [Accepted: 08/21/2023] [Indexed: 09/17/2023]

Kong W, Wang Y, Zhang S, Yu J, Zhang X. Recent Advances in Assembly of Complex Plant Genomes. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023;21:427-439. [PMID: 37100237 PMCID: PMC10787022 DOI: 10.1016/j.gpb.2023.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Revised: 03/18/2023] [Accepted: 04/07/2023] [Indexed: 04/28/2023]

Olson ND, Wagner J, Dwarshuis N, Miga KH, Sedlazeck FJ, Salit M, Zook JM. Variant calling and benchmarking in an era of complete human genome sequences. Nat Rev Genet 2023:10.1038/s41576-023-00590-0. [PMID: 37059810 DOI: 10.1038/s41576-023-00590-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/22/2023] [Indexed: 04/16/2023]

Towards routine chromosome-scale haplotype-resolved reconstruction in cancer genomics. Nat Commun 2023;14:1358. [PMID: 36914638 PMCID: PMC10011606 DOI: 10.1038/s41467-023-36689-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 02/10/2023] [Indexed: 03/16/2023] Open

Chan AP, Choi Y, Rangan A, Zhang G, Podder A, Berens M, Sharma S, Pirrotte P, Byron S, Duggan D, Schork NJ. Interrogating the Human Diplome: Computational Methods, Emerging Applications, and Challenges. Methods Mol Biol 2023;2590:1-30. [PMID: 36335489 DOI: 10.1007/978-1-0716-2819-5_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]

Jeon H, Bae J, Kim H, Kim MS. VPrimer: A Method of Designing and Updating Primer and Probe With High Variant Coverage for RNA Virus Detection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:775-784. [PMID: 34951850 DOI: 10.1109/tcbb.2021.3138145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Accessing the Variability of Multicopy Genes in Complex Genomes using Unassembled Next-Generation Sequencing Reads: The Case of Trypanosoma cruzi Multigene Families. mBio 2022;13:e0231922. [PMID: 36264102 PMCID: PMC9765020 DOI: 10.1128/mbio.02319-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Repetitive elements cause assembly fragmentation in complex eukaryotic genomes, limiting the study of their variability. The genome of Trypanosoma cruzi, the parasite that causes Chagas disease, has a high repetitive content, including multigene families. Although many T. cruzi multigene families encode surface proteins that play pivotal roles in host-parasite interactions, their variability is currently underestimated, as their high repetitive content results in collapsed gene variants. To estimate sequence variability and copy number variation of multigene families, we developed a read-based approach that is independent of gene-specific read mapping and de novo assembly. This methodology was used to estimate the copy number and variability of MASP, TcMUC, and Trans-Sialidase (TS), the three largest T. cruzi multigene families, in 36 strains, including members of all six parasite discrete typing units (DTUs). We found that these three families present a specific pattern of variability and copy number among the distinct parasite DTUs. Inter-DTU hybrid strains presented a higher variability of these families, suggesting that maintaining a larger content of their members could be advantageous. In addition, in a chronic murine model and chronic Chagasic human patients, the immune response was focused on TS antigens, suggesting that targeting TS conserved sequences could be a potential avenue to improve diagnosis and vaccine design against Chagas disease. Finally, the proposed approach can be applied to study multicopy genes in any organism, opening new avenues to access sequence variability in complex genomes. IMPORTANCE Sequences that have several copies in a genome, such as multicopy-gene families, mobile elements, and microsatellites, are among the most challenging genomic segments to study. They are frequently underestimated in genome assemblies, hampering the correct assessment of these important players in genome evolution and adaptation. Here, we developed a new methodology to estimate variability and copy numbers of repetitive genomic regions and employed it to characterize the T. cruzi multigene families MASP, TcMUC, and transsialidase (TS), which are important virulence factors in this parasite. We showed that multigene families vary in sequence and content among the parasite's lineages, whereas hybrid strains have a higher sequence variability that could be advantageous to the parasite's survivability. By identifying conserved sequences within multigene families, we showed that the mammalian host immune response toward these multigene families is usually focused on the TS multigene family. These TS conserved and immunogenic peptides can be explored in future works as diagnostic targets or vaccine candidates for Chagas disease. Finally, this methodology can be easily applied to any organism of interest, which will aid in our understanding of complex genomic regions.

Collapse

Zhang T, Zhou J, Gao W, Jia Y, Wei Y, Wang G. Complex genome assembly based on long-read sequencing. Brief Bioinform 2022;23:6657663. [PMID: 35940845 DOI: 10.1093/bib/bbac305] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/20/2022] [Accepted: 07/06/2022] [Indexed: 11/12/2022] Open

Fruzangohar M, Timmins WA, Kravchuk O, Taylor J. HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences. Gigascience 2022;11:giac038. [PMID: 35579550 PMCID: PMC9112781 DOI: 10.1093/gigascience/giac038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 01/17/2022] [Accepted: 03/24/2022] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

In diploid organisms, whole-genome haplotype assembly relies on the accurate identification and assignment of heterozygous single-nucleotide polymorphism alleles to the correct homologous chromosomes. This appropriate phasing of these alleles ensures that combinations of single-nucleotide polymorphisms on any chromosome, called haplotypes, can then be used in downstream genetic analysis approaches including determining their potential association with important phenotypic traits. A number of statistical algorithms and complementary computational software tools have been developed for whole-genome haplotype construction from genomic sequence data. However, many algorithms lack the ability to phase long haplotype blocks and simultaneously achieve a competitive accuracy.

RESULTS

In this research we present HaploMaker, a novel reference-based haplotype assembly algorithm capable of accurately and efficiently phasing long haplotypes using paired-end short reads and longer Pacific Biosciences reads from diploid genomic sequences. To achieve this we frame the problem as a directed acyclic graph with edges weighted on read evidence and use efficient path traversal and minimization techniques to optimally phase haplotypes. We compared the HaploMaker algorithm with 3 other common reference-based haplotype assembly tools using public haplotype data of human individuals from the Platinum Genome project. With short-read sequences, the HaploMaker algorithm maintained a competitively low switch error rate across all haplotype lengths and was superior in phasing longer genomic regions. For longer Pacific Biosciences reads, the phasing accuracy of HaploMaker remained competitive for all block lengths and generated substantially longer block lengths than the competing algorithms.

CONCLUSIONS

HaploMaker provides an improved haplotype assembly algorithm for diploid genomic sequences by accurately phasing longer haplotypes. The computationally efficient and portable nature of the Java implementation of the algorithm will ensure that it has maximal impact in reference-sequence-based haplotype assembly applications.

Collapse

Markello C, Huang C, Rodriguez A, Carroll A, Chang PC, Eizenga J, Markello T, Haussler D, Paten B. A complete pedigree-based graph workflow for rare candidate variant analysis. Genome Res 2022;32:893-903. [PMID: 35483961 PMCID: PMC9104704 DOI: 10.1101/gr.276387.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 03/24/2022] [Indexed: 11/24/2022]

Lin JH, Chen LC, Yu SC, Huang YT. LongPhase: an ultra-fast chromosome-scale phasing algorithm for small and large variants. Bioinformatics 2022;38:1816-1822. [PMID: 35104333 DOI: 10.1093/bioinformatics/btac058] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/04/2022] [Accepted: 01/26/2022] [Indexed: 02/03/2023] Open

Xie M, Yang L, Jiang C, Wu S, Luo C, Yang X, He L, Chen S, Deng T, Ye M, Yan J, Yang N. gcaPDA: a haplotype-resolved diploid assembler. BMC Bioinformatics 2022;23:68. [PMID: 35164674 PMCID: PMC8842951 DOI: 10.1186/s12859-022-04591-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 01/29/2022] [Indexed: 11/13/2022] Open

Huang X, Tatonetti N, LaRow K, Delgoffee B, Mayer J, Page D, Hebbring SJ. E-Pedigrees: a large-scale automatic family pedigree prediction application. Bioinformatics 2021;37:3966-3968. [PMID: 34086863 PMCID: PMC8570807 DOI: 10.1093/bioinformatics/btab419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/30/2021] [Accepted: 06/03/2021] [Indexed: 11/13/2022] Open

Neafsey DE, Taylor AR, MacInnis BL. Advances and opportunities in malaria population genomics. Nat Rev Genet 2021;22:502-517. [PMID: 33833443 PMCID: PMC8028584 DOI: 10.1038/s41576-021-00349-5] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/03/2021] [Indexed: 02/06/2023]

Garg S. Computational methods for chromosome-scale haplotype reconstruction. Genome Biol 2021;22:101. [PMID: 33845884 PMCID: PMC8040228 DOI: 10.1186/s13059-021-02328-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 03/25/2021] [Indexed: 12/13/2022] Open

Cao C, Greenberg M, Long Q. WgLink: reconstructing whole-genome viral haplotypes using L0+L1-regularization. Bioinformatics 2021;37:2744-2746. [PMID: 33532820 DOI: 10.1093/bioinformatics/btab076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 12/23/2020] [Accepted: 01/29/2021] [Indexed: 12/24/2022] Open

Holley G, Beyter D, Ingimundardottir H, Møller PL, Kristmundsdottir S, Eggertsson HP, Halldorsson BV. Ratatosk: hybrid error correction of long reads enables accurate variant calling and assembly. Genome Biol 2021;22:28. [PMID: 33419473 PMCID: PMC7792008 DOI: 10.1186/s13059-020-02244-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 12/15/2020] [Indexed: 12/20/2022] Open

Garg S, Fungtammasan A, Carroll A, Chou M, Schmitt A, Zhou X, Mac S, Peluso P, Hatas E, Ghurye J, Maguire J, Mahmoud M, Cheng H, Heller D, Zook JM, Moemke T, Marschall T, Sedlazeck FJ, Aach J, Chin CS, Church GM, Li H. Chromosome-scale, haplotype-resolved assembly of human genomes. Nat Biotechnol 2020;39:309-312. [PMID: 33288905 PMCID: PMC7954703 DOI: 10.1038/s41587-020-0711-0] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 09/09/2020] [Accepted: 09/17/2020] [Indexed: 12/14/2022]

Affiliation(s)

Shilpa Garg Department of Genetics, Harvard Medical School, Boston, MA, USA. .,Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA. .,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
Arkarachai Fungtammasan DNAnexus, Mountain View, CA, USA
Andrew Carroll Google, Mountain View, CA, USA
Mike Chou Department of Genetics, Harvard Medical School, Boston, MA, USA
Anthony Schmitt Arima Genomics, San Diego, CA, USA
Xiang Zhou Arima Genomics, San Diego, CA, USA
Stephen Mac Arima Genomics, San Diego, CA, USA
Paul Peluso Pacific Biosciences, Menlo Park, CA, USA
Emily Hatas Pacific Biosciences, Menlo Park, CA, USA
Jay Ghurye Dovetail Genomics, Scotts Valley, CA, USA
Jared Maguire Dovetail Genomics, Scotts Valley, CA, USA
Medhat Mahmoud Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Haoyu Cheng Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
David Heller Max Planck Institute for Molecular Genetics, Berlin, Germany
Justin M Zook Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
Tobias Moemke Saarland University, Saarbrücken, Germany
Tobias Marschall Saarland University, Saarbrücken, Germany.,Max Planck Institute for Informatics, Saarbrücken, Germany
Fritz J Sedlazeck Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
John Aach Department of Genetics, Harvard Medical School, Boston, MA, USA
Chen-Shan Chin DNAnexus, Mountain View, CA, USA.
George M Church Department of Genetics, Harvard Medical School, Boston, MA, USA.
Heng Li Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA. .,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Collapse

Kolmogorov M, Bickhart DM, Behsaz B, Gurevich A, Rayko M, Shin SB, Kuhn K, Yuan J, Polevikov E, Smith TPL, Pevzner PA. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat Methods 2020;17:1103-1110. [PMID: 33020656 PMCID: PMC10699202 DOI: 10.1038/s41592-020-00971-x] [Citation(s) in RCA: 292] [Impact Index Per Article: 73.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 08/22/2020] [Accepted: 09/07/2020] [Indexed: 02/06/2023]