Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sun C, Medvedev P. Toward fast and accurate SNP genotyping from whole genome sequencing data for bedside diagnostics. Bioinformatics 2019;35:415-420. [PMID: 30032192 DOI: 10.1093/bioinformatics/bty641] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2017] [Accepted: 07/18/2018] [Indexed: 12/13/2022] Open

For:	Sun C, Medvedev P. Toward fast and accurate SNP genotyping from whole genome sequencing data for bedside diagnostics. Bioinformatics 2019;35:415-420. [PMID: 30032192 DOI: 10.1093/bioinformatics/bty641] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2017] [Accepted: 07/18/2018] [Indexed: 12/13/2022] Open

Number

Cited by Other Article(s)

Moeckel C, Mareboina M, Konnaris MA, Chan CS, Mouratidis I, Montgomery A, Chantzi N, Pavlopoulos GA, Georgakopoulos-Soares I. A survey of k-mer methods and applications in bioinformatics. Comput Struct Biotechnol J 2024;23:2289-2303. [PMID: 38840832 PMCID: PMC11152613 DOI: 10.1016/j.csbj.2024.05.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/07/2024] Open

Rossignolo E, Comin M. Enhanced Compression of k-Mer Sets with Counters via de Bruijn Graphs. J Comput Biol 2024;31:524-538. [PMID: 38820168 DOI: 10.1089/cmb.2024.0530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2024] Open

Schmidt S, Khan S, Alanko JN, Pibiri GE, Tomescu AI. Matchtigs: minimum plain text representation of k-mer sets. Genome Biol 2023;24:136. [PMID: 37296461 PMCID: PMC10251615 DOI: 10.1186/s13059-023-02968-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 05/10/2023] [Indexed: 06/12/2023] Open

Grytten I, Dagestad Rand K, Sandve GK. KAGE: fast alignment-free graph-based genotyping of SNPs and short indels. Genome Biol 2022;23:209. [PMID: 36195962 PMCID: PMC9531401 DOI: 10.1186/s13059-022-02771-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 09/09/2022] [Indexed: 11/10/2022] Open

Santoro D, Pellegrina L, Comin M, Vandin F. SPRISS: approximating frequent k-mers by sampling reads, and applications. Bioinformatics 2022;38:3343-3350. [PMID: 35583271 PMCID: PMC9237683 DOI: 10.1093/bioinformatics/btac180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 02/25/2022] [Accepted: 05/16/2022] [Indexed: 11/29/2022] Open

Ebler J, Ebert P, Clarke WE, Rausch T, Audano PA, Houwaart T, Mao Y, Korbel JO, Eichler EE, Zody MC, Dilthey AT, Marschall T. Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes. Nat Genet 2022;54:518-525. [PMID: 35410384 PMCID: PMC9005351 DOI: 10.1038/s41588-022-01043-w] [Citation(s) in RCA: 121] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 03/03/2022] [Indexed: 12/30/2022]

Blanca A, Harris RS, Koslicki D, Medvedev P. The Statistics of k-mers from a Sequence Undergoing a Simple Mutation Process Without Spurious Matches. J Comput Biol 2022;29:155-168. [PMID: 35108101 PMCID: PMC11978275 DOI: 10.1089/cmb.2021.0431] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Bernardini G, Denti L, Previtali M. Alignment-Free Genotyping of Known Variations with MALVA. Methods Mol Biol 2022;2493:247-256. [PMID: 35751819 DOI: 10.1007/978-1-0716-2293-3_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Jha RM, Zusman BE, Puccio AM, Okonkwo DO, Pease M, Desai SM, Leach M, Conley YP, Kochanek PM. Genetic Variants Associated With Intraparenchymal Hemorrhage Progression After Traumatic Brain Injury. JAMA Netw Open 2021;4:e2116839. [PMID: 34309670 PMCID: PMC8314141 DOI: 10.1001/jamanetworkopen.2021.16839] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Abstract

IMPORTANCE

Intracerebral hemorrhage progression is associated with unfavorable outcome after traumatic brain injury (TBI). No effective treatments are currently available. This secondary injury process reflects an extreme form of vasogenic edema and blood-brain barrier breakdown. The sulfonylurea receptor 1-transient receptor potential melastatin 4 (SUR1-TRPM4) cation channel is a key underlying mechanism. A phase 2 trial of SUR1-TRPM4 inhibition in contusional TBI is ongoing, and a phase 3 trial is being designed. Targeted identification of patients at increased risk for hemorrhage progression may inform prognostication, trial design (including patient selection), and ultimately treatment response.

OBJECTIVE

To determine whether ABCC8 (SUR1) and TRPM4 genetic variability are associated with intraparenchymal hemorrhage (IPH) progression after severe TBI, based on the putative involvement of the SUR1-TRPM4 channel in this pathophysiology.

DESIGN, SETTING, AND PARTICIPANTS

In this genetic association study, DNA was extracted from 416 patients with severe TBI prospectively enrolled from a level I trauma academic medical center from May 9, 2002, to August 8, 2014. Forty ABCC8 and TRPM4 single-nucleotide variants (SNVs) were genotyped (multiplex, unbiased). Data were analyzed from January 7, 2020, to May 3, 2021.

MAIN OUTCOMES AND MEASURES

Primary analyses addressed IPH progression at 6, 24, and 120 hours in patients without acute craniectomy (n = 321). Multivariable regressions and receiver operating characteristic curves assessed SNV and haplotype associations with progression. Spatial modeling and functional predictions were determined using standard software.

RESULTS

Of the 321 patients included in the analysis (mean [SD] age, 37.0 [16.3] years; 247 [76.9%] male), IPH progression occurred in 102. Four ABCC8 SNVs were associated with markedly increased odds of progression (rs2237982 [odds ratio (OR), 2.60-3.80; 95% CI, 1.14-5.90 to 1.80-8.02; P = .02 to P < .001], rs2283261 [OR, 3.37-4.77; 95% CI, 1.07-10.77 to 1.89-12.07; P = .04 to P = .001], rs3819521 [OR, 2.96-3.92; 95% CI, 1.13-7.75 to 1.42-10.87; P = .03 to P = .009], and rs8192695 [OR, 3.06-4.95; 95% CI, 1.02-9.12 to 1.67-14.68]; P = .03-.004). These are brain-specific expression quantitative trait loci (eQTL) associated with increased ABCC8 messenger RNA levels. Regulatory annotations revealed promoter and enhancer marks and strong and/or active brain-tissue transcription, directionally consistent with increased progression. Three SNVs (rs2283261, rs2237982, and rs3819521) in this cohort have been associated with intracranial hypertension. Four TRPM4 SNVs were associated with decreased IPH progression (rs3760666 [OR, 0.40-0.49; 95% CI, 0.19-0.86 to 0.27-0.89; P = .02 to P = .009], rs1477363 [OR, 0.40-0.43; 95% CI, 0.18-0.88 to 0.23-0.81; P = .02 to P = .006], rs10410857 [OR, 0.36-0.41; 95% CI, 0.20-0.67 to 0.20-0.85; P = .02 to P = .001], and rs909010 [OR, 0.27-0.40; 95% CI, 0.12-0.62 to 0.16-0.58; P = .002 to P < .001]). Significant SNVs in both genes cluster downstream, flanking exons encoding the receptor site and SUR1-TRPM4 binding interface. Adding genetic variation to clinical models improved receiver operating characteristic curve performance from 0.6959 to 0.8030 (P = .003).

CONCLUSIONS AND RELEVANCE

In this genetic association study, 8 ABCC8 and TRPM4 SNVs were associated with IPH progression. Spatial clustering, brain-specific eQTL, and regulatory annotations suggest biological plausibility. These findings may have important implications for neurocritical care risk stratification, patient selection, and precision medicine, including an upcoming phase 3 trial design for SUR1-TRPM4 inhibition in severe TBI.

Collapse

Rahman A, Chikhi R, Medvedev P. Disk compression of k-mer sets. Algorithms Mol Biol 2021;16:10. [PMID: 34154632 PMCID: PMC8218509 DOI: 10.1186/s13015-021-00192-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 06/08/2021] [Indexed: 12/23/2022] Open

Khorsand P, Hormozdiari F. Nebula: ultra-efficient mapping-free structural variant genotyper. Nucleic Acids Res 2021;49:e47. [PMID: 33503255 PMCID: PMC8096284 DOI: 10.1093/nar/gkab025] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 01/03/2021] [Accepted: 01/11/2021] [Indexed: 11/24/2022] Open

Khorsand P, Denti L, Human Genome Structural Variant Consortium, Bonizzoni P, Chikhi R, Hormozdiari F. Comparative genome analysis using sample-specific string detection in accurate long reads. BIOINFORMATICS ADVANCES 2021;1:vbab005. [PMID: 36700094 PMCID: PMC9710709 DOI: 10.1093/bioadv/vbab005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Břinda K, Baym M, Kucherov G. Simplitigs as an efficient and scalable representation of de Bruijn graphs. Genome Biol 2021;22:96. [PMID: 33823902 PMCID: PMC8025321 DOI: 10.1186/s13059-021-02297-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 02/10/2021] [Indexed: 12/30/2022] Open

Richmond PA, Kaye AM, Kounkou GJ, Av-Shalom TV, Wasserman WW. Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper. PLoS Comput Biol 2021;17:e1008815. [PMID: 33750951 PMCID: PMC8016220 DOI: 10.1371/journal.pcbi.1008815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 04/01/2021] [Accepted: 02/17/2021] [Indexed: 11/26/2022] Open

Abstract

Across the life sciences, processing next generation sequencing data commonly relies upon a computationally expensive process where reads are mapped onto a reference sequence. Prior to such processing, however, there is a vast amount of information that can be ascertained from the reads, potentially obviating the need for processing, or allowing optimized mapping approaches to be deployed. Here, we present a method termed FlexTyper which facilitates a “reverse mapping” approach in which high throughput sequence queries, in the form of k-mer searches, are run against indexed short-read datasets in order to extract useful information. This reverse mapping approach enables the rapid counting of target sequences of interest. We demonstrate FlexTyper’s utility for recovering depth of coverage, and accurate genotyping of SNP sites across the human genome. We show that genotyping unmapped reads can correctly inform a sample’s population, sex, and relatedness in a family setting. Detection of pathogen sequences within RNA-seq data was sensitive and accurate, performing comparably to existing methods, but with increased flexibility. We present two examples of ways in which this flexibility allows the analysis of genome features not well-represented in a linear reference. First, we analyze contigs from African genome sequencing studies, showing how they distribute across families from three distinct populations. Second, we show how gene-marking k-mers for the killer immune receptor locus allow allele detection in a region that is challenging for standard read mapping pipelines. The future adoption of the reverse mapping approach represented by FlexTyper will be enabled by more efficient methods for FM-index generation and biology-informed collections of reference queries. In the long-term, selection of population-specific references or weighting of edges in pan-population reference genome graphs will be possible using the FlexTyper approach. FlexTyper is available at https://github.com/wassermanlab/OpenFlexTyper.

In the past 15 years, next generation sequencing technology has revolutionized our capacity to process and analyze DNA sequencing data. From agriculture to medicine, this technology is enabling a deeper understanding of the blueprint of life. Next generation sequencing data is composed of short sequences of DNA, referred to as “reads”, which are often shorter than 200 base pairs making them many orders of magnitude smaller than the entirety of a human genome. Gaining insights from this data has typically leveraged a reference-guided mapping approach, where the reads are aligned to a reference genome and then post-processed to gain actionable information such as presence or absence of genomic sequence, or variation between the reference genome and the sequenced sample. Many experts in the field of genomics have concluded that selecting a single, linear reference genome for mapping reads against is limiting, and several current research endeavors are focused on exploring options for improved analysis methods to unlock the full utility of sequencing data. Among these improvements are the usage of sex-matched genomes, population-specific reference genomes, and emergent graph-based reference pan-genomes. However, advanced methods that use raw DNA sequencing data to inform the choice of reference genome and guide the alignment of reads to enriched reference genomes are needed. Here we develop a method termed FlexTyper, which creates a searchable index of the short read data and enables flexible, user-guided queries to provide valuable insights without the need for reference-guided mapping. We demonstrate the utility of our method by identifying sample ancestry and sex in human whole genome sequencing data, detecting viral pathogen reads in RNA-seq data, African-enriched genome regions absent from the global reference, and killer-cell immune receptor alleles that are complex to discern using standard read mapping. We anticipate early adoption of FlexTyper within analysis pipelines as a pre-mapping component, and further envision the bioinformatics and genomics community will leverage the tool for creative uses of sequence queries from unmapped data.

Collapse

Standage DS, Brown CT, Hormozdiari F. Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants. iScience 2019;18:28-36. [PMID: 31377530 PMCID: PMC6682328 DOI: 10.1016/j.isci.2019.07.032] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 06/24/2019] [Accepted: 07/19/2019] [Indexed: 01/05/2023] Open

Denti L, Previtali M, Bernardini G, Schönhuth A, Bonizzoni P. MALVA: Genotyping by Mapping-free ALlele Detection of Known VAriants. iScience 2019;18:20-27. [PMID: 31352182 PMCID: PMC6664100 DOI: 10.1016/j.isci.2019.07.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 06/05/2019] [Accepted: 07/08/2019] [Indexed: 12/30/2022] Open