1
|
Freestone J, Käll L, Noble WS, Keich U. How to Train a Postprocessor for Tandem Mass Spectrometry Proteomics Database Search While Maintaining Control of the False Discovery Rate. J Proteome Res 2025; 24:2266-2279. [PMID: 40163043 DOI: 10.1021/acs.jproteome.4c00742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Decoy-based methods are a popular choice for the statistical validation of peptide detection in tandem mass spectrometry and proteomics data. Such methods can achieve a substantial boost in statistical power when coupled with postprocessors such as Percolator that use auxiliary features to learn a better-discriminating scoring function. However, we recently showed that Percolator can struggle to control the false discovery rate (FDR) when reporting the list of discovered peptides. To address this problem, we introduce Percolator-RESET, which is an adaptation of our recently developed RESET meta-procedure to the peptide detection problem. Specifically, Percolator-RESET fuses Percolator's iterative SVM training procedure with RESET's general framework to provide valid false discovery rate control. Percolator-RESET operates in both a standard single-decoy mode and a two-decoy mode, with the latter requiring the generation of two decoys per target. We demonstrate that Percolator-RESET controls the FDR in both modes, both theoretically and empirically, while typically reporting only a marginally smaller number of discoveries than Percolator in the single-decoy mode. The two-decoy mode is marginally more powerful than both Percolator and the single-decoy mode and exhibits less variability than the latter.
Collapse
Affiliation(s)
- Jack Freestone
- School of Mathematics and Statistics F07, University of Sydney, New South Wales 2006, Australia
| | - Lukas Käll
- Science for Life Laboratory, KTH Royal Institute of Technology, Stockholm SE-100 44, Sweden
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Uri Keich
- School of Mathematics and Statistics F07, University of Sydney, New South Wales 2006, Australia
| |
Collapse
|
2
|
Perez-Riverol Y, Bittremieux W, Noble WS, Martens L, Bilbao A, Lazear MR, Grüning B, Katz DS, MacCoss MJ, Dai C, Eng JK, Bouwmeester R, Shortreed MR, Audain E, Sachsenberg T, Van Goey J, Wallmann G, Wen B, Käll L, Fondrie WE. Open-Source and FAIR Research Software for Proteomics. J Proteome Res 2025; 24:2222-2234. [PMID: 40267229 PMCID: PMC12053954 DOI: 10.1021/acs.jproteome.4c01079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 03/14/2025] [Accepted: 04/11/2025] [Indexed: 04/25/2025]
Abstract
Scientific discovery relies on innovative software as much as experimental methods, especially in proteomics, where computational tools are essential for mass spectrometer setup, data analysis, and interpretation. Since the introduction of SEQUEST, proteomics software has grown into a complex ecosystem of algorithms, predictive models, and workflows, but the field faces challenges, including the increasing complexity of mass spectrometry data, limited reproducibility due to proprietary software, and difficulties integrating with other omics disciplines. Closed-source, platform-specific tools exacerbate these issues by restricting innovation, creating inefficiencies, and imposing hidden costs on the community. Open-source software (OSS), aligned with the FAIR Principles (Findable, Accessible, Interoperable, Reusable), offers a solution by promoting transparency, reproducibility, and community-driven development, which fosters collaboration and continuous improvement. In this manuscript, we explore the role of OSS in computational proteomics, its alignment with FAIR principles, and its potential to address challenges related to licensing, distribution, and standardization. Drawing on lessons from other omics fields, we present a vision for a future where OSS and FAIR principles underpin a transparent, accessible, and innovative proteomics community.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European
Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Genome
Campus, Cambridge CB10
1SD, U.K.
| | - Wout Bittremieux
- Department
of Computer Science, University of Antwerp, 2020 Antwerpen, Belgium
| | - William S. Noble
- Department
of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Lennart Martens
- VIB-UGent
Center for Medical Biotechnology, VIB, Ghent 9052, Belgium
- Department
of Biomolecular Medicine, Ghent University, Ghent 9052, Belgium
| | - Aivett Bilbao
- Environmental
Molecular Sciences Laboratory, Pacific Northwest
National Laboratory, Richland, Washington 99352, United States
- US
Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| | - Michael R. Lazear
- Belharra
Therapeutics, 3985 Sorrento
Valley Boulevard Suite C, San Diego, California 92121, United States
| | - Bjorn Grüning
- Bioinformatics
Group, Department of Computer Science, Albert-Ludwigs
University Freiburg, Freiburg 79110, Germany
| | - Daniel S. Katz
- National
Center for Supercomputing Applications & Siebel School of Computing
and Data Science & School of Information Sciences, University of Illinois Urbana−Champaign, Urbana, Illinois 61801, United States
| | - Michael J. MacCoss
- Department
of Genome Sciences, University of Washington, 3720 15th St. NE, Seattle, Washington 98195, United States
| | - Chengxin Dai
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Center for Protein Sciences (Beijing), Beijing
Institute of Life Omics, Beijing 102206, China
| | - Jimmy K. Eng
- Proteomics
Resource, University of Washington, Seattle, Washington 98195, United States
| | - Robbin Bouwmeester
- VIB-UGent
Center for Medical Biotechnology, VIB, Ghent 9052, Belgium
- Department
of Biomolecular Medicine, Ghent University, Ghent 9052, Belgium
| | - Michael R. Shortreed
- Department
of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Enrique Audain
- Institute
of Medical Genetics, University Medicine
Oldenburg, Carl von Ossietzky University, Oldenburg 26129, Germany
| | - Timo Sachsenberg
- Department
of Computer Science, Applied Bioinformatics, University of Tübingen, Tübingen 72076, Germany
| | | | - Georg Wallmann
- Proteomics
and Signal Transduction, Max Planck Institute
of Biochemistry, Martinsried 82152, Germany
| | - Bo Wen
- Department
of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Lukas Käll
- Science
for Life Laboratory, School of Engineering Sciences in Chemistry,
Biotechnology and Health, KTH Royal Institute
of Technology, Stockholm 17165, Sweden
| | | |
Collapse
|
3
|
Deutsch EW, Mendoza L, Moritz RL. Quetzal: Comprehensive Peptide Fragmentation Annotation and Visualization. J Proteome Res 2025; 24:2196-2204. [PMID: 40111914 DOI: 10.1021/acs.jproteome.5c00092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2025]
Abstract
Proteomics data-dependent acquisition data sets collected with high-resolution mass-spectrometry (MS) can achieve very high-quality results, but nearly every analysis yields results that are thresholded at some accepted false discovery rate, meaning that a substantial number of results are incorrect. For study conclusions that rely on a small number of peptide-spectrum matches being correct, it is thus important to examine at least some crucial spectra to ensure that they are not one of the incorrect identifications. We present Quetzal, a peptide fragment ion spectrum annotation tool to assist researchers in annotating and examining such spectra to ensure that they correctly support study conclusions. We describe how Quetzal annotates spectra using the new Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) mzPAF standard for fragment ion peak annotation, including the Python-based code, a web-service end point that provides annotation services, and a web-based application for annotating spectra and producing publication-quality figures. We illustrate its functionality with several annotated spectra of varying complexity. Quetzal provides easily accessible functionality that can assist in the effort to ensure and demonstrate that crucial spectra support study conclusions. Quetzal is publicly available at https://proteomecentral.proteomexchange.org/quetzal/.
Collapse
Affiliation(s)
- Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Luis Mendoza
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Robert L Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| |
Collapse
|
4
|
Riffle M, Zelter A, Jaschob D, Hoopmann MR, Faivre DA, Moritz RL, Davis TN, MacCoss MJ, Isoherranen N. Limelight: An Open, Web-Based Tool for Visualizing, Sharing, and Analyzing Mass Spectrometry Data from DDA Pipelines. J Proteome Res 2025; 24:1895-1906. [PMID: 40036265 PMCID: PMC11977539 DOI: 10.1021/acs.jproteome.4c00968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Revised: 01/17/2025] [Accepted: 02/14/2025] [Indexed: 03/06/2025]
Abstract
Liquid chromatography-tandem mass spectrometry employing data-dependent acquisition (DDA) is a mature, widely used proteomics technique routinely applied to proteome profiling, protein-protein interaction studies, biomarker discovery, and protein modification analysis. Numerous tools exist for searching DDA data and myriad file formats are output as results. While some search and post processing tools include data visualization features to aid biological interpretation, they are often limited or tied to specific software pipelines. This restricts the accessibility, sharing and interpretation of data, and hinders comparison of results between different software pipelines. We developed Limelight, an easy-to-use, open-source, freely available tool that provides data sharing, analysis and visualization and is not tied to any specific software pipeline. Limelight is a data visualization tool specifically designed to provide access to the whole "data stack", from raw and annotated scan data to peptide-spectrum matches, quality control, peptides, proteins, and modifications. Limelight is designed from the ground up for sharing and collaboration and to support data from any DDA workflow. We provide tools to import data from many widely used open-mass and closed-mass search software workflows. Limelight helps maximize the utility of data by providing an easy-to-use interface for finding and interpreting data, all using the native scores from respective workflows.
Collapse
Affiliation(s)
- Michael Riffle
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Alex Zelter
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Daniel Jaschob
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | | | - Danielle A. Faivre
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Robert L. Moritz
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Trisha N. Davis
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Michael J. MacCoss
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| | - Nina Isoherranen
- Department
of Genome Sciences, Department of Biochemistry, and Department of Pharmaceutics, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
5
|
Madej D, Lam H. Query Mix-Max Method for FDR Estimation Supported by Entrapment Queries. J Proteome Res 2025; 24:1135-1147. [PMID: 39907052 PMCID: PMC11894652 DOI: 10.1021/acs.jproteome.4c00744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Revised: 01/24/2025] [Accepted: 01/28/2025] [Indexed: 02/06/2025]
Abstract
Estimating the false discovery rate (FDR) is one of the key steps in ensuring appropriate error control in the analysis of shotgun proteomics data. Traditional estimation methods typically rely on decoy sequence databases or spectral libraries, which may not always provide satisfactory results due to limitations of decoy construction methods. This study introduces the query mix-max (QMM) method, a decoy-free alternative for FDR estimation in proteomics. The QMM framework builds upon the existing mix-max procedure but replaces decoy matches with entrapment queries to estimate the number of false positive discoveries. Through simulations and real data set analyses, the QMM method was demonstrated to provide reasonably accurate FDR estimation across various scenarios, particularly when smaller sample-to-entrapment spectra ratios were achieved. The QMM method tends to be conservatively biased, particularly at higher FDR values, which can ensure stringent FDR control. While flexible, the protocol's effectiveness may vary depending on the evolutionary distance between the sample and entrapment organisms. It also requires a sufficient number of entrapment queries to provide stable FDR estimates, especially for low FDR values. Despite these limitations, the QMM method is a promising alternative as one of the first query-based FDR estimation approaches in shotgun proteomics.
Collapse
Affiliation(s)
- Dominik Madej
- Department of Chemical and
Biological Engineering, The Hong Kong University
of Science and Technology, Hong
Kong, China
| | - Henry Lam
- Department of Chemical and
Biological Engineering, The Hong Kong University
of Science and Technology, Hong
Kong, China
| |
Collapse
|
6
|
Chu F, Lin A. Detecting Human Contaminant Genetically Variant Peptides in Nonhuman Samples. J Proteome Res 2025; 24:579-588. [PMID: 39705712 DOI: 10.1021/acs.jproteome.4c00718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2024]
Abstract
During proteomics data analysis, experimental spectra are searched against a user-defined protein database consisting of proteins that are reasonably expected to be present in the sample. Typically, this database contains the proteome of the organism under study concatenated with expected contaminants, such as trypsin and human keratins. However, there are additional contaminants that are not commonly added to the database. In this study, we describe a new set of protein contaminants and provide evidence that they can be detected in mass spectrometry-based proteomics data. Specifically, we provide evidence that human genetically variant peptides (GVPs) can be detected in nonhuman samples. GVPs are peptides that contain single amino acid polymorphisms that result from nonsynonymous single nucleotide polymorphisms in protein-coding regions of DNA. We reanalyzed previously collected nonhuman data-dependent acquisition (DDA) and data-independent acquisition (DIA) data sets and detected between 0 and 135 GVPs per data set. In addition, we show that GVPs are unlikely to originate from nonhuman sources and that a subset of eight GVPs are commonly detected across data sets.
Collapse
Affiliation(s)
- Fanny Chu
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Seattle, Washington 98109, United States
| | - Andy Lin
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Seattle, Washington 98109, United States
| |
Collapse
|
7
|
Wen B, Freestone J, Riffle M, MacCoss MJ, Noble WS, Keich U. Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.06.01.596967. [PMID: 38895431 PMCID: PMC11185562 DOI: 10.1101/2024.06.01.596967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
A pressing statistical challenge in the field of mass spectrometry proteomics is how to assess whether a given software tool provides accurate error control. Each software tool for searching such data uses its own internally implemented methodology for reporting and controlling the error. Many of these software tools are closed source, with incompletely documented methodology, and the strategies for validating the error are inconsistent across tools. In this work, we identify three different methods for validating false discovery rate (FDR) control in use in the field, one of which is invalid, one of which can only provide a lower bound rather than an upper bound, and one of which is valid but under-powered. The result is that the field has a very poor understanding of how well we are doing with respect to FDR control, particularly for the analysis of data-independent acquisition (DIA) data. We therefore propose a theoretical formulation of entrapment experiments that allows us to rigorously characterize the behavior of the various entrapment methods. We also propose a more powerful method for evaluating FDR control, and we employ that method, along with other existing techniques, to characterize a variety of popular search tools. We empirically validate our entrapment analysis in the fairly well-understood DDA setup before applying it in the DIA setup. We find that none of the DIA search tools consistently controls the FDR at the peptide level, and the tools struggle particularly with analysis of single cell datasets.
Collapse
Affiliation(s)
- Bo Wen
- Department of Genome Sciences, University of Washington
| | - Jack Freestone
- School of Mathematics and Statistics, University of Sydney
| | | | | | - William S. Noble
- Department of Genome Sciences, University of Washington
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| | - Uri Keich
- School of Mathematics and Statistics, University of Sydney
| |
Collapse
|
8
|
Hu Y, Schnaubelt M, Chen L, Zhang B, Hoang T, Lih TM, Zhang Z, Zhang H. MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses. Anal Chem 2024; 96:10145-10151. [PMID: 38869158 PMCID: PMC12038899 DOI: 10.1021/acs.analchem.3c01497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
Rapid development and wide adoption of mass spectrometry-based glycoproteomic technologies have empowered scientists to study proteins and protein glycosylation in complex samples on a large scale. This progress has also created unprecedented challenges for individual laboratories to store, manage, and analyze proteomic and glycoproteomic data, both in the cost for proprietary software and high-performance computing and in the long processing time that discourages on-the-fly changes of data processing settings required in explorative and discovery analysis. We developed an open-source, cloud computing-based pipeline, MS-PyCloud, with graphical user interface (GUI), for proteomic and glycoproteomic data analysis. The major components of this pipeline include data file integrity validation, MS/MS database search for spectral assignments to peptide sequences, false discovery rate estimation, protein inference, quantitation of global protein levels, and specific glycan-modified glycopeptides as well as other modification-specific peptides such as phosphorylation, acetylation, and ubiquitination. To ensure the transparency and reproducibility of data analysis, MS-PyCloud includes open-source software tools with comprehensive testing and versioning for spectrum assignments. Leveraging public cloud computing infrastructure via Amazon Web Services (AWS), MS-PyCloud scales seamlessly based on analysis demand to achieve fast and efficient performance. Application of the pipeline to the analysis of large-scale LC-MS/MS data sets demonstrated the effectiveness and high performance of MS-PyCloud. The software can be downloaded at https://github.com/huizhanglab-jhu/ms-pycloud.
Collapse
Affiliation(s)
| | | | - Li Chen
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Bai Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Trung Hoang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - T. Mamie Lih
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Zhen Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Hui Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| |
Collapse
|
9
|
Bhimani K, Peresadina A, Vozniuk D, Kertész-Farkas A. Exact p-value calculation for XCorr scoring of high-resolution MS/MS data. Proteomics 2024; 24:e2300145. [PMID: 37726251 DOI: 10.1002/pmic.202300145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 09/21/2023]
Abstract
Exact p-value (XPV)-based methods for dot product-like score functions-such as the XCorr score implemented in Tide, SEQUEST, Comet or shared peak count-based scoring in MSGF+ and ASPV-provide a fairly good calibration for peptide-spectrum-match (PSM) scoring in database searching-based MS/MS spectrum data identification. Unfortunately, standard XPV methods, in practice, cannot handle high-resolution fragmentation data produced by state-of-the-art mass spectrometers because having smaller bins increases the number of fragment matches that are assigned to incorrect bins and scored improperly. In this article, we present an extension of the XPV method, called the high-resolution exact p-value (HR-XPV) method, which can be used to calibrate PSM scores of high-resolution MS/MS spectra obtained with dot product-like scoring such as the XCorr. The HR-XPV carries remainder masses throughout the fragmentation, allowing them to greatly increase the number of fragments that are properly assigned to the correct bin and, thus, taking advantage of high-resolution data. Using four mass spectrometry data sets, our experimental results demonstrate that HR-XPV produces well-calibrated scores, which in turn results in more trusted spectrum annotations at any false discovery rate level.
Collapse
Affiliation(s)
- Kishankumar Bhimani
- Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, Moscow, Russian Federation
| | - Arina Peresadina
- Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, Moscow, Russian Federation
| | - Dmitrii Vozniuk
- Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, Moscow, Russian Federation
| | - Attila Kertész-Farkas
- Laboratory on AI for Computational Biology, Faculty of Computer Science, HSE University, Moscow, Russian Federation
| |
Collapse
|
10
|
Hoopmann MR, Shteynberg DD, Zelter A, Riffle M, Lyon AS, Agard DA, Luan Q, Nolen BJ, MacCoss MJ, Davis TN, Moritz RL. Improved Analysis of Cross-Linking Mass Spectrometry Data with Kojak 2.0, Advanced by Integration into the Trans-Proteomic Pipeline. J Proteome Res 2023; 22:647-655. [PMID: 36629399 PMCID: PMC10234491 DOI: 10.1021/acs.jproteome.2c00670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Fragmentation ion spectral analysis of chemically cross-linked proteins is an established technology in the proteomics research repertoire for determining protein interactions, spatial orientation, and structure. Here we present Kojak version 2.0, a major update to the original Kojak algorithm, which was developed to identify cross-linked peptides from fragment ion spectra using a database search approach. A substantially improved algorithm with updated scoring metrics, support for cleavable cross-linkers, and identification of cross-links between 15N-labeled homomultimers are among the newest features of Kojak 2.0 presented here. Kojak 2.0 is now integrated into the Trans-Proteomic Pipeline, enabling access to dozens of additional tools within that suite. In particular, the PeptideProphet and iProphet tools for validation of cross-links improve the sensitivity and accuracy of correct cross-link identifications at user-defined thresholds. These new features improve the versatility of the algorithm, enabling its use in a wider range of experimental designs and analysis pipelines. Kojak 2.0 remains open-source and multiplatform.
Collapse
Affiliation(s)
| | | | - Alex Zelter
- Department of Biochemistry, University of Washington, Seattle, WA, USA 98195
| | - Michael Riffle
- Department of Biochemistry, University of Washington, Seattle, WA, USA 98195
| | - Andrew S. Lyon
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA, USA 94143
| | - David A. Agard
- Department of Biochemistry and Biophysics, University of California, San Francisco, CA, USA 94143
| | - Qing Luan
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA 97403
| | - Brad J. Nolen
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA 97403
| | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, USA 98195
| | - Trisha N. Davis
- Department of Biochemistry, University of Washington, Seattle, WA, USA 98195
| | | |
Collapse
|