1
|
Koehler Leman J, Lyskov S, Lewis SM, Adolf-Bryfogle J, Alford RF, Barlow K, Ben-Aharon Z, Farrell D, Fell J, Hansen WA, Harmalkar A, Jeliazkov J, Kuenze G, Krys JD, Ljubetič A, Loshbaugh AL, Maguire J, Moretti R, Mulligan VK, Nance ML, Nguyen PT, Ó Conchúir S, Roy Burman SS, Samanta R, Smith ST, Teets F, Tiemann JKS, Watkins A, Woods H, Yachnin BJ, Bahl CD, Bailey-Kellogg C, Baker D, Das R, DiMaio F, Khare SD, Kortemme T, Labonte JW, Lindorff-Larsen K, Meiler J, Schief W, Schueler-Furman O, Siegel JB, Stein A, Yarov-Yarovoy V, Kuhlman B, Leaver-Fay A, Gront D, Gray JJ, Bonneau R. Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks. Nat Commun 2021; 12:6947. [PMID: 34845212 PMCID: PMC8630030 DOI: 10.1038/s41467-021-27222-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 11/02/2021] [Indexed: 01/14/2023] Open
Abstract
Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Steven M Lewis
- Cyrus Biotechnology, 1201 Second Ave, Suite 900, Seattle, WA, 98101, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kyle Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Daniel Farrell
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Jason Fell
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - William A Hansen
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jeliazko Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - Justyna D Krys
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Phuong T Nguyen
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Shane Ó Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Shannon T Smith
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Frank Teets
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Johanna K S Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Hope Woods
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Brahm J Yachnin
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA, 02115, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Sagar D Khare
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - William Schief
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Justin B Siegel
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Brian Kuhlman
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Andrew Leaver-Fay
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
- Department of Computer Science, New York University, New York, NY, 10003, USA.
| |
Collapse
|
2
|
Barlow KA, Ó Conchúir S, Thompson S, Suresh P, Lucas JE, Heinonen M, Kortemme T. Flex ddG: Rosetta Ensemble-Based Estimation of Changes in Protein-Protein Binding Affinity upon Mutation. J Phys Chem B 2018; 122:5389-5399. [PMID: 29401388 DOI: 10.1021/acs.jpcb.7b11367] [Citation(s) in RCA: 131] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Computationally modeling changes in binding free energies upon mutation (interface ΔΔ G) allows large-scale prediction and perturbation of protein-protein interactions. Additionally, methods that consider and sample relevant conformational plasticity should be able to achieve higher prediction accuracy over methods that do not. To test this hypothesis, we developed a method within the Rosetta macromolecular modeling suite (flex ddG) that samples conformational diversity using "backrub" to generate an ensemble of models and then applies torsion minimization, side chain repacking, and averaging across this ensemble to estimate interface ΔΔ G values. We tested our method on a curated benchmark set of 1240 mutants, and found the method outperformed existing methods that sampled conformational space to a lesser degree. We observed considerable improvements with flex ddG over existing methods on the subset of small side chain to large side chain mutations, as well as for multiple simultaneous non-alanine mutations, stabilizing mutations, and mutations in antibody-antigen interfaces. Finally, we applied a generalized additive model (GAM) approach to the Rosetta energy function; the resulting nonlinear reweighting model improved the agreement with experimentally determined interface ΔΔ G values but also highlighted the necessity of future energy function improvements.
Collapse
Affiliation(s)
- Kyle A Barlow
- Graduate Program in Bioinformatics , University of California San Francisco , San Francisco , California , United States of America
| | - Shane Ó Conchúir
- California Institute for Quantitative Biosciences , University of California San Francisco , San Francisco , California , United States of America.,Department of Bioengineering and Therapeutic Sciences , University of California San Francisco , San Francisco , California , United States of America
| | - Samuel Thompson
- Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America
| | - Pooja Suresh
- Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America
| | - James E Lucas
- Graduate Program in Bioengineering , University of California San Francisco , San Francisco , California , United States of America
| | - Markus Heinonen
- Department of Computer Science , Aalto University , Espoo , Finland.,Helsinki Institute for Information Technology (HIIT) , Helsinki , Finland
| | - Tanja Kortemme
- Graduate Program in Bioinformatics , University of California San Francisco , San Francisco , California , United States of America.,California Institute for Quantitative Biosciences , University of California San Francisco , San Francisco , California , United States of America.,Department of Bioengineering and Therapeutic Sciences , University of California San Francisco , San Francisco , California , United States of America.,Graduate Program in Biophysics , University of California San Francisco , San Francisco , California , United States of America.,Graduate Program in Bioengineering , University of California San Francisco , San Francisco , California , United States of America.,Chan Zuckerberg Biohub , San Francisco , California 94158 , United States
| |
Collapse
|
3
|
Mavor D, Barlow K, Thompson S, Barad BA, Bonny AR, Cario CL, Gaskins G, Liu Z, Deming L, Axen SD, Caceres E, Chen W, Cuesta A, Gate RE, Green EM, Hulce KR, Ji W, Kenner LR, Mensa B, Morinishi LS, Moss SM, Mravic M, Muir RK, Niekamp S, Nnadi CI, Palovcak E, Poss EM, Ross TD, Salcedo EC, See SK, Subramaniam M, Wong AW, Li J, Thorn KS, Conchúir SÓ, Roscoe BP, Chow ED, DeRisi JL, Kortemme T, Bolon DN, Fraser JS. Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting. eLife 2016; 5. [PMID: 27111525 PMCID: PMC4862753 DOI: 10.7554/elife.15802] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2016] [Accepted: 04/06/2016] [Indexed: 12/31/2022] Open
Abstract
Ubiquitin is essential for eukaryotic life and varies in only 3 amino acid positions between yeast and humans. However, recent deep sequencing studies indicate that ubiquitin is highly tolerant to single mutations. We hypothesized that this tolerance would be reduced by chemically induced physiologic perturbations. To test this hypothesis, a class of first year UCSF graduate students employed deep mutational scanning to determine the fitness landscape of all possible single residue mutations in the presence of five different small molecule perturbations. These perturbations uncover 'shared sensitized positions' localized to areas around the hydrophobic patch and the C-terminus. In addition, we identified perturbation specific effects such as a sensitization of His68 in HU and a tolerance to mutation at Lys63 in DTT. Our data show how chemical stresses can reduce buffering effects in the ubiquitin proteasome system. Finally, this study demonstrates the potential of lab-based interdisciplinary graduate curriculum. DOI:http://dx.doi.org/10.7554/eLife.15802.001 The ability of an organism to grow and reproduce, that is, it’s “fitness”, is determined by how its genes interact with the environment. Yeast is a model organism in which researchers can control the exact mutations present in the yeast’s genes (its genotype) and the conditions in which the yeast cells live (their environment). This allows researchers to measure how a yeast cell’s genotype and environment affect its fitness. Ubiquitin is a protein that many organisms depend on to manage cell stress by acting as a tag that targets other proteins for degradation. Essential proteins such as ubiquitin often remain unchanged by mutation over long periods of time. As a result, these proteins evolve very slowly. Like all proteins, ubiquitin is built from a chain of amino acid molecules linked together, and the ubiquitin proteins of yeast and humans are made of almost identical sequences of amino acids. Although ubiquitin has barely changed its sequence over evolution, previous studies have shown that – under normal growth conditions in the laboratory – most amino acids in ubiquitin can be mutated without any loss of cell fitness. This led Mavor et al. to hypothesize that treating the yeast cells with chemicals that cause cell stress might lead to amino acids in ubiquitin becoming more sensitive to mutation. To test this idea, a class of graduate students at the University of California, San Francisco grew yeast cells with different ubiquitin mutations together, and with different chemicals that induce cell stress, and measured their growth rates. Sequencing the ubiquitin gene in the thousands of tested yeast cells revealed that three of the chemicals cause a shared set of amino acids in ubiquitin to become more sensitive to mutation. This result suggests that these amino acids are important for the stress response, possibly by altering the ability of yeast cells to target certain proteins for degradation. Conversely, another chemical causes yeast to become more tolerant to changes in the ubiquitin sequence. The experiments also link changes in particular amino acids in ubiquitin to specific stress responses. Mavor et al. show that many of ubquitin’s amino acids are sensitive to mutation under different stress conditions, while others can be mutated to form different amino acids without effecting fitness. By testing the effects of other chemicals, future experiments could further characterize how the yeast’s genotype and environment interact. DOI:http://dx.doi.org/10.7554/eLife.15802.002
Collapse
Affiliation(s)
- David Mavor
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Kyle Barlow
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Samuel Thompson
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Benjamin A Barad
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Alain R Bonny
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Clinton L Cario
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Garrett Gaskins
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Zairan Liu
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Laura Deming
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States
| | - Seth D Axen
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Elena Caceres
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Weilin Chen
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Adolfo Cuesta
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Rachel E Gate
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Evan M Green
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Kaitlin R Hulce
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Weiyue Ji
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Lillian R Kenner
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Bruk Mensa
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Leanna S Morinishi
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Steven M Moss
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Marco Mravic
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Ryan K Muir
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Stefan Niekamp
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Chimno I Nnadi
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Eugene Palovcak
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Erin M Poss
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Tyler D Ross
- Biophysics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Eugenia C Salcedo
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Stephanie K See
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Meena Subramaniam
- Bioinformatics Graduate Group, University of California, San Francisco, San Francisco, United States
| | - Allison W Wong
- Chemistry and Chemical Biology Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Jennifer Li
- UCSF Science and Health Education Partnership, University of California, San Francisco, San Francisco, United States
| | - Kurt S Thorn
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Shane Ó Conchúir
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biology, University of California, San Francisco, San Francisco, United States
| | - Benjamin P Roscoe
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Eric D Chow
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States.,Center for Advanced Technology, University of California, San Francisco, San Francisco, United States
| | - Joseph L DeRisi
- Howard Hughes Medical Institute, University of California, San Francisco, San Francisco, United States.,Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biology, University of California, San Francisco, San Francisco, United States
| | - Daniel N Bolon
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biology, University of California, San Francisco, San Francisco, United States
| |
Collapse
|
4
|
Ó Conchúir S, Barlow KA, Pache RA, Ollikainen N, Kundert K, O'Meara MJ, Smith CA, Kortemme T. A Web Resource for Standardized Benchmark Datasets, Metrics, and Rosetta Protocols for Macromolecular Modeling and Design. PLoS One 2015; 10:e0130433. [PMID: 26335248 PMCID: PMC4559433 DOI: 10.1371/journal.pone.0130433] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 05/20/2015] [Indexed: 11/18/2022] Open
Abstract
The development and validation of computational macromolecular modeling and design methods depend on suitable benchmark datasets and informative metrics for comparing protocols. In addition, if a method is intended to be adopted broadly in diverse biological applications, there needs to be information on appropriate parameters for each protocol, as well as metrics describing the expected accuracy compared to experimental data. In certain disciplines, there exist established benchmarks and public resources where experts in a particular methodology are encouraged to supply their most efficient implementation of each particular benchmark. We aim to provide such a resource for protocols in macromolecular modeling and design. We present a freely accessible web resource (https://kortemmelab.ucsf.edu/benchmarks) to guide the development of protocols for protein modeling and design. The site provides benchmark datasets and metrics to compare the performance of a variety of modeling protocols using different computational sampling methods and energy functions, providing a “best practice” set of parameters for each method. Each benchmark has an associated downloadable benchmark capture archive containing the input files, analysis scripts, and tutorials for running the benchmark. The captures may be run with any suitable modeling method; we supply command lines for running the benchmarks using the Rosetta software suite. We have compiled initial benchmarks for the resource spanning three key areas: prediction of energetic effects of mutations, protein design, and protein structure prediction, each with associated state-of-the-art modeling protocols. With the help of the wider macromolecular modeling community, we hope to expand the variety of benchmarks included on the website and continue to evaluate new iterations of current methods as they become available.
Collapse
Affiliation(s)
- Shane Ó Conchúir
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Kyle A. Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (KAB); (TK)
| | - Roland A. Pache
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Noah Ollikainen
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Kale Kundert
- Graduate Program in Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Matthew J. O'Meara
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Colin A. Smith
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Tanja Kortemme
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
- Graduate Program in Biophysics, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (KAB); (TK)
| |
Collapse
|
5
|
Lyskov S, Chou FC, Conchúir SÓ, Der BS, Drew K, Kuroda D, Xu J, Weitzner BD, Renfrew PD, Sripakdeevong P, Borgo B, Havranek JJ, Kuhlman B, Kortemme T, Bonneau R, Gray JJ, Das R. Serverification of molecular modeling applications: the Rosetta Online Server that Includes Everyone (ROSIE). PLoS One 2013; 8:e63906. [PMID: 23717507 PMCID: PMC3661552 DOI: 10.1371/journal.pone.0063906] [Citation(s) in RCA: 261] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2013] [Accepted: 04/04/2013] [Indexed: 11/21/2022] Open
Abstract
The Rosetta molecular modeling software package provides experimentally tested and rapidly evolving tools for the 3D structure prediction and high-resolution design of proteins, nucleic acids, and a growing number of non-natural polymers. Despite its free availability to academic users and improving documentation, use of Rosetta has largely remained confined to developers and their immediate collaborators due to the code's difficulty of use, the requirement for large computational resources, and the unavailability of servers for most of the Rosetta applications. Here, we present a unified web framework for Rosetta applications called ROSIE (Rosetta Online Server that Includes Everyone). ROSIE provides (a) a common user interface for Rosetta protocols, (b) a stable application programming interface for developers to add additional protocols, (c) a flexible back-end to allow leveraging of computer cluster resources shared by RosettaCommons member institutions, and (d) centralized administration by the RosettaCommons to ensure continuous maintenance. This paper describes the ROSIE server infrastructure, a step-by-step 'serverification' protocol for use by Rosetta developers, and the deployment of the first nine ROSIE applications by six separate developer teams: Docking, RNA de novo, ERRASER, Antibody, Sequence Tolerance, Supercharge, Beta peptide design, NCBB design, and VIP redesign. As illustrated by the number and diversity of these applications, ROSIE offers a general and speedy paradigm for serverification of Rosetta applications that incurs negligible cost to developers and lowers barriers to Rosetta use for the broader biological community. ROSIE is available at http://rosie.rosettacommons.org.
Collapse
Affiliation(s)
- Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, United States of America
| | - Shane Ó. Conchúir
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Bryan S. Der
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Kevin Drew
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Daisuke Kuroda
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Jianqing Xu
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Brian D. Weitzner
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - P. Douglas Renfrew
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Parin Sripakdeevong
- Biophysics Program, Stanford University, Stanford, California, United States of America
| | - Benjamin Borgo
- Department of Genetics, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - James J. Havranek
- Department of Genetics, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Tanja Kortemme
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Richard Bonneau
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, United States of America
- Department of Physics, Stanford University, Stanford, California, United States of America
| |
Collapse
|