1
|
Yen A, Zappala Z, Fine RS, Majarian TD, Sripakdeevong P, Altshuler D. Specificity of CRISPR-Cas9 Editing in Exagamglogene Autotemcel. N Engl J Med 2024; 390:1723-1725. [PMID: 38657268 DOI: 10.1056/nejmc2313119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
|
2
|
Boyle SM, Clark MJ, Alla R, Luo S, Church DM, Helman E, Sripakdeevong P, West J, Chen R. Abstract 533: Accurately identifying expressed somatic variants for neoantigen detection and immuno-oncology. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Accurate detection of somatic variants is a staple of both research and clinical cancer analysis, with applications ranging from detecting new common driver mutations in large patient cohorts to selecting therapeutic small molecule treatment courses for an individual patient. Recent research into neoantigens and immunotherapy has shown great promise as a precision therapeutic, and somatic variant detection by next-generation sequencing represents an ideal method of identifying candidate neoantigens. Somatic variant detection typically involves assaying the DNA for changes in gene sequences without assessing whether those variants are actually expressed in RNA. However, the expression of small variants is key because only expressed peptides will be displayed as neoantigens on the cell surface.
From a technical standpoint, detection of somatic variants in the RNA represents additional challenges above and beyond those of somatic detection in DNA. The widely varying expression levels of cancer genes, alternative splicing, and RNA editing are all features that make somatic variant calling in RNA uniquely challenging. However, accurately detecting variants directly from expressed transcripts is beneficial to neoantigen prediction, and therefore we sought to create and validate a method for somatic variant calling in RNA.
We have designed a highly accurate expression-based somatic variant detection pipeline utilizing extensive discovery and filtering methods to overcome the challenges inherent in RNA somatic variant calling. We validated our pipeline using a combination of well-characterized cell lines, commercially available reference standards, and real world FFPE patient samples. To our knowledge, this is the most extensive validation of its kind to date, representing over 29,158 small variants across 39 samples. In testing, we measured our detection method at >99% sensitivity and >99% PPV using a combination of gold set small variants and orthogonal validation. This method, in combination with our validated DNA somatic variant calling pipeline (>99% sensitivity and >99% PPV), enables precise detection of variant expression levels in a given sample, even at low allele frequency (5%).
After validating our RNA somatic variant calling method, we applied it to detect candidate neoantigens in patient tumor samples. We performed HLA typing for each sample using HLAssign software and predicted MHC presentation of the expressed somatic variants. In ongoing studies, we are validating our most promising putative neoantigens using orthogonal technologies and demonstrating our ability to detect the most promising clinically effective peptides for therapy.
Citation Format: Sean M. Boyle, Michael J. Clark, Ravi Alla, Shujun Luo, Deanna M. Church, Elena Helman, Parin Sripakdeevong, John West, Rich Chen. Accurately identifying expressed somatic variants for neoantigen detection and immuno-oncology. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 533.
Collapse
|
3
|
Helman E, Clark MJ, Alla R, Boyle SM, Luo S, Virk S, Church D, Sripakdeevong P, Harris J, karbelashvili M, Haudenschild C, West J, Chen R. Abstract 3169: The benefits and burdens of assaying matched normal tissue when sequencing cancer genomes. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-3169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Targeted sequencing assays are increasingly used to identify tumor mutations that guide therapeutic decisions. Interpretation of a cancer variant's origin and therapeutic impact poses analytical challenges. Recent studies have indicated that jointly analyzing a tumor with its matched normal can accurately discriminate between tumor-specific (somatic) and inherited (germline) mutations. Moreover, a NHGRI/NCI Clinical Sequencing Exploratory Research Consortium Tumor Working Group just released a set of guidelines recommending that laboratories performing cancer sequencing tests should include germline variants. However, procurement of a matched sample is often logistically impractical. In the absence of a matched normal, large databases and analytical techniques are currently used to identify cancer variants in tumor sequencing data. Whether the benefits outweigh the additional burden of sequencing the matched normal for accurate detection of cancer-relevant mutations remains an open question.
To compare tumor-only and tumor/normal analysis of cancer samples, we collected a set of >100 formalin-fixed (FFPE) and fresh frozen cancer samples of various tumor types, where matched normal blood or adjacent tissue was available. We performed augmented target enrichment sequencing (exome and large cancer gene panel) of both DNA and RNA. The data was analyzed using cancer bioinformatics pipelines that detect base substitutions, small insertions/deletions, copy number alterations, and gene fusions in both tumor-only and tumor/normal modes. Variants were annotated using described clinical actionability filtering strategies. Analysis of germline variants for secondary findings was performed.
We find that 67% of mutations detected in tumor-only mode are reclassified as germline variants when analyzed together with the matched normal sample. These include mutations in hereditary cancer predisposition genes, such as BRCA1, VHL, and other genes with ACMG guidelines that warrant germline variant classification and appropriate management. Clinically actionable mutations may be miscalled as somatic when a matched normal is not available; however, we find the definition of ‘actionable’ can greatly impact the results of this analysis. Finally, the use of newly available large datasets, such as ExAC, substantially decreases the number of miscalled somatic variants in the absence of a matched normal.
The effects of administering targeted therapies to patients with germline mutations in the relevant gene are largely unknown. Mutations of putative germline origin may be important for hereditary cancer knowledge and tumor treatment, and should be reported as such. For NGS-based cancer interpretation to guide clinical decisions in a practical and cost-effective manner, highly optimized tumor-only and tumor/normal analyses must be available with proper attention to germline consent, classification and education.
Citation Format: Elena Helman, Michael J. Clark, Ravi Alla, Sean M. Boyle, Shujun Luo, Selene Virk, Deanna Church, Parin Sripakdeevong, Jason Harris, Mirian karbelashvili, Christian Haudenschild, John West, Richard Chen. The benefits and burdens of assaying matched normal tissue when sequencing cancer genomes. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 3169.
Collapse
|
4
|
Clark MJ, Boyle SM, Helman E, Luo S, Bartha G, Morra M, Patwardhan A, Haudenschild C, Karbelashvili M, Sripakdeevong P, Harris J, Church D, Chervitz S, West J, Chen R. Abstract 4744: Solving genomic assay trade-offs with an optimized, extended cancer gene panel for research and clinical applications. Cancer Res 2015. [DOI: 10.1158/1538-7445.am2015-4744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Choosing a genomic assay for cancer research is complicated by trade-offs. Cancer gene panels are a common choice, but they target mutational hotspots in a relatively small number of genes, often for cancers that are most commonly tested and that have common genetic etiologies. A common alternative is exome sequencing, which includes all the coding genes but, due to its larger genomic footprint, cannot reach the same depths as panels and therefore is less able to deal with low tumor purity and heterogeneity. Whole genome sequencing trades off very shallow depth and coverage over vast regions of uninterpretable genomic sequence in exchange for the identification of intergenic variants and structural variant breakpoints. All of these assays can be supplemented with RNA sequencing in order to capture gene fusions, allelic expression, splice isoforms, and gene expression. RNAseq comes with its own costs: the need to extract RNA from the same tissue, the need to perform a second assay, and the need to analyze a very different type of data from DNA sequencing.
The trade-offs generally come down to three major issues: depth of sequencing, specific genes targeted, and cost. To solve these, we designed an extended, optimized cancer gene panel facilitating high depth sequencing at low cost. We started by identifying a comprehensive list of over 1,300 cancer genes. These genes were chosen through exhaustive cancer gene database and literature curation, and include genes from all major cancer pathways and from the Cancer Gene Census. We then took this gene list and applied an augmented targeting design strategy that we have previously used to create an augmented exome enrichment platform which fills in gaps that standard technical exomes miss.
To validate the panel and analysis, we identified test samples including well-described cancer cell lines, cell line mixtures with engineered cancer variations, and formalin-fixed neoplastic tissues. We then performed a series of tests with these samples to measure the panel's small variant sensitivity and specificity, gauge its limits of detection, validate the detection of gene fusions, and demonstrate its ability to identify copy number alterations and loss of heterozygosity. In engineered cell lines, we detected 100% of small variants down to 5% allele frequency. We also mixed the cancer cell lines in various ratios and found similarly high sensitivity as well as very high specificity for small variant detection. We further compared our structural variation calls in the DNA and our fusion calls in the RNA with known data and found that we had very high concordance with known variations.
These studies demonstrate that an extended, augmented cancer gene panel strategy solves many genomic assay trade-offs and leads to high accuracy and variant yield for cancer research applications.
Citation Format: Michael J. Clark, Sean M. Boyle, Elena Helman, Shujun Luo, Gabor Bartha, Massimo Morra, Anil Patwardhan, Christian Haudenschild, Mirian Karbelashvili, Parin Sripakdeevong, Jason Harris, Deanna Church, Stephen Chervitz, John West, Richard Chen. Solving genomic assay trade-offs with an optimized, extended cancer gene panel for research and clinical applications. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 4744. doi:10.1158/1538-7445.AM2015-4744
Collapse
|
5
|
Boyle SM, Clark MJ, Helman E, Parker AS, Ho T, Luo S, Kirk S, Sripakdeevong P, Karbelashvili M, Church DM, Snyder M, West J, Chen R. Abstract 3899: Integrating RNA/DNA analysis with a comprehensive cancer panel to improve interpretations of stage four metastatic renal cell carcinoma. Cancer Res 2015. [DOI: 10.1158/1538-7445.am2015-3899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Next-generation sequencing is now commonly applied to measure genomic modifications, providing the opportunity for a more accurate view of cancer progression. That being said, several challenges including tumor purity, heterogeneity and the broad set of cancer associated cellular functions, increase the difficulty of interpreting cancer genomes and make analyzing them distinct from analysis of germline diseases. The ability to detect low allelic mutations in the broad set of cancer-associated genes is critically important. To address these issues, we developed an extended cancer panel, a targeted enrichment sequencing platform that includes over 1,300 cancer genes and 200 miRNAs that improves coverage in traditionally difficult to sequence NGS regions. In this study, we apply an integrated RNA/DNA analysis approach as well as our extended cancer panel to explore mutations in metastatic renal cell carcinoma (RCC) tissue samples before and after treatment.
Our goal with this study was to test the importance of integrating DNA and RNA analysis, which allows for detection of mutational events that cannot be observed by either method alone. With RNA analysis it is possible to detect gene fusion events, expression of small variants (SNVs/indels), and gene expression levels while DNA analysis allows for detection of loss of heterozygosity (LOH), copy number variants (CNVs), and small variants. By integrating both the RNA and DNA analysis, it is possible to identify unique variant classes, such as unexpressed variants, allele specific expression (ASE), effects of CNVs on gene expression levels, and how fusion gene integration sites affect CNVs. In our study, each variant class was interpreted pre- and post-treatment, allowing for optimal analysis of treatment effectiveness.
By applying both our RNA/DNA integration approach and our extended cancer panel to investigate metastatic RCC samples, we were able to classify many genomic alterations that drive cancer progression. For example, we found that anywhere from 5% to 30% of small variant mutations in driver genes called in the DNA reside in genes that are unexpressed in the RNA. Likewise, we noted multiple cases of ASE, where high profile variants observed in DNA were entirely absent in the RNA despite high expression levels. Strong correlations between CNVs observed in DNA and gene expression changes found in RNA were also detected. We also observed that about half of the RCC samples we tested were biallelic for VHL mutations, suggesting inherited predisposition combined with second hit somatic mutations, facilitating cancer progression.
This analysis demonstrates how combining an integrated RNA/DNA approach with a cancer focused augmented enrichment panel allows for detection of both low allelic representation variants and unique variant classes, both of which are critical for accurate interpretation of cancer samples.
Citation Format: Sean M. Boyle, Michael J. Clark, Elena Helman, Alexander S. Parker, Thai Ho, Shujun Luo, Scott Kirk, Parin Sripakdeevong, Mirian Karbelashvili, Deanna M. Church, Michael Snyder, John West, Rich Chen. Integrating RNA/DNA analysis with a comprehensive cancer panel to improve interpretations of stage four metastatic renal cell carcinoma. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 3899. doi:10.1158/1538-7445.AM2015-3899
Collapse
Affiliation(s)
| | | | | | | | - Thai Ho
- 3Mayo Clinic, Scottsdale, AZ
| | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Clark MJ, Helman E, Boyle S, Luo S, Church D, Harris J, Karbelashvili M, Chervitz S, Sripakdeevong P, Bartha G, Patwardhan AJ, West J, Chen R. The detection of clinically relevant cancer mutations using a high depth, augmented, comprehensive cancer gene panel. J Clin Oncol 2015. [DOI: 10.1200/jco.2015.33.15_suppl.e12547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
7
|
Sripakdeevong P, Cevec M, Chang AT, Erat MC, Ziegeler M, Zhao Q, Fox GE, Gao X, Kennedy SD, Kierzek R, Nikonowicz EP, Schwalbe H, Sigel RKO, Turner DH, Das R. Structure determination of noncanonical RNA motifs guided by ¹H NMR chemical shifts. Nat Methods 2014; 11:413-6. [PMID: 24584194 PMCID: PMC3985481 DOI: 10.1038/nmeth.2876] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 01/06/2014] [Indexed: 12/31/2022]
Abstract
Structured non-coding RNAs underline fundamental cellular processes, but determining their 3D structures remains challenging. We demonstrate herein that integrating NMR 1H chemical shift data with Rosetta de novo modeling can consistently return high-resolution RNA structures. On a benchmark set of 23 noncanonical RNA motifs, including 11 blind targets, Chemical-Shift-ROSETTA for RNA (CS-ROSETTA-RNA) recovered the experimental structures with high accuracy (0.6 to 2.0 Å all-heavy-atom rmsd) in 18 cases.
Collapse
Affiliation(s)
| | - Mirko Cevec
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Andrew T Chang
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Michèle C Erat
- 1] Department of Biochemistry, University of Oxford, Oxford, UK. [2] Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Melanie Ziegeler
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Qin Zhao
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - George E Fox
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Xiaolian Gao
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, New York, USA
| | - Ryszard Kierzek
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Edward P Nikonowicz
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Harald Schwalbe
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Roland K O Sigel
- Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Douglas H Turner
- Department of Chemistry, University of Rochester, Rochester, New York, USA
| | - Rhiju Das
- 1] Biophysics Program, Stanford University, Stanford, California, USA. [2] Department of Biochemistry, Stanford University, Stanford, California, USA. [3] Department of Physics, Stanford University, Stanford, California, USA
| |
Collapse
|
8
|
Lyskov S, Chou FC, Conchúir SÓ, Der BS, Drew K, Kuroda D, Xu J, Weitzner BD, Renfrew PD, Sripakdeevong P, Borgo B, Havranek JJ, Kuhlman B, Kortemme T, Bonneau R, Gray JJ, Das R. Serverification of molecular modeling applications: the Rosetta Online Server that Includes Everyone (ROSIE). PLoS One 2013; 8:e63906. [PMID: 23717507 PMCID: PMC3661552 DOI: 10.1371/journal.pone.0063906] [Citation(s) in RCA: 261] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2013] [Accepted: 04/04/2013] [Indexed: 11/21/2022] Open
Abstract
The Rosetta molecular modeling software package provides experimentally tested and rapidly evolving tools for the 3D structure prediction and high-resolution design of proteins, nucleic acids, and a growing number of non-natural polymers. Despite its free availability to academic users and improving documentation, use of Rosetta has largely remained confined to developers and their immediate collaborators due to the code's difficulty of use, the requirement for large computational resources, and the unavailability of servers for most of the Rosetta applications. Here, we present a unified web framework for Rosetta applications called ROSIE (Rosetta Online Server that Includes Everyone). ROSIE provides (a) a common user interface for Rosetta protocols, (b) a stable application programming interface for developers to add additional protocols, (c) a flexible back-end to allow leveraging of computer cluster resources shared by RosettaCommons member institutions, and (d) centralized administration by the RosettaCommons to ensure continuous maintenance. This paper describes the ROSIE server infrastructure, a step-by-step 'serverification' protocol for use by Rosetta developers, and the deployment of the first nine ROSIE applications by six separate developer teams: Docking, RNA de novo, ERRASER, Antibody, Sequence Tolerance, Supercharge, Beta peptide design, NCBB design, and VIP redesign. As illustrated by the number and diversity of these applications, ROSIE offers a general and speedy paradigm for serverification of Rosetta applications that incurs negligible cost to developers and lowers barriers to Rosetta use for the broader biological community. ROSIE is available at http://rosie.rosettacommons.org.
Collapse
Affiliation(s)
- Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, United States of America
| | - Shane Ó. Conchúir
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Bryan S. Der
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Kevin Drew
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Daisuke Kuroda
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Jianqing Xu
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Brian D. Weitzner
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - P. Douglas Renfrew
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Parin Sripakdeevong
- Biophysics Program, Stanford University, Stanford, California, United States of America
| | - Benjamin Borgo
- Department of Genetics, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - James J. Havranek
- Department of Genetics, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Tanja Kortemme
- California Institute for Quantitative Biomedical Research, University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Richard Bonneau
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, Maryland, United States of America
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, United States of America
- Department of Physics, Stanford University, Stanford, California, United States of America
| |
Collapse
|
9
|
Abstract
Three-dimensional RNA models fitted into crystallographic density maps exhibit pervasive conformational ambiguities, geometric errors, and steric clashes. To address these problems, we present Enumerative Real-space Refinement ASsisted by Electron density under Rosetta (ERRASER), coupled to PHENIX (Python-based Hierarchical Environment for Integrated Xtallography) diffraction-based refinement. On 24 datasets, ERRASER automatically corrects the majority of MolProbity-assessed errors, improves average Rfree factor, resolves functionally important discrepancies in non-canonical structure, and refines low-resolution models to better match higher resolution models.
Collapse
Affiliation(s)
- Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, California, USA
| | | | | | | | | |
Collapse
|
10
|
Cruz JA, Blanchet MF, Boniecki M, Bujnicki JM, Chen SJ, Cao S, Das R, Ding F, Dokholyan NV, Flores SC, Huang L, Lavender CA, Lisi V, Major F, Mikolajczak K, Patel DJ, Philips A, Puton T, Santalucia J, Sijenyi F, Hermann T, Rother K, Rother M, Serganov A, Skorupski M, Soltysinski T, Sripakdeevong P, Tuszynska I, Weeks KM, Waldsich C, Wildauer M, Leontis NB, Westhof E. RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. RNA 2012; 18:610-25. [PMID: 22361291 PMCID: PMC3312550 DOI: 10.1261/rna.031054.111] [Citation(s) in RCA: 156] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises.
Collapse
Affiliation(s)
- José Almeida Cruz
- Architecture et Réactivité de l'ARN, Université de Strasbourg, IBMC-CNRS, F-67084 Strasbourg, France
| | - Marc-Frédérick Blanchet
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - Michal Boniecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Shi-Jie Chen
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, Missouri 65211, USA
| | - Song Cao
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, Missouri 65211, USA
| | - Rhiju Das
- Department of Biochemistry
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Feng Ding
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Nikolay V. Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina, School of Medicine, Chapel Hill, North Carolina 27599, USA
| | - Samuel Coulbourn Flores
- Computational & Systems Biology Program, Institute for Cell and Molecular Biology, Uppsala University, 751 05 Uppsala, Sweden
| | - Lili Huang
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Christopher A. Lavender
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Véronique Lisi
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - François Major
- Institute for Research in Immunology and Cancer (IRIC), Department of Computer Science and Operations Research, Université de Montréal, Montréal, Québec H3C 3J7, Canada
| | - Katarzyna Mikolajczak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Dinshaw J. Patel
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Anna Philips
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Tomasz Puton
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - John Santalucia
- Department of Chemistry, Wayne State University, Detroit, Michigan 48202, USA
- DNA Software, Ann Arbor, Michigan 48104, USA
| | | | - Thomas Hermann
- Department of Chemistry and Biochemistry, University of California at San Diego, La Jolla, California 92093, USA
| | - Kristian Rother
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Magdalena Rother
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Alexander Serganov
- Structural Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA
| | - Marcin Skorupski
- Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, 61-614 Poznan, Poland
| | - Tomasz Soltysinski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Parin Sripakdeevong
- Department of Biochemistry
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Irina Tuszynska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
| | - Kevin M. Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Christina Waldsich
- Max F. Perutz Laboratories, Department of Biochemistry, University of Vienna, Vienna 1030, Austria
| | - Michael Wildauer
- Max F. Perutz Laboratories, Department of Biochemistry, University of Vienna, Vienna 1030, Austria
| | - Neocles B. Leontis
- Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - Eric Westhof
- Architecture et Réactivité de l'ARN, Université de Strasbourg, IBMC-CNRS, F-67084 Strasbourg, France
- Corresponding author.E-mail .
| |
Collapse
|