1
|
Claussnitzer M, Parikh VN, Wagner AH, Arbesfeld JA, Bult CJ, Firth HV, Muffley LA, Nguyen Ba AN, Riehle K, Roth FP, Tabet D, Bolognesi B, Glazer AM, Rubin AF. Minimum information and guidelines for reporting a multiplexed assay of variant effect. Genome Biol 2024; 25:100. [PMID: 38641812 PMCID: PMC11027375 DOI: 10.1186/s13059-024-03223-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 03/25/2024] [Indexed: 04/21/2024] Open
Abstract
Multiplexed assays of variant effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines have led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.
Collapse
Affiliation(s)
- Melina Claussnitzer
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Cambridge, MA, 02142, USA
| | - Victoria N Parikh
- Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Alex H Wagner
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, 43215, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, 43210, USA
| | - Jeremy A Arbesfeld
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, 43215, USA
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, 43210, USA
| | - Carol J Bult
- The Jackson Laboratory, Bar Harbor, ME, 04609, USA
| | - Helen V Firth
- Wellcome Sanger Institute, Hinxton, Cambridge, UK
- Dept of Medical Genetics, Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - Lara A Muffley
- Department of Genome Sciences, University of Washington, Seattle, WA, 98105, USA
| | - Alex N Nguyen Ba
- Department of Biology, University of Toronto at Mississauga, Mississauga, ON, Canada
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Frederick P Roth
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
| | - Daniel Tabet
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, Canada
| | - Benedetta Bolognesi
- Institute for Bioengineering of Catalunya (IBEC), The Barcelona Institute of Science and Technology, Barcelona, Spain.
| | - Andrew M Glazer
- Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia.
- Department of Medical Biology, University of Melbourne, Parkville, VIC, Australia.
| |
Collapse
|
2
|
Claussnitzer M, Parikh VN, Wagner AH, Arbesfeld JA, Bult CJ, Firth HV, Muffley LA, Nguyen Ba AN, Riehle K, Roth FP, Tabet D, Bolognesi B, Glazer AM, Rubin AF. Minimum information and guidelines for reporting a Multiplexed Assay of Variant Effect. ArXiv 2023:arXiv:2306.15113v1. [PMID: 37426450 PMCID: PMC10327236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Multiplexed Assays of Variant Effect (MAVEs) have emerged as a powerful approach for interrogating thousands of genetic variants in a single experiment. The flexibility and widespread adoption of these techniques across diverse disciplines has led to a heterogeneous mix of data formats and descriptions, which complicates the downstream use of the resulting datasets. To address these issues and promote reproducibility and reuse of MAVE data, we define a set of minimum information standards for MAVE data and metadata and outline a controlled vocabulary aligned with established biomedical ontologies for describing these experimental designs.
Collapse
Affiliation(s)
- Melina Claussnitzer
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Cambridge, MA 02142, USA
| | - Victoria N. Parikh
- Stanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA, USA 94305
| | - Alex H. Wagner
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH 43215, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH 43210, USA
| | - Jeremy A. Arbesfeld
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH 43215, USA
| | | | - Helen V. Firth
- Wellcome Sanger Institute, Hinxton, Cambridge, UK
- Dept of Medical Genetics, Cambridge University Hospitals NHS Trust, Cambridge UK
| | - Lara A. Muffley
- Department of Genome Sciences, University of Washington, Seattle, WA 98105, USA
| | - Alex N. Nguyen Ba
- Department of Biology, University of Toronto at Mississauga, Mississauga, Ontario, Canada
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Frederick P. Roth
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
| | - Daniel Tabet
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
| | - Benedetta Bolognesi
- Institute for Bioengineering of Catalunya (IBEC), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | | | - Alan F. Rubin
- Bioinformatics Division, WEHI, Parkville, Victoria, Australia
- Department of Medical Biology, University of Melbourne, Parkville, Victoria, Australia
| |
Collapse
|
3
|
Goar W, Babb L, Chamala S, Cline M, Freimuth RR, Hart RK, Kuzma K, Lee J, Nelson T, Prlić A, Riehle K, Smith A, Stahl K, Yates AD, Rehm HL, Wagner AH. Development and application of a computable genotype model in the GA4GH Variation Representation Specification. Pac Symp Biocomput 2023; 28:383-394. [PMID: 36540993 PMCID: PMC9782714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
As the diversity of genomic variation data increases with our growing understanding of the role of variation in health and disease, it is critical to develop standards for precise inter-system exchange of these data for research and clinical applications. The Global Alliance for Genomics and Health (GA4GH) Variation Representation Specification (VRS) meets this need through a technical terminology and information model for disambiguating and concisely representing variation concepts. Here we discuss the recent Genotype model in VRS, which may be used to represent the allelic composition of a genetic locus. We demonstrate the use of the Genotype model and the constituent Haplotype model for the precise and interoperable representation of pharmacogenomic diplotypes, HGVS variants, and VCF records using VRS and discuss how this can be leveraged to enable interoperable exchange and search operations between assayed variation and genomic knowledgebases.
Collapse
Affiliation(s)
- Wesley Goar
- Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Wei CH, Allot A, Riehle K, Milosavljevic A, Lu Z. tmVar 3.0: an improved variant concept recognition and normalization tool. Bioinformatics 2022; 38:4449-4451. [PMID: 35904569 PMCID: PMC9477515 DOI: 10.1093/bioinformatics/btac537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 07/07/2022] [Accepted: 07/27/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Previous studies have shown that automated text-mining tools are becoming increasingly important for successfully unlocking variant information in scientific literature at large scale. Despite multiple attempts in the past, existing tools are still of limited recognition scope and precision. RESULT We propose tmVar 3.0: an improved variant recognition and normalization system. Compared to its predecessors, tmVar 3.0 recognizes a wider spectrum of variant-related entities (e.g. allele and copy number variants), and groups together different variant mentions belonging to the same genomic sequence position in an article for improved accuracy. Moreover, tmVar 3.0 provides advanced variant normalization options such as allele-specific identifiers from the ClinGen Allele Registry. tmVar 3.0 exhibits state-of-the-art performance with over 90% in F-measure for variant recognition and normalization, when evaluated on three independent benchmarking datasets. tmVar 3.0 as well as annotations for the entire PubMed and PMC datasets are freely available for download. AVAILABILITY AND IMPLEMENTATION https://github.com/ncbi/tmVar3.
Collapse
Affiliation(s)
- Chih-Hsuan Wei
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Alexis Allot
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD 20894, USA
| |
Collapse
|
5
|
Thistlethwaite LR, Li X, Burrage LC, Riehle K, Hacia JG, Braverman N, Wangler MF, Miller MJ, Elsea SH, Milosavljevic A. Clinical diagnosis of metabolic disorders using untargeted metabolomic profiling and disease-specific networks learned from profiling data. Sci Rep 2022; 12:6556. [PMID: 35449147 PMCID: PMC9023513 DOI: 10.1038/s41598-022-10415-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Accepted: 03/14/2022] [Indexed: 02/06/2023] Open
Abstract
Untargeted metabolomics is a global molecular profiling technology that can be used to screen for inborn errors of metabolism (IEMs). Metabolite perturbations are evaluated based on current knowledge of specific metabolic pathway deficiencies, a manual diagnostic process that is qualitative, has limited scalability, and is not equipped to learn from accumulating clinical data. Our purpose was to improve upon manual diagnosis of IEMs in the clinic by developing novel computational methods for analyzing untargeted metabolomics data. We employed CTD, an automated computational diagnostic method that "connects the dots" between metabolite perturbations observed in individual metabolomics profiling data and modules identified in disease-specific metabolite co-perturbation networks learned from prior profiling data. We also extended CTD to calculate distances between any two individuals (CTDncd) and between an individual and a disease state (CTDdm), to provide additional network-quantified predictors for use in diagnosis. We show that across 539 plasma samples, CTD-based network-quantified measures can reproduce accurate diagnosis of 16 different IEMs, including adenylosuccinase deficiency, argininemia, argininosuccinic aciduria, aromatic L-amino acid decarboxylase deficiency, cerebral creatine deficiency syndrome type 2, citrullinemia, cobalamin biosynthesis defect, GABA-transaminase deficiency, glutaric acidemia type 1, maple syrup urine disease, methylmalonic aciduria, ornithine transcarbamylase deficiency, phenylketonuria, propionic acidemia, rhizomelic chondrodysplasia punctata, and the Zellweger spectrum disorders. Our approach can be used to supplement information from biochemical pathways and has the potential to significantly enhance the interpretation of variants of uncertain significance uncovered by exome sequencing. CTD, CTDdm, and CTDncd can serve as an essential toolset for biological interpretation of untargeted metabolomics data that overcomes limitations associated with manual diagnosis to assist diagnosticians in clinical decision-making. By automating and quantifying the interpretation of perturbation patterns, CTD can improve the speed and confidence by which clinical laboratory directors make diagnostic and treatment decisions, while automatically improving performance with new case data.
Collapse
Affiliation(s)
- Lillian R Thistlethwaite
- Quantitative and Computational Biosciences Program, Baylor College of Medicine, One Baylor Plaza, 400D, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Xiqi Li
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Lindsay C Burrage
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Hospital, Houston, TX, USA
| | - Kevin Riehle
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Joseph G Hacia
- Department of Biochemistry and Molecular Medicine, Keck School of Medicine of the University of Southern California, Los Angeles, CA, USA
| | - Nancy Braverman
- Department of Pediatrics and Human Genetics, McGill University, Montreal, QC, Canada
| | - Michael F Wangler
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Hospital, Houston, TX, USA
- Jan and Dan Duncan Texas Children's Hospital Neurological Research Institute, Houston, TX, USA
| | - Marcus J Miller
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Sarah H Elsea
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Aleksandar Milosavljevic
- Quantitative and Computational Biosciences Program, Baylor College of Medicine, One Baylor Plaza, 400D, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
6
|
Wagner AH, Babb L, Alterovitz G, Baudis M, Brush M, Cameron DL, Cline M, Griffith M, Griffith OL, Hunt SE, Kreda D, Lee JM, Li S, Lopez J, Moyer E, Nelson T, Patel RY, Riehle K, Robinson PN, Rynearson S, Schuilenburg H, Tsukanov K, Walsh B, Konopko M, Rehm HL, Yates AD, Freimuth RR, Hart RK. The GA4GH Variation Representation Specification: A computational framework for variation representation and federated identification. Cell Genom 2021; 1. [PMID: 35311178 PMCID: PMC8929418 DOI: 10.1016/j.xgen.2021.100027] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Maximizing the personal, public, research, and clinical value of genomic information will require the reliable exchange of genetic variation data. We report here the Variation Representation Specification (VRS, pronounced "verse"), an extensible framework for the computable representation of variation that complements contemporary human-readable and flat file standards for genomic variation representation. VRS provides semantically precise representations of variation and leverages this design to enable federated identification of biomolecular variation with globally consistent and unique computed identifiers. The VRS framework includes a terminology and information model, machine-readable schema, data sharing conventions, and a reference implementation, each of which is intended to be broadly useful and freely available for community use. VRS was developed by a partnership among national information resource providers, public initiatives, and diagnostic testing laboratories under the auspices of the Global Alliance for Genomics and Health (GA4GH).
Collapse
Affiliation(s)
- Alex H. Wagner
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH 43215, USA
- Corresponding author
| | - Lawrence Babb
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Corresponding author
| | - Gil Alterovitz
- Harvard Medical School, Boston, MA 02115, USA
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
| | - Michael Baudis
- University of Zurich and Swiss Institute of Bioinformatics, Zurich, Switzerland
| | - Matthew Brush
- Oregon Health & Science University, Portland, OR 97239, USA
| | - Daniel L. Cameron
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
| | - Melissa Cline
- UC Santa Cruz Genomics Institute, Santa Cruz, CA 95060, USA
| | - Malachi Griffith
- Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Obi L. Griffith
- Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Sarah E. Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Kreda
- Department of Biomedical Informatics, Harvard Medical School, Boston MA 02115, USA
| | - Jennifer M. Lee
- Essex Management LLC and National Cancer Institute, Rockville, MD 20850, USA
| | - Stephanie Li
- The Global Alliance for Genomics and Health, Toronto, ON, Canada
| | | | - Eric Moyer
- National Center for Biotechnology Information, National Library of Medicine National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | - Kevin Riehle
- Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Shawn Rynearson
- Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA
| | - Helen Schuilenburg
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kirill Tsukanov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Brian Walsh
- Oregon Health & Science University, Portland, OR 97239, USA
| | - Melissa Konopko
- The Global Alliance for Genomics and Health, Toronto, ON, Canada
| | - Heidi L. Rehm
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Cambridge, MA 02142, USA
| | - Andrew D. Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Robert R. Freimuth
- Center for Individualized Medicine, Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, USA
| | - Reece K. Hart
- Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- MyOme, Inc., Menlo Park, CA 94070, USA
- Corresponding author
| |
Collapse
|
7
|
Chen YC, Faver JC, Ku AF, Miklossy G, Riehle K, Bohren KM, Ucisik MN, Matzuk MM, Yu Z, Simmons N. C-N Coupling of DNA-Conjugated (Hetero)aryl Bromides and Chlorides for DNA-Encoded Chemical Library Synthesis. Bioconjug Chem 2020; 31:770-780. [PMID: 32019312 PMCID: PMC7086399 DOI: 10.1021/acs.bioconjchem.9b00863] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
![]()
DNA-encoded
chemical library (DECL) screens are a rapid and economical
tool to identify chemical starting points for drug discovery. As a
robust transformation for drug discovery, palladium-catalyzed C–N
coupling is a valuable synthetic method for the construction of DECL
chemical matter; however, currently disclosed methods have only been
demonstrated on DNA-attached (hetero)aromatic iodide and bromide electrophiles.
We developed conditions utilizing an N-heterocyclic
carbene–palladium catalyst that extends this reaction to the
coupling of DNA-conjugated (hetero)aromatic chlorides with (hetero)aromatic
and select aliphatic amine nucleophiles. In addition, we evaluated
steric and electronic effects within this catalyst series, carried
out a large substrate scope study on two representative (hetero)aryl
bromides, and applied this newly developed method within the construction
of a 63 million-membered DECL.
Collapse
Affiliation(s)
- Ying-Chu Chen
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - John C Faver
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Angela F Ku
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Gabriella Miklossy
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Kevin Riehle
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Kurt M Bohren
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Melek N Ucisik
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Martin M Matzuk
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Zhifeng Yu
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Nicholas Simmons
- Center for Drug Discovery, Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, United States
| |
Collapse
|
8
|
Li JY, Miklossy G, Modukuri RK, Bohren KM, Yu Z, Palaniappan M, Faver JC, Riehle K, Matzuk MM, Simmons N. Palladium-Catalyzed Hydroxycarbonylation of (Hetero)aryl Halides for DNA-Encoded Chemical Library Synthesis. Bioconjug Chem 2019; 30:2209-2215. [PMID: 31329429 PMCID: PMC6706801 DOI: 10.1021/acs.bioconjchem.9b00447] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
![]()
A strategy
for DNA-compatible, palladium-catalyzed hydroxycarbonylation
of (hetero)aryl halides on DNA–chemical conjugates has been
developed. This method generally provided the corresponding carboxylic
acids in moderate to very good conversions for (hetero)aryl iodides
and bromides, and in poor to moderate conversions for (hetero)aryl
chlorides. These conditions were further validated by application
within a DNA-encoded chemical library synthesis and subsequent discovery
of enriched features from the library in selection experiments against
two protein targets.
Collapse
Affiliation(s)
- Jian-Yuan Li
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Gabriella Miklossy
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Ram K Modukuri
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Kurt M Bohren
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Zhifeng Yu
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Murugesan Palaniappan
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - John C Faver
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Kevin Riehle
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Martin M Matzuk
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Nicholas Simmons
- Center for Drug Discovery, Department of Pathology and Immunology , Baylor College of Medicine , Houston , Texas 77030 , United States
| |
Collapse
|
9
|
Abstract
![]()
A hypodiboric
acid system for the reduction of nitro groups on
DNA–chemical conjugates has been developed. This transformation
provided good to excellent yields of the reduced amine product for
a variety of functionalized aromatic, heterocyclic, and aliphatic
nitro compounds. DNA tolerance to reaction conditions, extension to
decigram scale reductions, successful use in a DNA-encoded chemical
library synthesis, and subsequent target selection are also described.
Collapse
Affiliation(s)
- Huang-Chi Du
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Nicholas Simmons
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - John C Faver
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Zhifeng Yu
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Murugesan Palaniappan
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Kevin Riehle
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| | - Martin M Matzuk
- Center for Drug Discovery , Baylor College of Medicine , Houston , Texas 77030 , United States
| |
Collapse
|
10
|
Faver JC, Riehle K, Lancia DR, Milbank JBJ, Kollmann CS, Simmons N, Yu Z, Matzuk MM. Quantitative Comparison of Enrichment from DNA-Encoded Chemical Library Selections. ACS Comb Sci 2019; 21:75-82. [PMID: 30672692 PMCID: PMC6372980 DOI: 10.1021/acscombsci.8b00116] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
DNA-encoded
chemical libraries (DELs) provide a high-throughput
and cost-effective route for screening billions of unique molecules
for binding affinity for diverse protein targets. Identifying candidate
compounds from these libraries involves affinity selection, DNA sequencing,
and measuring enrichment in a sample pool of DNA barcodes. Successful
detection of potent binders is affected by many factors, including
selection parameters, chemical yields, library amplification, sequencing
depth, sequencing errors, library sizes, and the chosen enrichment
metric. To date, there has not been a clear consensus about how enrichment
from DEL selections should be measured or reported. We propose a normalized z-score enrichment metric using a binomial distribution
model that satisfies important criteria that are relevant for analysis
of DEL selection data. The introduced metric is robust with respect
to library diversity and sampling and allows for quantitative comparisons
of enrichment of n-synthons from parallel DEL selections.
These features enable a comparative enrichment analysis strategy that can
provide valuable information about hit compounds in early stage drug
discovery.
Collapse
Affiliation(s)
| | | | - David R. Lancia
- FORMA Therapeutics Inc., 500 Arsenal Street, Suite 100, Watertown, Massachusetts 02472, United States
| | - Jared B. J. Milbank
- FORMA Therapeutics Inc., 500 Arsenal Street, Suite 100, Watertown, Massachusetts 02472, United States
| | - Christopher S. Kollmann
- FORMA Therapeutics Inc., 500 Arsenal Street, Suite 100, Watertown, Massachusetts 02472, United States
| | | | | | | |
Collapse
|
11
|
Patel RY, Shah N, Jackson AR, Ghosh R, Pawliczek P, Paithankar S, Baker A, Riehle K, Chen H, Milosavljevic S, Bizon C, Rynearson S, Nelson T, Jarvik GP, Rehm HL, Harrison SM, Azzariti D, Powell B, Babb L, Plon SE, Milosavljevic A. ClinGen Pathogenicity Calculator: a configurable system for assessing pathogenicity of genetic variants. Genome Med 2017; 9:3. [PMID: 28081714 PMCID: PMC5228115 DOI: 10.1186/s13073-016-0391-z] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Accepted: 12/07/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The success of the clinical use of sequencing based tests (from single gene to genomes) depends on the accuracy and consistency of variant interpretation. Aiming to improve the interpretation process through practice guidelines, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) have published standards and guidelines for the interpretation of sequence variants. However, manual application of the guidelines is tedious and prone to human error. Web-based tools and software systems may not only address this problem but also document reasoning and supporting evidence, thus enabling transparency of evidence-based reasoning and resolution of discordant interpretations. RESULTS In this report, we describe the design, implementation, and initial testing of the Clinical Genome Resource (ClinGen) Pathogenicity Calculator, a configurable system and web service for the assessment of pathogenicity of Mendelian germline sequence variants. The system allows users to enter the applicable ACMG/AMP-style evidence tags for a specific allele with links to supporting data for each tag and generate guideline-based pathogenicity assessment for the allele. Through automation and comprehensive documentation of evidence codes, the system facilitates more accurate application of the ACMG/AMP guidelines, improves standardization in variant classification, and facilitates collaborative resolution of discordances. The rules of reasoning are configurable with gene-specific or disease-specific guideline variations (e.g. cardiomyopathy-specific frequency thresholds and functional assays). The software is modular, equipped with robust application program interfaces (APIs), and available under a free open source license and as a cloud-hosted web service, thus facilitating both stand-alone use and integration with existing variant curation and interpretation systems. The Pathogenicity Calculator is accessible at http://calculator.clinicalgenome.org . CONCLUSIONS By enabling evidence-based reasoning about the pathogenicity of genetic variants and by documenting supporting evidence, the Calculator contributes toward the creation of a knowledge commons and more accurate interpretation of sequence variants in research and clinical care.
Collapse
Affiliation(s)
- Ronak Y Patel
- Baylor College of Medicine, Houston, TX, 77030, USA.
| | - Neethu Shah
- Baylor College of Medicine, Houston, TX, 77030, USA
| | | | | | | | | | - Aaron Baker
- Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kevin Riehle
- Baylor College of Medicine, Houston, TX, 77030, USA
| | - Hailin Chen
- Baylor College of Medicine, Houston, TX, 77030, USA
| | | | - Chris Bizon
- The Renaissance Computing Institute, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27517, USA
| | - Shawn Rynearson
- University of Utah Hospitals and Clinics, University of Utah, Salt Lake City, UT, 84112, USA
| | - Tristan Nelson
- Geisinger autism and developmental medicine, Lewisburg, PA, 17837, USA
| | - Gail P Jarvik
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98195, USA
| | - Heidi L Rehm
- Laboratory for Molecular Medicine, Partners HealthCare Personalized Medicine, Cambridge, MA, 02139, USA
- Brigham & Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA
| | - Steven M Harrison
- Laboratory for Molecular Medicine, Partners HealthCare Personalized Medicine, Cambridge, MA, 02139, USA
| | - Danielle Azzariti
- Laboratory for Molecular Medicine, Partners HealthCare Personalized Medicine, Cambridge, MA, 02139, USA
| | - Bradford Powell
- Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27514, USA
| | - Larry Babb
- GeneInsight, Sunquest Information System, Boston, MA, 02210, USA
| | | | | |
Collapse
|
12
|
Hollister EB, Riehle K, Luna RA, Weidler EM, Rubio-Gonzales M, Mistretta TA, Raza S, Doddapaneni HV, Metcalf GA, Muzny DM, Gibbs RA, Petrosino JF, Shulman RJ, Versalovic J. Structure and function of the healthy pre-adolescent pediatric gut microbiome. Microbiome 2015; 3:36. [PMID: 26306392 PMCID: PMC4550057 DOI: 10.1186/s40168-015-0101-x] [Citation(s) in RCA: 237] [Impact Index Per Article: 26.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2015] [Accepted: 08/12/2015] [Indexed: 05/20/2023]
Abstract
BACKGROUND The gut microbiome influences myriad host functions, including nutrient acquisition, immune modulation, brain development, and behavior. Although human gut microbiota are recognized to change as we age, information regarding the structure and function of the gut microbiome during childhood is limited. Using 16S rRNA gene and shotgun metagenomic sequencing, we characterized the structure, function, and variation of the healthy pediatric gut microbiome in a cohort of school-aged, pre-adolescent children (ages 7-12 years). We compared the healthy pediatric gut microbiome with that of healthy adults previously recruited from the same region (Houston, TX, USA). RESULTS Although healthy children and adults harbored similar numbers of taxa and functional genes, their composition and functional potential differed significantly. Children were enriched in Bifidobacterium spp., Faecalibacterium spp., and members of the Lachnospiraceae, while adults harbored greater abundances of Bacteroides spp. From a functional perspective, significant differences were detected with respect to the relative abundances of genes involved in vitamin synthesis, amino acid degradation, oxidative phosphorylation, and triggering mucosal inflammation. Children's gut communities were enriched in functions which may support ongoing development, while adult communities were enriched in functions associated with inflammation, obesity, and increased risk of adiposity. CONCLUSIONS Previous studies suggest that the human gut microbiome is relatively stable and adult-like after the first 1 to 3 years of life. Our results suggest that the healthy pediatric gut microbiome harbors compositional and functional qualities that differ from those of healthy adults and that the gut microbiome may undergo a more prolonged development than previously suspected.
Collapse
Affiliation(s)
- Emily B Hollister
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA.
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA.
| | - Kevin Riehle
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, TX, USA
| | - Ruth Ann Luna
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA
| | - Erica M Weidler
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
- Children's Nutrition Research Center, Houston, TX, USA
- Pediatric Gastroenterology, Hepatology, and Nutrition, Texas Children's Hospital, Houston, TX, USA
| | - Michelle Rubio-Gonzales
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA
| | - Toni-Ann Mistretta
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA
| | - Sabeen Raza
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA
| | | | - Ginger A Metcalf
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Joseph F Petrosino
- Department of Molecular Virology & Microbiology, Baylor College of Medicine, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Alkek Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX, USA
| | - Robert J Shulman
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
- Children's Nutrition Research Center, Houston, TX, USA
- Pediatric Gastroenterology, Hepatology, and Nutrition, Texas Children's Hospital, Houston, TX, USA
| | - James Versalovic
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX, USA
- Texas Children's Microbiome Center, Department of Pathology, Texas Children's Hospital, Houston, TX, USA
| |
Collapse
|
13
|
Samuels ML, Roth M, Srinivasan P, Riehle K, Brufsky A, Hartmaier R, Boone D, Harris RA, Wu CC, Oesterreich S, Lee A, Milosavljevic A. Abstract 800: Simultaneous methylation and mutational analysis of breast cancer genomes using droplet-based targeted sequencing. Cancer Res 2013. [DOI: 10.1158/1538-7445.am2013-800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Targeted sequencing using droplet-based PCR provides an established method to selectively and uniformly amplify thousands of genomic regions of interest for deep, cost-effective next generation sequencing. Recently, this approach has been adapted for the analysis of bisulfite-treated DNA, enabling both quantitative epigenomic analysis (strand-specific determination of cytosine methylation status with single-base pair resolution) and genomic mutation detection. Simultaneous collection of both methylation and genomic aberration information with a single method makes efficient use of precious clinical material and allows both researchers and clinicians to use this information to sub-type cancers.
Here we report on our sequencing results using a single breast cancer targeted panel for sequence specific analysis of both epigenomic methylation and genomic mutation in breast tumor samples. The Breast Cancer MethylSeq Panel includes PCR primer pairs designed to amplify 1000 targets, leaving room for an additional 3000 target loci of interest to be added by individual researchers in the future. This single panel provides information about both loci-specific CpG methylation and genomic mutations, enabling analysis of multiple content types including: A) cell subtype (CELL); B) genomic mutation (MUT); C) surrogate gene expression subtypes using promoter CpG islands or other methylation marks (EXP); D) copy-number variation (CNV); E) Loss of Heterozygosity (LOH); and F) methylation-mediated gene silencing and loss of imprinting (IMP). In addition, miRNA associated with breast cancer will be targeted for analysis (MIR).
The Breast Cancer MethylSeq Panel was tested and validated with a set of 64 bisulfite-converted DNA samples that included breast cancer cell lines, a range of ER positive and negative tumors and TCGA samples, and a set of 8 metastatic tumors from a single individual. Overall Illumina HiSeq sequencing success metrics, concordance with metadata on the various target classes (A-F above) using validated controls, and results from the patient samples will be presented.
The Breast Cancer MethylSeq Panel allows simultaneous analysis of epigenomic and genomic sequence alterations associated with breast tumor samples, enabling tumor sub-typing and correlation with clinical outcomes. This ‘core’ panel provides a unique (expandable) screening tool for the breast cancer research community, and a paradigm for efficient analysis of multiple endpoints from a single assay for other diseases.
Citation Format: Michael L. Samuels, Matt Roth, Preethi Srinivasan, Kevin Riehle, Adam Brufsky, Ryan Hartmaier, David Boone, R. Alan Harris, Chia-Chin Wu, Steffi Oesterreich, Adrian Lee, Aleksandar Milosavljevic. Simultaneous methylation and mutational analysis of breast cancer genomes using droplet-based targeted sequencing. [abstract]. In: Proceedings of the 104th Annual Meeting of the American Association for Cancer Research; 2013 Apr 6-10; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2013;73(8 Suppl):Abstract nr 800. doi:10.1158/1538-7445.AM2013-800
Collapse
Affiliation(s)
| | - Matt Roth
- 2Baylor College of Medicine, Houston, TX
| | | | | | - Adam Brufsky
- 3University of Pittsburgh Cancer Institute, Pittsburgh, PA
| | - Ryan Hartmaier
- 3University of Pittsburgh Cancer Institute, Pittsburgh, PA
| | - David Boone
- 3University of Pittsburgh Cancer Institute, Pittsburgh, PA
| | | | | | | | - Adrian Lee
- 3University of Pittsburgh Cancer Institute, Pittsburgh, PA
| | | |
Collapse
|
14
|
Aagaard K, Ganu R, Ma J, Racusin D, Arndt M, Riehle K, Petrosino J, Versalovic J. 8: Whole metagenomic shotgun sequencing reveals a vibrant placental microbiome harboring metabolic function. Am J Obstet Gynecol 2013. [DOI: 10.1016/j.ajog.2012.10.182] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
15
|
Oesterreich S, Kitchens C, Gavin P, Wu CC, Riehle K, Coarfa C, Edwards D, Schiff R, Milosavljevic A, Lee A. Abstract P6-05-12: Lack of Frequent Estrogen Receptor Mutation in Primary Breast Tumors. Cancer Res 2012. [DOI: 10.1158/0008-5472.sabcs12-p6-05-12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The estrogen receptor alpha (ESR1) is a critical driver of breast tumorigenesis, and as such has been a target for therapy for many years. Early reports using Sanger sequencing and conformational assays reported that there were little if any mutations in ER-positive tumors, although a few somatic mutations have recently been described in tumors and cell lines. We set out to validate the authenticity of these reported somatic mutations by performing analysis of ER DNA sequence variants (DSVs) in 66 ER+ breast tumors, and 39 breast cancer cell lines. We utilized a combined approach of target capture sequencing of ESR1, and subsequent testing for novel and previously identified DSVs using an orthogonal mass spectrometry based sequencing approach. A DNA capture approach was designed to capture all exons, flanking splice sites, and the 3′ UTR of ESR1. DNA was captured and sequenced using SOLiD.
Using stringency with a cut-off with at least 2 variant reads, variants in at least 20% of the reads, and requiring evidence for the variants in both strands, we identified 4 previously reported SNPs (rs2077647; rs46432; rs1801132; rs2228480), and two previously reported somatic mutations H6Y and N532Y. We developed mass-spectrometry assays, including controls, for these DSVs, and in addition, we included 10 DSVs which had previously been reported as somatic mutations in breast (and other) cancer. Applying mass-spec based analysis to breast tumors and cell lines, we were able to confirm previously reported SNPs, and one previously reported mutations (H6Y). This mutation was found in one breast tumor and one cell line. None of the other reported somatic mutations could be confirmed in tumors or cell lines. Functional assays on the ESR1H6Y DSV failed to identify differences to wildtype receptor. The lack of a phenotype, and the infrequent occurrence of this DSV do not support a major driver role for ESR1H6Y.
This analysis suggests that ESR1 mutations are a rare event in ER+ primary breast cancer.
Citation Information: Cancer Res 2012;72(24 Suppl):Abstract nr P6-05-12.
Collapse
Affiliation(s)
- S Oesterreich
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - C Kitchens
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - P Gavin
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - C-C Wu
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - K Riehle
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - C Coarfa
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - D Edwards
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - R Schiff
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - A Milosavljevic
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| | - A Lee
- University of Pittsburgh Cancer Institure, Pittsburgh, PA; Baylor College of Medicine, Houston, TX; NSABP, Pittsburgh, PA
| |
Collapse
|
16
|
Riehle K, Coarfa C, Jackson A, Ma J, Tandon A, Paithankar S, Raghuraman S, Mistretta TA, Saulnier D, Raza S, Diaz MA, Shulman R, Aagaard K, Versalovic J, Milosavljevic A. The Genboree Microbiome Toolset and the analysis of 16S rRNA microbial sequences. BMC Bioinformatics 2012; 13 Suppl 13:S11. [PMID: 23320832 PMCID: PMC3426808 DOI: 10.1186/1471-2105-13-s13-s11] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND Microbial metagenomic analyses rely on an increasing number of publicly available tools. Installation, integration, and maintenance of the tools poses significant burden on many researchers and creates a barrier to adoption of microbiome analysis, particularly in translational settings. METHODS To address this need we have integrated a rich collection of microbiome analysis tools into the Genboree Microbiome Toolset and exposed them to the scientific community using the Software-as-a-Service model via the Genboree Workbench. The Genboree Microbiome Toolset provides an interactive environment for users at all bioinformatic experience levels in which to conduct microbiome analysis. The Toolset drives hypothesis generation by providing a wide range of analyses including alpha diversity and beta diversity, phylogenetic profiling, supervised machine learning, and feature selection. RESULTS We validate the Toolset in two studies of the gut microbiota, one involving obese and lean twins, and the other involving children suffering from the irritable bowel syndrome. CONCLUSIONS By lowering the barrier to performing a comprehensive set of microbiome analyses, the Toolset empowers investigators to translate high-volume sequencing data into valuable biomedical discoveries.
Collapse
Affiliation(s)
- Kevin Riehle
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Cristian Coarfa
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Andrew Jackson
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jun Ma
- Obstetrics and Gynecology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Arpit Tandon
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sameer Paithankar
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sriram Raghuraman
- Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Toni-Ann Mistretta
- Pathology & Immunology, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Sabeen Raza
- Pathology & Immunology, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Robert Shulman
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Kjersti Aagaard
- Obstetrics and Gynecology, Baylor College of Medicine, Houston, TX 77030, USA
| | - James Versalovic
- Pathology & Immunology, Baylor College of Medicine, Houston, TX 77030, USA
| | | |
Collapse
|
17
|
Aagaard K, Riehle K, Ma J, Segata N, Mistretta TA, Coarfa C, Raza S, Rosenbaum S, Van den Veyver I, Milosavljevic A, Gevers D, Huttenhower C, Petrosino J, Versalovic J. A metagenomic approach to characterization of the vaginal microbiome signature in pregnancy. PLoS One 2012; 7:e36466. [PMID: 22719832 PMCID: PMC3374618 DOI: 10.1371/journal.pone.0036466] [Citation(s) in RCA: 433] [Impact Index Per Article: 36.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2011] [Accepted: 04/06/2012] [Indexed: 12/26/2022] Open
Abstract
While current major national research efforts (i.e., the NIH Human Microbiome Project) will enable comprehensive metagenomic characterization of the adult human microbiota, how and when these diverse microbial communities take up residence in the host and during reproductive life are unexplored at a population level. Because microbial abundance and diversity might differ in pregnancy, we sought to generate comparative metagenomic signatures across gestational age strata. DNA was isolated from the vagina (introitus, posterior fornix, midvagina) and the V5V3 region of bacterial 16S rRNA genes were sequenced (454FLX Titanium platform). Sixty-eight samples from 24 healthy gravidae (18 to 40 confirmed weeks) were compared with 301 non-pregnant controls (60 subjects). Generated sequence data were quality filtered, taxonomically binned, normalized, and organized by phylogeny and into operational taxonomic units (OTU); principal coordinates analysis (PCoA) of the resultant beta diversity measures were used for visualization and analysis in association with sample clinical metadata. Altogether, 1.4 gigabytes of data containing >2.5 million reads (averaging 6,837 sequences/sample of 493 nt in length) were generated for computational analyses. Although gravidae were not excluded by virtue of a posterior fornix pH >4.5 at the time of screening, unique vaginal microbiome signature encompassing several specific OTUs and higher-level clades was nevertheless observed and confirmed using a combination of phylogenetic, non-phylogenetic, supervised, and unsupervised approaches. Both overall diversity and richness were reduced in pregnancy, with dominance of Lactobacillus species (L. iners crispatus, jensenii and johnsonii, and the orders Lactobacillales (and Lactobacillaceae family), Clostridiales, Bacteroidales, and Actinomycetales. This intergroup comparison using rigorous standardized sampling protocols and analytical methodologies provides robust initial evidence that the vaginal microbial 16S rRNA gene catalogue uniquely differs in pregnancy, with variance of taxa across vaginal subsite and gestational age.
Collapse
Affiliation(s)
- Kjersti Aagaard
- Department of Obstetrics & Gynecology, Division of Maternal-Fetal Medicine, Baylor College of Medicine and Texas Children's Hospital, Houston, Texas, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Ganu R, Ma J, Ayvaz T, Riehle K, Coarfa C, Schaible T, Milosavljevic A, Versalovic J, Petrosino J, Kellermayer R, Aagaard K. 435: Linking barkers hypothesis on the developmental origins of adult disease with the hygiene hypothesis: maternal methyl-donor supplementation (MDS) significantly alters the fetal liver microbiome. Am J Obstet Gynecol 2012. [DOI: 10.1016/j.ajog.2011.10.453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
19
|
Saulnier DM, Riehle K, Mistretta TA, Diaz MA, Mandal D, Raza S, Weidler EM, Qin X, Coarfa C, Milosavljevic A, Petrosino JF, Highlander S, Gibbs R, Lynch SV, Shulman RJ, Versalovic J. Gastrointestinal microbiome signatures of pediatric patients with irritable bowel syndrome. Gastroenterology 2011; 141:1782-91. [PMID: 21741921 PMCID: PMC3417828 DOI: 10.1053/j.gastro.2011.06.072] [Citation(s) in RCA: 472] [Impact Index Per Article: 36.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Revised: 06/15/2011] [Accepted: 06/24/2011] [Indexed: 12/14/2022]
Abstract
BACKGROUND & AIMS The intestinal microbiomes of healthy children and pediatric patients with irritable bowel syndrome (IBS) are not well defined. Studies in adults have indicated that the gastrointestinal microbiota could be involved in IBS. METHODS We analyzed 71 samples from 22 children with IBS (pediatric Rome III criteria) and 22 healthy children, ages 7-12 years, by 16S ribosomal RNA gene sequencing, with an average of 54,287 reads/stool sample (average 454 read length = 503 bases). Data were analyzed using phylogenetic-based clustering (Unifrac), or an operational taxonomic unit (OTU) approach using a supervised machine learning tool (randomForest). Most samples were also hybridized to a microarray that can detect 8741 bacterial taxa (16S rRNA PhyloChip). RESULTS Microbiomes associated with pediatric IBS were characterized by a significantly greater percentage of the class γ-proteobacteria (0.07% vs 0.89% of total bacteria, respectively; P < .05); 1 prominent component of this group was Haemophilus parainfluenzae. Differences highlighted by 454 sequencing were confirmed by high-resolution PhyloChip analysis. Using supervised learning techniques, we were able to classify different subtypes of IBS with a success rate of 98.5%, using limited sets of discriminant bacterial species. A novel Ruminococcus-like microbe was associated with IBS, indicating the potential utility of microbe discovery for gastrointestinal disorders. A greater frequency of pain correlated with an increased abundance of several bacterial taxa from the genus Alistipes. CONCLUSIONS Using 16S metagenomics by PhyloChip DNA hybridization and deep 454 pyrosequencing, we associated specific microbiome signatures with pediatric IBS. These findings indicate the important association between gastrointestinal microbes and IBS in children; these approaches might be used in diagnosis of functional bowel disorders in pediatric patients.
Collapse
Affiliation(s)
- Delphine M. Saulnier
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX,Department of Pathology, Texas Children's Hospital, Houston, TX,NIZO, Ede, The Netherlands
| | - Kevin Riehle
- Department of Molecular & Human Genetics, and Baylor College of Medicine, Houston, TX,Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, TX
| | - Toni-Ann Mistretta
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX,Department of Pathology, Texas Children's Hospital, Houston, TX
| | - Maria-Alejandra Diaz
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX,Department of Pathology, Texas Children's Hospital, Houston, TX
| | - Debasmita Mandal
- Department of Medicine, University of California San Francisco, San Francisco, CA
| | - Sabeen Raza
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX,Department of Pathology, Texas Children's Hospital, Houston, TX
| | - Erica M. Weidler
- Department of Pediatrics, Baylor College of Medicine, Houston, TX,Children's Nutrition Research Center, Houston, TX
| | - Xiang Qin
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - Cristian Coarfa
- Department of Molecular & Human Genetics, and Baylor College of Medicine, Houston, TX,Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, TX
| | - Aleksandar Milosavljevic
- Department of Molecular & Human Genetics, and Baylor College of Medicine, Houston, TX,Bioinformatics Research Laboratory, Baylor College of Medicine, Houston, TX
| | - Joseph F. Petrosino
- Department of Molecular Virology & Microbiology, Baylor College of Medicine, Houston, TX,Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX,Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX
| | - Sarah Highlander
- Department of Molecular Virology & Microbiology, Baylor College of Medicine, Houston, TX,Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - Richard Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - Susan V. Lynch
- Department of Medicine, University of California San Francisco, San Francisco, CA
| | - Robert J. Shulman
- Department of Pediatrics, Baylor College of Medicine, Houston, TX,Children's Nutrition Research Center, Houston, TX
| | - James Versalovic
- Department of Pathology & Immunology, Baylor College of Medicine, Houston, TX,Department of Molecular Virology & Microbiology, Baylor College of Medicine, Houston, TX,Department of Molecular & Human Genetics, and Baylor College of Medicine, Houston, TX,Department of Pediatrics, Baylor College of Medicine, Houston, TX,Department of Pathology, Texas Children's Hospital, Houston, TX
| |
Collapse
|
20
|
Aagaard K, Versalovic J, Petrosino J, Mistretta TA, Riehle K, Coarfa C, Raza S, Dowlin D, Rosenbaum S, Van den Veyver I, Milosavljevic A. 73: Metagenomic-based approach to a comprehensive characterization of the vaginal microbiome signature in pregnancy. Am J Obstet Gynecol 2011. [DOI: 10.1016/j.ajog.2010.10.087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|