1
|
Blau T, Chades I, Ong CS. Machine Learning for Biological Design. Methods Mol Biol 2024; 2760:319-344. [PMID: 38468097 DOI: 10.1007/978-1-0716-3658-9_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
We briefly present machine learning approaches for designing better biological experiments. These approaches build on machine learning predictors and provide additional tools to guide scientific discovery. There are two different kinds of objectives when designing better experiments: to improve the predictive model or to improve the experimental outcome. We survey five different approaches for adaptive experimental design that iteratively search the space of possible experiments while adapting to measured data. The approaches are Bayesian optimization, bandits, reinforcement learning, optimal experimental design, and active learning. These machine learning approaches have shown promise in various areas of biology, and we provide broad guidelines to the practitioner and links to further resources.
Collapse
Affiliation(s)
- Tom Blau
- CSIRO, Data61, Eveleigh, NSW, Australia
| | | | | |
Collapse
|
2
|
Vidal G, Vitalis C, Matúte T, Núñez I, Federici F, Rudge TJ. Genetic Network Design Automation with LOICA. Methods Mol Biol 2024; 2760:393-412. [PMID: 38468100 DOI: 10.1007/978-1-0716-3658-9_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Genetic design automation (GDA) is the use of computer-aided design (CAD) in designing genetic networks. GDA tools are necessary to create more complex synthetic genetic networks in a high-throughput fashion. At the core of these tools is the abstraction of a hierarchy of standardized components. The components' input, output, and interactions must be captured and parametrized from relevant experimental data. Simulations of genetic networks should use those parameters and include the experimental context to be compared with the experimental results.This chapter introduces Logical Operators for Integrated Cell Algorithms (LOICA), a Python package used for designing, modeling, and characterizing genetic networks using a simple object-oriented design abstraction. LOICA represents different biological and experimental components as classes that interact to generate models. These models can be parametrized by direct connection to the Flapjack experimental data management platform to characterize abstracted components with experimental data. The models can be simulated using stochastic simulation algorithms or ordinary differential equations with varying noise levels. The simulated data can be managed and published using Flapjack alongside experimental data for comparison. LOICA genetic network designs can be represented as graphs and plotted as networks for visual inspection and serialized as Python objects or in the Synthetic Biology Open Language (SBOL) format for sharing and use in other designs.
Collapse
Affiliation(s)
- Gonzalo Vidal
- Interdisciplinary Computing and Complex Biosystems, School of Computing, Newcastle University, Newcastle upon Tyne, UK
| | - Carolus Vitalis
- Department of Electrical, Computer, and Energy Engineering, University of Colorado Boulder, Boulder, CO, USA
| | - Tamara Matúte
- ANID-Millennium Science Initiative Program Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- FONDAP Center for Genome Regulation, Santiago, Chile
- Institute for Biological and Medical Engineering Schools of Engineering, Medicine and Biological Sciences Pontificia Universidad Catolica de Chile, Santiago, Chile
| | - Isaac Núñez
- ANID-Millennium Science Initiative Program Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- FONDAP Center for Genome Regulation, Santiago, Chile
- Institute for Biological and Medical Engineering Schools of Engineering, Medicine and Biological Sciences Pontificia Universidad Catolica de Chile, Santiago, Chile
| | - Fernán Federici
- ANID-Millennium Science Initiative Program Millennium Institute for Integrative Biology (iBio), Santiago, Chile
- FONDAP Center for Genome Regulation, Santiago, Chile
- Institute for Biological and Medical Engineering Schools of Engineering, Medicine and Biological Sciences Pontificia Universidad Catolica de Chile, Santiago, Chile
| | - Timothy J Rudge
- Interdisciplinary Computing and Complex Biosystems, School of Computing, Newcastle University, Newcastle upon Tyne, UK.
| |
Collapse
|
3
|
Abstract
The Synthetic Biology Open Language version 3 (SBOL3) provides a data model for representation of synthetic biology information across multiple scales and throughout the design-build-test-learn workflow. To support practical use of this data model, we have developed pySBOL3, a Python library that allows programmers to create and edit SBOL3 documents. Here we describe this library and key engineering decisions in its design. The resulting implementation is a compact and maintainable core that provides both a familiar, pythonic interface for manipulating SBOL3 objects as well as mechanisms for building additional extensions and representations on this base.
Collapse
Affiliation(s)
- Tom Mitchell
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Jacob Beal
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Bryan Bartley
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| |
Collapse
|
4
|
Vidal G, Vidal-Céspedes C, Rudge TJ. LOICA: Integrating Models with Data for Genetic Network Design Automation. ACS Synth Biol 2022; 11:1984-1990. [PMID: 35507566 PMCID: PMC9127962 DOI: 10.1021/acssynbio.1c00603] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Indexed: 11/30/2022]
Abstract
Genetic design automation tools are necessary to expand the scale and complexity of possible synthetic genetic networks. These tools are enabled by abstraction of a hierarchy of standardized components and devices. Abstracted elements must be parametrized from data derived from relevant experiments, and these experiments must be related to the part composition of the abstract components. Here we present Logical Operators for Integrated Cell Algorithms (LOICA), a Python package for designing, modeling, and characterizing genetic networks based on a simple object-oriented design abstraction. LOICA uses classes to represent different biological and experimental components, which generate models through their interactions. These models can be parametrized by direct connection to data contained in Flapjack so that abstracted components of designs can characterize themselves. Models can be simulated using continuous or stochastic methods and the data published and managed using Flapjack. LOICA also outputs SBOL3 descriptions and generates graph representations of genetic network designs.
Collapse
Affiliation(s)
- Gonzalo Vidal
- Institute
for Biological and Medical Engineering, Schools of Engineering, Biology,
and Medicine, Pontificia Universidad Católica
de Chile, Santiago 7820244, Chile
- Interdisciplinary
Computing and Complex BioSystems (ICOS) Research Group, School of
Computing, Newcastle University, Newcastle upon Tyne NE1
7RU, U.K.
| | - Carlos Vidal-Céspedes
- Institute
for Biological and Medical Engineering, Schools of Engineering, Biology,
and Medicine, Pontificia Universidad Católica
de Chile, Santiago 7820244, Chile
| | - Timothy J. Rudge
- Interdisciplinary
Computing and Complex BioSystems (ICOS) Research Group, School of
Computing, Newcastle University, Newcastle upon Tyne NE1
7RU, U.K.
| |
Collapse
|
5
|
Standage-Beier K, Tekel SJ, Brafman DA, Wang X. Prime Editing Guide RNA Design Automation Using PINE-CONE. ACS Synth Biol 2021; 10:422-427. [PMID: 33464043 PMCID: PMC7901017 DOI: 10.1021/acssynbio.0c00445] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Indexed: 12/18/2022]
Abstract
CRISPR-based technologies are paramount in genome engineering and synthetic biology. Prime editing (PE) is a technology capable of installing genomic edits without double-stranded DNA breaks (DSBs) or donor DNA. Prime editing guide RNAs (pegRNAs) simultaneously encode both guide and edit template sequences. They are more design intensive than CRISPR single guide RNAs (sgRNAs). As such, application of PE technology is hindered by the limited throughput of manual pegRNA design. To that end, we designed a software tool, Prime Induced Nucleotide Engineering Creator of New Edits (PINE-CONE), that enables high-throughput automated design of pegRNAs and prime editing strategies. PINE-CONE translates edit coordinates and sequences into pegRNA designs, accessory guides, and oligonucleotides for facile cloning workflows. To demonstrate PINE-CONE's utility in studying disease-relevant genotypes, we rapidly design a library of pegRNAs targeting Alzheimer's Disease single nucleotide polymorphisms (SNPs). Overall, PINE-CONE will accelerate the application of PEs in synthetic biology and biomedical research.
Collapse
Affiliation(s)
- Kylie Standage-Beier
- School
of Biological and Health Systems Engineering, Molecular and Cellular Biology
Graduate Program, School for Engineering of Matter, Transport and Energy, Arizona State University, Tempe, Arizona 85287, United States
| | - Stefan J. Tekel
- School
of Biological and Health Systems Engineering, Molecular and Cellular Biology
Graduate Program, School for Engineering of Matter, Transport and Energy, Arizona State University, Tempe, Arizona 85287, United States
| | - David A. Brafman
- School
of Biological and Health Systems Engineering, Molecular and Cellular Biology
Graduate Program, School for Engineering of Matter, Transport and Energy, Arizona State University, Tempe, Arizona 85287, United States
| | - Xiao Wang
- School
of Biological and Health Systems Engineering, Molecular and Cellular Biology
Graduate Program, School for Engineering of Matter, Transport and Energy, Arizona State University, Tempe, Arizona 85287, United States
| |
Collapse
|
6
|
Yáñez Feliú G, Earle Gómez B, Codoceo Berrocal V, Muñoz Silva M, Nuñez IN, Matute TF, Arce Medina A, Vidal G, Vitalis C, Dahlin J, Federici F, Rudge TJ. Flapjack: Data Management and Analysis for Genetic Circuit Characterization. ACS Synth Biol 2021; 10:183-191. [PMID: 33382586 DOI: 10.1021/acssynbio.0c00554] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Characterization is fundamental to the design, build, test, learn (DBTL) cycle for engineering synthetic genetic circuits. Components must be described in such a way as to account for their behavior in a range of contexts. Measurements and associated metadata, including part composition, constitute the test phase of the DBTL cycle. These data may consist of measurements of thousands of circuits, measured in hundreds of conditions, in multiple assays potentially performed in different laboratories and using different techniques. In order to inform the learn phase this large volume of data must be filtered, collated, and analyzed. Characterization consists of using this data to parametrize models of component function in different contexts, and combining them to predict behaviors of novel circuits. Tools to store, organize, share, and analyze large volumes of measurement and metadata are therefore essential to linking the test phase to the build and learn phases, closing the loop of the DBTL cycle. Here we present such a system, implemented as a web app with a backend data registry and analysis engine. An interactive frontend provides powerful querying, plotting, and analysis tools, and we provide a REST API and Python package for full integration with external build and learn software. All measurements are associated with circuit part composition via SBOL (Synthetic Biology Open Language). We demonstrate our tool by characterizing a range of genetic components and circuits according to composition and context.
Collapse
Affiliation(s)
- Guillermo Yáñez Feliú
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Benjamín Earle Gómez
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Verner Codoceo Berrocal
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Macarena Muñoz Silva
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Isaac N Nuñez
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
| | - Tamara F Matute
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
| | - Anibal Arce Medina
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
| | - Gonzalo Vidal
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Carlos Vitalis
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| | - Jonathan Dahlin
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
| | - Fernán Federici
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
- FONDAP, Center for Genome Regulation, Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
| | - Timothy J Rudge
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
- Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
| |
Collapse
|
7
|
Storch M, Haines MC, Baldwin GS. DNA-BOT: a low-cost, automated DNA assembly platform for synthetic biology. Synth Biol (Oxf) 2020; 5:ysaa010. [PMID: 32995552 PMCID: PMC7476404 DOI: 10.1093/synbio/ysaa010] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 06/05/2020] [Accepted: 06/26/2020] [Indexed: 01/10/2023] Open
Abstract
Multi-part DNA assembly is the physical starting point for many projects in Synthetic and Molecular Biology. The ability to explore a genetic design space by building extensive libraries of DNA constructs is essential for creating programmed biological systems. With multiple DNA assembly methods and standards adopted in the Synthetic Biology community, automation of the DNA assembly process is now receiving serious attention. Automation will enable larger builds using less researcher time, while increasing the accessible design space. However, these benefits currently incur high costs for both equipment and consumables. Here, we address this limitation by introducing low-cost DNA assembly with BASIC on OpenTrons (DNA-BOT). For this purpose, we developed an open-source software package and demonstrated the performance of DNA-BOT by simultaneously assembling 88 constructs composed of 10 genetic parts, evaluating the promoter, ribosome binding site and gene order design space for a three-gene operon. All 88 constructs were assembled with high accuracy, at a consumables cost of $1.50–$5.50 per construct. This illustrates the efficiency, accuracy and affordability of DNA-BOT, making it accessible for most labs and democratizing automated DNA assembly.
Collapse
Affiliation(s)
- Marko Storch
- Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK.,Imperial College Centre for Synthetic Biology, Imperial College London, London, SW7 2AZ, UK.,London Biofoundry, Imperial College Translation & Innovation Hub, London, W12 0BZ, UK
| | - Matthew C Haines
- Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK.,Imperial College Centre for Synthetic Biology, Imperial College London, London, SW7 2AZ, UK
| | - Geoff S Baldwin
- Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK.,Imperial College Centre for Synthetic Biology, Imperial College London, London, SW7 2AZ, UK
| |
Collapse
|
8
|
Crowther M, Grozinger L, Pocock M, Taylor CPD, McLaughlin JA, Mısırlı G, Bartley BA, Beal J, Goñi-Moreno A, Wipat A. ShortBOL: A Language for Scripting Designs for Engineered Biological Systems Using Synthetic Biology Open Language (SBOL). ACS Synth Biol 2020; 9:962-966. [PMID: 32129980 DOI: 10.1021/acssynbio.9b00470] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
The Synthetic Biology Open Language (SBOL) is an emerging synthetic biology data exchange standard, designed primarily for unambiguous and efficient machine-to-machine communication. However, manual editing of SBOL is generally difficult for nontrivial designs. Here, we describe ShortBOL, a lightweight SBOL scripting language that bridges the gap between manual editing, visual design tools, and direct programming. ShortBOL is a shorthand textual language developed to enable users to create SBOL designs quickly and easily, without requiring strong programming skills or visual design tools.
Collapse
Affiliation(s)
- Matthew Crowther
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| | - Lewis Grozinger
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| | - Matthew Pocock
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| | - Christopher P. D. Taylor
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| | - James A. McLaughlin
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| | - Göksel Mısırlı
- School of Computing and Mathematics, Keele University, Keele, Newcastle ST5 5BG, U.K
| | - Bryan A. Bartley
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Jacob Beal
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Angel Goñi-Moreno
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
- Centro de Biotecnologı́a y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politénica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologı́a Agraria y Alimentaria (INIA) Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón, Madrid, Spain
| | - Anil Wipat
- School of Computing, Newcastle University, Urban Sciences Building, Science Square, Newcastle upon Tyne NE4 5TG, U.K
| |
Collapse
|
9
|
Martínez-García E, Goñi-Moreno A, Bartley B, McLaughlin J, Sánchez-Sampedro L, Pascual del Pozo H, Prieto Hernández C, Marletta AS, De Lucrezia D, Sánchez-Fernández G, Fraile S, de Lorenzo V. SEVA 3.0: an update of the Standard European Vector Architecture for enabling portability of genetic constructs among diverse bacterial hosts. Nucleic Acids Res 2020; 48:D1164-D1170. [PMID: 31740968 PMCID: PMC7018797 DOI: 10.1093/nar/gkz1024] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Revised: 10/17/2019] [Accepted: 10/18/2019] [Indexed: 11/29/2022] Open
Abstract
The Standard European Vector Architecture 3.0 database (SEVA-DB 3.0, http://seva.cnb.csic.es) is the update of the platform launched in 2013 both as a web-based resource and as a material repository of formatted genetic tools (mostly plasmids) for analysis, construction and deployment of complex bacterial phenotypes. The period between the first version of SEVA-DB and the present time has witnessed several technical, computational and conceptual advances in genetic/genomic engineering of prokaryotes that have enabled upgrading of the utilities of the updated database. Novelties include not only a more user-friendly web interface and many more plasmid vectors, but also new links of the plasmids to advanced bioinformatic tools. These provide an intuitive visualization of the constructs at stake and a range of virtual manipulations of DNA segments that were not possible before. Finally, the list of canonical SEVA plasmids is available in machine-readable SBOL (Synthetic Biology Open Language) format. This ensures interoperability with other platforms and affords simulations of their behaviour under different in vivo conditions. We argue that the SEVA-DB will remain a useful resource for extending Synthetic Biology approaches towards non-standard bacterial species as well as genetically programming new prokaryotic chassis for a suite of fundamental and biotechnological endeavours.
Collapse
Affiliation(s)
- Esteban Martínez-García
- Systems Biology Program, Centro Nacional de Biotecnología CSIC, Campus de la Universidad Autónoma de Madrid, 28049 Spain
| | | | | | | | | | | | | | | | | | | | - Sofía Fraile
- Systems Biology Program, Centro Nacional de Biotecnología CSIC, Campus de la Universidad Autónoma de Madrid, 28049 Spain
| | - Víctor de Lorenzo
- Systems Biology Program, Centro Nacional de Biotecnología CSIC, Campus de la Universidad Autónoma de Madrid, 28049 Spain
| |
Collapse
|
10
|
Mısırlı G, Taylor R, Goñi-Moreno A, McLaughlin JA, Myers C, Gennari JH, Lord P, Wipat A. SBOL-OWL: An Ontological Approach for Formal and Semantic Representation of Synthetic Biology Information. ACS Synth Biol 2019; 8:1498-1514. [PMID: 31059645 DOI: 10.1021/acssynbio.8b00532] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Standard representation of data is key for the reproducibility of designs in synthetic biology. The Synthetic Biology Open Language (SBOL) has already emerged as a data standard to represent information about genetic circuits, and it is based on capturing data using graphs. The language provides the syntax using a free text document that is accessible to humans only. This paper describes SBOL-OWL, an ontology for a machine understandable definition of SBOL. This ontology acts as a semantic layer for genetic circuit designs. As a result, computational tools can understand the meaning of design entities in addition to parsing structured SBOL data. SBOL-OWL not only describes how genetic circuits can be constructed computationally, it also facilitates the use of several existing Semantic Web tools for synthetic biology. This paper demonstrates some of these features, for example, to validate designs and check for inconsistencies. Through the use of SBOL-OWL, queries can be simplified and become more intuitive. Moreover, existing reasoners can be used to infer information about genetic circuit designs that cannot be directly retrieved using existing querying mechanisms. This ontological representation of the SBOL standard provides a new perspective to the verification, representation, and querying of information about genetic circuits and is important to incorporate complex design information via the integration of biological ontologies.
Collapse
Affiliation(s)
- Göksel Mısırlı
- School of Computing and Mathematics, Keele University, Keele, Staffordshire ST5 5BG, UK
| | - Renee Taylor
- School of Computing and Mathematics, Keele University, Keele, Staffordshire ST5 5BG, UK
| | - Angel Goñi-Moreno
- School of Computing, Newcastle University, Newcastle upon Tyne NE4 5TG, UK
| | | | - Chris Myers
- Department of Electrical and Computer Engineering, University of Utah, Salt Lake City, Utah 84112, United States
| | - John H. Gennari
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington 98195, United States
| | - Phillip Lord
- School of Computing, Newcastle University, Newcastle upon Tyne NE4 5TG, UK
| | - Anil Wipat
- School of Computing, Newcastle University, Newcastle upon Tyne NE4 5TG, UK
| |
Collapse
|
11
|
Roehner N, Bartley B, Beal J, McLaughlin J, Pocock M, Zhang M, Zundel Z, Myers CJ. Specifying Combinatorial Designs with the Synthetic Biology Open Language (SBOL). ACS Synth Biol 2019; 8:1519-1523. [PMID: 31260271 DOI: 10.1021/acssynbio.9b00092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
As improvements in DNA synthesis technology and assembly methods make combinatorial assembly of genetic constructs increasingly accessible, methods for representing genetic constructs likewise need to improve to handle the exponential growth of combinatorial design space. To this end, we present a community accepted extension of the SBOL data standard that allows for the efficient and flexible encoding of combinatorial designs. This extension includes data structures for representing genetic designs with "variable" components that can be implemented by choosing one of many linked designs for existing genetic parts or constructs. We demonstrate the representational power of the SBOL combinatorial design extension through case studies on metabolic pathway design and genetic circuit design, and we report the expansion of the SBOLDesigner software tool to support users in creating and modifying combinatorial designs in SBOL.
Collapse
Affiliation(s)
- Nicholas Roehner
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Bryan Bartley
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | - Jacob Beal
- Raytheon BBN Technologies, Cambridge, Massachusetts 02138, United States
| | | | - Matthew Pocock
- Turing Ate My Hamster, Ltd., Tyne and Wear, NE27 0RT, UK
| | - Michael Zhang
- University of Utah, Salt Lake City, Utah 84112, United States
| | - Zach Zundel
- University of Utah, Salt Lake City, Utah 84112, United States
| | - Chris J. Myers
- University of Utah, Salt Lake City, Utah 84112, United States
| |
Collapse
|