1
|
Aguilera LU, Weber LM, Ron E, King CR, Öcal K, Popinga A, Cook J, May MP, Raymond WS, Fox ZR, Forero-Quintero LS, Forman JR, David A, Munsky B. Methods in quantitative biology-from analysis of single-cell microscopy images to inference of predictive models for stochastic gene expression. Phys Biol 2025; 22:042001. [PMID: 40388970 DOI: 10.1088/1478-3975/adda85] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2025] [Accepted: 05/19/2025] [Indexed: 05/21/2025]
Abstract
The field of quantitative biology (q-bio) seeks to provide precise and testable explanations for observed biological phenomena by applying mathematical and computational methods. The central goals of q-bio are to (1) systematically propose quantitative hypotheses in the form of mathematical models, (2) demonstrate that these models faithfully capture a specific essence of a biological process, and (3) correctly forecast the dynamics of the process in new, and previously untested circumstances. Achieving these goals depends on accurate analysis and incorporating informative experimental data to constrain the set of potential mathematical representations. In this introductory tutorial, we provide an overview of the state of the field and introduce some of the computational methods most commonly used in q-bio. In particular, we examine experimental techniques in single-cell imaging, computational tools to process images and extract quantitative data, various mechanistic modeling approaches used to reproduce these quantitative data, and techniques for data-driven model inference and model-driven experiment design. All topics are presented in the context of additional online resources, including open-source Python notebooks and open-ended practice problems that comprise the technical content of the annual Undergraduate Quantitative Biology Summer School (UQ-Bio).
Collapse
Affiliation(s)
- Luis U Aguilera
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- Department of Biochemistry and Molecular Genetics, University of Colorado-Anschutz Medical Campus, Aurora, CO 80045, United States of America
| | - Lisa M Weber
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- School of Mathematics and Engineering, Front Range Community College, Fort Collins, CO 80526, United States of America
| | - Eric Ron
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Connor R King
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- Cell and Molecular Biology Program, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Kaan Öcal
- School of BioSciences, University of Melbourne, Parkville, Victoria 3010, Australia
| | - Alex Popinga
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- School of Biological Sciences, University of Auckland, Auckland CBD, Auckland 1010, New Zealand
| | - Joshua Cook
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Michael P May
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - William S Raymond
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Zachary R Fox
- Oak Ridge National Laboratory, Oak Ridge, TN 37830, United States of America
| | - Linda S Forero-Quintero
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Jack R Forman
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Alexandre David
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, United States of America
| |
Collapse
|
2
|
Andreini C. Twenty years in metalloprotein bioinformatics: A short history of a long journey. J Inorg Biochem 2025; 266:112854. [PMID: 39961171 DOI: 10.1016/j.jinorgbio.2025.112854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 01/31/2025] [Accepted: 02/07/2025] [Indexed: 03/01/2025]
Abstract
The study of the structure and function of metalloproteins is a central subject of inorganic biochemistry. Starting from the 2000s, computational methods have flanked experimental research by exploiting the ever-increasing computing power and the huge amount of data produced by omics technologies. In this article, we retrace the major advancements that brought bioinformatics from being of minor relevance to being an essential tool for today's inorganic biochemists, focusing on the contributions coming from the Magnetic Resonance Center (CERM) of Florence, where we have been developing for twenty years methods and resources to investigate metalloproteins with computational approaches.
Collapse
Affiliation(s)
- Claudia Andreini
- Magnetic Resonance Center, University of Florence, 50019 Sesto Fiorentino, Italy; Department of Chemistry, University of Florence, 50019 Sesto Fiorentino, Italy.
| |
Collapse
|
3
|
Mougkogiannis P, Adamatzky A. On the Response of Proteinoid Ensembles to Fibonacci Sequences. ACS OMEGA 2025; 10:10401-10424. [PMID: 40124033 PMCID: PMC11923683 DOI: 10.1021/acsomega.4c10571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 02/14/2025] [Accepted: 02/26/2025] [Indexed: 03/25/2025]
Abstract
This work investigates the integration of Fibonacci patterns and Golden Ratio principles into proteinoid-based systems, connecting fundamental mathematical concepts with contemporary biomimetic approaches. Proteinoids are thermal proteins that can self-assemble and have enzyme-like capabilities. They provide a distinct platform for biomimetic information processing. Our study examines the impact of integrating Fibonacci sequences and the Golden Ratio (ϕ = 1.618) into the design and synthesis of proteinoids on their structural organization and response characteristics. We developed two categories of stimuli: auditory signals generated using frequencies derived from the Fibonacci sequence, and electrical patterns that correspond to the proportions of the Golden Ratio. The proteinoid microsphere assemblies were subjected to these stimuli, and their electrical and structural responses were recorded and analyzed. The results indicate that proteinoid systems reveal unique reactivity to acoustic stimuli based on the Fibonacci sequence, exhibiting heightened sensitivity to particular combinations of frequencies and demonstrating nonlinear amplification effects. The proteinoid assemblies exhibited distinctive temporal dynamics and emergent oscillatory behaviors when exposed to voltage patterns inspired by the Golden Ratio, which were not detected with ordinary input signals. These findings provide opportunities for developing advanced bioinspired information transfer and security systems and might improve our understanding of information processing in early chemical systems.
Collapse
Affiliation(s)
| | - Andrew Adamatzky
- Unconventional Computing
Laboratory, University of the West of England, Bristol BS16 1QY, U.K.
| |
Collapse
|
4
|
Hemel IMGM, Steen C, Denil SLIJ, Ertaylan G, Kutmon M, Adriaens M, Gerards M. The unusual suspect: A novel role for intermediate filament proteins in mitochondrial morphology. Mitochondrion 2025; 81:102008. [PMID: 39909388 DOI: 10.1016/j.mito.2025.102008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 01/30/2025] [Accepted: 02/01/2025] [Indexed: 02/07/2025]
Abstract
Mitochondrial dynamics is crucial for cellular homeostasis. However, not all proteins involved are known. Using a protein-protein interaction (PPI) approach, we identified ITPRIPL2 for involvement in mitochondrial dynamics. ITPRIPL2 co-localizes with intermediate filament protein vimentin, supported by protein simulations. ITPRIPL2 knockdown reveals mitochondrial elongation, disrupts vimentin processing, intermediate filament formation, and alters vimentin-related pathways. Interestingly, vimentin knockdown also leads to mitochondrial elongation. These findings highlight ITPRIPL2 as vimentin-associated protein essential for intermediate filament structure and suggest a role for intermediate filaments in mitochondrial morphology. Our study demonstrates that PPI analysis is a powerful approach for identifying novel mitochondrial dynamics proteins.
Collapse
Affiliation(s)
- Irene M G M Hemel
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht 6229 EN the Netherlands
| | - Carlijn Steen
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht 6229 EN the Netherlands
| | - Simon L I J Denil
- Flemish Institute for Technological Research (VITO) 2400 Mol, Belgium
| | - Gökhan Ertaylan
- Flemish Institute for Technological Research (VITO) 2400 Mol, Belgium
| | - Martina Kutmon
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht 6229 EN the Netherlands
| | - Michiel Adriaens
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht 6229 EN the Netherlands
| | - Mike Gerards
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht 6229 EN the Netherlands.
| |
Collapse
|
5
|
Ferenc K, Rauluseviciute I, Hovan L, Kumar V, Kuijjer ML, Mathelier A. Improving bioinformatics software quality through teamwork. Bioinformatics 2024; 40:btae632. [PMID: 39436982 PMCID: PMC11537420 DOI: 10.1093/bioinformatics/btae632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 10/02/2024] [Accepted: 10/21/2024] [Indexed: 10/25/2024] Open
Abstract
SUMMARY Since high-throughput techniques became a staple in biological science laboratories, computational algorithms, and scientific software have boomed. However, the development of bioinformatics software usually lacks software development quality standards. The resulting software code is hard to test, reuse, and maintain. We believe that the root of inefficiency in implementing the best software development practices in academic settings is the individualistic approach, which has traditionally been the norm for recognizing scientific achievements and, by extension, for developing specialized software. Software development is a collective effort in most software-heavy endeavors. Indeed, the literature suggests teamwork directly impacts code quality through knowledge sharing, collective software development, and established coding standards. In our computational biology research groups, we sustainably involve all group members in learning, sharing, and discussing software development while maintaining the personal ownership of research projects and related software products. We found that group members involved in this endeavor improved their coding skills, became more efficient bioinformaticians, and obtained detailed knowledge about their peers' work, triggering new collaborative projects. We strongly advocate for improving software development culture within bioinformatics through collective effort in computational biology groups or institutes with three or more bioinformaticians. AVAILABILITY AND IMPLEMENTATION Additional information and guidance on how to get started is available at https://ferenckata.github.io/ImprovingSoftwareTogether.github.io/.
Collapse
Affiliation(s)
- Katalin Ferenc
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
| | - Ieva Rauluseviciute
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
| | - Ladislav Hovan
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
| | - Vipin Kumar
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
| | - Anthony Mathelier
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo 0318, Norway
- Department of Medical Genetics, Institute of Clinical Medicine, Oslo University Hospital and University of Oslo, Oslo 0450, Norway
- Department of Pharmacy, University of Oslo, Oslo 0371, Norway
| |
Collapse
|
6
|
Gallagher K, Creswell R, Lambert B, Robinson M, Lok Lei C, Mirams GR, Gavaghan DJ. Ten simple rules for training scientists to make better software. PLoS Comput Biol 2024; 20:e1012410. [PMID: 39264985 PMCID: PMC11392269 DOI: 10.1371/journal.pcbi.1012410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2024] Open
Affiliation(s)
- Kit Gallagher
- Doctoral Training Centre, University of Oxford, Oxford, United Kingdom
| | - Richard Creswell
- Department of Computer Science, University of Oxford, Oxford, United Kingdom
| | - Ben Lambert
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Martin Robinson
- Department of Computer Science, University of Oxford, Oxford, United Kingdom
| | - Chon Lok Lei
- Department of Biomedical Sciences, Faculty of Health Sciences, University of Macau, Macau, China
| | - Gary R. Mirams
- Centre for Mathematical Medicine & Biology, School of Mathematical Sciences, University of Nottingham, Nottingham, United Kingdom
| | - David J. Gavaghan
- Doctoral Training Centre, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
7
|
Matuszyńska A, Ebenhöh O, Zurbriggen MD, Ducat DC, Axmann IM. A new era of synthetic biology-microbial community design. Synth Biol (Oxf) 2024; 9:ysae011. [PMID: 39086602 PMCID: PMC11290361 DOI: 10.1093/synbio/ysae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 06/21/2024] [Accepted: 07/15/2024] [Indexed: 08/02/2024] Open
Abstract
Synthetic biology conceptualizes biological complexity as a network of biological parts, devices, and systems with predetermined functionalities and has had a revolutionary impact on fundamental and applied research. With the unprecedented ability to synthesize and transfer any DNA and RNA across organisms, the scope of synthetic biology is expanding and being recreated in previously unimaginable ways. The field has matured to a level where highly complex networks, such as artificial communities of synthetic organisms, can be constructed. In parallel, computational biology became an integral part of biological studies, with computational models aiding the unravelling of the escalating complexity and emerging properties of biological phenomena. However, there is still a vast untapped potential for the complete integration of modelling into the synthetic design process, presenting exciting opportunities for scientific advancements. Here, we first highlight the most recent advances in computer-aided design of microbial communities. Next, we propose that such a design can benefit from an organism-free modular modelling approach that places its emphasis on modules of organismal function towards the design of multispecies communities. We argue for a shift in perspective from single organism-centred approaches to emphasizing the functional contributions of organisms within the community. By assembling synthetic biological systems using modular computational models with mathematical descriptions of parts and circuits, we can tailor organisms to fulfil specific functional roles within the community. This approach aligns with synthetic biology strategies and presents exciting possibilities for the design of artificial communities. Graphical Abstract.
Collapse
Affiliation(s)
- Anna Matuszyńska
- Computational Life Science, Department of Biology, RWTH Aachen University, Aachen 52074, Germany
- Cluster of Excellence on Plant Sciences, CEPLAS, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Oliver Ebenhöh
- Cluster of Excellence on Plant Sciences, CEPLAS, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
- Institute of Quantitative and Theoretical Biology, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Matias D Zurbriggen
- Cluster of Excellence on Plant Sciences, CEPLAS, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
- Institute of Synthetic Biology, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Daniel C Ducat
- MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, United States
- Department of Biochemistry & Molecular Biology, Michigan State University, East Lansing, MI 48824, United States
- Institute for Synthetic Microbiology, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| | - Ilka M Axmann
- Cluster of Excellence on Plant Sciences, CEPLAS, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
- Institute for Synthetic Microbiology, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
| |
Collapse
|
8
|
Ouzounis CA. Biology's transformation: from observation through experiment to computation. BIOINFORMATICS ADVANCES 2024; 4:vbae069. [PMID: 38799705 PMCID: PMC11127110 DOI: 10.1093/bioadv/vbae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 04/24/2024] [Accepted: 05/07/2024] [Indexed: 05/29/2024]
Abstract
Summary We explore the nuanced temporal and epistemological distinctions among natural sciences, particularly the contrasting treatment of time and the interplay between theory and experimentation. Physics, an exemplar of mature science, relies on theoretical models for predictability and simulations. In contrast, biology, traditionally experimental, is witnessing a computational surge, with data analytics and simulations reshaping its research paradigms. Despite these strides, a unified theoretical framework in biology remains elusive. We propose that contemporary global challenges might usher in a renewed emphasis, presenting an opportunity for the establishment of a novel theoretical underpinning for the life sciences. Availability and implementation https://github.com/ouzounis/CLS-emerges Data in Json format, Images in PNG format.
Collapse
Affiliation(s)
- Christos A Ouzounis
- Biological Computation & Computational Biology Group, Artificial Intelligence & Information Analysis Laboratory, School of Informatics, Faculty of Sciences, Aristotle University of Thessalonica, Thessalonica GR-54124, Greece
| |
Collapse
|
9
|
Wang X, Jenner AL, Salomone R, Warne DJ, Drovandi C. Calibration of agent based models for monophasic and biphasic tumour growth using approximate Bayesian computation. J Math Biol 2024; 88:28. [PMID: 38358410 PMCID: PMC10869399 DOI: 10.1007/s00285-024-02045-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 10/25/2023] [Accepted: 12/27/2023] [Indexed: 02/16/2024]
Abstract
Agent-based models (ABMs) are readily used to capture the stochasticity in tumour evolution; however, these models are often challenging to validate with experimental measurements due to model complexity. The Voronoi cell-based model (VCBM) is an off-lattice agent-based model that captures individual cell shapes using a Voronoi tessellation and mimics the evolution of cancer cell proliferation and movement. Evidence suggests tumours can exhibit biphasic growth in vivo. To account for this phenomena, we extend the VCBM to capture the existence of two distinct growth phases. Prior work primarily focused on point estimation for the parameters without consideration of estimating uncertainty. In this paper, approximate Bayesian computation is employed to calibrate the model to in vivo measurements of breast, ovarian and pancreatic cancer. Our approach involves estimating the distribution of parameters that govern cancer cell proliferation and recovering outputs that match the experimental data. Our results show that the VCBM, and its biphasic extension, provides insight into tumour growth and quantifies uncertainty in the switching time between the two phases of the biphasic growth model. We find this approach enables precise estimates for the time taken for a daughter cell to become a mature cell. This allows us to propose future refinements to the model to improve accuracy, whilst also making conclusions about the differences in cancer cell characteristics.
Collapse
Affiliation(s)
- Xiaoyu Wang
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia.
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia.
| | - Adrianne L Jenner
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
| | - Robert Salomone
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
- School of Computer Science, Queensland University of Technology, Brisbane, QLD, Australia
| | - David J Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
| | - Christopher Drovandi
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD, Australia
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia
| |
Collapse
|
10
|
Berezin CT, Aguilera LU, Billerbeck S, Bourne PE, Densmore D, Freemont P, Gorochowski TE, Hernandez SI, Hillson NJ, King CR, Köpke M, Ma S, Miller KM, Moon TS, Moore JH, Munsky B, Myers CJ, Nicholas DA, Peccoud SJ, Zhou W, Peccoud J. Ten simple rules for managing laboratory information. PLoS Comput Biol 2023; 19:e1011652. [PMID: 38060459 PMCID: PMC10703290 DOI: 10.1371/journal.pcbi.1011652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2023] Open
Abstract
Information is the cornerstone of research, from experimental (meta)data and computational processes to complex inventories of reagents and equipment. These 10 simple rules discuss best practices for leveraging laboratory information management systems to transform this large information load into useful scientific findings.
Collapse
Affiliation(s)
- Casey-Tyler Berezin
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Luis U. Aguilera
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Sonja Billerbeck
- Molecular Microbiology Unit, Faculty of Science and Engineering, University of Groningen, Groningen, the Netherlands
| | - Philip E. Bourne
- School of Data Science, University of Virginia, Charlottesville, Virginia, United States of America
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Douglas Densmore
- College of Engineering, Boston University, Boston, Massachusetts, United States of America
| | - Paul Freemont
- Department of Infectious Disease, Imperial College, London, United Kingdom
| | - Thomas E. Gorochowski
- School of Biological Sciences, University of Bristol, Bristol, United Kingdom
- BrisEngBio, University of Bristol, Bristol, United Kingdom
| | - Sarah I. Hernandez
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Nathan J. Hillson
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
- US Department of Energy Agile BioFoundry, Emeryville, California, United States of America
- US Department of Energy Joint BioEnergy Institute, Emeryville, California, United States of America
| | - Connor R. King
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Michael Köpke
- LanzaTech, Skokie, Illinois, United States of America
| | - Shuyi Ma
- Center for Global Infectious Disease Research, Seattle Children’s Hospital, University of Washington Medicine, Seattle, Washington, United States of America
| | - Katie M. Miller
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Tae Seok Moon
- Department of Energy, Environmental & Chemical Engineering, Washington University in St. Louis, St. Louis, Missouri, United States of America
| | - Jason H. Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, California, United States of America
| | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Chris J. Myers
- Department of Electrical, Computer & Energy Engineering, University of Colorado Boulder, Boulder, Colorado, United States of America
| | - Dequina A. Nicholas
- Department of Molecular Biology & Biochemistry, University of California Irvine, Irvine, California, United States of America
| | - Samuel J. Peccoud
- Department of Electrical and Computer Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| | - Wen Zhou
- Department of Statistics, Colorado State University, Fort Collins, Colorado, United States of America
| | - Jean Peccoud
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, Colorado, United States of America
| |
Collapse
|
11
|
Kowalski TW, Feira MF, Lord VO, Gomes JDA, Giudicelli GC, Fraga LR, Sanseverino MTV, Recamonde-Mendoza M, Schuler-Faccini L, Vianna FSL. A New Strategy for the Old Challenge of Thalidomide: Systems Biology Prioritization of Potential Immunomodulatory Drug (IMiD)-Targeted Transcription Factors. Int J Mol Sci 2023; 24:11515. [PMID: 37511270 PMCID: PMC10380514 DOI: 10.3390/ijms241411515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 07/06/2023] [Accepted: 07/08/2023] [Indexed: 07/30/2023] Open
Abstract
Several molecular mechanisms of thalidomide embryopathy (TE) have been investigated, from anti-angiogenesis to oxidative stress to cereblon binding. Recently, it was discovered that thalidomide and its analogs, named immunomodulatory drugs (IMiDs), induced the degradation of C2H2 transcription factors (TFs). This mechanism might impact the strict transcriptional regulation of the developing embryo. Hence, this study aims to evaluate the TFs altered by IMiDs, prioritizing the ones associated with embryogenesis through transcriptome and systems biology-allied analyses. This study comprises only the experimental data accessed through bioinformatics databases. First, proteins and genes reported in the literature as altered/affected by the IMiDs were annotated. A protein systems biology network was evaluated. TFs beta-catenin (CTNNB1) and SP1 play more central roles: beta-catenin is an essential protein in the network, while SP1 is a putative C2H2 candidate for IMiD-induced degradation. Separately, the differential expressions of the annotated genes were analyzed through 23 publicly available transcriptomes, presenting 8624 differentially expressed genes (2947 in two or more datasets). Seventeen C2H2 TFs were identified as related to embryonic development but not studied for IMiD exposure; these TFs are potential IMiDs degradation neosubstrates. This is the first study to suggest an integration of IMiD molecular mechanisms through C2H2 TF degradation.
Collapse
Affiliation(s)
- Thayne Woycinck Kowalski
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Teratogen Information System (SIAT), Medical Genetics Service, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Bioinformatics Core, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Biomedical Sciences Course, Centro Universitário CESUCA, Cachoeirinha 94935-630, Brazil
| | - Mariléa Furtado Feira
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
| | - Vinícius Oliveira Lord
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Biomedical Sciences Course, Centro Universitário CESUCA, Cachoeirinha 94935-630, Brazil
| | - Julia do Amaral Gomes
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
| | - Giovanna Câmara Giudicelli
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Bioinformatics Core, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
| | - Lucas Rosa Fraga
- Teratogen Information System (SIAT), Medical Genetics Service, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Post-Graduation Program in Medicine, Medical Sciences, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 90035-003, Brazil
- Department of Morphological Sciences, Institute of Health Sciences, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 90010-150, Brazil
| | - Maria Teresa Vieira Sanseverino
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Teratogen Information System (SIAT), Medical Genetics Service, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- School of Medicine, Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre 90619-900, Brazil
| | - Mariana Recamonde-Mendoza
- Bioinformatics Core, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Post-Graduation Program in Computer Science, Institute of Informatics, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
| | - Lavinia Schuler-Faccini
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Teratogen Information System (SIAT), Medical Genetics Service, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
| | - Fernanda Sales Luiz Vianna
- Graduate Program in Genetics and Molecular Biology, Genetics Department, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 91501-970, Brazil
- Teratogen Information System (SIAT), Medical Genetics Service, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Laboratory of Genomic Medicine, Center of Experimental Research, Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre 90035-903, Brazil
- Post-Graduation Program in Medicine, Medical Sciences, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre 90035-003, Brazil
| |
Collapse
|
12
|
Naha A, Antony S, Nath S, Sharma D, Mishra A, Biju DT, Madhavan A, Binod P, Varjani S, Sindhu R. A hypothetical model of multi-layered cost-effective wastewater treatment plant integrating microbial fuel cell and nanofiltration technology: A comprehensive review on wastewater treatment and sustainable remediation. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 323:121274. [PMID: 36804140 DOI: 10.1016/j.envpol.2023.121274] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 02/05/2023] [Accepted: 02/10/2023] [Indexed: 06/18/2023]
Abstract
Wastewater management has emerged as an uprising concern that demands immediate attention from environmentalists worldwide. Indiscriminate and irrational release of industrial and poultry wastes, sewage, pharmaceuticals, mining, pesticides, fertilizers, dyes and radioactive wastes, contribute immensely to water pollution. This has led to the aggravation of critical health concerns as evident from the uprising trends of antimicrobial resistance, and the presence of xenobiotics and pollutant traces in humans and animals due to the process of biomagnification. Therefore, the development of reliable, affordable and sustainable technologies for the supply of fresh water is the need of the hour. Conventional wastewater treatment often involves physical, chemical, and biological processes to remove solids from the effluent, including colloids, organic matter, nutrients, and soluble pollutants (metals, organics). Synthetic biology has been explored in recent years, incorporating both biological and engineering concepts to refine existing wastewater treatment technologies. In addition to outlining the benefits and drawbacks of the current technologies, this review addresses novel wastewater treatment techniques, especially those using dedicated rational design and engineering of organisms and their constituent parts. Furthermore, the review hypothesizes designing a multi-bedded wastewater treatment plant that is highly cost-efficient, sustainable and requires easy installation and handling. The novel setup envisages removing all the major wastewater pollutants, providing water fit for household, irrigation and storage purposes.
Collapse
Affiliation(s)
- Aniket Naha
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla-689 101, Kerala, India
| | - Sherly Antony
- Department of Microbiology, Pushpagiri Institute of Medical Sciences and Research Centre, Thiruvalla-689 101, Kerala, India
| | - Soumitra Nath
- Department of Biotechnology, Gurucharan College, Silchar-788004, India
| | - Dhrubjyoti Sharma
- Biological Engineering, Indian Institute of Technology, Gandhinagar, Palaj, Gandhinagar, 382 355 India
| | - Anamika Mishra
- Department of Biotechnology, Vellore Institute of Technology, Vellore, 632 014, India
| | - Devika T Biju
- Department of Biomedical Science, University of Salford, England, M5 4WT, United Kingdom
| | - Aravind Madhavan
- School of Biotechnology, Amrita Vishwa Vidyapeetham, Amritapuri, Kollam-690525, Kerala, India
| | - Parameswaran Binod
- Microbial Processes and Technology Division, CSIR-National Institute for Interdisciplinary Science and Technology (CSIR-NIIST), Thiruvananthapuram 695 019, Kerala, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad - 201 002, India
| | - Sunita Varjani
- Gujarat Pollution Control Board, Gandhinagar, Gujarat 382 010, India
| | - Raveendran Sindhu
- Department of Food Technology, T K M Institute of Technology, Kollam-691 505, Kerala, India.
| |
Collapse
|
13
|
Sharma P, Dahiya S, Kaur P, Kapil A. Computational biology: Role and scope in taming antimicrobial resistance. Indian J Med Microbiol 2023; 41:33-38. [PMID: 36870746 DOI: 10.1016/j.ijmmb.2022.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 12/11/2022] [Accepted: 12/16/2022] [Indexed: 01/05/2023]
Abstract
BACKGROUND Infectious diseases pose many challenges due to increasing threat of antimicrobial resistance, which necessitates continuous research to develop novel strategies for development of new molecules with antibacterial activity. In the era of computational biology there are tools and techniques available to address and solve the disease management issues in the field of clinical microbiology. The sequencing techniques, structural biology and machine learning can be implemented collectively to tackle infectious diseases e.g. for the diagnosis, epidemiological typing, pathotyping, antimicrobial resistance detection as well as the discovery of novel drugs and vaccine biomarkers. OBJECTIVES The present review is a narrative review representing a comprehensive literature-based assessment regarding the use of whole genome sequencing, structural biology and machine learning for the diagnosis, molecular typing and antibacterial drug discovery. CONTENT Here, we seek to present an overview of molecular and structural basis of resistance to antibiotics, while focusing on the recent bioinformatics approaches in whole genome sequencing and structural biology. The application of next generation sequencing in management of bacterial infections focusing on investigation of microbial population diversity, genotypic resistance testing and scope for the identification of targets for novel drug and vaccine candidates, has been addressed along with the use of structural biophysics and artificial intelligence.
Collapse
Affiliation(s)
- Priyanka Sharma
- Department of Biophysics, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India.
| | - Sushila Dahiya
- Department of Microbiology, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India.
| | - Punit Kaur
- Department of Biophysics, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India.
| | - Arti Kapil
- Department of Microbiology, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110029, India.
| |
Collapse
|
14
|
Abstract
This chapter outlines the myriad applications of machine learning (ML) in synthetic biology, specifically in engineering cell and protein activity, and metabolic pathways. Though by no means comprehensive, the chapter highlights several prominent computational tools applied in the field and their potential use cases. The examples detailed reinforce how ML algorithms can enhance synthetic biology research by providing data-driven insights into the behavior of living systems, even without detailed knowledge of their underlying mechanisms. By doing so, ML promises to increase the efficiency of research projects by modeling hypotheses in silico that can then be tested through experiments. While challenges related to training dataset generation and computational costs remain, ongoing improvements in ML tools are paving the way for smarter and more streamlined synthetic biology workflows that can be readily employed to address grand challenges across manufacturing, medicine, engineering, agriculture, and beyond.
Collapse
Affiliation(s)
- Brendan Fu-Long Sieow
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- NUS Graduate School for Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
| | - Ryan De Sotto
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Zhi Ren Darren Seet
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - In Young Hwang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Matthew Wook Chang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore.
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
15
|
Rademaker DT, Xue LC, ‘t Hoen PAC, Vriend G. Entropy and Variability: A Second Opinion by Deep Learning. Biomolecules 2022; 12:biom12121740. [PMID: 36551168 PMCID: PMC9775329 DOI: 10.3390/biom12121740] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/13/2022] [Accepted: 11/19/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Analysis of the distribution of amino acid types found at equivalent positions in multiple sequence alignments has found applications in human genetics, protein engineering, drug design, protein structure prediction, and many other fields. These analyses tend to revolve around measures of the distribution of the twenty amino acid types found at evolutionary equivalent positions: the columns in multiple sequence alignments. Commonly used measures are variability, average hydrophobicity, or Shannon entropy. One of these techniques, called entropy-variability analysis, as the name already suggests, reduces the distribution of observed residue types in one column to two numbers: the Shannon entropy and the variability as defined by the number of residue types observed. RESULTS We applied a deep learning, unsupervised feature extraction method to analyse the multiple sequence alignments of all human proteins. An auto-encoder neural architecture was trained on 27,835 multiple sequence alignments for human proteins to obtain the two features that best describe the seven million variability patterns. These two unsupervised learned features strongly resemble entropy and variability, indicating that these are the projections that retain most information when reducing the dimensionality of the information hidden in columns in multiple sequence alignments.
Collapse
Affiliation(s)
- Daniel T. Rademaker
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Li C. Xue
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Peter A. C. ‘t Hoen
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands
- Baco Institute for Protein Science (BIPS), Mindoro 5201, Philippines
- Correspondence:
| |
Collapse
|
16
|
Cordier BA, Sawaya NPD, Guerreschi GG, McWeeney SK. Biology and medicine in the landscape of quantum advantages. J R Soc Interface 2022; 19:20220541. [PMID: 36448288 PMCID: PMC9709576 DOI: 10.1098/rsif.2022.0541] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 11/04/2022] [Indexed: 12/03/2022] Open
Abstract
Quantum computing holds substantial potential for applications in biology and medicine, spanning from the simulation of biomolecules to machine learning methods for subtyping cancers on the basis of clinical features. This potential is encapsulated by the concept of a quantum advantage, which is contingent on a reduction in the consumption of a computational resource, such as time, space or data. Here, we distill the concept of a quantum advantage into a simple framework to aid researchers in biology and medicine pursuing the development of quantum applications. We then apply this framework to a wide variety of computational problems relevant to these domains in an effort to (i) assess the potential of practical advantages in specific application areas and (ii) identify gaps that may be addressed with novel quantum approaches. In doing so, we provide an extensive survey of the intersection of biology and medicine with the current landscape of quantum algorithms and their potential advantages. While we endeavour to identify specific computational problems that may admit practical advantages throughout this work, the rapid pace of change in the fields of quantum computing, classical algorithms and biological research implies that this intersection will remain highly dynamic for the foreseeable future.
Collapse
Affiliation(s)
- Benjamin A. Cordier
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR 97202, USA
| | | | | | - Shannon K. McWeeney
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR 97202, USA
- Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97202, USA
- Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, OR 97202, USA
| |
Collapse
|
17
|
Topçu A, Kılıç S, Özgür E, Türkmen D, Denizli A. Inspirations of Biomimetic Affinity Ligands: A Review. ACS OMEGA 2022; 7:32897-32907. [PMID: 36157742 PMCID: PMC9494661 DOI: 10.1021/acsomega.2c03530] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/29/2022] [Indexed: 06/16/2023]
Abstract
Affinity chromatography is a well-known method dependent on molecular recognition and is used to purify biomolecules by mimicking the specific interactions between the biomolecules and their substrates. Enzyme substrates, cofactors, antigens, and inhibitors are generally utilized as bioligands in affinity chromatography. However, their cost, instability, and leakage problems are the main drawbacks of these bioligands. Biomimetic affinity ligands can recognize their target molecules with high selectivity. Their cost-effectiveness and chemical and biological stabilities make these antibody analogs favorable candidates for affinity chromatography applications. Biomimetics applies to nature and aims to develop nanodevices, processes, and nanomaterials. Today, biomimetics provides a design approach to the biomimetic affinity ligands with the aid of computational methods, rational design, and other approaches to meet the requirements of the bioligands and improve the downstream process. This review highlighted the recent trends in designing biomimetic affinity ligands and summarized their binding interactions with the target molecules with computational approaches.
Collapse
Affiliation(s)
- Aykut
Arif Topçu
- Medical
Laboratory Program, Vocational School of Health Service, Aksaray University, 68100 Aksaray, Turkey
| | - Seçkin Kılıç
- Department
of Chemistry, Hacettepe University, 06230 Ankara, Turkey
| | - Erdoğan Özgür
- Department
of Chemistry, Hacettepe University, 06230 Ankara, Turkey
| | - Deniz Türkmen
- Department
of Chemistry, Hacettepe University, 06230 Ankara, Turkey
| | - Adil Denizli
- Department
of Chemistry, Hacettepe University, 06230 Ankara, Turkey
| |
Collapse
|
18
|
Cohen‐Sasson O, Tur‐Sinai O. Facilitating open science without sacrificing
IP
rights. EMBO Rep 2022; 23:e55841. [PMID: 35972193 PMCID: PMC9442283 DOI: 10.15252/embr.202255841] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 08/02/2022] [Indexed: 11/09/2022] Open
Affiliation(s)
- Or Cohen‐Sasson
- Zvi Meitar Center for Advanced Legal Studies Faculty of Law Tel Aviv University Tel Aviv Israel
| | | |
Collapse
|
19
|
Caughman AM, Weigel EG. Biology Students' Math and Computer Science Task Values Are Closely Linked. CBE LIFE SCIENCES EDUCATION 2022; 21:ar43. [PMID: 35759628 PMCID: PMC9582834 DOI: 10.1187/cbe.21-07-0180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 04/22/2022] [Accepted: 04/27/2022] [Indexed: 06/15/2023]
Abstract
Quantitative and computational skills are required of 21st-century biologists. While biology student abilities and attitudes toward math have been studied extensively, less is known about corresponding attitudes toward computer science (CS). It is important to understand how students perceive math and CS subjects and whether those perceptions are linked or operate contradictorily to determine instructional best practices. This study 1) determined biology students' perceptions of math and CS in biological contexts, 2) measured the linkage of those perceptions, and 3) examined additional factors affecting attitudes. Students (N = 272) were surveyed using the original and a CS-adapted version of the Math-Biology Values Instrument to determine interest, perceived utility, and perceived costs toward math and CS in biological contexts. Mixed-effects models were used to determine correlations between task values and investigate effects of exposure to topics and demographic factors. Math and CS values exhibited positive correlations, but utility and cost were more negative for CS, possibly due to less exposure to CS before college, and overall attitudes were influenced by CS background and gender. Given these findings, we make educational recommendations for CS and math exposure early, often, and embedded in the biology curriculum.
Collapse
Affiliation(s)
- Alicia M. Caughman
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332
| | - Emily G. Weigel
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332
| |
Collapse
|
20
|
Filazzola A, Lortie CJ. A call for clean code to effectively communicate science. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Alessandro Filazzola
- Apex Resource Management Solutions Ottawa ON Canada
- Centre for Urban Environments University of Toronto Mississauga Mississauga ON Canada
| | - CJ Lortie
- Department of Biology York University Toronto ON Canada
- The National Center for Ecological Analysis and Synthesis UCSB Santa Barbara CA USA
| |
Collapse
|
21
|
Alser M, Lindegger J, Firtina C, Almadhoun N, Mao H, Singh G, Gomez-Luna J, Mutlu O. From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures. Comput Struct Biotechnol J 2022; 20:4579-4599. [PMID: 36090814 PMCID: PMC9436709 DOI: 10.1016/j.csbj.2022.08.019] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 08/08/2022] [Accepted: 08/08/2022] [Indexed: 02/01/2023] Open
Abstract
We now need more than ever to make genome analysis more intelligent. We need to read, analyze, and interpret our genomes not only quickly, but also accurately and efficiently enough to scale the analysis to population level. There currently exist major computational bottlenecks and inefficiencies throughout the entire genome analysis pipeline, because state-of-the-art genome sequencing technologies are still not able to read a genome in its entirety. We describe the ongoing journey in significantly improving the performance, accuracy, and efficiency of genome analysis using intelligent algorithms and hardware architectures. We explain state-of-the-art algorithmic methods and hardware-based acceleration approaches for each step of the genome analysis pipeline and provide experimental evaluations. Algorithmic approaches exploit the structure of the genome as well as the structure of the underlying hardware. Hardware-based acceleration approaches exploit specialized microarchitectures or various execution paradigms (e.g., processing inside or near memory) along with algorithmic changes, leading to new hardware/software co-designed systems. We conclude with a foreshadowing of future challenges, benefits, and research directions triggered by the development of both very low cost yet highly error prone new sequencing technologies and specialized hardware chips for genomics. We hope that these efforts and the challenges we discuss provide a foundation for future work in making genome analysis more intelligent.
Collapse
Affiliation(s)
| | | | - Can Firtina
- ETH Zurich, Gloriastrasse 35, 8092 Zürich, Switzerland
| | | | - Haiyu Mao
- ETH Zurich, Gloriastrasse 35, 8092 Zürich, Switzerland
| | | | | | - Onur Mutlu
- ETH Zurich, Gloriastrasse 35, 8092 Zürich, Switzerland
| |
Collapse
|
22
|
Ren P, Yang X, Wang T, Hou Y, Zhang Z. Proteome-wide prediction and analysis of the Cryptosporidium parvum protein-protein interaction network through integrative methods. Comput Struct Biotechnol J 2022; 20:2322-2331. [PMID: 35615014 PMCID: PMC9120227 DOI: 10.1016/j.csbj.2022.05.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/03/2022] Open
Abstract
By combining a sequence embedding technique (i.e., Doc2Vec) and a di-peptide composition representation to convert protein sequences into feature vectors, we proposed an RF classifier trained on the Plasmodium falciparum dataset for predicting Cryptosporidium parvum PPIs. A high-confidence Cryptosporidium parvum PPI network was identified by conjoining interolog mapping, domain-domain interaction-based inference, and the RF classifier. Some detected hub proteins and functional modules provided clues for an in-depth biological understanding of Cryptosporidium parvum.
As one of the most studied Apicomplexan parasite Cryptosporidium, Cryptosporidium parvum (C. parvum) causes worldwide serious diarrhea disease cryptosporidiosis, which can be deadly to immunodeficiency individuals, newly born children, and animals. Proteome-wide identification of protein–protein interactions (PPIs) has proven valuable in the systematic understanding of the genome-phenome relationship. However, the PPIs of C. parvum are largely unknown because of the limited experimental studies carried out. Therefore, we took full advantage of three bioinformatics methods, i.e., interolog mapping (IM), domain-domain interaction (DDI)-based inference, and machine learning (ML) method, to jointly predict PPIs of C. parvum. Due to the lack of experimental PPIs of C. parvum, we used the PPI data of Plasmodium falciparum (P. falciparum), which owned the largest number of PPIs in Apicomplexa, to train an ML model to infer C. parvum PPIs. We utilized consistent results of these three methods as the predicted high-confidence PPI network, which contains 4,578 PPIs covering 554 proteins. To further explore the biological significance of the constructed PPI network, we also conducted essential network and protein functional analysis, mainly focusing on hub proteins and functional modules. We anticipate the constructed PPI network can become an important data resource to accelerate the functional genomics studies of C. parvum as well as offer new hints to the target discovery in developing drugs/vaccines.
Collapse
|
23
|
Selvaggio G, Cristellon S, Marchetti L. A Novel Hybrid Logic-ODE Modeling Approach to Overcome Knowledge Gaps. Front Mol Biosci 2022; 8:760077. [PMID: 34988115 PMCID: PMC8721169 DOI: 10.3389/fmolb.2021.760077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 11/09/2021] [Indexed: 11/13/2022] Open
Abstract
Mathematical modeling allows using different formalisms to describe, investigate, and understand biological processes. However, despite the advent of high-throughput experimental techniques, quantitative information is still a challenge when looking for data to calibrate model parameters. Furthermore, quantitative formalisms must cope with stiffness and tractability problems, more so if used to describe multicellular systems. On the other hand, qualitative models may lack the proper granularity to describe the underlying kinetic processes. We propose a hybrid modeling approach that integrates ordinary differential equations and logical formalism to describe distinct biological layers and their communication. We focused on a multicellular system as a case study by applying the hybrid formalism to the well-known Delta-Notch signaling pathway. We used a differential equation model to describe the intracellular pathways while the cell-cell interactions were defined by logic rules. The hybrid approach herein employed allows us to combine the pros of different modeling techniques by overcoming the lack of quantitative information with a qualitative description that discretizes activation and inhibition processes, thus avoiding complexity.
Collapse
Affiliation(s)
- Gianluca Selvaggio
- Piazza Manifattura, Fondazione The Microsoft Research-University of Trento Centre for Computational and Systems Biology (COSBI), Rovereto, Italy
| | - Serena Cristellon
- Piazza Manifattura, Fondazione The Microsoft Research-University of Trento Centre for Computational and Systems Biology (COSBI), Rovereto, Italy.,Department of Mathematics, University of Trento, Trento, Italy
| | - Luca Marchetti
- Piazza Manifattura, Fondazione The Microsoft Research-University of Trento Centre for Computational and Systems Biology (COSBI), Rovereto, Italy.,Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento, Italy
| |
Collapse
|
24
|
Sharma D, Sharma A, Singh B, Kumar S, Verma S. Neglected scrub typhus: An updated review with a focus on omics technologies. ASIAN PAC J TROP MED 2022. [DOI: 10.4103/1995-7645.364003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
|
25
|
Helmy M, Agrawal R, Ali J, Soudy M, Bui TT, Selvarajoo K. GeneCloudOmics: A Data Analytic Cloud Platform for High-Throughput Gene Expression Analysis. FRONTIERS IN BIOINFORMATICS 2021; 1:693836. [PMID: 36303746 PMCID: PMC9581002 DOI: 10.3389/fbinf.2021.693836] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 10/14/2021] [Indexed: 11/18/2022] Open
Abstract
Gene expression profiling techniques, such as DNA microarray and RNA-Sequencing, have provided significant impact on our understanding of biological systems. They contribute to almost all aspects of biomedical research, including studying developmental biology, host-parasite relationships, disease progression and drug effects. However, the high-throughput data generations present challenges for many wet experimentalists to analyze and take full advantage of such rich and complex data. Here we present GeneCloudOmics, an easy-to-use web server for high-throughput gene expression analysis that extends the functionality of our previous ABioTrans with several new tools, including protein datasets analysis, and a web interface. GeneCloudOmics allows both microarray and RNA-Seq data analysis with a comprehensive range of data analytics tools in one package that no other current standalone software or web-based tool can do. In total, GeneCloudOmics provides the user access to 23 different data analytical and bioinformatics tasks including reads normalization, scatter plots, linear/non-linear correlations, PCA, clustering (hierarchical, k-means, t-SNE, SOM), differential expression analyses, pathway enrichments, evolutionary analyses, pathological analyses, and protein-protein interaction (PPI) identifications. Furthermore, GeneCloudOmics allows the direct import of gene expression data from the NCBI Gene Expression Omnibus database. The user can perform all tasks rapidly through an intuitive graphical user interface that overcomes the hassle of coding, installing tools/packages/libraries and dealing with operating systems compatibility and version issues, complications that make data analysis tasks challenging for biologists. Thus, GeneCloudOmics is a one-stop open-source tool for gene expression data analysis and visualization. It is freely available at http://combio-sifbi.org/GeneCloudOmics.
Collapse
Affiliation(s)
- Mohamed Helmy
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Department of Computer Science, Lakehead University, Thunder Bay, ON, Canada
| | - Rahul Agrawal
- Department of Geology and Geophysics, Indian Institute of Technology (IIT) Kharagpur, Kharagpur, India
| | - Javed Ali
- Department of Geology and Geophysics, Indian Institute of Technology (IIT) Kharagpur, Kharagpur, India
| | - Mohamed Soudy
- Proteomics and Metabolomics Unit, Children Cancer Hospital (CCHE-57357), Cairo, Egypt
| | - Thuy Tien Bui
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Kumar Selvarajoo
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore (NUS), Singapore, Singapore
- *Correspondence: Kumar Selvarajoo,
| |
Collapse
|
26
|
Liu L, Gao H, Zaikin A, Chen S. Unraveling Aβ-Mediated Multi-Pathway Calcium Dynamics in Astrocytes: Implications for Alzheimer's Disease Treatment From Simulations. Front Physiol 2021; 12:767892. [PMID: 34777023 PMCID: PMC8581622 DOI: 10.3389/fphys.2021.767892] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 10/08/2021] [Indexed: 02/02/2023] Open
Abstract
The accumulation of amyloid β peptide (Aβ) in the brain is hypothesized to be the major factor driving Alzheimer's disease (AD) pathogenesis. Mounting evidence suggests that astrocytes are the primary target of Aβ neurotoxicity. Aβ is known to interfere with multiple calcium fluxes, thus disrupting the calcium homeostasis regulation of astrocytes, which are likely to produce calcium oscillations. Ca2+ dyshomeostasis has been observed to precede the appearance of clinical symptoms of AD; however, it is experimentally very difficult to investigate the interactions of many mechanisms. Given that Ca2+ disruption is ubiquitously involved in AD progression, it is likely that focusing on Ca2+ dysregulation may serve as a potential therapeutic approach to preventing or treating AD, while current hypotheses concerning AD have so far failed to yield curable therapies. For this purpose, we derive and investigate a concise mathematical model for Aβ-mediated multi-pathway astrocytic intracellular Ca2+ dynamics. This model accounts for how Aβ affects various fluxes contributions through voltage-gated calcium channels, Aβ-formed channels and ryanodine receptors. Bifurcation analysis of Aβ level, which reflected the corresponding progression of the disease, revealed that Aβ significantly induced the increasing [Ca2+] i and frequency of calcium oscillations. The influence of inositol 1,4,5-trisphosphate production (IP3) is also investigated in the presence of Aβ as well as the impact of changes in resting membrane potential. In turn, the Ca2+ flux can be considerably changed by exerting specific interventions, such as ion channel blockers or receptor antagonists. By doing so, a "combination therapy" targeting multiple pathways simultaneously has finally been demonstrated to be more effective. This study helps to better understand the effect of Aβ, and our findings provide new insight into the treatment of AD.
Collapse
Affiliation(s)
- Langzhou Liu
- Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China.,MoE Key Laboratory for Biomedical Photonics, School of Engineering Sciences, Huazhong University of Science and Technology, Wuhan, China
| | - Huayi Gao
- Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China.,MoE Key Laboratory for Biomedical Photonics, School of Engineering Sciences, Huazhong University of Science and Technology, Wuhan, China
| | - Alexey Zaikin
- Institute of Information Technologies, Mathematics and Mechanics, Lobachevsky State University of Nizhny Novgorod, Nizhny Novgorod, Russia.,Institute for Women's Health and Department of Mathematics, University College London, London, United Kingdom.,World-Class Research Center "Digital Biodesign and Personalized Healthcare", Sechenov First Moscow State Medical University, Moscow, Russia
| | - Shangbin Chen
- Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan, China.,MoE Key Laboratory for Biomedical Photonics, School of Engineering Sciences, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
27
|
Way GP, Greene CS, Carninci P, Carvalho BS, de Hoon M, Finley SD, Gosline SJC, Lȇ Cao KA, Lee JSH, Marchionni L, Robine N, Sindi SS, Theis FJ, Yang JYH, Carpenter AE, Fertig EJ. A field guide to cultivating computational biology. PLoS Biol 2021; 19:e3001419. [PMID: 34618807 PMCID: PMC8525744 DOI: 10.1371/journal.pbio.3001419] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 10/19/2021] [Indexed: 11/18/2022] Open
Abstract
Evolving in sync with the computation revolution over the past 30 years, computational biology has emerged as a mature scientific field. While the field has made major contributions toward improving scientific knowledge and human health, individual computational biology practitioners at various institutions often languish in career development. As optimistic biologists passionate about the future of our field, we propose solutions for both eager and reluctant individual scientists, institutions, publishers, funding agencies, and educators to fully embrace computational biology. We believe that in order to pave the way for the next generation of discoveries, we need to improve recognition for computational biologists and better align pathways of career success with pathways of scientific progress. With 10 outlined steps, we call on all adjacent fields to move away from the traditional individual, single-discipline investigator research model and embrace multidisciplinary, data-driven, team science.
Collapse
Affiliation(s)
- Gregory P. Way
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Center for Health AI, University of Colorado School of Medicine, Aurora, Colorado, United States of America
| | - Casey S. Greene
- Center for Health AI, University of Colorado School of Medicine, Aurora, Colorado, United States of America
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences Yokohama, Kanagawa, Japan
- Human Technopole, Milan, Italy
| | - Benilton S. Carvalho
- Department of Statistics, Institute of Mathematics, Statistics and Scientific Computing, University of Campinas, Campinas, Brazil
| | - Michiel de Hoon
- RIKEN Center for Integrative Medical Sciences Yokohama, Kanagawa, Japan
| | - Stacey D. Finley
- Department of Biomedical Engineering, Quantitative and Computational Biology, and Chemical Engineering & Materials Science, University of Southern California, Los Angeles, California, United States of America
| | - Sara J. C. Gosline
- Pacific Northwest National Laboratory, Seattle, Washington, United States of America
| | - Kim-Anh Lȇ Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
| | - Jerry S. H. Lee
- Ellison Institute and Departments of Medicine/Oncology, Chemical Engineering, and Material Sciences, University of Southern California, Los Angeles, California, United States of America
| | - Luigi Marchionni
- Department of Pathology and Laboratory Medicine, Weill-Cornell Medicine, New York, New York, United States of America
| | - Nicolas Robine
- Computational Biology Lab, New York Genome Center, New York, New York, United States of America
| | - Suzanne S. Sindi
- Department of Applied Mathematics, University of California Merced, Merced, California, United States of America
| | - Fabian J. Theis
- Institute of Computational Biology, Helmholtz Center Munich and Department of Mathematics, Technical University of Munich, Munich, Germany
| | - Jean Y. H. Yang
- Charles Perkins Centre and School of Mathematics and Statistics, The University of Sydney, Australia
| | - Anne E. Carpenter
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Elana J. Fertig
- Convergence Institute, Departments of Oncology, Biomedical Engineering, and Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, Maryland, United States of America
| |
Collapse
|
28
|
Padhi AK, Rath SL, Tripathi T. Accelerating COVID-19 Research Using Molecular Dynamics Simulation. J Phys Chem B 2021; 125:9078-9091. [PMID: 34319118 PMCID: PMC8340580 DOI: 10.1021/acs.jpcb.1c04556] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 07/12/2021] [Indexed: 12/14/2022]
Abstract
The COVID-19 pandemic has emerged as a global medico-socio-economic disaster. Given the lack of effective therapeutics against SARS-CoV-2, scientists are racing to disseminate suggestions for rapidly deployable therapeutic options, including drug repurposing and repositioning strategies. Molecular dynamics (MD) simulations have provided the opportunity to make rational scientific breakthroughs in a time of crisis. Advancements in these technologies in recent years have become an indispensable tool for scientists studying protein structure, function, dynamics, interactions, and drug discovery. Integrating the structural data obtained from high-resolution methods with MD simulations has helped in comprehending the process of infection and pathogenesis, as well as the SARS-CoV-2 maturation in host cells, in a short duration of time. It has also guided us to identify and prioritize drug targets and new chemical entities, and to repurpose drugs. Here, we discuss how MD simulation has been explored by the scientific community to accelerate and guide translational research on SARS-CoV-2 in the past year. We have also considered future research directions for researchers, where MD simulations can help fill the existing gaps in COVID-19 research.
Collapse
Affiliation(s)
- Aditya K. Padhi
- Laboratory for Structural Bioinformatics, Center for
Biosystems Dynamics Research, RIKEN, 1-7-22 Suehiro, Tsurumi,
Yokohama, Kanagawa 230-0045, Japan
| | - Soumya Lipsa Rath
- Department of Biotechnology, National
Institute of Technology, Warangal, Telangana 506004,
India
| | - Timir Tripathi
- Molecular and Structural Biophysics Laboratory,
Department of Biochemistry, North-Eastern Hill University,
Shillong 793022, India
| |
Collapse
|
29
|
Lortie CJ. The early bird gets the return: The benefits of publishing your data sooner. Ecol Evol 2021; 11:10736-10740. [PMID: 34429876 PMCID: PMC8366834 DOI: 10.1002/ece3.7853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 06/08/2021] [Accepted: 06/09/2021] [Indexed: 12/02/2022] Open
Abstract
The benefits of publishing your data sooner versus later in Ecology and Evolution.
Collapse
|
30
|
Chen Q, Leaman R, Allot A, Luo L, Wei CH, Yan S, Lu Z. Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing. Annu Rev Biomed Data Sci 2021; 4:313-339. [PMID: 34465169 DOI: 10.1146/annurev-biodatasci-021821-061045] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The COVID-19 (coronavirus disease 2019) pandemic has had a significant impact on society, both because of the serious health effects of COVID-19 and because of public health measures implemented to slow its spread. Many of these difficulties are fundamentally information needs; attempts to address these needs have caused an information overload for both researchers and the public. Natural language processing (NLP)-the branch of artificial intelligence that interprets human language-can be applied to address many of the information needs made urgent by the COVID-19 pandemic. This review surveys approximately 150 NLP studies and more than 50 systems and datasets addressing the COVID-19 pandemic. We detail work on four core NLP tasks: information retrieval, named entity recognition, literature-based discovery, and question answering. We also describe work that directly addresses aspects of the pandemic through four additional tasks: topic modeling, sentiment and emotion analysis, caseload forecasting, and misinformation detection. We conclude by discussing observable trends and remaining challenges.
Collapse
Affiliation(s)
- Qingyu Chen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Robert Leaman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Alexis Allot
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Ling Luo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Chih-Hsuan Wei
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Shankai Yan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| | - Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA;
| |
Collapse
|
31
|
Lydic R, Baghdoyan HA. Prefrontal Cortex Metabolome Is Modified by Opioids, Anesthesia, and Sleep. Physiology (Bethesda) 2021; 36:203-219. [PMID: 34159803 DOI: 10.1152/physiol.00043.2020] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Obtundation of wakefulness caused by opioids and loss of wakefulness caused by anesthetics and sleep significantly alter concentrations of molecules comprising the prefrontal cortex (PFC) metabolome. Quantifying state-selective changes in the PFC metabolome is essential for advancing functional metabolomics. Diverse functions of the PFC suggest the PFC metabolome as a potential therapeutic entry point for countermeasures to state-selective autonomic dysfunction.
Collapse
Affiliation(s)
- Ralph Lydic
- Psychology, University of Tennessee, Knoxville, Tennessee.,Oak Ridge National Laboratory, Oak Ridge, Tennessee
| | - Helen A Baghdoyan
- Psychology, University of Tennessee, Knoxville, Tennessee.,Oak Ridge National Laboratory, Oak Ridge, Tennessee
| |
Collapse
|
32
|
Schweizer RM, Saarman N, Ramstad KM, Forester BR, Kelley JL, Hand BK, Malison RL, Ackiss AS, Watsa M, Nelson TC, Beja-Pereira A, Waples RS, Funk WC, Luikart G. Big Data in Conservation Genomics: Boosting Skills, Hedging Bets, and Staying Current in the Field. J Hered 2021; 112:313-327. [PMID: 33860294 DOI: 10.1093/jhered/esab019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/13/2021] [Indexed: 02/07/2023] Open
Abstract
A current challenge in the fields of evolutionary, ecological, and conservation genomics is balancing production of large-scale datasets with additional training often required to handle such datasets. Thus, there is an increasing need for conservation geneticists to continually learn and train to stay up-to-date through avenues such as symposia, meetings, and workshops. The ConGen meeting is a near-annual workshop that strives to guide participants in understanding population genetics principles, study design, data processing, analysis, interpretation, and applications to real-world conservation issues. Each year of ConGen gathers a diverse set of instructors, students, and resulting lectures, hands-on sessions, and discussions. Here, we summarize key lessons learned from the 2019 meeting and more recent updates to the field with a focus on big data in conservation genomics. First, we highlight classical and contemporary issues in study design that are especially relevant to working with big datasets, including the intricacies of data filtering. We next emphasize the importance of building analytical skills and simulating data, and how these skills have applications within and outside of conservation genetics careers. We also highlight recent technological advances and novel applications to conservation of wild populations. Finally, we provide data and recommendations to support ongoing efforts by ConGen organizers and instructors-and beyond-to increase participation of underrepresented minorities in conservation and eco-evolutionary sciences. The future success of conservation genetics requires both continual training in handling big data and a diverse group of people and approaches to tackle key issues, including the global biodiversity-loss crisis.
Collapse
Affiliation(s)
- Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, MT
| | - Norah Saarman
- Department of Biology, Utah State University, Logan, UT
| | - Kristina M Ramstad
- Department of Biology and Geology, University of South Carolina Aiken, Aiken, SC
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, WA
| | - Brian K Hand
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Rachel L Malison
- Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Amanda S Ackiss
- Wisconsin Cooperative Fishery Research Unit, University of Wisconsin Stevens Point, Stevens Point, WI
| | | | | | - Albano Beja-Pereira
- Centro de Investigação em Biodiversidade e Recursos Genéticos (CIBIO-UP), InBIO, Universidade do Porto, Vairão, Portugal.,DGAOT, Faculty of Sciences, University of Porto, Porto, Portugal.,Sustainable Agrifood Production Research Centre (GreenUPorto), Faculty of Sciences, University of Porto, Porto, Portugal
| | - Robin S Waples
- Northwest Fisheries Science Center, NOAA Fisheries, Seattle, WA
| | - W Chris Funk
- Department of Biology, Graduate Degree Program in Ecology, Colorado State University, Fort Collins, CO
| | - Gordon Luikart
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| |
Collapse
|
33
|
Abstract
Cell imaging has entered the 'Big Data' era. New technologies in light microscopy and molecular biology have led to an explosion in high-content, dynamic and multidimensional imaging data. Similar to the 'omics' fields two decades ago, our current ability to process, visualize, integrate and mine this new generation of cell imaging data is becoming a critical bottleneck in advancing cell biology. Computation, traditionally used to quantitatively test specific hypotheses, must now also enable iterative hypothesis generation and testing by deciphering hidden biologically meaningful patterns in complex, dynamic or high-dimensional cell image data. Data science is uniquely positioned to aid in this process. In this Perspective, we survey the rapidly expanding new field of data science in cell imaging. Specifically, we highlight how data science tools are used within current image analysis pipelines, propose a computation-first approach to derive new hypotheses from cell image data, identify challenges and describe the next frontiers where we believe data science will make an impact. We also outline steps to ensure broad access to these powerful tools - democratizing infrastructure availability, developing sensitive, robust and usable tools, and promoting interdisciplinary training to both familiarize biologists with data science and expose data scientists to cell imaging.
Collapse
Affiliation(s)
- Meghan K Driscoll
- Department of Bioinformatics, UT Southwestern Medical Center, Dallas, TX 75390, USA
| | - Assaf Zaritsky
- Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel
| |
Collapse
|
34
|
Abstract
The way in which computer code is perceived and used in biological research has been a source of some controversy and confusion, and has resulted in sub-optimal outcomes related to reproducibility, scalability and productivity. We suggest that the confusion is due in part to a misunderstanding of the function of code when applied to the life sciences. Code has many roles, and in this paper we present a three-dimensional taxonomy to classify those roles and map them specifically to the life sciences. We identify a "sweet spot" in the taxonomy-a convergence where bioinformaticians should concentrate their efforts in order to derive the most value from the time they spend using code. We suggest the use of the "inverse Conway maneuver" to shape a research team so as to allow dedicated software engineers to interface with researchers working in this "sweet spot." We conclude that in order to address current issues in the use of software in life science research such as reproducibility and scalability, the field must reevaluate its relationship with software engineering, and adapt its research structures to overcome current issues in bioinformatics such as reproducibility, scalability and productivity.
Collapse
Affiliation(s)
- Brendan Lawlor
- Department of Computer Science, Munster Technological University, Cork, Ireland
| | - Roy D Sleator
- Department of Biological Sciences, Munster Technological University, Cork, Ireland
| |
Collapse
|
35
|
Richter CF, Lortie CJ, Kelly TL, Filazzola A, Nunes KA, Sarkar R. Online but not remote: Adapting field-based ecology laboratories for online learning. Ecol Evol 2021; 11:3616-3624. [PMID: 33898014 PMCID: PMC8057323 DOI: 10.1002/ece3.7008] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/14/2020] [Accepted: 10/16/2020] [Indexed: 11/06/2022] Open
Abstract
Teaching ecology effectively and experientially has become more challenging for at least two reasons today. Most experiences of our students are urban, and we now face the near immediate and continuing need to deliver courses (either partially or wholly) online because of COVID-19. Therefore, providing a learning experience that connects students to their environment within an ecological framework remains crucial and perhaps therapeutic to mental health. Here, we describe how prior to the pandemic we adapted our field-based laboratories to include data collection, analysis, and interpretation, along with the development of a citizen-science approach for online delivery. This design is simple to implement, does not require extensive work, and maintains the veracity of original learning outcomes. Collaboration online following field data collection in ecology courses within the context of cities offers further options to adapt to student experience levels, resource availability, and accessibility, as well as bringing instructors and students together to build an open well-curated data set that can be used in ecology courses where no laboratories are available. Finally, it promotes an open collaboration among ecology instructors that can drive lasting conversations about ecology curriculum.
Collapse
Affiliation(s)
| | | | | | | | - Krystal A. Nunes
- Department of BiologyUniversity of Toronto MississaugaMississaugaONCanada
| | - Raani Sarkar
- Department of BiologyUniversity of Toronto MississaugaMississaugaONCanada
| |
Collapse
|
36
|
Thessen AE, Bogdan P, Patterson DJ, Casey TM, Hinojo-Hinojo C, de Lange O, Haendel MA. From Reductionism to Reintegration: Solving society's most pressing problems requires building bridges between data types across the life sciences. PLoS Biol 2021; 19:e3001129. [PMID: 33770077 PMCID: PMC7997011 DOI: 10.1371/journal.pbio.3001129] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Decades of reductionist approaches in biology have achieved spectacular progress, but the proliferation of subdisciplines, each with its own technical and social practices regarding data, impedes the growth of the multidisciplinary and interdisciplinary approaches now needed to address pressing societal challenges. Data integration is key to a reintegrated biology able to address global issues such as climate change, biodiversity loss, and sustainable ecosystem management. We identify major challenges to data integration and present a vision for a "Data as a Service"-oriented architecture to promote reuse of data for discovery. The proposed architecture includes standards development, new tools and services, and strategies for career-development and sustainability.
Collapse
Affiliation(s)
- Anne E. Thessen
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
- * E-mail:
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
| | | | - Theresa M. Casey
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - César Hinojo-Hinojo
- Department of Earth System Science, University of California, Irvine, California, United States of America
| | - Orlando de Lange
- Department of Electrical Engineering, University of Washington, Seattle, Washington, United States of America
| | - Melissa A. Haendel
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
| |
Collapse
|
37
|
Affiliation(s)
- Philip E. Bourne
- University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| |
Collapse
|
38
|
Jafari M, Guan Y, Wedge DC, Ansari-Pour N. Re-evaluating experimental validation in the Big Data Era: a conceptual argument. Genome Biol 2021; 22:71. [PMID: 33627141 PMCID: PMC7903713 DOI: 10.1186/s13059-021-02292-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - David C Wedge
- Manchester Cancer Research Centre, The University of Manchester, Manchester, M20 4GJ, UK.
| | - Naser Ansari-Pour
- Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LF, UK.
| |
Collapse
|
39
|
Chitoiu L, Dobranici A, Gherghiceanu M, Dinescu S, Costache M. Multi-Omics Data Integration in Extracellular Vesicle Biology-Utopia or Future Reality? Int J Mol Sci 2020; 21:ijms21228550. [PMID: 33202771 PMCID: PMC7697477 DOI: 10.3390/ijms21228550] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 11/10/2020] [Accepted: 11/11/2020] [Indexed: 12/15/2022] Open
Abstract
Extracellular vesicles (EVs) are membranous structures derived from the endosomal system or generated by plasma membrane shedding. Due to their composition of DNA, RNA, proteins, and lipids, EVs have garnered a lot of attention as an essential mechanism of cell-to-cell communication, with various implications in physiological and pathological processes. EVs are not only a highly heterogeneous population by means of size and biogenesis, but they are also a source of diverse, functionally rich biomolecules. Recent advances in high-throughput processing of biological samples have facilitated the development of databases comprised of characteristic genomic, transcriptomic, proteomic, metabolomic, and lipidomic profiles for EV cargo. Despite the in-depth approach used to map functional molecules in EV-mediated cellular cross-talk, few integrative methods have been applied to analyze the molecular interplay in these targeted delivery systems. New perspectives arise from the field of systems biology, where accounting for heterogeneity may lead to finding patterns in an apparently random pool of data. In this review, we map the biological and methodological causes of heterogeneity in EV multi-omics data and present current applications or possible statistical methods for integrating such data while keeping track of the current bottlenecks in the field.
Collapse
Affiliation(s)
- Leona Chitoiu
- Ultrastructural Pathology and Bioimaging Laboratory, ‘Victor Babeș’ National Institute of Pathology, Bucharest 050096, Romania; (L.C.); (M.G.)
| | - Alexandra Dobranici
- Department of Biochemistry and Molecular Biology, University of Bucharest, Bucharest 050095, Romania; (A.D.); (M.C.)
| | - Mihaela Gherghiceanu
- Ultrastructural Pathology and Bioimaging Laboratory, ‘Victor Babeș’ National Institute of Pathology, Bucharest 050096, Romania; (L.C.); (M.G.)
- Department of Cellular, Molecular Biology and Histology, ‘Carol Davila’ University of Medicine and Pharmacy, Bucharest 050474, Romania
| | - Sorina Dinescu
- Department of Biochemistry and Molecular Biology, University of Bucharest, Bucharest 050095, Romania; (A.D.); (M.C.)
- Research Institute of the University of Bucharest, University of Bucharest, Bucharest 050663, Romania
- Correspondence:
| | - Marieta Costache
- Department of Biochemistry and Molecular Biology, University of Bucharest, Bucharest 050095, Romania; (A.D.); (M.C.)
- Research Institute of the University of Bucharest, University of Bucharest, Bucharest 050663, Romania
| |
Collapse
|
40
|
Lortie CJ. Online conferences for better learning. Ecol Evol 2020; 10:12442-12449. [PMID: 33250984 PMCID: PMC7679531 DOI: 10.1002/ece3.6923] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 09/21/2020] [Accepted: 09/30/2020] [Indexed: 02/01/2023] Open
Abstract
Conferences provide an invaluable set of opportunities for professional development. Online, virtual, and distributed conferences do not necessarily mean less opportunity for growth and innovation in science but varied and novel options for communicating the scientific process. Open science and many existing tools are in place in the practice of contemporary ecology and evolution to provide latitude for a much broader scope of sharing and thus learning from conferences. A brief overview of the science supporting online conferences and a highlight of some of the open science concepts in ecology and evolution are provided here to enable better learning through better planning for online conferences.
Collapse
|
41
|
Li B, Cao Y, Westhof E, Miao Z. Advances in RNA 3D Structure Modeling Using Experimental Data. Front Genet 2020; 11:574485. [PMID: 33193680 PMCID: PMC7649352 DOI: 10.3389/fgene.2020.574485] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/02/2020] [Indexed: 12/26/2022] Open
Abstract
RNA is a unique bio-macromolecule that can both record genetic information and perform biological functions in a variety of molecular processes, including transcription, splicing, translation, and even regulating protein function. RNAs adopt specific three-dimensional conformations to enable their functions. Experimental determination of high-resolution RNA structures using x-ray crystallography is both laborious and demands expertise, thus, hindering our comprehension of RNA structural biology. The computational modeling of RNA structure was a milestone in the birth of bioinformatics. Although computational modeling has been greatly improved over the last decade showing many successful cases, the accuracy of such computational modeling is not only length-dependent but also varies according to the complexity of the structure. To increase credibility, various experimental data were integrated into computational modeling. In this review, we summarize the experiments that can be integrated into RNA structure modeling as well as the computational methods based on these experimental data. We also demonstrate how computational modeling can help the experimental determination of RNA structure. We highlight the recent advances in computational modeling which can offer reliable structure models using high-throughput experimental data.
Collapse
Affiliation(s)
- Bing Li
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai, China
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| |
Collapse
|
42
|
Leaman R, Wei CH, Allot A, Lu Z. Ten tips for a text-mining-ready article: How to improve automated discoverability and interpretability. PLoS Biol 2020; 18:e3000716. [PMID: 32479517 PMCID: PMC7289435 DOI: 10.1371/journal.pbio.3000716] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 06/11/2020] [Indexed: 12/22/2022] Open
Abstract
Data-driven research in biomedical science requires structured, computable data. Increasingly, these data are created with support from automated text mining. Text-mining tools have rapidly matured: although not perfect, they now frequently provide outstanding results. We describe 10 straightforward writing tips—and a web tool, PubReCheck—guiding authors to help address the most common cases that remain difficult for text-mining tools. We anticipate these guides will help authors’ work be found more readily and used more widely, ultimately increasing the impact of their work and the overall benefit to both authors and readers. PubReCheck is available at http://www.ncbi.nlm.nih.gov/research/pubrecheck. Your published research is already being processed with automated tools, and text mining will become more common; this Community Page article describes how you can help these tools process your work more accurately, including a web tool, PubReCheck.
Collapse
Affiliation(s)
- Robert Leaman
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Chih-Hsuan Wei
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Alexis Allot
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
| | - Zhiyong Lu
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
43
|
Lortie CJ, Braun J, Filazzola A, Miguel F. A checklist for choosing between R packages in ecology and evolution. Ecol Evol 2020; 10:1098-1105. [PMID: 32076500 PMCID: PMC7029065 DOI: 10.1002/ece3.5970] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 11/18/2019] [Accepted: 12/05/2019] [Indexed: 11/12/2022] Open
Abstract
The open source and free programming language R is a phenomenal mechanism to address a multiplicity of challenges in ecology and evolution. It is also a complex ecosystem because of the diversity of solutions available to the analyst.Packages for R enhance and specialize the capacity to explore both niche data/experiments and more common needs. However, the paradox of choice or how we select between many seemingly similar options can be overwhelming and lead to different potential outcomes.There is extensive choice in ecology and evolution between packages for both fundamental statistics and for more specialized domain-level analyses.Here, we provide a checklist to inform these decisions based on the principles of resilience, need, and integration with scientific workflows for evidence.It is important to explore choices in any analytical coding environment-not just R-for solutions to challenges in ecology and evolution, and document this process because it advances reproducible science, promotes a deeper understand of the scientific evidence, and ensures that the outcomes are correct, representative, and robust.
Collapse
Affiliation(s)
- Christopher J. Lortie
- Department of BiologyYork UniversityTorontoONCanada
- The National Center for Ecological Analysis and SynthesisUCSBSanta BarbaraCAUSA
| | - Jenna Braun
- Department of BiologyYork UniversityTorontoONCanada
| | | | - Florencia Miguel
- National Scientific and Technical Research CouncilCONICETBuenos AiresArgentina
| |
Collapse
|
44
|
Brito JJ, Mosqueiro T, Rotman J, Xue V, Chapski DJ, la Hoz JD, Matias P, Martin LS, Zelikovsky A, Pellegrini M, Mangul S. Telescope: an interactive tool for managing large-scale analysis from mobile devices. Gigascience 2020; 9:giz163. [PMID: 31972019 PMCID: PMC6977584 DOI: 10.1093/gigascience/giz163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 11/26/2019] [Accepted: 12/19/2019] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND In today's world of big data, computational analysis has become a key driver of biomedical research. High-performance computational facilities are capable of processing considerable volumes of data, yet often lack an easy-to-use interface to guide the user in supervising and adjusting bioinformatics analysis via a tablet or smartphone. RESULTS To address this gap we proposed Telescope, a novel tool that interfaces with high-performance computational clusters to deliver an intuitive user interface for controlling and monitoring bioinformatics analyses in real-time. By leveraging last generation technology now ubiquitous to most researchers (such as smartphones), Telescope delivers a friendly user experience and manages conectivity and encryption under the hood. CONCLUSIONS Telescope helps to mitigate the digital divide between wet and computational laboratories in contemporary biology. By delivering convenience and ease of use through a user experience not relying on expertise with computational clusters, Telescope can help researchers close the feedback loop between bioinformatics and experimental work with minimal impact on the performance of computational tools. Telescope is freely available at https://github.com/Mangul-Lab-USC/telescope.
Collapse
Affiliation(s)
- Jaqueline J Brito
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, 1985 Zonal Avenue, Los Angeles, CA 90089-9121, USA
| | - Thiago Mosqueiro
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, 611 Charles E. Young Drive East, Los Angeles, CA 90095, USA
| | - Jeremy Rotman
- Department of Computer Science, University of California, Los Angeles, 404 Westwood Plaza, Los Angeles, CA 90095, USA
| | - Victor Xue
- Department of Computer Science, University of California, Los Angeles, 404 Westwood Plaza, Los Angeles, CA 90095, USA
| | - Douglas J Chapski
- Department of Anesthesiology, David Geffen School of Medicine at UCLA, 650 Charles E. Young Drive, Los Angeles, CA 90095, USA
| | - Juan De la Hoz
- Center for Neurobehavioral Genetics, University of California Los Angeles, 695 Charles E Young Dr S, Los Angeles, CA 90095, USA
| | - Paulo Matias
- Department of Computer Science, Federal University of São Carlos, km 325 Rod. Washington Luis, São Carlos, SP 13565–905, Brazil
| | - Lana S Martin
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, 1985 Zonal Avenue, Los Angeles, CA 90089-9121, USA
| | - Alex Zelikovsky
- Department of Computer Science, Georgia State University, 1 Park Place, Atlanta, GA 30303, USA
- The Laboratory of Bioinformatics, I.M. Sechenov First Moscow State Medical University, Moscow 119991, Russia
| | - Matteo Pellegrini
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, 611 Charles E. Young Drive East, Los Angeles, CA 90095, USA
| | - Serghei Mangul
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, 1985 Zonal Avenue, Los Angeles, CA 90089-9121, USA
| |
Collapse
|
45
|
Mangul S, Mosqueiro T, Abdill RJ, Duong D, Mitchell K, Sarwal V, Hill B, Brito J, Littman RJ, Statz B, Lam AKM, Dayama G, Grieneisen L, Martin LS, Flint J, Eskin E, Blekhman R. Challenges and recommendations to improve the installability and archival stability of omics computational tools. PLoS Biol 2019; 17:e3000333. [PMID: 31220077 PMCID: PMC6605654 DOI: 10.1371/journal.pbio.3000333] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 07/02/2019] [Indexed: 01/07/2023] Open
Abstract
Developing new software tools for analysis of large-scale biological data is a key component of advancing modern biomedical research. Scientific reproduction of published findings requires running computational tools on data generated by such studies, yet little attention is presently allocated to the installability and archival stability of computational software tools. Scientific journals require data and code sharing, but none currently require authors to guarantee the continuing functionality of newly published tools. We have estimated the archival stability of computational biology software tools by performing an empirical analysis of the internet presence for 36,702 omics software resources published from 2005 to 2017. We found that almost 28% of all resources are currently not accessible through uniform resource locators (URLs) published in the paper they first appeared in. Among the 98 software tools selected for our installability test, 51% were deemed "easy to install," and 28% of the tools failed to be installed at all because of problems in the implementation. Moreover, for papers introducing new software, we found that the number of citations significantly increased when authors provided an easy installation process. We propose for incorporation into journal policy several practical solutions for increasing the widespread installability and archival stability of published bioinformatics software.
Collapse
Affiliation(s)
- Serghei Mangul
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, California, United States of America
| | - Thiago Mosqueiro
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, California, United States of America
| | - Richard J. Abdill
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dat Duong
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Keith Mitchell
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Varuni Sarwal
- Indian Institute of Technology Delhi, Hauz Khas, New Delhi, India
| | - Brian Hill
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Jaqueline Brito
- Institute of Mathematics and Computer Science, University of São Paulo, São Paulo, Brazil
| | - Russell Jared Littman
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Benjamin Statz
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Angela Ka-Mei Lam
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
| | - Gargi Dayama
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Laura Grieneisen
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Lana S. Martin
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, California, United States of America
| | - Jonathan Flint
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California, United States of America
| | - Eleazar Eskin
- Department of Computer Science, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
| | - Ran Blekhman
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
- Department of Ecology, Evolution, and Behavior, University of Minnesota, Minnesota, United States of America
| |
Collapse
|
46
|
Lemos A, Lynce I, Monteiro PT. Repairing Boolean logical models from time-series data using Answer Set Programming. Algorithms Mol Biol 2019; 14:9. [PMID: 30962813 PMCID: PMC6434824 DOI: 10.1186/s13015-019-0145-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Accepted: 03/14/2019] [Indexed: 11/20/2022] Open
Abstract
Background Boolean models of biological signalling-regulatory networks are increasingly used to formally describe and understand complex biological processes. These models may become inconsistent as new data become available and need to be repaired. In the past, the focus has been shed on the inference of (classes of) models given an interaction network and time-series data sets. However, repair of existing models against new data is still in its infancy, where the process is still manually performed and therefore slow and prone to errors. Results In this work, we propose a method with an associated tool to suggest repairs over inconsistent Boolean models, based on a set of atomic repair operations. Answer Set Programming is used to encode the minimal repair problem as a combinatorial optimization problem. In particular, given an inconsistent model, the tool provides the minimal repairs that render the model capable of generating dynamics coherent with a (set of) time-series data set(s), considering either a synchronous or an asynchronous updating scheme. Conclusions The method was validated using known biological models from different species, as well as synthetic models obtained from randomly generated networks. We discuss the method’s limitations regarding each of the updating schemes and the considered minimization algorithm. Electronic supplementary material The online version of this article (10.1186/s13015-019-0145-8) contains supplementary material, which is available to authorized users.
Collapse
|
47
|
Mangul S, Martin LS, Eskin E, Blekhman R. Improving the usability and archival stability of bioinformatics software. Genome Biol 2019; 20:47. [PMID: 30813962 PMCID: PMC6391762 DOI: 10.1186/s13059-019-1649-8] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Implementation of bioinformatics software involves numerous unique challenges; a rigorous standardized approach is needed to examine software tools prior to their publication.
Collapse
Affiliation(s)
- Serghei Mangul
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA, 90095, USA. .,Institute for Quantitative and Computational Biosciences, University of California Los Angeles, 611 Charles E. Young Drive East, Los Angeles, CA, 90095, USA.
| | - Lana S Martin
- Institute for Quantitative and Computational Biosciences, University of California Los Angeles, 611 Charles E. Young Drive East, Los Angeles, CA, 90095, USA
| | - Eleazar Eskin
- Department of Computer Science, University of California Los Angeles, 580 Portola Plaza, Los Angeles, CA, 90095, USA.,Department of Human Genetics, University of California Los Angeles, 695 Charles E. Young Drive South, Los Angeles, CA, 90095, USA
| | - Ran Blekhman
- Department of Genetics, Cell Biology and Development, University of Minnesota, 321 Church St SE, Minneapolis, MN, 55455, USA.,Department of Ecology, Evolution, and Behavior, University of Minnesota, 100 Ecology Building, 1987 Upper Buford Cir, Falcon Heights, MN, 55108, USA
| |
Collapse
|
48
|
Guerfali FZ, Laouini D, Boudabous A, Tekaia F. Designing and running an advanced Bioinformatics and genome analyses course in Tunisia. PLoS Comput Biol 2019; 15:e1006373. [PMID: 30689625 PMCID: PMC6349305 DOI: 10.1371/journal.pcbi.1006373] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Genome data, with underlying new knowledge, are accumulating at exponential rate thanks to ever-improving sequencing technologies and the parallel development of dedicated efficient Bioinformatics methods and tools. Advanced Education in Bioinformatics and Genome Analyses is to a large extent not accessible to students in developing countries where endeavors to set up Bioinformatics courses concern most often only basic levels. Here, we report a pioneering pilot experience concerning the design and implementation, from scratch, of a three-months advanced and extensive course in Bioinformatics and Genome Analyses in the Institut Pasteur de Tunis. Most significantly the outcome of the course was upgrading the participants’ skills in Bioinformatics and Genome Analyses to recognized international standards. Here we detail the different steps involved in the implementation of this course as well as the topics covered in the program. The description of this pilot experience might be helpful for the implementation of other similar educational projects, notably in developing countries, aiming to go beyond basics and providing young researchers with high-level skills.
Collapse
Affiliation(s)
- Fatma Z. Guerfali
- Université Tunis El Manar, Tunis, Tunisia
- Institut Pasteur de Tunis, LR11IPT02, Laboratory of Transmission, Control and Immunobiology of Infections (LTCII), Tunis-Belvédère, Tunisia
| | - Dhafer Laouini
- Université Tunis El Manar, Tunis, Tunisia
- Institut Pasteur de Tunis, LR11IPT02, Laboratory of Transmission, Control and Immunobiology of Infections (LTCII), Tunis-Belvédère, Tunisia
| | - Abdellatif Boudabous
- Université Tunis El Manar, Faculté des Sciences de Tunis, Laboratoire Microorganisme et Biomolécules Actives, Campus Universitaire Farhat Heched, El Manar, Tunis, Tunisia
| | - Fredj Tekaia
- Institut Pasteur Paris, 28 rue du Dr Roux, 75724 Paris cedex 15, France
- * E-mail:
| |
Collapse
|
49
|
In-Silico Selection of Aptamer: A Review on the Revolutionary Approach to Understand the Aptamer Design and Interaction Through Computational Chemistry. ACTA ACUST UNITED AC 2019. [DOI: 10.1016/j.matpr.2019.11.185] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
50
|
Zaritsky A. Sharing and reusing cell image data. Mol Biol Cell 2018; 29:1274-1280. [PMID: 29851565 PMCID: PMC5994892 DOI: 10.1091/mbc.e17-10-0606] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Revised: 04/02/2018] [Accepted: 04/06/2018] [Indexed: 01/19/2023] Open
Abstract
The rapid growth in content and complexity of cell image data creates an opportunity for synergy between experimental and computational scientists. Sharing microscopy data enables computational scientists to develop algorithms and tools for data analysis, integration, and mining. These tools can be applied by experimentalists to promote hypothesis-generation and discovery. We are now at the dawn of this revolution: infrastructure is being developed for data standardization, deposition, sharing, and analysis; some journals and funding agencies mandate data deposition; data journals publish high-content microscopy data sets; quantification becomes standard in scientific publications; new analytic tools are being developed and dispatched to the community; and huge data sets are being generated by individual labs and philanthropic initiatives. In this Perspective, I reflect on sharing and reusing cell image data and the opportunities that will come along with it.
Collapse
|