1
|
Greener JG. Reversible molecular simulation for training classical and machine-learning force fields. Proc Natl Acad Sci U S A 2025; 122:e2426058122. [PMID: 40434635 DOI: 10.1073/pnas.2426058122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Accepted: 04/22/2025] [Indexed: 05/29/2025] Open
Abstract
The next generation of force fields for molecular dynamics will be developed using a wealth of data. Training systematically with experimental data remains a challenge, however, especially for machine-learning potentials. Differentiable molecular simulation calculates gradients of observables with respect to parameters through molecular dynamics trajectories. Here, we improve this approach by explicitly calculating gradients using a reverse-time simulation with effectively constant memory cost and a computation count similar to the forward simulation. The method is applied to learn all-atom water and gas diffusion models with different functional forms and to train a machine-learning potential for diamond from scratch. Comparison to ensemble reweighting indicates that reversible simulation can provide more accurate gradients and train to match time-dependent observables.
Collapse
Affiliation(s)
- Joe G Greener
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
2
|
Müller‐Bötticher N, Tiesmeyer S, Eils R, Ishaque N. Sainsc: A Computational Tool for Segmentation-Free Analysis of In Situ Capture Data. SMALL METHODS 2025; 9:e2401123. [PMID: 39533496 PMCID: PMC12103232 DOI: 10.1002/smtd.202401123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 09/23/2024] [Indexed: 11/16/2024]
Abstract
Spatially resolved transcriptomics (SRT) has become the method of choice for characterising the complexity of biomedical tissue samples. Until recently, scientists were restricted to SRT methods that can profile a limited set of target genes at high spatial resolution or transcriptome-wide but at a low spatial resolution. Through recent developments, there are now methods that offer both subcellular spatial resolution and full transcriptome coverage. However, utilising these new methods' high spatial resolution and gene resolution remains elusive due to several factors, including low detection efficiency and high computational costs. Here, we present Sainsc (Segmentation-free analysis of in situ capture data), which combines a cell-segmentation-free approach with efficient data processing of transcriptome-wide nanometre-resolution spatial data. Sainsc can generate cell-type maps with accurate cell-type assignment at the nanometre scale, together with corresponding maps of the assignment scores that facilitate interpretation of the local confidence of cell-type assignment. We demonstrate its utility and accuracy for different tissues and technologies. Compared to other methods, Sainsc requires lower computational resources and has scalable performance, enabling interactive data exploration. Sainsc is compatible with common data analysis frameworks and is available as open-source software in multiple programming languages.
Collapse
Affiliation(s)
- Niklas Müller‐Bötticher
- Center of Digital HealthBerlin Institute of Health at Charité – Universitätsmedizin BerlinCharitéplatz 110117BerlinGermany
- Department of Mathematics and Computer ScienceFreie Universität BerlinArnimallee 1414195BerlinGermany
| | - Sebastian Tiesmeyer
- Center of Digital HealthBerlin Institute of Health at Charité – Universitätsmedizin BerlinCharitéplatz 110117BerlinGermany
- Department of Mathematics and Computer ScienceFreie Universität BerlinArnimallee 1414195BerlinGermany
| | - Roland Eils
- Center of Digital HealthBerlin Institute of Health at Charité – Universitätsmedizin BerlinCharitéplatz 110117BerlinGermany
- Department of Mathematics and Computer ScienceFreie Universität BerlinArnimallee 1414195BerlinGermany
- Health Data Science UnitHeidelberg University Hospital and BioQuantUniversity of HeidelbergIm Neuenheimer Feld 26769120HeidelbergGermany
| | - Naveed Ishaque
- Center of Digital HealthBerlin Institute of Health at Charité – Universitätsmedizin BerlinCharitéplatz 110117BerlinGermany
| |
Collapse
|
3
|
Qiao L, Khalilimeybodi A, Linden-Santangeli NJ, Rangamani P. The Evolution of Systems Biology and Systems Medicine: From Mechanistic Models to Uncertainty Quantification. Annu Rev Biomed Eng 2025; 27:425-447. [PMID: 39971380 DOI: 10.1146/annurev-bioeng-102723-065309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Understanding interaction mechanisms within cells, tissues, and organisms is crucial for driving developments across biology and medicine. Mathematical modeling is an essential tool for simulating such biological systems. Building on experiments, mechanistic models are widely used to describe small-scale intracellular networks. The development of sequencing techniques and computational tools has recently enabled multiscale models. Combining such larger scale network modeling with mechanistic modeling provides us with an opportunity to reveal previously unknown disease mechanisms and pharmacological interventions. Here, we review systems biology models from mechanistic models to multiscale models that integrate multiple layers of cellular networks and discuss how they can be used to shed light on disease states and even wellness-related states. Additionally, we introduce several methods that increase the certainty and accuracy of model predictions. Thus, combining mechanistic models with emerging mathematical and computational techniques can provide us with increasingly powerful tools to understand disease states and inspire drug discoveries.
Collapse
Affiliation(s)
- Lingxia Qiao
- Department of Pharmacology, University of California San Diego, La Jolla, California, USA;
- Department of Mechanical and Aerospace Engineering, University of California San Diego, La Jolla, California, USA
| | - Ali Khalilimeybodi
- Department of Mechanical and Aerospace Engineering, University of California San Diego, La Jolla, California, USA
| | | | - Padmini Rangamani
- Department of Pharmacology, University of California San Diego, La Jolla, California, USA;
- Department of Mechanical and Aerospace Engineering, University of California San Diego, La Jolla, California, USA
| |
Collapse
|
4
|
Mdletshe S, Wang A. Enhancing medical imaging education: integrating computing technologies, digital image processing and artificial intelligence. J Med Radiat Sci 2025; 72:148-155. [PMID: 39508409 PMCID: PMC11909706 DOI: 10.1002/jmrs.837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Accepted: 10/18/2024] [Indexed: 11/15/2024] Open
Abstract
The rapid advancement of technology has brought significant changes to various fields, including medical imaging (MI). This discussion paper explores the integration of computing technologies (e.g. Python and MATLAB), digital image processing (e.g. image enhancement, segmentation and three-dimensional reconstruction) and artificial intelligence (AI) into the undergraduate MI curriculum. By examining current educational practices, gaps and limitations that hinder the development of future-ready MI professionals are identified. A comprehensive curriculum framework is proposed, incorporating essential computational skills, advanced image processing techniques and state-of-the-art AI tools, such as large language models like ChatGPT. The proposed curriculum framework aims to improve the quality of MI education significantly and better equip students for future professional practice and challenges while enhancing diagnostic accuracy, improving workflow efficiency and preparing students for the evolving demands of the MI field.
Collapse
Affiliation(s)
- Sibusiso Mdletshe
- Department of Anatomy and Medical Imaging, Faculty of Medical and Health SciencesThe University of AucklandAucklandNew Zealand
| | - Alan Wang
- Auckland Bioengineering InstituteThe University of AucklandAucklandNew Zealand
- Medical Imaging Research centre, Faculty of Medical and Health SciencesThe University of AucklandAucklandNew Zealand
- Centre for Co‐Created Ageing ResearchThe University of AucklandAucklandNew Zealand
- Centre for Brain ResearchThe University of AucklandAucklandNew Zealand
| |
Collapse
|
5
|
Blechschmidt J, Cabral JS. MetaRange.jl: A Dynamic and Metabolic Species Range Model for Plant Species. Ecol Evol 2025; 15:e70773. [PMID: 39803195 PMCID: PMC11724151 DOI: 10.1002/ece3.70773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 10/18/2024] [Accepted: 10/25/2024] [Indexed: 01/16/2025] Open
Abstract
Process-based models for range dynamics are urgently needed due to increasing intensity of human-induced biodiversity change. Despite a few existing models that focus on demographic processes, their use remains limited compared to the widespread application of correlative approaches. This slow adoption is largely due to the challenges in calibrating biological parameters and the high computational demands for large-scale applications. Moreover, increasing the number of simulated processes (i.e., mechanistic complexity) may further exacerbate those reasons of delay. Therefore, balancing mechanistic complexity and computational effectiveness of process-based models is a key area for improvement. A promising research direction is to expand demographically explicit metapopulation models by integrating metabolic constraints. We translated and expanded a previously developed R metapopulation model to Julia language and published it as a Julia module. The model integrates species-specific parameters such as preferred environmental conditions, biomass and dispersal ability with demographic rates (e.g., reproductive and mortality rates) derived from local temperature and biomass via the metabolic theory of ecology. We provide a simple application example for the model in which we illustrate a typical use case by predicting the future occurrence of Orchis militaris in Bavaria under different climate change scenarios. Our results show that climate change reduces habitat suitability overall, but some regions like the Franconian Forest and the Alps see increased suitability and abundance, confirming their role as refugia. Simulating metapopulation dynamics reveals that local population dynamics and dispersal are crucial for accurate predictions. For instance, increasing dispersal distance reduces overall abundance loss but also lessens population growth in refugia. This highlights the importance of measuring traits like dispersal ability to improve climate change forecasts.
Collapse
Affiliation(s)
- Jana Blechschmidt
- Center for Computational and Theoretical BiologyJulius‐Maximilians‐Universität WürzburgWürzburgGermany
| | | |
Collapse
|
6
|
Kreger J, Mooney JA, Shibata D, MacLean AL. Developmental hematopoietic stem cell variation explains clonal hematopoiesis later in life. Nat Commun 2024; 15:10268. [PMID: 39592593 PMCID: PMC11599844 DOI: 10.1038/s41467-024-54711-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 11/17/2024] [Indexed: 11/28/2024] Open
Abstract
Clonal hematopoiesis becomes increasingly common with age, but its cause is enigmatic because driver mutations are often absent. Serial observations infer weak selection indicating variants are acquired much earlier in life with unexplained initial growth spurts. Here we use fluctuating CpG methylation as a lineage marker to track stem cell clonal dynamics of hematopoiesis. We show, via the shared prenatal circulation of monozygotic twins, that weak selection conferred by stem cell variation created before birth can reliably yield clonal hematopoiesis later in life. Theory indicates weak selection will lead to dominance given enough time and large enough population sizes. Human hematopoiesis satisfies both these conditions. Stochastic loss of weakly selected variants is naturally prevented by the expansion of stem cell lineages during development. The dominance of stem cell clones created before birth is supported by blood fluctuating CpG methylation patterns that exhibit low correlation between unrelated individuals but are highly correlated between many elderly monozygotic twins. Therefore, clonal hematopoiesis driven by weak selection in later life appears to reflect variation created before birth.
Collapse
Affiliation(s)
- Jesse Kreger
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Jazlyn A Mooney
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Darryl Shibata
- Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Adam L MacLean
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
7
|
Datseris G, Zelko JS. Physiological signal analysis and open science using the Julia language and associated software. FRONTIERS IN NETWORK PHYSIOLOGY 2024; 4:1478280. [PMID: 39569020 PMCID: PMC11577965 DOI: 10.3389/fnetp.2024.1478280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Accepted: 10/08/2024] [Indexed: 11/22/2024]
Abstract
In this mini review, we propose the use of the Julia programming language and its software as a strong candidate for reproducible, efficient, and sustainable physiological signal analysis. First, we highlight available software and Julia communities that provide top-of-the-class algorithms for all aspects of physiological signal processing despite the language's relatively young age. Julia can significantly accelerate both research and software development due to its high-level interactive language and high-performance code generation. It is also particularly suited for open and reproducible science. Openness is supported and welcomed because the overwhelming majority of Julia software programs are open source and developed openly on public platforms, primarily through individual contributions. Such an environment increases the likelihood that an individual not (originally) associated with a software program would still be willing to contribute their code, further promoting code sharing and reuse. On the other hand, Julia's exceptionally strong package manager and surrounding ecosystem make it easy to create self-contained, reproducible projects that can be instantly installed and run, irrespective of processor architecture or operating system.
Collapse
Affiliation(s)
- George Datseris
- Department of Mathematics and Statistics, University of Exeter, Exeter, United Kingdom
| | - Jacob S Zelko
- Department of Mathematics, Northeastern University, Boston, MA, United States
- OHDSI Center, Roux Institute, Northeastern University, Portland, ME, United States
| |
Collapse
|
8
|
Xu J, Wiley HS, Sauro HM. Generating synthetic signaling networks for in silico modeling studies. J Theor Biol 2024; 593:111901. [PMID: 39004118 PMCID: PMC11309880 DOI: 10.1016/j.jtbi.2024.111901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 06/07/2024] [Accepted: 07/08/2024] [Indexed: 07/16/2024]
Abstract
Predictive models of signaling pathways have proven to be difficult to develop. Traditional approaches to developing mechanistic models rely on collecting experimental data and fitting a single model to that data. This approach works for simple systems but has proven unreliable for complex systems such as biological signaling networks. Thus, there is a need to develop new approaches to create predictive mechanistic models of complex systems. To meet this need, we developed a method for generating artificial signaling networks that were reasonably realistic and thus could be treated as ground truth models. These synthetic models could then be used to generate synthetic data for developing and testing algorithms designed to recover the underlying network topology and associated parameters. We defined the reaction degree and reaction distance to measure the topology of reaction networks, especially to consider enzymes. To determine whether our generated signaling networks displayed meaningful behavior, we compared them with signaling networks from the BioModels Database. This comparison indicated that our generated signaling networks had high topological similarities with BioModels signaling networks with respect to the reaction degree and distance distributions. In addition, our synthetic signaling networks had similar behavioral dynamics with respect to both steady states and oscillations, suggesting that our method generated synthetic signaling networks comparable with BioModels and thus could be useful for building network evaluation tools.
Collapse
Affiliation(s)
- Jin Xu
- Department of Bioengineering, University of Washington, Seattle 98195, WA, USA.
| | - H Steven Wiley
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland 99352, WA, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle 98195, WA, USA
| |
Collapse
|
9
|
Stumpf MPH. Hypergraph animals. Phys Rev E 2024; 110:044125. [PMID: 39562946 DOI: 10.1103/physreve.110.044125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 09/20/2024] [Indexed: 11/21/2024]
Abstract
Here we introduce simple structures for the analysis of complex hypergraphs, hypergraph animals. These structures are designed to describe the local node neighborhoods of nodes in hypergraphs. We establish their relationships to lattice animals and network motifs, and we develop their combinatorial properties for sparse and uncorrelated hypergraphs. We make use of the tight link of hypergraph animals to partition numbers, which opens up a vast mathematical framework for the analysis of hypergraph animals. We then study their abundances in random hypergraphs. Two transferable insights result from this analysis: (i) it establishes the importance of high-cardinality edges in ensembles of random hypergraphs that are inspired by the classical Erdös-Renyí random graphs; and (ii) there is a close connection between degree and hyperedge cardinality in random hypergraphs that shapes animal abundances and spectra profoundly. Both findings imply that hypergraph animals can have the potential to affect information flow and processing in complex systems. Our analysis also suggests that we need to spend more effort on investigating and developing suitable conditional ensembles of random hypergraphs that can capture real-world structures and their complex dependency structures.
Collapse
|
10
|
Pal S, Bhattacharya M, Dash S, Lee SS, Chakraborty C. A next-generation dynamic programming language Julia: Its features and applications in biological science. J Adv Res 2024; 64:143-154. [PMID: 37992995 PMCID: PMC11464422 DOI: 10.1016/j.jare.2023.11.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 11/07/2023] [Accepted: 11/14/2023] [Indexed: 11/24/2023] Open
Abstract
BACKGROUND The advent of Julia as a sophisticated and dynamic programming language in 2012 represented a significant milestone in computational programming, mathematical analysis, and statistical modeling. Having reached its stable release in version 1.9.0 on May 7, 2023, Julia has developed into a powerful and versatile instrument. Despite its potential and widespread adoption across various scientific and technical domains, there exists a noticeable knowledge gap in comprehending its utilization within biological sciences. THE AIM OF REVIEW This comprehensive review aims to address this particular knowledge gap and offer a thorough examination of Julia's fundamental characteristics and its applications in biology. KEY SCIENTIFIC CONCEPTS OF THE REVIEW The review focuses on a research gap in the biological science. The review aims to equip researchers with knowledge and tools to utilize Julia's capabilities in biological science effectively and to demonstrate the gap. It paves the way for innovative solutions and discoveries in this rapidly evolving field. It encompasses an analysis of Julia's characteristics, packages, and performance compared to the other programming languages in this field. The initial part of this review discusses the key features of Julia, such as its dynamic and interactive nature, fast processing speed, ease of expression manipulation, user-friendly syntax, code readability, strong support for multiple dispatch, and advanced type system. It also explores Julia's capabilities in data analysis, visualization, machine learning, and algorithms, making it suitable for scientific applications. The next section emphasizes the importance of using Julia in biological research, highlighting its seamless integration with biological studies for data analysis, and computational biology. It also compares Julia with other programming languages commonly used in biological research through benchmarking and performance analysis. Additionally, it provides insights into future directions and potential challenges in Julia's applications in biology.
Collapse
Affiliation(s)
- Soumen Pal
- School of Mechanical Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Manojit Bhattacharya
- Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore 756020, Odisha, India
| | - Snehasish Dash
- School of Mechanical Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India
| | - Sang-Soo Lee
- Institute for Skeletal Aging & Orthopedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon, Gangwon-Do 24252, Republic of Korea.
| | - Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal 700126, India.
| |
Collapse
|
11
|
Yan J, Zeng Q, Wang X. RankCompV3: a differential expression analysis algorithm based on relative expression orderings and applications in single-cell RNA transcriptomics. BMC Bioinformatics 2024; 25:259. [PMID: 39112940 PMCID: PMC11304794 DOI: 10.1186/s12859-024-05889-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 07/30/2024] [Indexed: 08/11/2024] Open
Abstract
BACKGROUND Effective identification of differentially expressed genes (DEGs) has been challenging for single-cell RNA sequencing (scRNA-seq) profiles. Many existing algorithms have high false positive rates (FPRs) and often fail to identify weak biological signals. RESULTS We present a novel method for identifying DEGs in scRNA-seq data called RankCompV3. It is based on the comparison of relative expression orderings (REOs) of gene pairs which are determined by comparing the expression levels of a pair of genes in a set of single-cell profiles. The numbers of genes with consistently higher or lower expression levels than the gene of interest are counted in two groups in comparison, respectively, and the result is tabulated in a 3 × 3 contingency table which is tested by McCullagh's method to determine if the gene is dysregulated. In both simulated and real scRNA-seq data, RankCompV3 tightly controlled the FPR and demonstrated high accuracy, outperforming 11 other common single-cell DEG detection algorithms. Analysis with either regular single-cell or synthetic pseudo-bulk profiles produced highly concordant DEGs with the ground-truth. In addition, RankCompV3 demonstrates higher sensitivity to weak biological signals than other methods. The algorithm was implemented using Julia and can be called in R. The source code is available at https://github.com/pathint/RankCompV3.jl . CONCLUSIONS The REOs-based algorithm is a valuable tool for analyzing single-cell RNA profiles and identifying DEGs with high accuracy and sensitivity.
Collapse
Affiliation(s)
- Jing Yan
- Department of Bioinformatics, Fujian Key Laboratory of Medical Bioinformatics, School of Medical Technology and Engineering, Fujian Medical University, Fuzhou, 350122, China
| | - Qiuhong Zeng
- Department of Bioinformatics, Fujian Key Laboratory of Medical Bioinformatics, School of Medical Technology and Engineering, Fujian Medical University, Fuzhou, 350122, China
| | - Xianlong Wang
- Department of Bioinformatics, Fujian Key Laboratory of Medical Bioinformatics, School of Medical Technology and Engineering, Fujian Medical University, Fuzhou, 350122, China.
- The Second Affiliated Hospital, Fujian Medical University, Quanzhou, 362000, China.
| |
Collapse
|
12
|
Coppard V, Szep G, Georgieva Z, Howlett SK, Jarvis LB, Rainbow DB, Suchanek O, Needham EJ, Mousa HS, Menon DK, Feyertag F, Mahbubani KT, Saeb-Parsy K, Jones JL. FlowAtlas: an interactive tool for high-dimensional immunophenotyping analysis bridging FlowJo with computational tools in Julia. Front Immunol 2024; 15:1425488. [PMID: 39086484 PMCID: PMC11288863 DOI: 10.3389/fimmu.2024.1425488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 07/02/2024] [Indexed: 08/02/2024] Open
Abstract
As the dimensionality, throughput and complexity of cytometry data increases, so does the demand for user-friendly, interactive analysis tools that leverage high-performance machine learning frameworks. Here we introduce FlowAtlas: an interactive web application that enables dimensionality reduction of cytometry data without down-sampling and that is compatible with datasets stained with non-identical panels. FlowAtlas bridges the user-friendly environment of FlowJo and computational tools in Julia developed by the scientific machine learning community, eliminating the need for coding and bioinformatics expertise. New population discovery and detection of rare populations in FlowAtlas is intuitive and rapid. We demonstrate the capabilities of FlowAtlas using a human multi-tissue, multi-donor immune cell dataset, highlighting key immunological findings. FlowAtlas is available at https://github.com/gszep/FlowAtlas.jl.git.
Collapse
Affiliation(s)
- Valerie Coppard
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Grisha Szep
- Randall Centre for Cell & Molecular Biophysics, King’s College London, London, United Kingdom
| | - Zoya Georgieva
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Sarah K. Howlett
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Lorna B. Jarvis
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Daniel B. Rainbow
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Ondrej Suchanek
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Edward J. Needham
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - Hani S. Mousa
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| | - David K. Menon
- Department of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
| | | | - Krishnaa T. Mahbubani
- Department of Surgery, University of Cambridge, Cambridge, United Kingdom
- Collaborative Biorepository for Translational Medicine (CBTM), Cambridge NIHR Biomedical Research Centre, Cambridge, United Kingdom
| | - Kourosh Saeb-Parsy
- Department of Surgery, University of Cambridge, Cambridge, United Kingdom
- Collaborative Biorepository for Translational Medicine (CBTM), Cambridge NIHR Biomedical Research Centre, Cambridge, United Kingdom
| | - Joanne L. Jones
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
13
|
Greener JG. Differentiable simulation to develop molecular dynamics force fields for disordered proteins. Chem Sci 2024; 15:4897-4909. [PMID: 38550690 PMCID: PMC10966991 DOI: 10.1039/d3sc05230c] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 02/08/2024] [Indexed: 11/11/2024] Open
Abstract
Implicit solvent force fields are computationally efficient but can be unsuitable for running molecular dynamics on disordered proteins. Here I improve the a99SB-disp force field and the GBNeck2 implicit solvent model to better describe disordered proteins. Differentiable molecular simulations with 5 ns trajectories are used to jointly optimise 108 parameters to better match explicit solvent trajectories. Simulations with the improved force field better reproduce the radius of gyration and secondary structure content seen in experiments, whilst showing slightly degraded performance on folded proteins and protein complexes. The force field, called GB99dms, reproduces the results of a small molecule binding study and improves agreement with experiment for the aggregation of amyloid peptides. GB99dms, which can be used in OpenMM, is available at https://github.com/greener-group/GB99dms. This work is the first to show that gradients can be obtained directly from nanosecond-length differentiable simulations of biomolecules and highlights the effectiveness of this approach to training whole force fields to match desired properties.
Collapse
Affiliation(s)
- Joe G Greener
- Medical Research Council Laboratory of Molecular Biology Cambridge CB2 0QH UK
| |
Collapse
|
14
|
Kreger J, Mooney JA, Shibata D, MacLean AL. Developmental hematopoietic stem cell variation explains clonal hematopoiesis later in life. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.02.583106. [PMID: 38496542 PMCID: PMC10942294 DOI: 10.1101/2024.03.02.583106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Clonal hematopoiesis becomes increasingly common with age, but its cause is enigmatic because driver mutations are often absent. Serial observations infer weak selection indicating variants are acquired much earlier in life with unexplained initial growth spurts. Here we use fluctuating CpG methylation as a lineage marker to track stem cell clonal dynamics of hematopoiesis. We show, via the shared prenatal circulation of monozygotic twins, that weak selection conferred by stem cell variation created before birth can reliably yield clonal hematopoiesis later in life. Theory indicates weak selection will lead to dominance given enough time and large enough population sizes. Human hematopoiesis satisfies both these conditions. Stochastic loss of weakly selected variants is naturally prevented by the expansion of stem cell lineages during development. The dominance of stem cell clones created before birth is supported by blood fluctuating CpG methylation patterns that exhibit low correlation between unrelated individuals but are highly correlated between many elderly monozygotic twins. Therefore, clonal hematopoiesis driven by weak selection in later life appears to reflect variation created before birth.
Collapse
Affiliation(s)
- Jesse Kreger
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Jazlyn A. Mooney
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Darryl Shibata
- Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Adam L. MacLean
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
15
|
Lang PF, Jain A, Rackauckas C. SBMLToolkit.jl: a Julia package for importing SBML into the SciML ecosystem. J Integr Bioinform 2024; 21:jib-2024-0003. [PMID: 38801698 PMCID: PMC11294517 DOI: 10.1515/jib-2024-0003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/21/2024] [Indexed: 05/29/2024] Open
Abstract
Julia is a general purpose programming language that was designed for simplifying and accelerating numerical analysis and computational science. In particular the Scientific Machine Learning (SciML) ecosystem of Julia packages includes frameworks for high-performance symbolic-numeric computations. It allows users to automatically enhance high-level descriptions of their models with symbolic preprocessing and automatic sparsification and parallelization of computations. This enables performant solution of differential equations, efficient parameter estimation and methodologies for automated model discovery with neural differential equations and sparse identification of nonlinear dynamics. To give the systems biology community easy access to SciML, we developed SBMLToolkit.jl. SBMLToolkit.jl imports dynamic SBML models into the SciML ecosystem to accelerate model simulation and fitting of kinetic parameters. By providing computational systems biologists with easy access to the open-source Julia ecosystevnm, we hope to catalyze the development of further Julia tools in this domain and the growth of the Julia bioscience community. SBMLToolkit.jl is freely available under the MIT license. The source code is available at https://github.com/SciML/SBMLToolkit.jl.
Collapse
Affiliation(s)
| | | | - Christopher Rackauckas
- JuliaHub, Boston, USA
- Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology, Boston, USA
| |
Collapse
|
16
|
Kozłowska E, Swierniak A. Mathematical Model of Intrinsic Drug Resistance in Lung Cancer. Int J Mol Sci 2023; 24:15801. [PMID: 37958784 PMCID: PMC10650033 DOI: 10.3390/ijms242115801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 09/25/2023] [Accepted: 10/28/2023] [Indexed: 11/15/2023] Open
Abstract
Drug resistance is a bottleneck in cancer treatment. Commonly, a molecular treatment for cancer leads to the emergence of drug resistance in the long term. Thus, some drugs, despite their initial excellent response, are withdrawn from the market. Lung cancer is one of the most mutated cancers, leading to dozens of targeted therapeutics available against it. Here, we have developed a mechanistic mathematical model describing sensitization to nine groups of targeted therapeutics and the emergence of intrinsic drug resistance. As we focus only on intrinsic drug resistance, we perform the computer simulations of the model only until clinical diagnosis. We have utilized, for model calibration, the whole-exome sequencing data combined with clinical information from over 1000 non-small-cell lung cancer patients. Next, the model has been applied to find an answer to the following questions: When does intrinsic drug resistance emerge? And how long does it take for early-stage lung cancer to grow to an advanced stage? The results show that drug resistance is inevitable at diagnosis but not always detectable and that the time interval between early and advanced-stage tumors depends on the selection advantage of cancer cells.
Collapse
Affiliation(s)
| | - Andrzej Swierniak
- Department of Systems Biology and Engineering, Silesian University of Technology, 44100 Gliwie, Poland;
| |
Collapse
|
17
|
Forrest J, Rajagopal V, Stumpf MPH, Pan M. BondGraphs.jl: composable energy-based modelling in systems biology. Bioinformatics 2023; 39:btad578. [PMID: 37725363 PMCID: PMC10551222 DOI: 10.1093/bioinformatics/btad578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 09/04/2023] [Accepted: 09/15/2023] [Indexed: 09/21/2023] Open
Abstract
SUMMARY BondGraphs.jl is a Julia implementation of bond graphs. Bond graphs provide a modelling framework that describes energy flow through a physical system and by construction enforce thermodynamic constraints. The framework is widely used in engineering and has recently been shown to be a powerful approach for modelling biology. Models are mutable, hierarchical, multiscale, and multiphysics, and BondGraphs.jl is compatible with the Julia modelling ecosystem. AVAILABILITY AND IMPLEMENTATION BondGraphs.jl is freely available under the MIT license. Source code and documentation can be found at https://github.com/jedforrest/BondGraphs.jl.
Collapse
Affiliation(s)
- Joshua Forrest
- School of BioSciences, University of Melbourne, Parkville, Australia
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia
- Department of Biomedical Engineering, University of Melbourne, Parkville, Australia
| | - Vijay Rajagopal
- Department of Biomedical Engineering, University of Melbourne, Parkville, Australia
- Baker Department of Cardiometabolic Health, University of Melbourne, Parkville, Australia
- The Graeme Clark Institute, University of Melbourne, Parkville, Australia
| | - Michael P H Stumpf
- School of BioSciences, University of Melbourne, Parkville, Australia
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia
- Melbourne Integrative Genomics, University of Melbourne, Parkville, Australia
- ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems, University of Melbourne, Parkville, Australia
| | - Michael Pan
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia
| |
Collapse
|