1
|
Anderson LN, Hoyt CT, Zucker JD, McNaughton AD, Teuton JR, Karis K, Arokium-Christian NN, Warley JT, Stromberg ZR, Gyori BM, Kumar N. Computational tools and data integration to accelerate vaccine development: challenges, opportunities, and future directions. Front Immunol 2025; 16:1502484. [PMID: 40124369 PMCID: PMC11925797 DOI: 10.3389/fimmu.2025.1502484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 01/23/2025] [Indexed: 03/25/2025] Open
Abstract
The development of effective vaccines is crucial for combating current and emerging pathogens. Despite significant advances in the field of vaccine development there remain numerous challenges including the lack of standardized data reporting and curation practices, making it difficult to determine correlates of protection from experimental and clinical studies. Significant gaps in data and knowledge integration can hinder vaccine development which relies on a comprehensive understanding of the interplay between pathogens and the host immune system. In this review, we explore the current landscape of vaccine development, highlighting the computational challenges, limitations, and opportunities associated with integrating diverse data types for leveraging artificial intelligence (AI) and machine learning (ML) techniques in vaccine design. We discuss the role of natural language processing, semantic integration, and causal inference in extracting valuable insights from published literature and unstructured data sources, as well as the computational modeling of immune responses. Furthermore, we highlight specific challenges associated with uncertainty quantification in vaccine development and emphasize the importance of establishing standardized data formats and ontologies to facilitate the integration and analysis of heterogeneous data. Through data harmonization and integration, the development of safe and effective vaccines can be accelerated to improve public health outcomes. Looking to the future, we highlight the need for collaborative efforts among researchers, data scientists, and public health experts to realize the full potential of AI-assisted vaccine design and streamline the vaccine development process.
Collapse
Affiliation(s)
| | - Charles Tapley Hoyt
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
| | - Jeremy D. Zucker
- Pacific Northwest National Laboratory (DOE), Richland, WA, United States
| | | | - Jeremy R. Teuton
- Pacific Northwest National Laboratory (DOE), Richland, WA, United States
| | - Klas Karis
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
| | | | - Jackson T. Warley
- Pacific Northwest National Laboratory (DOE), Richland, WA, United States
| | | | - Benjamin M. Gyori
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
- Department of Bioengineering, College of Engineering, Northeastern University, Boston, MA, United States
| | - Neeraj Kumar
- Pacific Northwest National Laboratory (DOE), Richland, WA, United States
| |
Collapse
|
2
|
Li T, Li M, Wu Y, Li Y. Visualization Methods for DNA Sequences: A Review and Prospects. Biomolecules 2024; 14:1447. [PMID: 39595624 PMCID: PMC11592258 DOI: 10.3390/biom14111447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 11/08/2024] [Accepted: 11/12/2024] [Indexed: 11/28/2024] Open
Abstract
The efficient analysis and interpretation of biological sequence data remain major challenges in bioinformatics. Graphical representation, as an emerging and effective visualization technique, offers a more intuitive method for analyzing DNA sequences. However, many visualization approaches are dispersed across research databases, requiring urgent organization, integration, and analysis. Additionally, no single visualization method excels in all aspects. To advance these methods, knowledge graphs and advanced machine learning techniques have become key areas of exploration. This paper reviews the current 2D and 3D DNA sequence visualization methods and proposes a new research direction focused on constructing knowledge graphs for biological sequence visualization, explaining the relevant theories, techniques, and models involved. Additionally, we summarize machine learning techniques applicable to sequence visualization, such as graph embedding methods and the use of convolutional neural networks (CNNs) for processing graphical representations. These machine learning techniques and knowledge graphs aim to provide valuable insights into computational biology, bioinformatics, genomic computing, and evolutionary analysis. The study serves as an important reference for improving intelligent search systems, enriching knowledge bases, and enhancing query systems related to biological sequence visualization, offering a comprehensive framework for future research.
Collapse
Affiliation(s)
- Tan Li
- School of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, China; (T.L.); (Y.L.)
| | - Mengshan Li
- School of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, China; (T.L.); (Y.L.)
| | - Yan Wu
- School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China;
| | - Yelin Li
- School of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, China; (T.L.); (Y.L.)
| |
Collapse
|
3
|
Roth GA, Vora B, Kim C, Wu M, Kuruvilla D. Prevalence and utility of pharmacokinetic data in preclinical studies of mRNA cancer vaccines. Clin Transl Sci 2023; 16:1554-1558. [PMID: 37452560 PMCID: PMC10499403 DOI: 10.1111/cts.13586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 06/13/2023] [Accepted: 06/14/2023] [Indexed: 07/18/2023] Open
Abstract
In this brief report, we provide insights into current practices in preclinical messenger RNA (mRNA) cancer vaccine characterization. To enable a more automated and thorough survey of mRNA cancer vaccine data in the literature, we implemented natural language processing to mine abstracts from PubMed followed by annotation of the selected articles. Through this analysis we identified a gap in the literature wherein pharmacokinetic (PK) characterization is not reported in mRNA cancer vaccine-focused articles. As a result, the field is unable to evaluate and discuss cross-study PK and pharmacodynamic (PD) relationships nor the translation of these relationships from preclinical species to humans. As the field of mRNA cancer vaccines is rapidly evolving, there is value in expanding our understanding of preclinical PK/PD relationships and how they relate to PK/PD in humans.
Collapse
Affiliation(s)
- Gillie A. Roth
- Preclinical and Translational PKPDGenentech, Inc.South San FranciscoCaliforniaUSA
| | - Bianca Vora
- Clinical PharmacologyGenentech, Inc.South San FranciscoCaliforniaUSA
| | - Chloe Kim
- Computational SciencesGenentech, Inc.South San FranciscoCaliforniaUSA
| | - Michael Wu
- Computational SciencesGenentech, Inc.South San FranciscoCaliforniaUSA
| | - Denison Kuruvilla
- Clinical PharmacologyGenentech, Inc.South San FranciscoCaliforniaUSA
| |
Collapse
|
4
|
Huffman A, Ong E, Hur J, D’Mello A, Tettelin H, He Y. COVID-19 vaccine design using reverse and structural vaccinology, ontology-based literature mining and machine learning. Brief Bioinform 2022; 23:bbac190. [PMID: 35649389 PMCID: PMC9294427 DOI: 10.1093/bib/bbac190] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 04/13/2022] [Accepted: 04/26/2022] [Indexed: 12/11/2022] Open
Abstract
Rational vaccine design, especially vaccine antigen identification and optimization, is critical to successful and efficient vaccine development against various infectious diseases including coronavirus disease 2019 (COVID-19). In general, computational vaccine design includes three major stages: (i) identification and annotation of experimentally verified gold standard protective antigens through literature mining, (ii) rational vaccine design using reverse vaccinology (RV) and structural vaccinology (SV) and (iii) post-licensure vaccine success and adverse event surveillance and its usage for vaccine design. Protegen is a database of experimentally verified protective antigens, which can be used as gold standard data for rational vaccine design. RV predicts protective antigen targets primarily from genome sequence analysis. SV refines antigens through structural engineering. Recently, RV and SV approaches, with the support of various machine learning methods, have been applied to COVID-19 vaccine design. The analysis of post-licensure vaccine adverse event report data also provides valuable results in terms of vaccine safety and how vaccines should be used or paused. Ontology standardizes and incorporates heterogeneous data and knowledge in a human- and computer-interpretable manner, further supporting machine learning and vaccine design. Future directions on rational vaccine design are discussed.
Collapse
Affiliation(s)
- Anthony Huffman
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Junguk Hur
- Department of Biomedical Sciences, University of North Dakota School of Medicine and Health Sciences, Grand Forks, North Dakota 58202, USA
| | - Adonis D’Mello
- Department of Microbiology and Immunology, Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Hervé Tettelin
- Department of Microbiology and Immunology, Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Yongqun He
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
5
|
Daniel S, Kis Z, Kontoravdi C, Shah N. Quality by Design for enabling RNA platform production processes. Trends Biotechnol 2022; 40:1213-1228. [DOI: 10.1016/j.tibtech.2022.03.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 03/18/2022] [Accepted: 03/28/2022] [Indexed: 12/26/2022]
|
6
|
Selvaggio G, Leonardelli L, Lofano G, Fresnay S, Parolo S, Medini D, Siena E, Marchetti L. A quantitative systems pharmacology approach to support mRNA vaccine development and optimization. CPT Pharmacometrics Syst Pharmacol 2021; 10:1448-1451. [PMID: 34672423 PMCID: PMC8674002 DOI: 10.1002/psp4.12721] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/13/2021] [Accepted: 09/20/2021] [Indexed: 12/27/2022] Open
Affiliation(s)
- Gianluca Selvaggio
- Fondazione The Microsoft Research – University of Trento Centre for Computational and Systems Biology (COSBI)RoveretoItaly
| | - Lorena Leonardelli
- Fondazione The Microsoft Research – University of Trento Centre for Computational and Systems Biology (COSBI)RoveretoItaly
| | | | | | - Silivia Parolo
- Fondazione The Microsoft Research – University of Trento Centre for Computational and Systems Biology (COSBI)RoveretoItaly
| | | | | | - Luca Marchetti
- Fondazione The Microsoft Research – University of Trento Centre for Computational and Systems Biology (COSBI)RoveretoItaly
- Department of Cellular, Computational and Integrative Biology (CIBIO)University of TrentoTrentoItaly
| |
Collapse
|