1
|
Kruta J, Carapito R, Trendelenburg M, Martin T, Rizzi M, Voll RE, Cavalli A, Natali E, Meier P, Stawiski M, Mosbacher J, Mollet A, Santoro A, Capri M, Giampieri E, Schkommodau E, Miho E. Machine learning for precision diagnostics of autoimmunity. Sci Rep 2024; 14:27848. [PMID: 39537649 PMCID: PMC11561187 DOI: 10.1038/s41598-024-76093-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 10/10/2024] [Indexed: 11/16/2024] Open
Abstract
Early and accurate diagnosis is crucial to prevent disease development and define therapeutic strategies. Due to predominantly unspecific symptoms, diagnosis of autoimmune diseases (AID) is notoriously challenging. Clinical decision support systems (CDSS) are a promising method with the potential to enhance and expedite precise diagnostics by physicians. However, due to the difficulties of integrating and encoding multi-omics data with clinical values, as well as a lack of standardization, such systems are often limited to certain data types. Accordingly, even sophisticated data models fall short when making accurate disease diagnoses and presenting data analyses in a user-friendly form. Therefore, the integration of various data types is not only an opportunity but also a competitive advantage for research and industry. We have developed an integration pipeline to enable the use of machine learning for patient classification based on multi-omics data in combination with clinical values and laboratory results. The application of our framework resulted in up to 96% prediction accuracy of autoimmune diseases with machine learning models. Our results deliver insights into autoimmune disease research and have the potential to be adapted for applications across disease conditions.
Collapse
Affiliation(s)
- Jan Kruta
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Raphael Carapito
- Laboratoire d'ImmunoRhumatologie Moléculaire, plateforme GENOMAX, Faculté de Médecine, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut Thématique Interdisciplinaire TRANSPLANTEX NG, INSERM UMR_S 1109, Fédération Hospitalo-Universitaire OMICARE, Université de Strasbourg, 4 rue Kirschleger, Strasbourg, 67085, France
- Service d'Immunologie Biologique, Pôle de Biologie, Plateau Technique de Biologie, Nouvel Hôpital Civil, 1 place de l'Hôpital, Strasbourg, 67091, France
| | - Marten Trendelenburg
- Division of Internal Medicine, University Hospital Basel, Basel, 4031, Switzerland
| | - Thierry Martin
- Laboratoire d'ImmunoRhumatologie Moléculaire, plateforme GENOMAX, Faculté de Médecine, Fédération de Médecine Translationnelle de Strasbourg (FMTS), Institut Thématique Interdisciplinaire TRANSPLANTEX NG, INSERM UMR_S 1109, Fédération Hospitalo-Universitaire OMICARE, Université de Strasbourg, 4 rue Kirschleger, Strasbourg, 67085, France
| | - Marta Rizzi
- Department of Rheumatology and Clinical Immunology, Medical Center, University of Freiburg, 79106, Freiburg, Germany
| | - Reinhard E Voll
- Department of Rheumatology and Clinical Immunology, Medical Center, University of Freiburg, 79106, Freiburg, Germany
| | - Andrea Cavalli
- FaBiT Department of Pharmacy and Biotechnology, Università di Bologna, Bologna, 40126, Italy
| | - Eriberto Natali
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Patrick Meier
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Marc Stawiski
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Johannes Mosbacher
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Annette Mollet
- Institute of Pharmaceutical Medicine, University of Basel, Basel, 4056, Switzerland
| | - Aurelia Santoro
- Department of Experimental, Diagnostic and Specialty Medicine (DIMES), University of Bologna, Bologna, 40126, Italy
| | - Miriam Capri
- Department of Experimental, Diagnostic and Specialty Medicine (DIMES), University of Bologna, Bologna, 40126, Italy
| | - Enrico Giampieri
- Department of Experimental, Diagnostic and Specialty Medicine (DIMES), University of Bologna, Bologna, 40126, Italy
| | - Erik Schkommodau
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland
| | - Enkelejda Miho
- School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Hofackerstrasse 30, Muttenz, 4132, Switzerland.
- SIB Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland.
- aiNET GmbH, Lichtstrasse 35, Basel, 4056, Switzerland.
| |
Collapse
|
2
|
Dascalu S, Sealy JE, Sadeyen JR, Flammer PG, Fiddaman S, Preston SG, Dixon RJ, Bonsall MB, Smith AL, Iqbal M. Immunisation of chickens with inactivated and/or infectious H9N2 avian influenza virus leads to differential immune B-cell repertoire development. Front Immunol 2024; 15:1461678. [PMID: 39534604 PMCID: PMC11555566 DOI: 10.3389/fimmu.2024.1461678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Accepted: 09/30/2024] [Indexed: 11/16/2024] Open
Abstract
Avian influenza viruses (AIVs) are a major economic burden to the poultry industry and pose serious zoonotic risks, with human infections being reported every year. To date, the vaccination of birds remains the most important method for the prevention and control of AIV outbreaks. Most national vaccination strategies against AIV infection use whole virus-inactivated vaccines, which predominantly trigger a systemic antibody-mediated immune response. There are currently no studies that have examined the antibody repertoire of birds that were infected with and/or vaccinated against AIV. To this end, we evaluate the changes in the H9N2-specific IgM and IgY repertoires in chickens subjected to vaccination(s) and/or infectious challenge. We show that a large proportion of the IgM and IgY clones were shared across multiple individuals, and these public clonal responses are dependent on both the immunisation status of the birds and the specific tissue that was examined. Furthermore, the analysis revealed specific clonal expansions that are restricted to particular H9N2 immunisation regimes. These results indicate that both the nature and number of immunisations are important drivers of the antibody responses and repertoire profiles in chickens following H9N2 antigenic stimulation. We discuss how the repertoire biology of avian B-cell responses may affect the success of AIV vaccination in chickens, in particular the implications of public versus private clonal selection.
Collapse
Affiliation(s)
- Stefan Dascalu
- Department of Biology, University of Oxford, Oxford, United Kingdom
- Avian Influenza and Newcastle Disease Research Group, The Pirbright Institute, Pirbright, United Kingdom
| | - Joshua E. Sealy
- Avian Influenza and Newcastle Disease Research Group, The Pirbright Institute, Pirbright, United Kingdom
| | - Jean-Remy Sadeyen
- Avian Influenza and Newcastle Disease Research Group, The Pirbright Institute, Pirbright, United Kingdom
| | | | - Steven Fiddaman
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Stephen G. Preston
- Department of Biology, University of Oxford, Oxford, United Kingdom
- UCL School of Pharmacy, University College London, London, United Kingdom
| | - Robert J. Dixon
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | | | - Adrian L. Smith
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Munir Iqbal
- Avian Influenza and Newcastle Disease Research Group, The Pirbright Institute, Pirbright, United Kingdom
| |
Collapse
|
3
|
Zia A, Orozco A, Fang ISY, Tang AM, Mendoza Viruega AS, Dong S, Leung LYT, Devraj VM, Oludada OE, Ehrhardt GRA. High throughput long-read sequencing of circulating lymphocytes of the evolutionarily distant sea lamprey reveals diversity and common elements of the variable lymphocyte receptor B (VLRB) repertoire. Front Immunol 2024; 15:1427075. [PMID: 39170622 PMCID: PMC11335541 DOI: 10.3389/fimmu.2024.1427075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 07/22/2024] [Indexed: 08/23/2024] Open
Abstract
The leucine-rich repeat-based variable lymphocyte receptor B (VLRB) antibody system of jawless vertebrates is capable of generating an antibody repertoire equal to or exceeding the diversity of antibody repertoires of jawed vertebrates. Unlike immunoglobulin-based immune repertoires, the VLRB repertoire diversity is characterized by variable lengths of VLRB encoding transcripts, rendering conventional immunoreceptor repertoire sequencing approaches unsuitable for VLRB repertoire sequencing. Here we demonstrate that long-read single-molecule real-time (SMRT) sequencing (PacBio) approaches permit the efficient large-scale assessment of the VLRB repertoire. We present a computational pipeline for sequence data processing and provide the first repertoire-based analysis of VLRB protein characteristics including properties of its subunits and regions of diversity within each structural leucine-rich repeat subunit. Our study provides a template to explore changes in the VLRB repertoire during immune responses and to establish large scale VLRB repertoire databases for computational approaches aimed at isolating monoclonal VLRB reagents for biomedical research and clinical applications.
Collapse
Affiliation(s)
| | - Ariel Orozco
- Department of Immunology, University of Toronto, Toronto, ON, Canada
| | - Irene S. Y. Fang
- Department of Immunology, University of Toronto, Toronto, ON, Canada
| | - Aspen M. Tang
- Department of Immunology, University of Toronto, Toronto, ON, Canada
| | | | - Shilan Dong
- Department of Immunology, University of Toronto, Toronto, ON, Canada
| | | | - Vijaya M. Devraj
- Department of Immunology, University of Toronto, Toronto, ON, Canada
| | | | | |
Collapse
|
4
|
Natsrita P, Charoenkwan P, Shoombuatong W, Mahalapbutr P, Faksri K, Chareonsudjai S, Rungrotmongkol T, Pipattanaboon C. Machine-learning-assisted high-throughput identification of potent and stable neutralizing antibodies against all four dengue virus serotypes. Sci Rep 2024; 14:17165. [PMID: 39060292 PMCID: PMC11282219 DOI: 10.1038/s41598-024-67487-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 07/11/2024] [Indexed: 07/28/2024] Open
Abstract
Several computational methods have been developed to identify neutralizing antibodies (NAbs) covering four dengue virus serotypes (DENV-1 to DENV-4); however, limitations of the dataset and the resulting performance remain. Here, we developed a new computational framework to predict potent and stable NAbs against DENV-1 to DENV-4 using only antibody (CDR-H3) and epitope sequences as input. Specifically, our proposed computational framework employed sequence-based ML and molecular dynamic simulation (MD) methods to achieve more accurate identification. First, we built a novel dataset (n = 1108) by compiling the interactions of CDR-H3 and epitope sequences with the half maximum inhibitory concentration (IC50) values, which represent neutralizing activities. Second, we achieved an accurately predictive ML model that showed high AUC values of 0.879 and 0.885 by tenfold cross-validation and independent tests, respectively. Finally, our computational framework could be applied to filter approximately 2.5 million unseen antibodies into two final candidates that showed strong and stable binding to all four serotypes. In addition, the most potent and stable candidate (1B3B9_V21) was evaluated for its development potential as a therapeutic agent by molecular docking and MD simulations. This study provides an antibody computational approach to facilitate the high-throughput identification of NAbs and accelerate the development of therapeutic antibodies.
Collapse
Affiliation(s)
- Piyatida Natsrita
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Phasit Charoenkwan
- Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, 50200, Thailand
| | - Watshara Shoombuatong
- Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand
| | - Panupong Mahalapbutr
- Department of Biochemistry, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Kiatichai Faksri
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Sorujsiri Chareonsudjai
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Thanyada Rungrotmongkol
- Center of Excellent in Biocatalyst and Sustainable Biotechnology, Department of Biochemistry, Faculty of Science, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Chonlatip Pipattanaboon
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, 40002, Thailand.
| |
Collapse
|
5
|
Servius L, Pigoli D, Ng J, Fraternali F. Predicting class switch recombination in B-cells from antibody repertoire data. Biom J 2024; 66:e2300171. [PMID: 38785212 DOI: 10.1002/bimj.202300171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 03/01/2024] [Accepted: 03/07/2024] [Indexed: 05/25/2024]
Abstract
Statistical and machine learning methods have proved useful in many areas of immunology. In this paper, we address for the first time the problem of predicting the occurrence of class switch recombination (CSR) in B-cells, a problem of interest in understanding antibody response under immunological challenges. We propose a framework to analyze antibody repertoire data, based on clonal (CG) group representation in a way that allows us to predict CSR events using CG level features as input. We assess and compare the performance of several predicting models (logistic regression, LASSO logistic regression, random forest, and support vector machine) in carrying out this task. The proposed approach can obtain an unweighted average recall of71 % $71\%$ with models based on variable region descriptors and measures of CG diversity during an immune challenge and, most notably, before an immune challenge.
Collapse
Affiliation(s)
- Lutecia Servius
- Department of Mathematics, King's College London, London, UK
| | - Davide Pigoli
- Department of Mathematics, King's College London, London, UK
| | - Joseph Ng
- Institute of Structural and Molecular Biology, University College London, London, UK
| | - Franca Fraternali
- Institute of Structural and Molecular Biology, University College London, London, UK
| |
Collapse
|
6
|
Natali EN, Horst A, Meier P, Greiff V, Nuvolone M, Babrak LM, Fink K, Miho E. The dengue-specific immune response and antibody identification with machine learning. NPJ Vaccines 2024; 9:16. [PMID: 38245547 PMCID: PMC10799860 DOI: 10.1038/s41541-023-00788-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 12/07/2023] [Indexed: 01/22/2024] Open
Abstract
Dengue virus poses a serious threat to global health and there is no specific therapeutic for it. Broadly neutralizing antibodies recognizing all serotypes may be an effective treatment. High-throughput adaptive immune receptor repertoire sequencing (AIRR-seq) and bioinformatic analysis enable in-depth understanding of the B-cell immune response. Here, we investigate the dengue antibody response with these technologies and apply machine learning to identify rare and underrepresented broadly neutralizing antibody sequences. Dengue immunization elicited the following signatures on the antibody repertoire: (i) an increase of CDR3 and germline gene diversity; (ii) a change in the antibody repertoire architecture by eliciting power-law network distributions and CDR3 enrichment in polar amino acids; (iii) an increase in the expression of JNK/Fos transcription factors and ribosomal proteins. Furthermore, we demonstrate the applicability of computational methods and machine learning to AIRR-seq datasets for neutralizing antibody candidate sequence identification. Antibody expression and functional assays have validated the obtained results.
Collapse
Affiliation(s)
- Eriberto Noel Natali
- FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Life Sciences, Muttenz, Switzerland
| | - Alexander Horst
- FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Life Sciences, Muttenz, Switzerland
| | - Patrick Meier
- FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Life Sciences, Muttenz, Switzerland
| | - Victor Greiff
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Oslo, Norway
| | - Mario Nuvolone
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
| | - Lmar Marie Babrak
- FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Life Sciences, Muttenz, Switzerland
| | | | - Enkelejda Miho
- FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Life Sciences, Muttenz, Switzerland.
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.
- aiNET GmbH, Basel, Switzerland.
| |
Collapse
|
7
|
Weber CR, Rubio T, Wang L, Zhang W, Robert PA, Akbar R, Snapkov I, Wu J, Kuijjer ML, Tarazona S, Conesa A, Sandve GK, Liu X, Reddy ST, Greiff V. Reference-based comparison of adaptive immune receptor repertoires. CELL REPORTS METHODS 2022; 2:100269. [PMID: 36046619 PMCID: PMC9421535 DOI: 10.1016/j.crmeth.2022.100269] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 04/01/2022] [Accepted: 07/19/2022] [Indexed: 11/26/2022]
Abstract
B and T cell receptor (immune) repertoires can represent an individual's immune history. While current repertoire analysis methods aim to discriminate between health and disease states, they are typically based on only a limited number of parameters. Here, we introduce immuneREF: a quantitative multidimensional measure of adaptive immune repertoire (and transcriptome) similarity that allows interpretation of immune repertoire variation by relying on both repertoire features and cross-referencing of simulated and experimental datasets. To quantify immune repertoire similarity landscapes across health and disease, we applied immuneREF to >2,400 datasets from individuals with varying immune states (healthy, [autoimmune] disease, and infection). We discovered, in contrast to the current paradigm, that blood-derived immune repertoires of healthy and diseased individuals are highly similar for certain immune states, suggesting that repertoire changes to immune perturbations are less pronounced than previously thought. In conclusion, immuneREF enables the population-wide study of adaptive immune response similarity across immune states.
Collapse
Affiliation(s)
- Cédric R. Weber
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Teresa Rubio
- Laboratory of Neurobiology, Centro Investigación Príncipe Felipe, Valencia, Spain
| | - Longlong Wang
- BGI-Shenzhen, Shenzhen, China
- BGI-Education Center, University of Chinese Academy of Sciences, Shenzhen, China
| | - Wei Zhang
- BGI-Shenzhen, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Hong Kong, China
| | - Philippe A. Robert
- Department of Immunology and Oslo University Hospital, University of Oslo, Oslo, Norway
| | - Rahmad Akbar
- Department of Immunology and Oslo University Hospital, University of Oslo, Oslo, Norway
| | - Igor Snapkov
- Department of Immunology and Oslo University Hospital, University of Oslo, Oslo, Norway
| | | | - Marieke L. Kuijjer
- Centre for Molecular Medicine Norway, University of Oslo, Oslo, Norway
- Department of Pathology, Leiden University Medical Center, Leiden, the Netherlands
- Leiden Center for Computational Oncology, Leiden University Medical Center, Leiden, the Netherlands
| | - Sonia Tarazona
- Departamento de Estadística e Investigación Operativa Aplicadas y Calidad, Universitat Politècnica de València, Valencia, Spain
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Valencia, Spain
| | - Geir K. Sandve
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Xiao Liu
- BGI-Shenzhen, Shenzhen, China
- Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
| | - Sai T. Reddy
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Victor Greiff
- Department of Immunology and Oslo University Hospital, University of Oslo, Oslo, Norway
| |
Collapse
|
8
|
Lim YW, Adler AS, Johnson DS. Predicting antibody binders and generating synthetic antibodies using deep learning. MAbs 2022; 14:2069075. [PMID: 35482911 PMCID: PMC9067455 DOI: 10.1080/19420862.2022.2069075] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 04/19/2022] [Indexed: 11/30/2022] Open
Abstract
The antibody drug field has continually sought improvements to methods for candidate discovery and engineering. Historically, most such methods have been laboratory-based, but informatics methods have recently started to make an impact. Deep learning, a subfield of machine learning, is rapidly gaining prominence in the biomedical research. Recent advances in microfluidics technologies and next-generation sequencing have not only revolutionized therapeutic antibody discovery, but also contributed to a vast amount of antibody repertoire sequencing data, providing opportunities for deep learning-based applications. Previously, we used microfluidics, yeast display, and deep sequencing to generate a panel of binder and non-binder antibody sequences to the cancer immunotherapy targets PD-1 and CTLA-4. Here we encoded the antibody light and heavy chain complementarity-determining regions (CDR3s) into antibody images, then built and trained convolutional neural network models to classify binders and non-binders. To improve model interpretability, we performed in silico mutagenesis to identify CDR3 residues that were important for binder classification. We further built generative deep learning models using generative adversarial network models to produce synthetic antibodies against PD-1 and CTLA-4. Our models generated variable length CDR3 sequences that resemble real sequences. Overall, our study demonstrates that deep learning methods can be leveraged to mine and learn patterns in antibody sequences, offering insights into antibody engineering, optimization, and discovery.
Collapse
Affiliation(s)
- Yoong Wearn Lim
- GigaGen Inc. (A Grifols Company), South San Francisco, CA, USA
| | - Adam S. Adler
- GigaGen Inc. (A Grifols Company), South San Francisco, CA, USA
| | | |
Collapse
|
9
|
Akbar R, Robert PA, Weber CR, Widrich M, Frank R, Pavlović M, Scheffer L, Chernigovskaya M, Snapkov I, Slabodkin A, Mehta BB, Miho E, Lund-Johansen F, Andersen JT, Hochreiter S, Hobæk Haff I, Klambauer G, Sandve GK, Greiff V. In silico proof of principle of machine learning-based antibody design at unconstrained scale. MAbs 2022; 14:2031482. [PMID: 35377271 PMCID: PMC8986205 DOI: 10.1080/19420862.2022.2031482] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 01/17/2022] [Indexed: 12/15/2022] Open
Abstract
Generative machine learning (ML) has been postulated to become a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide range of physiological antibody-binding parameters. The simulation framework enables the computation of synthetic antibody-antigen 3D-structures, and it functions as an oracle for unrestricted prospective evaluation and benchmarking of antibody design parameters of ML-generated antibody sequences. We found that a deep generative model, trained exclusively on antibody sequence (one dimensional: 1D) data can be used to design conformational (three dimensional: 3D) epitope-specific antibodies, matching, or exceeding the training dataset in affinity and developability parameter value variety. Furthermore, we established a lower threshold of sequence diversity necessary for high-accuracy generative antibody ML and demonstrated that this lower threshold also holds on experimental real-world data. Finally, we show that transfer learning enables the generation of high-affinity antibody sequences from low-N training data. Our work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design.
Collapse
Affiliation(s)
- Rahmad Akbar
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Philippe A. Robert
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Cédric R. Weber
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Michael Widrich
- Ellis Unit Linz and Lit Ai Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | - Robert Frank
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | | | | | - Maria Chernigovskaya
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Igor Snapkov
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Andrei Slabodkin
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Enkelejda Miho
- Institute of Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| | - Fridtjof Lund-Johansen
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| | - Jan Terje Andersen
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
- Institute of Clinical Medicine, Department of Pharmacology, University of Oslo, Oslo, Norway
| | - Sepp Hochreiter
- Ellis Unit Linz and Lit Ai Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
- Institute of Advanced Research in Artificial Intelligence (IARAI), Austria
| | | | - Günter Klambauer
- Ellis Unit Linz and Lit Ai Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | | | - Victor Greiff
- Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway
| |
Collapse
|