1
|
Ferruz N, Heinzinger M, Akdel M, Goncearenco A, Naef L, Dallago C. From sequence to function through structure: Deep learning for protein design. Comput Struct Biotechnol J 2022; 21:238-250. [PMID: 36544476 PMCID: PMC9755234 DOI: 10.1016/j.csbj.2022.11.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/05/2022] [Accepted: 11/05/2022] [Indexed: 11/20/2022] Open
Abstract
The process of designing biomolecules, in particular proteins, is witnessing a rapid change in available tooling and approaches, moving from design through physicochemical force fields, to producing plausible, complex sequences fast via end-to-end differentiable statistical models. To achieve conditional and controllable protein design, researchers at the interface of artificial intelligence and biology leverage advances in natural language processing (NLP) and computer vision techniques, coupled with advances in computing hardware to learn patterns from growing biological databases, curated annotations thereof, or both. Once learned, these patterns can be used to provide novel insights into mechanistic biology and the design of biomolecules. However, navigating and understanding the practical applications for the many recent protein design tools is complex. To facilitate this, we 1) document recent advances in deep learning (DL) assisted protein design from the last three years, 2) present a practical pipeline that allows to go from de novo-generated sequences to their predicted properties and web-powered visualization within minutes, and 3) leverage it to suggest a generated protein sequence which might be used to engineer a biosynthetic gene cluster to produce a molecular glue-like compound. Lastly, we discuss challenges and highlight opportunities for the protein design field.
Collapse
Key Words
- ADMM, Alternating Direction Method of Multipliers
- CNN, Convolutional Neural Network
- DL, Deep learning
- Deep learning
- Drug discovery
- FNN, fully-connected neural network
- GAN, Generative Adversarial Network
- GCN, Graph Convolutional Network
- GNN, Graph Neural Network
- GO, Gene Ontology
- GVP, Geometric Vector Perceptron
- LSTM, Long-Short Term Memory
- MLP, Multilayer Perceptron
- MSA, Multiple Sequence Alignment
- NLP, Natural Language Processing
- NSR, Natural Sequence Recovery
- Protein design
- Protein language models
- Protein prediction
- VAE, Variational Autoencoder
- pLM, protein Language Model
Collapse
Affiliation(s)
- Noelia Ferruz
- Institute of Informatics and Applications, University of Girona, Girona, Spain
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Michael Heinzinger
- Department of Informatics, Bioinformatics & Computational Biology, Technische Universität München, 85748 Garching, Germany
| | - Mehmet Akdel
- VantAI, 151 W 42nd Street, New York, NY 10036, United States
| | | | - Luca Naef
- VantAI, 151 W 42nd Street, New York, NY 10036, United States
| | - Christian Dallago
- Department of Informatics, Bioinformatics & Computational Biology, Technische Universität München, 85748 Garching, Germany
- VantAI, 151 W 42nd Street, New York, NY 10036, United States
- NVIDIA DE GmbH, Einsteinstraße 172, 81677 München, Germany
| |
Collapse
|
2
|
Lu L, Qin J, Chen J, Yu N, Miyano S, Deng Z, Li C. Recent computational drug repositioning strategies against SARS-CoV-2. Comput Struct Biotechnol J 2022; 20:5713-5728. [PMID: 36277237 PMCID: PMC9575573 DOI: 10.1016/j.csbj.2022.10.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 10/12/2022] [Accepted: 10/12/2022] [Indexed: 11/08/2022] Open
Abstract
We performed a comprehensive review of computational drug repositioning methods applied to COVID-19 based on differing data types including sequence data, expression data, structure data and interaction data. We found that graph theory and neural network were the most used strategies for drug repositioning in the case of COVID-19. Integrating different levels of data may improve the success rate for drug repositioning.
Since COVID-19 emerged in 2019, significant levels of suffering and disruption have been caused on a global scale. Although vaccines have become widely used, the virus has shown its potential for evading immunities or acquiring other novel characteristics. Whether current drug treatments are still effective for people infected with Omicron remains unclear. Due to the long development cycles and high expense requirements of de novo drug development, many researchers have turned to consider drug repositioning in the search to find effective treatments for COVID-19. Here, we review such drug repositioning and combination efforts towards providing better handling. For potential drugs under consideration, aspects of both structure and function require attention, with specific categories of sequence, expression, structure, and interaction, the key parameters for investigation. For different data types, we show the corresponding differing drug repositioning methods that have been exploited. As incorporating drug combinations can increase therapeutic efficacy and reduce toxicity, we also review computational strategies to reveal drug combination potential. Taken together, we found that graph theory and neural network were the most used strategy with high potential towards drug repositioning for COVID-19. Integrating different levels of data may further improve the success rate of drug repositioning.
Collapse
Affiliation(s)
- Lu Lu
- Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China,Zhejiang Provincial Key Laboratory of Genetic & Developmental Disorders, Zhejiang University School of Medicine, Hangzhou, China
| | - Jiale Qin
- Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China,Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Hangzhou, China
| | - Jiandong Chen
- Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China,School of Public Health, Undergraduate School of Zhejiang University, Hangzhou, China
| | - Na Yu
- Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Satoru Miyano
- M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
| | - Zhenzhong Deng
- Xinhua Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China,Corresponding authors at: Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China (C. Li).
| | - Chen Li
- Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China,Zhejiang Provincial Key Laboratory of Genetic & Developmental Disorders, Zhejiang University School of Medicine, Hangzhou, China,Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, China,Corresponding authors at: Department of Human Genetics, Department of Ultrasound, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China (C. Li).
| |
Collapse
|