Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Abdin O, Nim S, Wen H, Kim PM. PepNN: a deep attention model for the identification of peptide binding sites. Commun Biol 2022;5:503. [PMID: 35618814 PMCID: PMC9135736 DOI: 10.1038/s42003-022-03445-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/03/2022] [Indexed: 11/09/2022] Open

For:	Abdin O, Nim S, Wen H, Kim PM. PepNN: a deep attention model for the identification of peptide binding sites. Commun Biol 2022;5:503. [PMID: 35618814 PMCID: PMC9135736 DOI: 10.1038/s42003-022-03445-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/03/2022] [Indexed: 11/09/2022] Open

Number

Cited by Other Article(s)

Asim MN, Asif T, Hassan F, Dengel A. Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models. Database (Oxford) 2025;2025:baaf027. [PMID: 40448683 DOI: 10.1093/database/baaf027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 02/06/2025] [Accepted: 03/26/2025] [Indexed: 06/02/2025]

Abstract

Protein sequence analysis examines the order of amino acids within protein sequences to unlock diverse types of a wealth of knowledge about biological processes and genetic disorders. It helps in forecasting disease susceptibility by finding unique protein signatures, or biomarkers that are linked to particular disease states. Protein Sequence analysis through wet-lab experiments is expensive, time-consuming and error prone. To facilitate large-scale proteomics sequence analysis, the biological community is striving for utilizing AI competence for transitioning from wet-lab to computer aided applications. However, Proteomics and AI are two distinct fields and development of AI-driven protein sequence analysis applications requires knowledge of both domains. To bridge the gap between both fields, various review articles have been written. However, these articles focus revolves around few individual tasks or specific applications rather than providing a comprehensive overview about wide tasks and applications. Following the need of a comprehensive literature that presents a holistic view of wide array of tasks and applications, contributions of this manuscript are manifold: It bridges the gap between Proteomics and AI fields by presenting a comprehensive array of AI-driven applications for 63 distinct protein sequence analysis tasks. It equips AI researchers by facilitating biological foundations of 63 protein sequence analysis tasks. It enhances development of AI-driven protein sequence analysis applications by providing comprehensive details of 68 protein databases. It presents a rich data landscape, encompassing 627 benchmark datasets of 63 diverse protein sequence analysis tasks. It highlights the utilization of 25 unique word embedding methods and 13 language models in AI-driven protein sequence analysis applications. It accelerates the development of AI-driven applications by facilitating current state-of-the-art performances across 63 protein sequence analysis tasks.

Collapse

Gao L, Zhang Y, Ge F, Li S, Guo Y, Song J, Yu DJ. Structure-Directed Pan-Specific T-Cell Receptor-Peptide-Major Histocompatibility Complex Interaction Prediction. J Chem Inf Model 2025;65:4674-4686. [PMID: 40297927 DOI: 10.1021/acs.jcim.5c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]

Xiong S, Cai J, Shi H, Cui F, Zhang Z, Wei L. UMPPI: Unveiling Multilevel Protein-Peptide Interaction Prediction via Language Models. J Chem Inf Model 2025;65:3789-3799. [PMID: 40077987 DOI: 10.1021/acs.jcim.4c02365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2025]

Ünlü A, Ulusoy E, Yiğit MG, Darcan M, Doğan T. Protein language models for predicting drug-target interactions: Novel approaches, emerging methods, and future directions. Curr Opin Struct Biol 2025;91:103017. [PMID: 39985946 DOI: 10.1016/j.sbi.2025.103017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 01/28/2025] [Accepted: 01/29/2025] [Indexed: 02/24/2025]

Malhotra Y, John J, Yadav D, Sharma D, Vanshika, Rawal K, Mishra V, Chaturvedi N. Advancements in protein structure prediction: A comparative overview of AlphaFold and its derivatives. Comput Biol Med 2025;188:109842. [PMID: 39970826 DOI: 10.1016/j.compbiomed.2025.109842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2024] [Revised: 02/07/2025] [Accepted: 02/10/2025] [Indexed: 02/21/2025]

Wang Y, Han S, Wang Y, Liang Q, Luo W. Artificial Intelligence Technology Assists Enzyme Prediction and Rational Design. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2025;73:7065-7073. [PMID: 40066931 DOI: 10.1021/acs.jafc.4c13201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]

Khan S, Noor S, Awan HH, Iqbal S, AlQahtani SA, Dilshad N, Ahmad N. Deep-ProBind: binding protein prediction with transformer-based deep learning model. BMC Bioinformatics 2025;26:88. [PMID: 40121399 PMCID: PMC11929993 DOI: 10.1186/s12859-025-06101-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2024] [Accepted: 03/04/2025] [Indexed: 03/25/2025] Open

Abstract

Binding proteins play a crucial role in biological systems by selectively interacting with specific molecules, such as DNA, RNA, or peptides, to regulate various cellular processes. Their ability to recognize and bind target molecules with high specificity makes them essential for signal transduction, transport, and enzymatic activity. Traditional experimental methods for identifying protein-binding peptides are costly and time-consuming. Current sequence-based approaches often struggle with accuracy, focusing too narrowly on proximal sequence features and ignoring structural data. This study presents Deep-ProBind, a powerful prediction model designed to classify protein binding sites by integrating sequence and structural information. The proposed model employs a transformer and evolutionary-based attention mechanism, i.e., Bidirectional Encoder Representations from Transformers (BERT) and Pseudo position specific scoring matrix -Discrete Wavelet Transform (PsePSSM -DWT) approach to encode peptides. The SHapley Additive exPlanations (SHAP) algorithm selects the optimal hybrid features, and a Deep Neural Network (DNN) is then used as the classification algorithm to predict protein-binding peptides. The performance of the proposed model was evaluated in comparison with traditional Machine Learning (ML) algorithms and existing models. Experimental results demonstrate that Deep-ProBind achieved 92.67% accuracy with tenfold cross-validation on benchmark datasets and 93.62% accuracy on independent samples. The Deep-ProBind outperforms existing models by 3.57% on training data and 1.52% on independent tests. These results demonstrate Deep-ProBind's reliability and effectiveness, making it a valuable tool for researchers and a potential resource in pharmacological studies, where peptide binding plays a critical role in therapeutic development.

Collapse

Tang S, Zhang Y, Tong A, Chatterjee P. Gumbel-Softmax Flow Matching with Straight-Through Guidance for Controllable Biological Sequence Generation. ARXIV 2025:arXiv:2503.17361v1. [PMID: 40166737 PMCID: PMC11957225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]

Shi C, Liu F, Su X, Yang Z, Wang Y, Xie S, Xie S, Sun Q, Chen Y, Sang L, Tan M, Zhu L, Lei K, Li J, Yang J, Gao Z, Yu M, Wang X, Wang J, Chen J, Zhuo W, Fang Z, Liu J, Yan Q, Neculai D, Sun Q, Shao J, Lin W, Liu W, Chen J, Wang L, Liu Y, Li X, Zhou T, Lin A. Comprehensive discovery and functional characterization of the noncanonical proteome. Cell Res 2025;35:186-204. [PMID: 39794466 PMCID: PMC11909191 DOI: 10.1038/s41422-024-01059-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Accepted: 11/14/2024] [Indexed: 01/13/2025] Open

Abstract

The systematic identification and functional characterization of noncanonical translation products, such as novel peptides, will facilitate the understanding of the human genome and provide new insights into cell biology. Here, we constructed a high-coverage peptide sequencing reference library with 11,668,944 open reading frames and employed an ultrafiltration tandem mass spectrometry assay to identify novel peptides. Through these methods, we discovered 8945 previously unannotated peptides from normal gastric tissues, gastric cancer tissues and cell lines, nearly half of which were derived from noncoding RNAs. Moreover, our CRISPR screening revealed that 1161 peptides are involved in tumor cell proliferation. The presence and physiological function of a subset of these peptides, selected based on screening scores, amino acid length, and various indicators, were verified through Flag-knockin and multiple other methods. To further characterize the potential regulatory mechanisms involved, we constructed a framework based on artificial intelligence structure prediction and peptide‒protein interaction network analysis for the top 100 candidates and revealed that these cancer-related peptides have diverse subcellular locations and participate in organelle-specific processes. Further investigation verified the interacting partners of pep1-nc-OLMALINC, pep5-nc-TRHDE-AS1, pep-nc-ZNF436-AS1 and pep2-nc-AC027045.3, and the functions of these peptides in mitochondrial complex assembly, energy metabolism, and cholesterol metabolism, respectively. We showed that pep5-nc-TRHDE-AS1 and pep2-nc-AC027045.3 had substantial impacts on tumor growth in xenograft models. Furthermore, the dysregulation of these four peptides is closely correlated with clinical prognosis. Taken together, our study provides a comprehensive characterization of the noncanonical proteome, and highlights critical roles of these previously unannotated peptides in cancer biology.

Collapse

Affiliation(s)

Chengyu Shi The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Fangzhou Liu The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Xinwan Su The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Zuozhen Yang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Ying Wang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Shanshan Xie Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Department of Cell Biology and Program in Molecular Cell Biology, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China Department of Gastroenterology, the Second Affiliated Hospital, School of Medicine and Institute of Gastroenterology, Zhejiang University, Hangzhou, Zhejiang, China
Shaofang Xie Key Laboratory of Structural Biology of Zhejiang Province, Westlake Laboratory of Life Sciences and Biomedicine, Westlake University, Hangzhou, Zhejiang, China
Qiang Sun The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China
Yu Chen MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Lingjie Sang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Manman Tan MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Linyu Zhu MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Kai Lei MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Junhong Li MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Jiecheng Yang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Zerui Gao MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Meng Yu MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Xinyi Wang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Junfeng Wang MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China
Jing Chen Department of Gastrointestinal Surgery, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Wei Zhuo Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Department of Cell Biology and Program in Molecular Cell Biology, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China Department of Gastroenterology, the Second Affiliated Hospital, School of Medicine and Institute of Gastroenterology, Zhejiang University, Hangzhou, Zhejiang, China
Zhaoyuan Fang Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Haining, Zhejiang, China The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Jian Liu Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Haining, Zhejiang, China Hangzhou Cancer Hospital, Hangzhou, Zhejiang, China
Qingfeng Yan MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China
Dante Neculai The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China
Qiming Sun The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China
Jianzhong Shao MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China
Weiqiang Lin Department of Nephrology, Center for Regeneration and Aging Medicine, The Fourth Affiliated Hospital of School of Medicine and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, Zhejiang, China
Wei Liu The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China
Jian Chen Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China Department of Gastrointestinal Surgery, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Liangjing Wang Department of Gastroenterology, the Second Affiliated Hospital, School of Medicine and Institute of Gastroenterology, Zhejiang University, Hangzhou, Zhejiang, China
Yang Liu Institute of Immunology, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
Xu Li Key Laboratory of Structural Biology of Zhejiang Province, Westlake Laboratory of Life Sciences and Biomedicine, Westlake University, Hangzhou, Zhejiang, China
Tianhua Zhou The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China. Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China. Department of Cell Biology and Program in Molecular Cell Biology, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China. Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
Aifu Lin The Center for RNA Medicine, International Institutes of Medicine, International School of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, Yiwu, Zhejiang, China. MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, China. Cancer Center, Zhejiang University, Hangzhou, Zhejiang, China. Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, Hangzhou, Zhejiang, China. Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, Zhejiang, China. Key Laboratory for Cell and Gene Engineering of Zhejiang Province, Hangzhou, Zhejiang, China.

Collapse

Sequeira A, Rocha M, Lousa D. Machine and deep learning to predict viral fusion peptides. Comput Struct Biotechnol J 2025;27:692-704. [PMID: 40083606 PMCID: PMC11903910 DOI: 10.1016/j.csbj.2025.02.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Revised: 02/10/2025] [Accepted: 02/17/2025] [Indexed: 03/16/2025] Open

Zhai S, Liu T, Lin S, Li D, Liu H, Yao X, Hou T. Artificial intelligence in peptide-based drug design. Drug Discov Today 2025;30:104300. [PMID: 39842504 DOI: 10.1016/j.drudis.2025.104300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2024] [Revised: 01/14/2025] [Accepted: 01/15/2025] [Indexed: 01/24/2025]

Bhat S, Palepu K, Hong L, Mao J, Ye T, Iyer R, Zhao L, Chen T, Vincoff S, Watson R, Wang TZ, Srijay D, Kavirayuni VS, Kholina K, Goel S, Vure P, Deshpande AJ, Soderling SH, DeLisa MP, Chatterjee P. De novo design of peptide binders to conformationally diverse targets with contrastive language modeling. SCIENCE ADVANCES 2025;11:eadr8638. [PMID: 39841846 PMCID: PMC11753435 DOI: 10.1126/sciadv.adr8638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 12/20/2024] [Indexed: 01/24/2025]

Affiliation(s)

Suhaas Bhat Department of Biomedical Engineering, Duke University, Durham, NC, USA
Kalyan Palepu Department of Biomedical Engineering, Duke University, Durham, NC, USA
Lauren Hong Department of Biomedical Engineering, Duke University, Durham, NC, USA
Joey Mao Department of Cell Biology, Duke University, Durham, NC, USA
Tianzheng Ye Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
Rema Iyer Cancer Genome and Epigenetics Program, Sanford Burnham Prebys Institute, San Diego, CA, USA
Lin Zhao Department of Biomedical Engineering, Duke University, Durham, NC, USA
Tianlai Chen Department of Biomedical Engineering, Duke University, Durham, NC, USA
Sophia Vincoff Department of Biomedical Engineering, Duke University, Durham, NC, USA
Rio Watson Department of Biomedical Engineering, Duke University, Durham, NC, USA
Tian Z. Wang Department of Biomedical Engineering, Duke University, Durham, NC, USA
Divya Srijay Department of Biomedical Engineering, Duke University, Durham, NC, USA
Venkata Srikar Kavirayuni Department of Biomedical Engineering, Duke University, Durham, NC, USA
Kseniia Kholina Department of Biomedical Engineering, Duke University, Durham, NC, USA
Shrey Goel Department of Biomedical Engineering, Duke University, Durham, NC, USA
Pranay Vure Department of Biomedical Engineering, Duke University, Durham, NC, USA
Aniruddha J. Deshpande Cancer Genome and Epigenetics Program, Sanford Burnham Prebys Institute, San Diego, CA, USA
Scott H. Soderling Department of Cell Biology, Duke University, Durham, NC, USA
Matthew P. DeLisa Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA Cornell Institute of Biotechnology, Cornell University, Ithaca, NY, USA
Pranam Chatterjee Department of Biomedical Engineering, Duke University, Durham, NC, USA Department of Computer Science, Duke University, Durham, NC, USA Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA

Collapse

Guan C, Fernandes FC, Franco OL, de la Fuente-Nunez C. Leveraging large language models for peptide antibiotic design. CELL REPORTS. PHYSICAL SCIENCE 2025;6:102359. [PMID: 39949833 PMCID: PMC11823563 DOI: 10.1016/j.xcrp.2024.102359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/16/2025]

Affiliation(s)

Changge Guan Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA These authors contributed equally
Fabiano C. Fernandes Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil Departamento de Ciência da Computação, Instituto Federal de Brasília, Campus Taguatinga, Brasília, Brazil These authors contributed equally
Octavio L. Franco Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil S-Inova Biotech, Programa de Pós-Graduação em Biotecnologia, Universidade Católica Dom Bosco, Campo Grande, Brazil
Cesar de la Fuente-Nunez Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA

Collapse

Gagoski D, Rube HT, Rastogi C, Melo LAN, Li X, Voleti R, Shah NH, Bussemaker HJ. Accurate sequence-to-affinity models for SH2 domains from multi-round peptide binding assays coupled with free-energy regression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.12.23.630085. [PMID: 39764007 PMCID: PMC11703206 DOI: 10.1101/2024.12.23.630085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]

Sun X, Wu Z, Su J, Li C. GraphPBSP: Protein binding site prediction based on Graph Attention Network and pre-trained model ProstT5. Int J Biol Macromol 2024;282:136933. [PMID: 39471921 DOI: 10.1016/j.ijbiomac.2024.136933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 10/21/2024] [Accepted: 10/24/2024] [Indexed: 11/01/2024]

Huang J, Li W, Xiao B, Zhao C, Zheng H, Li Y, Wang J. PepCA: Unveiling protein-peptide interaction sites with a multi-input neural network model. iScience 2024;27:110850. [PMID: 39391726 PMCID: PMC11465048 DOI: 10.1016/j.isci.2024.110850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/13/2024] [Accepted: 08/27/2024] [Indexed: 10/12/2024] Open

Shafiee S, Fathi A, Taherzadeh G. DP-site: A dual deep learning-based method for protein-peptide interaction site prediction. Methods 2024;229:17-29. [PMID: 38871095 DOI: 10.1016/j.ymeth.2024.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/22/2024] [Accepted: 06/01/2024] [Indexed: 06/15/2024] Open

Abstract

BACKGROUND

Protein-peptide interaction prediction is an important topic for several applications including various biological processes, understanding drug discovery, protein function abnormal cellular behaviors, and treating diseases. Over the years, studies have shown that experimental methods have improved the identification of this bio-molecular interaction. However, predicting protein-peptide interactions using these methods is laborious, time-consuming, dependent on third-party tools, and costly.

METHOD

To address these previous drawbacks, this study introduces a computational framework called DP-Site. The proposed framework concentrates on using a compound of a dual pipeline along with a combination predictor. A deep convolutional neural network for feature extraction and classification is embedded in pipeline 1. In addition, pipeline 2 includes a deep long-short-term memory-based and a random forest classifier for feature extraction and classification. In this investigation, the evolutionary, structure-based, sequence-based, and physicochemical information of proteins is utilized for identifying protein-peptide interaction at the residue level.

RESULTS

The proposed method is evaluated on both the ten-fold cross-validation and independent test sets. The robust and consistent results between cross-validation and independent test sets confirm the ability of the proposed method to predict peptide binding residues in proteins. Moreover, experimental findings demonstrate that DP-Site has significantly outperformed other state-of-the-art sequence-based and structure-based methods. The proposed method achieves a remarkable balance between a specificity of 0.799 and a sensitivity of 0.770, along with the best f-measure of 0.661 and the highest precision of 0.580 using an independent test set.

CONCLUSIONS

The outcome of various experiments confirms the proficiency of the proposed method and outperforms state-of-the-art sequence-based and structure-based methods in terms of the mentioned criteria. DP-Site can be accessed at https://github.com/shafiee 95/shima.shafiee.DP-Site.

Collapse

Sun X, Wu Z, Su J, Li C. A deep attention model for wide-genome protein-peptide binding affinity prediction at a sequence level. Int J Biol Macromol 2024;276:133811. [PMID: 38996881 DOI: 10.1016/j.ijbiomac.2024.133811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 07/09/2024] [Accepted: 07/09/2024] [Indexed: 07/14/2024]

Yadalam PK, Ramadoss R, Anegundi RV. HyperAttention and Linformer-Based β-catenin Sequence Prediction For Bone Formation. Cureus 2024;16:e68849. [PMID: 39376879 PMCID: PMC11456985 DOI: 10.7759/cureus.68849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Accepted: 09/07/2024] [Indexed: 10/09/2024] Open

Chen T, Dumas M, Watson R, Vincoff S, Peng C, Zhao L, Hong L, Pertsemlidis S, Shaepers-Cheu M, Wang TZ, Srijay D, Monticello C, Vure P, Pulugurta R, Kholina K, Goel S, DeLisa MP, Truant R, Aguilar HC, Chatterjee P. PepMLM: Target Sequence-Conditioned Generation of Therapeutic Peptide Binders via Span Masked Language Modeling. ARXIV 2024:arXiv:2310.03842v3. [PMID: 37873004 PMCID: PMC10593082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]

Affiliation(s)

Tianlai Chen Department of Biomedical Engineering, Duke University
Madeleine Dumas Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University Department of Microbiology, College of Agriculture and Life Sciences, Cornell University
Rio Watson Department of Biomedical Engineering, Duke University
Sophia Vincoff Department of Biomedical Engineering, Duke University
Christina Peng Department of Biochemistry and Biomedical Sciences, McMaster University
Lin Zhao Department of Biomedical Engineering, Duke University
Lauren Hong Department of Biomedical Engineering, Duke University
Sarah Pertsemlidis Department of Biomedical Engineering, Duke University
Mayumi Shaepers-Cheu Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University
Tian Zi Wang Department of Biomedical Engineering, Duke University
Divya Srijay Department of Biomedical Engineering, Duke University
Connor Monticello Department of Biochemistry and Biomedical Sciences, McMaster University
Pranay Vure Department of Biomedical Engineering, Duke University
Rishab Pulugurta Department of Biomedical Engineering, Duke University
Kseniia Kholina Department of Biomedical Engineering, Duke University
Shrey Goel Department of Biomedical Engineering, Duke University
Matthew P. DeLisa Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA Robert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA Cornell Institute of Biotechnology, Cornell University, Ithaca, NY, USA
Ray Truant Department of Biochemistry and Biomedical Sciences, McMaster University
Hector C. Aguilar Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University
Pranam Chatterjee Department of Biomedical Engineering, Duke University Department of Computer Science, Duke University Department of Biostatistics and Bioinformatics, Duke University

Collapse

Wu J, Wang Y, Cai W, Chen D, Peng X, Dong H, Li J, Liu H, Shi S, Tang S, Li Z, Sui H, Wang Y, Wu C, Zhang Y, Fu X, Yin Y. Ribosomal translation of fluorinated non-canonical amino acids for de novo biologically active fluorinated macrocyclic peptides. Chem Sci 2024:d4sc04061a. [PMID: 39129776 PMCID: PMC11310889 DOI: 10.1039/d4sc04061a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 07/25/2024] [Indexed: 08/13/2024] Open

Chen T, Zhang Y, Chatterjee P. moPPIt: De Novo Generation of Motif-Specific Binders with Protein Language Models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.31.606098. [PMID: 39131360 PMCID: PMC11312608 DOI: 10.1101/2024.07.31.606098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]

Yuan Q, Tian C, Song Y, Ou P, Zhu M, Zhao H, Yang Y. GPSFun: geometry-aware protein sequence function predictions with language models. Nucleic Acids Res 2024;52:W248-W255. [PMID: 38738636 PMCID: PMC11223820 DOI: 10.1093/nar/gkae381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 04/22/2024] [Accepted: 04/26/2024] [Indexed: 05/14/2024] Open

Zhu C, Zhang C, Shang T, Zhang C, Zhai S, Cao L, Xu Z, Su Z, Song Y, Su A, Li C, Duan H. GAPS: a geometric attention-based network for peptide binding site identification by the transfer learning approach. Brief Bioinform 2024;25:bbae297. [PMID: 38990514 PMCID: PMC11238429 DOI: 10.1093/bib/bbae297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 04/28/2024] [Accepted: 06/07/2024] [Indexed: 07/12/2024] Open

Abstract

Protein-peptide interactions (PPepIs) are vital to understanding cellular functions, which can facilitate the design of novel drugs. As an essential component in forming a PPepI, protein-peptide binding sites are the basis for understanding the mechanisms involved in PPepIs. Therefore, accurately identifying protein-peptide binding sites becomes a critical task. The traditional experimental methods for researching these binding sites are labor-intensive and time-consuming, and some computational tools have been invented to supplement it. However, these computational tools have limitations in generality or accuracy due to the need for ligand information, complex feature construction, or their reliance on modeling based on amino acid residues. To deal with the drawbacks of these computational algorithms, we describe a geometric attention-based network for peptide binding site identification (GAPS) in this work. The proposed model utilizes geometric feature engineering to construct atom representations and incorporates multiple attention mechanisms to update relevant biological features. In addition, the transfer learning strategy is implemented for leveraging the protein-protein binding sites information to enhance the protein-peptide binding sites recognition capability, taking into account the common structure and biological bias between proteins and peptides. Consequently, GAPS demonstrates the state-of-the-art performance and excellent robustness in this task. Moreover, our model exhibits exceptional performance across several extended experiments including predicting the apo protein-peptide, protein-cyclic peptide and the AlphaFold-predicted protein-peptide binding sites. These results confirm that the GAPS model is a powerful, versatile, stable method suitable for diverse binding site predictions.

Collapse

Yin S, Mi X, Shukla D. Leveraging machine learning models for peptide-protein interaction prediction. RSC Chem Biol 2024;5:401-417. [PMID: 38725911 PMCID: PMC11078210 DOI: 10.1039/d3cb00208j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 02/07/2024] [Indexed: 05/12/2024] Open

Yuan Q, Tian C, Yang Y. Genome-scale annotation of protein binding sites via language model and geometric deep learning. eLife 2024;13:RP93695. [PMID: 38630609 PMCID: PMC11023698 DOI: 10.7554/elife.93695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024] Open

Jia P, Zhang F, Wu C, Li M. A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond. Brief Bioinform 2024;25:bbae162. [PMID: 38739759 PMCID: PMC11089422 DOI: 10.1093/bib/bbae162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 02/17/2024] [Accepted: 03/31/2024] [Indexed: 05/16/2024] Open

Yin S, Mi X, Shukla D. Leveraging Machine Learning Models for Peptide-Protein Interaction Prediction. ARXIV 2024:arXiv:2310.18249v2. [PMID: 37961736 PMCID: PMC10635286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Vottero P, Olivetti EC, D'Agostino LC, Di Grazia L, Vezzetti E, Aminpour M, Tuszynski JA, Marcolin F. Understanding the contagiousness of Covid-19 strains: A geometric approach. J Mol Graph Model 2024;126:108670. [PMID: 37984193 DOI: 10.1016/j.jmgm.2023.108670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 11/06/2023] [Accepted: 11/07/2023] [Indexed: 11/22/2023]

Abstract

Protein-protein interaction occurs on surface patches with some degree of complementary geometric and chemical features. Building on this understanding, this study endeavors to characterize the spike protein of the SARS-CoV-2 virus at the morphological and geometrical levels in its Alpha, Delta, and Omicron variants. In particular, the affinity between different SARS-CoV-2 spike proteins and the ACE2 receptor present on the membrane of the human respiratory system cells is investigated. To achieve an adequate degree of geometrical accuracy, the 3D depth maps of the proteins in exam are filtered by developing an ad-hoc convolutional filter with a kernel implemented as a sphere of varying radius, simulating a ball rolling on the surface (similar to the 'rolling ball' filter). This ball ideally models a hypothetical molecule that could interface with the protein and is inspired by the geometric approach to macromolecule-ligand interactions proposed by Kuntz et al. in 1982. The aim is to mitigate the imperfections and to obtain a smoother surface that could be studied from a geometrical perspective for binding purposes. A set of geometric descriptors, borrowed from the 3D face analysis context is then mapped point-by-point onto protein depth maps. Following a feature extraction phase inspired by Histogram of Oriented Gradients and Local Binary Patterns, the final histogram features are used as input for a Support Vector Machine classifier to automatically classify the proteins according to their surface affinity, where a similarity in shape is observed between ACE2 and the spike protein of the SARS-CoV-2 Omicron variant. Finally, Root Mean Square Error analysis is used to quantify the geometrical affinity between the ACE2 receptor and the respective Receptor Binding Domains of the three SARS-CoV-2 variants, culminating in a geometrical explanation for the higher contagiousness of Omicron relative to the other variants under study.

Collapse

Zhang Z, Verburgt J, Kagaya Y, Christoffer C, Kihara D. Improved Peptide Docking with Privileged Knowledge Distillation using Deep Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.01.569671. [PMID: 38106114 PMCID: PMC10723353 DOI: 10.1101/2023.12.01.569671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Chandra A, Sharma A, Dehzangi I, Tsunoda T, Sattar A. PepCNN deep learning tool for predicting peptide binding residues in proteins using sequence, structural, and language model features. Sci Rep 2023;13:20882. [PMID: 38016996 PMCID: PMC10684570 DOI: 10.1038/s41598-023-47624-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/16/2023] [Indexed: 11/30/2023] Open

Liu Y, Tian B. Protein-DNA binding sites prediction based on pre-trained protein language model and contrastive learning. Brief Bioinform 2023;25:bbad488. [PMID: 38171929 PMCID: PMC10782905 DOI: 10.1093/bib/bbad488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 09/28/2023] [Accepted: 11/30/2023] [Indexed: 01/05/2024] Open

Song Y, Yuan Q, Zhao H, Yang Y. Accurately identifying nucleic-acid-binding sites through geometric graph learning on language model predicted structures. Brief Bioinform 2023;24:bbad360. [PMID: 37824738 DOI: 10.1093/bib/bbad360] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 09/18/2023] [Accepted: 09/18/2023] [Indexed: 10/14/2023] Open

Abstract

The interactions between nucleic acids and proteins are important in diverse biological processes. The high-quality prediction of nucleic-acid-binding sites continues to pose a significant challenge. Presently, the predictive efficacy of sequence-based methods is constrained by their exclusive consideration of sequence context information, whereas structure-based methods are unsuitable for proteins lacking known tertiary structures. Though protein structures predicted by AlphaFold2 could be used, the extensive computing requirement of AlphaFold2 hinders its use for genome-wide applications. Based on the recent breakthrough of ESMFold for fast prediction of protein structures, we have developed GLMSite, which accurately identifies DNA- and RNA-binding sites using geometric graph learning on ESMFold predicted structures. Here, the predicted protein structures are employed to construct protein structural graph with residues as nodes and spatially neighboring residue pairs for edges. The node representations are further enhanced through the pre-trained language model ProtTrans. The network was trained using a geometric vector perceptron, and the geometric embeddings were subsequently fed into a common network to acquire common binding characteristics. Finally, these characteristics were input into two fully connected layers to predict binding sites with DNA and RNA, respectively. Through comprehensive tests on DNA/RNA benchmark datasets, GLMSite was shown to surpass the latest sequence-based methods and be comparable with structure-based methods. Moreover, the prediction was shown useful for inferring nucleic-acid-binding proteins, demonstrating its potential for protein function discovery. The datasets, codes, and trained models are available at https://github.com/biomed-AI/nucleic-acid-binding.

Collapse

Ghoreyshi ZS, George JT. Quantitative approaches for decoding the specificity of the human T cell repertoire. Front Immunol 2023;14:1228873. [PMID: 37781387 PMCID: PMC10539903 DOI: 10.3389/fimmu.2023.1228873] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 08/17/2023] [Indexed: 10/03/2023] Open

McFee M, Kim PM. GDockScore: a graph-based protein-protein docking scoring function. BIOINFORMATICS ADVANCES 2023;3:vbad072. [PMID: 37359726 PMCID: PMC10290236 DOI: 10.1093/bioadv/vbad072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/30/2023] [Accepted: 06/10/2023] [Indexed: 06/28/2023]

Peng X, Lei Y, Feng P, Jia L, Ma J, Zhao D, Zeng J. Characterizing the interaction conformation between T-cell receptors and epitopes with deep learning. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00634-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]

Wang X, Ding Z, Wang R, Lin X. Deepro-Glu: combination of convolutional neural network and Bi-LSTM models using ProtBert and handcrafted features to identify lysine glutarylation sites. Brief Bioinform 2023;24:6991122. [PMID: 36653898 DOI: 10.1093/bib/bbac631] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 12/11/2022] [Accepted: 12/28/2022] [Indexed: 01/20/2023] Open

Rogers JR, Nikolényi G, AlQuraishi M. Growing ecosystem of deep learning methods for modeling protein-protein interactions. Protein Eng Des Sel 2023;36:gzad023. [PMID: 38102755 DOI: 10.1093/protein/gzad023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/06/2023] [Accepted: 12/07/2023] [Indexed: 12/17/2023] Open

Cable J, Saphire EO, Hayday AC, Wiltshire TD, Mousa JJ, Humphreys DP, Breij ECW, Bruhns P, Broketa M, Furuya G, Hauser BM, Mahévas M, Carfi A, Cantaert T, Kwong PD, Tripathi P, Davis JH, Brewis N, Keyt BA, Fennemann FL, Dussupt V, Sivasubramanian A, Kim PM, Rawi R, Richardson E, Leventhal D, Wolters RM, Geuijen CAW, Sleeman MA, Pengo N, Donnellan FR. Antibodies as drugs-a Keystone Symposia report. Ann N Y Acad Sci 2023;1519:153-166. [PMID: 36382536 PMCID: PMC10103175 DOI: 10.1111/nyas.14915] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Affiliation(s)

Jennifer Cable PhD Science Writer, New York, New York, USA
Erica Ollmann Saphire Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, California, USA.,Department of Medicine, University of California San Diego, La Jolla, California, USA
Adrian C Hayday Peter Gorer Department of Immunobiology, King's College London, London, UK.,Cancer Research UK Cancer Immunotherapy Accelerator, London, UK.,Immunosurveillance Laboratory, The Francis Crick Institute, London, UK
Timothy D Wiltshire Mayo Clinic, Rochester, Minnesota, USA
Jarrod J Mousa Department of Infectious Diseases and Center for Vaccines and Immunology, College of Veterinary Medicine, Athens, Georgia, USA.,Department of Biochemistry and Molecular Biology, Franklin College of Arts and Sciences, University of Georgia, Athens, Georgia, USA.,Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, Tennessee, USA
David P Humphreys Protein Sciences, UCB Pharma, Berkshire, UK
Esther C W Breij Translational Research and Precision Medicine, Genmab BV, Utrecht, the Netherlands
Pierre Bruhns Institut Pasteur, Université de Paris, Unit of Antibodies in Therapy and Pathology, Paris, France
Matteo Broketa Institut Pasteur, Université de Paris, Unit of Antibodies in Therapy and Pathology, Paris, France
Genta Furuya Department of Preventive Medicine and Department of Pathology, Graduate School of Medicine, University of Tokyo, Tokyo, Japan
Blake M Hauser Ragon Institute of MGH, MIT, and Harvard, Cambridge, Massachusetts, USA
Matthieu Mahévas Service de Médecine Interne, Centre de Référence des Cytopénies Auto-immunes de l'adulte, Centre Hospitalier Universitaire Henri-Mondor, Assistance Publique-Hôpitaux de Paris, Université Paris-Est Créteil, Créteil, France
Andrea Carfi Moderna Inc., Cambridge, Massachusetts, USA.,Department of Pathology, Miller School of Medicine, University of Miami, Miami, Florida, USA
Tineke Cantaert Immunology Unit, Institut Pasteur du Cambodge, The Pasteur Network, Phnom Penh, Cambodia
Peter D Kwong Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland, USA
Prabhanshu Tripathi Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland, USA
Jonathan H Davis Invenra, Inc, Madison, Wisconsin, USA
Neil Brewis F-star Therapeutics Ltd., Cambridge, UK
Bruce A Keyt IGM Biosciences, Inc., Mountainview, California, USA
Felix L Fennemann Lava Therapeutics, Utrecht, the Netherlands
Vincent Dussupt Emerging Infectious Diseases Branch, U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, Maryland, USA.,Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, Maryland, USA
Arvind Sivasubramanian Computational Biology, Adimab, Palo Alto, California, USA
Philip M Kim Department of Molecular Genetics, Donnelly Centre for Cellular and Biomolecular Research, Toronto, Ontario, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
Reda Rawi Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland, USA
Eve Richardson Department of Statistics, University of Oxford, Oxford, UK
Daniel Leventhal Generate Biomedicines, Somerville, Massachusetts, USA
Rachael M Wolters Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Cecile A W Geuijen Merus N.V., Utrecht, the Netherlands
Matthew A Sleeman Regeneron Pharmaceuticals, Tarrytown, New York, USA
Niccolo Pengo Mabylon AG, Schlieren, Switzerland
Francesca Rose Donnellan University of Oxford, Oxford, UK

Collapse

Durairaj J, de Ridder D, van Dijk AD. Beyond sequence: Structure-based machine learning. Comput Struct Biotechnol J 2022;21:630-643. [PMID: 36659927 PMCID: PMC9826903 DOI: 10.1016/j.csbj.2022.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/31/2022] Open

AmiA and AliA peptide ligands are secreted by Klebsiella pneumoniae and inhibit growth of Streptococcus pneumoniae. Sci Rep 2022;12:22268. [PMID: 36564446 PMCID: PMC9789142 DOI: 10.1038/s41598-022-26838-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 12/21/2022] [Indexed: 12/24/2022] Open