Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jing X, Dong Q, Hong D, Lu R. Amino Acid Encoding Methods for Protein Sequences: A Comprehensive Review and Assessment. IEEE/ACM Trans Comput Biol Bioinform 2020;17:1918-1931. [PMID: 30998480 DOI: 10.1109/tcbb.2019.2911677] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

For:	Jing X, Dong Q, Hong D, Lu R. Amino Acid Encoding Methods for Protein Sequences: A Comprehensive Review and Assessment. IEEE/ACM Trans Comput Biol Bioinform 2020;17:1918-1931. [PMID: 30998480 DOI: 10.1109/tcbb.2019.2911677] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Number

Cited by Other Article(s)

García Sánchez N, Ugarte Carro E, Prieto-Santamaría L, Rodríguez-González A. Protein sequence analysis in the context of drug repurposing. BMC Med Inform Decis Mak 2024;24:122. [PMID: 38741115 DOI: 10.1186/s12911-024-02531-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 05/08/2024] [Indexed: 05/16/2024] Open

Chen S, Li M, Semenov I. MFA-DTI: Drug-target interaction prediction based on multi-feature fusion adopted framework. Methods 2024;224:79-92. [PMID: 38430967 DOI: 10.1016/j.ymeth.2024.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2023] [Revised: 02/16/2024] [Accepted: 02/23/2024] [Indexed: 03/05/2024] Open

Erten M. MehNet: a vigesimal-based model by amino acid melting points generates unique ID numbers for protein sequences. J Biomol Struct Dyn 2024:1-7. [PMID: 38230442 DOI: 10.1080/07391102.2024.2302937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 01/02/2024] [Indexed: 01/18/2024]

Yang Z, Wang Y, Ni X, Yang S. DeepDRP: Prediction of intrinsically disordered regions based on integrated view deep learning architecture from transformer-enhanced and protein information. Int J Biol Macromol 2023;253:127390. [PMID: 37827403 DOI: 10.1016/j.ijbiomac.2023.127390] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Revised: 09/20/2023] [Accepted: 10/09/2023] [Indexed: 10/14/2023]

Alquran H, Al Fahoum A, Zyout A, Abu Qasmieh I. A comprehensive framework for advanced protein classification and function prediction using synergistic approaches: Integrating bispectral analysis, machine learning, and deep learning. PLoS One 2023;18:e0295805. [PMID: 38096313 PMCID: PMC10721063 DOI: 10.1371/journal.pone.0295805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Accepted: 11/29/2023] [Indexed: 12/17/2023] Open

Abstract

Proteins are fundamental components of diverse cellular systems and play crucial roles in a variety of disease processes. Consequently, it is crucial to comprehend their structure, function, and intricate interconnections. Classifying proteins into families or groups with comparable structural and functional characteristics is a crucial aspect of this comprehension. This classification is crucial for evolutionary research, predicting protein function, and identifying potential therapeutic targets. Sequence alignment and structure-based alignment are frequently ineffective techniques for identifying protein families.This study addresses the need for a more efficient and accurate technique for feature extraction and protein classification. The research proposes a novel method that integrates bispectrum characteristics, deep learning techniques, and machine learning algorithms to overcome the limitations of conventional methods. The proposed method uses numbers to represent protein sequences, utilizes bispectrum analysis, uses different topologies for convolutional neural networks to pull out features, and chooses robust features to classify protein families. The goal is to outperform existing methods for identifying protein families, thereby enhancing classification metrics. The materials consist of numerous protein datasets, whereas the methods incorporate bispectrum characteristics and deep learning strategies. The results of this study demonstrate that the proposed method for identifying protein families is superior to conventional approaches. Significantly enhanced quality metrics demonstrated the efficacy of the combined bispectrum and deep learning approaches. These findings have the potential to advance the field of protein biology and facilitate pharmaceutical innovation. In conclusion, this study presents a novel method that employs bispectrum characteristics and deep learning techniques to improve the precision and efficiency of protein family identification. The demonstrated advancements in classification metrics demonstrate this method's applicability to numerous scientific disciplines. This furthers our understanding of protein function and its implications for disease and treatment.

Collapse

Garber ME, Frank V, Kazakov AE, Incha MR, Nava AA, Zhang H, Valencia LE, Keasling JD, Rajeev L, Mukhopadhyay A. REC protein family expansion by the emergence of a new signaling pathway. mBio 2023;14:e0262223. [PMID: 37991384 PMCID: PMC10746176 DOI: 10.1128/mbio.02622-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 10/20/2023] [Indexed: 11/23/2023] Open

Affiliation(s)

Megan E. Garber Department of Comparative Biochemistry, University of California, Berkeley, California, USA Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Vered Frank Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Alexey E. Kazakov Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Matthew R. Incha Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
Alberto A. Nava Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
Hanqiao Zhang Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Bioengineering, University of California, Berkeley, California, USA
Luis E. Valencia Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Bioengineering, University of California, Berkeley, California, USA
Jay D. Keasling Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Plant and Microbial Biology, University of California, Berkeley, California, USA Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA Department of Bioengineering, University of California, Berkeley, California, USA Center for Biosustainability, Danish Technical University, Lyngby, Denmark Center for Synthetic Biochemistry, Shenzhen Institutes for Advanced Technologies, Shenzhen, China
Lara Rajeev Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Aindrila Mukhopadhyay Department of Comparative Biochemistry, University of California, Berkeley, California, USA Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA

Collapse

Shaukat MA, Nguyen TT, Hsu EB, Yang S, Bhatti A. Comparative study of encoded and alignment-based methods for virus taxonomy classification. Sci Rep 2023;13:18662. [PMID: 37907535 PMCID: PMC10618506 DOI: 10.1038/s41598-023-45461-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 10/19/2023] [Indexed: 11/02/2023] Open

Savojardo C, Martelli PL, Casadio R. Finding functional motifs in protein sequences with deep learning and natural language models. Curr Opin Struct Biol 2023;81:102641. [PMID: 37385080 DOI: 10.1016/j.sbi.2023.102641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 04/17/2023] [Accepted: 05/24/2023] [Indexed: 07/01/2023]

Hsueh HT, Chou RT, Rai U, Liyanage W, Kim YC, Appell MB, Pejavar J, Leo KT, Davison C, Kolodziejski P, Mozzer A, Kwon H, Sista M, Anders NM, Hemingway A, Rompicharla SVK, Edwards M, Pitha I, Hanes J, Cummings MP, Ensign LM. Machine learning-driven multifunctional peptide engineering for sustained ocular drug delivery. Nat Commun 2023;14:2509. [PMID: 37130851 PMCID: PMC10154330 DOI: 10.1038/s41467-023-38056-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/12/2023] [Indexed: 05/04/2023] Open

Affiliation(s)

Henry T Hsueh Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Renee Ti Chou Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
Usha Rai Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Wathsala Liyanage Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Yoo Chun Kim Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Matthew B Appell Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
Jahnavi Pejavar Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Kirby T Leo Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
Charlotte Davison Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Patricia Kolodziejski Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Ann Mozzer Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
HyeYoung Kwon Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Maanasa Sista Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA
Nicole M Anders The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Avelina Hemingway The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Sri Vishnu Kiran Rompicharla Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Malia Edwards Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Ian Pitha Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Justin Hanes Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Michael P Cummings Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA.
Laura M Ensign Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA. Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA. Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA. Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA. The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA.

Collapse

Wang X, Ding Z, Wang R, Lin X. Deepro-Glu: combination of convolutional neural network and Bi-LSTM models using ProtBert and handcrafted features to identify lysine glutarylation sites. Brief Bioinform 2023;24:6991122. [PMID: 36653898 DOI: 10.1093/bib/bbac631] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 12/11/2022] [Accepted: 12/28/2022] [Indexed: 01/20/2023] Open

Milchevskiy YV, Milchevskaya VY, Kravatsky YV. Method to Generate Complex Predictive Features for Machine Learning-Based Prediction of the Local Structure and Functions of Proteins. Mol Biol 2023. [DOI: 10.1134/s0026893323010089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]

Yue ZX, Yan TC, Xu HQ, Liu YH, Hong YF, Chen GX, Xie T, Tao L. A systematic review on the state-of-the-art strategies for protein representation. Comput Biol Med 2023;152:106440. [PMID: 36543002 DOI: 10.1016/j.compbiomed.2022.106440] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/08/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]

A Novel Capsule Network with Attention Routing to Identify Prokaryote Phosphorylation Sites. Biomolecules 2022;12:biom12121854. [PMID: 36551282 PMCID: PMC9775645 DOI: 10.3390/biom12121854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/07/2022] [Accepted: 12/09/2022] [Indexed: 12/14/2022] Open

Abstract

By denaturing proteins and promoting the formation of multiprotein complexes, protein phosphorylation has important effects on the activity of protein functional molecules and cell signaling. The regulation of protein phosphorylation allows microbes to respond rapidly and reversibly to specific environmental stimuli or niches, which is closely related to the molecular mechanisms of bacterial drug resistance. Accurate prediction of phosphorylation sites (p-site) of prokaryotes can contribute to addressing bacterial resistance and providing new perspectives for developing novel antibacterial drugs. Most existing studies focus on human phosphorylation sites, while tools targeting phosphorylation site identification of prokaryotic proteins are still relatively scarce. This study designs a capsule network-based prediction technique for p-site in prokaryotes. To address the poor scalability and unreliability of dynamic routing processes in the output space of capsule networks, a more reliable way is introduced to learn the consistency between capsules. We incorporate a self-attention mechanism into the routing algorithm to capture the global information of the capsule, reducing the computational effort while enriching the representation capability of the capsule. Aiming at the weak robustness of the model, EcapsP improves the prediction accuracy and stability by introducing shortcuts and unconditional reconfiguration. In addition, the study compares and analyzes the prediction performance based on word vectors, physicochemical properties, and mixing characteristics in predicting serine (Ser/S), threonine (Thr/T), and tyrosine (Tyr/Y) p-site. The comprehensive experimental results show that the accuracy of the developed technique is close to 70% for the identification of the three phosphorylation sites in prokaryotes. Importantly, in side-by-side comparisons with other state-of-the-art predictors, our method improves the Matthews correlation coefficient (MCC) by approximately 7%. The results demonstrate the superiority of EcapsP in terms of high performance and reliability.

Collapse

Ismi DP, Pulungan R, Afiahayati. Deep learning for protein secondary structure prediction: Pre and post-AlphaFold. Comput Struct Biotechnol J 2022;20:6271-6286. [PMID: 36420164 PMCID: PMC9678802 DOI: 10.1016/j.csbj.2022.11.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 11/05/2022] [Accepted: 11/05/2022] [Indexed: 11/13/2022] Open

Cao L, Zhang Q, Song H, Lin K, Pang E. DeepASmRNA: Reference-free prediction of alternative splicing events with a scalable and interpretable deep learning model. iScience 2022;25:105345. [PMID: 36325068 PMCID: PMC9619290 DOI: 10.1016/j.isci.2022.105345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/20/2022] [Accepted: 10/11/2022] [Indexed: 11/30/2022] Open

Li W, Yang L, Qiu Y, Yuan Y, Li X, Meng Z. FFP: joint Fast Fourier transform and fractal dimension in amino acid property-aware phylogenetic analysis. BMC Bioinformatics 2022;23:347. [PMID: 35986255 PMCID: PMC9392226 DOI: 10.1186/s12859-022-04889-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 08/11/2022] [Indexed: 11/10/2022] Open

Bhattacharya D, Kleeblatt DC, Statt A, Reinhart WF. Predicting aggregate morphology of sequence-defined macromolecules with recurrent neural networks. SOFT MATTER 2022;18:5037-5051. [PMID: 35748651 DOI: 10.1039/d2sm00452f] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Ma R, Li S, Li W, Yao L, Huang HD, Lee TY. KinasePhos 3.0: Redesign and Expansion of the Prediction on Kinase-specific Phosphorylation Sites. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022:S1672-0229(22)00081-X. [PMID: 35781048 PMCID: PMC10373160 DOI: 10.1016/j.gpb.2022.06.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Revised: 05/30/2022] [Accepted: 06/27/2022] [Indexed: 06/04/2023]

Feng G, Yao H, Li C, Liu R, Huang R, Fan X, Ge R, Miao Q. ME-ACP: Multi-view neural networks with ensemble model for identification of anticancer peptides. Comput Biol Med 2022;145:105459. [DOI: 10.1016/j.compbiomed.2022.105459] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 03/22/2022] [Accepted: 03/24/2022] [Indexed: 12/26/2022]

Predicting protein intrinsically disordered regions by applying natural language processing practices. Soft comput 2022. [DOI: 10.1007/s00500-022-07085-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Jiang M, Zhang R, Xia Y, Jia G, Yin Y, Wang P, Wu J, Ge R. i2APP: A Two-Step Machine Learning Framework For Antiparasitic Peptides Identification. Front Genet 2022;13:884589. [PMID: 35571057 PMCID: PMC9091563 DOI: 10.3389/fgene.2022.884589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Accepted: 04/11/2022] [Indexed: 11/18/2022] Open

Li W, Yang L, Meng Z, Qiu Y, Wang PSP, Li X. Phylogenetic Analysis: A Novel Method of Protein Sequence Similarity Analysis. INT J PATTERN RECOGN 2022. [DOI: 10.1142/s0218001422580071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Recognition of Protein Network for Bioinformatics Knowledge Analysis Using Support Vector Machine. BIOMED RESEARCH INTERNATIONAL 2022;2022:2273648. [PMID: 35502337 PMCID: PMC9056223 DOI: 10.1155/2022/2273648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/22/2022] [Accepted: 03/29/2022] [Indexed: 11/18/2022]

Villegas-Morcillo A, Gomez AM, Sanchez V. An analysis of protein language model embeddings for fold prediction. Brief Bioinform 2022;23:6571527. [PMID: 35443054 DOI: 10.1093/bib/bbac142] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 03/21/2022] [Accepted: 03/28/2022] [Indexed: 11/13/2022] Open

Ryu J, Komoto Y, Ohshiro T, Taniguchi M. Single-Molecule Classification of Aspartic Acid and Leucine by Molecular Recognition through Hydrogen Bonding and Time-Series Analysis. Chem Asian J 2022;17:e202200179. [PMID: 35445555 DOI: 10.1002/asia.202200179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 04/13/2022] [Indexed: 11/06/2022]

Ma Q, Zou K, Zhang Z, Yang F. GLTM: A Global-Local Attention LSTM Model to Locate Dimer Motif of Single-Pass Membrane Proteins. Front Genet 2022;13:854571. [PMID: 35368690 PMCID: PMC8965067 DOI: 10.3389/fgene.2022.854571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 02/14/2022] [Indexed: 11/13/2022] Open

Abstract

Single-pass membrane proteins, which constitute up to 50% of all transmembrane proteins, are typically active in significant conformational changes, such as a dimer or other oligomers, which is essential for understanding the function of transmembrane proteins. Finding the key motifs of oligomers through experimental observation is a routine method used in the field to infer the potential conformations of other members of the transmembrane protein family. However, approaches based on experimental observation need to consume a lot of time and manpower costs; moreover, they are hard to reveal the potential motifs. A proposed approach is to build an accurate and efficient transmembrane protein oligomer prediction model to screen the key motifs. In this paper, an attention-based Global-Local structure LSTM model named GLTM is proposed to predict dimers and screen potential dimer motifs. Different from traditional motifs screening based on highly conserved sequence search frame, a self-attention mechanism has been employed in GLTM to locate the highest dimerization score of subsequence fragments and has been proven to locate most known dimer motifs well. The proposed GLTM can reach 97.5% accuracy on the benchmark dataset collected from Membranome2.0. The three characteristics of GLTM can be summarized as follows: First, the original sequence fragment was converted to a set of subsequences which having the similar length of known motifs, and this additional step can greatly enhance the capability of capturing motif pattern; Second, to solve the problem of sample imbalance, a novel data enhancement approach combining improved one-hot encoding with random subsequence windows has been proposed to improve the generalization capability of GLTM; Third, position penalization has been taken into account, which makes a self-attention mechanism focused on special TM fragments. The experimental results in this paper fully demonstrated that the proposed GLTM has a broad application perspective on the location of potential oligomer motifs, and is helpful for preliminary and rapid research on the conformational change of mutants.

Collapse

Zhang Z, Wang L. Using Chou's 5-steps rule to identify N⁶-methyladenine sites by ensemble learning combined with multiple feature extraction methods. J Biomol Struct Dyn 2022;40:796-806. [PMID: 32948102 DOI: 10.1080/07391102.2020.1821778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Dinu A, Apetrei C. A Review of Sensors and Biosensors Modified with Conducting Polymers and Molecularly Imprinted Polymers Used in Electrochemical Detection of Amino Acids: Phenylalanine, Tyrosine, and Tryptophan. Int J Mol Sci 2022;23:1218. [PMID: 35163145 PMCID: PMC8835779 DOI: 10.3390/ijms23031218] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 01/18/2022] [Accepted: 01/20/2022] [Indexed: 02/07/2023] Open

Ko CW, Huh J, Park JW. Deep learning program to predict protein functions based on sequence information. MethodsX 2022;9:101622. [PMID: 35111575 PMCID: PMC8790617 DOI: 10.1016/j.mex.2022.101622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 01/11/2022] [Indexed: 01/11/2023] Open

Abstract

•

A new deep learning program to predict protein functions in silico.

•

Requirement of nothing more than the protein sequence information.

•

A sequence segmentation to improve the efficiency of prediction.

•

Prediction of the clinical impact of mutations or polymorphisms.

Deep learning technologies have been adopted to predict the functions of newly identified proteins in silico. However, most current models are not suitable for poorly characterized proteins because they require diverse information on target proteins. We designed a binary classification deep learning program requiring only sequence information. This program was named ‘FUTUSA’ (function teller using sequence alone). It applied sequence segmentation during the sequence feature extraction process, by a convolution neural network, to train the regional sequence patterns and their relationship. This segmentation process improved the predictive performance by 49% than the full-length process. Compared with a baseline method, our approach achieved higher performance in predicting oxidoreductase activity. In addition, FUTUSA also showed dramatic performance in predicting acetyltransferase and demethylase activities. Next, we tested the possibility that FUTUSA can predict the functional consequence of point mutation. After trained for monooxygenase activity, FUTUSA successfully predicted the impact of point mutations on phenylalanine hydroxylase, which is responsible for an inherited metabolic disease PKU. This deep-learning program can be used as the first-step tool for characterizing newly identified or poorly studied proteins.•

We proposed new deep learning program to predict protein functions in silico that requires nothing more than the protein sequence information.

•

Due to application of sequence segmentation, the efficiency of prediction is improved.

•

This method makes prediction of the clinical impact of mutations or polymorphisms possible.

Collapse

Ovek D, Abali Z, Zeylan ME, Keskin O, Gursoy A, Tuncbag N. Artificial intelligence based methods for hot spot prediction. Curr Opin Struct Biol 2021;72:209-218. [PMID: 34954608 DOI: 10.1016/j.sbi.2021.11.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/29/2022]

Agrawal S, Sisodia DS, Nagwani NK. Augmented sequence features and subcellular localization for functional characterization of unknown protein sequences. Med Biol Eng Comput 2021;59:2297-2310. [PMID: 34545514 DOI: 10.1007/s11517-021-02436-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Accepted: 08/29/2021] [Indexed: 11/24/2022]

Anteghini M, Martins dos Santos V, Saccenti E. In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins. Int J Mol Sci 2021;22:6409. [PMID: 34203866 PMCID: PMC8232616 DOI: 10.3390/ijms22126409] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 05/31/2021] [Accepted: 06/09/2021] [Indexed: 01/28/2023] Open

Alakus TB, Turkoglu I. A Novel Protein Mapping Method for Predicting the Protein Interactions in COVID-19 Disease by Deep Learning. Interdiscip Sci 2021;13:44-60. [PMID: 33433784 PMCID: PMC7801232 DOI: 10.1007/s12539-020-00405-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 11/23/2020] [Accepted: 11/28/2020] [Indexed: 12/11/2022]

Abstract

The new type of corona virus (SARS-COV-2) emerging in Wuhan, China has spread rapidly to the world and has become a pandemic. In addition to having a significant impact on daily life, it also shows its effect in different areas, including public health and economy. Currently, there is no vaccine or antiviral drug available to prevent the COVID-19 disease. Therefore, determination of protein interactions of new types of corona virus is vital in clinical studies, drug therapy, identification of preclinical compounds and protein functions. Protein–protein interactions are important to examine protein functions and pathways involved in various biological processes and to determine the cause and progression of diseases. Various high-throughput experimental methods have been used to identify protein–protein interactions in organisms, yet, there is still a huge gap in specifying all possible protein interactions in an organism. In addition, since the experimental methods used include cloning, labeling, affinity purification mass spectrometry, the processes take a long time. Determining these interactions with artificial intelligence-based methods rather than experimental approaches may help to identify protein functions faster. Thus, protein–protein interaction prediction using deep-learning algorithms has been employed in conjunction with experimental method to explore new protein interactions. However, to predict protein interactions with artificial intelligence techniques, protein sequences need to be mapped. There are various types and numbers of protein-mapping methods in the literature. In this study, we wanted to contribute to the literature by proposing a novel protein-mapping method based on the AVL tree. The proposed method was inspired by the fast search performance on the dictionary structure of AVL tree and was used to verify the protein interactions between SARS-COV-2 virus and human. First, protein sequences were mapped by both the proposed method and various protein-mapping methods. Then, the mapped protein sequences were normalized and classified by bidirectional recurrent neural networks. The performance of the proposed method was evaluated with accuracy, f1-score, precision, recall, and AUC scores. Our results indicated that our mapping method predicts the protein interactions between SARS-COV-2 virus proteins and human proteins at an accuracy of 97.76%, precision of 97.60%, recall of 98.33%, f1-score of 79.42%, and with AUC 89% in average.

Collapse

Lopez-Del Rio A, Martin M, Perera-Lluna A, Saidi R. Effect of sequence padding on the performance of deep learning models in archaeal protein functional prediction. Sci Rep 2020;10:14634. [PMID: 32884053 PMCID: PMC7471694 DOI: 10.1038/s41598-020-71450-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 08/06/2020] [Indexed: 11/08/2022] Open

Ge R, Feng G, Jing X, Zhang R, Wang P, Wu Q. EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides. Front Genet 2020;11:760. [PMID: 32903636 PMCID: PMC7438906 DOI: 10.3389/fgene.2020.00760] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Accepted: 06/26/2020] [Indexed: 12/13/2022] Open

Xie Z, Deng X, Shu K. Prediction of Protein-Protein Interaction Sites Using Convolutional Neural Network and Improved Data Sets. Int J Mol Sci 2020;21:E467. [PMID: 31940793 PMCID: PMC7013409 DOI: 10.3390/ijms21020467] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Revised: 12/23/2019] [Accepted: 01/08/2020] [Indexed: 12/20/2022] Open