1
|
Zhang F, Li J, Wen Z, Fang C. FusPB-ESM2: Fusion model of ProtBERT and ESM-2 for cell-penetrating peptide prediction. Comput Biol Chem 2024; 111:108098. [PMID: 38820799 DOI: 10.1016/j.compbiolchem.2024.108098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 04/18/2024] [Accepted: 05/16/2024] [Indexed: 06/02/2024]
Abstract
Cell-penetrating peptides have attracted much attention for their ability to break through cell membrane barriers, which can improve drug bioavailability, reduce side effects, and promote the development of gene therapy. Traditional wet-lab prediction methods are time-consuming and costly, and computational methods provide a short-time and low-cost alternative. Still, the accuracy and reliability need to be further improved. To solve this problem, this study proposes a feature fusion-based prediction model, where the protein pre-trained language models ProtBERT and ESM-2 are used as feature extractors, and the extracted features from both are fused to obtain a more comprehensive and effective feature representation, which is then predicted by linear mapping. Validated by many experiments on public datasets, the method has an AUC value as high as 0.983 and shows high accuracy and reliability in cell-penetrating peptide prediction.
Collapse
Affiliation(s)
- Fan Zhang
- Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Jinfeng Li
- Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zhenguo Wen
- Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Chun Fang
- Beijing Institute of Petrochemical Technology, Beijing 102617, China.
| |
Collapse
|
2
|
Zou H. iHBPs-VWDC: variable-length window-based dynamic connectivity approach for identifying hormone-binding proteins. J Biomol Struct Dyn 2023:1-10. [PMID: 37978902 DOI: 10.1080/07391102.2023.2283150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 11/08/2023] [Indexed: 11/19/2023]
Abstract
Hormone-binding proteins (HBPs) are soluble carrier proteins that play a vital role in the growth and development of living organisms. Identifying HBPs accurately is crucial for understanding their functions. However, traditional wet lab experimental methods are labor intensive and cost ineffective. Therefore, there is a need for computational methods to efficiently identify HBPs. In this study, a machine learning method based on support vector machine (SVM) was proposed for the accurate and efficient identification of HBPs. The encoding of protein sequences involved using fifty different physicochemical (PC) properties. A variable-length window-based dynamic connectivity method was applied to capture the connection information between two different PC properties through two distinct strategies. The canonical correlation analysis algorithm was then used to fuse features obtained from these approaches. Feature selection was performed using the F-score approach to choose the most discriminative features. Finally, these selected features were fed into the SVM to discriminate between HBPs and non-HBPs. The proposed method achieved high classification accuracies of 99.19%, 96.77%, and 94.57% on the main dataset and two independent datasets, respectively, as demonstrated in the jackknife test. Comparative results showed that our proposed method outperforms existing approaches on the same datasets, indicating its potential as a useful tool for identifying HBPs. The Matlab codes and datasets used in the current study are freely available at https://figshare.com/articles/online_resource/iHBPs-VWDC/23559834.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Hongliang Zou
- School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
- Jiangxi Engineering Research Center of Unattended Perception System and Artificial Intelligence Technology, Jiangxi Science and Technology Normal University, Nanchang, China
| |
Collapse
|
3
|
Lim C, Lee S, Shin Y, Cho S, Park C, Shin Y, Song EC, Kim WK, Ham C, Kim SB, Kwon YS, Oh KT. Development and application of novel peptide-formulated nanoparticles for treatment of atopic dermatitis. J Mater Chem B 2023; 11:10131-10146. [PMID: 37830254 DOI: 10.1039/d3tb01202f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
Atopic dermatitis is a chronic inflammatory skin condition that is characterized by skin inflammation, itching, and redness. Although various treatments can alleviate symptoms, they often come with side effects, highlighting the need for new treatments. Here, we discovered a new peptide-based therapy using the intra-dermal delivery technology (IDDT) platform developed by Remedi Co., Ltd (REMEDI). The platform screens and identifies peptides derived from proteins in the human body that possess cell-penetrating peptide (CPP) properties. We screened over 1000-peptides and identified several derived from the Speckled protein (SP) family that have excellent CPP properties and have anti-inflammatory effects. We assessed these peptides for their potential as a treatment for atopic dermatitis. Among them, the RMSP1 peptide showed the most potent anti-inflammatory effects by inhibiting the nuclear factor kappa-light-chain-enhancer of activated B cells (NF-κB) and signal transducer and activator of transcription 3 (STAT3) signaling pathways while possessing CPP properties. To further improve efficacy and stability, we developed a palmitoylated version called Pal-RMSP1. Formulation studies using liposomes (Pal-RMSP1 LP) and micelles (Pal-RMSP1 DP) demonstrated improved anti-inflammatory effects in vitro and enhanced therapeutic effects in vivo. Our study indicates that nano-formulated Pal-RMSP1 could have the potential to become a new treatment option for atopic dermatitis.
Collapse
Affiliation(s)
- Chaemin Lim
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea.
- College of Pharmacy, CHA University, 335 Pangyo-ro, Bundang-gu, Seongnam-si, 13488 Gyeonggi-do, Republic of Korea
| | - Subin Lee
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea.
- Department of Global Innovative Drugs, The Graduate School of Chung-Ang University, Seoul 06974, Republic of Korea
| | - Yuseon Shin
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea.
- Department of Global Innovative Drugs, The Graduate School of Chung-Ang University, Seoul 06974, Republic of Korea
| | - Seongmin Cho
- Remedi Co., Ltd. Research Center, Songdo 21990, Republic of Korea
| | - Chanho Park
- Remedi Co., Ltd. Research Center, Songdo 21990, Republic of Korea
| | - Yungyeong Shin
- Remedi Co., Ltd. Research Center, Songdo 21990, Republic of Korea
| | - Ee Chan Song
- Remedi Co., Ltd. Research Center, Songdo 21990, Republic of Korea
| | - Wan Ki Kim
- Remedi Co., Ltd. Research Center, Songdo 21990, Republic of Korea
| | - Cheolmin Ham
- Rare Isotope Science Project, Institute for Basic Science, Daejeon 34000, Republic of Korea
| | - Sang Bum Kim
- College of Pharmacy, Sahmyook University, Seoul 01795, Republic of Korea
| | - Yong-Su Kwon
- Department of Ophthalmology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Kyung Taek Oh
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea.
- Department of Global Innovative Drugs, The Graduate School of Chung-Ang University, Seoul 06974, Republic of Korea
| |
Collapse
|
4
|
Guerrero-Vázquez K, Del Rio G, Brizuela CA. Cell-penetrating peptides predictors: A comparative analysis of methods and datasets. Mol Inform 2023; 42:e202300104. [PMID: 37672879 DOI: 10.1002/minf.202300104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 07/24/2023] [Accepted: 08/18/2023] [Indexed: 09/08/2023]
Abstract
Cell-Penetrating Peptides (CPP) are emerging as an alternative to small-molecule drugs to expand the range of biomolecules that can be targeted for therapeutic purposes. Due to the importance of identifying and designing new CPP, a great variety of predictors have been developed to achieve these goals. To establish a ranking for these predictors, a couple of recent studies compared their performances on specific datasets, yet their conclusions cannot determine if the ranking obtained is due to the model, the set of descriptors or the datasets used to test the predictors. We present a systematic study of the influence of the peptide sequence's similarity of the datasets on the predictors' performance. The analysis reveals that the datasets used for training have a stronger influence on the predictors performance than the model or descriptors employed. We show that datasets with low sequence similarity between the positive and negative examples can be easily separated, and the tested classifiers showed good performance on them. On the other hand, a dataset with high sequence similarity between CPP and non-CPP will be a hard dataset, and it should be the one to be used for assessing the performance of new predictors.
Collapse
Affiliation(s)
- Karen Guerrero-Vázquez
- Department of Computer Science, CICESE Research Center, Ensenada, 22860, Mexico
- Current address: School of Mathematics & Statistical Sciences, University of Galway, Galway, H91 TK33, Ireland
| | - Gabriel Del Rio
- Department of Biochemistry and Structural Biology, Instituto de Fisiologia Celular, UNAM, Mexico City, 04510, Mexico
| | - Carlos A Brizuela
- Department of Computer Science, CICESE Research Center, Ensenada, 22860, Mexico
| |
Collapse
|
5
|
Hazra P, Vadnere S, Mishra S, Halder S, Mandal S, Ghosh P. Review on Uric Acid Recognition by MOFs with a Future in Machine Learning. ACS APPLIED MATERIALS & INTERFACES 2023. [PMID: 37905918 DOI: 10.1021/acsami.3c11210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Uric acid (UA) is produced from purine metabolism and serves as a prevalent biomarker for multiple diseases including cancer. Hyperuricemia or hypouricemia can cause multiple dysfunctions throughout the biological processes. Consequently, there is a pressing need for monitoring UA concentration in body fluid. While clinical methods are known, the availability of a point-of-care testing (PoCT) kit remains conspicuously absent. In the case of electrochemical recognition of UA, the oxidation potential of ascorbic acid closely aligns with that of UA and thus it hinders the detection process, which eventually may result in false positive signals. Several chemosensors are known in the field of supramolecular chemistry, and metal-organic frameworks (MOFs) are one of the best-performing contenders due to their robustness, stability, and versatile structures. In this review, we tried to unbox the up-to-date development of UA sensing by MOFs. We delve into the state of UA recognition by MOFs, exploring both electrochemical and fluorometric pathways and drawing comparisons with structurally similar probes like covalent organic frameworks (COFs) to understand/establish the advantages of MOFs specifically in UA sensing. In the absence of a PoCT kit, we have provided the conceptual outlook for designing a PoCT device termed a "Urimeter" via electrochemical operation. For the first time, we have proposed different methods of how UA sensing can be tied up with artificial intelligence and machine learning (AI-ML).
Collapse
Affiliation(s)
- Poimanti Hazra
- School of Electronics Engineering (SENSE), Vellore Institute of Technology, Chennai Campus, Chennai 600127, Tamil Nadu, India
| | - Srushti Vadnere
- School of Electronics Engineering (SENSE), Vellore Institute of Technology, Chennai Campus, Chennai 600127, Tamil Nadu, India
| | - Saswat Mishra
- School of Electronics Engineering (SENSE), Vellore Institute of Technology, Chennai Campus, Chennai 600127, Tamil Nadu, India
| | - Shibashis Halder
- Department of Chemistry, Tej Narayan Banaili College, Bhagalpur 812007, Bihar, India
| | - Shaswati Mandal
- Schulich Faculty of Chemistry, Technion-Israel Institute of Technology, Haifa 3200003, Israel
| | - Pritam Ghosh
- Chemistry Division, School of Advanced Sciences (SAS), Vellore Institute of Technology, Chennai Campus, Chennai 600127, Tamil Nadu, India
| |
Collapse
|
6
|
In Silico Screening and Optimization of Cell-Penetrating Peptides Using Deep Learning Methods. Biomolecules 2023; 13:biom13030522. [PMID: 36979457 PMCID: PMC10046020 DOI: 10.3390/biom13030522] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 03/03/2023] [Accepted: 03/09/2023] [Indexed: 03/16/2023] Open
Abstract
Cell-penetrating peptides (CPPs) have great potential to deliver bioactive agents into cells. Although there have been many recent advances in CPP-related research, it is still important to develop more efficient CPPs. The development of CPPs by in silico methods is a very useful addition to experimental methods, but in many cases it can lead to a large number of false-positive results. In this study, we developed a deep-learning-based CPP prediction method, AiCPP, to develop novel CPPs. AiCPP uses a large number of peptide sequences derived from human-reference proteins as a negative set to reduce false-positive predictions and adopts a method to learn small-length peptide sequence motifs that may have CPP tendencies. Using AiCPP, we found that short peptide sequences derived from amyloid precursor proteins are efficient new CPPs, and experimentally confirmed that these CPP sequences can be further optimized.
Collapse
|
7
|
Zhang X, Wei L, Ye X, Zhang K, Teng S, Li Z, Jin J, Kim MJ, Sakurai T, Cui L, Manavalan B, Wei L. SiameseCPP: a sequence-based Siamese network to predict cell-penetrating peptides by contrastive learning. Brief Bioinform 2023; 24:6958502. [PMID: 36562719 DOI: 10.1093/bib/bbac545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/07/2022] [Accepted: 11/10/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Cell-penetrating peptides (CPPs) have received considerable attention as a means of transporting pharmacologically active molecules into living cells without damaging the cell membrane, and thus hold great promise as future therapeutics. Recently, several machine learning-based algorithms have been proposed for predicting CPPs. However, most existing predictive methods do not consider the agreement (disagreement) between similar (dissimilar) CPPs and depend heavily on expert knowledge-based handcrafted features. RESULTS In this study, we present SiameseCPP, a novel deep learning framework for automated CPPs prediction. SiameseCPP learns discriminative representations of CPPs based on a well-pretrained model and a Siamese neural network consisting of a transformer and gated recurrent units. Contrastive learning is used for the first time to build a CPP predictive model. Comprehensive experiments demonstrate that our proposed SiameseCPP is superior to existing baseline models for predicting CPPs. Moreover, SiameseCPP also achieves good performance on other functional peptide datasets, exhibiting satisfactory generalization ability.
Collapse
Affiliation(s)
- Xin Zhang
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Lesong Wei
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Kai Zhang
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Saisai Teng
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Zhongshen Li
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Junru Jin
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Min Jae Kim
- Department of integrative Biotechnology, College of Biotechnology & Bioengineering, Sungkyunkwan University, Seobu-ro, Jangan-gu, Suwon-si, Gyeonggi-do 16419, Republic of Korea
| | - Tetsuya Sakurai
- Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan
| | - Lizhen Cui
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Balachandran Manavalan
- Department of integrative Biotechnology, College of Biotechnology & Bioengineering, Sungkyunkwan University, Seobu-ro, Jangan-gu, Suwon-si, Gyeonggi-do 16419, Republic of Korea
| | - Leyi Wei
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| |
Collapse
|
8
|
Antimicrobial peptides with cell-penetrating activity as prophylactic and treatment drugs. Biosci Rep 2022; 42:231731. [PMID: 36052730 PMCID: PMC9508529 DOI: 10.1042/bsr20221789] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 08/31/2022] [Accepted: 09/01/2022] [Indexed: 01/18/2023] Open
Abstract
Health is fundamental for the development of individuals and evolution of species. In that sense, for human societies is relevant to understand how the human body has developed molecular strategies to maintain health. In the present review, we summarize diverse evidence that support the role of peptides in this endeavor. Of particular interest to the present review are antimicrobial peptides (AMP) and cell-penetrating peptides (CPP). Different experimental evidence indicates that AMP/CPP are able to regulate autophagy, which in turn regulates the immune system response. AMP also assists in the establishment of the microbiota, which in turn is critical for different behavioral and health aspects of humans. Thus, AMP and CPP are multifunctional peptides that regulate two aspects of our bodies that are fundamental to our health: autophagy and microbiota. While it is now clear the multifunctional nature of these peptides, we are still in the early stages of the development of computational strategies aimed to assist experimentalists in identifying selective multifunctional AMP/CPP to control nonhealthy conditions. For instance, both AMP and CPP are computationally characterized as amphipatic and cationic, yet none of these features are relevant to differentiate these peptides from non-AMP or non-CPP. The present review aims to highlight current knowledge that may facilitate the development of AMP’s design tools for preventing or treating illness.
Collapse
|
9
|
Arif M, Kabir M, Ahmed S, Khan A, Ge F, Khelifi A, Yu DJ. DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2749-2759. [PMID: 34347603 DOI: 10.1109/tcbb.2021.3102133] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Cell-penetrating peptides (CPPs) are special peptides capable of carrying a variety of bioactive molecules, such as genetic materials, short interfering RNAs and nanoparticles, into cells. Recently, research on CPP has gained substantial interest from researchers, and the biological mechanisms of CPPS have been assessed in the context of safe drug delivery agents and therapeutic applications. Correct identification and synthesis of CPPs using traditional biochemical methods is an extremely slow, expensive and laborious task particularly due to the large volume of unannotated peptide sequences accumulating in the World Bank repository. Hence, a powerful bioinformatics predictor that rapidly identifies CPPs with a high recognition rate is urgently needed. To date, numerous computational methods have been developed for CPP prediction. However, the available machine-learning (ML) tools are unable to distinguish both the CPPs and their uptake efficiencies. This study aimed to develop a two-layer deep learning framework named DeepCPPred to identify both CPPs in the first phase and peptide uptake efficiency in the second phase. The DeepCPPred predictor first uses four types of descriptors that cover evolutionary, energy estimation, reduced sequence and amino-acid contact information. Then, the extracted features are optimized through the elastic net algorithm and fed into a cascade deep forest algorithm to build the final CPP model. The proposed method achieved 99.45 percent overall accuracy with the CPP924 benchmark dataset in the first layer and 95.43 percent accuracy in the second layer with the CPPSite3 dataset using a 5-fold cross-validation test. Thus, our proposed bioinformatics tool surpassed all the existing state-of-the-art sequence-based CPP approaches.
Collapse
|
10
|
Zhang X, Yang Y, Shen YW, Zhang KR, Jiang ZK, Ma LT, Ding C, Wang BY, Meng Y, Liu H. Diagnostic accuracy and potential covariates of artificial intelligence for diagnosing orthopedic fractures: a systematic literature review and meta-analysis. Eur Radiol 2022; 32:7196-7216. [PMID: 35754091 DOI: 10.1007/s00330-022-08956-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 05/07/2022] [Accepted: 06/08/2022] [Indexed: 02/05/2023]
Abstract
OBJECTIVES To systematically quantify the diagnostic accuracy and identify potential covariates affecting the performance of artificial intelligence (AI) in diagnosing orthopedic fractures. METHODS PubMed, Embase, Web of Science, and Cochrane Library were systematically searched for studies on AI applications in diagnosing orthopedic fractures from inception to September 29, 2021. Pooled sensitivity and specificity and the area under the receiver operating characteristic curves (AUC) were obtained. This study was registered in the PROSPERO database prior to initiation (CRD 42021254618). RESULTS Thirty-nine were eligible for quantitative analysis. The overall pooled AUC, sensitivity, and specificity were 0.96 (95% CI 0.94-0.98), 90% (95% CI 87-92%), and 92% (95% CI 90-94%), respectively. In subgroup analyses, multicenter designed studies yielded higher sensitivity (92% vs. 88%) and specificity (94% vs. 91%) than single-center studies. AI demonstrated higher sensitivity with transfer learning (with vs. without: 92% vs. 87%) or data augmentation (with vs. without: 92% vs. 87%), compared to those without. Utilizing plain X-rays as input images for AI achieved results comparable to CT (AUC 0.96 vs. 0.96). Moreover, AI achieved comparable results to humans (AUC 0.97 vs. 0.97) and better results than non-expert human readers (AUC 0.98 vs. 0.96; sensitivity 95% vs. 88%). CONCLUSIONS AI demonstrated high accuracy in diagnosing orthopedic fractures from medical images. Larger-scale studies with higher design quality are needed to validate our findings. KEY POINTS • Multicenter study design, application of transfer learning, and data augmentation are closely related to improving the performance of artificial intelligence models in diagnosing orthopedic fractures. • Utilizing plain X-rays as input images for AI to diagnose fractures achieved results comparable to CT (AUC 0.96 vs. 0.96). • AI achieved comparable results to humans (AUC 0.97 vs. 0.97) but was superior to non-expert human readers (AUC 0.98 vs. 0.96, sensitivity 95% vs. 88%) in diagnosing fractures.
Collapse
Affiliation(s)
- Xiang Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi Yang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yi-Wei Shen
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ke-Rui Zhang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Ze-Kun Jiang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610000, China
| | - Li-Tai Ma
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Chen Ding
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Bei-Yu Wang
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Yang Meng
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China
| | - Hao Liu
- Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, No. 37 Guo Xue Rd, Chengdu, 610041, China.
| |
Collapse
|
11
|
A deep learning model (FociRad) for automated detection of γ-H2AX foci and radiation dose estimation. Sci Rep 2022; 12:5527. [PMID: 35365702 PMCID: PMC8975967 DOI: 10.1038/s41598-022-09180-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
DNA double-strand breaks (DSBs) are the most lethal form of damage to cells from irradiation. γ-H2AX (phosphorylated form of H2AX histone variant) has become one of the most reliable and sensitive biomarkers of DNA DSBs. However, the γ-H2AX foci assay still has limitations in the time consumed for manual scoring and possible variability between scorers. This study proposed a novel automated foci scoring method using a deep convolutional neural network based on a You-Only-Look-Once (YOLO) algorithm to quantify γ-H2AX foci in peripheral blood samples. FociRad, a two-stage deep learning approach, consisted of mononuclear cell (MNC) and γ-H2AX foci detections. Whole blood samples were irradiated with X-rays from a 6 MV linear accelerator at 1, 2, 4 or 6 Gy. Images were captured using confocal microscopy. Then, dose-response calibration curves were established and implemented with unseen dataset. The results of the FociRad model were comparable with manual scoring. MNC detection yielded 96.6% accuracy, 96.7% sensitivity and 96.5% specificity. γ-H2AX foci detection showed very good F1 scores (> 0.9). Implementation of calibration curve in the range of 0-4 Gy gave mean absolute difference of estimated doses less than 1 Gy compared to actual doses. In addition, the evaluation times of FociRad were very short (< 0.5 min per 100 images), while the time for manual scoring increased with the number of foci. In conclusion, FociRad was the first automated foci scoring method to use a YOLO algorithm with high detection performance and fast evaluation time, which opens the door for large-scale applications in radiation triage.
Collapse
|
12
|
Hooshmand SE, Sabet MJ, Hasanzadeh A, Mousavi SMK, Moghadam NH, Hooshmand SA, Rabiee N, Liu Y, Hamblin MR, Karimi M. Histidine‐enhanced gene delivery systems: The state of the art. J Gene Med 2022; 24:e3415. [DOI: 10.1002/jgm.3415] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/26/2022] [Accepted: 01/29/2022] [Indexed: 11/08/2022] Open
Affiliation(s)
- Seyyed Emad Hooshmand
- Cellular and Molecular Research Center Iran University of Medical Sciences Tehran Iran
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
| | - Makkieh Jahanpeimay Sabet
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
| | - Akbar Hasanzadeh
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
| | - Seyede Mahtab Kamrani Mousavi
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
| | - Niloofar Haeri Moghadam
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
| | - Seyed Aghil Hooshmand
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics University of Tehran Tehran Iran
| | - Navid Rabiee
- Department of Physics Sharif University of Technology Tehran Iran
- School of Engineering Macquarie University Sydney New South Wales Australia
| | - Yong Liu
- Institute of Functional Nano & Soft Materials (FUNSOM) Soochow University Suzhou Jiangsu China
| | - Michael R. Hamblin
- Laser Research Centre, Faculty of Health Science University of Johannesburg South Africa
| | - Mahdi Karimi
- Cellular and Molecular Research Center Iran University of Medical Sciences Tehran Iran
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine Iran University of Medical Sciences Tehran Iran
- Oncopathology Research Center Iran University of Medical Sciences Tehran Iran
- Research Center for Science and Technology in Medicine Tehran University of Medical Sciences Tehran Iran
| |
Collapse
|
13
|
Wan H, Zhang J, Ding Y, Wang H, Tian G. Immunoglobulin Classification Based on FC* and GC* Features. Front Genet 2022; 12:827161. [PMID: 35140745 PMCID: PMC8819591 DOI: 10.3389/fgene.2021.827161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 12/22/2021] [Indexed: 11/13/2022] Open
Abstract
Immunoglobulins have a pivotal role in disease regulation. Therefore, it is vital to accurately identify immunoglobulins to develop new drugs and research related diseases. Compared with utilizing high-dimension features to identify immunoglobulins, this research aimed to examine a method to classify immunoglobulins and non-immunoglobulins using two features, FC* and GC*. Classification of 228 samples (109 immunoglobulin samples and 119 non-immunoglobulin samples) revealed that the overall accuracy was 80.7% in 10-fold cross-validation using the J48 classifier implemented in Weka software. The FC* feature identified in this study was found in the immunoglobulin subtype domain, which demonstrated that this extracted feature could represent functional and structural properties of immunoglobulins for forecasting.
Collapse
Affiliation(s)
- Hao Wan
- Institute of Advanced Cross-field Science, College of Life Science, Qingdao University, Qingdao, China
| | - Jina Zhang
- Geneis (Beijing) Co., Ltd., Beijing, China
| | - Yijie Ding
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Hetian Wang
- Beidahuang Industry Group General Hospital, Harbin, China
- *Correspondence: Hetian Wang, ; Geng Tian,
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing, China
- *Correspondence: Hetian Wang, ; Geng Tian,
| |
Collapse
|
14
|
López-Vidal EM, Schissel CK, Mohapatra S, Bellovoda K, Wu CL, Wood JA, Malmberg AB, Loas A, Gómez-Bombarelli R, Pentelute BL. Deep Learning Enables Discovery of a Short Nuclear Targeting Peptide for Efficient Delivery of Antisense Oligomers. JACS AU 2021; 1:2009-2020. [PMID: 34841414 PMCID: PMC8611673 DOI: 10.1021/jacsau.1c00327] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Indexed: 06/01/2023]
Abstract
Therapeutic macromolecules such as proteins and oligonucleotides can be highly efficacious but are often limited to extracellular targets due to the cell's impermeable membrane. Cell-penetrating peptides (CPPs) are able to deliver such macromolecules into cells, but limited structure-activity relationships and inconsistent literature reports make it difficult to design effective CPPs for a given cargo. For example, polyarginine motifs are common in CPPs, promoting cell uptake at the expense of systemic toxicity. Machine learning may be able to address this challenge by bridging gaps between experimental data in order to discern sequence-activity relationships that evade our intuition. Our earlier data set and deep learning model led to the design of miniproteins (>40 amino acids) for antisense delivery. Here, we leveraged and expanded our model with data augmentation in the short CPP sequence space of the data set to extrapolate and discover short, low-arginine-content CPPs that would be easier to synthesize and amenable to rapid conjugation to desired cargo, and with minimal in vivo toxicity. The lead predicted peptide, termed P6, is as active as a polyarginine CPP for the delivery of an antisense oligomer, while having only one arginine side chain and 18 total residues. We determined the pentalysine motif and the C-terminal cysteine of P6 to be the main drivers of activity. The antisense conjugate was able to enhance corrective splicing in an animal model to produce functional eGFP in heart tissue in vivo while remaining nontoxic up to a dose of 60 mg/kg. In addition, P6 was able to deliver an enzyme to the cytosol of cells. Our findings suggest that, given a data set of long CPPs, we can discover by extrapolation short, active sequences that deliver antisense oligomers.
Collapse
Affiliation(s)
- Eva M. López-Vidal
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Carly K. Schissel
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Somesh Mohapatra
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Kamela Bellovoda
- Sarepta
Therapeutics, 215 First Street, Cambridge, Massachusetts 02142, United States
| | - Chia-Ling Wu
- Sarepta
Therapeutics, 215 First Street, Cambridge, Massachusetts 02142, United States
| | - Jenna A. Wood
- Sarepta
Therapeutics, 215 First Street, Cambridge, Massachusetts 02142, United States
| | - Annika B. Malmberg
- Sarepta
Therapeutics, 215 First Street, Cambridge, Massachusetts 02142, United States
| | - Andrei Loas
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Rafael Gómez-Bombarelli
- Department
of Materials Science and Engineering, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Bradley L. Pentelute
- Department
of Chemistry, Massachusetts Institute of
Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- The
Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, Massachusetts 02142, United States
- Center
for Environmental Health Sciences, Massachusetts
Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Broad Institute
of MIT and Harvard, 415
Main Street, Cambridge, Massachusetts 02142, United States
| |
Collapse
|
15
|
Zou H, Yang F, Yin Z. Identifying N7-methylguanosine sites by integrating multiple features. Biopolymers 2021; 113:e23480. [PMID: 34709657 DOI: 10.1002/bip.23480] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 10/12/2021] [Accepted: 10/14/2021] [Indexed: 11/10/2022]
Abstract
Recent studies reported that N7-methylguanosine (m7G) plays a vital role in gene expression regulation. As a consequence, determining the distribution of m7G is a crucial step towards further understanding its biological functions. Although biological experimental approaches are capable of accurately locating m7G sites, they are labor-intensive, costly, and time-consuming. Therefore, it is necessary to develop more effective and robust computational methods to replace, or at least complement current experimental methods. In this study, we developed a novel sequence-based computational tool to identify RNA m7G sites. In this model, 22 kinds of dinucleotide physicochemical (PC) properties were employed to encode the RNA sequence. Three types of descriptors, including auto-covariance, cross-covariance, and discrete wavelet transform were adopted to extract effective features from the PC matrix. The least absolute shrinkage and selection operator (LASSO) algorithm was utilized to reduce the influence of irrelevant or redundant features. Finally, these selected features were fed into a support vector machine (SVM) for distinguishing m7G from non-m7G sites. The proposed method significantly outperforms existing predictors across all evaluation metrics. It indicates that the approach is effective in identifying RNA m7G sites.
Collapse
Affiliation(s)
- Hongliang Zou
- School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Fan Yang
- School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
| | - Zhijian Yin
- School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
| |
Collapse
|
16
|
Schissel CK, Mohapatra S, Wolfe JM, Fadzen CM, Bellovoda K, Wu CL, Wood JA, Malmberg AB, Loas A, Gómez-Bombarelli R, Pentelute BL. Deep learning to design nuclear-targeting abiotic miniproteins. Nat Chem 2021; 13:992-1000. [PMID: 34373596 PMCID: PMC8819921 DOI: 10.1038/s41557-021-00766-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 07/05/2021] [Indexed: 02/08/2023]
Abstract
There are more amino acid permutations within a 40-residue sequence than atoms on Earth. This vast chemical search space hinders the use of human learning to design functional polymers. Here we show how machine learning enables the de novo design of abiotic nuclear-targeting miniproteins to traffic antisense oligomers to the nucleus of cells. We combined high-throughput experimentation with a directed evolution-inspired deep-learning approach in which the molecular structures of natural and unnatural residues are represented as topological fingerprints. The model is able to predict activities beyond the training dataset, and simultaneously deciphers and visualizes sequence-activity predictions. The predicted miniproteins, termed 'Mach', reach an average mass of 10 kDa, are more effective than any previously known variant in cells and can also deliver proteins into the cytosol. The Mach miniproteins are non-toxic and efficiently deliver antisense cargo in mice. These results demonstrate that deep learning can decipher design principles to generate highly active biomolecules that are unlikely to be discovered by empirical approaches.
Collapse
Affiliation(s)
- Carly K. Schissel
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Somesh Mohapatra
- Massachusetts Institute of Technology, Department of Materials Science and Engineering, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Justin M. Wolfe
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Colin M. Fadzen
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Kamela Bellovoda
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | - Chia-Ling Wu
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | - Jenna A. Wood
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | | | - Andrei Loas
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Rafael Gómez-Bombarelli
- Massachusetts Institute of Technology, Department of Materials Science and Engineering, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,Correspondence to: ,
| | - Bradley L. Pentelute
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02142, USA,Center for Environmental Health Sciences, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA,Correspondence to: ,
| |
Collapse
|
17
|
Xue Y, Ye X, Wei L, Zhang X, Sakurai T, Wei L. Better Performance with Transformer: CPPFormer in precise prediction of cell-Penetrating Peptides. Curr Med Chem 2021; 29:881-893. [PMID: 34544332 DOI: 10.2174/0929867328666210920103140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 07/28/2021] [Accepted: 08/07/2021] [Indexed: 11/22/2022]
Abstract
With its superior performance, the Transformer model, which is based on the 'Encoder-Decoder' paradigm, has become the mainstream in natural language processing. On the other hand, bioinformatics has embraced machine learning and made great progress in drug design and protein property prediction. Cell-penetrating peptides (CPPs) are one kind of permeable protein that is convenient as a kind of 'postman' in drug penetration tasks. However, a small number of CPPs have been discovered by research, let alone practical applications in drug permeability. Therefore, correctly identifying the CPPs has opened up a new way to take macromolecules into cells without other potentially harmful materials in the drug. Most of the previous work only uses trivial machine learning techniques and hand-crafted features to construct a simple classifier. In CPPFormer, we learn from the idea of implementing the attention structure of Transformer, rebuilding the network based on the characteristics of CPPs according to its short length, and using an automatic feature extractor with a few manual engineered features to co-direct the predicted results. Compared to all previous methods and other classic text classification models, the empirical result has shown that our proposed deep model-based method has achieved the best performance of 92.16% accuracy in the CPP924 dataset and has passed various index tests.
Collapse
Affiliation(s)
- Yuyang Xue
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Lesong Wei
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Xin Zhang
- School of Software, Shandong University, Jinan. China
| | - Tetsuya Sakurai
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Leyi Wei
- School of Software, Shandong University, Jinan. China
| |
Collapse
|
18
|
Su R, Hu J, Zou Q, Manavalan B, Wei L. Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools. Brief Bioinform 2021; 21:408-420. [PMID: 30649170 DOI: 10.1093/bib/bby124] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 12/16/2022] Open
Abstract
Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the last few decades, a number of machine learning (ML)-based prediction tools have been developed, and some of them are freely available as web portals. However, the predictions produced by various tools are difficult to quantify and compare. In particular, there is no systematic comparison of the web-based prediction tools in performance, especially in practical applications. In this work, we provide a comprehensive review on the biological importance of CPPs, CPP database and existing ML-based methods for CPP prediction. To evaluate current prediction tools, we conducted a comparative study and analyzed a total of 12 models from 6 publicly available CPP prediction tools on 2 benchmark validation sets of CPPs and non-CPPs. Our benchmarking results demonstrated that a model from the KELM-CPPpred, namely KELM-hybrid-AAC, showed a significant improvement in overall performance, when compared to the other 11 prediction models. Moreover, through a length-dependency analysis, we find that existing prediction tools tend to more accurately predict CPPs and non-CPPs with the length of 20-25 residues long than peptides in other length ranges.
Collapse
Affiliation(s)
- Ran Su
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jie Hu
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | | | - Leyi Wei
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| |
Collapse
|
19
|
Zhou J, Li Y, Huang W, Shi W, Qian H. Source and exploration of the peptides used to construct peptide-drug conjugates. Eur J Med Chem 2021; 224:113712. [PMID: 34303870 DOI: 10.1016/j.ejmech.2021.113712] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 07/12/2021] [Accepted: 07/17/2021] [Indexed: 12/16/2022]
Abstract
Peptide-drug conjugates (PDCs) are a class of novel molecules widely designed and synthesized for delivering payload drugs. The peptide part plays a vital role in the whole molecule, because they determine the ability of the molecules to penetrate the membrane and target to the specific targets. Here, we introduce the source of different kinds of cell-penetrating peptides (CPPs) and cell-targeting peptides (CTPs) that have been used or could be used in constructing PDCs as well as their latest application in delivering drugs. What's more, the approaches of developing CPPs and CTPs and the techniques to discover novel peptides are focused on and summarized in the review. This review aims to help relevant researchers fast understand the research status of peptides in PDCs and carry forward the process of novel peptides discovery.
Collapse
Affiliation(s)
- Jiaqi Zhou
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China
| | - Yuanyuan Li
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China
| | - Wenlong Huang
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China; Jiangsu Key Laboratory of Drug Discovery for Metabolic Disease, China Pharmaceutical University, 24 Tongjiaxiang, Nanjing, 210009, PR China
| | - Wei Shi
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China.
| | - Hai Qian
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China; Jiangsu Key Laboratory of Drug Discovery for Metabolic Disease, China Pharmaceutical University, 24 Tongjiaxiang, Nanjing, 210009, PR China.
| |
Collapse
|
20
|
K-Nearest Neighbor and Random Forest-Based Prediction of Putative Tyrosinase Inhibitory Peptides of Abalone Haliotis diversicolor. Molecules 2021; 26:molecules26123671. [PMID: 34208619 PMCID: PMC8234169 DOI: 10.3390/molecules26123671] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/06/2021] [Accepted: 06/15/2021] [Indexed: 12/28/2022] Open
Abstract
Skin pigment disorders are common cosmetic and medical problems. Many known compounds inhibit the key melanin-producing enzyme, tyrosinase, but their use is limited due to side effects. Natural-derived peptides also display tyrosinase inhibition. Abalone is a good source of peptides, and the abalone proteins have been used widely in pharmaceutical and cosmetic products, but not for melanin inhibition. This study aimed to predict putative tyrosinase inhibitory peptides (TIPs) from abalone, Haliotis diversicolor, using k-nearest neighbor (kNN) and random forest (RF) algorithms. The kNN and RF predictors were trained and tested against 133 peptides with known anti-tyrosinase properties with 97% and 99% accuracy. The kNN predictor suggested 1075 putative TIPs and six TIPs from the RF predictor. Two helical peptides were predicted by both methods and showed possible interaction with the predicted structure of mushroom tyrosinase, similar to those of the known TIPs. These two peptides had arginine and aromatic amino acids, which were common to the known TIPs, suggesting non-competitive inhibition on the tyrosinase. Therefore, the first version of the TIP predictors could suggest a reasonable number of the TIP candidates for further experiments. More experimental data will be important for improving the performance of these predictors, and they can be extended to discover more TIPs from other organisms. The confirmation of TIPs in abalone will be a new commercial opportunity for abalone farmers and industry.
Collapse
|
21
|
Spänig S, Mohsen S, Hattab G, Hauschild AC, Heider D. A large-scale comparative study on peptide encodings for biomedical classification. NAR Genom Bioinform 2021; 3:lqab039. [PMID: 34046590 PMCID: PMC8140742 DOI: 10.1093/nargab/lqab039] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 04/13/2021] [Accepted: 04/26/2021] [Indexed: 01/19/2023] Open
Abstract
Owing to the great variety of distinct peptide encodings, working on a biomedical classification task at hand is challenging. Researchers have to determine encodings capable to represent underlying patterns as numerical input for the subsequent machine learning. A general guideline is lacking in the literature, thus, we present here the first large-scale comprehensive study to investigate the performance of a wide range of encodings on multiple datasets from different biomedical domains. For the sake of completeness, we added additional sequence- and structure-based encodings. In particular, we collected 50 biomedical datasets and defined a fixed parameter space for 48 encoding groups, leading to a total of 397 700 encoded datasets. Our results demonstrate that none of the encodings are superior for all biomedical domains. Nevertheless, some encodings often outperform others, thus reducing the initial encoding selection substantially. Our work offers researchers to objectively compare novel encodings to the state of the art. Our findings pave the way for a more sophisticated encoding optimization, for example, as part of automated machine learning pipelines. The work presented here is implemented as a large-scale, end-to-end workflow designed for easy reproducibility and extensibility. All standardized datasets and results are available for download to comply with FAIR standards.
Collapse
Affiliation(s)
- Sebastian Spänig
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Hans-Meerwein-Str. 6, D-35032 Marburg, Germany
| | - Siba Mohsen
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Hans-Meerwein-Str. 6, D-35032 Marburg, Germany
| | - Georges Hattab
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Hans-Meerwein-Str. 6, D-35032 Marburg, Germany
| | - Anne-Christin Hauschild
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Hans-Meerwein-Str. 6, D-35032 Marburg, Germany
| | - Dominik Heider
- To whom correspondence should be addressed. Tel: +49 6421 2821579;
| |
Collapse
|
22
|
Moritsugu K, Takeuchi K, Kamiya N, Higo J, Yasumatsu I, Fukunishi Y, Fukuda I. Flexibility and Cell Permeability of Cyclic Ras-Inhibitor Peptides Revealed by the Coupled Nosé-Hoover Equation. J Chem Inf Model 2021; 61:1921-1930. [PMID: 33835817 DOI: 10.1021/acs.jcim.0c01427] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Quantifying the cell permeability of cyclic peptides is crucial for their rational drug design. However, the reasons remain unclear why a minor chemical modification, such as the difference between Ras inhibitors cyclorasin 9A5 and 9A54, can substantially change a peptide's permeability. To address this question, we performed enhanced sampling simulations of these two 11-mer peptides using the coupled Nosé-Hoover equation (cNH) we recently developed. The present cNH simulations realized temperature fluctuations over a wide range (240-600 K) in a dynamic manner, allowing structural samplings that were well validated by nuclear Overhauser effect measurements. The derived structural ensembles were comprehensively analyzed by all-atom structural clustering, mapping the derived clusters onto principal components (PCs) that characterize the cyclic structure, and calculating cluster-dependent geometric and chemical properties. The planar-open conformation was dominant in aqueous solvent, owing to inclusion of the Trp side chain in the main-chain ring, while the compact-closed conformation, which favors cell permeation due to its compactness and high polarity, was also accessible. Conformation-dependent cell permeability was observed in one of the derived PCs, demonstrating that decreased cell permeability in 9A54 is due to the high free energy barrier separating the two conformations. The origin of the change in free energy surface was determined to be loss of flexibility in the modified residues 2-3, resulting from the increased bulkiness of their side chains. The derived molecular mechanism of cell permeability highlights the significance of complete structural dynamics surveys for accelerating drug development with cyclic peptides.
Collapse
Affiliation(s)
- Kei Moritsugu
- Graduate School of Medical Life Science, Yokohama City University, 1-7-29 Suehirocho, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| | - Koh Takeuchi
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | - Narutoshi Kamiya
- Graduate School of Simulation Studies, University of Hyogo, 7-1-28 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.,Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Junichi Higo
- Graduate School of Simulation Studies, University of Hyogo, 7-1-28 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.,Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Isao Yasumatsu
- Structure-Based Drug Design Group, Organic Synthesis Department, Daiichi Sankyo RD Novare Co., Ltd., 1-2-58 Hiromachi, Shinagawa-ku, Tokyo 140-8710, Japan
| | - Yoshifumi Fukunishi
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology, 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | - Ikuo Fukuda
- Graduate School of Simulation Studies, University of Hyogo, 7-1-28 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.,Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| |
Collapse
|
23
|
Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space. Sci Rep 2021; 11:7628. [PMID: 33828175 PMCID: PMC8027643 DOI: 10.1038/s41598-021-87134-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 03/24/2021] [Indexed: 02/01/2023] Open
Abstract
Cell-penetrating peptides (CPPs) are naturally able to cross the lipid bilayer membrane that protects cells. These peptides share common structural and physicochemical properties and show different pharmaceutical applications, among which drug delivery is the most important. Due to their ability to cross the membranes by pulling high-molecular-weight polar molecules, they are termed Trojan horses. In this study, we proposed a machine learning (ML)-based framework named BChemRF-CPPred (beyond chemical rules-based framework for CPP prediction) that uses an artificial neural network, a support vector machine, and a Gaussian process classifier to differentiate CPPs from non-CPPs, using structure- and sequence-based descriptors extracted from PDB and FASTA formats. The performance of our algorithm was evaluated by tenfold cross-validation and compared with those of previously reported prediction tools using an independent dataset. The BChemRF-CPPred satisfactorily identified CPP-like structures using natural and synthetic modified peptide libraries and also obtained better performance than those of previously reported ML-based algorithms, reaching the independent test accuracy of 90.66% (AUC = 0.9365) for PDB, and an accuracy of 86.5% (AUC = 0.9216) for FASTA input. Moreover, our analyses of the CPP chemical space demonstrated that these peptides break some molecular rules related to the prediction of permeability of therapeutic molecules in cell membranes. This is the first comprehensive analysis to predict synthetic and natural CPP structures and to evaluate their chemical space using an ML-based framework. Our algorithm is freely available for academic use at http://comptools.linc.ufpa.br/BChemRF-CPPred .
Collapse
|
24
|
Narayanan H, Dingfelder F, Butté A, Lorenzen N, Sokolov M, Arosio P. Machine Learning for Biologics: Opportunities for Protein Engineering, Developability, and Formulation. Trends Pharmacol Sci 2021; 42:151-165. [DOI: 10.1016/j.tips.2020.12.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 12/10/2020] [Accepted: 12/16/2020] [Indexed: 12/19/2022]
|
25
|
Porosk L, Gaidutšik I, Langel Ü. Approaches for the discovery of new cell-penetrating peptides. Expert Opin Drug Discov 2020; 16:553-565. [PMID: 33874824 DOI: 10.1080/17460441.2021.1851187] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Abstract
Introduction: The capability of cell-penetrating peptides (CPP), also known as protein transduction domains (PTD), to enter into cells possibly with an attached cargo, makes their application as delivery vectors or as direct therapeutics compelling. They are generally biocompatible, nontoxic, and easy to synthesize and modify. Three decades after the discovery of the first CPPs, ~2,000 CPP sequences have been identified, and many more predicted. Nevertheless, the field has a strong commitment to authenticate new, more efficient, and specific CPPs.Areas covered: Although a scattering of CPPs have been found by chance, various systematic approaches have been developed and refined over the years to directly aid the identification and depiction of new peptide-based delivery vectors or therapeutics. Here, the authors give an overview of CPPs, and review various approaches of discovering new ones. An emphasis is placed on in silico methods, as these have advanced rapidly in recent years.Expert opinion: Although there are many known CPPs, there is a need to find more efficient and specific CPPs. Several approaches are used to identify such sequences. The success of these approaches depends on the advancement of others and the successful prediction of CPP sequences relies on experimental data.
Collapse
Affiliation(s)
- Ly Porosk
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Ilja Gaidutšik
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Ülo Langel
- Department Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| |
Collapse
|
26
|
Cppsite 2.0: An Available Database of Experimentally Validated Cell-Penetrating Peptides Predicting their Secondary and Tertiary Structures. J Mol Biol 2020; 433:166703. [PMID: 33186582 DOI: 10.1016/j.jmb.2020.11.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 11/04/2020] [Accepted: 11/05/2020] [Indexed: 12/11/2022]
Abstract
One of the biggest barriers in drug and vaccine development is to find an effective delivery system. Cell-penetrating peptides (CPPs) play a crucial role for delivery of biological cargoes and pass them through the membranes. Several databases have been developed for therapeutic peptides as potential drug candidates and delivery vehicles. A rapid growth has occurred in many patents and research articles on CPPs as therapeutic peptides. To save time and cost in laboratories, prediction and design of CPPs before in vitro/in vivo experiments using computational methods and online web servers are rational. Various online web servers which provide prediction of CPPs including CellPPD, CPPpred, CPPred-RF and MLCPP, and also different curated databases that present validated information of CPPs such as CPPsite 2.0 have been developed up to now. Two methods including CellPPD and CPPpred were applied to predict and design potent CPPs. CPPsite 2.0 is a user-friendly updated database that provides various information about CPPs and contains 1855 entries. This database provides comprehensive information on experimentally tested CPPs and prediction of their secondary and tertiary structures to realize their structure-function relationship. Furthermore, each entry presents information of a CPP including chirality, origin, nature of peptide, sub-cellular localization, uptake mechanism and efficiency, amino acid composition, hydrophobicity, and physicochemical properties. One of main goals of CPPsite 2.0 database is to provide the latest datasets of CPPs for analysis and development of CPP prediction methods. CPPsite 2.0 is freely available at https://webs.iiitd.edu.in/raghava/cppsite.
Collapse
|
27
|
Li J, Ma X, Li X, Gu J. PPAI: a web server for predicting protein-aptamer interactions. BMC Bioinformatics 2020; 21:236. [PMID: 32517696 PMCID: PMC7285591 DOI: 10.1186/s12859-020-03574-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 05/28/2020] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND The interactions between proteins and aptamers are prevalent in organisms and play an important role in various life activities. Thanks to the rapid accumulation of protein-aptamer interaction data, it is necessary and feasible to construct an accurate and effective computational model to predict aptamers binding to certain interested proteins and protein-aptamer interactions, which is beneficial for understanding mechanisms of protein-aptamer interactions and improving aptamer-based therapies. RESULTS In this study, a novel web server named PPAI is developed to predict aptamers and protein-aptamer interactions with key sequence features of proteins/aptamers and a machine learning framework integrated adaboost and random forest. A new method for extracting several key sequence features of both proteins and aptamers is presented, where the features for proteins are extracted from amino acid composition, pseudo-amino acid composition, grouped amino acid composition, C/T/D composition and sequence-order-coupling number, while the features for aptamers are extracted from nucleotide composition, pseudo-nucleotide composition (PseKNC) and normalized Moreau-Broto autocorrelation coefficient. On the basis of these feature sets and balanced the samples with SMOTE algorithm, we validate the performance of PPAI by the independent test set. The results demonstrate that the Area Under Curve (AUC) is 0.907 for prediction of aptamer, while the AUC reaches 0.871 for prediction of protein-aptamer interactions. CONCLUSION These results indicate that PPAI can query aptamers and proteins, predict aptamers and predict protein-aptamer interactions in batch mode precisely and efficiently, which would be a novel bioinformatics tool for the research of protein-aptamer interactions. PPAI web-server is freely available at http://39.96.85.9/PPAI.
Collapse
Affiliation(s)
- Jianwei Li
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China. .,Tianjin Key Laboratory of Bioelectromagnetic Technology and Intelligent Health, Hebei University of Technology, Tianjin, China.
| | - Xiaoyu Ma
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Xichuan Li
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin, China
| | - Junhua Gu
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| |
Collapse
|
28
|
|
29
|
Feger G, Angelov B, Angelova A. Prediction of Amphiphilic Cell-Penetrating Peptide Building Blocks from Protein-Derived Amino Acid Sequences for Engineering of Drug Delivery Nanoassemblies. J Phys Chem B 2020; 124:4069-4078. [PMID: 32337991 DOI: 10.1021/acs.jpcb.0c01618] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Amphiphilic molecules, forming self-assembled nanoarchitectures, are typically composed of hydrophobic and hydrophilic domains. Peptide amphiphiles can be designed from two, three, or four building blocks imparting novel structural and functional properties and affinities for interaction with cellular membranes or intracellular organelles. Here we present a combined numerical approach to design amphiphilic peptide scaffolds that are derived from the human nuclear Ki-67 protein. Ki-67 acts, like a biosurfactant, as a steric and electrostatic charge barrier against the collapse of mitotic chromosomes. The proposed predictive design of new Ki-67 protein-derived amphiphilic amino acid sequences exploits the computational outcomes of a set of web-accessible predictors, which are based on machine learning methods. The ensemble of such artificial intelligence algorithms, involving support vector machine (SVM), random forest (RF) classifiers, and neural networks (NN), enables the nanoengineering of a broad range of innovative peptide materials for therapeutic delivery in various applications. Amphiphilic cell-penetrating peptides (CPP), derived from natural protein sequences, may spontaneously form self-assembled nanocarriers characterized by enhanced cellular uptake. Thanks to their inherent low immunogenicity, they may enable the safe delivery of therapeutic molecules across the biological barriers.
Collapse
Affiliation(s)
- Guillaume Feger
- Université Paris-Saclay, CNRS, Institut Galien Paris-Saclay UMR8612, F-92296 Châtenay-Malabry, France
| | - Borislav Angelov
- Institute of Physics, ELI Beamlines, Academy of Sciences of the Czech Republic, Na Slovance 2, CZ-18221 Prague, Czech Republic
| | - Angelina Angelova
- Université Paris-Saclay, CNRS, Institut Galien Paris-Saclay UMR8612, F-92296 Châtenay-Malabry, France
| |
Collapse
|
30
|
Wang S, Cao Z, Li M, Yue Y. G-DipC: An Improved Feature Representation Method for Short Sequences to Predict the Type of Cargo in Cell-Penetrating Peptides. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:739-747. [PMID: 31352350 DOI: 10.1109/tcbb.2019.2930993] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Cell-penetrating peptides (CPPs) are functional short peptides with high carrying capacity. CPP sequences with targeting functions for the highly efficient delivery of drugs to target cells. In this paper, which is focused on the prediction of the cargo category of CPPs, a biocomputational model is constructed to efficiently distinguish the category of cargo carried by CPPs as macromolecular carriers among the seven known deliverable cargo categories. Based on dipeptide composition (DipC), an improved feature representation method, general dipeptide composition (G-DipC) is proposed for short peptide sequences and can effectively increase the abundance of features represented. Then linear discriminant analysis (LDA) is applied to mine some important low-dimensional features of G-DipC and a predictive model is built with the XGBoost algorithm. Experimental results with five-fold cross validation show that G-DipC improves accuracy by 25 and 5 percent compared with amino acid composition (AAC) and DipC, respectively. G-DipC is even found to be better than tripeptide composition (TipC). Thus, the proposed model provides a novel resource for the study of cell-penetrating peptides, and the improved dipeptide composition G-DipC can be widely adapted to determine the feature representation of other biological sequences.
Collapse
|
31
|
Gomes Dos Reis L, Traini D. Advances in the use of cell penetrating peptides for respiratory drug delivery. Expert Opin Drug Deliv 2020; 17:647-664. [PMID: 32138567 DOI: 10.1080/17425247.2020.1739646] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Introduction: Respiratory diseases are leading causes of death in the world, still inhalation therapies are the largest fail in drug development. There is an evident need to develop new therapies. Biomolecules represent apotential therapeutic agent in this regard, however their translation to the clinic is hindered by the lack of tools to efficiently deliver molecules. Cell penetrating peptides (CPPs) have arisen as apotential strategy for intracellular delivery that could theoretically enable the translation of new therapies.Areas covered: In this review, the use of CPPs as astrategy to deliver different molecules (cargoes) to treat lung-relateddiseases will be the focus. Abrief description of these molecules and the innovative methods in designing new CPPs is presented. The delivery of different cargoes (proteins, peptides, poorly soluble drugs and nucleic acids) using CPPs is discussed, focusing on benefits to treat different respiratory diseases like inflammatory disorders, cystic fibrosis and lung cancer.Expert opinion: The advantages of using CPPs to deliver biomolecules and poorly soluble drugs to the lungs is evident. This field has advanced in the past few years toward targeted intracellular delivery, although further studies are needed to fully understand its potential and limitations in vitro and in vivo.
Collapse
Affiliation(s)
- Larissa Gomes Dos Reis
- Respiratory Technology, Woolcock Institute of Medical Research and Discipline of Pharmacology, Faculty of Medicine and Health, The University of Sydney, Sydney, Australia
| | - Daniela Traini
- Respiratory Technology, Woolcock Institute of Medical Research and Discipline of Pharmacology, Faculty of Medicine and Health, The University of Sydney, Sydney, Australia
| |
Collapse
|
32
|
Arif M, Ahmad S, Ali F, Fang G, Li M, Yu DJ. TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree. J Comput Aided Mol Des 2020; 34:841-856. [PMID: 32180124 DOI: 10.1007/s10822-020-00307-z] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 03/09/2020] [Indexed: 02/08/2023]
Abstract
Cell-penetrating peptides (CPPs) are short length permeable proteins have emerged as drugs delivery tool of therapeutic agents including genetic materials and macromolecules into cells. Recently, CPP has become a hotspot avenue for life science research and paved a new way of disease treatment without harmful impact on cell viability due to nontoxic characteristic. Therefore, the correct identification of CPPs will provide hints for medical applications. Considering the shortcomings of traditional experimental CPPs identification, it is urgently needed to design intelligent predictor for accurate identification of CPPs for the large scale uncharacterized sequences. We develop a novel computational method, called TargetCPP, to discriminate CPPs from Non-CPPs with improved accuracy. In TargetCPP, first the peptide sequences are formulated with four distinct encoding methods i.e., composite protein sequence representation, composition transition and distribution, split amino acid composition, and information theory features. These dominant feature vectors were fused and applied intelligent minimum redundancy and maximum relevancy feature selection method to choose an optimal subset of features. Finally, the predictive model is learned through different classification algorithms on the optimized features. Among these classifiers, gradient boost decision tree algorithm achieved excellent performance throughout the experiments. Notably, the TargetCPP tool attained high prediction Accuracy of 93.54% and 88.28% using jackknife and independent test, respectively. Empirical outcomes prove the superiority and potency of proposed bioinformatics method over state-of-the-art methods. It is highly anticipated that the outcomes of this study will provide a strong background for large scale prediction of CPPs and instructive guidance in clinical therapy and medical applications.
Collapse
Affiliation(s)
- Muhammad Arif
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Saeed Ahmad
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Farman Ali
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Ge Fang
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Min Li
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China.
| |
Collapse
|
33
|
Fu X, Cai L, Zeng X, Zou Q. StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency. Bioinformatics 2020; 36:3028-3034. [DOI: 10.1093/bioinformatics/btaa131] [Citation(s) in RCA: 74] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2019] [Revised: 02/19/2020] [Accepted: 02/25/2020] [Indexed: 12/19/2022] Open
Abstract
Abstract
Motivation
Cell-penetrating peptides (CPPs) are a vehicle for transporting into living cells pharmacologically active molecules, such as short interfering RNAs, nanoparticles, plasmid DNAs and small peptides, thus offering great potential as future therapeutics. Existing experimental techniques for identifying CPPs are time-consuming and expensive. Thus, the prediction of CPPs from peptide sequences by using computational methods can be useful to annotate and guide the experimental process quickly. Many machine learning-based methods have recently emerged for identifying CPPs. Although considerable progress has been made, existing methods still have low feature representation capabilities, thereby limiting further performance improvements.
Results
We propose a method called StackCPPred, which proposes three feature methods on the basis of the pairwise energy content of the residue as follows: RECM-composition, PseRECM and RECM–DWT. These features are used to train stacking-based machine learning methods to effectively predict CPPs. On the basis of the CPP924 and CPPsite3 datasets with jackknife validation, StackDPPred achieved 94.5% and 78.3% accuracy, which was 2.9% and 5.8% higher than the state-of-the-art CPP predictors, respectively. StackCPPred can be a powerful tool for predicting CPPs and their uptake efficiency, facilitating hypothesis-driven experimental design and accelerating their applications in clinical therapy.
Availability and implementation
Source code and data can be downloaded from https://github.com/Excelsior511/StackCPPred.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiangzheng Fu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Lijun Cai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Xiangxiang Zeng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|
34
|
Basith S, Manavalan B, Hwan Shin T, Lee G. Machine intelligence in peptide therapeutics: A next‐generation tool for rapid disease screening. Med Res Rev 2020; 40:1276-1314. [DOI: 10.1002/med.21658] [Citation(s) in RCA: 139] [Impact Index Per Article: 34.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 11/26/2019] [Accepted: 12/16/2019] [Indexed: 12/12/2022]
Affiliation(s)
- Shaherin Basith
- Department of PhysiologyAjou University School of MedicineSuwon Republic of Korea
| | | | - Tae Hwan Shin
- Department of PhysiologyAjou University School of MedicineSuwon Republic of Korea
| | - Gwang Lee
- Department of PhysiologyAjou University School of MedicineSuwon Republic of Korea
| |
Collapse
|
35
|
Grasso G, Mercuri S, Danani A, Deriu MA. Biofunctionalization of Silica Nanoparticles with Cell-Penetrating Peptides: Adsorption Mechanism and Binding Energy Estimation. J Phys Chem B 2019; 123:10622-10630. [DOI: 10.1021/acs.jpcb.9b08106] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Affiliation(s)
- Gianvito Grasso
- Istituto Dalle Molle di studi sull’Intelligenza Artificiale (IDSIA), Scuola Universitaria Professionale della Svizzera italiana (SUPSI), Università della Svizzera italiana (USI), Centro Galleria 2, Manno, CH-6928, Switzerland
| | - Stefano Mercuri
- PolitoBIOMed Lab, Department of Mechanical and Aerospace Engineering, Politecnico di Torino, Corso Duca degli Abruzzi 24, IT-10128, Torino, Italy
| | - Andrea Danani
- Istituto Dalle Molle di studi sull’Intelligenza Artificiale (IDSIA), Scuola Universitaria Professionale della Svizzera italiana (SUPSI), Università della Svizzera italiana (USI), Centro Galleria 2, Manno, CH-6928, Switzerland
| | - Marco A. Deriu
- PolitoBIOMed Lab, Department of Mechanical and Aerospace Engineering, Politecnico di Torino, Corso Duca degli Abruzzi 24, IT-10128, Torino, Italy
| |
Collapse
|
36
|
Ferreira A, Lapa R, Vale N. Combination of Gemcitabine with Cell-Penetrating Peptides: A Pharmacokinetic Approach Using In Silico Tools. Biomolecules 2019; 9:biom9110693. [PMID: 31690028 PMCID: PMC6921036 DOI: 10.3390/biom9110693] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 10/07/2019] [Accepted: 11/01/2019] [Indexed: 02/06/2023] Open
Abstract
Gemcitabine is an anticancer drug used to treat a wide range of solid tumors and is a first line treatment for pancreatic cancer. Our group has previously developed novel conjugates of gemcitabine with cell-penetrating peptides (CPP), and here we report some preliminary data regarding the pharmacokinetics of gemcitabine, two gemcitabine-CPP conjugates and respective CPP gathered from GastroPlus™, and analyze these results considering our previous evaluation of gemcitabine release and conjugates’ bioactivity. Additionally, seeking to shed some light on the relation between the penetration ability of CPP and their physicochemical properties, chemical descriptors for the 20 natural amino acids were calculated, a new principal property scale (z-scale) was created and CPP prediction models were developed, establishing quantitative structure-activity relationships (QSAR). The z-scores of the peptides conjugated with gemcitabine are presented and analyzed with the aforementioned data.
Collapse
Affiliation(s)
- Abigail Ferreira
- Laboratory of Pharmacology, Department of Drug Sciences, Faculty of Pharmacy, University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal.
- LAQV/REQUIMTE, Laboratory of Applied Chemistry, Department of Chemical Sciences, Faculty of Pharmacy, University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal.
| | - Rui Lapa
- LAQV/REQUIMTE, Laboratory of Applied Chemistry, Department of Chemical Sciences, Faculty of Pharmacy, University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal.
| | - Nuno Vale
- Laboratory of Pharmacology, Department of Drug Sciences, Faculty of Pharmacy, University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal.
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Rua Júlio Amaral de Carvalho, 45, 4200-135 Porto, Portugal.
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Rua Alfredo Allen, 208, 4200-135 Porto, Portugal.
- Department of Molecular Pathology and Immunology, Abel Salazar Biomedical Sciences Institute (ICBAS), University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal.
| |
Collapse
|
37
|
Adhikari S, Leissa JA, Karlsson AJ. Beyond function: Engineering improved peptides for therapeutic applications. AIChE J 2019. [DOI: 10.1002/aic.16776] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Sayanee Adhikari
- Department of Chemical and Biomolecular Engineering University of Maryland College Park Maryland
| | - Jesse A. Leissa
- Department of Chemical and Biomolecular Engineering University of Maryland College Park Maryland
| | - Amy J. Karlsson
- Department of Chemical and Biomolecular Engineering University of Maryland College Park Maryland
- Fischell Department of Bioengineering University of Maryland College Park Maryland
| |
Collapse
|
38
|
Polyhistidine facilitates direct membrane translocation of cell-penetrating peptides into cells. Sci Rep 2019; 9:9398. [PMID: 31253836 PMCID: PMC6599048 DOI: 10.1038/s41598-019-45830-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Accepted: 06/14/2019] [Indexed: 12/27/2022] Open
Abstract
The bovine lactoferricin L6 (RRWQWR) has been previously identified as a novel cell-penetrating peptide (CPP) that is able to efficiently internalize into human cells. L6 interacts with quantum dots (QDs) noncovalently to generate stable L6/QD complexes that enter cells by endocytosis. In this study, we demonstrate a modified L6 (HL6; CHHHHHRRWQWRHHHHHC), in which short polyhistidine peptides are introduced into both flanks of L6, has enhanced cell-penetrating ability in human bronchoalveolar carcinoma A549 cells. The mechanism of cellular uptake of HL6/QD complexes is primarily direct membrane translocation rather than endocytosis. Dimethyl sulfoxide (DMSO), but not pyrenebutyrate (PB), ethanol, oleic acid, or 1,2-benzisothiazol-3(2 H)-one (BIT), slightly enhances HL6-mediated protein transduction efficiency. Neither HL6 nor HL6/QD complexes are cytotoxic to A549 or HeLa cells. These results indicate that HL6 could be a more efficient drug carrier than L6 for biomedical as well as biotechnological applications, and that the function of polyhistidine peptides is critical to CPP-mediated protein transduction.
Collapse
|
39
|
Wei HH, Yang W, Tang H, Lin H. The Development of Machine Learning Methods in Cell-Penetrating Peptides Identification: A Brief Review. Curr Drug Metab 2019; 20:217-223. [DOI: 10.2174/1389200219666181010114750] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Revised: 05/21/2018] [Accepted: 08/02/2018] [Indexed: 11/22/2022]
Abstract
Background:Cell-penetrating Peptides (CPPs) are important short peptides that facilitate cellular intake or uptake of various molecules. CPPs can transport drug molecules through the plasma membrane and send these molecules to different cellular organelles. Thus, CPP identification and related mechanisms have been extensively explored. In order to reveal the penetration mechanisms of a large number of CPPs, it is necessary to develop convenient and fast methods for CPPs identification.Methods:Biochemical experiments can provide precise details for accurately identifying CPP, but these methods are expensive and laborious. To overcome these disadvantages, several computational methods have been developed to identify CPPs. We have performed review on the development of machine learning methods in CPP identification. This review provides an insight into CPP identification.Results:We summarized the machine learning-based CPP identification methods and compared the construction strategies of 11 different computational methods. Furthermore, we pointed out the limitations and difficulties in predicting CPPs.Conclusion:In this review, the last studies on CPP identification using machine learning method were reported. We also discussed the future development direction of CPP recognition with computational methods.
Collapse
Affiliation(s)
- Huan-Huan Wei
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Wuritu Yang
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hua Tang
- Department of Pathophysiology, Southwest Medical University, Luzhou, China
| | - Hao Lin
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
40
|
Deprey K, Becker L, Kritzer J, Plückthun A. Trapped! A Critical Evaluation of Methods for Measuring Total Cellular Uptake versus Cytosolic Localization. Bioconjug Chem 2019; 30:1006-1027. [PMID: 30882208 PMCID: PMC6527423 DOI: 10.1021/acs.bioconjchem.9b00112] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Biomolecules have many properties that make them promising for intracellular therapeutic applications, but delivery remains a key challenge because large biomolecules cannot easily enter the cytosol. Furthermore, quantification of total intracellular versus cytosolic concentrations remains demanding, and the determination of delivery efficiency is thus not straightforward. In this review, we discuss strategies for delivering biomolecules into the cytosol and briefly summarize the mechanisms of uptake for these systems. We then describe commonly used methods to measure total cellular uptake and, more selectively, cytosolic localization, and discuss the major advantages and drawbacks of each method. We critically evaluate methods of measuring "cell penetration" that do not adequately distinguish total cellular uptake and cytosolic localization, which often lead to inaccurate interpretations of a molecule's cytosolic localization. Finally, we summarize the properties and components of each method, including the main caveats of each, to allow for informed decisions about method selection for specific applications. When applied correctly and interpreted carefully, methods for quantifying cytosolic localization offer valuable insight into the bioactivity of biomolecules and potentially the prospects for their eventual development into therapeutics.
Collapse
Affiliation(s)
- Kirsten Deprey
- Department of Chemistry, Tufts University, 62 Talbot Avenue, Medford, Massachusetts 02155, United States
| | - Lukas Becker
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Joshua Kritzer
- Department of Chemistry, Tufts University, 62 Talbot Avenue, Medford, Massachusetts 02155, United States
| | - Andreas Plückthun
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| |
Collapse
|
41
|
Yu JF, Qu A, Tang HC, Wang FH, Wang CL, Wang HM, Wang JH, Zhu HQ. A novel numerical model for protein sequences analysis based on spherical coordinates and multiple physicochemical properties of amino acids. Biopolymers 2019; 110:e23282. [PMID: 30977898 DOI: 10.1002/bip.23282] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 03/28/2019] [Accepted: 03/28/2019] [Indexed: 01/25/2023]
Abstract
How to characterize short protein sequences to make an effective connection to their functions is an unsolved problem. Here we propose to map the physicochemical properties of each amino acid onto unit spheres so that each protein sequence can be represented quantitatively. We demonstrate the usefulness of this representation by applying it to the prediction of cell penetrating peptides. We show that its combination with traditional composition features yields the best performance across different datasets, among several methods compared. For the convenience of users, a web server has been established for automatic calculations of the proposed features at http://biophy.dzu.edu.cn/SNumD/.
Collapse
Affiliation(s)
- Jia-Feng Yu
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China.,Department of Biomedical Engineering, College of Engineering, and Centre for Quantitative Biology, Peking University, Beijing, China
| | - Ang Qu
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Hu-Cheng Tang
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Fang-Hua Wang
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Chun-Ling Wang
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Hong-Mei Wang
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Ji-Hua Wang
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Huai-Qiu Zhu
- Department of Biomedical Engineering, College of Engineering, and Centre for Quantitative Biology, Peking University, Beijing, China
| |
Collapse
|
42
|
|
43
|
NeuroPIpred: a tool to predict, design and scan insect neuropeptides. Sci Rep 2019; 9:5129. [PMID: 30914676 PMCID: PMC6435694 DOI: 10.1038/s41598-019-41538-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 03/05/2019] [Indexed: 12/15/2022] Open
Abstract
Insect neuropeptides and their associated receptors have been one of the potential targets for the pest control. The present study describes in silico models developed using natural and modified insect neuropeptides for predicting and designing new neuropeptides. Amino acid composition analysis revealed the preference of residues C, D, E, F, G, N, S, and Y in insect neuropeptides The positional residue preference analysis show that in natural neuropeptides residues like A, N, F, D, P, S, and I are preferred at N terminus and residues like L, R, P, F, N, and G are preferred at C terminus. Prediction models were developed using input features like amino acid and dipeptide composition, binary profiles and implementing different machine learning techniques. Dipeptide composition based SVM model performed best among all the models. In case of NeuroPIpred_DS1, model achieved an accuracy of 86.50% accuracy and 0.73 MCC on training dataset and 83.71% accuracy and 0.67 MCC on validation dataset whereas in case of NeuroPIpred_DS2, model achieved 97.47% accuracy and 0.95 MCC on training dataset and 97.93% accuracy and 0.96 MCC on validation dataset. In order to assist researchers, we created standalone and user friendly web server NeuroPIpred, available at (https://webs.iiitd.edu.in/raghava/neuropipred.)
Collapse
|
44
|
Zahiri J, Khorsand B, Yousefi AA, Kargar M, Shirali Hossein Zade R, Mahdevar G. AntAngioCOOL: computational detection of anti-angiogenic peptides. J Transl Med 2019; 17:71. [PMID: 30832671 PMCID: PMC6399940 DOI: 10.1186/s12967-019-1813-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 02/21/2019] [Indexed: 01/01/2023] Open
Abstract
Background Angiogenesis inhibition research is a cutting edge area in angiogenesis-dependent disease therapy, especially in cancer therapy. Recently, studies on anti-angiogenic peptides have provided promising results in the field of cancer treatment. Methods A non-redundant dataset of 135 anti-angiogenic peptides (positive instances) and 135 non anti-angiogenic peptides (negative instances) was used in this study. Also, 20% of each class were selected to construct an independent test dataset (see Additional files 1, 2). We proposed an effective machine learning based R package (AntAngioCOOL) to predict anti-angiogenic peptides. We have examined more than 200 different classifiers to build an efficient predictor. Also, more than 17,000 features were extracted to encode the peptides. Results Finally, more than 2000 informative features were selected to train the classifiers for detecting anti-angiogenic peptides. AntAngioCOOL includes three different models that can be selected by the user for different purposes; it is the most sensitive, most specific and most accurate. According to the obtained results AntAngioCOOL can effectively suggest anti-angiogenic peptides; this tool achieved sensitivity of 88%, specificity of 77% and accuracy of 75% on the independent test set. AntAngioCOOL can be accessed at https://cran.r-project.org/. Conclusions Only 2% of the extracted descriptors were used to build the predictor models. The results revealed that physico-chemical profile is the most important feature type in predicting anti-angiogenic peptides. Also, atomic profile and PseAAC are the other important features. Electronic supplementary material The online version of this article (10.1186/s12967-019-1813-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Javad Zahiri
- Bioinformatics and Computational Omics. Lab (BioCOOL), Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University (TMU), Tehran, Iran.
| | - Babak Khorsand
- Computer Engineering Department, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Ali Akbar Yousefi
- Department of Computer Engineering, Faculty of Engineering, University of Science and Culture, Tehran, Iran
| | - Mohammadjavad Kargar
- Department of Computer Engineering, Faculty of Engineering, University of Science and Culture, Tehran, Iran
| | | | - Ghasem Mahdevar
- Department of Mathematics, Faculty of Sciences, University of Isfahan, Isfahan, Iran
| |
Collapse
|
45
|
Krause T, Röckendorf N, El-Sourani N, Ramaker K, Henkel M, Hauke S, Borschbach M, Frey A. Breeding Cell Penetrating Peptides: Optimization of Cellular Uptake by a Function-Driven Evolutionary Process. Bioconjug Chem 2018; 29:4020-4029. [PMID: 30380293 DOI: 10.1021/acs.bioconjchem.8b00583] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
In nature, building block-based biopolymers can adapt to functional and environmental demands by recombination and mutation of the monomer sequence. We present here an analogous, artificial evolutionary optimization process which we have applied to improve the functionality of cell-penetrating peptide molecules. The "evolution" consisted of repeated rounds of in silico peptide sequence alterations using a genetic algorithm followed by in vitro peptide synthesis, experimental analysis, and ranking according to their "fitness" (i.e., their ability to carry the cargo carboxyfluorescein into cultured cells). The genetic algorithm-based optimization method was customized and adapted from former successful applications in the lab to realize an early convergence and a minimum number of in vitro and in silico processing steps by configured settings derived from empirical in silico simulation. We started out with 20 "lead peptides" which we had previously identified as top performers regarding their ability to enter cultured cells. Ten breeding rounds comprising 240 peptides each yielded a peptide population of which the top 10 candidates displayed a 6-fold (median values) increase in its cell-penetration capability compared with the top 10 lead peptides, and two consensus sequences emerged which represent local fitness optima. In addition, the cell-penetrating potential could be proven independently of the carboxyfluorescein cargo in an alternative setting. Our results demonstrate that we have established a powerful optimization technology that can be used to further improve peptides with known functionality and adapt them to specific applications.
Collapse
Affiliation(s)
- Thorsten Krause
- Department of Mucosal Immunology and Diagnostics , Priority Area Asthma and Allergy, Research Center Borstel , 23845 Borstel , Germany ; Member of Leibniz Health Technologies
| | - Niels Röckendorf
- Department of Mucosal Immunology and Diagnostics , Priority Area Asthma and Allergy, Research Center Borstel , 23845 Borstel , Germany ; Member of Leibniz Health Technologies
| | - Nail El-Sourani
- Faculty of Computer Science , FHDW University of Applied Sciences , 51465 Bergisch Gladbach , Germany
| | - Katrin Ramaker
- Department of Mucosal Immunology and Diagnostics , Priority Area Asthma and Allergy, Research Center Borstel , 23845 Borstel , Germany ; Member of Leibniz Health Technologies
| | - Maik Henkel
- Department of Mucosal Immunology and Diagnostics , Priority Area Asthma and Allergy, Research Center Borstel , 23845 Borstel , Germany ; Member of Leibniz Health Technologies
| | - Sascha Hauke
- Faculty of Computer Science , FHDW University of Applied Sciences , 51465 Bergisch Gladbach , Germany
| | - Markus Borschbach
- Faculty of Computer Science , FHDW University of Applied Sciences , 51465 Bergisch Gladbach , Germany
| | - Andreas Frey
- Department of Mucosal Immunology and Diagnostics , Priority Area Asthma and Allergy, Research Center Borstel , 23845 Borstel , Germany ; Member of Leibniz Health Technologies
| |
Collapse
|
46
|
Qiang X, Zhou C, Ye X, Du PF, Su R, Wei L. CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning. Brief Bioinform 2018; 21:11-23. [PMID: 30239616 DOI: 10.1093/bib/bby091] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 08/13/2018] [Accepted: 08/22/2018] [Indexed: 11/14/2022] Open
Abstract
Cell-penetrating peptides (CPPs) have been shown to be a transport vehicle for delivering cargoes into live cells, offering great potential as future therapeutics. It is essential to identify CPPs for better understanding of their functional mechanisms. Machine learning-based methods have recently emerged as a main approach for computational identification of CPPs. However, one of the main challenges and difficulties is to propose an effective feature representation model that sufficiently exploits the inner difference and relevance between CPPs and non-CPPs, in order to improve the predictive performance. In this paper, we have developed CPPred-FL, a powerful bioinformatics tool for fast, accurate and large-scale identification of CPPs. In our predictor, we introduce a new feature representation learning scheme that enables one to learn feature representations from totally 45 well-trained random forest models with multiple feature descriptors from different perspectives, such as compositional information, position-specific information and physicochemical properties, etc. We integrate class and probabilistic information into our feature representations. To improve the feature representation ability, we further remove redundant and irrelevant features by feature space optimization. Benchmarking experiments showed that CPPred-FL, using 19 informative features only, is able to achieve better performance than the state-of-the-art predictors. We anticipate that CPPred-FL will be a powerful tool for large-scale identification of CPPs, facilitating the characterization of their functional mechanisms and accelerating their applications in clinical therapy.
Collapse
Affiliation(s)
- Xiaoli Qiang
- Institute of Computing Science and Technology, Guangzhou University, Guangzhou, China
| | - Chen Zhou
- School of Computer Science and Technology, Tianjin University, Tianjin, China
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba, Japan
| | - Pu-Feng Du
- School of Software, Tianjin University, Tianjin, China
| | - Ran Su
- School of Software, Tianjin University, Tianjin, China
| | - Leyi Wei
- School of Computer Science and Technology, Tianjin University, Tianjin, China
| |
Collapse
|
47
|
Peraro L, Kritzer JA. Emerging Methods and Design Principles for Cell-Penetrant Peptides. Angew Chem Int Ed Engl 2018; 57:11868-11881. [PMID: 29740917 PMCID: PMC7184558 DOI: 10.1002/anie.201801361] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Revised: 04/24/2018] [Indexed: 12/12/2022]
Abstract
Biomolecules such as antibodies, proteins, and peptides are important tools for chemical biology and leads for drug development. They have been used to inhibit a variety of extracellular proteins, but accessing intracellular proteins has been much more challenging. In this review, we discuss diverse chemical approaches that have yielded cell-penetrant peptides and identify three distinct strategies: masking backbone amides, guanidinium group patterning, and amphipathic patterning. We summarize a growing number of large data sets, which are starting to reveal more specific design guidelines for each strategy. We also discuss advantages and disadvantages of current methods for quantifying cell penetration. Finally, we provide an overview of best-odds approaches for applying these new methods and design principles to optimize cytosolic penetration for a given bioactive peptide.
Collapse
Affiliation(s)
- Leila Peraro
- Department of Chemistry, Tufts University, Medford, Massachusetts, 02155, USA
| | - Joshua A Kritzer
- Department of Chemistry, Tufts University, Medford, Massachusetts, 02155, USA
| |
Collapse
|
48
|
Peraro L, Kritzer JA. Neue Methoden und Designprinzipien für zellgängige Peptide. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201801361] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Leila Peraro
- Department of Chemistry Tufts University Medford Massachusetts 02155 USA
| | - Joshua A. Kritzer
- Department of Chemistry Tufts University Medford Massachusetts 02155 USA
| |
Collapse
|
49
|
|
50
|
Pandey P, Patel V, George NV, Mallajosyula SS. KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides. J Proteome Res 2018; 17:3214-3222. [DOI: 10.1021/acs.jproteome.8b00322] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Poonam Pandey
- Department of Biological Engineering, Indian Institute of Technology Gandhinagar, Ahmedabad, Gujarat 382355, India
| | - Vinal Patel
- Department of Electrical Engineering, Indian Institute of Technology Gandhinagar, Ahmedabad, Gujarat 382355, India
| | - Nithin V. George
- Department of Electrical Engineering, Indian Institute of Technology Gandhinagar, Ahmedabad, Gujarat 382355, India
| | - Sairam S. Mallajosyula
- Department of Chemistry, Indian Institute of Technology Gandhinagar, Ahmedabad, Gujarat 382355, India
| |
Collapse
|