1
|
Huang J, Peng H, Yang D. Research advances in protein lysine 2-hydroxyisobutyrylation: From mechanistic regulation to disease relevance. J Cell Physiol 2024; 239:e31435. [PMID: 39351825 DOI: 10.1002/jcp.31435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 08/26/2024] [Accepted: 09/03/2024] [Indexed: 12/18/2024]
Abstract
Histone lysine 2-hydroxyisobutyrylation (Khib) was identified as a novel posttranslational modification in 2014. Significant progress has been made in understanding its roles in reproduction, development, and disease. Although 2-hydroxyisobutyrylation shares some overlapping modification sites and regulatory factors with other lysine residue modifications, its unique structure suggests distinct functions. This review summarizes the latest advancements in Khib, including its regulatory mechanisms, roles in mammalian physiological processes, and its relationship with diseases. This provides direction for further research on Khib and offers new perspectives for developing treatment strategies for related diseases.
Collapse
Affiliation(s)
- Jinglei Huang
- School of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, People's Republic of China
| | - Hui Peng
- School of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, People's Republic of China
| | - Diqi Yang
- School of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, People's Republic of China
| |
Collapse
|
2
|
Schofield LC, Dialpuri JS, Murshudov GN, Agirre J. Post-translational modifications in the Protein Data Bank. Acta Crystallogr D Struct Biol 2024; 80:647-660. [PMID: 39207896 PMCID: PMC11394121 DOI: 10.1107/s2059798324007794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 08/07/2024] [Indexed: 09/04/2024] Open
Abstract
Proteins frequently undergo covalent modification at the post-translational level, which involves the covalent attachment of chemical groups onto amino acids. This can entail the singular or multiple addition of small groups, such as phosphorylation; long-chain modifications, such as glycosylation; small proteins, such as ubiquitination; as well as the interconversion of chemical groups, such as the formation of pyroglutamic acid. These post-translational modifications (PTMs) are essential for the normal functioning of cells, as they can alter the physicochemical properties of amino acids and therefore influence enzymatic activity, protein localization, protein-protein interactions and protein stability. Despite their inherent importance, accurately depicting PTMs in experimental studies of protein structures often poses a challenge. This review highlights the role of PTMs in protein structures, as well as the prevalence of PTMs in the Protein Data Bank, directing the reader to accurately built examples suitable for use as a modelling reference.
Collapse
Affiliation(s)
- Lucy C Schofield
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, United Kingdom
| | - Jordan S Dialpuri
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, United Kingdom
| | - Garib N Murshudov
- MRC Laboratory of Molecular Biology, University of Cambridge, Cambridge, United Kingdom
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, United Kingdom
| |
Collapse
|
3
|
Shaon MSH, Karim T, Sultan MF, Ali MM, Ahmed K, Hasan MZ, Moustafa A, Bui FM, Al-Zahrani FA. AMP-RNNpro: a two-stage approach for identification of antimicrobials using probabilistic features. Sci Rep 2024; 14:12892. [PMID: 38839785 PMCID: PMC11153637 DOI: 10.1038/s41598-024-63461-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 05/29/2024] [Indexed: 06/07/2024] Open
Abstract
Antimicrobials are molecules that prevent the formation of microorganisms such as bacteria, viruses, fungi, and parasites. The necessity to detect antimicrobial peptides (AMPs) using machine learning and deep learning arises from the need for efficiency to accelerate the discovery of AMPs, and contribute to developing effective antimicrobial therapies, especially in the face of increasing antibiotic resistance. This study introduced AMP-RNNpro based on Recurrent Neural Network (RNN), an innovative model for detecting AMPs, which was designed with eight feature encoding methods that are selected according to four criteria: amino acid compositional, grouped amino acid compositional, autocorrelation, and pseudo-amino acid compositional to represent the protein sequences for efficient identification of AMPs. In our framework, two-stage predictions have been conducted. Initially, this study analyzed 33 models on these feature extractions. Then, we selected the best six models from these models using rigorous performance metrics. In the second stage, probabilistic features have been generated from the selected six models in each feature encoding and they are aggregated to be fed into our final meta-model called AMP-RNNpro. This study also introduced 20 features with SHAP, which are crucial in the drug development fields, where we discover AAC, ASDC, and CKSAAGP features are highly impactful for detection and drug discovery. Our proposed framework, AMP-RNNpro excels in the identification of novel Amps with 97.15% accuracy, 96.48% sensitivity, and 97.87% specificity. We built a user-friendly website for demonstrating the accurate prediction of AMPs based on the proposed approach which can be accessed at http://13.126.159.30/ .
Collapse
Affiliation(s)
- Md Shazzad Hossain Shaon
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
| | - Tasmin Karim
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
| | - Md Fahim Sultan
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
| | - Md Mamun Ali
- Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
- Division of Biomedical Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada
- Department of Software Engineering, Daffodil International University, Daffodil Smart City (DSC), Birulia, Savar, Dhaka, 1216, Bangladesh
| | - Kawsar Ahmed
- Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh.
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada.
- Group of Bio-photomatiχ, Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh.
| | - Md Zahid Hasan
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
- Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
| | - Ahmed Moustafa
- Department of Human Anatomy and Physiology, The Faculty of Health Sciences, University of Johannesburg, Johannesburg, South Africa
- School of Psychology, Centre for Data Analytics, Bond University, Gold Coast, QLD, Australia
| | - Francis M Bui
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada
| | | |
Collapse
|
4
|
Jiang H, Shang S, Sha Y, Zhang L, He N, Li L. EdeepSADPr: an extensive deep-learning architecture for prediction of the in situ crosstalks of serine phosphorylation and ADP-ribosylation. Front Cell Dev Biol 2023; 11:1149535. [PMID: 37187615 PMCID: PMC10175571 DOI: 10.3389/fcell.2023.1149535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Accepted: 04/17/2023] [Indexed: 05/17/2023] Open
Abstract
The in situ post-translational modification (PTM) crosstalk refers to the interactions between different types of PTMs that occur on the same residue site of a protein. The crosstalk sites generally have different characteristics from those with the single PTM type. Studies targeting the latter's features have been widely conducted, while studies on the former's characteristics are rare. For example, the characteristics of serine phosphorylation (pS) and serine ADP-ribosylation (SADPr) have been investigated, whereas those of their in situ crosstalks (pSADPr) are unknown. In this study, we collected 3,250 human pSADPr, 7,520 SADPr, 151,227 pS and 80,096 unmodified serine sites and explored the features of the pSADPr sites. We found that the characteristics of pSADPr sites are more similar to those of SADPr compared to pS or unmodified serine sites. Moreover, the crosstalk sites are likely to be phosphorylated by some kinase families (e.g., AGC, CAMK, STE and TKL) rather than others (e.g., CK1 and CMGC). Additionally, we constructed three classifiers to predict pSADPr sites from the pS dataset, the SADPr dataset and the protein sequences separately. We built and evaluated five deep-learning classifiers in ten-fold cross-validation and independent test datasets. We also used the classifiers as base classifiers to develop a few stacking-based ensemble classifiers to improve performance. The best classifiers had the AUC values of 0.700, 0.914 and 0.954 for recognizing pSADPr sites from the SADPr, pS and unmodified serine sites, respectively. The lowest prediction accuracy was achieved by separating pSADPr and SADPr sites, which is consistent with the observation that pSADPr's characteristics are more similar to those of SADPr than the rest. Finally, we developed an online tool for extensively predicting human pSADPr sites based on the CNNOH classifier, dubbed EdeepSADPr. It is freely available through http://edeepsadpr.bioinfogo.org/. We expect our investigation will promote a comprehensive understanding of crosstalks.
Collapse
Affiliation(s)
- Haoqiang Jiang
- College of Basic Medicine, Qingdao University, Qingdao, China
- Sino Genomics Technology Co., Ltd., Qingdao, China
| | - Shipeng Shang
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Yutong Sha
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Lin Zhang
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Ningning He
- College of Basic Medicine, Qingdao University, Qingdao, China
| | - Lei Li
- College of Basic Medicine, Qingdao University, Qingdao, China
- Faculty of Biomedical and Rehabilitation Engineering, University of Health and Rehabilitation Sciences, Qingdao, China
- *Correspondence: Lei Li,
| |
Collapse
|
5
|
Zhao J, Jiang H, Zou G, Lin Q, Wang Q, Liu J, Ma L. CNNArginineMe: A CNN structure for training models for predicting arginine methylation sites based on the One-Hot encoding of peptide sequence. Front Genet 2022; 13:1036862. [PMID: 36324513 PMCID: PMC9618650 DOI: 10.3389/fgene.2022.1036862] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/04/2022] [Indexed: 11/30/2022] Open
Abstract
Protein arginine methylation (PRme), as one post-translational modification, plays a critical role in numerous cellular processes and regulates critical cellular functions. Though several in silico models for predicting PRme sites have been reported, new models may be required to develop due to the significant increase of identified PRme sites. In this study, we constructed multiple machine-learning and deep-learning models. The deep-learning model CNN combined with the One-Hot coding showed the best performance, dubbed CNNArginineMe. CNNArginineMe performed best in AUC scoring metrics in comparisons with several reported predictors. Additionally, we employed CNNArginineMe to predict arginine methylation proteome and performed functional analysis. The arginine methylated proteome is significantly enriched in the amyotrophic lateral sclerosis (ALS) pathway. CNNArginineMe is freely available at https://github.com/guoyangzou/CNNArginineMe.
Collapse
Affiliation(s)
- Jiaojiao Zhao
- Cancer Institute of the Affiliated Hospital of Qingdao University and Qingdao Cancer Institute, Qingdao University, Qingdao, China
- School of Basic Medicine, Qingdao University, Qingdao, China
| | - Haoqiang Jiang
- School of Basic Medicine, Qingdao University, Qingdao, China
| | - Guoyang Zou
- School of Basic Medicine, Qingdao University, Qingdao, China
| | - Qian Lin
- Cancer Institute of the Affiliated Hospital of Qingdao University and Qingdao Cancer Institute, Qingdao University, Qingdao, China
| | - Qiang Wang
- Oncology Department, Shandong Second Provincial General Hospital, Jinan, China
| | - Jia Liu
- Department of Pharmacology, School of Pharmacy, Qingdao University, Qingdao, China
| | - Leina Ma
- Cancer Institute of the Affiliated Hospital of Qingdao University and Qingdao Cancer Institute, Qingdao University, Qingdao, China
- *Correspondence: Leina Ma,
| |
Collapse
|
6
|
Zhu Y, Liu Y, Chen Y, Li L. ResSUMO: A Deep Learning Architecture Based on Residual Structure for Prediction of Lysine SUMOylation Sites. Cells 2022; 11:2646. [PMID: 36078053 PMCID: PMC9454673 DOI: 10.3390/cells11172646] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 08/18/2022] [Accepted: 08/22/2022] [Indexed: 12/26/2022] Open
Abstract
Lysine SUMOylation plays an essential role in various biological functions. Several approaches integrating various algorithms have been developed for predicting SUMOylation sites based on a limited dataset. Recently, the number of identified SUMOylation sites has significantly increased due to investigation at the proteomics scale. We collected modification data and found the reported approaches had poor performance using our collected data. Therefore, it is essential to explore the characteristics of this modification and construct prediction models with improved performance based on an enlarged dataset. In this study, we constructed and compared 16 classifiers by integrating four different algorithms and four encoding features selected from 11 sequence-based or physicochemical features. We found that the convolution neural network (CNN) model integrated with residue structure, dubbed ResSUMO, performed favorably when compared with the traditional machine learning and CNN models in both cross-validation and independent tests. The area under the receiver operating characteristic (ROC) curve for ResSUMO was around 0.80, superior to that of the reported predictors. We also found that increasing the depth of neural networks in the CNN models did not improve prediction performance due to the degradation problem, but the residual structure could be included to optimize the neural networks and improve performance. This indicates that residual neural networks have the potential to be broadly applied in the prediction of other types of modification sites with great effectiveness and robustness. Furthermore, the online ResSUMO service is freely accessible.
Collapse
Affiliation(s)
- Yafei Zhu
- College of Computer Science and Technology, Qingdao University, Qingdao 266071, China
| | - Yuhai Liu
- Dawning International Information Industry, Co., Ltd., Qingdao 266101, China
| | - Yu Chen
- College of Computer Science and Technology, Qingdao University, Qingdao 266071, China
| | - Lei Li
- College of Computer Science and Technology, Qingdao University, Qingdao 266071, China
- Faculty of Biomedical and Rehabilitation Engineering, University of Health and Rehabilitation Sciences, Qingdao 266001, China
| |
Collapse
|
7
|
Bourdillon AT, Shah HP, Cohen O, Hajek MA, Mehra S. Novel Machine Learning Model to Predict Interval of Oral Cancer Recurrence for Surveillance Stratification. Laryngoscope 2022. [DOI: 10.1002/lary.30351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 07/29/2022] [Accepted: 08/01/2022] [Indexed: 12/24/2022]
Affiliation(s)
| | - Hemali P. Shah
- Yale University School of Medicine New Haven Connecticut U.S.A
| | - Oded Cohen
- Division of Otolaryngology–Head and Neck Surgery, Department of Surgery Yale University School of Medicine New Haven Connecticut U.S.A
| | - Michael A. Hajek
- Division of Otolaryngology–Head and Neck Surgery, Department of Surgery Yale University School of Medicine New Haven Connecticut U.S.A
| | - Saral Mehra
- Division of Otolaryngology–Head and Neck Surgery, Department of Surgery Yale University School of Medicine New Haven Connecticut U.S.A
- Yale Cancer Center New Haven Connecticut U.S.A
| |
Collapse
|
8
|
Mini-review: Recent advances in post-translational modification site prediction based on deep learning. Comput Struct Biotechnol J 2022; 20:3522-3532. [PMID: 35860402 PMCID: PMC9284371 DOI: 10.1016/j.csbj.2022.06.045] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 06/21/2022] [Accepted: 06/21/2022] [Indexed: 11/23/2022] Open
Abstract
Post-translational modifications (PTMs) are closely linked to numerous diseases, playing a significant role in regulating protein structures, activities, and functions. Therefore, the identification of PTMs is crucial for understanding the mechanisms of cell biology and diseases therapy. Compared to traditional machine learning methods, the deep learning approaches for PTM prediction provide accurate and rapid screening, guiding the downstream wet experiments to leverage the screened information for focused studies. In this paper, we reviewed the recent works in deep learning to identify phosphorylation, acetylation, ubiquitination, and other PTM types. In addition, we summarized PTM databases and discussed future directions with critical insights.
Collapse
Key Words
- AAindex, Amino acid index
- ATP, Adenosine triphosphate
- AUC, Area under curve
- Ac, Acetylation
- BE, Binary encoding
- BLOSUM, Blocks substitution matrix
- Bi-LSTM, Bidirectional LSTM
- CKSAAP, Composition of k-spaced amino acid Pairs
- CNN, Convolutional neural network
- CNNOH, CNN with the one-hot encoding
- CNNWE, CNN with the word-embedding encoding
- CNNrgb, CNN red green blue
- CV, Cross-validation
- DC-CNN, Densely connected convolutional neural network
- DL, Deep learning
- DNNs, Deep neural networks
- Deep learning
- E. coli, Escherichia coli
- EBGW, Encoding based on grouped weight
- EGAAC, Enhanced grouped amino acids content
- IG, Information gain
- K, Lysine
- KNN, k nearest neighbor
- LASSO, Least absolute shrinkage and selection operator
- LSTM, Long short-term memory
- LSTMWE, LSTM with the word-embedding encoding
- M.musculus, Mus musculus
- MDC, Modular densely connected convolutional networks
- MDCAN, Multilane dense convolutional attention network
- ML, Machine learning
- MLP, Multilayer perceptron
- MMI, Multivariate mutual information
- Machine learning
- Mass spectrometry
- NMBroto, Normalized Moreau-Broto autocorrelation
- P, Proline
- PSP, PhosphoSitePlus
- PSSM, Position-specific scoring matrix
- PTM, Post-translational modifications
- Ph, Phosphorylation
- Post-translational modification
- Prediction
- PseAAC, Pseudo-amino acid composition
- R, Arginine
- RF, Random forest
- RNN, Recurrent neural network
- ROC, Receiver operating characteristic
- S, Serine
- S. typhimurium, Salmonella typhimurium
- S.cerevisiae, Saccharomyces cerevisiae
- SE, Squeeze and excitation
- SEV, Split to Equal Validation
- ST, Source and target
- SUMO, Small ubiquitin-like modifier
- SVM, Support vector machines
- T, Threonine
- Ub, Ubiquitination
- Y, Tyrosine
- ZSL, Zero-shot learning
Collapse
|
9
|
Development of an experiment-split method for benchmarking the generalization of a PTM site predictor: Lysine methylome as an example. PLoS Comput Biol 2021; 17:e1009682. [PMID: 34879076 PMCID: PMC8687584 DOI: 10.1371/journal.pcbi.1009682] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 12/20/2021] [Accepted: 11/25/2021] [Indexed: 11/19/2022] Open
Abstract
Many computational classifiers have been developed to predict different types of post-translational modification sites. Their performances are measured using cross-validation or independent test, in which experimental data from different sources are mixed and randomly split into training and test sets. However, the self-reported performances of most classifiers based on this measure are generally higher than their performances in the application of new experimental data. It suggests that the cross-validation method overestimates the generalization ability of a classifier. Here, we proposed a generalization estimate method, dubbed experiment-split test, where the experimental sources for the training set are different from those for the test set that simulate the data derived from a new experiment. We took the prediction of lysine methylome (Kme) as an example and developed a deep learning-based Kme site predictor (called DeepKme) with outstanding performance. We assessed the experiment-split test by comparing it with the cross-validation method. We found that the performance measured using the experiment-split test is lower than that measured in terms of cross-validation. As the test data of the experiment-split method were derived from an independent experimental source, this method could reflect the generalization of the predictor. Therefore, we believe that the experiment-split method can be applied to benchmark the practical performance of a given PTM model. DeepKme is free accessible via https://github.com/guoyangzou/DeepKme. The performance of a model for predicting post-translational modification sites is commonly evaluated using the cross-validation method, where the data derived from different experimental sources are mixed and randomly separated into the training dataset and validation dataset. However, the performance measured through cross-validation is generally higher than the performance in the application of new experimental data, indicating that the cross-validation method overestimates the generalization of a model. In this study, we proposed a generalization estimate method, dubbed experiment-split test, where the experimental sources for the training set are different from those for the test set that simulate the data derived from a new experiment. We took the prediction of lysine methylome as an example and developed a deep learning-based Kme site predictor DeepKme with outstanding performance. We found that the performance measured by the experiment-split method is lower than that measured in terms of cross-validation. As the test data of the experiment-split method were derived from an independent experimental source, this method could reflect the generalization of the prediction model. Therefore, the experiment-split method can be applied to benchmark the practical prediction performance.
Collapse
|
10
|
Lv H, Zhang Y, Wang JS, Yuan SS, Sun ZJ, Dao FY, Guan ZX, Lin H, Deng KJ. iRice-MS: An integrated XGBoost model for detecting multitype post-translational modification sites in rice. Brief Bioinform 2021; 23:6447435. [PMID: 34864888 DOI: 10.1093/bib/bbab486] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 10/05/2021] [Accepted: 10/23/2021] [Indexed: 12/13/2022] Open
Abstract
Post-translational modification (PTM) refers to the covalent and enzymatic modification of proteins after protein biosynthesis, which orchestrates a variety of biological processes. Detecting PTM sites in proteome scale is one of the key steps to in-depth understanding their regulation mechanisms. In this study, we presented an integrated method based on eXtreme Gradient Boosting (XGBoost), called iRice-MS, to identify 2-hydroxyisobutyrylation, crotonylation, malonylation, ubiquitination, succinylation and acetylation in rice. For each PTM-specific model, we adopted eight feature encoding schemes, including sequence-based features, physicochemical property-based features and spatial mapping information-based features. The optimal feature set was identified from each encoding, and their respective models were established. Extensive experimental results show that iRice-MS always display excellent performance on 5-fold cross-validation and independent dataset test. In addition, our novel approach provides the superiority to other existing tools in terms of AUC value. Based on the proposed model, a web server named iRice-MS was established and is freely accessible at http://lin-group.cn/server/iRice-MS.
Collapse
Affiliation(s)
- Hao Lv
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, China
| | - Jia-Shu Wang
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Shi-Shi Yuan
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Zi-Jie Sun
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Fu-Ying Dao
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Zheng-Xing Guan
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Hao Lin
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| | - Ke-Jun Deng
- Center for Informational Biology at University of Electronic Science and Technology of China, China
| |
Collapse
|
11
|
Qi T, Li J, Wang H, Han X, Li J, Du J. Global analysis of protein lysine 2-hydroxyisobutyrylation (K hib) profiles in Chinese herb rhubarb (Dahuang). BMC Genomics 2021; 22:542. [PMID: 34266380 PMCID: PMC8283887 DOI: 10.1186/s12864-021-07847-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 06/21/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Lysine 2-hydroxyisobutyrylation (Khib) is a newly discovered protein posttranslational modification (PTM) and is involved in the broad-spectrum regulation of cellular processes that are found in both prokaryotic and eukaryotic cells, including in plants. The Chinese herb rhubarb (Dahuang) is one of the most widely used traditional Chinese medicines in clinical applications. To better understand the physiological activities and mechanism of treating diseases with the herb, it is necessary to conduct intensive research on rhubarb. However, Khib modification has not been reported thus far in rhubarb. RESULTS In this study, we performed the first global analysis of Khib-modified proteins in rhubarb by using sensitive affinity enrichment combined with high-accuracy HPLC-MS/MS tandem spectrometry. A total of 4333 overlapping Khib modification peptides matched on 1525 Khib-containing proteins were identified in three independent tests. Bioinformatics analysis showed that these Khib-containing proteins are involved in a wide range of cellular processes, particularly in protein biosynthesis and central carbon metabolism and are distributed mainly in chloroplasts, cytoplasm, nucleus and mitochondria. In addition, the amino acid sequence motif analysis showed that a negatively charged side chain residue (E), a positively charged residue (K), and an uncharged residue with the smallest side chain (G) were strongly preferred around the Khib site, and a total of 13 Khib modification motifs were identified. These identified motifs can be classified into three motif patterns, and some motif patterns are unique to rhubarb and have not been identified in other plants to date. CONCLUSIONS A total of 4333 Khib-modified peptides on 1525 proteins were identified. The Khib-modified proteins are mainly distributed in the chloroplast, cytoplasm, nucleus and mitochondria, and involved in a wide range of cellular processes. Moreover, three types of amino acid sequence motif patterns, including EKhib/KhibE, GKhib and k.kkk….Khib….kkkkk, were extracted from a total of 13 Khib-modified peptides. This study provides comprehensive Khib-proteome resource of rhubarb. The findings from the study contribute to a better understanding of the physiological roles of Khib modification, and the Khib proteome data will facilitate further investigations of the roles and mechanisms of Khib modification in rhubarb.
Collapse
Affiliation(s)
- Tong Qi
- Shandong Provincial Key Laboratory of Dryland Farming Technology, College of Agronomy,, Qingdao Agricultural University, Shandong, 266109, Qingdao, China
| | - Jinping Li
- International Cooperation Department of Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| | - Huifang Wang
- Shandong Provincial Key Laboratory of Dryland Farming Technology, College of Agronomy,, Qingdao Agricultural University, Shandong, 266109, Qingdao, China
| | - Xiaofan Han
- Shandong Provincial Key Laboratory of Dryland Farming Technology, College of Agronomy,, Qingdao Agricultural University, Shandong, 266109, Qingdao, China
| | - Junrong Li
- Bathurst Future Agri-Tech Institute of Qingdao Agricultural University, Qingdao, China
| | - Jinzhe Du
- Shandong Provincial Key Laboratory of Dryland Farming Technology, College of Agronomy,, Qingdao Agricultural University, Shandong, 266109, Qingdao, China.
| |
Collapse
|