1
|
Arend M, Paulitz E, Hsieh YE, Nikoloski Z. Scaling metabolic model reconstruction up to the pan-genome level: A systematic review and prospective applications to photosynthetic organisms. Metab Eng 2025; 90:67-77. [PMID: 40081464 DOI: 10.1016/j.ymben.2025.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 02/11/2025] [Accepted: 02/25/2025] [Indexed: 03/16/2025]
Abstract
Advances in genomics technologies have generated large data sets that provide tremendous insights into the genetic diversity of taxonomic groups. However, it remains challenging to pinpoint the effect of genetic diversity on different traits without performing resource-intensive phenotyping experiments. Pan-genome-scale metabolic models (panGEMs) extend traditional genome-scale metabolic models by considering the entire reaction repertoire that enables the prediction and comparison of metabolic capabilities within a taxonomic group. Here, we systematically review the state-of-the-art methodologies for constructing panGEMs, focusing on used tools, databases, experimental datasets, and orthology relationships. We highlight the unique advantages of panGEMs compared to single-species GEMs in predicting metabolic phenotypes and in guiding the experimental validation of genome annotations. In addition, we emphasize the disparity between the available (pan-)genomic data on photosynthetic organisms and their under-representation in current (pan)GEMs. Finally, we propose a perspective for tackling the reconstruction of panGEMs for photosynthetic eukaryotes that can help advance our understanding of the metabolic diversity in this taxonomic group.
Collapse
Affiliation(s)
- Marius Arend
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam, Germany; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany; Bioinformatics and Mathematical Modeling Department, Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| | - Emilian Paulitz
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam, Germany; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany
| | - Yunli Eric Hsieh
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam, Germany; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany; School of BioSciences, The University of Melbourne, Parkville, 3010 VIC, Australia
| | - Zoran Nikoloski
- Bioinformatics Department, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam, Germany; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany; Bioinformatics and Mathematical Modeling Department, Center of Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria.
| |
Collapse
|
2
|
Singh P, Pandey B, Purwar S, Nair RM, Pratap A. Genome-wide identification and characterization of Subtilisin-like Serine protease encoding genes in Vigna radiata L. Wilczek. Sci Rep 2025; 15:13284. [PMID: 40246940 PMCID: PMC12006535 DOI: 10.1038/s41598-025-95331-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 03/20/2025] [Indexed: 04/19/2025] Open
Abstract
Subtilisin-like serine proteases (SBTs) are serine proteolytic enzymes that play various roles in plant growth, function and stress responses. Vigna glabrescens, a wild relative of mungbean known to be a potential donor of photo- and thermoperiod insensitivity, was characterized for thermotolerance through reproductive biology and gene expression profiling. Whole-genome sequencing of this species has not yet been performed; hence, genome-wide analysis of this species has not been explored. In the present study, a systematic analysis of SBT-encoding genes in the V. radiata (Vradi_SBT) genome was conducted, with a focus on their response during flower development under different temperature regimes, such as optimum temperature, heat and cold stresses, in Vigna radiata and a wild relative, Vigna glabrescens. Thirty-eight Vradi_SBT genes were identified in the V. radiata genome and were further grouped into five distinct subgroups. The key domain of the SBT peptidase, "peptidase_S8_53," was found in all 38 Vradi_SBT proteins, while 28 of them contained the "peptidase_S8" domain. Additionally, 30 of these proteins contained a maximum of 10 motifs. A total of 22 orthologous genes were identified in Arabidopsis thaliana, whereas paralogous gene pairs were detected as tandemly duplicated genes with V. unguiculata. Cis-acting element analysis revealed that these genes presented more stress-responsive promoter sequences than the other promoters. Furthermore, Vradi_SBT-1.9 was found significantly upregulated under both high- and low-temperature stresses. This study provides insights into SBT-encoding genes and their possible role in flower development and thermotolerance in Vigna species.
Collapse
Affiliation(s)
- Poornima Singh
- Department of Biotechnology, School of Life Sciences, Mahatma Gandhi Central University, Motihari, 845 401, India
| | - Brijesh Pandey
- Department of Biotechnology, School of Life Sciences, Mahatma Gandhi Central University, Motihari, 845 401, India
| | - Shalini Purwar
- Department of Basic and Social Sciences, Banda University of Agriculture and Technology, Banda, 210 001, India.
| | | | - Aditya Pratap
- Project Coordination Unit, All India Coordinated Research Project on Kharif Pulses, ICAR-Indian Institute of Pulses Research, Kanpur, 208 024, India.
| |
Collapse
|
3
|
Gillani M, Pollastri G. Protein subcellular localization prediction tools. Comput Struct Biotechnol J 2024; 23:1796-1807. [PMID: 38707539 PMCID: PMC11066471 DOI: 10.1016/j.csbj.2024.04.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
Protein subcellular localization prediction is of great significance in bioinformatics and biological research. Most of the proteins do not have experimentally determined localization information, computational prediction methods and tools have been acting as an active research area for more than two decades now. Knowledge of the subcellular location of a protein provides valuable information about its functionalities, the functioning of the cell, and other possible interactions with proteins. Fast, reliable, and accurate predictors provides platforms to harness the abundance of sequence data to predict subcellular locations accordingly. During the last decade, there has been a considerable amount of research effort aimed at developing subcellular localization predictors. This paper reviews recent subcellular localization prediction tools in the Eukaryotic, Prokaryotic, and Virus-based categories followed by a detailed analysis. Each predictor is discussed based on its main features, strengths, weaknesses, algorithms used, prediction techniques, and analysis. This review is supported by prediction tools taxonomies that highlight their rele- vant area and examples for uncomplicated categorization and ease of understandability. These taxonomies help users find suitable tools according to their needs. Furthermore, recent research gaps and challenges are discussed to cover areas that need the utmost attention. This survey provides an in-depth analysis of the most recent prediction tools to facilitate readers and can be considered a quick guide for researchers to identify and explore the recent literature advancements.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| | - Gianluca Pollastri
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| |
Collapse
|
4
|
Hu G, Moon J, Hayashi T. Protein Classes Predicted by Molecular Surface Chemical Features: Machine Learning-Assisted Classification of Cytosol and Secreted Proteins. J Phys Chem B 2024; 128:8423-8436. [PMID: 39185763 PMCID: PMC11382266 DOI: 10.1021/acs.jpcb.4c02461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Chemical structures of protein surfaces govern intermolecular interaction, and protein functions include specific molecular recognition, transport, self-assembly, etc. Therefore, the relationship between the chemical structure and protein functions provides insights into the understanding of the mechanism underlying protein functions and developments of new biomaterials. In this study, we analyze protein surface features, including surface amino acid populations and secondary structure ratios, instead of entire sequences as input for the classifier, intending to provide deeper insights into the determination of protein classes (cytosol or secreted). We employed a random forest-based classifier for the prediction of protein locations. Our training and testing data sets consisting of secreted and cytosol proteins were constructed using filtered information from UniProt and 3D structures from AlphaFold. The classifier achieved a testing accuracy of 93.9% with a feature importance ranking and quantitative boundary values for the top three features. We discuss the significance of these features quantitatively and the hidden rules to determine the protein classes (cytosol or secreted).
Collapse
Affiliation(s)
- Guanghao Hu
- Department of Materials Science and Engineering, School of Materials Science and Chemical Technology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama-shi, Kanagawa-ken 226-8502, Japan
| | - Jooa Moon
- Department of Materials Science and Engineering, School of Materials Science and Chemical Technology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama-shi, Kanagawa-ken 226-8502, Japan
| | - Tomohiro Hayashi
- Department of Materials Science and Engineering, School of Materials Science and Chemical Technology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama-shi, Kanagawa-ken 226-8502, Japan
- The Institute for Solid State Physics, The University of Tokyo, 5-1-5, Kashiwanoha, Kashiwa, Chiba 277-0882, Japan
| |
Collapse
|
5
|
Thagun C, Odahara M, Kodama Y, Numata K. Identification of a highly efficient chloroplast-targeting peptide for plastid engineering. PLoS Biol 2024; 22:e3002785. [PMID: 39298532 PMCID: PMC11444414 DOI: 10.1371/journal.pbio.3002785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 10/01/2024] [Accepted: 08/03/2024] [Indexed: 09/22/2024] Open
Abstract
Plastids are pivotal target organelles for comprehensively enhancing photosynthetic and metabolic traits in plants via plastid engineering. Plastidial proteins predominantly originate in the nucleus and must traverse membrane-bound multiprotein translocons to access these organelles. This import process is meticulously regulated by chloroplast-targeting peptides (cTPs). Whereas many cTPs have been employed to guide recombinantly expressed functional proteins to chloroplasts, there is a critical need for more efficient cTPs. Here, we performed a comprehensive exploration and comparative assessment of an advanced suite of cTPs exhibiting superior targeting capabilities. We employed a multifaceted approach encompassing computational prediction, in planta expression, fluorescence tracking, and in vitro chloroplast import studies to identify and analyze 88 cTPs associated with Arabidopsis thaliana mutants with phenotypes linked to chloroplast function. These polypeptides exhibited distinct abilities to transport green fluorescent protein (GFP) to various compartments within leaf cells, particularly chloroplasts. A highly efficient cTP derived from Arabidopsis plastid ribosomal protein L35 (At2g24090) displayed remarkable effectiveness in chloroplast localization. This cTP facilitated the activities of chloroplast-targeted RNA-processing proteins and metabolic enzymes within plastids. This cTP could serve as an ideal transit peptide for precisely targeting biomolecules to plastids, leading to advancements in plastid engineering.
Collapse
Affiliation(s)
- Chonprakun Thagun
- Department of Material Chemistry, Graduate School of Engineering, Kyoto University, Kyoto-Daigaku-Katsura, Kyoto, Japan
- Center for Bioscience Research and Education, Utsunomiya University, Tochigi, Japan
| | - Masaki Odahara
- Biomacromolecules Research Team, RIKEN Center for Sustainable Resource Science, Saitama, Japan
| | - Yutaka Kodama
- Center for Bioscience Research and Education, Utsunomiya University, Tochigi, Japan
- Biomacromolecules Research Team, RIKEN Center for Sustainable Resource Science, Saitama, Japan
| | - Keiji Numata
- Department of Material Chemistry, Graduate School of Engineering, Kyoto University, Kyoto-Daigaku-Katsura, Kyoto, Japan
- Biomacromolecules Research Team, RIKEN Center for Sustainable Resource Science, Saitama, Japan
| |
Collapse
|
6
|
Kant R, Khan MS, Chopra M, Saluja D. Artificial intelligence-driven reverse vaccinology for Neisseria gonorrhoeae vaccine: Prioritizing epitope-based candidates. Front Mol Biosci 2024; 11:1442158. [PMID: 39193221 PMCID: PMC11347834 DOI: 10.3389/fmolb.2024.1442158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2024] [Accepted: 07/04/2024] [Indexed: 08/29/2024] Open
Abstract
Neisseria gonorrhoeae is the causative agent of the sexually transmitted disease gonorrhea. The increasing prevalence of this disease worldwide, the rise of antibiotic-resistant strains, and the difficulties in treatment necessitate the development of a vaccine, highlighting the significance of preventative measures to control and eradicate the infection. Currently, there is no widely available vaccine, partly due to the bacterium's ability to evade natural immunity and the limited research investment in gonorrhea compared to other diseases. To identify distinct vaccine candidates, we chose to focus on the uncharacterized, hypothetical proteins (HPs) as our initial approach. Using the in silico method, we first carried out a comprehensive assessment of hypothetical proteins of Neisseria gonorrhoeae, encompassing assessments of physicochemical properties, cellular localization, secretary pathways, transmembrane regions, antigenicity, toxicity, and prediction of B-cell and T-cell epitopes, among other analyses. Detailed analysis of all HPs resulted in the functional annotation of twenty proteins with a great degree of confidence. Further, using the immuno-informatics approach, the prediction pipeline identified one CD8+ restricted T-cell epitope, seven linear B-cell epitopes, and seven conformational B-cell epitopes as putative epitope-based peptide vaccine candidates which certainly require further validation in laboratory settings. The study accentuates the promise of functional annotation and immuno-informatics in the systematic design of epitope-based peptide vaccines targeting Neisseria gonorrhoeae.
Collapse
Affiliation(s)
- Ravi Kant
- Medical Biotechnology Laboratory, Dr. B. R. Ambedkar Center for Biomedical Research, University of Delhi, Delhi, India
- Delhi School of Public Health, Institute of Eminence (IoE), University of Delhi, Delhi, India
| | - Mohd. Shoaib Khan
- Laboratory of Molecular Modeling and Anticancer Drug Development, Dr. B. R. Ambedkar Center for Biomedical Research, University of Delhi, Delhi, India
| | - Madhu Chopra
- Laboratory of Molecular Modeling and Anticancer Drug Development, Dr. B. R. Ambedkar Center for Biomedical Research, University of Delhi, Delhi, India
| | - Daman Saluja
- Medical Biotechnology Laboratory, Dr. B. R. Ambedkar Center for Biomedical Research, University of Delhi, Delhi, India
- Delhi School of Public Health, Institute of Eminence (IoE), University of Delhi, Delhi, India
| |
Collapse
|
7
|
Chen J, Fang M, Li Y, Ding H, Zhang X, Jiang X, Zhang J, Zhang C, Lu Z, Luo M. Cell surface protein-protein interaction profiling for biological network analysis and novel target discovery. LIFE MEDICINE 2024; 3:lnae031. [PMID: 39872863 PMCID: PMC11749001 DOI: 10.1093/lifemedi/lnae031] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Accepted: 08/28/2024] [Indexed: 01/30/2025]
Abstract
The secretome is composed of cell surface membrane proteins and extracellular secreted proteins that are synthesized via secretory machinery, accounting for approximately one-third of human protein-encoding genes and playing central roles in cellular communication with the external environment. Secretome protein-protein interactions (SPPIs) mediate cell proliferation, apoptosis, and differentiation, as well as stimulus- or cell-specific responses that regulate a diverse range of biological processes. Aberrant SPPIs are associated with diseases including cancer, immune disorders, and illness caused by infectious pathogens. Identifying the receptor/ligand for a secretome protein or pathogen can be a challenging task, and many SPPIs remain obscure, with a large number of orphan receptors and ligands, as well as viruses with unknown host receptors, populating the SPPI network. In addition, proteins with known receptors/ligands may also interact with alternative uncharacterized partners and exert context-dependent effects. In the past few decades, multiple varied approaches have been developed to identify SPPIs, and these methods have broad applications in both basic and translational research. Here, we review and discuss the technologies for SPPI profiling and the application of these technologies in identifying novel targets for immunotherapy and anti-infectious agents.
Collapse
Affiliation(s)
- Jiaojiao Chen
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Maoxin Fang
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Yuwei Li
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Haodong Ding
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Xinyu Zhang
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Xiaoyi Jiang
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Jinlan Zhang
- The Fifth People’s Hospital of Shanghai, Fudan University, Shanghai 200240, China
| | - Chengcheng Zhang
- Department of Physiology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA
| | - Zhigang Lu
- The Fifth People’s Hospital of Shanghai, Fudan University, Shanghai 200240, China
- Shanghai Institute of Infectious Diseases and Biosecurity, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Min Luo
- Institute of Pediatrics, Children’s Hospital of Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| |
Collapse
|
8
|
Gillani M, Pollastri G. SCLpred-ECL: Subcellular Localization Prediction by Deep N-to-1 Convolutional Neural Networks. Int J Mol Sci 2024; 25:5440. [PMID: 38791479 PMCID: PMC11121631 DOI: 10.3390/ijms25105440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 05/09/2024] [Accepted: 05/11/2024] [Indexed: 05/26/2024] Open
Abstract
The subcellular location of a protein provides valuable insights to bioinformaticians in terms of drug designs and discovery, genomics, and various other aspects of medical research. Experimental methods for protein subcellular localization determination are time-consuming and expensive, whereas computational methods, if accurate, would represent a much more efficient alternative. This article introduces an ab initio protein subcellular localization predictor based on an ensemble of Deep N-to-1 Convolutional Neural Networks. Our predictor is trained and tested on strict redundancy-reduced datasets and achieves 63% accuracy for the diverse number of classes. This predictor is a step towards bridging the gap between a protein sequence and the protein's function. It can potentially provide information about protein-protein interaction to facilitate drug design and processes like vaccine production that are essential to disease prevention.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), D04 V1W8 Dublin, Ireland;
| | | |
Collapse
|
9
|
Wen Y, Zhao Z, Cheng L, Zhou S, An M, Zhao J, Dong S, Yuan X, Yin M. Genome-wide identification and expression profiling of the ABI5 gene family in foxtail millet (Setaria italica). BMC PLANT BIOLOGY 2024; 24:164. [PMID: 38431546 PMCID: PMC10908088 DOI: 10.1186/s12870-024-04865-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Accepted: 02/26/2024] [Indexed: 03/05/2024]
Abstract
BACKGROUND ABA Insensitive 5 (ABI5) is a basic leucine zipper transcription factor that crucially influences plant growth, development, and stress response. However, there is minimal research on the ABI5 family in foxtail millet. RESULTS In this study, 16 ABI5 genes were identified in foxtail millet, and their sequence composition, gene structures, cis-acting elements, chromosome positions, and gene replication events were analyzed. To more thoroughly evaluate the developmental mechanisms of the SiABI5 family during evolution, we selected three dicotyledons (S. lycopersicum, A. thaliana, F. tataricum) and three (Z. mays, O. sativa, S. bicolor) specific representative monocotyledons associated with foxtail millet for comparative homology mapping. The results showed that foxtail millet ABI5 genes had the best homology with maize. A promoter sequence analysis showed that the SiABI5s contain numerous cis-acting elements related to hormone and stress responses, indicating that the regulation of SiABI5 expression was complex. The expression responses of 16 genes in different tissues, seed germination, and ear development were analyzed. A total of six representative genes were targeted from five subfamilies to characterize their gene expression responses to four different abiotic stresses. Overexpression of SiABI5.12 confers tolerance to osmotic stress in transgenic Arabidopsis thaliana, which demonstrated the function of SiABI5 responded to abiotic stress. CONCLUSIONS In summary, our research results comprehensively characterized the SiABI5 family and can provide a valuable reference for demonstrating the role of SiABI5s in regulating abiotic stress responses in foxtail millet.
Collapse
Affiliation(s)
- Yinyuan Wen
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Zeya Zhao
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Liuna Cheng
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Shixue Zhou
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Mengyao An
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Juan Zhao
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Shuqi Dong
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China
| | - Xiangyang Yuan
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China.
| | - Meiqiang Yin
- College of Agronomy, Shanxi Agricultural University, Taigu, 030801, China.
| |
Collapse
|
10
|
Mackon E, Guo Y, Jeazet Dongho Epse Mackon GC, Ma Y, Yao Y, Luo D, Dai X, Zhao N, Lu Y, Jandan TH, Liu P. OsGSTU34, a Bz2-like anthocyanin-related glutathione transferase transporter, is essential for rice (Oryza sativa L.) organs coloration. PHYTOCHEMISTRY 2024; 217:113896. [PMID: 37866445 DOI: 10.1016/j.phytochem.2023.113896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 10/12/2023] [Accepted: 10/12/2023] [Indexed: 10/24/2023]
Abstract
Anthocyanins are a flavonoid compound known as one of the most important chromogenic substances. They play several functions, including health promotion and sustaining plants during adverse conditions. They are synthesized at the endoplasmic reticulum and sequestered in the vacuole. In this work, we generated knock-out lines of OsGSTU34, a glutathione transporter's tau gene family, with no transgene line and off-target through CRISPR/Cas9 mutagenesis and highlighted the loss of pigmentation in rice flowers, leaves, stems, shoots, and caryopsis. The anthocyanin quantification in the wild-type BLWT and mutant line BLG34-8 caryopsis showed that cyanidin-3-O-glucoside (C3G) and peonidin-3-O-glucoside (P3G) were almost undetectable in the mutant line. A tandem mass tag (TMT) labeling proteomic analysis was conducted to elucidate the proteomic changes in the BLWT and BLG34-8. The result revealed that 1175 proteins were altered, including 408 that were down-regulated and 767 that were upregulated. The accumulation of the OsGSTU34-related protein (Q8L576), along with several anthocyanin-related proteins, was down-regulated. The enrichment analysis showed that the down-regulated proteins were enriched in different pathways, among which the phenylpropanoid biosynthesis pathway, flavonoid biosynthesis metabolites, and anthocyanin biosynthesis pathway. Protein interaction network prediction revealed that glutathione-S-transferase (Q8L576) was connected to the proteins involved in the flavonoid and anthocyanin biosynthesis pathways, such as flavanone 3-dioxygenase 1 (Q7XM21), leucoanthocyanidin dioxygenase 1 (Q93VC3), 4-coumarate-CoA ligase 2 (Q42982), phenylalanine ammonia-lyase (P14717), chalcone synthase 1 (Q2R3A1), and 4-coumarate-CoA ligase 5 (Q6ZAC1). However, the expression of the most important anthocyanin biosynthesis gene was not altered, suggesting that only the transport mechanism was affected. Our findings highlight new insight into the anthocyanin pigmentation in black rice and provide new perspectives for future research.
Collapse
Affiliation(s)
- Enerand Mackon
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Science and Technology, Guangxi University PR China.
| | - Yongqiang Guo
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | | | - Yafei Ma
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Yuhang Yao
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Dengjie Luo
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Science and Technology, Guangxi University PR China.
| | - Xianggui Dai
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Neng Zhao
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Ying Lu
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Tahir Hussain Jandan
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| | - Piqing Liu
- State Key Laboratory of Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, Guangxi University, Nanning 530005, PR China.
| |
Collapse
|
11
|
Caoili SEC. B-Cell Epitope Prediction for Antipeptide Paratopes with the HAPTIC2/HEPTAD User Toolkit (HUT). Methods Mol Biol 2024; 2821:9-32. [PMID: 38997477 DOI: 10.1007/978-1-0716-3914-6_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/14/2024]
Abstract
B-cell epitope prediction is key to developing peptide-based vaccines and immunodiagnostics along with antibodies for prophylactic, therapeutic and/or diagnostic use. This entails estimating paratope binding affinity for variable-length peptidic sequences subject to constraints on both paratope accessibility and antigen conformational flexibility, as described herein for the HAPTIC2/HEPTAD User Toolkit (HUT). HUT comprises the Heuristic Affinity Prediction Tool for Immune Complexes 2 (HAPTIC2), the HAPTIC2-like Epitope Prediction Tool for Antigen with Disulfide (HEPTAD) and the HAPTIC2/HEPTAD Input Preprocessor (HIP). HIP enables tagging of residues (e.g., in hydrophobic blobs, ordered regions and glycosylation motifs) for exclusion from downstream analyses by HAPTIC2 and HEPTAD. HAPTIC2 estimates paratope binding affinity for disulfide-free disordered peptidic antigens (by analogy between flexible-ligand docking and protein folding), from terms attributed to compaction (in view of sequence length, charge and temperature-dependent polyproline-II helical propensity), collapse (disfavored by residue bulkiness) and contact (with glycine and proline regarded as polar residues that hydrogen bond with paratopes). HEPTAD analyzes antigen sequences that each contain two cysteine residues for which the impact of disulfide pairing is estimated as a correction to the free-energy penalty of compaction. All of HUT is freely accessible online ( https://freeshell.de/~badong/hut.htm ).
Collapse
Affiliation(s)
- Salvador Eugenio C Caoili
- Biomedical Innovations Research for Translational Health Science (BIRTHS) Laboratory, Department of Biochemistry and Molecular Biology, College of Medicine, University of the Philippines Manila, Ermita, Manila, Philippines.
| |
Collapse
|
12
|
Khwaja E, Song YS, Agarunov A, Huang B. CELL-E 2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 2023; 36:4899-4914. [PMID: 39021511 PMCID: PMC11254339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
We present CELL-E 2, a novel bidirectional transformer that can generate images depicting protein subcellular localization from the amino acid sequences (and vice versa). Protein localization is a challenging problem that requires integrating sequence and image information, which most existing methods ignore. CELL-E 2 extends the work of CELL-E, not only capturing the spatial complexity of protein localization and produce probability estimates of localization atop a nucleus image, but also being able to generate sequences from images, enabling de novo protein design. We train and finetune CELL-E 2 on two large-scale datasets of human proteins. We also demonstrate how to use CELL-E 2 to create hundreds of novel nuclear localization signals (NLS). Results and interactive demos are featured at https://bohuanglab.github.io/CELL-E_2/.
Collapse
Affiliation(s)
- Emaad Khwaja
- UC Berkeley - UCSF Joint Bioengineering Graduate Program
- Computer Science Division, UC Berkeley, CA 94720
| | - Yun S Song
- Department of Statistics, UC Berkeley, CA 94720
- Computer Science Division, UC Berkeley, CA 94720
| | - Aaron Agarunov
- Department of Pathology, Memorial Sloan Kettering Cancer Center, 10065
| | - Bo Huang
- Department of Pharmaceutical Chemistry, UCSF, San Francisco, CA 94143
- Department of Biochemistry and Biophysics, UCSF, San Francisco, CA 94143
- Chan Zuckerberg Biohub - San Francisco, San Francisco, CA 94158
| |
Collapse
|
13
|
Wang B, Zhang X, Xu C, Han X, Wang Y, Situ C, Li Y, Guo X. DeepSP: A Deep Learning Framework for Spatial Proteomics. J Proteome Res 2023. [PMID: 37314414 DOI: 10.1021/acs.jproteome.2c00394] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The study of protein subcellular localization (PSL) is a fundamental step toward understanding the mechanism of protein function. The recent development of mass spectrometry (MS)-based spatial proteomics to quantify the distribution of proteins across subcellular fractions provides us a high-throughput approach to predict unknown PSLs based on known PSLs. However, the accuracy of PSL annotations in spatial proteomics is limited by the performance of existing PSL predictors based on traditional machine learning algorithms. In this study, we present a novel deep learning framework named DeepSP for PSL prediction of an MS-based spatial proteomics data set. DeepSP constructs the new feature map of a difference matrix by capturing detailed changes between different subcellular fractions of protein occupancy profiles and uses the convolutional block attention module to improve the prediction performance of PSL. DeepSP achieved significant improvement in accuracy and robustness for PSL prediction in independent test sets and unknown PSL prediction compared to current state-of-the-art machine learning predictors. As an efficient and robust framework for PSL prediction, DeepSP is expected to facilitate spatial proteomics studies and contributes to the elucidation of protein functions and the regulation of biological processes.
Collapse
Affiliation(s)
- Bing Wang
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
- School of Medicine, Southeast University, Nanjing 210009, China
| | - Xiangzheng Zhang
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Chen Xu
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Xudong Han
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
- School of Medicine, Southeast University, Nanjing 210009, China
| | - Yue Wang
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Chenghao Situ
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| | - Yan Li
- Department of Clinical Laboratory, Sir Run Run Hospital, Nanjing Medical University, Nanjing 211100, China
| | - Xuejiang Guo
- Department of Histology and Embryology, State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing 211166, China
| |
Collapse
|
14
|
Jiang Y, Jiang L, Akhil CS, Wang D, Zhang Z, Zhang W, Xu D. MULocDeep web service for protein localization prediction and visualization at subcellular and suborganellar levels. Nucleic Acids Res 2023:7161528. [PMID: 37178004 DOI: 10.1093/nar/gkad374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 04/12/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023] Open
Abstract
Predicting protein localization and understanding its mechanisms are critical in biology and pathology. In this context, we propose a new web application of MULocDeep with improved performance, result interpretation, and visualization. By transferring the original model into species-specific models, MULocDeep achieved competitive prediction performance at the subcellular level against other state-of-the-art methods. It uniquely provides a comprehensive localization prediction at the suborganellar level. Besides prediction, our web service quantifies the contribution of single amino acids to localization for individual proteins; for a group of proteins, common motifs or potential targeting-related regions can be derived. Furthermore, the visualizations of targeting mechanism analyses can be downloaded for publication-ready figures. The MULocDeep web service is available at https://www.mu-loc.org/.
Collapse
Affiliation(s)
- Yuexu Jiang
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Lei Jiang
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Chopparapu Sai Akhil
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Duolin Wang
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Ziyang Zhang
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Weinan Zhang
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| | - Dong Xu
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, USA
| |
Collapse
|
15
|
Chen X, Pan X, Ji T, Yu S, Sun Y. Fusion classification of stroke patients' biosignals by weighted cross-validation-based feature selection (W-CVFS) method. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
16
|
Rozov SM, Deineko EV. Increasing the Efficiency of the Accumulation of Recombinant Proteins in Plant Cells: The Role of Transport Signal Peptides. PLANTS (BASEL, SWITZERLAND) 2022; 11:2561. [PMID: 36235427 PMCID: PMC9572730 DOI: 10.3390/plants11192561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 09/23/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]
Abstract
The problem with increasing the yield of recombinant proteins is resolvable using different approaches, including the transport of a target protein to cell compartments with a low protease activity. In the cell, protein targeting involves short-signal peptide sequences recognized by intracellular protein transport systems. The main systems of the protein transport across membranes of the endoplasmic reticulum and endosymbiotic organelles are reviewed here, as are the major types and structure of the signal sequences targeting proteins to the endoplasmic reticulum and its derivatives, to plastids, and to mitochondria. The role of protein targeting to certain cell organelles depending on specific features of recombinant proteins and the effect of this targeting on the protein yield are discussed, in addition to the main directions of the search for signal sequences based on their primary structure. This knowledge makes it possible not only to predict a protein localization in the cell but also to reveal the most efficient sequences with potential biotechnological utility.
Collapse
|
17
|
Nakai K, Wei L. Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics. FRONTIERS IN BIOINFORMATICS 2022; 2:910531. [PMID: 36304291 PMCID: PMC9580943 DOI: 10.3389/fbinf.2022.910531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Prediction of subcellular localization of proteins from their amino acid sequences has a long history in bioinformatics and is still actively developing, incorporating the latest advances in machine learning and proteomics. Notably, deep learning-based methods for natural language processing have made great contributions. Here, we review recent advances in the field as well as its related fields, such as subcellular proteomics and the prediction/recognition of subcellular localization from image data.
Collapse
Affiliation(s)
- Kenta Nakai
- Institute of Medical Science, The University of Tokyo, Minato-Ku, Japan
| | - Leyi Wei
- School of Software, Shandong University, Jinan, China
| |
Collapse
|
18
|
Zhuang H, Fan X, Ji D, Wang Y, Fan J, Li M, Ni D, Lu S, Li X, Chai Z. Elucidation of the conformational dynamics and assembly of Argonaute-RNA complexes by distinct yet coordinated actions of the supplementary microRNA. Comput Struct Biotechnol J 2022; 20:1352-1365. [PMID: 35356544 PMCID: PMC8933676 DOI: 10.1016/j.csbj.2022.03.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/04/2022] [Accepted: 03/04/2022] [Indexed: 02/07/2023] Open
Abstract
Argonaute (AGO) proteins, the core of RNA-induced silencing complex, are guided by microRNAs (miRNAs) to recognize target RNA for repression. The miRNA-target RNA recognition forms initially through pairing at the seed region while the additional supplementary pairing can enhance target recognition and compensate for seed mismatch. The extension of miRNA lengths can strengthen the target affinity when pairing both in the seed and supplementary regions. However, the mechanism underlying the effect of the supplementary pairing on the conformational dynamics and the assembly of AGO-RNA complex remains poorly understood. To address this, we performed large-scale molecular dynamics simulations of AGO-RNA complexes with different pairing patterns and miRNA lengths. The results reveal that the additional supplementary pairing can not only strengthen the interaction between miRNA and target RNA, but also induce the increased plasticity of the PAZ domain and enhance the domain connectivity among the PAZ, PIWI, N domains of the AGO protein. The strong community network between these domains tightens the mouth of the supplementary chamber of AGO protein, which prevents the escape of target RNA from the complex and shields it from solvent water attack. Importantly, the inner stronger matching pairs between the miRNA and target RNA can compensate for weaker mismatches at the edge of supplementary region. These findings provide guidance for the design of miRNA mimics and anti-miRNAs for both clinical and experimental use and open the way for further engineering of AGO proteins as a new tool in the field of gene regulation.
Collapse
Affiliation(s)
- Haiming Zhuang
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Xiaohua Fan
- Department of Anesthesiology, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Dong Ji
- Department of Anesthesiology, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Yuanhao Wang
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Jigang Fan
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Mingyu Li
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Duan Ni
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Shaoyong Lu
- Department of Pathophysiology, Cell Differentiation and Apoptosis of Chinese Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Xiaolong Li
- Department of Orthopedics, Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Zongtao Chai
- Department of Hepatic Surgery VI, Eastern Hepatobiliary Surgery Hospital, Navy Medical University, Shanghai 200438, China
- Department of Hepatic Surgery, Shanghai Geriatric Center, Shanghai 201104, China
| |
Collapse
|