1
|
Liang X, Qin S, Wei G, Guo X, Wei S, Wei F, Liang Y. Comprehensive analysis of the NAC transcription factor gene family in Sophora tonkinensis Gagnep. BMC PLANT BIOLOGY 2025; 25:530. [PMID: 40281421 PMCID: PMC12023634 DOI: 10.1186/s12870-025-06564-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Accepted: 04/15/2025] [Indexed: 04/29/2025]
Abstract
BACKGROUND Sophora tonkinensis Gagnep. has long been utilized in the treatment of anti-inflammatory and pain-relieving, with its principal active compounds being alkaloids and flavonoids. NAC transcription factors, a large family of plant-specific regulators, play pivotal roles in growth, development, stress responses, and secondary metabolism. However, comprehensive genome-wide characterization of S. tonkinensis NAC gene family (StNAC) remains unexplored. RESULTS This study identified 85 NAC proteins from the S. tonkinensis genome database. Phylogenetic analysis revealed that StNAC proteins were categorized into 15 subgroups based on their homology with Arabidopsis thaliana NAC proteins. Gene structure analysis demonstrated a variation in intron numbers ranging from 1 to 7, with a majority of StNAC genes containing 2-3 introns. Chromosomal distribution analysis indicated an uneven spread of StNAC genes across 9 chromosomes, with the highest number of StNAC genes on Chr3. Detection of 4 tandem duplicates and 32 segmental duplicates revealed that segmental duplication primarily drive StNAC genes amplification. Prediction of cis-regulatory elements suggested the involvement of StNAC genes in growth, stress responses, and hormone regulation. Gene expression analysis showed substantial variability expression of StNAC genes across different tissues. Notably, eight StNAC genes were identified as significantly associated alkaloid and flavonoid levels. qRT-PCR validation indicated that five genes were highly expressed in tissues, corroborating transcriptome data. CONCLUSION These findings offer valuable insights for further functional characterization of NAC genes and their potential roles in alkaloid and flavonoid biosynthesis in S. tonkinensis.
Collapse
Affiliation(s)
- Ximei Liang
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
- College of pharmacy, Guangxi University of Chinese Medicine, Nanning, China
| | - Shuangshuang Qin
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
| | - Guili Wei
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
| | - Xiaoyun Guo
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China
| | - Shugen Wei
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China.
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China.
- College of pharmacy, Guangxi University of Chinese Medicine, Nanning, China.
- Guangxi Key Laboratory of High Quality Formation and Application of Genuine Medicinal Materials / Guangxi Traditional Chinese Medicine Breeding Technology Innovation Center, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China.
| | - Fan Wei
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China.
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China.
| | - Ying Liang
- Guangxi Key Laboratory of Medicinal Resources Protection and Genetic Improvement, National Center for Traditional Chinese Medicine (TCM) Inheritance and Innovation, Guangxi Botanical Garden of Medicinal Plants, No. 189 Changgang Road, Xingning District, Nanning, 530023, People's Republic of China.
- National Engineering Research Center for Southwest Endangered Medicinal Materials Resources Development, Guangxi Botanical Garden of Medicinal Plants, Nanning, China.
| |
Collapse
|
2
|
Zhang C, Zhu X, Peterson N, Wang J, Wan S. A Comprehensive Review on RNA Subcellular Localization Prediction. ARXIV 2025:arXiv:2504.17162v1. [PMID: 40313658 PMCID: PMC12045386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
The subcellular localization of RNAs, including long non-coding RNAs (lncRNAs), messenger RNAs (mRNAs), microRNAs (miRNAs) and other smaller RNAs, plays a critical role in determining their biological functions. For instance, lncRNAs are predominantly associated with chromatin and act as regulators of gene transcription and chromatin structure, while mRNAs are distributed across the nucleus and cytoplasm, facilitating the transport of genetic information for protein synthesis. Understanding RNA localization sheds light on processes like gene expression regulation with spatial and temporal precision. However, traditional wet lab methods for determining RNA localization, such as in situ hybridization, are often time-consuming, resource-demanding, and costly. To overcome these challenges, computational methods leveraging artificial intelligence (AI) and machine learning (ML) have emerged as powerful alternatives, enabling large-scale prediction of RNA subcellular localization. This paper provides a comprehensive review of the latest advancements in AI-based approaches for RNA subcellular localization prediction, covering various RNA types and focusing on sequence-based, image-based, and hybrid methodologies that combine both data types. We highlight the potential of these methods to accelerate RNA research, uncover molecular pathways, and guide targeted disease treatments. Furthermore, we critically discuss the challenges in AI/ML approaches for RNA subcellular localization, such as data scarcity and lack of benchmarks, and opportunities to address them. This review aims to serve as a valuable resource for researchers seeking to develop innovative solutions in the field of RNA subcellular localization and beyond.
Collapse
Affiliation(s)
- Cece Zhang
- Department of Cell & Systems Biology, University of Toronto, ON, Canada
| | - Xuehuan Zhu
- School of Engineering, University of California, Los Angeles, CA, United States
| | - Nick Peterson
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Jieqiong Wang
- Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE, United States
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| |
Collapse
|
3
|
Gillani M, Pollastri G. Protein subcellular localization prediction tools. Comput Struct Biotechnol J 2024; 23:1796-1807. [PMID: 38707539 PMCID: PMC11066471 DOI: 10.1016/j.csbj.2024.04.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
Protein subcellular localization prediction is of great significance in bioinformatics and biological research. Most of the proteins do not have experimentally determined localization information, computational prediction methods and tools have been acting as an active research area for more than two decades now. Knowledge of the subcellular location of a protein provides valuable information about its functionalities, the functioning of the cell, and other possible interactions with proteins. Fast, reliable, and accurate predictors provides platforms to harness the abundance of sequence data to predict subcellular locations accordingly. During the last decade, there has been a considerable amount of research effort aimed at developing subcellular localization predictors. This paper reviews recent subcellular localization prediction tools in the Eukaryotic, Prokaryotic, and Virus-based categories followed by a detailed analysis. Each predictor is discussed based on its main features, strengths, weaknesses, algorithms used, prediction techniques, and analysis. This review is supported by prediction tools taxonomies that highlight their rele- vant area and examples for uncomplicated categorization and ease of understandability. These taxonomies help users find suitable tools according to their needs. Furthermore, recent research gaps and challenges are discussed to cover areas that need the utmost attention. This survey provides an in-depth analysis of the most recent prediction tools to facilitate readers and can be considered a quick guide for researchers to identify and explore the recent literature advancements.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| | - Gianluca Pollastri
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| |
Collapse
|
4
|
Gillani M, Pollastri G. SCLpred-ECL: Subcellular Localization Prediction by Deep N-to-1 Convolutional Neural Networks. Int J Mol Sci 2024; 25:5440. [PMID: 38791479 PMCID: PMC11121631 DOI: 10.3390/ijms25105440] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 05/09/2024] [Accepted: 05/11/2024] [Indexed: 05/26/2024] Open
Abstract
The subcellular location of a protein provides valuable insights to bioinformaticians in terms of drug designs and discovery, genomics, and various other aspects of medical research. Experimental methods for protein subcellular localization determination are time-consuming and expensive, whereas computational methods, if accurate, would represent a much more efficient alternative. This article introduces an ab initio protein subcellular localization predictor based on an ensemble of Deep N-to-1 Convolutional Neural Networks. Our predictor is trained and tested on strict redundancy-reduced datasets and achieves 63% accuracy for the diverse number of classes. This predictor is a step towards bridging the gap between a protein sequence and the protein's function. It can potentially provide information about protein-protein interaction to facilitate drug design and processes like vaccine production that are essential to disease prevention.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), D04 V1W8 Dublin, Ireland;
| | | |
Collapse
|
5
|
Gao Y, Qu D, Zhou M, Tang R, Ye J, Li X, Wang Y. Rhizobial-induced phosphatase GmPP2C61A positively regulates soybean nodulation. PHYSIOLOGIA PLANTARUM 2024; 176:e14341. [PMID: 38741264 DOI: 10.1111/ppl.14341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 04/23/2024] [Accepted: 04/25/2024] [Indexed: 05/16/2024]
Abstract
Symbiotic nitrogen fixation (SNF) is crucial for legumes, providing them with the nitrogen necessary for plant growth and development. Nodulation is the first step in the establishment of SNF. However, the determinant genes in soybean nodulation and the understanding of the underlying molecular mechanisms governing nodulation are still limited. Herein, we identified a phosphatase, GmPP2C61A, which was specifically induced by rhizobia inoculation. Using transgenic hairy roots harboring GmPP2C61A::GUS, we showed that GmPP2C61A was mainly induced in epidermal cells following rhizobia inoculation. Functional analysis revealed that knockdown or knock-out of GmPP2C61A significantly reduced the number of nodules, while overexpression of GmPP2C61A promoted nodule formation. Additionally, GmPP2C61A protein was mainly localized in the cytoplasm and exhibited conserved phosphatase activity in vitro. Our findings suggest that phosphatase GmPP2C61A serves as a critical regulator in soybean nodulation, highlighting its potential significance in enhancing symbiotic nitrogen fixation.
Collapse
Affiliation(s)
- Yongkang Gao
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Dejie Qu
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Miaomiao Zhou
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Ruiheng Tang
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Junjie Ye
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Xia Li
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
| | - Youning Wang
- State Key Laboratory of Agricultural Microbiology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production, College of Agronomy, Northwest A&F University Yangling, Shaanxi Province, P.R. China
| |
Collapse
|
6
|
Liu M, Li Z, Kang Y, Lv J, Jin Z, Mu S, Yue H, Li L, Chen P, Li Y. A mutation in CsGME encoding GDP-mannose 3,5-epimerase results in little and wrinkled leaf in cucumber. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:114. [PMID: 38678513 DOI: 10.1007/s00122-024-04600-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 03/13/2024] [Indexed: 05/01/2024]
Abstract
KEY MESSAGE Map-based cloning revealed that a mutation in a highly conserved amino acid of the CsGME gene encoding GDP-mannose 3,5-epimerase, causes the phenotype of little and wrinkled leaves in cucumbers. Leaf size is a critical determinant of plant architecture in cucumbers, yet only a few genes associated with this trait have been mapped or cloned. Here, we identified and characterized a mutant with little and wrinkled leaves, named lwl-1. Genetic analysis revealed that the phenotype of the lwl-1 was controlled by a single recessive gene. Through map-based cloning, the lwl-1 locus was narrowed down to a 12.22-kb region exclusively containing one fully annotated gene CsGME (CsaV3_2G004170). CsGME encodes GDP-mannose 3,5-epimerase, which is involved in the synthesis of ascorbic acid (ASA) and one of the components of pectin, RG-II. Whole-length sequencing of the 12.22 kb DNA fragment revealed the presence of only a non-synonymous mutation located in the sixth exon of CsGME in lwl-1, resulting in an amino acid alteration from Pro363 to Leu363. This mutation was unique among 118 inbred lines from cucumber natural populations. CsGME expression significantly reduced in various organs of lwl-1, accompanied by a significant decrease in ASA and pectin content in leaves. Both CsGME and Csgme proteins were localized to the cytoplasm. The mutant phenotype exhibited partial recovery after the application of exogenous boric acid. Silencing CsGME in cucumber through VIGS confirmed its role as the causal gene for lwl-1. Transcriptome profiling revealed that CsGME greatly affected the expression of genes related to the cell division process and cell plate formation. This study represents the first report to characterize and clone the CsGME in cucumber, indicating its crucial role in regulating leaf size and development.
Collapse
Affiliation(s)
- Mengying Liu
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Zhaowei Li
- College of Life Science, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Yunfeng Kang
- College of Life Science, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Jinzhao Lv
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Zhuoshuai Jin
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Siyu Mu
- College of Life Science, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Hongzhong Yue
- Vegetable Research Institute, Gansu Academy of Agricultural Sciences, Lanzhou, 730070, Gansu, China
| | - Lixia Li
- College of Horticulture, Shanxi Agricultural University, Taigu, 030801, China
| | - Peng Chen
- College of Life Science, Northwest A&F University, Yangling, 712100, Shaanxi, China.
| | - Yuhong Li
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China.
| |
Collapse
|
7
|
Demko V, Belova T, Messerer M, Hvidsten TR, Perroud PF, Ako AE, Johansen W, Mayer KFX, Olsen OA, Lang D. Regulation of developmental gatekeeping and cell fate transition by the calpain protease DEK1 in Physcomitrium patens. Commun Biol 2024; 7:261. [PMID: 38438476 PMCID: PMC10912778 DOI: 10.1038/s42003-024-05933-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 02/19/2024] [Indexed: 03/06/2024] Open
Abstract
Calpains are cysteine proteases that control cell fate transitions whose loss of function causes severe, pleiotropic phenotypes in eukaryotes. Although mainly considered as modulatory proteases, human calpain targets are directed to the N-end rule degradation pathway. Several such targets are transcription factors, hinting at a gene-regulatory role. Here, we analyze the gene-regulatory networks of the moss Physcomitrium patens and characterize the regulons that are misregulated in mutants of the calpain DEFECTIVE KERNEL1 (DEK1). Predicted cleavage patterns of the regulatory hierarchies in five DEK1-controlled subnetworks are consistent with a pleiotropic and regulatory role during cell fate transitions targeting multiple functions. Network structure suggests DEK1-gated sequential transitions between cell fates in 2D-to-3D development. Our method combines comprehensive phenotyping, transcriptomics and data science to dissect phenotypic traits, and our model explains the protease function as a switch gatekeeping cell fate transitions potentially also beyond plant development.
Collapse
Affiliation(s)
- Viktor Demko
- Department of Plant Sciences, Norwegian University of Life Sciences, P.O. Box 5003, NO-1432, Ås, Norway
- Department of Plant Physiology, Faculty of Natural Sciences, Comenius University in Bratislava, Ilkovicova 6, 84104, Bratislava, Slovakia
- Plant Science and Biodiversity Center, Slovak Academy of Sciences, Dubravska cesta 9, 84104, Bratislava, Slovakia
| | - Tatiana Belova
- Department of Plant Sciences, Norwegian University of Life Sciences, P.O. Box 5003, NO-1432, Ås, Norway
- Centre for Molecular Medicine Norway, University of Oslo, Oslo, Norway
| | - Maxim Messerer
- Plant Genome and Systems Biology, Helmholtz Center Munich-Research Center for Environmental Health, 85764, Neuherberg, Germany
| | - Torgeir R Hvidsten
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Pierre-François Perroud
- Institut Jean-Pierre Bourgin, INRAE, AgroParisTech, Université Paris-Saclay, 78000, Versailles, France
| | - Ako Eugene Ako
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata 31, 2318, Hamar, Norway
- School of Animal, Rural and Environmental Sciences, Nottingham Trent University, Brackenhurst Campus, Southwell, Nottinghamshire, NG25 0QF, UK
| | - Wenche Johansen
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata 31, 2318, Hamar, Norway
| | - Klaus F X Mayer
- Plant Genome and Systems Biology, Helmholtz Center Munich-Research Center for Environmental Health, 85764, Neuherberg, Germany
- School of Life Sciences, Technical University Munich, 85354, Freising, Germany
| | - Odd-Arne Olsen
- Department of Plant Sciences, Norwegian University of Life Sciences, P.O. Box 5003, NO-1432, Ås, Norway
| | - Daniel Lang
- Plant Genome and Systems Biology, Helmholtz Center Munich-Research Center for Environmental Health, 85764, Neuherberg, Germany.
- Bundeswehr Institute of Microbiology, Microbial Genomics and Bioforensics, 80937, Munich, Germany.
| |
Collapse
|
8
|
Graffam D, Cutlan M, Storm AR, Hulse-Kemp AM, Stoeckman AK. Gossypium hirsutum gene of unknown function Gohir.A02G161000 encodes a potential transmembrane Root UVB Sensitive 4 Protein with a putative protein-protein interaction interface. MICROPUBLICATION BIOLOGY 2024; 2024:10.17912/micropub.biology.000869. [PMID: 38495582 PMCID: PMC10943365 DOI: 10.17912/micropub.biology.000869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 02/05/2024] [Accepted: 02/27/2024] [Indexed: 03/19/2024]
Abstract
A gene of unknown function, Gohir.A02G161000.1, identified in Gossypium hirsutum was studied using computational sequence and structure bioinformatics tools. The associated protein GhRUS4-A0A1U8JPV7 (UniProt A0A1U8JPV7) is predicted to be a plastid-localized, transmembrane root UVB-sensitive 4 (RUS4) protein with a newly identified potential dimerization surface. Evidence from homology and sequence conservation suggest involvement in auxin transport and pollen maturation.
Collapse
Affiliation(s)
| | - Marissa Cutlan
- Chemistry Department, Bethel University, Saint Paul, MN USA
| | - Amanda R Storm
- Department of Biology, Western Carolina University, Cullowhee, NC USA
| | - Amanda M Hulse-Kemp
- Genomics and Bioinformatics Research Unit, The Agricultural Research Service of U.S. Department of Agriculture, Raleigh, NC USA
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC USA
| | | |
Collapse
|
9
|
Nielsen H. Protein Sorting Prediction. Methods Mol Biol 2024; 2715:27-63. [PMID: 37930519 DOI: 10.1007/978-1-0716-3445-5_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]
Abstract
Many computational methods are available for predicting protein sorting in bacteria. When comparing them, it is important to know that they can be grouped into three fundamentally different approaches: signal-based, global property-based, and homology-based prediction. In this chapter, the strengths and drawbacks of each of these approaches are described through many examples of methods that predict secretion, integration into membranes, or subcellular locations in general. The aim of this chapter is to provide a user-level introduction to the field with a minimum of computational theory.
Collapse
Affiliation(s)
- Henrik Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark.
| |
Collapse
|
10
|
Di Donfrancesco A, Berlingieri C, Giacomello M, Frascarelli C, Magalhaes Rebelo AP, Bindoff LA, Reeval S, Renbaum P, Santorelli FM, Massaro G, Viscomi C, Zeviani M, Ghezzi D, Bottani E, Brunetti D. PPAR-gamma agonist pioglitazone recovers mitochondrial quality control in fibroblasts from PITRM1-deficient patients. Front Pharmacol 2023; 14:1220620. [PMID: 37576821 PMCID: PMC10415619 DOI: 10.3389/fphar.2023.1220620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 07/10/2023] [Indexed: 08/15/2023] Open
Abstract
Introduction: Biallelic variants in PITRM1 are associated with a slowly progressive syndrome characterized by intellectual disability, spinocerebellar ataxia, cognitive decline and psychosis. The pitrilysin metallopeptidase 1 (PITRM1) is a mitochondrial matrix enzyme, which digests diverse oligopeptides, including the mitochondrial targeting sequences (MTS) that are cleaved from proteins imported across the inner mitochondrial membrane by the mitochondrial processing peptidase (MPP). Mitochondrial peptidases also play a role in the maturation of Frataxin, the protein affected in Friedreich's ataxia. Recent studies in yeast indicated that the mitochondrial matrix protease Ste23, which is a homologue of the human insulin-degrading enzyme (IDE), cooperates with Cym1 (homologue of PITRM1) to ensure the proper functioning of the preprotein processing machinery. In humans, IDE could be upregulated by Peroxisome Proliferator-Activated Receptor Gamma (PPARG) agonists. Methods: We investigated preprotein processing, mitochondrial membrane potential and MTS degradation in control and patients' fibroblasts, and we evaluated the pharmacological effect of the PPARG agonist Pioglitazone on mitochondrial proteostasis. Results: We discovered that PITRM1 dysfunction results in the accumulation of MTS, leading to the disruption and dissipation of the mitochondrial membrane potential. This triggers a feedback inhibition of MPP activity, consequently impairing the processing and maturation of Frataxin. Furthermore, we found that the pharmacological stimulation of PPARG by Pioglitazone upregulates IDE and also PITRM1 protein levels restoring the presequence processing machinery and improving Frataxin maturation and mitochondrial function. Discussion: Our findings provide mechanistic insights and suggest a potential pharmacological strategy for this rare neurodegenerative mitochondrial disease.
Collapse
Affiliation(s)
- Alessia Di Donfrancesco
- Unità di Genetica Medica e Neurogenetica, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
| | - Christian Berlingieri
- Unità di Genetica Medica e Neurogenetica, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
| | - Marta Giacomello
- Department of Biology, University of Padova, Padova, Italy
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Chiara Frascarelli
- Unità di Genetica Medica e Neurogenetica, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
| | | | | | - Segel Reeval
- Shaare Zedek Medical Center, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Paul Renbaum
- Shaare Zedek Medical Center, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | - Giulia Massaro
- UCL School of Pharmacy, University College London, London, United Kingdom
| | - Carlo Viscomi
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Massimo Zeviani
- Department of Neurosciences, University of Padova, Padova, Italy
| | - Daniele Ghezzi
- Unità di Genetica Medica e Neurogenetica, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
| | - Emanuela Bottani
- Department of Diagnostic and Public Health, Section of Pharmacology, University of Verona, Verona, Italy
| | - Dario Brunetti
- Unità di Genetica Medica e Neurogenetica, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
- Department of Medical Biotechnology and Translational Medicine, University of Milan, Milan, Italy
| |
Collapse
|
11
|
Villalobos-Alva J, Ochoa-Toledo L, Villalobos-Alva MJ, Aliseda A, Pérez-Escamirosa F, Altamirano-Bustamante NF, Ochoa-Fernández F, Zamora-Solís R, Villalobos-Alva S, Revilla-Monsalve C, Kemper-Valverde N, Altamirano-Bustamante MM. Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field. Front Bioeng Biotechnol 2022; 10:788300. [PMID: 35875501 PMCID: PMC9301016 DOI: 10.3389/fbioe.2022.788300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 05/25/2022] [Indexed: 11/23/2022] Open
Abstract
Proteins are some of the most fascinating and challenging molecules in the universe, and they pose a big challenge for artificial intelligence. The implementation of machine learning/AI in protein science gives rise to a world of knowledge adventures in the workhorse of the cell and proteome homeostasis, which are essential for making life possible. This opens up epistemic horizons thanks to a coupling of human tacit-explicit knowledge with machine learning power, the benefits of which are already tangible, such as important advances in protein structure prediction. Moreover, the driving force behind the protein processes of self-organization, adjustment, and fitness requires a space corresponding to gigabytes of life data in its order of magnitude. There are many tasks such as novel protein design, protein folding pathways, and synthetic metabolic routes, as well as protein-aggregation mechanisms, pathogenesis of protein misfolding and disease, and proteostasis networks that are currently unexplored or unrevealed. In this systematic review and biochemical meta-analysis, we aim to contribute to bridging the gap between what we call binomial artificial intelligence (AI) and protein science (PS), a growing research enterprise with exciting and promising biotechnological and biomedical applications. We undertake our task by exploring "the state of the art" in AI and machine learning (ML) applications to protein science in the scientific literature to address some critical research questions in this domain, including What kind of tasks are already explored by ML approaches to protein sciences? What are the most common ML algorithms and databases used? What is the situational diagnostic of the AI-PS inter-field? What do ML processing steps have in common? We also formulate novel questions such as Is it possible to discover what the rules of protein evolution are with the binomial AI-PS? How do protein folding pathways evolve? What are the rules that dictate the folds? What are the minimal nuclear protein structures? How do protein aggregates form and why do they exhibit different toxicities? What are the structural properties of amyloid proteins? How can we design an effective proteostasis network to deal with misfolded proteins? We are a cross-functional group of scientists from several academic disciplines, and we have conducted the systematic review using a variant of the PICO and PRISMA approaches. The search was carried out in four databases (PubMed, Bireme, OVID, and EBSCO Web of Science), resulting in 144 research articles. After three rounds of quality screening, 93 articles were finally selected for further analysis. A summary of our findings is as follows: regarding AI applications, there are mainly four types: 1) genomics, 2) protein structure and function, 3) protein design and evolution, and 4) drug design. In terms of the ML algorithms and databases used, supervised learning was the most common approach (85%). As for the databases used for the ML models, PDB and UniprotKB/Swissprot were the most common ones (21 and 8%, respectively). Moreover, we identified that approximately 63% of the articles organized their results into three steps, which we labeled pre-process, process, and post-process. A few studies combined data from several databases or created their own databases after the pre-process. Our main finding is that, as of today, there are no research road maps serving as guides to address gaps in our knowledge of the AI-PS binomial. All research efforts to collect, integrate multidimensional data features, and then analyze and validate them are, so far, uncoordinated and scattered throughout the scientific literature without a clear epistemic goal or connection between the studies. Therefore, our main contribution to the scientific literature is to offer a road map to help solve problems in drug design, protein structures, design, and function prediction while also presenting the "state of the art" on research in the AI-PS binomial until February 2021. Thus, we pave the way toward future advances in the synthetic redesign of novel proteins and protein networks and artificial metabolic pathways, learning lessons from nature for the welfare of humankind. Many of the novel proteins and metabolic pathways are currently non-existent in nature, nor are they used in the chemical industry or biomedical field.
Collapse
Affiliation(s)
- Jalil Villalobos-Alva
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Luis Ochoa-Toledo
- Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Mario Javier Villalobos-Alva
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Atocha Aliseda
- Instituto de Investigaciones Filosóficas, Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Fernando Pérez-Escamirosa
- Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | | | - Francine Ochoa-Fernández
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Ricardo Zamora-Solís
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Sebastián Villalobos-Alva
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Cristina Revilla-Monsalve
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| | - Nicolás Kemper-Valverde
- Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | - Myriam M. Altamirano-Bustamante
- Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
| |
Collapse
|
12
|
Thumuluri V, Almagro Armenteros JJ, Johansen A, Nielsen H, Winther O. DeepLoc 2.0: multi-label subcellular localization prediction using protein language models. Nucleic Acids Res 2022; 50:W228-W234. [PMID: 35489069 PMCID: PMC9252801 DOI: 10.1093/nar/gkac278] [Citation(s) in RCA: 292] [Impact Index Per Article: 97.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 04/07/2022] [Accepted: 04/19/2022] [Indexed: 12/19/2022] Open
Abstract
The prediction of protein subcellular localization is of great relevance for proteomics research. Here, we propose an update to the popular tool DeepLoc with multi-localization prediction and improvements in both performance and interpretability. For training and validation, we curate eukaryotic and human multi-location protein datasets with stringent homology partitioning and enriched with sorting signal information compiled from the literature. We achieve state-of-the-art performance in DeepLoc 2.0 by using a pre-trained protein language model. It has the further advantage that it uses sequence input rather than relying on slower protein profiles. We provide two means of better interpretability: an attention output along the sequence and highly accurate prediction of nine different types of protein sorting signals. We find that the attention output correlates well with the position of sorting signals. The webserver is available at services.healthtech.dtu.dk/service.php?DeepLoc-2.0.
Collapse
Affiliation(s)
| | - José Juan Almagro Armenteros
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen 2200, Denmark
- Department of Genetics, Stanford University School of Medicine, Stanford 94305, CA, USA
| | - Alexander Rosenberg Johansen
- Department of Computer Science, Stanford University, Stanford 94305, CA, USA
- Department of Genetics, Stanford University School of Medicine, Stanford 94305, CA, USA
| | - Henrik Nielsen
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby 2800, Denmark
| | - Ole Winther
- Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital), Copenhagen 2100, Denmark
- Department of Biology, Bioinformatics Centre, University of Copenhagen, Copenhagen 2200, Denmark
- Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby 2800, Denmark
| |
Collapse
|
13
|
Figueiredo L, Santos RB, Figueiredo A. The grapevine aspartic protease gene family: characterization and expression modulation in response to Plasmopara viticola. JOURNAL OF PLANT RESEARCH 2022; 135:501-515. [PMID: 35426578 DOI: 10.1007/s10265-022-01390-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 03/28/2022] [Indexed: 06/14/2023]
Abstract
Grapevine aspartic proteases gene family is characterized and five VviAPs appear to be involved in grapevine defense against downy mildew. Grapevine (Vitis vinifera L.) is one of the most important crops worldwide. However, it is highly susceptible to the downy mildew disease caused by Plasmopara viticola (Berk. & Curt.) Berl. & De Toni. To minimize the use of fungicides used to control P. viticola, it is essential to gain a deeper comprehension on this pathosystem and proteases have gained particular interest in the past decade. Proteases were shown to actively participate in plant-pathogen interactions, not only in the processes that lead to plant cell death, stress responses and protein processing/degradation but also as components of the recognition and signalling pathways. The aim of this study was to identify and characterize the aspartic proteases (APs) involvement in grapevine defense against P. viticola. A genome-wide search and bioinformatics characterization of the V. vinifera AP gene family was conducted and a total of 81 APs proteins, coded by 65 genes, were found. VviAPs proteins can be divided into three categories, similar to those previously described for other plants. Twelve APs coding genes were selected, and expression analysis was conducted at several time-points after inoculation in both compatible and incompatible interactions. Five grapevine APs may be involved in grapevine tolerance against P. viticola. Our findings provide an overall understanding of the VviAPs gene family and establish better groundwork to further describe the roles of VviAPs in defense against P. viticola.
Collapse
Affiliation(s)
- Laura Figueiredo
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisboa, Portugal
| | - Rita B Santos
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisboa, Portugal.
| | - Andreia Figueiredo
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016, Lisboa, Portugal
| |
Collapse
|
14
|
Guo Y, Chen F, Luo J, Qiao M, Zeng W, Li J, Xu W. The DUF288 domain containing proteins GhSTLs participate in cotton fiber cellulose synthesis and impact on fiber elongation. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 316:111168. [PMID: 35151452 DOI: 10.1016/j.plantsci.2021.111168] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 12/13/2021] [Accepted: 12/21/2021] [Indexed: 06/14/2023]
Abstract
Cotton is one of the most important economic crops in the world, with over 90 % cellulose in the mature fiber. However, the cellulose synthesis mechanism in cotton fibers is poorly understood. Here, we identified four DUF288 domain containing proteins, which we designated GhSTL1-4. These four GhSTL genes are highly expressed in 6 days post anthesis (dpa) and 20 dpa cotton fibers. They are localized to the Golgi apparatus, and can rescue the growth defects in primary cell wall (PCW) and secondary cell wall (SCW) of cellulose synthesis of the Arabidopsis stl1stl2 double mutant at varying degrees. Silencing of GhSTLs resulted in reduced cellulose content and shorter fibers. In addition, split-ubiquitin membrane yeast two-hybrid analysis showed that GhSTL1 and GhSTL4 can interact with PCW-related GhCesA6-1/6-3 and SCW-associated GhCesA7-1/7-2. GhSTL3 can interact with SCW-related GhCesA4-3. These interactions are further confirmed by firefly luciferase complementation imaging assay. Together, we demonstrate that GhSTLs can selectively interact with both the PCW and SCW-associated GhCesAs and impact on cellulose synthesis and fiber development. Our findings provide insights into the mechanism underlying cellulose biosynthesis in cotton fibers, and offer potential candidate genes to coordinate PCW and SCW cellulose synthesis of cotton fibers for developing elite cotton varieties with enhanced fiber quality.
Collapse
Affiliation(s)
- Yanjun Guo
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China
| | - Feng Chen
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China
| | - Jingwen Luo
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China
| | - Mengfei Qiao
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China
| | - Wei Zeng
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Hangzhou 311300, China
| | - Juan Li
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China
| | - Wenliang Xu
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan 430079, China.
| |
Collapse
|
15
|
Norero NS, Rey Burusco MF, D’Ippólito S, Décima Oneto CA, Massa GA, Castellote MA, Feingold SE, Guevara MG. Genome-Wide Analyses of Aspartic Proteases on Potato Genome ( Solanum tuberosum): Generating New Tools to Improve the Resistance of Plants to Abiotic Stress. PLANTS (BASEL, SWITZERLAND) 2022; 11:plants11040544. [PMID: 35214878 PMCID: PMC8875628 DOI: 10.3390/plants11040544] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 12/04/2021] [Accepted: 01/06/2022] [Indexed: 05/11/2023]
Abstract
Aspartic proteases are proteolytic enzymes widely distributed in living organisms and viruses. Although they have been extensively studied in many plant species, they are poorly described in potatoes. The present study aimed to identify and characterize S. tuberosum aspartic proteases. Gene structure, chromosome and protein domain organization, phylogeny, and subcellular predicted localization were analyzed and integrated with RNAseq data from different tissues, organs, and conditions focused on abiotic stress. Sixty-two aspartic protease genes were retrieved from the potato genome, distributed in 12 chromosomes. A high number of intronless genes and segmental and tandem duplications were detected. Phylogenetic analysis revealed eight StAP groups, named from StAPI to StAPVIII, that were differentiated into typical (StAPI), nucellin-like (StAPIIIa), and atypical aspartic proteases (StAPII, StAPIIIb to StAPVIII). RNAseq data analyses showed that gene expression was consistent with the presence of cis-acting regulatory elements on StAP promoter regions related to water deficit. The study presents the first identification and characterization of 62 aspartic protease genes and proteins on the potato genome and provides the baseline material for functional gene determinations and potato breeding programs, including gene editing mediated by CRISPR.
Collapse
Affiliation(s)
- Natalia Sigrid Norero
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
| | - María Florencia Rey Burusco
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
- Faculty of Agricultural Sciences, University National of Mar del Plata, Balcarce B7620, Argentina
| | - Sebastián D’Ippólito
- Institute of Biological Research, University of Mar del Plata (IIB-UNMdP), Mar del Plata B7600, Argentina;
- National Scientific and Technical Research Council, Argentina (CONICET), Buenos Aires C1499, Argentina
| | - Cecilia Andrea Décima Oneto
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
| | - Gabriela Alejandra Massa
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
- Faculty of Agricultural Sciences, University National of Mar del Plata, Balcarce B7620, Argentina
| | - Martín Alfredo Castellote
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
| | - Sergio Enrique Feingold
- Laboratory of Agrobiotechnology IPADS (INTA—CONICET), Balcarce B7620, Argentina; (N.S.N.); (M.F.R.B.); (C.A.D.O.); (G.A.M.); (M.A.C.); (S.E.F.)
| | - María Gabriela Guevara
- Institute of Biological Research, University of Mar del Plata (IIB-UNMdP), Mar del Plata B7600, Argentina;
- National Scientific and Technical Research Council, Argentina (CONICET), Buenos Aires C1499, Argentina
- Correspondence: or
| |
Collapse
|
16
|
Dash A, Ghag SB. Genome-wide in silico characterization and stress induced expression analysis of BcL-2 associated athanogene (BAG) family in Musa spp. Sci Rep 2022; 12:625. [PMID: 35022483 PMCID: PMC8755836 DOI: 10.1038/s41598-021-04707-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 12/21/2021] [Indexed: 11/23/2022] Open
Abstract
Programmed cell death (PCD) is a genetically controlled process for the selective removal of damaged cells. Though understanding about plant PCD has improved over years, the mechanisms are yet to be fully deciphered. Among the several molecular players of PCD in plants, B cell lymphoma 2 (Bcl-2)-associated athanogene (BAG) family of co-chaperones are evolutionary conserved and regulate cell death, growth and development. In this study, we performed a genome-wide in silico analysis of the MusaBAG gene family in a globally important fruit crop banana. Thirteen MusaBAG genes were identified, out of which MusaBAG1, 7 and 8 genes were found to have multiple copies. MusaBAG genes were distributed on seven out of 11 chromosomes in banana. Except for one paralog of MusaBAG8 all the other 12 proteins have characteristic BAG domain. MusaBAG1, 2 and 4 have an additional ubiquitin-like domain whereas MusaBAG5-8 have a calmodulin binding motif. Most of the MusaBAG proteins were predicted to be localized in the nucleus and mitochondria or chloroplast. The in silico cis-regulatory element analysis suggested regulation associated with photoperiodic control, abiotic and biotic stress. The phylogenetic analysis revealed 2 major clusters. Digital gene expression analysis and quantitative real-time RT-PCR depicted the differential expression pattern of MusaBAG genes under abiotic and biotic stress conditions. Further studies are warranted to uncover the role of each of these proteins in growth, PCD and stress responses so as to explore them as candidate genes for engineering transgenic banana plants with improved agronomic traits.
Collapse
Affiliation(s)
- Ashutosh Dash
- School of Biological Sciences, UM-DAE Centre for Excellence in Basic Sciences, University of Mumbai Campus, Kalina, Santacruz (East), Mumbai, 400 098, India
| | - Siddhesh B Ghag
- School of Biological Sciences, UM-DAE Centre for Excellence in Basic Sciences, University of Mumbai Campus, Kalina, Santacruz (East), Mumbai, 400 098, India.
| |
Collapse
|
17
|
Kataria R, Kaundal R. Deciphering the Crosstalk Mechanisms of Wheat-Stem Rust Pathosystem: Genome-Scale Prediction Unravels Novel Host Targets. FRONTIERS IN PLANT SCIENCE 2022; 13:895480. [PMID: 35800602 PMCID: PMC9253690 DOI: 10.3389/fpls.2022.895480] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/31/2022] [Indexed: 05/04/2023]
Abstract
Triticum aestivum (wheat), a major staple food grain, is affected by various biotic stresses. Among these, fungal diseases cause about 15-20% of yield loss, worldwide. In this study, we performed a comparative analysis of protein-protein interactions between two Puccinia graminis races (Pgt 21-0 and Pgt Ug99) that cause stem (black) rust in wheat. The available molecular techniques to study the host-pathogen interaction mechanisms are expensive and labor-intensive. We implemented two computational approaches (interolog and domain-based) for the prediction of PPIs and performed various functional analysis to determine the significant differences between the two pathogen races. The analysis revealed that T. aestivum-Pgt 21-0 and T. aestivum-Pgt Ug99 interactomes consisted of ∼90M and ∼56M putative PPIs, respectively. In the predicted PPIs, we identified 115 Pgt 21-0 and 34 Pgt Ug99 potential effectors that were highly involved in pathogen virulence and development. Functional enrichment analysis of the host proteins revealed significant GO terms and KEGG pathways such as O-methyltransferase activity (GO:0008171), regulation of signal transduction (GO:0009966), lignin metabolic process (GO:0009808), plastid envelope (GO:0009526), plant-pathogen interaction pathway (ko04626), and MAPK pathway (ko04016) that are actively involved in plant defense and immune signaling against the biotic stresses. Subcellular localization analysis anticipated the host plastid as a primary target for pathogen attack. The highly connected host hubs in the protein interaction network belonged to protein kinase domain including Ser/Thr protein kinase, MAPK, and cyclin-dependent kinase. We also identified 5,577 transcription factors in the interactions, associated with plant defense during biotic stress conditions. Additionally, novel host targets that are resistant to stem rust disease were also identified. The present study elucidates the functional differences between Pgt 21-0 and Pgt Ug99, thus providing the researchers with strain-specific information for further experimental validation of the interactions, and the development of durable, disease-resistant crop lines.
Collapse
Affiliation(s)
- Raghav Kataria
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
- Bioinformatics Facility, Center for Integrated BioSystems, Utah State University, Logan, UT, United States
- Department of Computer Science, College of Science, Utah State University, Logan, UT, United States
- *Correspondence: Rakesh Kaundal,
| |
Collapse
|
18
|
Hu L, Liu P, Jin Z, Sun J, Weng Y, Chen P, Du S, Wei A, Li Y. A mutation in CsHY2 encoding a phytochromobilin (PΦB) synthase leads to an elongated hypocotyl 1(elh1) phenotype in cucumber (Cucumis sativus L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:2639-2652. [PMID: 34091695 DOI: 10.1007/s00122-021-03849-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/29/2021] [Indexed: 06/12/2023]
Abstract
The elongated hypocotyl1 (elh1) mutant in cucumber is due to a mutation in CsHY2, which is a homolog of the Arabidopsis HY2 encoding the phytochromobilin (PΦB) synthase for phytochrome biosynthesis Hypocotyl length is a critical determinant in establishing high quality seedlings for successful cucumber production, but knowledge on the molecular regulation of hypocotyl growth in cucumber is very limited. Here, we reported identification and characterization of a cucumber elongated hypocotyl 1 (elh1) mutant. We found that the longer hypocotyl in elh1 was due to longitudinal growth of hypocotyl cells. With fine mapping, the elh1 locus was delimited to a 20.9-kb region containing three annotated genes; only one polymorphism was identified in this region between two parental lines, which was a non-synonymous SNP (G28153633A) in the third exon of CsHY2 (CsGy1G030000) that encodes a phytochromobilin (PΦB) synthase. Uniqueness of the mutant allele at CsHY2 was verified in natural cucumber populations. Ectopic expression of CsHY2 in Arabidopsis hy2-1 long-hypocotyl mutant led to reduced hypocotyl length. The PΦB protein was targeted to the chloroplast. The expression levels of CsHY2 and five phytochrome genes CsPHYA1, CsPHYA2, CsPHYB, CsPHYC and CsPHYE were all significantly down-regulated while several cell elongation related genes were up-regulated in elh1 mutant compared to wild-type cucumber, which are correlated with dynamic hypocotyl elongation in the mutant. RNA-seq analysis in the WT and mutant revealed differentially expressed genes involved in porphyrin and chlorophyll metabolisms, cell elongation and plant hormone signal transduction pathways. This is the first report to characterize and clone the CsHY2 gene in cucumber. This work reveals the important of CsHY2 in regulating hypocotyl length and extends our understanding of the roles of CsHY2 in cucumber.
Collapse
Affiliation(s)
- Liangliang Hu
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Peng Liu
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Zhuoshuai Jin
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Jing Sun
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Yiqun Weng
- Horticulture Department, USDA-ARS Vegetable Crops Research Unit, University of Wisconsin, Madison, WI, 53706, USA
| | - Peng Chen
- College of Life Science, Northwest A & F University, Yangling, 712100, Shaanxi,, China
| | - Shengli Du
- Tianjin Vegetable Research Center, Tianjin, 300192, China
- National Key Laboratory of Vegetable Germplasm Innovation, Tianjin, 300192, China
| | - Aimin Wei
- Tianjin Vegetable Research Center, Tianjin, 300192, China.
- National Key Laboratory of Vegetable Germplasm Innovation, Tianjin, 300192, China.
| | - Yuhong Li
- College of Horticulture, Northwest A&F University, Yangling, 712100, Shaanxi, China.
| |
Collapse
|
19
|
Ye Z, Yang W, Yang Y, Ouyang D. Interpretable machine learning methods for in vitro pharmaceutical formulation development. FOOD FRONTIERS 2021. [DOI: 10.1002/fft2.78] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Affiliation(s)
- Zhuyifan Ye
- State Key Laboratory of Quality Research in Chinese Medicine Institute of Chinese Medical Sciences (ICMS) University of Macau Macau China
| | - Wenmian Yang
- State Key Laboratory of Internet of Things for Smart City University of Macau Macau China
| | - Yilong Yang
- School of Software Beihang University Beijing China
| | - Defang Ouyang
- State Key Laboratory of Quality Research in Chinese Medicine Institute of Chinese Medical Sciences (ICMS) University of Macau Macau China
| |
Collapse
|
20
|
Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS, Dèng X, Razzaq A, Ali M, Yuán Y, Gǒng W. Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC PLANT BIOLOGY 2021; 21:37. [PMID: 33430775 PMCID: PMC7798291 DOI: 10.1186/s12870-020-02786-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 12/07/2020] [Indexed: 06/12/2023]
Abstract
BACKGROUND O-methyltransferases (OMTs) are an important group of enzymes that catalyze the transfer of a methyl group from S-adenosyl-L-methionine to their acceptor substrates. OMTs are divided into several groups according to their structural features. In Gossypium species, they are involved in phenolics and flavonoid pathways. Phenolics defend the cellulose fiber from dreadful external conditions of biotic and abiotic stresses, promoting strength and growth of plant cell wall. RESULTS An OMT gene family, containing a total of 192 members, has been identified and characterized in three main Gossypium species, G. hirsutum, G. arboreum and G. raimondii. Cis-regulatory elements analysis suggested important roles of OMT genes in growth, development, and defense against stresses. Transcriptome data of different fiber developmental stages in Chromosome Substitution Segment Lines (CSSLs), Recombination Inbred Lines (RILs) with excellent fiber quality, and standard genetic cotton cultivar TM-1 demonstrate that up-regulation of OMT genes at different fiber developmental stages, and abiotic stress treatments have some significant correlations with fiber quality formation, and with salt stress response. Quantitative RT-PCR results revealed that GhOMT10_Dt and GhOMT70_At genes had a specific expression in response to salt stress while GhOMT49_At, GhOMT49_Dt, and GhOMT48_At in fiber elongation and secondary cell wall stages. CONCLUSIONS Our results indicate that O-methyltransferase genes have multi-responses to salt stress and fiber development in Gossypium species and that they may contribute to salt tolerance or fiber quality formation in Gossypium.
Collapse
Affiliation(s)
- Abdul Hafeez
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
- Sindh Agriculture University Tandojam, Hyderabad, Sindh, 70060, Pakistan
| | - Qún Gě
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Qí Zhāng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Jùnwén Lǐ
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Jǔwǔ Gōng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Ruìxián Liú
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Yùzhēn Shí
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Hǎihóng Shāng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Àiyīng Liú
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Muhammad S Iqbal
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Xiǎoyīng Dèng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Abdul Razzaq
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China
| | - Muharam Ali
- Sindh Agriculture University Tandojam, Hyderabad, Sindh, 70060, Pakistan.
| | - Yǒulù Yuán
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China.
| | - Wànkuí Gǒng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, Henan, China.
| |
Collapse
|
21
|
Abstract
The elucidation of the subcellular localization of proteins is very important in order to deeply understand their functions. In fact, proteins activities are strictly correlated to the cellular compartment and microenvironment in which they are present.In recent years, several effective and reliable proteomics techniques and computational methods have been developed and implemented in order to identify the proteins subcellular localization. This process is often time-consuming and expensive, but the recent technological and bioinformatics progress allowed the development of more accurate and simple workflows to determine the localization, interactions, and functions of proteins.In the following chapter, a brief introduction on the importance of knowing subcellular localization of proteins will be presented. Then, sample preparation protocols, proteomic methods, data analysis strategies, and software for the prediction of proteins localization will be presented and discussed. Finally, the more recent and advanced spatial proteomics techniques will be shown.
Collapse
Affiliation(s)
- Elettra Barberis
- Department of Translational Medicine, University of Piemonte Orientale, Novara, Italy
- Center for Translational Research on Autoimmune and Allergic Diseases, CAAD, University of Piemonte Orientale, Novara, Italy
| | - Emilio Marengo
- Department of Sciences and Technological Innovation, University of Piemonte Orientale, Alessandria, Italy
- Center for Translational Research on Autoimmune and Allergic Diseases, CAAD, University of Piemonte Orientale, Novara, Italy
| | - Marcello Manfredi
- Department of Translational Medicine, University of Piemonte Orientale, Novara, Italy.
- Center for Translational Research on Autoimmune and Allergic Diseases, CAAD, University of Piemonte Orientale, Novara, Italy.
| |
Collapse
|
22
|
Muggia L, Ametrano CG, Sterflinger K, Tesei D. An Overview of Genomics, Phylogenomics and Proteomics Approaches in Ascomycota. Life (Basel) 2020; 10:E356. [PMID: 33348904 PMCID: PMC7765829 DOI: 10.3390/life10120356] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 12/10/2020] [Accepted: 12/12/2020] [Indexed: 12/26/2022] Open
Abstract
Fungi are among the most successful eukaryotes on Earth: they have evolved strategies to survive in the most diverse environments and stressful conditions and have been selected and exploited for multiple aims by humans. The characteristic features intrinsic of Fungi have required evolutionary changes and adaptations at deep molecular levels. Omics approaches, nowadays including genomics, metagenomics, phylogenomics, transcriptomics, metabolomics, and proteomics have enormously advanced the way to understand fungal diversity at diverse taxonomic levels, under changeable conditions and in still under-investigated environments. These approaches can be applied both on environmental communities and on individual organisms, either in nature or in axenic culture and have led the traditional morphology-based fungal systematic to increasingly implement molecular-based approaches. The advent of next-generation sequencing technologies was key to boost advances in fungal genomics and proteomics research. Much effort has also been directed towards the development of methodologies for optimal genomic DNA and protein extraction and separation. To date, the amount of proteomics investigations in Ascomycetes exceeds those carried out in any other fungal group. This is primarily due to the preponderance of their involvement in plant and animal diseases and multiple industrial applications, and therefore the need to understand the biological basis of the infectious process to develop mechanisms for biologic control, as well as to detect key proteins with roles in stress survival. Here we chose to present an overview as much comprehensive as possible of the major advances, mainly of the past decade, in the fields of genomics (including phylogenomics) and proteomics of Ascomycota, focusing particularly on those reporting on opportunistic pathogenic, extremophilic, polyextremotolerant and lichenized fungi. We also present a review of the mostly used genome sequencing technologies and methods for DNA sequence and protein analyses applied so far for fungi.
Collapse
Affiliation(s)
- Lucia Muggia
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy
| | - Claudio G. Ametrano
- Grainger Bioinformatics Center, Department of Science and Education, The Field Museum, Chicago, IL 60605, USA;
| | - Katja Sterflinger
- Academy of Fine Arts Vienna, Institute of Natual Sciences and Technology in the Arts, 1090 Vienna, Austria;
| | - Donatella Tesei
- Department of Biotechnology, University of Natural Resources and Life Sciences, 1190 Vienna, Austria;
| |
Collapse
|
23
|
Imai K, Nakai K. Tools for the Recognition of Sorting Signals and the Prediction of Subcellular Localization of Proteins From Their Amino Acid Sequences. Front Genet 2020; 11:607812. [PMID: 33324450 PMCID: PMC7723863 DOI: 10.3389/fgene.2020.607812] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 11/03/2020] [Indexed: 12/13/2022] Open
Abstract
At the time of translation, nascent proteins are thought to be sorted into their final subcellular localization sites, based on the part of their amino acid sequences (i.e., sorting or targeting signals). Thus, it is interesting to computationally recognize these signals from the amino acid sequences of any given proteins and to predict their final subcellular localization with such information, supplemented with additional information (e.g., k-mer frequency). This field has a long history and many prediction tools have been released. Even in this era of proteomic atlas at the single-cell level, researchers continue to develop new algorithms, aiming at accessing the impact of disease-causing mutations/cell type-specific alternative splicing, for example. In this article, we overview the entire field and discuss its future direction.
Collapse
Affiliation(s)
- Kenichiro Imai
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Kenta Nakai
- The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
24
|
Giordana L, Nowicki C. Two phylogenetically divergent isocitrate dehydrogenases are encoded in Leishmania parasites. Molecular and functional characterization of Leishmania mexicana isoenzymes with specificity towards NAD + and NADP .. Mol Biochem Parasitol 2020; 240:111320. [PMID: 32980452 DOI: 10.1016/j.molbiopara.2020.111320] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 08/26/2020] [Accepted: 08/27/2020] [Indexed: 10/23/2022]
Abstract
Leishmania parasites are of great relevance to public health because they are the causative agents of various long-term and health-threatening diseases in humans. Dependent on the manifestation, drugs either require difficult and lengthy administration, are toxic, expensive, not very effective or have lost efficacy due to the resistance developed by these pathogens against clinical treatments. The intermediary metabolism of Leishmania parasites is characterized by several unusual features, among which whether the Krebs cycle operates in a cyclic and/or in a non-cyclic mode is included. Our survey of the genomes of Leishmania species and monoxenous parasites such as those of the genera Crithidia and Leptomonas (http://www.tritrypdb.org) revealed that two genes encoding putative isocitrate dehydrogenases (IDHs) -with distantly related sequences- are strictly conserved among these parasites. Thus, in this study, we aimed to functionally characterize the two leishmanial IDH isoenzymes, for which we selected the genes LmxM10.0290 (Lmex_IDH-90) and LmxM32.2550 (Lmex_IDH-50) from L. mexicana. Phylogenetic analysis showed that Lmex_IDH-50 clustered with members of Subfamily I, which contains mainly archaeal and bacterial IDHs, and that Lmex_IDH-90 was a close relative of eukaryotic enzymes comprised within Subfamily II IDHs. 3-D homology modeling predicted that both IDHs exhibited the typical folding motifs recognized as canonical for prokaryotic and eukaryotic counterparts, respectively. Both IDH isoforms displayed dual subcellular localization, in the cytosol and the mitochondrion. Kinetic studies showed that Lmex_IDH-50 exclusively catalyzed the reduction of NAD+, while Lmex_IDH-90 solely used NADP+ as coenzyme. Besides, Lmex_IDH-50 differed from Lmex_IDH-90 by exhibiting a nearly 20-fold lower apparent Km value towards isocitrate (2.0 μM vs 43 μM). Our findings showed, for the first time, that the genus Leishmania differentiates not only from other trypanosomatids such as Trypanosoma cruzi and Trypanosoma brucei, but also from most living organisms, by exhibiting two functional homo-dimeric IDHs, highly specific towards NAD+ and NADP+, respectively. It is tempting to argue that any or both types of IDHs might be directly or indirectly linked to the Krebs cycle and/or to the de novo synthesis of glutamate. Our results about the biochemical and structural features of leishmanial IDHs show the relevance of deepening our knowledge of the metabolic processes in these pathogenic parasites to potentially identify new therapeutic targets.
Collapse
Affiliation(s)
- Lucila Giordana
- Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Instituto de Química y Fisicoquímica Biológica (IQUIFIB-CONICET), Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Cristina Nowicki
- Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Instituto de Química y Fisicoquímica Biológica (IQUIFIB-CONICET), Junín 956, C1113AAD, Buenos Aires, Argentina.
| |
Collapse
|
25
|
Marconi G, Aiello D, Kindiger B, Storchi L, Marrone A, Reale L, Terzaroli N, Albertini E. The Role of APOSTART in Switching between Sexuality and Apomixis in Poa pratensis. Genes (Basel) 2020; 11:genes11080941. [PMID: 32824095 PMCID: PMC7464379 DOI: 10.3390/genes11080941] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 08/11/2020] [Accepted: 08/11/2020] [Indexed: 12/20/2022] Open
Abstract
The production of seeds without sex is considered the holy grail of plant biology. The transfer of apomixis to various crop species has the potential to transform plant breeding, since it will allow new varieties to retain valuable traits thorough asexual reproduction. Therefore, a greater molecular understanding of apomixis is fundamental. In a previous work we identified a gene, namely APOSTART, that seemed to be involved in this asexual mode of reproduction, which is very common in Poa pratensis L., and here we present a detailed work aimed at clarifying its role in apomixis. In situ hybridization showed that PpAPOSTART is expressed in reproductive tissues from pre-meiosis to embryo development. Interestingly, it is expressed early in few nucellar cells of apomictic individuals possibly switching from a somatic to a reproductive cell as in aposporic apomixis. Moreover, out of 13 APOSTART members, we identified one, APOSTART_6, as specifically expressed in flower tissue. APOSTART_6 also exhibited delayed expression in apomictic genotypes when compared with sexual types. Most importantly, the SCAR (Sequence Characterized Amplified Region) derived from the APOSTART_6 sequence completely co-segregated with apomixis.
Collapse
Affiliation(s)
- Gianpiero Marconi
- Dipartimento di Scienze Agrarie, Alimentari e Ambientali, Università degli Studi di Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy; (G.M.); (D.A.); (L.R.); (N.T.)
| | - Domenico Aiello
- Dipartimento di Scienze Agrarie, Alimentari e Ambientali, Università degli Studi di Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy; (G.M.); (D.A.); (L.R.); (N.T.)
| | - Bryan Kindiger
- USDA-ARS, Grazinglands Research Laboratory, 7207 West Cheyenne St., El Reno, OK 73036, USA;
| | - Loriano Storchi
- Dipartimento di Farmacia, Università G. d’Annunzio, via dei Vestini 31, 66100 Chieti, Italy; (L.S.); (A.M.)
- Molecular Discovery Limited, Elstree WD6 3FG, UK
| | - Alessandro Marrone
- Dipartimento di Farmacia, Università G. d’Annunzio, via dei Vestini 31, 66100 Chieti, Italy; (L.S.); (A.M.)
| | - Lara Reale
- Dipartimento di Scienze Agrarie, Alimentari e Ambientali, Università degli Studi di Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy; (G.M.); (D.A.); (L.R.); (N.T.)
| | - Niccolò Terzaroli
- Dipartimento di Scienze Agrarie, Alimentari e Ambientali, Università degli Studi di Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy; (G.M.); (D.A.); (L.R.); (N.T.)
| | - Emidio Albertini
- Dipartimento di Scienze Agrarie, Alimentari e Ambientali, Università degli Studi di Perugia, Borgo XX Giugno 74, 06121 Perugia, Italy; (G.M.); (D.A.); (L.R.); (N.T.)
- Correspondence:
| |
Collapse
|
26
|
Lamelas L, Valledor L, Escandón M, Pinto G, Cañal MJ, Meijón M. Integrative analysis of the nuclear proteome in Pinus radiata reveals thermopriming coupled to epigenetic regulation. JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:2040-2057. [PMID: 31781741 PMCID: PMC7094079 DOI: 10.1093/jxb/erz524] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 11/27/2019] [Indexed: 05/19/2023]
Abstract
Despite it being an important issue in the context of climate change, for most plant species it is not currently known how abiotic stresses affect nuclear proteomes and mediate memory effects. This study examines how Pinus radiata nuclei respond, adapt, 'remember', and 'learn' from heat stress. Seedlings were heat-stressed at 45 °C for 10 d and then allowed to recover. Nuclear proteins were isolated and quantified by nLC-MS/MS, the dynamics of tissue DNA methylation were examined, and the potential acquired memory was analysed in recovered plants. In an additional experiment, the expression of key gene genes was also quantified. Specific nuclear heat-responsive proteins were identified, and their biological roles were evaluated using a systems biology approach. In addition to heat-shock proteins, several clusters involved in regulation processes were discovered, such as epigenomic-driven gene regulation, some transcription factors, and a variety of RNA-associated functions. Nuclei exhibited differential proteome profiles across the phases of the experiment, with histone H2A and methyl cycle enzymes in particular being accumulated in the recovery step. A thermopriming effect was possibly linked to H2A abundance and over-accumulation of spliceosome elements in recovered P. radiata plants. The results suggest that epigenetic mechanisms play a key role in heat-stress tolerance and priming mechanisms.
Collapse
Affiliation(s)
- Laura Lamelas
- Plant Physiology, Department of Organisms and Systems Biology, Faculty of Biology and Biotechnology Institute of Asturias, University of Oviedo, Oviedo, Asturias, Spain
| | - Luis Valledor
- Plant Physiology, Department of Organisms and Systems Biology, Faculty of Biology and Biotechnology Institute of Asturias, University of Oviedo, Oviedo, Asturias, Spain
| | - Mónica Escandón
- Department of Biology and CESAM, University of Aveiro, Aveiro, Portugal
| | - Gloria Pinto
- Department of Biology and CESAM, University of Aveiro, Aveiro, Portugal
| | - María Jesús Cañal
- Plant Physiology, Department of Organisms and Systems Biology, Faculty of Biology and Biotechnology Institute of Asturias, University of Oviedo, Oviedo, Asturias, Spain
| | - Mónica Meijón
- Plant Physiology, Department of Organisms and Systems Biology, Faculty of Biology and Biotechnology Institute of Asturias, University of Oviedo, Oviedo, Asturias, Spain
| |
Collapse
|
27
|
Miao YY, Zhao W, Li GP, Gao Y, Du PF. Predicting Endoplasmic Reticulum Resident Proteins Using Auto-Cross Covariance Transformation With a U-Shaped Residue Weight-Transfer Function. Front Genet 2020; 10:1231. [PMID: 31921288 PMCID: PMC6932965 DOI: 10.3389/fgene.2019.01231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 11/06/2019] [Indexed: 11/13/2022] Open
Abstract
Background: The endoplasmic reticulum (ER) is an important organelle in eukaryotic cells. It is involved in many important biological processes, such as cell metabolism, protein synthesis, and post-translational modification. The proteins that reside within the ER are called ER-resident proteins. These proteins are closely related to the biological functions of the ER. The difference between the ER-resident proteins and other non-resident proteins should be carefully studied. Methods: We developed a support vector machine (SVM)-based method. We developed a U-shaped weight-transfer function and used it, along with the positional-specific physiochemical properties (PSPCP), to integrate together sequence order information, signaling peptides information, and evolutionary information. Result: Our method achieved over 86% accuracy in a jackknife test. We also achieved roughly 86% sensitivity and 67% specificity in an independent dataset test. Our method is capable of identifying ER-resident proteins.
Collapse
Affiliation(s)
- Yang-Yang Miao
- College of Intelligence and Computing, Tianjin University, Tianjin, China.,School of Chemical Engineering, Tianjin University, Tianjin, China
| | - Wei Zhao
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Guang-Ping Li
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Yang Gao
- School of Medicine, Nankai University, Tianjin, China
| | - Pu-Feng Du
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| |
Collapse
|
28
|
Javed F, Hayat M. Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC. Genomics 2019; 111:1325-1332. [DOI: 10.1016/j.ygeno.2018.09.004] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2018] [Accepted: 09/04/2018] [Indexed: 12/13/2022]
|
29
|
Nielsen H, Petsalaki EI, Zhao L, Stühler K. Predicting eukaryotic protein secretion without signals. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2019; 1867:140174. [DOI: 10.1016/j.bbapap.2018.11.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2018] [Revised: 10/30/2018] [Accepted: 11/29/2018] [Indexed: 10/27/2022]
|
30
|
Novel redox-active enzymes for ligninolytic applications revealed from multiomics analyses of Peniophora sp. CBMAI 1063, a laccase hyper-producer strain. Sci Rep 2019; 9:17564. [PMID: 31772294 PMCID: PMC6879535 DOI: 10.1038/s41598-019-53608-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Accepted: 11/04/2019] [Indexed: 11/08/2022] Open
Abstract
The repertoire of redox-active enzymes produced by the marine fungus Peniophora sp. CBMAI 1063, a laccase hyper-producer strain, was characterized by omics analyses. The genome revealed 309 Carbohydrate-Active Enzymes (CAZymes) genes, including 48 predicted genes related to the modification and degradation of lignin, whith 303 being transcribed under cultivation in optimized saline conditions for laccase production. The secretome confirmed that the fungus can produce a versatile ligninolytic enzyme cocktail. It secretes 56 CAZymes, including 11 oxidative enzymes classified as members of auxiliary activity families (AAs), comprising two laccases, Pnh_Lac1 and Pnh_Lac2, the first is the major secretory protein of the fungi. The Pnh_Lac1-mediator system was able to promote the depolymerization of lignin fragments and polymeric lignin removal from pretreated sugarcane bagasse, confirming viability of this fungus enzymatic system for lignocellulose-based bioproducts applications.
Collapse
|
31
|
Han GS, Yu ZG. ML-rRBF-ECOC: A Multi-Label Learning Classifier for Predicting Protein Subcellular Localization with Both Single and Multiple Sites. CURR PROTEOMICS 2019. [DOI: 10.2174/1570164616666190103143945] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
The subcellular localization of a protein is closely related with its functions
and interactions. More and more evidences show that proteins may simultaneously exist at, or move
between, two or more different subcellular localizations. Therefore, predicting protein subcellular localization
is an important but challenging problem.
Observation:
Most of the existing methods for predicting protein subcellular localization assume that a
protein locates at a single site. Although a few methods have been proposed to deal with proteins with
multiple sites, correlations between subcellular localization are not efficiently taken into account. In
this paper, we propose an integrated method for predicting protein subcellular localizations with both
single site and multiple sites.
Methods:
Firstly, we extend the Multi-Label Radial Basis Function (ML-RBF) method to the regularized
version, and augment the first layer of ML-RBF to take local correlations between subcellular localization
into account. Secondly, we embed the modified ML-RBF into a multi-label Error-Correcting
Output Codes (ECOC) method in order to further consider the subcellular localization dependency. We
name our method ML-rRBF-ECOC. Finally, the performance of ML-rRBF-ECOC is evaluated on
three benchmark datasets.
Results:
The results demonstrate that ML-rRBF-ECOC has highly competitive performance to the related
multi-label learning method and some state-of-the-art methods for predicting protein subcellular
localizations with multiple sites. Considering dependency between subcellular localizations can contribute
to the improvement of prediction performance.
Conclusion:
This also indicates that correlations between different subcellular localizations really exist.
Our method at least plays a complementary role to existing methods for predicting protein subcellular
localizations with multiple sites.
Collapse
Affiliation(s)
- Guo-Sheng Han
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Hunan 411105, China
| | - Zu-Guo Yu
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Hunan 411105, China
| |
Collapse
|
32
|
Abstract
Background:
Revealing the subcellular location of a newly discovered protein can
bring insight into their function and guide research at the cellular level. The experimental methods
currently used to identify the protein subcellular locations are both time-consuming and expensive.
Thus, it is highly desired to develop computational methods for efficiently and effectively identifying
the protein subcellular locations. Especially, the rapidly increasing number of protein sequences
entering the genome databases has called for the development of automated analysis methods.
Methods:
In this review, we will describe the recent advances in predicting the protein subcellular
locations with machine learning from the following aspects: i) Protein subcellular location benchmark
dataset construction, ii) Protein feature representation and feature descriptors, iii) Common
machine learning algorithms, iv) Cross-validation test methods and assessment metrics, v) Web
servers.
Result & Conclusion:
Concomitant with a large number of protein sequences generated by highthroughput
technologies, four future directions for predicting protein subcellular locations with
machine learning should be paid attention. One direction is the selection of novel and effective features
(e.g., statistics, physical-chemical, evolutional) from the sequences and structures of proteins.
Another is the feature fusion strategy. The third is the design of a powerful predictor and the fourth
one is the protein multiple location sites prediction.
Collapse
Affiliation(s)
- Ting-He Zhang
- School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Shao-Wu Zhang
- School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| |
Collapse
|
33
|
Abstract
Ever since the signal hypothesis was proposed in 1971, the exact nature of signal peptides has been a focus point of research. The prediction of signal peptides and protein subcellular location from amino acid sequences has been an important problem in bioinformatics since the dawn of this research field, involving many statistical and machine learning technologies. In this review, we provide a historical account of how position-weight matrices, artificial neural networks, hidden Markov models, support vector machines and, lately, deep learning techniques have been used in the attempts to predict where proteins go. Because the secretory pathway was the first one to be studied both experimentally and through bioinformatics, our main focus is on the historical development of prediction methods for signal peptides that target proteins for secretion; prediction methods to identify targeting signals for other cellular compartments are treated in less detail.
Collapse
Affiliation(s)
- Henrik Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark.
| | - Konstantinos D Tsirigos
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Søren Brunak
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
- Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Gunnar von Heijne
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
- Science for Life Laboratory, Stockholm University, Solna, Sweden
| |
Collapse
|
34
|
Mahmoodi S, Nezafat N. In SilicoDesigning a Novel Multi-epitope DNA Vaccine against Anti-apoptotic Proteins in Tumor Cells. CURR PROTEOMICS 2019. [DOI: 10.2174/1570164616666181127142214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:Cancer therapy has been known as one of the most important challenges in the world. Various therapeutic methods such as cancer immunotherapy are used to eradicate tumor cells. Vaccines have an important role among different cancer immunotherapeutic approaches. In the field of vaccine production, bioinformatics approach is considered as a useful tool to design multi-epitope cancer vaccines, mainly for selecting immunodominant Cytotoxic T Lymphocytes (CTL) and Helper T Lymphocytes (HTL) epitopes.Objective:Generally, to design efficient multi-epitope cancer vaccines, Tumor-Specific Antigens (TSA) are targeted. In the context of DNA-based cancer vaccines, they contain genes that code tumor antigens and are delivered to host by different methods.Methods:In this study, the anti-apoptotic proteins (BCL2, BCL-X, survivin) that are over-expressed in different tumor cells were selected for CTL and HTL epitopes prediction through different servers such as RANKPEP, CTLpred, and BCPREDS.Results:Three regions from BCL2 and one region from BCL-X were selected as CTL epitopes and two segments from survivin were defined as HTL epitopes. In addition, β-defensin was used as a proper adjuvant to enhance vaccine efficacy. The aforesaid segments were joined together by appropriate linkers, and some important properties of designed vaccine such as antigenicity, allergenicity and physicochemical characteristics were determined by various bioinformatics servers.Conclusion:Based on the bioinformatics results, the physicochemical and immunological features showed that the designed vaccine construct can be used as an efficient cancer vaccine after its efficacy was confirmed by in vitro and in vivo immunological assays.
Collapse
Affiliation(s)
- Shirin Mahmoodi
- Department of Medical Biotechnology, School of Medicine, Fasa University of Medical Sciences, Fasa, Iran
| | - Navid Nezafat
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
35
|
Bhattacharya M, Jurkovitz C, Shatkay H. Chronic Kidney Disease stratification using office visit records: Handling data imbalance via hierarchical meta-classification. BMC Med Inform Decis Mak 2018; 18:125. [PMID: 30537962 PMCID: PMC6290512 DOI: 10.1186/s12911-018-0675-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Background Chronic Kidney Disease (CKD) is one of several conditions that affect a growing percentage of the US population; the disease is accompanied by multiple co-morbidities, and is hard to diagnose in-and-of itself. In its advanced forms it carries severe outcomes and can lead to death. It is thus important to detect the disease as early as possible, which can help devise effective intervention and treatment plan. Here we investigate ways to utilize information available in electronic health records (EHRs) from regular office visits of more than 13,000 patients, in order to distinguish among several stages of the disease. While clinical data stored in EHRs provide valuable information for risk-stratification, one of the major challenges in using them arises from data imbalance. That is, records associated with a more severe condition are typically under-represented compared to those associated with a milder manifestation of the disease. To address imbalance, we propose and develop a sampling-based ensemble approach, hierarchical meta-classification, aiming to stratify CKD patients into severity stages, using simple quantitative non-text features gathered from standard office visit records. Methods The proposed hierarchical meta-classification method frames the multiclass classification task as a hierarchy of two subtasks. The first is binary classification, separating records associated with the majority class from those associated with all minority classes combined, using meta-classification. The second subtask separates the records assigned to the combined minority classes into the individual constituent classes. Results The proposed method identifies a significant proportion of patients suffering from the more advanced stages of the condition, while also correctly identifying most of the less severe cases, maintaining high sensitivity, specificity and F-measure (≥ 93%). Our results show that the high level of performance attained by our method is preserved even when the size of the training set is significantly reduced, demonstrating the stability and generalizability of our approach. Conclusion We present a new approach to perform classification while addressing data imbalance, which is inherent in the biomedical domain. Our model effectively identifies severity stages of CKD patients, using information readily available in office visit records within the realistic context of high data imbalance.
Collapse
Affiliation(s)
- Moumita Bhattacharya
- Computational Biomedicine Lab, Computer and Information Sciences, University of Delaware, Newark, DE, USA.
| | | | - Hagit Shatkay
- Computational Biomedicine Lab, Computer and Information Sciences, University of Delaware, Newark, DE, USA.,Center for Bioinformatics and Computational Biology, Delaware Biotechnology Inst, University of Delaware, Newark, DE, USA
| |
Collapse
|
36
|
Juárez-Colunga S, López-González C, Morales-Elías NC, Massange-Sánchez JA, Trachsel S, Tiessen A. Genome-wide analysis of the invertase gene family from maize. PLANT MOLECULAR BIOLOGY 2018; 97:385-406. [PMID: 29948658 DOI: 10.1007/s11103-018-0746-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2018] [Accepted: 06/04/2018] [Indexed: 05/14/2023]
Abstract
The recent release of the maize genome (AGPv4) contains annotation errors of invertase genes and therefore the enzymes are bestly curated manually at the protein level in a comprehensible fashion The synthesis, transport and degradation of sucrose are determining factors for biomass allocation and yield of crop plants. Invertase (INV) is a key enzyme of carbon metabolism in both source and sink tissues. Current releases of the maize genome correctly annotates only two vacuolar invertases (ivr1 and ivr2) and four cell wall invertases (incw1, incw2 (mn1), incw3, and incw4). Our comprehensive survey identified 21 INV isogenes for which we propose a standard nomenclature grouped phylogenetically by amino acid similarity: three vacuolar (INVVR), eight cell wall (INVCW), and ten alkaline/neutral (INVAN) isogenes which form separate dendogram branches due to distinct molecular features. The acidic enzymes were curated for the presence of the DPN tripeptide which is coded by one of the smallest exons reported in plants. Particular attention was placed on the molecular role of INV in vascular tissues such as the nodes, internodes, leaf sheath, husk leaves and roots. We report the expression profile of most members of the maize INV family in nine tissues in two developmental stages, R1 and R3. INVCW7, INVVR2, INVAN8, INVAN9, INVAN10, and INVAN3 displayed the highest absolute expressions in most tissues. INVVR3, INVCW5, INVCW8, and INVAN1 showed low mRNA levels. Expressions of most INVs were repressed from stage R1 to R3, except for INVCW7 which increased significantly in all tissues after flowering. The mRNA levels of INVCW7 in the vegetative stem correlated with a higher transport rate of assimilates from leaves to the cob which led to starch accumulation and growth of the female reproductive organs.
Collapse
Affiliation(s)
- Sheila Juárez-Colunga
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV), Km 9.6 Libramiento Norte, Irapuato, C.P. 36824, Guanajuato, Mexico
| | - Cristal López-González
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV), Km 9.6 Libramiento Norte, Irapuato, C.P. 36824, Guanajuato, Mexico
| | - Norma Cecilia Morales-Elías
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV), Km 9.6 Libramiento Norte, Irapuato, C.P. 36824, Guanajuato, Mexico
| | - Julio Armando Massange-Sánchez
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV), Km 9.6 Libramiento Norte, Irapuato, C.P. 36824, Guanajuato, Mexico
- KWS Group, Grimsehlstrasse 31, 37574, Einbeck, Germany
| | - Samuel Trachsel
- Global Maize Program, Centro Internacional de Mejoramiento de Maíz y Trigo (CIMMYT), Km 45 Carretera Mexico-Veracruz, El Batán, 56130, Texcoco, State Of Mexico, Mexico
- Department of Genetics and Biotechnology, Aarhus University, Forsøgsvej 1, 4200, Slagelse, Denmark
| | - Axel Tiessen
- Departamento de Ingeniería Genética, Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV), Km 9.6 Libramiento Norte, Irapuato, C.P. 36824, Guanajuato, Mexico.
| |
Collapse
|
37
|
Yu B, Li S, Qiu W, Wang M, Du J, Zhang Y, Chen X. Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction. BMC Genomics 2018; 19:478. [PMID: 29914358 PMCID: PMC6006758 DOI: 10.1186/s12864-018-4849-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 06/01/2018] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Apoptosis is associated with some human diseases, including cancer, autoimmune disease, neurodegenerative disease and ischemic damage, etc. Apoptosis proteins subcellular localization information is very important for understanding the mechanism of programmed cell death and the development of drugs. Therefore, the prediction of subcellular localization of apoptosis protein is still a challenging task. RESULTS In this paper, we propose a novel method for predicting apoptosis protein subcellular localization, called PsePSSM-DCCA-LFDA. Firstly, the protein sequences are extracted by combining pseudo-position specific scoring matrix (PsePSSM) and detrended cross-correlation analysis coefficient (DCCA coefficient), then the extracted feature information is reduced dimensionality by LFDA (local Fisher discriminant analysis). Finally, the optimal feature vectors are input to the SVM classifier to predict subcellular location of the apoptosis proteins. The overall prediction accuracy of 99.7, 99.6 and 100% are achieved respectively on the three benchmark datasets by the most rigorous jackknife test, which is better than other state-of-the-art methods. CONCLUSION The experimental results indicate that our method can significantly improve the prediction accuracy of subcellular localization of apoptosis proteins, which is quite high to be able to become a promising tool for further proteomics studies. The source code and all datasets are available at https://github.com/QUST-BSBRC/PsePSSM-DCCA-LFDA/ .
Collapse
Affiliation(s)
- Bin Yu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China. .,Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao, 266061, China. .,School of Life Sciences, University of Science and Technology of China, Hefei, 230027, China.
| | - Shan Li
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China.,Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Wenying Qiu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China.,Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Minghui Wang
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China.,Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Junwei Du
- College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Yusen Zhang
- School of Mathematics and Statistics, Shandong University at Weihai, Weihai, 264209, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 21116, China
| |
Collapse
|
38
|
de Gouvêa PF, Bernardi AV, Gerolamo LE, de Souza Santos E, Riaño-Pachón DM, Uyemura SA, Dinamarco TM. Transcriptome and secretome analysis of Aspergillus fumigatus in the presence of sugarcane bagasse. BMC Genomics 2018; 19:232. [PMID: 29614953 PMCID: PMC5883313 DOI: 10.1186/s12864-018-4627-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Accepted: 03/27/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sugarcane bagasse has been proposed as a lignocellulosic residue for second-generation ethanol (2G) produced by breaking down biomass into fermentable sugars. The enzymatic cocktails for biomass degradation are mostly produced by fungi, but low cost and high efficiency can consolidate 2G technologies. A. fumigatus plays an important role in plant biomass degradation capabilities and recycling. To gain more insight into the divergence in gene expression during steam-exploded bagasse (SEB) breakdown, this study profiled the transcriptome of A. fumigatus by RNA sequencing to compare transcriptional profiles of A. fumigatus grown on media containing SEB or fructose as the sole carbon source. Secretome analysis was also performed using SDS-PAGE and LC-MS/MS. RESULTS The maximum activities of cellulases (0.032 U mL-1), endo-1,4-β--xylanase (10.82 U mL-1) and endo-1,3-β glucanases (0.77 U mL-1) showed that functional CAZymes (carbohydrate-active enzymes) were secreted in the SEB culture conditions. Correlations between transcriptome and secretome data identified several CAZymes in A. fumigatus. Particular attention was given to CAZymes related to lignocellulose degradation and sugar transporters. Genes encoding glycoside hydrolase classes commonly expressed during the breakdown of cellulose, such as GH-5, 6, 7, 43, 45, and hemicellulose, such as GH-2, 10, 11, 30, 43, were found to be highly expressed in SEB conditions. Lytic polysaccharide monooxygenases (LPMO) classified as auxiliary activity families AA9 (GH61), CE (1, 4, 8, 15, 16), PL (1, 3, 4, 20) and GT (1, 2, 4, 8, 20, 35, 48) were also differentially expressed in this condition. Similarly, the most important enzymes related to biomass degradation, including endoxylanases, xyloglucanases, β-xylosidases, LPMOs, α-arabinofuranosidases, cellobiohydrolases, endoglucanases and β-glucosidases, were also identified in the secretome. CONCLUSIONS This is the first report of a transcriptome and secretome experiment of Aspergillus fumigatus in the degradation of pretreated sugarcane bagasse. The results suggest that this strain employs important strategies for this complex degradation process. It was possible to identify a set of genes and proteins that might be applied in several biotechnology fields. This knowledge can be exploited for the improvement of 2G ethanol production by the rational design of enzymatic cocktails.
Collapse
Affiliation(s)
- Paula Fagundes de Gouvêa
- Faculty of Philosophy, Sciences and Literature of Ribeirão Preto, Chemistry Department, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Aline Vianna Bernardi
- Faculty of Philosophy, Sciences and Literature of Ribeirão Preto, Chemistry Department, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Luis Eduardo Gerolamo
- Faculty of Philosophy, Sciences and Literature of Ribeirão Preto, Chemistry Department, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Emerson de Souza Santos
- Faculty of Pharmaceutical Science, Department of Clinical, Toxicological and Bromatological Analysis, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Diego Mauricio Riaño-Pachón
- Brazilian Bioethanol Science and Technology Laboratory, Campinas, São Paulo, Brazil
- Current address: Laboratory of Regulatory Systems Biology, Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo, Brazil
| | - Sergio Akira Uyemura
- Faculty of Pharmaceutical Science, Department of Clinical, Toxicological and Bromatological Analysis, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Taisa Magnani Dinamarco
- Faculty of Philosophy, Sciences and Literature of Ribeirão Preto, Chemistry Department, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| |
Collapse
|
39
|
Singh A, Tyagi C, Nath O, Singh IK. Helicoverpa-inducible Thioredoxin h from Cicer arietinum: structural modeling and potential targets. Int J Biol Macromol 2018; 109:231-243. [DOI: 10.1016/j.ijbiomac.2017.12.079] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Revised: 12/09/2017] [Accepted: 12/12/2017] [Indexed: 12/31/2022]
|
40
|
Abstract
Many computational methods are available for predicting protein sorting in bacteria. When comparing them, it is important to know that they can be grouped into three fundamentally different approaches: signal-based, global-property-based and homology-based prediction. In this chapter, the strengths and drawbacks of each of these approaches is described through many examples of methods that predict secretion, integration into membranes, or subcellular locations in general. The aim of this chapter is to provide a user-level introduction to the field with a minimum of computational theory.
Collapse
Affiliation(s)
- Henrik Nielsen
- Technical University of Denmark, Kemitorvet, Building 208, DK-2800, Kgs. Lyngby, Denmark.
| |
Collapse
|
41
|
Li M, Li W, Wu FX, Pan Y, Wang J. Identifying essential proteins based on sub-network partition and prioritization by integrating subcellular localization information. J Theor Biol 2018; 447:65-73. [PMID: 29571709 DOI: 10.1016/j.jtbi.2018.03.029] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2017] [Revised: 03/19/2018] [Accepted: 03/20/2018] [Indexed: 01/07/2023]
Abstract
Essential proteins are important participants in various life activities and play a vital role in the survival and reproduction of living organisms. Identification of essential proteins from protein-protein interaction (PPI) networks has great significance to facilitate the study of human complex diseases, the design of drugs and the development of bioinformatics and computational science. Studies have shown that highly connected proteins in a PPI network tend to be essential. A series of computational methods have been proposed to identify essential proteins by analyzing topological structures of PPI networks. However, the high noise in the PPI data can degrade the accuracy of essential protein prediction. Moreover, proteins must be located in the appropriate subcellular localization to perform their functions, and only when the proteins are located in the same subcellular localization, it is possible that they can interact with each other. In this paper, we propose a new network-based essential protein discovery method based on sub-network partition and prioritization by integrating subcellular localization information, named SPP. The proposed method SPP was tested on two different yeast PPI networks obtained from DIP database and BioGRID database. The experimental results show that SPP can effectively reduce the effect of false positives in PPI networks and predict essential proteins more accurately compared with other existing computational methods DC, BC, CC, SC, EC, IC, NC.
Collapse
Affiliation(s)
- Min Li
- School of Information Science and Engineering, Central South University, Changsha 410083, China.
| | - Wenkai Li
- School of Information Science and Engineering, Central South University, Changsha 410083, China.
| | - Fang-Xiang Wu
- Division of Biomedical Engineering and Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada.
| | - Yi Pan
- Department of Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA.
| | - Jianxin Wang
- School of Information Science and Engineering, Central South University, Changsha 410083, China.
| |
Collapse
|
42
|
Complex regulation of the TaMyc1 gene expression in wheat grain synthesizing anthocyanin pigments. Mol Biol Rep 2018; 45:327-334. [PMID: 29556921 DOI: 10.1007/s11033-018-4165-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2017] [Accepted: 03/11/2018] [Indexed: 01/08/2023]
Abstract
The wheat TaMyc1 gene encodes for transcriptional factor (TF) with bHLH domain. The gene is expressed in purple wheat grains and activates transcription of the anthocyanin biosynthesis structural genes. To reveal the features of TaMyc1 regulation in wheat pericarp transcription start sites (TSS) were identified by 5' RACE mean and translation efficiency was predicted by in silico methods. Three alternative transcript variants of TaMyc1 differing by 5' leader sequence only were identified in purple pericarp. The three transcripts are generated from distinct TATA boxes and thereby are differed by TSS. Two transcripts (TaMyc1a, -b) have identical initiation AUG codons that lead to the TaMYC1 regulatory protein with bHLH domain. However because of different stability of secondary structures predicted in 5' leader the two transcripts might be translated with different efficiency. The third transcript is assumed to be not effectively translated. qRT-PCR and colonies counting were applied to assess contribution each of the transcripts to total TaMyc1 gene transcription level. TaMyc1c has the lowest contribution (ca. 16%), whereas the others two transcripts contribute equally (ca. 42%) to total TaMyc1 expression level. The role of the tree mRNA isoforms transcribed in one tissue is discussed.
Collapse
|
43
|
Ren S, Yang M, Yue Y, Ge F, Li Y, Guo X, Zhang J, Zhang F, Nie X, Wang S. Lysine Succinylation Contributes to Aflatoxin Production and Pathogenicity in Aspergillus flavus. Mol Cell Proteomics 2018; 17:457-471. [PMID: 29298838 PMCID: PMC5836371 DOI: 10.1074/mcp.ra117.000393] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 12/11/2017] [Indexed: 12/27/2022] Open
Abstract
Aspergillus flavus (A. flavus) is a ubiquitous saprophytic and pathogenic fungus that produces the aflatoxin carcinogen, and A. flavus can have tremendous economic and health impacts worldwide. Increasing evidence demonstrates that lysine succinylation plays an important regulatory role in metabolic processes in both bacterial and human cells. However, little is known about the extent and function of lysine succinylation in A. flavus. Here, we performed a global succinylome analysis of A. flavus using high accuracy nano-LC-MS/MS in combination with the enrichment of succinylated peptides from digested cell lysates and subsequent peptide identification. In total, 985 succinylation sites on 349 succinylated proteins were identified in this pathogen. Bioinformatics analysis revealed that the succinylated proteins were involved in various biological processes and were particularly enriched in the aflatoxin biosynthesis process. Site-specific mutagenesis and biochemical studies showed that lysine succinylation on the norsolorinic acid reductase NorA (AflE), a key enzyme in aflatoxins biosynthesis, can affect the production of sclerotia and aflatoxins biosynthesis in A. flavus. Together, our findings reveal widespread roles for lysine succinylation in regulating metabolism and aflatoxins biosynthesis in A. flavus. Our data provide a rich resource for functional analyses of lysine succinylation and facilitate the dissection of metabolic networks in this pathogen.
Collapse
Affiliation(s)
- Silin Ren
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Mingkun Yang
- §Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Yuewei Yue
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Feng Ge
- §Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Yu Li
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Xiaodong Guo
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Jia Zhang
- §Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Feng Zhang
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Xinyi Nie
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Shihua Wang
- From the ‡Key Laboratory of Pathogenic Fungi and Mycotoxins of Fujian Province, and School of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China;
| |
Collapse
|
44
|
Polysaccharide associated protein (PSAP) from the green microalga Botryococcus braunii is a unique extracellular matrix hydroxyproline-rich glycoprotein. ALGAL RES 2018. [DOI: 10.1016/j.algal.2017.11.018] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
45
|
Zuma B, Dana MB, Wang D. Prolonged Expression of a Putative Invertase Inhibitor in Micropylar Endosperm Suppressed Embryo Growth in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2018; 9:61. [PMID: 29441087 PMCID: PMC5797552 DOI: 10.3389/fpls.2018.00061] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Accepted: 01/12/2018] [Indexed: 05/21/2023]
Abstract
Proper seed development requires coordinated growth among the three genetically distinct components, the embryo, the endosperm, and the seed coat. In Arabidopsis, embryo growth rate accelerates after endosperm cellularization, which requires a chromatin-remodeling complex, the FIS2-Polycomb Repressive Complex 2 (PRC2). After cellularization, the endosperm ceases to grow and is eventually absorbed by the embryo. This sequential growth pattern displayed by the endosperm and the embryo suggests a possibility that the supply of sugar might be shifted from the endosperm to the embryo upon endosperm cellularization. Since invertases and invertase inhibitors play an important role in sugar partition, we investigated their expression pattern during early stages of seed development in Arabidopsis. Two putative invertase inhibitors (InvINH1 and InvINH2) were identified as being preferentially expressed in the micropylar endosperm that surrounds the embryo. After endosperm cellularization, InvINH1 and InvINH2 were down-regulated in a FIS2-dependent manner. We hypothesized that FIS2-PRC2 complex either directly or indirectly represses InvINH1 and InvINH2 to increase invertase activity around the embryo, making more hexose available to support the accelerated embryo growth after endosperm cellularization. In support of our hypothesis, embryo growth was delayed in transgenic lines that ectopically expressed InvINH1 in the cellularized endosperm. Our data suggested a novel mechanism for the FIS2-PRC2 complex to control embryo growth rate via the regulation of invertase activity in the endosperm.
Collapse
|
46
|
Roth C, Lüdke D, Klenke M, Quathamer A, Valerius O, Braus GH, Wiermer M. The truncated NLR protein TIR-NBS13 is a MOS6/IMPORTIN-α3 interaction partner required for plant immunity. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 92:808-821. [PMID: 28901644 DOI: 10.1111/tpj.13717] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 08/21/2017] [Accepted: 09/04/2017] [Indexed: 05/28/2023]
Abstract
Importin-α proteins mediate the translocation of nuclear localization signal (NLS)-containing proteins from the cytoplasm into the nucleus through nuclear pore complexes (NPCs). Genetically, Arabidopsis IMPORTIN-α3/MOS6 (MODIFIER OF SNC1, 6) is required for basal plant immunity and constitutive disease resistance activated in autoimmune mutant snc1 (suppressor of npr1-1, constitutive 1), suggesting that MOS6 plays a role in the nuclear import of proteins involved in plant defense signaling. Here, we sought to identify and characterize defense-regulatory cargo proteins and interaction partners of MOS6. We conducted both in silico database analyses and affinity purification of functional epitope-tagged MOS6 from pathogen-challenged stable transgenic plants coupled with mass spectrometry. We show that among the 13 candidate MOS6 interactors we selected for further functional characterization, the TIR-NBS-type protein TN13 is required for resistance against Pseudomonas syringae pv. tomato (Pst) DC3000 lacking the type-III effector proteins AvrPto and AvrPtoB. When expressed transiently in N. benthamiana leaves, TN13 co-immunoprecipitates with MOS6, but not with its closest homolog IMPORTIN-α6, and localizes to the endoplasmic reticulum (ER), consistent with a predicted N-terminal transmembrane domain in TN13. Our work uncovered the truncated NLR protein TN13 as a component of plant innate immunity that selectively binds to MOS6/IMPORTIN-α3 in planta. We speculate that the release of TN13 from the ER membrane in response to pathogen stimulus, and its subsequent nuclear translocation, is important for plant defense signal transduction.
Collapse
Affiliation(s)
- Charlotte Roth
- RG Molecular Biology of Plant-Microbe Interactions, Albrecht-von-Haller-Institute for Plant Sciences, University of Goettingen, Julia-Lermontowa-Weg 3, 37077, Goettingen, Germany
| | - Daniel Lüdke
- RG Molecular Biology of Plant-Microbe Interactions, Albrecht-von-Haller-Institute for Plant Sciences, University of Goettingen, Julia-Lermontowa-Weg 3, 37077, Goettingen, Germany
| | - Melanie Klenke
- RG Molecular Biology of Plant-Microbe Interactions, Albrecht-von-Haller-Institute for Plant Sciences, University of Goettingen, Julia-Lermontowa-Weg 3, 37077, Goettingen, Germany
| | - Annalena Quathamer
- RG Molecular Biology of Plant-Microbe Interactions, Albrecht-von-Haller-Institute for Plant Sciences, University of Goettingen, Julia-Lermontowa-Weg 3, 37077, Goettingen, Germany
| | - Oliver Valerius
- Department of Molecular Microbiology and Genetics, Institute for Microbiology and Genetics, University of Goettingen, Grisebachstrasse 8, 37077, Goettingen, Germany
| | - Gerhard H Braus
- Department of Molecular Microbiology and Genetics, Institute for Microbiology and Genetics, University of Goettingen, Grisebachstrasse 8, 37077, Goettingen, Germany
| | - Marcel Wiermer
- RG Molecular Biology of Plant-Microbe Interactions, Albrecht-von-Haller-Institute for Plant Sciences, University of Goettingen, Julia-Lermontowa-Weg 3, 37077, Goettingen, Germany
| |
Collapse
|
47
|
Jiang X, Ringwald M, Blake J, Shatkay H. Effective biomedical document classification for identifying publications relevant to the mouse Gene Expression Database (GXD). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:3084695. [PMID: 28365740 PMCID: PMC5467553 DOI: 10.1093/database/bax017] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Accepted: 02/13/2017] [Indexed: 12/16/2022]
Abstract
The Gene Expression Database (GXD) is a comprehensive online database within the Mouse Genome Informatics resource, aiming to provide available information about endogenous gene expression during mouse development. The information stems primarily from many thousands of biomedical publications that database curators must go through and read. Given the very large number of biomedical papers published each year, automatic document classification plays an important role in biomedical research. Specifically, an effective and efficient document classifier is needed for supporting the GXD annotation workflow. We present here an effective yet relatively simple classification scheme, which uses readily available tools while employing feature selection, aiming to assist curators in identifying publications relevant to GXD. We examine the performance of our method over a large manually curated dataset, consisting of more than 25 000 PubMed abstracts, of which about half are curated as relevant to GXD while the other half as irrelevant to GXD. In addition to text from title-and-abstract, we also consider image captions, an important information source that we integrate into our method. We apply a captions-based classifier to a subset of about 3300 documents, for which the full text of the curated articles is available. The results demonstrate that our proposed approach is robust and effectively addresses the GXD document classification. Moreover, using information obtained from image captions clearly improves performance, compared to title and abstract alone, affirming the utility of image captions as a substantial evidence source for automatically determining the relevance of biomedical publications to a specific subject area. Database URL:www.informatics.jax.org
Collapse
Affiliation(s)
- Xiangying Jiang
- Department of Computer and Information Sciences, University of Delaware, 101 Smith Hall, Newark, DE, USA
| | - Martin Ringwald
- Department of Computer and Information Sciences, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, USA
| | - Judith Blake
- Department of Computer and Information Sciences, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, USA
| | - Hagit Shatkay
- Department of Computer and Information Sciences, University of Delaware, 101 Smith Hall, Newark, DE, USA
| |
Collapse
|
48
|
A single Danio rerio hars gene encodes both cytoplasmic and mitochondrial histidyl-tRNA synthetases. PLoS One 2017; 12:e0185317. [PMID: 28934368 PMCID: PMC5608375 DOI: 10.1371/journal.pone.0185317] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 09/11/2017] [Indexed: 12/16/2022] Open
Abstract
Histidyl tRNA Synthetase (HARS) is a member of the aminoacyl tRNA synthetase (ARS) family of enzymes. This family of 20 enzymes is responsible for attaching specific amino acids to their cognate tRNA molecules, a critical step in protein synthesis. However, recent work highlighting a growing number of associations between ARS genes and diverse human diseases raises the possibility of new and unexpected functions in this ancient enzyme family. For example, mutations in HARS have been linked to two different neurological disorders, Usher Syndrome Type IIIB and Charcot Marie Tooth peripheral neuropathy. These connections raise the possibility of previously undiscovered roles for HARS in metazoan development, with alterations in these functions leading to complex diseases. In an attempt to establish Danio rerio as a model for studying HARS functions in human disease, we characterized the Danio rerio hars gene and compared it to that of human HARS. Using a combination of bioinformatics, molecular biology, and cellular approaches, we found that while the human genome encodes separate genes for cytoplasmic and mitochondrial HARS protein, the Danio rerio genome encodes a single hars gene which undergoes alternative splicing to produce the respective cytoplasmic and mitochondrial versions of Hars. Nevertheless, while the HARS genes of humans and Danio differ significantly at the genomic level, we found that they are still highly conserved at the amino acid level, underscoring the potential utility of Danio rerio as a model organism for investigating HARS function and its link to human diseases in vivo.
Collapse
|
49
|
Prakasam G, Singh RK, Iqbal MA, Saini SK, Tiku AB, Bamezai RNK. Pyruvate kinase M knockdown-induced signaling via AMP-activated protein kinase promotes mitochondrial biogenesis, autophagy, and cancer cell survival. J Biol Chem 2017; 292:15561-15576. [PMID: 28778925 PMCID: PMC5602412 DOI: 10.1074/jbc.m117.791343] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2017] [Revised: 08/01/2017] [Indexed: 11/06/2022] Open
Abstract
Preferential expression of the low-activity (dimeric) M2 isoform of pyruvate kinase (PK) over its constitutively active splice variant M1 isoform is considered critical for aerobic glycolysis in cancer cells. However, our results reported here indicate co-expression of PKM1 and PKM2 and their possible physical interaction in cancer cells. We show that knockdown of either PKM1 or PKM2 differentially affects net PK activity, viability, and cellular ATP levels of the lung carcinoma cell lines H1299 and A549. The stable knockdown of PK isoforms in A549 cells significantly reduced the cellular ATP level, whereas in H1299 cells the level of ATP was unaltered. Interestingly, the PKM1/2 knockdown in H1299 cells activated AMP-activated protein kinase (AMPK) signaling and stimulated mitochondrial biogenesis and autophagy to maintain energy homeostasis. In contrast, knocking down either of the PKM isoforms in A549 cells lacking LKB1, a serine/threonine protein kinase upstream of AMPK, failed to activate AMPK and sustain energy homeostasis and resulted in apoptosis. Moreover, in a similar genetic background of silenced PKM1 or PKM2, the knocking down of AMPKα1/2 catalytic subunit in H1299 cells induced apoptosis. Our findings help explain why previous targeting of PKM2 in cancer cells to control tumor growth has not met with the expected success. We suggest that this lack of success is because of AMPK-mediated energy metabolism rewiring, protecting cancer cell viability. On the basis of our observations, we propose an alternative therapeutic strategy of silencing either of the PKM isoforms along with AMPK in tumors.
Collapse
Affiliation(s)
| | - Rajnish Kumar Singh
- From the School of Life Sciences and
- Department of Microbiology and Tumor Virology Program of the Abramson Cancer Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, and
| | - Mohammad Askandar Iqbal
- From the School of Life Sciences and
- Department of Biotechnology, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi 110025, India
| | | | - Ashu Bhan Tiku
- Radiation and Cancer Therapeutics Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | | |
Collapse
|
50
|
Orentas RJ, Sindiri S, Duris C, Wen X, He J, Wei JS, Jarzembowski J, Khan J. Paired Expression Analysis of Tumor Cell Surface Antigens. Front Oncol 2017; 7:173. [PMID: 28871274 PMCID: PMC5566986 DOI: 10.3389/fonc.2017.00173] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 07/31/2017] [Indexed: 01/15/2023] Open
Abstract
Adoptive immunotherapy with antibody-based therapy or with T cells transduced to express chimeric antigen receptors (CARs) is useful to the extent that the cell surface membrane protein being targeted is not expressed on normal tissues. The most successful CAR-based (anti-CD19) or antibody-based therapy (anti-CD20) in hematologic malignancies has the side effect of eliminating the normal B cell compartment. Targeting solid tumors may not provide a similar expendable marker. Beyond antibody to Her2/NEU and EGFR, very few antibody-based and no CAR-based therapies have seen broad clinical application for solid tumors. To expand the way in which the surfaceome of solid tumors can be analyzed, we created an algorithm that defines the pairwise relative overexpression of surface antigens. This enables the development of specific immunotherapies that require the expression of two discrete antigens on the surface of the tumor target. This dyad analysis was facilitated by employing the Hotelling’s T-squared test (Hotelling–Lawley multivariate analysis of variance) for two independent variables in comparison to a third constant entity (i.e., gene expression levels in normal tissues). We also present a unique consensus scoring mechanism for identifying transcripts that encode cell surface proteins. The unique application of our bioinformatics processing pipeline and statistical tools allowed us to compare the expression of two membrane protein targets as a pair, and to propose a new strategy based on implementing immunotherapies that require both antigens to be expressed on the tumor cell surface to trigger therapeutic effector mechanisms. Specifically, we found that, for MYCN amplified neuroblastoma, pairwise expression of ACVR2B or anaplastic lymphoma kinase (ALK) with GFRA3, GFRA2, Cadherin 24, or with one another provided the strongest hits. For MYCN, non-amplified stage 4 neuroblastoma, neurotrophic tyrosine kinase 1, or ALK paired with GFRA2, GFRA3, SSK1, GPR173, or with one another provided the most promising paired-hits. We propose that targeting these markers together would increase the specificity and thereby the safety of CAR-based therapy for neuroblastoma.
Collapse
Affiliation(s)
- Rimas J Orentas
- Lentigen Technology, Inc., a Miltenyi Biotec Company, Gaithersburg, MD, United States
| | - Sivasish Sindiri
- Genetics Branch, National Cancer Institute, Center for Cancer Research, National Institutes of Health, Bethesda, MD, United States
| | - Christine Duris
- Department of Pathology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Xinyu Wen
- Genetics Branch, National Cancer Institute, Center for Cancer Research, National Institutes of Health, Bethesda, MD, United States
| | - Jianbin He
- Genetics Branch, National Cancer Institute, Center for Cancer Research, National Institutes of Health, Bethesda, MD, United States
| | - Jun S Wei
- Genetics Branch, National Cancer Institute, Center for Cancer Research, National Institutes of Health, Bethesda, MD, United States
| | - Jason Jarzembowski
- Department of Pathology, Medical College of Wisconsin, Milwaukee, WI, United States
| | - Javed Khan
- Genetics Branch, National Cancer Institute, Center for Cancer Research, National Institutes of Health, Bethesda, MD, United States
| |
Collapse
|