1
|
Zencir S, Dilg D, Bruzzone M, Stutz F, Soudet J, Shore D, Albert B. A two-step regulatory mechanism dynamically controls histone H3 acetylation by SAGA complex at growth-related promoters. Nucleic Acids Res 2025; 53:gkaf276. [PMID: 40207626 PMCID: PMC11983098 DOI: 10.1093/nar/gkaf276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 03/03/2025] [Accepted: 03/31/2025] [Indexed: 04/11/2025] Open
Abstract
Acetylation of histone H3 at residue K9 (H3K9ac) is a dynamically regulated mark associated with transcriptionally active promoters in eukaryotes. However, our understanding of the relationship between H3K9ac and gene expression remains mostly correlative. In this study, we identify a large suite of growth-related (GR) genes in yeast that undergo a particularly strong down-regulation of both transcription and promoter-associated H3K9ac upon stress, and delineate the roles of transcriptional activators (TAs), repressors, SAGA (Spt-Ada-Gcn5 acetyltransferase) histone acetyltransferase, and RNA-polymerase II in this response. We demonstrate that H3K9 acetylation states are orchestrated by a two-step mechanism driven by the dynamic binding of transcriptional repressors (TRs) and activators, that is independent of transcription. In response to stress, promoter release of TAs at GR genes is a prerequisite for rapid reduction of H3K9ac, whereas binding of TRs is required to establish a hypo-acetylated, strongly repressed state.
Collapse
Affiliation(s)
- Sevil Zencir
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - Daniel Dilg
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - Maria Jessica Bruzzone
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - Françoise Stutz
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - Julien Soudet
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - David Shore
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| | - Benjamin Albert
- Department of Molecular and Cellular Biology, Université de Genève, 1211, Geneva, Switzerland
| |
Collapse
|
2
|
Wanniarachchi DV, Viswakula S, Wickramasuriya AM. The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes. BMC Bioinformatics 2024; 25:371. [PMID: 39623329 PMCID: PMC11613939 DOI: 10.1186/s12859-024-05995-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Accepted: 11/21/2024] [Indexed: 12/06/2024] Open
Abstract
BACKGROUND The precise prediction of transcription factor binding sites (TFBSs) is pivotal for unraveling the gene regulatory networks underlying biological processes. While numerous tools have emerged for in silico TFBS prediction in recent years, the evolving landscape of computational biology necessitates thorough assessments of tool performance to ensure accuracy and reliability. Only a limited number of studies have been conducted to evaluate the performance of TFBS prediction tools comprehensively. Thus, the present study focused on assessing twelve widely used TFBS prediction tools and four de novo motif discovery tools using a benchmark dataset comprising real, generic, Markov, and negative sequences. TFBSs of Arabidopsis thaliana and Homo sapiens genomes downloaded from the JASPAR database were implanted in these sequences and the performance of tools was evaluated using several statistical parameters at different overlap percentages between the lengths of known and predicted binding sites. RESULTS Overall, the Multiple Cluster Alignment and Search Tool (MCAST) emerged as the best TFBS prediction tool, followed by Find Individual Motif Occurrences (FIMO) and MOtif Occurrence Detection Suite (MOODS). In addition, MotEvo and Dinucleotide Weight Tensor Toolbox (DWT-toolbox) demonstrated the highest sensitivity in identifying TFBSs at 90% and 80% overlap. Further, MCAST and DWT-toolbox managed to demonstrate the highest sensitivity across all three data types real, generic, and Markov. Among the de novo motif discovery tools, the Multiple Em for Motif Elicitation (MEME) emerged as the best performer. An analysis of the promoter regions of genes involved in the anthocyanin biosynthesis pathway in plants and the pentose phosphate pathway in humans, using the three best-performing tools, revealed considerable variation among the top 20 motifs identified by these tools. CONCLUSION The findings of this study lay a robust groundwork for selecting optimal TFBS prediction tools for future research. Given the variability observed in tool performance, employing multiple tools for identifying TFBSs in a set of sequences is highly recommended. In addition, further studies are recommended to develop an integrated toolbox that incorporates TFBS prediction or motif discovery tools, aiming to streamline result precision and accuracy.
Collapse
Affiliation(s)
- Dinithi V Wanniarachchi
- Department of Plant Sciences, Faculty of Science, University of Colombo, Colombo 03, Sri Lanka
| | - Sameera Viswakula
- Department of Statistics, Faculty of Science, University of Colombo, Colombo 03, Sri Lanka
| | | |
Collapse
|
3
|
Bakulin A, Teyssier NB, Kampmann M, Khoroshkin M, Goodarzi H. pyPAGE: A framework for Addressing biases in gene-set enrichment analysis-A case study on Alzheimer's disease. PLoS Comput Biol 2024; 20:e1012346. [PMID: 39236079 PMCID: PMC11421795 DOI: 10.1371/journal.pcbi.1012346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 09/24/2024] [Accepted: 07/22/2024] [Indexed: 09/07/2024] Open
Abstract
Inferring the driving regulatory programs from comparative analysis of gene expression data is a cornerstone of systems biology. Many computational frameworks were developed to address this problem, including our iPAGE (information-theoretic Pathway Analysis of Gene Expression) toolset that uses information theory to detect non-random patterns of expression associated with given pathways or regulons. Our recent observations, however, indicate that existing approaches are susceptible to the technical biases that are inherent to most real world annotations. To address this, we have extended our information-theoretic framework to account for specific biases and artifacts in biological networks using the concept of conditional information. To showcase pyPAGE, we performed a comprehensive analysis of regulatory perturbations that underlie the molecular etiology of Alzheimer's disease (AD). pyPAGE successfully recapitulated several known AD-associated gene expression programs. We also discovered several additional regulons whose differential activity is significantly associated with AD. We further explored how these regulators relate to pathological processes in AD through cell-type specific analysis of single cell and spatial gene expression datasets. Our findings showcase the utility of pyPAGE as a precise and reliable biomarker discovery in complex diseases such as Alzheimer's disease.
Collapse
Affiliation(s)
- Artemy Bakulin
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Noam B. Teyssier
- Institute for Neurodegenerative Diseases, University of California San Francisco, California, United States of America
| | - Martin Kampmann
- Institute for Neurodegenerative Diseases, University of California San Francisco, California, United States of America
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Matvei Khoroshkin
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Department of Urology, University of California San Francisco, San Francisco, California, United States of America
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, California, United States of America
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, California, United States of America
| | - Hani Goodarzi
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Department of Urology, University of California San Francisco, San Francisco, California, United States of America
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, California, United States of America
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, California, United States of America
- Arc Institute, Palo Alto, California, United States of America
| |
Collapse
|
4
|
Tognon M, Giugno R, Pinello L. A survey on algorithms to characterize transcription factor binding sites. Brief Bioinform 2023; 24:bbad156. [PMID: 37099664 PMCID: PMC10422928 DOI: 10.1093/bib/bbad156] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 03/27/2023] [Accepted: 04/01/2023] [Indexed: 04/28/2023] Open
Abstract
Transcription factors (TFs) are key regulatory proteins that control the transcriptional rate of cells by binding short DNA sequences called transcription factor binding sites (TFBS) or motifs. Identifying and characterizing TFBS is fundamental to understanding the regulatory mechanisms governing the transcriptional state of cells. During the last decades, several experimental methods have been developed to recover DNA sequences containing TFBS. In parallel, computational methods have been proposed to discover and identify TFBS motifs based on these DNA sequences. This is one of the most widely investigated problems in bioinformatics and is referred to as the motif discovery problem. In this manuscript, we review classical and novel experimental and computational methods developed to discover and characterize TFBS motifs in DNA sequences, highlighting their advantages and drawbacks. We also discuss open challenges and future perspectives that could fill the remaining gaps in the field.
Collapse
Affiliation(s)
- Manuel Tognon
- Computer Science Department, University of Verona, Verona, Italy
- Molecular Pathology Unit, Center for Computational and Integrative Biology and Center for Cancer Research, Massachusetts General Hospital, Charlestown, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Rosalba Giugno
- Computer Science Department, University of Verona, Verona, Italy
| | - Luca Pinello
- Molecular Pathology Unit, Center for Computational and Integrative Biology and Center for Cancer Research, Massachusetts General Hospital, Charlestown, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
5
|
Comparative Research: Regulatory Mechanisms of Ribosomal Gene Transcription in Saccharomyces cerevisiae and Schizosaccharomyces pombe. Biomolecules 2023; 13:biom13020288. [PMID: 36830657 PMCID: PMC9952952 DOI: 10.3390/biom13020288] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 01/31/2023] [Accepted: 02/01/2023] [Indexed: 02/05/2023] Open
Abstract
Restricting ribosome biosynthesis and assembly in response to nutrient starvation is a universal phenomenon that enables cells to survive with limited intracellular resources. When cells experience starvation, nutrient signaling pathways, such as the target of rapamycin (TOR) and protein kinase A (PKA), become quiescent, leading to several transcription factors and histone modification enzymes cooperatively and rapidly repressing ribosomal genes. Fission yeast has factors for heterochromatin formation similar to mammalian cells, such as H3K9 methyltransferase and HP1 protein, which are absent in budding yeast. However, limited studies on heterochromatinization in ribosomal genes have been conducted on fission yeast. Herein, we shed light on and compare the regulatory mechanisms of ribosomal gene transcription in two species with the latest insights.
Collapse
|
6
|
Nayarisseri A, Bhrdwaj A, Khan A, Sharma K, Shaheen U, Selvaraj C, Khan MA, Abhirami R, Pravin MA, Shri GR, Raje D, Singh SK. Promoter–motif extraction from co-regulated genes and their relevance to co-expression using E. coli as a model. Brief Funct Genomics 2023; 22:204-216. [PMID: 37053503 DOI: 10.1093/bfgp/elac043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 08/29/2022] [Accepted: 10/26/2022] [Indexed: 02/04/2023] Open
Abstract
Abstract
Gene expression varies due to the intrinsic stochasticity of transcription or as a reaction to external perturbations that generate cellular mutations. Co-regulation, co-expression and functional similarity of substances have been employed for indoctrinating the process of the transcriptional paradigm. The difficult process of analysing complicated proteomes and biological switches has been made easier by technical improvements, and microarray technology has flourished as a viable platform. Therefore, this research enables Microarray to cluster genes that are co-expressed and co-regulated into specific segments. Copious search algorithms have been employed to ascertain diacritic motifs or a combination of motifs that are performing regular expression, and their relevant information corresponding to the gene patterns is also documented. The associated genes co-expression and relevant cis-elements are further explored by engaging Escherichia coli as a model organism. Various clustering algorithms have also been used to generate classes of genes with similar expression profiles. A promoter database ‘EcoPromDB’ has been developed by referring RegulonDB database; this promoter database is freely available at www.ecopromdb.eminentbio.com and is divided into two sub-groups, depending upon the results of co-expression and co-regulation analyses.
Collapse
Affiliation(s)
- Anuraj Nayarisseri
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
- LeGene Biosciences Pvt Ltd Bioinformatics Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Anushka Bhrdwaj
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Arshiya Khan
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Khushboo Sharma
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Uzma Shaheen
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
| | - Chandrabose Selvaraj
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Mohammad Aqueel Khan
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Rajaram Abhirami
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Muthuraja Arun Pravin
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Gurunathan Rubha Shri
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
| | - Dhanjay Raje
- Eminent Biosciences In silico Research Laboratory, , 91, Sector-A, Mahalakshmi Nagar, Indore, 452010, Madhya Pradesh , India
| | - Sanjeev Kumar Singh
- Alagappa University Computer Aided Drug Designing and Molecular Modeling Lab, Department of Bioinformatics, , Karaikudi, 630003, Tamil Nadu , India
- Department of Data Sciences, Centre of Biomedical Research , SGPGIMS Campus, Raebareli Rd, Lucknow 226014, India
| |
Collapse
|
7
|
Gouesbet G. Deciphering Macromolecular Interactions Involved in Abiotic Stress Signaling: A Review of Bioinformatics Analysis. Methods Mol Biol 2023; 2642:257-294. [PMID: 36944884 DOI: 10.1007/978-1-0716-3044-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
Plant functioning and responses to abiotic stresses largely involve regulations at the transcriptomic level via complex interactions of signal molecules, signaling cascades, and regulators. Nevertheless, all the signaling networks involved in responses to abiotic stresses have not yet been fully established. The in-depth analysis of transcriptomes in stressed plants has become a relevant state-of-the-art methodology to study these regulations and signaling pathways that allow plants to cope with or attempt to survive abiotic stresses. The plant science and molecular biology community has developed databases about genes, proteins, protein-protein interactions, protein-DNA interactions and ontologies, which are valuable sources of knowledge for deciphering such regulatory and signaling networks. The use of these data and the development of bioinformatics tools help to make sense of transcriptomic data in specific contexts, such as that of abiotic stress signaling, using functional biological approaches. The aim of this chapter is to present and assess some of the essential online tools and resources that will allow novices in bioinformatics to decipher transcriptomic data in order to characterize the cellular processes and functions involved in abiotic stress responses and signaling. The analysis of case studies further describes how these tools can be used to conceive signaling networks on the basis of transcriptomic data. In these case studies, particular attention was paid to the characterization of abiotic stress responses and signaling related to chemical and xenobiotic stressors.
Collapse
Affiliation(s)
- Gwenola Gouesbet
- University of Rennes, CNRS, ECOBIO [(Ecosystèmes, Biodiversité, Evolution)] - UMR 6553, Rennes, France.
| |
Collapse
|
8
|
Pan S, Li Z, Wang Y, Liang L, Liu F, Qiao Y, Liu D, Liu Z. A Comprehensive Weighted Gene Co-expression Network Analysis Uncovers Potential Targets in Diabetic Kidney Disease. J Transl Int Med 2022; 10:359-368. [PMID: 36860636 PMCID: PMC9969566 DOI: 10.2478/jtim-2022-0053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Background and Objectives Diabetic kidney disease (DKD) is one of the most common microvascular complications of diabetes. It has always been difficult to explore novel biomarkers and therapeutic targets of DKD. We aimed to identify new biomarkers and further explore their functions in DKD. Methods The weighted gene co-expression network analysis (WGCNA) method was used to analyze the expression profile data of DKD, obtain key modules related to the clinical traits of DKD, and perform gene enrichment analysis. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to verify the mRNA expression of the hub genes in DKD. Spearman's correlation coefficients were used to determine the relationship between gene expression and clinical indicators. Results Fifteen gene modules were obtained via WGCNA analysis, among which the green module had the most significant correlation with DKD. Gene enrichment analysis revealed that the genes in this module were mainly involved in sugar and lipid metabolism, regulation of small guanosine triphosphatase (GTPase) mediated signal transduction, G protein-coupled receptor signaling pathway, peroxisome proliferator-activated receptor (PPAR) molecular signaling pathway, Rho protein signal transduction, and oxidoreductase activity. The qRT-PCR results showed that the relative expression of nuclear pore complex-interacting protein family member A2 (NPIPA2) and ankyrin repeat domain 36 (ANKRD36) was notably increased in DKD compared to the control. NPIPA2 was positively correlated with the urine albumin/creatinine ratio (ACR) and serum creatinine (Scr) but negatively correlated with albumin (ALB) and hemoglobin (Hb) levels. ANKRD36 was positively correlated with the triglyceride (TG) level and white blood cell (WBC) count. Conclusion NPIPA2 expression is closely related to the disease condition of DKD, whereas ANKRD36 may be involved in the progression of DKD through lipid metabolism and inflammation, providing an experimental basis to further explore the pathogenesis of DKD.
Collapse
Affiliation(s)
- Shaokang Pan
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Zhengyong Li
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Yixue Wang
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Lulu Liang
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Fengxun Liu
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Yingjin Qiao
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Dongwei Liu
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| | - Zhangsuo Liu
- Department of TCM-Integrated Department of Nephrology, the First Affiliated Hospital of Zhengzhou University; Research Institute of Nephrology, Zhengzhou University; Research Center for Kidney Disease, Henan Province; Key Laboratory of Precision Diagnosis and Treatment for Chronic Kidney Disease in Henan Province; Core Unit of National Clinical Medical Research Center of Kidney Disease, Zhengzhou 450052, Henan Province, China
| |
Collapse
|
9
|
A comprehensive weighted gene co-expression network analysis uncovers potential targets in diabetic kidney disease. J Transl Int Med 2022. [DOI: 10.2478/jtim-2022-0058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Abstract
Background and Objectives
Diabetic kidney disease (DKD) is one of the most common microvascular complications of diabetes. It has always been difficult to explore novel biomarkers and therapeutic targets of DKD. We aimed to identify new biomarkers and further explore their functions in DKD.
Methods
The weighted gene co-expression network analysis (WGCNA) method was used to analyze the expression profile data of DKD, obtain key modules related to the clinical traits of DKD, and perform gene enrichment analysis. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to verify the mRNA expression of the hub genes in DKD. Spearman’s correlation coefficients were used to determine the relationship between gene expression and clinical indicators.
Results
Fifteen gene modules were obtained via WGCNA analysis, among which the green module had the most significant correlation with DKD. Gene enrichment analysis revealed that the genes in this module were mainly involved in sugar and lipid metabolism, regulation of small guanosine triphosphatase (GTPase) mediated signal transduction, G protein-coupled receptor signaling pathway, peroxisome proliferator-activated receptor (PPAR) molecular signaling pathway, Rho protein signal transduction, and oxidoreductase activity. The qRT-PCR results showed that the relative expression of nuclear pore complex-interacting protein family member A2 (NPIPA2) and ankyrin repeat domain 36 (ANKRD36) was notably increased in DKD compared to the control. NPIPA2 was positively correlated with the urine albumin/creatinine ratio (ACR) and serum creatinine (Scr) but negatively correlated with albumin (ALB) and hemoglobin (Hb) levels. ANKRD36 was positively correlated with the triglyceride (TG) level and white blood cell (WBC) count.
Conclusion
NPIPA2 expression is closely related to the disease condition of DKD, whereas ANKRD36 may be involved in the progression of DKD through lipid metabolism and inflammation, providing an experimental basis to further explore the pathogenesis of DKD.
Collapse
|
10
|
Qin H, Wu H, Shen K, Liu Y, Li M, Wang H, Qiao Z, Mu Z. Fermented Minor Grain Foods: Classification, Functional Components, and Probiotic Potential. Foods 2022; 11:3155. [PMID: 37430904 PMCID: PMC9601907 DOI: 10.3390/foods11203155] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/01/2022] [Accepted: 10/04/2022] [Indexed: 08/05/2023] Open
Abstract
Fermented minor grain (MG) foods often have unique nutritional value and functional characteristics, which are important for developing dietary culture worldwide. As a kind of special raw material in fermented food, minor grains have special functional components, such as trace elements, dietary fiber, and polyphenols. Fermented MG foods have excellent nutrients, phytochemicals, and bioactive compounds and are consumed as a rich source of probiotic microbes. Thus, the purpose of this review is to introduce the latest progress in research related to the fermentation products of MGs. Specific discussion is focused on the classification of fermented MG foods and their nutritional and health implications, including studies of microbial diversity, functional components, and probiotic potential. Furthermore, this review discusses how mixed fermentation of grain mixtures is a better method for developing new functional foods to increase the nutritional value of meals based on cereals and legumes in terms of dietary protein and micronutrients.
Collapse
Affiliation(s)
- Huibin Qin
- Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Ministry of Agriculture, Shanxi Key Laboratory of Genetic Resources and Genetic Improvement of Minor Crops, Taiyuan 030031, China
| | - Houbin Wu
- Shennong Technology Group Co., Ltd., Jinzhong 030801, China
| | - Ke Shen
- Shennong Technology Group Co., Ltd., Jinzhong 030801, China
| | - Yilin Liu
- Shennong Technology Group Co., Ltd., Jinzhong 030801, China
| | - Meng Li
- Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Ministry of Agriculture, Shanxi Key Laboratory of Genetic Resources and Genetic Improvement of Minor Crops, Taiyuan 030031, China
| | - Haigang Wang
- Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Ministry of Agriculture, Shanxi Key Laboratory of Genetic Resources and Genetic Improvement of Minor Crops, Taiyuan 030031, China
| | - Zhijun Qiao
- Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Ministry of Agriculture, Shanxi Key Laboratory of Genetic Resources and Genetic Improvement of Minor Crops, Taiyuan 030031, China
| | - Zhixin Mu
- Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Ministry of Agriculture, Shanxi Key Laboratory of Genetic Resources and Genetic Improvement of Minor Crops, Taiyuan 030031, China
| |
Collapse
|
11
|
Cao L, Liu P, Chen J, Deng L. Prediction of Transcription Factor Binding Sites Using a Combined Deep Learning Approach. Front Oncol 2022; 12:893520. [PMID: 35719916 PMCID: PMC9204005 DOI: 10.3389/fonc.2022.893520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 04/11/2022] [Indexed: 12/04/2022] Open
Abstract
In the process of regulating gene expression and evolution, such as DNA replication and mRNA transcription, the binding of transcription factors (TFs) to TF binding sites (TFBS) plays a vital role. Precisely modeling the specificity of genes and searching for TFBS are helpful to explore the mechanism of cell expression. In recent years, computational and deep learning methods searching for TFBS have become an active field of research. However, existing methods generally cannot meet high performance and interpretability simultaneously. Here, we develop an accurate and interpretable attention-based hybrid approach, DeepARC, that combines a convolutional neural network (CNN) and recurrent neural network (RNN) to predict TFBS. DeepARC employs a positional embedding method to extract the hidden embedding from DNA sequences, including the positional information from OneHot encoding and the distributed embedding from DNA2Vec. DeepARC feeds the positional embedding of the DNA sequence into a CNN-BiLSTM-Attention-based framework to complete the task of finding the motif. Taking advantage of the attention mechanism, DeepARC can gain greater access to valuable information about the motif and bring interpretability to the work of searching for motifs through the attention weight graph. Moreover, DeepARC achieves promising performances with an average area under the receiver operating characteristic curve (AUC) score of 0.908 on five cell lines (A549, GM12878, Hep-G2, H1-hESC, and Hela) in the benchmark dataset. We also compare the positional embedding with OneHot and DNA2Vec and gain a competitive advantage.
Collapse
Affiliation(s)
- Linan Cao
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Pei Liu
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Jialong Chen
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
12
|
Huang T, Xiao H, Tian Q, He Z, Yuan C, Lin Z, Gao X, Yao M. Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics. PLoS Comput Biol 2022; 18:e1009773. [PMID: 35671296 PMCID: PMC9205514 DOI: 10.1371/journal.pcbi.1009773] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 06/17/2022] [Accepted: 04/30/2022] [Indexed: 11/18/2022] Open
Abstract
Background Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive. Methodology/Principal findings This study presents a computational tool, named “Grit”, for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student’s t-test statistical method that detects high-scoring binding sites utilizing conservation information among species. The program performs sequence scanning at a rate of 3.2 Mbp/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It significantly outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets. Significance Grit is a good alternative to current available motif scanning tools. Locating transcription factor-binding (TF-binding) site in the genome and identification their function is fundamental in understanding various biological processes. Improve the performance of the prediction tools is important because accurate TF-binding site prediction can save cost and time for wet-lab experiments. Also, genome wide TF-binding site prediction can provide new insights for transcriptome regulation in system biology perspective. This study developed a new TF-binding site prediction tool based on mixed Student’s t-test statistical method. The tool is amongst the top-ranked TF-binding site predictors, as such, it can help the researchers in TF-binding site identification and transcriptional regulation mechanism interpretation of genes.
Collapse
Affiliation(s)
- Tinghua Huang
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Hong Xiao
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Qi Tian
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Zhen He
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Cheng Yuan
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Zezhao Lin
- College of Animal Science, Yangtze University, Jingzhou, China
| | - Xuejun Gao
- College of Animal Science, Yangtze University, Jingzhou, China
- * E-mail: (XG); (MY)
| | - Min Yao
- College of Animal Science, Yangtze University, Jingzhou, China
- * E-mail: (XG); (MY)
| |
Collapse
|
13
|
Kim C, Wang X, Kültz D. Prediction and Experimental Validation of a New Salinity-Responsive Cis-Regulatory Element (CRE) in a Tilapia Cell Line. Life (Basel) 2022; 12:787. [PMID: 35743818 PMCID: PMC9225295 DOI: 10.3390/life12060787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 05/12/2022] [Accepted: 05/16/2022] [Indexed: 11/16/2022] Open
Abstract
Transcriptional regulation is a major mechanism by which organisms integrate gene x environment interactions. It can be achieved by coordinated interplay between cis-regulatory elements (CREs) and transcription factors (TFs). Euryhaline tilapia (Oreochromis mossambicus) tolerate a wide range of salinity and thus are an appropriate model to examine transcriptional regulatory mechanisms during salinity stress in fish. Quantitative proteomics in combination with the transcription inhibitor actinomycin D revealed 19 proteins that are transcriptionally upregulated by hyperosmolality in tilapia brain (OmB) cells. We searched the extended proximal promoter up to intron1 of each corresponding gene for common motifs using motif discovery tools. The top-ranked motif identified (STREME1) represents a binding site for the Forkhead box TF L1 (FoxL1). STREME1 function during hyperosmolality was experimentally validated by choosing two of the 19 genes, chloride intracellular channel 2 (clic2) and uridine phosphorylase 1 (upp1), that are enriched in STREME1 in their extended promoters. Transcriptional induction of these genes during hyperosmolality requires STREME1, as evidenced by motif mutagenesis. We conclude that STREME1 represents a new functional CRE that contributes to gene x environment interactions during salinity stress in tilapia. Moreover, our results indicate that FoxL1 family TFs are contribute to hyperosmotic induction of genes in euryhaline fish.
Collapse
Affiliation(s)
- Chanhee Kim
- Stress-Induced Evolution Laboratory, Department of Animal Sciences, University of California, Davis, CA 95616, USA;
| | - Xiaodan Wang
- Laboratory of Aquaculture Nutrition and Environmental Health, School of Life Sciences, East China Normal University, Shanghai 200241, China;
| | - Dietmar Kültz
- Stress-Induced Evolution Laboratory, Department of Animal Sciences, University of California, Davis, CA 95616, USA;
| |
Collapse
|
14
|
Deepika D, Poddar N, Kumar S, Singh A. Molecular Characterization Reveals the Involvement of Calcium Dependent Protein Kinases in Abiotic Stress Signaling and Development in Chickpea ( Cicer arietinum). FRONTIERS IN PLANT SCIENCE 2022; 13:831265. [PMID: 35498712 PMCID: PMC9039462 DOI: 10.3389/fpls.2022.831265] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 03/11/2022] [Indexed: 06/14/2023]
Abstract
Calcium-dependent protein kinases (CDPKs) are a major group of calcium (Ca2+) sensors in plants. CDPKs play a dual function of "Ca2+ sensor and responder." These sensors decode the "Ca2+ signatures" generated in response to adverse growth conditions such as drought, salinity, and cold and developmental processes. However, knowledge of the CDPK family in the legume crop chickpea is missing. Here, we have identified a total of 22 CDPK genes in the chickpea genome. The phylogenetic analysis of the chickpea CDPK family with other plants revealed their evolutionary conservation. Protein homology modeling described the three-dimensional structure of chickpea CDPKs. Defined arrangements of α-helix, β-strands, and transmembrane-helix represent important structures like kinase domain, inhibitory junction domain, N and C-lobes of EF-hand motifs. Subcellular localization analysis revealed that CaCDPK proteins are localized mainly at the cytoplasm and in the nucleus. Most of the CaCDPK promoters had abiotic stress and development-related cis-regulatory elements, suggesting the functional role of CaCDPKs in abiotic stress and development-related signaling. RNA sequencing (RNA-seq) expression analysis indicated the role of the CaCDPK family in various developmental stages, including vegetative, reproductive development, senescence stages, and during seed stages of early embryogenesis, late embryogenesis, mid and late seed maturity. The real-time quantitative PCR (qRT-PCR) analysis revealed that several CaCDPK genes are specifically as well as commonly induced by drought, salt, and Abscisic acid (ABA). Overall, these findings indicate that the CDPK family is probably involved in abiotic stress responses and development in chickpeas. This study provides crucial information on the CDPK family that will be utilized in generating abiotic stress-tolerant and high-yielding chickpea varieties.
Collapse
Affiliation(s)
- Deepika Deepika
- Stress Signaling Lab, National Institute of Plant Genome Research, New Delhi, India
| | - Nikita Poddar
- Bioinformatics Lab, National Institute of Plant Genome Research, New Delhi, India
| | - Shailesh Kumar
- Bioinformatics Lab, National Institute of Plant Genome Research, New Delhi, India
| | - Amarjeet Singh
- Stress Signaling Lab, National Institute of Plant Genome Research, New Delhi, India
| |
Collapse
|
15
|
Mohanty S, Pattnaik PK, Al-Absi AA, Kang DK. A Review on Planted ( l, d) Motif Discovery Algorithms for Medical Diagnose. SENSORS (BASEL, SWITZERLAND) 2022; 22:1204. [PMID: 35161949 PMCID: PMC8838483 DOI: 10.3390/s22031204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/19/2022] [Accepted: 01/31/2022] [Indexed: 11/16/2022]
Abstract
Personalized diagnosis of chronic disease requires capturing the continual pattern across the biological sequence. This repeating pattern in medical science is called "Motif". Motifs are the short, recurring patterns of biological sequences that are supposed signify some health disorder. They identify the binding sites for transcription factors that modulate and synchronize the gene expression. These motifs are important for the analysis and interpretation of various health issues like human disease, gene function, drug design, patient's conditions, etc. Searching for these patterns is an important step in unraveling the mechanisms of gene expression properly diagnose and treat chronic disease. Thus, motif identification has a vital role in healthcare studies and attracts many researchers. Numerous approaches have been characterized for the motif discovery process. This article attempts to review and analyze fifty-four of the most frequently found motif discovery processes/algorithms from different approaches and summarizes the discussion with their strengths and weaknesses.
Collapse
Affiliation(s)
- Satarupa Mohanty
- School of Computer Engineering, KIIT Deemed to be University, Bhubaneswar 751024, India; (S.M.); (P.K.P.)
| | - Prasant Kumar Pattnaik
- School of Computer Engineering, KIIT Deemed to be University, Bhubaneswar 751024, India; (S.M.); (P.K.P.)
| | | | - Dae-Ki Kang
- Department of Computer & Information Engineering, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan 47011, Korea
| |
Collapse
|
16
|
Theepalakshmi P, Reddy US. Freezing firefly algorithm for efficient planted (ℓ, d) motif search. Med Biol Eng Comput 2022; 60:511-530. [PMID: 35020123 DOI: 10.1007/s11517-021-02468-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Accepted: 11/06/2021] [Indexed: 10/19/2022]
Abstract
The detection of inimitable patterns (motif) occurring in a set of biological sequences could elevate new biological discoveries. Its application in recognition of transcription factors and their binding sites have demonstrated the necessity to attain knowledge of gene function, human diseases, and drug design. The literature identifies (ℓ, d) motif search as the widely studied problem in PMS (Planted Motif Search). This paper proposes an efficient optimization algorithm named "Freezing FireFly (FFF)" to solve (ℓ, d) motif search problem. The new strategy freezing such as local and global was added to increase the performance of the basic Firefly algorithm. It freezes the best possible out coming positions even in the lesser brighter one. The performance of the proposed algorithm is experienced on simulated and real datasets. The experimental results show that the proposed algorithm resolves the instance (50, 21) within 1.47 min in the simulated dataset. For real (such as ChIP-seq (Chromatin Immunoprecipitation)) and synthetic datasets, the proposed algorithm runs much faster in comparison to existing state-of-the-art optimization algorithms, including Samselect, TraverStringRef, PMS8, qPMS9, AlignACE, FMGA, and GSGA.
Collapse
Affiliation(s)
- P Theepalakshmi
- Department of Computer Applications, National Institute of Technology, Tiruchirappalli, Tamilnadu, India.
| | - U Srinivasulu Reddy
- Machine Learning and Data Analytics Lab, Center of Excellence in Artificial Intelligence, Department of Computer Applications, National Institute of Technology, Tiruchirappalli, Tamilnadu, India
| |
Collapse
|
17
|
Interaction of TOR and PKA Signaling in S. cerevisiae. Biomolecules 2022; 12:biom12020210. [PMID: 35204711 PMCID: PMC8961621 DOI: 10.3390/biom12020210] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/22/2022] [Accepted: 01/25/2022] [Indexed: 01/13/2023] Open
Abstract
TOR and PKA signaling are the major growth-regulatory nutrient-sensing pathways in S. cerevisiae. A number of experimental findings demonstrated a close relationship between these pathways: Both are responsive to glucose availability. Both regulate ribosome production on the transcriptional level and repress autophagy and the cellular stress response. Sch9, a major downstream effector of TORC1 presumably shares its kinase consensus motif with PKA, and genetic rescue and synthetic defects between PKA and Sch9 have been known for a long time. Further, studies in the first decade of this century have suggested direct regulation of PKA by TORC1. Nonetheless, the contribution of a potential direct cross-talk vs. potential sharing of targets between the pathways has still not been completely resolved. What is more, other findings have in contrast highlighted an antagonistic relationship between the two pathways. In this review, I explore the association between TOR and PKA signaling, mainly by focusing on proteins that are commonly referred to as shared TOR and PKA targets. Most of these proteins are transcription factors which to a large part explain the major transcriptional responses elicited by TOR and PKA upon nutrient shifts. I examine the evidence that these proteins are indeed direct targets of both pathways and which aspects of their regulation are targeted by TOR and PKA. I further explore if they are phosphorylated on shared sites by PKA and Sch9 or when experimental findings point towards regulation via the PP2ASit4/PP2A branch downstream of TORC1. Finally, I critically review data suggesting direct cross-talk between the pathways and its potential mechanism.
Collapse
|
18
|
Transcriptional control of ribosome biogenesis in yeast: links to growth and stress signals. Biochem Soc Trans 2021; 49:1589-1599. [PMID: 34240738 PMCID: PMC8421047 DOI: 10.1042/bst20201136] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/15/2022]
Abstract
Ribosome biogenesis requires prodigious transcriptional output in rapidly growing yeast cells and is highly regulated in response to both growth and stress signals. This minireview focuses on recent developments in our understanding of this regulatory process, with an emphasis on the 138 ribosomal protein genes (RPGs) themselves and a group of >200 ribosome biogenesis (RiBi) genes whose products contribute to assembly but are not part of the ribosome. Expression of most RPGs depends upon Rap1, a pioneer transcription factor (TF) required for the binding of a pair of RPG-specific TFs called Fhl1 and Ifh1. RPG expression is correlated with Ifh1 promoter binding, whereas Rap1 and Fhl1 remain promoter-associated upon stress-induced down regulation. A TF called Sfp1 has also been implicated in RPG regulation, though recent work reveals that its primary function is in activation of RiBi and other growth-related genes. Sfp1 plays an important regulatory role at a small number of RPGs where Rap1–Fhl1–Ifh1 action is subsidiary or non-existent. In addition, nearly half of all RPGs are bound by Hmo1, which either stabilizes or re-configures Fhl1–Ifh1 binding. Recent studies identified the proline rotamase Fpr1, known primarily for its role in rapamycin-mediated inhibition of the TORC1 kinase, as an additional TF at RPG promoters. Fpr1 also affects Fhl1–Ifh1 binding, either independently or in cooperation with Hmo1. Finally, a major recent development was the discovery of a protein homeostasis mechanism driven by unassembled ribosomal proteins, referred to as the Ribosome Assembly Stress Response (RASTR), that controls RPG transcription through the reversible condensation of Ifh1.
Collapse
|
19
|
Robinson H, Ruelcke JE, Lewis A, Bond CS, Fox AH, Bharti V, Wani S, Cloonan N, Lai A, Margolin D, Li L, Salomon C, Richards RS, Farrell A, Gardiner RA, Parton RG, Cristino AS, Hill MM. Caveolin-1-driven membrane remodelling regulates hnRNPK-mediated exosomal microRNA sorting in cancer. Clin Transl Med 2021; 11:e381. [PMID: 33931969 PMCID: PMC8031663 DOI: 10.1002/ctm2.381] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 03/15/2021] [Accepted: 03/17/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Caveolae proteins play diverse roles in cancer development and progression. In prostate cancer, non-caveolar caveolin-1 (CAV1) promotes metastasis, while CAVIN1 attenuates CAV1-induced metastasis. Here, we unveil a novel mechanism linking CAV1 to selective loading of exosomes with metastasis-promoting microRNAs. RESULTS We identify hnRNPK as a CAV1-regulated microRNA binding protein. In the absence of CAVIN1, non-caveolar CAV1 drives localisation of hnRPNK to multi-vesicular bodies (MVBs), recruiting AsUGnA motif-containing miRNAs and causing their release within exosomes. This process is dependent on the lipid environment of membranes as shown by cholesterol depletion using methyl-β-cyclodextrin or by treatment with n-3 polyunsaturated fatty acids. Consistent with a role in bone metastasis, knockdown of hnRNPK in prostate cancer PC3 cells abolished the ability of PC3 extracellular vesicles (EV) to induce osteoclastogenesis, and biofluid EV hnRNPK is elevated in metastatic prostate and colorectal cancer. CONCLUSIONS Taken together, these results support a novel pan-cancer mechanism for CAV1-driven exosomal release of hnRNPK and associated miRNA in metastasis, which is modulated by the membrane lipid environment.
Collapse
Affiliation(s)
- Harley Robinson
- The University of Queensland Diamantina InstituteThe University of QueenslandWoolloongabbaQueenslandAustralia
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| | - Jayde E. Ruelcke
- The University of Queensland Diamantina InstituteThe University of QueenslandWoolloongabbaQueenslandAustralia
| | - Amanda Lewis
- School of Molecular SciencesThe University of Western AustraliaCrawleyWAAustralia
| | - Charles S. Bond
- School of Molecular SciencesThe University of Western AustraliaCrawleyWAAustralia
| | - Archa H. Fox
- School of Molecular SciencesThe University of Western AustraliaCrawleyWAAustralia
- School of Human SciencesThe University of Western AustraliaCrawleyWAAustralia
- The Harry Perkins Institute of Medical ResearchQEII Medical CentreNedlandsWAAustralia
| | - Vandhana Bharti
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| | - Shivangi Wani
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| | - Nicole Cloonan
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| | - Andrew Lai
- University of Queensland Centre for Clinical Research, Royal Brisbane and Women's HospitalThe University of QueenslandBrisbaneQueenslandAustralia
| | - David Margolin
- Maternal‐Fetal Medicine, Department of Obstetrics and GynecologyOchsner Clinic FoundationNew OrleansUSA
| | - Li Li
- Maternal‐Fetal Medicine, Department of Obstetrics and GynecologyOchsner Clinic FoundationNew OrleansUSA
| | - Carlos Salomon
- University of Queensland Centre for Clinical Research, Royal Brisbane and Women's HospitalThe University of QueenslandBrisbaneQueenslandAustralia
- Maternal‐Fetal Medicine, Department of Obstetrics and GynecologyOchsner Clinic FoundationNew OrleansUSA
- Department of Clinical Biochemistry and Immunology, Faculty of PharmacyUniversity of ConcepciónConcepciónChile
| | - Renée S. Richards
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| | - Aine Farrell
- University of Queensland Centre for Clinical Research, Royal Brisbane and Women's HospitalThe University of QueenslandBrisbaneQueenslandAustralia
| | - Robert A. Gardiner
- University of Queensland Centre for Clinical Research, Royal Brisbane and Women's HospitalThe University of QueenslandBrisbaneQueenslandAustralia
| | - Robert G. Parton
- Institute for Molecular BioscienceThe University of QueenslandSt LuciaQueenslandAustralia
- Centre for Microscopy and MicroanalysisThe University of QueenslandSt LuciaQueenslandAustralia
| | - Alexandre S. Cristino
- The University of Queensland Diamantina InstituteThe University of QueenslandWoolloongabbaQueenslandAustralia
- Griffith Institute for Drug DiscoveryGriffith UniversityBrisbaneQueenslandAustralia
| | - Michelle M. Hill
- The University of Queensland Diamantina InstituteThe University of QueenslandWoolloongabbaQueenslandAustralia
- QIMR Berghofer Medical Research InstituteBrisbaneQueenslandAustralia
| |
Collapse
|
20
|
Zhang Q, Chen F, Wu S, Liang H. A simple yet powerful test for assessing goodness-of-fit of high-dimensional linear models. Stat Med 2021; 40:3153-3166. [PMID: 33792070 DOI: 10.1002/sim.8968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 01/16/2021] [Accepted: 03/13/2021] [Indexed: 11/06/2022]
Abstract
We evaluate the validity of a projection-based test checking linear models when the number of covariates tends to infinity, and analyze two gene expression datasets. We show that the test is still consistent and derive the asymptotic distributions under the null and alternative hypotheses. The asymptotic properties are almost the same as those when the number of covariates is fixed as long as p/n → 0 with additional mild assumptions. The test dramatically gains dimension reduction, and its numerical performance is remarkable.
Collapse
Affiliation(s)
- Qi Zhang
- School of Mathematics and Statistics, Qingdao University, Shandong, China
| | - Feifei Chen
- Center for Statistics and Data Science, Beijing Normal University, Zhuhai, China
| | - Shunyao Wu
- College of Computer Science and Technology, Qingdao University, Shandong, China
| | - Hua Liang
- Department of Statistics, George Washington University, Washington, District of Columbia, USA
| |
Collapse
|
21
|
Avsec Ž, Weilert M, Shrikumar A, Krueger S, Alexandari A, Dalal K, Fropf R, McAnany C, Gagneur J, Kundaje A, Zeitlinger J. Base-resolution models of transcription-factor binding reveal soft motif syntax. Nat Genet 2021; 53:354-366. [PMID: 33603233 PMCID: PMC8812996 DOI: 10.1038/s41588-021-00782-6] [Citation(s) in RCA: 317] [Impact Index Per Article: 79.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 01/07/2021] [Indexed: 01/30/2023]
Abstract
The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.
Collapse
Affiliation(s)
- Žiga Avsec
- Department of Informatics, Technical University of Munich, Garching, Germany,Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Munich, Germany,Currently at DeepMind, London, UK
| | - Melanie Weilert
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Sabrina Krueger
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Amr Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Khyati Dalal
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA
| | - Robin Fropf
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Charles McAnany
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Julien Gagneur
- Department of Informatics, Technical University of Munich, Garching, Germany
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA,Department of Genetics, Stanford University, Stanford, CA, USA,correspondence: ,
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA,correspondence: ,
| |
Collapse
|
22
|
Kanofsky K, Rusche J, Eilert L, Machens F, Hehl R. Unusual DNA-binding properties of the Arabidopsis thaliana WRKY50 transcription factor at target gene promoters. PLANT CELL REPORTS 2021; 40:69-83. [PMID: 33006643 PMCID: PMC7811519 DOI: 10.1007/s00299-020-02611-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 09/21/2020] [Indexed: 05/29/2023]
Abstract
WRKY50 from A. thaliana requires WT-boxes at target gene promoters for activation and binding. Based on the genome-wide prediction of WRKY50 target genes and the similarity of a WRKY50 binding site to WT-boxes in microbe-associated molecular pattern (MAMP)-responsive cis-regulatory modules (CRM), four WT-box containing CRMs from the promoter region of three WRKY50 target genes were investigated for their interaction with WRKY50. These target genes are DJ1E, WRKY30 and ATBBE4. Two of the four CRMs, one from DJ1E and one from WRKY30, were able to activate reporter gene expression in the presence of WRKY50. Activation requires the WT-boxes GGACTTTT, GGACTTTG from DJ1E and GGACTTTC from WRKY30. WRKY50 does not activate a second CRM from WRKY30 and the CRM from ATBBE4, both containing the WT-box TGACTTTT. In vitro gel-shift assays demonstrate WT-box-specific binding of the WRKY50 DNA-binding domain to all four CRMs. This work shows a high flexibility of WRKY50 binding site recognition beyond the classic W-box TTGACC/T.
Collapse
Affiliation(s)
- Konstantin Kanofsky
- Institut für Genetik, Technische Universität Braunschweig, Spielmannstr. 7, 38106, Braunschweig, Germany
| | - Jendrik Rusche
- Institut für Genetik, Technische Universität Braunschweig, Spielmannstr. 7, 38106, Braunschweig, Germany
| | - Lea Eilert
- Institut für Genetik, Technische Universität Braunschweig, Spielmannstr. 7, 38106, Braunschweig, Germany
| | - Fabian Machens
- Institut für Genetik, Technische Universität Braunschweig, Spielmannstr. 7, 38106, Braunschweig, Germany
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Potsdam Science Park, Am Mühlenberg 1, Golm, 14476, Potsdam, Germany
| | - Reinhard Hehl
- Institut für Genetik, Technische Universität Braunschweig, Spielmannstr. 7, 38106, Braunschweig, Germany.
| |
Collapse
|
23
|
Negrini F, O’Grady K, Hyvönen M, Folta KM, Baraldi E. Genomic structure and transcript analysis of the Rapid Alkalinization Factor (RALF) gene family during host-pathogen crosstalk in Fragaria vesca and Fragaria x ananassa strawberry. PLoS One 2020; 15:e0226448. [PMID: 32214345 PMCID: PMC7098601 DOI: 10.1371/journal.pone.0226448] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 03/01/2020] [Indexed: 11/19/2022] Open
Abstract
Rapid Alkalinization Factors (RALFs) are cysteine-rich peptides ubiquitous within plant kingdom. They play multiple roles as hormonal signals in diverse processes, including root elongation, cell growth, pollen tube development, and fertilization. Their involvement in host-pathogen crosstalk as negative regulators of immunity in Arabidopsis has also been recognized. In addition, peptides homologous to RALF are secreted by different fungal pathogens as effectors during early stages of infection. Previous studies have identified nine RALF genes in the diploid strawberry (Fragaria vesca) genome. This work describes the genomic organization of the RALF gene families in commercial octoploid strawberry (Fragaria × ananassa) and the re-annotated genome of F. vesca, and then compares findings with orthologs in Arabidopsis thaliana. We reveal the presence of 15 RALF genes in F. vesca genotype Hawaii 4 and 50 in Fragaria x ananassa cv. Camarosa, showing a non-homogenous localization of genes among the different Fragaria x ananassa subgenomes. Expression analysis of Fragaria x ananassa RALF genes upon infection with Colletotrichum acutatum or Botrytis cinerea showed that FanRALF3-1 was the only fruit RALF gene upregulated after fungal infection. In silico analysis was used to identify distinct pathogen inducible elements upstream of the FanRALF3-1 gene. Agroinfiltration of strawberry fruit with deletion constructs of the FanRALF3-1 promoter identified a 5' region required for FanRALF3-1 expression in fruit, but failed to identify a region responsible for fungal induced expression.
Collapse
Affiliation(s)
- Francesca Negrini
- Laboratory of Plant Pathology and Biotechnology, DISTAL, University of Bologna, Bologna Italy
- Horticultural Sciences Department, University of Florida, Gainesville, Florida, United States of America
| | - Kevin O’Grady
- Horticultural Sciences Department, University of Florida, Gainesville, Florida, United States of America
| | - Marko Hyvönen
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Kevin M. Folta
- Horticultural Sciences Department, University of Florida, Gainesville, Florida, United States of America
| | - Elena Baraldi
- Laboratory of Plant Pathology and Biotechnology, DISTAL, University of Bologna, Bologna Italy
| |
Collapse
|
24
|
Li Y, Liu Y, Juedes D, Drews F, Bunescu R, Welch L. Set cover-based methods for motif selection. Bioinformatics 2020; 36:1044-1051. [PMID: 31665223 PMCID: PMC7703758 DOI: 10.1093/bioinformatics/btz697] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Revised: 08/13/2019] [Accepted: 09/13/2019] [Indexed: 11/14/2022] Open
Abstract
Motivation De novo motif discovery algorithms find statistically over-represented sequence motifs that may function as transcription factor binding sites. Current methods often report large numbers of motifs, making it difficult to perform further analyses and experimental validation. The motif selection problem seeks to identify a minimal set of putative regulatory motifs that characterize sequences of interest (e.g. ChIP-Seq binding regions). Results In this study, the motif selection problem is mapped to variants of the set cover problem that are solved via tabu search and by relaxed integer linear programing (RILP). The algorithms are employed to analyze 349 ChIP-Seq experiments from the ENCODE project, yielding a small number of high-quality motifs that represent putative binding sites of primary factors and cofactors. Specifically, when compared with the motifs reported by Kheradpour and Kellis, the set cover-based algorithms produced motif sets covering 35% more peaks for 11 TFs and identified 4 more putative cofactors for 6 TFs. Moreover, a systematic evaluation using nested cross-validation revealed that the RILP algorithm selected fewer motifs and was able to cover 6% more peaks and 3% fewer background regions, which reduced the error rate by 7%. Availability and implementation The source code of the algorithms and all the datasets are available at https://github.com/YichaoOU/Set_cover_tools. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yichao Li
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| | - Yating Liu
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| | - David Juedes
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| | - Frank Drews
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| | - Razvan Bunescu
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| | - Lonnie Welch
- Department of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701, USA
| |
Collapse
|
25
|
Wang Z, He W, Tang J, Guo F. Identification of Highest-Affinity Binding Sites of Yeast Transcription Factor Families. J Chem Inf Model 2020; 60:1876-1883. [DOI: 10.1021/acs.jcim.9b01012] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Zongyu Wang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
| | - Wenying He
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
| | - Jijun Tang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, P. R. China
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Fei Guo
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
| |
Collapse
|
26
|
Hashim FA, Houssein EH, Hussain K, Mabrouk MS, Al-Atabany W. A modified Henry gas solubility optimization for solving motif discovery problem. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04611-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
27
|
Kunkel J, Luo X, Capaldi AP. Integrated TORC1 and PKA signaling control the temporal activation of glucose-induced gene expression in yeast. Nat Commun 2019; 10:3558. [PMID: 31395866 PMCID: PMC6687784 DOI: 10.1038/s41467-019-11540-y] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 07/19/2019] [Indexed: 01/04/2023] Open
Abstract
The growth rate of a yeast cell is controlled by the target of rapamycin kinase complex I (TORC1) and cAMP-dependent protein kinase (PKA) pathways. To determine how TORC1 and PKA cooperate to regulate cell growth, we performed temporal analysis of gene expression in yeast switched from a non-fermentable substrate, to glucose, in the presence and absence of TORC1 and PKA inhibitors. Quantitative analysis of these data reveals that PKA drives the expression of key cell growth genes during transitions into, and out of, the rapid growth state in glucose, while TORC1 is important for the steady-state expression of the same genes. This circuit design may enable yeast to set an exact growth rate based on the abundance of internal metabolites such as amino acids, via TORC1, but also adapt rapidly to changes in external nutrients, such as glucose, via PKA.
Collapse
Affiliation(s)
- Joseph Kunkel
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, 85721-0206, USA
| | - Xiangxia Luo
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, 85721-0206, USA
| | - Andrew P Capaldi
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, 85721-0206, USA.
| |
Collapse
|
28
|
Parnell EJ, Stillman DJ. Multiple Negative Regulators Restrict Recruitment of the SWI/SNF Chromatin Remodeler to the HO Promoter in Saccharomyces cerevisiae. Genetics 2019; 212:1181-1204. [PMID: 31167839 PMCID: PMC6707452 DOI: 10.1534/genetics.119.302359] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 05/30/2019] [Indexed: 01/22/2023] Open
Abstract
Activation of the Saccharomyces cerevisiae HO promoter is highly regulated, requiring the ordered recruitment of activators and coactivators and allowing production of only a few transcripts in mother cells within a short cell cycle window. We conducted genetic screens to identify the negative regulators of HO expression necessary to limit HO transcription. Known repressors of HO (Ash1 and Rpd3) were identified, as well as several additional chromatin-associated factors including the Hda1 histone deacetylase, the Isw2 chromatin remodeler, and the corepressor Tup1 We also identified clusters of HO promoter mutations that suggested roles for the Dot6/Tod6 (PAC site) and Ume6 repression pathways. We used ChIP assays with synchronized cells to validate the involvement of these factors and map the association of Ash1, Dot6, and Ume6 with the HO promoter to a brief window in the cell cycle between binding of the initial activating transcription factor and initiation of transcription. We found that Ash1 and Ume6 each recruit the Rpd3 histone deacetylase to HO, and their effects are additive. In contrast, Rpd3 was not recruited significantly to the PAC site, suggesting this site has a distinct mechanism for repression. Increases in HO expression and SWI/SNF recruitment were all additive upon loss of Ash1, Ume6, and PAC site factors, indicating the convergence of independent pathways for repression. Our results demonstrate that multiple protein complexes are important for limiting the spread of SWI/SNF-mediated nucleosome eviction across the HO promoter, suggesting that regulation requires a delicate balance of activities that promote and repress transcription.
Collapse
Affiliation(s)
- Emily J Parnell
- Department of Pathology, University of Utah Health Sciences Center, Salt Lake City, Utah 84112
| | - David J Stillman
- Department of Pathology, University of Utah Health Sciences Center, Salt Lake City, Utah 84112
| |
Collapse
|
29
|
Agrawal A, Sambare SV, Narlikar L, Siddharthan R. THiCweed: fast, sensitive detection of sequence features by clustering big datasets. Nucleic Acids Res 2019; 46:e29. [PMID: 29267972 PMCID: PMC5861420 DOI: 10.1093/nar/gkx1251] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 12/01/2017] [Indexed: 11/19/2022] Open
Abstract
We present THiCweed, a new approach to analyzing transcription factor binding data from high-throughput chromatin immunoprecipitation-sequencing (ChIP-seq) experiments. THiCweed clusters bound regions based on sequence similarity using a divisive hierarchical clustering approach based on sequence similarity within sliding windows, while exploring both strands. ThiCweed is specially geared toward data containing mixtures of motifs, which present a challenge to traditional motif-finders. Our implementation is significantly faster than standard motif-finding programs, able to process 30 000 peaks in 1–2 h, on a single CPU core of a desktop computer. On synthetic data containing mixtures of motifs it is as accurate or more accurate than all other tested programs. THiCweed performs best with large ‘window’ sizes (≥50 bp), much longer than typical binding sites (7–15 bp). On real data it successfully recovers literature motifs, but also uncovers complex sequence characteristics in flanking DNA, variant motifs and secondary motifs even when they occur in <5% of the input, all of which appear biologically relevant. We also find recurring sequence patterns across diverse ChIP-seq datasets, possibly related to chromatin architecture and looping. THiCweed thus goes beyond traditional motif finding to give new insights into genomic transcription factor-binding complexity.
Collapse
Affiliation(s)
- Ankit Agrawal
- Computational Biology Group, The Institute of Mathematical Sciences (HBNI), Chennai 600113, Tamil Nadu, India
| | - Snehal V Sambare
- Computational Biology Group, The Institute of Mathematical Sciences (HBNI), Chennai 600113, Tamil Nadu, India
| | - Leelavati Narlikar
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Pune 411008, Maharashtra, India
| | - Rahul Siddharthan
- Computational Biology Group, The Institute of Mathematical Sciences (HBNI), Chennai 600113, Tamil Nadu, India
| |
Collapse
|
30
|
Deyell M, Ameta S, Nghe P. Large scale control and programming of gene expression using CRISPR. Semin Cell Dev Biol 2019; 96:124-132. [PMID: 31181342 DOI: 10.1016/j.semcdb.2019.05.013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Accepted: 05/14/2019] [Indexed: 12/22/2022]
Abstract
The control of gene expression in cells and organisms allows to unveil gene to function relationships and to reprogram biological responses. Several systems, such as Zinc fingers, TALE (Transcription activator-like effectors), and siRNAs (small-interfering RNAs), have been exploited to achieve this. However, recent advances in Clustered Regularly Interspaced Short Palindromic Repeats and Cas9 (CRISPR-Cas9) have overshadowed them due to high specificity, compatibility with many different organisms, and design flexibility. In this review we summarize state-of-the art for CRISPR-Cas9 technology for large scale gene perturbation studies, including single gene and multiple genes knock-out, knock-down, knock-up libraries, and their associated screening assays. We feature in particular the combination of these methods with single-cell transcriptomics approaches. Finally, we highlight the application of CRISPR-Cas9 systems in building synthetic circuits that can be interfaced with gene networks to control cellular states.
Collapse
Affiliation(s)
- Matthew Deyell
- Laboratoire de Biochimie, CNRS UMR8231, Chimie Biologie Innovation, PSL Research University, ESPCI Paris, 10 Rue Vauquelin, 75005, Paris, France
| | - Sandeep Ameta
- Laboratoire de Biochimie, CNRS UMR8231, Chimie Biologie Innovation, PSL Research University, ESPCI Paris, 10 Rue Vauquelin, 75005, Paris, France
| | - Philippe Nghe
- Laboratoire de Biochimie, CNRS UMR8231, Chimie Biologie Innovation, PSL Research University, ESPCI Paris, 10 Rue Vauquelin, 75005, Paris, France.
| |
Collapse
|
31
|
Shaik Ibrahim SH, Chellasamy R, Pugalendhi G. Optimal Estimation of Binding Preference of Protein in PBM Data using Clustering based Modified Jaya Optimization. J Bioinform Comput Biol 2019. [DOI: 10.1142/s0219720019500161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
32
|
Albert B, Tomassetti S, Gloor Y, Dilg D, Mattarocci S, Kubik S, Hafner L, Shore D. Sfp1 regulates transcriptional networks driving cell growth and division through multiple promoter-binding modes. Genes Dev 2019; 33:288-293. [PMID: 30804227 PMCID: PMC6411004 DOI: 10.1101/gad.322040.118] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/17/2018] [Indexed: 12/19/2022]
Abstract
In this study, Albert et al. investigated the mechanisms by which the yeast Sfp1 protein coordinates both cell division and growth. They demonstrate that Sfp1 directly controls genes required for ribosome production and many other growth-promoting processes. The yeast Sfp1 protein regulates both cell division and growth but how it coordinates these processes is poorly understood. We demonstrate that Sfp1 directly controls genes required for ribosome production and many other growth-promoting processes. Remarkably, the complete set of Sfp1 target genes is revealed only by a combination of ChIP (chromatin immunoprecipitation) and ChEC (chromatin endogenous cleavage) methods, which uncover two promoter binding modes, one requiring a cofactor and the other a DNA-recognition motif. Glucose-regulated Sfp1 binding at cell cycle “START” genes suggests that Sfp1 controls cell size by coordinating expression of genes implicated in mass accumulation and cell division.
Collapse
Affiliation(s)
- Benjamin Albert
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Susanna Tomassetti
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Yvonne Gloor
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Daniel Dilg
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Stefano Mattarocci
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Slawomir Kubik
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - Lukas Hafner
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| | - David Shore
- Department of Molecular Biology and Institute of Genetics and Genomics of Geneva (iGE3), 1211 Geneva 4, Switzerland
| |
Collapse
|
33
|
Hashim FA, Mabrouk MS, Atabany WA. Comparative Analysis of DNA Motif Discovery Algorithms: A Systemic Review. CURRENT CANCER THERAPY REVIEWS 2019. [DOI: 10.2174/1573394714666180417161728] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Bioinformatics is an interdisciplinary field that combines biology and information
technology to study how to deal with the biological data. The DNA motif discovery
problem is the main challenge of genome biology and its importance is directly proportional to increasing
sequencing technologies which produce large amounts of data. DNA motif is a repeated
portion of DNA sequences of major biological interest with important structural and functional
features. Motif discovery plays a vital role in the antibody-biomarker identification which is useful
for diagnosis of disease and to identify Transcription Factor Binding Sites (TFBSs) that help in
learning the mechanisms for regulation of gene expression. Recently, scientists discovered that the
TFs have a mutation rate five times higher than the flanking sequences, so motif discovery also
has a crucial role in cancer discovery.
Methods:
Over the past decades, many attempts use different algorithms to design fast and accurate
motif discovery tools. These algorithms are generally classified into consensus or probabilistic
approach.
Results:
Many of DNA motif discovery algorithms are time-consuming and easily trapped in a local
optimum.
Conclusion:
Nature-inspired algorithms and many of combinatorial algorithms are recently proposed
to overcome the problems of consensus and probabilistic approaches. This paper presents a
general classification of motif discovery algorithms with new sub-categories. It also presents a
summary comparison between them.
Collapse
Affiliation(s)
- Fatma A. Hashim
- Department of Biomedical Engineering, Helwan University, Helwan, Egypt
| | - Mai S. Mabrouk
- Department of Biomedical Engineering, Misr University for Science and Technology (MUST), Cairo, Egypt
| | | |
Collapse
|
34
|
Dempster-Shafer Theory for the Prediction of Auxin-Response Elements (AuxREs) in Plant Genomes. BIOMED RESEARCH INTERNATIONAL 2018; 2018:3837060. [PMID: 30515394 PMCID: PMC6236769 DOI: 10.1155/2018/3837060] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 10/15/2018] [Indexed: 11/17/2022]
Abstract
Auxin is a major regulator of plant growth and development; its action involves transcriptional activation. The identification of Auxin-response element (AuxRE) is one of the most important issues to understand the Auxin regulation of gene expression. Over the past few years, a large number of motif identification tools have been developed. Despite these considerable efforts provided by computational biologists, building reliable models to predict regulatory elements has still been a difficult challenge. In this context, we propose in this work a data fusion approach for the prediction of AuxRE. Our method is based on the combined use of Dempster-Shafer evidence theory and fuzzy theory. To evaluate our model, we have scanning the DORNRÖSCHEN promoter by our model. All proven AuxRE present in the promoter has been detected. At the 0.9 threshold we have no false positive. The comparison of the results of our model and some previous motifs finding tools shows that our model can predict AuxRE more successfully than the other tools and produce less false positive. The comparison of the results before and after combination shows the importance of Dempster-Shafer combination in the decrease of false positive and to improve the reliability of prediction. For an overall evaluation we have chosen to present the performance of our approach in comparison with other methods. In fact, the results indicated that the data fusion method has the highest degree of sensitivity (Sn) and Positive Predictive Value (PPV).
Collapse
|
35
|
Zhang H, Zhu L, Huang DS. DiscMLA: An Efficient Discriminative Motif Learning Algorithm over High-Throughput Datasets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1810-1820. [PMID: 27164602 DOI: 10.1109/tcbb.2016.2561930] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The transcription factors (TFs) can activate or suppress gene expression by binding to specific sites, hence are crucial regulatory elements for transcription. Recently, series of discriminative motif finders have been tailored to offering promising strategy for harnessing the power of large quantities of accumulated high-throughput experimental data. However, in order to achieve high speed, these algorithms have to sacrifice accuracy by employing simplified statistical models during the searching process. In this paper, we propose a novel approach named Discriminative Motif Learning via AUC (DiscMLA) to discover motifs on high-throughput datasets. Unlike previous approaches, DiscMLA tries to optimize with a more comprehensive criterion (AUC) during motifs searching. In addition, based on an experimental observation of motif identification on large-scale datasets, some novel procedures are designed to accelerate DiscMLA. The experimental results on 52 real-world datasets demonstrate that our approach substantially outperforms previous methods on discriminative motif learning problems. DiscMLA' stability, discriminability, and validity will help to exploit high-throughput datasets and answer many fundamental biological questions.
Collapse
|
36
|
Lee YT, Fang YY, Sun YW, Hsu HC, Weng SM, Tseng TL, Lin TH, Shieh JC. THR1 mediates GCN4 and CDC4 to link morphogenesis with nutrient sensing and the stress response in Candida albicans. Int J Mol Med 2018; 42:3193-3208. [PMID: 30320368 PMCID: PMC6202100 DOI: 10.3892/ijmm.2018.3930] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 10/08/2018] [Indexed: 12/31/2022] Open
Abstract
Candida albicans (C. albicans) CDC4 (CaCDC4), encoding the F-box protein for the substrate specificity of the Skp1-cullin-F-box E3 ubiquitin ligase complex, suppresses the yeast-to-filament transition in C. albicans. In our previous study, Thr1 was identified as a CaCdc4-associated protein using affinity purification. THR1 encodes a homoserine kinase, which is involved in the threonine biosynthesis pathway. The present study generated a strain with repressible CaCDC4 expression and continuous THR1 expression. Colony and cell morphology analyses, as well as immunoblotting, revealed that the Thr1 protein was detectable under conditions in which the expression of CaCDC4 was repressed and that the filaments resulting from the repressed expression of CaCDC4 were suppressed by the constitutive expression of THR1 in C. albicans. Additionally, by using the CaSAT1-flipper method, the present study produced null mutants of THR1, GCN4, and CaCDC4. The phenotypic consequences were evaluated by growth curves, spotting assays, microscopic analysis, reverse transcription-polymerase chain reaction and XTT-based biofilm formation ability. The results revealed that fewer cells lacking THR1 entered the stationary phase but had no apparent morphological alteration. It was observed that the expression of THR1 was upregulated concurrently with GCN4 during nutrient depletion and that cells lacking GCN4 rescued the lethality of cells in the absence of THR1 in conditions accumulating homoserine in the threonine biosynthesis pathway. Of note, it was found that cells with either CaCDC4 or THR1 loss were sensitive to oxidative stress and osmotic stress, with those with THR1 loss being more sensitive. In addition, it was observed that cells with loss of either CaCDC4 or THR1 exhibited the ability to increase biofilm formation, with those lacking CaCDC4 exhibiting a greater extent of enhancement. It was concluded that CaCDC4 is important in the coordination of morphogenesis, nutrient sensing, and the stress response through THR1 in C. albicans.
Collapse
Affiliation(s)
- Yuan-Ti Lee
- Institute of Medicine and School of Medicine, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Yi-Ya Fang
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Yu Wen Sun
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Hsiao-Chi Hsu
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Shan-Mei Weng
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Tzu-Ling Tseng
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Ting-Hui Lin
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| | - Jia-Ching Shieh
- Department of Biomedical Sciences, Chung Shan Medical University, Taichung City 40201, Taiwan, R.O.C
| |
Collapse
|
37
|
Lee NK, Li X, Wang D. A comprehensive survey on genetic algorithms for DNA motif prediction. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.07.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
38
|
Bruzzone MJ, Grünberg S, Kubik S, Zentner GE, Shore D. Distinct patterns of histone acetyltransferase and Mediator deployment at yeast protein-coding genes. Genes Dev 2018; 32:1252-1265. [PMID: 30108132 PMCID: PMC6120713 DOI: 10.1101/gad.312173.118] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 06/19/2018] [Indexed: 02/07/2023]
Abstract
Here, Bruzzone et al. explore the relative contributions of the transcriptional coactivators Mediator and two histone acetyltransferase (HAT) complexes, NuA4 and SAGA, to RNA polymerase II association at specific genes and gene classes by rapid nuclear depletion of key complex subunits. They show that the NuA4 HAT Esa1 differentially affects certain groups of genes, whereas the SAGA HAT Gcn5 has a weaker but more uniform effect, and their findings suggest that at least three distinct combinations of coactivator deployment are used to generate moderate or high transcription levels. The transcriptional coactivators Mediator and two histone acetyltransferase (HAT) complexes, NuA4 and SAGA, play global roles in transcriptional activation. Here we explore the relative contributions of these factors to RNA polymerase II association at specific genes and gene classes by rapid nuclear depletion of key complex subunits. We show that the NuA4 HAT Esa1 differentially affects certain groups of genes, whereas the SAGA HAT Gcn5 has a weaker but more uniform effect. Relative dependence on Esa1 and Tra1, a shared component of NuA4 and SAGA, distinguishes two large groups of coregulated growth-promoting genes. In contrast, we show that the activity of Mediator is particularly important at a separate, small set of highly transcribed TATA-box-containing genes. Our analysis indicates that at least three distinct combinations of coactivator deployment are used to generate moderate or high transcription levels and suggests that each may be associated with distinct forms of regulation.
Collapse
Affiliation(s)
- Maria Jessica Bruzzone
- Department of Molecular Biology, Institute of Genetics and Genomics in Geneva, 1211 Geneva 4, Switzerland
| | - Sebastian Grünberg
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Slawomir Kubik
- Department of Molecular Biology, Institute of Genetics and Genomics in Geneva, 1211 Geneva 4, Switzerland
| | - Gabriel E Zentner
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - David Shore
- Department of Molecular Biology, Institute of Genetics and Genomics in Geneva, 1211 Geneva 4, Switzerland
| |
Collapse
|
39
|
Lee NK, Azizan FL, Wong YS, Omar N. DeepFinder: An integration of feature-based and deep learning approach for DNA motif discovery. BIOTECHNOL BIOTEC EQ 2018. [DOI: 10.1080/13102818.2018.1438209] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
Affiliation(s)
- Nung Kion Lee
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Farah Liyana Azizan
- Centre For Pre-University Studies, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Yu Shiong Wong
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Norshafarina Omar
- Department of Cognitive Sciences, Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| |
Collapse
|
40
|
Vishnevsky OV, Bocharnikov AV, Kolchanov NA. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets. J Bioinform Comput Biol 2017; 16:1740012. [PMID: 29281953 DOI: 10.1142/s0219720017400121] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.
Collapse
Affiliation(s)
- Oleg V Vishnevsky
- * Institute of Cytology and Genetics SB RAS, Lavrentieva Ave., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova, 10, Novosibirsk 630090, Russia
| | | | - Nikolay A Kolchanov
- * Institute of Cytology and Genetics SB RAS, Lavrentieva Ave., 10, Novosibirsk 630090, Russia.,† Novosibirsk State University, Pirogova, 10, Novosibirsk 630090, Russia
| |
Collapse
|
41
|
Pei C, Wang SL, Fang J, Zhang W. GSMC: Combining Parallel Gibbs Sampling with Maximal Cliques for Hunting DNA Motif. J Comput Biol 2017; 24:1243-1253. [PMID: 29116820 DOI: 10.1089/cmb.2017.0100] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Regulatory elements are responsible for regulating gene transcription. Therefore, identification of these elements is a tremendous challenge in the field of gene expression. Transcription factors (TFs) play a key role in gene regulation by binding to target promoter sequences. A set of conserved sequence patterns with a highly similar structure that is bound by a TF is called a motif. Motif discovery has been a difficult problem over the past decades. Meanwhile, it is a foundation stone in meeting this challenge. Recent advances in obtaining genomic sequences and high-throughput gene expression analysis techniques have enabled the rapid development of computational methods for motif discovery. As a result, a large number of motif-finding algorithms aiming at various motif models have sprung up in the past few years. However, most of them are not suitable for analysis of the large data sets generated by next-generation sequencing. To better handle large-scale ChIP-Seq data and achieve better performance in computational time and motif detection accuracy, we propose an excellent motif-finding algorithm known as GSMC (Combining Parallel Gibbs Sampling with Maximal Cliques for hunting DNA Motif). The GSMC algorithm consists of two steps. First, we employ the commonly used Gibbs sampling to generating initial motifs. Second, we utilize maximal cliques to cluster motifs according to Similarity with Position Information Contents (SPIC). Consequently, we raise the detection accuracy in a great degree, in the meantime holding comparative computation efficiency. In addition, we can find much more credible cofactor interacting motifs.
Collapse
Affiliation(s)
- Chao Pei
- 1 College of Computer Science and Electronics Engineering, Hunan University , Changsha, China
| | - Shu-Lin Wang
- 1 College of Computer Science and Electronics Engineering, Hunan University , Changsha, China
| | - Jianwen Fang
- 2 Biometric Research Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute , Rockville, MD 20850
| | - Wei Zhang
- 1 College of Computer Science and Electronics Engineering, Hunan University , Changsha, China
| |
Collapse
|
42
|
Song J, Bjarnason J, Surette MG. The identification of functional motifs in temporal gene expression analysis. Evol Bioinform Online 2017. [DOI: 10.1177/117693430500100008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR) and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur) binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.
Collapse
Affiliation(s)
- Jiuzhou Song
- Department of Animal and Avian Sciences, and University of Maryland, Maryland 20742, USA
| | - Jaime Bjarnason
- Department of Microbiology and Infectious Diseases, and Department of Biochemistry and Molecular Biology, Health Sciences Centre, University of Calgary, Calgary, AB, Canada, T2N 4N1
| | - Michael G. Surette
- Department of Microbiology and Infectious Diseases, and Department of Biochemistry and Molecular Biology, Health Sciences Centre, University of Calgary, Calgary, AB, Canada, T2N 4N1
| |
Collapse
|
43
|
Gómez-Herreros F, Margaritis T, Rodríguez-Galán O, Pelechano V, Begley V, Millán-Zambrano G, Morillo-Huesca M, Muñoz-Centeno MC, Pérez-Ortín JE, de la Cruz J, Holstege FCP, Chávez S. The ribosome assembly gene network is controlled by the feedback regulation of transcription elongation. Nucleic Acids Res 2017. [PMID: 28637236 PMCID: PMC5737610 DOI: 10.1093/nar/gkx529] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Ribosome assembly requires the concerted expression of hundreds of genes, which are transcribed by all three nuclear RNA polymerases. Transcription elongation involves dynamic interactions between RNA polymerases and chromatin. We performed a synthetic lethal screening in Saccharomyces cerevisiae with a conditional allele of SPT6, which encodes one of the factors that facilitates this process. Some of these synthetic mutants corresponded to factors that facilitate pre-rRNA processing and ribosome biogenesis. We found that the in vivo depletion of one of these factors, Arb1, activated transcription elongation in the set of genes involved directly in ribosome assembly. Under these depletion conditions, Spt6 was physically targeted to the up-regulated genes, where it helped maintain their chromatin integrity and the synthesis of properly stable mRNAs. The mRNA profiles of a large set of ribosome biogenesis mutants confirmed the existence of a feedback regulatory network among ribosome assembly genes. The transcriptional response in this network depended on both the specific malfunction and the role of the regulated gene. In accordance with our screening, Spt6 positively contributed to the optimal operation of this global network. On the whole, this work uncovers a feedback control of ribosome biogenesis by fine-tuning transcription elongation in ribosome assembly factor-coding genes.
Collapse
Affiliation(s)
- Fernando Gómez-Herreros
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Thanasis Margaritis
- Molecular Cancer Research, University Medical Center Utrecht, & Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Olga Rodríguez-Galán
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Vicent Pelechano
- Departamento de Bioquímica y Biología Molecular and ERI Biotecmed. Facultad de Biológicas, Universitat de València. Burjassot, Spain.,SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, 171 65 Solna, Sweden
| | - Victoria Begley
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Gonzalo Millán-Zambrano
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Macarena Morillo-Huesca
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Mari Cruz Muñoz-Centeno
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - José E Pérez-Ortín
- Departamento de Bioquímica y Biología Molecular and ERI Biotecmed. Facultad de Biológicas, Universitat de València. Burjassot, Spain
| | - Jesús de la Cruz
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| | - Frank C P Holstege
- Molecular Cancer Research, University Medical Center Utrecht, & Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Sebastián Chávez
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC-Universidad de Sevilla, and Departamento de Genética, Universidad de Sevilla, 41013 Seville, Spain
| |
Collapse
|
44
|
de la Cruz J, Gómez-Herreros F, Rodríguez-Galán O, Begley V, de la Cruz Muñoz-Centeno M, Chávez S. Feedback regulation of ribosome assembly. Curr Genet 2017; 64:393-404. [PMID: 29022131 DOI: 10.1007/s00294-017-0764-x] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Revised: 10/06/2017] [Accepted: 10/07/2017] [Indexed: 12/12/2022]
Abstract
Ribosome biogenesis is a crucial process for growth and constitutes the major consumer of cellular resources. This pathway is subjected to very stringent regulation to ensure correct ribosome manufacture with a wide variety of environmental and metabolic changes, and intracellular insults. Here we summarise our current knowledge on the regulation of ribosome biogenesis in Saccharomyces cerevisiae by particularly focusing on the feedback mechanisms that maintain ribosome homeostasis. Ribosome biogenesis in yeast is controlled mainly at the level of the production of both pre-rRNAs and ribosomal proteins through the transcriptional and post-transcriptional control of the TORC1 and protein kinase A signalling pathways. Pre-rRNA processing can occur before or after the 35S pre-rRNA transcript is completed; the switch between these two alternatives is regulated by growth conditions. The expression of both ribosomal proteins and the large family of transacting factors involved in ribosome biogenesis is co-regulated. Recently, it has been shown that the synthesis of rRNA and ribosomal proteins, but not of trans-factors, is coupled. Thus the so-called CURI complex sequesters specific transcription factor Ifh1 to repress ribosomal protein genes when rRNA transcription is impaired. We recently found that an analogue system should operate to control the expression of transacting factor genes in response to actual ribosome assembly performance. Regulation of ribosome biogenesis manages situations of imbalanced ribosome production or misassembled ribosomal precursors and subunits, which have been closely linked to distinct human diseases.
Collapse
Affiliation(s)
- Jesús de la Cruz
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain. .,Departamento de Genética, Universidad de Sevilla, Seville, Spain.
| | - Fernando Gómez-Herreros
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain.,Departamento de Genética, Universidad de Sevilla, Seville, Spain
| | - Olga Rodríguez-Galán
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain.,Departamento de Genética, Universidad de Sevilla, Seville, Spain
| | - Victoria Begley
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain.,Departamento de Genética, Universidad de Sevilla, Seville, Spain
| | - María de la Cruz Muñoz-Centeno
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain.,Departamento de Genética, Universidad de Sevilla, Seville, Spain
| | - Sebastián Chávez
- Instituto de Biomedicina de Sevilla (IBiS), Hospital Virgen del Rocío-CSIC, Universidad de Sevilla, Seville, Spain. .,Departamento de Genética, Universidad de Sevilla, Seville, Spain.
| |
Collapse
|
45
|
Predicting DNA Motifs by Using Multi-Objective Hybrid Adaptive Biogeography-Based Optimization. INFORMATION 2017. [DOI: 10.3390/info8040115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
46
|
Bosio MC, Fermi B, Spagnoli G, Levati E, Rubbi L, Ferrari R, Pellegrini M, Dieci G. Abf1 and other general regulatory factors control ribosome biogenesis gene expression in budding yeast. Nucleic Acids Res 2017; 45:4493-4506. [PMID: 28158860 PMCID: PMC5416754 DOI: 10.1093/nar/gkx058] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2016] [Accepted: 01/25/2017] [Indexed: 01/21/2023] Open
Abstract
Ribosome biogenesis in Saccharomyces cerevisiae involves a regulon of >200 genes (Ribi genes) coordinately regulated in response to nutrient availability and cellular growth rate. Two cis-acting elements called PAC and RRPE are known to mediate Ribi gene repression in response to nutritional downshift. Here, we show that most Ribi gene promoters also contain binding sites for one or more General Regulatory Factors (GRFs), most frequently Abf1 and Reb1, and that these factors are enriched in vivo at Ribi promoters. Abf1/Reb1/Tbf1 promoter association was required for full Ribi gene expression in rich medium and for its modulation in response to glucose starvation, characterized by a rapid drop followed by slow recovery. Such a response did not entail changes in Abf1 occupancy, but it was paralleled by a quick increase, followed by slow decrease, in Rpd3L histone deacetylase occupancy. Remarkably, Abf1 site disruption also abolished Rpd3L complex recruitment in response to starvation. Extensive mutational analysis of the DBP7 promoter revealed a complex interplay of Tbf1 sites, PAC and RRPE in the transcriptional regulation of this Ribi gene. Our observations point to GRFs as new multifaceted players in Ribi gene regulation both during exponential growth and under repressive conditions.
Collapse
Affiliation(s)
- Maria Cristina Bosio
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, 43124 Parma, Italy
| | - Beatrice Fermi
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, 43124 Parma, Italy
| | - Gloria Spagnoli
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, 43124 Parma, Italy
| | - Elisabetta Levati
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, 43124 Parma, Italy
| | - Ludmilla Rubbi
- Department of Molecular, Cell and Developmental Biology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Roberto Ferrari
- Department of Molecular, Cell and Developmental Biology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Matteo Pellegrini
- Department of Molecular, Cell and Developmental Biology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Giorgio Dieci
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, 43124 Parma, Italy
| |
Collapse
|
47
|
Cis and trans determinants of epigenetic silencing by Polycomb repressive complex 2 in Arabidopsis. Nat Genet 2017; 49:1546-1552. [DOI: 10.1038/ng.3937] [Citation(s) in RCA: 150] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Accepted: 07/21/2017] [Indexed: 12/12/2022]
|
48
|
Fu H, Zhang X. Noncoding Variants Functional Prioritization Methods Based on Predicted Regulatory Factor Binding Sites. Curr Genomics 2017; 18:322-331. [PMID: 29081688 PMCID: PMC5635616 DOI: 10.2174/1389202918666170228143619] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Revised: 10/16/2016] [Accepted: 11/02/2016] [Indexed: 12/31/2022] Open
Abstract
BACKGROUNDS With the advent of the post genomic era, the research for the genetic mechanism of the diseases has found to be increasingly depended on the studies of the genes, the gene-networks and gene-protein interaction networks. To explore gene expression and regulation, the researchers have carried out many studies on transcription factors and their binding sites (TFBSs). Based on the large amount of transcription factor binding sites predicting values in the deep learning models, further computation and analysis have been done to reveal the relationship between the gene mutation and the occurrence of the disease. It has been demonstrated that based on the deep learning methods, the performances of the prediction for the functions of the noncoding variants are outperforming than those of the conventional methods. The research on the prediction for functions of Single Nucleotide Polymorphisms (SNPs) is expected to uncover the mechanism of the gene mutation affection on traits and diseases of human beings. RESULTS We reviewed the conventional TFBSs identification methods from different perspectives. As for the deep learning methods to predict the TFBSs, we discussed the related problems, such as the raw data preprocessing, the structure design of the deep convolution neural network (CNN) and the model performance measure et al. And then we summarized the techniques that usually used in finding out the functional noncoding variants from de novo sequence. CONCLUSION Along with the rapid development of the high-throughout assays, more and more sample data and chromatin features would be conducive to improve the prediction accuracy of the deep convolution neural network for TFBSs identification. Meanwhile, getting more insights into the deep CNN framework itself has been proved useful for both the promotion on model performance and the development for more suitable design to sample data. Based on the feature values predicted by the deep CNN model, the prioritization model for functional noncoding variants would contribute to reveal the affection of gene mutation on the diseases.
Collapse
Affiliation(s)
- Haoyue Fu
- College of Sciences, Northeastern University, Shenyang, China
| | - LianpingYang
- College of Sciences, Northeastern University, Shenyang, China
- University of Southern California, Dept. Biol. Sci., Program Mol & Computat Biol, USA
| | - Xiangde Zhang
- College of Sciences, Northeastern University, Shenyang, China
| |
Collapse
|
49
|
Abstract
The judicious choice of promoter to drive gene expression remains one of the most important considerations for synthetic biology applications. Constitutive promoter sequences isolated from nature are often used in laboratory settings or small-scale commercial production streams, but unconventional microbial chassis for new synthetic biology applications require well-characterized, robust and orthogonal promoters. This review provides an overview of the opportunities and challenges for synthetic promoter discovery and design, including molecular methodologies, such as saturation mutagenesis of flanking regions and mutagenesis by error-prone PCR, as well as the less familiar use of computational and statistical analyses for de novo promoter design.
Collapse
|
50
|
Bosio MC, Fermi B, Dieci G. Transcriptional control of yeast ribosome biogenesis: A multifaceted role for general regulatory factors. Transcription 2017; 8:254-260. [PMID: 28448767 DOI: 10.1080/21541264.2017.1317378] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
In Saccharomyces cerevisiae, a group of more than 200 co-regulated genes (Ribi genes) is involved in ribosome biogenesis. This regulon has recently been shown to rely on a small set of transcriptional regulators (mainly Abf1, but also Reb1, Tbf1 and Rap1) previously referred to as general regulatory factors (GRFs) because of their widespread binding and action at many promoters and other specialized genomic regions. Intriguingly, Abf1 binding to Ribi genes is differentially modulated in response to distinct nutrition signaling pathways. Such a dynamic promoter association has the potential to orchestrate both activation and repression of Ribi genes in synergy with neighboring regulatory sites and through the functional interplay of histone acetyltransferases and deacetylases.
Collapse
Affiliation(s)
- Maria Cristina Bosio
- a Department of Chemistry , Life Sciences and Environmental Sustainability, University of Parma , Parma , Italy
| | - Beatrice Fermi
- a Department of Chemistry , Life Sciences and Environmental Sustainability, University of Parma , Parma , Italy
| | - Giorgio Dieci
- a Department of Chemistry , Life Sciences and Environmental Sustainability, University of Parma , Parma , Italy
| |
Collapse
|