1
|
Wu K, Dhillon N, Bajor A, Abrahamsson S, Kamakaka RT. Yeast heterochromatin stably silences only weak regulatory elements by altering burst duration. Cell Rep 2024; 43:113983. [PMID: 38517895 PMCID: PMC11141299 DOI: 10.1016/j.celrep.2024.113983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 12/25/2023] [Accepted: 03/06/2024] [Indexed: 03/24/2024] Open
Abstract
Transcriptional silencing in Saccharomyces cerevisiae involves the generation of a chromatin state that stably represses transcription. Using multiple reporter assays, a diverse set of upstream activating sequence enhancers and core promoters were investigated for their susceptibility to silencing. We show that heterochromatin stably silences only weak and stress-induced regulatory elements but is unable to stably repress housekeeping gene regulatory elements, and the partial repression of these elements did not result in bistable expression states. Permutation analysis of enhancers and promoters indicates that both elements are targets of repression. Chromatin remodelers help specific regulatory elements to resist repression, most probably by altering nucleosome mobility and changing transcription burst duration. The strong enhancers/promoters can be repressed if silencer-bound Sir1 is increased. Together, our data suggest that the heterochromatic locus has been optimized to stably silence the weak mating-type gene regulatory elements but not strong housekeeping gene regulatory sequences.
Collapse
Affiliation(s)
- Kenneth Wu
- Department of MCD Biology, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA
| | - Namrita Dhillon
- Department of MCD Biology, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA
| | - Antone Bajor
- Electrical Engineering Department, Baskin School of Engineering, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA
| | - Sara Abrahamsson
- Electrical Engineering Department, Baskin School of Engineering, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA
| | - Rohinton T Kamakaka
- Department of MCD Biology, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA.
| |
Collapse
|
2
|
Wu K, Dhillon N, Bajor A, Abrahamson S, Kamakaka RT. Yeast Heterochromatin Only Stably Silences Weak Regulatory Elements by Altering Burst Duration. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.05.561072. [PMID: 37873261 PMCID: PMC10592971 DOI: 10.1101/2023.10.05.561072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The interplay between nucleosomes and transcription factors leads to programs of gene expression. Transcriptional silencing involves the generation of a chromatin state that represses transcription and is faithfully propagated through DNA replication and cell division. Using multiple reporter assays, including directly visualizing transcription in single cells, we investigated a diverse set of UAS enhancers and core promoters for their susceptibility to heterochromatic gene silencing. These results show that heterochromatin only stably silences weak and stress induced regulatory elements but is unable to stably repress housekeeping gene regulatory elements and the partial repression did not result in bistable expression states. Permutation analysis of different UAS enhancers and core promoters indicate that both elements function together to determine the susceptibility of regulatory sequences to repression. Specific histone modifiers and chromatin remodellers function in an enhancer specific manner to aid these elements to resist repression suggesting that Sir proteins likely function in part by reducing nucleosome mobility. We also show that the strong housekeeping regulatory elements can be repressed if silencer bound Sir1 is increased, suggesting that Sir1 is a limiting component in silencing. Together, our data suggest that the heterochromatic locus has been optimized to stably silence the weak mating type gene regulatory elements but not strong housekeeping gene regulatory sequences which could help explain why these genes are often found at the boundaries of silenced domains.
Collapse
Affiliation(s)
- Kenneth Wu
- Department of MCD Biology, 1156 High Street, University of California, Santa Cruz, CA 95064 USA
| | - Namrita Dhillon
- Department of MCD Biology, 1156 High Street, University of California, Santa Cruz, CA 95064 USA
| | - Antone Bajor
- Electrical Engineering Department, Baskin School of Engineering, 1156 High Street, University of California, Santa Cruz, CA 95064 USA
| | - Sara Abrahamson
- Electrical Engineering Department, Baskin School of Engineering, 1156 High Street, University of California, Santa Cruz, CA 95064 USA
| | - Rohinton T. Kamakaka
- Department of MCD Biology, 1156 High Street, University of California, Santa Cruz, CA 95064 USA
| |
Collapse
|
3
|
The Genetic Mechanisms Underlying the Concerted Expression of the yellow and tan Genes in Complex Patterns on the Abdomen and Wings of Drosophila guttifera. Genes (Basel) 2023; 14:genes14020304. [PMID: 36833231 PMCID: PMC9957387 DOI: 10.3390/genes14020304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/12/2023] [Accepted: 01/21/2023] [Indexed: 01/26/2023] Open
Abstract
How complex morphological patterns form is an intriguing question in developmental biology. However, the mechanisms that generate complex patterns remain largely unknown. Here, we sought to identify the genetic mechanisms that regulate the tan (t) gene in a multi-spotted pigmentation pattern on the abdomen and wings of Drosophila guttifera. Previously, we showed that yellow (y) gene expression completely prefigures the abdominal and wing pigment patterns of this species. In the current study, we demonstrate that the t gene is co-expressed with the y gene in nearly identical patterns, both transcripts foreshadowing the adult abdominal and wing melanin spot patterns. We identified cis-regulatory modules (CRMs) of t, one of which drives reporter expression in six longitudinal rows of spots on the developing pupal abdomen, while the second CRM activates the reporter gene in a spotted wing pattern. Comparing the abdominal spot CRMs of y and t, we found a similar composition of putative transcription factor binding sites that are thought to regulate the complex expression patterns of both terminal pigmentation genes y and t. In contrast, the y and t wing spots appear to be regulated by distinct upstream factors. Our results suggest that the D. guttifera abdominal and wing melanin spot patterns have been established through the co-regulation of y and t, shedding light on how complex morphological traits may be regulated through the parallel coordination of downstream target genes.
Collapse
|
4
|
Chatfield-Reed K, Marno Jones K, Shah F, Chua G. Genetic-interaction screens uncover novel biological roles and regulators of transcription factors in fission yeast. G3 GENES|GENOMES|GENETICS 2022; 12:6655692. [PMID: 35924983 PMCID: PMC9434175 DOI: 10.1093/g3journal/jkac194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 07/20/2022] [Indexed: 12/05/2022]
Abstract
In Schizosaccharomyces pombe, systematic analyses of single transcription factor deletion or overexpression strains have made substantial advances in determining the biological roles and target genes of transcription factors, yet these characteristics are still relatively unknown for over a quarter of them. Moreover, the comprehensive list of proteins that regulate transcription factors remains incomplete. To further characterize Schizosaccharomyces pombe transcription factors, we performed synthetic sick/lethality and synthetic dosage lethality screens by synthetic genetic array. Examination of 2,672 transcription factor double deletion strains revealed a sick/lethality interaction frequency of 1.72%. Phenotypic analysis of these sick/lethality strains revealed potential cell cycle roles for several poorly characterized transcription factors, including SPBC56F2.05, SPCC320.03, and SPAC3C7.04. In addition, we examined synthetic dosage lethality interactions between 14 transcription factors and a miniarray of 279 deletion strains, observing a synthetic dosage lethality frequency of 4.99%, which consisted of known and novel transcription factor regulators. The miniarray contained deletions of genes that encode primarily posttranslational-modifying enzymes to identify putative upstream regulators of the transcription factor query strains. We discovered that ubiquitin ligase Ubr1 and its E2/E3-interacting protein, Mub1, degrade the glucose-responsive transcriptional repressor Scr1. Loss of ubr1+ or mub1+ increased Scr1 protein expression, which resulted in enhanced repression of flocculation through Scr1. The synthetic dosage lethality screen also captured interactions between Scr1 and 2 of its known repressors, Sds23 and Amk2, each affecting flocculation through Scr1 by influencing its nuclear localization. Our study demonstrates that sick/lethality and synthetic dosage lethality screens can be effective in uncovering novel functions and regulators of Schizosaccharomyces pombe transcription factors.
Collapse
Affiliation(s)
- Kate Chatfield-Reed
- Department of Biological Sciences, University of Calgary , Calgary, Alberta T2N 1N4, Canada
| | - Kurtis Marno Jones
- Department of Biological Sciences, University of Calgary , Calgary, Alberta T2N 1N4, Canada
| | - Farah Shah
- Department of Biological Sciences, University of Calgary , Calgary, Alberta T2N 1N4, Canada
| | | |
Collapse
|
5
|
Kuang Z, Ji Z, Boeke JD, Ji H. Dynamic motif occupancy (DynaMO) analysis identifies transcription factors and their binding sites driving dynamic biological processes. Nucleic Acids Res 2019; 46:e2. [PMID: 29325176 PMCID: PMC5758894 DOI: 10.1093/nar/gkx905] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Accepted: 09/26/2017] [Indexed: 01/02/2023] Open
Abstract
Biological processes are usually associated with genome-wide remodeling of transcription driven by transcription factors (TFs). Identifying key TFs and their spatiotemporal binding patterns are indispensable to understanding how dynamic processes are programmed. However, most methods are designed to predict TF binding sites only. We present a computational method, dynamic motif occupancy analysis (DynaMO), to infer important TFs and their spatiotemporal binding activities in dynamic biological processes using chromatin profiling data from multiple biological conditions such as time-course histone modification ChIP-seq data. In the first step, DynaMO predicts TF binding sites with a random forests approach. Next and uniquely, DynaMO infers dynamic TF binding activities at predicted binding sites using their local chromatin profiles from multiple biological conditions. Another landmark of DynaMO is to identify key TFs in a dynamic process using a clustering and enrichment analysis of dynamic TF binding patterns. Application of DynaMO to the yeast ultradian cycle, mouse circadian clock and human neural differentiation exhibits its accuracy and versatility. We anticipate DynaMO will be generally useful for elucidating transcriptional programs in dynamic processes.
Collapse
Affiliation(s)
- Zheng Kuang
- Institute for Systems Genetics, NYU Langone Medical Center, New York City, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Medical Center, New York City, NY 10016, USA.,Department of Biostatistics, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Zhicheng Ji
- Department of Biostatistics, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Jef D Boeke
- Institute for Systems Genetics, NYU Langone Medical Center, New York City, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Medical Center, New York City, NY 10016, USA
| | - Hongkai Ji
- Department of Biostatistics, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD 21205, USA
| |
Collapse
|
6
|
Inference and interrogation of a coregulatory network in the context of lipid accumulation in Yarrowia lipolytica. NPJ Syst Biol Appl 2017; 3:21. [PMID: 28955503 PMCID: PMC5554221 DOI: 10.1038/s41540-017-0024-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 07/07/2017] [Accepted: 07/13/2017] [Indexed: 12/14/2022] Open
Abstract
Complex phenotypes, such as lipid accumulation, result from cooperativity between regulators and the integration of multiscale information. However, the elucidation of such regulatory programs by experimental approaches may be challenging, particularly in context-specific conditions. In particular, we know very little about the regulators of lipid accumulation in the oleaginous yeast of industrial interest Yarrowia lipolytica. This lack of knowledge limits the development of this yeast as an industrial platform, due to the time-consuming and costly laboratory efforts required to design strains with the desired phenotypes. In this study, we aimed to identify context-specific regulators and mechanisms, to guide explorations of the regulation of lipid accumulation in Y. lipolytica. Using gene regulatory network inference, and considering the expression of 6539 genes over 26 time points from GSE35447 for biolipid production and a list of 151 transcription factors, we reconstructed a gene regulatory network comprising 111 transcription factors, 4451 target genes and 17048 regulatory interactions (YL-GRN-1) supported by evidence of protein-protein interactions. This study, based on network interrogation and wet laboratory validation (a) highlights the relevance of our proposed measure, the transcription factors influence, for identifying phases corresponding to changes in physiological state without prior knowledge (b) suggests new potential regulators and drivers of lipid accumulation and
Collapse
|
7
|
Bolotin-Fukuhara M. Thirty years of the HAP2/3/4/5 complex. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2017; 1860:543-559. [DOI: 10.1016/j.bbagrm.2016.10.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Revised: 10/24/2016] [Accepted: 10/25/2016] [Indexed: 01/22/2023]
|
8
|
Österlund T, Bordel S, Nielsen J. Controllability analysis of transcriptional regulatory networks reveals circular control patterns among transcription factors. Integr Biol (Camb) 2015; 7:560-8. [PMID: 25855217 DOI: 10.1039/c4ib00247d] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Transcriptional regulation is the most committed type of regulation in living cells where transcription factors (TFs) control the expression of their target genes and TF expression is controlled by other TFs forming complex transcriptional regulatory networks that can be highly interconnected. Here we analyze the topology and organization of nine transcriptional regulatory networks for E. coli, yeast, mouse and human, and we evaluate how the structure of these networks influences two of their key properties, namely controllability and stability. We calculate the controllability for each network as a measure of the organization and interconnectivity of the network. We find that the number of driver nodes nD needed to control the whole network is 64% of the TFs in the E. coli transcriptional regulatory network in contrast to only 17% for the yeast network, 4% for the mouse network and 8% for the human network. The high controllability (low number of drivers needed to control the system) in yeast, mouse and human is due to the presence of internal loops in their regulatory networks where the TFs regulate each other in a circular fashion. We refer to these internal loops as circular control motifs (CCM). The E. coli transcriptional regulatory network, which does not have any CCMs, shows a hierarchical structure of the transcriptional regulatory network in contrast to the eukaryal networks. The presence of CCMs also has influence on the stability of these networks, as the presence of cycles can be associated with potential unstable steady-states where even small changes in binding affinities can cause dramatic rearrangements of the state of the network.
Collapse
Affiliation(s)
- Tobias Österlund
- Novo Nordisk Foundation Center for Biosustainability, Department of Chemical and Biological Engineering, Chalmers University of Technology, SE-41296 Göteborg, Sweden.
| | | | | |
Collapse
|
9
|
Mahdevar G, Nowzari-Dalini A, Sadeghi M. Inferring gene correlation networks from transcription factor binding sites. Genes Genet Syst 2014; 88:301-9. [PMID: 24694393 DOI: 10.1266/ggs.88.301] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Gene expression is a highly regulated biological process that is fundamental to the existence of phenotypes of any living organism. The regulatory relations are usually modeled as a network; simply, every gene is modeled as a node and relations are shown as edges between two related genes. This paper presents a novel method for inferring correlation networks, networks constructed by connecting co-expressed genes, through predicting co-expression level from genes promoter's sequences. According to the results, this method works well on biological data and its outcome is comparable to the methods that use microarray as input. The method is written in C++ language and is available upon request from the corresponding author.
Collapse
Affiliation(s)
- Ghasem Mahdevar
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran
| | | | | |
Collapse
|
10
|
Liu G, Marras A, Nielsen J. The future of genome-scale modeling of yeast through integration of a transcriptional regulatory network. QUANTITATIVE BIOLOGY 2014. [DOI: 10.1007/s40484-014-0027-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
11
|
Abstract
The term “transcriptional network” refers to the mechanism(s) that underlies coordinated expression of genes, typically involving transcription factors (TFs) binding to the promoters of multiple genes, and individual genes controlled by multiple TFs. A multitude of studies in the last two decades have aimed to map and characterize transcriptional networks in the yeast Saccharomyces cerevisiae. We review the methodologies and accomplishments of these studies, as well as challenges we now face. For most yeast TFs, data have been collected on their sequence preferences, in vivo promoter occupancy, and gene expression profiles in deletion mutants. These systematic studies have led to the identification of new regulators of numerous cellular functions and shed light on the overall organization of yeast gene regulation. However, many yeast TFs appear to be inactive under standard laboratory growth conditions, and many of the available data were collected using techniques that have since been improved. Perhaps as a consequence, comprehensive and accurate mapping among TF sequence preferences, promoter binding, and gene expression remains an open challenge. We propose that the time is ripe for renewed systematic efforts toward a complete mapping of yeast transcriptional regulatory mechanisms.
Collapse
|
12
|
Costanzo MC, Engel SR, Wong ED, Lloyd P, Karra K, Chan ET, Weng S, Paskov KM, Roe GR, Binkley G, Hitz BC, Cherry JM. Saccharomyces genome database provides new regulation data. Nucleic Acids Res 2013; 42:D717-25. [PMID: 24265222 PMCID: PMC3965049 DOI: 10.1093/nar/gkt1158] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the community resource for genomic, gene and protein information about the budding yeast Saccharomyces cerevisiae, containing a variety of functional information about each yeast gene and gene product. We have recently added regulatory information to SGD and present it on a new tabbed section of the Locus Summary entitled 'Regulation'. We are compiling transcriptional regulator-target gene relationships, which are curated from the literature at SGD or imported, with permission, from the YEASTRACT database. For nearly every S. cerevisiae gene, the Regulation page displays a table of annotations showing the regulators of that gene, and a graphical visualization of its regulatory network. For genes whose products act as transcription factors, the Regulation page also shows a table of their target genes, accompanied by a Gene Ontology enrichment analysis of the biological processes in which those genes participate. We additionally synthesize information from the literature for each transcription factor in a free-text Regulation Summary, and provide other information relevant to its regulatory function, such as DNA binding site motifs and protein domains. All of the regulation data are available for querying, analysis and download via YeastMine, the InterMine-based data warehouse system in use at SGD.
Collapse
Affiliation(s)
- Maria C Costanzo
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Systematic genetic analysis of transcription factors to map the fission yeast transcription-regulatory network. Biochem Soc Trans 2013; 41:1696-700. [DOI: 10.1042/bst20130224] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Mapping transcriptional-regulatory networks requires the identification of target genes, binding specificities and signalling pathways of transcription factors. However, the characterization of each transcription factor sufficiently for deciphering such networks remains laborious. The recent availability of overexpression and deletion strains for almost all of the transcription factor genes in the fission yeast Schizosaccharomyces pombe provides a valuable resource to better investigate transcription factors using systematic genetics. In the present paper, I review and discuss the utility of these strain collections combined with transcriptome profiling and genome-wide chromatin immunoprecipitation to identify the target genes of transcription factors.
Collapse
|
14
|
Abstract
In Schizosaccharomyces pombe, over 90% of transcription factor genes are nonessential. Moreover, the majority do not exhibit significant growth defects under optimal conditions when deleted, complicating their functional characterization and target gene identification. Here, we systematically overexpressed 99 transcription factor genes with the nmt1 promoter and found that 64 transcription factor genes exhibited reduced fitness when ectopically expressed. Cell cycle defects were also often observed. We further investigated three uncharacterized transcription factor genes (toe1(+)-toe3(+)) that displayed cell elongation when overexpressed. Ectopic expression of toe1(+) resulted in a G1 delay while toe2(+) and toe3(+) overexpression produced an accumulation of septated cells with abnormalities in septum formation and nuclear segregation, respectively. Transcriptome profiling and ChIP-chip analysis of the transcription factor overexpression strains indicated that Toe1 activates target genes of the pyrimidine-salvage pathway, while Toe3 regulates target genes involved in polyamine synthesis. We also found that ectopic expression of the putative target genes SPBC3H7.05c, and dad5(+) and SPAC11D3.06 could recapitulate the cell cycle phenotypes of toe2(+) and toe3(+) overexpression, respectively. Furthermore, single deletions of the putative target genes urg2(+) and SPAC1399.04c, and SPBC3H7.05c, SPACUNK4.15, and rds1(+), could suppress the phenotypes of toe1(+) and toe2(+) overexpression, respectively. This study implicates new transcription factors and metabolism genes in cell cycle regulation and demonstrates the potential of systematic overexpression analysis to elucidate the function and target genes of transcription factors in S. pombe.
Collapse
|
15
|
Reimand J, Bader GD. Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer drivers. Mol Syst Biol 2013; 9:637. [PMID: 23340843 PMCID: PMC3564258 DOI: 10.1038/msb.2012.68] [Citation(s) in RCA: 210] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 12/06/2012] [Indexed: 12/20/2022] Open
Abstract
Large-scale cancer genome sequencing has uncovered thousands of gene mutations, but distinguishing tumor driver genes from functionally neutral passenger mutations is a major challenge. We analyzed 800 cancer genomes of eight types to find single-nucleotide variants (SNVs) that precisely target phosphorylation machinery, important in cancer development and drug targeting. Assuming that cancer-related biological systems involve unexpectedly frequent mutations, we used novel algorithms to identify genes with significant phosphorylation-associated SNVs (pSNVs), phospho-mutated pathways, kinase networks, drug targets, and clinically correlated signaling modules. We highlight increased survival of patients with TP53 pSNVs, hierarchically organized cancer kinase modules, a novel pSNV in EGFR, and an immune-related network of pSNVs that correlates with prolonged survival in ovarian cancer. Our findings include multiple actionable cancer gene candidates (FLNB, GRM1, POU2F1), protein complexes (HCF1, ASF1), and kinases (PRKCZ). This study demonstrates new ways of interpreting cancer genomes and presents new leads for cancer research.
Collapse
Affiliation(s)
- Jüri Reimand
- The Donnelly Centre, University of Toronto, Toronto, Canada
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, Canada
| |
Collapse
|
16
|
Functional analysis with a barcoder yeast gene overexpression system. G3-GENES GENOMES GENETICS 2012; 2:1279-89. [PMID: 23050238 PMCID: PMC3464120 DOI: 10.1534/g3.112.003400] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 08/22/2012] [Indexed: 01/26/2023]
Abstract
Systematic analysis of gene overexpression phenotypes provides an insight into gene function, enzyme targets, and biological pathways. Here, we describe a novel functional genomics platform that enables a highly parallel and systematic assessment of overexpression phenotypes in pooled cultures. First, we constructed a genome-level collection of ~5100 yeast barcoder strains, each of which carries a unique barcode, enabling pooled fitness assays with a barcode microarray or sequencing readout. Second, we constructed a yeast open reading frame (ORF) galactose-induced overexpression array by generating a genome-wide set of yeast transformants, each of which carries an individual plasmid-born and sequence-verified ORF derived from the Saccharomyces cerevisiae full-length EXpression-ready (FLEX) collection. We combined these collections genetically using synthetic genetic array methodology, generating ~5100 strains, each of which is barcoded and overexpresses a specific ORF, a set we termed “barFLEX.” Additional synthetic genetic array allows the barFLEX collection to be moved into different genetic backgrounds. As a proof-of-principle, we describe the properties of the barFLEX overexpression collection and its application in synthetic dosage lethality studies under different environmental conditions.
Collapse
|
17
|
Mahdevar G, Sadeghi M, Nowzari-Dalini A. Transcription factor binding sites detection by using alignment-based approach. J Theor Biol 2012; 304:96-102. [PMID: 22504445 DOI: 10.1016/j.jtbi.2012.03.039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Revised: 03/27/2012] [Accepted: 03/29/2012] [Indexed: 11/25/2022]
Abstract
Gene expression is the main cause for the existence of various phenotypes. Through this procedure, the information stored in DNA rises to the phenotype. Essentially, gene expression is dependent upon the successful binding of transcription factors (TFs) - a specific type of proteins - to explicit positions in its upstream, TF binding sites (TFBSs). Unfortunately, finding these TFBSs is costly and laborious; therefore, discovering TFBSs computationally is a significant problem that many researches endeavor to solve. In this paper, a new TFBS discovery method is presented by considering known biological facts about TFBSs. The input to this method includes sequences with arbitrary lengths and the output comprises positions that tend to be TFBS. Through the application of previous methods along with a method that focuses on biological and simulated datasets, it is shown that this method achieves higher accuracy in discovering TFBSs.
Collapse
Affiliation(s)
- Ghasem Mahdevar
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| | | | | |
Collapse
|
18
|
Titsias MK, Honkela A, Lawrence ND, Rattray M. Identifying targets of multiple co-regulating transcription factors from expression time-series by Bayesian model comparison. BMC SYSTEMS BIOLOGY 2012; 6:53. [PMID: 22647244 PMCID: PMC3527261 DOI: 10.1186/1752-0509-6-53] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2011] [Accepted: 05/30/2012] [Indexed: 02/02/2023]
Abstract
BACKGROUND Complete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes. RESULTS We present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives. CONCLUSIONS Our results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.
Collapse
Affiliation(s)
- Michalis K Titsias
- The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.
| | | | | | | |
Collapse
|
19
|
Wu M, Chan C. Learning transcriptional regulation on a genome scale: a theoretical analysis based on gene expression data. Brief Bioinform 2012; 13:150-61. [PMID: 21622543 PMCID: PMC3294238 DOI: 10.1093/bib/bbr029] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Revised: 04/23/2011] [Indexed: 12/17/2022] Open
Abstract
The recent advent of high-throughput microarray data has enabled the global analysis of the transcriptome, driving the development and application of computational approaches to study transcriptional regulation on the genome scale, by reconstructing in silico the regulatory interactions of the gene network. Although there are many in-depth reviews of such 'reverse-engineering' methodologies, most have focused on the practical aspect of data mining, and few on the biological problem and the biological relevance of the methodology. Therefore, in this review, from a biological perspective, we used a set of yeast microarray data as a working example, to evaluate the fundamental assumptions implicit in associating transcription factor (TF)-target gene expression levels and estimating TFs' activity, and further explore cooperative models. Finally we confirm that the detailed transcription mechanism is overly-complex for expression data alone to reveal, nevertheless, future network reconstruction studies could benefit from the incorporation of context-specific information, the modeling of multiple layers of regulation (e.g. micro-RNA), or the development of approaches for context-dependent analysis, to uncover the mechanisms of gene regulation.
Collapse
Affiliation(s)
- Ming Wu
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | | |
Collapse
|
20
|
McCord RP, Zhou VW, Yuh T, Bulyk ML. Distant cis-regulatory elements in human skeletal muscle differentiation. Genomics 2011; 98:401-11. [PMID: 21907276 DOI: 10.1016/j.ygeno.2011.08.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2011] [Accepted: 08/17/2011] [Indexed: 10/17/2022]
Abstract
Identifying gene regulatory elements and their target genes in human cells remains a significant challenge. Despite increasing evidence of physical interactions between distant regulatory elements and gene promoters in mammalian cells, many studies consider only promoter-proximal regulatory regions. We identify putative cis-regulatory modules (CRMs) in human skeletal muscle differentiation by combining myogenic TF binding data before and after differentiation with histone modification data in myoblasts. CRMs that are distant (>20 kb) from muscle gene promoters are common and are more likely than proximal promoter regions to show differentiation-specific changes in myogenic TF binding. We find that two of these distant CRMs, known to activate transcription in differentiating myoblasts, interact physically with gene promoters (PDLIM3 and ACTA1) during differentiation. Our results highlight the importance of considering distal CRMs in investigations of mammalian gene regulation and support the hypothesis that distant CRM-promoter looping contacts are a general mechanism of gene regulation.
Collapse
Affiliation(s)
- Rachel Patton McCord
- Division of Genetics, Department of Medicine, Brigham & Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | | | | | | |
Collapse
|
21
|
Li R, Ackerman WE, Summerfield TL, Yu L, Gulati P, Zhang J, Huang K, Romero R, Kniss DA. Inflammatory gene regulatory networks in amnion cells following cytokine stimulation: translational systems approach to modeling human parturition. PLoS One 2011; 6:e20560. [PMID: 21655103 PMCID: PMC3107214 DOI: 10.1371/journal.pone.0020560] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 05/05/2011] [Indexed: 11/18/2022] Open
Abstract
A majority of the studies examining the molecular regulation of human labor have been conducted using single gene approaches. While the technology to produce multi-dimensional datasets is readily available, the means for facile analysis of such data are limited. The objective of this study was to develop a systems approach to infer regulatory mechanisms governing global gene expression in cytokine-challenged cells in vitro, and to apply these methods to predict gene regulatory networks (GRNs) in intrauterine tissues during term parturition. To this end, microarray analysis was applied to human amnion mesenchymal cells (AMCs) stimulated with interleukin-1β, and differentially expressed transcripts were subjected to hierarchical clustering, temporal expression profiling, and motif enrichment analysis, from which a GRN was constructed. These methods were then applied to fetal membrane specimens collected in the absence or presence of spontaneous term labor. Analysis of cytokine-responsive genes in AMCs revealed a sterile immune response signature, with promoters enriched in response elements for several inflammation-associated transcription factors. In comparison to the fetal membrane dataset, there were 34 genes commonly upregulated, many of which were part of an acute inflammation gene expression signature. Binding motifs for nuclear factor-κB were prominent in the gene interaction and regulatory networks for both datasets; however, we found little evidence to support the utilization of pathogen-associated molecular pattern (PAMP) signaling. The tissue specimens were also enriched for transcripts governed by hypoxia-inducible factor. The approach presented here provides an uncomplicated means to infer global relationships among gene clusters involved in cellular responses to labor-associated signals.
Collapse
Affiliation(s)
- Ruth Li
- Division of Maternal-Fetal Medicine and Laboratory of Perinatal Research,
Department of Obstetrics and Gynecology, The Ohio State University, Columbus,
Ohio, United States of America
| | - William E. Ackerman
- Division of Maternal-Fetal Medicine and Laboratory of Perinatal Research,
Department of Obstetrics and Gynecology, The Ohio State University, Columbus,
Ohio, United States of America
| | - Taryn L. Summerfield
- Division of Maternal-Fetal Medicine and Laboratory of Perinatal Research,
Department of Obstetrics and Gynecology, The Ohio State University, Columbus,
Ohio, United States of America
| | - Lianbo Yu
- Center for Biostatistics, The Ohio State University, Columbus, Ohio,
United States of America
| | - Parul Gulati
- Center for Biostatistics, The Ohio State University, Columbus, Ohio,
United States of America
| | - Jie Zhang
- Department of Biomedical Informatics, The Ohio State University,
Columbus, Ohio, United States of America
| | - Kun Huang
- Department of Biomedical Informatics, The Ohio State University,
Columbus, Ohio, United States of America
| | - Roberto Romero
- Perinatology Research Branch, Intramural Division, Eunice Kennedy Shriver
National Institute of Child Health and Human Development, National Institutes of
Health, Department of Health and Human Services, Bethesda, Maryland, United
States of America
- Hutzel Women's Hospital, Detroit, Michigan, United States of
America
| | - Douglas A. Kniss
- Division of Maternal-Fetal Medicine and Laboratory of Perinatal Research,
Department of Obstetrics and Gynecology, The Ohio State University, Columbus,
Ohio, United States of America
- Department of Biomedical Engineering, The Ohio State University,
Columbus, Ohio, United States of America
- * E-mail:
| |
Collapse
|
22
|
Josephides C, Moses AM. Modeling the evolution of a classic genetic switch. BMC SYSTEMS BIOLOGY 2011; 5:24. [PMID: 21294912 PMCID: PMC3048525 DOI: 10.1186/1752-0509-5-24] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Accepted: 02/05/2011] [Indexed: 11/10/2022]
Abstract
Background The regulatory network underlying the yeast galactose-use pathway has emerged as a model system for the study of regulatory network evolution. Evidence has recently been provided for adaptive evolution in this network following a whole genome duplication event. An ancestral gene encoding a bi-functional galactokinase and co-inducer protein molecule has become subfunctionalized as paralogous genes (GAL1 and GAL3) in Saccharomyces cerevisiae, with most fitness gains being attributable to changes in cis-regulatory elements. However, the quantitative functional implications of the evolutionary changes in this regulatory network remain unexplored. Results We develop a modeling framework to examine the evolution of the GAL regulatory network. This enables us to translate molecular changes in the regulatory network to changes in quantitative network function. We computationally reconstruct an inferred ancestral version of the network and trace the evolutionary paths in the lineage leading to S. cerevisiae. We explore the evolutionary landscape of possible regulatory networks and find that the operation of intermediate networks leading to S. cerevisiae differs substantially depending on the order in which evolutionary changes accumulate; in particular, we systematically explore evolutionary paths and find that some network features cannot be optimized simultaneously. Conclusions We find that a computational modeling approach can be used to analyze the evolution of a well-studied regulatory network. Our results are consistent with several experimental studies of the evolutionary of the GAL regulatory network, including increased fitness in Saccharomyces due to duplication and adaptive regulatory divergence. The conceptual and computational tools that we have developed may be applicable in further studies of regulatory network evolution.
Collapse
Affiliation(s)
- Christos Josephides
- Department of Cell & Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON M5 S 3B2, Canada
| | | |
Collapse
|
23
|
Chowdhury S, Lloyd-Price J, Smolander OP, Baici WCV, Hughes TR, Yli-Harja O, Chua G, Ribeiro AS. Information propagation within the Genetic Network of Saccharomyces cerevisiae. BMC SYSTEMS BIOLOGY 2010; 4:143. [PMID: 20977725 PMCID: PMC2975643 DOI: 10.1186/1752-0509-4-143] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2010] [Accepted: 10/26/2010] [Indexed: 12/21/2022]
Abstract
Background A gene network's capacity to process information, so as to bind past events to future actions, depends on its structure and logic. From previous and new microarray measurements in Saccharomyces cerevisiae following gene deletions and overexpressions, we identify a core gene regulatory network (GRN) of functional interactions between 328 genes and the transfer functions of each gene. Inferred connections are verified by gene enrichment. Results We find that this core network has a generalized clustering coefficient that is much higher than chance. The inferred Boolean transfer functions have a mean p-bias of 0.41, and thus similar amounts of activation and repression interactions. However, the distribution of p-biases differs significantly from what is expected by chance that, along with the high mean connectivity, is found to cause the core GRN of S. cerevisiae's to have an overall sensitivity similar to critical Boolean networks. In agreement, we find that the amount of information propagated between nodes in finite time series is much higher in the inferred core GRN of S. cerevisiae than what is expected by chance. Conclusions We suggest that S. cerevisiae is likely to have evolved a core GRN with enhanced information propagation among its genes.
Collapse
Affiliation(s)
- Sharif Chowdhury
- Laboratory of Biosystem Dynamics, Computational Systems Biology Research Group, Tampere University of Technology, Tampere, Finland
| | | | | | | | | | | | | | | |
Collapse
|
24
|
Stark C, Su TC, Breitkreutz A, Lourenco P, Dahabieh M, Breitkreutz BJ, Tyers M, Sadowski I. PhosphoGRID: a database of experimentally verified in vivo protein phosphorylation sites from the budding yeast Saccharomyces cerevisiae. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:bap026. [PMID: 20428315 PMCID: PMC2860897 DOI: 10.1093/database/bap026] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Revised: 12/07/2009] [Accepted: 12/23/2009] [Indexed: 12/17/2022]
Abstract
Protein phosphorylation plays a central role in cellular regulation. Recent proteomics strategies for identifying phosphopeptides have been developed using the model organism Saccharomyces cerevisiae, and consequently, when combined with studies of individual gene products, the number of reported specific phosphorylation sites for this organism has expanded enormously. In order to systematically document and integrate these various data types, we have developed a database of experimentally verified in vivo phosphorylation sites curated from the S. cerevisiae primary literature. PhosphoGRID (www.phosphogrid.org) records the positions of over 5000 specific phosphorylated residues on 1495 gene products. Nearly 900 phosphorylated residues are reported from detailed studies of individual proteins; these in vivo phosphorylation sites are documented by a hierarchy of experimental evidence codes. Where available for specific sites, we have also noted the relevant protein kinases and/or phosphatases, the specific condition(s) under which phosphorylation occurs, and the effect(s) that phosphorylation has on protein function. The unique features of PhosphoGRID that assign both function and specific physiological conditions to each phosphorylated residue will provide a valuable benchmark for proteome-level studies and will facilitate bioinformatic analysis of cellular signal transduction networks. Database URL: http://phosphogrid.org/
Collapse
Affiliation(s)
- Chris Stark
- Centre for Systems Biology, Samuel Lunenfeld Research Institute, 600 University Avenue, Toronto, Ontario M5G 1X5, Canada
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Abstract
Biological systems can be modeled as networks of interacting components across multiple scales. A central problem in computational systems biology is to identify those critical components and the rules that define their interactions and give rise to the emergent behavior of a host response. In this paper we will discuss two fundamental problems related to the construction of transcription factor networks and the identification of networks of functional modules describing disease progression. We focus on inflammation as a key physiological response of clinical and translational importance.
Collapse
Affiliation(s)
- P.T. Foteinou
- Biomedical Engineering Department, Rutgers University, 599 Taylor Road Piscataway, NJ 08854
| | - E. Yang
- Biomedical Engineering Department, Rutgers University, 599 Taylor Road Piscataway, NJ 08854
| | - I. P. Androulakis
- Biomedical Engineering Department, Rutgers University, 599 Taylor Road Piscataway, NJ 08854
- Chemical & Biochemical Engineering Department, Rutgers University, 98 Brett Road, Piscataway, NJ 08854
| |
Collapse
|
26
|
Abstract
One central problem in biology is to understand how gene expression is regulated under different conditions. Microarray gene expression data and other high throughput data have made it possible to dissect transcriptional regulatory networks at the genomics level. Owing to the very large number of genes that need to be studied, the relatively small number of data sets available, the noise in the data and the different natures of the distinct data types, network inference presents great challenges. In this article, we review statistical and computational methods that have been developed in the last decade in response to genomics data for inferring transcriptional regulatory networks.
Collapse
Affiliation(s)
- Ning Sun
- Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT 06520, USA.
| | | |
Collapse
|
27
|
Marco A, Konikoff C, Karr TL, Kumar S. Relationship between gene co-expression and sharing of transcription factor binding sites in Drosophila melanogaster. Bioinformatics 2009; 25:2473-7. [PMID: 19633094 PMCID: PMC2752616 DOI: 10.1093/bioinformatics/btp462] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2009] [Revised: 07/03/2009] [Accepted: 07/22/2009] [Indexed: 01/16/2023] Open
Abstract
MOTIVATION In functional genomics, it is frequently useful to correlate expression levels of genes to identify transcription factor binding sites (TFBS) via the presence of common sequence motifs. The underlying assumption is that co-expressed genes are more likely to contain shared TFBS and, thus, TFBS can be identified computationally. Indeed, gene pairs with a very high expression correlation show a significant excess of shared binding sites in yeast. We have tested this assumption in a more complex organism, Drosophila melanogaster, by using experimentally determined TFBS and microarray expression data. We have also examined the reverse relationship between the expression correlation and the extent of TFBS sharing. RESULTS Pairs of genes with shared TFBS show, on average, a higher degree of co-expression than those with no common TFBS in Drosophila. However, the reverse does not hold true: gene pairs with high expression correlations do not share significantly larger numbers of TFBS. Exception to this observation exists when comparing expression of genes from the earliest stages of embryonic development. Interestingly, semantic similarity between gene annotations (Biological Process) is much better associated with TFBS sharing, as compared to the expression correlation. We discuss these results in light of reverse engineering approaches to computationally predict regulatory sequences by using comparative genomics.
Collapse
Affiliation(s)
- Antonio Marco
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University, Tempe, AZ 85287-5301, USA.
| | | | | | | |
Collapse
|
28
|
Computational approaches to the integration of gene expression, ChIP-chip and sequence data in the inference of gene regulatory networks. Semin Cell Dev Biol 2009; 20:863-8. [PMID: 19682595 DOI: 10.1016/j.semcdb.2009.08.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Accepted: 08/03/2009] [Indexed: 01/12/2023]
Abstract
A major challenge in systems biology is the ability to model complex regulatory interactions, such as gene regulatory networks, and a number of computational approaches have been developed over recent years to address this challenge. This paper reviews a number of these approaches, with a focus on probabilistic graphical models and the integration of diverse data sets, such as gene expression and transcription factor binding site location and activity.
Collapse
|
29
|
Janky R, Helden JV, Babu MM. Investigating transcriptional regulation: from analysis of complex networks to discovery of cis-regulatory elements. Methods 2009; 48:277-86. [PMID: 19450688 DOI: 10.1016/j.ymeth.2009.04.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2009] [Revised: 04/17/2009] [Accepted: 04/18/2009] [Indexed: 10/20/2022] Open
Abstract
Regulation of gene expression at the transcriptional level is a fundamental mechanism that is well conserved in all cellular systems. Due to advances in large-scale experimental analyses, we now have a wealth of information on gene regulation such as mRNA expression level across multiple conditions, genome-wide location data of transcription factors and data on transcription factor binding sites. This knowledge can be used to reconstruct transcriptional regulatory networks. Such networks are usually represented as directed graphs where regulatory interactions are depicted as directed edges from the transcription factor nodes to the target gene nodes. This abstract representation allows us to apply graph theory to study transcriptional regulation at global and local levels, to predict regulatory motifs and regulatory modules such as regulons and to compare the regulatory network of different genomes. Here we review some of the available computational methodologies for studying transcriptional regulatory networks as well as their interpretation.
Collapse
Affiliation(s)
- Rekin's Janky
- Structural Studies Division, Medical Research Council - Laboratory of Molecular Biology, Hills Road, Cambridge CB20QH, United Kingdom.
| | | | | |
Collapse
|
30
|
Chua G. Identification of transcription factor targets by phenotypic activation and microarray expression profiling in yeast. Methods Mol Biol 2009; 548:19-35. [PMID: 19521817 DOI: 10.1007/978-1-59745-540-4_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
A major obstacle to identify physiological transcriptional targets is that the conditions that induce the majority of yeast transcription factors (TFs) are unknown. Microarray analyses of deletion mutants indicate that most TFs are inactive under standard growth conditions. To overcome this, we screened an ordered array of yeast open reading frames (ORFs) to identify TFs that confer reduced fitness upon overexpression, suggesting that overexpression results in an activated state (phenotypic activation). Approximately one-third of all yeast TFs exhibited this phenotype. Here, we describe in detail our methodology to characterize these TF overexpression strains including microarray expression profiling, data analysis, and motif searching. Our analyses show that in many cases, the differentially regulated genes correspond to physiological functions and known targets of well-characterized TFs. The expected binding sites of several TFs were also identified in the promoters of these genes. Moreover, novel DNA-binding sequences and putative targets were identified for less-characterized TFs. These results demonstrate that phenotypic activation is an effective approach to rapidly characterize TFs on a large scale, which should also be feasible in other organisms.
Collapse
Affiliation(s)
- Gordon Chua
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada T2N 1N4.
| |
Collapse
|
31
|
Hillenmeyer ME, Fung E, Wildenhain J, Pierce SE, Hoon S, Lee W, Proctor M, St.Onge RP, Tyers M, Koller D, Altman RB, Davis RW, Nislow C, Giaever G. The chemical genomic portrait of yeast: uncovering a phenotype for all genes. Science 2008; 320:362-5. [PMID: 18420932 PMCID: PMC2794835 DOI: 10.1126/science.1150021] [Citation(s) in RCA: 748] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Genetics aims to understand the relation between genotype and phenotype. However, because complete deletion of most yeast genes ( approximately 80%) has no obvious phenotypic consequence in rich medium, it is difficult to study their functions. To uncover phenotypes for this nonessential fraction of the genome, we performed 1144 chemical genomic assays on the yeast whole-genome heterozygous and homozygous deletion collections and quantified the growth fitness of each deletion strain in the presence of chemical or environmental stress conditions. We found that 97% of gene deletions exhibited a measurable growth phenotype, suggesting that nearly all genes are essential for optimal growth in at least one condition.
Collapse
Affiliation(s)
- Maureen E. Hillenmeyer
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
- Program in Biomedical Informatics, StanfordUniversity,PaloAlto,CA94305,USA
| | - Eula Fung
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
| | - Jan Wildenhain
- SamuelLunenfeld Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, Ontario M5G1X5, Canada
| | - Sarah E. Pierce
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Shawn Hoon
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - William Lee
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Michael Proctor
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
| | - Robert P. St.Onge
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA 94304, USA
| | - Mike Tyers
- SamuelLunenfeld Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, Ontario M5G1X5, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S1A8, Canada
| | - Daphne Koller
- Department of Computer Science, Stanford University, Palo Alto, CA 94305, USA
| | - Russ B. Altman
- Program in Biomedical Informatics, StanfordUniversity,PaloAlto,CA94305,USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Ronald W. Davis
- Program in Biomedical Informatics, StanfordUniversity,PaloAlto,CA94305,USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Corey Nislow
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S1A8, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S3E1, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S3E1, Canada
| | - Guri Giaever
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S1A8, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S3E1, Canada
- Department of Pharmaceutical Sciences, University of Toronto, Toronto, Ontario M5S3M2, Canada
| |
Collapse
|
32
|
Toenjes M, Schueler M, Hammer S, Pape UJ, Fischer JJ, Berger F, Vingron M, Sperling S. Prediction of cardiac transcription networks based on molecular data and complex clinical phenotypes. MOLECULAR BIOSYSTEMS 2008; 4:589-98. [PMID: 18493657 DOI: 10.1039/b800207j] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
We present an integrative approach combining sophisticated techniques to construct cardiac gene regulatory networks based on correlated gene expression and optimized prediction of transcription factor binding sites. We analyze transcription levels of a comprehensive set of 42 genes in biopsies derived from hearts of a cohort of 190 patients as well as healthy individuals. To precisely describe the variety of heart malformations observed in the patients, we delineate a detailed phenotype ontology that allows description of observed clinical characteristics as well as the definition of informative meta-phenotypes. Based on the expression data obtained by real-time PCR we identify specific disease associated transcription profiles by applying linear models. Furthermore, genes that show highly correlated expression patterns are depicted. By predicting binding sites on promoter settings optimized using a cardiac specific chromatin immunoprecipitation data set, we reveal regulatory dependencies. Several of the found interactions have been previously described in literature, demonstrating that the approach is a versatile tool to predict regulatory networks.
Collapse
Affiliation(s)
- Martje Toenjes
- Department of Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Ihnestr. 73, 14195 Berlin, Germany
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Kiesel J, Miller C, Abu-Amer Y, Aurora R. Systems level analysis of osteoclastogenesis reveals intrinsic and extrinsic regulatory interactions. Dev Dyn 2007; 236:2181-97. [PMID: 17584858 DOI: 10.1002/dvdy.21206] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Osteoclasts are bone-resorbing cells derived from the myeloid lineage that play a central role in bone remodeling and inflammatory bone erosion diseases. The receptor activator of NF-kappaB ligand (RANKL) produced by osteoblasts and activated immune cells initiates the development of osteoclasts in the bone marrow. Using time series gene expression data, the intrinsic processes and the extrinsic factors that control osteoclastogenesis were identified. The gene expression profiles display distinct commitment and differentiation phases. Analysis of the time course revealed several mechanistic details, including the complex role of cholesterol in osteoclast development. Epistatic interactions and the coordination between cellular processes that regulate development were inferred from the coexpression network. The coexpression network indicated that osteoclasts induce angiogenesis and recruit T-cells to the site of osteoclastogenesis early in the commitment phase. The resulting model provides an essential framework for a better understanding of the epigenetic program of osteoclastogenesis.
Collapse
Affiliation(s)
- Jennifer Kiesel
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, Saint Louis, Missouri, USA
| | | | | | | |
Collapse
|
34
|
Alazard R, Mourey L, Ebel C, Konarev PV, Petoukhov MV, Svergun DI, Erard M. Fine-tuning of intrinsic N-Oct-3 POU domain allostery by regulatory DNA targets. Nucleic Acids Res 2007; 35:4420-32. [PMID: 17576670 PMCID: PMC1935007 DOI: 10.1093/nar/gkm453] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The 'POU' (acronym of Pit-1, Oct-1, Unc-86) family of transcription factors share a common DNA-binding domain of approximately 160 residues, comprising so-called 'POUs' and 'POUh' sub-domains connected by a flexible linker. The importance of POU proteins as developmental regulators and tumor-promoting agents is due to linker flexibility, which allows them to adapt to a considerable variety of DNA targets. However, because of this flexibility, it has not been possible to determine the Oct-1/Pit-1 linker structure in crystallographic POU/DNA complexes. We have previously shown that the neuronal POU protein N-Oct-3 linker contains a structured region. Here, we have used a combination of hydrodynamic methods, DNA footprinting experiments, molecular modeling and small angle X-ray scattering to (i) structurally interpret the N-Oct-3-binding site within the HLA DRalpha gene promoter and deduce from this a novel POU domain allosteric conformation and (ii) analyze the molecular mechanisms involved in conformational transitions. We conclude that there might exist a continuum running from free to 'pre-bound' N-Oct-3 POU conformations and that regulatory DNA regions likely select pre-existing conformers, in addition to molding the appropriate DBD structure. Finally, we suggest that a specific pair of glycine residues in the linker might act as a major conformational switch.
Collapse
Affiliation(s)
- Robert Alazard
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Lionel Mourey
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Christine Ebel
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Peter V. Konarev
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Maxim V. Petoukhov
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Dmitri I. Svergun
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
| | - Monique Erard
- Institut de Pharmacologie et de Biologie Structurale, 205 Route de Narbonne, 31077 Toulouse, Institut de Biologie Structurale, UMR 5075 CEA-CNRS-UJF, 41 rue Jules Horowitz, 38027 Grenoble, France and European Molecular Biology Laboratory, Hamburg Outstation, EMBL c/o DESY, D-22603 Hamburg, Germany and Institute of Crystallography, Russian Academy of Sciences, Leninsky pr. 59, 117333 Moscow, Russia
- *To whom correspondence should be addressed. +33 (0) 562175496+33 (0) 562175994
| |
Collapse
|
35
|
Walker E, Ohishi M, Davey RE, Zhang W, Cassar PA, Tanaka TS, Der SD, Morris Q, Hughes TR, Zandstra PW, Stanford WL. Prediction and Testing of Novel Transcriptional Networks Regulating Embryonic Stem Cell Self-Renewal and Commitment. Cell Stem Cell 2007; 1:71-86. [PMID: 18371337 DOI: 10.1016/j.stem.2007.04.002] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2006] [Revised: 03/16/2007] [Accepted: 04/19/2007] [Indexed: 01/07/2023]
Affiliation(s)
- Emily Walker
- Institute of Biomaterials and Biomedical Engineering, University of Toronto, 164 College Street, Toronto, ON, M5S 3G9, Canada
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Wang J. A new framework for identifying combinatorial regulation of transcription factors: a case study of the yeast cell cycle. J Biomed Inform 2007; 40:707-25. [PMID: 17418646 DOI: 10.1016/j.jbi.2007.02.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2006] [Revised: 12/23/2006] [Accepted: 02/27/2007] [Indexed: 01/24/2023]
Abstract
By integrating heterogeneous functional genomic datasets, we have developed a new framework for detecting combinatorial control of gene expression, which includes estimating transcription factor activities using a singular value decomposition method and reducing high-dimensional input gene space by considering genomic properties of gene clusters. The prediction of cooperative gene regulation is accomplished by either Gaussian Graphical Models or Pairwise Mixed Graphical Models. The proposed framework was tested on yeast cell cycle datasets: (1) 54 known yeast cell cycle genes with 9 cell cycle regulators and (2) 676 putative yeast cell cycle genes with 9 cell cycle regulators. The new framework gave promising results on inferring TF-TF and TF-gene interactions. It also revealed several interesting mechanisms such as negatively correlated protein-protein interactions and low affinity protein-DNA interactions that may be important during the yeast cell cycle. The new framework may easily be extended to study other higher eukaryotes.
Collapse
Affiliation(s)
- Junbai Wang
- Department of Biological Sciences, Columbia University, 1212, Amsterdam Avenue, MC 2442, New York, NY 10027, USA.
| |
Collapse
|
37
|
Abstract
MOTIVATION Global gene expression measurements as obtained, for example, in microarray experiments can provide important clues to the underlying transcriptional control mechanisms and network structure of a biological cell. In the absence of a detailed understanding of this gene regulation, current attempts at classification of expression data rely on clustering and pattern recognition techniques employing ad-hoc similarity criteria. To improve this situation, a better understanding of the expected relationships between expression profiles of genes associated by biological function is required. RESULTS It is shown that perturbation expansions familiar from biological systems theory make precise predictions for the types of relationships to be expected for expression profiles of biologically associated genes, even if the underlying biological factors responsible for this association are not known. Classification criteria are derived, most of which are not usually employed in clustering algorithms. The approach is illustrated by using the AtGenExpress Arabidopsis thaliana developmental expression map.
Collapse
Affiliation(s)
- Andreas W Schreiber
- Australian Centre for Plant Functional Genomics, Hartley Grove, PMB 1 Waite Campus, The University of Adelaide Glen Osmond 5064, Australia.
| | | |
Collapse
|
38
|
Moorman C, Sun LV, Wang J, de Wit E, Talhout W, Ward LD, Greil F, Lu XJ, White KP, Bussemaker HJ, van Steensel B. Hotspots of transcription factor colocalization in the genome of Drosophila melanogaster. Proc Natl Acad Sci U S A 2006; 103:12027-32. [PMID: 16880385 PMCID: PMC1567692 DOI: 10.1073/pnas.0605003103] [Citation(s) in RCA: 166] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2006] [Indexed: 11/18/2022] Open
Abstract
Regulation of gene expression is a highly complex process that requires the concerted action of many proteins, including sequence-specific transcription factors, cofactors, and chromatin proteins. In higher eukaryotes, the interplay between these proteins and their interactions with the genome still is poorly understood. We systematically mapped the in vivo binding sites of seven transcription factors with diverse physiological functions, five cofactors, and two heterochromatin proteins at approximately 1-kb resolution in a 2.9 Mb region of the Drosophila melanogaster genome. Surprisingly, all tested transcription factors and cofactors show strongly overlapping localization patterns, and the genome contains many "hotspots" that are targeted by all of these proteins. Several control experiments show that the strong overlap is not an artifact of the techniques used. Colocalization hotspots are 1-5 kb in size, spaced on average by approximately 50 kb, and preferentially located in regions of active transcription. We provide evidence that protein-protein interactions play a role in the hotspot association of some transcription factors. Colocalization hotspots constitute a previously uncharacterized type of feature in the genome of Drosophila, and our results provide insights into the general targeting mechanisms of transcription regulators in a higher eukaryote.
Collapse
Affiliation(s)
- Celine Moorman
- *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Ling V. Sun
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06520; and
| | - Junbai Wang
- Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10027
| | - Elzo de Wit
- *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Wendy Talhout
- *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Lucas D. Ward
- Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10027
| | - Frauke Greil
- *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Xiang-Jun Lu
- Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10027
| | - Kevin P. White
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06520; and
| | - Harmen J. Bussemaker
- Department of Biological Sciences and Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10027
| | - Bas van Steensel
- *Department of Molecular Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| |
Collapse
|
39
|
Chua G, Morris QD, Sopko R, Robinson MD, Ryan O, Chan ET, Frey BJ, Andrews BJ, Boone C, Hughes TR. Identifying transcription factor functions and targets by phenotypic activation. Proc Natl Acad Sci U S A 2006; 103:12045-50. [PMID: 16880382 PMCID: PMC1567694 DOI: 10.1073/pnas.0605140103] [Citation(s) in RCA: 147] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Mapping transcriptional regulatory networks is difficult because many transcription factors (TFs) are activated only under specific conditions. We describe a generic strategy for identifying genes and pathways induced by individual TFs that does not require knowledge of their normal activation cues. Microarray analysis of 55 yeast TFs that caused a growth phenotype when overexpressed showed that the majority caused increased transcript levels of genes in specific physiological categories, suggesting a mechanism for growth inhibition. Induced genes typically included established targets and genes with consensus promoter motifs, if known, indicating that these data are useful for identifying potential new target genes and binding sites. We identified the sequence 5'-TCACGCAA as a binding sequence for Hms1p, a TF that positively regulates pseudohyphal growth and previously had no known motif. The general strategy outlined here presents a straightforward approach to discovery of TF activities and mapping targets that could be adapted to any organism with transgenic technology.
Collapse
Affiliation(s)
- Gordon Chua
- *Banting and Best Department of Medical Research, and Departments of
| | - Quaid D. Morris
- *Banting and Best Department of Medical Research, and Departments of
- Computer Science
- Electrical and Computer Engineering, and
| | - Richelle Sopko
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
| | - Mark D. Robinson
- *Banting and Best Department of Medical Research, and Departments of
- Electrical and Computer Engineering, and
| | - Owen Ryan
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
| | - Esther T. Chan
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
| | - Brendan J. Frey
- *Banting and Best Department of Medical Research, and Departments of
- Electrical and Computer Engineering, and
| | - Brenda J. Andrews
- *Banting and Best Department of Medical Research, and Departments of
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
| | - Charles Boone
- *Banting and Best Department of Medical Research, and Departments of
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
| | - Timothy R. Hughes
- *Banting and Best Department of Medical Research, and Departments of
- Medical Genetics and Microbiology, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 1A8
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
40
|
Colot HV, Park G, Turner GE, Ringelberg C, Crew CM, Litvinkova L, Weiss RL, Borkovich KA, Dunlap JC. A high-throughput gene knockout procedure for Neurospora reveals functions for multiple transcription factors. Proc Natl Acad Sci U S A 2006; 103:10352-10357. [PMID: 16801547 PMCID: PMC1482798 DOI: 10.1073/pnas.0601456103] [Citation(s) in RCA: 931] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The low rate of homologous recombination exhibited by wild-type strains of filamentous fungi has hindered development of high-throughput gene knockout procedures for this group of organisms. In this study, we describe a method for rapidly creating knockout mutants in which we make use of yeast recombinational cloning, Neurospora mutant strains deficient in nonhomologous end-joining DNA repair, custom-written software tools, and robotics. To illustrate our approach, we have created strains bearing deletions of 103 Neurospora genes encoding transcription factors. Characterization of strains during growth and both asexual and sexual development revealed phenotypes for 43% of the deletion mutants, with more than half of these strains possessing multiple defects. Overall, the methodology, which achieves high-throughput gene disruption at an efficiency >90% in this filamentous fungus, promises to be applicable to other eukaryotic organisms that have a low frequency of homologous recombination.
Collapse
Affiliation(s)
- Hildur V Colot
- *Department of Genetics, Dartmouth Medical School, HB7400, Hanover, NH 03755
| | - Gyungsoon Park
- Department of Plant Pathology, University of California, Riverside, CA 92521; and
| | - Gloria E Turner
- Department of Chemistry and Biochemistry, 405 Hilgard Avenue, University of California, Los Angeles, CA 90095
| | - Carol Ringelberg
- *Department of Genetics, Dartmouth Medical School, HB7400, Hanover, NH 03755
| | - Christopher M Crew
- Department of Plant Pathology, University of California, Riverside, CA 92521; and
| | - Liubov Litvinkova
- Department of Plant Pathology, University of California, Riverside, CA 92521; and
| | - Richard L Weiss
- Department of Chemistry and Biochemistry, 405 Hilgard Avenue, University of California, Los Angeles, CA 90095
| | | | - Jay C Dunlap
- *Department of Genetics, Dartmouth Medical School, HB7400, Hanover, NH 03755;
| |
Collapse
|
41
|
Reguly T, Breitkreutz A, Boucher L, Breitkreutz BJ, Hon GC, Myers CL, Parsons A, Friesen H, Oughtred R, Tong A, Stark C, Ho Y, Botstein D, Andrews B, Boone C, Troyanskya OG, Ideker T, Dolinski K, Batada NN, Tyers M. Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae. J Biol 2006; 5:11. [PMID: 16762047 PMCID: PMC1561585 DOI: 10.1186/jbiol36] [Citation(s) in RCA: 224] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2005] [Revised: 03/17/2006] [Accepted: 03/30/2006] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The study of complex biological networks and prediction of gene function has been enabled by high-throughput (HTP) methods for detection of genetic and protein interactions. Sparse coverage in HTP datasets may, however, distort network properties and confound predictions. Although a vast number of well substantiated interactions are recorded in the scientific literature, these data have not yet been distilled into networks that enable system-level inference. RESULTS We describe here a comprehensive database of genetic and protein interactions, and associated experimental evidence, for the budding yeast Saccharomyces cerevisiae, as manually curated from over 31,793 abstracts and online publications. This literature-curated (LC) dataset contains 33,311 interactions, on the order of all extant HTP datasets combined. Surprisingly, HTP protein-interaction datasets currently achieve only around 14% coverage of the interactions in the literature. The LC network nevertheless shares attributes with HTP networks, including scale-free connectivity and correlations between interactions, abundance, localization, and expression. We find that essential genes or proteins are enriched for interactions with other essential genes or proteins, suggesting that the global network may be functionally unified. This interconnectivity is supported by a substantial overlap of protein and genetic interactions in the LC dataset. We show that the LC dataset considerably improves the predictive power of network-analysis approaches. The full LC dataset is available at the BioGRID (http://www.thebiogrid.org) and SGD (http://www.yeastgenome.org/) databases. CONCLUSION Comprehensive datasets of biological interactions derived from the primary literature provide critical benchmarks for HTP methods, augment functional prediction, and reveal system-level attributes of biological networks.
Collapse
Affiliation(s)
- Teresa Reguly
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
| | - Ashton Breitkreutz
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
| | - Lorrie Boucher
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
| | - Bobby-Joe Breitkreutz
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
| | - Gary C Hon
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0412, USA
| | - Chad L Myers
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Washington Road, Princeton, NJ 08544, USA
- Department of Computer Science, Princeton University, NJ 08544, USA
| | - Ainslie Parsons
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - Helena Friesen
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - Rose Oughtred
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Washington Road, Princeton, NJ 08544, USA
| | - Amy Tong
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - Chris Stark
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
| | - Yuen Ho
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - David Botstein
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Washington Road, Princeton, NJ 08544, USA
| | - Brenda Andrews
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - Charles Boone
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
- Banting and Best Department of Medical Research, University of Toronto, Toronto ON M5G 1L6, Canada
| | - Olga G Troyanskya
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Washington Road, Princeton, NJ 08544, USA
- Department of Computer Science, Princeton University, NJ 08544, USA
| | - Trey Ideker
- Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0412, USA
| | - Kara Dolinski
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Washington Road, Princeton, NJ 08544, USA
| | - Nizar N Batada
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
| | - Mike Tyers
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto ON M5G 1X5, Canada
- Department of Medical Genetics and Microbiology, University of Toronto, Toronto ON M5S 1A8, Canada
| |
Collapse
|
42
|
Boshoff HIM, Manjunatha UH. The impact of genomics on discovering drugs against infectious diseases. Microbes Infect 2006; 8:1654-61. [PMID: 16690340 DOI: 10.1016/j.micinf.2005.11.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2005] [Accepted: 11/30/2005] [Indexed: 01/30/2023]
Abstract
Genomics is accelerating the progress in data generation and interpretation in the global analyses of components of cells, including the spectrum of lipids, RNA, metabolites, proteins, mutational phenotypes or DNA methylation sites. Integration of the knowledge generated by these diverse strategies is predicted to have a tremendous impact on approaches to rational drug discovery against infectious diseases.
Collapse
Affiliation(s)
- Helena I M Boshoff
- National Institutes of Health, Tuberculosis Research Section, LIG/NIAID/NIH, Twinbrook II, Room 239, 12441 Parklawn Drive, Rockville, MD 20852, USA.
| | | |
Collapse
|
43
|
Dolinski K, Botstein D. Changing perspectives in yeast research nearly a decade after the genome sequence. Genome Res 2006; 15:1611-9. [PMID: 16339358 DOI: 10.1101/gr.3727505] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Research with budding yeast (Saccharomyces cerevisiae) has been transformed by the publication, nearly a decade ago, of the entire genome DNA sequence. The introduction of this first eukaryotic genomic sequence changed the yeast research environment significantly, not just because of dramatic progress in technical means but also because the sequence made accessible a new class of scientific questions. A central goal of yeast research remains the determination of the biological role of every sequence feature in the yeast genome. The most remarkable change has been the shift in perspective from focus on individual genes and functionalities to a more global view of how the cellular networks and systems interact and function together to produce the highly evolved organism we see today.
Collapse
Affiliation(s)
- Kara Dolinski
- Lewis-Sigler Institute for Integrative Genomics, Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544 USA
| | | |
Collapse
|
44
|
Abstract
The cell division cycle is one of the most intensively studied biological processes, yet, in spite of great effort, many questions remain as to how the cell cycle is controlled by cyclin-dependent kinases and other critical regulators. Recent functional genomic and proteomic approaches have yielded new insights into almost all aspects of cell cycle control, including transcriptional circuits, DNA replication, sister chromatid separation and regulation by environmental signals. Perhaps most notably, systematic analysis has begin to reveal meta-level connections between previously distinct sub-processes. As the interconnections between these huge datasets are beyond intuition, mathematical representation and automated analysis of functional genomic data is an urgent mandate.
Collapse
Affiliation(s)
- Mike Tyers
- Samuel Lunenfeld Research Institute, Toronto, Canada M5G 1X5.
| |
Collapse
|