1
|
Behdarvandy M, Karimian M, Atlasi MA, Azami Tameh A. Heat shock protein 27 as a neuroprotective biomarker and a suitable target for stem cell therapy and pharmacotherapy in ischemic stroke. Cell Biol Int 2019; 44:356-367. [PMID: 31502740 DOI: 10.1002/cbin.11237] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 09/08/2019] [Indexed: 12/12/2022]
Abstract
Ischemic stroke is a major common cause of death and long-term disability worldwide. Several pathophysiological events including excitotoxicity, oxidative/nitrative stress, inflammation, and apoptosis are involved in ischemic injuries. Recently, the molecular mechanisms involved in cerebral ischemia through a focus on a member of small heat shock proteins family, Hsp27, has been developed. Notably, following exposure to ischemia, Hsp27 expression in the brain could be increased rather than the normal condition and it may play an important role in neuroprotection after ischemic stroke. The neuroprotection effects of Hsp27 may arise from its anti-oxidant, anti-inflammatory, anti-apoptotic, and chaperonic properties. Moreover, some therapeutic strategies such as stem cell therapy and pharmacotherapy have been developed with Hsp27 targeting. In this review, we describe the function and structure of Hsp27 and its possible role in neuroprotection after ischemic stroke. Finally, we present current studies in stroke therapy, which focused on Hsp27 targeting.
Collapse
Affiliation(s)
- Marjan Behdarvandy
- Anatomical Sciences Research Center, Kashan University of Medical Sciences, Qotb-e Ravandi Blvd., 8715988141, Kashan, Iran
| | - Mohammad Karimian
- Anatomical Sciences Research Center, Kashan University of Medical Sciences, Qotb-e Ravandi Blvd., 8715988141, Kashan, Iran
| | - Mohammad Ali Atlasi
- Anatomical Sciences Research Center, Kashan University of Medical Sciences, Qotb-e Ravandi Blvd., 8715988141, Kashan, Iran
| | - Abolfazl Azami Tameh
- Anatomical Sciences Research Center, Kashan University of Medical Sciences, Qotb-e Ravandi Blvd., 8715988141, Kashan, Iran
| |
Collapse
|
2
|
Sanz-Soler R, Sanz L, Calvete JJ. Distribution ofRPTLNGenes Across Reptilia: Hypothesized Role for RPTLN in the Evolution of SVMPs. Integr Comp Biol 2016; 56:989-1003. [DOI: 10.1093/icb/icw031] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
3
|
Ling KH, Brautigan PJ, Moore S, Fraser R, Cheah PS, Raison JM, Babic M, Lee YK, Daish T, Mattiske DM, Mann JR, Adelson DL, Thomas PQ, Hahn CN, Scott HS. Derivation of an endogenous small RNA from double-stranded Sox4 sense and natural antisense transcripts in the mouse brain. Genomics 2016; 107:88-99. [DOI: 10.1016/j.ygeno.2016.01.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2015] [Revised: 01/18/2016] [Accepted: 01/20/2016] [Indexed: 11/28/2022]
|
4
|
Characterization of human pseudogene-derived non-coding RNAs for functional potential. PLoS One 2014; 9:e93972. [PMID: 24699680 PMCID: PMC3974860 DOI: 10.1371/journal.pone.0093972] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 03/10/2014] [Indexed: 11/19/2022] Open
Abstract
Thousands of pseudogenes exist in the human genome and many are transcribed, but their functional potential remains elusive and understudied. To explore these issues systematically, we first developed a computational pipeline to identify transcribed pseudogenes from RNA-Seq data. Applying the pipeline to datasets from 16 distinct normal human tissues identified ∼ 3,000 pseudogenes that could produce non-coding RNAs in a manner of low abundance but high tissue specificity under normal physiological conditions. Cross-tissue comparison revealed that the transcriptional profiles of pseudogenes and their parent genes showed mostly positive correlations, suggesting that pseudogene transcription could have a positive effect on the expression of their parent genes, perhaps by functioning as competing endogenous RNAs (ceRNAs), as previously suggested and demonstrated with the PTEN pseudogene, PTENP1. Our analysis of the ENCODE project data also found many transcriptionally active pseudogenes in the GM12878 and K562 cell lines; moreover, it showed that many human pseudogenes produced small RNAs (sRNAs) and some pseudogene-derived sRNAs, especially those from antisense strands, exhibited evidence of interfering with gene expression. Further integrated analysis of transcriptomics and epigenomics data, however, demonstrated that trimethylation of histone 3 at lysine 9 (H3K9me3), a posttranslational modification typically associated with gene repression and heterochromatin, was enriched at many transcribed pseudogenes in a transcription-level dependent manner in the two cell lines. The H3K9me3 enrichment was more prominent in pseudogenes that produced sRNAs at pseudogene loci and their adjacent regions, an observation further supported by the co-enrichment of SETDB1 (a H3K9 methyltransferase), suggesting that pseudogene sRNAs may have a role in regional chromatin repression. Taken together, our comprehensive and systematic characterization of pseudogene transcription uncovers a complex picture of how pseudogene ncRNAs could influence gene and pseudogene expression, at both epigenetic and post-transcriptional levels.
Collapse
|
5
|
Durán E, Djebali S, González S, Flores O, Mercader JM, Guigó R, Torrents D, Soler-López M, Orozco M. Unravelling the hidden DNA structural/physical code provides novel insights on promoter location. Nucleic Acids Res 2013; 41:7220-30. [PMID: 23761436 PMCID: PMC3753636 DOI: 10.1093/nar/gkt511] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Although protein recognition of DNA motifs in promoter regions has been traditionally considered as a critical regulatory element in transcription, the location of promoters, and in particular transcription start sites (TSSs), still remains a challenge. Here we perform a comprehensive analysis of putative core promoter sequences relative to non-annotated predicted TSSs along the human genome, which were defined by distinct DNA physical properties implemented in our ProStar computational algorithm. A representative sampling of predicted regions was subjected to extensive experimental validation and analyses. Interestingly, the vast majority proved to be transcriptionally active despite the lack of specific sequence motifs, indicating that physical signaling is indeed able to detect promoter activity beyond conventional TSS prediction methods. Furthermore, highly active regions displayed typical chromatin features associated to promoters of housekeeping genes. Our results enable to redefine the promoter signatures and analyze the diversity, evolutionary conservation and dynamic regulation of human core promoters at large-scale. Moreover, the present study strongly supports the hypothesis of an ancient regulatory mechanism encoded by the intrinsic physical properties of the DNA that may contribute to the complexity of transcription regulation in the human genome.
Collapse
Affiliation(s)
- Elisa Durán
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona 08028, Spain, Joint IRB-BSC Research Program on Computational Biology, Barcelona 08028, Spain, Bioinformatics and Genomics Group, Center for Genomic Regulation and Universitat Pompeu Fabra, Barcelona 08003, Spain, Barcelona Supercomputing Center, Barcelona 08034, Spain and Department of Biochemistry and Molecular Biology, University of Barcelona, Barcelona 08028, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Conley AB, Jordan IK. Epigenetic regulation of human cis-natural antisense transcripts. Nucleic Acids Res 2012; 40:1438-45. [PMID: 22371288 PMCID: PMC3287164 DOI: 10.1093/nar/gkr1010] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Mammalian genomes encode numerous cis-natural antisense transcripts (cis-NATs). The extent to which these cis-NATs are actively regulated and ultimately functionally relevant, as opposed to transcriptional noise, remains a matter of debate. To address this issue, we analyzed the chromatin environment and RNA Pol II binding properties of human cis-NAT promoters genome-wide. Cap analysis of gene expression data were used to identify thousands of cis-NAT promoters, and profiles of nine histone modifications and RNA Pol II binding for these promoters in ENCODE cell types were analyzed using chromatin immunoprecipitation followed by sequencing (ChIP-seq) data. Active cis-NAT promoters are enriched with activating histone modifications and occupied by RNA Pol II, whereas weak cis-NAT promoters are depleted for both activating modifications and RNA Pol II. The enrichment levels of activating histone modifications and RNA Pol II binding show peaks centered around cis-NAT transcriptional start sites, and the levels of activating histone modifications at cis-NAT promoters are positively correlated with cis-NAT expression levels. Cis-NAT promoters also show highly tissue-specific patterns of expression. These results suggest that human cis-NATs are actively transcribed by the RNA Pol II and that their expression is epigenetically regulated, prerequisites for a functional potential for many of these non-coding RNAs.
Collapse
Affiliation(s)
- Andrew B Conley
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | | |
Collapse
|
7
|
A user's guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol 2011; 9:e1001046. [PMID: 21526222 PMCID: PMC3079585 DOI: 10.1371/journal.pbio.1001046] [Citation(s) in RCA: 1068] [Impact Index Per Article: 82.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2010] [Accepted: 03/10/2011] [Indexed: 12/18/2022] Open
Abstract
The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome.
Collapse
|
8
|
Computational methods for epigenetic analysis: the protocol of computational analysis for modified methylation-specific digital karyotyping based on massively parallel sequencing. Methods Mol Biol 2011; 791:313-28. [PMID: 21913089 DOI: 10.1007/978-1-61779-316-5_23] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Massively parallel sequencing technology opens new possibilities for epigenetic research. Many methods have been developed based on the new sequencing platforms, allowing an ultra-deep mapping of epigenetic variants in a fast and cost-effective way. However, handling millions of short reads produced by these sequencing platforms is a huge challenge for many laboratories. Thus, there is a need for the development of accurate and fast computational tools for epigenetic studies in the new era of genomic sequencing.Modified methylation-specific digital karyotyping (MMSDK) is an improved method for genome-wide DNA methylation profiling based on the combination of traditional MSDK and Illumina/Solexa sequencing. Here, we introduce our computational tools used in the MMSDK analysis process from the experimental design to statistical analysis. We have developed a mapping process based on the in silico simulation of combined enzyme cutting and tag extraction of the reference genome. Subsequently, the 20-21 nucleotides (nt) long tags obtained by sequencing are mapped to the simulated library using an open source software Mapping and Assembly with Qualities. Our computational methods include trimming, annotation, normalization, and counting the reads to obtain digital DNA methylation profiles. We present the complete protocol and discuss some important issues that should be considered by readers, such as handling of repeat sequences, SNPs, and normalization. The core part of this protocol (mapping and annotation of tags) is suitable for any tag profiling-based methods, and it could also be modified to analyze results from other types of epigenetic studies based on massively parallel sequencing.
Collapse
|
9
|
Mercer TR, Wilhelm D, Dinger ME, Soldà G, Korbie DJ, Glazov EA, Truong V, Schwenke M, Simons C, Matthaei KI, Saint R, Koopman P, Mattick JS. Expression of distinct RNAs from 3' untranslated regions. Nucleic Acids Res 2010; 39:2393-403. [PMID: 21075793 PMCID: PMC3064787 DOI: 10.1093/nar/gkq1158] [Citation(s) in RCA: 160] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The 3′ untranslated regions (3′UTRs) of eukaryotic genes regulate mRNA stability, localization and translation. Here, we present evidence that large numbers of 3′UTRs in human, mouse and fly are also expressed separately from the associated protein-coding sequences to which they are normally linked, likely by post-transcriptional cleavage. Analysis of CAGE (capped analysis of gene expression), SAGE (serial analysis of gene expression) and cDNA libraries, as well as microarray expression profiles, demonstrate that the independent expression of 3′UTRs is a regulated and conserved genome-wide phenomenon. We characterize the expression of several 3′UTR-derived RNAs (uaRNAs) in detail in mouse embryos, showing by in situ hybridization that these transcripts are expressed in a cell- and subcellular-specific manner. Our results suggest that 3′UTR sequences can function not only in cis to regulate protein expression, but also intrinsically and independently in trans, likely as noncoding RNAs, a conclusion supported by a number of previous genetic studies. Our findings suggest novel functions for 3′UTRs, as well as caution in the use of 3′UTR sequence probes to analyze gene expression.
Collapse
Affiliation(s)
- Tim R Mercer
- Institute for Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
Transcriptional regulation of gene expression plays a significant role in establishing the diversity of human cell types and biological functions from a common set of genes. The components of regulatory control in the human genome include cis-acting elements that act across immense genomic distances to influence the spatial and temporal distribution of gene expression. Here we review the established categories of distant-acting regulatory elements, discussing the classical and contemporary evidence of their regulatory potential and clinical importance. Current efforts to identify regulatory sequences throughout the genome and elucidate their biological significance depend heavily on advances in sequence conservation-based analyses and on increasingly large-scale efforts applying transgenic technologies in model organisms. We discuss the advantages and limitations of sequence conservation as a predictor of regulatory function and present complementary emerging technologies now being applied to annotate regulatory elements in vertebrate genomes.
Collapse
Affiliation(s)
- James P Noonan
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA.
| | | |
Collapse
|
11
|
Hancks DC, Kazazian H. SVA retrotransposons: Evolution and genetic instability. Semin Cancer Biol 2010; 20:234-45. [PMID: 20416380 PMCID: PMC2945828 DOI: 10.1016/j.semcancer.2010.04.001] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 04/01/2010] [Accepted: 04/14/2010] [Indexed: 01/21/2023]
Abstract
SINE-VNTR-Alus (SVA) are non-autonomous hominid specific retrotransposons that are associated with disease in humans. SVAs are evolutionarily young and presumably mobilized by the LINE-1 reverse transcriptase in trans. SVAs are currently active and may impact the host through a variety of mechanisms including insertional mutagenesis, exon shuffling, alternative splicing, and the generation of differentially methylated regions (DMR). Here we review SVA biology, including SVA insertions associated with known diseases. Further, we discuss a model describing the initial formation of SVA and the mechanisms by which SVA may impact the host.
Collapse
Affiliation(s)
- Dustin C. Hancks
- Department of Genetics, The University of Pennsylvania School of Medicine
| | - Haig Kazazian
- Department of Genetics, The University of Pennsylvania School of Medicine
| |
Collapse
|
12
|
Warnatz HJ, Querfurth R, Guerasimova A, Cheng X, Haas SA, Hufton AL, Manke T, Vanhecke D, Nietfeld W, Vingron M, Janitz M, Lehrach H, Yaspo ML. Functional analysis and identification of cis-regulatory elements of human chromosome 21 gene promoters. Nucleic Acids Res 2010; 38:6112-23. [PMID: 20494980 PMCID: PMC2952857 DOI: 10.1093/nar/gkq402] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Given the inherent limitations of in silico studies relying solely on DNA sequence analysis, the functional characterization of mammalian promoters and associated cis-regulatory elements requires experimental support, which demands cloning and analysis of putative promoter regions. Focusing on human chromosome 21, we cloned 182 gene promoters of 2500 bp in length and conducted reporter gene assays on transfected-cell arrays. We found 56 promoters that were active in HEK293 cells, while another 49 promoters could be activated by treatment of cells with Trichostatin A or depletion of serum. We observed high correlations between promoter activities and endogenous transcript levels, RNA polymerase II occupancy, CpG islands and core promoter elements. Truncation of a subset of 62 promoters to ∼500 bp revealed that truncation rarely resulted in loss of activity, but rather in loss of responses to external stimuli, suggesting the presence of cis-regulatory response elements within distal promoter regions. In these regions, we found a strong enrichment of transcription factor binding sites that could potentially activate gene expression in the presence of stimuli. This study illustrates the modular functional architecture of chromosome 21 promoters and helps to reveal the complex mechanisms governing transcriptional regulation.
Collapse
Affiliation(s)
- Hans-Jörg Warnatz
- Department for Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Lim SJ, Tan TW, Tong JC. Computational Epigenetics: the new scientific paradigm. Bioinformation 2010; 4:331-7. [PMID: 20978607 PMCID: PMC2957762 DOI: 10.6026/97320630004331] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Revised: 01/13/2010] [Accepted: 01/21/2010] [Indexed: 12/25/2022] Open
Abstract
Epigenetics has recently emerged as a critical field for studying how non-gene factors can influence the traits and functions of an organism. At the core of this new wave of research is the use of computational tools that play critical roles not only in directing the selection of key experiments, but also in formulating new testable hypotheses through detailed analysis of complex genomic information that is not achievable using traditional approaches alone. Epigenomics, which combines traditional genomics with computer science, mathematics, chemistry, biochemistry and proteomics for the large-scale analysis of heritable changes in phenotype, gene function or gene expression that are not dependent on gene sequence, offers new opportunities to further our understanding of transcriptional regulation, nuclear organization, development and disease. This article examines existing computational strategies for the study of epigenetic factors. The most important databases and bioinformatic tools in this rapidly growing field have been reviewed.
Collapse
Affiliation(s)
- Shen Jean Lim
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597
| | | | | |
Collapse
|
14
|
Hartzell DD, Trinklein ND, Mendez J, Murphy N, Aldred SF, Wood K, Urh M. A functional analysis of the CREB signaling pathway using HaloCHIP-chip and high throughput reporter assays. BMC Genomics 2009; 10:497. [PMID: 19860899 PMCID: PMC2774331 DOI: 10.1186/1471-2164-10-497] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2009] [Accepted: 10/27/2009] [Indexed: 01/30/2023] Open
Abstract
Background Regulation of gene expression is essential for normal development and cellular growth. Transcriptional events are tightly controlled both spatially and temporally by specific DNA-protein interactions. In this study we finely map the genome-wide targets of the CREB protein across all known and predicted human promoters, and characterize the functional consequences of a subset of these binding events using high-throughput reporter assays. To measure CREB binding, we used HaloCHIP, an antibody-free alternative to the ChIP method that utilizes the HaloTag fusion protein, and also high-throughput promoter-luciferase reporter assays, which provide rapid and quantitative screening of promoters for transcriptional activation or repression in living cells. Results In analysis of CREB genome-wide binding events using a comprehensive DNA microarray of human promoters, we observe for the first time that CREB has a strong preference for binding at bidirectional promoters and unlike unidirectional promoters, these binding events often occur downstream of transcription start sites. Comparison between HaloCHIP-chip and ChIP-chip data reveal this to be true for both methodologies, indicating it is not a bias of the technology chosen. Transcriptional data obtained from promoter-luciferase reporter arrays also show an unprecedented, high level of activation of CREB-bound promoters in the presence of the co-activator protein TORC1. Conclusion These data suggest for the first time that TORC1 provides directional information when CREB is bound at bidirectional promoters and possible pausing of the CREB protein after initial transcriptional activation. Also, this combined approach demonstrates the ability to more broadly characterize CREB protein-DNA interactions wherein not only DNA binding sites are discovered, but also the potential of the promoter sequence to respond to CREB is evaluated.
Collapse
Affiliation(s)
- Danette D Hartzell
- SwitchGear Genomics 1455 Adams Drive, Suite 1317, Menlo Park, CA 94025, USA.
| | | | | | | | | | | | | |
Collapse
|
15
|
Rasmussen MH, Wang B, Wabl M, Nielsen AL, Pedersen FS. Activation of alternative Jdp2 promoters and functional protein isoforms in T-cell lymphomas by retroviral insertion mutagenesis. Nucleic Acids Res 2009; 37:4657-71. [PMID: 19502497 PMCID: PMC2724284 DOI: 10.1093/nar/gkp469] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Retroviral insertional mutagenesis has been instrumental for the identification of genes important in cancer development. The molecular mechanisms involved in retroviral-mediated activation of proto-oncogenes influence the distribution of insertions within specific regions during tumorigenesis and hence may point to novel gene structures. From a retroviral tagging screen on tumors of 1767 SL3-3 MLV-infected BALB/c mice, intron 2 of the AP-1 repressor Jdp2 locus was found frequently targeted by proviruses resulting in upregulation of non-canonical RNA subspecies. We identified several promoter regions within 1000 bp upstream of exon 3 that allowed for the production of Jdp2 protein isoforms lacking the histone acetylase inhibitory domain INHAT present in canonical Jdp2. The novel Jdp2 isoforms localized to the nucleus and over-expression in murine fibroblast cells induced cell death similar to canonic Jdp2. When expressed in the context of oncogenic NRAS both full length Jdp2 and the shorter isoforms increased anchorage-independent growth. Our results demonstrate a biological function of Jdp2 lacking the INHAT domain and suggest a post-genomic application for the use of retroviral tagging data in identifying new gene products with a potential role in tumorigenesis.
Collapse
|
16
|
Deblois G, Giguère V. Nuclear receptor location analyses in mammalian genomes: from gene regulation to regulatory networks. Mol Endocrinol 2008; 22:1999-2011. [PMID: 18292239 PMCID: PMC5419453 DOI: 10.1210/me.2007-0546] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2007] [Accepted: 02/14/2008] [Indexed: 02/07/2023] Open
Abstract
Rapid progress in mapping nuclear receptor binding sites, referred to as "location analysis," has recently been achieved through the use of chromatin immunoprecipitation approaches. Location analysis can be performed on a single locus or cover a complete genome, and the resulting datasets can be probed to identify direct target genes and/or investigate the molecular mechanisms by which nuclear receptors control gene expression. In addition, when coupled with other genetic and functional genomics investigative methods, location analysis has proven to be a powerful tool with which to identify novel biological functions of nuclear receptors and build transcriptional regulatory networks. Thus, the knowledge gained from several recent chromatin immunoprecipitation-based studies has challenged basic concepts of nuclear receptor action, offered new insights into gene-regulatory mechanisms, and led to the identification of nuclear receptor-controlled biological functions.
Collapse
|
17
|
Flockerzi A, Ruggieri A, Frank O, Sauter M, Maldener E, Kopper B, Wullich B, Seifarth W, Müller-Lantzsch N, Leib-Mösch C, Meese E, Mayer J. Expression patterns of transcribed human endogenous retrovirus HERV-K(HML-2) loci in human tissues and the need for a HERV Transcriptome Project. BMC Genomics 2008; 9:354. [PMID: 18664271 PMCID: PMC2525661 DOI: 10.1186/1471-2164-9-354] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2008] [Accepted: 07/29/2008] [Indexed: 12/14/2022] Open
Abstract
Background A significant proportion of the human genome is comprised of human endogenous retroviruses (HERVs). HERV transcripts are found in every human tissue. Expression of proviruses of the HERV-K(HML-2) family has been associated with development of human tumors, in particular germ cell tumors (GCT). Very little is known about transcriptional activity of individual HML-2 loci in human tissues, though. Results By employing private nucleotide differences between loci, we assigned ~1500 HML-2 cDNAs to individual HML-2 loci, identifying, in total, 23 transcriptionally active HML-2 proviruses. Several loci are active in various human tissue types. Transcription levels of some HML-2 loci appear higher than those of other loci. Several HML-2 Rec-encoding loci are expressed in GCT and non-GCT tissues. A provirus on chromosome 22q11.21 appears strongly upregulated in pathologic GCT tissues and may explain high HML-2 Gag protein levels in GCTs. Presence of Gag and Env antibodies in GCT patients is not correlated with activation of individual loci. HML-2 proviruses previously reported capable of forming an infectious HML-2 variant are transcriptionally active in germ cell tissue. Our study furthermore shows that Expressed Sequence Tag (EST) data are insufficient to describe transcriptional activity of HML-2 and other HERV loci in tissues of interest. Conclusion Our, to date, largest-scale study reveals in greater detail expression patterns of individual HML-2 loci in human tissues of clinical interest. Moreover, large-scale, specialized studies are indicated to better comprehend transcriptional activity and regulation of HERVs. We thus emphasize the need for a specialized HERV Transcriptome Project.
Collapse
Affiliation(s)
- Aline Flockerzi
- Department of Human Genetics, Medical Faculty, University of Saarland, Homburg, Germany.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Petesch SJ, Lis JT. Rapid, transcription-independent loss of nucleosomes over a large chromatin domain at Hsp70 loci. Cell 2008; 134:74-84. [PMID: 18614012 DOI: 10.1016/j.cell.2008.05.029] [Citation(s) in RCA: 260] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2008] [Revised: 04/13/2008] [Accepted: 05/19/2008] [Indexed: 11/26/2022]
Abstract
To efficiently transcribe genes, RNA Polymerase II (Pol II) must overcome barriers imposed by nucleosomes and higher-order chromatin structure. Many genes, including Drosophila melanogaster Hsp70, undergo changes in chromatin structure upon activation. To characterize these changes, we mapped the nucleosome landscape of Hsp70 after an instantaneous heat shock at high spatial and temporal resolution. Surprisingly, we find an initial disruption of nucleosomes across the entire gene within 30 s after activation, faster than the rate of Pol II transcription, followed by a second further disruption within 2 min. This initial change occurs independently of Pol II transcription. Furthermore, the rapid loss of nucleosomes extends beyond Hsp70 and halts at the scs and scs' insulating elements. An RNAi screen of 28 transcription and chromatin-related factors reveals that depletion of heat shock factor, GAGA Factor, or Poly(ADP)-Ribose Polymerase or its activity abolishes the loss of nucleosomes upon Hsp70 activation.
Collapse
Affiliation(s)
- Steven J Petesch
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
19
|
Petesch SJ, Lis JT. Rapid, transcription-independent loss of nucleosomes over a large chromatin domain at Hsp70 loci. Cell 2008. [PMID: 18614012 DOI: 10.1016/j.cell2008.05.029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/24/2023]
Abstract
To efficiently transcribe genes, RNA Polymerase II (Pol II) must overcome barriers imposed by nucleosomes and higher-order chromatin structure. Many genes, including Drosophila melanogaster Hsp70, undergo changes in chromatin structure upon activation. To characterize these changes, we mapped the nucleosome landscape of Hsp70 after an instantaneous heat shock at high spatial and temporal resolution. Surprisingly, we find an initial disruption of nucleosomes across the entire gene within 30 s after activation, faster than the rate of Pol II transcription, followed by a second further disruption within 2 min. This initial change occurs independently of Pol II transcription. Furthermore, the rapid loss of nucleosomes extends beyond Hsp70 and halts at the scs and scs' insulating elements. An RNAi screen of 28 transcription and chromatin-related factors reveals that depletion of heat shock factor, GAGA Factor, or Poly(ADP)-Ribose Polymerase or its activity abolishes the loss of nucleosomes upon Hsp70 activation.
Collapse
Affiliation(s)
- Steven J Petesch
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
20
|
Lu J, Luo L. Prediction for human transcription start site using diversity measure with quadratic discriminant. Bioinformation 2008; 2:316-21. [PMID: 18478087 PMCID: PMC2374378 DOI: 10.6026/97320630002316] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Revised: 03/17/2008] [Accepted: 04/15/2008] [Indexed: 11/23/2022] Open
Abstract
The accurate identification of promoter regions and transcription start sites is a challenge to the construction of human transcription regulation networks. Thus, an efficient prediction method based on theoretical formulation is necessary for this purpose. We used the method of increment diversity with quadratic discriminant analysis (IDQD) to predict transcription start sites (TSS). The method produced sensitivity and positive predictive value of more than 65% with positives to negatives ratio of 1:58. The performance evaluation using Receiver Operator Characteristics (ROC) showed an auROC (area under ROC) of greater than 96%. The evaluation by Precision Recall Curves (PRC) showed an auPRC (area under PRC) of about 26% for positives to negatives ratio of 1:679 and about 64% for positives to negatives ratio of 1:113. The results documented in this approach are either better or comparable to other known methods.
Collapse
Affiliation(s)
- Jun Lu
- Laboratory of Theoretical Biophysics, Faculty of Science and Technology, Inner Mongolia University, Hohhot 010021, P.R.China.
| | | |
Collapse
|
21
|
Abstract
Despite much interest in regulatory evolution, how promoters have evolved remains poorly studied, mainly owing to paucity of data on promoter regions. Using a new set of high-quality experimentally determined core promoters of the human genome, we conducted a comparative analysis of 2,492 human and rhesus macaque promoters and their neighboring nearly neutral regions. We found that the core promoters have an average rate of nucleotide substitution substantially higher than that at 4-fold degenerate sites and only slightly lower than that for the assumed neutral controls of neighboring noncoding regions, suggesting that core promoters are subject to very weak selective constraints. Interestingly, we identified 24 core promoters (at false discovery rate = 50%) that have evolved at an accelerated rate compared with the neutral controls, suggesting that they may have undergone positive selection. The inferred positively selected genes show strong bias in molecular function. We also used population genetic approaches to examine the evolution of core promoters in human populations and found evidence of positive selection at some loci. Taken together, our results suggest that positive selection has played a substantial role in the evolution of transcriptional regulation in primates.
Collapse
Affiliation(s)
- Han Liang
- Department of Ecology and Evolution, University of Chicago, USA
| | | | | |
Collapse
|
22
|
Wu JQ, Du J, Rozowsky J, Zhang Z, Urban AE, Euskirchen G, Weissman S, Gerstein M, Snyder M. Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome. Genome Biol 2008; 9:R3. [PMID: 18173853 PMCID: PMC2395237 DOI: 10.1186/gb-2008-9-1-r3] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Revised: 12/06/2007] [Accepted: 01/03/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. RESULTS We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. CONCLUSION We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Collapse
Affiliation(s)
- Jia Qian Wu
- Molecular, Cellular and Developmental Biology Department, KBT918, Yale University, New Haven, Connecticut 06511, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Abstract
Epigenetic research aims to understand heritable gene regulation that is not directly encoded in the DNA sequence. Epigenetic mechanisms such as DNA methylation and histone modifications modulate the packaging of the DNA in the nucleus and thereby influence gene expression. Patterns of epigenetic information are faithfully propagated over multiple cell divisions, which makes epigenetic regulation a key mechanism for cellular differentiation and cell fate decisions. In addition, incomplete erasure of epigenetic information can lead to complex patterns of non-Mendelian inheritance. Stochastic and environment-induced epigenetic defects are known to play a major role in cancer and ageing, and they may also contribute to mental disorders and autoimmune diseases. Recent technical advances such as ChIP-on-chip and ChIP-seq have started to convert epigenetic research into a high-throughput endeavor, to which bioinformatics is expected to make significant contributions. Here, we review pioneering computational studies that have contributed to epigenetic research. In addition, we give a brief introduction into epigenetics-targeted at bioinformaticians who are new to the field-and we outline future challenges in computational epigenetics.
Collapse
Affiliation(s)
- Christoph Bock
- Max-Planck-Institut für Informatik, Saarbrücken, Germany.
| | | |
Collapse
|
24
|
Trinklein ND, Karaöz U, Wu J, Halees A, Force Aldred S, Collins PJ, Zheng D, Zhang ZD, Gerstein MB, Snyder M, Myers RM, Weng Z. Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome. Genome Res 2007; 17:720-31. [PMID: 17567992 PMCID: PMC1891333 DOI: 10.1101/gr.5716607] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are >2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated.
Collapse
Affiliation(s)
- Nathan D. Trinklein
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Ulaş Karaöz
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Jiaqian Wu
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
| | - Anason Halees
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Shelley Force Aldred
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Patrick J. Collins
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Deyou Zheng
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Zhengdong D. Zhang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Mark B. Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Michael Snyder
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Richard M. Myers
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
- Corresponding authors.E-mail ; fax (617) 353-6766.E-mail ; fax (650) 725-9689
| | - Zhiping Weng
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
- Biomedical Engineering Department, Boston University, Boston, Massachusetts 02215, USA
- Corresponding authors.E-mail ; fax (617) 353-6766.E-mail ; fax (650) 725-9689
| |
Collapse
|
25
|
Zhang ZD, Paccanaro A, Fu Y, Weissman S, Weng Z, Chang J, Snyder M, Gerstein MB. Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regions. Genome Res 2007; 17:787-97. [PMID: 17567997 PMCID: PMC1891338 DOI: 10.1101/gr.5573107] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The comprehensive inventory of functional elements in 44 human genomic regions carried out by the ENCODE Project Consortium enables for the first time a global analysis of the genomic distribution of transcriptional regulatory elements. In this study we developed an intuitive and yet powerful approach to analyze the distribution of regulatory elements found in many different ChIP-chip experiments on a 10 approximately 100-kb scale. First, we focus on the overall chromosomal distribution of regulatory elements in the ENCODE regions and show that it is highly nonuniform. We demonstrate, in fact, that regulatory elements are associated with the location of known genes. Further examination on a local, single-gene scale shows an enrichment of regulatory elements near both transcription start and end sites. Our results indicate that overall these elements are clustered into regulatory rich "islands" and poor "deserts." Next, we examine how consistent the nonuniform distribution is between different transcription factors. We perform on all the factors a multivariate analysis in the framework of a biplot, which enhances biological signals in the experiments. This groups transcription factors into sequence-specific and sequence-nonspecific clusters. Moreover, with experimental variation carefully controlled, detailed correlations show that the distribution of sites was generally reproducible for a specific factor between different laboratories and microarray platforms. Data sets associated with histone modifications have particularly strong correlations. Finally, we show how the correlations between factors change when only regulatory elements far from the transcription start sites are considered.
Collapse
Affiliation(s)
- Zhengdong D. Zhang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Alberto Paccanaro
- Department of Computer Science Royal Holloway, University of London, Egham Hill, TW20 0EX, United Kingdom
| | - Yutao Fu
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Sherman Weissman
- Department of Genetics, Yale University, New Haven, Connecticut 06510, USA
| | - Zhiping Weng
- Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
- Biomedical Engineering Department, Boston University, Boston, Massachusetts 02215, USA
| | - Joseph Chang
- Department of Statistics, Yale University, New Haven, Connecticut 06520, USA
| | - Michael Snyder
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
| | - Mark B. Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
- Program in Computational Biology and Bioinformatics Yale University, New Haven, Connecticut 06520, USA
- Corresponding author.E-mail ; fax (360) 838-7861
| |
Collapse
|
26
|
Affiliation(s)
- George M Weinstock
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA.
| |
Collapse
|
27
|
Zheng D, Frankish A, Baertsch R, Kapranov P, Reymond A, Choo SW, Lu Y, Denoeud F, Antonarakis SE, Snyder M, Ruan Y, Wei CL, Gingeras TR, Guigó R, Harrow J, Gerstein MB. Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Res 2007; 17:839-51. [PMID: 17568002 PMCID: PMC1891343 DOI: 10.1101/gr.5586307] [Citation(s) in RCA: 131] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome's structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction ( approximately 80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.
Collapse
Affiliation(s)
- Deyou Zheng
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
- Corresponding authors.E-mail ; fax (360) 838-7861.E-mail ; fax (360) 838-7861
| | - Adam Frankish
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1HH, United Kingdom
| | - Robert Baertsch
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California 95064, USA
| | | | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland
| | - Siew Woh Choo
- Genome Institute of Singapore, Singapore 138672, Singapore
| | - Yontao Lu
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California 95064, USA
| | - France Denoeud
- Grup de Recerca en Informática Biomèdica, Institut Municipal d’Investigació Mèdica/Universitat Pompeu Fabra, Passeig Marítim de la Barceloneta, 37-49, 08003, Barcelona, Catalonia, Spain
| | - Stylianos E. Antonarakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland
| | - Michael Snyder
- Molecular, Cellular & Developmental Biology Department, Yale University, New Haven, Connecticut 06520, USA
| | - Yijun Ruan
- Genome Institute of Singapore, Singapore 138672, Singapore
| | - Chia-Lin Wei
- Genome Institute of Singapore, Singapore 138672, Singapore
| | | | - Roderic Guigó
- Grup de Recerca en Informática Biomèdica, Institut Municipal d’Investigació Mèdica/Universitat Pompeu Fabra, Passeig Marítim de la Barceloneta, 37-49, 08003, Barcelona, Catalonia, Spain
- Center for Genomic Regulation, Passeig Marítim de la Barceloneta, 37-49, 08003, Barcelona, Catalonia, Spain
| | - Jennifer Harrow
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1HH, United Kingdom
| | - Mark B. Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
- Department of Computer Science, Yale University, New Haven, Connecticut 06520, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
- Corresponding authors.E-mail ; fax (360) 838-7861.E-mail ; fax (360) 838-7861
| |
Collapse
|
28
|
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007; 447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3798] [Impact Index Per Article: 223.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Collapse
|