701
|
Chang CW, Cheng WC, Chen CR, Shu WY, Tsai ML, Huang CL, Hsu IC. Identification of human housekeeping genes and tissue-selective genes by microarray meta-analysis. PLoS One 2011; 6:e22859. [PMID: 21818400 PMCID: PMC3144958 DOI: 10.1371/journal.pone.0022859] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Accepted: 06/29/2011] [Indexed: 01/26/2023] Open
Abstract
Background Categorizing protein-encoding transcriptomes of normal tissues into housekeeping genes and tissue-selective genes is a fundamental step toward studies of genetic functions and genetic associations to tissue-specific diseases. Previous studies have been mainly based on a few data sets with limited samples in each tissue, which restrained the representativeness of their identified genes, and resulted in low consensus among them. Results This study compiled 1,431 samples in 43 normal human tissues from 104 microarray data sets. We developed a new method to improve gene expression assessment, and showed that more than ten samples are needed to robustly identify the protein-encoding transcriptome of a tissue. We identified 2,064 housekeeping genes and 2,293 tissue-selective genes, and analyzed gene lists by functional enrichment analysis. The housekeeping genes are mainly involved in fundamental cellular functions, and the tissue-selective genes are strikingly related to functions and diseases corresponding to tissue-origin. We also compared agreements and related functions among our housekeeping genes and those of previous studies, and pointed out some reasons for the low consensuses. Conclusions The results indicate that sufficient samples have improved the identification of protein-encoding transcriptome of a tissue. Comprehensive meta-analysis has proved the high quality of our identified HK and TS genes. These results could offer a useful resource for future research on functional and genomic features of HK and TS genes.
Collapse
Affiliation(s)
- Cheng-Wei Chang
- Department of Biomedical Engineering and Environmental Sciences, National Tsing Hua University, Hsinchu, Taiwan
| | - Wei-Chung Cheng
- Department of Biomedical Engineering and Environmental Sciences, National Tsing Hua University, Hsinchu, Taiwan
| | - Chaang-Ray Chen
- Department of Biomedical Engineering and Environmental Sciences, National Tsing Hua University, Hsinchu, Taiwan
| | - Wun-Yi Shu
- Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan
| | - Min-Lung Tsai
- Institute of Athletics, National Taiwan Sport University, Taichung, Taiwan
| | - Ching-Lung Huang
- Department of Biomedical Engineering and Environmental Sciences, National Tsing Hua University, Hsinchu, Taiwan
| | - Ian C. Hsu
- Department of Biomedical Engineering and Environmental Sciences, National Tsing Hua University, Hsinchu, Taiwan
- * E-mail:
| |
Collapse
|
702
|
Baker DA, Nolan T, Fischer B, Pinder A, Crisanti A, Russell S. A comprehensive gene expression atlas of sex- and tissue-specificity in the malaria vector, Anopheles gambiae. BMC Genomics 2011; 12:296. [PMID: 21649883 PMCID: PMC3129592 DOI: 10.1186/1471-2164-12-296] [Citation(s) in RCA: 145] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2011] [Accepted: 06/07/2011] [Indexed: 02/07/2023] Open
Abstract
Background The mosquito, Anopheles gambiae, is the primary vector of human malaria, a disease responsible for millions of deaths each year. To improve strategies for controlling transmission of the causative parasite, Plasmodium falciparum, we require a thorough understanding of the developmental mechanisms, physiological processes and evolutionary pressures affecting life-history traits in the mosquito. Identifying genes expressed in particular tissues or involved in specific biological processes is an essential part of this process. Results In this study, we present transcription profiles for ~82% of annotated Anopheles genes in dissected adult male and female tissues. The sensitivity afforded by examining dissected tissues found gene activity in an additional 20% of the genome that is undetected when using whole-animal samples. The somatic and reproductive tissues we examined each displayed patterns of sexually dimorphic and tissue-specific expression. By comparing expression profiles with Drosophila melanogaster we also assessed which genes are well conserved within the Diptera versus those that are more recently evolved. Conclusions Our expression atlas and associated publicly available database, the MozAtlas (http://www.tissue-atlas.org), provides information on the relative strength and specificity of gene expression in several somatic and reproductive tissues, isolated from a single strain grown under uniform conditions. The data will serve as a reference for other mosquito researchers by providing a simple method for identifying where genes are expressed in the adult, however, in addition our resource will also provide insights into the evolutionary diversity associated with gene expression levels among species.
Collapse
Affiliation(s)
- Dean A Baker
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB1 3QA, UK.
| | | | | | | | | | | |
Collapse
|
703
|
Santure AW, Gratten J, Mossman JA, Sheldon BC, Slate J. Characterisation of the transcriptome of a wild great tit Parus major population by next generation sequencing. BMC Genomics 2011; 12:283. [PMID: 21635727 PMCID: PMC3125266 DOI: 10.1186/1471-2164-12-283] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2010] [Accepted: 06/02/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The recent development of next generation sequencing technologies has made it possible to generate very large amounts of sequence data in species with little or no genome information. Combined with the large phenotypic databases available for wild and non-model species, these data will provide an unprecedented opportunity to "genomicise" ecological model organisms and establish the genetic basis of quantitative traits in natural populations. RESULTS This paper describes the sequencing, de novo assembly and analysis from the transcriptome of eight tissues of ten wild great tits. Approximately 4.6 million sequences and 1.4 billion bases of DNA were generated and assembled into 95,979 contigs, one third of which aligned with known Taeniopygia guttata (zebra finch) and Gallus gallus (chicken) transcripts. The majority (78%) of the remaining contigs aligned within or very close to regions of the zebra finch genome containing known genes, suggesting that they represented precursor mRNA rather than untranscribed genomic DNA. More than 35,000 single nucleotide polymorphisms and 10,000 microsatellite repeats were identified. Eleven percent of contigs were expressed in every tissue, while twenty one percent of contigs were expressed in only one tissue. The function of those contigs with strong evidence for tissue specific expression and contigs expressed in every tissue was inferred from the gene ontology (GO) terms associated with these contigs; heart and pancreas had the highest number of highly tissue specific GO terms (21.4% and 28.5% respectively). CONCLUSIONS In summary, the transcriptomic data generated in this study will contribute towards efforts to assemble and annotate the great tit genome, as well as providing the markers required to perform gene mapping studies in wild populations.
Collapse
Affiliation(s)
- Anna W Santure
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, S10 2TN, UK
| | - Jake Gratten
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, S10 2TN, UK
| | - Jim A Mossman
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, S10 2TN, UK
| | - Ben C Sheldon
- Edward Grey Institute, Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK
| | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, S10 2TN, UK
| |
Collapse
|
704
|
Fischer JJ, Dalhoff C, Schrey AK, Graebner OY, Michaelis S, Andrich K, Glinski M, Kroll F, Sefkow M, Dreger M, Koester H. Dasatinib, imatinib and staurosporine capture compounds - Complementary tools for the profiling of kinases by Capture Compound Mass Spectrometry (CCMS). J Proteomics 2011; 75:160-8. [PMID: 21664307 DOI: 10.1016/j.jprot.2011.05.035] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2011] [Revised: 05/13/2011] [Accepted: 05/20/2011] [Indexed: 01/08/2023]
Abstract
Capture Compound Mass Spectrometry (CCMS) is a platform technology for the functional isolation of subproteomes. Here we report the synthesis of two new kinase Capture Compounds (CCs) based on the tyrosine-kinase specific inhibitors dasatinib and imatinib and compare their interaction profiles to that of our previously reported staurosporine-CCs. CCs are tri-functional molecules: they comprise a sorting function (e.g. the small molecule or drug of interest) which interacts with target proteins, a photo-activatable reactivity function to covalently trap the interacting proteins, and a sorting function to isolate the CC-protein conjugates from complex biological samples for protein identification by liquid chromatography/mass spectrometry (LC-MS/MS). We present data of CCMS experiments from human HepG2 cells and compare the profiles of the kinases isolated with dasatinib, imatinib and staurosporine CC, respectively. Dasatinib and imatinib have a more selective kinase binding profile than staurosporine. Moreover, the new CCs allow isolation and identification of additional kinases, complementing the staurosporine CC. The family of kinase CCs will be a valuable tool for the proteomic profiling of this important protein class. Besides sets of expected kinases we identified additional specific interactors; these off-targets may be of relevance in the view of the pharmacological profile of dasatinib and imatinib.
Collapse
|
705
|
Greiner M, Kreutzer B, Jung V, Grobholz R, Hasenfus A, Stöhr RF, Tornillo L, Dudek J, Stöckle M, Unteregger G, Kamradt J, Wullich B, Zimmermann R. Silencing of the SEC62 gene inhibits migratory and invasive potential of various tumor cells. Int J Cancer 2011; 128:2284-95. [PMID: 20669223 DOI: 10.1002/ijc.25580] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Sec62 is part of the protein translocation apparatus in the membrane of the endoplasmic reticulum (ER). In yeast, Sec62 participates in the post-translational translocation of proteins into the ER, but its function in mammals remains elusive. Previously we described the amplification and over-expression of the SEC62 gene in prostate cancer cell lines and the protein has been described as a potential target gene in prostate cancer. In the current study we show that in the tumor tissue of prostate cancer patients Sec62 protein levels are elevated compared with tumor-free tissue derived from the same patients or from prostates of control group patients and that the higher Sec62 protein content correlates with an increasing de-differentiation of the cells. Therefore, up-regulation of Sec62 protein content indeed is a phenomenon associated with prostate cancer progression. Analysis of a multi-tissue tumor array showed that in addition to prostate cancer, overproduction of Sec62 is observed in various other tumors, most significantly in tumors of the lung and the thyroid. To examine the tumor-related functions of Sec62, we silenced the SEC62 gene in the prostate cancer cell-line PC3 as well as in a set of other tumor cell-lines with two different siRNAs. In general, after silencing of SEC62 the cell migration and the invasive potential of the cells was blocked or at least dramatically reduced while cell viability was hardly affected. Thus, the SEC62 gene may indeed be considered as a target gene in the therapy of various tumors.
Collapse
Affiliation(s)
- Markus Greiner
- Department of Medical Biochemistry and Molecular Biology, Saarland University, Homburg/Saar, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
706
|
Bukurova YA, Nikitina IG, Khankin SL, Krasnov GS, Lisitsyn NA, Karpov VL, Beresten SF. Search for protein markers for serum diagnostics of tumors by analysis of microRNA expression profiles. Mol Biol 2011. [DOI: 10.1134/s0026893311020038] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
707
|
Podder S, Ghosh TC. Insights into the molecular correlates modulating functional compensation between monogenic and polygenic disease gene duplicates in human. Genomics 2011; 97:200-4. [PMID: 21281709 DOI: 10.1016/j.ygeno.2011.01.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2010] [Revised: 01/06/2011] [Accepted: 01/16/2011] [Indexed: 01/18/2023]
|
708
|
Weber CC, Hurst LD. Support for multiple classes of local expression clusters in Drosophila melanogaster, but no evidence for gene order conservation. Genome Biol 2011; 12:R23. [PMID: 21414197 PMCID: PMC3129673 DOI: 10.1186/gb-2011-12-3-r23] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Revised: 03/04/2011] [Accepted: 03/17/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Gene order in eukaryotic genomes is not random, with genes with similar expression profiles tending to cluster. In yeasts, the model taxon for gene order analysis, such syntenic clusters of non-homologous genes tend to be conserved over evolutionary time. Whether similar clusters show gene order conservation in other lineages is, however, undecided. Here, we examine this issue in Drosophila melanogaster using high-resolution chromosome rearrangement data. RESULTS We show that D. melanogaster has at least three classes of expression clusters: first, as observed in mammals, large clusters of functionally unrelated housekeeping genes; second, small clusters of functionally related highly co-expressed genes; and finally, as previously defined by Spellman and Rubin, larger domains of co-expressed but functionally unrelated genes. The latter are, however, not independent of the small co-expression clusters and likely reflect a methodological artifact. While the small co-expression and housekeeping/essential gene clusters resemble those observed in yeast, in contrast to yeast, we see no evidence that any of the three cluster types are preserved as synteny blocks. If anything, adjacent co-expressed genes are more likely to become rearranged than expected. Again in contrast to yeast, in D. melanogaster, gene pairs with short intergene distance or in divergent orientations tend to have higher rearrangement rates. These findings are consistent with co-expression being partly due to shared chromatin environment. CONCLUSIONS We conclude that, while similar in terms of cluster types, gene order evolution has strikingly different patterns in yeasts and in D. melanogaster, although recombination is associated with gene order rearrangement in both.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | | |
Collapse
|
709
|
Chen SCC, Chuang TJ, Li WH. The relationships among microRNA regulation, intrinsically disordered regions, and other indicators of protein evolutionary rate. Mol Biol Evol 2011; 28:2513-20. [PMID: 21398349 DOI: 10.1093/molbev/msr068] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Many indicators of protein evolutionary rate have been proposed, but some of them are interrelated. The purpose of this study is to disentangle their correlations. We assess the strength of each indicator by controlling for the other indicators under study. We find that the number of microRNA (miRNA) types that regulate a gene is the strongest rate indicator (a negative correlation), followed by disorder content (the percentage of disordered regions in a protein, a positive correlation); the strength of disorder content as a rate indicator is substantially increased after controlling for the number of miRNA types. By dividing proteins into lowly and highly intrinsically disordered proteins (L-IDPs and H-IDPs), we find that proteins interacting with more H-IDPs tend to evolve more slowly, which largely explains the previous observation of a negative correlation between the number of protein-protein interactions and evolutionary rate. Moreover, all of the indicators examined here, except for the number of miRNA types, have different strengths in L-IDPs and in H-IDPs. Finally, the number of phosphorylation sites is weakly correlated with the number of miRNA types, and its strength as a rate indicator is substantially reduced when other indicators are considered. Our study reveals the relative strength of each rate indicator and increases our understanding of protein evolution.
Collapse
Affiliation(s)
- Sean Chun-Chang Chen
- Institute of BioMedical Informatics, National Yang-Ming University, Taipei, Taiwan
| | | | | |
Collapse
|
710
|
Yang L, Gaut BS. Factors that contribute to variation in evolutionary rate among Arabidopsis genes. Mol Biol Evol 2011; 28:2359-69. [PMID: 21389272 DOI: 10.1093/molbev/msr058] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Surprisingly, few studies have described evolutionary rate variation among plant nuclear genes, with little investigation of the causes of rate variation. Here, we describe evolutionary rates for 11,492 ortholog pairs between Arabidopsis thaliana and A. lyrata and investigate possible contributors to rate variation among these genes. Rates of evolution at synonymous sites vary along chromosomes, suggesting that mutation rates vary on genomic scales, perhaps as a function of recombination rate. Rates of evolution at nonsynonymous sites correlate most strongly with expression patterns, but they also vary as to whether a gene is duplicated and retained after a whole-genome duplication (WGD) event. WGD genes evolve more slowly, on average, than nonduplicated genes and non-WGD duplicates. We hypothesize that levels and patterns of expression are not only the major determinants that explain nonsynonymous rate variation among genes but also a critical determinant of gene retention after duplication.
Collapse
Affiliation(s)
- Liang Yang
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, USA
| | | |
Collapse
|
711
|
Wong A. Testing the effects of mating system variation on rates of molecular evolution in primates. Evolution 2011; 64:2779-85. [PMID: 20497215 DOI: 10.1111/j.1558-5646.2010.01038.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Post-copulatory sexual selection has been proposed to drive the rapid evolution of reproductive proteins, and, more recently,to increase genome-wide mutation rates. Comparisons of rates of molecular evolution between lineages with different levels of female multiple mating represent a promising, but under-utilized, approach for testing the effects of sperm competition on sequence evolution. Here, I use comparisons between primate species with divergent mating systems to examine the effects of sperm competition on reproductive protein evolution, as well as on sex-averaged mutation rates. Rates of nonsynonymous substitution are higher for testis-specific genes along the chimpanzee lineage in comparison to the human lineage, consistent with expectations. However, the data reported here do not allow firm conclusions concerning the effects of mating system on genome-wide mutation rates, with different results obtained from different species pairs. Ultimately, comparative studies encompassing a range of mating systems and other life history traits will be required to make broad generalizations concerning the genomic effects of sperm competition.
Collapse
Affiliation(s)
- Alex Wong
- Department of Biology, University of Ottawa, 30 Marie-Curie, Ottawa, Ontario, Canada.
| |
Collapse
|
712
|
Jjingo D, Huda A, Gundapuneni M, Mariño-Ramírez L, Jordan IK. Effect of the transposable element environment of human genes on gene length and expression. Genome Biol Evol 2011; 3:259-71. [PMID: 21362639 PMCID: PMC3070429 DOI: 10.1093/gbe/evr015] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/22/2011] [Indexed: 11/17/2022] Open
Abstract
Independent lines of investigation have documented effects of both transposable elements (TEs) and gene length (GL) on gene expression. However, TE gene fractions are highly correlated with GL, suggesting that they cannot be considered independently. We evaluated the TE environment of human genes and GL jointly in an attempt to tease apart their relative effects. TE gene fractions and GL were compared with the overall level of gene expression and the breadth of expression across tissues. GL is strongly correlated with overall expression level but weakly correlated with the breadth of expression, confirming the selection hypothesis that attributes the compactness of highly expressed genes to selection for economy of transcription. However, TE gene fractions overall, and for the L1 family in particular, show stronger anticorrelations with expression level than GL, indicating that GL may not be the most important target of selection for transcriptional economy. These results suggest a specific mechanism, removal of TEs, by which highly expressed genes are selectively tuned for efficiency. MIR elements are the only family of TEs with gene fractions that show a positive correlation with tissue-specific expression, suggesting that they may provide regulatory sequences that help to control human gene expression. Consistent with this notion, MIR fractions are relatively enriched close to transcription start sites and associated with coexpression in specific sets of related tissues. Our results confirm the overall relevance of the TE environment to gene expression and point to distinct mechanisms by which different TE families may contribute to gene regulation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology
| | - Ahsan Huda
- School of Biology, Georgia Institute of Technology
| | - Madhumati Gundapuneni
- School of Biology, Georgia Institute of Technology
- Institute for Systems Biology, Seattle, Washington
| | - Leonardo Mariño-Ramírez
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
- PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| | - I. King Jordan
- School of Biology, Georgia Institute of Technology
- PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| |
Collapse
|
713
|
Abstract
CpG islands mark CpG-enriched regions in otherwise CpG-depleted vertebrate genomes. While the regulatory importance of CpG islands is widely accepted, it is little appreciated that CpG islands vary greatly in lengths. For example, CpG islands in the human genome vary ∼30-fold in their lengths. Here we report findings suggesting that the lengths of CpG islands have functional consequences. Specifically, we show that promoters associated with long CpG islands (long-CGI promoters) are distinct from other promoters. First, long-CGI promoters are uniquely associated with genes with an intermediate level of gene expression breadths. Notably, intermediate expression breadths require the most complex mode of gene regulation, from the standpoint of information content. Second, long-CGI promoters encode more RNA polymerase II (Polr2a) binding sites than other promoters. Third, the actual binding patterns of Polr2a occur in a more tissue-specific manner in long-CGI promoters compared to other CGI promoters. Moreover, long-CGI promoters contain the largest numbers of experimentally characterized transcription start sites compared to other promoters, and the types of transcription start sites in them are biased toward tissue-specific patterns of gene expression. Finally, long-CGI promoters are preferentially associated with genes involved in development and regulation. Together, these findings indicate that functionally relevant variations of CpG islands exist. By investigating consequences of certain CpG island traits, we can gain additional insights into the mechanism and evolution of regulatory complexity of gene expression.
Collapse
|
714
|
Meisel RP. Towards a more nuanced understanding of the relationship between sex-biased gene expression and rates of protein-coding sequence evolution. Mol Biol Evol 2011; 28:1893-900. [PMID: 21239389 DOI: 10.1093/molbev/msr010] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Genes that are differentially expressed between the sexes (sex-biased genes) are among the fastest evolving genes in animal genomes. The majority of sex-biased expression is attributable to genes that are primarily expressed in sex-limited reproductive tissues, and these reproductive genes are often rapidly evolving because of intra- and intersexual selection pressures. Additionally, studies of multiple taxa have revealed that genes with sex-biased expression are also expressed in a limited number of tissues. This is worth noting because narrowly expressed genes are known to evolve faster than broadly expressed genes. Therefore, it is not clear whether sex-biased genes are rapidly evolving because they have sexually dimorphic expression, because they are expressed in sex-limited reproductive tissues, or because they are narrowly expressed. To determine the extend to which other confounding variables can explain the rapid evolution of sex-biased genes, I analyzed the rates of evolution of sex-biased genes in Drosophila melanogaster and Mus musculus in light of tissue-specific measures of expression. I find that genes with sex-biased expression in somatic tissues shared by both sexes are often evolving faster than non-sex-biased genes, but this is best explained by the narrow expression profiles of sex-biased genes. Sex-biased genes in sex-limited tissues in D. melanogaster, however, evolve faster than other narrowly expressed genes. Therefore, the rapid evolution of sex-biased genes is limited only to those genes primarily expressed in sex-limited reproductive tissues.
Collapse
Affiliation(s)
- Richard P Meisel
- Department of Molecular Biology and Genetics, Cornell University, USA.
| |
Collapse
|
715
|
HepatoNet1: a comprehensive metabolic reconstruction of the human hepatocyte for the analysis of liver physiology. Mol Syst Biol 2010; 6:411. [PMID: 20823849 PMCID: PMC2964118 DOI: 10.1038/msb.2010.62] [Citation(s) in RCA: 205] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2009] [Accepted: 07/08/2010] [Indexed: 02/08/2023] Open
Abstract
We present HepatoNet1, the first reconstruction of a comprehensive metabolic network of the human hepatocyte that is shown to accomplish a large canon of known metabolic liver functions. The network comprises 777 metabolites in six intracellular and two extracellular compartments and 2539 reactions, including 1466 transport reactions. It is based on the manual evaluation of >1500 original scientific research publications to warrant a high-quality evidence-based model. The final network is the result of an iterative process of data compilation and rigorous computational testing of network functionality by means of constraint-based modeling techniques. Taking the hepatic detoxification of ammonia as an example, we show how the availability of nutrients and oxygen may modulate the interplay of various metabolic pathways to allow an efficient response of the liver to perturbations of the homeostasis of blood compounds.
Collapse
|
716
|
Jerby L, Shlomi T, Ruppin E. Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism. Mol Syst Biol 2010; 6:401. [PMID: 20823844 PMCID: PMC2964116 DOI: 10.1038/msb.2010.56] [Citation(s) in RCA: 279] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2010] [Accepted: 06/25/2010] [Indexed: 12/18/2022] Open
Abstract
The first computational approach for the rapid generation of genome-scale tissue-specific models from a generic species model. A genome scale model of human liver metabolism, which is comprehensively tested and validated using cross-validation and the ability to carry out complex hepatic metabolic functions. The model's flux predictions are shown to correlate with flux measurements across a variety of hormonal and dietary conditions, and are successfully used to predict biomarker changes in genetic metabolic disorders, both with higher accuracy than the generic human model.
The study of normal human metabolism and its alterations is central to the understanding and treatment of a variety of human diseases, including diabetes, metabolic syndrome, neurodegenerative disorders, and cancer. A promising systems biology approach for studying human metabolism is through the development and analysis of large-scale stoichiometric network models of human metabolism. The reconstruction of these network models has followed two main paths: the former being the reconstruction of generic (non-tissue specific) models, characterizing the complete metabolic potential of human cells, based mostly on genomic data to trace enzyme-coding genes (Duarte et al, 2007; Ma et al, 2007), and the latter is the reconstruction of cell type- and tissue-specific models (Wiback and Palsson, 2002; Chatziioannou et al, 2003; Vo et al, 2004), based on a similar methodology to that described above, with the extra complexity of manual curation of literature evidence for the cell/system specificity of metabolic enzymes and pathways. On this background, we present in this study, to the best of our knowledge, the first computational approach for a rapid generation of genome-scale tissue-specific models. The method relies on integrating the previously reconstructed generic human models with a variety of high-throughput molecular ‘omics' data, including transcriptomic, proteomic, metabolomic, and phenotypic data, as well as literature-based knowledge, characterizing the tissue in hand (Figure 1). Hence, it can be readily used to quite rapidly build and use a large array of human tissue-specific models. The resulting model satisfies stoichiometric, mass-balance, and thermodynamic constraints. It serves as a functional metabolic network that can then be used to explore the metabolic state of a tissue under various genetic and physiological conditions, simulating enzymatic inhibition or drug applications through standard constraint-based modeling methods, without requiring additional context-specific molecular data. We applied this approach to build a genome scale model of liver metabolism, which is then comprehensively tested and validated. The model is shown to be able to simulate complex hepatic metabolic functions, as well as depicting the pathological alterations caused by urea cycle deficiencies. The liver model was applied to predict measured intra-cellular metabolic fluxes given measured metabolite uptake and secretion rates at different hepatic metabolic conditions. The predictions were tested using a comprehensive set of flux measurements performed by (Chan et al, 2003), showing that the liver model obtained more accurate predictions compared to those obtained by the original, generic human model (an overall prediction accuracy of 0.67 versus 0.46). Furthermore, it was applied to identify metabolic biomarkers for liver in-born errors of metabolism—once again, displaying superiority vs. the predictions generated by the generic human model (accuracy of 0.67 versus 0.59). From a biotechnological standpoint, the liver model generated here can serve as a basis for future studies aiming to optimize the functioning of bio artificial liver devices. The application of the method to rapidly construct metabolic models of other human tissues can obviously lead to many other important clinical insights, e.g., concerning means for metabolic salvage of ischemic heart and brain tissues. Last but not least, the application of the new method is not limited to the realm of human modeling; it can be used to generate tissue models for any multi-tissue organism for which a generic model exists, such as the Mus musculus (Quek and Nielsen, 2008; Sheikh et al, 2005) and the model plant Arabidopsis thaliana (Poolman et al, 2009). The computational study of human metabolism has been advanced with the advent of the first generic (non-tissue specific) stoichiometric model of human metabolism. In this study, we present a new algorithm for rapid reconstruction of tissue-specific genome-scale models of human metabolism. The algorithm generates a tissue-specific model from the generic human model by integrating a variety of tissue-specific molecular data sources, including literature-based knowledge, transcriptomic, proteomic, metabolomic and phenotypic data. Applying the algorithm, we constructed the first genome-scale stoichiometric model of hepatic metabolism. The model is verified using standard cross-validation procedures, and through its ability to carry out hepatic metabolic functions. The model's flux predictions correlate with flux measurements across a variety of hormonal and dietary conditions, and improve upon the predictive performance obtained using the original, generic human model (prediction accuracy of 0.67 versus 0.46). Finally, the model better predicts biomarker changes in genetic metabolic disorders than the generic human model (accuracy of 0.67 versus 0.59). The approach presented can be used to construct other human tissue-specific models, and be applied to other organisms.
Collapse
Affiliation(s)
- Livnat Jerby
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| | | | | |
Collapse
|
717
|
Bone morphogenetic proteins: a critical review. Cell Signal 2010; 23:609-20. [PMID: 20959140 DOI: 10.1016/j.cellsig.2010.10.003] [Citation(s) in RCA: 512] [Impact Index Per Article: 34.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2010] [Revised: 09/14/2010] [Accepted: 10/01/2010] [Indexed: 12/14/2022]
Abstract
Bone Morphogenetic Proteins (BMPs) are potent growth factors belonging to the Transforming Growth Factor Beta superfamily. To date over 20 members have been identified in humans with varying functions during processes such as embryogenesis, skeletal formation, hematopoiesis and neurogenesis. Though their functions have been identified, less is known regarding levels of regulation at the extracellular matrix, membrane surface, and receptor activation. Further, current models of activation lack the integration of these regulatory mechanisms. This review focuses on the different levels of regulation, ranging from the release of BMPs into the extracellular components to receptor activation for different BMPs. It also highlights areas in research that is lacking or contradictory.
Collapse
|
718
|
Whitaker HC, Kote-Jarai Z, Ross-Adams H, Warren AY, Burge J, George A, Bancroft E, Jhavar S, Leongamornlert D, Tymrakiewicz M, Saunders E, Page E, Mitra A, Mitchell G, Lindeman GJ, Evans DG, Blanco I, Mercer C, Rubinstein WS, Clowes V, Douglas F, Hodgson S, Walker L, Donaldson A, Izatt L, Dorkins H, Male A, Tucker K, Stapleton A, Lam J, Kirk J, Lilja H, Easton D, IMPACT Study Steering Committee, IMPACT Study Collaborators, UK GPCS Collaborators, Cooper C, Eeles R, Neal DE. The rs10993994 risk allele for prostate cancer results in clinically relevant changes in microseminoprotein-beta expression in tissue and urine. PLoS One 2010; 5:e13363. [PMID: 20967219 PMCID: PMC2954177 DOI: 10.1371/journal.pone.0013363] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Accepted: 09/01/2010] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Microseminoprotein-beta (MSMB) regulates apoptosis and using genome-wide association studies the rs10993994 single nucleotide polymorphism in the MSMB promoter has been linked to an increased risk of developing prostate cancer. The promoter location of the risk allele, and its ability to reduce promoter activity, suggested that the rs10993994 risk allele could result in lowered MSMB in benign tissue leading to increased prostate cancer risk. METHODOLOGY/PRINCIPAL FINDINGS MSMB expression in benign and malignant prostate tissue was examined using immunohistochemistry and compared with the rs10993994 genotype. Urinary MSMB concentrations were determined by ELISA and correlated with urinary PSA, the presence or absence of cancer, rs10993994 genotype and age of onset. MSMB levels in prostate tissue and urine were greatly reduced with tumourigenesis. Urinary MSMB was better than urinary PSA at differentiating men with prostate cancer at all Gleason grades. The high risk allele was associated with heterogeneity of MSMB staining and loss of MSMB in both tissue and urine in benign prostate. CONCLUSIONS These data show that some high risk alleles discovered using genome-wide association studies produce phenotypic effects with potential clinical utility. We provide the first link between a low penetrance polymorphism for prostate cancer and a potential test in human tissue and bodily fluids. There is potential to develop tissue and urinary MSMB for a biomarker of prostate cancer risk, diagnosis and disease monitoring.
Collapse
Affiliation(s)
- Hayley C Whitaker
- Uro-Oncology Research Group, CRUK Cambridge Research Institute, Cambridge, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
719
|
Weber CC, Hurst LD. Intronic AT skew is a defendable proxy for germline transcription but does not predict crossing-over or protein evolution rates in Drosophila melanogaster. J Mol Evol 2010; 71:415-26. [PMID: 20938653 DOI: 10.1007/s00239-010-9395-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 09/17/2010] [Indexed: 01/28/2023]
Abstract
Recent evidence suggests that germline transcription may affect both protein evolutionary rates, possibly mediated by repair processes, and recombination rates, possibly mediated by chromatin and epigenetic modification. Here, we test these propositions in Drosophila melanogaster. The challenge for such analyses is to provide defendable measures of germline gene expression. Intronic AT skew is a good candidate measure as it is thought to be a consequence, at least in part, of transcription-coupled repair. Prior evidence suggests that intronic AT skew in D. melanogaster is not affected by proximity to intron extremities and differs between transcribed DNA and flanking sequence. We now also establish that intronic AT skew is a defendable proxy for germline expression as (a) it is more similar than expected by chance between introns of the same gene (which is not accounted for by physical proximity), (b) is correlated with male germline expression, and (c) is more pronounced in broadly expressed genes. Furthermore, (d) a trend for intronic skew to differ between 3' and 5' ends of genes is particular to broadly expressed genes. Finally, (e) controlling for physical distance, introns of proximate genes are most different in skew if they have different tissue specificity. We find that intronic AT skew, employed as a proxy for germline transcription, correlates neither with recombination rates nor with the rate of protein evolution. We conclude that there is no prima facie evidence that germline expression modulates recombination rates or monotonically affects protein evolution rates in D. melanogaster.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, UK
| | | |
Collapse
|
720
|
Ekblom R, French L, Slate J, Burke T. Evolutionary analysis and expression profiling of zebra finch immune genes. Genome Biol Evol 2010; 2:781-90. [PMID: 20884724 PMCID: PMC2975445 DOI: 10.1093/gbe/evq061] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/27/2010] [Indexed: 01/11/2023] Open
Abstract
Genes of the immune system are generally considered to evolve rapidly due to host-parasite coevolution. They are therefore of great interest in evolutionary biology and molecular ecology. In this study, we manually annotated 144 avian immune genes from the zebra finch (Taeniopygia guttata) genome and conducted evolutionary analyses of these by comparing them with their orthologs in the chicken (Gallus gallus). Genes classified as immune receptors showed elevated d(N)/d(S) ratios compared with other classes of immune genes. Immune genes in general also appear to be evolving more rapidly than other genes, as inferred from a higher d(N)/d(S) ratio compared with the rest of the genome. Furthermore, ten genes (of 27) for which sequence data were available from at least three bird species showed evidence of positive selection acting on specific codons. From transcriptome data of eight different tissues, we found evidence for expression of 106 of the studied immune genes, with primary expression of most of these in bursa, blood, and spleen. These immune-related genes showed a more tissue-specific expression pattern than other genes in the zebra finch genome. Several of the avian immune genes investigated here provide strong candidates for in-depth studies of molecular adaptation in birds.
Collapse
Affiliation(s)
- Robert Ekblom
- University of Sheffield, Department of Animal and Plant Sciences, Sheffield, UK.
| | | | | | | |
Collapse
|
721
|
Shu XO, Long J, Cai Q, Qi L, Xiang YB, Cho YS, Tai ES, Li X, Lin X, Chow WH, Go MJ, Seielstad M, Bao W, Li H, Cornelis MC, Yu K, Wen W, Shi J, Han BG, Sim XL, Liu L, Qi Q, Kim HL, Ng DPK, Lee JY, Kim YJ, Li C, Gao YT, Zheng W, Hu FB. Identification of new genetic risk variants for type 2 diabetes. PLoS Genet 2010; 6:e1001127. [PMID: 20862305 PMCID: PMC2940731 DOI: 10.1371/journal.pgen.1001127] [Citation(s) in RCA: 160] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2010] [Accepted: 08/17/2010] [Indexed: 12/19/2022] Open
Abstract
Although more than 20 genetic susceptibility loci have been reported for type 2 diabetes (T2D), most reported variants have small to moderate effects and account for only a small proportion of the heritability of T2D, suggesting that the majority of inter-person genetic variation in this disease remains to be determined. We conducted a multistage, genome-wide association study (GWAS) within the Asian Consortium of Diabetes to search for T2D susceptibility markers. From 590,887 SNPs genotyped in 1,019 T2D cases and 1,710 controls selected from Chinese women in Shanghai, we selected the top 2,100 SNPs that were not in linkage disequilibrium (r2<0.2) with known T2D loci for in silico replication in three T2D GWAS conducted among European Americans, Koreans, and Singapore Chinese. The 5 most promising SNPs were genotyped in an independent set of 1,645 cases and 1,649 controls from Shanghai, and 4 of them were further genotyped in 1,487 cases and 3,316 controls from 2 additional Chinese studies. Consistent associations across all studies were found for rs1359790 (13q31.1), rs10906115 (10p13), and rs1436955 (15q22.2) with P-values (per allele OR, 95%CI) of 6.49×10−9 (1.15, 1.10–1.20), 1.45×10−8 (1.13, 1.08–1.18), and 7.14×10−7 (1.13, 1.08–1.19), respectively, in combined analyses of 9,794 cases and 14,615 controls. Our study provides strong evidence for a novel T2D susceptibility locus at 13q31.1 and the presence of new independent risk variants near regions (10p13 and 15q22.2) reported by previous GWAS. Type 2 diabetes, a complex disease affecting more than a billion people worldwide, is believed to be caused by both environmental and genetic factors. Although some studies have shown that certain genes may make some people more susceptible to type 2 diabetes than others, the genes reported to date have only a small effect and account for a small proportion of type 2 diabetes cases. Furthermore, few of these studies have been conducted in Asian populations, although Asians are known to be more susceptible to insulin resistance than people living in Western countries, and incidence of type 2 diabetes has been increasing alarmingly in Asian countries. We conducted a multi-stage study involving 9,794 type 2 diabetes cases and 14,615 controls, predominantly Asians, to discover genes related to susceptibility to type 2 diabetes. We identified 3 genetic regions that are related to increased risk of type 2 diabetes.
Collapse
Affiliation(s)
- Xiao Ou Shu
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
722
|
Hasinoff BB, Patel D. The lack of target specificity of small molecule anticancer kinase inhibitors is correlated with their ability to damage myocytes in vitro. Toxicol Appl Pharmacol 2010; 249:132-9. [PMID: 20832415 DOI: 10.1016/j.taap.2010.08.026] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 08/24/2010] [Accepted: 08/26/2010] [Indexed: 10/19/2022]
Abstract
Many new targeted small molecule anticancer kinase inhibitors are actively being developed. However, the clinical use of some kinase inhibitors has been shown to result in cardiotoxicity. In most cases the mechanisms by which they exert their cardiotoxicity are not well understood. We have used large scale profiling data on 8 FDA-approved tyrosine kinase inhibitors and 10 other kinase inhibitors to a panel of 317 kinases in order to correlate binding constants and kinase inhibitor binding selectivity scores with kinase inhibitor-induced damage to neonatal rat cardiac myocytes. The 18 kinase inhibitors that were the subject of this study were: canertinib, dasatinib, dovitinib, erlotinib, flavopiridol, gefitinib, imatinib, lapatinib, midostaurin, motesanib, pazopanib, sorafenib, staurosporine, sunitinib, tandutinib, tozasertib, vandetanib and vatalanib. The combined tyrosine kinase and serine-threonine kinase selectivity scores were highly correlated with the myocyte-damaging effects of the kinase inhibitors. This result suggests that myocyte damage was due to a lack of target selectivity to binding of both tyrosine kinases and serine-threonine kinases, and was not due to binding to either group specifically. Finally, the strength of kinase inhibitor binding for 290 kinases was examined for correlations with myocyte damage. Kinase inhibitor binding was significantly correlated with myocyte damage for 12 kinases. Thus, myocyte damage may be multifactorial in nature with the inhibition of a number of kinases involved in producing kinase inhibitor-induced myocyte damage.
Collapse
Affiliation(s)
- Brian B Hasinoff
- Faculty of Pharmacy, Apotex Centre, University of Manitoba, 750 McDermot Avenue, Winnipeg, Manitoba R3E0T5, Canada.
| | | |
Collapse
|
723
|
Meurs KM, Mauceli E, Lahmers S, Acland GM, White SN, Lindblad-Toh K. Genome-wide association identifies a deletion in the 3' untranslated region of striatin in a canine model of arrhythmogenic right ventricular cardiomyopathy. Hum Genet 2010; 128:315-24. [PMID: 20596727 PMCID: PMC2962869 DOI: 10.1007/s00439-010-0855-y] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 06/18/2010] [Indexed: 02/08/2023]
Abstract
Arrhythmogenic right ventricular cardiomyopathy (ARVC) is a familial cardiac disease characterized by ventricular arrhythmias and sudden cardiac death. It is most frequently inherited as an autosomal dominant trait with incomplete and age-related penetrance and variable clinical expression. The human disease is most commonly associated with a causative mutation in one of several genes encoding desmosomal proteins. We have previously described a spontaneous canine model of ARVC in the boxer dog. We phenotyped adult boxer dogs for ARVC by performing physical examination, echocardiogram and ambulatory electrocardiogram. Genome-wide association using the canine 50k SNP array identified several regions of association, of which the strongest resided on chromosome 17. Fine mapping and direct DNA sequencing identified an 8-bp deletion in the 3' untranslated region (UTR) of the Striatin gene on chromosome 17 in association with ARVC in the boxer dog. Evaluation of the secondary structure of the 3' UTR demonstrated that the deletion affects a stem loop structure of the mRNA and expression analysis identified a reduction in Striatin mRNA. Dogs that were homozygous for the deletion had a more severe form of disease based on a significantly higher number of ventricular premature complexes. Immunofluorescence studies localized Striatin to the intercalated disc region of the cardiac myocyte and co-localized it to three desmosomal proteins, Plakophilin-2, Plakoglobin and Desmoplakin, all involved in the pathogenesis of ARVC in human beings. We suggest that Striatin may serve as a novel candidate gene for human ARVC.
Collapse
Affiliation(s)
- Kathryn M Meurs
- Washington State University College of Veterinary Medicine, Pullman, USA.
| | | | | | | | | | | |
Collapse
|
724
|
Gorni C, Garino C, Iacuaniello S, Castiglioni B, Stella A, Restelli GL, Pagnacco G, Mariani P. Transcriptome analysis to identify differential gene expression affecting meat quality in heavy Italian pigs. Anim Genet 2010; 42:161-71. [PMID: 20726855 DOI: 10.1111/j.1365-2052.2010.02098.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Suppressive subtractive hybridization (SSH) was used to analyse the muscle transcriptome and identify genes affecting meat quality within an Italian pig population of Large White and Landrace purebred individuals. Seven phenotypes were recorded at slaughter: dorsal fat thickness, ham fat thickness, ham fat coverage, muscle compactness, marbling, meat colour and colour uniformity. Two subtractive libraries were created from longissimus dorsi tissue of selected pigs with extreme phenotypes for meat quality. Eighty-four differentially expressed ESTs were identified, which showed homology to expressed pig sequences and/or to genomic pig sequences produced within the pig genome project. Sixty-eight sequences were mapped on the pig genome, and most of these sequences co-localized with the same chromosomal positions as QTLs that have been previously identified for meat quality. Thirty sequences, including eight matching known genes previously related to muscle metabolic pathways, were selected to statistically validate their differential expression. Association analysis and t-test results indicated that 28 ESTs of the 30 analysed were associated with phenotypes investigated here and have significant differential expression levels (P≤ 0.05) between the two tails of the phenotypic distribution.
Collapse
Affiliation(s)
- C Gorni
- Parco Tecnologico Padano, 26900 Lodi, Italy.Dipartimento di Scienze e Tecnologie Veterinarie per la Sicurezza Alimentare, Università Degli Studi di Milano, 20134 Milano, Italy.Istituto di Biologia e Biotecnologia Agraria, CNR, 20133 Milano, Italy
| | | | | | | | | | | | | | | |
Collapse
|
725
|
Yang TL, Guo Y, Zhang LS, Tian Q, Yan H, Papasian CJ, Recker RR, Deng HW. Runs of homozygosity identify a recessive locus 12q21.31 for human adult height. J Clin Endocrinol Metab 2010; 95:3777-82. [PMID: 20466785 PMCID: PMC2913044 DOI: 10.1210/jc.2009-1715] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
BACKGROUND Runs of homozygosity (ROHs) have recently been proposed to have potential recessive significance for complex traits. Human adult height is a classic complex trait with heritability estimated up to 90%, and recessive loci that contribute to adult height variation have been identified. METHODS Using the Affymetrix 500K array set, we performed a genome-wide ROHs analysis to identify genetic loci for adult height in a discovery sample including 998 unrelated Caucasian subjects from the midwest United States. For the significant ROHs identified, we replicated these findings in a family-based sample of 8385 Caucasian subjects from the Framingham Heart Study (FHS). RESULTS Our results revealed one ROH, located in 12q21.31, that had a strong association with adult height variation both in the discovery (P=6.69x10(-6)) and replication samples (P=5.40x10(-5)). We further validated the presence of this ROH using the HapMap sample. CONCLUSION Our findings open a new avenue for identifying loci with recessive contributions to adult height variation. Further molecular and functional studies are needed to explore and clarify the potential mechanism.
Collapse
Affiliation(s)
- Tie-Lin Yang
- Key Laboratory of Biomedical Information Engineering, Ministry of Education and Institute of Molecular Genetics, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, People's Republic of China
| | | | | | | | | | | | | | | |
Collapse
|
726
|
Planas J, Serrat JM. Gene promoter evolution targets the center of the human protein interaction network. PLoS One 2010; 5:e11476. [PMID: 20628608 PMCID: PMC2900212 DOI: 10.1371/journal.pone.0011476] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2010] [Accepted: 06/11/2010] [Indexed: 02/06/2023] Open
Abstract
Assessing the contribution of promoters and coding sequences to gene evolution is an important step toward discovering the major genetic determinants of human evolution. Many specific examples have revealed the evolutionary importance of cis-regulatory regions. However, the relative contribution of regulatory and coding regions to the evolutionary process and whether systemic factors differentially influence their evolution remains unclear. To address these questions, we carried out an analysis at the genome scale to identify signatures of positive selection in human proximal promoters. Next, we examined whether genes with positively selected promoters (Prom+ genes) show systemic differences with respect to a set of genes with positively selected protein-coding regions (Cod+ genes). We found that the number of genes in each set was not significantly different (8.1% and 8.5%, respectively). Furthermore, a functional analysis showed that, in both cases, positive selection affects almost all biological processes and only a few genes of each group are located in enriched categories, indicating that promoters and coding regions are not evolutionarily specialized with respect to gene function. On the other hand, we show that the topology of the human protein network has a different influence on the molecular evolution of proximal promoters and coding regions. Notably, Prom+ genes have an unexpectedly high centrality when compared with a reference distribution (P = 0.008, for Eigenvalue centrality). Moreover, the frequency of Prom+ genes increases from the periphery to the center of the protein network (P = 0.02, for the logistic regression coefficient). This means that gene centrality does not constrain the evolution of proximal promoters, unlike the case with coding regions, and further indicates that the evolution of proximal promoters is more efficient in the center of the protein network than in the periphery. These results show that proximal promoters have had a systemic contribution to human evolution by increasing the participation of central genes in the evolutionary process.
Collapse
Affiliation(s)
- Jordi Planas
- Bioinformatics and Medical Statistics Group, Department of Systems Biology, Universitat de Vic, Vic, Spain
| | - Josep M. Serrat
- Bioinformatics and Medical Statistics Group, Department of Systems Biology, Universitat de Vic, Vic, Spain
- * E-mail:
| |
Collapse
|
727
|
Schlessinger A, Matsson P, Shima JE, Pieper U, Yee SW, Kelly L, Apeltsin L, Stroud RM, Ferrin TE, Giacomini KM, Sali A. Comparison of human solute carriers. Protein Sci 2010; 19:412-28. [PMID: 20052679 DOI: 10.1002/pro.320] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Solute carriers are eukaryotic membrane proteins that control the uptake and efflux of solutes, including essential cellular compounds, environmental toxins, and therapeutic drugs. Solute carriers can share similar structural features despite weak sequence similarities. Identification of sequence relationships among solute carriers is needed to enhance our ability to model individual carriers and to elucidate the molecular mechanisms of their substrate specificity and transport. Here, we describe a comprehensive comparison of solute carriers. We link the proteins using sensitive profile-profile alignments and two classification approaches, including similarity networks. The clusters are analyzed in view of substrate type, transport mode, organism conservation, and tissue specificity. Solute carrier families with similar substrates generally cluster together, despite exhibiting relatively weak sequence similarities. In contrast, some families cluster together with no apparent reason, revealing unexplored relationships. We demonstrate computationally and experimentally the functional overlap between representative members of these families. Finally, we identify four putative solute carriers in the human genome. The solute carriers include a biomedically important group of membrane proteins that is diverse in sequence and structure. The proposed classification of solute carriers, combined with experiment, reveals new relationships among the individual families and identifies new solute carriers. The classification scheme will inform future attempts directed at modeling the structures of the solute carriers, a prerequisite for describing the substrate specificities of the individual families.
Collapse
Affiliation(s)
- Avner Schlessinger
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, University of California, San Francisco, California.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
728
|
Wu X, Qi X. Genes encoding hub and bottleneck enzymes of the Arabidopsis metabolic network preferentially retain homeologs through whole genome duplication. BMC Evol Biol 2010; 10:145. [PMID: 20478072 PMCID: PMC2880986 DOI: 10.1186/1471-2148-10-145] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2010] [Accepted: 05/18/2010] [Indexed: 11/10/2022] Open
Abstract
Background Whole genome duplication (WGD) occurs widely in angiosperm evolution. It raises the intriguing question of how interacting networks of genes cope with this dramatic evolutionary event. Results In study of the Arabidopsis metabolic network, we assigned each enzyme (node) with topological centralities (in-degree, out-degree and between-ness) to measure quantitatively their centralities in the network. The Arabidopsis metabolic network is highly modular and separated into 11 interconnected modules, which correspond well to the functional metabolic pathways. The enzymes with higher in-out degree and between-ness (defined as hub and bottleneck enzymes, respectively) tend to be more conserved and preferentially retain homeologs after WGD. Moreover, the simultaneous retention of homeologs encoding enzymes which catalyze consecutive steps in a pathway is highly favored and easily achieved, and enzyme-enzyme interactions contribute to the retention of one-third of WGD enzymes. Conclusions Our analyses indicate that the hub and bottleneck enzymes of metabolic network obtain great benefits from WGD, and this event grants clear evolutionary advantages in adaptation to different environments.
Collapse
Affiliation(s)
- Xudong Wu
- the Institute of Botany, Chinese Academy of Sciences, Beijing, China
| | | |
Collapse
|
729
|
Rao YS, Wang ZF, Chai XW, Wu GZ, Zhou M, Nie QH, Zhang XQ. Selection for the compactness of highly expressed genes in Gallus gallus. Biol Direct 2010; 5:35. [PMID: 20465857 PMCID: PMC2883972 DOI: 10.1186/1745-6150-5-35] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2009] [Accepted: 05/14/2010] [Indexed: 11/10/2022] Open
Abstract
Background Coding sequence (CDS) length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus) is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure. Results Based on different technologies, we gathered expression data for nuclear protein coding, single-splicing genes from Gallus gallus genome and compared them with gene parameters. We found that gene size, CDS length, first intron length, average intron length, and total intron length are negatively correlated with expression level and expression breadth significantly. The tissue specificity is positively correlated with the first intron length but negatively correlated with the average intron length, and not correlated with the CDS length and protein domain numbers. Comparison analyses showed that ubiquitously expressed genes and narrowly expressed genes with the similar expression levels do not differ in compactness. Our data provided evidence that the genomic design model can not, at least in part, explain our observations. We grouped all somatic-tissue-specific genes (n = 1105), and compared the first intron length and the average intron length between highly expressed genes (top 5% expressed genes) and weakly expressed genes (bottom 5% expressed genes). We found that the first intron length and the average intron length in highly expressed genes are not different from that in weakly expressed genes. We also made a comparison between ubiquitously expressed genes and narrowly expressed somatic genes with similar expression levels. Our data demonstrated that ubiquitously expressed genes are less compact than narrowly expressed genes with the similar expression levels. Obviously, these observations can not be explained by mutational bias hypotheses either. We also found that the significant trend between genes' compactness and expression level could not be affected by local mutational biases. We argued that the selection of economy model is most likely one to explain the relationship between gene expression and gene characteristics in chicken genome. Conclusion Natural selection appears to favor the compactness of highly expressed genes in chicken genome. This observation can be explained by the selection of economy model. Reviewers This article was reviewed by Dr. Gavin Huttley, Dr. Liran Carmel (nominated by Dr. Eugene V. Koonin) and Dr. Araxi Urrutia (nominated by Dr. Laurence D. Hurst).
Collapse
Affiliation(s)
- You S Rao
- Department of Biological Technology, Jiangxi Educational Institute, Nanchang, Jiangxi, China
| | | | | | | | | | | | | |
Collapse
|
730
|
Metta M, Schlötterer C. Non-random genomic integration - an intrinsic property of retrogenes in Drosophila? BMC Evol Biol 2010; 10:114. [PMID: 20426838 PMCID: PMC2879276 DOI: 10.1186/1471-2148-10-114] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2009] [Accepted: 04/28/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Drosophila X-chromosome shows a significant underrepresentation of genes with male-biased gene expression (demasculinization). This trend is matched by retrogenes, which typically have a male biased gene expression pattern and show a significant movement bias from X-chromosomes to autosomes. It is currently assumed that these patterns are best explained by selection, either mediated by male meiotic sex chromosome inactivation (MSCI) or sexually antagonistic forces. We scrutinized the evolutionary dynamics of retroposition by focusing on retrogenes for which the parental copy has degenerated. RESULTS Consistent with a functional substitution of the degenerated gene by the retrogene, patterns of sequence evolution and gene expression were similar between retroposed and parental genes. Like previous studies, our set of retrogenes showed a significant movement off the X-chromosome. In contrast to data sets where retroposition caused gene duplication, the genes in our study showed primarily female-biased or unbiased gene expression. CONCLUSIONS Based on our results, the biased transposition pattern cannot be explained by MSCI and probably not by sexual antagonism. Rather, we propose that the movement away from the X-chromosome represents a general property of retroposition in Drosophila.
Collapse
Affiliation(s)
- Muralidhar Metta
- Institut für Populationsgenetik, Vetmeduni vienna, Veterinärplatz 1, 1210 Wien, Austria
- Unit of Animal Genomics, GIGA-R, B34 +1, University of Liège, 4000 Liège, Belgium
| | - Christian Schlötterer
- Institut für Populationsgenetik, Vetmeduni vienna, Veterinärplatz 1, 1210 Wien, Austria
| |
Collapse
|
731
|
Hellgren O, Ekblom R. Evolution of a cluster of innate immune genes (beta-defensins) along the ancestral lines of chicken and zebra finch. Immunome Res 2010; 6:3. [PMID: 20359324 PMCID: PMC3161384 DOI: 10.1186/1745-7580-6-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Accepted: 04/01/2010] [Indexed: 12/24/2022] Open
Abstract
Background Avian β-defensins (AvBDs) represent a group of innate immune genes with broad antimicrobial activity. Within the chicken genome, previous work identified 14 AvBDs in a cluster on chromosome three. The release of a second bird genome, the zebra finch, allows us to study the comparative evolutionary history of these gene clusters between from two species that shared a common ancestor about 100 million years ago. Results A phylogenetic analysis of the β-defensin gene clusters in the chicken and the zebra finch identified several cases of gene duplication and gene loss along their ancestral lines. In the zebra finch genome a cluster of 22 AvBD genes were identified, all located within 125 Kbp on chromosome three. Ten of the 22 genes were found to be highly conserved with orthologous genes in the chicken genome. The remaining 12 genes were all located within a cluster of 58 Kbp and are suggested to be a result of recent gene duplication events that occurred after the galliformes- passeriformes split (G-P split). Within the chicken genome, AvBD6 was found to be a duplication of AvBD7, whereas the gene AvDB14 seems to have been lost along the ancestral line of the zebra finch. The duplicated β-defensin genes have had a significantly higher accumulation of non-synonymous over synonymous substitutions compared to the genes that have not undergone duplication since the G-P split. The expression patterns of avian β-defensin genes seem to be well conserved between chicken and zebra finch. Conclusion The genomic comparisons of the β-defensins gene clusters of the chicken and zebra finch illuminate the evolutionary history of this gene complex. Along their ancestral lines, several gene duplication events have occurred in the passerine line after the galliformes-passeriformes split giving rise to 12 novel genes compared to a single duplication event in the galliformes line. After the duplication events, the duplicated genes have been subject to a relaxed selection pressure compared to the non-duplicated genes, thus supporting models of evolution by gene duplication.
Collapse
Affiliation(s)
- Olof Hellgren
- Edward Grey Institute, Department of Zoology, South Parks Road, Oxford, OX1 3PS, UK.
| | | |
Collapse
|
732
|
Ekblom R, Balakrishnan CN, Burke T, Slate J. Digital gene expression analysis of the zebra finch genome. BMC Genomics 2010; 11:219. [PMID: 20359325 PMCID: PMC2996964 DOI: 10.1186/1471-2164-11-219] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Accepted: 04/01/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In order to understand patterns of adaptation and molecular evolution it is important to quantify both variation in gene expression and nucleotide sequence divergence. Gene expression profiling in non-model organisms has recently been facilitated by the advent of massively parallel sequencing technology. Here we investigate tissue specific gene expression patterns in the zebra finch (Taeniopygia guttata) with special emphasis on the genes of the major histocompatibility complex (MHC). RESULTS Almost 2 million 454-sequencing reads from cDNA of six different tissues were assembled and analysed. A total of 11,793 zebra finch transcripts were represented in this EST data, indicating a transcriptome coverage of about 65%. There was a positive correlation between the tissue specificity of gene expression and non-synonymous to synonymous nucleotide substitution ratio of genes, suggesting that genes with a specialised function are evolving at a higher rate (or with less constraint) than genes with a more general function. In line with this, there was also a negative correlation between overall expression levels and expression specificity of contigs. We found evidence for expression of 10 different genes related to the MHC. MHC genes showed relatively tissue specific expression levels and were in general primarily expressed in spleen. Several MHC genes, including MHC class I also showed expression in brain. Furthermore, for all genes with highest levels of expression in spleen there was an overrepresentation of several gene ontology terms related to immune function. CONCLUSIONS Our study highlights the usefulness of next-generation sequence data for quantifying gene expression in the genome as a whole as well as in specific candidate genes. Overall, the data show predicted patterns of gene expression profiles and molecular evolution in the zebra finch genome. Expression of MHC genes in particular, corresponds well with expression patterns in other vertebrates.
Collapse
Affiliation(s)
- Robert Ekblom
- Department of Animal and Plant Sciences, University of Sheffield, Alfred Denny Building, Western Bank, Sheffield S10 2TN, UK
- Department of Population Biology and Conservation Biology, Uppsala University, Norbyvägen 18 D, SE-75236 Uppsala, Sweden
| | - Christopher N Balakrishnan
- Institute for Genomic Biology and Department of Molecular & Cellular Biology, University of Illinois, 1206 West Gregory Drive MC-195, Urbana, IL 61801, USA
| | - Terry Burke
- Department of Animal and Plant Sciences, University of Sheffield, Alfred Denny Building, Western Bank, Sheffield S10 2TN, UK
| | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, Alfred Denny Building, Western Bank, Sheffield S10 2TN, UK
| |
Collapse
|
733
|
Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. Genome Res 2010; 20:646-54. [PMID: 20305017 DOI: 10.1101/gr.100677.109] [Citation(s) in RCA: 370] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Understanding the dynamics of eukaryotic transcriptome is essential for studying the complexity of transcriptional regulation and its impact on phenotype. However, comprehensive studies of transcriptomes at single base resolution are rare, even for modern organisms, and lacking for rice. Here, we present the first transcriptome atlas for eight organs of cultivated rice. Using high-throughput paired-end RNA-seq, we unambiguously detected transcripts expressing at an extremely low level, as well as a substantial number of novel transcripts, exons, and untranslated regions. An analysis of alternative splicing in the rice transcriptome revealed that alternative cis-splicing occurred in approximately 33% of all rice genes. This is far more than previously reported. In addition, we also identified 234 putative chimeric transcripts that seem to be produced by trans-splicing, indicating that transcript fusion events are more common than expected. In-depth analysis revealed a multitude of fusion transcripts that might be by-products of alternative splicing. Validation and chimeric transcript structural analysis provided evidence that some of these transcripts are likely to be functional in the cell. Taken together, our data provide extensive evidence that transcriptional regulation in rice is vastly more complex than previously believed.
Collapse
|
734
|
Vinogradov AE. Human transcriptome nexuses: basic-eukaryotic and metazoan. Genomics 2010; 95:345-54. [PMID: 20298777 DOI: 10.1016/j.ygeno.2010.03.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Revised: 03/01/2010] [Accepted: 03/08/2010] [Indexed: 01/10/2023]
Abstract
Using a new approach, I analysed human transcriptome coexpression network and revealed two large-scale nexuses. Besides gene coexpression, each nexus is characterized by a combination of gene evolutionary origin, function and among-tissues expression breadth. The first nexus contains mostly genes of pre-metazoan origin, which are widely expressed and have cell-centred functions. The second nexus is enriched in genes of metazoan origin, which are expressed more narrowly and have organism-centred functions. The revealed nexuses are supported by asymmetry in distribution of transcription factor targets between them. Within the metazoan nexus, there is a subnexus that is more pronounced in the nervous tissues and is enriched in gene regulatory complexity. It mostly contains genes related to nervous system, cell communication and multicellular organism processes and development. The revealed nexuses indicate a dichotomy in the transcriptional regulation and can provide a framework for further functional genomics studies.
Collapse
|
735
|
Stelzer G, Inger A, Olender T, Iny-Stein T, Dalah I, Harel A, Safran M, Lancet D. GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2010; 13:477-87. [PMID: 20001862 DOI: 10.1089/omi.2009.0069] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Sophisticated genomic navigation strongly benefits from a capacity to establish a similarity metric among genes. GeneDecks is a novel analysis tool that provides such a metric by highlighting shared descriptors between pairs of genes, based on the rich annotation within the GeneCards compendium of human genes. The current implementation addresses information about pathways, protein domains, Gene Ontology (GO) terms, mouse phenotypes, mRNA expression patterns, disorders, drug relationships, and sequence-based paralogy. GeneDecks has two modes: (1) Paralog Hunter, which seeks functional paralogs based on combinatorial similarity of attributes; and (2) Set Distiller, which ranks descriptors by their degree of sharing within a given gene set. GeneDecks enables the elucidation of unsuspected putative functional paralogs, and a refined scrutiny of various gene-sets (e.g., from high-throughput experiments) for discovering relevant biological patterns.
Collapse
Affiliation(s)
- Gil Stelzer
- Departments of Molecular Genetics, Weizmann Institute of Science , Rehovot 76100, Israel.
| | | | | | | | | | | | | | | |
Collapse
|
736
|
Chen FC, Chen CJ, Li WH, Chuang TJ. Gene family size conservation is a good indicator of evolutionary rates. Mol Biol Evol 2010; 27:1750-8. [PMID: 20194423 PMCID: PMC2908708 DOI: 10.1093/molbev/msq055] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human–chimpanzee–macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
Collapse
Affiliation(s)
- Feng-Chi Chen
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Miaoli County, Taiwan
| | | | | | | |
Collapse
|
737
|
Gene expression profiles identify inflammatory signatures in dendritic cells. PLoS One 2010; 5:e9404. [PMID: 20195376 PMCID: PMC2827557 DOI: 10.1371/journal.pone.0009404] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Accepted: 02/04/2010] [Indexed: 12/12/2022] Open
Abstract
Dendritic cells (DCs) constitute a heterogeneous group of antigen-presenting leukocytes important in activation of both innate and adaptive immunity. We studied the gene expression patterns of DCs incubated with reagents inducing their activation or inhibition. Total RNA was isolated from DCs and gene expression profiling was performed with oligonucleotide microarrays. Using a supervised learning algorithm based on Random Forest, we generated a molecular signature of inflammation from a training set of 77 samples. We then validated this molecular signature in a testing set of 38 samples. Supervised analysis identified a set of 44 genes that distinguished very accurately between inflammatory and non inflammatory samples. The diagnostic performance of the signature genes was assessed against an independent set of samples, by qRT-PCR. Our findings suggest that the gene expression signature of DCs can provide a molecular classification for use in the selection of anti-inflammatory or adjuvant molecules with specific effects on DC activity.
Collapse
|
738
|
Whitaker HC, Warren AY, Eeles R, Kote-Jarai Z, Neal DE. The potential value of microseminoprotein-beta as a prostate cancer biomarker and therapeutic target. Prostate 2010; 70:333-40. [PMID: 19790236 DOI: 10.1002/pros.21059] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
BACKGROUND Recent genome-wide association studies have shown an association of a SNP two base pairs upstream of the 5' UTR of the microseminoprotein-beta (MSMB) gene with an increased risk of developing the prostate cancer, re-igniting interest in its protein product, MSMB. METHODS As one of the most abundant prostatic proteins, MSMB can be reliably detected in tissue and serum. RESULTS It has been consistently shown that MSMB expression is high in normal and benign prostate tissue and lowered or lost in prostate cancer suggesting that it might be a useful tissue biomarker for prostate cancer diagnosis and its levels in serum may be useful as a marker for prognosis. Members of the cysteine-rich secretory protein family and laminin receptors have been shown to bind MSMB at the cell surface and in serum thereby regulating apoptosis. Thus, in the benign prostate, MSMB regulates cell growth, but when MSMB is lost during tumourigenesis, cells are able to grow in a more uncontrolled manner. Both full length MSMB and a short peptide comprised of amino acids 31-45 have been tested for potential therapeutic benefit in mouse models and humans. CONCLUSIONS MSMB has potential as a biomarker of prostate cancer development, progression and recurrence and potentially as a target for therapeutic intervention.
Collapse
Affiliation(s)
- Hayley C Whitaker
- Uro-Oncology Research Group, CRUK Cambridge Research Institute, Li Ka Shing Centre, Robinson Way, Cambridge, UK.
| | | | | | | | | |
Collapse
|
739
|
Chatterjee A, Dutta S, Mukherjee S, Mukherjee N, Chandra S, Mukherjee A, Sinha S, Panda CK, Chaudhuri K, Mukhopadyay K. Differential allelic distribution of V-ets erythroblastosis virus E26 oncogene homolog2 (ETS2) functional polymorphisms in different group of patients. Gene Expr 2010; 15:61-73. [PMID: 21526717 PMCID: PMC6043833 DOI: 10.3727/105221611x12973615737541] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
V-ets erythroblastosis virus E26 oncogene homolog2 (ETS2), located at chromosome 21 and overexpressed in Down's syndrome (DS), has known cancer regulatory functions. Because leukemia is of common occurrence in DS subjects while solid tumors are rare, we have explored the role of ETS2 functional genetic polymorphisms in this differential oncological development. In silico methods were used for identifying deleterious SNPs, tagged SNPs, and linkage disequilibrium followed by genotyping of 14 SNPs in Indo-Caucasoid individuals (N=668). Significantly different allelic frequencies for rs457705, rs1051420, and rs1051425 were observed in Indian controls (N=149) compared to other ethnic groups. A heterozygous "T" insertion, between chromosomal contig positions 40195541 and 40195542, was observed in DS subjects and their parents. rs461155 showed significant allelic and genotypic association in breast and oral cancer patients. Significantly higher occurrence of G-C haplotype (rs461155-rs1051425) was also observed in these patients compared to DS and leukemic patients. This is the first report on this type of allelic discrimination pattern of ETS2 under different disease conditions. From the data obtained it may be proposed that allelic discrimination of deleterious SNPs in ETS2 may play a regulatory role in the differential development of malignancy in DS subjects.
Collapse
Affiliation(s)
- Arpita Chatterjee
- Manovikas Biomedical Research and Diagnostic Centre, MRIH, Kolkata, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
740
|
Tamasi V, Juvan P, Beer M, Rozman D, Meyer UA. Transcriptional activation of PPARalpha by phenobarbital in the absence of CAR and PXR. Mol Pharm 2009; 6:1573-81. [PMID: 19708687 DOI: 10.1021/mp9001552] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The nuclear receptors CAR (constitutive androstane receptor) and PXR (pregnane X receptor) mediate the effects of phenobarbital on gene transcription. To investigate the relative contribution of these nuclear receptors to the expression of specific genes we studied the effect of phenobarbital in livers of wild type, CAR(-/-), PXR(-/-) and CAR/PXR(-/-) knockout mice. Spotted Steroltalk v1 cDNA arrays were applied containing probes for genes involved in drug metabolism, sterol biosynthesis, steroid synthesis/transport and heme synthesis. In the absence of CAR and PXR, phenobarbital unexpectedly induced mRNAs of several nuclear receptors, including PPARalpha and its target genes Cyp4a10 and Cyp4a14. Interestingly, in primary cultures of hepatocytes isolated from CAR/PXR(-/-) knockout mice, phenobarbital increased HNF-4alpha levels. In further experiments in these hepatocyte cultures we provide evidence that phenobarbital directly induces transcription of the PPARalpha gene via its HNF-4alpha response element, and indirectly by lack of inhibitory crosstalk of AMPK, CAR and PXR with HNF-4alpha. Our results provide further insight into CAR and PXR-independent effects of phenobarbital and the crosstalk between different nuclear receptor signaling pathways.
Collapse
Affiliation(s)
- Viola Tamasi
- Genome Scale Biology, Biozentrum, University of Basel, Basel, Switzerland.
| | | | | | | | | |
Collapse
|
741
|
Mitra S, Alnabulsi A, Secombes CJ, Bird S. Identification and characterization of the transcription factors involved in T-cell development, t-bet, stat6 and foxp3, within the zebrafish, Danio rerio. FEBS J 2009; 277:128-47. [PMID: 19961539 DOI: 10.1111/j.1742-4658.2009.07460.x] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The discovery of cytokines expressed by T-helper 1 (Th1), Th2, Th17 and T-regulatory (T(reg)) cells has prompted speculation that these types of responses may exist in fish, arising early in vertebrate evolution. In this investigation, we cloned three zebrafish transcription factors, T-box expressed in T cells (t-bet), signal transducer and activator of transcription 6 (stat6) and fork-head box p3 (foxp3), in which two transcripts are present, that are important in the development of a number of these cell types. They were found within the zebrafish genome, using a synteny approach in the case of t-bet and foxp3. Multiple alignments of zebrafish t-bet, stat6 and foxp3 amino acids with known vertebrate homologues revealed regions of high conservation, subsequently identified to be protein domains important in the functioning of these transcription factors. The gene organizations of zebrafish t-bet and foxp3 were identical to those of the human genes, with the second foxp3 transcript lacking exons 5, 6, 7 and 8. Zebrafish stat6 (21 exons and 20 introns) was slightly different from the human gene, which contained 22 exons and 21 introns. Immunostimulation of zebrafish head kidney and spleen cells with phytohaemagglutinin, lipopolysaccharide or Poly I:C, showed a correlation between the expression of t-bet, stat6 and foxp3 with other genes involved in Th and T(reg) responses using quantitative PCR. These transcription factors, together with many of the cytokines that are expressed by different T-cell subtypes, will aid future investigations into the Th and T(reg) cell types that exist in teleosts.
Collapse
Affiliation(s)
- Suman Mitra
- Scottish Fish Immunology Research Centre, School of Biological Sciences, University of Aberdeen, Aberdeen, UK
| | | | | | | |
Collapse
|
742
|
Exploring the Differences in Evolutionary Rates between Monogenic and Polygenic Disease Genes in Human. Mol Biol Evol 2009; 27:934-41. [DOI: 10.1093/molbev/msp297] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
|
743
|
Demeure O, Duby C, Desert C, Assaf S, Hazard D, Guillou H, Lagarrigue S. Liver X receptor α regulates fatty acid synthase expression in chicken. Poult Sci 2009; 88:2628-35. [DOI: 10.3382/ps.2009-00258] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
744
|
Santagata S, Maire CL, Idbaih A, Geffers L, Correll M, Holton K, Quackenbush J, Ligon KL. CRX is a diagnostic marker of retinal and pineal lineage tumors. PLoS One 2009; 4:e7932. [PMID: 19936203 PMCID: PMC2775954 DOI: 10.1371/journal.pone.0007932] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Accepted: 10/05/2009] [Indexed: 11/19/2022] Open
Abstract
Background CRX is a homeobox transcription factor whose expression and function is critical to maintain retinal and pineal lineage cells and their progenitors. To determine the biologic and diagnostic potential of CRX in human tumors of the retina and pineal, we examined its expression in multiple settings. Methodology/Principal Findings Using situ hybridization and immunohistochemistry we show that Crx RNA and protein expression are exquisitely lineage restricted to retinal and pineal cells during normal mouse and human development. Gene expression profiling analysis of a wide range of human cancers and cancer cell lines also supports that CRX RNA is highly lineage restricted in cancer. Immunohistochemical analysis of 22 retinoblastomas and 13 pineal parenchymal tumors demonstrated strong expression of CRX in over 95% of these tumors. Importantly, CRX was not detected in the majority of tumors considered in the differential diagnosis of pineal region tumors (n = 78). The notable exception was medulloblastoma, 40% of which exhibited CRX expression in a heterogeneous pattern readily distinguished from that seen in retino-pineal tumors. Conclusions/Significance These findings describe new potential roles for CRX in human cancers and highlight the general utility of lineage restricted transcription factors in cancer biology. They also identify CRX as a sensitive and specific clinical marker and a potential lineage dependent therapeutic target in retinoblastoma and pineoblastoma.
Collapse
Affiliation(s)
- Sandro Santagata
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
- Department of Pathology, Children's Hospital Boston, Boston, Massachusetts, United States of America
- Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Cecile L. Maire
- Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Medical Oncology and Center for Molecular Oncologic Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - Ahmed Idbaih
- Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Medical Oncology and Center for Molecular Oncologic Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - Lars Geffers
- Department of Genes and Behavior, Max-Planck-Institute of Biophysical Chemistry, Goettingen, Germany
| | - Mick Correll
- Center for Cancer Computational Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - Kristina Holton
- Center for Cancer Computational Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - John Quackenbush
- Center for Cancer Computational Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - Keith L. Ligon
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
- Department of Pathology, Children's Hospital Boston, Boston, Massachusetts, United States of America
- Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Medical Oncology and Center for Molecular Oncologic Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
745
|
Salameh MA, Robinson JL, Navaneetham D, Sinha D, Madden BJ, Walsh PN, Radisky ES. The amyloid precursor protein/protease nexin 2 Kunitz inhibitor domain is a highly specific substrate of mesotrypsin. J Biol Chem 2009; 285:1939-49. [PMID: 19920152 DOI: 10.1074/jbc.m109.057216] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The amyloid precursor protein (APP) is a ubiquitously expressed transmembrane adhesion protein and the progenitor of amyloid-beta peptides. The major splice isoforms of APP expressed by most tissues contain a Kunitz protease inhibitor domain; secreted APP containing this domain is also known as protease nexin 2 and potently inhibits serine proteases, including trypsin and coagulation factors. The atypical human trypsin isoform mesotrypsin is resistant to inhibition by most protein protease inhibitors and cleaves some inhibitors at a substantially accelerated rate. Here, in a proteomic screen to identify potential physiological substrates of mesotrypsin, we find that APP/protease nexin 2 is selectively cleaved by mesotrypsin within the Kunitz protease inhibitor domain. In studies employing the recombinant Kunitz domain of APP (APPI), we show that mesotrypsin cleaves selectively at the Arg(15)-Ala(16) reactive site bond, with kinetic constants approaching those of other proteases toward highly specific protein substrates. Finally, we show that cleavage of APPI compromises its inhibition of other serine proteases, including cationic trypsin and factor XIa, by 2 orders of magnitude. Because APP/protease nexin 2 and mesotrypsin are coexpressed in a number of tissues, we suggest that processing by mesotrypsin may ablate the protease inhibitory function of APP/protease nexin 2 in vivo and may also modulate other activities of APP/protease nexin 2 that involve the Kunitz domain.
Collapse
Affiliation(s)
- Moh'd A Salameh
- Department of Cancer Biology, Mayo Clinic Cancer Center, Jacksonville, Florida 32224, USA
| | | | | | | | | | | | | |
Collapse
|
746
|
Amaral PP, Neyt C, Wilkins SJ, Askarian-Amiri ME, Sunkin SM, Perkins AC, Mattick JS. Complex architecture and regulated expression of the Sox2ot locus during vertebrate development. RNA (NEW YORK, N.Y.) 2009; 15:2013-2027. [PMID: 19767420 PMCID: PMC2764477 DOI: 10.1261/rna.1705309] [Citation(s) in RCA: 173] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Accepted: 08/18/2009] [Indexed: 05/28/2023]
Abstract
The Sox2 gene is a key regulator of pluripotency embedded within an intron of a long noncoding RNA (ncRNA), termed Sox2 overlapping transcript (Sox2ot), which is transcribed in the same orientation. However, this ncRNA remains uncharacterized. Here we show that Sox2ot has multiple transcription start sites associated with genomic features that indicate regulated expression, including highly conserved elements (HCEs) and chromatin marks characteristic of gene promoters. To identify biological processes in which Sox2ot may be involved, we analyzed its expression in several developmental systems, compared to expression of Sox2. We show that Sox2ot is a stable transcript expressed in mouse embryonic stem cells, which, like Sox2, is down-regulated upon induction of embryoid body (EB) differentiation. However, in contrast to Sox2, Sox2ot is up-regulated during EB mesoderm-lineage differentiation. In adult mouse, Sox2ot isoforms were detected in tissues where Sox2 is expressed, as well as in different tissues, supporting independent regulation of expression of the ncRNA. Sox2dot, an isoform of Sox2ot transcribed from a distal HCE located >500 kb upstream of Sox2, was detected exclusively in the mouse brain, with enrichment in regions of adult neurogenesis. In addition, Sox2ot isoforms are transcribed from HCEs upstream of Sox2 in other vertebrates, including in several regions of the human brain. We also show that Sox2ot is dynamically regulated during chicken and zebrafish embryogenesis, consistently associated with central nervous system structures. These observations provide insight into the structure and regulation of the Sox2ot gene, and suggest conserved roles for Sox2ot orthologs during vertebrate development.
Collapse
Affiliation(s)
- Paulo P Amaral
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, The University of Queensland, St Lucia,QLD 4072, Australia
| | | | | | | | | | | | | |
Collapse
|
747
|
Wang Y, Azuma Y, Friedman DB, Coffey RJ, Neufeld KL. Novel association of APC with intermediate filaments identified using a new versatile APC antibody. BMC Cell Biol 2009; 10:75. [PMID: 19845967 PMCID: PMC2774295 DOI: 10.1186/1471-2121-10-75] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Accepted: 10/21/2009] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND As a key player in suppression of colon tumorigenesis, Adenomatous Polyposis Coli (APC) has been widely studied to determine its cellular functions. However, inconsistencies of commercially available APC antibodies have limited the exploration of APC function. APC is implicated in spindle formation by direct interactions with tubulin and microtubule-binding protein EB1. APC also interacts with the actin cytoskeleton to regulate cell polarity. Until now, interaction of APC with the third cytoskeletal element, intermediate filaments, has remained unexamined. RESULTS We generated an APC antibody (APC-M2 pAb) raised against the 15 amino acid repeat region, and verified its reliability in applications including immunoprecipitation, immunoblotting, and immunofluorescence in cultured cells and tissue. Utilizing this APC-M2 pAb, we immunoprecipitated endogenous APC and its binding proteins from colon epithelial cells expressing wild-type APC. Using Liquid Chromatography Tandem Mass Spectrometry (LC-MS/MS), we identified 42 proteins in complex with APC, including beta-catenin and intermediate filament (IF) proteins lamin B1 and keratin 81. Association of lamin B1 with APC in cultured cells and human colonic tissue was verified by co-immunoprecipitation and colocalization. APC also colocalized with keratins and remained associated with IF proteins throughout a sequential extraction procedure. CONCLUSION We introduce a versatile APC antibody that is useful for cell/tissue immunostaining, immunoblotting and immunoprecipitation. We also present evidence for interactions between APC and IFs, independent of actin filaments and microtubules. Our results suggest that APC associates with all three major components of the cytoskeleton, thus expanding potential roles for APC in the regulation of cytoskeletal integrity.
Collapse
Affiliation(s)
- Yang Wang
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS, USA
| | - Yoshiaki Azuma
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS, USA
| | - David B Friedman
- Mass Spectrometry Research Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Robert J Coffey
- Departments of Cell and Developmental Biology and Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Kristi L Neufeld
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS, USA
| |
Collapse
|
748
|
Seo M, Lee WH, Suk K. Identification of novel cell migration-promoting genes by a functional genetic screen. FASEB J 2009; 24:464-78. [PMID: 19812375 DOI: 10.1096/fj.09-137562] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Here we describe the identification of novel cell migration-promoting genes based on an unbiased functional genetic screen in cultured cells. After the introduction of the retroviral mouse brain cDNA library into NIH3T3 mouse fibroblast cells, migration-promoted cells were selected by a 3-dimensional migration assay using cell culture inserts. After 5 rounds of enrichment, cDNAs were retrieved from the cells with a selected phenotype. Cell migration-promoting activity was confirmed by independent migration assays for the retrieved cDNAs, among which further investigation was focused on coiled-coil-helix-coiled-coil-helix domain-containing protein 2 (chchd2). Whereas overexpression of chchd2 promoted cell migration, knockdown of endogenous chchd2 expression reduced cell migration. Chchd2-induced cell migration was associated with augmented formation of actin stress fibers and focal adhesion, which was mediated through Akt, RhoA/ROCK, and Jnk pathways. CHCHD2 protein directly interacted with hyaluronic acid-binding protein 1 (HABP1) that possessed migration-suppressing activity. Intracellular localization and further functional studies suggested that CHCHD2 and HABP1 may mutually regulate each other to balance cell migration. Thus, chchd2 is a novel cell migration determinant identified by an in vitro functional genetic selection strategy. The selection method can also be useful for the isolation of genes that give other phenotypes of interest.
Collapse
Affiliation(s)
- Minchul Seo
- Department of Pharmacology, School of Medicine, Brain Science and Engineering Institute, Kyungpook National University, Daegu, Korea
| | | | | |
Collapse
|
749
|
Numata K, Osada Y, Okada Y, Saito R, Hiraiwa N, Nakaoka H, Yamamoto N, Watanabe K, Okubo K, Kohama C, Kanai A, Abe K, Kiyosawa H. Identification of novel endogenous antisense transcripts by DNA microarray analysis targeting complementary strand of annotated genes. BMC Genomics 2009; 10:392. [PMID: 19698135 PMCID: PMC2741491 DOI: 10.1186/1471-2164-10-392] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2009] [Accepted: 08/22/2009] [Indexed: 11/26/2022] Open
Abstract
Background Recent transcriptomic analyses in mammals have uncovered the widespread occurrence of endogenous antisense transcripts, termed natural antisense transcripts (NATs). NATs are transcribed from the opposite strand of the gene locus and are thought to control sense gene expression, but the mechanism of such regulation is as yet unknown. Although several thousand potential sense-antisense pairs have been identified in mammals, examples of functionally characterized NATs remain limited. To identify NAT candidates suitable for further functional analyses, we performed DNA microarray-based NAT screening using mouse adult normal tissues and mammary tumors to target not only the sense orientation but also the complementary strand of the annotated genes. Results First, we designed microarray probes to target the complementary strand of genes for which an antisense counterpart had been identified only in human public cDNA sources, but not in the mouse. We observed a prominent expression signal from 66.1% of 635 target genes, and 58 genes of these showed tissue-specific expression. Expression analyses of selected examples (Acaa1b and Aard) confirmed their dynamic transcription in vivo. Although interspecies conservation of NAT expression was previously investigated by the presence of cDNA sources in both species, our results suggest that there are more examples of human-mouse conserved NATs that could not be identified by cDNA sources. We also designed probes to target the complementary strand of well-characterized genes, including oncogenes, and compared the expression of these genes between mammary cancerous tissues and non-pathological tissues. We found that antisense expression of 95 genes of 404 well-annotated genes was markedly altered in tumor tissue compared with that in normal tissue and that 19 of these genes also exhibited changes in sense gene expression. These results highlight the importance of NAT expression in the regulation of cellular events and in pathological conditions. Conclusion Our microarray platform targeting the complementary strand of annotated genes successfully identified novel NATs that could not be identified by publically available cDNA data, and as such could not be detected by the usual "sense-targeting" microarray approach. Differentially expressed NATs monitored by this platform may provide candidates for investigations of gene function. An advantage of our microarray platform is that it can be applied to any genes and target samples of interest.
Collapse
Affiliation(s)
- Koji Numata
- Technology and Development Team for Mammalian Cellular Dynamics, BioResource Center (BRC), RIKEN Tsukuba Institute, Ibaraki 305-0074, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
750
|
Kim EY, Kim SY, Ashlock D, Nam D. MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering. BMC Bioinformatics 2009; 10:260. [PMID: 19698124 PMCID: PMC2743671 DOI: 10.1186/1471-2105-10-260] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2009] [Accepted: 08/22/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. RESULTS We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. CONCLUSION The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors.
Collapse
Affiliation(s)
- Eun-Youn Kim
- National Institute for Mathematical Sciences (NIMS), Yuseong, Daejeon 305-340, Republic of Korea.
| | | | | | | |
Collapse
|