1
|
Leveraging Comparative Genomics to Identify and Functionally Characterize Genes Associated with Sperm Phenotypes in Python bivittatus (Burmese Python). GENETICS RESEARCH INTERNATIONAL 2016; 2016:7505268. [PMID: 27200191 PMCID: PMC4855019 DOI: 10.1155/2016/7505268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Revised: 01/30/2016] [Accepted: 02/18/2016] [Indexed: 11/22/2022]
Abstract
Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism's genome (such as the mouse genome) in order to make physiological inferences about the role of genes and proteins in a less characterized organism's genome (such as the Burmese python). We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1) production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2) enhanced assisted reproduction technology for endangered and captive reptiles; and (3) novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value.
Collapse
|
2
|
Abstract
The goal of case-control association studies is to find genetic variants in the human genome that influence common traits. The Human Genome and HapMap projects have added fresh impetus to this goal by cataloguing the raw genetic data behind human DNA variation. Studies that associate these genetic variants with phenotype improve both molecular diagnostics and drug discovery and offer clinicians important opportunities to improve care of patients. In this review I focus on case-control studies, which are the most widely used design and expected to be the most powerful. I also address the problem of case-control non-replication, which is widespread despite enormous effort and use of resources. Important causes of non-replication include inadequate statistical power to detect small and moderate effects, phenotype heterogeneity, population stratification, publication bias, and multiple comparison testing.
Collapse
Affiliation(s)
- Daniel G Healy
- Institute of Neurology, Queen Square hospital, Lambert palace road, London, UK.
| |
Collapse
|
3
|
Boguski MS, Jones AR. Neurogenomics: at the intersection of neurobiology and genome sciences. Nat Neurosci 2004; 7:429-33. [PMID: 15114353 DOI: 10.1038/nn1232] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Neurogenomics is the study of how the genome as a whole contributes to the evolution, development, structure and function of the nervous system. It includes investigations of how genome products (transcriptomes and proteomes) vary in time and space. Neurogenomics differs markedly from the application of genome sciences to other systems, particularly in the spatial category, because anatomy and connectivity are paramount to our understanding of function in the nervous system. We focus here on some of the influences of genomics and its associated technologies on neuroscience. We discuss comparative genomics, gene expression atlases of the brain, network genetics and applications to behavioral phenotypes, and consider the culture, organization and funding of genome-scale projects.
Collapse
Affiliation(s)
- Mark S Boguski
- Allen Institute for Brain Science, 551 N. 34th Street, Seattle, Washington 98103, USA.
| | - Allan R Jones
- Allen Institute for Brain Science, 551 N. 34th Street, Seattle, Washington 98103, USA
| |
Collapse
|
4
|
|
5
|
|
6
|
Abstract
The advent of whole-genome data resources--not only sequence but also other genome-scale data collections such as gene expression, protein interaction, and genetic variation--is having two marked, complementary effects on the relatively new discipline of bioinformatics. First, the veritable flood of data is creating a need and demand for new tools for dealing adequately with the deluge, and, second, the unprecedented extent, diversity, and impending completeness of the data sets are creating opportunities for new approaches to discovery based on computational methods.
Collapse
Affiliation(s)
- D B Searls
- Bioinformatics Department, SmithKline Beecham Pharmaceuticals, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|
7
|
Stanchi F, Bertocco E, Toppo S, Dioguardi R, Simionati B, Cannata N, Zimbello R, Lanfranchi G, Valle G. Characterization of 16 novel human genes showing high similarity to yeast sequences. Yeast 2001; 18:69-80. [PMID: 11124703 DOI: 10.1002/1097-0061(200101)18:1<69::aid-yea647>3.0.co;2-h] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The entire set of open reading frames (ORFs) of Saccharomyces cerevisiae has been used to perform systematic similarity searches against nucleic acid and protein databases: with the aim of identifying interesting homologies between yeast and mammalian genes. Many similarities were detected: mostly with known genes. However: several yeast ORFs were only found to match human partial sequence tags: indicating the presence of human transcripts still uncharacterized that have a homologous counterpart in yeast. About 30 such transcripts were further studied and named HUSSY (human sequence similar to yeast). The 16 most interesting are presented in this paper along with their sequencing and mapping data. As expected: most of these genes seem to be involved in basic metabolic and cellular functions (lipoic acid biosynthesis: ribulose-5-phosphate-3-epimerase: glycosyl transferase: beta-transducin: serine-threonine-kinase: ABC proteins: cation transporters). Genes related to RNA maturation were also found (homologues to DIM1: ROK1-RNA-elicase and NFS1). Furthermore: five novel human genes were detected (HUSSY-03: HUSSY-22: HUSSY-23: HUSSY-27: HUSSY-29) that appear to be homologous to yeast genes whose function is still undetermined. More information on this work can be obtained at the website http://grup.bio.unipd.it/hussy
Collapse
Affiliation(s)
- F Stanchi
- CRIBI Biotechnology Centre, Università di Padova, via G. Colombo 3, Padova 35121, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Mendell JT, Medghalchi SM, Lake RG, Noensie EN, Dietz HC. Novel Upf2p orthologues suggest a functional link between translation initiation and nonsense surveillance complexes. Mol Cell Biol 2000; 20:8944-57. [PMID: 11073994 PMCID: PMC86549 DOI: 10.1128/mcb.20.23.8944-8957.2000] [Citation(s) in RCA: 126] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay (NMD). In addition to protecting cells by preventing the translation of potentially deleterious truncated peptides, studies have suggested that NMD plays a broader role in the regulation of the steady-state levels of physiologic transcripts. In Saccharomyces cerevisiae, three trans-acting factors (Upf1p to Upf3p) are required for NMD. Orthologues of Upf1p have been identified in numerous species, showing that the NMD machinery, at least in part, is conserved through evolution. In this study, we demonstrate additional functional conservation of the NMD pathway through the identification of Upf2p homologues in Schizosaccharomyces pombe and humans (rent2). Disruption of S. pombe UPF2 established that this gene is required for NMD in fission yeast. rent2 was demonstrated to interact directly with rent1, a known trans-effector of NMD in mammalian cells. Additionally, fragments of rent2 were shown to possess nuclear targeting activity, although the native protein localizes to the cytoplasmic compartment. Finally, novel functional domains of Upf2p and rent2 with homology to eukaryotic initiation factor 4G (eIF4G) and other translational regulatory proteins were identified. Directed mutations within these so-called eIF4G homology (4GH) domains were sufficient to abolish the function of S. pombe Upf2p. Furthermore, using the two-hybrid system, we obtained evidence for direct interaction between rent2 and human eIF4AI and Sui1, both components of the translation initiation complex. Based on these findings, a novel model in which Upf2p and rent2 effects decreased translation and accelerated decay of nonsense transcripts through competitive interactions with eIF4G-binding partners is proposed.
Collapse
Affiliation(s)
- J T Mendell
- Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | |
Collapse
|
9
|
O'Carroll D, Scherthan H, Peters AH, Opravil S, Haynes AR, Laible G, Rea S, Schmid M, Lebersorger A, Jerratsch M, Sattler L, Mattei MG, Denny P, Brown SD, Schweizer D, Jenuwein T. Isolation and characterization of Suv39h2, a second histone H3 methyltransferase gene that displays testis-specific expression. Mol Cell Biol 2000; 20:9423-33. [PMID: 11094092 PMCID: PMC102198 DOI: 10.1128/mcb.20.24.9423-9433.2000] [Citation(s) in RCA: 222] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Higher-order chromatin has been implicated in epigenetic gene control and in the functional organization of chromosomes. We have recently discovered mouse (Suv39h1) and human (SUV39H1) histone H3 lysine 9-selective methyltransferases (Suv39h HMTases) and shown that they modulate chromatin dynamics in somatic cells. We describe here the isolation, chromosomal assignment, and characterization of a second murine gene, Suv39h2. Like Suv39h1, Suv39h2 encodes an H3 HMTase that shares 59% identity with Suv39h1 but which differs by the presence of a highly basic N terminus. Using fluorescent in situ hybridization and haplotype analysis, the Suv39h2 locus was mapped to the subcentromeric region of mouse chromosome 2, whereas the Suv39h1 locus resides at the tip of the mouse X chromosome. Notably, although both Suv39h loci display overlapping expression profiles during mouse embryogenesis, Suv39h2 transcripts remain specifically expressed in adult testes. Immunolocalization of Suv39h2 protein during spermatogenesis indicates enriched distribution at the heterochromatin from the leptotene to the round spermatid stage. Moreover, Suv39h2 specifically accumulates with chromatin of the sex chromosomes (XY body) which undergo transcriptional silencing during the first meiotic prophase. These data are consistent with redundant enzymatic roles for Suv39h1 and Suv39h2 during mouse development and suggest an additional function of the Suv39h2 HMTase in organizing meiotic heterochromatin that may even impart an epigenetic imprint to the male germ line.
Collapse
Affiliation(s)
- D O'Carroll
- Research Institute of Molecular Pathology at The Vienna Biocenter, University of Vienna, A-1030 Vienna, Austria
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Boyer JL, Ketner G. Genetic analysis of a potential zinc-binding domain of the adenovirus E4 34k protein. J Biol Chem 2000; 275:14969-78. [PMID: 10747932 DOI: 10.1074/jbc.m000566200] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
E4 34k, the product of adenovirus early region 4 (E4) open reading frame 6, modulates viral late gene expression, viral DNA replication, apoptosis, double strand break repair, and transformation through multiple interactions with components in infected and transformed cells. Conservation of several cysteine and histidine residues among E4 34k sequences from a variety of adenovirus serotypes suggests the presence of a zinc binding domain important for function. Consistent with the hypothesis that E4 34k is a zinc metalloprotein, zinc binding by baculovirus-expressed E4 34k protein was demonstrated in a zinc blotting assay. To investigate the relationship between the potential zinc-binding region and E4 34k function, a series of mutant genes containing single amino acid substitutions at each of the conserved cysteine and histidine residues in E4 34k were constructed. The mutant proteins were examined for the ability to complement the late protein synthetic defect of an E4 deletion mutant, to physically interact with the viral E1b 55-kDa protein (E1b 55k) and cellular p53 protein, to relocalize E1b 55k, and to destabilize the p53 protein. These analyses identified a subset of cysteine and histidine residues required for stimulation of late gene expression, physical interaction with E1b 55k, and p53 destabilization. These data suggest that a zinc-binding domain participates in the formation of the E4 34k-E1b 55k physical complex and that the complex is required in late gene expression and for p53 destabilization.
Collapse
Affiliation(s)
- J L Boyer
- Department of Molecular Microbiology and Immunology, Johns Hopkins University School of Public Health, Baltimore, Maryland 21205, USA
| | | |
Collapse
|
11
|
Du Y, Tsai S, Keller JR, Williams SC. Identification of an interleukin-3-regulated aldoketo reductase gene in myeloid cells which may function in autocrine regulation of myelopoiesis. J Biol Chem 2000; 275:6724-32. [PMID: 10702227 DOI: 10.1074/jbc.275.10.6724] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The EML hematopoietic progenitor cell line is a model system for studying molecular events regulating myeloid commitment and terminal differentiation. We used representational difference analysis to identify genes that are expressed differentially during myeloid differentiation of EML cells. One gene (named mAKRa) encoded a novel member of the aldoketo reductase (AKR) superfamily of cytosolic NAD(P)(H)-dependent oxidoreductases. mAKRa mRNA was detected in murine hematopoietic tissues including bone marrow, spleen, and thymus. In myeloid cell lines, mAKRa was expressed at highest levels in cells representative of promyelocytes. mAKRa mRNA levels increased rapidly in response to interleukin-3 over the first 24 h of EML cell differentiation when the cells undergo lineage commitment and extensive proliferation. mAKRa mRNA levels decreased later in the differentiation process particularly when the EML cells were cultured with granulocyte/macrophage colony-stimulating factor and retinoic acid to induce terminal granulocytic maturation. mAKRa mRNA levels decreased during retinoic acid-induced terminal granulocytic differentiation of the MPRO promyelocyte cell line. AKRs act as molecular switches by catalyzing the interconversion or inactivation of bioactive molecules including steroids and prostaglandins. We propose that mAKRa may catalyze the production or catabolism of autocrine factors that promote the proliferation and/or lineage commitment of early myeloid progenitors.
Collapse
Affiliation(s)
- Y Du
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences Center, Lubbock, Texas 79430, USA
| | | | | | | |
Collapse
|
12
|
Law DJ, Du M, Law GL, Merchant JL. ZBP-99 defines a conserved family of transcription factors and regulates ornithine decarboxylase gene expression. Biochem Biophys Res Commun 1999; 262:113-20. [PMID: 10448078 DOI: 10.1006/bbrc.1999.1180] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Among transcription factors that regulate ornithine decarboxylase (ODC) gene expression are those that interact with GC-rich promoters, including Sp1 and ZBP-89. Sp1 functions as a transactivator and ZBP-89 as a transrepressor of both the ODC and gastrin promoters. This study reports the cloning and characterization of a second member of the ZBP family that also binds GC boxes. ZBP-99 contains four Krüppel-type zinc fingers that collectively share 91% amino acid sequence similarity and 79% sequence identity with those found in ZBP-89. In addition, there are highly conserved amino acid sequences in the carboxy-terminal segments of the two genes. In spite of their structural similarities, the two proteins are encoded at distinct loci, ZBP-89 on chromosome 3q21 and ZBP-99 on 1q32.1. The predicted open reading frame of ZBP-99 cDNA encodes a 99-kDa protein. Electrophoretic mobility shift assays showed that ZBP-99 protein specifically binds to the GC-rich promoter elements of gastrin and ODC genes. Northern blot analysis showed that a major ZBP-99 transcript of 5.6 kb is expressed ubiquitously at low levels, with elevated expression levels in placenta and in adult kidney, liver, and lymphocytes. Cotransfection of AGS gastric adenocarcinoma and HT-29 colon adenocarcinoma cells with a ZBP-99 expression construct and with an ODC reporter construct show that ZBP-99 repressed basal expression in the two cell lines by 80 and 60%, respectively. Collectively, the data suggest that ZBP-99 binds GC-rich promoters and may complement the activities mediated by ZBP-89.
Collapse
Affiliation(s)
- D J Law
- Department of Internal Medicine, Department of Physiology, University of Michigan, Michigan, Ann Arbor 48109-0650, USA
| | | | | | | |
Collapse
|
13
|
Aagaard L, Laible G, Selenko P, Schmid M, Dorn R, Schotta G, Kuhfittig S, Wolf A, Lebersorger A, Singh PB, Reuter G, Jenuwein T. Functional mammalian homologues of the Drosophila PEV-modifier Su(var)3-9 encode centromere-associated proteins which complex with the heterochromatin component M31. EMBO J 1999; 18:1923-38. [PMID: 10202156 PMCID: PMC1171278 DOI: 10.1093/emboj/18.7.1923] [Citation(s) in RCA: 337] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The chromo and SET domains are conserved sequence motifs present in chromosomal proteins that function in epigenetic control of gene expression, presumably by modulating higher order chromatin. Based on sequence information from the SET domain, we have isolated human (SUV39H1) and mouse (Suv39h1) homologues of the dominant Drosophila modifier of position-effect-variegation (PEV) Su(var)3-9. Mammalian homologues contain, in addition to the SET domain, the characteristic chromo domain, a combination that is also preserved in the Schizosaccharyomyces pombe silencing factor clr4. Chromatin-dependent gene regulation is demonstrated by the potential of human SUV39H1 to increase repression of the pericentromeric white marker gene in transgenic flies. Immunodetection of endogenous Suv39h1/SUV39H1 proteins in a variety of mammalian cell lines reveals enriched distribution at heterochromatic foci during interphase and centromere-specific localization during metaphase. In addition, Suv39h1/SUV39H1 proteins associate with M31, currently the only other characterized mammalian SU(VAR) homologue. These data indicate the existence of a mammalian SU(VAR) complex and define Suv39h1/SUV39H1 as novel components of mammalian higher order chromatin.
Collapse
Affiliation(s)
- L Aagaard
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Dr. Bohrgasse 7, A-1030 Vienna, Austria
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
The identification of proteins involved in the early phases of cell death has relied primarily on the modular organization of shared sequences and structural motifs of previously identified proteins in the apoptotic machinery. This property has facilitated the isolation of proteins that interact with each other through structural domains using yeast two-hybrid cloning. Likewise, the conservation in primary sequence of the various shared domains has promoted the use of polymerase chain reaction and database search strategies to isolate additional family members. Here, we discuss the use of database search strategies in the isolation of novel death proteins, as well as how similar strategies may be extended to discover additional, novel cell death proteins.
Collapse
Affiliation(s)
- R Gururajan
- Department of Tumor Cell Biology, St. Jude Children's Research Hospital, 332 North Lauderdale Street, Memphis, Tennessee 38105, USA.
| | | |
Collapse
|
15
|
Hendrich B, Bird A. Identification and characterization of a family of mammalian methyl-CpG binding proteins. Mol Cell Biol 1998; 18:6538-47. [PMID: 9774669 PMCID: PMC109239 DOI: 10.1128/mcb.18.11.6538] [Citation(s) in RCA: 951] [Impact Index Per Article: 36.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/1998] [Accepted: 08/20/1998] [Indexed: 11/20/2022] Open
Abstract
Methylation at the DNA sequence 5'-CpG is required for mouse development. MeCP2 and MBD1 (formerly PCM1) are two known proteins that bind specifically to methylated DNA via a related amino acid motif and that can repress transcription. We describe here three novel human and mouse proteins (MBD2, MBD3, and MBD4) that contain the methyl-CpG binding domain. MBD2 and MBD4 bind specifically to methylated DNA in vitro. Expression of MBD2 and MBD4 tagged with green fluorescent protein in mouse cells shows that both proteins colocalize with foci of heavily methylated satellite DNA. Localization is disrupted in cells that have greatly reduced levels of CpG methylation. MBD3 does not bind methylated DNA in vivo or in vitro. MBD1, MBD2, MBD3, and MBD4 are expressed in somatic tissues, but MBD1 and MBD2 expression is reduced or absent in embryonic stem cells which are known to be deficient in MeCP1 activity. The data demonstrate that MBD2 and MBD4 bind specifically to methyl-CpG in vitro and in vivo and are therefore likely to be mediators of the biological consequences of the methylation signal.
Collapse
Affiliation(s)
- B Hendrich
- Institute of Cell and Molecular Biology, University of Edinburgh, Edinburgh EH9 3JR, Scotland.
| | | |
Collapse
|
16
|
Affiliation(s)
- R Waterston
- Genome Sequencing Center, Washington University School of Medicine, St. Louis, MO 63108, USA.
| | | |
Collapse
|
17
|
Affiliation(s)
- M Walhout
- Massachusetts General Hospital, Cancer Cener, Charlestown, MA, USA
| | | | | | | | | |
Collapse
|
18
|
|
19
|
Taglicht D, Michaelis S. Saccharomyces cerevisiae ABC proteins and their relevance to human health and disease. Methods Enzymol 1998; 292:130-62. [PMID: 9711551 DOI: 10.1016/s0076-6879(98)92012-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- D Taglicht
- Sigma Israel Chemicals Limited, Jerusalem, Israel
| | | |
Collapse
|
20
|
Tabtiang RK, Herskowitz I. Nuclear proteins Nut1p and Nut2p cooperate to negatively regulate a Swi4p-dependent lacZ reporter gene in Saccharomyces cerevisiae. Mol Cell Biol 1998; 18:4707-18. [PMID: 9671481 PMCID: PMC109057 DOI: 10.1128/mcb.18.8.4707] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/1998] [Accepted: 05/14/1998] [Indexed: 02/08/2023] Open
Abstract
The URS2 region of the Saccharomyces cerevisiae HO upstream region contains 10 binding sites for the Swi4p/Swi6p transcription factor and confers Swi4p dependence for transcription. Using a hybrid promoter, UASGAL (upstream activation sequence of GAL1)-URS2R, in which the GAL1-10 regulatory region is fused to the proximal 360 bp of URS2, we isolated mutants in which Swi4p is no longer required for transcription. Mutations of SIN4, ROX3, SRB8, SRB9, SRB10, SRB11, and two novel genes, NUT1 and NUT2, relieve the requirement of Swi4p for expression of this reporter. We found that NUT1 (open reading frame [ORF] YGL151w) is a nonessential gene, that NUT2 (ORF YPR168w) is essential, and that both Nut1p and Nut2p encode nuclear proteins. Deletion of NUT1 causes a constitutive, Swi4p-independent phenotype only in combination with the nut2-1 allele or an allele of CCR4. In contrast, inactivation of a temperature-sensitive allele of NUT2, nut2-ts70, alone causes constitutivity. nut1Delta nut2-1 cells and sin4Delta cells exhibit Swi4p-independent expression of an ho-lacZ reporter but not of an intact ho gene. Likewise, a pPHO5-lacZ construct is constitutively expressed in nut1 nut2 mutants relative to their wild-type counterparts. These results suggest that Nut1p, Nut2p, Sin4p, and Ccr4p define a group of proteins that negatively regulate transcription in a subtle manner which is revealed by artificial reporter genes.
Collapse
Affiliation(s)
- R K Tabtiang
- Program in Biochemistry and Molecular Biology, Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, California 94143-0448, USA
| | | |
Collapse
|
21
|
Pittman DL, Weinberg LR, Schimenti JC. Identification, characterization, and genetic mapping of Rad51d, a new mouse and human RAD51/RecA-related gene. Genomics 1998; 49:103-11. [PMID: 9570954 DOI: 10.1006/geno.1998.5226] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Homologous DNA recombination occurs in all organisms and is important for repair of DNA damage during mitosis. One of the critical genes for DNA repair and meiotic recombination in yeast is RAD51, and homologs of RAD51 have been identified in several species, including mouse and human. Here we describe a new RAD51-related mammalian gene, named Rad51d, identified by searching the EST database with the yeast RAD55 and human RAD51B/REC2 genes. A full-length 1.5-kb mouse cDNA clone that encodes a predicted 329-amino-acid protein was isolated. Rad51d mRNA was present in every mouse tissue examined. Four different transcript sizes were detected, one of which was specific to testis. Human cDNA clones that predicted 71% amino acid identity to the mouse protein were also isolated. Interestingly, the sequences of these human clones and of RT-PCR-derived products provided evidence for alternative splicing. These mRNAs are predicted to encode proteins that are truncated relative to the mouse and lack the ATP-binding motif characteristic of RecA-related proteins. Using an interspecific backcross mapping panel, Rad51d was mapped to mouse Chromosome 11, 48.5 cM from the centromere. By radiation hybrid mapping, the human ortholog RAD51D was mapped to chromosome 17q11, which is a region syntenic to mouse Chromosome 11. Due to its expression pattern and sequence similarity to other RAD51 family members, it is likely that Rad51d is part of a complex of proteins required for DNA repair and meiotic recombination.
Collapse
Affiliation(s)
- D L Pittman
- Jackson Laboratory, Bar Harbor, Maine 04609, USA
| | | | | |
Collapse
|
22
|
Bailey LC, Searls DB, Overton GC. Analysis of EST-driven gene annotation in human genomic sequence. Genome Res 1998; 8:362-76. [PMID: 9548972 DOI: 10.1101/gr.8.4.362] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
We have performed a systematic analysis of gene identification in genomic sequence by similarity search against expressed sequence tags (ESTs) to assess the suitability of this method for automated annotation of the human genome. A BLAST-based strategy was constructed to examine the potential of this approach, and was applied to test sets containing all human genomic sequences longer than 5 kb in public databases, plus 300 kb of exhaustively characterized benchmark sequence. At high stringency, 70%-90% of all annotated genes are detected by near-identity to EST sequence; >95% of ESTs aligning with well-annotated sequences overlap a gene. These ESTs provide immediate access to the corresponding cDNA clones for follow-up laboratory verification and subsequent biologic analysis. At lower stringency, up to 97% of annotated genes were identified by similarity to ESTs. The apparent false-positive rate rose to 55% of ESTs among all sequences and 20% among benchmark sequences at the lowest stringency, indicating that many genes in public database entries are unannotated. Approximately half of the alignments span multiple exons, and thus aid in the construction of gene predictions and elucidation of alternative splicing. In addition, ESTs from multiple cDNA libraries frequently cluster over genes, providing a starting point for crude expression profiles. Clone IDs may be used to form EST pairs, and particularly to extend models by associating alignments of lower stringency with high-quality alignments. These results demonstrate that EST similarity search is a practical general-purpose annotation technique that complements pattern recognition methods as a tool for gene characterization.
Collapse
Affiliation(s)
- L C Bailey
- Computational Biology and Informatics Laboratory, Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA.
| | | | | |
Collapse
|
23
|
Benton D. Integrated access to genomic and other bioinformation: an essential ingredient of the drug discovery process. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 1998; 8:121-155. [PMID: 9522473 DOI: 10.1080/10629369808039138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Due to the high rate of data production and the need of researchers to have rapid access to new data, public databases have become the major medium through which genome mapping and sequencing data as well as macromolecular structural data are published. There are now more than 250 databases of biomolecular, structural, genetic, or phenotypic data, many of which are doubling in size annually. These databases, many of which were created and are maintained by experimentalists for their own research use, provide valuable collections of organized, validated data. However, the very number and diversity of databases now make efficient data resource discovery as important as effective data resource use. Existing autonomous biological databases contain related data which are more valuable when interconnected than when isolated. Political and scientific realities dictate that these databases will be built by different teams, in different locations, for different purposes, and using different data models and supporting DBMSs. As a consequence, connecting the related data they contain is not straightforward. Experience with existing biological databases indicates that it is possible to form useful queries across these databases, but that doing so usually requires expertise in the semantic structure of each source database. Advancing to the next level of integration among biological information resources poses significant technical and sociological challenges.
Collapse
Affiliation(s)
- D Benton
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892-6050, USA
| |
Collapse
|
24
|
Aral B, Benelli C, Ait-Ghezala G, Amessou M, Fouque F, Maunoury C, Créau N, Kamoun P, Marsac C. Mutations in PDX1, the human lipoyl-containing component X of the pyruvate dehydrogenase-complex gene on chromosome 11p1, in congenital lactic acidosis. Am J Hum Genet 1997; 61:1318-26. [PMID: 9399911 PMCID: PMC1716072 DOI: 10.1086/301653] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We have identified and sequenced a cDNA that encodes an apparent human orthologue of a yeast protein-X component (ScPDX1) of pyruvate dehydrogenase multienzyme complexes. The new human cDNA that has been referred to as "HsPDX1" cDNA was cloned by use of the "database cloning" strategy and had a 1,506-bp open reading frame. The amino acid sequence of the protein encoded by the cDNA was 20% identical with that encoded by the yeast PDX1 gene and 40% identical with that encoded by the lipoate acetyltransferase component of the pyruvate dehydrogenase and included a lipoyl-bearing domain that is conserved in some dehydrogenase enzyme complexes. Northern blot analysis demonstrated that the major HsPDX1 mRNA was 2.5 kb in length and was expressed mainly in human skeletal and cardiac muscles but was also present, at low levels, in other tissues. FISH analysis performed with a P1-derived artificial chromosome (PAC)-containing HsPDX1 gene sublocalized the gene to 11p1.3. Molecular investigation of PDX1 deficiency in four patients with neonatal lactic acidemias revealed mutations 78del85 and 965del59 in a homozygous state, and one other patient had no PDX1 mRNA expression.
Collapse
MESH Headings
- Acidosis, Lactic/congenital
- Acidosis, Lactic/genetics
- Amino Acid Sequence
- Chromosome Mapping
- Chromosomes, Human, Pair 11/genetics
- Cloning, Molecular
- DNA Mutational Analysis
- DNA, Complementary/genetics
- Female
- Genes
- Humans
- In Situ Hybridization, Fluorescence
- Infant, Newborn
- Male
- Molecular Sequence Data
- Muscle Proteins/biosynthesis
- Muscle Proteins/genetics
- Organ Specificity
- Peptides/chemical synthesis
- Peptides/chemistry
- Peptides/genetics
- Polymerase Chain Reaction
- Protein Structure, Secondary
- Pyruvate Dehydrogenase Complex/chemical synthesis
- Pyruvate Dehydrogenase Complex/chemistry
- Pyruvate Dehydrogenase Complex/genetics
- RNA, Messenger/biosynthesis
- RNA, Messenger/genetics
- Sequence Alignment
- Sequence Deletion
- Sequence Homology, Amino Acid
Collapse
Affiliation(s)
- B Aral
- Laboratoire de Biochimie Médicale B, CNRS, URA1335, Institut Necker, Hôpital des Enfants Malades, Paris, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Zweiger G, Scott RW. From expressed sequence tags to 'epigenomics': an understanding of disease processes. Curr Opin Biotechnol 1997; 8:684-7. [PMID: 9425656 DOI: 10.1016/s0958-1669(97)80119-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Expressed sequence tags (ESTs) are at the forefront of technological change that is sweeping the biomedical research community. ESTs provide a high throughput means for identifying gene transcripts and monitoring complex gene expression patterns. EST-based technologies coupled with sophisticated computer analysis tools enable the informational content and output of the genome to be accessed and evaluated on a scale immensely larger than previously possible. EST-based technologies are being used to understand disease processes and to find better disease treatments, and will allow biology to move from single gene to multigene, or even more complex epigenetic, explanations for disease.
Collapse
Affiliation(s)
- G Zweiger
- Incyte Pharmaceuticals, Palo Alto, CA 94304, USA.
| | | |
Collapse
|
26
|
Curtiss J, Heilig JS. Arrowhead encodes a LIM homeodomain protein that distinguishes subsets of Drosophila imaginal cells. Dev Biol 1997; 190:129-41. [PMID: 9331336 DOI: 10.1006/dbio.1997.8659] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The Arrowhead gene encodes a LIM-homeodomain transcription factor required for establishment of a subset of imaginal tissues: the abdominal histoblasts and the salivary gland imaginal rings. Consistent with its role in development, during embryogenesis Arrowhead is expressed in each abdominal segment and in the labial segment. Late in embryonic development, expression is refined to the abdominal histoblasts and salivary gland imaginal ring cells themselves. When ectopically expressed in imaginal disc cells, Arrowhead causes programmed cell death and loss of corresponding adult structures. Therefore, Arrowhead expression is required for development of one set of imaginal cells and is incompatible with development of another, emphasizing the specificity of Arrowhead and the sensitivity of different target cells to its expression. Loss-of-function mutations in Arrowhead affect conserved or invariant amino acids in the LIM- and homeo-domains demonstrating the importance of these residues in LIM homeodomain protein activity.
Collapse
Affiliation(s)
- J Curtiss
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder 80309, USA
| | | |
Collapse
|
27
|
Abstract
A sequence similarity search has been carried out against the complete Saccharomyces cerevisiae genome to identify the yeast homologues of human disease-associated genes. Using the BLAST algorithm (Basic Local Alignment Search Tool), it was found that 52 out of the 170 disease genes identified without reference to chromosomal map position and 22 of the 80 (27.5%) positionally cloned genes match yeast genes with a P-value of <e(-40). The percentage of the disease genes identified by positional cloning which bear homology to yeast is similar to that of a random collection of human cDNAs. The biochemical and physiological functions of the large majority of these human genes remain poorly understood and, even though a strict conservation of function cannot safely be assessed from structural homology analysis without the support of experimental and three-dimensional data, functional analogies can often be established between the human and yeast genes.
Collapse
Affiliation(s)
- F Foury
- Unité de Biochimie Physiologique, Université Catholique de Louvain, Louvain-la-Neuve, Belgium.
| |
Collapse
|
28
|
Abstract
Five intersubspecific backcrosses and an intercross were used to establish a sex-averaged recombinational map spanning 56 cM across most of mouse Chromosome 16 (Chr 16). A total of 123 markers were ordered using an interval mapping approach to identify 425 recombination sites in a collection of 1154 meioses from 1155 progeny generated in the six crosses. The markers include the 10 "classic" Chr 16 reference markers, 26 additional genes or transcripts including two phenotypic markers (Pit1dw and Kcnj6wv), and 87 simple sequence length polymorphisms (SSLPs). One set of monozygotic twins was detected among the 304 meioses mapped to highest resolution. The reference markers and SSLPs allow the map to be well integrated with existing maps of Chr 16. The average distance between crossover sites is less than 500 kb for most chromosomes, making this collection of recombinant chromosomes useful as a binning and ordering resource for YAC-based physical map assembly on Chr 16.
Collapse
Affiliation(s)
- R H Reeves
- Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.
| | | | | | | |
Collapse
|
29
|
Cross SH, Meehan RR, Nan X, Bird A. A component of the transcriptional repressor MeCP1 shares a motif with DNA methyltransferase and HRX proteins. Nat Genet 1997; 16:256-9. [PMID: 9207790 DOI: 10.1038/ng0797-256] [Citation(s) in RCA: 194] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Methylation of cytosines within the sequence CpG is essential for mouse development and has been linked to transcriptional suppression in vertebrate systems. Methyl-CpG binding proteins (MeCPs) 1 and 2 bind preferentially to methylated DNA and can inhibit transcription. The gene for MeCP2 has been cloned and a methyl-CpG binding domain (MBD) within it has been defined. A search of DNA sequence databases with the MBD sequence identified a human cDNA with potential to encode an MBD-like region. Sequencing of the complete cDNA revealed that the open reading frame also encodes two cysteine-rich domains that are found in animal DNA methyltransferases (DNMTs) and in the mammalian HRX protein (also known as MLL and All-1). HRX is related to Drosophila trithorax. The protein, known as Protein Containing MBD (PCM1), was expressed in bacteria and shown to bind specifically to methylated DNA. PCM1 also repressed transcription in vitro in a methylation-dependent manner. A polyclonal antibody raised against the protein was able to 'supershift' the native MeCP11 complex from HeLa cells, indicating that PCM1 is a component of mammalian MeCP1.
Collapse
Affiliation(s)
- S H Cross
- Institute of Cell and Molecular Biology, Edinburgh University U.K. and
| | | | | | | |
Collapse
|
30
|
Sen-Gupta M, Güldener U, Beinhauer J, Fiedler T, Hegemann JH. Sequence analysis of the 33 kb long region between ORC5 and SUI1 from the left arm of chromosome XIV from Saccharomyces cerevisiae. Yeast 1997; 13:849-60. [PMID: 9234673 DOI: 10.1002/(sici)1097-0061(199707)13:9<849::aid-yea106>3.0.co;2-n] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
We have determined the nucleotide sequence of a chromosomal region of 33,016 bp located on the left arm of chromosome XIV from budding yeast between the ORC5 and the SUI1 gene. Subsequent sequence analysis revealed the presence of 18 non-overlapping open reading frames (ORFs) including eight previously identified and sequenced genes (ORC5, ATX1, SIP3, NRD1, RAD50, MPA43, RPA49 and SUI1). Three other ORFs (YNL256w, YNL255c and YNL247w) code for putative proteins with significant homology to proteins from other organisms, while 4 ORFs exhibit only weak homology to known proteins. Three ORFs have no homology with sequences in the databases.
Collapse
Affiliation(s)
- M Sen-Gupta
- Institut für Mikrobiologie und Molekularbiologie, Justus-Liebig-Universitat Giessen, Germany
| | | | | | | | | |
Collapse
|
31
|
Pollet N, Boccaccio C, Dhorne-Pollet S, Driancourt C, Raynaud N, Auffray C, Hadchouel M, Meunier-Rotival M. Construction of an integrated physical and gene map of human chromosome 20p12 providing candidate genes for Alagille syndrome. Genomics 1997; 42:489-98. [PMID: 9205123 DOI: 10.1006/geno.1997.4676] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Physical mapping and localization of eSTS markers were used to generate an integrated physical and gene map covering a approximately 10-Mb region of human chromosome 20p12 containing the Alagille syndrome (AGS) locus. Seventy-four STSs, 28 of which were derived from cDNA sequences, mapped with an average resolution of 135 kb. The 28 eSTS markers define 20 genes. Six known genes, namely CHGB, BMP2, PLCB1, PLCB4, SNAP, and HJ1, were precisely mapped. Among the genes identified, one maps in the smallest region of overlap of the deletions associated with AGS and could therefore be regarded as a candidate gene for Alagille syndrome.
Collapse
Affiliation(s)
- N Pollet
- INSERM U347 affiliée au CNRS, Le Kremlin-Bicêtre, France
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Mushegian AR, Bassett DE, Boguski MS, Bork P, Koonin EV. Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs. Proc Natl Acad Sci U S A 1997; 94:5831-6. [PMID: 9159160 PMCID: PMC20866 DOI: 10.1073/pnas.94.11.5831] [Citation(s) in RCA: 177] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Positional cloning has already produced the sequences of more than 70 human genes associated with specific diseases. In addition to their medical importance, these genes are of interest as a set of human genes isolated solely on the basis of the phenotypic effect of the respective mutations. We analyzed the protein sequences encoded by the positionally cloned disease genes using an iterative strategy combining several sensitive computer methods. Comparisons to complete sequence databases and to separate databases of nematode, yeast, and bacterial proteins showed that for most of the disease gene products, statistically significant sequence similarities are detectable in each of the model organisms. Only the nematode genome encodes apparent orthologs with conserved domain architecture for the majority of the disease genes. In yeast and bacterial homologs, domain organization is typically not conserved, and sequence similarity is limited to individual domains. Generally, human genes complement mutations only in orthologous yeast genes. Most of the positionally cloned genes encode large proteins with several globular and nonglobular domains, the functions of some or all of which are not known. We detected conserved domains and motifs not described previously in a number of proteins encoded by disease genes and predicted functions for some of them. These predictions include an ATP-binding domain in the product of hereditary nonpolyposis colon cancer gene (a MutL homolog), which is conserved in the HS90 family of chaperone proteins, type II DNA topoisomerases, and histidine kinases, and a nuclease domain homologous to bacterial RNase D and the 3'-5' exonuclease domain of DNA polymerase I in the Werner syndrome gene product.
Collapse
Affiliation(s)
- A R Mushegian
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|
33
|
Holloway AJ, Della NG, Fletcher CF, Largespada DA, Copeland NG, Jenkins NA, Bowtell DD. Chromosomal mapping of five highly conserved murine homologues of the Drosophila RING finger gene seven-in-absentia. Genomics 1997; 41:160-8. [PMID: 9143490 DOI: 10.1006/geno.1997.4642] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Seven-in-absentia (sina) is epistatic to all other known genes in the sevenless-ras signaling pathway, which mediates R7 photoreceptor formation in the Drosophila eye. The murine genome contains several closely related sina homologues (Siah1A-D, Siah2) that are also likely to participate in ras signaling. As part of a genetic and biochemical analysis of the mammalian Siah genes, we have used gene-specific probes to map the chromosomal positions of each family member. Here we report their chromosomal positions in relation to a number of known mouse mutations and also describe an analysis of the human Siah genes. By comparing the complexity of the Siah genes in these two mammalian species we have gained further insight into which members of this murine multigene family are likely to be functional.
Collapse
Affiliation(s)
- A J Holloway
- Trescowthick Research Laboratories, Peter MacCallum Cancer Institute, Melbourne, Victoria, Australia
| | | | | | | | | | | | | |
Collapse
|
34
|
Purdue PE, Zhang JW, Skoneczny M, Lazarow PB. Rhizomelic chondrodysplasia punctata is caused by deficiency of human PEX7, a homologue of the yeast PTS2 receptor. Nat Genet 1997; 15:381-4. [PMID: 9090383 DOI: 10.1038/ng0497-381] [Citation(s) in RCA: 216] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The rhizomelic form of chondrodysplasia punctata (RCDP) is an autosomal recessive disease of peroxisome biogenesis characterized by deficiencies in several peroxisomal proteins, including the peroxisomal enzymes of plasmalogen biosynthesis and peroxisomal 3-ketoacyl thiolase. In cultured fibroblasts from patients with this disorder, both the peroxisomal targeting and proteolytic removal of the amino-terminal type 2 peroxisomal targeting sequence (PTS2) of thiolase are defective, whereas the biogenesis of proteins targeted by carboxyterminal type 1 peroxisomal targeting sequences (PTS1) is unimpaired. We have previously isolated a Saccharomyces cerevisiae peroxisomal biogenesis mutant, pex7 (formerly peb1/pas7), which demonstrates a striking similarity to the cellular phenotype of RCDP fibroblasts in that PTS1 targeting is functional, but the peroxisomal packaging of PTS2 targeted thiolase is lacking. Complementation of this mutant has led to the identification of the protein ScPex7p, a PTS2 receptor. In this paper we report cloning of the human orthologue of ScPEX7, and demonstrate that this is the defective gene in RCDP. We show that expression of human PEX7 in RCDP cells rescues PTS2 targeting and restores some activity of dihydroxyacetone phosphate acyltransferase (DHAP-AT), a peroxisomal enzyme of plasmalogen biosynthesis, and we identify the mutations responsible for loss of function of PEX7 in a compound heterozygote RCDP patient. These results imply that several peroxisomal proteins are targeted by PTS2 signals and that the various biochemical and clinical defects in RCDP result from a defect in the receptor for this class of PTS.
Collapse
Affiliation(s)
- P E Purdue
- Department of Cell Biology and Anatomy, Mount Sinai School of Medicine, New York, NY 10029-6574, USA
| | | | | | | |
Collapse
|
35
|
Bassett DE, Boguski MS, Spencer F, Reeves R, Kim S, Weaver T, Hieter P. Genome cross-referencing and XREFdb: implications for the identification and analysis of genes mutated in human disease. Nat Genet 1997; 15:339-44. [PMID: 9090377 DOI: 10.1038/ng0497-339] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Comparative genomics approaches and multi-organismal biology are valuable tools for genetic analysis. Cross-species connections between genes mutated in human disease states and homologues in model organisms can be particularly powerful, as model-organism gene function data and experimental approaches can shed light on the molecular mechanisms defective in the disease. We describe a project that is systematically identifying novel expressed sequence tag (EST) sequences that are highly related to genes in model organisms and mapping them to positions on the mouse and human maps. This process effectively cross-references model organism genes with mapped mammalian phenotypes, facilitating the identification of genes mutated in human disease states via the positional candidate approach. A public database, XREFdb (http:@www.ncbi.nlm.nih.gov/XREFdb/), disseminates similarity search, mapping and mammalian phenotype information and increases the rate at which these cross-species connections are established.
Collapse
Affiliation(s)
- D E Bassett
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205-2185, USA
| | | | | | | | | | | | | |
Collapse
|
36
|
Applequist SE, Selg M, Raman C, Jäck HM. Cloning and characterization of HUPF1, a human homolog of the Saccharomyces cerevisiae nonsense mRNA-reducing UPF1 protein. Nucleic Acids Res 1997; 25:814-21. [PMID: 9064659 PMCID: PMC146496 DOI: 10.1093/nar/25.4.814] [Citation(s) in RCA: 120] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Levels of most nonsense mRNAs are normally reduced in prokaryotes and eukaryotes when compared with that of corresponding functional mRNAs. Genes encoding polypeptides that selectively reduce levels of nonsense mRNA have so far only been identified in simple eukaryotes. We have now cloned a human cDNA whose deduced amino acid sequence shows the highest degree of homology to that of UPF1, a bona fide Saccharomyces cerevisiae group I RNA helicase required for accelerated degradation of nonsense mRNA. Based on the total sequence of the shorter yeast UPF1 protein, the overall identity between the human protein and UPF1 is 51%. Besides NTPase and other RNA helicase consensus motifs, UPF1 and its human homolog also share similar putative zinc finger motifs that are absent in other group I RNA helicases. Northern blot analysis with the human cDNA probe revealed two transcripts in several human cell lines. Further, antibodies raised against a synthetic peptide of the human polypeptide detected a single 130 kDa polypeptide on Western blots from human and mouse cells. Finally, immunofluorescence and Western blot analyses revealed that the human and mouse polypeptides, like yeast UPF1, are expressed in the cytoplasm, but not in the nucleus. We have thus identified the first mammalian homolog of yeast UPF1, a protein that regulates levels of nonsense mRNA, and we tentatively name this protein human HUPF1 (for human homolog of UPF1).
Collapse
Affiliation(s)
- S E Applequist
- Department of Microbiology and Immunology, Stritch School of Medicine, Loyola University of Chicago, Maywood, IL 60153, USA
| | | | | | | |
Collapse
|
37
|
Affiliation(s)
- A Guffanti
- Telethon Institute of Genetics and Medicine (Tigem), Milano, Italy.
| | | | | | | | | |
Collapse
|
38
|
Fujimura-Kamada K, Nouvet FJ, Michaelis S. A novel membrane-associated metalloprotease, Ste24p, is required for the first step of NH2-terminal processing of the yeast a-factor precursor. J Cell Biol 1997; 136:271-85. [PMID: 9015299 PMCID: PMC2134828 DOI: 10.1083/jcb.136.2.271] [Citation(s) in RCA: 133] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/1996] [Revised: 11/08/1996] [Indexed: 02/03/2023] Open
Abstract
Many secreted bioactive signaling molecules, including the yeast mating pheromones a-factor and alpha-factor, are initially synthesized as precursors requiring multiple intracellular processing enzymes to generate their mature forms. To identify new gene products involved in the biogenesis of a-factor in Saccharomyces cerevisiae, we carried out a screen for MA Ta-specific, mating-defective mutants. We have identified a new mutant, ste24, in addition to previously known sterile mutants. During its biogenesis in a wild-type strain, the a-factor precursor undergoes a series of COOH-terminal CAAX modifications, two sequential NH2-terminal cleavage events, and export from the cell. Identification of the a-factor biosynthetic intermediate that accumulates in the ste24 mutant revealed that STE24 is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The STE24 gene product contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for STE24 activity, since STE24 fails to function when conserved residues within this motif are mutated. The identification of Ste24p homologues in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe, Haemophilus influenzae, and Homo sapiens, indicates that Ste24p has been highly conserved throughout evolution. Ste24p and the proteins related to it define a new subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases.
Collapse
Affiliation(s)
- K Fujimura-Kamada
- Department of Cell Biology and Anatomy, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | |
Collapse
|
39
|
Ross-Macdonald P, Sheehan A, Roeder GS, Snyder M. A multipurpose transposon system for analyzing protein production, localization, and function in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 1997; 94:190-5. [PMID: 8990184 PMCID: PMC19279 DOI: 10.1073/pnas.94.1.190] [Citation(s) in RCA: 134] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Analysis of the function of a particular gene product typically involves determining the expression profile of the gene, the subcellular location of the protein, and the phenotype of a null strain lacking the protein. Conditional alleles of the gene are often created as an additional tool. We have developed a multifunctional, transposon-based system that simultaneously generates constructs for all the above analyses and is suitable for mutagenesis of any given Saccharomyces cerevisiae gene. Depending on the transposon used, the yeast gene is fused to a coding region for beta-galactosidase or green fluorescent protein. Gene expression can therefore be monitored by chemical or fluorescence assays. The transposons create insertion mutations in the target gene, allowing phenotypic analysis. The transposon can be reduced by cre-lox site-specific recombination to a smaller element that leaves an epitope tag inserted in the encoded protein. In addition to its utility for a variety of immunodetection purposes, the epitope tag element also has the potential to create conditional alleles of the target gene. We demonstrate these features of the transposons by mutagenesis of the SPA2, ARP100, SER1, and BDF1 genes.
Collapse
Affiliation(s)
- P Ross-Macdonald
- Department of Biology, Yale University, New Haven, CT 06520-8103, USA
| | | | | | | |
Collapse
|
40
|
Bassett DE, Basrai MA, Connelly C, Hyland KM, Kitagawa K, Mayer ML, Morrow DM, Page AM, Resto VA, Skibbens RV, Hieter P. Exploiting the complete yeast genome sequence. Curr Opin Genet Dev 1996; 6:763-6. [PMID: 8994849 DOI: 10.1016/s0959-437x(96)80033-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The completion of the genome sequence of the budding yeast Saccharomyces cerevisiae marks the dawn of an exciting new era in eukaryotic biology that will bring with it a new understanding of yeast, other model organisms, and human beings. This body of sequence data benefits yeast researchers by obviating the need for piecemeal sequencing of genes, and allows researchers working with other organisms to tap into experimental advantages inherent in the yeast system and learn from functionally characterized yeast gene products which are their proteins of interest. In addition, the yeast post-genome sequence era is serving as a testing ground for powerful new technologies, and proven experimental approaches are being applied for the first time in a comprehensive fashion on a complete eukaryotic gene repertoire.
Collapse
Affiliation(s)
- D E Bassett
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205-2185, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Perlick HA, Medghalchi SM, Spencer FA, Kendzior RJ, Dietz HC. Mammalian orthologues of a yeast regulator of nonsense transcript stability. Proc Natl Acad Sci U S A 1996; 93:10928-32. [PMID: 8855285 PMCID: PMC38260 DOI: 10.1073/pnas.93.20.10928] [Citation(s) in RCA: 109] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
All eukaryotes that have been studied to date possess the ability to detect and degrade transcripts that contain a premature signal for the termination of translation. This process of nonsense-mediated RNA decay has been most comprehensively studied in the yeast Saccharomyces cerevisiae where at least three trans-acting factors (Upf1p through Upf3P) are required. We have cloned cDNAs encoding human and murine homologues of Upf1p, termed rent1 (regulator of nonsense transcripts). Rent1 is the first identified mammalian protein that contains all of the putative functional elements in Upf1p including zinc finger-like and NTPase domains, as well as all motifs common to members of helicase superfamily I. Moreover, expression of a chimeric protein, N and C termini of Upf1p, complements the Upf1p-deficient phenotype in yeast. Thus, despite apparent differences between yeast and mammalian nonsense-mediated RNA decay, these data suggest that the two pathways use functionally related machinery.
Collapse
Affiliation(s)
- H A Perlick
- Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | | | | | | | | |
Collapse
|
42
|
Reifsnyder C, Lowell J, Clarke A, Pillus L. Yeast SAS silencing genes and human genes associated with AML and HIV-1 Tat interactions are homologous with acetyltransferases. Nat Genet 1996; 14:42-9. [PMID: 8782818 DOI: 10.1038/ng0996-42] [Citation(s) in RCA: 223] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Silencing is an epigenetic form of transcriptional regulation whereby genes are heritably, but not necessarily permanently, inactivated. We have identified the Saccharomyces cerevisiae genes SAS2 and SAS3 through a screen for enhancers of sir1 epigenetic silencing defects. SAS2, SAS3 and a Schizosaccharomyces pombe homologue are closely related to several human genes, including one associated with acute myeloid leukaemia arising from the recurrent translocation t(8;16)(p11;p13) and one implicated in HIV-1 Tat interactions. All of these genes encode proteins with an atypical zinc finger and well-conserved similarities to acetyltransferases. Sequence similarities and yeast mutant phenotypes suggest that SAS-like genes function in transcriptional regulation and cell-cycle exit and reveal novel connections between transcriptional silencing and human disease.
Collapse
MESH Headings
- Acetyltransferases/genetics
- Acute Disease
- Amino Acid Sequence
- Base Sequence
- Cloning, Molecular
- DNA, Fungal
- Enhancer Elements, Genetic
- Fungal Proteins/genetics
- Fungal Proteins/metabolism
- Gene Expression Regulation, Fungal
- Gene Products, tat/genetics
- HIV-1/genetics
- Humans
- Leukemia, Myeloid/genetics
- Molecular Sequence Data
- Mutation
- Phenotype
- Resting Phase, Cell Cycle
- Saccharomyces cerevisiae/enzymology
- Saccharomyces cerevisiae/genetics
- Schizosaccharomyces/genetics
- Sequence Homology, Amino Acid
- Silent Information Regulator Proteins, Saccharomyces cerevisiae
- Trans-Activators/genetics
- Zinc Fingers/genetics
- tat Gene Products, Human Immunodeficiency Virus
Collapse
Affiliation(s)
- C Reifsnyder
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder 80309, USA
| | | | | | | |
Collapse
|
43
|
Makałowski W, Zhang J, Boguski MS. Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. Genome Res 1996; 6:846-57. [PMID: 8889551 DOI: 10.1101/gr.6.9.846] [Citation(s) in RCA: 175] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
A large set of mRNA and encoded protein sequences, from orthologous murine and human genes, was compiled to analyze statistical, biological, and evolutionary properties of coding and noncoding transcribed sequences. Protein sequence conservation varied between 36% and 100% identity, with an average value of 85%. The average degree of nucleotide sequence identity for the corresponding coding sequences was also approximately 85%, whereas 5' and 3' untranslated regions (UTRs) were less conserved, with aligned identities of 67% and 69%, respectively. For some mouse and human genes, nucleotide sequences are more highly conserved than the encoded protein sequences. A subset of 32 sequences, consisting of only mouse/human protein pairs for which the human sequence represents a positionally cloned disease gene, had properties very similar to the larger data set, suggesting that our data are representative of the genome as a whole. With respect to sequence conservation, two interesting outliers are the breast cancer (BRCAI) gene product and the testis-determining factor (SRY), both of which display among the lowest degrees of sequence identity. The occurrence of both introns and repetitive elements (e.g., Alu, Bl) in 5' and 3' UTRs was also studied. These results provide one benchmark for the "comparative genomics" of mice and humans, with practical implications for the cross-referencing of transcript maps. Also, they should prove useful in estimating the additional sampling diversity provided by mouse EST sequencing projects designed to complement the existing human cDNA collection.
Collapse
Affiliation(s)
- W Makałowski
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | |
Collapse
|
44
|
Abstract
The materials of bioinformatics are biological data, and its methods are derived from a wide variety of computational techniques. Recent years have seen an explosive growth in biological data, and the development of novel computational methods. These methods have become essential to research progress in structural biology, genomics, structure-based drug design and molecular evolution. The development and maintenance of a robust infrastructure of biological data is of equal importance if biotechnology is to take maximum advantage of research advances in a wide variety of fields. While bioinformatics has already made important contributions, it faces significant challenges as it matures.
Collapse
Affiliation(s)
- D Benton
- National Center for Human Genome Research, National Institutes of Health, Bethesda, MD 20892-6050, USA. benton@extra,nchgr.nih.gov
| |
Collapse
|
45
|
Affiliation(s)
- M Johnston
- Department of Genetics, Washington University Medical School, St Louis, MO 63110, USA.
| |
Collapse
|
46
|
|
47
|
Affiliation(s)
- S Walsh
- Sanger Centre, Hinxton Hall, Hinxton, Cambridge, UK.
| | | |
Collapse
|
48
|
Rep M, Grivell LA. MBA1 encodes a mitochondrial membrane-associated protein required for biogenesis of the respiratory chain. FEBS Lett 1996; 388:185-8. [PMID: 8690083 DOI: 10.1016/0014-5793(96)00543-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The yeast MBA 1 gene (Multi-copy Bypass of AFG3) is one of three genes whose overexpression suppresses afg3-null and rca1-null mutations. Bypass of AFG3 and RCA1, whose products are essential for assembly of mitochondrial inner membrane enzyme complexes, suggests a related role for MBA1. The predicted translation product is a 30 kDa hydrophilic protein with a putative mitochondrial targeting sequence and no homology to any sequence in protein or EST databases. Gene disruption leads to a partial respiratory growth defect, which is more pronounced at temperatures above 30 degrees C. Concomitantly, amounts of cytochromes b and aa3 are reduced. A C-terminal c-myc-tagged MBA1 gene product is functional and is found associated with the mitochondrial inner membrane, from which it can he extracted by carbonate, but not by high salt. These observations give further support to a role of MBA1 in assembly of the respiratory chain.
Collapse
Affiliation(s)
- M Rep
- Section for Molecular Biology, Department of Molecular Cell Biology, University of Amsterdam, The Netherlands
| | | |
Collapse
|
49
|
Hartzog GA, Basrai MA, Ricupero-Hovasse SL, Hieter P, Winston F. Identification and analysis of a functional human homolog of the SPT4 gene of Saccharomyces cerevisiae. Mol Cell Biol 1996; 16:2848-56. [PMID: 8649394 PMCID: PMC231277 DOI: 10.1128/mcb.16.6.2848] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Spt4p is a nonhistone protein of Saccharomyces cerevisiae that is believed to be required for normal chromatin structure and transcription. In this work we describe the isolation and analysis of a human gene, SUPT4H, that encodes a predicted protein 42% identical to Spt4p. When expressed in S. cerevisiae, SUPT4H complemented all spt4 mutant phenotypes. In human cells SUPT4H encodes a nuclear protein that is expressed in all tissues tested. In addition, hybridization analyses suggest that an SUPT4H-related gene is also present in mice. SUPT4H was localized to human chromosome 17 by PCR analysis of a human-rodent somatic cell hybrid panel. Thus, like other proteins that are components of or control the structure of chromatin, Spt4p appears to be conserved from S. cerevisiae to mammals.
Collapse
Affiliation(s)
- G A Hartzog
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | | | | | | | | |
Collapse
|
50
|
Ma XJ, Lu Q, Grunstein M. A search for proteins that interact genetically with histone H3 and H4 amino termini uncovers novel regulators of the Swe1 kinase in Saccharomyces cerevisiae. Genes Dev 1996; 10:1327-40. [PMID: 8647431 DOI: 10.1101/gad.10.11.1327] [Citation(s) in RCA: 146] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
In a genetic screen for second-site mutations that are lethal in combination with a deletion of the amino terminus of histone H3, we have uncovered three new gene products that regulate the Saccharomyces cerevisiae Swe1 kinase. The Swe1 protein kinase phosphorylates tyrosine residue 19 of Cdc28 and inhibits its activity. One histone synthetic-lethal gene, HSL1, encodes a putative protein kinase that has high sequence and functional homology to fission yeast cdr1/nim1, an inhibitory kinase of wee1. Another gene, HSL7, is a novel negative regulator of Swe1 function. Sequences similar to Hsl7 exist in Caenorhabditis elegans and humans. In addition, we have isolated a dosage-dependent suppressor, OSS1, of hsl1 and hsl7. OSS1 is important for the transcriptional repression of SWE1 and CLN2 in G2. Mutations in HSL1 and HSL7 therefore cause hyperactivity of the Swe1 kinase, which in turn decreases mitotic Cdc28 kinase activity. Moreover, HSL5 is identical to CDC28, further suggesting that it is the decreased Cdc28 kinase activity in these hsl mutants that causes lethality in the histone mutant background. Because neither HSL1 nor HSL7 is essential in yeast, and histone transcription is unaffected by the hsl5/cdc28 mutation, it is unlikely that synthetic lethality results from reduced transcription of HSL1 and HSL7 caused by histone mutations, or from reduced histone transcription when Cdc28 kinase activity is compromised. We suggest that these cell cycle regulators function in a pathway upstream of both histones H3 and H4, thereby modulating histone function in the cell cycle.
Collapse
Affiliation(s)
- X J Ma
- Department of Biological Chemistry, University of California at Los Angeles School of Medicine, California 90095, USA
| | | | | |
Collapse
|