1
|
Reid I, O’Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PMK, Soh J, Butler G, Sensen CW, Tsang A. SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models. BMC Bioinformatics 2014; 15:229. [PMID: 24980894 PMCID: PMC4084796 DOI: 10.1186/1471-2105-15-229] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Accepted: 06/17/2014] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Locating the protein-coding genes in novel genomes is essential to understanding and exploiting the genomic information but it is still difficult to accurately predict all the genes. The recent availability of detailed information about transcript structure from high-throughput sequencing of messenger RNA (RNA-Seq) delineates many expressed genes and promises increased accuracy in gene prediction. Computational gene predictors have been intensively developed for and tested in well-studied animal genomes. Hundreds of fungal genomes are now or will soon be sequenced. The differences of fungal genomes from animal genomes and the phylogenetic sparsity of well-studied fungi call for gene-prediction tools tailored to them. RESULTS SnowyOwl is a new gene prediction pipeline that uses RNA-Seq data to train and provide hints for the generation of Hidden Markov Model (HMM)-based gene predictions and to evaluate the resulting models. The pipeline has been developed and streamlined by comparing its predictions to manually curated gene models in three fungal genomes and validated against the high-quality gene annotation of Neurospora crassa; SnowyOwl predicted N. crassa genes with 83% sensitivity and 65% specificity. SnowyOwl gains sensitivity by repeatedly running the HMM gene predictor Augustus with varied input parameters and selectivity by choosing the models with best homology to known proteins and best agreement with the RNA-Seq data. CONCLUSIONS SnowyOwl efficiently uses RNA-Seq data to produce accurate gene models in both well-studied and novel fungal genomes. The source code for the SnowyOwl pipeline (in Python) and a web interface (in PHP) is freely available from http://sourceforge.net/projects/snowyowl/.
Collapse
Affiliation(s)
- Ian Reid
- Centre for Structural and Functional Genomics, Concordia University, 7141 Sherbrooke St. W, Montreal, QC H4B 1R6, Canada
| | - Nicholas O’Toole
- Centre for Structural and Functional Genomics, Concordia University, 7141 Sherbrooke St. W, Montreal, QC H4B 1R6, Canada
| | - Omar Zabaneh
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Reza Nourzadeh
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Mahmoud Dahdouli
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Mostafa Abdellateef
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Paul MK Gordon
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Jung Soh
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Gregory Butler
- Centre for Structural and Functional Genomics, Concordia University, 7141 Sherbrooke St. W, Montreal, QC H4B 1R6, Canada
| | - Christoph W Sensen
- Faculty of Medicine, Visual Genomics Centre, University of Calgary, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada
| | - Adrian Tsang
- Centre for Structural and Functional Genomics, Concordia University, 7141 Sherbrooke St. W, Montreal, QC H4B 1R6, Canada
| |
Collapse
|
2
|
Yin PY, Shyu SJ, Yang SR, Chang YC. Reinforcement Learning for Improving Gene Identification Accuracy by Combination of Gene-Finding Programs. INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING 2012. [DOI: 10.4018/jamc.2012010104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Due to the explosive and growing size of the genome database, the discovery of gene has become one of the most computationally intensive tasks in bioinformatics. Many such systems have been developed to find genes; however, there is still some room to improve the prediction accuracy. This paper proposes a reinforcement learning model for a combination of gene predictions from existing gene-finding programs. The model learns the optimal policy for accepting the best predictions. The fitness of a policy is reinforced if the selected prediction at a nucleotide site correctly corresponds to the true annotation. The model searches for the optimal policy which maximizes the expected prediction accuracy over all nucleotide sites in the sequences. The experimental results demonstrate that the proposed model yields higher prediction accuracy than that obtained by the single best program.
Collapse
|
3
|
Batarseh A, Barlow KD, Martinez-Arguelles DB, Papadopoulos V. Functional characterization of the human translocator protein (18kDa) gene promoter in human breast cancer cell lines. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2011; 1819:38-56. [PMID: 21958735 DOI: 10.1016/j.bbagrm.2011.09.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Revised: 09/11/2011] [Accepted: 09/12/2011] [Indexed: 10/17/2022]
Abstract
The translocator protein (18kDa; TSPO) is a mitochondrial drug- and cholesterol-binding protein that has been implicated in several processes, including steroidogenesis, cell proliferation, and apoptosis. Expression of the human TSPO gene is elevated in several cancers. To understand the molecular mechanisms that regulate TSPO expression in human breast cancer cells, the TSPO promoter was identified, cloned, and functionally characterized in poor-in-TSPO hormone-dependent, non-aggressive MCF-7 cells and rich-in-TSPO hormone-independent, aggressive, and metastatic MDA-MB-231 breast cancer cells. RNA ligase-mediated 5'-rapid amplification of cDNA ends analysis indicated transcription initiated at multiple sites downstream of a GC-rich promoter that lacks functional TATA and CCAAT boxes. Deletion analysis indicated that the region from -121 to +66, which contains five putative regulatory sites known as GC boxes, was sufficient to induce reporter activity up to 24-fold in MCF-7 and nearly 120-fold in MDA-MB-231 cells. Electrophoretic mobility shift and chromatin immunoprecipitation assays indicated that Sp1, Sp3 and Sp4 bind to these GC boxes in vitro and to the endogenous TSPO promoter. Silencing of Sp1, Sp3 and Sp4 gene expression reduced TSPO levels. In addition, TSPO expression was epigenetically regulated at one or more of the identified GC boxes. Disruption of the sequence downstream of the main start site of TSPO differentially regulated TSPO promoter activity in MCF-7 and MDA-MB-231 cells, indicating that essential elements contribute to its differential expression in these cells. Taken together, these experiments constitute the first in-depth functional analysis of the human TSPO gene promoter and its transcriptional regulation.
Collapse
Affiliation(s)
- Amani Batarseh
- The Research Institute of the McGill University Health Centre and the Department of Medicine, McGill University, Montreal, Quebec, Canada H3G 1A4.
| | | | | | | |
Collapse
|
4
|
Genome wide analysis and comparative docking studies of new diaryl furan derivatives against human cyclooxygenase-2, lipoxygenase, thromboxane synthase and prostacyclin synthase enzymes involved in inflammatory pathway. J Mol Graph Model 2009; 28:313-29. [DOI: 10.1016/j.jmgm.2009.08.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2009] [Revised: 08/19/2009] [Accepted: 08/20/2009] [Indexed: 11/21/2022]
|
5
|
Okamura K, Wintle RF, Scherer SW. Characterization of the differentially methylated region of the Impact gene that exhibits Glires-specific imprinting. Genome Biol 2008; 9:R160. [PMID: 19014519 PMCID: PMC2614492 DOI: 10.1186/gb-2008-9-11-r160] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 11/13/2008] [Indexed: 11/29/2022] Open
Abstract
Comparative genomic analysis of the Impact locus, which is imprinted in Glires but not in other mammals, reveals features required for genomic imprinting. Background Imprinted genes are exclusively expressed from one of the two parental alleles in a parent-of-origin-specific manner. In mammals, nearly 100 genes are documented to be imprinted. To understand the mechanism behind this gene regulation and to identify novel imprinted genes, common features of DNA sequences have been analyzed; however, the general features required for genomic imprinting have not yet been identified, possibly due to variability in underlying molecular mechanisms from locus to locus. Results We performed a thorough comparative genomic analysis of a single locus, Impact, which is imprinted only in Glires (rodents and lagomorphs). The fact that Glires and primates diverged from each other as recent as 70 million years ago makes comparisons between imprinted and non-imprinted orthologues relatively reliable. In species from the Glires clade, Impact bears a differentially methylated region, whereby the maternal allele is hypermethylated. Analysis of this region demonstrated that imprinting was not associated with the presence of direct tandem repeats nor with CpG dinucleotide density. In contrast, a CpG periodicity of 8 bp was observed in this region in species of the Glires clade compared to those of carnivores, artiodactyls, and primates. Conclusions We show that tandem repeats are dispensable, establishment of the differentially methylated region does not rely on G+C content and CpG density, and the CpG periodicity of 8 bp is meaningful to the imprinting. This interval has recently been reported to be optimal for de novo methylation by the Dnmt3a-Dnmt3L complex, suggesting its importance in the establishment of imprinting in Impact and other genes.
Collapse
Affiliation(s)
- Kohji Okamura
- The Centre for Applied Genomics, Program in Genetics and Genome Biology, The Hospital for Sick Children, MaRS Centre TMDT, 101 College Street, Toronto, Ontario M5G 1L7, Canada.
| | | | | |
Collapse
|
6
|
Satou Y, Mineta K, Ogasawara M, Sasakura Y, Shoguchi E, Ueno K, Yamada L, Matsumoto J, Wasserscheid J, Dewar K, Wiley GB, Macmil SL, Roe BA, Zeller RW, Hastings KEM, Lemaire P, Lindquist E, Endo T, Hotta K, Inaba K. Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis: new insight into intron and operon populations. Genome Biol 2008; 9:R152. [PMID: 18854010 PMCID: PMC2760879 DOI: 10.1186/gb-2008-9-10-r152] [Citation(s) in RCA: 178] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2008] [Revised: 10/06/2008] [Accepted: 10/14/2008] [Indexed: 11/10/2022] Open
Abstract
An improved assembly of the Ciona intestinalis genome reveals that it contains non-canonical introns and that about 20% of Ciona genes reside in operons. Background The draft genome sequence of the ascidian Ciona intestinalis, along with associated gene models, has been a valuable research resource. However, recently accumulated expressed sequence tag (EST)/cDNA data have revealed numerous inconsistencies with the gene models due in part to intrinsic limitations in gene prediction programs and in part to the fragmented nature of the assembly. Results We have prepared a less-fragmented assembly on the basis of scaffold-joining guided by paired-end EST and bacterial artificial chromosome (BAC) sequences, and BAC chromosomal in situ hybridization data. The new assembly (115.2 Mb) is similar in length to the initial assembly (116.7 Mb) but contains 1,272 (approximately 50%) fewer scaffolds. The largest scaffold in the new assembly incorporates 95 initial-assembly scaffolds. In conjunction with the new assembly, we have prepared a greatly improved global gene model set strictly correlated with the extensive currently available EST data. The total gene number (15,254) is similar to that of the initial set (15,582), but the new set includes 3,330 models at genomic sites where none were present in the initial set, and 1,779 models that represent fusions of multiple previously incomplete models. In approximately half, 5'-ends were precisely mapped using 5'-full-length ESTs, an important refinement even in otherwise unchanged models. Conclusion Using these new resources, we identify a population of non-canonical (non-GT-AG) introns and also find that approximately 20% of Ciona genes reside in operons and that operons contain a high proportion of single-exon genes. Thus, the present dataset provides an opportunity to analyze the Ciona genome much more precisely than ever.
Collapse
Affiliation(s)
- Yutaka Satou
- Department of Zoology, Graduate School of Science, Kyoto University, Sakyo, Kyoto, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
An artificial neural network method for combining gene prediction based on equitable weights. Neurocomputing 2008. [DOI: 10.1016/j.neucom.2007.07.019] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
8
|
Thys M, Van Den Bogaert K, Iliadou V, Vanderstraeten K, Dieltjens N, Schrauwen I, Chen W, Eleftheriades N, Grigoriadou M, Pauw RJ, Cremers CRWJ, Smith RJH, Petersen MB, Van Camp G. A seventh locus for otosclerosis, OTSC7, maps to chromosome 6q13–16.1. Eur J Hum Genet 2007; 15:362-8. [PMID: 17213839 DOI: 10.1038/sj.ejhg.5201761] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Otosclerosis is a common form of hearing impairment among white adults with a prevalence of 0.3-0.4%. It is caused by abnormal bone homeostasis of the otic capsule that compromises free motion of the stapes in the oval window. Otosclerosis is in most patients a multifactorial disease, caused by both genetic and environmental factors. In some cases, the disease is inherited as a monogenic autosomal dominant trait, sometimes with reduced penetrance. However, families large enough for genetic linkage studies are extremely rare. To date, five loci (OTSC1-5) have been reported, but none of the responsible genes have been cloned yet. An additional locus, OTSC6, has been reported to the HUGO nomenclature committee but the relevant linkage study has not been published. In this study, a genome-wide linkage study was performed in a large Greek pedigree segregating autosomal dominant otosclerosis. A seventh locus, OTSC7, was localized on chromosome 6q13-16.1 with a multipoint LOD score of 7.5 in the 13.47 cM region defined by markers D6S1036 (centromeric) and D6S300 (telomeric). Linkage analysis of this new locus in 13 smaller Belgian and Dutch families has identified one family from The Netherlands in which allele segregation suggests linkage to this region. The overlap between the critical regions of these two families is a 1.06 Mb interval between the genetic markers D6S1036 (centromeric) and D6S406 (telomeric) on chromosome 6q13.
Collapse
Affiliation(s)
- Melissa Thys
- Department of Medical Genetics, University of Antwerp, Universiteitsplein 1, Antwerp, Belgium
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, Déjardin A, Depamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasjärvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Leplé JC, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouzé P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai CJ, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, Van de Peer Y, Rokhsar D. The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 2006; 313:1596-604. [PMID: 16973872 DOI: 10.1126/science.1128691] [Citation(s) in RCA: 2575] [Impact Index Per Article: 143.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the Populus genome. A second, older duplication event is indistinguishably coincident with the divergence of the Populus and Arabidopsis lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in Populus than in Arabidopsis. Populus has more protein-coding genes than Arabidopsis, ranging on average from 1.4 to 1.6 putative Populus homologs for each Arabidopsis gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in Populus include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.
Collapse
Affiliation(s)
- G A Tuskan
- Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Klemsdal SS, Clarke JL, Hoell IA, Eijsink VGH, Brurberg MB. Molecular cloning, characterization, and expression studies of a novel chitinase gene (ech30) from the mycoparasite Trichoderma atroviride strain P1. FEMS Microbiol Lett 2006; 256:282-9. [PMID: 16499618 DOI: 10.1111/j.1574-6968.2006.00132.x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
We describe the cloning and characterization of a single copy gene from Trichoderma atroviride P1 encoding a novel 30 kDa chitinase, Ech30. Ech30 is a family 18 chitinase showing low sequence similarity to other Trichoderma chitinases. Real-time quantitative RT-PCR studies revealed that expression of the ech30 gene was induced by the presence of Botrytis cinerea in plate confrontation assays, but hardly by chitin in liquid cultures. Studies of Ech30 purified from an Escherichia coli strain overexpressing the ech30 gene devoid of the leader sequence and a predicted intron, showed that the gene encodes an active chitinase, which, as expected for family 18 chitinases, is inhibited by allosamidin.
Collapse
Affiliation(s)
- Sonja S Klemsdal
- Department of Plant Pathology, Plant Protection Centre, The Norwegian Crop Research Institute, Hoegskoleveien, Norway.
| | | | | | | | | |
Collapse
|
11
|
Juszczynski P, Kutok JL, Li C, Mitra J, Aguiar RCT, Shipp MA. BAL1 and BBAP are regulated by a gamma interferon-responsive bidirectional promoter and are overexpressed in diffuse large B-cell lymphomas with a prominent inflammatory infiltrate. Mol Cell Biol 2006; 26:5348-59. [PMID: 16809771 PMCID: PMC1592708 DOI: 10.1128/mcb.02351-05] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
BAL1 is a transcription modulator that is overexpressed in chemoresistant, diffuse large B-cell lymphomas (DLBCLs). BAL1 complexes with a recently described DELTEX family member termed BBAP. Herein, we characterized BAL1 and BBAP expression in primary DLBCL subtypes defined by their comprehensive transcriptional profiles. BAL1 and BBAP were most abundant in lymphomas with a brisk host inflammatory response, designated host response (HR) tumors. Although these DLBCLs include significant numbers of tumor-infiltrating lymphocytes and interdigitating dendritic cells, BAL1 and BBAP were expressed primarily by malignant B cells, prompting speculation that the genes might be induced by host-derived inflammatory mediators such as gamma interferon (IFN-gamma). In fact, IFN-gamma induced BAL1 and BBAP expression in DLBCL cell lines; doxycycline-induced BAL1 also increased the expression of multiple IFN-stimulated genes, directly implicating BAL1 in an IFN signaling pathway. We show that BAL1 and BBAP are located on chromosome 3q21 in a head-to-head orientation and are regulated by a IFN-gamma-responsive bidirectional promoter. BBAP regulates the subcellular localization of BAL1 by a dynamic shuttling mechanism, highlighting the functional requirement for coordinated BBAP and BAL1 expression. IFN-gamma-induced BAL1/BBAP expression contributes to the molecular signature of HR DLBCLs and highlights the interplay between the inflammatory infiltrate and malignant B cells in these tumors.
Collapse
MESH Headings
- Base Sequence
- Cell Line, Tumor
- Chromosomes, Human, Pair 3/genetics
- Gene Expression Regulation, Neoplastic/drug effects
- Humans
- Interferon-gamma/metabolism
- Interferon-gamma/pharmacology
- Janus Kinase 2
- Lymphoma, B-Cell/genetics
- Lymphoma, B-Cell/immunology
- Lymphoma, B-Cell/pathology
- Lymphoma, Large B-Cell, Diffuse/genetics
- Lymphoma, Large B-Cell, Diffuse/immunology
- Lymphoma, Large B-Cell, Diffuse/pathology
- Mutation
- Neoplasm Proteins/genetics
- Poly(ADP-ribose) Polymerases
- Promoter Regions, Genetic
- Protein-Tyrosine Kinases/metabolism
- Proto-Oncogene Proteins/metabolism
- RNA Interference
- RNA, Small Interfering/genetics
- Recombinant Proteins
- Subcellular Fractions/metabolism
- Ubiquitin-Protein Ligases/antagonists & inhibitors
- Ubiquitin-Protein Ligases/genetics
Collapse
|
12
|
Vanden Wymelenberg A, Minges P, Sabat G, Martinez D, Aerts A, Salamov A, Grigoriev I, Shapiro H, Putnam N, Belinky P, Dosoretz C, Gaskell J, Kersten P, Cullen D. Computational analysis of the Phanerochaete chrysosporium v2.0 genome database and mass spectrometry identification of peptides in ligninolytic cultures reveal complex mixtures of secreted proteins. Fungal Genet Biol 2006; 43:343-56. [PMID: 16524749 DOI: 10.1016/j.fgb.2006.01.003] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2005] [Revised: 12/30/2005] [Accepted: 01/11/2006] [Indexed: 11/28/2022]
Abstract
The white-rot basidiomycete Phanerochaete chrysosporium employs extracellular enzymes to completely degrade the major polymers of wood: cellulose, hemicellulose, and lignin. Analysis of a total of 10,048 v2.1 gene models predicts 769 secreted proteins, a substantial increase over the 268 models identified in the earlier database (v1.0). Within the v2.1 'computational secretome,' 43% showed no significant similarity to known proteins, but were structurally related to other hypothetical protein sequences. In contrast, 53% showed significant similarity to known protein sequences including 87 models assigned to 33 glycoside hydrolase families and 52 sequences distributed among 13 peptidase families. When grown under standard ligninolytic conditions, peptides corresponding to 11 peptidase genes were identified in culture filtrates by mass spectrometry (LS-MS/MS). Five peptidases were members of a large family of aspartyl proteases, many of which were localized to gene clusters. Consistent with a role in dephosphorylation of lignin peroxidase, a mannose-6-phosphatase (M6Pase) was also identified in carbon-starved cultures. Beyond proteases and M6Pase, 28 specific gene products were identified including several representatives of gene families. These included 4 lignin peroxidases, 3 lipases, 2 carboxylesterases, and 8 glycosyl hydrolases. The results underscore the rich genetic diversity and complexity of P. chrysosporium's extracellular enzyme systems.
Collapse
|
13
|
Churbanov A, Pauley M, Quest D, Ali H. A method of precise mRNA/DNA homology-based gene structure prediction. BMC Bioinformatics 2005; 6:261. [PMID: 16242044 PMCID: PMC1274302 DOI: 10.1186/1471-2105-6-261] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2005] [Accepted: 10/21/2005] [Indexed: 11/29/2022] Open
Abstract
Background Accurate and automatic gene finding and structural prediction is a common problem in bioinformatics, and applications need to be capable of handling non-canonical splice sites, micro-exons and partial gene structure predictions that span across several genomic clones. Results We present a mRNA/DNA homology based gene structure prediction tool, GIGOgene. We use a new affine gap penalty splice-enhanced global alignment algorithm running in linear memory for a high quality annotation of splice sites. Our tool includes a novel algorithm to assemble partial gene structure predictions using interval graphs. GIGOgene exhibited a sensitivity of 99.08% and a specificity of 99.98% on the Genie learning set, and demonstrated a higher quality of gene structural prediction when compared to Sim4, est2genome, Spidey, Galahad and BLAT, including when genes contained micro-exons and non-canonical splice sites. GIGOgene showed an acceptable loss of prediction quality when confronted with a noisy Genie learning set simulating ESTs. Conclusion GIGOgene shows a higher quality of gene structure prediction for mRNA/DNA spliced alignment when compared to other available tools.
Collapse
Affiliation(s)
- Alexander Churbanov
- Department of Computer Science, College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182-0116, USA
| | - Mark Pauley
- Department of Computer Science, College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182-0116, USA
| | - Daniel Quest
- Department of Computer Science, College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182-0116, USA
| | - Hesham Ali
- Department of Computer Science, College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182-0116, USA
| |
Collapse
|
14
|
Wymelenberg AV, Sabat G, Martinez D, Rajangam AS, Teeri TT, Gaskell J, Kersten PJ, Cullen D. The Phanerochaete chrysosporium secretome: Database predictions and initial mass spectrometry peptide identifications in cellulose-grown medium. J Biotechnol 2005; 118:17-34. [PMID: 15888348 DOI: 10.1016/j.jbiotec.2005.03.010] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2004] [Revised: 03/03/2005] [Accepted: 03/26/2005] [Indexed: 11/28/2022]
Abstract
The white rot basidiomycete, Phanerochaete chrysosporium, employs an array of extracellular enzymes to completely degrade the major polymers of wood: cellulose, hemicellulose and lignin. Towards the identification of participating enzymes, 268 likely secreted proteins were predicted using SignalP and TargetP algorithms. To assess the reliability of secretome predictions and to evaluate the usefulness of the current database, we performed shotgun LC-MS/MS on cultures grown on standard cellulose-containing medium. A total of 182 unique peptide sequences were matched to 50 specific genes, of which 24 were among the secretome subset. Underscoring the rich genetic diversity of P. chrysosporium, identifications included 32 glycosyl hydrolases. Functionally interconnected enzyme groups were recognized. For example, the multiple endoglucanases and processive exocellobiohydrolases observed quite probably attack cellulose in a synergistic manner. In addition, a hemicellulolytic system included endoxylanases, alpha-galactosidase, acetyl xylan esterase, and alpha-l-arabinofuranosidase. Glucose and cellobiose metabolism likely involves cellobiose dehydrogenase, glucose oxidase, and various inverting glycoside hydrolases, all perhaps enhanced by an epimerase. To evaluate the completeness of the current database, mass spectroscopy analysis was performed on a larger and more inclusive dataset containing all possible ORFs. This allowed identification of a previously undetected hypothetical protein and a putative acid phosphatase. The expression of several genes was supported by RT-PCR amplification of their cDNAs.
Collapse
|
15
|
Turton JPG, Mehta A, Raza J, Woods KS, Tiulpakov A, Cassar J, Chong K, Thomas PQ, Eunice M, Ammini AC, Bouloux PM, Starzyk J, Hindmarsh PC, Dattani MT. Mutations within the transcription factor PROP1 are rare in a cohort of patients with sporadic combined pituitary hormone deficiency (CPHD). Clin Endocrinol (Oxf) 2005; 63:10-8. [PMID: 15963055 DOI: 10.1111/j.1365-2265.2005.02291.x] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
OBJECTIVE Mutations within the pituitary-specific paired-like homeobox gene PROP1 have been described in 50-100% of patients with familial combined pituitary hormone deficiency (CPHD). We screened a cohort of sporadic (n = 189) and familial (n = 44) patients with hypopituitarism (153 CPHD and 80 isolated hormone deficiencies) for mutations within the coding sequence of PROP1. DESIGN AND PATIENTS Patients with congenital hypopituitarism were recruited from the London Centre for Paediatric Endocrinology as well as several national and international centres. The pituitary phenotype ranged from isolated growth hormone deficiency (IGHD) to panhypopituitarism. Clinical data, including endocrine and neuro-radiological studies were obtained from patient records, and DNA was collected and screened for mutations within PROP1 using PCR and single-stranded conformation polymorphism (SSCP) analysis. Positive results on SSCP were sequenced directly. RESULTS The prevalence of PROP1 mutations in unselected sporadic cases of hypopituitarism was lower (1.1%) than in familial cases (29.5%). PROP1 mutations can be associated with a highly variable phenotype, and both pituitary hypoplasia and pituitary hyperplasia. We describe the waxing and waning of a pituitary mass over 20 months in association with a PROP1 mutation that is predicted to lead to complete loss of function. Additionally, we have identified a possible founder mutation in CPHD patients from the Indian subcontinent. CONCLUSIONS PROP1 mutations are rare in sporadic cases of CPHD, although the prevalence rises if there is a positive family history or if the patients are carefully selected with respect to the endocrine and neuroradiological phenotype. There is considerable phenotypic variability in families with the same mutation, indicating the role of other genetic or environmental factors on phenotypic expression. Finally, the pituitary enlargement that is observed in patients with PROP1 mutations can wax and wane in size before eventual involution.
Collapse
Affiliation(s)
- James P G Turton
- Biochemistry, Endocrinology and Metabolism Unit, University College London, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Foissac S, Schiex T. Integrating alternative splicing detection into gene prediction. BMC Bioinformatics 2005; 6:25. [PMID: 15705189 PMCID: PMC550657 DOI: 10.1186/1471-2105-6-25] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2004] [Accepted: 02/10/2005] [Indexed: 11/24/2022] Open
Abstract
Background Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders. Results We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGÈNE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage). Conclusions This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.
Collapse
MESH Headings
- Algorithms
- Alternative Splicing
- Arabidopsis/genetics
- Codon
- Computer Graphics
- DNA, Complementary/metabolism
- Databases, Genetic
- Databases, Nucleic Acid
- Databases, Protein
- Exons
- Expressed Sequence Tags
- Gene Expression Profiling
- Genes, Plant
- Genome
- Genome, Human
- Genomics
- Humans
- Introns
- Models, Genetic
- Proteomics/methods
- RNA Splice Sites
- Sequence Alignment
- Sequence Analysis, Protein
- Sequence Analysis, RNA
- Software
- Transcription, Genetic
- User-Computer Interface
Collapse
Affiliation(s)
- Sylvain Foissac
- Unité de Biométrie et Intelligence Artificielle, INRA, 31326 Castanet Tolosan, France
| | - Thomas Schiex
- Unité de Biométrie et Intelligence Artificielle, INRA, 31326 Castanet Tolosan, France
| |
Collapse
|
17
|
Yao H, Guo L, Fu Y, Borsuk LA, Wen TJ, Skibbe DS, Cui X, Scheffler BE, Cao J, Emrich SJ, Ashlock DA, Schnable PS. Evaluation of five ab initio gene prediction programs for the discovery of maize genes. PLANT MOLECULAR BIOLOGY 2005; 57:445-60. [PMID: 15830133 DOI: 10.1007/s11103-005-0271-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Accepted: 01/06/2005] [Indexed: 05/20/2023]
Abstract
Five ab initio programs (FGENESH, GeneMark.hmm, GENSCAN, GlimmerR and Grail) were evaluated for their accuracy in predicting maize genes. Two of these programs, GeneMark.hmm and GENSCAN had been trained for maize; FGENESH had been trained for monocots (including maize), and the others had been trained for rice or Arabidopsis. Initial evaluations were conducted using eight maize genes (gl8a, pdc2, pdc3, rf2c, rf2d, rf2e1, rth1, and rth3) of which the sequences were not released to the public prior to conducting this evaluation. The significant advantage of this data set for this evaluation is that these genes could not have been included in the training sets of the prediction programs. FGENESH yielded the most accurate and GeneMark.hmm the second most accurate predictions. The five programs were used in conjunction with RT-PCR to identify and establish the structures of two new genes in the a1-sh2 interval of the maize genome. FGENESH, GeneMark.hmm and GENSCAN were tested on a larger data set consisting of maize assembled genomic islands (MAGIs) that had been aligned to ESTs. FGENESH, GeneMark.hmm and GENSCAN correctly predicted gene models in 773, 625, and 371 MAGIs, respectively, out of the 1353 MAGIs that comprise data set 2.
Collapse
Affiliation(s)
- Hong Yao
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, 50011-3650, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Perincheri S, Dingle RWC, Peterson ML, Spear BT. Hereditary persistence of alpha-fetoprotein and H19 expression in liver of BALB/cJ mice is due to a retrovirus insertion in the Zhx2 gene. Proc Natl Acad Sci U S A 2005; 102:396-401. [PMID: 15626755 PMCID: PMC544306 DOI: 10.1073/pnas.0408555102] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2004] [Indexed: 11/18/2022] Open
Abstract
The alpha-fetoprotein (AFP) and H19 genes are transcribed at high levels in the mammalian fetal liver but are rapidly repressed postnatally. This repression in the liver is controlled, at least in part, by the Afr1 gene. Afr1 was defined >25 years ago when BALB/cJ mice were found to have 5- to 20-fold higher adult serum AFP levels compared with all other mouse strains; subsequent studies showed that this elevation was due to higher Afp expression in the liver. H19, which has become a model for genomic imprinting, was identified initially in a screen for Afr1-regulated genes. The BALB/cJ allele (Afr1(b)) is recessive to the wild-type allele (Afr1(a)), consistent with the idea that Afr1 functions as a repressor. By high-resolution mapping, we identified a gene that maps to the Afr1 interval on chromosome 15 and encodes a putative zinc fingers and homeoboxes (ZHX) protein. In BALB/cJ mice, this gene contains a murine endogenous retrovirus within its first intron and produces predominantly an aberrant transcript that no longer encodes a functional protein. Liver-specific overexpression of a Zhx2 transgene restores wild-type H19 repression on a BALB/cJ background, confirming that this gene is responsible for hereditary persistence of Afp and H19 in the livers of BALB/cJ mice. Thus we have identified a genetically defined transcription factor that is involved in developmental gene silencing in mammals. We present a model to explain the liver-specific phenotype in BALB/cJ mice, even though Afr1 is a ubiquitously expressed gene.
Collapse
Affiliation(s)
- Sudhir Perincheri
- Department of Microbiology, Immunology, and Molecular Genetics and Markey Cancer Center, University of Kentucky College of Medicine, Lexington, KY 40536
| | | | | | | |
Collapse
|
19
|
Miska KB, Wright AM, Lundgren R, Sasaki-McClees R, Osterman A, Gale JM, Miller RD. Analysis of a marsupial MHC region containing two recently duplicated class I loci. Mamm Genome 2004; 15:851-64. [PMID: 15520888 DOI: 10.1007/s00335-004-2224-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2002] [Accepted: 05/24/2004] [Indexed: 11/26/2022]
Abstract
A 37-kb cosmid containing two complete major histocompatibility complex (MHC) class I alpha chain loci from the opossum Monodelphis domestica was isolated, fully sequenced, and characterized. This sequence represents the largest contiguous genomic sequence reported for the MHC region of a nonplacental mammal. Based on particular conserved amino acid residues, and limited expression analyses, the two MHC-I loci, designated ModoUB and ModoUC, appear to encode functional MHC-I molecules. The two coding regions are 98% identical at the nucleotide level; however, their promoter regions differ significantly. Two CpG islands present in the cosmid sequence correspond to the two coding regions. Twelve microsatellites and six retroelements were also present in the cosmid. The retroelements share highest sequence homology to the CORE-SINE family of retroelements. Due to high sequence identity, it is very likely that ModoUB and ModoUC loci are products of recent gene duplication that occurred less than 4 million years ago.
Collapse
Affiliation(s)
- Katarzyna B Miska
- Department of Biology, The University of New Mexico, Albuquerque, New Mexico 87131, USA
| | | | | | | | | | | | | |
Collapse
|
20
|
Schadt EE, Edwards SW, GuhaThakurta D, Holder D, Ying L, Svetnik V, Leonardson A, Hart KW, Russell A, Li G, Cavet G, Castle J, McDonagh P, Kan Z, Chen R, Kasarskis A, Margarint M, Caceres RM, Johnson JM, Armour CD, Garrett-Engele PW, Tsinoremas NF, Shoemaker DD. A comprehensive transcript index of the human genome generated using microarrays and computational approaches. Genome Biol 2004; 5:R73. [PMID: 15461792 PMCID: PMC545593 DOI: 10.1186/gb-2004-5-10-r73] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2004] [Revised: 07/07/2004] [Accepted: 08/16/2004] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. RESULTS The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. CONCLUSIONS These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized.
Collapse
Affiliation(s)
- Eric E Schadt
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Stephen W Edwards
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | | | - Dan Holder
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Lisa Ying
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Vladimir Svetnik
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Amy Leonardson
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Kyle W Hart
- Rally Scientific, 41 Fayette Street, Suite 1, Watertown, MA 02472, USA
| | - Archie Russell
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Guoya Li
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Guy Cavet
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - John Castle
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Paul McDonagh
- Amgen Inc, 1201 Amgen Court W, Seattle, WA 98119, USA
| | - Zhengyan Kan
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Ronghua Chen
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Andrew Kasarskis
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Mihai Margarint
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Ramon M Caceres
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Jason M Johnson
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | | | | | | | - Daniel D Shoemaker
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| |
Collapse
|
21
|
Martinez D, Larrondo LF, Putnam N, Gelpke MDS, Huang K, Chapman J, Helfenbein KG, Ramaiya P, Detter JC, Larimer F, Coutinho PM, Henrissat B, Berka R, Cullen D, Rokhsar D. Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78. Nat Biotechnol 2004; 22:695-700. [PMID: 15122302 DOI: 10.1038/nbt967] [Citation(s) in RCA: 561] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2003] [Accepted: 03/16/2004] [Indexed: 11/08/2022]
Abstract
White rot fungi efficiently degrade lignin, a complex aromatic polymer in wood that is among the most abundant natural materials on earth. These fungi use extracellular oxidative enzymes that are also able to transform related aromatic compounds found in explosive contaminants, pesticides and toxic waste. We have sequenced the 30-million base-pair genome of Phanerochaete chrysosporium strain RP78 using a whole genome shotgun approach. The P. chrysosporium genome reveals an impressive array of genes encoding secreted oxidases, peroxidases and hydrolytic enzymes that cooperate in wood decay. Analysis of the genome data will enhance our understanding of lignocellulose degradation, a pivotal process in the global carbon cycle, and provide a framework for further development of bioprocesses for biomass utilization, organopollutant degradation and fiber bleaching. This genome provides a high quality draft sequence of a basidiomycete, a major fungal phylum that includes important plant and animal pathogens.
Collapse
Affiliation(s)
- Diego Martinez
- US DoE Joint Genome Institute, Walnut Creek, California 94598, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Xu Z, Liang L, Wang H, Li T, Zhao M. HCRP1, a novel gene that is downregulated in hepatocellular carcinoma, encodes a growth-inhibitory protein. Biochem Biophys Res Commun 2004; 311:1057-66. [PMID: 14623289 DOI: 10.1016/j.bbrc.2003.10.109] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
One of the most frequent allelic deletions in hepatocellular carcinoma (HCC) has been found at chromosome 8p21-23. We reported here the identification and characterization of a novel gene for a hepatocellular carcinoma related protein 1 (HCRP1) localized at 8p22, which was isolated by positional candidate cloning. The expression of the gene for HCRP1 was most abundant in normal human liver tissue and significantly reduced or undetected in HCC tissues. The analysis of subcellular distribution showed that HCRP1 diffused in the cytoplasm with a significant fraction accumulated in the nuclei. After introduction of the sense and antisense cDNA of HCRP1 into HCC cell line SMMC-7721, we observed that the overexpression of HCRP1 significantly inhibited both anchorage-dependent and anchorage-independent cell growth in vitro. Using the transgenic short hairpin RNA (shRNA) to knock down the expression of HCRP1 gene in the other HCC cell line BEL-7404 resulted in the cell growth greatly enhanced. Moreover, reduction of the HCRP1 gene expression could also elevate the invasive ability of BEL-7404 cells. Our results strongly suggest that HCRP1 might be a growth inhibitory protein and associated with decreasing the invasion of HCC cells.
Collapse
Affiliation(s)
- Zhenhua Xu
- State Key Laboratory of Molecular Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, 200031, Shanghai, China
| | | | | | | | | |
Collapse
|
23
|
Malva C, Varricchio P, Falabella P, La Scaleia R, Graziani F, Pennacchio F. Physiological and molecular interaction in the host-parasitoid system Heliothis virescens-Toxoneuron nigriceps: current status and future perspectives. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2004; 34:177-183. [PMID: 14871614 DOI: 10.1016/j.ibmb.2003.09.008] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2002] [Revised: 09/24/2003] [Accepted: 09/24/2003] [Indexed: 05/24/2023]
Abstract
Toxoneuron nigriceps (Viereck) (Hymenoptera, Braconidae) is an endophagous parasitoid of the tobacco budworm Heliothis virescens (F.) (Lepidoptera, Noctuidae). Parasitized H. virescens larvae are developmentally arrested and show a complex array of pathological symptoms ranging from the suppression of the immune response to an alteration of ecdysone biosynthesis and metabolism. Most of these pathological syndromes are induced by the polydnavirus associated with T. nigriceps (TnBV). An overview of our recent research work on this system is described herein. The mechanisms involved in the disruption of the host hormonal balance have been further investigated, allowing to better define the physiological model previously proposed. A functional genomic approach has been undertaken to identify TnBV genes expressed in the host and to assess their role in the major parasitoid-induced pathologies. Some TnBV genes cloned so far are novel and do not show any similarity with genes already available in genomic databases, while others code for proteins having conserved domains, such as aspartic proteases and tyrosine phosphatases. Sequencing of the entire TnBV genome is in progress and will considerably contribute to the understanding of the molecular bases of parasitoid-induced host alterations.
Collapse
Affiliation(s)
- C Malva
- Istituto di Genetica e Biofisica, via P. Castellino 111, Napoli, Italy.
| | | | | | | | | | | |
Collapse
|
24
|
Affiliation(s)
- Jeferson Gross
- Universidade Federal do Rio Grande do Sul, Brazil; Universidade Federal do Rio Grande do Sul, Brazil
| | | | - Arthur Germano Fett-Neto
- Universidade Federal do Rio Grande do Sul, Brazil; Universidade Federal do Rio Grande do Sul, Brazil
| | - Janette Palma Fett
- Universidade Federal do Rio Grande do Sul, Brazil; Universidade Federal do Rio Grande do Sul, Brazil
| |
Collapse
|
25
|
Mesak FM, Osada N, Hashimoto K, Liu QY, Ng CE. Molecular cloning, genomic characterization and over-expression of a novel gene, XRRA1, identified from human colorectal cancer cell HCT116Clone2_XRR and macaque testis. BMC Genomics 2003; 4:32. [PMID: 12908878 PMCID: PMC194569 DOI: 10.1186/1471-2164-4-32] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2003] [Accepted: 08/09/2003] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND As part of our investigation into the genetic basis of tumor cell radioresponse, we have isolated several clones with a wide range of responses to X-radiation (XR) from an unirradiated human colorectal tumor cell line, HCT116. Using human cDNA microarrays, we recently identified a novel gene that was down-regulated by two-fold in an XR-resistant cell clone, HCT116Clone2_XRR. We have named this gene as X-ray radiation resistance associated 1 (XRRA1) (GenBank BK000541). Here, we present the first report on the molecular cloning, genomic characterization and over-expression of the XRRA1 gene. RESULTS We found that XRRA1 was expressed predominantly in testis of both human and macaque. cDNA microarray analysis showed three-fold higher expression of XRRA1 in macaque testis relative to other tissues. We further cloned the macaque XRRA1 cDNA (GenBank AB072776) and a human XRRA1 splice variant from HCT116Clone2_XRR (GenBank AY163836). In silico analysis revealed the full-length human XRRA1, mouse, rat and bovine Xrra1 cDNAs. The XRRA1 gene comprises 11 exons and spans 64 kb on chromosome 11q13.3. Human and macaque cDNAs share 96% homology. Human XRRA1 cDNA is 1987 nt long and encodes a protein of 559 aa. XRRA1 protein is highly conserved in human, macaque, mouse, rat, pig, and bovine. GFP-XRRA1 fusion protein was detected in both the nucleus and cytoplasm of HCT116 clones and COS-7 cells. Interestingly, we found evidence that COS-7 cells which over-expressed XRRA1 lacked Ku86 (Ku80, XRCC5), a non-homologous end joining (NHEJ) DNA repair molecule, in the nucleus. RT-PCR analysis showed differential expression of XRRA1 after XR in HCT116 clones manifesting significantly different XR responses. Further, we found that XRRA1 was expressed in most tumor cell types. Surprisingly, mouse Xrra1 was detected in mouse embryonic stem cells R1. CONCLUSIONS Both XRRA1 cDNA and protein are highly conserved among mammals, suggesting that XRRA1 may have similar functions. Our results also suggest that the genetic modulation of XRRA1 may affect the XR responses of HCT116 clones and that XRRA1 may have a role in the response of human tumor and normal cells to XR. XRRA1 might be correlated with cancer development and might also be an early expressed gene.
Collapse
Affiliation(s)
- Felix M Mesak
- Centre for Cancer Therapeutics, Ottawa Regional Cancer Centre, 503 Smyth Rd., Ottawa, ON, K1H 1C4, Canada
- Faculty of Medicine, University of Ottawa, 451 Smyth Rd., Ottawa, ON, K1H 8M5, Canada
| | - Naoki Osada
- Division of Genetic Resources, National Institute of Infectious Disease, Toyama 1-23-1, Shinjuku-ku, Tokyo 162-8640, Japan
- Current address: Department of Ecology and Evolution, University of Chicago, 1101 E57th Str., Chicago, IL 60637, USA
| | - Katsuyuki Hashimoto
- Division of Genetic Resources, National Institute of Infectious Disease, Toyama 1-23-1, Shinjuku-ku, Tokyo 162-8640, Japan
| | - Qing Y Liu
- Institute for Biological Sciences, National Research Council of Canada, 1200 Montreal Rd, M-54, Ottawa, ON, K1A 0R6, Canada
| | - Cheng E Ng
- Centre for Cancer Therapeutics, Ottawa Regional Cancer Centre, 503 Smyth Rd., Ottawa, ON, K1H 1C4, Canada
- Faculty of Medicine, University of Ottawa, 451 Smyth Rd., Ottawa, ON, K1H 8M5, Canada
| |
Collapse
|
26
|
Abstract
The availability of genome sequences is revolutionizing the field of microbiology. Genetic methods are being modified to facilitate rapid analysis at a genome-wide level and are blossoming for human pathogens that were previously considered intractable. This revolution coincided with a growing concern about the emergence of microbial drug resistance, compelling the pharmaceutical industry to search for new antimicrobial agents. The availability of the new technologies, combined with many genetic strategies, has changed the way that researchers approach antibacterial drug discovery.
Collapse
Affiliation(s)
- Lynn Miesel
- Department of Antimicrobial Therapy, Schering-Plough Research Institute, 2015 Galloping Hill Road, Kenilworth, New Jersey 07033-0530, USA.
| | | | | |
Collapse
|
27
|
Kazmierski ST, Antin PB, Witt CC, Huebner N, McElhinny AS, Labeit S, Gregorio CC. The complete mouse nebulin gene sequence and the identification of cardiac nebulin. J Mol Biol 2003; 328:835-46. [PMID: 12729758 DOI: 10.1016/s0022-2836(03)00348-6] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Nebulin is a giant (M(r) 750-850kDa), modular sarcomeric protein proposed to regulate the assembly, and to specify the precise lengths of actin (thin) filaments in vertebrate skeletal muscles. Nebulin's potential role as a molecular template is based on its structural and biochemical properties. Its central approximately 700kDa portion associates with actin along the entire length of the thin filament, its N-terminal region extends to thin filament pointed ends, and approximately 80kDa of its C-terminal region integrates within the Z-line lattice. Here, we determined the exon/intron organization of the entire mouse nebulin gene, which contains 165 exons in a 202kb segment. We identified 16 novel exons, 15 of which encode nebulin-repeat motifs (12 from its central region and 3 from its Z-line region). One novel exon shares high sequence homology to the 20 residue repeats of the tight-junction protein, ZO-1. RT-PCR analyses revealed that all 16 novel exons are expressed in mouse skeletal muscle. Surprisingly, we also amplified mRNA transcripts from mouse and human heart cDNA using primers designed along the entire length of nebulin. The expression of cardiac-specific nebulin transcripts was confirmed by in situ hybridization in fetal rat cardiomyocytes and in embryonic Xenopus laevis (frog) heart. On the protein level, antibodies specific for skeletal muscle nebulin's N and C-terminal regions stained isolated rat cardiac myofibrils at the pointed and barbed ends of thin filaments, respectively. These data indicate a conserved molecular layout of the nebulin filament systems in both cardiac and skeletal myofibrils. We propose that thin filament length regulation in cardiac and skeletal muscles may share conserved nebulin-based mechanisms, and that nebulin isoform diversity may contribute to thin filament length differences in cardiac and skeletal muscle.
Collapse
Affiliation(s)
- Steven T Kazmierski
- Department of Cell Biology and Anatomy, The University of Arizona, Tucson, AZ 85724, USA
| | | | | | | | | | | | | |
Collapse
|
28
|
Galaviz-Hernandez C, Stagg C, de Ridder G, Tanaka TS, Ko MSH, Schlessinger D, Nagaraja R. Plac8 and Plac9, novel placental-enriched genes identified through microarray analysis. Gene 2003; 309:81-9. [PMID: 12758124 DOI: 10.1016/s0378-1119(03)00508-0] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Microarray expression profiling of a collection of 15,000 mouse genes with placental and embryonic RNAs revealed candidates for placental-enriched genes, three of which we have confirmed and further characterized. One, Plac1, strongly expressed in all trophoblast-derived cells in the placenta, has been described earlier (Genomics 68 (2000) 305). Here we report that of the other two, Plac8 expression is restricted to the spongiotrophoblast layer during development, whereas Plac9 is weakly expressed though highly enriched in placenta. For both, cDNAs with complete open reading frames were recovered and exon-intron structures inferred from comparisons of mouse cDNA and genomic sequence. The predicted proteins (112 and 108 amino acids) both contain putative signal peptides, with a coiled-coil segment of mPLAC9 as the only other detected motif. Genomic sequence comparisons reveal that in addition to an apparent pseudogene on chromosome 1, Plac8 is expressed at mouse cytoband 5e3. It is tightly conserved in human in a syntenically equivalent ortholog at 4q21.23. Plac9 is present in a single copy on chromosome 14, with a syntenically equivalent human ortholog at 10q22.3. Putative promoter regions up to 10 kb 5' of the transcription units for Plac1, Plac8, and Plac9 contain sites for widely-expressed transcription factors which, by analogy to other instances, may be sufficient to explain placental enrichment.
Collapse
Affiliation(s)
- Carlos Galaviz-Hernandez
- Laboratory of Genetics, National Institute on Aging, NIH, Suite 3000, 333 Cassell Drive, Baltimore, MD 21224, USA
| | | | | | | | | | | | | |
Collapse
|
29
|
Zacchi P, Sblattero D, Florian F, Marzari R, Bradbury ARM. Selecting open reading frames from DNA. Genome Res 2003; 13:980-90. [PMID: 12727911 PMCID: PMC430925 DOI: 10.1101/gr.861503] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We describe a method to select DNA encoding functional open reading frames (ORFs) from noncoding DNA within the context of a specific vector. Phage display has been used as an example, but any system requiring DNA encoding protein fragments, for example, the yeast two-hybrid system, could be used. By cloning DNA fragments upstream of a fusion gene, consisting of the beta-lactamase gene flanked by lox recombination sites, which is, in turn, upstream of gene 3 from fd phage, only those clones containing DNA fragments encoding ORFs confer ampicillin resistance and survive. After selection, the beta-lactamase gene can be removed by Cre recombinase, leaving a standard phage display vector with ORFs fused to gene 3. This vector has been tested on a plasmid containing tissue transglutaminase. All surviving clones analyzed by sequencing were found to contain ORFs, of which 83% were localized to known genes, and at least 80% produced immunologically detectable polypeptides. Use of a specific anti-tTG monoclonal antibody allowed the identification of clones containing the correct epitope. This approach could be applicable to the efficient selection of random ORFs representing the coding potential of whole organisms, and their subsequent downstream use in a number of different systems.
Collapse
|
30
|
Rattanadechakul W, Webb BA. Characterization of Campoletis sonorensis ichnovirus unique segment B and excision locus structure. JOURNAL OF INSECT PHYSIOLOGY 2003; 49:523-532. [PMID: 12770631 DOI: 10.1016/s0022-1910(03)00053-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Polydnaviruses (PDVs) are segmented, symbiotic, double-stranded DNA viruses that are vertically transmitted as proviruses within the genomes of some parasitoid Hymenoptera. The PDV associated with the ichneumonid wasp Campoletis sonorensis (CsIV) consists of 24 non-redundant DNA segments varying in size from approximately 6 to 20 kbp. CsIV segment B, one of the smallest genome segments, was sequenced and the excision sites of the proviral segment were characterized. The segment B sequence was 83.2% non-coding with only two open reading frames (ORFs). Some non-coding sequences have similarities to database sequences and were likely pseudogenic, but most were unrelated to known nucleic acid or predicted protein sequences. One ORF, BHv0.9, encodes a member of the rep gene family and was expressed only in parasitized insects while transcription of the other ORF could not be detected. Previously, a third region of the segment was shown to hybridize to 0.6 and 1.2 kb poly A+ RNAs from female wasps during virus replication (Theilmann and Summers, 1988) but this region did not have an identifiable ORF in the determined sequence. In contrast to CsIV segment W, segment B had little repetitive sequence. The segment B proviral integration locus contains a 59 bp direct imperfect repeat. Further analyses of this integration locus demonstrated that segment B was excised from wasp genomic DNA with flanking sequences at the integration site rejoined after segment excision. The segment B "excision locus" retained one of the two copies of the 59 bp repeat sequence with the other repeat present in the excised segment. The data indicate that Ichnovirus segments have distinctive characteristics possibly reflecting functional co-evolution between the wasp and individual types of polydnavirus segments.
Collapse
|
31
|
Fernandez PC, Frank SR, Wang L, Schroeder M, Liu S, Greene J, Cocito A, Amati B. Genomic targets of the human c-Myc protein. Genes Dev 2003; 17:1115-29. [PMID: 12695333 PMCID: PMC196049 DOI: 10.1101/gad.1067003] [Citation(s) in RCA: 734] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The transcription factor Myc is induced by mitogenic signals and regulates downstream cellular responses. If overexpressed, Myc promotes malignant transformation. Myc modulates expression of diverse genes in experimental systems, but few are proven direct targets. Here, we present a large-scale screen for genomic Myc-binding sites in live human cells. We used bioinformatics to select consensus DNA elements (CACGTG or E-boxes) situated in the 5' regulatory region of genes and measured Myc binding to those sequences in vivo by quantitative chromatin immunoprecipitation. Strikingly, most promoter-associated E-boxes showed selective recovery with Myc, unlike non-E-box promoters or E-boxes in bulk genomic DNA. Promoter E-boxes were distributed in two groups bound by Myc at distinct frequencies. The high-affinity group included an estimated 11% of all cellular loci, was highly conserved among different cells, and was bound independently of Myc expression levels. Overexpressed Myc associated at increased frequency with low-affinity targets and, at extreme levels, also with other sequences, suggesting that some binding was not sequence-specific. The strongest DNA-sequence parameter defining high-affinity targets was the location of E-boxes within CpG islands, correlating with an open, preacetylated state of chromatin. Myc further enhanced histone acetylation, with or without accompanying induction of mRNA expression. Our findings point to a high regulatory and biological diversity among Myc-target genes.
Collapse
|
32
|
Liang L, Zhao M, Xu Z, Yokoyama KK, Li T. Molecular cloning and characterization of CIDE-3, a novel member of the cell-death-inducing DNA-fragmentation-factor (DFF45)-like effector family. Biochem J 2003; 370:195-203. [PMID: 12429024 PMCID: PMC1223158 DOI: 10.1042/bj20020656] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2002] [Revised: 11/08/2002] [Accepted: 11/12/2002] [Indexed: 11/17/2022]
Abstract
DNA fragmentation is one of the critical steps in apoptosis, which is induced by DNA fragmentation factor (DFF). DFF is composed of two subunits, a 40 kDa caspase-activated nuclease (DFF40) and a 45 kDa inhibitor (DFF45). Recently a novel family of cell-death-inducing DFF45-like effectors (CIDEs) has been identified. Among CIDEs, two from human (CIDE-A and CIDE-B) and three from mouse (CIDE-A, CIDE-B and FSP27) have been reported. In this study human CIDE-3, a novel member of CIDEs, was identified upon sequence analysis of a previously unidentified cDNA that encoded a protein of 238 amino acids. It was shown to be a human homologue of mouse FSP27, and shared homology with the CIDE-N and CIDE-C domains of CIDEs. Apoptosis-inducing activity was clearly shown by DNA-fragmentation assay of the nuclear DNA of CIDE-3 transfected 293T cells. The expression pattern of CIDE-3 was different from that of CIDE-B. As shown by Northern-blot analysis, CIDE-3 was expressed mainly in human small intestine, heart, colon and stomach, while CIDE-B showed strong expression in liver and small intestine and at a lower level in colon, kidney and spleen. Green-fluorescent-protein-tagged CIDE-3 was revealed in some cytosolic corpuscles. Alternative splicing of the CIDE-3 gene was also identified by reverse transcription PCR, revealing that two transcripts, CIDE-3 and CIDE-3alpha, were present in HepG2 and A375 cells. CIDE-3 comprised a full-length open reading frame with 238 amino acids; in CIDE-3alpha exon 3 was deleted and it encoded a protein of 164 amino acids. Interestingly the CIDE-3alpha isoform still kept the apoptosis-inducing activity and showed the same pattern of subcellular localization as CIDE-3. Consistent with its chromosome localization at 3p25, a region associated with high frequency loss of heterozygosity in many tumours, CIDE-3 may play an important role in prevention of tumorigenesis.
Collapse
Affiliation(s)
- Liang Liang
- State Key Laboratory of Molecular Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai 200031, China
| | | | | | | | | |
Collapse
|
33
|
Rinn JL, Euskirchen G, Bertone P, Martone R, Luscombe NM, Hartman S, Harrison PM, Nelson FK, Miller P, Gerstein M, Weissman S, Snyder M. The transcriptional activity of human Chromosome 22. Genes Dev 2003; 17:529-40. [PMID: 12600945 PMCID: PMC195998 DOI: 10.1101/gad.1055203] [Citation(s) in RCA: 237] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2002] [Accepted: 12/24/2002] [Indexed: 01/09/2023]
Abstract
A DNA microarray representing nearly all of the unique sequences of human Chromosome 22 was constructed and used to measure global-transcriptional activity in placental poly(A)(+) RNA. We found that many of the known, related and predicted genes are expressed. More importantly, our study reveals twice as many transcribed bases as have been reported previously. Many of the newly discovered expressed fragments were verified by RNA blot analysis and a novel technique called differential hybridization mapping (DHM). Interestingly, a significant fraction of these novel fragments are expressed antisense to previously annotated introns. The coding potential of these novel expressed regions is supported by their sequence conservation in the mouse genome. This study has greatly increased our understanding of the biological information encoded on a human chromosome. To facilitate the dissemination of these results to the scientific community, we have developed a comprehensive Web resource to present the findings of this study and other features of human Chromosome 22 at http://array.mbb.yale.edu/chr22.
Collapse
Affiliation(s)
- John L Rinn
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Elnitski L, Hardison RC, Li J, Yang S, Kolbe D, Eswara P, O'Connor MJ, Schwartz S, Miller W, Chiaromonte F. Distinguishing regulatory DNA from neutral sites. Genome Res 2003; 13:64-72. [PMID: 12529307 PMCID: PMC430974 DOI: 10.1101/gr.817703] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
We explore several computational approaches to analyzing interspecies genomic sequence alignments, aiming to distinguish regulatory regions from neutrally evolving DNA. Human-mouse genomic alignments were collected for three sets of human regions: (1) experimentally defined gene regulatory regions, (2) well-characterized exons (coding sequences, as a positive control), and (3) interspersed repeats thought to have inserted before the human-mouse split (a good model for neutrally evolving DNA). Models that potentially could distinguish functional noncoding sequences from neutral DNA were evaluated on these three data sets, as well as bulk genome alignments. Our analyses show that discrimination based on frequencies of individual nucleotide pairs or gaps (i.e., of possible alignment columns) is only partially successful. In contrast, scoring procedures that include the alignment context, based on frequencies of short runs of alignment columns, dramatically improve separation between regulatory and neutral features. Such scoring functions should aid in the identification of putative regulatory regions throughout the human genome.
Collapse
Affiliation(s)
- Laura Elnitski
- Departments of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
To maximize the value of genome sequences they need to be integrated with other types of biological data and with each other. The entire collection of data then needs to be made available in a way that is easy to view and mine for complex relationships. The recently determined vertebrate genome sequences of human and mouse are so large that building the infrastructure to manage these datasets is a major challenge. This article reviews the database systems and tools for analysis that have so far been developed to address this.
Collapse
Affiliation(s)
- Ewan Birney
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, United Kingdom.
| | | | | |
Collapse
|
36
|
Ryan EJ, Marshall AJ, Magaletti D, Floyd H, Draves KE, Olson NE, Clark EA. Dendritic cell-associated lectin-1: a novel dendritic cell-associated, C-type lectin-like molecule enhances T cell secretion of IL-4. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2002; 169:5638-48. [PMID: 12421943 DOI: 10.4049/jimmunol.169.10.5638] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
We have characterized dendritic cell (DC)-associated lectin-1 (DCAL-1), a novel, type II, transmembrane, C-type lectin-like protein. DCAL-1 has restricted expression in hemopoietic cells, in particular, DCs and B cells, but T cells and monocytes do not express it. The DCAL-1 locus is within a cluster of C-type lectin-like loci on human chromosome 12p12-13 just 3' to the CD69 locus. The consensus sequence of the DCAL-1 gene was confirmed by RACE-PCR; however, based on sequence alignment with genomic DNA and with various human expressed sequence tags, we predict that DCAL-1 has two splice variants. C-type lectins share a common sequence motif of 14 invariable and 18 highly conserved aa residues known as the carbohydrate recognition domain. DCAL-1, however, is missing three of the cysteine residues required to form the standard carbohydrate recognition domain. DCAL-1 mRNA and protein expression are increased upon the differentiation of monocytes to CD1a(+) DCs. B cells also express high levels of DCAL-1 on their cell surface. Using a DCAL-1 fusion protein we identified a population of CD4(+) CD45RA(+) T cells that express DCAL-1 ligand. Coincubation with soluble DCAL-1 enhanced the proliferation of CD4(+) T cells in response to CD3 ligation and significantly increased IL-4 secretion. In contrast, coincubation with soluble DC-specific ICAM-3-grabbing nonintegrin (CD209) fusion protein as a control had no effect on CD4(+) T cell proliferation or IL-4 and IFN-gamma secretion. Therefore, the function of DCAL-1 on DCs and B cells may act as a T cell costimulatory molecule, which skews CD4(+) T cells toward a Th2 response by enhancing their secretion of IL-4.
Collapse
MESH Headings
- Adjuvants, Immunologic/genetics
- Adjuvants, Immunologic/isolation & purification
- Adjuvants, Immunologic/metabolism
- Adjuvants, Immunologic/physiology
- Amino Acid Sequence
- Base Sequence
- Blood Cells/immunology
- Blood Cells/metabolism
- Cells, Cultured
- Chromosome Mapping
- Dendritic Cells/immunology
- Dendritic Cells/metabolism
- Gene Expression Regulation/immunology
- Humans
- Interleukin-4/metabolism
- Lectins, C-Type/genetics
- Lectins, C-Type/isolation & purification
- Lectins, C-Type/metabolism
- Lectins, C-Type/physiology
- Leukocytes, Mononuclear/immunology
- Leukocytes, Mononuclear/metabolism
- Ligands
- Lymphocyte Activation/immunology
- Lymphoid Tissue/immunology
- Lymphoid Tissue/metabolism
- Membrane Proteins/genetics
- Membrane Proteins/isolation & purification
- Membrane Proteins/metabolism
- Membrane Proteins/physiology
- Molecular Sequence Data
- Palatine Tonsil/immunology
- Palatine Tonsil/metabolism
- Protein Binding/immunology
- Protein Structure, Tertiary/genetics
- T-Lymphocytes/immunology
- T-Lymphocytes/metabolism
- Th2 Cells/immunology
Collapse
Affiliation(s)
- Elizabeth J Ryan
- Regional Primate Research Center, Box 357330, University of Washington, Seattle, WA 98195, USA.
| | | | | | | | | | | | | |
Collapse
|
37
|
Mathé C, Sagot MF, Schiex T, Rouzé P. Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res 2002; 30:4103-17. [PMID: 12364589 PMCID: PMC140543 DOI: 10.1093/nar/gkf543] [Citation(s) in RCA: 209] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2002] [Revised: 08/07/2002] [Accepted: 08/07/2002] [Indexed: 11/14/2022] Open
Abstract
While the genomes of many organisms have been sequenced over the last few years, transforming such raw sequence data into knowledge remains a hard task. A great number of prediction programs have been developed that try to address one part of this problem, which consists of locating the genes along a genome. This paper reviews the existing approaches to predicting genes in eukaryotic genomes and underlines their intrinsic advantages and limitations. The main mathematical models and computational algorithms adopted are also briefly described and the resulting software classified according to both the method and the type of evidence used. Finally, the several difficulties and pitfalls encountered by the programs are detailed, showing that improvements are needed and that new directions must be considered.
Collapse
Affiliation(s)
- Catherine Mathé
- Institut de Pharmacologie et Biologie Structurale, UMR 5089, 205 route de Narbonne, F-31077 Toulouse Cedex, France.
| | | | | | | |
Collapse
|
38
|
Hilbricht T, Salamini F, Bartels D. CpR18, a novel SAP-domain plant transcription factor, binds to a promoter region necessary for ABA mediated expression of the CDeT27-45 gene from the resurrection plant Craterostigma plantagineum Hochst. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2002; 31:293-303. [PMID: 12164809 DOI: 10.1046/j.1365-313x.2002.01357.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
CDeT27-45 is a lea-like gene from the resurrection plant Craterostigma plantagineum (Scrophulariaceae) which is strongly expressed in vegetative tissues in response to dehydration or treatment with abscisic acid (ABA). Expression of the gene is correlated with the acquisition of desiccation tolerance. Nuclear proteins bind to a 29-bp cis-regulatory region of the promoter which is essential for transcriptional activation of the CDeT27-45 gene by ABA. Using a yeast one-hybrid screen, the cDNA clone CpR18 was isolated, which encodes a protein with specific binding activity for the cis-regulatory element in the CDeT27-45 promoter. The protein contains an acidic region, a SAP-domain, a zinc finger of the C3H-type, and two motifs which are conserved in proteins from several plant species. One of the conserved regions is rich in basic residues and is predicted to form a helix-loop-helix structure. The R18 gene shows high similarities to genomic sequences and ESTs from other plant species. The tissue-specific expression pattern of the rare R18 mRNA and the distribution of nuclear protein binding activity for the CDeT27-45 promoter fragment are compared. The R18 protein is indeed localized in the nucleus, and activates transcription of CDeT27-45 promoter-GUS fusion constructs in tobacco protoplasts. DNA blot analysis and isolation of genomic clones reveal that two copies of R18 are present in the C. plantagineum genome.
Collapse
Affiliation(s)
- Tobias Hilbricht
- Max-Planck-Institut für Züchtungsforschung, Carl-von-Linné-Weg 10, D-50829 Köln, Germany
| | | | | |
Collapse
|
39
|
Abstract
As more of the human genome draft sequence is finished, and genomes from other organisms begin to be sequenced, the demand for accurate and reliable genome annotation will increase significantly. To facilitate this industrial-scale genome annotation, automated bioinformatics solutions are increasingly required. As a result, automatic genome annotation systems have become more important in gene discovery within recent years. The design of such large-scale bioinformatics systems is an evolving and dynamic field, based on central cores of bioinformatics software tools and relational databases. Not only must these systems efficiently manage and integrate large volumes of genomic data, but they must also deliver accurate gene predictions and effectively distribute annotation data to the biosciences community.
Collapse
Affiliation(s)
- Alistair G Rust
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB101SD, Cambridge, UK
| | | | | |
Collapse
|
40
|
Abstract
The advent of whole-genome data resources--not only sequence but also other genome-scale data collections such as gene expression, protein interaction, and genetic variation--is having two marked, complementary effects on the relatively new discipline of bioinformatics. First, the veritable flood of data is creating a need and demand for new tools for dealing adequately with the deluge, and, second, the unprecedented extent, diversity, and impending completeness of the data sets are creating opportunities for new approaches to discovery based on computational methods.
Collapse
Affiliation(s)
- D B Searls
- Bioinformatics Department, SmithKline Beecham Pharmaceuticals, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|
41
|
Aggarwal G, Ramaswamy R. Ab initio gene identification: prokaryote genome annotation with GeneScan and GLIMMER. J Biosci 2002; 27:7-14. [PMID: 11927773 DOI: 10.1007/bf02703679] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
We compare the annotation of three complete genomes using the ab initio methods of gene identification GeneScan and GLIMMER. The annotation given in GenBank, the standard against which these are compared, has been made using GeneMark. We find a number of novel genes which are predicted by both methods used here, as well as a number of genes that are predicted by GeneMark, but are not identified by either of the nonconsensus methods that we have used. The three organisms studied here are all prokaryotic species with fairly compact genomes. The Fourier measure forms the basis for an efficient non-consensus method for gene prediction, and the algorithm GeneScan exploits this measure. We have bench-marked this program as well as GLIMMER using 3 complete prokaryotic genomes. An effort has also been made to study the limitations of these techniques for complete genome analysis. GeneScan and GLIMMER are of comparable accuracy insofar as gene-identification is concerned, with sensitivities and specificities typically greater than 0.9. The number of false predictions (both positive and negative) is higher for GeneScan as compared to GLIMMER, but in a significant number of cases, similar results are provided by the two techniques. This suggests that there could be some as-yet unidentified additional genes in these three genomes, and also that some of the putative identifications made hitherto might require re-evaluation. All these cases are discussed in detail.
Collapse
Affiliation(s)
- Gautam Aggarwal
- School of Physical Sciences, Jawaharlal Nehru University, New Delhi 110 067, India
| | | |
Collapse
|
42
|
Stoilov P, Castren E, Stamm S. Analysis of the human TrkB gene genomic organization reveals novel TrkB isoforms, unusual gene length, and splicing mechanism. Biochem Biophys Res Commun 2002; 290:1054-65. [PMID: 11798182 DOI: 10.1006/bbrc.2001.6301] [Citation(s) in RCA: 129] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We determined the gene structure of the human TrkB gene. The gene is unusually large and spans at least 590 kbp. It contains 24 exons. Using alternative promoters, splicing, and polyadenylation sites, the gene can create at least 100 isoforms, that can encode 10 proteins. RT-PCR and Northern blot analysis reveals that only three major protein isoforms are generated by the gene: the full length receptor, an isoform lacking the tyrosine kinase domain, and a novel isoform lacking the tyrosine kinase domain but containing a Shc binding site. This novel isoform, TrkB-T-Shc is generated by the use of a new alternative exon 19. It is expressed only in brain. TrkB-T-Shc protein is located in the plasma membrane. Coimmunoprecipitation experiments show that TrkB-T-Shc is not phosphorylated by the full length receptor, indicating that it could be a negative regulator of TrkB signaling in the brain.
Collapse
Affiliation(s)
- Peter Stoilov
- Friedrich-Alexander-University Erlangen, Institute for Biochemistry, Fahrstrasse 17, Erlangen, 91054, Germany
| | | | | |
Collapse
|
43
|
Sonnenburg S, Rätsch G, Jagota A, Müller KR. New Methods for Splice Site Recognition. ARTIFICIAL NEURAL NETWORKS — ICANN 2002 2002. [DOI: 10.1007/3-540-46084-5_54] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
44
|
Wyder S, Tschannen A, Hochuli A, Gruber A, Saladin V, Zumbach S, Lanzrein B. Characterization of Chelonus inanitus polydnavirus segments: sequences and analysis, excision site and demonstration of clustering. J Gen Virol 2002; 83:247-256. [PMID: 11752722 DOI: 10.1099/0022-1317-83-1-247] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Polydnaviruses (genera Ichnovirus and Bracovirus) have a segmented genome of circular double-stranded DNA molecules, replicate in the ovary of parasitic wasps and are essential for successful parasitism of the host. Here we show the first detailed analysis of various segments of a bracovirus, the Chelonus inanitus virus (CiV). Four segments were sequenced and two of them, CiV12 and CiV14, were found to be closely related while CiV14.5 and CiV16.8 were unrelated. CiV12, CiV14.5 and CiV16.8 are unique while CiV14 occurs also nested in another larger segment. All four segments are predicted to contain genes and predictions could be substantiated in most cases. Comparison with databases revealed no significant similarities at either the nucleotide or amino acid level. Inverted repeats with identities between 77% and 92% and lengths between 26 bp and 100 bp were found on all segments outside of predicted genes. Hybridization experiments indicate that CiV12 and CiV14 are both flanked by other virus segments, suggesting that proviral CiV segments are clustered in the genome of the wasp. The integration/excision site of CiV14 was analysed and compared to that of CiV12. On both termini of proviral CiV12 and CiV14 as well as in the excised circular molecule and the rejoined DNA a very similar repeat of 14 bp was found. A model to illustrate where the terminal repeats might recombine to yield the circular molecule is presented. Excision of CiV12 and CiV14 is restricted to the female and sets in at a very specific time-point in pupal-adult development.
Collapse
Affiliation(s)
- Stefan Wyder
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Adrian Tschannen
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Anita Hochuli
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Andreas Gruber
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Verena Saladin
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Sonja Zumbach
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| | - Beatrice Lanzrein
- Institute of Cell Biology, University of Berne, Baltzerstrasse 4, CH-3012 Bern, Switzerland1
| |
Collapse
|
45
|
Zheng X, Chung D, Takayama TK, Majerus EM, Sadler JE, Fujikawa K. Structure of von Willebrand factor-cleaving protease (ADAMTS13), a metalloprotease involved in thrombotic thrombocytopenic purpura. J Biol Chem 2001; 276:41059-63. [PMID: 11557746 DOI: 10.1074/jbc.c100515200] [Citation(s) in RCA: 565] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Thrombotic thrombocytopenic purpura is associated with acquired or congenital deficiency of a plasma von Willebrand factor-cleaving protease (VWFCP). Based on partial amino acid sequence, VWFCP was identified recently as a new member of the ADAMTS family of metalloproteases and designated ADAMTS13. The 4.6-kilobase pair cDNA sequence for VWFCP has now been determined. By Northern blotting, full-length VWFCP mRNA was detected only in liver. VWFCP consists of 1427 amino acid residues and has a signal peptide, a short propeptide terminating in the sequence RQRR, a reprolysin-like metalloprotease domain, a disintegrin-like domain, a thrombospondin-1 repeat, a Cys-rich domain, an ADAMTS spacer, seven additional thrombospondin-1 repeats, and two CUB domains. VWFCP apparently is made as a zymogen that requires proteolytic activation, possibly by furin intracellularly. Sites for Zn(2+) and Ca(2+) ions are conserved in the protease domain. The Cys-rich domain contains an RGDS sequence that could mediate integrin-dependent binding to platelets or other cells. Alternative splicing gives rise to at least seven potential variants that truncate the protein at different positions after the protease domain. Alternative splicing may have functional significance, producing proteins with distinct abilities to interact with cofactors, connective tissue, platelets, and von Willebrand factor.
Collapse
Affiliation(s)
- X Zheng
- Department of Pathology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | | | | | | | | | | |
Collapse
|
46
|
Yeh RF, Lim LP, Burge CB. Computational inference of homologous gene structures in the human genome. Genome Res 2001; 11:803-16. [PMID: 11337476 PMCID: PMC311055 DOI: 10.1101/gr.175701] [Citation(s) in RCA: 264] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
With the human genome sequence approaching completion, a major challenge is to identify the locations and encoded protein sequences of all human genes. To address this problem we have developed a new gene identification algorithm, GenomeScan, which combines exon-intron and splice signal models with similarity to known protein sequences in an integrated model. Extensive testing shows that GenomeScan can accurately identify the exon-intron structures of genes in finished or draft human genome sequence with a low rate of false-positives. Application of GenomeScan to 2.7 billion bases of human genomic DNA identified at least 20,000-25,000 human genes out of an estimated 30,000-40,000 present in the genome. The results show an accurate and efficient automated approach for identifying genes in higher eukaryotic genomes and provide a first-level annotation of the draft human genome.
Collapse
Affiliation(s)
- R F Yeh
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | | | | |
Collapse
|
47
|
Harrington JJ, Sherf B, Rundlett S, Jackson PD, Perry R, Cain S, Leventhal C, Thornton M, Ramachandran R, Whittington J, Lerner L, Costanzo D, McElligott K, Boozer S, Mays R, Smith E, Veloso N, Klika A, Hess J, Cothren K, Lo K, Offenbacher J, Danzig J, Ducar M. Creation of genome-wide protein expression libraries using random activation of gene expression. Nat Biotechnol 2001; 19:440-5. [PMID: 11329013 DOI: 10.1038/88107] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Here we report the use of random activation of gene expression (RAGE) to create genome-wide protein expression libraries. RAGE libraries containing only 5 x 10(6) individual clones were found to express every gene tested, including genes that are normally silent in the parent cell line. Furthermore, endogenous genes were activated at similar frequencies and expressed at similar levels within RAGE libraries created from multiple human cell lines, demonstrating that RAGE libraries are inherently normalized. Pools of RAGE clones were used to isolate 19,547 human gene clusters, approximately 53% of which were novel when tested against public databases of expressed sequence tag (EST) and complementary DNA (cDNA). Isolation of individual clones confirmed that the activated endogenous genes can be expressed at high levels to produce biologically active proteins. The properties of RAGE libraries and RAGE expression clones are well suited for a number of biotechnological applications including gene discovery, protein characterization, drug development, and protein manufacturing.
Collapse
Affiliation(s)
- J J Harrington
- Athersys, Inc., 3201 Carnegie Ave., Cleveland, OH 44115, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Abstract
Since the structure of the DNA molecule was identified half a century ago, the complete genome sequence has been determined for 37 prokaryotes and several eukaryotes. With the exponential growth of genetic information, bioinformatics has attempted to predict gene locations and functions in cyberspace prior to experimental confirmation at the bench.
Collapse
Affiliation(s)
- Y Cho
- Stanford Genome Technology Center, 855 California Avenue, Palo Alto, CA 94304-1103, USA.
| | | |
Collapse
|
49
|
Reiter JL, Threadgill DW, Eley GD, Strunk KE, Danielsen AJ, Sinclair CS, Pearsall RS, Green PJ, Yee D, Lampland AL, Balasubramaniam S, Crossley TD, Magnuson TR, James CD, Maihle NJ. Comparative genomic sequence analysis and isolation of human and mouse alternative EGFR transcripts encoding truncated receptor isoforms. Genomics 2001; 71:1-20. [PMID: 11161793 DOI: 10.1006/geno.2000.6341] [Citation(s) in RCA: 82] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
This study presents the annotated genomic sequence and exon-intron organization of the human and mouse epidermal growth factor receptor (EGFR) genes located on chromosomes 7p11.2 and 11, respectively. We report that the EGFR gene spans nearly 200 kb and that the full-length 170-kDa EGFR is encoded by 28 exons. In addition, we have identified two human and two mouse alternative EGFR transcripts of 2.4-3.0 kb using both computational and experimental methods. The human 3.0-kb and mouse 2.8-kb EGFR mRNAs are predominantly expressed in placenta and liver, respectively, and both transcripts encode 110-kDa truncated receptor isoforms containing only the extracellular ligand-binding domain. We also have demonstrated that the aberrant 2.8-kb EGFR transcript produced by the human A431 carcinoma cell line is generated by splicing to a recombinant 3'-terminal exon located in EGFR intron 16, which apparently was formed as a result of a chromosomal translocation. Finally, we have shown that the human, mouse, rat, and chicken 1.8- to 3.0-kb alternative EGFR transcripts are generated by distinct splicing mechanisms and that each of these mRNAs contains unique 3' sequences that are not evolutionarily conserved. The presence of truncated receptor isoforms in diverse species suggests that these proteins may have important functional roles in regulating EGFR activity.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Base Sequence
- Chickens
- Chromosomes, Human, Pair 7
- Cloning, Molecular
- DNA, Complementary/metabolism
- ErbB Receptors/biosynthesis
- ErbB Receptors/chemistry
- ErbB Receptors/genetics
- Evolution, Molecular
- Exons
- Expressed Sequence Tags
- Gene Library
- Genome
- Humans
- Introns
- Ligands
- Liver/metabolism
- Mice
- Models, Genetic
- Molecular Sequence Data
- Placenta/metabolism
- Protein Isoforms
- Protein Structure, Tertiary
- RNA, Messenger/metabolism
- Rats
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Analysis, DNA
- Sequence Homology, Nucleic Acid
- Tissue Distribution
- Transfection
- Translocation, Genetic
- Tumor Cells, Cultured
Collapse
Affiliation(s)
- J L Reiter
- Tumor Biology Program, Mayo Clinic, Rochester, Minnesota 55905, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Webb KJ, Skøt L, Nicholson MN, Jørgensen B, Mizen S. Mesorhizobium loti increases root-specific expression of a calcium-binding protein homologue identified by promoter tagging in Lotus japonicus. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2000; 13:606-16. [PMID: 10830260 DOI: 10.1094/mpmi.2000.13.6.606] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
A promoter tagging program in the legume Lotus japonicus was initiated to identify plant genes involved in the nitrogen-fixing symbiosis between legumes and rhizobia. Seven transformed plant lines expressing the promoterless reporter gene uidA (beta-glucuronidase; GUS) specifically in roots and/or nodules were identified. Four of these expressed GUS in the roots only after inoculation with nodule-forming Mesorhizobium loti. In one line (T90), GUS activity was found in the root epidermis, including root hairs. During seedling growth, GUS expression gradually became focused in developing nodules and disappeared from root tissue. No GUS activity was detected when a non-nodulating mutant of M. loti was used to inoculate the plants. The T-DNA insertion in this plant line was located 1.3 kb upstream of a putative coding sequence with strong homology to calcium-binding proteins. Four motifs were identified, which were very similar to the "EF hands" in calmodulin-related proteins, each binding one Ca2+. We have named the gene LjCbp1 (calcium-binding protein). Northern (RNA) analyses showed that this gene is expressed specifically in roots of L. japonicus. Expression was reduced in roots inoculated with non-nodulating M. loti mutants and in progeny homozygous for the T-DNA insertion, suggesting a link between the T-DNA insertion and this gene.
Collapse
Affiliation(s)
- K J Webb
- Institute of Grassland and Environmental Research, Plas Gogerddan, Ceredigion, Wales, UK
| | | | | | | | | |
Collapse
|