1
|
Commisso M, Guarino F, Marchi L, Muto A, Piro A, Degola F. Bryo-Activities: A Review on How Bryophytes Are Contributing to the Arsenal of Natural Bioactive Compounds against Fungi. PLANTS (BASEL, SWITZERLAND) 2021; 10:203. [PMID: 33494524 PMCID: PMC7911284 DOI: 10.3390/plants10020203] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 01/12/2021] [Accepted: 01/15/2021] [Indexed: 01/05/2023]
Abstract
Usually regarded as less evolved than their more recently diverged vascular sisters, which currently dominate vegetation landscape, bryophytes seem having nothing to envy to the defensive arsenal of other plants, since they had acquired a suite of chemical traits that allowed them to adapt and persist on land. In fact, these closest modern relatives of the ancestors to the earliest terrestrial plants proved to be marvelous chemists, as they traditionally were a popular remedy among tribal people all over the world, that exploit their pharmacological properties to cure the most different diseases. The phytochemistry of bryophytes exhibits a stunning assortment of biologically active compounds such as lipids, proteins, steroids, organic acids, alcohols, aliphatic and aromatic compounds, polyphenols, terpenoids, acetogenins and phenylquinones, thus it is not surprising that substances obtained from various species belonging to such ancestral plants are widely employed as antitumor, antipyretic, insecticidal and antimicrobial. This review explores in particular the antifungal potential of the three Bryophyta divisions-mosses (Musci), hornworts (Anthocerotae) and liverworts (Hepaticae)-to be used as a sources of interesting bioactive constituents for both pharmaceutical and agricultural areas, providing an updated overview of the latest relevant insights.
Collapse
Affiliation(s)
- Mauro Commisso
- Department of Biotechnology, University of Verona, Cà Vignal 1, Strada Le Grazie 15, 37134 Verona (VR), Italy;
| | - Francesco Guarino
- Department of Chemistry and Biology, University of Salerno, Via Giovanni Paolo II 132, 84084 Fisciano (SA), Italy;
| | - Laura Marchi
- Department of Medicine and Surgery, Respiratory Disease and Lung Function Unit, University of Parma, Via Gramsci 14, 43125 Parma (PR), Italy;
| | - Antonella Muto
- Department of Biology, Ecology and Earth Sciences, University of Calabria, Via Ponte P. Bucci 6b, Arcavacata di Rende, 87036 Cosenza (CS), Italy;
| | - Amalia Piro
- Laboratory of Plant Biology and Plant Proteomics (Lab.Bio.Pro.Ve), Department of Chemistry and Chemical Technologies, University of Calabria, Ponte P. Bucci 12 C, Arcavacata di Rende, 87036 Cosenza (CS), Italy;
| | - Francesca Degola
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco delle Scienze 11/A, 43124 Parma (PR), Italy
| |
Collapse
|
2
|
Bian Q, Chen J, Qiu W, Peng C, Song M, Sun X, Liu Y, Ding F, Chen J, Zhang L. Four targeted genes for predicting the prognosis of colorectal cancer: A bioinformatics analysis case. Oncol Lett 2019; 18:5043-5054. [PMID: 31612015 PMCID: PMC6781647 DOI: 10.3892/ol.2019.10866] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Accepted: 08/13/2019] [Indexed: 12/12/2022] Open
Abstract
The molecular mechanisms underlying the development and progression of colorectal cancer (CRC) have not been clarified. The purpose of the present study was to identify key genes that may serve as novel therapeutic targets or prognostic predictors in patients with CRC using bioinformatics analysis. Four gene expression datasets were downloaded from the Gene Expression Omnibus database, which revealed 19 upregulated and 34 downregulated differentially expressed genes (DEGs). The downregulated DEGs were significantly enriched in eight pathways according to Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis. A protein-protein interaction network was constructed with 52 DEGs and 458 edges. Ten key genes were identified according to the degree value, betweenness centrality and closeness centrality. Survival analysis revealed that low expression of four of the ten genes, carcinoembryonic antigen related cell adhesion molecule 7 (CEACAM7), solute carrier family 4 member 4 (SLC4A4), glucagon (GCG) and chloride channel accessory 1 (CLCA1) genes, were associated with unfavorable prognosis in CRC. Furthermore, gene set enrichment analysis revealed that two pathways were significantly enriched in the CEACAM7 low-expression group. Thus, CEACAM7, SLC4A4, GCG and CLCA1 may be prognostic markers or therapeutic targets of CRC. Low CEACAM7 expression may be associated with the activation of glycosaminoglycan biosynthesis-chondroitin sulfate and extracellular matrix receptor interaction pathways and may affect the prognosis of CRC.
Collapse
Affiliation(s)
- Qinglai Bian
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Jiaxu Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China.,Formula-Pattern Research Center, School of Traditional Chinese Medicine, Jinan University, Guangzhou, Guangdong 510632, P.R. China
| | - Wenqi Qiu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Chenxi Peng
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Meifang Song
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Xuebin Sun
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Yueyun Liu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Fengmin Ding
- School of Basic Medical Science, Hubei University of Chinese Medicine, Wuhan, Hubei 430065, P.R. China
| | - Jianbei Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, P.R. China
| | - Liqing Zhang
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24060, USA
| |
Collapse
|
3
|
Lamond AI, Mann M. Cell biology and the genome projects a concerted strategy for characterizing multiprotein complexes by using mass spectrometry. Trends Cell Biol 2012; 7:139-42. [PMID: 17708925 DOI: 10.1016/s0962-8924(97)01031-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
New developments in biological mass spectrometry and the growth of sequence databases are revolutionizing protein analysis. As recent results demonstrate, the high sensitivity and high throughput of this approach allow the rapid characterization of gel-separated proteins and the identification of cognate cDNA and expressed-sequence tag (EST) clones. We advocate here exploiting this new technology for the systematic characterization of multiprotein complexes. The analysis of proteins as components of specific complexes provides an immediate link to biological function and is a powerful method for deciphering the functions of open reading frames uncovered in the genome-sequencing projects.
Collapse
Affiliation(s)
- A I Lamond
- the Dept of Biochemistry, University of Dundee, Dundee, UK DD1 4HN
| | | |
Collapse
|
4
|
Abstract
The science of genomes: only within the past few decades have scientists progressed from the analysis of a single or a small number of genes at once to the investigation of thousands of genes, going from the study of the units of inheritance to the investigation of the whole genome of an organism. The science of the genomes, or "genomics," initially dedicated to the determination of DNA sequences (the nucleotide order on a given fragment of DNA), has promptly expanded toward a more functional level--studying the expression profiles and the roles of both genes and proteins. The aim of the chapter is to review some basic assumptions and definitions that are the fabric of genomics, and to elucidate key concepts and approaches on which genomics rely.
Collapse
Affiliation(s)
- Luca Del Giacco
- Division of Functional and Reproductive Biology, Department of Biology, University of Milan, Milan-MI, Italy.
| | | |
Collapse
|
5
|
Xing XB, Li QR, Sun H, Fu X, Zhan F, Huang X, Li J, Chen CL, Shyr Y, Zeng R, Li YX, Xie L. The discovery of novel protein-coding features in mouse genome based on mass spectrometry data. Genomics 2011; 98:343-51. [PMID: 21840390 DOI: 10.1016/j.ygeno.2011.07.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Revised: 07/18/2011] [Accepted: 07/23/2011] [Indexed: 01/07/2023]
Abstract
Identifying protein-coding genes in eukaryotic genomes remains a challenge in post-genome era due to the complex gene models. We applied a proteogenomics strategy to detect un-annotated protein-coding regions in mouse genome. High-accuracy tandem mass spectrometry (MS/MS) data from diverse mouse samples were generated by LTQ-Orbitrap mass spectrometer in house. Two searchable diagnostic proteomic datasets were constructed, one with all possible encoding exon junctions, and the other with all putative encoding exons, for the discovery of novel exon splicing events and novel uninterrupted protein-coding regions. Altogether 29,586 unique peptides were identified. Aligning backwards to the mouse genome, the translation of 4471 annotated genes was validated by the known peptides; and 172 genic events were defined in mouse genome by the novel peptides. The approach in the current work can provide substantial evidences for eukaryote genome annotation in encoding genes.
Collapse
Affiliation(s)
- Xiao-Bin Xing
- Shanghai Center for Bioinformation Technology, PR China
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
Due to alternative splicing events in eukaryotic species, the identification of mRNA isoforms (or splicing variants) is a difficult problem. Traditional experimental methods for this purpose are time consuming and cost ineffective. The emerging RNA-Seq technology provides a possible effective method to address this problem. Although the advantages of RNA-Seq over traditional methods in transcriptome analysis have been confirmed by many studies, the inference of isoforms from millions of short sequence reads (e.g., Illumina/Solexa reads) has remained computationally challenging. In this work, we propose a method to calculate the expression levels of isoforms and infer isoforms from short RNA-Seq reads using exon-intron boundary, transcription start site (TSS) and poly-A site (PAS) information. We first formulate the relationship among exons, isoforms, and single-end reads as a convex quadratic program, and then use an efficient algorithm (called IsoInfer) to search for isoforms. IsoInfer can calculate the expression levels of isoforms accurately if all the isoforms are known and infer novel isoforms from scratch. Our experimental tests on known mouse isoforms with both simulated expression levels and reads demonstrate that IsoInfer is able to calculate the expression levels of isoforms with an accuracy comparable to the state-of-the-art statistical method and a 60 times faster speed. Moreover, our tests on both simulated and real reads show that it achieves a good precision and sensitivity in inferring isoforms when given accurate exon-intron boundary, TSS, and PAS information, especially for isoforms whose expression levels are significantly high. The software is publicly available for free at http://www.cs.ucr.edu/∼jianxing/IsoInfer.html.
Collapse
Affiliation(s)
- Jianxing Feng
- School of Life Sciences and Technology, Tongji University, China.
| | | | | |
Collapse
|
7
|
Feng J, Li W, Jiang T. Inference of Isoforms from Short Sequence Reads. LECTURE NOTES IN COMPUTER SCIENCE 2010. [DOI: 10.1007/978-3-642-12683-3_10] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
8
|
Abstract
Originally established in the beginning of the 1990s as a direct route to gene finding, expressed sequence tags (ESTs) still lend themselves as a means to analyze gene expression in almost all human tissues. The type of questions that can be addressed using public EST libraries ranges from tissue-specific gene profiling to the comparison between tissues in diseased and healthy states. Thanks to a multitude of web-based online bioinformatics resources, mining in EST libraries is not restricted to experts in the field of data analysis, but can readily be performed by the medical or life scientist. In this chapter, a couple of cases studies are presented that guide the scientist to the most useful online resources so that they can conduct their own research.
Collapse
Affiliation(s)
- Armin O Schmitt
- Institute for Animal Sciences, Humboldt-Universität zu Berlin, Berlin, Germany
| |
Collapse
|
9
|
Morozova O, Hirst M, Marra MA. Applications of new sequencing technologies for transcriptome analysis. Annu Rev Genomics Hum Genet 2009; 10:135-51. [PMID: 19715439 DOI: 10.1146/annurev-genom-082908-145957] [Citation(s) in RCA: 348] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Transcriptome analysis has been a key area of biological inquiry for decades. Over the years, research in the field has progressed from candidate gene-based detection of RNAs using Northern blotting to high-throughput expression profiling driven by the advent of microarrays. Next-generation sequencing technologies have revolutionized transcriptomics by providing opportunities for multidimensional examinations of cellular transcriptomes in which high-throughput expression data are obtained at a single-base resolution.
Collapse
Affiliation(s)
- Olena Morozova
- BC Cancer Agency, Genome Sciences Center, Vancouver, BC V5Z 4S6, Canada.
| | | | | |
Collapse
|
10
|
Scheibye-Alsing K, Hoffmann S, Frankel A, Jensen P, Stadler PF, Mang Y, Tommerup N, Gilchrist MJ, Nygård AB, Cirera S, Jørgensen CB, Fredholm M, Gorodkin J. Sequence assembly. Comput Biol Chem 2008; 33:121-36. [PMID: 19152793 DOI: 10.1016/j.compbiolchem.2008.11.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 11/28/2008] [Accepted: 11/28/2008] [Indexed: 01/20/2023]
Abstract
Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and plays an important role in processing the information generated by these methods. Here, we provide a comprehensive overview of the current publicly available sequence assembly programs. We describe the basic principles of computational assembly along with the main concerns, such as repetitive sequences in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html.
Collapse
Affiliation(s)
- K Scheibye-Alsing
- Division of Genetics and Bioinformatics, IBHV, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Abstract
GenBank(R) is a comprehensive database of publicly available DNA sequences for more than 205,000 named organisms and for more than 60,000 within the embryophyta, obtained through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Daily data exchange with the European Molecular Biology Laboratory (EMBL) in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the National Center for Biotechnology Information (NCBI) retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases with taxonomy, genome, mapping, protein structure, and domain information and the biomedical journal literature through PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available through FTP. GenBank usage scenarios ranging from local analyses of the data available through FTP to online analyses supported by the NCBI Web-based tools are discussed. To access GenBank and its related retrieval and analysis services, go to the NCBI Homepage at http://www.ncbi.nlm.nih.gov.
Collapse
Affiliation(s)
- David Wheeler
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
12
|
Gertz EM, Yu YK, Agarwala R, Schäffer AA, Altschul SF. Composition-based statistics and translated nucleotide searches: improving the TBLASTN module of BLAST. BMC Biol 2006; 4:41. [PMID: 17156431 PMCID: PMC1779365 DOI: 10.1186/1741-7007-4-41] [Citation(s) in RCA: 388] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2006] [Accepted: 12/07/2006] [Indexed: 11/29/2022] Open
Abstract
Background TBLASTN is a mode of operation for BLAST that aligns protein sequences to a nucleotide database translated in all six frames. We present the first description of the modern implementation of TBLASTN, focusing on new techniques that were used to implement composition-based statistics for translated nucleotide searches. Composition-based statistics use the composition of the sequences being aligned to generate more accurate E-values, which allows for a more accurate distinction between true and false matches. Until recently, composition-based statistics were available only for protein-protein searches. They are now available as a command line option for recent versions of TBLASTN and as an option for TBLASTN on the NCBI BLAST web server. Results We evaluate the statistical and retrieval accuracy of the E-values reported by a baseline version of TBLASTN and by two variants that use different types of composition-based statistics. To test the statistical accuracy of TBLASTN, we ran 1000 searches using scrambled proteins from the mouse genome and a database of human chromosomes. To test retrieval accuracy, we modernize and adapt to translated searches a test set previously used to evaluate the retrieval accuracy of protein-protein searches. We show that composition-based statistics greatly improve the statistical accuracy of TBLASTN, at a small cost to the retrieval accuracy. Conclusion TBLASTN is widely used, as it is common to wish to compare proteins to chromosomes or to libraries of mRNAs. Composition-based statistics improve the statistical accuracy, and therefore the reliability, of TBLASTN results. The algorithms used by TBLASTN are not widely known, and some of the most important are reported here. The data used to test TBLASTN are available for download and may be useful in other studies of translated search algorithms.
Collapse
Affiliation(s)
- E Michael Gertz
- National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| | - Yi-Kuo Yu
- National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| | - Richa Agarwala
- National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| | - Alejandro A Schäffer
- National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| | - Stephen F Altschul
- National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, MD, USA
| |
Collapse
|
13
|
Yang RZ, Lee MJ, Hu H, Pray J, Wu HB, Hansen BC, Shuldiner AR, Fried SK, McLenithan JC, Gong DW. Identification of omentin as a novel depot-specific adipokine in human adipose tissue: possible role in modulating insulin action. Am J Physiol Endocrinol Metab 2006; 290:E1253-61. [PMID: 16531507 DOI: 10.1152/ajpendo.00572.2004] [Citation(s) in RCA: 616] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Central (visceral) obesity is more closely associated with insulin resistance, type 2 diabetes, and cardiovascular disease than is peripheral [subcutaneous (sc)] obesity, but the underlying mechanism for this pathophysiological difference is largely unknown. To understand the molecular basis of this difference, we sequenced 10,437 expressed sequence tags (ESTs) from a human omental fat cDNA library and discovered a novel visceral fat depot-specific secretory protein, which we have named omentin. Omentin ESTs were more abundant than many known adipose genes, such as perilipin, adiponectin, and leptin in the cDNA library. Protein sequence analysis indicated that omentin mRNA encodes a peptide of 313 amino acids, containing a secretory signal sequence and a fibrinogen-related domain. Northern analysis demonstrated that omentin mRNA was predominantly expressed in visceral adipose tissue and was barely detectable in sc fat depots in humans and rhesus monkeys. Quantative real-time PCR showed that omentin mRNA was expressed in stromal vascular cells, but not fat cells, isolated from omental adipose tissue, with >150-fold less in sc cell fractions. Accordingly, omentin protein was secreted into the culture medium of omental, but not sc, fat explants. Omentin was detectable in human serum by Western blot analysis. Addition of recombinant omentin in vitro did not affect basal but enhanced insulin-stimulated glucose uptake in both sc (47%, n = 9, P = 0.003) and omental (approximately 30%, n = 3, P < 0.05) human adipocytes. Omentin increased Akt phosphorylation in the absence and presence of insulin. In conclusion, omentin is a new adipokine that is expressed in omental adipose tissue in humans and may regulate insulin action.
Collapse
Affiliation(s)
- Rong-Ze Yang
- Division of Endocrinology, Diabetes, and Nutrition, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Holste D, Huo G, Tung V, Burge CB. HOLLYWOOD: a comparative relational database of alternative splicing. Nucleic Acids Res 2006; 34:D56-62. [PMID: 16381932 PMCID: PMC1347411 DOI: 10.1093/nar/gkj048] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2005] [Revised: 09/26/2005] [Accepted: 10/04/2005] [Indexed: 01/05/2023] Open
Abstract
RNA splicing is an essential step in gene expression, and is often variable, giving rise to multiple alternatively spliced mRNA and protein isoforms from a single gene locus. The design of effective databases to support experimental and computational investigations of alternative splicing (AS) is a significant challenge. In an effort to integrate accurate exon and splice site annotation with current knowledge about splicing regulatory elements and predicted AS events, and to link information about the splicing of orthologous genes in different species, we have developed the Hollywood system. This database was built upon genomic annotation of splicing patterns of known genes derived from spliced alignment of complementary DNAs (cDNAs) and expressed sequence tags, and links features such as splice site sequence and strength, exonic splicing enhancers and silencers, conserved and non-conserved patterns of splicing, and cDNA library information for inferred alternative exons. Hollywood was implemented as a relational database and currently contains comprehensive information for human and mouse. It is accompanied by a web query tool that allows searches for sets of exons with specific splicing characteristics or splicing regulatory element composition, or gives a graphical or sequence-level summary of splicing patterns for a specific gene. A streamlined graphical representation of gene splicing patterns is provided, and these patterns can alternatively be layered onto existing information in the UCSC Genome Browser. The database is accessible at http://hollywood.mit.edu.
Collapse
Affiliation(s)
- Dirk Holste
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02319, USA.
| | | | | | | |
Collapse
|
15
|
Fernandes JMO, Mackenzie MG, Elgar G, Suzuki Y, Watabe S, Kinghorn JR, Johnston IA. A genomic approach to reveal novel genes associated with myotube formation in the model teleost,Takifugu rubripes. Physiol Genomics 2005; 22:327-38. [PMID: 15928209 DOI: 10.1152/physiolgenomics.00087.2005] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Little is known about the transcriptional networks that regulate myotube production in vertebrates. In the present study, we have used a genomic approach to discover novel genes associated with myotube formation in fast muscle of the tiger puffer fish, Takifugu rubripes. The number of fast muscle fibers per myotome increased until 1.2 kg body mass, and subsequent growth was by fiber hypertrophy alone. Forward and reverse subtracted cDNA libraries were prepared from a 180-g (myotube +) and a 3.4-kg (myotube −) fish, and 1,452 expressed sequence tags (ESTs) were obtained. After these ESTs were grouped into nonredundant clusters and housekeeping and structural genes were eliminated, 57 genes were selected and quantitative PCR was used to investigate their expression levels in different tissues from independent groups of myotube(−) and myotube(+) fish acclimated to the same environmental conditions and diet. Eleven novel genes were found to be consistently differentially expressed, but only four showed appropriate tissue-specific expression. These four genes were upregulated 5–25 times in fast muscle of myotube(−) relative to myotube(+) growth stages, while their expression remained unchanged in the other tissues studied. The novel genes identified, which are also present in other vertebrate genomes, may play a role in inhibiting myotube formation in vertebrate muscle.
Collapse
Affiliation(s)
- Jorge M O Fernandes
- Gatty Marine Laboratory, School of Biology, University of St. Andrews, St. Andrews, Fife, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
16
|
Elizabeth Cha I, Rouchka EC. Comparison of Current BLAST Software on Nucleotide Sequences. PROCEEDINGS. IPDPS (CONFERENCE) 2005; 19:8. [PMID: 21243090 PMCID: PMC3021256 DOI: 10.1109/ipdps.2005.145] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
The computational power needed for searching exponentially growing databases, such as GenBank, has increased dramatically. Three different implementations of the most widely used sequence alignment tool, known as BLAST (Basic Local Alignment Search Tool), are studied for their efficiency on nucleotide-nucleotide comparisons. The performance of these implementations are evaluated using target databases and query sequences of varying lengths and number of entries constructed from human genomic and EST sequences. In general, WU BLAST was found to be most efficient when the database and query composition are unknown. NCBI BLAST appears to work best when the database contains a small number of sequences, while mpiBLAST shows the power of database distribution when the number of bases per target database is large. The optimal number of compute nodes in mpiBLAST varies depending upon the database, yet in the cases studied, remains surprisingly low.
Collapse
Affiliation(s)
- I. Elizabeth Cha
- University of Louisville Department of Computer Engineering and Computer Science, Louisville, KY 40292,
| | - Eric C. Rouchka
- University of Louisville Department of Computer Engineering and Computer Science, Louisville, KY 40292,
| |
Collapse
|
17
|
Nair KS, Jaleel A, Asmann YW, Short KR, Raghavakaimal S. Proteomic research: potential opportunities for clinical and physiological investigators. Am J Physiol Endocrinol Metab 2004; 286:E863-74. [PMID: 15140753 DOI: 10.1152/ajpendo.00370.2003] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Proteomics is the comprehensive and systematic study of proteins, which are functional molecules. Although proteins are products of gene expression, there are more proteins than genes due to the posttranslational modifications of proteins, making the study of proteins difficult. Protein expression is tissue specific, and its function is modulated by variety of factors, including other proteins, phosphates, sulfates, carbohydrates, and lipids, as well as other metabolites. Because of the dynamic nature of protein expression and posttranslational modifications, identification and quantification of proteins alone are not sufficient to understand functional changes. Emerging technologies will allow investigators to perform a combination of metabolic labeling and identification as well as quantification and measurement of the synthesis rates of a large number of proteins in a tissue. This offers the opportunity to better understand the regulation of tissue functions. Rapid advances in mass spectrometry, protein purification techniques, isotope labeling of proteins, and bioinformatics are likely to improve our understanding of physiological states and altered functions in diseased states. Such mechanistic information will improve the ability to perform early diagnosis of tumors and other diseases and develop prognostic indexes and novel therapies.
Collapse
Affiliation(s)
- K Sreekumaran Nair
- Mayo Clinic School of Medicine, Endocrinology Research Unit, Joseph 5-194, 200 First St. SW, Rochester, MN 55905, USA.
| | | | | | | | | |
Collapse
|
18
|
Ota T, Suzuki Y, Nishikawa T, Otsuki T, Sugiyama T, Irie R, Wakamatsu A, Hayashi K, Sato H, Nagai K, Kimura K, Makita H, Sekine M, Obayashi M, Nishi T, Shibahara T, Tanaka T, Ishii S, Yamamoto JI, Saito K, Kawai Y, Isono Y, Nakamura Y, Nagahari K, Murakami K, Yasuda T, Iwayanagi T, Wagatsuma M, Shiratori A, Sudo H, Hosoiri T, Kaku Y, Kodaira H, Kondo H, Sugawara M, Takahashi M, Kanda K, Yokoi T, Furuya T, Kikkawa E, Omura Y, Abe K, Kamihara K, Katsuta N, Sato K, Tanikawa M, Yamazaki M, Ninomiya K, Ishibashi T, Yamashita H, Murakawa K, Fujimori K, Tanai H, Kimata M, Watanabe M, Hiraoka S, Chiba Y, Ishida S, Ono Y, Takiguchi S, Watanabe S, Yosida M, Hotuta T, Kusano J, Kanehori K, Takahashi-Fujii A, Hara H, Tanase TO, Nomura Y, Togiya S, Komai F, Hara R, Takeuchi K, Arita M, Imose N, Musashino K, Yuuki H, Oshima A, Sasaki N, Aotsuka S, Yoshikawa Y, Matsunawa H, Ichihara T, Shiohata N, Sano S, Moriya S, Momiyama H, Satoh N, Takami S, Terashima Y, Suzuki O, Nakagawa S, Senoh A, Mizoguchi H, Goto Y, Shimizu F, Wakebe H, Hishigaki H, Watanabe T, Sugiyama A, et alOta T, Suzuki Y, Nishikawa T, Otsuki T, Sugiyama T, Irie R, Wakamatsu A, Hayashi K, Sato H, Nagai K, Kimura K, Makita H, Sekine M, Obayashi M, Nishi T, Shibahara T, Tanaka T, Ishii S, Yamamoto JI, Saito K, Kawai Y, Isono Y, Nakamura Y, Nagahari K, Murakami K, Yasuda T, Iwayanagi T, Wagatsuma M, Shiratori A, Sudo H, Hosoiri T, Kaku Y, Kodaira H, Kondo H, Sugawara M, Takahashi M, Kanda K, Yokoi T, Furuya T, Kikkawa E, Omura Y, Abe K, Kamihara K, Katsuta N, Sato K, Tanikawa M, Yamazaki M, Ninomiya K, Ishibashi T, Yamashita H, Murakawa K, Fujimori K, Tanai H, Kimata M, Watanabe M, Hiraoka S, Chiba Y, Ishida S, Ono Y, Takiguchi S, Watanabe S, Yosida M, Hotuta T, Kusano J, Kanehori K, Takahashi-Fujii A, Hara H, Tanase TO, Nomura Y, Togiya S, Komai F, Hara R, Takeuchi K, Arita M, Imose N, Musashino K, Yuuki H, Oshima A, Sasaki N, Aotsuka S, Yoshikawa Y, Matsunawa H, Ichihara T, Shiohata N, Sano S, Moriya S, Momiyama H, Satoh N, Takami S, Terashima Y, Suzuki O, Nakagawa S, Senoh A, Mizoguchi H, Goto Y, Shimizu F, Wakebe H, Hishigaki H, Watanabe T, Sugiyama A, Takemoto M, Kawakami B, Yamazaki M, Watanabe K, Kumagai A, Itakura S, Fukuzumi Y, Fujimori Y, Komiyama M, Tashiro H, Tanigami A, Fujiwara T, Ono T, Yamada K, Fujii Y, Ozaki K, Hirao M, Ohmori Y, Kawabata A, Hikiji T, Kobatake N, Inagaki H, Ikema Y, Okamoto S, Okitani R, Kawakami T, Noguchi S, Itoh T, Shigeta K, Senba T, Matsumura K, Nakajima Y, Mizuno T, Morinaga M, Sasaki M, Togashi T, Oyama M, Hata H, Watanabe M, Komatsu T, Mizushima-Sugano J, Satoh T, Shirai Y, Takahashi Y, Nakagawa K, Okumura K, Nagase T, Nomura N, Kikuchi H, Masuho Y, Yamashita R, Nakai K, Yada T, Nakamura Y, Ohara O, Isogai T, Sugano S. Complete sequencing and characterization of 21,243 full-length human cDNAs. Nat Genet 2003; 36:40-5. [PMID: 14702039 DOI: 10.1038/ng1285] [Show More Authors] [Citation(s) in RCA: 699] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2003] [Accepted: 12/01/2003] [Indexed: 02/06/2023]
Abstract
As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at approximately 58% compared with a peak at approximately 42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at approximately 42%, relatively low compared with that of protein-coding cDNAs.
Collapse
Affiliation(s)
- Toshio Ota
- Helix Research Institute, 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Vannahme C, Gösling S, Paulsson M, Maurer P, Hartmann U. Characterization of SMOC-2, a modular extracellular calcium-binding protein. Biochem J 2003; 373:805-14. [PMID: 12741954 PMCID: PMC1223551 DOI: 10.1042/bj20030532] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2003] [Accepted: 05/13/2003] [Indexed: 01/11/2023]
Abstract
We have isolated the novel gene SMOC-2, which encodes a secreted modular protein containing an EF-hand calcium-binding domain homologous to that in BM-40. It further consists of two thyroglobulin-like domains, a follistatin-like domain and a novel domain found only in the homologous SMOC-1. Phylogenetic analysis of the calcium-binding domain sequences showed that SMOC-1 and -2 form a separate group within the BM-40 family. The human and mouse SMOC-2 sequences are coded for by genes consisting of 13 exons located on chromosomes 6 and 17, respectively. Analysis of recombinantly expressed protein showed that SMOC-2 is a glycoprotein with a calcium-dependent conformation. Results from Northern blots and reverse transcription PCR revealed a widespread expression in many tissues.
Collapse
Affiliation(s)
- Christian Vannahme
- Center for Biochemistry, Medical Faculty, University of Cologne, Joseph-Stelzmann Strasse 52, D-50931 Cologne, Germany
| | | | | | | | | |
Collapse
|
20
|
Becker M, Nothwang HG, Friauf E. Differential expression pattern of chloride transporters NCC, NKCC2, KCC1, KCC3, KCC4, and AE3 in the developing rat auditory brainstem. Cell Tissue Res 2003; 312:155-65. [PMID: 12712325 DOI: 10.1007/s00441-003-0713-5] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2002] [Accepted: 02/19/2003] [Indexed: 11/30/2022]
Abstract
During development of inhibitory synapses, the action of the two neurotransmitters GABA and glycine shifts from depolarizing to hyperpolarizing. The shift is due to an age-dependent regulation of the intracellular free chloride concentration ([Cl(-)](i)) in postsynaptic neurons. A model system to study this maturation process is a glycinergic projection in the mammalian auditory brainstem. It is formed in the superior olivary complex (SOC) by neurons of the medial nucleus of the trapezoid body, whose axons terminate in the lateral superior olive (LSO). LSO neurons of perinatal rats and mice are depolarized upon glycine application, whereas older cells (>postnatal day (P) 8) are hyperpolarized. Here we examined the expression of six secondary active chloride transporter genes ( NCC, NKCC2, KCC1, KCC3, KCC4, and AE3) in the rat SOC to unravel the molecular mechanisms underlying this change. RT-PCR analysis demonstrated brainstem expression of KCC1, KCC3, KCC4, and AE3, but not of NCC and NKCC2. RNA in situ hybridization showed that only AE3 is highly expressed both at P3 (high [Cl(-)](i)) and P12 (low [Cl(-)](i)) in LSO neurons. KCC1 and KCC4 are weakly expressed in LSO neurons at P3 and P12, respectively. This study completes the expression analysis of all known chloride transporters sensitive to loop diuretic drugs in the SOC and demonstrates differences in the maturation between hippocampal and brainstem inhibitory synapses.
Collapse
Affiliation(s)
- Michael Becker
- Abteilung Tierphysiologie, Universität Kaiserslautern, Kaiserslautern, Germany
| | | | | |
Collapse
|
21
|
Abstract
Success in proteomics depends upon careful study design and high-quality biological samples. Advanced information technologies, and also an ability to use existing knowledge to the full, will be crucial in making sense of the data. Despite its genome-scale potential, proteome analysis is at a much earlier stage of development than genomics and gene expression (microarray) studies. Fundamental issues involving biological variability, pre-analytic factors and analytical reproducibility remain to be resolved. Consequently, the analysis of proteomics data is currently informal and relies heavily on expert opinion. Databases and software tools developed for the analysis of molecular sequences and microarrays are helpful, but are limited owing to the unique attributes of proteomics data and differing research goals.
Collapse
Affiliation(s)
- Mark S Boguski
- Human Biology Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, PO Box 19024, Seattle, Washington 98109, USA.
| | | |
Collapse
|
22
|
Abstract
A successful date collection visualization should satisfy a set of many requirements: unification of diverse data formats, support for serendipity research, support of hierarchical structures, algorithmizability, vast information density, Internet-readiness, and other. Recently, virtual reality has made significant progress in engineering, architectural design, entertainment and communication. We experiment with the possibility of using the immersive abstract three-dimensional visualizations of the metabolic networks. We present the trial Metabolic Network Visualizer software, which produces graphical representation of a metabolic network as a VRML world from a formal description written in a simple SGML-type scripting language.
Collapse
|
23
|
Sorek R, Safer HM. A novel algorithm for computational identification of contaminated EST libraries. Nucleic Acids Res 2003; 31:1067-74. [PMID: 12560505 PMCID: PMC149192 DOI: 10.1093/nar/gkg170] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A key goal of the Human Genome Project was to understand the complete set of human proteins, the proteome. Since the genome sequence by itself is not sufficient for predicting new genes and alternative splicing events that lead to new proteins, expressed sequence tags (ESTs) are used as the primary tool for these purposes. The high prevalence of artifacts in dbEST, however, often leads to invalid predictions. Here we describe a novel method for recognizing genomic DNA contamination and other artifacts that cannot be identified using current EST cleaning techniques. Our method uses the alignment of the entire set of ESTs to the human genome to identify highly contaminated EST libraries. We discovered 53 highly contaminated libraries and a subset of 24 766 ESTs from these libraries that probably represent contamination with genomic DNA, pre-mRNA, and ESTs that span non-canonical introns. Although this is only a small fraction of the entire EST dataset, each contaminating sequence could create a spurious transcript prediction. Indeed, in the clustering and assembly tool that we used, these sequences would have caused incorrect inference of 9575 new splice variants and 6370 new genes. Conclusions based on EST analysis, including prediction of alternative splicing, should be re-evaluated in light of these results. Our method, along with the identified set of contaminated sequences, will be essential for applications that depend on large EST datasets.
Collapse
Affiliation(s)
- Rotem Sorek
- Compugen Ltd, 72 Pinchas Rosen Street, Tel Aviv 69512, Israel.
| | | |
Collapse
|
24
|
Vannahme C, Smyth N, Miosge N, Gösling S, Frie C, Paulsson M, Maurer P, Hartmann U. Characterization of SMOC-1, a novel modular calcium-binding protein in basement membranes. J Biol Chem 2002; 277:37977-86. [PMID: 12130637 DOI: 10.1074/jbc.m203830200] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We have isolated the novel gene SMOC-1 that encodes a secreted modular protein containing an EF-hand calcium-binding domain homologous to that in BM-40. It further consists of two thyroglobulin-like domains, a follistatin-like domain and a novel domain. Recombinant expression in human cells showed that SMOC-1 is a glycoprotein with a calcium-dependent conformation. Results from Northern blots, reverse transcriptase-PCR, and immunoblots revealed a widespread expression in many tissues. Immunofluorescence studies with an antiserum directed against recombinant human SMOC-1 demonstrated a basement membrane localization of the protein and additionally its presence in other extracellular matrices. Immunogold electron microscopy confirmed the localization of SMOC-1 within basement membranes in kidney and skeletal muscle as well as its expression in the zona pellucida surrounding the oocyte.
Collapse
Affiliation(s)
- Christian Vannahme
- Institute for Biochemistry, Medical Faculty, University of Cologne, Joseph-Stelzmann Strasse 52, D-50931 Cologne, Germany
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Abstract
The advent of whole-genome data resources--not only sequence but also other genome-scale data collections such as gene expression, protein interaction, and genetic variation--is having two marked, complementary effects on the relatively new discipline of bioinformatics. First, the veritable flood of data is creating a need and demand for new tools for dealing adequately with the deluge, and, second, the unprecedented extent, diversity, and impending completeness of the data sets are creating opportunities for new approaches to discovery based on computational methods.
Collapse
Affiliation(s)
- D B Searls
- Bioinformatics Department, SmithKline Beecham Pharmaceuticals, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|
26
|
Cohen AM, Rumpel K, Coombs GH, Wastling JM. Characterisation of global protein expression by two-dimensional electrophoresis and mass spectrometry: proteomics of Toxoplasma gondii. Int J Parasitol 2002; 32:39-51. [PMID: 11796121 DOI: 10.1016/s0020-7519(01)00308-3] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The development of tools for the analysis of global gene expression is vital for the optimal exploitation of the data on parasite genomes that are now being generated in abundance. Recent advances in two-dimensional electrophoresis (2-DE), mass spectrometry and bioinformatics have greatly enhanced the possibilities for mapping and characterisation of protein populations. We have employed these developments in a proteomics approach for the analysis of proteins expressed in the tachyzoite stage of Toxoplasma gondii. Over 1000 polypeptides were reproducibly separated by high-resolution 2-DE using the pH ranges 4-7 and 6-11. Further separations using narrow range gels suggest that at least 3000-4000 polypeptides should be resolvable by 2-DE using multiple single pH unit gels. Mass spectrometry was used to characterise a variety of protein spots on the 2-DE gels. Peptide mass fingerprints, acquired by matrix-assisted laser desorption/ionisation-(MALDI) mass spectrometry, enabled unambiguous protein identifications to be made where full gene sequence information was available. However, interpretation of peptide mass fingerprint data using the T. gondii expressed sequence tag (EST) database was less reliable. Peptide fragmentation data, acquired by post-source decay mass spectrometry, proved a more successful strategy for the putative identification of proteins using the T. gondii EST database and protein databases from other organisms. In some instances, several protein spots appeared to be encoded by the same gene, indicating that post-translational modification and/or alternative splicing events may be a common feature of functional gene expression in T. gondii. The data demonstrate that proteomic analyses are now viable for T. gondii and other protozoa for which there are good EST databases, even in the absence of complete genome sequence. Moreover, proteomics is of great value in interpreting and annotating EST databases.
Collapse
Affiliation(s)
- A M Cohen
- Division of Infection & Immunity, Joseph Black Building, Institute of Biomedical and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK
| | | | | | | |
Collapse
|
27
|
Tetlow N, Liu D, Board P. Polymorphism of human Alpha class glutathione transferases. PHARMACOGENETICS 2001; 11:609-17. [PMID: 11668220 DOI: 10.1097/00008571-200110000-00007] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The recognition of the importance and utility of single nucleotide polymorphisms has generated an interest in the development of new strategies for their identification. Analysis of the Expressed Sequence Tag (EST) database can provide a rapid and efficient means of identifying polymorphisms. Screening of the Alpha class glutathione transferases (GSTs) in the EST database identified 10 putative polymorphisms in the coding region of the GSTA1 and GSTA2 genes, six of which were subsequently verified by sequence analysis. Polymerase chain reaction/restriction fragment length polymorphism analysis revealed the existence of three variants, a silent base substitution, K125K (G365A) in GSTA1, and T112S and E210A in GSTA2, in European Australian, African and Chinese populations. The variant isoforms of GSTA2 were expressed in Escherichia coli, purified, and enzymatically characterized. Modelling of the two GSTA2 polymorphisms into a three-dimensional structure of GSTA2, and characterization of their enzymatic properties, has shown that the structure and function of the wild-type GSTA2-2 isoenzyme is not significantly altered by these polymorphisms. This report demonstrates that analysis of the EST database provides a rapid and efficient means of identifying variant proteins.
Collapse
Affiliation(s)
- N Tetlow
- Molecular Genetics Group, Division of Molecular Medicine, John Curtin School of Medical Research, Australian National University, Canberra, Australia
| | | | | |
Collapse
|
28
|
Becker B, Feja N, Melkonian M. Analysis of expressed sequence tags (ESTs) from the scaly green flagellate Scherffelia dubia Pascher emend. Melkonian et Preisig. Protist 2001; 152:139-47. [PMID: 11545437 DOI: 10.1078/1434-4610-00052] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Partial sequencing of cDNA libraries to generate expressed sequence tags (ESTs) is an effective means of gene discovery, generation of molecular markers and characterization of transcription patterns. We have constructed an EST-database of the scaly green flagellate Scherffelia dubia (Chlorophyta) containing 361 sequences. cDNAs were obtained from interphase cells and from cells regenerating flagella. Analysis of the ESTs identified 138 EST-groups with significant similarity to known sequences. 134 EST-groups showed no significant similarity to any sequences in the databases. Most of the ESTs with similarity to known proteins are associated with typical interphase cell functions of a photosynthetic plant cell: assimilation of nutrients and biosynthesis of proteins. Others are related to the activation of the secretory pathway or the biogenesis of scales (e.g. kdo-synthase). Comparison of S. dubia ESTs with the genome of Arabidopsis thaliana and the EST database of Chlamydomonas reinhardtii revealed that S. dubia ESTs with similarity to known proteins were more similar to sequences in C. reinhardtii than to those of A. thaliana. Additionally, ESTs for guanylyl cyclase and cGMP phosphodiesterase are present in the two flagellates, but so far these gene products have not been found in embryophytes.
Collapse
Affiliation(s)
- B Becker
- Botanisches Institut, Universitat zu Köln, Germany.
| | | | | |
Collapse
|
29
|
Abstract
In this postgenomic era, it is no longer necessary to argue the need for automated methods for sequence annotation. Many researchers have designed tools for analyzing DNA sequences, but running multiple tools and interpreting the results can be tedious and confusing. In the last few years, many analysis workbenches have been developed to help streamline the process of sequence annotation. Genotator, developed in 1996, is still a popular choice owing to its ease of use and its configurability. This article will review annotating sequence data using the Genotator.
Collapse
Affiliation(s)
- N L Harris
- Electron Microscopy Unit, Imperial Cancer Research Fund, London, WC2A 3PX, UK
| |
Collapse
|
30
|
Valverde P. Cloning, expression, and mapping of hWW45, a novel human WW domain-containing gene. Biochem Biophys Res Commun 2000; 276:990-8. [PMID: 11027580 DOI: 10.1006/bbrc.2000.3582] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. The cloning and characterization of a novel human WW domain-containing gene, hWW45, which encodes a protein of approximately 45 kDa consisting of 2 WW domains and a coiled-coil region is reported here. The murine homologue cDNA, mWW45, displays a different 3'-untranslated region and predicts a protein identity of 93% to hWW45. Northern blot and RT-PCR analysis demonstrated that both mWW45 and hWW45 transcripts are ubiquitously expressed in adult tissues. The mouse embryonic expression is first seen at 7 days post coitum in Northern blot analysis of whole embryos. Chromosomal localization by radiation hybrid mapping revealed that hWW45 is localized at chromosome 14, 10.31cR from the marker D14S269.
Collapse
Affiliation(s)
- P Valverde
- Harvard/Forsyth Department of Oral Biology, Forsyth Institute, 140 Fenway, Boston, Massachusetts, 02115, USA.
| |
Collapse
|
31
|
Schaid DJ, Buetow K, Weeks DE, Wijsman E, Guo SW, Ott J, Dahl C. Discovery of cancer susceptibility genes: study designs, analytic approaches, and trends in technology. J Natl Cancer Inst Monogr 2000:1-16. [PMID: 10854480 DOI: 10.1093/oxfordjournals.jncimonographs.a024219] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Determining the genetic causes of cancers has immense public health benefits, ranging from prevention to earlier detection and treatment of disease. Although a number of cancer susceptibility genes have been successfully identified, design and analytic issues remain that challenge the current paradigm of gene discovery. Some examples are the definition and measurement of cancer phenotype, the use of intermediate end points, the choice of sample (e.g., affected relative pairs versus large extended pedigrees), the choice of analytic method [e.g., parametric logarithm of the odds (LOD) score method versus model-free methods], and the influence of gene-environment interaction on linkage analysis. Furthermore, association methods, based on either the traditional case-control study design or family-based controls, are popular choices to evaluate candidate genes or screen for linkage disequilibrium. Finally, the study design and analytic methods for gene discovery are determined to some extent by what genomic technology is feasible within the laboratory. Many of the main issues related to gene discovery, as well as trends in genomic technology that will impact on gene discovery, are discussed from the perspective of their strengths and weaknesses, pointing to areas in need of further work.
Collapse
Affiliation(s)
- D J Schaid
- Department of Health Sciences Research and Medical Genetics, Mayo Clinic/Mayo Foundation, Rochester, MN 55905, USA.
| | | | | | | | | | | | | |
Collapse
|
32
|
Sunyaev S, Hanke J, Brett D, Aydin A, Zastrow I, Lathe W, Bork P, Reich J. Individual variation in protein-coding sequences of human genome. ADVANCES IN PROTEIN CHEMISTRY 2000; 54:409-37. [PMID: 10829234 DOI: 10.1016/s0065-3233(00)54012-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Affiliation(s)
- S Sunyaev
- Max-Delbrück-Centrum of Molecular Medicine, Berlin-Buch, Germany
| | | | | | | | | | | | | | | |
Collapse
|
33
|
O'Brien KP, Tapia-Páez I, Ståhle-Bäckdahl M, Kedra D, Dumanski JP. Characterization of five novel human genes in the 11q13-q22 region. Biochem Biophys Res Commun 2000; 273:90-4. [PMID: 10873569 DOI: 10.1006/bbrc.2000.2910] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The redundancy of sequences in dbEST has approached a level where contiguous cDNA sequences of genes can be assembled, without the need to physically handle the clones from which the ESTs are derived. This is termed EST based in silico gene cloning. With the availability of sequence chromatogram files for a subset of ESTs, the quality of EST sequences can be ascertained accurately and used in contig assembly. In this report, we performed a study using this approach and isolated five novel human genes, C11orf1-C11orf5, in the 11q13-q22 region. The full open reading frames of these genes were determined by comparison with their orthologs, of which four mouse orthologs were isolated (c11orf1, c11orf2, c11orf3 and c11orf5). These genes were then analyzed using several proteomics tools. Both C11orf1 and C11orf2 are nuclear proteins with no other distinguishing features. C11orf3 is a cytoplasmic protein containing an ATP/GTP binding site, a signal peptide located in the N-terminus and a similarity to the C. elegans protein "Probable ARP 2/3 complex 20kD subunit." C11orf4 is a peptide which displays four putative transmembrane domains and is predicted to have a cytoplasmic localization. It contains signal peptides at the N- and C-termini. C11orf5 is a putative nuclear protein displaying a central coiled coil domain. Here, we propose that this purely EST-based cloning approach can be used by modestly sized laboratories to rapidly and accurately characterize and map a significant number of human genes without the need of further sequencing.
Collapse
Affiliation(s)
- K P O'Brien
- Department of Medicine, Karolinska Hospital, Stockholm, S-171 76, Sweden
| | | | | | | | | |
Collapse
|
34
|
Sadée W, Graul RC, Lee AY. Classification of membrane transporters. PHARMACEUTICAL BIOTECHNOLOGY 2000; 12:29-58. [PMID: 10742971 DOI: 10.1007/0-306-46812-3_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- W Sadée
- Department of Biopharmaceutical Sciences, School of Pharmacy, University of California San Francisco 94143-0446, USA
| | | | | |
Collapse
|
35
|
Shevchenko A, Loboda A, Shevchenko A, Ens W, Standing KG. MALDI quadrupole time-of-flight mass spectrometry: a powerful tool for proteomic research. Anal Chem 2000; 72:2132-41. [PMID: 10815976 DOI: 10.1021/ac9913659] [Citation(s) in RCA: 190] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A MALDI QqTOF mass spectrometer has been used to identify proteins separated by one-dimensional or two-dimensional gel electrophoresis at the femtomole level. The high mass resolution and the high mass accuracy of this instrument in both MS and MS/MS modes allow identification of a protein either by peptide mass fingerprinting of the protein digest or from tandem mass spectra acquired by collision-induced dissociation of individual peptide precursors. A peptide mass map of the digest and tandem mass spectra of multiple peptide precursor ions can be acquired from the same sample in the course of a single experiment. Database searching and acquisition of MS and MS/MS spectra can be combined in an interactive fashion, increasing the information value of the analytical data. The approach has demonstrated its usefulness in the comprehensive characterization of protein in-gel digests, in the dissection of complex protein mixtures, and in sequencing of a low molecular weight integral membrane protein. Proteins can be identified in all types of sequence databases, including an EST database. Thus, MALDI QqTOF mass spectrometry promises to have remarkable potential for advancing proteomic research.
Collapse
Affiliation(s)
- A Shevchenko
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.
| | | | | | | | | |
Collapse
|
36
|
Wang SM, Fears SC, Zhang L, Chen JJ, Rowley JD. Screening poly(dA/dT)- cDNAs for gene identification. Proc Natl Acad Sci U S A 2000; 97:4162-7. [PMID: 10760283 PMCID: PMC18183 DOI: 10.1073/pnas.97.8.4162] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Many genes expressed in the human genome have not been identified despite intensive efforts. We observed that the presence of long poly(dA/dT) sequences in the 3' end of cDNA templates contributes significantly to this problem, because the hybrids formed randomly between poly(dA) and poly(dT) sequences of unrelated cDNA templates lead to loss of many templates in the normalization/subtraction reactions. The low abundant copies, which account for the majority of the expressed genes, are affected in particular by this phenomenon. We have developed a strategy called screening poly(dA/dT)(-) cDNAs for gene identification to overcome this obstacle. Applying this strategy can significantly enhance the efficiency of genome-wide gene identification and should have an impact on many functional genomic studies in the postgenome era.
Collapse
Affiliation(s)
- S M Wang
- Section of Hematology and Oncology, University of Chicago Medical Center, 5841 South Maryland Avenue, MC 2115, Chicago, IL 60637-1470, USA.
| | | | | | | | | |
Collapse
|
37
|
Affiliation(s)
- K J Martin
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | | |
Collapse
|
38
|
Schölzel S, Zimmermann W, Schwarzkopf G, Grunert F, Rogaczewski B, Thompson J. Carcinoembryonic antigen family members CEACAM6 and CEACAM7 are differentially expressed in normal tissues and oppositely deregulated in hyperplastic colorectal polyps and early adenomas. THE AMERICAN JOURNAL OF PATHOLOGY 2000; 156:595-605. [PMID: 10666389 PMCID: PMC1850034 DOI: 10.1016/s0002-9440(10)64764-5] [Citation(s) in RCA: 150] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Four members of the carcinoembryonic antigen (CEA) family, CEA, CEACAM1 (BGP), CEACAM6 (NCA-50/90), and CEACAM7 (CGM2), are coexpressed in normal colorectal epithelia but are deregulated in colorectal cancers, where they could play a role in tumorigenesis. As a basis for functional studies, their expression patterns in normal tissues first need to be clarified. This is well documented for CEACAM1 and CEA but not for CEACAM6 or CEACAM7. We have now carried out immunohistochemical expression studies on 35 different organs, using CEACAM6-specific (9A6) and CEACAM7-specific (BAC2) monoclonal antibodies. CEACAM7 was only found on the apical surface of highly differentiated epithelial cells in the colorectal mucosa and on isolated ductal epithelial cells within the pancreas. CEACAM6 was expressed in granulocytes and epithelia from various organs. CEACAM6 and CEACAM7 expression correlated with apoptosis at the table region of the normal colon, and both were absent from highly proliferating cells at the base of colonic crypts. CEACAM6 revealed a broader expression zone in proliferating cells in hyperplastic polyps and adenomas compared with normal mucosa, whereas CEACAM7 was completely absent. Down-regulation of CEACAM7 and up-regulation of CEACAM6 expression in hyperplastic polyps and early adenomas represent some of the earliest observable molecular events leading to colorectal tumors.
Collapse
Affiliation(s)
- S Schölzel
- Institutes of Molecular Medicine, University of Freiburg, Department of Surgery, University Hospital, Freiburg, Germany
| | | | | | | | | | | |
Collapse
|
39
|
Yang JY, Spanaus KS, Widmer U. Cloning, characterization and genomic organization of LCC-1 (scya16), a novel human CC chemokine expressed in liver. Cytokine 2000; 12:101-9. [PMID: 10671294 DOI: 10.1006/cyto.1999.0548] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
By homology search of expressed sequence tags (EST) in GenBank a novel member of the CC chemokine family was identified. The full-length sequence of this liver-specific CC chemokine (LCC-1) predicted a mature protein of 97 amino acids with 31-48% identity to other CC chemokines. There was a characteristic amino acid C-term extension when aligned with other chemokines. Northern blot analysis from a panel of human tissues revealed that LCC-1 mRNA expression is restricted to adult and fetal liver. Different polyadenylation results in two mRNA species of 1.5 kb and 0.5 kb in size. LCC-1 is constitutively expressed in human HepG2 hepatoma cells and is induced by hypoxic exposure. The promoter region of the LCC-1 gene contains potential HIF-1 binding sites. The EST for LCC-1 has been previously mapped to the CC chemokine cluster on human chromosome 17q11.2. The organization of the LCC-1 gene (scya16) into three exons interrupted by two introns is identical to that found for other members of the CC chemokine family.
Collapse
MESH Headings
- Adult
- Amino Acid Sequence
- Base Sequence
- Binding Sites/genetics
- Carcinoma, Hepatocellular/genetics
- Carcinoma, Hepatocellular/immunology
- Cell Hypoxia/genetics
- Cell Hypoxia/immunology
- Chemokines, CC/genetics
- Chromosomes, Human, Pair 17/genetics
- Cloning, Molecular
- DNA/genetics
- DNA-Binding Proteins/metabolism
- Expressed Sequence Tags
- Gene Expression
- Humans
- Hypoxia-Inducible Factor 1
- Hypoxia-Inducible Factor 1, alpha Subunit
- Liver/immunology
- Liver Neoplasms/genetics
- Liver Neoplasms/immunology
- Molecular Sequence Data
- Molecular Weight
- Nuclear Proteins/metabolism
- Promoter Regions, Genetic
- RNA, Messenger/chemistry
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- RNA, Neoplasm/genetics
- RNA, Neoplasm/metabolism
- Sequence Homology, Amino Acid
- Transcription Factors
Collapse
Affiliation(s)
- J Y Yang
- Division of Clinical Immunology, Department of Medicine, University Hospital, Zürich, CH-8091, Switzerland
| | | | | |
Collapse
|
40
|
Ploger R, Zhang J, Bassett D, Reeves R, Hieter P, Boguski M, Spencer F. XREFdb: cross-referencing the genetics and genes of mammals and model organisms. Nucleic Acids Res 2000; 28:120-2. [PMID: 10592198 PMCID: PMC102470 DOI: 10.1093/nar/28.1.120] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
XREFdb supports the investigation of protein function in the context of information available through work in multiple organisms. In addition to facilitating the association of functional data among known genes from multiple organisms, XREFdb has developed strategies that provide access to information associated with as-yet unstudied genes. The database organizes protein similarity and genetic map positional information from diverse sources in the public domain to facilitate investigator evaluation of potential functional significance. XREFdb is found at URL www.ncbi.nlm.nih.gov/XREFdb
Collapse
Affiliation(s)
- R Ploger
- Institute for Genetic Medicine, Johns Hopkins University School of Medicine, Rutland Avenue, Baltimore, MD 21205, USA
| | | | | | | | | | | | | |
Collapse
|
41
|
Abstract
Annotation of large-scale gene sequence data will benefit from comprehensive and consistent application of well-documented, standard analysis methods and from progressive and vigilant efforts to ensure quality and utility and to keep the annotation up to date. However, it is imperative to learn how to apply information derived from functional genomics and proteomics technologies to conceptualize and explain the behaviors of biological systems. Quantitative and dynamical models of systems behaviors will supersede the limited and static forms of single-gene annotation that are now the norm. Molecular biological epistemology will increasingly encompass both teleological and causal explanations.
Collapse
Affiliation(s)
- M S Boguski
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
42
|
Urményi TP, Bonaldo MF, Soares MB, Rondinelli E. Construction of a normalized cDNA library for the Trypanosoma cruzi genome project. J Eukaryot Microbiol 1999; 46:542-4. [PMID: 10519221 DOI: 10.1111/j.1550-7408.1999.tb06072.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Sequencing of the Trypanosoma cruzi genome is underway. Expressed sequence tags, obtained from cDNA libraries, facilitate mapping and gene discovery. The efficiency of large-scale generation of such tags is increased when using normalized cDNA libraries, where the frequency of individual clones is brought within a narrow range. Repetitive sequencing of abundant clones is therefore minimized. We constructed a normalized cDNA library from epimastigotes of clone CL Brener, and the efficiency of normalization of representative clones was assessed and shown to be adequate. The normalized cDNA library has been distributed to several groups and large-scale sequencing is currently in progress.
Collapse
Affiliation(s)
- T P Urményi
- Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Brazil
| | | | | | | |
Collapse
|
43
|
Vannahme C, Schübel S, Herud M, Gösling S, Hülsmann H, Paulsson M, Hartmann U, Maurer P. Molecular cloning of testican-2: defining a novel calcium-binding proteoglycan family expressed in brain. J Neurochem 1999; 73:12-20. [PMID: 10386950 DOI: 10.1046/j.1471-4159.1999.0730012.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We have screened a human cDNA library using an expressed sequence tag related to the BM-40/secreted protein, acidic and rich in cysteine (SPARC)/osteonectin family of proteins and isolated a novel cDNA. It encodes a protein precursor of 424 amino acids that consists of a signal peptide, a follistatin-like domain, a Ca2+-binding domain, a thyroglobulin-like domain, and a C-terminal region with two putative glycosaminoglycan attachment sites. The protein is homologous to testican-1 and was termed testican-2. Testican-1 is a proteoglycan originally isolated from human seminal plasma that is also expressed in brain. Northern blot hybridization of testican-2 showed a 6.1-kb mRNA expressed mainly in CNS but also found in lung and testis. A widespread expression in multiple neuronal cell types in olfactory bulb, cerebral cortex, thalamus, hippocampus, cerebellum, and medulla was detected by in situ hybridization. A recombinant fragment consisting of the Ca2+-binding EF-hand domain and the thyroglobulin-like domain of testican-2 showed a reversible Ca2+-dependent conformational change in circular dichroism studies. Testican-1 and -2 form a novel Ca2+-binding proteoglycan family built of modular domains with the potential to participate in diverse steps of neurogenesis.
Collapse
Affiliation(s)
- C Vannahme
- Institute for Biochemistry II, Medical Faculty, University of Cologne, Germany
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Jordan BR. 'Genomics': buzzword or reality? J Biomed Sci 1999; 6:145-50. [PMID: 10343163 DOI: 10.1007/bf02255898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
'Genomics' has become a widely used term, covering a range of approaches that make use of the newly acquired wealth of genome data (both on man and on a number of model organisms) to gain new insights and accelerate research. This review attempts to present a clear and balanced view of developments in this field, to describe the four major approaches that contribute to genomics (bioinformatics, genetic analysis of extended populations, large-scale expression studies, functional approaches), and to indicate applications in basic and pharmaceutical research.
Collapse
Affiliation(s)
- B R Jordan
- TAGC Group, Institut de Cancérologie et d'Immunologie de Marseille, Centre d'Immunologie INSERM/CNRS de Marseille-Luminy, Marseille, France.
| |
Collapse
|
45
|
Walkey CJ, Shields DJ, Vance DE. Identification of three novel cDNAs for human phosphatidylethanolamine N-methyltransferase and localization of the human gene on chromosome 17p11.2. BIOCHIMICA ET BIOPHYSICA ACTA 1999; 1436:405-12. [PMID: 9989271 DOI: 10.1016/s0005-2760(98)00147-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Phosphatidylethanolamine is converted to phosphatidylcholine in mammalian liver by the enzyme phosphatidylethanolamine N-methyltransferase (PEMT). A form of the enzyme (PEMT2) has been isolated from rat liver, the cDNA cloned and expressed and the murine gene has been characterized and disrupted. Several lines of evidence suggested that PEMT2 might have a role in hepatocyte proliferation and liver cancer. Hence, we decided to investigate the human form of the enzyme. Unexpectedly, we cloned and expressed three novel human cDNAs encoding PEMT2. These forms differ from each other in the 5'-region with the point of divergence being 15 nucleotides upstream of the putative translation initiation codon. The remainder of the three cDNAs was identical. Expression of the coding region of the cDNAs in McArdle rat hepatoma cells resulted in three stable cell lines that showed a 27- to 115-fold elevation of PEMT activity compared to vector-transfected control cell lines. Screening of somatic cell hybrid panels, radiation hybrid panel mapping and fluorescent in situ hybridization mapping localized the human gene for PEMT2 to chromosome 17p11.2. The identification of three different human cDNAs for PEMT2 suggests that understanding the function of PEMT2 will be more complicated than anticipated.
Collapse
Affiliation(s)
- C J Walkey
- Lipid and Lipoprotein Research Group, University of Alberta, Edmonton, Canada
| | | | | |
Collapse
|
46
|
Tonetti M, Sturla L, Bisso A, Zanardi D, Benatti U, De Flora A. The metabolism of 6-deoxyhexoses in bacterial and animal cells. Biochimie 1998; 80:923-31. [PMID: 9893952 DOI: 10.1016/s0300-9084(00)88889-6] [Citation(s) in RCA: 48] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
L-fucose and L-rhamnose are two 6-deoxyhexoses naturally occurring in several complex carbohydrates. In prokaryotes both of them are found in polysaccharides of the cell wall, while in animals only L-fucose has been described, which mainly participates to the structure of glycoconjugates, either in the cell membrane or secreted in biological fluids, such as ABH blood groups and Lewis system antigens. L-fucose and L-rhamnose are synthesized by two de novo biosynthetic pathways starting from GDP-D-mannose and dTDP-D-glucose, respectively, which share several common features. The first step for both pathways is a dehydration reaction catalyzed by specific nucleotide-sugar dehydratases. This leads to the formation of unstable 4-keto-6-deoxy intermediates, which undergo a subsequent epimerization reaction responsible for the change from D- to L-conformation, and then a NADPH-dependent reduction of the 4-keto group, with the consequent formation of either GDP-L-fucose or dTDP-L-rhamnose. These compounds are then the substrates of specific glycosyltransferases which are responsible for insertion of either L-fucose or L-rhamnose in the corresponding glycoconjugates. The enzyme involved in the first step of GDP-L-fucose biosynthesis in E. coli, i.e., GDP-D-mannose 4,6 dehydratase, has been recently expressed as recombinant protein and characterized in our laboratory. We have also cloned and fully characterized a human protein, formerly named FX, and an E. coli protein, WcaG, which display both the epimerase and the reductase activities, thus indicating that only two enzymes are required for GDP-L-fucose production. Fucosylated complex glycoconjugates at the cell surface can then be recognized by specific counter-receptors in interacting cells, these mechanisms initiating important processes including inflammation and metastasis. The second pathway starting from dTDP-D-glucose leads to the synthesis of antibiotic glycosides or, alternatively, to the production of dTDP-L-rhamnose. While several sets of data are available on the first enzyme of the pathway, i.e., dTDP-D-glucose dehydratase, the enzymes involved in the following steps still need to be identified and characterized.
Collapse
Affiliation(s)
- M Tonetti
- Institute of Biochemistry, University of Genova, Italy
| | | | | | | | | | | |
Collapse
|
47
|
Wang SM, Rowley JD. A strategy for genome-wide gene analysis: integrated procedure for gene identification. Proc Natl Acad Sci U S A 1998; 95:11909-14. [PMID: 9751764 PMCID: PMC21739 DOI: 10.1073/pnas.95.20.11909] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have developed a technique called the Integrated Procedure for Gene Identification that modifies and integrates parts from several existing techniques to increase the efficiency for genome-wide gene identification. The procedure has the following features: (i) Only the 3' portion of the expressed templates is used to ensure a match to 3' expressed sequence tag (EST) sequences; (ii) the 3' portion of the cDNA is poly dA/poly dT minus, which maintains complete representation of the expressed copies, particularly the rare copies, which otherwise would be lost heavily because of random poly dA/poly dT hybridization in the subtraction reaction; (iii) redundancy is decreased substantially by the subtraction reaction to reduce the effort for sequencing analysis; (iv) the nonsubtracted templates that largely contain the rare copies are amplified selectively with suppression PCR and are sequenced directly or through serial analysis of gene expression (SAGE); and (v) the identified sequences are matched to databases to determine whether they are cloned genes, ESTs, or novel sequences. Using this procedure in a model system, we showed that the redundant copies were largely removed, and the rates of EST matches and the novel sequence identification were significantly increased. Most of the plasmids containing the matched EST are readily available from the IMAGE consortium. This technique can be used to index genome-wide expressed genes and to identify differentially expressed genes in different cells. Compared with the existing techniques, this procedure is relatively efficient, simple, less expensive, and labor intensive. It is especially useful for standard molecular laboratories to perform genome-wide studies.
Collapse
Affiliation(s)
- S M Wang
- Section of Hematology and Oncology, University of Chicago Medical Center, 5841 South Maryland Avenue, MC 2115, Chicago, IL 60637-1470, USA
| | | |
Collapse
|
48
|
Kas K, Voz ML, Hensen K, Meyen E, Van de Ven WJ. Transcriptional activation capacity of the novel PLAG family of zinc finger proteins. J Biol Chem 1998; 273:23026-32. [PMID: 9722527 DOI: 10.1074/jbc.273.36.23026] [Citation(s) in RCA: 120] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
We have isolated and characterized two novel cDNAs encoding C2H2 zinc finger proteins showing high sequence homology to PLAG1, a protein ectopically activated by promoter swapping or promoter substitution in pleomorphic adenomas with chromosomal abnormalities at chromosome 8q12. PLAG1 and the two new PLAG1 family members (PLAGL1 and PLAGL2) constitute a novel subfamily of zinc finger proteins that recognize DNA and/or RNA. To examine the potential of the three human proteins to modulate transcription, we constructed several PLAG/GAL4 DNA binding domain fusion proteins and measured their ability to activate transcription of a reporter gene construct in different mammalian cell lines and in yeast. Although the carboxyl-terminal part of PLAGL1 shows strong overall transcriptional activity in mesenchymal (COS-1) and epithelial cells (293), both PLAG1 and PLAGL2 transactivate in mesenchymal cells only if depleted from a repressing region. This effect is less profound in epithelial cells. These data suggest that the activation in pleomorphic adenomas of PLAG1 most likely results in uncontrolled activation of downstream target genes.
Collapse
Affiliation(s)
- K Kas
- Laboratory for Molecular Oncology, Center for Human Genetics, University of Leuven and Flanders Interuniversity Institute for Biotechnology, Herestraat 49, B-3000 Leuven, Belgium.
| | | | | | | | | |
Collapse
|
49
|
Bond HM, Bonelli P, Mesuraca M, Agosti V, Masone C, Cuomo C, Nisticò A, Tassone P, Tuccillo F, Cecco L, Iacopino L, Barbieri V, Cerra M, Costanzo FS, Morrone G, Venuta S. Identification by differential display of transcripts regulated during hematopoietic differentiation. Stem Cells 1998; 16:136-43. [PMID: 9554038 DOI: 10.1002/stem.160136] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The polymerase chain reaction-based differential display method (DDRT-PCR) was used to identify mRNAs differentially expressed during the maturation of human CD34+ progenitor cells stimulated to differentiate in vitro towards granulomonocytic or erythroid lineages with a mixture of hemopoietins (kit ligand + interleukin 3 + GM-CSF in the absence or presence of erythropoietin, respectively). Three cDNA transcripts (B32, B41, and B56) display differential expression during cytokine-induced maturation of CD34+ cells. These clones have no homology with already-described sequences. Primer extension cofirmed the presence of the corresponding mRNA. The levels of mRNA corresponding to B32 are enhanced in the later phases of the granulomonocytic as well as in the erythroid differentiation of CD34+ cells. The mRNA identified by B41 was induced by a late stage in only granulomonocytic differentiation of CD34+ cells. The mRNA corresponding to B56 was instead present in nonstimulated CD34+ cells, declined in the early stages of differentiation, and reappeared at later stages in cells treated with both combinations of cytokines. Expression of these genes was detected in a number of acute myelogenous leukemias, as well as in some leukemic cell lines. B32 and B41 were downregulated in KG-1 cells induced to differentiate towards the monocytic lineage, whereas the levels of B56 were unchanged. In K562 cells, clones B41 and B56 were downregulated only in the late phases of PMA-induced megakaryocytic differentiation and during erythroid differentiation. B32 was rapidly downregulated when K562 cells were induced to differentiate towards either megakaryocytic or erythroid phenotypes. These transcripts represent novel hematopoietic cDNAs that should prove of value for the study of human blood cells and their disorders.
Collapse
Affiliation(s)
- H M Bond
- Department of Experimental Oncology, National Cancer Institute, Fondazione G. Pascale, Napoli, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Fessing MY, Belkov VM, Krynetski EY, Evans WE. Molecular cloning and functional characterization of the cDNA encoding the murine thiopurine S-methyltransferase (TPMT). FEBS Lett 1998; 424:143-5. [PMID: 9539138 DOI: 10.1016/s0014-5793(98)00159-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Thiopurine S-methyltransferase (TPMT) is a cytosolic enzyme that catalyzes S-methylation of aromatic and heterocyclic sulfhydryl compounds, including anticancer and immunosuppressive thiopurines. Here we report the isolation and functional characterization of the murine TPMT cDNA. The screening of expressed sequence tags database led to isolation of a murine cDNA clone containing an uninterrupted ORF encoding the protein with an amino acid sequence that is 82% similar and 78% identical to the human TPMT. The expression product of the murine cDNA in rabbit reticulocyte and wheat germ lysate coupled transcription-translation systems showed TPMT enzymatic activity. We conclude that the isolated cDNA clone represents the murine TPMT cDNA.
Collapse
Affiliation(s)
- M Y Fessing
- Department of Pharmaceutical Sciences, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | | | | | | |
Collapse
|