1
|
Chen Y, Sheng G, Wang G. CapsNet-TIS: Predicting translation initiation site based on multi-feature fusion and improved capsule network. Gene 2024; 924:148598. [PMID: 38782224 DOI: 10.1016/j.gene.2024.148598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/22/2024] [Accepted: 05/20/2024] [Indexed: 05/25/2024]
Abstract
Genes are the basic units of protein synthesis in organisms, and accurately identifying the translation initiation site (TIS) of genes is crucial for understanding the regulation, transcription, and translation processes of genes. However, the existing models cannot adequately extract the feature information in TIS sequences, and they also inadequately capture the complex hierarchical relationships among features. Therefore, a novel predictor named CapsNet-TIS is proposed in this paper. CapsNet-TIS first fully extracts the TIS sequence information using four encoding methods, including One-hot encoding, physical structure property (PSP) encoding, nucleotide chemical property (NCP) encoding, and nucleotide density (ND) encoding. Next, multi-scale convolutional neural networks are used to perform feature fusion of the encoded features to enhance the comprehensiveness of the feature representation. Finally, the fused features are classified using capsule network as the main network of the classification model to capture the complex hierarchical relationships among the features. Moreover, we improve the capsule network by introducing residual block, channel attention, and BiLSTM to enhance the model's feature extraction and sequence data modeling capabilities. In this paper, the performance of CapsNet-TIS is evaluated using TIS datasets from four species: human, mouse, bovine, and fruit fly, and the effectiveness of each part is demonstrated by performing ablation experiments. By comparing the experimental results with models proposed by other researchers, the results demonstrate the superior performance of CapsNet-TIS.
Collapse
Affiliation(s)
- Yu Chen
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China.
| | - Guojun Sheng
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Gang Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| |
Collapse
|
2
|
Odogwu NM, Hagen C, Nelson TJ. Transcriptome studies of congenital heart diseases: identifying current gaps and therapeutic frontiers. Front Genet 2023; 14:1278747. [PMID: 38152655 PMCID: PMC10751320 DOI: 10.3389/fgene.2023.1278747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 11/16/2023] [Indexed: 12/29/2023] Open
Abstract
Congenital heart disease (CHD) are genetically complex and comprise a wide range of structural defects that often predispose to - early heart failure, a common cause of neonatal morbidity and mortality. Transcriptome studies of CHD in human pediatric patients indicated a broad spectrum of diverse molecular signatures across various types of CHD. In order to advance research on congenital heart diseases (CHDs), we conducted a detailed review of transcriptome studies on this topic. Our analysis identified gaps in the literature, with a particular focus on the cardiac transcriptome signatures found in various biological specimens across different types of CHDs. In addition to translational studies involving human subjects, we also examined transcriptomic analyses of CHDs in a range of model systems, including iPSCs and animal models. We concluded that RNA-seq technology has revolutionized medical research and many of the discoveries from CHD transcriptome studies draw attention to biological pathways that concurrently open the door to a better understanding of cardiac development and related therapeutic avenue. While some crucial impediments to perfectly studying CHDs in this context remain obtaining pediatric cardiac tissue samples, phenotypic variation, and the lack of anatomical/spatial context with model systems. Combining model systems, RNA-seq technology, and integrating algorithms for analyzing transcriptomic data at both single-cell and high throughput spatial resolution is expected to continue uncovering unique biological pathways that are perturbed in CHDs, thus facilitating the development of novel therapy for congenital heart disease.
Collapse
Affiliation(s)
- Nkechi Martina Odogwu
- Program for Hypoplastic Left Heart Syndrome, Mayo Clinic, Rochester, MN, United States
| | - Clinton Hagen
- Program for Hypoplastic Left Heart Syndrome, Mayo Clinic, Rochester, MN, United States
| | - Timothy J. Nelson
- Program for Hypoplastic Left Heart Syndrome, Mayo Clinic, Rochester, MN, United States
- Center for Regenerative Medicine, Mayo Clinic, Rochester, MN, United States
- Division of General Internal Medicine, Mayo Clinic, Rochester, MN, United States
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States
- Division of Pediatric Cardiology, Department of Pediatric and Adolescent Medicine, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
3
|
Abstract
Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of transcripts of multiple types. Among these, a small set of protein-coding mRNAs play a disproportionately large role in defining phenotypes. Due to their sequence conservation, orthology can be established, making it possible to define the universal catalog of eukaryotic protein-coding genes. This catalog should substantially contribute to uncovering the genomic events underlying the emergence of eukaryotic phenotypes. This piece briefly reviews the basics of protein-coding gene prediction, discusses challenges in finalizing annotation of the human genome, and proposes strategies for producing annotations across the eukaryotic Tree of Life. This lays the groundwork for obtaining the catalog of all genes-the Earth's code of life.
Collapse
Affiliation(s)
- Roderic Guigó
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Catalonia
- Universitat Pompeu Fabra (UPF), Barcelona, Catalonia
| |
Collapse
|
4
|
Ruozi G, Bortolotti F, Mura A, Tomczyk M, Falcione A, Martinelli V, Vodret S, Braga L, Dal Ferro M, Cannatà A, Zentilin L, Sinagra G, Zacchigna S, Giacca M. Cardioprotective factors against myocardial infarction selected in vivo from an AAV secretome library. Sci Transl Med 2022; 14:eabo0699. [PMID: 36044596 DOI: 10.1126/scitranslmed.abo0699] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Therapies for patients with myocardial infarction and heart failure are urgently needed, in light of the breadth of these conditions and lack of curative treatments. To systematically identify previously unidentified cardioactive biologicals in an unbiased manner in vivo, we developed cardiac FunSel, a method for the systematic, functional selection of effective factors using a library of 1198 barcoded adeno-associated virus (AAV) vectors encoding for the mouse secretome. By pooled vector injection into the heart, this library was screened to functionally select for factors that confer cardioprotection against myocardial infarction. After two rounds of iterative selection in mice, cardiac FunSel identified three proteins [chordin-like 1 (Chrdl1), family with sequence similarity 3 member C (Fam3c), and Fam3b] that preserve cardiomyocyte viability, sustain cardiac function, and prevent pathological remodeling. In particular, Chrdl1 exerted its protective activity by binding and inhibiting extracellular bone morphogenetic protein 4 (BMP4), which resulted in protection against cardiomyocyte death and induction of autophagy in cardiomyocytes after myocardial infarction. Chrdl1 also inhibited fibrosis and maladaptive cardiac remodeling by binding transforming growth factor-β (TGF-β) and preventing cardiac fibroblast differentiation into myofibroblasts. Production of secreted and circulating Chrdl1, Fam3c, and Fam3b from the liver also protected the heart from myocardial infarction, thus supporting the use of the three proteins as recombinant factors. Together, these findings disclose a powerful method for the in vivo, unbiased selection of tissue-protective factors and describe potential cardiac therapeutics.
Collapse
Affiliation(s)
- Giulia Ruozi
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Francesca Bortolotti
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy.,Cardiovascular Department, ASUGI, 34149 Trieste, Italy
| | - Antonio Mura
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Mateusz Tomczyk
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy.,British Heart Foundation Centre of Research Excellence, School of Cardiovascular Medicine and Sciences, King's College London, London SE5 9NU, UK
| | - Antonella Falcione
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Valentina Martinelli
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Simone Vodret
- Cardiovascular Biology Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Luca Braga
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy.,British Heart Foundation Centre of Research Excellence, School of Cardiovascular Medicine and Sciences, King's College London, London SE5 9NU, UK
| | | | - Antonio Cannatà
- Cardiovascular Department, ASUGI, 34149 Trieste, Italy.,British Heart Foundation Centre of Research Excellence, School of Cardiovascular Medicine and Sciences, King's College London, London SE5 9NU, UK
| | - Lorena Zentilin
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy
| | - Gianfranco Sinagra
- Cardiovascular Department, ASUGI, 34149 Trieste, Italy.,Department of Medical, Surgical and Health Sciences, University of Trieste, 34149 Trieste, Italy
| | - Serena Zacchigna
- Cardiovascular Biology Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy.,Department of Medical, Surgical and Health Sciences, University of Trieste, 34149 Trieste, Italy
| | - Mauro Giacca
- Molecular Medicine Laboratory, International Centre for Genetic Engineering and Biotechnology (ICGEB), 34139 Trieste, Italy.,British Heart Foundation Centre of Research Excellence, School of Cardiovascular Medicine and Sciences, King's College London, London SE5 9NU, UK.,Department of Medical, Surgical and Health Sciences, University of Trieste, 34149 Trieste, Italy
| |
Collapse
|
5
|
Renalase: a novel regulator of cardiometabolic and renal diseases. Hypertens Res 2022; 45:1582-1598. [PMID: 35941358 PMCID: PMC9358379 DOI: 10.1038/s41440-022-00986-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 04/26/2022] [Accepted: 06/05/2022] [Indexed: 11/13/2022]
Abstract
Renalase is a ~38 kDa flavin-adenine dinucleotide (FAD) domain-containing protein that can function as a cytokine and an anomerase. It is emerging as a novel regulator of cardiometabolic diseases. Expressed mainly in the kidneys, renalase has been reported to have a hypotensive effect and may control blood pressure through regulation of sympathetic tone. Furthermore, genetic variations in the renalase gene, such as a functional missense polymorphism (Glu37Asp), have implications in the cardiovascular and renal systems and can potentially increase the risk of cardiometabolic disorders. Research on the physiological functions and biochemical actions of renalase over the years has indicated a role for renalase as one of the key proteins involved in various disease states, such as diabetes, impaired lipid metabolism, and cancer. Recent studies have identified three transcription factors (viz., Sp1, STAT3, and ZBP89) as key positive regulators in modulating the expression of the human renalase gene. Moreover, renalase is under the post-transcriptional regulation of two microRNAs (viz., miR-29b, and miR-146a), which downregulate renalase expression. While renalase supplementation may be useful for treating hypertension, inhibition of renalase signaling may be beneficial to patients with cancerous tumors. However, more incisive investigations are required to unravel the potential therapeutic applications of renalase. Based on the literature pertaining to the function and physiology of renalase, this review attempts to consolidate and comprehend the role of renalase in regulating cardiometabolic and renal disorders. ![]()
Collapse
|
6
|
Guo Y, Zhou D, Cao J, Nie R, Ruan X, Liu Y. Gated residual neural networks with self-normalization for translation initiation site recognition. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107783] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
7
|
Stieglitz JT, Potts KA, Van Deventer JA. Broadening the Toolkit for Quantitatively Evaluating Noncanonical Amino Acid Incorporation in Yeast. ACS Synth Biol 2021; 10:3094-3104. [PMID: 34730946 DOI: 10.1021/acssynbio.1c00370] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Genetic code expansion is a powerful approach for advancing critical fields such as biological therapeutic discovery. However, the machinery for genetically encoding noncanonical amino acids (ncAAs) is only available in limited plasmid formats, constraining potential applications. In extreme cases, the introduction of two separate plasmids, one containing an orthogonal translation system (OTS) to facilitate ncAA incorporation and a second for expressing a ncAA-containing protein of interest, is not possible due to a lack of the available selection markers. One strategy to circumvent this challenge is to express the OTS and protein of interest from a single vector. For what we believe is the first time in yeast, we describe here several sets of single plasmid systems (SPSs) for performing genetic code manipulation and compare the ncAA incorporation capabilities of these plasmids against the capabilities of previously described dual plasmid systems (DPSs). For both dual fluorescent protein reporters and yeast display reporters tested with multiple OTSs and ncAAs, measured ncAA incorporation efficiencies with SPSs were determined to be equal to efficiencies determined with DPSs. Click chemistry on yeast cells displaying ncAA-containing proteins was also shown to be feasible in both formats, although differences in reactivity between formats suggest the need for caution when using such approaches. Additionally, we investigated whether these reporters would support the separation of yeast strains known to exhibit distinct ncAA incorporation efficiencies. Model sorts conducted with mixtures of two strains transformed with the same SPS or DPS both led to the enrichment of a strain known to support a higher efficiency ncAA incorporation, suggesting that these reporters will be suitable for conducting screens for strains exhibiting enhanced ncAA incorporation efficiencies. Overall, our results confirm that SPSs are well behaved in yeast and provide a convenient alternative to DPSs. SPSs are expected to be invaluable for conducting high-throughput investigations of the effects of genetic or genomic changes on ncAA incorporation efficiency and, more fundamentally, the eukaryotic translation apparatus.
Collapse
Affiliation(s)
- Jessica T. Stieglitz
- Chemical and Biological Engineering Department, Tufts University, Medford, Massachusetts 02155, United States
| | - Kelly A. Potts
- Chemical and Biological Engineering Department, Tufts University, Medford, Massachusetts 02155, United States
| | - James A. Van Deventer
- Chemical and Biological Engineering Department, Tufts University, Medford, Massachusetts 02155, United States
- Biomedical Engineering Department, Tufts University, Medford, Massachusetts 02155, United States
| |
Collapse
|
8
|
Guo Y, Zhou D, Li W, Cao J, Nie R, Xiong L, Ruan X. Identifying polyadenylation signals with biological embedding via self-attentive gated convolutional highway networks. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107133] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
9
|
Gupta A, Bansal M. RNA-mediated translation regulation in viral genomes: computational advances in the recognition of sequences and structures. Brief Bioinform 2020; 21:1151-1163. [PMID: 31204430 PMCID: PMC7109810 DOI: 10.1093/bib/bbz054] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 03/24/2019] [Accepted: 04/15/2019] [Indexed: 12/30/2022] Open
Abstract
RNA structures are widely distributed across all life forms. The global conformation of these structures is defined by a variety of constituent structural units such as helices, hairpin loops, kissing-loop motifs and pseudoknots, which often behave in a modular way. Their ubiquitous distribution is associated with a variety of functions in biological processes. The location of these structures in the genomes of RNA viruses is often coordinated with specific processes in the viral life cycle, where the presence of the structure acts as a checkpoint for deciding the eventual fate of the process. These structures have been found to adopt complex conformations and exert their effects by interacting with ribosomes, multiple host translation factors and small RNA molecules like miRNA. A number of such RNA structures have also been shown to regulate translation in viruses at the level of initiation, elongation or termination. The role of various computational studies in the preliminary identification of such sequences and/or structures and subsequent functional analysis has not been fully appreciated. This review aims to summarize the processes in which viral RNA structures have been found to play an active role in translational regulation, their global conformational features and the bioinformatics/computational tools available for the identification and prediction of these structures.
Collapse
Affiliation(s)
- Asmita Gupta
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| |
Collapse
|
10
|
Siew SM, Cunningham SC, Zhu E, Tay SS, Venuti E, Bolitho C, Alexander IE. Prevention of Cholestatic Liver Disease and Reduced Tumorigenicity in a Murine Model of PFIC Type 3 Using Hybrid AAV-piggyBac Gene Therapy. Hepatology 2019; 70:2047-2061. [PMID: 31099022 DOI: 10.1002/hep.30773] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 05/06/2019] [Indexed: 12/14/2022]
Abstract
Recombinant adeno-associated viral (rAAV) vectors are highly promising vehicles for liver-targeted gene transfer, with therapeutic efficacy demonstrated in preclinical models and clinical trials. Progressive familial intrahepatic cholestasis type 3 (PFIC3), an inherited juvenile-onset, cholestatic liver disease caused by homozygous mutation of the ABCB4 gene, may be a promising candidate for rAAV-mediated liver-targeted gene therapy. The Abcb4-/- mice model of PFIC3, with juvenile mice developing progressive cholestatic liver injury due to impaired biliary phosphatidylcholine excretion, resulted in cirrhosis and liver malignancy. Using a conventional rAAV strategy, we observed markedly blunted rAAV transduction in adult Abcb4-/- mice with established liver disease, but not in disease-free, wild-type adults or in homozygous juveniles prior to liver disease onset. However, delivery of predominantly nonintegrating rAAV vectors to juvenile mice results in loss of persistent transgene expression due to hepatocyte proliferation in the growing liver. Conclusion: A hybrid vector system, combining the high transduction efficiency of rAAV with piggyBac transposase-mediated somatic integration, was developed to facilitate stable human ABCB4 expression in vivo and to correct juvenile-onset chronic liver disease in a murine model of PFIC3. A single dose of hybrid vector at birth led to life-long restoration of bile composition, prevention of biliary cirrhosis, and a substantial reduction in tumorigenesis. This powerful hybrid rAAV-piggyBac transposon vector strategy has the capacity to mediate lifelong phenotype correction and reduce the tumorigenicity of progressive familial intrahepatic cholestasis type 3 and, with further refinement, the potential for human clinical translation.
Collapse
Affiliation(s)
- Susan M Siew
- Department of Gastroenterology and James Fairfax Institute of Pediatric Nutrition, Sydney Children's Hospitals Network, Westmead, Australia
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
| | - Sharon C Cunningham
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
| | - Erhua Zhu
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
| | - Szun S Tay
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
| | - Elena Venuti
- Department of Gastroenterology and James Fairfax Institute of Pediatric Nutrition, Sydney Children's Hospitals Network, Westmead, Australia
| | - Christine Bolitho
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
| | - Ian E Alexander
- Gene Therapy Research Unit, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney and Sydney Children's Hospitals Network, Westmead, Australia
- Discipline of Child and Adolescent Health, Sydney Medical School, Faculty of Medicine and Health, The University of Sydney, Westmead, Australia
| |
Collapse
|
11
|
Marín de Evsikova C, Raplee ID, Lockhart J, Jaimes G, Evsikov AV. The Transcriptomic Toolbox: Resources for Interpreting Large Gene Expression Data within a Precision Medicine Context for Metabolic Disease Atherosclerosis. J Pers Med 2019; 9:E21. [PMID: 31032818 PMCID: PMC6617151 DOI: 10.3390/jpm9020021] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 04/20/2019] [Accepted: 04/25/2019] [Indexed: 11/16/2022] Open
Abstract
As one of the most widespread metabolic diseases, atherosclerosis affects nearly everyone as they age; arteries gradually narrow from plaque accumulation over time reducing oxygenated blood flow to central and periphery causing heart disease, stroke, kidney problems, and even pulmonary disease. Personalized medicine promises to bring treatments based on individual genome sequencing that precisely target the molecular pathways underlying atherosclerosis and its symptoms, but to date only a few genotypes have been identified. A promising alternative to this genetic approach is the identification of pathways altered in atherosclerosis by transcriptome analysis of atherosclerotic tissues to target specific aspects of disease. Transcriptomics is a potentially useful tool for both diagnostics and discovery science, exposing novel cellular and molecular mechanisms in clinical and translational models, and depending on experimental design to identify and test novel therapeutics. The cost and time required for transcriptome analysis has been greatly reduced by the development of next generation sequencing. The goal of this resource article is to provide background and a guide to appropriate technologies and downstream analyses in transcriptomics experiments generating ever-increasing amounts of gene expression data.
Collapse
Affiliation(s)
- Caralina Marín de Evsikova
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
- Epigenetics & Functional Genomics Laboratories, Department of Research and Development, Bay Pines Veteran Administration Healthcare System, Bay Pines, FL 33744, USA.
| | - Isaac D Raplee
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| | - John Lockhart
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| | - Gilberto Jaimes
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| | - Alexei V Evsikov
- Epigenetics & Functional Genomics Laboratories, Department of Research and Development, Bay Pines Veteran Administration Healthcare System, Bay Pines, FL 33744, USA.
| |
Collapse
|
12
|
Huang R, Grishagin I, Wang Y, Zhao T, Greene J, Obenauer JC, Ngan D, Nguyen DT, Guha R, Jadhav A, Southall N, Simeonov A, Austin CP. The NCATS BioPlanet - An Integrated Platform for Exploring the Universe of Cellular Signaling Pathways for Toxicology, Systems Biology, and Chemical Genomics. Front Pharmacol 2019; 10:445. [PMID: 31133849 PMCID: PMC6524730 DOI: 10.3389/fphar.2019.00445] [Citation(s) in RCA: 135] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 04/08/2019] [Indexed: 12/16/2022] Open
Abstract
Chemical genomics aims to comprehensively define, and ultimately predict, the effects of small molecule compounds on biological systems. Chemical activity profiling approaches must consider chemical effects on all pathways operative in mammalian cells. To enable a strategic and maximally efficient chemical profiling of pathway space, we have created the NCATS BioPlanet, a comprehensive integrated pathway resource that incorporates the universe of 1,658 human pathways sourced from publicly available, manually curated sources, which have been subjected to thorough redundancy and consistency cross-evaluation. BioPlanet supports interactive browsing, retrieval, and analysis of pathways, exploration of pathway connections, and pathway search by gene targets, category, and availability of corresponding bioactivity assay, as well as visualization of pathways on a 3-dimensional globe, in which the distance between any two pathways is proportional to their degree of gene component overlap. Using this resource, we propose a strategy to identify a minimal set of 362 biological assays that can interrogate the universe of human pathways. The NCATS BioPlanet is a public resource, which will be continually expanded and updated, for systems biology, toxicology, and chemical genomics, available at http://tripod.nih.gov/bioplanet/.
Collapse
Affiliation(s)
- Ruili Huang
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | | | - Yuhong Wang
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Tongan Zhao
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Jon Greene
- Rancho BioSciences, San Diego, CA, United States
| | | | - Deborah Ngan
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Dac-Trung Nguyen
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Rajarshi Guha
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Ajit Jadhav
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Noel Southall
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Anton Simeonov
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| | - Christopher P Austin
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD, United States
| |
Collapse
|
13
|
Kalkatawi M, Magana-Mora A, Jankovic B, Bajic VB. DeepGSR: an optimized deep-learning structure for the recognition of genomic signals and regions. Bioinformatics 2019; 35:1125-1132. [PMID: 30184052 PMCID: PMC6449759 DOI: 10.1093/bioinformatics/bty752] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 07/15/2018] [Accepted: 08/31/2018] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Recognition of different genomic signals and regions (GSRs) in DNA is crucial for understanding genome organization, gene regulation, and gene function, which in turn generate better genome and gene annotations. Although many methods have been developed to recognize GSRs, their pure computational identification remains challenging. Moreover, various GSRs usually require a specialized set of features for developing robust recognition models. Recently, deep-learning (DL) methods have been shown to generate more accurate prediction models than 'shallow' methods without the need to develop specialized features for the problems in question. Here, we explore the potential use of DL for the recognition of GSRs. RESULTS We developed DeepGSR, an optimized DL architecture for the prediction of different types of GSRs. The performance of the DeepGSR structure is evaluated on the recognition of polyadenylation signals (PAS) and translation initiation sites (TIS) of different organisms: human, mouse, bovine and fruit fly. The results show that DeepGSR outperformed the state-of-the-art methods, reducing the classification error rate of the PAS and TIS prediction in the human genome by up to 29% and 86%, respectively. Moreover, the cross-organisms and genome-wide analyses we performed, confirmed the robustness of DeepGSR and provided new insights into the conservation of examined GSRs across species. AVAILABILITY AND IMPLEMENTATION DeepGSR is implemented in Python using Keras API; it is available as open-source software and can be obtained at https://doi.org/10.5281/zenodo.1117159. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manal Kalkatawi
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Arturo Magana-Mora
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Drilling Technology Team, EXPEC-ARC, Saudi Aramco, Dhahran, Saudi Arabia
| | - Boris Jankovic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Vladimir B Bajic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| |
Collapse
|
14
|
Edwards A, Morgan M, Al Chawaf A, Andrusiak K, Charney R, Cynader Z, ElDessouki A, Lee Y, Moeser A, Stern S, Zuercher WJ. A trust approach for sharing research reagents. Sci Transl Med 2018; 9:9/392/eaai9055. [PMID: 28566431 DOI: 10.1126/scitranslmed.aai9055] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 01/05/2017] [Indexed: 11/02/2022]
Abstract
The core feature of trusts-holding property for the benefit of others-is well suited to constructing a research community that treats reagents as public goods.
Collapse
Affiliation(s)
- Aled Edwards
- Structural Genomics Consortium, University of Toronto, Toronto, Ontario M5G 1L7, Canada.
| | - Max Morgan
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada.,Grand Challenges Canada, Toronto, Ontario M5G 1L7, Canada
| | - Arij Al Chawaf
- Structural Genomics Consortium, University of Toronto, Toronto, Ontario M5G 1L7, Canada
| | - Kerry Andrusiak
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada
| | - Rachel Charney
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada
| | - Zarya Cynader
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada.,Gilbert's LLP, Toronto-Dominion Centre, Toronto, Ontario M5K 1K2, Canada
| | - Ahmed ElDessouki
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada
| | - Yunjeong Lee
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada
| | - Andrew Moeser
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada.,Gilbert's LLP, Toronto-Dominion Centre, Toronto, Ontario M5K 1K2, Canada
| | - Simon Stern
- Faculty of Law, University of Toronto, Toronto, Ontario M5S 2C5, Canada
| | - William J Zuercher
- Structural Genomics Consortium, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
15
|
Suzuki Y, Ogasawara T, Tanaka Y, Takeda H, Sawasaki T, Mogi M, Liu S, Maeyama K. Functional G-Protein-Coupled Receptor (GPCR) Synthesis: The Pharmacological Analysis of Human Histamine H1 Receptor (HRH1) Synthesized by a Wheat Germ Cell-Free Protein Synthesis System Combined with Asolectin Glycerosomes. Front Pharmacol 2018; 9:38. [PMID: 29467651 PMCID: PMC5808195 DOI: 10.3389/fphar.2018.00038] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 01/12/2018] [Indexed: 11/13/2022] Open
Abstract
G-protein-coupled receptors (GPCRs) are membrane proteins distributed on the cell surface, and they may be potential drug targets. However, synthesizing GPCRs in vitro can be challenging. Recently, some cell-free protein synthesis systems have been shown to produce a large amount of membrane protein combined with chemical chaperones that include liposomes and glycerol. Liposomes containing high concentrations of glycerol are known as glycerosomes, which are used in new drug delivery systems. Glycerosomes have greater morphological stability than liposomes. Proteoglycerosomes are defined as glycerosomes that contain membrane proteins. Human histamine H1 receptor (HRH1) is one of the most studied GPCRs. In this study, we synthesized wild-type HRH1 (WT-HRH1) proteoglycerosomes and D107A-HRH1, (in which Asp107 was replaced by Ala) in a wheat germ cell-free protein synthesis system combined with asolectin glycerosomes. The mutant HRH1 has been reported to have low affinity for the H1 antagonist. In this study, the amount of synthesized WT-HRH1 in one synthesis reaction was 434 ± 66.6 μg (7.75 ± 1.19 × 103pmol). The specific binding of [3H]pyrilamine to the WT-HRH1 proteoglycerosomes became saturated as the concentration of the radioligand increased. The dissociation constant (Kd) and maximum density (Bmax) of the synthesized WT-HRH1 were 9.76 ± 1.25 nM and 21.4 ± 0.936 pmol/mg protein, respectively. However, specific binding to D107A-HRH1 was reduced compared with WT-HRH1 and the binding did not become saturated. The findings of this study highlight that HRH1 synthesized using a wheat germ cell-free protein synthesis system combined with glycerosomes has the ability to bind to H1 antagonists.
Collapse
Affiliation(s)
- Yasuyuki Suzuki
- Department of Pharmacology, Ehime University Graduate School of Medicine, Toon, Japan
| | | | - Yuki Tanaka
- Advanced Research Support Center, Division of Analytical Bio-Medicine, Ehime University, Toon, Japan
| | | | | | - Masaki Mogi
- Department of Pharmacology, Ehime University Graduate School of Medicine, Toon, Japan
| | - Shuang Liu
- Department of Pharmacology, Ehime University Graduate School of Medicine, Toon, Japan
| | - Kazutaka Maeyama
- Department of Pharmacology, Ehime University Graduate School of Medicine, Toon, Japan
| |
Collapse
|
16
|
Identification of targets of tumor suppressor microRNA-34a using a reporter library system. Proc Natl Acad Sci U S A 2017; 114:3927-3932. [PMID: 28356515 DOI: 10.1073/pnas.1620019114] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
miRNAs play critical roles in various biological processes by targeting specific mRNAs. Current approaches to identifying miRNA targets are insufficient for elucidation of a miRNA regulatory network. Here, we created a cell-based screening system using a luciferase reporter library composed of 4,891 full-length cDNAs, each of which was integrated into the 3' UTR of a luciferase gene. Using this reporter library system, we conducted a screening for targets of miR-34a, a tumor-suppressor miRNA. We identified both previously characterized and previously uncharacterized targets. miR-34a overexpression in MDA-MB-231 breast cancer cells repressed the expression of these previously unrecognized targets. Among these targets, GFRA3 is crucial for MDA-MB-231 cell growth, and its expression correlated with the overall survival of patients with breast cancer. Furthermore, GFRA3 was found to be directly regulated by miR-34a via its coding region. These data show that this system is useful for elucidating miRNA functions and networks.
Collapse
|
17
|
The ligand binding ability of dopamine D1 receptors synthesized using a wheat germ cell-free protein synthesis system with liposomes. Eur J Pharmacol 2014; 745:117-22. [PMID: 25446930 DOI: 10.1016/j.ejphar.2014.10.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Revised: 10/06/2014] [Accepted: 10/08/2014] [Indexed: 01/15/2023]
Abstract
G-protein coupled receptors (GPCRs) share a common seven-transmembrane topology and mediate cellular responses to a variety of extracellular signals. However, structural and functional approaches to GPCRs have often been limited by the difficulty of producing a sufficient amount of receptor protein using conventional expression systems. We synthesized human dopamine D1 receptors using a wheat cell-free protein synthesis system with liposomes and then analyzed their receptor binding ability. We determined the specific binding of [(3)H]SCH23390 to the synthesized receptors generated from a cell-free protein synthesis system or rat striatal membranes. From Scatchard plot analysis, the dissociation constant (Kd) and the maximum density (Bmax) of the synthesized receptors were 6.61±0.06 nM and 1.85±0.05 pmol/mg protein, respectively. The same analysis for rat striatal membrane gave a Kd of 2.67±0.05 nM and Bmax of 0.70±0.10 pmol/mg protein. Using a competition binding assay, Ki values of antagonists, SCH23390, LE300 and SKF83566, for the synthetic receptors were the same as those for rat striatal membranes, but Ki values of agonists, A68930, SKF38393 and dopamine, were 5-17 fold higher than those for rat striatal membranes. These results suggest that the dopamine D1 receptors synthesized in liposomes have a functional binding capacity. The different patterns of binding of antagonists and agonists to the synthetic receptors and rat striatal membranes indicate that G proteins are involved in agonist binding to dopamine D1 receptors. The cell-free protein synthesis method with liposomes will be invaluable for the functional analysis of GPCRs.
Collapse
|
18
|
Eshraghi A, Dixon SD, Tamilselvam B, Kim EJK, Gargi A, Kulik JC, Damoiseaux R, Blanke SR, Bradley KA. Cytolethal distending toxins require components of the ER-associated degradation pathway for host cell entry. PLoS Pathog 2014; 10:e1004295. [PMID: 25078082 PMCID: PMC4117610 DOI: 10.1371/journal.ppat.1004295] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Accepted: 06/23/2014] [Indexed: 11/18/2022] Open
Abstract
Intracellular acting protein exotoxins produced by bacteria and plants are important molecular determinants that drive numerous human diseases. A subset of these toxins, the cytolethal distending toxins (CDTs), are encoded by several Gram-negative pathogens and have been proposed to enhance virulence by allowing evasion of the immune system. CDTs are trafficked in a retrograde manner from the cell surface through the Golgi apparatus and into the endoplasmic reticulum (ER) before ultimately reaching the host cell nucleus. However, the mechanism by which CDTs exit the ER is not known. Here we show that three central components of the host ER associated degradation (ERAD) machinery, Derlin-2 (Derl2), the E3 ubiquitin-protein ligase Hrd1, and the AAA ATPase p97, are required for intoxication by some CDTs. Complementation of Derl2-deficient cells with Derl2:Derl1 chimeras identified two previously uncharacterized functional domains in Derl2, the N-terminal 88 amino acids and the second ER-luminal loop, as required for intoxication by the CDT encoded by Haemophilus ducreyi (Hd-CDT). In contrast, two motifs required for Derlin-dependent retrotranslocation of ERAD substrates, a conserved WR motif and an SHP box that mediates interaction with the AAA ATPase p97, were found to be dispensable for Hd-CDT intoxication. Interestingly, this previously undescribed mechanism is shared with the plant toxin ricin. These data reveal a requirement for multiple components of the ERAD pathway for CDT intoxication and provide insight into a Derl2-dependent pathway exploited by retrograde trafficking toxins.
Collapse
Affiliation(s)
- Aria Eshraghi
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Shandee D. Dixon
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Batcha Tamilselvam
- Department of Microbiology, Institute for Genomic Biology, University of Illinois, Urbana, Urbana, Illinois, United States of America
| | - Emily Jin-Kyung Kim
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Amandeep Gargi
- Department of Microbiology, Institute for Genomic Biology, University of Illinois, Urbana, Urbana, Illinois, United States of America
| | - Julia C. Kulik
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Robert Damoiseaux
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Steven R. Blanke
- Department of Microbiology, Institute for Genomic Biology, University of Illinois, Urbana, Urbana, Illinois, United States of America
| | - Kenneth A. Bradley
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
19
|
Yu X, Bian X, Throop A, Song L, Moral LD, Park J, Seiler C, Fiacco M, Steel J, Hunter P, Saul J, Wang J, Qiu J, Pipas JM, LaBaer J. Exploration of panviral proteome: high-throughput cloning and functional implications in virus-host interactions. Am J Cancer Res 2014; 4:808-22. [PMID: 24955142 PMCID: PMC4063979 DOI: 10.7150/thno.8255] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2013] [Accepted: 04/27/2014] [Indexed: 12/24/2022] Open
Abstract
Throughout the long history of virus-host co-evolution, viruses have developed delicate strategies to facilitate their invasion and replication of their genome, while silencing the host immune responses through various mechanisms. The systematic characterization of viral protein-host interactions would yield invaluable information in the understanding of viral invasion/evasion, diagnosis and therapeutic treatment of a viral infection, and mechanisms of host biology. With more than 2,000 viral genomes sequenced, only a small percent of them are well investigated. The access of these viral open reading frames (ORFs) in a flexible cloning format would greatly facilitate both in vitro and in vivo virus-host interaction studies. However, the overall progress of viral ORF cloning has been slow. To facilitate viral studies, we are releasing the initiation of our panviral proteome collection of 2,035 ORF clones from 830 viral genes in the Gateway® recombinational cloning system. Here, we demonstrate several uses of our viral collection including highly efficient production of viral proteins using human cell-free expression system in vitro, global identification of host targets for rubella virus using Nucleic Acid Programmable Protein Arrays (NAPPA) containing 10,000 unique human proteins, and detection of host serological responses using micro-fluidic multiplexed immunoassays. The studies presented here begin to elucidate host-viral protein interactions with our systemic utilization of viral ORFs, high-throughput cloning, and proteomic technologies. These valuable plasmid resources will be available to the research community to enable continued viral functional studies.
Collapse
|
20
|
Nuclear receptor NR4A1 promotes breast cancer invasion and metastasis by activating TGF-β signalling. Nat Commun 2014; 5:3388. [PMID: 24584437 DOI: 10.1038/ncomms4388] [Citation(s) in RCA: 129] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2013] [Accepted: 02/05/2014] [Indexed: 01/19/2023] Open
Abstract
In advanced cancers, the TGF-β pathway acts as an oncogenic factor and is considered to be a therapeutic target. Here using a genome-wide cDNA screen, we identify nuclear receptor NR4A1 as a strong activator of TGF-β signalling. NR4A1 promotes TGF-β/SMAD signalling by facilitating AXIN2-RNF12/ARKADIA-induced SMAD7 degradation. NR4A1 interacts with SMAD7 and AXIN2, and potently and directly induces AXIN2 expression. Whereas loss of NR4A1 inhibits TGF-β-induced epithelial-to-mesenchymal transition and metastasis, slight NR4A1 ectopic expression stimulates metastasis in a TGF-β-dependent manner. Importantly, inflammatory cytokines potently induce NR4A1 expression, and potentiate TGF-β-mediated breast cancer cell migration, invasion and metastasis in vitro and in vivo. Notably, NR4A1 expression is elevated in breast cancer patients with high immune infiltration and its expression weakly correlates with phosphorylated SMAD2 levels, and is an indicator of poor prognosis. Our results uncover inflammation-induced NR4A1 as an important determinant for hyperactivation of pro-oncogenic TGF-β signalling in breast cancer.
Collapse
|
21
|
Im H, Snyder M. Preparation of recombinant protein spotted arrays for proteome-wide identification of kinase targets. ACTA ACUST UNITED AC 2013; Chapter 27:Unit 27.4. [PMID: 23546622 DOI: 10.1002/0471140864.ps2704s72] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Protein microarrays allow unique approaches for interrogating global protein interaction networks. Protein arrays can be divided into two categories: antibody arrays and functional protein arrays. Antibody arrays consist of various antibodies and are appropriate for profiling protein abundance and modifications. Functional full-length protein arrays employ full-length proteins with various post-translational modifications. A key advantage of the latter is rapid parallel processing of large number of proteins for studying highly controlled biochemical activities, protein-protein interactions, protein-nucleic acid interactions, and protein-small molecule interactions. This unit presents a protocol for constructing functional yeast protein microarrays for global kinase substrate identification. This approach enables the rapid determination of protein interaction networks in yeast on a proteome-wide level. The same methodology can be readily applied to higher eukaryotic systems with careful consideration of overexpression strategy.
Collapse
Affiliation(s)
- Hogune Im
- Department of Genetics, Stanford University, Stanford, California, USA
| | | |
Collapse
|
22
|
Hemantha HP, Narendra N, Sureshbabu VV. Total chemical synthesis of polypeptides and proteins: chemistry of ligation techniques and beyond. Tetrahedron 2012. [DOI: 10.1016/j.tet.2012.08.059] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
23
|
Gene expression profiling reveals distinct cocaine-responsive genes in human fetal CNS cell types. J Addict Med 2012; 3:218-26. [PMID: 20948987 DOI: 10.1097/adm.0b013e318199d863] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
OBJECTIVES Prenatal exposure to cocaine causes cytoarchitectural alterations in the developing neocortex. Previously, we reported that cocaine inhibits neural progenitor cell proliferation through oxidative endoplasmic reticulum stress and consequent down-regulation of cyclin A, whereas cyclin A expression was increased in astrocytes. In the present study, cell type-specific responses to cocaine were further explored. METHODS Gene expression profiles were examined in five types of cells obtained from the human fetal cerebral cortex at 20 weeks gestation. Cells were treated with 100 µM cocaine in vitro for 24 hr, followed by gene expression analysis using a human neural/stem cell/drug abuse-focused cDNA array, with verification by quantitative real-time RT-PCR. RESULTS Cocaine influenced transcription of distinct categories of genes in a cell type-specific manner. Cocaine down-regulated cytoskeleton-related genes including ezrin, γ2 actin, α3d tubulin and α8 tubulin in neural and/or A2B5+ progenitor cells. In contrast, cocaine modulated immune and cell death-related genes in microglia and astrocytes. In microglia, cocaine up-regulated the immunoregulatory and pro-apoptotic genes IL-1β and BAX. In astrocytes, cocaine down-regulated the immune response gene glucocorticoid receptor and up-regulated the anti-apoptotic genes 14-3-3 ε and HVEM. Therefore, cell types comprising the developing neocortex show differential responses to cocaine. CONCLUSIONS These data suggest that cocaine causes cytoskeletal abnormalities leading to disturbances in neural differentiation and migration in progenitor cells, while altering immune and apoptotic responses in glia. Understanding the mechanisms of cocaine's effects on human CNS cells may help in the development of therapeutic strategies to prevent or ameliorate cocaine-induced impairments in fetal brain development.
Collapse
|
24
|
USP4 is regulated by AKT phosphorylation and directly deubiquitylates TGF-β type I receptor. Nat Cell Biol 2012; 14:717-26. [PMID: 22706160 DOI: 10.1038/ncb2522] [Citation(s) in RCA: 246] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Accepted: 05/11/2012] [Indexed: 12/13/2022]
Abstract
The stability and membrane localization of the transforming growth factor-β (TGF-β) type I receptor (TβRI) determines the levels of TGF-β signalling. TβRI is targeted for ubiquitylation-mediated degradation by the SMAD7-SMURF2 complex. Here we performed a genome-wide gain-of-function screen and identified ubiquitin-specific protease (USP) 4 as a strong inducer of TGF-β signalling. USP4 was found to directly interact with TβRI and act as a deubiquitylating enzyme, thereby controlling TβRI levels at the plasma membrane. Depletion of USP4 mitigates TGF-β-induced epithelial to mesenchymal transition and metastasis. Importantly, AKT (also known as protein kinase B), which has been associated with poor prognosis in breast cancer, directly associates with and phosphorylates USP4. AKT-mediated phosphorylation relocates nuclear USP4 to the cytoplasm and membrane and is required for maintaining its protein stability. Moreover, AKT-induced breast cancer cell migration was inhibited by USP4 depletion and TβRI kinase inhibition. Our results uncover USP4 as an important determinant for crosstalk between TGF-β and AKT signalling pathways.
Collapse
|
25
|
Bray JE. Target selection for structural genomics based on combining fold recognition and crystallisation prediction methods: application to the human proteome. ACTA ACUST UNITED AC 2012; 13:37-46. [PMID: 22354707 DOI: 10.1007/s10969-012-9130-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2011] [Accepted: 02/07/2012] [Indexed: 11/29/2022]
Abstract
The objective of this study is to automatically identify regions of the human proteome that are suitable for 3D structure determination by X-ray crystallography and to annotate them according to their likelihood to produce diffraction quality crystals. The results provide a powerful tool for structural genomics laboratories who wish to select human proteins based on the statistical likelihood of crystallisation success. Combining fold recognition and crystallisation prediction algorithms enables the efficient calculation of the crystallisability of the entire human proteome. This novel study estimates that there are approximately 40,000 crystallisable regions in the human proteome. Currently, only 15% of these regions (approx. 6,000 sequences) have been solved to at least 95% sequence identity. The remaining unsolved regions have been categorised into 5 crystallisation classes and an integral membrane protein (IMP) class, based on established structure prediction, crystallisation prediction and transmembrane (TM) helix prediction algorithms. Approximately 750 unsolved regions (2% of the proteome) have been identified as having a PDB fold representative (template) and an 'optimal' likelihood of crystallisation. At the other end of the spectrum, more than 10,500 non-IMP regions with a PDB template are classified as 'very difficult' to crystallise (26%) and almost 2,500 regions (6%) were predicted to contain at least 3 TM helices. The 3D-SPECS (3D Structural Proteomics Explorer with Crystallisation Scores) website contains crystallisation predictions for the entire human proteome and can be found at http://www.bioinformaticsplus.org/3dspecs.
Collapse
Affiliation(s)
- James E Bray
- Structural Genomics Consortium, University of Oxford, Old Road Campus Research Building, Roosevelt Drive, Oxford, OX3 7DQ, UK.
| |
Collapse
|
26
|
Tarry M, Skaar K, Heijne GV, Draheim RR, Högbom M. Production of human tetraspanin proteins in Escherichia coli. Protein Expr Purif 2012; 82:373-9. [PMID: 22381464 DOI: 10.1016/j.pep.2012.02.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2011] [Revised: 02/01/2012] [Accepted: 02/08/2012] [Indexed: 12/25/2022]
Abstract
Tetraspanins are found in multicellular eukaryotes and are generally thought to act as scaffolding proteins, localizing multiple proteins to a specific region of the cell membrane. Activities for tetraspanins have been identified in several fundamental processes such as motility, cell adhesion, proliferation and viral entry. Tetraspanins are also key players in cancer development and progression. However, structural and biochemical information on tetraspanins is decidely limited, due in no small part to the difficulties associated with expressing eukaryotic membrane proteins. In this study, we have used GFP fusions of a library of human tetraspanin proteins to identify growth conditions for expression in Escherichia coli. Three tetraspanin-GFP proteins could be produced at high enough levels to allow subsequent purification, paving the way for future structural and biochemical studies.
Collapse
Affiliation(s)
- Michael Tarry
- Stockholm Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, The Arrhenius Laboratories for Natural Sciences, SE-10691 Stockholm, Sweden
| | | | | | | | | |
Collapse
|
27
|
Begley DA, Krupke DM, Neuhauser SB, Richardson JE, Bult CJ, Eppig JT, Sundberg JP. The Mouse Tumor Biology Database (MTB): a central electronic resource for locating and integrating mouse tumor pathology data. Vet Pathol 2011; 49:218-23. [PMID: 21282667 DOI: 10.1177/0300985810395726] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The Mouse Tumor Biology Database (MTB) is designed to provide an electronic data storage, search, and analysis system for information on mouse models of human cancer. The MTB includes data on tumor frequency and latency, strain, germ line, and somatic genetics, pathologic notations, and photomicrographs. The MTB collects data from the primary literature, other public databases, and direct submissions from the scientific community. The MTB is a community resource that provides integrated access to mouse tumor data from different scientific research areas and facilitates integration of molecular, genetic, and pathologic data. Current status of MTB, search capabilities, data types, and future enhancements are described in this article.
Collapse
Affiliation(s)
- D A Begley
- The Jackson Laboratory, 600 Main St, Bar Harbor, ME 04609-1500, USA
| | | | | | | | | | | | | |
Collapse
|
28
|
Abstract
Site-specific modification of glycoproteins has wide application in both biochemical and biophysical studies. This method describes the conjugation of synthetic molecules to the N-terminus of a glycoprotein fragment, viz., human immunoglobulin G subclass 1 fragment crystallizable (IgG1 Fc), by native chemical ligation. The glycosylated IgG1 Fc is expressed in a glycosylation-deficient yeast strain. The N-terminal cysteine is generated by the endogenous yeast protease Kex2 in the yeast secretory pathway. The N-terminal cysteine is then conjugated with a biotin thioester to produce a biotinylated, glycosylated IgG1 Fc using native chemical ligation.
Collapse
Affiliation(s)
| | - Thomas J. Tolbert
- To whom correspondence should be addressed: Department of Chemistry, Indiana University, Bloomington, IN 47405
| |
Collapse
|
29
|
Lu X, Shapiro JA, Ting CT, Li Y, Li C, Xu J, Huang H, Cheng YJ, Greenberg AJ, Li SH, Wu ML, Shen Y, Wu CI. Genome-wide misexpression of X-linked versus autosomal genes associated with hybrid male sterility. Genome Res 2010; 20:1097-102. [PMID: 20511493 DOI: 10.1101/gr.076620.108] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Postmating reproductive isolation is often manifested as hybrid male sterility, for which X-linked genes are overrepresented (the so-called large X effect). In contrast, X-linked genes are significantly under-represented among testis-expressing genes. This seeming contradiction may be germane to the X:autosome imbalance hypothesis on hybrid sterility, in which the X-linked effect is mediated mainly through the misexpression of autosomal genes. In this study, we compared gene expression in fertile and sterile males in the hybrids between two Drosophila species. These hybrid males differ only in a small region of the X chromosome containing the Ods-site homeobox (OdsH) (also known as Odysseus) locus of hybrid sterility. Of genes expressed in the testis, autosomal genes were, indeed, more likely to be misexpressed than X-linked genes under the sterilizing action of OdsH. Since this mechanism of X:autosome interaction is only associated with spermatogenesis, a connection between X:autosome imbalance and the high rate of hybrid male sterility seems plausible.
Collapse
Affiliation(s)
- Xuemei Lu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, People's Republic of China
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Sequencing, analysis, and annotation of expressed sequence tags for Camelus dromedarius. PLoS One 2010; 5:e10720. [PMID: 20502665 PMCID: PMC2873428 DOI: 10.1371/journal.pone.0010720] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 04/29/2010] [Indexed: 11/19/2022] Open
Abstract
Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism.
Collapse
|
31
|
Mangan ME, Williams JM, Kuhn RM, Lathe WC. The UCSC genome browser: what every molecular biologist should know. ACTA ACUST UNITED AC 2009; Chapter 19:Unit19.9. [PMID: 19816931 DOI: 10.1002/0471142727.mb1909s88] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Electronic data resources can enable molecular biologists to query and display many useful features that make benchwork more efficient and drive new discoveries. The UCSC Genome Browser provides a wealth of data and tools that advance one's understanding of genomic context for many species, enable detailed understanding of data, and provide the ability to interrogate regions of interest. Researchers can also supplement the standard display with their own data to query and share with others. Effective use of these resources has become crucial to biological research today, and this unit describes some practical applications of the UCSC Genome Browser.
Collapse
|
32
|
Bell AW, Deutsch EW, Au CE, Kearney RE, Beavis R, Sechi S, Nilsson T, Bergeron JJM. A HUPO test sample study reveals common problems in mass spectrometry-based proteomics. Nat Methods 2009; 6:423-30. [PMID: 19448641 PMCID: PMC2785450 DOI: 10.1038/nmeth.1333] [Citation(s) in RCA: 256] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2008] [Accepted: 04/03/2009] [Indexed: 12/11/2022]
Abstract
We carried out a test sample study to try to identify errors leading to irreproducibility, including incompleteness of peptide sampling, in LC-MS-based proteomics. We distributed a test sample consisting of an equimolar mix of 20 highly purified recombinant human proteins, to 27 laboratories for identification. Each protein contained one or more unique tryptic peptides of 1250 Da to also test for ion selection and sampling in the mass spectrometer. Of the 27 labs, initially only 7 labs reported all 20 proteins correctly, and only 1 lab reported all the tryptic peptides of 1250 Da. Nevertheless, a subsequent centralized analysis of the raw data revealed that all 20 proteins and most of the 1250 Da peptides had in fact been detected by all 27 labs. The centralized analysis allowed us to determine sources of problems encountered in the study, which include missed identifications (false negatives), environmental contamination, database matching, and curation of protein identifications. Improved search engines and databases are likely to increase the fidelity of mass spectrometry-based proteomics.
Collapse
Affiliation(s)
- Alexander W Bell
- Department of Anatomy and Cell Biology, McGill University, Montreal, Canada
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Evsikov AV, Marín de Evsikova C. Gene expression during the oocyte-to-embryo transition in mammals. Mol Reprod Dev 2009; 76:805-18. [PMID: 19363788 DOI: 10.1002/mrd.21038] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The seminal question in modern developmental biology is the origins of new life arising from the unification of sperm and egg. The roots of this question begin from 19th to 20th century embryologists studying fertilization and embryogenesis. Although the revolution of molecular biology has yielded significant insight into the complexity of this process, the overall orchestration of genes, molecules, and cells is still not fully formed. Early mammalian development, specifically the oocyte-to-embryo transition, is essentially under "maternal command" from factors deposited in the cytoplasm during oocyte growth, independent of de novo transcription from the nascent embryo. Many of the advances in understanding this developmental period occurred in tandem with application of new methods and techniques from molecular biology, from protein electrophoresis to sequencing and assemblies of whole genomes. From this bed of knowledge, it appears that precise control of mRNA translation is a key regulator coordinating the molecular and cellular events occurring during oocyte-to-embryo transition. Notably, oocyte transcriptomes share, yet retain some uniqueness, common genetic motifs among all chordates. The common genetic motifs typically define fundamental processes critical for cellular maintenance, whereas the unique genetic features may be a source of variation and a substrate for sexual selection, genetic drift, or gene flow. One purpose for this complex interplay among genes, proteins, and cells may allow for evolution to transform and act upon the underlying processes, at molecular, structural and organismal levels, to increase diversity, which is the ultimate goal of sexual reproduction.
Collapse
|
34
|
Temple G, Gerhard DS, Rasooly R, Feingold EA, Good PJ, Robinson C, Mandich A, Derge JG, Lewis J, Shoaf D, Collins FS, Jang W, Wagner L, Shenmen CM, Misquitta L, Schaefer CF, Buetow KH, Bonner TI, Yankie L, Ward M, Phan L, Astashyn A, Brown G, Farrell C, Hart J, Landrum M, Maidak BL, Murphy M, Murphy T, Rajput B, Riddick L, Webb D, Weber J, Wu W, Pruitt KD, Maglott D, Siepel A, Brejova B, Diekhans M, Harte R, Baertsch R, Kent J, Haussler D, Brent M, Langton L, Comstock CLG, Stevens M, Wei C, van Baren MJ, Salehi-Ashtiani K, Murray RR, Ghamsari L, Mello E, Lin C, Pennacchio C, Schreiber K, Shapiro N, Marsh A, Pardes E, Moore T, Lebeau A, Muratet M, Simmons B, Kloske D, Sieja S, Hudson J, Sethupathy P, Brownstein M, Bhat N, Lazar J, Jacob H, Gruber CE, Smith MR, McPherson J, Garcia AM, Gunaratne PH, Wu J, Muzny D, Gibbs RA, Young AC, Bouffard GG, Blakesley RW, Mullikin J, Green ED, Dickson MC, Rodriguez AC, Grimwood J, Schmutz J, Myers RM, Hirst M, Zeng T, Tse K, Moksa M, Deng M, Ma K, Mah D, Pang J, Taylor G, Chuah E, Deng A, Fichter K, Go A, Lee S, Wang J, Griffith M, Morin R, Moore RA, Mayo M, Munro S, Wagner S, Jones SJM, Holt RA, Marra MA, Lu S, Yang S, Hartigan J, Graf M, Wagner R, Letovksy S, Pulido JC, Robison K, Esposito D, Hartley J, Wall VE, Hopkins RF, Ohara O, Wiemann S. The completion of the Mammalian Gene Collection (MGC). Genome Res 2009; 19:2324-33. [PMID: 19767417 DOI: 10.1101/gr.095976.109] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress using directed RT-PCR cloning and DNA synthesis. The MGC now contains clones with the entire protein-coding sequence for 92% of human and 89% of mouse genes with curated RefSeq (NM-accession) transcripts, and for 97% of human and 96% of mouse genes with curated RefSeq transcripts that have one or more PubMed publications, in addition to clones for more than 6300 rat genes. These high-quality MGC clones and their sequences are accessible without restriction to researchers worldwide.
Collapse
|
35
|
Pinheiro DG, Galante PAF, de Souza SJ, Zago MA, Silva WA. A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling. BMC Bioinformatics 2009; 10:170. [PMID: 19500384 PMCID: PMC2701951 DOI: 10.1186/1471-2105-10-170] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2008] [Accepted: 06/06/2009] [Indexed: 12/01/2022] Open
Abstract
Background High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at . S3T source code and datasets can also be downloaded from the aforementioned website.
Collapse
Affiliation(s)
- Daniel G Pinheiro
- Departamento de Genética, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP, Brazil.
| | | | | | | | | |
Collapse
|
36
|
Zhang Z, Xin D, Wang P, Zhou L, Hu L, Kong X, Hurst LD. Noisy splicing, more than expression regulation, explains why some exons are subject to nonsense-mediated mRNA decay. BMC Biol 2009; 7:23. [PMID: 19442261 PMCID: PMC2697156 DOI: 10.1186/1741-7007-7-23] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2009] [Accepted: 05/14/2009] [Indexed: 01/23/2023] Open
Abstract
Background Nonsense-mediated decay is a mechanism that degrades mRNAs with a premature termination codon. That some exons have premature termination codons at fixation is paradoxical: why make a transcript if it is only to be destroyed? One model supposes that splicing is inherently noisy and spurious transcripts are common. The evolution of a premature termination codon in a regularly made unwanted transcript can be a means to prevent costly translation. Alternatively, nonsense-mediated decay can be regulated under certain conditions so the presence of a premature termination codon can be a means to up-regulate transcripts needed when nonsense-mediated decay is suppressed. Results To resolve this issue we examined the properties of putative nonsense-mediated decay targets in humans and mice. We started with a well-annotated set of protein coding genes and found that 2 to 4% of genes are probably subject to nonsense-mediated decay, and that the premature termination codon reflects neither rare mutations nor sequencing artefacts. Several lines of evidence suggested that the noisy splicing model has considerable relevance: 1) exons that are uniquely found in nonsense-mediated decay transcripts (nonsense-mediated decay-specific exons) tend to be newly created; 2) have low-inclusion level; 3) tend not to be a multiple of three long; 4) belong to genes with multiple splice isoforms more often than expected; and 5) these genes are not obviously enriched for any functional class nor conserved as nonsense-mediated decay candidates in other species. However, nonsense-mediated decay-specific exons for which distant orthologous exons can be found tend to have been under purifying selection, consistent with the regulation model. Conclusion We conclude that for recently evolved exons the noisy splicing model is the better explanation of their properties, while for ancient exons the nonsense-mediated decay regulated gene expression is a viable explanation.
Collapse
Affiliation(s)
- Zhenguo Zhang
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS) & Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai, PR China.
| | | | | | | | | | | | | |
Collapse
|
37
|
Jain S, Zhang X, Khandelwal PJ, Saunders AJ, Cummings BS, Oelkers P. Characterization of human lysophospholipid acyltransferase 3. J Lipid Res 2009; 50:1563-70. [PMID: 19351971 DOI: 10.1194/jlr.m800398-jlr200] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Esterifying lysophospholipids may serve a variety of functions, including phospholipid remodeling and limiting the abundance of bioactive lipids. Recently, a yeast enzyme, Lpt1p, that esterifies an array of lysophospholipids was identified. Described here is the characterization of a human homolog of LPT1 that we have called lysophosphatidylcholine acyltransferase 3 (LPCAT3). Expression of LPCAT3 in Sf9 insect cells conferred robust esterification of lysophosphatidylcholine in vitro. Kinetic analysis found apparent cooperativity with a saturated acyl-CoA having the lowest K0.5 (5 microM), a monounsaturated acyl-CoA having the highest apparent Vmax (759 nmol/min/mg), and two polyunsaturated acyl-CoAs showing intermediate values. Lysophosphatidylethanolamine and lysophosphatidylserine were also utilized as substrates. Electrospray ionization mass spectrometric analysis of phospholipids in Sf9 cells expressing LPCAT3 showed a relative increase in phosphatidylcholine containing saturated acyl chains and a decrease in phosphatidylcholine containing unsaturated acyl chains. Targeted reduction of LPCAT3 expression in HEK293 cells had essentially an opposite effect, resulting in decreased abundance of saturated phospholipid species and more unsaturated species. Reduced LPCAT3 expression resulted in more apoptosis and distinctly fewer lamellipodia, suggesting a necessary role for lysophospholipid esterification in maintaining cellular function and structure.
Collapse
Affiliation(s)
- Shilpa Jain
- Department of Biology, Drexel University, Philadelphia, PA 19104, USA
| | | | | | | | | | | |
Collapse
|
38
|
Freeman RM, Wu M, Cordonnier-Pratt MM, Pratt LH, Gruber CE, Smith M, Lander ES, Stange-Thomann N, Lowe CJ, Gerhart J, Kirschner M. cDNA sequences for transcription factors and signaling proteins of the hemichordate Saccoglossus kowalevskii: efficacy of the expressed sequence tag (EST) approach for evolutionary and developmental studies of a new organism. THE BIOLOGICAL BULLETIN 2008; 214:284-302. [PMID: 18574105 DOI: 10.2307/25470670] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
We describe a collection of expressed sequence tags (ESTs) for Saccoglossus kowalevskii, a direct-developing hemichordate valuable for evolutionary comparisons with chordates. The 202,175 ESTs represent 163,633 arrayed clones carrying cDNAs prepared from embryonic libraries, and they assemble into 13,677 continuous sequences (contigs), leaving 10,896 singletons (excluding mitochondrial sequences). Of the contigs, 53% had significant matches when BLAST was used to query the NCBI databases (< or = 10(-10)), as did 51% of the singletons. Contigs most frequently matched sequences from amphioxus (29%), chordates (67%), and deuterostomes (87%). From the clone array, we isolated 400 full-length sequences for transcription factors and signaling proteins of use for evolutionary and developmental studies. The set includes sequences for fox, pax, tbx, hox, and other homeobox-containing factors, and for ligands and receptors of the TGFbeta, Wnt, Hh, Delta/Notch, and RTK pathways. At least 80% of key sequences have been obtained, when judged against gene lists of model organisms. The median length of these cDNAs is 2.3 kb, including 1.05 kb of 3' untranslated region (UTR). Only 30% are entirely matched by single contigs assembled from ESTs. We conclude that an EST collection based on 150,000 clones is a rich source of sequences for molecular developmental work, and that the EST approach is an efficient way to initiate comparative studies of a new organism.
Collapse
Affiliation(s)
- R M Freeman
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Abstract
The prediction of the binding free energy between a ligand and a protein is an important component in the virtual screening and lead optimization of ligands for drug discovery. To determine the quality of current binding free energy estimation programs, we examined FlexX, X-Score, AutoDock, and BLEEP for their performance in binding free energy prediction in various situations including cocrystallized complex structures, cross docking of ligands to their non-cocrystallized receptors, docking of thermally unfolded receptor decoys to their ligands, and complex structures with "randomized" ligand decoys. In no case was there a satisfactory correlation between the experimental and estimated binding free energies over all the datasets tested. Meanwhile, a strong correlation between ligand molecular weight-binding affinity correlation and experimental predicted binding affinity correlation was found. Sometimes the programs also correctly ranked ligands' binding affinities even though native interactions between the ligands and their receptors were essentially lost because of receptor deformation or ligand randomization, and the programs could not decisively discriminate randomized ligand decoys from their native ligands; this suggested that the tested programs miss important components for the accurate capture of specific ligand binding interactions.
Collapse
Affiliation(s)
- Ryangguk Kim
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street, Atlanta, Georgia 30318, USA
| | | |
Collapse
|
40
|
Hamilton BS, Brede Y, Tolbert TJ. Expression and characterization of human glycosylated interleukin-1 receptor antagonist in Pichia pastoris. Protein Expr Purif 2008; 59:64-8. [DOI: 10.1016/j.pep.2008.01.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Revised: 01/07/2008] [Accepted: 01/10/2008] [Indexed: 01/14/2023]
|
41
|
Construction and characterization of a normalized yeast two-hybrid library derived from a human protein-coding clone collection. Biotechniques 2008; 44:265-73. [PMID: 18330356 DOI: 10.2144/000112674] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The nuclear yeast two-hybrid (Y2H) system is the most widely used technology for detecting interactions between proteins. A common approach is to screen specific test proteins (baits) against large compilations of randomly cloned proteins (prey libraries). For eukaryotic organisms, libraries have traditionally been generated using messenger RNA (mRNA) extracted from various tissues and cells. Here we present a library construction strategy made possible by ongoing public efforts to establish collections of full-length protein encoding clones. Our approach generates libraries that are essentially normalized and contain both randomly fragmented as well as full-length inserts. We refer to this type of protein-coding clone-derived library as random and full-length (RAFL) Y2H library. The library described here is based on clones from the Mammalian Gene Collection, but our strategy is compatible with the use of any protein-coding clone collection from any organism in any vector and does not require inserts to be devoid of untranslated regions. We tested our prototype human RAFL library against a set of baits that had previously been searched against multiple cDNA libraries. These Y2H searches yielded a combination of novel as well as expected interactions, indicating that the RAFL library constitutes a valuable complement to Y2H cDNA libraries.
Collapse
|
42
|
Desir GV. Renalase deficiency in chronic kidney disease, and its contribution to hypertension and cardiovascular disease. Curr Opin Nephrol Hypertens 2008; 17:181-5. [DOI: 10.1097/mnh.0b013e3282f521ba] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
43
|
Fullwood MJ, Tan JJS, Ng PWP, Chiu KP, Liu J, Wei CL, Ruan Y. The use of multiple displacement amplification to amplify complex DNA libraries. Nucleic Acids Res 2008; 36:e32. [PMID: 18285362 PMCID: PMC2275127 DOI: 10.1093/nar/gkn074] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2007] [Revised: 01/17/2008] [Accepted: 02/05/2008] [Indexed: 11/29/2022] Open
Abstract
Complex libraries for genomic DNA and cDNA sequencing analyses are typically amplified using bacterial propagation. To reduce biases, large numbers of colonies are plated and scraped from solid-surface agar. This process is time consuming, tedious and limits scaling up. At the same time, multiple displacement amplification (MDA) has been recently developed as a method for in vitro amplification of DNA. However, MDA has no selection function for the removal of ligation multimers. We developed a novel method of briefly introducing ligation reactions into bacteria to select single insert DNA clones followed by MDA to amplify. We applied these methods to a Gene Identification Signatures with Paired-End diTags (GIS-PET) library, which is a complex transcriptome library created by pairing short tags from the 5' and 3' ends of cDNA fragments together, and demonstrated that this selection and amplification strategy is unbiased and efficient.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Yijun Ruan
- Genome Institute of Singapore, Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome #02-01, Singapore 138672
| |
Collapse
|
44
|
Hirst M, Delaney A, Rogers SA, Schnerch A, Persaud DR, O'Connor MD, Zeng T, Moksa M, Fichter K, Mah D, Go A, Morin RD, Baross A, Zhao Y, Khattra J, Prabhu AL, Pandoh P, McDonald H, Asano J, Dhalla N, Ma K, Lee S, Ally A, Chahal N, Menzies S, Siddiqui A, Holt R, Jones S, Gerhard DS, Thomson JA, Eaves CJ, Marra MA. LongSAGE profiling of nine human embryonic stem cell lines. Genome Biol 2008; 8:R113. [PMID: 17570852 PMCID: PMC2394759 DOI: 10.1186/gb-2007-8-6-r113] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2006] [Revised: 04/23/2007] [Accepted: 06/14/2007] [Indexed: 12/20/2022] Open
Abstract
To facilitate discovery of novel human embryonic stem cell (ESC) transcripts, we generated 2.5 million LongSAGE tags from 9 human ESC lines. Analysis of this data revealed that ESCs express proportionately more RNA binding proteins compared with terminally differentiated cells, and identified novel ESC transcripts, at least one of which may represent a marker of the pluripotent state.
Collapse
Affiliation(s)
- Martin Hirst
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Allen Delaney
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Sean A Rogers
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Angelique Schnerch
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Deryck R Persaud
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Michael D O'Connor
- Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Thomas Zeng
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Michelle Moksa
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Keith Fichter
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Diana Mah
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Anne Go
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Ryan D Morin
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Agnes Baross
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Yongjun Zhao
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Jaswinder Khattra
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Anna-Liisa Prabhu
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Pawan Pandoh
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Helen McDonald
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Jennifer Asano
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Noreen Dhalla
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Kevin Ma
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Stephanie Lee
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Adrian Ally
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Neil Chahal
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Stephanie Menzies
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Asim Siddiqui
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Robert Holt
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Steven Jones
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Daniela S Gerhard
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - James A Thomson
- Wisconsin National Primate Research Centre and Department of Anatomy, School of Medicine, University of Wisconsin, Madison, Wisconsin 53715, USA
| | - Connie J Eaves
- Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| | - Marco A Marra
- Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada, V5Z 1L3
| |
Collapse
|
45
|
Faria-Campos AC, Moratelli FS, Mendes IK, Ortolani PL, Oliveira GC, Campos SVA, Ortega JM, Franco GR. Production of full-length cDNA sequences by sequencing and analysis of expressed sequence tags from Schistosoma mansoni. Mem Inst Oswaldo Cruz 2008; 101 Suppl 1:161-5. [PMID: 17308765 DOI: 10.1590/s0074-02762006000900026] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2006] [Accepted: 06/26/2006] [Indexed: 11/21/2022] Open
Abstract
The number of sequences generated by genome projects has increased exponentially, but gene characterization has not followed at the same rate. Sequencing and analysis of full-length cDNAs is an important step in gene characterization that has been used nowadays by several research groups. In this work, we have selected Schistosoma mansoni clones for full-length sequencing, using an algorithm that investigates the presence of the initial methionine in the parasite sequence based on the positions of alignment start between two sequences. BLAST searches to produce such alignments have been performed using parasite expressed sequence tags produced by Minas Gerais Genome Network against sequences from the database Eukaryotic Cluster of Orthologous Groups (KOG). This procedure has allowed the selection of clones representing 398 proteins which have not been deposited as S. mansoni complete CDS in any public database. Dedicated sequencing of 96 of such clones with reads from both 5' and 3' ends has been performed. These reads have been assembled using PHRAP, resulting in the production of 33 full-length sequences that represent novel S. mansoni proteins. These results shall contribute to construct a more complete view of the biology of this important parasite.
Collapse
Affiliation(s)
- Alessandra C Faria-Campos
- Departamento de Bioquímica e Imunologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Av. Antônio Carlos 6627, 31270-901 Belo Horzonte, MG, Brazil
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Abstract
The rat genome project and the resources that it has generated are transforming the translation of rat biology to human medicine. The rat genome was sequenced to a high quality “draft,” the structure and location of the genes were predicted, and a global assessment was published (Gibbs RA et al., Nature 428: 493–521, 2004). Since that time, researchers have made use of the genome sequence and annotations and related resources. We take this opportunity to review the currently available rat genome resources and to discuss the progress and future plans for the rat genome.
Collapse
Affiliation(s)
- K. C. Worley
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | - G. M. Weinstock
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| | - R. A. Gibbs
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas
| |
Collapse
|
47
|
Abstract
The sequencing of the human genome and ensuing wave of data generation have brought new light upon the extent and importance of alternative splicing as an RNA regulatory mechanism. Alternative splicing could potentially explain the complexity of protein repertoire during evolution, and defects in the splicing mechanism are responsible for diseases as complex as cancer. Among the challenges that rise in light of these discoveries are cataloguing splice variation in the human and other eukaryotic genomes, and identifying and characterizing the splicing regulatory elements that control their expression. Bioinformatics efforts tackling these two questions are just at the beginning. This article is a survey of these methods.
Collapse
Affiliation(s)
- Liliana Florea
- Department of Computer Science, George Washington University, Academic Center-Rm 714, Washington DC 20052, USA.
| |
Collapse
|
48
|
Sobrado P, Goren MA, James D, Amundson CK, Fox BG. A Protein Structure Initiative approach to expression, purification, and in situ delivery of human cytochrome b5 to membrane vesicles. Protein Expr Purif 2007; 58:229-41. [PMID: 18226920 DOI: 10.1016/j.pep.2007.11.018] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2007] [Revised: 11/26/2007] [Accepted: 11/30/2007] [Indexed: 11/18/2022]
Abstract
A specialized vector backbone from the Protein Structure Initiative was used to express full-length human cytochrome b5 as a C-terminal fusion to His8-maltose binding protein in Escherichia coli. The fusion protein could be completely cleaved by tobacco etch virus protease, and a yield of approximately 18 mg of purified full-length human cytochrome b5 per liter of culture medium was obtained (2.3mg per g of wet weight bacterial cells). In situ proteolysis of the fusion protein in the presence of chemically defined synthetic liposomes allowed facile spontaneous delivery of the functional peripheral membrane protein into a defined membrane environment without prior exposure to detergents or other lipids. The utility of this approach as a delivery method for production and incorporation of monotopic (peripheral) membrane proteins is discussed.
Collapse
Affiliation(s)
- Pablo Sobrado
- Department of Biochemistry, College of Agricultural and Life Sciences, University of Wisconsin, Room 141B, 433 Babcock Drive, Madison, WI 53706, USA
| | | | | | | | | |
Collapse
|
49
|
Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CLG, Davis C, Ewing B, Oommen S, Lau C, Yu HC, Li J, Roe BA, Green P, Gerhard DS, Temple G, Haussler D, Brent MR. Targeted discovery of novel human exons by comparative genomics. Genes Dev 2007; 17:1763-73. [PMID: 17989246 PMCID: PMC2099585 DOI: 10.1101/gr.7128207] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2007] [Accepted: 10/15/2007] [Indexed: 01/20/2023]
Abstract
A complete and accurate set of human protein-coding gene annotations is perhaps the single most important resource for genomic research after the human-genome sequence itself, yet the major gene catalogs remain incomplete and imperfect. Here we describe a genome-wide effort, carried out as part of the Mammalian Gene Collection (MGC) project, to identify human genes not yet in the gene catalogs. Our approach was to produce gene predictions by algorithms that rely on comparative sequence data but do not require direct cDNA evidence, then to test predicted novel genes by RT-PCR. We have identified 734 novel gene fragments (NGFs) containing 2188 exons with, at most, weak prior cDNA support. These NGFs correspond to an estimated 563 distinct genes, of which >160 are completely absent from the major gene catalogs, while hundreds of others represent significant extensions of known genes. The NGFs appear to be predominantly protein-coding genes rather than noncoding RNAs, unlike novel transcribed sequences identified by technologies such as tiling arrays and CAGE. They tend to be expressed at low levels and in a tissue-specific manner, and they are enriched for roles in motor activity, cell adhesion, connective tissue, and central nervous system development. Our results demonstrate that many important genes and gene fragments have been missed by traditional approaches to gene discovery but can be identified by their evolutionary signatures using comparative sequence data. However, they suggest that hundreds-not thousands-of protein-coding genes are completely missing from the current gene catalogs.
Collapse
Affiliation(s)
- Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Aturaliya RN, Fink JL, Davis MJ, Teasdale MS, Hanson KA, Miranda KC, Forrest ARR, Grimmond SM, Suzuki H, Kanamori M, Kai C, Kawai J, Carninci P, Hayashizaki Y, Teasdale RD. Subcellular localization of mammalian type II membrane proteins. Traffic 2007; 7:613-25. [PMID: 16643283 DOI: 10.1111/j.1600-0854.2006.00407.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Application of a computational membrane organization prediction pipeline, MemO, identified putative type II membrane proteins as proteins predicted to encode a single alpha-helical transmembrane domain (TMD) and no signal peptides. MemO was applied to RIKEN's mouse isoform protein set to identify 1436 non-overlapping genomic regions or transcriptional units (TUs), which encode exclusively type II membrane proteins. Proteins with overlapping predicted InterPro and TMDs were reviewed to discard false positive predictions resulting in a dataset comprised of 1831 transcripts in 1408 TUs. This dataset was used to develop a systematic protocol to document subcellular localization of type II membrane proteins. This approach combines mining of published literature to identify subcellular localization data and a high-throughput, polymerase chain reaction (PCR)-based approach to experimentally characterize subcellular localization. These approaches have provided localization data for 244 and 169 proteins. Type II membrane proteins are localized to all major organelle compartments; however, some biases were observed towards the early secretory pathway and punctate structures. Collectively, this study reports the subcellular localization of 26% of the defined dataset. All reported localization data are presented in the LOCATE database (http://www.locate.imb.uq.edu.au).
Collapse
Affiliation(s)
- Rajith N Aturaliya
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland 4072, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|