1
|
Veluri R, Pollin G, Wagenknecht JB, Urrutia R, Zimmermann MT. An Integrative Multitiered Computational Analysis for Better Understanding the Structure and Function of 85 Miniproteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.31.635936. [PMID: 39974886 PMCID: PMC11838408 DOI: 10.1101/2025.01.31.635936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Background Miniproteins, defined as polypeptides containing fewer than 50 amino acids, have recently elicited significant interest due to an emerging understanding of their diverse roles in fundamental biological processes. In addition, miniprotein dysregulation underlies human diseases and is a significant focus for biotechnology and drug development. Notably, the human genome project revealed the existence of many novel miniproteins, most of which remain uncharacterized. This study reports an approach for analyzing and scoring previously uncharacterized miniproteins by integrating knowledge from classic sequence-based bioinformatics, computational biophysics, and system biology annotations. Our results demonstrate that these approaches provide novel information on the structure-function relationship of these molecules with a particular focus on their biomedical relevance. Methods We identified 85 human miniproteins using a simple multi-tier approach. First, we performed a sequence-based analysis of these proteins using several algorithms to identify regions of structural and functional importance. Protein-protein interactions and gene ontology annotations were used to analyze miniprotein function. Then, we predicted miniprotein three-dimensional structures using AI-based methods and peptide modeling to determine their relative yields for these understudied polymers. Subsequently, we used several computational biophysics methods and structure-based calculations to annotate and evaluate results from both algorithms. Results We find several relations between predicted structure and functional properties to assign these proteins to several groups with similar properties. Sequence-based analysis leads us to identify motifs and residues that link structure-to-function for most of these proteins. We suggest novel miniprotein functions, such as thymosin beta proteins regulating the shelterin complex through TERF1 and POT1 interactions, FAM86JP and FAM66E participating in endocytic processes, and BAGE1 influencing chromatin remodeling through interaction with nuclear proteins. Further, known functions of miniproteins, such as STRIT1, STMP1, and SLN, were supported. Finally, structure-based scoring led us to build 3D models that provided complementary information to ontologies. We identify that structural propensity is not strictly dependent on polymer length. In fact, in this dataset, peptide-based algorithms may have advantages over AI-based algorithms for certain groups of miniproteins. Conclusion This analytic approach and resulting identification and annotation of miniproteins adds much to what is currently known about miniproteins. Our determination of novel properties of miniproteins bears significant mechanistic and biomedical relevance. We propose novel functions of miniproteins, which expands our understanding of their potential roles in cellular processes. And, we practically identify which sequence and structure-based tools provide the most information, aiding future studies of miniproteins.
Collapse
Affiliation(s)
- Reethika Veluri
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- School of Medicine, Saint Louis University, Saint Louis, Missouri, USA
| | - Gareth Pollin
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jessica B. Wagenknecht
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Raul Urrutia
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Michael T. Zimmermann
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Data Science Institute, Medical College of Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
2
|
Hiraiwa S, Takeshita S, Terano T, Hayashi R, Suzuki K, Tajiri R, Kojima T. Unveiling the cell dynamics during the final shape formation of the tarsus in Drosophila adult leg by live imaging. Dev Genes Evol 2024; 234:117-133. [PMID: 38977431 PMCID: PMC11611951 DOI: 10.1007/s00427-024-00719-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 07/01/2024] [Indexed: 07/10/2024]
Abstract
Organisms display a remarkable diversity in their shapes. Although substantial progress has been made in unraveling the mechanisms that govern cell fate determination during development, the mechanisms by which fate-determined cells give rise to the final shapes of organisms remain largely unknown. This study describes in detail the process of the final shape formation of the tarsus, which is near the distal tip of the adult leg, during the pupal stage in Drosophila melanogaster. Days-long live imaging revealed unexpectedly complicated cellular dynamics. The epithelial cells transiently form the intriguing structure, which we named the Parthenon-like structure. The basal surface of the epithelial cells and localization of the basement membrane protein initially show a mesh-like structure and rapidly shrink into the membranous structure during the formation and disappearance of the Parthenon-like structure. Furthermore, macrophage-like cells are observed moving around actively in the Parthenon-like structure and engulfing epithelial cells. The findings in this research are expected to significantly contribute to our understanding of the mechanisms involved in shaping the final structure of the adult tarsus.
Collapse
Affiliation(s)
- Shotaro Hiraiwa
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
| | - Shumpei Takeshita
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
| | - Tensho Terano
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
| | - Ryuhei Hayashi
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
| | - Koyo Suzuki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
| | - Reiko Tajiri
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan
- Present address: Laboratory for Extracellular Morphogenesis, Graduate School of Science, Chiba University, 1-33 Yayoi-cho, Inage-ku, Chiba-shi, Chiba, 263-8522, Japan
| | - Tetsuya Kojima
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Biosciences Building 501, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8562, Japan.
| |
Collapse
|
3
|
Mapalo MA, Game M, Smith FW, Ortega-Hernández J. Expression of distal limb patterning genes in Hypsibius exemplaris indicate regionalization and suggest distal identity of tardigrade legs. EvoDevo 2024; 15:15. [PMID: 39538290 PMCID: PMC11562647 DOI: 10.1186/s13227-024-00235-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND Panarthropods, a major group of invertebrate animals comprised of arthropods, onychophorans, and tardigrades, are the only limb-bearing members of Ecdysozoa. The complexity and versatility of panarthropod paired limbs has prompted great interest in their development to better understand the formation of these structures and the genes involved in this process. However, studies of limb patterning and development are overwhelmingly focused on arthropods, followed by select work on onychophorans but almost entirely lacking for tardigrades. This model organism bias is inherently limited and precludes a comparative analysis of how panarthropod legs originated, have evolved, and the likely limb patterning genes present in the earliest panarthropod ancestors. In this study, we investigated tardigrade homologs of seven arthropod distal limb patterning genes (apterous, aristaless, BarH1, clawless, Lim1, rotund, and spineless) to better characterize tardigrade limb development in a comparative context. RESULTS We detected homologs of all seven genes in the eutardigrade Hypsibius exemplaris and heterotardigrade Echiniscoides cf. sigismundi suggesting their conservation in both tardigrade lineages. Hybridization chain reaction experiments in H. exemplaris reveal a regionalized expression pattern for the genes aristaless, BarH1, clawless, rotund and spineless. CONCLUSION The observed regionalized expression of the distal limb patterning genes in H. exemplaris might reflect the external morphological features of tardigrade legs, such as the distal claws, sensory organs in the proximal region, and specific muscle attachment sites. The comparison between the expression of these limb patterning genes in H. exemplaris relative to other panarthropods suggests their conserved role in the last common panarthropod ancestor, such as establishing the distal limb end and the distribution of sensory structures. Our results support the hypothesis that tardigrade legs are homologous to the distal region of other panarthropod limbs, as suggested by previous work on the expression of leg gap genes in H. exemplaris.
Collapse
Affiliation(s)
- Marc A Mapalo
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Mandy Game
- Department of Ecology & Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Frank W Smith
- Biology Department, University of North Florida, Jacksonville, FL, USA.
| | - Javier Ortega-Hernández
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
4
|
Zhao L, Svetec N, Begun DJ. De Novo Genes. Annu Rev Genet 2024; 58:211-232. [PMID: 39088850 PMCID: PMC12051474 DOI: 10.1146/annurev-genet-111523-102413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/03/2024]
Abstract
Although the majority of annotated new genes in a given genome appear to have arisen from duplication-related mechanisms, recent studies have shown that genes can also originate de novo from ancestrally nongenic sequences. Investigating de novo-originated genes offers rich opportunities to understand the origin and functions of new genes, their regulatory mechanisms, and the associated evolutionary processes. Such studies have uncovered unexpected and intriguing facets of gene origination, offering novel perspectives on the complexity of the genome and gene evolution. In this review, we provide an overview of the research progress in this field, highlight recent advancements, identify key technical and conceptual challenges, and underscore critical questions that remain to be addressed.
Collapse
Affiliation(s)
- Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, USA; ,
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, USA; ,
| | - David J Begun
- Department of Evolution and Ecology, University of California, Davis, California, USA;
| |
Collapse
|
5
|
Chanut-Delalande H, Zanet J. Small ORFs, Big Insights: Drosophila as a Model to Unraveling Microprotein Functions. Cells 2024; 13:1645. [PMID: 39404408 PMCID: PMC11475943 DOI: 10.3390/cells13191645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 09/27/2024] [Accepted: 10/02/2024] [Indexed: 10/19/2024] Open
Abstract
Recently developed experimental and computational approaches to identify putative coding small ORFs (smORFs) in genomes have revealed thousands of smORFs localized within coding and non-coding RNAs. They can be translated into smORF peptides or microproteins, which are defined as less than 100 amino acids in length. The identification of such a large number of potential biological regulators represents a major challenge, notably for elucidating the in vivo functions of these microproteins. Since the emergence of this field, Drosophila has proved to be a valuable model for studying the biological functions of microproteins in vivo. In this review, we outline how the smORF field emerged and the nomenclature used in this domain. We summarize the technical challenges associated with identifying putative coding smORFs in the genome and the relevant translated microproteins. Finally, recent findings on one of the best studied smORF peptides, Pri, and other microproteins studied so far in Drosophila are described. These studies highlight the diverse roles that microproteins can fulfil in the regulation of various molecular targets involved in distinct cellular processes during animal development and physiology. Given the recent emergence of the microprotein field and the associated discoveries, the microproteome represents an exquisite source of potentially bioactive molecules, whose in vivo biological functions can be explored in the Drosophila model.
Collapse
Affiliation(s)
| | - Jennifer Zanet
- Unité de Biologie Moléculaire, Cellulaire et du Développement (MCD), UMR 5077, Centre de Biologie Intégrative (CBI), CNRS, UPS, Université de Toulouse, 31062 Toulouse, France;
| |
Collapse
|
6
|
Whited AM, Jungreis I, Allen J, Cleveland CL, Mudge JM, Kellis M, Rinn JL, Hough LE. Biophysical characterization of high-confidence, small human proteins. BIOPHYSICAL REPORTS 2024; 4:100167. [PMID: 38909903 PMCID: PMC11305224 DOI: 10.1016/j.bpr.2024.100167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/09/2024] [Accepted: 06/20/2024] [Indexed: 06/25/2024]
Abstract
Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. In addition, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from noncoding ones in otherwise ambiguous cases.
Collapse
Affiliation(s)
- A M Whited
- BioFrontiers Institute, University of Colorado, Boulder, Colorado
| | - Irwin Jungreis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - Jeffre Allen
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | | | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Manolis Kellis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - John L Rinn
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | - Loren E Hough
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Physics, University of Colorado Boulder, Boulder, Colorado.
| |
Collapse
|
7
|
Gallois M, Menoret D, Marques-Prieto S, Montigny A, Valenti P, Moussian B, Plaza S, Payre F, Chanut-Delalande H. Pri peptides temporally coordinate transcriptional programs during epidermal differentiation. SCIENCE ADVANCES 2024; 10:eadg8816. [PMID: 38335295 PMCID: PMC10857433 DOI: 10.1126/sciadv.adg8816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 01/09/2024] [Indexed: 02/12/2024]
Abstract
To achieve a highly differentiated state, cells undergo multiple transcriptional processes whose coordination and timing are not well understood. In Drosophila embryonic epidermal cells, polished-rice (Pri) smORF peptides act as temporal mediators of ecdysone to activate a transcriptional program leading to cell shape remodeling. Here, we show that the ecdysone/Pri axis concomitantly represses the transcription of a large subset of cuticle genes to ensure proper differentiation of the insect exoskeleton. The repression relies on the transcription factor Ken and persists for several days throughout early larval stages, during which a soft cuticle allows larval crawling. The onset of these cuticle genes normally awaits the end of larval stages when the rigid pupal case assembles, and their premature expression triggers abnormal sclerotization of the larval cuticle. These results uncovered a temporal switch to set up distinct structures of cuticles adapted to the animal lifestyle and which might be involved in the evolutionary history of insects.
Collapse
Affiliation(s)
- Maylis Gallois
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Delphine Menoret
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Simon Marques-Prieto
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Audrey Montigny
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Philippe Valenti
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Bernard Moussian
- Université Côte d'Azur, INRAE, CNRS, Institut Sophia Agrobiotech, Sophia Antipolis, France
| | - Serge Plaza
- Laboratoire de Recherche en Sciences Végétales, CNRS/UPS/INPT, Auzeville-Tolosane, France
| | - François Payre
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| | - Hélène Chanut-Delalande
- Molecular Cellular and Developmental Biology Unit (MCD), Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
8
|
Markus D, Pelletier A, Boube M, Port F, Boutros M, Payre F, Obermayer B, Zanet J. The pleiotropic functions of Pri smORF peptides synchronize leg development regulators. PLoS Genet 2023; 19:e1011004. [PMID: 37903161 PMCID: PMC10635573 DOI: 10.1371/journal.pgen.1011004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 11/09/2023] [Accepted: 10/03/2023] [Indexed: 11/01/2023] Open
Abstract
The last decade witnesses the emergence of the abundant family of smORF peptides, encoded by small ORF (<100 codons), whose biological functions remain largely unexplored. Bioinformatic analyses here identify hundreds of putative smORF peptides expressed in Drosophila imaginal leg discs. Thanks to a functional screen in leg, we found smORF peptides involved in morphogenesis, including the pioneer smORF peptides Pri. Since we identified its target Ubr3 in the epidermis and pri was known to control leg development through poorly understood mechanisms, we investigated the role of Ubr3 in mediating pri function in leg. We found that pri plays several roles during leg development both in patterning and in cell survival. During larval stage, pri activates independently of Ubr3 tarsal transcriptional programs and Notch and EGFR signaling pathways, whereas at larval pupal transition, Pri peptides cooperate with Ubr3 to insure cell survival and leg morphogenesis. Our results highlight Ubr3 dependent and independent functions of Pri peptides and their pleiotropy. Moreover, we reveal that the smORF peptide family is a reservoir of overlooked developmental regulators, displaying distinct molecular functions and orchestrating leg development.
Collapse
Affiliation(s)
- Damien Markus
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Aurore Pelletier
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Muriel Boube
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Fillip Port
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - Michael Boutros
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - François Payre
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Benedikt Obermayer
- Core Unit Bioinformatics (CUBI), Berlin Institute of Health at Charité Universitätsmedizin-Berlin, Berlin, Germany
| | - Jennifer Zanet
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| |
Collapse
|
9
|
Translation and natural selection of micropeptides from long non-canonical RNAs. Nat Commun 2022; 13:6515. [PMID: 36316320 PMCID: PMC9622821 DOI: 10.1038/s41467-022-34094-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 10/13/2022] [Indexed: 12/25/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nucleotides but lacking canonical coding sequences. Apparently unable to produce peptides, lncRNA function seems to rely only on RNA expression, sequence and structure. Here, we exhaustively detect in-vivo translation of small open reading frames (small ORFs) within lncRNAs using Ribosomal profiling during Drosophila melanogaster embryogenesis. We show that around 30% of lncRNAs contain small ORFs engaged by ribosomes, leading to regulated translation of 100 to 300 micropeptides. We identify lncRNA features that favour translation, such as cistronicity, Kozak sequences, and conservation. For the latter, we develop a bioinformatics pipeline to detect small ORF homologues, and reveal evidence of natural selection favouring the conservation of micropeptide sequence and function across evolution. Our results expand the repertoire of lncRNA biochemical functions, and suggest that lncRNAs give rise to novel coding genes throughout evolution. Since most lncRNAs contain small ORFs with as yet unknown translation potential, we propose to rename them "long non-canonical RNAs".
Collapse
|
10
|
Dib A, Zanet J, Mancheno-Ferris A, Gallois M, Markus D, Valenti P, Marques-Prieto S, Plaza S, Kageyama Y, Chanut-Delalande H, Payre F. Pri smORF Peptides Are Wide Mediators of Ecdysone Signaling, Contributing to Shape Spatiotemporal Responses. Front Genet 2021; 12:714152. [PMID: 34527021 DOI: 10.3389/fgene.2021.714152] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 07/28/2021] [Indexed: 11/13/2022] Open
Abstract
There is growing evidence that peptides encoded by small open-reading frames (sORF or smORF) can fulfill various cellular functions and define a novel class regulatory molecules. To which extend transcripts encoding only smORF peptides compare with canonical protein-coding genes, yet remain poorly understood. In particular, little is known on whether and how smORF-encoding RNAs might need tightly regulated expression within a given tissue, at a given time during development. We addressed these questions through the analysis of Drosophila polished rice (pri, a.k.a. tarsal less or mille pattes), which encodes four smORF peptides (11-32 amino acids in length) required at several stages of development. Previous work has shown that the expression of pri during epidermal development is regulated in the response to ecdysone, the major steroid hormone in insects. Here, we show that pri transcription is strongly upregulated by ecdysone across a large panel of cell types, suggesting that pri is a core component of ecdysone response. Although pri is produced as an intron-less short transcript (1.5 kb), genetic assays reveal that the developmental functions of pri require an unexpectedly large array of enhancers (spanning over 50 kb), driving a variety of spatiotemporal patterns of pri expression across developing tissues. Furthermore, we found that separate pri enhancers are directly activated by the ecdysone nuclear receptor (EcR) and display distinct regulatory modes between developmental tissues and/or stages. Alike major developmental genes, the expression of pri in a given tissue often involves several enhancers driving apparently redundant (or shadow) expression, while individual pri enhancers can harbor pleiotropic functions across tissues. Taken together, these data reveal the broad role of Pri smORF peptides in ecdysone signaling and show that the cis-regulatory architecture of the pri gene contributes to shape distinct spatial and temporal patterns of ecdysone response throughout development.
Collapse
Affiliation(s)
- Azza Dib
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Jennifer Zanet
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Alexandra Mancheno-Ferris
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Maylis Gallois
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Damien Markus
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Philippe Valenti
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Simon Marques-Prieto
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Serge Plaza
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Yuji Kageyama
- Department of Biology, Graduate School of Science, Kobe University, Kobe, Japan.,Biosignal Research Center, Kobe University, Kobe, Japan
| | - Hélène Chanut-Delalande
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - François Payre
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| |
Collapse
|
11
|
Douka K, Birds I, Wang D, Kosteletos A, Clayton S, Byford A, Vasconcelos EJR, O'Connell MJ, Deuchars J, Whitehouse A, Aspden JL. Cytoplasmic long noncoding RNAs are differentially regulated and translated during human neuronal differentiation. RNA (NEW YORK, N.Y.) 2021; 27:1082-1101. [PMID: 34193551 PMCID: PMC8370745 DOI: 10.1261/rna.078782.121] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 06/22/2021] [Indexed: 06/10/2023]
Abstract
The expression of long noncoding RNAs is highly enriched in the human nervous system. However, the function of neuronal lncRNAs in the cytoplasm and their potential translation remains poorly understood. Here we performed Poly-Ribo-Seq to understand the interaction of lncRNAs with the translation machinery and the functional consequences during neuronal differentiation of human SH-SY5Y cells. We discovered 237 cytoplasmic lncRNAs up-regulated during early neuronal differentiation, 58%-70% of which are associated with polysome translation complexes. Among these polysome-associated lncRNAs, we find 45 small ORFs to be actively translated, 17 specifically upon differentiation. Fifteen of 45 of the translated lncRNA-smORFs exhibit sequence conservation within Hominidea, suggesting they are under strong selective constraint in this clade. The profiling of publicly available data sets revealed that 8/45 of the translated lncRNAs are dynamically expressed during human brain development, and 22/45 are associated with cancers of the central nervous system. One translated lncRNA we discovered is LINC01116, which is induced upon differentiation and contains an 87 codon smORF exhibiting increased ribosome profiling signal upon differentiation. The resulting LINC01116 peptide localizes to neurites. Knockdown of LINC01116 results in a significant reduction of neurite length in differentiated cells, indicating it contributes to neuronal differentiation. Our findings indicate cytoplasmic lncRNAs interact with translation complexes, are a noncanonical source of novel peptides, and contribute to neuronal function and disease. Specifically, we demonstrate a novel functional role for LINC01116 during human neuronal differentiation.
Collapse
Affiliation(s)
- Katerina Douka
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Isabel Birds
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Dapeng Wang
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Andreas Kosteletos
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Sophie Clayton
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- School of Biomedical Sciences, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Abigail Byford
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- School of Biomedical Sciences, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
| | | | - Mary J O'Connell
- School of Life Sciences, Faculty of Medicine and Health Sciences, The University of Nottingham, Nottingham NG7 2UH, United Kingdom
| | - Jim Deuchars
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
- School of Biomedical Sciences, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Adrian Whitehouse
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
- Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds LS2 9JT, United Kingdom
| |
Collapse
|
12
|
Guerra-Almeida D, Tschoeke DA, da-Fonseca RN. Understanding small ORF diversity through a comprehensive transcription feature classification. DNA Res 2021; 28:6317669. [PMID: 34240112 PMCID: PMC8435553 DOI: 10.1093/dnares/dsab007] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Indexed: 11/13/2022] Open
Abstract
Small open reading frames (small ORFs/sORFs/smORFs) are potentially coding sequences smaller than 100 codons that have historically been considered junk DNA by gene prediction software and in annotation screening; however, the advent of next-generation sequencing has contributed to the deeper investigation of junk DNA regions and their transcription products, resulting in the emergence of smORFs as a new focus of interest in systems biology. Several smORF peptides were recently reported in noncanonical mRNAs as new players in numerous biological contexts; however, their relevance is still overlooked in coding potential analysis. Hence, this review proposes a smORF classification based on transcriptional features, discussing the most promising approaches to investigate smORFs based on their different characteristics. First, smORFs were divided into nonexpressed (intergenic) and expressed (genic) smORFs. Second, genic smORFs were classified as smORFs located in noncoding RNAs (ncRNAs) or canonical mRNAs. Finally, smORFs in ncRNAs were further subdivided into sequences located in small or long RNAs, whereas smORFs located in canonical mRNAs were subdivided into several specific classes depending on their localization along the gene. We hope that this review provides new insights into large-scale annotations and reinforces the role of smORFs as essential components of a hidden coding DNA world.
Collapse
Affiliation(s)
- Diego Guerra-Almeida
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Diogo Antonio Tschoeke
- Alberto Luiz Coimbra Institute of Graduate Studies and Engineering Research (COPPE), Biomedical Engineering Program, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo Nunes- da-Fonseca
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.,National Institute of Science and Technology in Molecular Entomology, Rio de Janeiro, Brazil
| |
Collapse
|
13
|
Taira Y, Wada H, Hayashi S, Kageyama Y. polished rice mediates ecdysone-dependent control of Drosophila embryonic organogenesis. Genes Cells 2021; 26:269-281. [PMID: 33621395 DOI: 10.1111/gtc.12841] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 02/05/2021] [Accepted: 02/05/2021] [Indexed: 01/08/2023]
Abstract
In many animals, progression of developmental stages is temporally controlled by steroid hormones. In Drosophila, the level of ecdysone titer oscillates and developmental stage transitions, such as larval molting and metamorphosis, are induced at each of ecdysone peaks. Ecdysone titer also peaks at the stage of mid-embryogenesis and the embryonic ecdysone is necessary for morphogenesis of several organs, although the regulatory mechanisms of embryonic organogenesis dependent on ecdysone signaling are still open questions. In this study, we find that absence or interruption of embryonic ecdysone signaling caused multiple defects in the tracheal system, including decrease in luminal protein deposition, uneven dilation of the dorsal trunk and loss of terminal branches. We also reveal that an ecdysone-inducible gene polished rice (pri) is essential for tip cell fate decision in dorsal branches. As over-expression of pri can restore the defects caused by disturbance of ecdysone biosynthesis, pri functions as one of the major mediators of embryonic ecdysone signal in tracheogenesis. These results demonstrate that ecdysone and its downstream target pri play essential roles in tracheal development by modulating cell fate decision.
Collapse
Affiliation(s)
- Yuki Taira
- Department of Biology, Graduate School of Science, Kobe University, Kobe, Japan
| | - Housei Wada
- Laboratory for Morphogenetic Signaling, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Shigeo Hayashi
- Department of Biology, Graduate School of Science, Kobe University, Kobe, Japan.,Laboratory for Morphogenetic Signaling, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| | - Yuji Kageyama
- Department of Biology, Graduate School of Science, Kobe University, Kobe, Japan.,Biosignal Research Center, Kobe University, Kobe, Japan
| |
Collapse
|
14
|
Schlesinger D, Elsässer SJ. Revisiting sORFs: overcoming challenges to identify and characterize functional microproteins. FEBS J 2021; 289:53-74. [PMID: 33595896 DOI: 10.1111/febs.15769] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 01/17/2021] [Accepted: 02/15/2021] [Indexed: 02/07/2023]
Abstract
Short ORFs (sORFs), that is, occurrences of a start and stop codon within 100 codons or less, can be found in organisms of all domains of life, outnumbering annotated protein-coding ORFs by orders of magnitude. Even though functional proteins smaller than 100 amino acids are known, the coding potential of sORFs has often been overlooked, as it is not trivial to predict and test for functionality within the large number of sORFs. Recent advances in ribosome profiling and mass spectrometry approaches, together with refined bioinformatic predictions, have enabled a huge leap forward in this field and identified thousands of likely coding sORFs. A relatively low number of small proteins or microproteins produced from these sORFs have been characterized so far on the molecular, structural, and/or mechanistic level. These however display versatile and, in some cases, essential cellular functions, allowing for the exciting possibility that many more, previously unknown small proteins might be encoded in the genome, waiting to be discovered. This review will give an overview of the steadily growing microprotein field, focusing on eukaryotic small proteins. We will discuss emerging themes in the molecular action of microproteins, as well as advances and challenges in microprotein identification and characterization.
Collapse
Affiliation(s)
- Dörte Schlesinger
- Science for Life Laboratory, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.,Ming Wai Lau Centre for Reparative Medicine, Stockholm node, Karolinska Institutet, Stockholm, Sweden
| | - Simon J Elsässer
- Science for Life Laboratory, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.,Ming Wai Lau Centre for Reparative Medicine, Stockholm node, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
15
|
Guerra-Almeida D, Nunes-da-Fonseca R. Small Open Reading Frames: How Important Are They for Molecular Evolution? Front Genet 2020; 11:574737. [PMID: 33193682 PMCID: PMC7606980 DOI: 10.3389/fgene.2020.574737] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Accepted: 08/25/2020] [Indexed: 11/13/2022] Open
Affiliation(s)
- Diego Guerra-Almeida
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo Nunes-da-Fonseca
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.,National Institute of Science and Technology in Molecular Entomology, Rio de Janeiro, Brazil
| |
Collapse
|
16
|
Role of Notch Signaling in Leg Development in Drosophila melanogaster. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2020; 1218:103-127. [PMID: 32060874 DOI: 10.1007/978-3-030-34436-8_7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Notch pathway plays diverse and fundamental roles during animal development. One of the most relevant, which arises directly from its unique mode of activation, is the specification of cell fates and tissue boundaries. The development of the leg of Drosophila melanogaster is a fine example of this Notch function, as it is required to specify the fate of the cells that will eventually form the leg joints, the flexible structures that separate the different segments of the adult leg. Notch activity is accurately activated and maintained at the distal end of each segment in response to the proximo-distal patterning gene network of the developing leg. Region-specific downstream targets of Notch in turn regulate the formation of the different types of joints. We discuss recent findings that shed light on the molecular and cellular mechanisms that are ultimately governed by Notch to achieve epithelial fold and joint morphogenesis. Finally, we briefly summarize the role that Notch plays in inducing the nonautonomous growth of the leg. Overall, this book chapter aims to highlight leg development as a useful model to study how patterning information is translated into specific cell behaviors that shape the final form of an adult organ.
Collapse
|
17
|
Reisacher C, Arbibe L. Not lost in host translation: The new roles of long noncoding RNAs in infectious diseases. Cell Microbiol 2019; 21:e13119. [PMID: 31634981 DOI: 10.1111/cmi.13119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 09/10/2019] [Accepted: 09/17/2019] [Indexed: 12/20/2022]
Abstract
Long non-coding RNAs (lncRNAs) play a central role in the regulation of gene expression. Although they were initially described as mRNA-like transcripts not encoding proteins, global approaches such as ribosome profiling have shown that they frequently associate with ribosomes, opening the possibility that lncRNAs are a source of cryptic translation events with functional roles. Recent studies have shed more light on small ORFs borne by lncRNAs and encoding short peptides potentially involved in infectious immunity. This review outlines the main strategies used to determine the coding potential of lncRNAs and discusses our emerging understanding of the implication of the encoded peptides in infectious diseases.
Collapse
Affiliation(s)
- Caroline Reisacher
- Department of Immunology, Infectiology and Hematology, Institut Necker-Enfants Malades (INEM), INSERM U1151, CNRS UMR 8253, Université Paris Descartes, Paris, France
| | - Laurence Arbibe
- Department of Immunology, Infectiology and Hematology, Institut Necker-Enfants Malades (INEM), INSERM U1151, CNRS UMR 8253, Université Paris Descartes, Paris, France
| |
Collapse
|
18
|
Ray S, Rosenberg MI, Chanut-Delalande H, Decaras A, Schwertner B, Toubiana W, Auman T, Schnellhammer I, Teuscher M, Valenti P, Khila A, Klingler M, Payre F. The mlpt/Ubr3/Svb module comprises an ancient developmental switch for embryonic patterning. eLife 2019; 8:e39748. [PMID: 30896406 PMCID: PMC6428570 DOI: 10.7554/elife.39748] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Accepted: 03/07/2019] [Indexed: 12/30/2022] Open
Abstract
Small open reading frames (smORFs) encoding 'micropeptides' exhibit remarkable evolutionary complexity. Conserved peptides encoded by mille-pattes (mlpt)/polished rice (pri)/tarsal less (tal) are essential for embryo segmentation in Tribolium but, in Drosophila, function in terminal epidermal differentiation and patterning of adult legs. Here, we show that a molecular complex identified in Drosophila epidermal differentiation, comprising Mlpt peptides, ubiquitin-ligase Ubr3 and transcription factor Shavenbaby (Svb), represents an ancient developmental module required for early insect embryo patterning. We find that loss of segmentation function for this module in flies evolved concomitantly with restriction of Svb expression in early Drosophila embryos. Consistent with this observation, artificially restoring early Svb expression in flies causes segmentation defects that depend on mlpt function, demonstrating enduring potency of an ancestral developmental switch despite evolving embryonic patterning modes. These results highlight the evolutionary plasticity of conserved molecular complexes under the constraints of essential genetic networks. Editorial note This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that all the issues have been addressed (see decision letter).
Collapse
Affiliation(s)
- Suparna Ray
- Department of Biology, Developmental BiologyUniversity of Erlangen-NurembergErlangenGermany
| | - Miriam I Rosenberg
- Department of Ecology, Evolution and BehaviorHebrew University of JerusalemJerusalemIsrael
| | | | | | - Barbara Schwertner
- Department of Biology, Developmental BiologyUniversity of Erlangen-NurembergErlangenGermany
| | | | - Tzach Auman
- Department of Ecology, Evolution and BehaviorHebrew University of JerusalemJerusalemIsrael
| | - Irene Schnellhammer
- Department of Biology, Developmental BiologyUniversity of Erlangen-NurembergErlangenGermany
| | - Matthias Teuscher
- Department of Biology, Developmental BiologyUniversity of Erlangen-NurembergErlangenGermany
| | - Philippe Valenti
- Centre de Biologie du Développement, Université Paul Sabatier de ToulouseToulouseFrance
| | | | - Martin Klingler
- Department of Biology, Developmental BiologyUniversity of Erlangen-NurembergErlangenGermany
| | - François Payre
- Centre de Biologie du Développement, Université Paul Sabatier de ToulouseToulouseFrance
| |
Collapse
|
19
|
Mat-Sharani S, Firdaus-Raih M. Computational discovery and annotation of conserved small open reading frames in fungal genomes. BMC Bioinformatics 2019; 19:551. [PMID: 30717662 PMCID: PMC7394265 DOI: 10.1186/s12859-018-2550-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 11/30/2018] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes. RESULTS A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized. CONCLUSIONS It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
Collapse
Affiliation(s)
- Shuhaila Mat-Sharani
- Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.,Malaysia Genome Institute, Ministry of Science, Technology & Innovation, Jalan Bangi, 43000, Kajang, Selangor, Malaysia
| | - Mohd Firdaus-Raih
- Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia. .,Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.
| |
Collapse
|
20
|
Jiménez-Guri E, Wotton KR, Jaeger J. tarsal-less is expressed as a gap gene but has no gap gene phenotype in the moth midge Clogmia albipunctata. ROYAL SOCIETY OPEN SCIENCE 2018; 5:180458. [PMID: 30225035 PMCID: PMC6124123 DOI: 10.1098/rsos.180458] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 07/18/2018] [Indexed: 06/08/2023]
Abstract
Gap genes are involved in segment determination during early development of the vinegar fly Drosophila melanogaster and other dipteran insects (flies, midges and mosquitoes). They are expressed in overlapping domains along the antero-posterior (A-P) axis of the blastoderm embryo. While gap domains cover the entire length of the A-P axis in Drosophila, there is a region in the blastoderm of the moth midge Clogmia albipunctata, which lacks canonical gap gene expression. Is a non-canonical gap gene functioning in this area? Here, we characterize tarsal-less (tal) in C. albipunctata. The homologue of tal in the flour beetle Tribolium castaneum (called milles-pattes, mlpt) is a bona fide gap gene. We find that Ca-tal is expressed in the region previously reported as lacking gap gene expression. Using RNA interference, we study the interaction of Ca-tal with gap genes. We show that Ca-tal is regulated by gap genes, but only has a very subtle effect on tailless (Ca-tll), while not affecting other gap genes at all. Moreover, cuticle phenotypes of Ca-tal depleted embryos do not show any gap phenotype. We conclude that Ca-tal is expressed and regulated like a gap gene, but does not function as a gap gene in C. albipunctata.
Collapse
Affiliation(s)
- Eva Jiménez-Guri
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica (CRG), The Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centre for Ecology and Conservation, College of Life and Environmental Sciences, University of Exeter, Penryn, Cornwall TR10 9EZ, UK
| | - Karl R. Wotton
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica (CRG), The Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Johannes Jaeger
- EMBL/CRG Research Unit in Systems Biology, Centre de Regulació Genòmica (CRG), The Barcelona Institute of Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| |
Collapse
|
21
|
Functions and impact of tal-like genes in animals with regard to applied aspects. Appl Microbiol Biotechnol 2018; 102:6841-6845. [PMID: 29909570 DOI: 10.1007/s00253-018-9159-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 06/04/2018] [Accepted: 06/11/2018] [Indexed: 02/03/2023]
Abstract
A large number of DNAs in eukaryote genomes can code for atypical transcripts, and their functions are controversial. It has been reported that the transcripts contain many small open reading frames (sORFs), which were originally considered as non-translatable RNAs. However, increasing evidence has suggested that some of these sORFs can encode for small peptides and some are conserved across large evolutionary distances. It has been reported that the small peptides have functions and may be involved in varieties of cellular processes, playing important roles in development, physiology, and metabolism. Among the sORFs, studies of the non-canonical gene polished rice/tarsal-less (pri/tal) in Drosophila and mille-pattes(mlpt) in Tribolium have been more thoroughly studied. The genes similar to pri/tal in other species have been defined as the tarsal-less-related gene family, tal-like gene. In this review, we described recent progress in the discovery and functional characterization of the small peptides encoded by the tal-like gene and their possible functional potentials.
Collapse
|
22
|
Gene regulatory network architecture in different developmental contexts influences the genetic basis of morphological evolution. PLoS Genet 2018; 14:e1007375. [PMID: 29723190 PMCID: PMC5953500 DOI: 10.1371/journal.pgen.1007375] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2017] [Revised: 05/15/2018] [Accepted: 04/23/2018] [Indexed: 01/09/2023] Open
Abstract
Convergent phenotypic evolution is often caused by recurrent changes at particular nodes in the underlying gene regulatory networks (GRNs). The genes at such evolutionary ‘hotspots’ are thought to maximally affect the phenotype with minimal pleiotropic consequences. This has led to the suggestion that if a GRN is understood in sufficient detail, the path of evolution may be predictable. The repeated evolutionary loss of larval trichomes among Drosophila species is caused by the loss of shavenbaby (svb) expression. svb is also required for development of leg trichomes, but the evolutionary gain of trichomes in the ‘naked valley’ on T2 femurs in Drosophila melanogaster is caused by reduced microRNA-92a (miR-92a) expression rather than changes in svb. We compared the expression and function of components between the larval and leg trichome GRNs to investigate why the genetic basis of trichome pattern evolution differs in these developmental contexts. We found key differences between the two networks in both the genes employed, and in the regulation and function of common genes. These differences in the GRNs reveal why mutations in svb are unlikely to contribute to leg trichome evolution and how instead miR-92a represents the key evolutionary switch in this context. Our work shows that variability in GRNs across different developmental contexts, as well as whether a morphological feature is lost versus gained, influence the nodes at which a GRN evolves to cause morphological change. Therefore, our findings have important implications for understanding the pathways and predictability of evolution. A major goal of biology is to identify the genetic causes of organismal diversity. Convergent evolution of traits is often caused by changes in the same genes–evolutionary ‘hotspots’. shavenbaby is a ‘hotspot’ for larval trichome loss in Drosophila, but microRNA-92a underlies the gain of leg trichomes. To understand this difference in the genetics of phenotypic evolution, we compared the expression and function of genes in the underlying regulatory networks. We found that the pathway of evolution is influenced by differences in gene regulatory network architecture in different developmental contexts, as well as by whether a trait is lost or gained. Therefore, hotspots in one context may not readily evolve in a different context. This has important implications for understanding the genetic basis of phenotypic change and the predictability of evolution.
Collapse
|
23
|
VanOrsdel CE, Kelly JP, Burke BN, Lein CD, Oufiero CE, Sanchez JF, Wimmers LE, Hearn DJ, Abuikhdair FJ, Barnhart KR, Duley ML, Ernst SEG, Kenerson BA, Serafin AJ, Hemm MR. Identifying New Small Proteins in Escherichia coli. Proteomics 2018; 18:e1700064. [PMID: 29645342 PMCID: PMC6001520 DOI: 10.1002/pmic.201700064] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Revised: 03/05/2018] [Indexed: 12/11/2022]
Abstract
The number of small proteins (SPs) encoded in the Escherichia coli genome is unknown, as current bioinformatics and biochemical techniques make short gene and small protein identification challenging. One method of small protein identification involves adding an epitope tag to the 3′ end of a short open reading frame (sORF) on the chromosome, with synthesis confirmed by immunoblot assays. In this study, this strategy was used to identify new E. coli small proteins, tagging 80 sORFs in the E. coli genome, and assayed for protein synthesis. The selected sORFs represent diverse sequence characteristics, including degrees of sORF conservation, predicted transmembrane domains, sORF direction with respect to flanking genes, ribosome binding site (RBS) prediction, and ribosome profiling results. Of 80 sORFs, 36 resulted in encoded synthesized proteins—a 45% success rate. Modeling of detected versus non‐detected small proteins analysis showed predictions based on RBS prediction, transcription data, and ribosome profiling had statistically‐significant correlation with protein synthesis; however, there was no correlation between current sORF annotation and protein synthesis. These results suggest substantial numbers of small proteins remain undiscovered in E. coli, and existing bioinformatics techniques must continue to improve to facilitate identification.
Collapse
Affiliation(s)
- Caitlin E VanOrsdel
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - John P Kelly
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Brittany N Burke
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Christina D Lein
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | | | - Joseph F Sanchez
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Larry E Wimmers
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - David J Hearn
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Fatimeh J Abuikhdair
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Kathryn R Barnhart
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Michelle L Duley
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Sarah E G Ernst
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Briana A Kenerson
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Aubrey J Serafin
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| | - Matthew R Hemm
- Department of Biological Sciences, Smith Hall, Towson University, Towson, MD, USA
| |
Collapse
|
24
|
Pang Y, Mao C, Liu S. Encoding activities of non-coding RNAs. Am J Cancer Res 2018; 8:2496-2507. [PMID: 29721095 PMCID: PMC5928905 DOI: 10.7150/thno.24677] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2018] [Accepted: 02/25/2018] [Indexed: 12/14/2022] Open
Abstract
The universal expression of various non-coding RNAs (ncRNAs) is now considered the main feature of organisms' genomes. Many regions in the genome are transcribed but not annotated to encode proteins, yet contain small open reading frames (smORFs). A widely accepted opinion is that a vast majority of ncRNAs are not further translated. However, increasing evidence underlines a series of intriguing translational events from the ncRNAs, which were previously considered to lack coding potential. Recent studies also suggest that products derived from such novel translational events display important regulatory functions in many fundamental biological and pathological processes. Here we give a critical review on the potential coding capacity of ncRNAs, in particular, about what is known and unknown in this emerging area. We also discuss the possible underlying coding mechanisms of these extraordinary ncRNAs and possible roles of peptides or proteins derived from the ncRNAs in disease development and theranostics. Our review offers an extensive resource for studying the biology of ncRNAs and sheds light into the use of ncRNAs and their corresponding peptides or proteins for disease diagnosis and therapy.
Collapse
|
25
|
The influence of transcript assembly on the proteogenomics discovery of microproteins. PLoS One 2018; 13:e0194518. [PMID: 29584760 PMCID: PMC5870951 DOI: 10.1371/journal.pone.0194518] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 03/05/2018] [Indexed: 11/19/2022] Open
Abstract
Proteogenomics methods have identified many non-annotated protein-coding genes in the human genome. Many of the newly discovered protein-coding genes encode peptides and small proteins, referred to collectively as microproteins. Microproteins are produced through ribosome translation of small open reading frames (smORFs). The discovery of many smORFs reveals a blind spot in traditional gene-finding algorithms for these genes. Biological studies have found roles for microproteins in cell biology and physiology, and the potential that there exists additional bioactive microproteins drives the interest in detection and discovery of these molecules. A key step in any proteogenomics workflow is the assembly of RNA-Seq data into likely mRNA transcripts that are then used to create a searchable protein database. Here we demonstrate that specific features of the assembled transcriptome impact microprotein detection by shotgun proteomics. By tailoring transcript assembly for downstream mass spectrometry searching, we show that we can detect more than double the number of high-quality microprotein candidates and introduce a novel open-source mRNA assembler for proteogenomics (MAPS) that incorporates all of these features. By integrating our specialized assembler, MAPS, and a popular generalized assembler into our proteogenomics pipeline, we detect 45 novel human microproteins from a high quality proteogenomics dataset of a human cell line. We then characterize the features of the novel microproteins, identifying two classes of microproteins. Our work highlights the importance of specialized transcriptome assembly upstream of proteomics validation when searching for short and potentially rare and poorly conserved proteins.
Collapse
|
26
|
Identification of tarsal-less peptides from the silkworm Bombyx mori. Appl Microbiol Biotechnol 2018; 102:1809-1822. [PMID: 29306967 DOI: 10.1007/s00253-017-8708-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 11/22/2017] [Accepted: 12/11/2017] [Indexed: 10/18/2022]
Abstract
The polycistronic and non-canonical gene tarsal-less (tal, known as pri) was reported to be required for embryonic and imaginal development in Drosophila; however, there are few reports of the tal gene in the silkworm Bombyx mori. Here, we cloned a tal-like (Bmtal) gene, and a sequence analysis showed that the Bmtal cDNA (1661 bp) contains five small open reading frames (smORFs) (A1, A2, A3, A4, and B) that encode short peptides of 11-12 (A1-A4) amino acid residues containing an LDPTG(E)L(Q)(V)Y motif that is conserved in Drosophila Tal, as well as a 32-amino-acid B peptide. Reverse transcription-quantitative polymerase chain reaction showed that the expression of the Bmtal gene was highest in the trachea, followed by the silk gland and Malpighian tubule, in day 3 fifth-instar larvae. Subcellular localization showed that BmTal localized in the nucleus. By regulating the expression of the full-length Bmtal gene and the functional smORFs of Bmtal, we showed that the expression levels of the Bmovo gene and genes related to the Notch, transforming growth factor-β, and Hippo signaling pathways changed with changes in BmTal peptide expression. A co-immunoprecipitation assay showed that BmTal interacts with polyubiquitin, which triggered degradation and/or processing of the 14-3-3 protein zeta. A comparative transcriptome analysis showed that 2843 (2045) genes were up- (down)-regulated after Bmtal gene expression was up-regulated. The up- (down)-regulated differentially expressed genes were enriched in 326 (278) gene ontology terms (P ≤ 0.05) and 54 (59) Kyoto Encyclopedia of Genes and Genomes pathways (P ≤ 0.05), and the results indicated that the BmTal peptides could function as mediators of hormone levels or the activities of multiple pathways, including the peroxisome proliferator-activated receptor, Hedgehog, mitogen-activated protein kinase, adipocytokine, and gonadotropin-releasing hormone signaling pathways, as well as the innate immune response. These results increase our understanding of the function and mechanism of BmTal at the genome-wide level.
Collapse
|
27
|
|
28
|
Chu Q, Rathore A, Diedrich JK, Donaldson CJ, Yates JR, Saghatelian A. Identification of Microprotein-Protein Interactions via APEX Tagging. Biochemistry 2017; 56:3299-3306. [PMID: 28589727 PMCID: PMC5499098 DOI: 10.1021/acs.biochem.7b00265] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Revised: 05/17/2017] [Indexed: 02/08/2023]
Abstract
Microproteins are peptides and small proteins encoded by small open reading frames (smORFs). Newer technologies have led to the recent discovery of hundreds to thousands of new microproteins. The biological functions of a few microproteins have been elucidated, and these microproteins have fundamental roles in biology ranging from limb development to muscle function, highlighting the value of characterizing these molecules. The identification of microprotein-protein interactions (MPIs) has proven to be a successful approach to the functional characterization of these genes; however, traditional immunoprecipitation methods result in the enrichment of nonspecific interactions for microproteins. Here, we test and apply an in situ proximity tagging method that relies on an engineered ascorbate peroxidase 2 (APEX) to elucidate MPIs. The results demonstrate that APEX tagging is superior to traditional immunoprecipitation methods for microproteins. Furthermore, the application of APEX tagging to an uncharacterized microprotein called C11orf98 revealed that this microprotein interacts with nucleolar proteins nucleophosmin and nucleolin, demonstrating the ability of this approach to identify novel hypothesis-generating MPIs.
Collapse
Affiliation(s)
- Qian Chu
- Clayton
Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Annie Rathore
- Clayton
Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, United States
- Division
of Biological Sciences, University of California,
San Diego, 9500 Gilman
Drive, La Jolla, California 92093, United States
| | - Jolene K. Diedrich
- Clayton
Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, United States
- Department
of Chemical Physiology, The Scripps Research
Institute, 10550 North
Torrey Pines Road, La Jolla, California 92037, United States
| | - Cynthia J. Donaldson
- Clayton
Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, United States
| | - John R. Yates
- Department
of Chemical Physiology, The Scripps Research
Institute, 10550 North
Torrey Pines Road, La Jolla, California 92037, United States
| | - Alan Saghatelian
- Clayton
Foundation Laboratories for Peptide Biology, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, United States
| |
Collapse
|
29
|
Functions of long non-coding RNAs in human disease and their conservation in Drosophila development. Biochem Soc Trans 2017; 45:895-904. [PMID: 28673935 DOI: 10.1042/bst20160428] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 05/18/2017] [Accepted: 05/31/2017] [Indexed: 02/06/2023]
Abstract
Genomic analysis has found that the transcriptome in both humans and Drosophila melanogaster features large numbers of long non-coding RNA transcripts (lncRNAs). This recently discovered class of RNAs regulates gene expression in diverse ways and has been involved in a large variety of important biological functions. Importantly, an increasing number of lncRNAs have also been associated with a range of human diseases, including cancer. Comparative analyses of their functions among these organisms suggest that some of their modes of action appear to be conserved. This highlights the importance of model organisms such as Drosophila, which shares many gene regulatory networks with humans, in understanding lncRNA function and its possible impact in human health. This review discusses some known functions and mechanisms of action of lncRNAs and their implication in human diseases, together with their functional conservation and relevance in Drosophila development.
Collapse
|
30
|
Tissue-specific enhancer repression through molecular integration of cell signaling inputs. PLoS Genet 2017; 13:e1006718. [PMID: 28394894 PMCID: PMC5402979 DOI: 10.1371/journal.pgen.1006718] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Revised: 04/24/2017] [Accepted: 03/27/2017] [Indexed: 11/19/2022] Open
Abstract
Drosophila leg morphogenesis occurs under the control of a relatively well-known genetic cascade, which mobilizes both cell signaling pathways and tissue-specific transcription factors. However, their cross-regulatory interactions, deployed to refine leg patterning, remain poorly characterized at the gene expression level. Within the genetically interacting landscape that governs limb development, the bric-à-brac2 (bab2) gene is required for distal leg segmentation. We have previously shown that the Distal-less (Dll) homeodomain and Rotund (Rn) zinc-finger activating transcription factors control limb-specific bab2 expression by binding directly a single critical leg/antennal enhancer (LAE) within the bric-à-brac locus. By genetic and molecular analyses, we show here that the EGFR-responsive C15 homeodomain and the Notch-regulated Bowl zinc-finger transcription factors also interact directly with the LAE enhancer as a repressive duo. The appendage patterning gene bab2 is the first identified direct target of the Bowl repressor, an Odd-skipped/Osr family member. Moreover, we show that C15 acts on LAE activity independently of its regular partner, the Aristaless homeoprotein. Instead, we find that C15 interacts physically with the Dll activator through contacts between their homeodomain and binds competitively with Dll to adjacent cognate sites on LAE, adding potential new layers of regulation by C15. Lastly, we show that C15 and Bowl activities regulate also rn expression. Our findings shed light on how the concerted action of two transcriptional repressors, in response to cell signaling inputs, shapes and refines gene expression along the limb proximo-distal axis in a timely manner. Limb morphogenesis is controlled by a well-known genetic cascade, mobilizing both cell signaling and tissue-specific transcription factors (TFs). However, how their concerted action refines gene expression remains to be deciphered. It is thus crucial to understand how cell signaling inputs are integrated by transcriptional “enhancers”. The Drosophila leg provides a good paradigm to dissect the molecular mechanisms underlying gene regulation. Here, we used the bric-a-brac2 (bab2) gene as a model to study the integrated regulation of patterning genes implicated in tarsal segmentation. bab2 expression in the leg primordium is dynamic and complex, going from initial broad distal expression to precisely positioned tarsal rings. By genetic and molecular analyses, we show here that the cell signaling-responding TFs C15 and Bowl interact directly with the limb-specific bab2 enhancer as a repressive duo. Moreover, C15 acts independently of its partner Aristaless through physical interaction with the Dll activator. We propose that Dll induces early circular bab2 expression pattern, then EGFR signaling-induced C15 in the distalmost cells competes with Dll for LAE binding and resolves bab2 pattern as a ring. Taken together our data shed light on how the concerted action of a quartet of transcription factors reshapes gene expression during limb proximo-distal axis development.
Collapse
|
31
|
Kojima T. Developmental mechanism of the tarsus in insect legs. CURRENT OPINION IN INSECT SCIENCE 2017; 19:36-42. [PMID: 28521941 DOI: 10.1016/j.cois.2016.11.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 11/16/2016] [Accepted: 11/22/2016] [Indexed: 06/07/2023]
Abstract
Insects show a tremendous morphological variety and have been a subject of studying morphological evolution. In legs, the tarsus is especially variable in the number of subsegments (tarsal segments) and their proportion unlike other leg segments. Recent studies in Drosophila melanogaster have revealed details of the tarsal development: regionalization of the tarsal region through integration of regulatory network and its growth, determination of the joint-forming region in each segment through strict regulation of Notch activity, changes in tissue morphology through regulation of RhoGTPases regulators and localized cell death, and finally, the morphogenetic mechanism of the ball-and-socket joint between tarsal segments. The substantial knowledge of the tarsal development makes it a suitable model for studying mechanisms of morphological evolution and diversification.
Collapse
Affiliation(s)
- Tetsuya Kojima
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan.
| |
Collapse
|
32
|
Serum stress responsive gene EhslncRNA of Entamoeba histolytica is a novel long noncoding RNA. Sci Rep 2016; 6:27476. [PMID: 27273618 PMCID: PMC4895391 DOI: 10.1038/srep27476] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 05/19/2016] [Indexed: 12/15/2022] Open
Abstract
Non coding RNAs are known to play important roles in regulating gene expression at the transcriptional and posttranscriptional levels in metazoans. There is very little information available about non coding RNAs in protists such as Entamoeba histolytica. Antisense and micro RNAs have been reported in E. histolytica, however no long non coding RNAs has been reported yet. Here, we report our findings on an in vitro serum stress-inducible gene EhslncRNA, a member of B1 transmembrane kinase family of E. histolytica. EhslncRNA encodes a transcript of 2.6 kb and sequence analysis revealed that there is no ORF >150 bp within this transcript. The transcript was found to be polyadenylated and mainly associated with monosomes in the cytoplasm under serum starvation. In normal proliferating cells this RNA is mainly present in the nucleus. The promoter element was mapped between 437 to 346 nucleotides upstream of transcriptional start site and has both positive and negative regulatory elements. Deletion of the negative element converted the promoter to serum inducible type. Oxygen and heat stress also increased expression levels of EhslncRNA. These observations suggest that EhslncRNA may be a long non coding RNA and likely to help cells withstand stressful conditions in the host.
Collapse
|
33
|
Chicken genome modelling for the benefit of science. WORLD POULTRY SCI J 2016. [DOI: 10.1017/s0043933915002652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
34
|
Saghatelian A, Couso JP. Discovery and characterization of smORF-encoded bioactive polypeptides. Nat Chem Biol 2015; 11:909-16. [PMID: 26575237 PMCID: PMC4956473 DOI: 10.1038/nchembio.1964] [Citation(s) in RCA: 197] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 10/19/2015] [Indexed: 12/13/2022]
Abstract
Analysis of genomes, transcriptomes and proteomes reveals the existence of hundreds to thousands of translated, yet non-annotated, short open reading frames (small ORFs or smORFs). The discovery of smORFs and their protein products, smORF-encoded polypeptides (SEPs), points to a fundamental gap in our knowledge of protein-coding genes. Various studies have identified central roles for smORFs in metabolism, apoptosis and development. The discovery of these bioactive SEPs emphasizes the functional potential of this unexplored class of biomolecules. Here, we provide an overview of this emerging field and highlight the opportunities for chemical biology to answer fundamental questions about these novel genes. Such studies will provide new insights into the protein-coding potential of genomes and identify functional genes with roles in biology and disease.
Collapse
Affiliation(s)
- Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Helmsley Center for Genomic Medicine, Salk Institute for Biological Studies, San Diego, CA 92037
| | - Juan Pablo Couso
- School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 6PU, UK
| |
Collapse
|
35
|
Li Q, Barish S, Okuwa S, Volkan PC. Examination of Endogenous Rotund Expression and Function in Developing Drosophila Olfactory System Using CRISPR-Cas9-Mediated Protein Tagging. G3 (BETHESDA, MD.) 2015; 5:2809-16. [PMID: 26497147 PMCID: PMC4683652 DOI: 10.1534/g3.115.021857] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 10/12/2015] [Indexed: 02/05/2023]
Abstract
The zinc-finger protein Rotund (Rn) plays a critical role in controlling the development of the fly olfactory system. However, little is known about its molecular function in vivo. Here, we added protein tags to the rn locus using CRISPR-Cas9 technology in Drosophila to investigate its subcellular localization and the genes that it regulates . We previously used a reporter construct to show that rn is expressed in a subset of olfactory receptor neuron (ORN) precursors and it is required for the diversification of ORN fates. Here, we show that tagged endogenous Rn protein is functional based on the analysis of ORN phenotypes. Using this method, we also mapped the expression pattern of the endogenous isoform-specific tags in vivo with increased precision. Comparison of the Rn expression pattern from this study with previously published results using GAL4 reporters showed that Rn is mainly present in early steps in antennal disc patterning, but not in pupal stages when ORNs are born. Finally, using chromatin immunoprecipitation, we showed a direct binding of Rotund to a previously identified regulatory element upstream of the bric-a-brac gene locus in the developing antennal disc.
Collapse
Affiliation(s)
- Qingyun Li
- Department of Biology, Duke University, Durham, North Carolina 27708
| | - Scott Barish
- Department of Biology, Duke University, Durham, North Carolina 27708
| | - Sumie Okuwa
- Department of Biology, Duke University, Durham, North Carolina 27708
| | - Pelin C Volkan
- Department of Biology, Duke University, Durham, North Carolina 27708 Duke Institute for Brain Sciences, Duke University, Durham, North Carolina 27708
| |
Collapse
|
36
|
Couzigou JM, Lauressergues D, Bécard G, Combier JP. miRNA-encoded peptides (miPEPs): A new tool to analyze the roles of miRNAs in plant biology. RNA Biol 2015; 12:1178-80. [PMID: 26400469 DOI: 10.1080/15476286.2015.1094601] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
MicroRNAs (miRNAs) are short RNA molecules negatively regulating the expression of many important genes in plants and animals. We have recently shown that plant primary transcripts of miRNAs encode peptides (miPEPs) able to increase specifically the transcription of their associated miRNA.(1) We discuss here the possibility of using miPEPs as a new tool for functional analysis of single members of miRNA families in plants, including in non-model plants, that could avoid transgenic transformation and minimize artifactual interpretation. We also raise several fundamental and crucial questions that need to be address for a deeper understanding of the cellular and molecular mechanisms underlining the regulatory activity of miPEPs.
Collapse
Affiliation(s)
- Jean-Malo Couzigou
- a Université de Toulouse; UPS; UMR5546; Laboratoire de Recherche en Sciences Végétales ; Castanet-Tolosan , France.,b CNRS; UMR5546 ; Castanet-Tolosan , France
| | - Dominique Lauressergues
- a Université de Toulouse; UPS; UMR5546; Laboratoire de Recherche en Sciences Végétales ; Castanet-Tolosan , France.,b CNRS; UMR5546 ; Castanet-Tolosan , France
| | - Guillaume Bécard
- a Université de Toulouse; UPS; UMR5546; Laboratoire de Recherche en Sciences Végétales ; Castanet-Tolosan , France.,b CNRS; UMR5546 ; Castanet-Tolosan , France
| | - Jean-Philippe Combier
- a Université de Toulouse; UPS; UMR5546; Laboratoire de Recherche en Sciences Végétales ; Castanet-Tolosan , France.,b CNRS; UMR5546 ; Castanet-Tolosan , France
| |
Collapse
|
37
|
Albuquerque JP, Tobias-Santos V, Rodrigues AC, Mury FB, da Fonseca RN. small ORFs: A new class of essential genes for development. Genet Mol Biol 2015; 38:278-83. [PMID: 26500431 PMCID: PMC4612599 DOI: 10.1590/s1415-475738320150009] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2015] [Accepted: 03/30/2015] [Indexed: 12/02/2022] Open
Abstract
Genes that contain small open reading frames (smORFs) constitute a new group of eukaryotic genes and are expected to represent 5% of the Drosophila melanogaster transcribed genes. In this review we provide a historical perspective of their recent discovery, describe their general mechanism and discuss the importance of smORFs for future genomic and transcriptomic studies. Finally, we discuss the biological role of the most studied smORF so far, the Mlpt/Pri/Tal gene in arthropods. The pleiotropic action of Mlpt/Pri/Tal in D. melanogaster suggests a complex evolutionary scenario that can be used to understand the origins, evolution and integration of smORFs into complex gene regulatory networks.
Collapse
Affiliation(s)
- João Paulo Albuquerque
- Laboratório Integrado de Bioquímica Hatisaburo Masuda, Núcleo em Ecologia e Desenvolvimento Sócio-Ambiental, Universidade Federal de Rio de Janeiro, Macaé, RJ, Brazil
| | - Vitória Tobias-Santos
- Laboratório Integrado de Bioquímica Hatisaburo Masuda, Núcleo em Ecologia e Desenvolvimento Sócio-Ambiental, Universidade Federal de Rio de Janeiro, Macaé, RJ, Brazil
| | - Aline Cáceres Rodrigues
- Laboratório Integrado de Bioquímica Hatisaburo Masuda, Núcleo em Ecologia e Desenvolvimento Sócio-Ambiental, Universidade Federal de Rio de Janeiro, Macaé, RJ, Brazil
| | - Flávia Borges Mury
- Laboratório Integrado de Bioquímica Hatisaburo Masuda, Núcleo em Ecologia e Desenvolvimento Sócio-Ambiental, Universidade Federal de Rio de Janeiro, Macaé, RJ, Brazil. ; Entomologia Molecular, Instituto Nacional de Ciência e Tecnologia, Universidade Federal de Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Rodrigo Nunes da Fonseca
- Laboratório Integrado de Bioquímica Hatisaburo Masuda, Núcleo em Ecologia e Desenvolvimento Sócio-Ambiental, Universidade Federal de Rio de Janeiro, Macaé, RJ, Brazil. ; Entomologia Molecular, Instituto Nacional de Ciência e Tecnologia, Universidade Federal de Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| |
Collapse
|
38
|
Linz DM, Tomoyasu Y. RNAi screening of developmental toolkit genes: a search for novel wing genes in the red flour beetle, Tribolium castaneum. Dev Genes Evol 2015; 225:11-22. [PMID: 25613748 DOI: 10.1007/s00427-015-0488-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 01/05/2015] [Indexed: 10/24/2022]
Abstract
The amazing array of diversity among insect wings offers a powerful opportunity to study the mechanisms guiding morphological evolution. Studies in Drosophila (the fruit fly) have identified dozens of genes important for wing development. These genes are often called candidate genes, serving as an ideal starting point to study wing development in other insects. However, we also need to explore beyond the candidate genes to gain a more comprehensive view of insect wing evolution. As a first step away from the traditional candidate genes, we utilized Tribolium (the red flour beetle) as a model and assessed the potential involvement of a group of developmental toolkit genes (embryonic patterning genes) in beetle wing development. We hypothesized that the highly pleiotropic nature of these developmental genes would increase the likelihood of finding novel wing genes in Tribolium. Through the RNA interference screening, we found that Tc-cactus has a less characterized (but potentially evolutionarily conserved) role in wing development. We also found that the odd-skipped family genes are essential for the formation of the thoracic pleural plates, including the recently discovered wing serial homologs in Tribolium. In addition, we obtained several novel insights into the function of these developmental genes, such as the involvement of mille-pattes and Tc-odd-paired in metamorphosis. Despite these findings, no gene we examined was found to have novel wing-related roles unique in Tribolium. These results suggest a relatively conserved nature of developmental toolkit genes and highlight the limited degree to which these genes are co-opted during insect wing evolution.
Collapse
Affiliation(s)
- David M Linz
- Department of Biology, Miami University, 700E High St., Oxford, OH, 45056, USA
| | | |
Collapse
|
39
|
Pri peptides are mediators of ecdysone for the temporal control of development. Nat Cell Biol 2014; 16:1035-44. [DOI: 10.1038/ncb3052] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 09/15/2014] [Indexed: 02/08/2023]
|
40
|
Aspden JL, Eyre-Walker YC, Phillips RJ, Amin U, Mumtaz MAS, Brocard M, Couso JP. Extensive translation of small Open Reading Frames revealed by Poly-Ribo-Seq. eLife 2014; 3:e03528. [PMID: 25144939 PMCID: PMC4359375 DOI: 10.7554/elife.03528] [Citation(s) in RCA: 258] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 08/19/2014] [Indexed: 12/11/2022] Open
Abstract
Thousands of small Open Reading Frames (smORFs) with the potential to encode small peptides of fewer than 100 amino acids exist in our genomes. However, the number of smORFs actually translated, and their molecular and functional roles are still unclear. In this study, we present a genome-wide assessment of smORF translation by ribosomal profiling of polysomal fractions in Drosophila. We detect two types of smORFs bound by multiple ribosomes and thus undergoing productive translation. The ‘longer’ smORFs of around 80 amino acids resemble canonical proteins in translational metrics and conservation, and display a propensity to contain transmembrane motifs. The ‘dwarf’ smORFs are in general shorter (around 20 amino-acid long), are mostly found in 5′-UTRs and non-coding RNAs, are less well conserved, and have no bioinformatic indicators of peptide function. Our findings indicate that thousands of smORFs are translated in metazoan genomes, reinforcing the idea that smORFs are an abundant and fundamental genome component. DOI:http://dx.doi.org/10.7554/eLife.03528.001 To produce a protein, a stretch of DNA must first be transcribed to produce a molecule of messenger RNA (mRNA). The genetic information copied from the DNA is then read three letters at a time, in groups called codons. Each codon either encodes a particular amino acid to be added into a protein or provides further instructions: ‘start codons’ mark the beginning of a protein; ‘stop codons’ mark its end. The DNA between these two points is called an open reading frame (or ORF)—however, not all ORFs produce proteins. Most proteins are made of several hundred amino acids, but the genomes of animals contain thousands of ORFs that would generate much smaller proteins made of fewer than 100 amino acids, if they were translated. It is, however, unclear how many of these small ORFs are converted into mRNA molecules and functional proteins. Ribosomes are large molecular machines that translate the code in mRNA molecules and join together the appropriate amino acids in the right order to make a protein. Ribosome profiling is a technique that identifies which mRNA molecules are translated into proteins by determining the sequences of all the mRNA molecules bound to ribosomes at a particular moment. The mRNA sequences can then be compared with the sequence of the whole genome to work out which ORFs they correspond to. Ribosome profiling has been used to detect translated small ORFs, but the method yields a relatively high false positive rate as some mRNAs can bind to ribosomes without being translated. To better detect small protein-producing ORFs, Aspden et al. developed a technique based on ribosome profiling called Poly-Ribo-Seq. The method takes advantage of the fact that during active translation, clusters of multiple ribosomes, called polysomes, bind mRNAs. Poly-Ribo-Seq isolates these polysomes and determines the sequence bound by each of the ribosomes, thereby reducing the number of false positives. Applying Poly-Ribo-Seq to cells from the fruit fly Drosophila allowed Aspden et al. to identify two types of short ORF. The first type codes for proteins that are around 80 amino acids long and are translated with the same efficiency as larger ORFs. The sequences of these ORFs are found in other species, match at least in part sequences of known functional ORFs, and the proteins produced are found in specific locations inside cells. These small proteins may contribute to membrane integrity or function. Together, these properties suggest that these mRNAs create functional small proteins. The second pool consists of very small ORFs (‘dwarf smORFs’) that code for around 20 amino acids, which are not translated so often and do not show many similarities with other species. As the findings of Aspden et al. suggest that a large fraction of Drosophila small ORFs are translated into proteins, the next challenge will be to determine the roles of these small proteins in cells. DOI:http://dx.doi.org/10.7554/eLife.03528.002
Collapse
Affiliation(s)
- Julie L Aspden
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | | | - Rose J Phillips
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Unum Amin
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | | | - Michele Brocard
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Juan-Pablo Couso
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
41
|
Crappé J, Van Criekinge W, Menschaert G. Little things make big things happen: A summary of micropeptide encoding genes. EUPA OPEN PROTEOMICS 2014. [DOI: 10.1016/j.euprot.2014.02.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
42
|
Bazzini AA, Johnstone TG, Christiano R, Mackowiak SD, Obermayer B, Fleming ES, Vejnar CE, Lee MT, Rajewsky N, Walther TC, Giraldez AJ. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J 2014; 33:981-93. [PMID: 24705786 DOI: 10.1002/embj.201488411] [Citation(s) in RCA: 501] [Impact Index Per Article: 45.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Identification of the coding elements in the genome is a fundamental step to understanding the building blocks of living systems. Short peptides (< 100 aa) have emerged as important regulators of development and physiology, but their identification has been limited by their size. We have leveraged the periodicity of ribosome movement on the mRNA to define actively translated ORFs by ribosome footprinting. This approach identifies several hundred translated small ORFs in zebrafish and human. Computational prediction of small ORFs from codon conservation patterns corroborates and extends these findings and identifies conserved sequences in zebrafish and human, suggesting functional peptide products (micropeptides). These results identify micropeptide-encoding genes in vertebrates, providing an entry point to define their function in vivo.
Collapse
Affiliation(s)
- Ariel A Bazzini
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Ericson M, Janes MA, Butter F, Mann M, Ullu E, Tschudi C. On the extent and role of the small proteome in the parasitic eukaryote Trypanosoma brucei. BMC Biol 2014; 12:14. [PMID: 24552149 PMCID: PMC3942054 DOI: 10.1186/1741-7007-12-14] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 02/06/2014] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Although technical advances in genomics and proteomics research have yielded a better understanding of the coding capacity of a genome, one major challenge remaining is the identification of all expressed proteins, especially those less than 100 amino acids in length. Such information can be particularly relevant to human pathogens, such as Trypanosoma brucei, the causative agent of African trypanosomiasis, since it will provide further insight into the parasite biology and life cycle. RESULTS Starting with 993 T. brucei transcripts, previously shown by RNA-Sequencing not to coincide with annotated coding sequences (CDS), homology searches revealed that 173 predicted short open reading frames in these transcripts are conserved across kinetoplastids with 13 also conserved in representative eukaryotes. Mining mass spectrometry data sets revealed 42 transcripts encoding at least one matching peptide. RNAi-induced down-regulation of these 42 transcripts revealed seven to be essential in insect-form trypanosomes with two also required for the bloodstream life cycle stage. To validate the specificity of the RNAi results, each lethal phenotype was rescued by co-expressing an RNAi-resistant construct of each corresponding CDS. These previously non-annotated essential small proteins localized to a variety of cell compartments, including the cell surface, mitochondria, nucleus and cytoplasm, inferring the diverse biological roles they are likely to play in T. brucei. We also provide evidence that one of these small proteins is required for replicating the kinetoplast (mitochondrial) DNA. CONCLUSIONS Our studies highlight the presence and significance of small proteins in a protist and expose potential new targets to block the survival of trypanosomes in the insect vector and/or the mammalian host.
Collapse
Affiliation(s)
- Megan Ericson
- Department of Epidemiology of Microbial Diseases, School of Public Health, Yale University, New Haven, CT, USA
| | - Michael A Janes
- Department of Epidemiology of Microbial Diseases, School of Public Health, Yale University, New Haven, CT, USA
- Current address: San Francisco General Hospital, Pulmonary & Critical Care, San Francisco, CA, USA
| | - Falk Butter
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
- Current address: Institute of Molecular Biology gGmbH, Mainz, Germany
| | - Matthias Mann
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Elisabetta Ullu
- Department of Cell Biology and Internal Medicine, School of Medicine, Yale University, New Haven, CT, USA
| | - Christian Tschudi
- Department of Epidemiology of Microbial Diseases, School of Public Health, Yale University, New Haven, CT, USA
| |
Collapse
|
44
|
The zinc finger homeodomain-2 gene of Drosophila controls Notch targets and regulates apoptosis in the tarsal segments. Dev Biol 2013; 385:350-65. [PMID: 24144920 DOI: 10.1016/j.ydbio.2013.10.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2013] [Revised: 10/06/2013] [Accepted: 10/12/2013] [Indexed: 12/19/2022]
Abstract
The development of the Drosophila leg is a good model to study processes of pattern formation, cell death and segmentation. Such processes require the coordinate activity of different genes and signaling pathways that progressively subdivide the leg territory into smaller domains. One of the main pathways needed for leg development is the Notch pathway, required for determining the proximo-distal axis of the leg and for the formation of the joints that separate different leg segments. The mechanisms required to coordinate such events are largely unknown. We describe here that the zinc finger homeodomain-2 (zfh-2) gene is highly expressed in cells that will form the leg joints and needed to establish a correct size and pattern in the distal leg. There is an early requirement of zfh-2 to establish the correct proximo-distal axis, but zfh-2 is also needed at late third instar to form the joint between the fourth and fifth tarsal segments. The expression of zfh-2 requires Notch activity but zfh-2 is necessary, in turn, to activate Notch targets such as Enhancer of split and big brain. zfh-2 is controlled by the Drosophila activator protein 2 gene and regulates the late expression of tarsal-less. In the absence of zfh-2 many cells ectopically express the pro-apoptotic gene head involution defective, activate caspase-3 and are positive for acridine orange, indicating they undergo apoptosis. Our results demonstrate the key role of zfh-2 in the control of cell death and Notch signaling during leg development.
Collapse
|
45
|
Drosophila distal-less and Rotund bind a single enhancer ensuring reliable and robust bric-a-brac2 expression in distinct limb morphogenetic fields. PLoS Genet 2013; 9:e1003581. [PMID: 23825964 PMCID: PMC3694829 DOI: 10.1371/journal.pgen.1003581] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 05/05/2013] [Indexed: 11/19/2022] Open
Abstract
Most identified Drosophila appendage-patterning genes encode DNA-binding proteins, whose cross-regulatory interactions remain to be better characterized at the molecular level, notably by studying their direct binding to tissue-specific transcriptional enhancers. A fine-tuned spatio-temporal expression of bric-a-brac2 (bab2) along concentric rings is essential for proper proximo-distal (P-D) differentiation of legs and antennae. However, within the genetic interaction landscape governing limb development, no transcription factor directly controlling bab2 expression has been identified to date. Using site-targeted GFP reporter assay and BAC recombineering, we show here that restricted bab2 expression in leg and antennal imaginal discs relies on a single 567-bp-long cis-regulatory module (CRM), termed LAE (for leg and antennal enhancer). We show that this CRM (i) is necessary and sufficient to ensure normal bab2 activity in developing leg and antenna, and (ii) is structurally and functionally conserved among Drosophilidae. Through deletion and site-directed mutagenesis approaches, we identified within the LAE essential sequence motifs required in both leg and antennal tissues. Using genetic and biochemical tests, we establish that in the LAE (i) a key TAAT-rich activator motif interacts with the homeodomain P-D protein Distal-less (Dll) and (ii) a single T-rich activator motif binds the C2H2 zinc-finger P-D protein Rotund (Rn), leading to bab2 up-regulation respectively in all or specifically in the proximal-most ring(s), both in leg and antenna. Joint ectopic expression of Dll and Rn is sufficient to cell-autonomously activate endogenous bab2 and LAE-driven reporter expression in wing and haltere cells. Our findings indicate that accuracy, reliability and robustness of developmental gene expression do not necessarily require cis-regulatory information redundancy. In insects, leg and antenna are homologous limbs, though derive from a single ancestral appendage. In Drosophila, leg and antennal development along the proximo-distal (P-D) axis relies on relatively-well known genetic cascades, in which most appendage-patterning genes encode transcription factors (TF). However, their cross-regulatory interactions remain to be better characterized at the molecular level. A fine-tuned expression of the bric-a-brac2 (bab2) gene is essential for normal leg and antennal segmentation. However, within the genetic cascades governing P-D limb development, no TF directly controlling bab2 expression has been identified to date. We show here that restricted bab2 expression in developing leg and antenna is governed by a single enhancer, termed LAE, which is necessary and sufficient in-vivo to ensure bab2 functions there. We show that leg and antennal cis-regulatory elements are closely associated and that essential LAE sites interact with Distal-less (Dll) and Rotund (Rn) TFs, leading to bab2 activation in all or specifically in the proximal-most expressing cells, respectively. Finally, joint ectopic expression of Dll and Rn is sufficient to instruct wing and haltere cells to up-regulate bab2. Taken together, our work indicates that a single enhancer is necessary and sufficient to reliably govern bab2 expression in distinct morphogenetic fields.
Collapse
|
46
|
Roy A, Shimizu S, Kiya T, Mita K, Iwami M. Identification of 20-hydroxyecdysone-inducible genes from larval brain of the silkworm, Bombyx mori, and their expression analysis. Zoolog Sci 2012; 29:333-9. [PMID: 22559968 DOI: 10.2108/zsj.29.333] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The insect brain secretes prothoracicotropic hormone (PTTH), which stimulates the prothoracic gland to synthesize ecdysone. The active metabolite of ecdysone, 20-hydroxyecdysone (20E), works through ecdysone receptor (EcR) and ultraspiracle (USP) to initiate molting and metamorphosis by regulating downstream genes. Previously, we found that EcR was expressed in the PTTH-producing neurosecretory cells (PTPCs) in larval brain of the silkworm Bombyx mori, suggesting that PTPCs function as the master cells of development under the regulation of 20E. To gain a better understanding of the molecular mechanism of the 20E control of PTPCs, we performed a comprehensive screening of genes induced by 20E using DNA microarray with brains of day-2 fifth instar silkworm larvae. Forty-one genes showed greater than twofold changes caused by artificial application of 20E. A subsequent semiquantitative screening identified ten genes upregulated by 20E, four of which were novel or not previously identified as 20E-response genes. Developmental profiling determined that two genes, UP4 and UP5, were correlated with the endogenous ecdysteroid titer. Whole-mount in situ hybridization showed exclusive expression of these two genes in two pairs of cells in the larval brain in response to 20E-induction, suggesting that the cells are PTPCs. BLAST searches revealed that UP4 and UP5 are Bombyx homologs of vrille and tarsal-less, respectively. The present study identifies 20E-induced genes that may be involved in the ecdysone signal hierarchies underlying pupal-adult development and/or the 20E regulation of PTPCs.
Collapse
Affiliation(s)
- Anuradha Roy
- Division of Life Sciences, Graduate school of Natural Science and Technology, Kanazawa University, Kakuma-machi, Kanazawa 920-1192, Japan
| | | | | | | | | |
Collapse
|
47
|
Estella C, Voutev R, Mann RS. A dynamic network of morphogens and transcription factors patterns the fly leg. Curr Top Dev Biol 2012; 98:173-98. [PMID: 22305163 DOI: 10.1016/b978-0-12-386499-4.00007-0] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Animal appendages require a proximodistal (PD) axis, which forms orthogonally from the two main body axes, anteroposterior and dorsoventral. In this review, we discuss recent advances that begin to provide insights into the molecular mechanisms controlling PD axis formation in the Drosophila leg. In this case, two morphogens, Wingless (Wg) and Decapentaplegic (Dpp), initiate a genetic cascade that, together with growth of the leg imaginal disc, establishes the PD axis. The analysis of cis-regulatory modules (CRMs) that control the expression of genes at different positions along the PD axis has been particularly valuable in dissecting this complex process. From these experiments, it appears that only one concentration of Wg and Dpp are required to initiate PD axis formation by inducing the expression of Distal-less (Dll), a homeodomain-encoding gene that is required for leg development. Once Dll is turned on, it activates the medially expressed gene dachshund (dac). Cross-regulation between Dll and dac, together with cell proliferation in the growing leg imaginal disc, results in the formation of a rudimentary PD axis. Wg and Dpp also initiate the expression of ligands for the EGFR pathway, which in turn induces the expression of a series of target genes that pattern the distal-most portion of the leg.
Collapse
Affiliation(s)
- Carlos Estella
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, USA
| | | | | |
Collapse
|
48
|
Progressive tarsal patterning in the Drosophila by temporally dynamic regulation of transcription factor genes. Dev Biol 2012; 361:450-62. [DOI: 10.1016/j.ydbio.2011.10.031] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Revised: 10/24/2011] [Accepted: 10/25/2011] [Indexed: 10/15/2022]
|
49
|
Ladoukakis E, Pereira V, Magny EG, Eyre-Walker A, Couso JP. Hundreds of putatively functional small open reading frames in Drosophila. Genome Biol 2011; 12:R118. [PMID: 22118156 PMCID: PMC3334604 DOI: 10.1186/gb-2011-12-11-r118] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2011] [Revised: 11/04/2011] [Accepted: 11/25/2011] [Indexed: 12/22/2022] Open
Abstract
Background The relationship between DNA sequence and encoded information is still an unsolved puzzle. The number of protein-coding genes in higher eukaryotes identified by genome projects is lower than was expected, while a considerable amount of putatively non-coding transcription has been detected. Functional small open reading frames (smORFs) are known to exist in several organisms. However, coding sequence detection methods are biased against detecting such very short open reading frames. Thus, a substantial number of non-canonical coding regions encoding short peptides might await characterization. Results Using bio-informatics methods, we have searched for smORFs of less than 100 amino acids in the putatively non-coding euchromatic DNA of Drosophila melanogaster, and initially identified nearly 600,000 of them. We have studied the pattern of conservation of these smORFs as coding entities between D. melanogaster and Drosophila pseudoobscura, their presence in syntenic and in transcribed regions of the genome, and their ratio of conservative versus non-conservative nucleotide changes. For negative controls, we compared the results with those obtained using random short sequences, while a positive control was provided by smORFs validated by proteomics data. Conclusions The combination of these analyses led us to postulate the existence of at least 401 functional smORFs in Drosophila, with the possibility that as many as 4,561 such functional smORFs may exist.
Collapse
|
50
|
Pueyo JI, Couso JP. Tarsal-less peptides control Notch signalling through the Shavenbaby transcription factor. Dev Biol 2011; 355:183-93. [PMID: 21527259 PMCID: PMC3940869 DOI: 10.1016/j.ydbio.2011.03.033] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2011] [Revised: 03/28/2011] [Accepted: 03/29/2011] [Indexed: 11/24/2022]
Abstract
The formation of signalling boundaries is one of the strategies employed by the Notch (N) pathway to give rise to two distinct signalling populations of cells. Unravelling the mechanisms involved in the regulation of these signalling boundaries is essential to understanding the role of N during development and diseases. The function of N in the segmentation of the Drosophila leg provides a good system to pursue these mechanisms at the molecular level. Transcriptional and post-transcriptional regulation of the N ligands, Serrate (Ser) and Delta (Dl) generates a signalling boundary that allows the directional activation of N in the distalmost part of the segment, the presumptive joint. A negative feedback loop between odd-skipped-related genes and the N pathway maintains this signalling boundary throughout development in the true joints. However, the mechanisms controlling N signalling boundaries in the tarsal joints are unknown. Here we show that the non-canonical tarsal-less (tal) gene (also known as pri), which encodes for four small related peptides, is expressed in the N-activated region and required for joint development in the tarsi during pupal development. This function of tal is both temporally and functionally separate from the tal-mediated tarsal intercalation during mid-third instar that we reported previously. In the pupal function described here, N signalling activates tal expression and reciprocally Tal peptides feedback on N by repressing the transcription of Dl in the tarsal joints. This Tal-induced repression of Dl is mediated by the post-transcriptional activation of the Shavenbaby transcription factor, in a similar manner as it has been recently described in the embryo. Thus, a negative feedback loop involving Tal regulates the formation and maintenance of a Dl+/Dl- boundary in the tarsal segments highlighting an ancient mechanism for the regulation of N signalling based on the action of small cell signalling peptides.
Collapse
Affiliation(s)
| | - Juan Pablo Couso
- Corresponding author: School of Life Sciences, University of Sussex, Falmer, Brighton, United Kingdom, BN1 9QG.
| |
Collapse
|