1
|
Margasyuk S, Kuznetsova A, Zavileyskiy L, Vlasenok M, Skvortsov D, Pervouchine D. Human introns contain conserved tissue-specific cryptic poison exons. NAR Genom Bioinform 2024; 6:lqae163. [PMID: 39664813 PMCID: PMC11632617 DOI: 10.1093/nargab/lqae163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Revised: 10/10/2024] [Accepted: 11/10/2024] [Indexed: 12/13/2024] Open
Abstract
Eukaryotic cells express a large number of transcripts from a single gene due to alternative splicing. Despite hundreds of thousands of splice isoforms being annotated in databases, it has been reported that the current exon catalogs remain incomplete. At the same time, introns of human protein-coding (PC) genes contain a large number of evolutionarily conserved elements with unknown function. Here, we explore the possibility that some of them represent cryptic exons that are expressed in rare conditions. We identified a group of cryptic exons that are similar to the annotated exons in terms of evolutionary conservation and RNA-seq read coverage in the Genotype-Tissue Expression dataset. Most of them were poison, i.e. generated an nonsense-mediated decay (NMD) isoform upon inclusion, and many showed signs of tissue-specific and cancer-specific expression and regulation. We performed RNA-seq in A549 cell line treated with cycloheximide to inactivate NMD and confirmed using quantitative polymerase chain reaction that seven of eight exons tested are, indeed, expressed. This study shows that introns of human PC genes contain cryptic poison exons, which reside in conserved intronic regions and remain not fully annotated due to insufficient representation in RNA-seq libraries.
Collapse
Affiliation(s)
- Sergey Margasyuk
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
| | - Antonina Kuznetsova
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
| | - Lev Zavileyskiy
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
| | - Maria Vlasenok
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
| | - Dmitry Skvortsov
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
- Faculty of Chemistry, Moscow State University, Ul Kolmogorova, 1, 119991, Moscow, Russia
| | - Dmitri D Pervouchine
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar, 30, 121205, Moscow, Russia
| |
Collapse
|
2
|
Fajac A, Simeonova I, Leemput J, Gabriel M, Morin A, Lejour V, Hamon A, Rakotopare J, Vaysse-Zinkhöfer W, Eldawra E, Pinskaya M, Morillon A, Bourdon JC, Bardot B, Toledo F. Mutant mice lacking alternatively spliced p53 isoforms unveil Ackr4 as a male-specific prognostic factor in Myc-driven B-cell lymphomas. eLife 2024; 13:RP92774. [PMID: 39298333 PMCID: PMC11412721 DOI: 10.7554/elife.92774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2024] Open
Abstract
The Trp53 gene encodes several isoforms of elusive biological significance. Here, we show that mice lacking the Trp53 alternatively spliced (AS) exon, thereby expressing the canonical p53 protein but not isoforms with the AS C-terminus, have unexpectedly lost a male-specific protection against Myc-induced B-cell lymphomas. Lymphomagenesis was delayed in Trp53+/+Eμ-Myc males compared to Trp53ΔAS/ΔAS Eμ-Myc males, but also compared to Trp53+/+Eμ-Myc and Trp53ΔAS/ΔAS Eμ-Myc females. Pre-tumoral splenic cells from Trp53+/+Eμ-Myc males exhibited a higher expression of Ackr4, encoding an atypical chemokine receptor with tumor suppressive effects. We identified Ackr4 as a p53 target gene whose p53-mediated transactivation is inhibited by estrogens, and as a male-specific factor of good prognosis relevant for murine Eμ-Myc-induced and human Burkitt lymphomas. Furthermore, the knockout of ACKR4 increased the chemokine-guided migration of Burkitt lymphoma cells. These data demonstrate the functional relevance of alternatively spliced p53 isoforms and reveal sex disparities in Myc-driven lymphomagenesis.
Collapse
Affiliation(s)
- Anne Fajac
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Iva Simeonova
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Julia Leemput
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Marc Gabriel
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
- Non Coding RNA, Epigenetic and Genome Fluidity, Institut CurieParisFrance
| | - Aurélie Morin
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Vincent Lejour
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Annaïg Hamon
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Jeanne Rakotopare
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Wilhelm Vaysse-Zinkhöfer
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Eliana Eldawra
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Marina Pinskaya
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
- Non Coding RNA, Epigenetic and Genome Fluidity, Institut CurieParisFrance
| | - Antonin Morillon
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
- School of Medicine, Ninewells Hospital, University of DundeeDundeeUnited Kingdom
| | | | - Boris Bardot
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| | - Franck Toledo
- Genetics of Tumor Suppression, Institut CurieParisFrance
- CNRS UMR3244ParisFrance
- Sorbonne UniversityParisFrance
- PSL Research UniversityParisFrance
| |
Collapse
|
3
|
Kiseleva OI, Arzumanian VA, Kurbatov IY, Poverennaya EV. In silico and in cellulo approaches for functional annotation of human protein splice variants. BIOMEDITSINSKAIA KHIMIIA 2024; 70:315-328. [PMID: 39324196 DOI: 10.18097/pbmc20247005315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2024]
Abstract
The elegance of pre-mRNA splicing mechanisms continues to interest scientists even after over a half century, since the discovery of the fact that coding regions in genes are interrupted by non-coding sequences. The vast majority of human genes have several mRNA variants, coding structurally and functionally different protein isoforms in a tissue-specific manner and with a linkage to specific developmental stages of the organism. Alteration of splicing patterns shifts the balance of functionally distinct proteins in living systems, distorts normal molecular pathways, and may trigger the onset and progression of various pathologies. Over the past two decades, numerous studies have been conducted in various life sciences disciplines to deepen our understanding of splicing mechanisms and the extent of their impact on the functioning of living systems. This review aims to summarize experimental and computational approaches used to elucidate the functions of splice variants of a single gene based on our experience accumulated in the laboratory of interactomics of proteoforms at the Institute of Biomedical Chemistry (IBMC) and best global practices.
Collapse
Affiliation(s)
- O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, Russia
| | | | | | | |
Collapse
|
4
|
Fair B, Buen Abad Najar CF, Zhao J, Lozano S, Reilly A, Mossian G, Staley JP, Wang J, Li YI. Global impact of unproductive splicing on human gene expression. Nat Genet 2024; 56:1851-1861. [PMID: 39223315 PMCID: PMC11387194 DOI: 10.1038/s41588-024-01872-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 07/16/2024] [Indexed: 09/04/2024]
Abstract
Alternative splicing (AS) in human genes is widely viewed as a mechanism for enhancing proteomic diversity. AS can also impact gene expression levels without increasing protein diversity by producing 'unproductive' transcripts that are targeted for rapid degradation by nonsense-mediated decay (NMD). However, the relative importance of this regulatory mechanism remains underexplored. To better understand the impact of AS-NMD relative to other regulatory mechanisms, we analyzed population-scale genomic data across eight molecular assays, covering various stages from transcription to cytoplasmic decay. We report threefold more unproductive splicing compared with prior estimates using steady-state RNA. This unproductive splicing compounds across multi-intronic genes, resulting in 15% of transcript molecules from protein-coding genes being unproductive. Leveraging genetic variation across cell lines, we find that GWAS trait-associated loci explained by AS are as often associated with NMD-induced expression level differences as with differences in protein isoform usage. Our findings suggest that much of the impact of AS is mediated by NMD-induced changes in gene expression rather than diversification of the proteome.
Collapse
Affiliation(s)
- Benjamin Fair
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | | | - Junxing Zhao
- Department of Medicinal Chemistry, University of Kansas, Lawrence, KS, USA
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Stephanie Lozano
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
- Center for Neuroscience, University of California Davis, Davis, CA, USA
| | - Austin Reilly
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Gabriela Mossian
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Jonathan P Staley
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL, USA
| | - Jingxin Wang
- Department of Medicinal Chemistry, University of Kansas, Lawrence, KS, USA
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA.
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
5
|
Ulicevic J, Shao Z, Jasnovidova O, Bressin A, Gajos M, Ng AH, Annaldasula S, Meierhofer D, Church GM, Busskamp V, Mayer A. Uncovering the dynamics and consequences of RNA isoform changes during neuronal differentiation. Mol Syst Biol 2024; 20:767-798. [PMID: 38755290 PMCID: PMC11219738 DOI: 10.1038/s44320-024-00039-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 05/18/2024] Open
Abstract
Static gene expression programs have been extensively characterized in stem cells and mature human cells. However, the dynamics of RNA isoform changes upon cell-state-transitions during cell differentiation, the determinants and functional consequences have largely remained unclear. Here, we established an improved model for human neurogenesis in vitro that is amenable for systems-wide analyses of gene expression. Our multi-omics analysis reveals that the pronounced alterations in cell morphology correlate strongly with widespread changes in RNA isoform expression. Our approach identifies thousands of new RNA isoforms that are expressed at distinct differentiation stages. RNA isoforms mainly arise from exon skipping and the alternative usage of transcription start and polyadenylation sites during human neurogenesis. The transcript isoform changes can remodel the identity and functions of protein isoforms. Finally, our study identifies a set of RNA binding proteins as a potential determinant of differentiation stage-specific global isoform changes. This work supports the view of regulated isoform changes that underlie state-transitions during neurogenesis.
Collapse
Affiliation(s)
- Jelena Ulicevic
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Biology, Chemistry and Pharmacy, Freie Universität Berlin, Berlin, Germany
| | - Zhihao Shao
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Olga Jasnovidova
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Annkatrin Bressin
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Martyna Gajos
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Alex Hm Ng
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, USA
- Wyss Institute for Biologically Inspired Engineering at Harvard University, Boston, USA
| | - Siddharth Annaldasula
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - David Meierhofer
- Mass Spectrometry Facility, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - George M Church
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, USA
- Wyss Institute for Biologically Inspired Engineering at Harvard University, Boston, USA
| | - Volker Busskamp
- Department of Ophthalmology, University Hospital Bonn, Medical Faculty, Bonn, Germany
| | - Andreas Mayer
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Berlin, Germany.
| |
Collapse
|
6
|
Qin Z, Yang J, Zhang K, Gao X, Ran Q, Xu Y, Wang Z, Lou D, Huang C, Zellmer L, Meng G, Chen N, Ma H, Wang Z, Liao DJ. Updating mRNA variants of the human RSK4 gene and their expression in different stressed situations. Heliyon 2024; 10:e27475. [PMID: 38560189 PMCID: PMC10980951 DOI: 10.1016/j.heliyon.2024.e27475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 02/11/2024] [Accepted: 02/29/2024] [Indexed: 04/04/2024] Open
Abstract
We determined RNA spectrum of the human RSK4 (hRSK4) gene (also called RPS6KA6) and identified 29 novel mRNA variants derived from alternative splicing, which, plus the NCBI-documented ones and the five we reported previously, totaled 50 hRSK4 RNAs that, by our bioinformatics analyses, encode 35 hRSK4 protein isoforms of 35-762 amino acids. Many of the mRNAs are bicistronic or tricistronic for hRSK4. The NCBI-normalized NM_014496.5 and the protein it encodes are designated herein as the Wt-1 mRNA and protein, respectively, whereas the NM_001330512.1 and the long protein it encodes are designated as the Wt-2 mRNA and protein, respectively. Many of the mRNA variants responded differently to different situations of stress, including serum starvation, a febrile temperature, treatment with ethanol or ethanol-extracted clove buds (an herbal medicine), whereas the same stressed situation often caused quite different alterations among different mRNA variants in different cell lines. Mosifloxacin, an antibiotics and also a functional inhibitor of hRSK4, could inhibit the expression of certain hRSK4 mRNA variants. The hRSK4 gene likely uses alternative splicing as a handy tool to adapt to different stressed situations, and the mRNA and protein multiplicities may partly explain the incongruous literature on its expression and comports.
Collapse
Affiliation(s)
- Zhenwei Qin
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Jianglin Yang
- Center for Clinical Laboratories, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Rd, Guiyang, 550004, Guizhou Province, China
- Key Lab of Endemic and Ethnic Diseases of the Ministry of Education of China in Guizhou Medical University, Guiyang, 550004, Guizhou Province, China
| | - Keyin Zhang
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Xia Gao
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Qianchuan Ran
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Yuanhong Xu
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Zhi Wang
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Didong Lou
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Chunhua Huang
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Lucas Zellmer
- Department of Medicine, Hennepin County Medical Center, 730 South 8th St., Minneapolis, MN, 55415, USA
| | - Guangxue Meng
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Na Chen
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Hong Ma
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Zhe Wang
- State Key Laboratory of Cancer Biology, Department of Pathology, Xijing Hospital, Air Force Medical University, 169 Changle West Road, Xi'an, 710032, China
| | - Dezhong Joshua Liao
- Center for Clinical Laboratories, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Rd, Guiyang, 550004, Guizhou Province, China
- Key Lab of Endemic and Ethnic Diseases of the Ministry of Education of China in Guizhou Medical University, Guiyang, 550004, Guizhou Province, China
| |
Collapse
|
7
|
Bénitière F, Necsulea A, Duret L. Random genetic drift sets an upper limit on mRNA splicing accuracy in metazoans. eLife 2024; 13:RP93629. [PMID: 38470242 DOI: 10.7554/elife.93629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024] Open
Abstract
Most eukaryotic genes undergo alternative splicing (AS), but the overall functional significance of this process remains a controversial issue. It has been noticed that the complexity of organisms (assayed by the number of distinct cell types) correlates positively with their genome-wide AS rate. This has been interpreted as evidence that AS plays an important role in adaptive evolution by increasing the functional repertoires of genomes. However, this observation also fits with a totally opposite interpretation: given that 'complex' organisms tend to have small effective population sizes (Ne), they are expected to be more affected by genetic drift, and hence more prone to accumulate deleterious mutations that decrease splicing accuracy. Thus, according to this 'drift barrier' theory, the elevated AS rate in complex organisms might simply result from a higher splicing error rate. To test this hypothesis, we analyzed 3496 transcriptome sequencing samples to quantify AS in 53 metazoan species spanning a wide range of Ne values. Our results show a negative correlation between Ne proxies and the genome-wide AS rates among species, consistent with the drift barrier hypothesis. This pattern is dominated by low abundance isoforms, which represent the vast majority of the splice variant repertoire. We show that these low abundance isoforms are depleted in functional AS events, and most likely correspond to errors. Conversely, the AS rate of abundant isoforms, which are relatively enriched in functional AS events, tends to be lower in more complex species. All these observations are consistent with the hypothesis that variation in AS rates across metazoans reflects the limits set by drift on the capacity of selection to prevent gene expression errors.
Collapse
Affiliation(s)
- Florian Bénitière
- Laboratoire de Biometrie et Biologie Evolutive, CNRS, Universite Lyon 1, Villeurbanne, France
| | - Anamaria Necsulea
- Laboratoire de Biometrie et Biologie Evolutive, CNRS, Universite Lyon 1, Villeurbanne, France
| | - Laurent Duret
- Laboratoire de Biometrie et Biologie Evolutive, CNRS, Universite Lyon 1, Villeurbanne, France
| |
Collapse
|
8
|
Fair B, Najar CBA, Zhao J, Lozano S, Reilly A, Mossian G, Staley JP, Wang J, Li YI. Global impact of aberrant splicing on human gene expression levels. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.13.557588. [PMID: 37745605 PMCID: PMC10515962 DOI: 10.1101/2023.09.13.557588] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Alternative splicing (AS) is pervasive in human genes, yet the specific function of most AS events remains unknown. It is widely assumed that the primary function of AS is to diversify the proteome, however AS can also influence gene expression levels by producing transcripts rapidly degraded by nonsense-mediated decay (NMD). Currently, there are no precise estimates for how often the coupling of AS and NMD (AS-NMD) impacts gene expression levels because rapidly degraded NMD transcripts are challenging to capture. To better understand the impact of AS on gene expression levels, we analyzed population-scale genomic data in lymphoblastoid cell lines across eight molecular assays that capture gene regulation before, during, and after transcription and cytoplasmic decay. Sequencing nascent mRNA transcripts revealed frequent aberrant splicing of human introns, which results in remarkably high levels of mRNA transcripts subject to NMD. We estimate that ~15% of all protein-coding transcripts are degraded by NMD, and this estimate increases to nearly half of all transcripts for lowly-expressed genes with many introns. Leveraging genetic variation across cell lines, we find that GWAS trait-associated loci explained by AS are similarly likely to associate with NMD-induced expression level differences as with differences in protein isoform usage. Additionally, we used the splice-switching drug risdiplam to perturb AS at hundreds of genes, finding that ~3/4 of the splicing perturbations induce NMD. Thus, we conclude that AS-NMD substantially impacts the expression levels of most human genes. Our work further suggests that much of the molecular impact of AS is mediated by changes in protein expression levels rather than diversification of the proteome.
Collapse
Affiliation(s)
- Benjamin Fair
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Carlos Buen Abad Najar
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Junxing Zhao
- Department of Medicinal Chemistry, University of Kansas, Lawrence, KS 66047, USA
| | - Stephanie Lozano
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Present address: Center for Neuroscience, University of California Davis, Davis, CA 95618, USA
| | - Austin Reilly
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Gabriela Mossian
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Jonathan P Staley
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL 60637, USA
| | - Jingxin Wang
- Department of Medicinal Chemistry, University of Kansas, Lawrence, KS 66047, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
9
|
Dou Y, Liu Y, Yi X, Olsen LK, Zhu H, Gao Q, Zhou H, Zhang B. SEPepQuant enhances the detection of possible isoform regulations in shotgun proteomics. Nat Commun 2023; 14:5809. [PMID: 37726316 PMCID: PMC10509223 DOI: 10.1038/s41467-023-41558-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 09/06/2023] [Indexed: 09/21/2023] Open
Abstract
Shotgun proteomics is essential for protein identification and quantification in biomedical research, but protein isoform characterization is challenging due to the extensive number of peptides shared across proteins, hindering our understanding of protein isoform regulation and their roles in normal and disease biology. We systematically assess the challenge and opportunities of shotgun proteomics-based protein isoform characterization using in silico and experimental data, and then present SEPepQuant, a graph theory-based approach to maximize isoform characterization. Using published data from one induced pluripotent stem cell study and two human hepatocellular carcinoma studies, we demonstrate the ability of SEPepQuant in addressing the key limitations of existing methods, providing more comprehensive isoform-level characterization, identifying hundreds of isoform-level regulation events, and facilitating streamlined cross-study comparisons. Our analysis provides solid evidence to support a widespread role of protein isoform regulation in normal and disease processes, and SEPepQuant has broad applications to biological and translational research.
Collapse
Affiliation(s)
- Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yuejia Liu
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, 138 Xianlin Avenue, 210023, Nanjing, Jiangsu, China
| | - Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Lindsey K Olsen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Hongwen Zhu
- Department of Analytical Chemistry, State Key Laboratory of Drug Research and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, 201203, Shanghai, China
| | - Qiang Gao
- Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, and Key Laboratory of Carcinogenesis and Cancer Invasion of Ministry of Education, 180 Fenglin Road, 200032, Shanghai, China
| | - Hu Zhou
- School of Chinese Materia Medica, Nanjing University of Chinese Medicine, 138 Xianlin Avenue, 210023, Nanjing, Jiangsu, China
- Department of Analytical Chemistry, State Key Laboratory of Drug Research and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, 201203, Shanghai, China
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
10
|
Varabyou A, Erdogdu B, Salzberg SL, Pertea M. Investigating Open Reading Frames in Known and Novel Transcripts using ORFanage. NATURE COMPUTATIONAL SCIENCE 2023; 3:700-708. [PMID: 38098813 PMCID: PMC10718564 DOI: 10.1038/s43588-023-00496-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/05/2023] [Indexed: 12/17/2023]
Abstract
ORFanage is a system designed to assign open reading frames (ORFs) to known and novel gene transcripts while maximizing similarity to annotated proteins. The primary intended use of ORFanage is the identification of ORFs in the assembled results of RNA sequencing experiments, a capability that most transcriptome assembly methods do not have. Our experiments demonstrate how ORFanage can be used to find novel protein variants in RNA-seq datasets, and to improve the annotations of ORFs in tens of thousands of transcript models in the human annotation databases. Through its implementation of a highly accurate and efficient pseudo-alignment algorithm, ORFanage is substantially faster than other ORF annotation methods, enabling its application to very large datasets. When used to analyze transcriptome assemblies, ORFanage can aid in the separation of signal from transcriptional noise and the identification of likely functional transcript variants, ultimately advancing our understanding of biology and medicine.
Collapse
Affiliation(s)
- Ales Varabyou
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
| | - Beril Erdogdu
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Steven L. Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
11
|
Varabyou A, Erdogdu B, Salzberg SL, Pertea M. Investigating Open Reading Frames in Known and Novel Transcripts using ORFanage. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.23.533704. [PMID: 36993373 PMCID: PMC10055401 DOI: 10.1101/2023.03.23.533704] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
Abstract
ORFanage is a system designed to assign open reading frames (ORFs) to both known and novel gene transcripts while maximizing similarity to annotated proteins. The primary intended use of ORFanage is the identification of ORFs in the assembled results of RNA sequencing (RNA-seq) experiments, a capability that most transcriptome assembly methods do not have. Our experiments demonstrate how ORFanage can be used to find novel protein variants in RNA-seq datasets, and to improve the annotations of ORFs in tens of thousands of transcript models in the RefSeq and GENCODE human annotation databases. Through its implementation of a highly accurate and efficient pseudo-alignment algorithm, ORFanage is substantially faster than other ORF annotation methods, enabling its application to very large datasets. When used to analyze transcriptome assemblies, ORFanage can aid in the separation of signal from transcriptional noise and the identification of likely functional transcript variants, ultimately advancing our understanding of biology and medicine.
Collapse
Affiliation(s)
- Ales Varabyou
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
| | - Beril Erdogdu
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Steven L Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
12
|
Manuel JM, Guilloy N, Khatir I, Roucou X, Laurent B. Re-evaluating the impact of alternative RNA splicing on proteomic diversity. Front Genet 2023; 14:1089053. [PMID: 36845399 PMCID: PMC9947481 DOI: 10.3389/fgene.2023.1089053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 01/23/2023] [Indexed: 02/11/2023] Open
Abstract
Alternative splicing (AS) constitutes a mechanism by which protein-coding genes and long non-coding RNA (lncRNA) genes produce more than a single mature transcript. From plants to humans, AS is a powerful process that increases transcriptome complexity. Importantly, splice variants produced from AS can potentially encode for distinct protein isoforms which can lose or gain specific domains and, hence, differ in their functional properties. Advances in proteomics have shown that the proteome is indeed diverse due to the presence of numerous protein isoforms. For the past decades, with the help of advanced high-throughput technologies, numerous alternatively spliced transcripts have been identified. However, the low detection rate of protein isoforms in proteomic studies raised debatable questions on whether AS contributes to proteomic diversity and on how many AS events are really functional. We propose here to assess and discuss the impact of AS on proteomic complexity in the light of the technological progress, updated genome annotation, and current scientific knowledge.
Collapse
Affiliation(s)
- Jeru Manoj Manuel
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Noé Guilloy
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Inès Khatir
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Xavier Roucou
- Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada,Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, QC, Canada,Quebec Network for Research on Protein Function Structure and Engineering, PROTEO, Québec, QC, Canada
| | - Benoit Laurent
- Research Center on Aging, Centre Intégré Universitaire de Santé et Services Sociaux de l’Estrie-Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, QC, Canada,Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada,*Correspondence: Benoit Laurent,
| |
Collapse
|
13
|
Martinez-Gomez L, Cerdán-Vélez D, Abascal F, Tress ML. Origins and Evolution of Human Tandem Duplicated Exon Substitution Events. Genome Biol Evol 2022; 14:6809199. [PMID: 36346145 PMCID: PMC9741552 DOI: 10.1093/gbe/evac162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/25/2022] [Accepted: 10/29/2022] [Indexed: 11/10/2022] Open
Abstract
The mutually exclusive splicing of tandem duplicated exons produces protein isoforms that are identical save for a homologous region that allows for the fine tuning of protein function. Tandem duplicated exon substitution events are rare, yet highly important alternative splicing events. Most events are ancient, their isoforms are highly expressed, and they have significantly more pathogenic mutations than other splice events. Here, we analyzed the physicochemical properties and functional roles of the homologous polypeptide regions produced by the 236 tandem duplicated exon substitutions annotated in the human gene set. We find that the most important structural and functional residues in these homologous regions are maintained, and that most changes are conservative rather than drastic. Three quarters of the isoforms produced from tandem duplicated exon substitution events are tissue-specific, particularly in nervous and cardiac tissues, and tandem duplicated exon substitution events are enriched in functional terms related to structures in the brain and skeletal muscle. We find considerable evidence for the convergent evolution of tandem duplicated exon substitution events in vertebrates, arthropods, and nematodes. Twelve human gene families have orthologues with tandem duplicated exon substitution events in both Drosophila melanogaster and Caenorhabditis elegans. Six of these gene families are ion transporters, suggesting that tandem exon duplication in genes that control the flow of ions into the cell has an adaptive benefit. The ancient origins, the strong indications of tissue-specific functions, and the evidence of convergent evolution suggest that these events may have played important roles in the evolution of animal tissues and organs.
Collapse
Affiliation(s)
- Laura Martinez-Gomez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Daniel Cerdán-Vélez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Federico Abascal
- Somatic Evolution Group, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom
| | | |
Collapse
|
14
|
Zhou D, Tran Y, Abou Elela S, Scott MS. SAPFIR: A webserver for the identification of alternative protein features. BMC Bioinformatics 2022; 23:250. [PMID: 35751026 PMCID: PMC9229502 DOI: 10.1186/s12859-022-04804-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 06/20/2022] [Indexed: 11/29/2022] Open
Abstract
Background Alternative splicing can increase the diversity of gene functions by generating multiple isoforms with different sequences and functions. However, the extent to which splicing events have functional consequences remains unclear and predicting the impact of splicing events on protein activity is limited to gene-specific analysis. Results To accelerate the identification of functionally relevant alternative splicing events we created SAPFIR, a predictor of protein features associated with alternative splicing events. This webserver tool uses InterProScan to predict protein features such as functional domains, motifs and sites in the human and mouse genomes and link them to alternative splicing events. Alternative protein features are displayed as functions of the transcripts and splice sites. SAPFIR could be used to analyze proteins generated from a single gene or a group of genes and can directly identify alternative protein features in large sequence data sets. The accuracy and utility of SAPFIR was validated by its ability to rediscover previously validated alternative protein domains. In addition, our de novo analysis of public datasets using SAPFIR indicated that only a small portion of alternative protein domains was conserved between human and mouse, and that in human, genes involved in nervous system process, regulation of DNA-templated transcription and aging are more likely to produce isoforms missing functional domains due to alternative splicing. Conclusion Overall SAPFIR represents a new tool for the rapid identification of functional alternative splicing events and enables the identification of cellular functions affected by a defined splicing program. SAPFIR is freely available at https://bioinfo-scottgroup.med.usherbrooke.ca/sapfir/, a website implemented in Python, with all major browsers supported. The source code is available at https://github.com/DelongZHOU/SAPFIR. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04804-w.
Collapse
Affiliation(s)
- Delong Zhou
- Département de Microbiologie et d'infectiologie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Sherbrooke, QC, J1E 4K8, Canada
| | - Yvan Tran
- Département de Biochimie et Génomique Fonctionnelle, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Sherbrooke, QC, J1E 4K8, Canada
| | - Sherif Abou Elela
- Département de Microbiologie et d'infectiologie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Sherbrooke, QC, J1E 4K8, Canada.
| | - Michelle S Scott
- Département de Biochimie et Génomique Fonctionnelle, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Sherbrooke, QC, J1E 4K8, Canada.
| |
Collapse
|
15
|
Miller RM, Jordan BT, Mehlferber MM, Jeffery ED, Chatzipantsiou C, Kaur S, Millikin RJ, Dai Y, Tiberi S, Castaldi PJ, Shortreed MR, Luckey CJ, Conesa A, Smith LM, Deslattes Mays A, Sheynkman GM. Enhanced protein isoform characterization through long-read proteogenomics. Genome Biol 2022; 23:69. [PMID: 35241129 PMCID: PMC8892804 DOI: 10.1186/s13059-022-02624-y] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/02/2022] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms. RESULTS We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis. CONCLUSIONS Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.
Collapse
Affiliation(s)
- Rachel M Miller
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Ben T Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Madison M Mehlferber
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Erin D Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | | | - Simi Kaur
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Robert J Millikin
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Yunxiang Dai
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Simone Tiberi
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Peter J Castaldi
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of General Medicine and Primary Care, Brigham and Women's Hospital, Boston, MA, USA
| | | | - Chance John Luckey
- Department of Pathology, University of Virginia, Charlottesville, VA, USA
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain
- Microbiology and Cell Science Department, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Anne Deslattes Mays
- Office of Data Science and Sharing, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, MD, USA
| | - Gloria M Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA.
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.
- UVA Cancer Center, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
16
|
Singh P, Ahi EP. The importance of alternative splicing in adaptive evolution. Mol Ecol 2022; 31:1928-1938. [DOI: 10.1111/mec.16377] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 01/06/2022] [Accepted: 01/25/2022] [Indexed: 11/26/2022]
Affiliation(s)
- Pooja Singh
- Department of Biological Sciences University of Calgary Calgary Canada
- Institute of Ecology and Evolution University of Bern Bern Switzerland
- Swiss Federal Institute of Aquatic Science and Technology (EAWAG) Kastanienbaum Switzerland
| | - Ehsan Pashay Ahi
- Organismal and Evolutionary Biology Research Programme University of Helsinki Helsinki Finland
| |
Collapse
|
17
|
Zou X, Schaefke B, Li Y, Jia F, Sun W, Li G, Liang W, Reif T, Heyd F, Gao Q, Tian S, Li Y, Tang Y, Fang L, Hu Y, Chen W. Mammalian splicing divergence is shaped by drift, buffering in trans, and a scaling law. Life Sci Alliance 2022; 5:5/4/e202101333. [PMID: 34969779 PMCID: PMC8739531 DOI: 10.26508/lsa.202101333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 12/20/2021] [Accepted: 12/20/2021] [Indexed: 11/24/2022] Open
Abstract
This study globally investigates the allelic splicing pattern in multiple tissues of an F1 hybrid mouse and reveals the underlying driving forces shaping such tissue-dependent splicing divergence. Alternative splicing is ubiquitous, but the mechanisms underlying its pattern of evolutionary divergence across mammalian tissues are still underexplored. Here, we investigated the cis-regulatory divergences and their relationship with tissue-dependent trans-regulation in multiple tissues of an F1 hybrid between two mouse species. Large splicing changes between tissues are highly conserved and likely reflect functional tissue-dependent regulation. In particular, micro-exons frequently exhibit this pattern with high inclusion levels in the brain. Cis-divergence of splicing appears to be largely non-adaptive. Although divergence is in general associated with higher densities of sequence variants in regulatory regions, events with high usage of the dominant isoform apparently tolerate more mutations, explaining why their exon sequences are highly conserved but their intronic splicing site flanking regions are not. Moreover, we demonstrate that non-adaptive mutations are often masked in tissues where accurate splicing likely is more important, and experimentally attribute such buffering effect to trans-regulatory splicing efficiency.
Collapse
Affiliation(s)
- Xudong Zou
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Bernhard Schaefke
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| | - Yisheng Li
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Fujian Jia
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Wei Sun
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Guipeng Li
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| | - Weizheng Liang
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Tristan Reif
- Institute for Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Florian Heyd
- Institute for Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Qingsong Gao
- Laboratory for Systems Biology and Functional Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Shuye Tian
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Yanping Li
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Yisen Tang
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Liang Fang
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| | - Yuhui Hu
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Wei Chen
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China .,Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.,Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
18
|
Kashkan I, Timofeyenko K, Růžička K. How alternative splicing changes the properties of plant proteins. QUANTITATIVE PLANT BIOLOGY 2022; 3:e14. [PMID: 37077961 PMCID: PMC10095807 DOI: 10.1017/qpb.2022.9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Revised: 05/01/2022] [Accepted: 05/03/2022] [Indexed: 05/03/2023]
Abstract
Most plant primary transcripts undergo alternative splicing (AS), and its impact on protein diversity is a subject of intensive investigation. Several studies have uncovered various mechanisms of how particular protein splice isoforms operate. However, the common principles behind the AS effects on protein function in plants have rarely been surveyed. Here, on the selected examples, we highlight diverse tissue expression patterns, subcellular localization, enzymatic activities, abilities to bind other molecules and other relevant features. We describe how the protein isoforms mutually interact to underline their intriguing roles in altering the functionality of protein complexes. Moreover, we also discuss the known cases when these interactions have been placed inside the autoregulatory loops. This review is particularly intended for plant cell and developmental biologists who would like to gain inspiration on how the splice variants encoded by their genes of interest may coordinately work.
Collapse
Affiliation(s)
- Ivan Kashkan
- Laboratory of Hormonal Regulations in Plants, Institute of Experimental Botany, Czech Academy of Sciences, Prague, Czech Republic
- Functional Genomics and Proteomics of Plants, Central European Institute of Technology and National Centre for Biomolecular Research, Masaryk University, Brno62500, Czech Republic
| | - Ksenia Timofeyenko
- Laboratory of Hormonal Regulations in Plants, Institute of Experimental Botany, Czech Academy of Sciences, Prague, Czech Republic
- Functional Genomics and Proteomics of Plants, Central European Institute of Technology and National Centre for Biomolecular Research, Masaryk University, Brno62500, Czech Republic
| | - Kamil Růžička
- Laboratory of Hormonal Regulations in Plants, Institute of Experimental Botany, Czech Academy of Sciences, Prague, Czech Republic
- Author for correspondence: K. Růžička, E-mail:
| |
Collapse
|
19
|
Lill JR, Mathews WR, Rose CM, Schirle M. Proteomics in the pharmaceutical and biotechnology industry: a look to the next decade. Expert Rev Proteomics 2021; 18:503-526. [PMID: 34320887 DOI: 10.1080/14789450.2021.1962300] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
INTRODUCTION Pioneering technologies such as proteomics have helped fuel the biotechnology and pharmaceutical industry with the discovery of novel targets and an intricate understanding of the activity of therapeutics and their various activities in vitro and in vivo. The field of proteomics is undergoing an inflection point, where new sensitive technologies are allowing intricate biological pathways to be better understood, and novel biochemical tools are pivoting us into a new era of chemical proteomics and biomarker discovery. In this review, we describe these areas of innovation, and discuss where the fields are headed in terms of fueling biotechnological and pharmacological research and discuss current gaps in the proteomic technology landscape. AREAS COVERED Single cell sequencing and single molecule sequencing. Chemoproteomics. Biological matrices and clinical samples including biomarkers. Computational tools including instrument control software, data analysis. EXPERT OPINION Proteomics will likely remain a key technology in the coming decade, but will have to evolve with respect to type and granularity of data, cost and throughput of data generation as well as integration with other technologies to fulfill its promise in drug discovery.
Collapse
Affiliation(s)
- Jennie R Lill
- Department of Microchemistry, Lipidomics and Next Generation Sequencing, Genentech Inc. DNA Way, South San Francisco, CA, USA
| | - William R Mathews
- OMNI Department, Genentech Inc. 1 DNA Way, South San Francisco, CA, USA
| | - Christopher M Rose
- Department of Microchemistry, Lipidomics and Next Generation Sequencing, Genentech Inc. DNA Way, South San Francisco, CA, USA
| | - Markus Schirle
- Chemical Biology and Therapeutics Department, Novartis Institutes for Biomedical Research, Cambridge, MA, USA
| |
Collapse
|
20
|
Sellis D, Guérin F, Arnaiz O, Pett W, Lerat E, Boggetto N, Krenek S, Berendonk T, Couloux A, Aury JM, Labadie K, Malinsky S, Bhullar S, Meyer E, Sperling L, Duret L, Duharcourt S. Massive colonization of protein-coding exons by selfish genetic elements in Paramecium germline genomes. PLoS Biol 2021; 19:e3001309. [PMID: 34324490 PMCID: PMC8354472 DOI: 10.1371/journal.pbio.3001309] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 08/10/2021] [Accepted: 06/04/2021] [Indexed: 11/18/2022] Open
Abstract
Ciliates are unicellular eukaryotes with both a germline genome and a somatic genome in the same cytoplasm. The somatic macronucleus (MAC), responsible for gene expression, is not sexually transmitted but develops from a copy of the germline micronucleus (MIC) at each sexual generation. In the MIC genome of Paramecium tetraurelia, genes are interrupted by tens of thousands of unique intervening sequences called internal eliminated sequences (IESs), which have to be precisely excised during the development of the new MAC to restore functional genes. To understand the evolutionary origin of this peculiar genomic architecture, we sequenced the MIC genomes of 9 Paramecium species (from approximately 100 Mb in Paramecium aurelia species to >1.5 Gb in Paramecium caudatum). We detected several waves of IES gains, both in ancestral and in more recent lineages. While the vast majority of IESs are single copy in present-day genomes, we identified several families of mobile IESs, including nonautonomous elements acquired via horizontal transfer, which generated tens to thousands of new copies. These observations provide the first direct evidence that transposable elements can account for the massive proliferation of IESs in Paramecium. The comparison of IESs of different evolutionary ages indicates that, over time, IESs shorten and diverge rapidly in sequence while they acquire features that allow them to be more efficiently excised. We nevertheless identified rare cases of IESs that are under strong purifying selection across the aurelia clade. The cases examined contain or overlap cellular genes that are inactivated by excision during development, suggesting conserved regulatory mechanisms. Similar to the evolution of introns in eukaryotes, the evolution of Paramecium IESs highlights the major role played by selfish genetic elements in shaping the complexity of genome architecture and gene expression. A comparative genomics study of nine Paramecium species reveals successful invasion of genes by transposable elements in their germline genomes, showing that the internal eliminated sequences (IESs) followed an evolutionary trajectory remarkably similar to that of spliceosomal introns.
Collapse
Affiliation(s)
- Diamantis Sellis
- Université de Lyon, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villeurbanne, France
| | - Frédéric Guérin
- Université de Paris, CNRS, Institut Jacques Monod, Paris, France
| | - Olivier Arnaiz
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Walker Pett
- Université de Lyon, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villeurbanne, France
| | - Emmanuelle Lerat
- Université de Lyon, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villeurbanne, France
| | - Nicole Boggetto
- Université de Paris, CNRS, Institut Jacques Monod, Paris, France
| | - Sascha Krenek
- TU Dresden, Institute of Hydrobiology, Dresden, Germany
| | | | - Arnaud Couloux
- Génomique Métabolique, Genoscope, Institut de biologie François Jacob, CEA, CNRS, Université d’Évry, Université Paris-Saclay, Evry, France
| | - Jean-Marc Aury
- Génomique Métabolique, Genoscope, Institut de biologie François Jacob, CEA, CNRS, Université d’Évry, Université Paris-Saclay, Evry, France
| | - Karine Labadie
- Genoscope, Institut de biologie François-Jacob, Commissariat à l’Energie Atomique (CEA), Université Paris-Saclay, Evry, France
| | - Sophie Malinsky
- Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
- Université de Paris, Paris, France
| | - Simran Bhullar
- Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Eric Meyer
- Institut de Biologie de l’Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERM, Université PSL, Paris, France
| | - Linda Sperling
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Laurent Duret
- Université de Lyon, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villeurbanne, France
- * E-mail: (LD); (SD)
| | - Sandra Duharcourt
- Université de Paris, CNRS, Institut Jacques Monod, Paris, France
- * E-mail: (LD); (SD)
| |
Collapse
|
21
|
Martinez Gomez L, Pozo F, Walsh TA, Abascal F, Tress ML. The clinical importance of tandem exon duplication-derived substitutions. Nucleic Acids Res 2021; 49:8232-8246. [PMID: 34302486 PMCID: PMC8373072 DOI: 10.1093/nar/gkab623] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 07/21/2021] [Indexed: 01/04/2023] Open
Abstract
Most coding genes in the human genome are annotated with multiple alternative transcripts. However, clear evidence for the functional relevance of the protein isoforms produced by these alternative transcripts is often hard to find. Alternative isoforms generated from tandem exon duplication-derived substitutions are an exception. These splice events are rare, but have important functional consequences. Here, we have catalogued the 236 tandem exon duplication-derived substitutions annotated in the GENCODE human reference set. We find that more than 90% of the events have a last common ancestor in teleost fish, so are at least 425 million years old, and twenty-one can be traced back to the Bilateria clade. Alternative isoforms generated from tandem exon duplication-derived substitutions also have significantly more clinical impact than other alternative isoforms. Tandem exon duplication-derived substitutions have >25 times as many pathogenic and likely pathogenic mutations as other alternative events. Tandem exon duplication-derived substitutions appear to have vital functional roles in the cell and may have played a prominent part in metazoan evolution.
Collapse
Affiliation(s)
- Laura Martinez Gomez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| | - Thomas A Walsh
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain.,Eukaryotic Annotation Team, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SA. UK
| | - Federico Abascal
- Somatic Evolution Group, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), C. Melchor Fernandez Almagro, 3, 28029 Madrid, Spain
| |
Collapse
|
22
|
Mazin PV, Khaitovich P, Cardoso-Moreira M, Kaessmann H. Alternative splicing during mammalian organ development. Nat Genet 2021; 53:925-934. [PMID: 33941934 PMCID: PMC8187152 DOI: 10.1038/s41588-021-00851-w] [Citation(s) in RCA: 98] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Accepted: 03/19/2021] [Indexed: 12/30/2022]
Abstract
Alternative splicing (AS) is pervasive in mammalian genomes, yet cross-species comparisons have been largely restricted to adult tissues and the functionality of most AS events remains unclear. We assessed AS patterns across pre- and postnatal development of seven organs in six mammals and a bird. Our analyses revealed that developmentally dynamic AS events, which are especially prevalent in the brain, are substantially more conserved than nondynamic ones. Cassette exons with increasing inclusion frequencies during development show the strongest signals of conserved and regulated AS. Newly emerged cassette exons are typically incorporated late in testis development, but those retained during evolution are predominantly brain specific. Our work suggests that an intricate interplay of programs controlling gene expression levels and AS is fundamental to organ development, especially for the brain and heart. In these regulatory networks, AS affords substantial functional diversification of genes through the generation of tissue- and time-specific isoforms from broadly expressed genes.
Collapse
Affiliation(s)
- Pavel V Mazin
- V. Zelman Center for Neurobiology and Brain Restoration, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Philipp Khaitovich
- V. Zelman Center for Neurobiology and Brain Restoration, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Margarida Cardoso-Moreira
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany.
- Evolutionary Developmental Biology Laboratory, The Francis Crick Institute, London, UK.
| | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany.
| |
Collapse
|
23
|
Pozo F, Martinez-Gomez L, Walsh TA, Rodriguez JM, Di Domenico T, Abascal F, Vazquez J, Tress ML. Assessing the functional relevance of splice isoforms. NAR Genom Bioinform 2021; 3:lqab044. [PMID: 34046593 PMCID: PMC8140736 DOI: 10.1093/nargab/lqab044] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 04/22/2021] [Accepted: 05/17/2021] [Indexed: 12/20/2022] Open
Abstract
Alternative splicing of messenger RNA can generate an array of mature transcripts, but it is not clear how many go on to produce functionally relevant protein isoforms. There is only limited evidence for alternative proteins in proteomics analyses and data from population genetic variation studies indicate that most alternative exons are evolving neutrally. Determining which transcripts produce biologically important isoforms is key to understanding isoform function and to interpreting the real impact of somatic mutations and germline variations. Here we have developed a method, TRIFID, to classify the functional importance of splice isoforms. TRIFID was trained on isoforms detected in large-scale proteomics analyses and distinguishes these biologically important splice isoforms with high confidence. Isoforms predicted as functionally important by the algorithm had measurable cross species conservation and significantly fewer broken functional domains. Additionally, exons that code for these functionally important protein isoforms are under purifying selection, while exons from low scoring transcripts largely appear to be evolving neutrally. TRIFID has been developed for the human genome, but it could in principle be applied to other well-annotated species. We believe that this method will generate valuable insights into the cellular importance of alternative splicing.
Collapse
Affiliation(s)
- Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Laura Martinez-Gomez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Thomas A Walsh
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - José Manuel Rodriguez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Tomas Di Domenico
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Federico Abascal
- Somatic Evolution Group, Wellcome Sanger Institute, Hinxton CB10 1SA, UK
| | - Jesús Vazquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| |
Collapse
|
24
|
An extended catalogue of tandem alternative splice sites in human tissue transcriptomes. PLoS Comput Biol 2021; 17:e1008329. [PMID: 33826604 PMCID: PMC8055015 DOI: 10.1371/journal.pcbi.1008329] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 04/19/2021] [Accepted: 03/22/2021] [Indexed: 12/18/2022] Open
Abstract
Tandem alternative splice sites (TASS) is a special class of alternative splicing events that are characterized by a close tandem arrangement of splice sites. Most TASS lack functional characterization and are believed to arise from splicing noise. Based on the RNA-seq data from the Genotype Tissue Expression project, we present an extended catalogue of TASS in healthy human tissues and analyze their tissue-specific expression. The expression of TASS is usually dominated by one major splice site (maSS), while the expression of minor splice sites (miSS) is at least an order of magnitude lower. Among 46k miSS with sufficient read support, 9k (20%) are significantly expressed above the expected noise level, and among them 2.5k are expressed tissue-specifically. We found significant correlations between tissue-specific expression of RNA-binding proteins (RBP), tissue-specific expression of miSS, and miSS response to RBP inactivation by shRNA. In combination with RBP profiling by eCLIP, this allowed prediction of novel cases of tissue-specific splicing regulation including a miSS in QKI mRNA that is likely regulated by PTBP1. The analysis of human primary cell transcriptomes suggested that both tissue-specific and cell-type-specific factors contribute to the regulation of miSS expression. More than 20% of tissue-specific miSS affect structured protein regions and may adjust protein-protein interactions or modify the stability of the protein core. The significantly expressed miSS evolve under the same selection pressure as maSS, while other miSS lack signatures of evolutionary selection and conservation. Using mixture models, we estimated that not more than 15% of maSS and not more than 54% of tissue-specific miSS are noisy, while the proportion of noisy splice sites among non-significantly expressed miSS is above 63%. Pre-mRNA splicing is an important step in the processing of the genomic information during gene expression. During splicing, introns are excised from a gene transcript, and the remaining exons are ligated. Our work concerns one its particular subtype, which involves the so-called tandem alternative splice sites, a group of closely located exon borders that are used alternatively. We analyzed RNA-seq measurements of gene expression provided by the Genotype-Tissue Expression (GTEx) project, the largest to-date collection of such measurements in healthy human tissues, and constructed a detailed catalogue of tandem alternative splice sites. Within this catalogue, we characterized patterns of tissue-specific expression, regulation, impact on protein structure, and evolutionary selection acting on tandem alternative splice sites. In a number of genes, we predicted regulatory mechanisms that could be responsible for choosing one of many tandem alternative splice sites. The results of this study provide an invaluable resource for molecular biologists studying alternative splicing.
Collapse
|
25
|
Salovska B, Zhu H, Gandhi T, Frank M, Li W, Rosenberger G, Wu C, Germain PL, Zhou H, Hodny Z, Reiter L, Liu Y. Isoform-resolved correlation analysis between mRNA abundance regulation and protein level degradation. Mol Syst Biol 2021; 16:e9170. [PMID: 32175694 PMCID: PMC7073818 DOI: 10.15252/msb.20199170] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Revised: 02/06/2020] [Accepted: 02/12/2020] [Indexed: 12/15/2022] Open
Abstract
Profiling of biological relationships between different molecular layers dissects regulatory mechanisms that ultimately determine cellular function. To thoroughly assess the role of protein post‐translational turnover, we devised a strategy combining pulse stable isotope‐labeled amino acids in cells (pSILAC), data‐independent acquisition mass spectrometry (DIA‐MS), and a novel data analysis framework that resolves protein degradation rate on the level of mRNA alternative splicing isoforms and isoform groups. We demonstrated our approach by the genome‐wide correlation analysis between mRNA amounts and protein degradation across different strains of HeLa cells that harbor a high grade of gene dosage variation. The dataset revealed that specific biological processes, cellular organelles, spatial compartments of organelles, and individual protein isoforms of the same genes could have distinctive degradation rate. The protein degradation diversity thus dissects the corresponding buffering or concerting protein turnover control across cancer cell lines. The data further indicate that specific mRNA splicing events such as intron retention significantly impact the protein abundance levels. Our findings support the tight association between transcriptome variability and proteostasis and provide a methodological foundation for studying functional protein degradation.
Collapse
Affiliation(s)
- Barbora Salovska
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.,Department of Genome Integrity, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Hongwen Zhu
- Department of Analytical Chemistry and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | | | - Max Frank
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | | | - Chongde Wu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Pierre-Luc Germain
- Institute for Neuroscience, D-HEST, ETH Zurich, Zurich, Switzerland.,Statistical Bioinformatics Lab, DMLS, University of Zürich, Zurich, Switzerland
| | - Hu Zhou
- Department of Analytical Chemistry and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | - Zdenek Hodny
- Department of Genome Integrity, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | | | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.,Department of Pharmacology, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
26
|
Varabyou A, Salzberg SL, Pertea M. Effects of transcriptional noise on estimates of gene and transcript expression in RNA sequencing experiments. Genome Res 2021; 31:301-308. [PMID: 33361112 PMCID: PMC7849408 DOI: 10.1101/gr.266213.120] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 12/18/2020] [Indexed: 12/25/2022]
Abstract
RNA sequencing is widely used to measure gene expression across a vast range of animal and plant tissues and conditions. Most studies of computational methods for gene expression analysis use simulated data to evaluate the accuracy of these methods. These simulations typically include reads generated from known genes at varying levels of expression. Until now, simulations did not include reads from noisy transcripts, which might include erroneous transcription, erroneous splicing, and other processes that affect transcription in living cells. Here we examine the effects of realistic amounts of transcriptional noise on the ability of leading computational methods to assemble and quantify the genes and transcripts in an RNA sequencing experiment. We show that the inclusion of noise leads to systematic errors in the ability of these programs to measure expression, including systematic underestimates of transcript abundance levels and large increases in the number of false-positive genes and transcripts. Our results also suggest that alignment-free computational methods sometimes fail to detect transcripts expressed at relatively low levels.
Collapse
Affiliation(s)
- Ales Varabyou
- Center for Computational Biology, Johns Hopkins University, Baltimore, Maryland 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | - Steven L Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, Maryland 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
- Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland 21205, USA
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, Maryland 21211, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
| |
Collapse
|
27
|
Abstract
Males and females of the same species share the majority of their genomes, yet they are frequently exposed to conflicting selection pressures. Gene regulation is widely assumed to resolve these conflicting sex-specific selection pressures, and although there has been considerable focus on elucidating the role of gene expression level in sex-specific adaptation, other regulatory mechanisms have been overlooked. Alternative splicing enables different transcripts to be generated from the same gene, meaning that exons which have sex-specific beneficial effects can in theory be retained in the gene product, whereas exons with detrimental effects can be skipped. However, at present, little is known about how sex-specific selection acts on broad patterns of alternative splicing. Here, we investigate alternative splicing across males and females of multiple bird species. We identify hundreds of genes that have sex-specific patterns of splicing and establish that sex differences in splicing are correlated with phenotypic sex differences. Additionally, we find that alternatively spliced genes have evolved rapidly as a result of sex-specific selection and suggest that sex differences in splicing offer another route to sex-specific adaptation when gene expression level changes are limited by functional constraints. Overall, our results shed light on how a diverse transcriptional framework can give rise to the evolution of phenotypic sexual dimorphism.
Collapse
Affiliation(s)
- Thea F Rogers
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Daniela H Palmer
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Alison E Wright
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
28
|
Sciarrillo R, Wojtuszkiewicz A, Assaraf YG, Jansen G, Kaspers GJL, Giovannetti E, Cloos J. The role of alternative splicing in cancer: From oncogenesis to drug resistance. Drug Resist Updat 2020; 53:100728. [PMID: 33070093 DOI: 10.1016/j.drup.2020.100728] [Citation(s) in RCA: 153] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 09/17/2020] [Accepted: 09/21/2020] [Indexed: 12/15/2022]
Abstract
Alternative splicing is a tightly regulated process whereby non-coding sequences of pre-mRNA are removed and protein-coding segments are assembled in diverse combinations, ultimately giving rise to proteins with distinct or even opposing functions. In the past decade, whole genome/transcriptome sequencing studies revealed the high complexity of splicing regulation, which occurs co-transcriptionally and is influenced by chromatin status and mRNA modifications. Consequently, splicing profiles of both healthy and malignant cells display high diversity and alternative splicing was shown to be widely deregulated in multiple cancer types. In particular, mutations in pre-mRNA regulatory sequences, splicing regulators and chromatin modifiers, as well as differential expression of splicing factors are important contributors to cancer pathogenesis. It has become clear that these aberrations contribute to many facets of cancer, including oncogenic transformation, cancer progression, response to anticancer drug treatment as well as resistance to therapy. In this respect, alternative splicing was shown to perturb the expression a broad spectrum of relevant genes involved in drug uptake/metabolism (i.e. SLC29A1, dCK, FPGS, and TP), activation of nuclear receptor pathways (i.e. GR, AR), regulation of apoptosis (i.e. MCL1, BCL-X, and FAS) and modulation of response to immunotherapy (CD19). Furthermore, aberrant splicing constitutes an important source of novel cancer biomarkers and the spliceosome machinery represents an attractive target for a novel and rapidly expanding class of therapeutic agents. Small molecule inhibitors targeting SF3B1 or splice factor kinases were highly cytotoxic against a wide range of cancer models, including drug-resistant cells. Importantly, these effects are enhanced in specific cancer subsets, such as splicing factor-mutated and c-MYC-driven tumors. Furthermore, pre-clinical studies report synergistic effects of spliceosome modulators in combination with conventional antitumor agents. These strategies based on the use of low dose splicing modulators could shift the therapeutic window towards decreased toxicity in healthy tissues. Here we provide an extensive overview of the latest findings in the field of regulation of splicing in cancer, including molecular mechanisms by which cancer cells harness alternative splicing to drive oncogenesis and evade anticancer drug treatment as well as splicing-based vulnerabilities that can provide novel treatment opportunities. Furthermore, we discuss current challenges arising from genome-wide detection and prediction methods of aberrant splicing, as well as unravelling functional relevance of the plethora of cancer-related splicing alterations.
Collapse
Affiliation(s)
- Rocco Sciarrillo
- Department of Hematology, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands; Department of Pediatric Oncology, Emma's Children's Hospital, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands; Department of Medical Oncology, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands
| | - Anna Wojtuszkiewicz
- Department of Hematology, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands
| | - Yehuda G Assaraf
- The Fred Wyszkowski Cancer Research Laboratory, Department of Biology, Technion-Israel Institute of Technology, Haifa 3200003, Israel
| | - Gerrit Jansen
- Amsterdam Immunology and Rheumatology Center, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands
| | - Gertjan J L Kaspers
- Department of Pediatric Oncology, Emma's Children's Hospital, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands; Princess Máxima Center for Pediatric Oncology, Utrecht, Netherlands
| | - Elisa Giovannetti
- Department of Medical Oncology, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands; Fondazione Pisana per la Scienza, Pisa, Italy
| | - Jacqueline Cloos
- Department of Hematology, Amsterdam UMC, VU University Medical Center, Cancer Center Amsterdam, Amsterdam, Netherlands.
| |
Collapse
|
29
|
Role of Alternatively Spliced Messenger RNA (mRNA) Isoforms of the Insulin-Like Growth Factor 1 (IGF1) in Selected Human Tumors. Int J Mol Sci 2020; 21:ijms21196995. [PMID: 32977489 PMCID: PMC7582825 DOI: 10.3390/ijms21196995] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 09/20/2020] [Accepted: 09/21/2020] [Indexed: 02/07/2023] Open
Abstract
Insulin-like growth factor 1 (IGF1) is a key regulator of tissue growth and development that is also implicated in the initiation and progression of various cancers. The human IGF1 gene contains six exons and five long introns, the transcription of which is controlled by two promoters (P1 and P2). Alternate promoter usage, as well as alternative splicing (AS) of IGF1, results in the expression of six various variants (isoforms) of mRNA, i.e., IA, IB, IC, IIA, IIB, and IIC. A mature 70-kDa IGF1 protein is coded only by exons 3 and 4, while exons 5 and 6 are alternatively spliced code for the three C-terminal E peptides: Ea (exon 6), Eb (exon 5), and Ec (fragments of exons 5 and 6). The most abundant of those transcripts is IGF1Ea, followed by IGF1Eb and IGF1Ec (also known as mechano-growth factor, MGF). The presence of different IGF1 transcripts suggests tissue-specific auto- and/or paracrine action, as well as separate regulation of both of these gene promoters. In physiology, the role of different IGF1 mRNA isoforms and pro-peptides is best recognized in skeletal muscle tissue. Their functions include the development and regeneration of muscles, as well as maintenance of proper muscle mass. In turn, in nervous tissue, a neuroprotective function of short peptides, produced as a result of IGF1 expression and characterized by significant blood-brain barrier penetrance, has been described and could be a potential therapeutic target. When it comes to the regulation of carcinogenesis, the potential biological role of different var iants of IGF1 mRNAs and pro-peptides is also intensively studied. This review highlights the role of IGF1 isoform expression (mRNAs, proteins) in physiology and different types of human tumors (e.g., breast cancer, cervical cancer, colorectal cancer, osteosarcoma, prostate and thyroid cancers), as well as mechanisms of IGF1 spliced variants involvement in tumor biology.
Collapse
|
30
|
Lau E, Han Y, Williams DR, Thomas CT, Shrestha R, Wu JC, Lam MPY. Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome. Cell Rep 2020; 29:3751-3765.e5. [PMID: 31825849 PMCID: PMC6961840 DOI: 10.1016/j.celrep.2019.11.026] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Revised: 09/24/2019] [Accepted: 11/06/2019] [Indexed: 12/18/2022] Open
Abstract
The protein-level translational status and function of many alternative splicing events remain poorly understood. We use an RNA sequencing (RNA-seq)-guided proteomics method to identify protein alternative splicing isoforms in the human proteome by constructing tissue-specific protein databases that prioritize transcript splice junction pairs with high translational potential. Using the custom databases to reanalyze ~80 million mass spectra in public proteomics datasets, we identify more than 1,500 noncanonical protein isoforms across 12 human tissues, including ~400 sequences undocumented on TrEMBL and RefSeq databases. We apply the method to original quantitative mass spectrometry experiments and observe widespread isoform regulation during human induced pluripotent stem cell cardiomyocyte differentiation. On a proteome scale, alternative isoform regions overlap frequently with disordered sequences and post-translational modification sites, suggesting that alternative splicing may regulate protein function through modulating intrinsically disordered regions. The described approach may help elucidate functional consequences of alternative splicing and expand the scope of proteomics investigations in various systems. The translation and function of many alternative splicing events await confirmation at the protein level. Lau et al. use an integrated proteotranscriptomics approach to identify non-canonical and undocumented isoforms from 12 organs in the human proteome. Alternative isoforms interfere with functional sequence features and are differentially regulated during iPSC cardiomyocyte differentiation.
Collapse
Affiliation(s)
- Edward Lau
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Yu Han
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Damon R Williams
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Cody T Thomas
- Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA
| | - Rajani Shrestha
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA
| | - Joseph C Wu
- Stanford Cardiovascular Institute, Department of Medicine, Stanford University, Palo Alto, CA, USA; Department of Radiology, School of Medicine, Stanford University, Palo Alto, CA, USA
| | - Maggie P Y Lam
- Consortium for Fibrosis Research and Translation, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA; Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado, Aurora, CO, USA.
| |
Collapse
|
31
|
Mishra SK, Muthye V, Kandoi G. Computational Methods for Predicting Functions at the mRNA Isoform Level. Int J Mol Sci 2020; 21:ijms21165686. [PMID: 32784445 PMCID: PMC7460821 DOI: 10.3390/ijms21165686] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 08/05/2020] [Accepted: 08/06/2020] [Indexed: 11/16/2022] Open
Abstract
Multiple mRNA isoforms of the same gene are produced via alternative splicing, a biological mechanism that regulates protein diversity while maintaining genome size. Alternatively spliced mRNA isoforms of the same gene may sometimes have very similar sequence, but they can have significantly diverse effects on cellular function and regulation. The products of alternative splicing have important and diverse functional roles, such as response to environmental stress, regulation of gene expression, human heritable, and plant diseases. The mRNA isoforms of the same gene can have dramatically different functions. Despite the functional importance of mRNA isoforms, very little has been done to annotate their functions. The recent years have however seen the development of several computational methods aimed at predicting mRNA isoform level biological functions. These methods use a wide array of proteo-genomic data to develop machine learning-based mRNA isoform function prediction tools. In this review, we discuss the computational methods developed for predicting the biological function at the individual mRNA isoform level.
Collapse
|
32
|
Calviello L, Hirsekorn A, Ohler U. Quantification of translation uncovers the functions of the alternative transcriptome. Nat Struct Mol Biol 2020; 27:717-725. [DOI: 10.1038/s41594-020-0450-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Accepted: 05/13/2020] [Indexed: 11/09/2022]
|
33
|
de la Fuente L, Arzalluz-Luque Á, Tardáguila M, Del Risco H, Martí C, Tarazona S, Salguero P, Scott R, Lerma A, Alastrue-Agudo A, Bonilla P, Newman JRB, Kosugi S, McIntyre LM, Moreno-Manzano V, Conesa A. tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing. Genome Biol 2020; 21:119. [PMID: 32423416 PMCID: PMC7236505 DOI: 10.1186/s13059-020-02028-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 04/23/2020] [Indexed: 12/26/2022] Open
Abstract
Recent advances in long-read sequencing solve inaccuracies in alternative transcript identification of full-length transcripts in short-read RNA-Seq data, which encourages the development of methods for isoform-centered functional analysis. Here, we present tappAS, the first framework to enable a comprehensive Functional Iso-Transcriptomics (FIT) analysis, which is effective at revealing the functional impact of context-specific post-transcriptional regulation. tappAS uses isoform-resolved annotation of coding and non-coding functional domains, motifs, and sites, in combination with novel analysis methods to interrogate different aspects of the functional readout of transcript variants and isoform regulation. tappAS software and documentation are available at https://app.tappas.org.
Collapse
Affiliation(s)
- Lorena de la Fuente
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
- Present Address: Bioinformatics Unit, IIS Fundación Jiménez Díaz, Madrid, Spain
| | - Ángeles Arzalluz-Luque
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Manuel Tardáguila
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Héctor Del Risco
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Cristina Martí
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Sonia Tarazona
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Pedro Salguero
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Raymond Scott
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Alberto Lerma
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Ana Alastrue-Agudo
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Pablo Bonilla
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Jeremy R B Newman
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Pathology, University of Florida, Gainesville, FL, USA
| | - Shunichi Kosugi
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Laboratory for Statistical and Translational Genetics, Center for Integrative Medical Sciences, RIKEN, Wako, Japan
| | - Lauren M McIntyre
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
| | | | - Ana Conesa
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA.
- Genetics Institute, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
34
|
Reixachs-Solé M, Ruiz-Orera J, Albà MM, Eyras E. Ribosome profiling at isoform level reveals evolutionary conserved impacts of differential splicing on the proteome. Nat Commun 2020; 11:1768. [PMID: 32286305 PMCID: PMC7156646 DOI: 10.1038/s41467-020-15634-w] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 03/11/2020] [Indexed: 12/14/2022] Open
Abstract
The differential production of transcript isoforms from gene loci is a key cellular mechanism. Yet, its impact in protein production remains an open question. Here, we describe ORQAS (ORF quantification pipeline for alternative splicing), a pipeline for the translation quantification of individual transcript isoforms using ribosome-protected mRNA fragments (ribosome profiling). We find evidence of translation for 40-50% of the expressed isoforms in human and mouse, with 53% of the expressed genes having more than one translated isoform in human, and 33% in mouse. Differential splicing analysis revealed that about 40% of the splicing changes at RNA level are concordant with changes in translation. Furthermore, orthologous cassette exons between human and mouse preserve the directionality of the change, and are enriched in microexons in a comparison between glia and glioma. ORQAS leverages ribosome profiling to uncover a widespread and evolutionarily conserved impact of differential splicing on translation, particularly of microexon-containing isoforms.
Collapse
Affiliation(s)
- Marina Reixachs-Solé
- The John Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT, 2601, Australia
| | - Jorge Ruiz-Orera
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, 13125, Germany
| | - M Mar Albà
- IMIM - Hospital del Mar Medical Research Institute, E08003, Barcelona, Spain
- Catalan Institution for Research and Advanced Studies, E08010, Barcelona, Spain
- Pompeu Fabra University, E08003, Barcelona, Spain
| | - Eduardo Eyras
- The John Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia.
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT, 2601, Australia.
- IMIM - Hospital del Mar Medical Research Institute, E08003, Barcelona, Spain.
- Catalan Institution for Research and Advanced Studies, E08010, Barcelona, Spain.
| |
Collapse
|
35
|
Analgesic treatment with buprenorphine should be adapted to the mouse strain. Pharmacol Biochem Behav 2020; 191:172877. [DOI: 10.1016/j.pbb.2020.172877] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 02/19/2020] [Accepted: 02/19/2020] [Indexed: 11/23/2022]
|
36
|
Sequence variants of human tropoelastin affecting assembly, structural characteristics and functional properties of polymeric elastin in health and disease. Matrix Biol 2019; 84:68-80. [DOI: 10.1016/j.matbio.2019.06.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Revised: 05/23/2019] [Accepted: 06/25/2019] [Indexed: 12/19/2022]
|
37
|
Hatje K, Mühlhausen S, Simm D, Kollmar M. The Protein-Coding Human Genome: Annotating High-Hanging Fruits. Bioessays 2019; 41:e1900066. [PMID: 31544971 DOI: 10.1002/bies.201900066] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 08/07/2019] [Indexed: 12/19/2022]
Abstract
The major transcript variants of human protein-coding genes are annotated to a certain degree of accuracy combining manual curation, transcript data, and proteomics evidence. However, there is considerable disagreement on the annotation of about 2000 genes-they can be protein-coding, noncoding, or pseudogenes-and on the annotation of most of the predicted alternative transcripts. Pure transcriptome mapping approaches seem to be limited in discriminating functional expression from noise. These limitations have partially been overcome by dedicated algorithms to detect alternative spliced micro-exons and wobble splice variants. Recently, knowledge about splice mechanism and protein structure are incorporated into an algorithm to predict neighboring homologous exons, often spliced in a mutually exclusive manner. Predicted exons are evaluated by transcript data, structural compatibility, and evolutionary conservation, revealing hundreds of novel coding exons and splice mechanism re-assignments. The emerging human pan-genome is necessitating distinctive annotations incorporating differences between individuals and between populations.
Collapse
Affiliation(s)
- Klas Hatje
- Roche Pharmaceutical Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstr. 124, 4070, Basel, Switzerland
| | - Stefanie Mühlhausen
- Group Systems Biology of Motor Proteins, Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany
| | - Dominic Simm
- Group Systems Biology of Motor Proteins, Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany.,Theoretical Computer Science and Algorithmic Methods, Institute of Computer Science, Georg-August-University Göttingen, Goldschmidtstr. 7, 37077, Göttingen, Germany
| | - Martin Kollmar
- Group Systems Biology of Motor Proteins, Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany
| |
Collapse
|
38
|
Mudge JM, Jungreis I, Hunt T, Gonzalez JM, Wright JC, Kay M, Davidson C, Fitzgerald S, Seal R, Tweedie S, He L, Waterhouse RM, Li Y, Bruford E, Choudhary JS, Frankish A, Kellis M. Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci. Genome Res 2019; 29:2073-2087. [PMID: 31537640 PMCID: PMC6886504 DOI: 10.1101/gr.246462.118] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 09/09/2019] [Indexed: 12/15/2022]
Abstract
The most widely appreciated role of DNA is to encode protein, yet the exact portion of the human genome that is translated remains to be ascertained. We previously developed PhyloCSF, a widely used tool to identify evolutionary signatures of protein-coding regions using multispecies genome alignments. Here, we present the first whole-genome PhyloCSF prediction tracks for human, mouse, chicken, fly, worm, and mosquito. We develop a workflow that uses machine learning to predict novel conserved protein-coding regions and efficiently guide their manual curation. We analyze more than 1000 high-scoring human PhyloCSF regions and confidently add 144 conserved protein-coding genes to the GENCODE gene set, as well as additional coding regions within 236 previously annotated protein-coding genes, and 169 pseudogenes, most of them disabled after primates diverged. The majority of these represent new discoveries, including 70 previously undetected protein-coding genes. The novel coding genes are additionally supported by single-nucleotide variant evidence indicative of continued purifying selection in the human lineage, coding-exon splicing evidence from new GENCODE transcripts using next-generation transcriptomic data sets, and mass spectrometry evidence of translation for several new genes. Our discoveries required simultaneous comparative annotation of other vertebrate genomes, which we show is essential to remove spurious ORFs and to distinguish coding from pseudogene regions. Our new coding regions help elucidate disease-associated regions by revealing that 118 GWAS variants previously thought to be noncoding are in fact protein altering. Altogether, our PhyloCSF data sets and algorithms will help researchers seeking to interpret these genomes, while our new annotations present exciting loci for further experimental characterization.
Collapse
Affiliation(s)
- Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Jose Manuel Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - James C Wright
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Stephen Fitzgerald
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Ruth Seal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Susan Tweedie
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Liang He
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Robert M Waterhouse
- Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Yue Li
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elspeth Bruford
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Jyoti S Choudhary
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
39
|
Rodriguez JM, Rodriguez-Rivas J, Di Domenico T, Vázquez J, Valencia A, Tress ML. APPRIS 2017: principal isoforms for multiple gene sets. Nucleic Acids Res 2019; 46:D213-D217. [PMID: 29069475 PMCID: PMC5753224 DOI: 10.1093/nar/gkx997] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 10/19/2017] [Indexed: 01/23/2023] Open
Abstract
The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.
Collapse
Affiliation(s)
- Jose Manuel Rodriguez
- Spanish National Bioinformatics Institute (INB), Spanish National Cancer Research Centre (CNIO), Madrid 28029, Spain
| | - Juan Rodriguez-Rivas
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid 28029, Spain
| | - Tomás Di Domenico
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid 28029, Spain
| | - Jesús Vázquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28029 Madrid, Spain.,CIBER de Enfermedades Cardiovasculares (CIBERCV), 28029 Madrid, Spain
| | - Alfonso Valencia
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona E-08010, Spain.,Life Sciences Department, Barcelona Supercomputing Centre (BSC-CNS), Barcelona E-08034, Spain
| | - Michael L Tress
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid 28029, Spain
| |
Collapse
|
40
|
Abascal F, Juan D, Jungreis I, Kellis M, Martinez L, Rigau M, Rodriguez JM, Vazquez J, Tress ML. Loose ends: almost one in five human genes still have unresolved coding status. Nucleic Acids Res 2019; 46:7070-7084. [PMID: 29982784 PMCID: PMC6101605 DOI: 10.1093/nar/gky587] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 06/18/2018] [Indexed: 12/16/2022] Open
Abstract
Seventeen years after the sequencing of the human genome, the human proteome is still under revision. One in eight of the 22 210 coding genes listed by the Ensembl/GENCODE, RefSeq and UniProtKB reference databases are annotated differently across the three sets. We have carried out an in-depth investigation on the 2764 genes classified as coding by one or more sets of manual curators and not coding by others. Data from large-scale genetic variation analyses suggests that most are not under protein-like purifying selection and so are unlikely to code for functional proteins. A further 1470 genes annotated as coding in all three reference sets have characteristics that are typical of non-coding genes or pseudogenes. These potential non-coding genes also appear to be undergoing neutral evolution and have considerably less supporting transcript and protein evidence than other coding genes. We believe that the three reference databases currently overestimate the number of human coding genes by at least 2000, complicating and adding noise to large-scale biomedical experiments. Determining which potential non-coding genes do not code for proteins is a difficult but vitally important task since the human reference proteome is a fundamental pillar of most basic research and supports almost all large-scale biomedical projects.
Collapse
Affiliation(s)
- Federico Abascal
- Wellcome Trust Sanger Institute, Hinxton CB10 1SA, Cambridgeshire, UK
| | - David Juan
- Comparative Genomics Lab, Instituto de Biologica Evolutiva, Universitat Pompeu Fabra, Barcelona, Spain
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA and Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Laura Martinez
- Bioinformatics Unit, Spanish National Cancer Research Centre, Madrid, Spain
| | - Maria Rigau
- Computational Biology Life Sciences Group, Barcelona Supercomputing Center, Barcelona, Spain
| | - Jose Manuel Rodriguez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares, Madrid, Spain
| | - Jesus Vazquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares, Madrid, Spain
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre, Madrid, Spain
| |
Collapse
|
41
|
Wang H, Lekbaby B, Fares N, Augustin J, Attout T, Schnuriger A, Cassard AM, Panasyuk G, Perlemuter G, Bieche I, Vacher S, Selves J, Péron JM, Bancel B, Merle P, Kremsdorf D, Hall J, Chemin I, Soussan P. Alteration of splicing factors' expression during liver disease progression: impact on hepatocellular carcinoma outcome. Hepatol Int 2019; 13:454-467. [PMID: 31140152 DOI: 10.1007/s12072-019-09950-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 04/29/2019] [Indexed: 12/29/2022]
Abstract
PURPOSE Trans-acting splicing factors (SF) shape the eukaryotic transcriptome by regulating alternative splicing (AS). This process is recurrently modulated in liver cancer suggesting its direct contribution to the course of liver disease. The aim of our study was to investigate the relationship between the regulation of SFs expression and liver damage. METHODS The expression profile of 10 liver-specific SF and the AS events of 7 genes associated with liver disorders was assessed by western-blotting in 6 murine models representing different stages of liver damage, from inflammation to hepatocellular carcinoma (HCC). Relevant SFs (PSF, SRSF3, and SRSF6) and target genes (INSR, SRSF3, and SLK) modulated in mice were investigated in a cohort of 179 HCC patients. RESULTS Each murine model of liver disease was characterized by a unique SF expression profile. Changes in the SF profile did not affect AS events of the selected genes despite the presence of corresponding splicing sites. In human HCC expression of SFs, including the tumor-suppressor SRSF3, and AS regulation of genes studied were frequently upregulated in tumor versus non-tumor tissues. Risk of tumor recurrence positively correlated with AS isoform of the INSR gene. In contrast, increased levels of SFs expression correlated with an extended overall survival of patients. CONCLUSIONS Dysregulation of SF expression is an early event occurring during liver injury and not just at the stage of HCC. Besides impacting on AS regulation, overexpression of SF may contribute to preserving hepatocyte homeostasis during liver pathogenesis.
Collapse
Affiliation(s)
- Hualin Wang
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
| | - Bouchra Lekbaby
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
| | - Nadim Fares
- Centre de Recherche en Cancérologie de Lyon, UMR INSERM 1052, CNRS 5286, Lyon Cedex 03, France
| | - Jeremy Augustin
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
| | - Tarik Attout
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
| | - Aurelie Schnuriger
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
- Département de Virologie, Hôpitaux Est Parisien, Paris, France
| | - Anne-Marie Cassard
- Faculté de médecine Paris-Sud, Université Paris-Sud, 94270, Kremlin-Bicêtre, France
| | - Ganna Panasyuk
- Institut Necker-Enfants Malades, Université Paris Descartes, Paris, France
- INSERM U1151/CNRS Unité Mixte de Recherche (UMR) 8253, Paris, France
| | - Gabriel Perlemuter
- Faculté de médecine Paris-Sud, Université Paris-Sud, 94270, Kremlin-Bicêtre, France
- AP-HP, Hôpital Antoine Béclère, Service d'hépato-gastroentérologie, 92140, Clamart, France
| | | | | | - Janick Selves
- Institut Universitaire de Cancérologie de Toulouse Oncopole, Université Paul Sabatier, Toulouse, France
| | - Jean-Marie Péron
- Institut Universitaire de Cancérologie de Toulouse Oncopole, Université Paul Sabatier, Toulouse, France
| | - Brigitte Bancel
- Centre de Recherche en Cancérologie de Lyon, UMR INSERM 1052, CNRS 5286, Lyon Cedex 03, France
| | - Philippe Merle
- Centre de Recherche en Cancérologie de Lyon, UMR INSERM 1052, CNRS 5286, Lyon Cedex 03, France
| | - Dina Kremsdorf
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France
- Sorbonne Université, Paris, France
| | - Janet Hall
- Centre de Recherche en Cancérologie de Lyon, UMR INSERM 1052, CNRS 5286, Lyon Cedex 03, France
| | - Isabelle Chemin
- Centre de Recherche en Cancérologie de Lyon, UMR INSERM 1052, CNRS 5286, Lyon Cedex 03, France
| | - Patrick Soussan
- INSERM U1135, Centre d'immunologie et de maladie infectieuse, 91 boulevard de l'Hôpital, 75013, Paris, France.
- Sorbonne Université, Paris, France.
- Département de Virologie, Hôpitaux Est Parisien, Paris, France.
| |
Collapse
|
42
|
Zhang D, Yang JF, Gao B, Liu TY, Hao GF, Yang GF, Fu LJ, Chen MX, Zhang J. Identification, evolution and alternative splicing profile analysis of the splicing factor 30 (SPF30) in plant species. PLANTA 2019; 249:1997-2014. [PMID: 30904945 DOI: 10.1007/s00425-019-03146-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 03/19/2019] [Indexed: 06/09/2023]
Abstract
The work offers a comprehensive evaluation on the phylogenetics and conservation of splicing patterns of the plant SPF30 splicing factor gene family. In eukaryotes, one pre-mRNA can generate multiple mRNA transcripts by alternative splicing (AS), which expands transcriptome and proteome diversity. Splicing factor 30 (SPF30), also known as survival motor neuron domain containing protein 1 (SMNDC1), is a spliceosomal protein that plays an essential role in spliceosomal assembly. Although SPF30 genes have been well characterised in human and yeast, little is known about their homologues in plants. Here, we report the genome-wide identification and phylogenetic analysis of SPF30 genes in the plant kingdom. In total, 82 SPF30 genes were found in 64 plant species from algae to land plants. Alternative transcripts were found in many SPF30 genes and splicing profile analysis revealed that the second intron in SPF30 genome is frequently associated with AS events and contributed to the birth of novel exons in a few SPF30 members. In addition, different conserved sequences were observed at these putative splice sites among moss, monocots and dicots, respectively. Our findings will facilitate further functional characterization of plant SPF30 genes as putative splicing factors.
Collapse
Affiliation(s)
- Di Zhang
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Jing-Fang Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, 430079, China
| | - Bei Gao
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Tie-Yuan Liu
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Ge-Fei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, 430079, China
| | - Guang-Fu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, 430079, China
| | - Li-Jun Fu
- Fujian Provincial Key Laboratory of Ecology-Toxicological Effects & Control for Emerging Contaminants, Putian University, Putian, 351100, Fujian, People's Republic of China
| | - Mo-Xian Chen
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China.
| | - Jianhua Zhang
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China.
- Department of Biology, Hong Kong Baptist University and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong.
| |
Collapse
|
43
|
Lopez-Perolio I, Leman R, Behar R, Lattimore V, Pearson JF, Castéra L, Martins A, Vaur D, Goardon N, Davy G, Garre P, García-Barberán V, Llovet P, Pérez-Segura P, Díaz-Rubio E, Caldés T, Hruska KS, Hsuan V, Wu S, Pesaran T, Karam R, Vallon-Christersson J, Borg A, Valenzuela-Palomo A, Velasco EA, Southey M, Vreeswijk MPG, Devilee P, Kvist A, Spurdle AB, Walker LC, Krieger S, de la Hoya M. Alternative splicing and ACMG-AMP-2015-based classification of PALB2 genetic variants: an ENIGMA report. J Med Genet 2019; 56:453-460. [PMID: 30890586 PMCID: PMC6591742 DOI: 10.1136/jmedgenet-2018-105834] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 01/10/2019] [Accepted: 02/06/2019] [Indexed: 11/04/2022]
Abstract
BACKGROUND PALB2 monoallelic loss-of-function germ-line variants confer a breast cancer risk comparable to the average BRCA2 pathogenic variant. Recommendations for risk reduction strategies in carriers are similar. Elaborating robust criteria to identify loss-of-function variants in PALB2-without incurring overprediction-is thus of paramount clinical relevance. Towards this aim, we have performed a comprehensive characterisation of alternative splicing in PALB2, analysing its relevance for the classification of truncating and splice site variants according to the 2015 American College of Medical Genetics and Genomics-Association for Molecular Pathology guidelines. METHODS Alternative splicing was characterised in RNAs extracted from blood, breast and fimbriae/ovary-related human specimens (n=112). RNAseq, RT-PCR/CE and CloneSeq experiments were performed by five contributing laboratories. Centralised revision/curation was performed to assure high-quality annotations. Additional splicing analyses were performed in PALB2 c.212-1G>A, c.1684+1G>A, c.2748+2T>G, c.3113+5G>A, c.3350+1G>A, c.3350+4A>C and c.3350+5G>A carriers. The impact of the findings on PVS1 status was evaluated for truncating and splice site variant. RESULTS We identified 88 naturally occurring alternative splicing events (81 newly described), including 4 in-frame events predicted relevant to evaluate PVS1 status of splice site variants. We did not identify tissue-specific alternate gene transcripts in breast or ovarian-related samples, supporting the clinical relevance of blood-based splicing studies. CONCLUSIONS PVS1 is not necessarily warranted for splice site variants targeting four PALB2 acceptor sites (exons 2, 5, 7 and 10). As a result, rare variants at these splice sites cannot be assumed pathogenic/likely pathogenic without further evidences. Our study puts a warning in up to five PALB2 genetic variants that are currently reported as pathogenic/likely pathogenic in ClinVar.
Collapse
Affiliation(s)
- Irene Lopez-Perolio
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Raphaël Leman
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Raquel Behar
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Vanessa Lattimore
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | - John F Pearson
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Laurent Castéra
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Alexandra Martins
- Inserm U1245 Genomics and Personalized Medecine in Cancer and Neurological Disorders, UNIROUEN, Normandie Université, Normandy Centre for Genomic and Personalized Medicine, Rouen, France
| | - Dominique Vaur
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Nicolas Goardon
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Grégoire Davy
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Pilar Garre
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Vanesa García-Barberán
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Patricia Llovet
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Pedro Pérez-Segura
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Eduardo Díaz-Rubio
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - Trinidad Caldés
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | | | | | - Sitao Wu
- Ambry Genetics, Aliso Viejo, CA, USA
| | | | | | - Johan Vallon-Christersson
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Ake Borg
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | -
- Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.,The Sir Peter MacCallum Department of Oncology, University of Melbourne, Parkville, Australia
| | - Alberto Valenzuela-Palomo
- Splicing and genetic susceptibility to cancer, Instituto de Biología y Genética Molecular (CSIC-UVa), Valladolid, Spain
| | - Eladio A Velasco
- Splicing and genetic susceptibility to cancer, Instituto de Biología y Genética Molecular (CSIC-UVa), Valladolid, Spain
| | - Melissa Southey
- Genetic Epidemiology Laboratory, Department of Clinical Pathology, The University of Melbourne, Melbourne, VIC, Australia
| | - Maaike P G Vreeswijk
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Peter Devilee
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Anders Kvist
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Amanda B Spurdle
- Molecular Cancer Epidemiology Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, Australia
| | - Logan C Walker
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Sophie Krieger
- Laboratory of Clinical Biology and Oncology, Centre François Baclesse, Inserm U1245 Genomics and Personalized Medicine in Cancer and Neurological Disorders, Normandy University, Caen, France
| | - Miguel de la Hoya
- Molecular Oncology Laboratory CIBERONC, Hospital Clínico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| |
Collapse
|
44
|
Abstract
Alternative splicing is a widespread, essential, and complex component of gene regulation. Apicomplexan parasites have long been recognized to produce alternatively spliced transcripts for some genes and can produce multiple protein products that are essential for parasite growth. Alternative splicing is a widespread, essential, and complex component of gene regulation. Apicomplexan parasites have long been recognized to produce alternatively spliced transcripts for some genes and can produce multiple protein products that are essential for parasite growth. Recent approaches are now providing more wide-ranging surveys of the extent of alternative splicing; some indicate that alternative splicing is less widespread than in other model eukaryotes, whereas others suggest levels comparable to those of previously studied groups. In many cases, apicomplexan alternative splicing events appear not to generate multiple alternative proteins but instead produce aberrant or noncoding transcripts. Nonetheless, appropriate regulation of alternative splicing is clearly essential in Plasmodium and Toxoplasma parasites, suggesting a biological role for at least some of the alternative splicing observed. Several studies have now disrupted conserved regulators of alternative splicing and demonstrated lethal effects in apicomplexans. This minireview discusses methods to accurately determine the extent of alternative splicing in Apicomplexa and discuss potential biological roles for this conserved process in a phylum of parasites with compact genomes.
Collapse
|
45
|
Chaudhary S, Khokhar W, Jabre I, Reddy ASN, Byrne LJ, Wilson CM, Syed NH. Alternative Splicing and Protein Diversity: Plants Versus Animals. FRONTIERS IN PLANT SCIENCE 2019; 10:708. [PMID: 31244866 PMCID: PMC6581706 DOI: 10.3389/fpls.2019.00708] [Citation(s) in RCA: 116] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 05/13/2019] [Indexed: 05/11/2023]
Abstract
Plants, unlike animals, exhibit a very high degree of plasticity in their growth and development and employ diverse strategies to cope with the variations during diurnal cycles and stressful conditions. Plants and animals, despite their remarkable morphological and physiological differences, share many basic cellular processes and regulatory mechanisms. Alternative splicing (AS) is one such gene regulatory mechanism that modulates gene expression in multiple ways. It is now well established that AS is prevalent in all multicellular eukaryotes including plants and humans. Emerging evidence indicates that in plants, as in animals, transcription and splicing are coupled. Here, we reviewed recent evidence in support of co-transcriptional splicing in plants and highlighted similarities and differences between plants and humans. An unsettled question in the field of AS is the extent to which splice isoforms contribute to protein diversity. To take a critical look at this question, we presented a comprehensive summary of the current status of research in this area in both plants and humans, discussed limitations with the currently used approaches and suggested improvements to current methods and alternative approaches. We end with a discussion on the potential role of epigenetic modifications and chromatin state in splicing memory in plants primed with stresses.
Collapse
Affiliation(s)
- Saurabh Chaudhary
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
| | - Waqas Khokhar
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
| | - Ibtissam Jabre
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
| | - Anireddy S. N. Reddy
- Department of Biology and Program in Cell and Molecular Biology, Colorado State University, Fort Collins, CO, United States
| | - Lee J. Byrne
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
| | - Cornelia M. Wilson
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
| | - Naeem H. Syed
- School of Human and Life Sciences, Canterbury Christ Church University, Canterbury, United Kingdom
- *Correspondence: Naeem H. Syed,
| |
Collapse
|
46
|
Sterne-Weiler T, Weatheritt RJ, Best AJ, Ha KC, Blencowe BJ. Efficient and Accurate Quantitative Profiling of Alternative Splicing Patterns of Any Complexity on a Laptop. Mol Cell 2018; 72:187-200.e6. [DOI: 10.1016/j.molcel.2018.08.018] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Revised: 06/24/2018] [Accepted: 08/09/2018] [Indexed: 01/08/2023]
|
47
|
Ten-Caten F, Vêncio RZN, Lorenzetti APR, Zaramela LS, Santana AC, Koide T. Internal RNAs overlapping coding sequences can drive the production of alternative proteins in archaea. RNA Biol 2018; 15:1119-1132. [PMID: 30175688 PMCID: PMC6161675 DOI: 10.1080/15476286.2018.1509661] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Prokaryotic genomes show a high level of information compaction often with different molecules transcribed from the same locus. Although antisense RNAs have been relatively well studied, RNAs in the same strand, internal RNAs (intraRNAs), are still poorly understood. The question of how common is the translation of overlapping reading frames remains open. We address this question in the model archaeon Halobacterium salinarum. In the present work we used differential RNA-seq (dRNA-seq) in H. salinarum NRC-1 to locate intraRNA signals in subsets of internal transcription start sites (iTSS) and establish the open reading frames associated to them (intraORFs). Using C-terminally flagged proteins, we experimentally observed isoforms accurately predicted by intraRNA translation for kef1, acs3 and orc4 genes. We also recovered from the literature and mass spectrometry databases several instances of protein isoforms consistent with intraRNA translation such as the gas vesicle protein gene gvpC1. We found evidence for intraRNAs in horizontally transferred genes such as the chaperone dnaK and the aerobic respiration related cydA in both H. salinarum and Escherichia coli. Also, intraRNA translation evidence in H. salinarum, E. coli and yeast of a universal elongation factor (aEF-2, fusA and eEF-2) suggests that this is an ancient phenomenon present in all domains of life.
Collapse
Affiliation(s)
- Felipe Ten-Caten
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ricardo Z N Vêncio
- b Department of Computation and Mathematics, Faculdade de Filosofia , Ciências e Letras de Ribeirão Preto, University of São Paulo , Ribeirão Preto , Brazil
| | - Alan Péricles R Lorenzetti
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Livia Soares Zaramela
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ana Carolina Santana
- c Department of Cell and Molecular Biology and Pathogenic Bioagents , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Tie Koide
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| |
Collapse
|
48
|
Bhuiyan SA, Ly S, Phan M, Huntington B, Hogan E, Liu CC, Liu J, Pavlidis P. Systematic evaluation of isoform function in literature reports of alternative splicing. BMC Genomics 2018; 19:637. [PMID: 30153812 PMCID: PMC6114036 DOI: 10.1186/s12864-018-5013-2] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Accepted: 08/14/2018] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Although most genes in mammalian genomes have multiple isoforms, an ongoing debate is whether these isoforms are all functional as well as the extent to which they increase the functional repertoire of the genome. To ground this debate in data, it would be helpful to have a corpus of experimentally-verified cases of genes which have functionally distinct splice isoforms (FDSIs). RESULTS We established a curation framework for evaluating experimental evidence of FDSIs, and analyzed over 700 human and mouse genes, strongly biased towards genes that are prominent in the alternative splicing literature. Despite this bias, we found experimental evidence meeting the classical definition for functionally distinct isoforms for ~ 5% of the curated genes. If we relax our criteria for inclusion to include weaker forms of evidence, the fraction of genes with evidence of FDSIs remains low (~ 13%). We provide evidence that this picture will not change substantially with further curation and conclude there is a large gap between the presumed impact of splicing on gene function and the experimental evidence. Furthermore, many functionally distinct isoforms were not traceable to a specific isoform in Ensembl, a database that forms the basis for much computational research. CONCLUSIONS We conclude that the claim that alternative splicing vastly increases the functional repertoire of the genome is an extrapolation from a limited number of empirically supported cases. We also conclude that more work is needed to integrate experimental evidence and genome annotation databases. Our work should help shape research around the role of splicing on gene function from presuming large general effects to acknowledging the need for stronger experimental evidence.
Collapse
Affiliation(s)
- Shamsuddin A. Bhuiyan
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Department of Psychiatry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Graduate Program in Bioinformatics, University of British Columbia, Vancouver, Canada
| | - Sophia Ly
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Minh Phan
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Brandon Huntington
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Ellie Hogan
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Chao Chun Liu
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - James Liu
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Paul Pavlidis
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Department of Psychiatry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| |
Collapse
|
49
|
Abstract
Single-cell RNAseq and alternative splicing studies have recently become two of the most prominent applications of RNAseq. However, the combination of both is still challenging, and few research efforts have been dedicated to the intersection between them. Cell-level insight on isoform expression is required to fully understand the biology of alternative splicing, but it is still an open question to what extent isoform expression analysis at the single-cell level is actually feasible. Here, we establish a set of four conditions that are required for a successful single-cell-level isoform study and evaluate how these conditions are met by these technologies in published research.
Collapse
Affiliation(s)
- Ángeles Arzalluz-Luque
- Genomics of Gene Expression Laboratory, Centro de Investigación Principe Felipe (CIPF), 46012, Valencia, Spain
| | - Ana Conesa
- Genomics of Gene Expression Laboratory, Centro de Investigación Principe Felipe (CIPF), 46012, Valencia, Spain.
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida, 32611, USA.
| |
Collapse
|
50
|
Schaefke B, Sun W, Li YS, Fang L, Chen W. The evolution of posttranscriptional regulation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 9:e1485. [PMID: 29851258 DOI: 10.1002/wrna.1485] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Revised: 04/23/2018] [Accepted: 04/26/2018] [Indexed: 12/13/2022]
Abstract
"DNA makes RNA makes protein." After transcription, mRNAs undergo a series of intertwining processes to be finally translated into functional proteins. The "posttranscriptional" regulation (PTR) provides cells an extended option to fine-tune their proteomes. To meet the demands of complex organism development and the appropriate response to environmental stimuli, every step in these processes needs to be finely regulated. Moreover, changes in these regulatory processes are important driving forces underlying the evolution of phenotypic differences across different species. The major PTR mechanisms discussed in this review include the regulation of splicing, polyadenylation, decay, and translation. For alternative splicing and polyadenylation, we mainly discuss their evolutionary dynamics and the genetic changes underlying the regulatory differences in cis-elements versus trans-factors. For mRNA decay and translation, which, together with transcription, determine the cellular RNA or protein abundance, we focus our discussion on how their divergence coordinates with transcriptional changes to shape the evolution of gene expression. Then to highlight the importance of PTR in the evolution of higher complexity, we focus on their roles in two major phenomena during eukaryotic evolution: the evolution of multicellularity and the division of labor between different cell types and tissues; and the emergence of diverse, often highly specialized individual phenotypes, especially those concerning behavior in eusocial insects. This article is categorized under: RNA Evolution and Genomics > RNA and Ribonucleoprotein Evolution Translation > Translation Regulation RNA Processing > Splicing Regulation/Alternative Splicing.
Collapse
Affiliation(s)
- Bernhard Schaefke
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Wei Sun
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Department of Pharmaceutical Chemistry and Cardiovascular Research Institute, University of California San Francisco, San Francisco
| | - Yi-Sheng Li
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Liang Fang
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Medi-X Institute, SUSTech Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| | - Wei Chen
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Medi-X Institute, SUSTech Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|