1
|
Choquet K, Chaumont LP, Bache S, Baxter-Koenigs AR, Churchman LS. Genetic regulation of nascent RNA maturation revealed by direct RNA nanopore sequencing. Genome Res 2025; 35:712-724. [PMID: 39952678 PMCID: PMC12047268 DOI: 10.1101/gr.279203.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 10/31/2024] [Indexed: 02/17/2025]
Abstract
Quantitative trait loci analyses have revealed an important role for genetic variants in regulating alternative splicing (AS) and alternative cleavage and polyadenylation (APA) in humans. Yet, these studies are generally performed with mature mRNA, so they report on the outcome rather than the processes of RNA maturation and thus may overlook how variants directly modulate pre-mRNA processing. The order in which the many introns of a human gene are removed can substantially influence AS, while nascent RNA polyadenylation can affect RNA stability and decay. However, how splicing order and poly(A) tail length are regulated by genetic variation has never been explored. Here, we used direct RNA nanopore sequencing to investigate allele-specific pre-mRNA maturation in 12 human lymphoblastoid cell lines. We find frequent splicing order differences between alleles and uncover significant single-nucleotide polymorphism (SNP)-splicing order associations in 17 genes. This includes SNPs located in or near splice sites as well as more distal intronic and exonic SNPs. Moreover, several genes showed allele-specific poly(A) tail lengths, many of which also have a skewed allelic abundance ratio. HLA class I transcripts, which encode proteins that play an essential role in antigen presentation, show the most allele-specific splicing orders, which frequently co-occur with allele-specific AS, APA, or poly(A) tail length differences. Together, our results expose new layers of genetic regulation of pre-mRNA maturation and highlight the power of long-read RNA sequencing for allele-specific analyses.
Collapse
Affiliation(s)
- Karine Choquet
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke J1E 4K8, Canada;
- Research Centre on Aging, CIUSSS de l'Estrie-CHUS, Sherbrooke J1H 2J7, Canada
| | - Louis-Philippe Chaumont
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke J1E 4K8, Canada
- Research Centre on Aging, CIUSSS de l'Estrie-CHUS, Sherbrooke J1H 2J7, Canada
| | - Simon Bache
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke J1E 4K8, Canada
- Research Centre on Aging, CIUSSS de l'Estrie-CHUS, Sherbrooke J1H 2J7, Canada
| | | | - L Stirling Churchman
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
2
|
Ament IH, DeBruyne N, Wang F, Lin L. Long-read RNA sequencing: A transformative technology for exploring transcriptome complexity in human diseases. Mol Ther 2025; 33:883-894. [PMID: 39563027 PMCID: PMC11897757 DOI: 10.1016/j.ymthe.2024.11.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 10/30/2024] [Accepted: 11/15/2024] [Indexed: 11/21/2024] Open
Abstract
Long-read RNA sequencing (RNA-seq) is emerging as a powerful and versatile technology for studying human transcriptomes. By enabling the end-to-end sequencing of full-length transcripts, long-read RNA-seq opens up avenues for investigating various RNA species and features that cannot be reliably interrogated by standard short-read RNA-seq methods. In this review, we present an overview of long-read RNA-seq, delineating its strengths over short-read RNA-seq, as well as summarizing recent advances in experimental and computational approaches to boost the power of long-read-based transcriptomics. We describe a wide range of applications of long-read RNA-seq, and highlight its expanding role as a foundational technology for exploring transcriptome variations in human diseases.
Collapse
Affiliation(s)
| | - Nicole DeBruyne
- Graduate Group in Cell and Molecular Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Feng Wang
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
| | - Lan Lin
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
| |
Collapse
|
3
|
Choquet K, Chaumont LP, Bache S, Baxter-Koenigs AR, Churchman LS. Genetic regulation of nascent RNA maturation revealed by direct RNA nanopore sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.29.610338. [PMID: 39257732 PMCID: PMC11383983 DOI: 10.1101/2024.08.29.610338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Quantitative trait loci analyses have revealed an important role for genetic variants in regulating alternative splicing (AS) and alternative cleavage and polyadenylation (APA) in humans. Yet, these studies are generally performed with mature mRNA, so they report on the outcome rather than the processes of RNA maturation and thus may overlook how variants directly modulate pre-mRNA processing. The order in which the many introns of a human gene are removed can substantially influence AS, while nascent RNA polyadenylation can affect RNA stability and decay. However, how splicing order and poly(A) tail length are regulated by genetic variation has never been explored. Here, we used direct RNA nanopore sequencing to investigate allele-specific pre-mRNA maturation in 12 human lymphoblastoid cell lines. We found frequent splicing order differences between alleles and uncovered significant single nucleotide polymorphism (SNP)-splicing order associations in 17 genes. This included SNPs located in or near splice sites as well as more distal intronic and exonic SNPs. Moreover, several genes showed allele-specific poly(A) tail lengths, many of which also had a skewed allelic abundance ratio. HLA class I transcripts, which encode proteins that play an essential role in antigen presentation, showed the most allele-specific splicing orders, which frequently co-occurred with allele-specific AS, APA or poly(A) tail length differences. Together, our results expose new layers of genetic regulation of pre-mRNA maturation and highlight the power of long-read RNA sequencing for allele-specific analyses.
Collapse
Affiliation(s)
- Karine Choquet
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | - Louis-Philippe Chaumont
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | - Simon Bache
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | | | | |
Collapse
|
4
|
Wang Y, Xie Z, Kutschera E, Adams JI, Kadash-Edmondson KE, Xing Y. rMATS-turbo: an efficient and flexible computational tool for alternative splicing analysis of large-scale RNA-seq data. Nat Protoc 2024; 19:1083-1104. [PMID: 38396040 DOI: 10.1038/s41596-023-00944-2] [Citation(s) in RCA: 47] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 11/02/2023] [Indexed: 02/25/2024]
Abstract
Pre-mRNA alternative splicing is a prevalent mechanism for diversifying eukaryotic transcriptomes and proteomes. Regulated alternative splicing plays a role in many biological processes, and dysregulated alternative splicing is a feature of many human diseases. Short-read RNA sequencing (RNA-seq) is now the standard approach for transcriptome-wide analysis of alternative splicing. Since 2011, our laboratory has developed and maintained Replicate Multivariate Analysis of Transcript Splicing (rMATS), a computational tool for discovering and quantifying alternative splicing events from RNA-seq data. Here we provide a protocol for the contemporary version of rMATS, rMATS-turbo, a fast and scalable re-implementation that maintains the statistical framework and user interface of the original rMATS software, while incorporating a revamped computational workflow with a substantial improvement in speed and data storage efficiency. The rMATS-turbo software scales up to massive RNA-seq datasets with tens of thousands of samples. To illustrate the utility of rMATS-turbo, we describe two representative application scenarios. First, we describe a broadly applicable two-group comparison to identify differential alternative splicing events between two sample groups, including both annotated and novel alternative splicing events. Second, we describe a quantitative analysis of alternative splicing in a large-scale RNA-seq dataset (~1,000 samples), including the discovery of alternative splicing events associated with distinct cell states. We detail the workflow and features of rMATS-turbo that enable efficient parallel processing and analysis of large-scale RNA-seq datasets on a compute cluster. We anticipate that this protocol will help the broad user base of rMATS-turbo make the best use of this software for studying alternative splicing in diverse biological systems.
Collapse
Affiliation(s)
- Yuanyuan Wang
- Bioinformatics Interdepartmental Graduate Program, University of California, Los Angeles, Los Angeles, CA, USA
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Zhijie Xie
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Eric Kutschera
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jenea I Adams
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Genomics and Computational Biology Graduate Program, University of Pennsylvania, Philadelphia, PA, USA
| | - Kathryn E Kadash-Edmondson
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yi Xing
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA.
| |
Collapse
|
5
|
Xu F, Liu S, Zhao A, Shang M, Wang Q, Jiang S, Cheng Q, Chen X, Zhai X, Zhang J, Wang X, Yan J. iFLAS: positive-unlabeled learning facilitates full-length transcriptome-based identification and functional exploration of alternatively spliced isoforms in maize. THE NEW PHYTOLOGIST 2024; 241:2606-2620. [PMID: 38291701 DOI: 10.1111/nph.19554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 01/06/2024] [Indexed: 02/01/2024]
Abstract
The advent of full-length transcriptome sequencing technologies has accelerated the discovery of novel splicing isoforms. However, existing alternative splicing (AS) tools are either tailored for short-read RNA-Seq data or designed for human and animal studies. The disparities in AS patterns between plants and animals still pose a challenge to the reliable identification and functional exploration of novel isoforms in plants. Here, we developed integrated full-length alternative splicing analysis (iFLAS), a plant-optimized AS toolkit that introduced a semi-supervised machine learning method known as positive-unlabeled (PU) learning to accurately identify novel isoforms. iFLAS also enables the investigation of AS functions from various perspectives, such as differential AS, poly(A) tail length, and allele-specific AS (ASAS) analyses. By applying iFLAS to three full-length transcriptome sequencing datasets, we systematically identified and functionally characterized maize (Zea mays) AS patterns. We found intron retention not only introduces premature termination codons, resulting in lower expression levels of isoforms, but may also regulate the length of 3'UTR and poly(A) tail, thereby affecting the functional differentiation of isoforms. Moreover, we observed distinct ASAS patterns in two genes within heterosis offspring, highlighting their potential value in breeding. These results underscore the broad applicability of iFLAS in plant full-length transcriptome-based AS research.
Collapse
Affiliation(s)
- Feng Xu
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Songyu Liu
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Anwen Zhao
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Meiqi Shang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Qian Wang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Shuqin Jiang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Qian Cheng
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Xingming Chen
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Xiaoguang Zhai
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Jianan Zhang
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Xiangfeng Wang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Jun Yan
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| |
Collapse
|
6
|
Wang R, Helbig I, Edmondson AC, Lin L, Xing Y. Splicing defects in rare diseases: transcriptomics and machine learning strategies towards genetic diagnosis. Brief Bioinform 2023; 24:bbad284. [PMID: 37580177 PMCID: PMC10516351 DOI: 10.1093/bib/bbad284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 07/10/2023] [Accepted: 07/20/2023] [Indexed: 08/16/2023] Open
Abstract
Genomic variants affecting pre-messenger RNA splicing and its regulation are known to underlie many rare genetic diseases. However, common workflows for genetic diagnosis and clinical variant interpretation frequently overlook splice-altering variants. To better serve patient populations and advance biomedical knowledge, it has become increasingly important to develop and refine approaches for detecting and interpreting pathogenic splicing variants. In this review, we will summarize a few recent developments and challenges in using RNA sequencing technologies for rare disease investigation. Moreover, we will discuss how recent computational splicing prediction tools have emerged as complementary approaches for revealing disease-causing variants underlying splicing defects. We speculate that continuous improvements to sequencing technologies and predictive modeling will not only expand our understanding of splicing regulation but also bring us closer to filling the diagnostic gap for rare disease patients.
Collapse
Affiliation(s)
- Robert Wang
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Genomics and Computational Biology Graduate Program, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Ingo Helbig
- The Epilepsy NeuroGenetics Initiative, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Biomedical and Health Informatics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Neurology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew C Edmondson
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pediatrics, Division of Human Genetics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Lan Lin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Yi Xing
- Center for Computational and Genomic Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Biomedical and Health Informatics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
7
|
Wu EY, Singh NP, Choi K, Zakeri M, Vincent M, Churchill GA, Ackert-Bicknell CL, Patro R, Love MI. SEESAW: detecting isoform-level allelic imbalance accounting for inferential uncertainty. Genome Biol 2023; 24:165. [PMID: 37438847 PMCID: PMC10337143 DOI: 10.1186/s13059-023-03003-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 06/29/2023] [Indexed: 07/14/2023] Open
Abstract
Detecting allelic imbalance at the isoform level requires accounting for inferential uncertainty, caused by multi-mapping of RNA-seq reads. Our proposed method, SEESAW, uses Salmon and Swish to offer analysis at various levels of resolution, including gene, isoform, and aggregating isoforms to groups by transcription start site. The aggregation strategies strengthen the signal for transcripts with high uncertainty. The SEESAW suite of methods is shown to have higher power than other allelic imbalance methods when there is isoform-level allelic imbalance. We also introduce a new test for detecting imbalance that varies across a covariate, such as time.
Collapse
Affiliation(s)
- Euphy Y Wu
- Department of Biostatistics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA
| | - Noor P Singh
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | | | - Mohsen Zakeri
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | | | | | - Cheryl L Ackert-Bicknell
- Department of Orthopedics, School of Medicine, University of Colorado, Anschutz Campus, Aurora, CO, USA
| | - Rob Patro
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Michael I Love
- Department of Biostatistics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA.
- Department of Genetics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
8
|
Dong L, Wang J, Wang G. ASAS-EGB: A statistical framework for estimating allele-specific alternative splicing events using transcriptome data. Comput Biol Med 2023; 160:106981. [PMID: 37163967 DOI: 10.1016/j.compbiomed.2023.106981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 04/15/2023] [Accepted: 04/27/2023] [Indexed: 05/12/2023]
Abstract
Ubiquitous Alternative splicing (AS) in eukaryotes greatly increases the biodiversity of proteins and involves in disease and cancer. Allele-specific AS studies can facilitate the identification of cis-acting elements because both alleles share the same cellular environment. Due to the limited information provided on the exons defined by AS events, we propose a statistical framework and algorithm ASAS-EGB for ASAS analysis using the gene transcriptome. The framework obtains exclusively compatible sets of gene isoforms supporting each event isoform, and uses both phased and non-phased SNPs within the exons on the gene isoforms for inference. Using this strategy, we have demonstrated ASAS-EGB can yield better ASAS inferential performance than using event isoforms. ASAS-EGB supports both single-end and paired-end RNA-seq data, and we have proved its robustness using RNA-seq replicates of individual NA12878. ASAS-EGB builds Bayesian models for ASAS analysis, and the MCMC method is used to solve the problem. With more detailed annotations for individual genomes and transcriptomes appearing in the future, the algorithm proposed by the paper can provide better support for these data to reveal the regulatory mechanisms of individual genomes.
Collapse
Affiliation(s)
- Lili Dong
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China
| | - Jianan Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China
| | - Guohua Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China.
| |
Collapse
|
9
|
Allele-specific expression analysis for complex genetic phenotypes applied to a unique dilated cardiomyopathy cohort. Sci Rep 2023; 13:564. [PMID: 36631531 PMCID: PMC9834222 DOI: 10.1038/s41598-023-27591-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 01/04/2023] [Indexed: 01/13/2023] Open
Abstract
Allele-specific expression (ASE) analysis detects the relative abundance of alleles at heterozygous loci as a proxy for cis-regulatory variation, which affects the personal transcriptome and proteome. This study describes the development and application of an ASE analysis pipeline on a unique cohort of 87 well phenotyped and RNA sequenced patients from the Maastricht Cardiomyopathy Registry with dilated cardiomyopathy (DCM), a complex genetic disorder with a remaining gap in explained heritability. Regulatory processes for which ASE is a proxy might explain this gap. We found an overrepresentation of known DCM-associated genes among the significant results across the cohort. In addition, we were able to find genes of interest that have not been associated with DCM through conventional methods such as genome-wide association or differential gene expression studies. The pipeline offers RNA sequencing data processing, individual and population level ASE analyses as well as group comparisons and several intuitive visualizations such as Manhattan plots and protein-protein interaction networks. With this pipeline, we found evidence supporting the case that cis-regulatory variation contributes to the phenotypic heterogeneity of DCM. Additionally, our results highlight that ASE analysis offers an additional layer to conventional genomic and transcriptomic analyses for candidate gene identification and biological insight.
Collapse
|
10
|
Pervasive occurrence of splice-site-creating mutations and their possible involvement in genetic disorders. NPJ Genom Med 2022; 7:22. [PMID: 35304488 PMCID: PMC8933504 DOI: 10.1038/s41525-022-00294-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 02/15/2022] [Indexed: 01/06/2023] Open
Abstract
The search for causative mutations in human genetic disorders has mainly focused on mutations that disrupt coding regions or splice sites. Recently, however, it has been reported that mutations creating splice sites can also cause a range of genetic disorders. In this study, we identified 5656 candidate splice-site-creating mutations (SCMs), of which 3942 are likely to be pathogenic, in 4054 genes responsible for genetic disorders. Reanalysis of exome data obtained from ciliopathy patients led us to identify 38 SCMs as candidate causative mutations. We estimate that, by focusing on SCMs, the increase in diagnosis rate is approximately 5.9–8.5% compared to the number of already known pathogenic variants. This finding suggests that SCMs are mutations worth focusing on in the search for causative mutations of genetic disorders.
Collapse
|
11
|
Clark KC, Kwitek AE. Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome. Compr Physiol 2021; 12:3045-3084. [PMID: 34964118 PMCID: PMC9373910 DOI: 10.1002/cphy.c210010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Metabolic syndrome (MetS) is a highly heritable disease and a major public health burden worldwide. MetS diagnosis criteria are met by the simultaneous presence of any three of the following: high triglycerides, low HDL/high LDL cholesterol, insulin resistance, hypertension, and central obesity. These diseases act synergistically in people suffering from MetS and dramatically increase risk of morbidity and mortality due to stroke and cardiovascular disease, as well as certain cancers. Each of these component features is itself a complex disease, as is MetS. As a genetically complex disease, genetic risk factors for MetS are numerous, but not very powerful individually, often requiring specific environmental stressors for the disease to manifest. When taken together, all sequence variants that contribute to MetS disease risk explain only a fraction of the heritable variance, suggesting additional, novel loci have yet to be discovered. In this article, we will give a brief overview on the genetic concepts needed to interpret genome-wide association studies (GWAS) and quantitative trait locus (QTL) data, summarize the state of the field of MetS physiological genomics, and to introduce tools and resources that can be used by the physiologist to integrate genomics into their own research on MetS and any of its component features. There is a wealth of phenotypic and molecular data in animal models and humans that can be leveraged as outlined in this article. Integrating these multi-omic QTL data for complex diseases such as MetS provides a means to unravel the pathways and mechanisms leading to complex disease and promise for novel treatments. © 2022 American Physiological Society. Compr Physiol 12:1-40, 2022.
Collapse
Affiliation(s)
- Karen C Clark
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| | - Anne E Kwitek
- Department of Physiology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
12
|
Jiang Y, Fu X, Zhang Y, Wang SF, Zhu H, Wang WK, Zhang L, Wu P, Wong CCL, Li J, Ma J, Guan JS, Huang Y, Hui J. Rett syndrome linked to defects in forming the MeCP2/Rbfox/LASR complex in mouse models. Nat Commun 2021; 12:5767. [PMID: 34599184 PMCID: PMC8486766 DOI: 10.1038/s41467-021-26084-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 09/13/2021] [Indexed: 01/01/2023] Open
Abstract
Rett syndrome (RTT) is a severe neurological disorder and a leading cause of intellectual disability in young females. RTT is mainly caused by mutations found in the X-linked gene encoding methyl-CpG binding protein 2 (MeCP2). Despite extensive studies, the molecular mechanism underlying RTT pathogenesis is still poorly understood. Here, we report MeCP2 as a key subunit of a higher-order multiunit protein complex Rbfox/LASR. Defective MeCP2 in RTT mouse models disrupts the assembly of the MeCP2/Rbfox/LASR complex, leading to reduced binding of Rbfox proteins to target pre-mRNAs and aberrant splicing of Nrxns and Nlgn1 critical for synaptic plasticity. We further show that MeCP2 disease mutants display defective condensate properties and fail to promote phase-separated condensates with Rbfox proteins in vitro and in cultured cells. These data link an impaired function of MeCP2 with disease mutation in splicing control to its defective properties in mediating the higher-order assembly of the MeCP2/Rbfox/LASR complex. MeCP2 mutations can cause Rett syndrome, a severe childhood neurological disorder. Here the authors show that MeCP2 mediates the higher-order assembly of a large splicing complex Rbfox/LASR, which is disrupted in the mouse models of Rett syndrome.
Collapse
Affiliation(s)
- Yan Jiang
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Xing Fu
- Shanghai Center for Plant Stress Biology, Chinese Academy of Sciences, 201602, Shanghai, China
| | - Yuhan Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Centre of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China.,Department of General Surgery, Shanghai Key Laboratory of Biliary Tract Disease Research, State Key Laboratory of Oncogenes and Related Genes, Xinhua Hospital, Shanghai Jiao Tong University, 200092, Shanghai, China
| | - Shen-Fei Wang
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Hong Zhu
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Wei-Kang Wang
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Lin Zhang
- State Key Laboratory of Cell Biology, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Ping Wu
- National Facility for Protein Science in Shanghai, Zhangjiang Lab, Shanghai Advanced Research Institute, Chinese Academy of Sciences, 201210, Shanghai, China
| | - Catherine C L Wong
- National Facility for Protein Science in Shanghai, Zhangjiang Lab, Shanghai Advanced Research Institute, Chinese Academy of Sciences, 201210, Shanghai, China.,Center for Precision Medicine Multi-Omics Research, Peking University Health Science Center, School of Basic Medical Sciences, Peking University, 100191, Beijing, China
| | - Jinsong Li
- State Key Laboratory of Cell Biology, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China
| | - Jinbiao Ma
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Centre of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
| | - Ji-Song Guan
- School of Life Science and Technology, ShanghaiTech University, 201210, Shanghai, China.,Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, 200031, Shanghai, China
| | - Ying Huang
- Department of General Surgery, Shanghai Key Laboratory of Biliary Tract Disease Research, State Key Laboratory of Oncogenes and Related Genes, Xinhua Hospital, Shanghai Jiao Tong University, 200092, Shanghai, China.
| | - Jingyi Hui
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 200031, Shanghai, China.
| |
Collapse
|
13
|
Matallana-Ramirez LP, Whetten RW, Sanchez GM, Payn KG. Breeding for Climate Change Resilience: A Case Study of Loblolly Pine ( Pinus taeda L.) in North America. FRONTIERS IN PLANT SCIENCE 2021; 12:606908. [PMID: 33995428 PMCID: PMC8119900 DOI: 10.3389/fpls.2021.606908] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 04/08/2021] [Indexed: 05/25/2023]
Abstract
Earth's atmosphere is warming and the effects of climate change are becoming evident. A key observation is that both the average levels and the variability of temperature and precipitation are changing. Information and data from new technologies are developing in parallel to provide multidisciplinary opportunities to address and overcome the consequences of these changes in forest ecosystems. Changes in temperature and water availability impose multidimensional environmental constraints that trigger changes from the molecular to the forest stand level. These can represent a threat for the normal development of the tree from early seedling recruitment to adulthood both through direct mortality, and by increasing susceptibility to pathogens, insect attack, and fire damage. This review summarizes the strengths and shortcomings of previous work in the areas of genetic variation related to cold and drought stress in forest species with particular emphasis on loblolly pine (Pinus taeda L.), the most-planted tree species in North America. We describe and discuss the implementation of management and breeding strategies to increase resilience and adaptation, and discuss how new technologies in the areas of engineering and genomics are shaping the future of phenotype-genotype studies. Lessons learned from the study of species important in intensively-managed forest ecosystems may also prove to be of value in helping less-intensively managed forest ecosystems adapt to climate change, thereby increasing the sustainability and resilience of forestlands for the future.
Collapse
Affiliation(s)
- Lilian P. Matallana-Ramirez
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Ross W. Whetten
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Georgina M. Sanchez
- Center for Geospatial Analytics, North Carolina State University, Raleigh, Raleigh, NC, United States
| | - Kitt G. Payn
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, Raleigh, NC, United States
| |
Collapse
|
14
|
Park E, Jiang Y, Hao L, Hui J, Xing Y. Genetic variation and microRNA targeting of A-to-I RNA editing fine tune human tissue transcriptomes. Genome Biol 2021; 22:77. [PMID: 33685485 PMCID: PMC7942016 DOI: 10.1186/s13059-021-02287-1] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Accepted: 02/02/2021] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND A-to-I RNA editing diversifies the transcriptome and has multiple downstream functional effects. Genetic variation contributes to RNA editing variability between individuals and has the potential to impact phenotypic variability. RESULTS We analyze matched genetic and transcriptomic data in 49 tissues across 437 individuals to identify RNA editing events that are associated with genetic variation. Using an RNA editing quantitative trait loci (edQTL) mapping approach, we identify 3117 unique RNA editing events associated with a cis genetic polymorphism. Fourteen percent of these edQTL events are also associated with genetic variation in their gene expression. A subset of these events are associated with genome-wide association study signals of complex traits or diseases. We determine that tissue-specific levels of ADAR and ADARB1 are able to explain a subset of tissue-specific edQTL events. We find that certain microRNAs are able to differentiate between the edited and unedited isoforms of their targets. Furthermore, microRNAs can generate an expression quantitative trait loci (eQTL) signal from an edQTL locus by microRNA-mediated transcript degradation in an editing-specific manner. By integrative analyses of edQTL, eQTL, and microRNA expression profiles, we computationally discover and experimentally validate edQTL-microRNA pairs for which the microRNA may generate an eQTL signal from an edQTL locus in a tissue-specific manner. CONCLUSIONS Our work suggests a mechanism in which RNA editing variability can influence the phenotypes of complex traits and diseases by altering the stability and steady-state level of critical RNA molecules.
Collapse
Affiliation(s)
- Eddie Park
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
| | - Yan Jiang
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, 200031 China
| | - Lili Hao
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Jingyi Hui
- State Key Laboratory of Molecular Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, 200031 China
| | - Yi Xing
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA 19104 USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104 USA
| |
Collapse
|