1
|
Zhang M, Wang Y, Wu Q, Sun Y, Zhao C, Ge M, Zhou L, Zhang T, Zhang W, Qian Y, Ruan L, Zhao H. Time-course transcriptomic analysis reveals transcription factors involved in modulating nitrogen sensibility in maize. J Genet Genomics 2025; 52:400-410. [PMID: 39395686 DOI: 10.1016/j.jgg.2024.09.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 09/29/2024] [Accepted: 09/30/2024] [Indexed: 10/14/2024]
Abstract
Nitrogen (N) serves both as a vital macronutrient and a signaling molecule for plants. Unveiling key regulators involved in N metabolism helps dissect the mechanisms underlying N metabolism, which is essential for developing maize with high N use efficiency. Two maize lines, B73 and Ki11, show differential chlorate and low-N tolerance. Time-course transcriptomic analysis reveals that the expression of N utilization genes (NUGs) in B73 and Ki11 have distinct responsive patterns to nitrate variation. By the coexpression networks, significant differences in the number of N response modules and regulatory networks of transcription factors (TFs) are revealed between B73 and Ki11. There are 23 unique TFs in B73 and 41 unique TFs in Ki11. MADS26 is a unique TF in the B73 N response network, with different expression levels and N response patterns in B73 and Ki11. Overexpression of MADS26 enhances the sensitivity to chlorate and the utilization of nitrate in maize, at least partially explaining the differential chlorate tolerance and low-N sensitivity between B73 and Ki11. The findings in this work provide unique insights and promising candidates for maize breeding to reduce unnecessary N overuse.
Collapse
Affiliation(s)
- Mingliang Zhang
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Yuancong Wang
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Qi Wu
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Yangming Sun
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Chenxu Zhao
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Min Ge
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Ling Zhou
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Tifu Zhang
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China
| | - Wei Zhang
- Crop Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui 230041, China
| | - Yiliang Qian
- Crop Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui 230041, China
| | - Long Ruan
- Crop Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui 230041, China
| | - Han Zhao
- Institute of Crop Germplasm and Biotechnology, Jiangsu Provincial Key Laboratory of Agrobiology, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu 210014, China.
| |
Collapse
|
2
|
Quinones-Valdez G, Amoah K, Xiao X. Long-read RNA-seq demarcates cis- and trans-directed alternative RNA splicing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.14.599101. [PMID: 38915585 PMCID: PMC11195283 DOI: 10.1101/2024.06.14.599101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Genetic regulation of alternative splicing constitutes an important link between genetic variation and disease. Nonetheless, RNA splicing is regulated by both cis-acting elements and trans-acting splicing factors. Determining splicing events that are directed primarily by the cis- or trans-acting mechanisms will greatly inform our understanding of the genetic basis of disease. Here, we show that long-read RNA-seq, combined with our new method isoLASER, enables a clear segregation of cis- and trans-directed splicing events for individual samples. The genetic linkage of splicing is largely individual-specific, in stark contrast to the tissue-specific pattern of splicing profiles. Analysis of long-read RNA-seq data from human and mouse revealed thousands of cis-directed splicing events susceptible to genetic regulation. We highlight such events in the HLA genes whose analysis was challenging with short-read data. We also highlight novel cis-directed splicing events in Alzheimer's disease-relevant genes such as MAPT and BIN1. Together, the clear demarcation of cis- and trans-directed splicing paves ways for future studies of the genetic basis of disease.
Collapse
Affiliation(s)
- Giovanni Quinones-Valdez
- Department of Integrative Biology and Physiology, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Kofi Amoah
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Xinshu Xiao
- Department of Integrative Biology and Physiology, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
3
|
Nanni A, Titus-McQuillan J, Bankole KS, Pardo-Palacios F, Signor S, Vlaho S, Moskalenko O, Morse A, Rogers RL, Conesa A, McIntyre LM. Nucleotide-level distance metrics to quantify alternative splicing implemented in TranD. Nucleic Acids Res 2024; 52:e28. [PMID: 38340337 PMCID: PMC10954468 DOI: 10.1093/nar/gkae056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 11/29/2023] [Accepted: 01/18/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in affordable transcriptome sequencing combined with better exon and gene prediction has motivated many to compare transcription across the tree of life. We develop a mathematical framework to calculate complexity and compare transcript models. Structural features, i.e. intron retention (IR), donor/acceptor site variation, alternative exon cassettes, alternative 5'/3' UTRs, are compared and the distance between transcript models is calculated with nucleotide level precision. All metrics are implemented in a PyPi package, TranD and output can be used to summarize splicing patterns for a transcriptome (1GTF) and between transcriptomes (2GTF). TranD output enables quantitative comparisons between: annotations augmented by empirical RNA-seq data and the original transcript models; transcript model prediction tools for longread RNA-seq (e.g. FLAIR versus Isoseq3); alternate annotations for a species (e.g. RefSeq vs Ensembl); and between closely related species. In C. elegans, Z. mays, D. melanogaster, D. simulans and H. sapiens, alternative exons were observed more frequently in combination with an alternative donor/acceptor than alone. Transcript models in RefSeq and Ensembl are linked and both have unique transcript models with empirical support. D. melanogaster and D. simulans, share many transcript models and long-read RNAseq data suggests that both species are under-annotated. We recommend combined references.
Collapse
Affiliation(s)
- Adalena Nanni
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| | - James Titus-McQuillan
- University of North Carolina at Charlotte Department of Bioinformatics and Genomics Charlotte, NC, USA
| | - Kinfeosioluwa S Bankole
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| | | | - Sarah Signor
- Department of Biological Sciences, North Dakota State University, Fargo, ND, USA
| | - Srna Vlaho
- Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA
| | - Oleksandr Moskalenko
- University of Florida Research Computing, University of Florida, Gainesville, FL 32611, USA
| | - Alison M Morse
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| | - Rebekah L Rogers
- University of North Carolina at Charlotte Department of Bioinformatics and Genomics Charlotte, NC, USA
| | - Ana Conesa
- Institute for Integrative Systems Biology. Spanish National Research Council, Paterna, Spain
| | - Lauren M McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
4
|
Xu F, Liu S, Zhao A, Shang M, Wang Q, Jiang S, Cheng Q, Chen X, Zhai X, Zhang J, Wang X, Yan J. iFLAS: positive-unlabeled learning facilitates full-length transcriptome-based identification and functional exploration of alternatively spliced isoforms in maize. THE NEW PHYTOLOGIST 2024; 241:2606-2620. [PMID: 38291701 DOI: 10.1111/nph.19554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 01/06/2024] [Indexed: 02/01/2024]
Abstract
The advent of full-length transcriptome sequencing technologies has accelerated the discovery of novel splicing isoforms. However, existing alternative splicing (AS) tools are either tailored for short-read RNA-Seq data or designed for human and animal studies. The disparities in AS patterns between plants and animals still pose a challenge to the reliable identification and functional exploration of novel isoforms in plants. Here, we developed integrated full-length alternative splicing analysis (iFLAS), a plant-optimized AS toolkit that introduced a semi-supervised machine learning method known as positive-unlabeled (PU) learning to accurately identify novel isoforms. iFLAS also enables the investigation of AS functions from various perspectives, such as differential AS, poly(A) tail length, and allele-specific AS (ASAS) analyses. By applying iFLAS to three full-length transcriptome sequencing datasets, we systematically identified and functionally characterized maize (Zea mays) AS patterns. We found intron retention not only introduces premature termination codons, resulting in lower expression levels of isoforms, but may also regulate the length of 3'UTR and poly(A) tail, thereby affecting the functional differentiation of isoforms. Moreover, we observed distinct ASAS patterns in two genes within heterosis offspring, highlighting their potential value in breeding. These results underscore the broad applicability of iFLAS in plant full-length transcriptome-based AS research.
Collapse
Affiliation(s)
- Feng Xu
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Songyu Liu
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Anwen Zhao
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Meiqi Shang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Qian Wang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Shuqin Jiang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Qian Cheng
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Xingming Chen
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Xiaoguang Zhai
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Jianan Zhang
- Molbreeding Biotechnology Co., Ltd, Shijiazhuang, Hebei Province, 051430, China
| | - Xiangfeng Wang
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| | - Jun Yan
- State Key Laboratory of Maize Bio-Breeding, National Maize Improvement Center, Frontiers Science Center for Molecular Design Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100094, China
| |
Collapse
|
5
|
Nguyen TA, Heng JWJ, Ng YT, Sun R, Fisher S, Oguz G, Kaewsapsak P, Xue S, Reversade B, Ramasamy A, Eisenberg E, Tan MH. Deep transcriptome profiling reveals limited conservation of A-to-I RNA editing in Xenopus. BMC Biol 2023; 21:251. [PMID: 37946231 PMCID: PMC10636886 DOI: 10.1186/s12915-023-01756-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 11/02/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Xenopus has served as a valuable model system for biomedical research over the past decades. Notably, ADAR was first detected in frog oocytes and embryos as an activity that unwinds RNA duplexes. However, the scope of A-to-I RNA editing by the ADAR enzymes in Xenopus remains underexplored. RESULTS Here, we identify millions of editing events in Xenopus with high accuracy and systematically map the editome across developmental stages, adult organs, and species. We report diverse spatiotemporal patterns of editing with deamination activity highest in early embryogenesis before zygotic genome activation and in the ovary. Strikingly, editing events are poorly conserved across different Xenopus species. Even sites that are detected in both X. laevis and X. tropicalis show largely divergent editing levels or developmental profiles. In protein-coding regions, only a small subset of sites that are found mostly in the brain are well conserved between frogs and mammals. CONCLUSIONS Collectively, our work provides fresh insights into ADAR activity in vertebrates and suggest that species-specific editing may play a role in each animal's unique physiology or environmental adaptation.
Collapse
Affiliation(s)
- Tram Anh Nguyen
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Jia Wei Joel Heng
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Yan Ting Ng
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Rui Sun
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Shira Fisher
- Faculty of Life Sciences, The Mina and Everard Goodman, Bar-Ilan University, Ramat Gan, Israel
| | - Gokce Oguz
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Pornchai Kaewsapsak
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
- Department of Biochemistry, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Shifeng Xue
- Institute of Molecular and Cell Biology, Agency for Science Technology and Research, Singapore, Singapore
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Bruno Reversade
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
- Institute of Molecular and Cell Biology, Agency for Science Technology and Research, Singapore, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Medical Genetics, School of Medicine (KUSoM), Koç University, Istanbul, Turkey
| | - Adaikalavan Ramasamy
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Eli Eisenberg
- Raymond and Beverly Sackler School of Physics and Astronomy, Tel Aviv University, Tel Aviv, Israel
| | - Meng How Tan
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore.
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore.
- HP-NTU Digital Manufacturing Corporate Lab, Nanyang Technological University, Singapore, Singapore.
| |
Collapse
|
6
|
Pardo-Palacios FJ, Arzalluz-Luque A, Kondratova L, Salguero P, Mestre-Tomás J, Amorín R, Estevan-Morió E, Liu T, Nanni A, McIntyre L, Tseng E, Conesa A. SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541248. [PMID: 37398077 PMCID: PMC10312485 DOI: 10.1101/2023.05.17.541248] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The emergence of long-read RNA sequencing (lrRNA-seq) has provided an unprecedented opportunity to analyze transcriptomes at isoform resolution. However, the technology is not free from biases, and transcript models inferred from these data require quality control and curation. In this study, we introduce SQANTI3, a tool specifically designed to perform quality analysis on transcriptomes constructed using lrRNA-seq data. SQANTI3 provides an extensive naming framework to describe transcript model diversity in comparison to the reference transcriptome. Additionally, the tool incorporates a wide range of metrics to characterize various structural properties of transcript models, such as transcription start and end sites, splice junctions, and other structural features. These metrics can be utilized to filter out potential artifacts. Moreover, SQANTI3 includes a Rescue module that prevents the loss of known genes and transcripts exhibiting evidence of expression but displaying low-quality features. Lastly, SQANTI3 incorporates IsoAnnotLite, which enables functional annotation at the isoform level and facilitates functional iso-transcriptomics analyses. We demonstrate the versatility of SQANTI3 in analyzing different data types, isoform reconstruction pipelines, and sequencing platforms, and how it provides novel biological insights into isoform biology. The SQANTI3 software is available at https://github.com/ConesaLab/SQANTI3 .
Collapse
|
7
|
de Souza VBC, Jordan BT, Tseng E, Nelson EA, Hirschi KK, Sheynkman G, Robinson MD. Transformation of alignment files improves performance of variant callers for long-read RNA sequencing data. Genome Biol 2023; 24:91. [PMID: 37095564 PMCID: PMC10123983 DOI: 10.1186/s13059-023-02923-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 04/05/2023] [Indexed: 04/26/2023] Open
Abstract
Long-read RNA sequencing (lrRNA-seq) produces detailed information about full-length transcripts, including novel and sample-specific isoforms. Furthermore, there is an opportunity to call variants directly from lrRNA-seq data. However, most state-of-the-art variant callers have been developed for genomic DNA. Here, there are two objectives: first, we perform a mini-benchmark on GATK, DeepVariant, Clair3, and NanoCaller primarily on PacBio Iso-Seq, data, but also on Nanopore and Illumina RNA-seq data; second, we propose a pipeline to process spliced-alignment files, making them suitable for variant calling with DNA-based callers. With such manipulations, high calling performance can be achieved using DeepVariant on Iso-seq data.
Collapse
Affiliation(s)
- Vladimir B C de Souza
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, 8057, Zurich, Switzerland
| | - Ben T Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | | | - Elizabeth A Nelson
- Department of Cell Biology and Cardiovascular Research Center, University of Virginia School of Medicine, Charlottesville, VA, 22908, USA
| | - Karen K Hirschi
- Department of Cell Biology and Cardiovascular Research Center, University of Virginia School of Medicine, Charlottesville, VA, 22908, USA
- Department of Medicine, Yale University School of Medicine, New Haven, CT, 06511, USA
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06511, USA
- Yale Cardiovascular Research Center, Yale University School of Medicine, New Haven, CT, 06511, USA
| | - Gloria Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA.
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA.
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA.
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA, USA.
| | - Mark D Robinson
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, 8057, Zurich, Switzerland.
| |
Collapse
|
8
|
Gladman N, Goodwin S, Chougule K, Richard McCombie W, Ware D. Era of gapless plant genomes: innovations in sequencing and mapping technologies revolutionize genomics and breeding. Curr Opin Biotechnol 2023; 79:102886. [PMID: 36640454 PMCID: PMC9899316 DOI: 10.1016/j.copbio.2022.102886] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 12/03/2022] [Accepted: 12/13/2022] [Indexed: 01/15/2023]
Abstract
Whole-genome sequencing and assembly have revolutionized plant genetics and molecular biology over the last two decades. However, significant shortcomings in first- and second-generation technology resulted in imperfect reference genomes: numerous and large gaps of low quality or undeterminable sequence in areas of highly repetitive DNA along with limited chromosomal phasing restricted the ability of researchers to characterize regulatory noncoding elements and genic regions that underwent recent duplication events. Recently, advances in long-read sequencing have resulted in the first gapless, telomere-to-telomere (T2T) assemblies of plant genomes. This leap forward has the potential to increase the speed and confidence of genomics and molecular experimentation while reducing costs for the research community.
Collapse
Affiliation(s)
- Nicholas Gladman
- U.S. Department of Agriculture-Agricultural Research Service, NEA Robert W. Holley Center for Agriculture and Health, 538 Tower Rd, Ithaca, NY 14853, USA; Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 , USA
| | - Sara Goodwin
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 , USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 , USA
| | | | - Doreen Ware
- U.S. Department of Agriculture-Agricultural Research Service, NEA Robert W. Holley Center for Agriculture and Health, 538 Tower Rd, Ithaca, NY 14853, USA; Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 , USA.
| |
Collapse
|
9
|
Tseng E, Underwood JG, Evans Hutzenbiler BD, Trojahn S, Kingham B, Shevchenko O, Bernberg E, Vierra M, Robbins CT, Jansen HT, Kelley JL. Long-read isoform sequencing reveals tissue-specific isoform expression between active and hibernating brown bears (Ursus arctos). G3 (BETHESDA, MD.) 2022; 12:6472356. [PMID: 35100340 PMCID: PMC9210309 DOI: 10.1093/g3journal/jkab422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 11/17/2021] [Indexed: 11/13/2022]
Abstract
Understanding hibernation in brown bears (Ursus arctos) can provide insight into some human diseases. During hibernation, brown bears experience periods of insulin resistance, physical inactivity, extreme bradycardia, obesity, and the absence of urine production. These states closely mimic aspects of human diseases such as type 2 diabetes, muscle atrophy, as well as renal and heart failure. The reversibility of these states from hibernation to active season enables the identification of mediators with possible therapeutic value for humans. Recent studies have identified genes and pathways that are differentially expressed between active and hibernation seasons in bears. However, little is known about the role of differential expression of gene isoforms on hibernation physiology. To identify both distinct and novel mRNA isoforms, full-length RNA-sequencing (Iso-Seq) was performed on adipose, skeletal muscle, and liver from three individual bears sampled during both active and hibernation seasons. The existing reference genome annotation was improved by combining it with the Iso-Seq data. Short-read RNA-sequencing data from six individuals were mapped to the new reference annotation to quantify differential isoform usage (DIU) between tissues and seasons. We identified differentially expressed isoforms in all three tissues, to varying degrees. Adipose had a high level of DIU with isoform switching, regardless of whether the genes were differentially expressed. Our analyses revealed that DIU, even in the absence of differential gene expression, is an important mechanism for modulating genes during hibernation. These findings demonstrate the value of isoform expression studies and will serve as the basis for deeper exploration into hibernation biology.
Collapse
Affiliation(s)
| | | | - Brandon D Evans Hutzenbiler
- Department of Integrative Physiology and Neuroscience, Washington State University, Pullman, WA 99164, USA.,School of the Environment, Washington State University, Pullman, WA 99164, USA
| | - Shawn Trojahn
- School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| | - Brewster Kingham
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Olga Shevchenko
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Erin Bernberg
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | | | - Charles T Robbins
- School of the Environment, Washington State University, Pullman, WA 99164, USA.,School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| | - Heiko T Jansen
- Department of Integrative Physiology and Neuroscience, Washington State University, Pullman, WA 99164, USA
| | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| |
Collapse
|
10
|
Mendieta JP, Marand AP, Ricci WA, Zhang X, Schmitz RJ. Leveraging histone modifications to improve genome annotations. G3 (BETHESDA, MD.) 2021; 11:jkab263. [PMID: 34568920 PMCID: PMC8473982 DOI: 10.1093/g3journal/jkab263] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/15/2021] [Indexed: 12/27/2022]
Abstract
Accurate genome annotations are essential to modern biology; however, they remain challenging to produce. Variation in gene structure and expression across species, as well as within an organism, make correctly annotating genes arduous; an issue exacerbated by pitfalls in current in silico methods. These issues necessitate complementary approaches to add additional confidence and rectify potential misannotations. Integration of epigenomic data into genome annotation is one such approach. In this study, we utilized sets of histone modification data, which are precisely distributed at either gene bodies or promoters to evaluate the annotation of the Zea mays genome. We leveraged these data genome wide, allowing for identification of annotations discordant with empirical data. In total, 13,159 annotation discrepancies were found in Z. mays upon integrating data across three different tissues, which were corroborated using RNA-based approaches. Upon correction, genes were extended by an average of 2128 base pairs, and we identified 2529 novel genes. Application of this method to five additional plant genomes identified a series of misannotations, as well as identified novel genes, including 13,836 in Asparagus officinalis, 2724 in Setaria viridis, 2446 in Sorghum bicolor, 8631 in Glycine max, and 2585 in Phaseolous vulgaris. This study demonstrates that histone modification data can be leveraged to rapidly improve current genome annotations across diverse plant lineages.
Collapse
Affiliation(s)
| | | | - William A Ricci
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
11
|
Cosentino RO, Brink BG, Siegel TN. Allele-specific assembly of a eukaryotic genome corrects apparent frameshifts and reveals a lack of nonsense-mediated mRNA decay. NAR Genom Bioinform 2021; 3:lqab082. [PMID: 34541528 PMCID: PMC8445201 DOI: 10.1093/nargab/lqab082] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 08/25/2021] [Accepted: 09/06/2021] [Indexed: 11/14/2022] Open
Abstract
To date, most reference genomes represent a mosaic consensus sequence in which the homologous chromosomes are collapsed into one sequence. This approach produces sequence artefacts and impedes analyses of allele-specific mechanisms. Here, we report an allele-specific genome assembly of the diploid parasite Trypanosoma brucei and reveal allelic variants affecting gene expression. Using long-read sequencing and chromosome conformation capture data, we could assign 99.5% of all heterozygote variants to a specific homologous chromosome and build a 66 Mb long allele-specific genome assembly. The phasing of haplotypes allowed us to resolve hundreds of artefacts present in the previous mosaic consensus assembly. In addition, it revealed allelic recombination events, visible as regions of low allelic heterozygosity, enabling the lineage tracing of T. brucei isolates. Interestingly, analyses of transcriptome and translatome data of genes with allele-specific premature termination codons point to the absence of a nonsense-mediated decay mechanism in trypanosomes. Taken together, this study delivers a reference quality allele-specific genome assembly of T. brucei and demonstrates the importance of such assemblies for the study of gene expression control. We expect the new genome assembly will increase the awareness of allele-specific phenomena and provide a platform to investigate them.
Collapse
Affiliation(s)
- Raúl O Cosentino
- Division of Experimental Parasitology, Faculty of Veterinary Medicine, Ludwig-Maximilians-Universität in Munich, Lena-Christ-Str. 48, Planegg-Martinsried 82152, Germany
| | - Benedikt G Brink
- Division of Experimental Parasitology, Faculty of Veterinary Medicine, Ludwig-Maximilians-Universität in Munich, Lena-Christ-Str. 48, Planegg-Martinsried 82152, Germany
| | - T Nicolai Siegel
- Division of Experimental Parasitology, Faculty of Veterinary Medicine, Ludwig-Maximilians-Universität in Munich, Lena-Christ-Str. 48, Planegg-Martinsried 82152, Germany
| |
Collapse
|
12
|
Freire R, Weisweiler M, Guerreiro R, Baig N, Hüttel B, Obeng-Hinneh E, Renner J, Hartje S, Muders K, Truberg B, Rosen A, Prigge V, Bruckmüller J, Lübeck J, Stich B. Chromosome-scale reference genome assembly of a diploid potato clone derived from an elite variety. G3-GENES GENOMES GENETICS 2021; 11:6371871. [PMID: 34534288 PMCID: PMC8664475 DOI: 10.1093/g3journal/jkab330] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 09/08/2021] [Indexed: 01/27/2023]
Abstract
Potato (Solanum tuberosum L.) is one of the most important crops with a worldwide production of 370 million metric tons. The objectives of this study were (1) to create a high-quality consensus sequence across the two haplotypes of a diploid clone derived from a tetraploid elite variety and assess the sequence divergence from the available potato genome assemblies, as well as among the two haplotypes; (2) to evaluate the new assembly’s usefulness for various genomic methods; and (3) to assess the performance of phasing in diploid and tetraploid clones, using linked-read sequencing technology. We used PacBio long reads coupled with 10x Genomics reads and proximity ligation scaffolding to create the dAg1_v1.0 reference genome sequence. With a final assembly size of 812 Mb, where 750 Mb are anchored to 12 chromosomes, our assembly is larger than other available potato reference sequences and high proportions of properly paired reads were observed for clones unrelated by pedigree to dAg1. Comparisons of the new dAg1_v1.0 sequence to other potato genome sequences point out the high divergence between the different potato varieties and illustrate the potential of using dAg1_v1.0 sequence in breeding applications.
Collapse
Affiliation(s)
- Ruth Freire
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Marius Weisweiler
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Ricardo Guerreiro
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Nadia Baig
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Bruno Hüttel
- Max Planck-Genome-centre Cologne, Max Planck Institute for Plant Breeding, Carl-von-Linne-Weg 10, 50829 Köln, Germany
| | - Evelyn Obeng-Hinneh
- Böhm-Nordkartoffel Agrarproduktion GmbH & Co. OHG, Strehlow 19, 17111 Hohenmocker, Germany
| | - Juliane Renner
- Böhm-Nordkartoffel Agrarproduktion GmbH & Co. OHG, Strehlow 19, 17111 Hohenmocker, Germany
| | - Stefanie Hartje
- Böhm-Nordkartoffel Agrarproduktion GmbH & Co. OHG, Strehlow 19, 17111 Hohenmocker, Germany
| | - Katja Muders
- Nordring- Kartoffelzucht- und Vermehrungs- GmbH, Parkweg 4, 18190 Sanitz, Germany
| | - Bernd Truberg
- Nordring- Kartoffelzucht- und Vermehrungs- GmbH, Parkweg 4, 18190 Sanitz, Germany
| | - Arne Rosen
- Nordring- Kartoffelzucht- und Vermehrungs- GmbH, Parkweg 4, 18190 Sanitz, Germany
| | - Vanessa Prigge
- SaKa Pflanzenzucht GmbH & Co. KG, Zuchtstation Windeby, Eichenallee 9, 24340 Windeby, Germany
| | | | - Jens Lübeck
- Solana Research GmbH, Eichenallee 9, 24340 Windeby, Germany
| | - Benjamin Stich
- Institute for Quantitative Genetics and Genomics of Plants, Universitätsstraße 1, 40225 Düsseldorf, Germany.,Cluster of Excellence on Plant Sciences, From Complex Traits towards Synthetic Modules, Universitätsstraße 1, 40225 Düsseldorf, Germany
| |
Collapse
|
13
|
Williams AM, Itgen MW, Broz AK, Carter OG, Sloan DB. Long-read transcriptome and other genomic resources for the angiosperm Silene noctiflora. G3 (BETHESDA, MD.) 2021; 11:jkab189. [PMID: 34849814 PMCID: PMC8496259 DOI: 10.1093/g3journal/jkab189] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 01/04/2023]
Abstract
The angiosperm genus Silene is a model system for several traits of ecological and evolutionary significance in plants, including breeding system and sex chromosome evolution, host-pathogen interactions, invasive species biology, heavy metal tolerance, and cytonuclear interactions. Despite its importance, genomic resources for this large genus of approximately 850 species are scarce, with only one published whole-genome sequence (from the dioecious species Silene latifolia). Here, we provide genomic and transcriptomic resources for a hermaphroditic representative of this genus (S. noctiflora), including a PacBio Iso-Seq transcriptome, which uses long-read, single-molecule sequencing technology to analyze full-length mRNA transcripts. Using these data, we have assembled and annotated high-quality full-length cDNA sequences for approximately 14,126 S. noctiflora genes and 25,317 isoforms. We demonstrated the utility of these data to distinguish between recent and highly similar gene duplicates by identifying novel paralogous genes in an essential protease complex. Furthermore, we provide a draft assembly for the approximately 2.7-Gb genome of this species, which is near the upper range of genome-size values reported for diploids in this genus and threefold larger than the 0.9-Gb genome of Silene conica, another species in the same subgenus. Karyotyping confirmed that S. noctiflora is a diploid, indicating that its large genome size is not due to polyploidization. These resources should facilitate further study and development of this genus as a model in plant ecology and evolution.
Collapse
Affiliation(s)
- Alissa M Williams
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
- Cell and Molecular Biology Graduate Program, Colorado State University, Fort Collins, CO 80523, USA
| | - Michael W Itgen
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Amanda K Broz
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Olivia G Carter
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| |
Collapse
|
14
|
Feng JW, Lu Y, Shao L, Zhang J, Li H, Chen LL. Phasing analysis of the transcriptome and epigenome in a rice hybrid reveals the inheritance and difference in DNA methylation and allelic transcription regulation. PLANT COMMUNICATIONS 2021; 2:100185. [PMID: 34327321 PMCID: PMC8299081 DOI: 10.1016/j.xplc.2021.100185] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Revised: 03/14/2021] [Accepted: 04/13/2021] [Indexed: 05/16/2023]
Abstract
Hybrids are always a focus of botanical research and have a high practical value in agricultural production. To better understand allele regulation and differences in DNA methylation in hybrids, we developed a phasing pipeline for hybrid rice based on two parental genomes (PP2PG), which is applicable for Iso-Seq, RNA-Seq, and Bisulfite sequencing (BS-Seq). Using PP2PG, we analyzed differences in gene transcription, alternative splicing, and DNA methylation in an allele-specific manner between parents and progeny or different progeny alleles. The phasing of Iso-Seq data provided a great advantage in separating the whole gene structure and producing a significantly higher separation ratio than RNA-Seq. The interaction of hybrid alleles was studied by constructing an allele co-expression network that revealed the dominant allele effect in the network. The expression variation between parents and the parental alleles in progeny showed tissue- or environment-specific patterns, which implied a preference for trans-acting regulation under different conditions. In addition, by comparing allele-specific DNA methylation, we found that CG methylation was more likely to be inherited than CHG and CHH methylation, and its enrichment in genic regions was connected to gene structure. In addition to an effective phasing pipeline, we also identified differentiation in OsWAK38 gene structure that may have led to the expansion of allele functions in hybrids. In summary, we developed a phasing pipeline and provided valuable insights into alternative splicing, interaction networks, trans-acting regulation, and the inheritance of DNA methylation in hybrid rice.
Collapse
Affiliation(s)
- Jia-Wu Feng
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Yue Lu
- Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-Innovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional Genomics of the Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Lin Shao
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Huan Li
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- Corresponding author
| | - Ling-Ling Chen
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- College of Life Science and Technology, Guangxi University, Nanning 530004, China
- Corresponding author
| |
Collapse
|
15
|
Chintalaphani SR, Pineda SS, Deveson IW, Kumar KR. An update on the neurological short tandem repeat expansion disorders and the emergence of long-read sequencing diagnostics. Acta Neuropathol Commun 2021; 9:98. [PMID: 34034831 PMCID: PMC8145836 DOI: 10.1186/s40478-021-01201-x] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/17/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Short tandem repeat (STR) expansion disorders are an important cause of human neurological disease. They have an established role in more than 40 different phenotypes including the myotonic dystrophies, Fragile X syndrome, Huntington's disease, the hereditary cerebellar ataxias, amyotrophic lateral sclerosis and frontotemporal dementia. MAIN BODY STR expansions are difficult to detect and may explain unsolved diseases, as highlighted by recent findings including: the discovery of a biallelic intronic 'AAGGG' repeat in RFC1 as the cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS); and the finding of 'CGG' repeat expansions in NOTCH2NLC as the cause of neuronal intranuclear inclusion disease and a range of clinical phenotypes. However, established laboratory techniques for diagnosis of repeat expansions (repeat-primed PCR and Southern blot) are cumbersome, low-throughput and poorly suited to parallel analysis of multiple gene regions. While next generation sequencing (NGS) has been increasingly used, established short-read NGS platforms (e.g., Illumina) are unable to genotype large and/or complex repeat expansions. Long-read sequencing platforms recently developed by Oxford Nanopore Technology and Pacific Biosciences promise to overcome these limitations to deliver enhanced diagnosis of repeat expansion disorders in a rapid and cost-effective fashion. CONCLUSION We anticipate that long-read sequencing will rapidly transform the detection of short tandem repeat expansion disorders for both clinical diagnosis and gene discovery.
Collapse
Affiliation(s)
- Sanjog R. Chintalaphani
- School of Medicine, University of New South Wales, Sydney, 2052 Australia
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
| | - Sandy S. Pineda
- Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Brain and Mind Centre, University of Sydney, Camperdown, NSW 2050 Australia
| | - Ira W. Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Faculty of Medicine, St Vincent’s Clinical School, University of New South Wales, Sydney, NSW 2010 Australia
| | - Kishore R. Kumar
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Molecular Medicine Laboratory and Neurology Department, Central Clinical School, Concord Repatriation General Hospital, University of Sydney, Concord, NSW 2137 Australia
| |
Collapse
|
16
|
Fujita MK, Singhal S, Brunes TO, Maldonado JA. Evolutionary Dynamics and Consequences of Parthenogenesis in Vertebrates. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2020. [DOI: 10.1146/annurev-ecolsys-011720-114900] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Parthenogenesis is asexual reproduction without any required participation from males and, as such, is a null model for sexual reproduction. In a comparative context, we can expand our understanding of the evolution and ecology of sex by investigating the consequences of parthenogenesis. In this review, we examine the theoretical predictions of and empirical results on the evolution of asexual reproduction in vertebrates, focusing on recent studies addressing the origins and geographic spread of parthenogenetic lineages and the genomic consequences of an asexual life history. With advances in computational methods and genome technologies, researchers are poised to make rapid and significant progress in studying the origin and evolution of parthenogenesis in vertebrates, thus providing an important perspective on understanding biodiversity patterns of both asexual and sexual populations.
Collapse
Affiliation(s)
- Matthew K. Fujita
- Amphibian and Reptile Diversity Research Center and Department of Biology, University of Texas at Arlington, Arlington, Texas 76019, USA
| | - Sonal Singhal
- Department of Biology, California State University, Dominguez Hills, Carson, California 90747, USA
| | - Tuliana O. Brunes
- Departamento de Zoologia, Instituto de Biociências, Universidade de São Paulo, São Paulo 05508-090, Brazil
| | - Jose A. Maldonado
- Amphibian and Reptile Diversity Research Center and Department of Biology, University of Texas at Arlington, Arlington, Texas 76019, USA
| |
Collapse
|