1
|
Sneddon A, Ravindran A, Shanmuganandam S, Kanchi M, Hein N, Jiang S, Shirokikh N, Eyras E. Biochemical-free enrichment or depletion of RNA classes in real-time during direct RNA sequencing with RISER. Nat Commun 2024; 15:4422. [PMID: 38789440 PMCID: PMC11126589 DOI: 10.1038/s41467-024-48673-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 05/07/2024] [Indexed: 05/26/2024] Open
Abstract
The heterogeneous composition of cellular transcriptomes poses a major challenge for detecting weakly expressed RNA classes, as they can be obscured by abundant RNAs. Although biochemical protocols can enrich or deplete specified RNAs, they are time-consuming, expensive and can compromise RNA integrity. Here we introduce RISER, a biochemical-free technology for the real-time enrichment or depletion of RNA classes. RISER performs selective rejection of molecules during direct RNA sequencing by identifying RNA classes directly from nanopore signals with deep learning and communicating with the sequencing hardware in real time. By targeting the dominant messenger and mitochondrial RNA classes for depletion, RISER reduces their respective read counts by more than 85%, resulting in an increase in sequencing depth of 47% on average for long non-coding RNAs. We also apply RISER for the depletion of globin mRNA in whole blood, achieving a decrease in globin reads by more than 90% as well as an increase in non-globin reads by 16% on average. Furthermore, using a GPU or a CPU, RISER is faster than GPU-accelerated basecalling and mapping. RISER's modular and retrainable software and intuitive command-line interface allow easy adaptation to other RNA classes. RISER is available at https://github.com/comprna/riser .
Collapse
Affiliation(s)
- Alexandra Sneddon
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT 2601, Australia
- Centre for Computational Biomedical Sciences, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
- The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
| | - Agin Ravindran
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT 2601, Australia
- Centre for Computational Biomedical Sciences, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
- The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
| | - Somasundhari Shanmuganandam
- Department of Immunity, Inflammation and Infection, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
- Centre for Personalised Immunology, NHMRC Centre for Research Excellence, Australian National University, Canberra, ACT 2601, Australia
| | - Madhu Kanchi
- The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
| | - Nadine Hein
- ACRF Department of Cancer Biology and Therapeutics, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
| | - Simon Jiang
- Department of Immunity, Inflammation and Infection, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
- Centre for Personalised Immunology, NHMRC Centre for Research Excellence, Australian National University, Canberra, ACT 2601, Australia
- Department of Renal Medicine, The Canberra Hospital, Canberra, ACT 2605, Australia
| | - Nikolay Shirokikh
- The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia.
| | - Eduardo Eyras
- EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, ACT 2601, Australia.
- Centre for Computational Biomedical Sciences, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia.
- The Shine-Dalgarno Centre for RNA Innovation, The John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia.
| |
Collapse
|
2
|
Adams M, Vollmers C. Generation and analysis of a mouse multi-tissue genome annotation atlas. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.31.578267. [PMID: 38352519 PMCID: PMC10862843 DOI: 10.1101/2024.01.31.578267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/24/2024]
Abstract
Generating an accurate and complete genome annotation for an organism is complex because the cells within each tissue can express a unique set of transcript isoforms from a unique set of genes. A comprehensive genome annotation should contain information on what tissues express what transcript isoforms at what level. This tissue-level isoform information can then inform a wide range of research questions as well as experiment designs. Long-read sequencing technology combined with advanced full-length cDNA library preparation methods has now achieved throughput and accuracy where generating these types of annotations is achievable. Here, we show this by generating a genome annotation of the mouse (Mus musculus). We used the nanopore-based R2C2 long-read sequencing method to generate 64 million highly accurate full length cDNA consensus reads - averaging 5.4 million reads per tissue for a dozen tissues. Using the Mandalorion tool we processed these reads to generate the Tissue-level Atlas of Mouse Isoforms (TAMI - available at https://genome.ucsc.edu/s/vollmers/TAMI) which we believe will be a valuable complement to conventional, manually curated reference genome annotations.
Collapse
Affiliation(s)
- Matthew Adams
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz
| | | |
Collapse
|
3
|
Zhang C, Fang Y, Chen W, Chen Z, Zhang Y, Xie Y, Chen W, Xie Z, Guo M, Wang J, Tan C, Wang H, Tang C. Improving the RNA velocity approach with single-cell RNA lifecycle (nascent, mature and degrading RNAs) sequencing technologies. Nucleic Acids Res 2023; 51:e112. [PMID: 37941145 PMCID: PMC10711548 DOI: 10.1093/nar/gkad969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 09/27/2023] [Accepted: 10/14/2023] [Indexed: 11/10/2023] Open
Abstract
We presented an experimental method called FLOUR-seq, which combines BD Rhapsody and nanopore sequencing to detect the RNA lifecycle (including nascent, mature, and degrading RNAs) in cells. Additionally, we updated our HIT-scISOseq V2 to discover a more accurate RNA lifecycle using 10x Chromium and Pacbio sequencing. Most importantly, to explore how single-cell full-length RNA sequencing technologies could help improve the RNA velocity approach, we introduced a new algorithm called 'Region Velocity' to more accurately configure cellular RNA velocity. We applied this algorithm to study spermiogenesis and compared the performance of FLOUR-seq with Pacbio-based HIT-scISOseq V2. Our findings demonstrated that 'Region Velocity' is more suitable for analyzing single-cell full-length RNA data than traditional RNA velocity approaches. These novel methods could be useful for researchers looking to discover full-length RNAs in single cells and comprehensively monitor RNA lifecycle in cells.
Collapse
Affiliation(s)
| | | | - Weitian Chen
- BGI, Shenzhen 518000, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
| | | | - Ying Zhang
- Guangdong Provincial Reproductive Science Institute (Guangdong Provincial Fertility Hospital), Guangzhou, China; NHC Key Laboratory of Male Reproduction and Genetics, Guangzhou, China
| | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Schwenk V, Leal Silva RM, Scharf F, Knaust K, Wendlandt M, Häusser T, Pickl JMA, Steinke-Lange V, Laner A, Morak M, Holinski-Feder E, Wolf DA. Transcript capture and ultradeep long-read RNA sequencing (CAPLRseq) to diagnose HNPCC/Lynch syndrome. J Med Genet 2023; 60:747-759. [PMID: 36593122 PMCID: PMC10423559 DOI: 10.1136/jmg-2022-108931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 12/10/2022] [Indexed: 01/03/2023]
Abstract
PURPOSE Whereas most human genes encode multiple mRNA isoforms with distinct function, clinical workflows for assessing this heterogeneity are not readily available. This is a substantial shortcoming, considering that up to 25% of disease-causing gene variants are suspected of disrupting mRNA splicing or mRNA abundance. Long-read sequencing can readily portray mRNA isoform diversity, but its sensitivity is relatively low due to insufficient transcriptome penetration. METHODS We developed and applied capture-based target enrichment from patient RNA samples combined with Oxford Nanopore long-read sequencing for the analysis of 123 hereditary cancer transcripts (capture and ultradeep long-read RNA sequencing (CAPLRseq)). RESULTS Validating CAPLRseq, we confirmed 17 cases of hereditary non-polyposis colorectal cancer/Lynch syndrome based on the demonstration of splicing defects and loss of allele expression of mismatch repair genes MLH1, PMS2, MSH2 and MSH6. Using CAPLRseq, we reclassified two variants of uncertain significance in MSH6 and PMS2 as either likely pathogenic or benign. CONCLUSION Our data show that CAPLRseq is an automatable and adaptable workflow for effective transcriptome-based identification of disease variants in a clinical diagnostic setting.
Collapse
Affiliation(s)
| | | | | | | | | | - Tanja Häusser
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
| | - Julia M A Pickl
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
- Klinikum der Universität München, Munich, Germany
| | | | - Andreas Laner
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
| | - Monika Morak
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
- Klinikum der Universität München, Munich, Germany
| | - Elke Holinski-Feder
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
- Medizinische Klinik und Poliklinik IV, Campus Innenstadt, Klinikum der Universität München, Munich, Germany
| | - Dieter A Wolf
- Medizinisch Genetisches Zentrum (MGZ), Munich, Germany
- Department of Medicine II, Technical University Munich, Munich, Germany
| |
Collapse
|
5
|
Volden R, Schimke KD, Byrne A, Dubocanin D, Adams M, Vollmers C. Identifying and quantifying isoforms from accurate full-length transcriptome sequencing reads with Mandalorion. Genome Biol 2023; 24:167. [PMID: 37461039 DOI: 10.1186/s13059-023-02999-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 06/28/2023] [Indexed: 07/20/2023] Open
Abstract
In this manuscript, we introduce and benchmark Mandalorion v4.1 for the identification and quantification of full-length transcriptome sequencing reads. It further improves upon the already strong performance of Mandalorion v3.6 used in the LRGASP consortium challenge. By processing real and simulated data, we show three main features of Mandalorion: first, Mandalorion-based isoform identification has very high precision and maintains high recall even in the absence of any genome annotation. Second, isoform read counts as quantified by Mandalorion show a high correlation with simulated read counts. Third, isoforms identified by Mandalorion closely reflect the full-length transcriptome sequencing data sets they are based on.
Collapse
Affiliation(s)
- Roger Volden
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
- Present Address: Pacific Biosciences, Menlo Park, CA, 94025, USA
| | - Kayla D Schimke
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Ashley Byrne
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
- Present Address: Genentech, San Francisco, CA, 94080, USA
| | - Danilo Dubocanin
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Matthew Adams
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA.
| |
Collapse
|
6
|
Zee A, Deng DZQ, Adams M, Schimke KD, Corbett-Detig R, Russell SL, Zhang X, Schmitz RJ, Vollmers C. Sequencing Illumina libraries at high accuracy on the ONT MinION using R2C2. Genome Res 2022; 32:2092-2106. [PMID: 36351772 PMCID: PMC9808628 DOI: 10.1101/gr.277031.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 10/21/2022] [Indexed: 11/11/2022]
Abstract
High-throughput short-read sequencing has taken on a central role in research and diagnostics. Hundreds of different assays take advantage of Illumina short-read sequencers, the predominant short-read sequencing technology available today. Although other short-read sequencing technologies exist, the ubiquity of Illumina sequencers in sequencing core facilities and the high capital costs of these technologies have limited their adoption. Among a new generation of sequencing technologies, Oxford Nanopore Technologies (ONT) holds a unique position because the ONT MinION, an error-prone long-read sequencer, is associated with little to no capital cost. Here we show that we can make short-read Illumina libraries compatible with the ONT MinION by using the rolling circle to concatemeric consensus (R2C2) method to circularize and amplify the short library molecules. This results in longer DNA molecules containing tandem repeats of the original short library molecules. This longer DNA is ideally suited for the ONT MinION, and after sequencing, the tandem repeats in the resulting raw reads can be converted into high-accuracy consensus reads with similar error rates to that of the Illumina MiSeq. We highlight this capability by producing and benchmarking RNA-seq, ChIP-seq, and regular and target-enriched Tn5 libraries. We also explore the use of this approach for rapid evaluation of sequencing library metrics by implementing a real-time analysis workflow.
Collapse
Affiliation(s)
- Alexander Zee
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Dori Z Q Deng
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Matthew Adams
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Kayla D Schimke
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Russell Corbett-Detig
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Shelbi L Russell
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| |
Collapse
|
7
|
Lin J, Ge L, Mei X, Niu Y, Chen C, Hou S, Liu X. Integrated ONT Full-Length Transcriptome and Metabolism Reveal the Mechanism Affecting Ovulation in Muscovy Duck (Cairina moschata). Front Vet Sci 2022; 9:890979. [PMID: 35873698 PMCID: PMC9305713 DOI: 10.3389/fvets.2022.890979] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 06/16/2022] [Indexed: 11/13/2022] Open
Abstract
Ovulation is a complicated physiological process that is regulated by a multitude of different pathways. In comparison to mammalian studies, there are few reports of ovulation in Muscovy ducks, and the molecular mechanism of ovarian development remained unclear. In order to identify candidate genes and metabolites related to Muscovy duck follicular ovulation, the study combined Oxford Nanopore Technologies (ONT) full-length transcriptome and metabolomics to analyze the differences in gene expression and metabolite accumulation in the ovaries between pre-ovulation (PO) and consecutive ovulation (CO) Muscovy ducks. 83 differentially accumulated metabolites (DAMs) were identified using metabolomics analysis, 33 of which are related to lipids. Combined with data from previous transcriptomic analyses found that DEGs and DAMs were particularly enriched in processes including the regulation of glycerophospholipid metabolism pathway, arachidonic acid metabolic pathway and the steroid biosynthetic pathway. In summary, the novel potential mechanisms that affect ovulation in Muscovy ducks may be related to lipid metabolism, and the findings provide new insights into the mechanisms of ovulation in waterfowl and will contribute to a better understanding of changes in the waterfowl ovarian development regulatory network.
Collapse
Affiliation(s)
- Junyuan Lin
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Liyan Ge
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Xiang Mei
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Yurui Niu
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Chu Chen
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Shuisheng Hou
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
- Ministry of Agriculture Key Laboratory of Animal Genetics Breeding and Reproduction (Poultry), Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
- *Correspondence: Shuisheng Hou
| | - Xiaolin Liu
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
- Xiaolin Liu
| |
Collapse
|
8
|
Laidre KL, Supple MA, Born EW, Regehr EV, Wiig Ø, Ugarte F, Aars J, Dietz R, Sonne C, Hegelund P, Isaksen C, Akse GB, Cohen B, Stern HL, Moon T, Vollmers C, Corbett-Detig R, Paetkau D, Shapiro B. Glacial ice supports a distinct and undocumented polar bear subpopulation persisting in late 21st-century sea-ice conditions. Science 2022; 376:1333-1338. [PMID: 35709290 DOI: 10.1126/science.abk2793] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Polar bears are susceptible to climate warming because of their dependence on sea ice, which is declining rapidly. We present the first evidence for a genetically distinct and functionally isolated group of polar bears in Southeast Greenland. These bears occupy sea-ice conditions resembling those projected for the High Arctic in the late 21st century, with an annual ice-free period that is >100 days longer than the estimated fasting threshold for the species. Whereas polar bears in most of the Arctic depend on annual sea ice to catch seals, Southeast Greenland bears have a year-round hunting platform in the form of freshwater glacial mélange. This suggests that marine-terminating glaciers, although of limited availability, may serve as previously unrecognized climate refugia. Conservation of Southeast Greenland polar bears, which meet criteria for recognition as the world's 20th polar bear subpopulation, is necessary to preserve the genetic diversity and evolutionary potential of the species.
Collapse
Affiliation(s)
- Kristin L Laidre
- Polar Science Center, Applied Physics Laboratory, University of Washington, Seattle, WA 98105, USA.,Greenland Institute of Natural Resources, 3900 Nuuk, Greenland
| | - Megan A Supple
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Erik W Born
- Greenland Institute of Natural Resources, 3900 Nuuk, Greenland
| | - Eric V Regehr
- Polar Science Center, Applied Physics Laboratory, University of Washington, Seattle, WA 98105, USA
| | - Øystein Wiig
- Natural History Museum, University of Oslo, Blindern, NO-0318 Oslo, Norway
| | - Fernando Ugarte
- Greenland Institute of Natural Resources, 3900 Nuuk, Greenland
| | - Jon Aars
- Norwegian Polar Institute, Fram Centre, NO-9296 Tromsø, Norway
| | - Rune Dietz
- Department of Ecoscience and Arctic Research Centre, Aarhus University, DK-4000 Roskilde, Denmark
| | - Christian Sonne
- Department of Ecoscience and Arctic Research Centre, Aarhus University, DK-4000 Roskilde, Denmark
| | - Peter Hegelund
- Greenland Institute of Natural Resources, 3900 Nuuk, Greenland
| | - Carl Isaksen
- Greenland Institute of Natural Resources, 3900 Nuuk, Greenland
| | | | - Benjamin Cohen
- Polar Science Center, Applied Physics Laboratory, University of Washington, Seattle, WA 98105, USA
| | - Harry L Stern
- Polar Science Center, Applied Physics Laboratory, University of Washington, Seattle, WA 98105, USA
| | - Twila Moon
- National Snow and Ice Data Center, Cooperative Institute for Research In Environmental Sciences (CIRES), University of Colorado, Boulder, CO 80309, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Russ Corbett-Detig
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - David Paetkau
- Wildlife Genetics International, Nelson, BC V1L 5P9, Canada
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.,Howard Hughes Medical Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| |
Collapse
|
9
|
Tseng E, Underwood JG, Evans Hutzenbiler BD, Trojahn S, Kingham B, Shevchenko O, Bernberg E, Vierra M, Robbins CT, Jansen HT, Kelley JL. Long-read isoform sequencing reveals tissue-specific isoform expression between active and hibernating brown bears (Ursus arctos). G3 (BETHESDA, MD.) 2022; 12:6472356. [PMID: 35100340 PMCID: PMC9210309 DOI: 10.1093/g3journal/jkab422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 11/17/2021] [Indexed: 11/13/2022]
Abstract
Understanding hibernation in brown bears (Ursus arctos) can provide insight into some human diseases. During hibernation, brown bears experience periods of insulin resistance, physical inactivity, extreme bradycardia, obesity, and the absence of urine production. These states closely mimic aspects of human diseases such as type 2 diabetes, muscle atrophy, as well as renal and heart failure. The reversibility of these states from hibernation to active season enables the identification of mediators with possible therapeutic value for humans. Recent studies have identified genes and pathways that are differentially expressed between active and hibernation seasons in bears. However, little is known about the role of differential expression of gene isoforms on hibernation physiology. To identify both distinct and novel mRNA isoforms, full-length RNA-sequencing (Iso-Seq) was performed on adipose, skeletal muscle, and liver from three individual bears sampled during both active and hibernation seasons. The existing reference genome annotation was improved by combining it with the Iso-Seq data. Short-read RNA-sequencing data from six individuals were mapped to the new reference annotation to quantify differential isoform usage (DIU) between tissues and seasons. We identified differentially expressed isoforms in all three tissues, to varying degrees. Adipose had a high level of DIU with isoform switching, regardless of whether the genes were differentially expressed. Our analyses revealed that DIU, even in the absence of differential gene expression, is an important mechanism for modulating genes during hibernation. These findings demonstrate the value of isoform expression studies and will serve as the basis for deeper exploration into hibernation biology.
Collapse
Affiliation(s)
| | | | - Brandon D Evans Hutzenbiler
- Department of Integrative Physiology and Neuroscience, Washington State University, Pullman, WA 99164, USA.,School of the Environment, Washington State University, Pullman, WA 99164, USA
| | - Shawn Trojahn
- School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| | - Brewster Kingham
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Olga Shevchenko
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Erin Bernberg
- Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | | | - Charles T Robbins
- School of the Environment, Washington State University, Pullman, WA 99164, USA.,School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| | - Heiko T Jansen
- Department of Integrative Physiology and Neuroscience, Washington State University, Pullman, WA 99164, USA
| | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, WA 99164, USA
| |
Collapse
|
10
|
Volden R, Vollmers C. Single-cell isoform analysis in human immune cells. Genome Biol 2022; 23:47. [PMID: 35130954 PMCID: PMC8819920 DOI: 10.1186/s13059-022-02615-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 01/20/2022] [Indexed: 02/06/2023] Open
Abstract
High-throughput single-cell analysis today is facilitated by protocols like the 10X Genomics platform or Drop-Seq which generate cDNA pools in which the origin of a transcript is encoded at its 5′ or 3′ end. Here, we used R2C2 to sequence and demultiplex 12 million full-length cDNA molecules generated by the 10X Genomics platform from ~3000 peripheral blood mononuclear cells. We use these reads, independent from Illumina data, to identify B cell, T cell, and monocyte clusters and generate isoform-level transcriptomes for cells and cell types. Finally, we extract paired adaptive immune receptor sequences unique to each T and B cell.
Collapse
Affiliation(s)
- Roger Volden
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, 95064, USA.
| |
Collapse
|
11
|
Halstead MM, Islas-Trejo A, Goszczynski DE, Medrano JF, Zhou H, Ross PJ. Large-Scale Multiplexing Permits Full-Length Transcriptome Annotation of 32 Bovine Tissues From a Single Nanopore Flow Cell. Front Genet 2021; 12:664260. [PMID: 34093657 PMCID: PMC8173071 DOI: 10.3389/fgene.2021.664260] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/06/2021] [Indexed: 12/18/2022] Open
Abstract
A comprehensive annotation of transcript isoforms in domesticated species is lacking. Especially considering that transcriptome complexity and splicing patterns are not well-conserved between species, this presents a substantial obstacle to genomic selection programs that seek to improve production, disease resistance, and reproduction. Recent advances in long-read sequencing technology have made it possible to directly extrapolate the structure of full-length transcripts without the need for transcript reconstruction. In this study, we demonstrate the power of long-read sequencing for transcriptome annotation by coupling Oxford Nanopore Technology (ONT) with large-scale multiplexing of 93 samples, comprising 32 tissues collected from adult male and female Hereford cattle. More than 30 million uniquely mapping full-length reads were obtained from a single ONT flow cell, and used to identify and characterize the expression dynamics of 99,044 transcript isoforms at 31,824 loci. Of these predicted transcripts, 21% exactly matched a reference transcript, and 61% were novel isoforms of reference genes, substantially increasing the ratio of transcript variants per gene, and suggesting that the complexity of the bovine transcriptome is comparable to that in humans. Over 7,000 transcript isoforms were extremely tissue-specific, and 61% of these were attributed to testis, which exhibited the most complex transcriptome of all interrogated tissues. Despite profiling over 30 tissues, transcription was only detected at about 60% of reference loci. Consequently, additional studies will be necessary to continue characterizing the bovine transcriptome in additional cell types, developmental stages, and physiological conditions. However, by here demonstrating the power of ONT sequencing coupled with large-scale multiplexing, the task of exhaustively annotating the bovine transcriptome - or any mammalian transcriptome - appears significantly more feasible.
Collapse
Affiliation(s)
| | | | | | | | | | - Pablo J. Ross
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| |
Collapse
|
12
|
Vollmers AC, Mekonen HE, Campos S, Carpenter S, Vollmers C. Generation of an isoform-level transcriptome atlas of macrophage activation. J Biol Chem 2021; 296:100784. [PMID: 34000296 PMCID: PMC8191339 DOI: 10.1016/j.jbc.2021.100784] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/05/2021] [Accepted: 05/10/2021] [Indexed: 01/26/2023] Open
Abstract
RNA-seq is routinely used to measure gene expression changes in response to cell perturbation. Genes upregulated or downregulated following some perturbation are designated as genes of interest, and their most expressed isoform(s) would then be selected for follow-up experimentation. However, because of its need to fragment RNA molecules, RNA-seq is limited in its ability to capture gene isoforms and their expression patterns. This lack of isoform-specific data means that isoforms would be selected based on annotation databases that are incomplete, not tissue specific, or do not provide key information on expression levels. As a result, minority or nonexistent isoforms might be selected for follow-up, leading to loss in valuable resources and time. There is therefore a great need to comprehensively identify gene isoforms along with their corresponding levels of expression. Using the long-read nanopore-based R2C2 method, which does not fragment RNA molecules, we generated an Isoform-level transcriptome Atlas of Macrophage Activation that identifies full-length isoforms in primary human monocyte-derived macrophages. Macrophages are critical innate immune cells important for recognizing pathogens through binding of pathogen-associated molecular patterns to toll-like receptors, culminating in the initiation of host defense pathways. We characterized isoforms for most moderately-to-highly expressed genes in resting and toll-like receptor–activated monocyte-derived macrophages, identified isoforms differentially expressed between conditions, and validated these isoforms by RT-qPCR. We compiled these data into a user-friendly data portal within the UCSC Genome Browser (https://genome.ucsc.edu/s/vollmers/IAMA). Our atlas represents a valuable resource for innate immune research, providing unprecedented isoform information for primary human macrophages.
Collapse
Affiliation(s)
- Apple Cortez Vollmers
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California, USA
| | - Honey E Mekonen
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, USA
| | - Sophia Campos
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, USA
| | - Susan Carpenter
- Department of Molecular, Cellular, and Developmental Biology, University of California Santa Cruz, Santa Cruz, California, USA.
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, USA.
| |
Collapse
|
13
|
Banerjee SM, Stoll JA, Allen CD, Lynch JM, Harris HS, Kenyon L, Connon RE, Sterling EJ, Naro-Maciel E, McFadden K, Lamont MM, Benge J, Fernandez NB, Seminoff JA, Benson SR, Lewison RL, Eguchi T, Summers TM, Hapdei JR, Rice MR, Martin S, Jones TT, Dutton PH, Balazs GH, Komoroske LM. Species and population specific gene expression in blood transcriptomes of marine turtles. BMC Genomics 2021; 22:346. [PMID: 33985425 PMCID: PMC8117300 DOI: 10.1186/s12864-021-07656-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 04/23/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Transcriptomic data has demonstrated utility to advance the study of physiological diversity and organisms' responses to environmental stressors. However, a lack of genomic resources and challenges associated with collecting high-quality RNA can limit its application for many wild populations. Minimally invasive blood sampling combined with de novo transcriptomic approaches has great potential to alleviate these barriers. Here, we advance these goals for marine turtles by generating high quality de novo blood transcriptome assemblies to characterize functional diversity and compare global transcriptional profiles between tissues, species, and foraging aggregations. RESULTS We generated high quality blood transcriptome assemblies for hawksbill (Eretmochelys imbricata), loggerhead (Caretta caretta), green (Chelonia mydas), and leatherback (Dermochelys coriacea) turtles. The functional diversity in assembled blood transcriptomes was comparable to those from more traditionally sampled tissues. A total of 31.3% of orthogroups identified were present in all four species, representing a core set of conserved genes expressed in blood and shared across marine turtle species. We observed strong species-specific expression of these genes, as well as distinct transcriptomic profiles between green turtle foraging aggregations that inhabit areas of greater or lesser anthropogenic disturbance. CONCLUSIONS Obtaining global gene expression data through non-lethal, minimally invasive sampling can greatly expand the applications of RNA-sequencing in protected long-lived species such as marine turtles. The distinct differences in gene expression signatures between species and foraging aggregations provide insight into the functional genomics underlying the diversity in this ancient vertebrate lineage. The transcriptomic resources generated here can be used in further studies examining the evolutionary ecology and anthropogenic impacts on marine turtles.
Collapse
Affiliation(s)
- Shreya M Banerjee
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA
| | - Jamie Adkins Stoll
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA
| | - Camryn D Allen
- Marine Turtle Biology and Assessment Program, Protected Species Division, Pacific Islands Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Honolulu, HI, USA.,Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
| | - Jennifer M Lynch
- Chemical Sciences Division, National Institute of Standards and Technology, Hawai'i Pacific University, Waimanalo, HI, USA
| | - Heather S Harris
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
| | - Lauren Kenyon
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA
| | - Richard E Connon
- Department of Anatomy, Physiology and Cell Biology, University of California, Davis, Davis, CA, USA
| | - Eleanor J Sterling
- Center for Biodiversity and Conservation, American Museum of Natural History, New York, NY, USA
| | | | - Kathryn McFadden
- School of Agricultural, Forest, and Environmental Sciences, Clemson University, Clemson, SC, USA
| | - Margaret M Lamont
- United States Geological Survey, Wetland and Aquatic Research Center, Gainesville, FL, USA
| | - James Benge
- Section of Molecular Biology, Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
| | - Nadia B Fernandez
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA
| | - Jeffrey A Seminoff
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
| | - Scott R Benson
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Moss Landing, CA, 95039, USA.,Moss Landing Marine Laboratories, San Jose State University, Moss Landing, CA, 95039, USA
| | - Rebecca L Lewison
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Tomoharu Eguchi
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
| | | | - Jessy R Hapdei
- Jessy's Tag Services, Saipan, Commonwealth of the Northern Mariana Islands, USA
| | - Marc R Rice
- Hawai'i Preparatory Academy, Kamuela, HI, USA
| | - Summer Martin
- Marine Turtle Biology and Assessment Program, Protected Species Division, Pacific Islands Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Honolulu, HI, USA
| | - T Todd Jones
- Marine Turtle Biology and Assessment Program, Protected Species Division, Pacific Islands Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Honolulu, HI, USA
| | - Peter H Dutton
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA
| | | | - Lisa M Komoroske
- Department of Environmental Conservation, University of Massachusetts, Amherst, MA, USA. .,Marine Mammal and Turtle Division, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, CA, USA.
| |
Collapse
|
14
|
Sahlin K, Medvedev P. Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis. Nat Commun 2021; 12:2. [PMID: 33397972 PMCID: PMC7782715 DOI: 10.1038/s41467-020-20340-8] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 11/25/2020] [Indexed: 01/24/2023] Open
Abstract
Oxford Nanopore (ONT) is a leading long-read technology which has been revolutionizing transcriptome analysis through its capacity to sequence the majority of transcripts from end-to-end. This has greatly increased our ability to study the diversity of transcription mechanisms such as transcription initiation, termination, and alternative splicing. However, ONT still suffers from high error rates which have thus far limited its scope to reference-based analyses. When a reference is not available or is not a viable option due to reference-bias, error correction is a crucial step towards the reconstruction of the sequenced transcripts and downstream sequence analysis of transcripts. In this paper, we present a novel computational method to error correct ONT cDNA sequencing data, called isONcorrect. IsONcorrect is able to jointly use all isoforms from a gene during error correction, thereby allowing it to correct reads at low sequencing depths. We are able to obtain a median accuracy of 98.9-99.6%, demonstrating the feasibility of applying cost-effective cDNA full transcript length sequencing for reference-free transcriptome analysis.
Collapse
Affiliation(s)
- Kristoffer Sahlin
- Department of Mathematics, Science for Life Laboratory, Stockholm University, 106 91, Stockholm, Sweden
| | - Paul Medvedev
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, USA.
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA.
- Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
15
|
Prezza G, Heckel T, Dietrich S, Homberger C, Westermann AJ, Vogel J. Improved bacterial RNA-seq by Cas9-based depletion of ribosomal RNA reads. RNA (NEW YORK, N.Y.) 2020; 26:1069-1078. [PMID: 32345633 PMCID: PMC7373992 DOI: 10.1261/rna.075945.120] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 05/01/2020] [Indexed: 05/08/2023]
Abstract
A major challenge for RNA-seq analysis of gene expression is to achieve sufficient coverage of informative nonribosomal transcripts. In eukaryotic samples, this is typically achieved by selective oligo(dT)-priming of messenger RNAs to exclude ribosomal RNA (rRNA) during cDNA synthesis. However, this strategy is not compatible with prokaryotes in which functional transcripts are generally not polyadenylated. To overcome this, we adopted DASH (depletion of abundant sequences by hybridization), initially developed for eukaryotic cells, to improve both the sensitivity and depth of bacterial RNA-seq. DASH uses the Cas9 nuclease to remove unwanted cDNA sequences prior to library amplification. We report the design, evaluation, and optimization of DASH experiments for standard bacterial short-read sequencing approaches, including software for automated guide RNA (gRNA) design for Cas9-mediated cleavage in bacterial rDNA sequences. Using these gRNA pools, we effectively removed rRNA reads (56%-86%) in RNA-seq libraries from two different model bacteria, the Gram-negative pathogen Salmonella enterica and the anaerobic gut commensal Bacteroides thetaiotaomicron DASH works robustly, even with subnanogram amounts of input RNA. Its efficiency, high sensitivity, ease of implementation, and low cost (∼$5 per sample) render DASH an attractive alternative to rRNA removal protocols, in particular for material-constrained studies where conventional ribodepletion techniques fail.
Collapse
Affiliation(s)
- Gianluca Prezza
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, 97080, Germany
| | - Tobias Heckel
- Core Unit Systems Medicine, University of Würzburg, Würzburg, 97080, Germany
| | - Sascha Dietrich
- Core Unit Systems Medicine, University of Würzburg, Würzburg, 97080, Germany
| | - Christina Homberger
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, 97080, Germany
| | - Alexander J Westermann
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, 97080, Germany
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, 97080, Germany
| | - Jörg Vogel
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, 97080, Germany
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, 97080, Germany
| |
Collapse
|
16
|
Complete characterization of the human immune cell transcriptome using accurate full-length cDNA sequencing. Genome Res 2020; 30:589-601. [PMID: 32312742 PMCID: PMC7197476 DOI: 10.1101/gr.257188.119] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Accepted: 04/03/2020] [Indexed: 01/06/2023]
Abstract
The human immune system relies on highly complex and diverse transcripts and the proteins they encode. These include transcripts encoding human leukocyte antigen (HLA) receptors as well as B cell and T cell receptors (BCR and TCR). Determining which alleles an individual possesses for each HLA gene (high-resolution HLA typing) is essential to establish donor–recipient compatibility in organ and bone marrow transplantations. In turn, the repertoires of millions of unique BCR and TCR transcripts in each individual carry a vast amount of health-relevant information. Both short-read RNA-seq-based HLA typing and BCR/TCR repertoire sequencing (AIRR-seq) currently rely on our incomplete knowledge of the genetic diversity at HLA and BCR/TCR loci. Here, we generated over 10,000,000 full-length cDNA sequences at a median accuracy of 97.9% using our nanopore sequencing-based Rolling Circle Amplification to Concatemeric Consensus (R2C2) protocol. We used this data set to (1) show that deep and accurate full-length cDNA sequencing can be used to provide isoform-level transcriptome analysis for more than 9000 loci, (2) generate accurate sequences of HLA alleles, and (3) extract detailed AIRR data for the analysis of the adaptive immune system. The HLA and AIRR analysis approaches we introduce here are untargeted and therefore do not require prior knowledge of the composition or genetic diversity of HLA and BCR/TCR loci.
Collapse
|