1
|
De La Cerda GY, Landis JB, Eifler E, Hernandez AI, Li F, Zhang J, Tribble CM, Karimi N, Chan P, Givnish T, Strickler SR, Specht CD. Balancing read length and sequencing depth: Optimizing Nanopore long-read sequencing for monocots with an emphasis on the Liliales. Appl Plant Sci 2023; 11:e11524. [PMID: 37342170 PMCID: PMC10278932 DOI: 10.1002/aps3.11524] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/20/2023] [Accepted: 01/30/2023] [Indexed: 06/22/2023]
Abstract
PREMISE We present approaches used to generate long-read Nanopore sequencing reads for the Liliales and demonstrate how modifications to standard protocols directly impact read length and total output. The goal is to help those interested in generating long-read sequencing data determine which steps may be necessary for optimizing output and results. METHODS Four species of Calochortus (Liliaceae) were sequenced. Modifications made to sodium dodecyl sulfate (SDS) extractions and cleanup protocols included grinding with a mortar and pestle, using cut or wide-bore tips, chloroform cleaning, bead cleaning, eliminating short fragments, and using highly purified DNA. RESULTS Steps taken to maximize read length can decrease overall output. Notably, the number of pores in a flow cell is correlated with the overall output, yet we did not see an association between the pore number and the read length or the number of reads produced. DISCUSSION Many factors contribute to the overall success of a Nanopore sequencing run. We showed the direct impact that several modifications to the DNA extraction and cleaning steps have on the total sequencing output, read size, and number of reads generated. We show a tradeoff between read length and the number of reads and, to a lesser extent, the total sequencing output, all of which are important factors for successful de novo genome assembly.
Collapse
Affiliation(s)
- Gisel Y. De La Cerda
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| | - Jacob B. Landis
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Evan Eifler
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Adriana I. Hernandez
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| | - Fay‐Wei Li
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Jing Zhang
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
| | - Carrie M. Tribble
- School of Life SciencesUniversity of Hawaiʻi, MānoaHonoluluHawaiʻi96822USA
| | - Nisa Karimi
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Patricia Chan
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Thomas Givnish
- Department of BotanyUniversity of Wisconsin–MadisonMadisonWisconsin53706USA
| | - Susan R. Strickler
- BTI Computational Biology CenterBoyce Thompson InstituteIthacaNew York14853USA
- Present address:
Plant Science and ConservationChicago Botanic GardenGlencoeIllinois60022USA
- Present address:
Plant Biology and Conservation ProgramNorthwestern UniversityEvanstonIllinois60208USA
| | - Chelsea D. Specht
- School of Integrative Plant Science, Section of Plant Biology and the L. H. Bailey HortoriumCornell UniversityIthacaNew York14853USA
| |
Collapse
|
2
|
Goll JB, Bosinger SE, Jensen TL, Walum H, Grimes T, Tharp GK, Natrajan MS, Blazevic A, Head RD, Gelber CE, Steenbergen KJ, Patel NB, Sanz P, Rouphael NG, Anderson EJ, Mulligan MJ, Hoft DF. Corrigendum: The Vacc-SeqQC project: Benchmarking RNA-Seq for clinical vaccine studies. Front Immunol 2023; 14:1163550. [PMID: 36911714 PMCID: PMC9996330 DOI: 10.3389/fimmu.2023.1163550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 02/13/2023] [Indexed: 02/25/2023] Open
Abstract
[This corrects the article DOI: 10.3389/fimmu.2022.1093242.].
Collapse
Affiliation(s)
- Johannes B Goll
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Steven E Bosinger
- Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Department of Pathology & Laboratory Medicine, School of Medicine, Emory University, Atlanta, GA, United States.,Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States
| | - Travis L Jensen
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Hasse Walum
- Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Tyler Grimes
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Gregory K Tharp
- Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Muktha S Natrajan
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States
| | - Azra Blazevic
- Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States
| | - Richard D Head
- McDonnell Genome Institute, Washington University, St. Louis, MO, United States
| | - Casey E Gelber
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Kristen J Steenbergen
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Nirav B Patel
- Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Patrick Sanz
- Office of Biodefense, Research Resources and Translational Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, United States
| | - Nadine G Rouphael
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States
| | - Evan J Anderson
- Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,Center for Childhood Infections and Vaccines (CCIV) of Children's Healthcare of Atlanta and Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, United States
| | - Mark J Mulligan
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,New York University Vaccine Center, New York, NY, United States
| | - Daniel F Hoft
- Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States.,Department of Molecular Microbiology & Immunology, Saint Louis University, St. Louis, MO, United States
| |
Collapse
|
3
|
Goll JB, Bosinger SE, Jensen TL, Walum H, Grimes T, Tharp GK, Natrajan MS, Blazevic A, Head RD, Gelber CE, Steenbergen KJ, Patel NB, Sanz P, Rouphael NG, Anderson EJ, Mulligan MJ, Hoft DF. The Vacc-SeqQC project: Benchmarking RNA-Seq for clinical vaccine studies. Front Immunol 2022; 13:1093242. [PMID: 36741404 PMCID: PMC9893923 DOI: 10.3389/fimmu.2022.1093242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 12/30/2022] [Indexed: 01/20/2023] Open
Abstract
Introduction Over the last decade, the field of systems vaccinology has emerged, in which high throughput transcriptomics and other omics assays are used to probe changes of the innate and adaptive immune system in response to vaccination. The goal of this study was to benchmark key technical and analytical parameters of RNA sequencing (RNA-seq) in the context of a multi-site, double-blind randomized vaccine clinical trial. Methods We collected longitudinal peripheral blood mononuclear cell (PBMC) samples from 10 subjects before and after vaccination with a live attenuated Francisella tularensis vaccine and performed RNA-Seq at two different sites using aliquots from the same sample to generate two replicate datasets (5 time points for 50 samples each). We evaluated the impact of (i) filtering lowly-expressed genes, (ii) using external RNA controls, (iii) fold change and false discovery rate (FDR) filtering, (iv) read length, and (v) sequencing depth on differential expressed genes (DEGs) concordance between replicate datasets. Using synthetic mRNA spike-ins, we developed a method for empirically establishing minimal read-count thresholds for maintaining fold change accuracy on a per-experiment basis. We defined a reference PBMC transcriptome by pooling sequence data and established the impact of sequencing depth and gene filtering on transcriptome representation. Lastly, we modeled statistical power to detect DEGs for a range of sample sizes, effect sizes, and sequencing depths. Results and Discussion Our results showed that (i) filtering lowly-expressed genes is recommended to improve fold-change accuracy and inter-site agreement, if possible guided by mRNA spike-ins (ii) read length did not have a major impact on DEG detection, (iii) applying fold-change cutoffs for DEG detection reduced inter-set agreement and should be used with caution, if at all, (iv) reduction in sequencing depth had a minimal impact on statistical power but reduced the identifiable fraction of the PBMC transcriptome, (v) after sample size, effect size (i.e. the magnitude of fold change) was the most important driver of statistical power to detect DEG. The results from this study provide RNA sequencing benchmarks and guidelines for planning future similar vaccine studies.
Collapse
Affiliation(s)
- Johannes B Goll
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Steven E Bosinger
- Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Department of Pathology & Laboratory Medicine, School of Medicine, Emory University, Atlanta, GA, United States.,Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States
| | - Travis L Jensen
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Hasse Walum
- Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Tyler Grimes
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Gregory K Tharp
- Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Muktha S Natrajan
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States
| | - Azra Blazevic
- Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States
| | - Richard D Head
- McDonnell Genome Institute, Washington University, St. Louis, MO, United States
| | - Casey E Gelber
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Kristen J Steenbergen
- Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
| | - Nirav B Patel
- Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
| | - Patrick Sanz
- Office of Biodefense, Research Resources and Translational Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, United States
| | - Nadine G Rouphael
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States
| | - Evan J Anderson
- Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,Center for Childhood Infections and Vaccines (CCIV) of Children's Healthcare of Atlanta and Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, United States
| | - Mark J Mulligan
- Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,New York University Vaccine Center, New York, NY, United States
| | - Daniel F Hoft
- Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States.,Department of Molecular Microbiology & Immunology, Saint Louis University, St. Louis, MO, United States
| |
Collapse
|