Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Poplawski A, Binder H. Feasibility of sample size calculation for RNA-seq studies. Brief Bioinform 2019;19:713-720. [PMID: 28100468 DOI: 10.1093/bib/bbw144] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Mascardi MF, Mazzini FN, Suárez B, Ruda VM, Marciano S, Casciato P, Narvaez A, Haddad L, Anders M, Orozco F, Tamaroff AJ, Cook F, Gounarides J, Gutt S, Gadano A, García CM, Marro ML, Penas Steinhardt A, Trinks J. Integrated analysis of the transcriptome and its interaction with the metabolome in metabolic associated fatty liver disease: Gut microbiome signatures, correlation networks, and effect of PNPLA3 genotype. Proteomics 2023;23:e2200414. [PMID: 37525333 DOI: 10.1002/pmic.202200414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 07/12/2023] [Accepted: 07/12/2023] [Indexed: 08/02/2023]

Affiliation(s)

María Florencia Mascardi Instituto de Medicina Traslacional e Ingeniería Biomédica (IMTIB) - CONICET - Instituto Universitario del Hospital Italiano (IUHI) - Hospital Italiano de Buenos Aires (HIBA), Buenos Aires, Argentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
Flavia Noelia Mazzini Instituto de Medicina Traslacional e Ingeniería Biomédica (IMTIB) - CONICET - Instituto Universitario del Hospital Italiano (IUHI) - Hospital Italiano de Buenos Aires (HIBA), Buenos Aires, Argentina
Bárbara Suárez Instituto de Medicina Traslacional e Ingeniería Biomédica (IMTIB) - CONICET - Instituto Universitario del Hospital Italiano (IUHI) - Hospital Italiano de Buenos Aires (HIBA), Buenos Aires, Argentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
Vera M Ruda Biotherapeutic and Analytical Technologies, Novartis Institutes for Biomedical Research (NIBR), Cambridge, Massachusetts, USA
Sebastián Marciano Liver Unit of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Paola Casciato Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina Liver Unit of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Adrián Narvaez Liver Unit of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Leila Haddad Liver Unit of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Margarita Anders Liver Unit of Hospital Alemán, Buenos Aires, Argentina
Federico Orozco Liver Unit of Hospital Alemán, Buenos Aires, Argentina
Ana Jesica Tamaroff Nutrition Department of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Frank Cook Analytical Sciences & Imaging Department, NIBR, Cambridge, Massachusetts, USA
John Gounarides Analytical Sciences & Imaging Department, NIBR, Cambridge, Massachusetts, USA
Susana Gutt Nutrition Department of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Adrián Gadano Liver Unit of Hospital Italiano de Buenos Aires, Buenos Aires, Argentina
Celia Méndez García Chemical Biology & Therapeutics Department, NIBR, Cambridge, Massachusetts, USA
Martin L Marro Cardiovascular and Metabolic Disease Area, NIBR, Cambridge, Massachusetts, USA
Alberto Penas Steinhardt Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina Departamento de Ciencias Básicas, Laboratorio de Genómica Computacional, Universidad Nacional de Luján, Lujan, Buenos Aires, Argentina
Julieta Trinks Instituto de Medicina Traslacional e Ingeniería Biomédica (IMTIB) - CONICET - Instituto Universitario del Hospital Italiano (IUHI) - Hospital Italiano de Buenos Aires (HIBA), Buenos Aires, Argentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina

Collapse

Albaz N, Agha S. Medical Education in the Context of the Eastern Mediterranean Region: Professional Development Activity of Health Professionals. Adv Med Educ Pract 2023;14:463-471. [PMID: 37168458 PMCID: PMC10166095 DOI: 10.2147/amep.s395015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 04/20/2023] [Indexed: 05/13/2023]

Abstract

Background

The development and maintenance of a high-quality medical education workforce depend on continuing medical education (CME). Traditionally, CME is delivered face-to-face, but due to COVID-19 and geographical distances, it is challenging to conduct professional development activities for several days. Using a webinar on advancement in medical education in the context of the eastern Mediterranean, we aimed to assess the participants' perspectives towards the professional development activity using a synchronous learning approach.

Methods

We used a cross-sectional survey-based study design. We invited faculty members from King Saud bin Abdulaziz University for Health Sciences (KSAU-HS) and United Arab Emirates University for Health Sciences (UAEU). We assessed their perspectives on the relevance of the content and effectiveness of the activity on their knowledge and skills after the two days' webinar series. A self-designed questionnaire was administered post-webinar immediately. Open-ended responses were analyzed thematically.

Results

One hundred thirty-six registered healthcare professionals attended day 1, and 97 registered participants joined on the second day of the webinar. Most participants appreciated the diversity of the contents, the quality of the presentations, and the expertise of the facilitators. They reported that the content optimized their knowledge and understanding of new concepts such as assessment in simulation teaching, programmatic assessment, insight into the implementation of IPE and EPAs in CBME, and so on. The e-learning platform's user accessibility, online tutor interaction, and the addition of more scenario-based case studies were all recommended for improvement.

Conclusion

Overall the two days webinar series presentations were informative and highlighted the transformation in medical practices. Suggestions to improve the quality of the webinars and content were discussed.

Collapse

Wang L, Ye H, Huang D, Lu C, Lin W, Chen X. Comprehensive circRNA Analyses in Human Vertebrae of GIOP and Its Molecular Mechanism. Evid Based Complement Alternat Med 2022;2022:4203161. [PMID: 35178103 DOI: 10.1155/2022/4203161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 12/23/2021] [Accepted: 01/10/2022] [Indexed: 11/18/2022]

Goll JB, Bosinger SE, Jensen TL, Walum H, Grimes T, Tharp GK, Natrajan MS, Blazevic A, Head RD, Gelber CE, Steenbergen KJ, Patel NB, Sanz P, Rouphael NG, Anderson EJ, Mulligan MJ, Hoft DF. The Vacc-SeqQC project: Benchmarking RNA-Seq for clinical vaccine studies. Front Immunol 2022;13:1093242. [PMID: 36741404 PMCID: PMC9893923 DOI: 10.3389/fimmu.2022.1093242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 12/30/2022] [Indexed: 01/20/2023] Open

Abstract

Introduction

Over the last decade, the field of systems vaccinology has emerged, in which high throughput transcriptomics and other omics assays are used to probe changes of the innate and adaptive immune system in response to vaccination. The goal of this study was to benchmark key technical and analytical parameters of RNA sequencing (RNA-seq) in the context of a multi-site, double-blind randomized vaccine clinical trial.

Methods

We collected longitudinal peripheral blood mononuclear cell (PBMC) samples from 10 subjects before and after vaccination with a live attenuated Francisella tularensis vaccine and performed RNA-Seq at two different sites using aliquots from the same sample to generate two replicate datasets (5 time points for 50 samples each). We evaluated the impact of (i) filtering lowly-expressed genes, (ii) using external RNA controls, (iii) fold change and false discovery rate (FDR) filtering, (iv) read length, and (v) sequencing depth on differential expressed genes (DEGs) concordance between replicate datasets. Using synthetic mRNA spike-ins, we developed a method for empirically establishing minimal read-count thresholds for maintaining fold change accuracy on a per-experiment basis. We defined a reference PBMC transcriptome by pooling sequence data and established the impact of sequencing depth and gene filtering on transcriptome representation. Lastly, we modeled statistical power to detect DEGs for a range of sample sizes, effect sizes, and sequencing depths.

Results and Discussion

Our results showed that (i) filtering lowly-expressed genes is recommended to improve fold-change accuracy and inter-site agreement, if possible guided by mRNA spike-ins (ii) read length did not have a major impact on DEG detection, (iii) applying fold-change cutoffs for DEG detection reduced inter-set agreement and should be used with caution, if at all, (iv) reduction in sequencing depth had a minimal impact on statistical power but reduced the identifiable fraction of the PBMC transcriptome, (v) after sample size, effect size (i.e. the magnitude of fold change) was the most important driver of statistical power to detect DEG. The results from this study provide RNA sequencing benchmarks and guidelines for planning future similar vaccine studies.

Collapse

Affiliation(s)

Johannes B Goll Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
Steven E Bosinger Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Department of Pathology & Laboratory Medicine, School of Medicine, Emory University, Atlanta, GA, United States.,Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States.,Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States
Travis L Jensen Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
Hasse Walum Division of Microbiology & Immunology, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
Tyler Grimes Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
Gregory K Tharp Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
Muktha S Natrajan Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States
Azra Blazevic Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States
Richard D Head McDonnell Genome Institute, Washington University, St. Louis, MO, United States
Casey E Gelber Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
Kristen J Steenbergen Department of Biomedical Data Science and Bioinformatics, The Emmes Company, LLC, Rockville, MD, United States
Nirav B Patel Emory NPRC Genomics Core, Emory National Primate Research Center, Emory University, Atlanta, GA, United States
Patrick Sanz Office of Biodefense, Research Resources and Translational Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, United States
Nadine G Rouphael Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States
Evan J Anderson Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,Center for Childhood Infections and Vaccines (CCIV) of Children's Healthcare of Atlanta and Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, United States
Mark J Mulligan Emory Vaccine Center, Emory University School of Medicine, Atlanta, GA, United States.,Hope Clinic of the Emory Vaccine Center, Emory University, Atlanta, GA, United States.,Department of Medicine, Division of Infectious Diseases, Emory University School of Medicine, Emory University, Atlanta, GA, United States.,New York University Vaccine Center, New York, NY, United States
Daniel F Hoft Division of Infectious Diseases, Allergy, and Immunology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO, United States.,Department of Molecular Microbiology & Immunology, Saint Louis University, St. Louis, MO, United States

Collapse

Schmid KT, Höllbacher B, Cruceanu C, Böttcher A, Lickert H, Binder EB, Theis FJ, Heinig M. scPower accelerates and optimizes the design of multi-sample single cell transcriptomic studies. Nat Commun 2021;12:6625. [PMID: 34785648 PMCID: PMC8595682 DOI: 10.1038/s41467-021-26779-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 10/22/2021] [Indexed: 12/13/2022] Open

Affiliation(s)

Katharina T Schmid Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany Department of Informatics, Technical University Munich, Munich, Germany
Barbara Höllbacher Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany Department of Informatics, Technical University Munich, Munich, Germany
Cristiana Cruceanu Department of Translational Research, Max Planck Institute for Psychiatry, Munich, Germany
Anika Böttcher Institute of Diabetes and Regeneration Research, Helmholtz Diabetes Center, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany German Center for Diabetes Research (DZD), Neuherberg, Germany School of Medicine, Technical University of Munich, Munich, Germany
Heiko Lickert Institute of Diabetes and Regeneration Research, Helmholtz Diabetes Center, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany German Center for Diabetes Research (DZD), Neuherberg, Germany School of Medicine, Technical University of Munich, Munich, Germany
Elisabeth B Binder Department of Translational Research, Max Planck Institute for Psychiatry, Munich, Germany Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Georgia, USA
Fabian J Theis Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany Department of Mathematics, Technical University Munich, Munich, Germany
Matthias Heinig Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany. Department of Informatics, Technical University Munich, Munich, Germany.

Collapse

Tarazona S, Balzano-Nogueira L, Gómez-Cabrero D, Schmidt A, Imhof A, Hankemeier T, Tegnér J, Westerhuis JA, Conesa A. Harmonization of quality metrics and power calculation in multi-omic studies. Nat Commun 2020;11:3092. [PMID: 32555183 PMCID: PMC7303201 DOI: 10.1038/s41467-020-16937-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 05/29/2020] [Indexed: 12/20/2022] Open

Affiliation(s)

Sonia Tarazona Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, Valencia, Spain
Leandro Balzano-Nogueira Microbiology and Cell Science Department, Institute for Food and Agricultural Research, University of Florida, Gainesville, FL, USA
David Gómez-Cabrero Unit of Computational Medicine, Department of Medicine, Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden Science for Life Laboratory, Solna, Sweden Mucosal & Salivary Biology Division, King's College London Dental Institute, London, UK Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Pamplona, Spain
Andreas Schmidt Protein Analysis Unit, Biomedical Center, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany
Axel Imhof Protein Analysis Unit, Biomedical Center, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany Munich Center of Integrated Protein Science LMU Munich, Planegg-Martinsried, Germany
Thomas Hankemeier Division Analytical Biosciences, Leiden/Amsterdam Center for Drug Research, Leiden, The Netherlands
Jesper Tegnér Unit of Computational Medicine, Department of Medicine, Solna, Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden Science for Life Laboratory, Solna, Sweden Biological and Environmental Sciences and Engineering Division, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Johan A Westerhuis Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands Department of Statistics, Faculty of Natural Sciences, North-West University (Potchefstroom Campus), Potchefstroom, South Africa
Ana Conesa Microbiology and Cell Science Department, Institute for Food and Agricultural Research, University of Florida, Gainesville, FL, USA. Genetics Institute, University of Florida, Gainesville, FL, USA.

Collapse

Höllbacher B, Balázs K, Heinig M, Uhlenhaut NH. Seq-ing answers: Current data integration approaches to uncover mechanisms of transcriptional regulation. Comput Struct Biotechnol J 2020;18:1330-1341. [PMID: 32612756 PMCID: PMC7306512 DOI: 10.1016/j.csbj.2020.05.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 05/21/2020] [Accepted: 05/23/2020] [Indexed: 02/06/2023] Open

Chowdhury HA, Bhattacharyya DK, Kalita JK. Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools. IEEE/ACM Trans Comput Biol Bioinform 2020;17:566-586. [PMID: 30281477 DOI: 10.1109/tcbb.2018.2873010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Alpern D, Gardeux V, Russeil J, Mangeat B, Meireles-Filho ACA, Breysse R, Hacker D, Deplancke B. BRB-seq: ultra-affordable high-throughput transcriptomics enabled by bulk RNA barcoding and sequencing. Genome Biol 2019;20:71. [PMID: 30999927 PMCID: PMC6474054 DOI: 10.1186/s13059-019-1671-x] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 03/07/2019] [Indexed: 01/10/2023] Open

Baccarella A, Williams CR, Parrish JZ, Kim CC. Empirical assessment of the impact of sample number and read depth on RNA-Seq analysis workflow performance. BMC Bioinformatics 2018;19:423. [PMID: 30428853 PMCID: PMC6234607 DOI: 10.1186/s12859-018-2445-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Accepted: 10/23/2018] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

RNA-Sequencing analysis methods are rapidly evolving, and the tool choice for each step of one common workflow, differential expression analysis, which includes read alignment, expression modeling, and differentially expressed gene identification, has a dramatic impact on performance characteristics. Although a number of workflows are emerging as high performers that are robust to diverse input types, the relative performance characteristics of these workflows when either read depth or sample number is limited-a common occurrence in real-world practice-remain unexplored.

RESULTS

Here, we evaluate the impact of varying read depth and sample number on the performance of differential gene expression identification workflows, as measured by precision, or the fraction of genes correctly identified as differentially expressed, and by recall, or the fraction of differentially expressed genes identified. We focus our analysis on 30 high-performing workflows, systematically varying the read depth and number of biological replicates of patient monocyte samples provided as input. We find that, in general for most workflows, read depth has little effect on workflow performance when held above two million reads per sample, with reduced workflow performance below this threshold. The greatest impact of decreased sample number is seen below seven samples per group, when more heterogeneity in workflow performance is observed. The choice of differential expression identification tool, in particular, has a large impact on the response to limited inputs.

CONCLUSIONS

Among the tested workflows, the recall/precision balance remains relatively stable at a range of read depths and sample numbers, although some workflows are more sensitive to input restriction. At ranges typically recommended for biological studies, performance is more greatly impacted by the number of biological replicates than by read depth. Caution should be used when selecting analysis workflows and interpreting results from low sample number experiments, as all workflows exhibit poorer performance at lower sample numbers near typically reported values, with variable impact on recall versus precision. These analyses highlight the performance characteristics of common differential gene expression workflows at varying read depths and sample numbers, and provide empirical guidance in experimental and analytical design.

Collapse

Vieth B, Ziegenhain C, Parekh S, Enard W, Hellmann I. powsimR: power analysis for bulk and single cell RNA-seq experiments. Bioinformatics 2018;33:3486-3488. [PMID: 29036287 DOI: 10.1093/bioinformatics/btx435] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 07/04/2017] [Indexed: 11/14/2022] Open

Kwan STC, King JH, Grenier JK, Yan J, Jiang X, Roberson MS, Caudill MA. Maternal Choline Supplementation during Normal Murine Pregnancy Alters the Placental Epigenome: Results of an Exploratory Study. Nutrients 2018;10:nu10040417. [PMID: 29597262 PMCID: PMC5946202 DOI: 10.3390/nu10040417] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 03/23/2018] [Accepted: 03/26/2018] [Indexed: 12/14/2022] Open

Lamarre S, Frasse P, Zouine M, Labourdette D, Sainderichin E, Hu G, Le Berre-Anton V, Bouzayen M, Maza E. Optimization of an RNA-Seq Differential Gene Expression Analysis Depending on Biological Replicate Number and Library Size. Front Plant Sci 2018;9:108. [PMID: 29491871 PMCID: PMC5817962 DOI: 10.3389/fpls.2018.00108] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 01/19/2018] [Indexed: 05/23/2023]

Abstract

RNA-Seq is a widely used technology that allows an efficient genome-wide quantification of gene expressions for, for example, differential expression (DE) analysis. After a brief review of the main issues, methods and tools related to the DE analysis of RNA-Seq data, this article focuses on the impact of both the replicate number and library size in such analyses. While the main drawback of previous relevant studies is the lack of generality, we conducted both an analysis of a two-condition experiment (with eight biological replicates per condition) to compare the results with previous benchmark studies, and a meta-analysis of 17 experiments with up to 18 biological conditions, eight biological replicates and 100 million (M) reads per sample. As a global trend, we concluded that the replicate number has a larger impact than the library size on the power of the DE analysis, except for low-expressed genes, for which both parameters seem to have the same impact. Our study also provides new insights for practitioners aiming to enhance their experimental designs. For instance, by analyzing both the sensitivity and specificity of the DE analysis, we showed that the optimal threshold to control the false discovery rate (FDR) is approximately 2^-r, where r is the replicate number. Furthermore, we showed that the false positive rate (FPR) is rather well controlled by all three studied R packages: DESeq, DESeq2, and edgeR. We also analyzed the impact of both the replicate number and library size on gene ontology (GO) enrichment analysis. Interestingly, we concluded that increases in the replicate number and library size tend to enhance the sensitivity and specificity, respectively, of the GO analysis. Finally, we recommend to RNA-Seq practitioners the production of a pilot data set to strictly analyze the power of their experimental design, or the use of a public data set, which should be similar to the data set they will obtain. For individuals working on tomato research, on the basis of the meta-analysis, we recommend at least four biological replicates per condition and 20 M reads per sample to be almost sure of obtaining about 1000 DE genes if they exist.

Collapse