1
|
Degnan DJ, Claborne DM, Richardson RE, Strauch CW, Glasscock EC, Veličković D, Burnum-Johnson KE, Webb-Robertson BJM, Stratton KG, Bramer LM. MODE: A Web Application for Interactive Visualization and Exploration of Omics Data. J Proteome Res 2025; 24:911-918. [PMID: 39835806 DOI: 10.1021/acs.jproteome.4c00650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Studies generating transcriptomics, proteomics, lipidomics, and metabolomics (colloquially referred to as "omics") data allow researchers to find biomarkers or molecular targets or understand complex biological structures and functions by identifying changes in biomolecule abundance and expression between experimental conditions. Omics data are multidimensional, and oftentimes summarization techniques such as principal component analysis (PCA) are used to identify high-level patterns in data. Though useful, these summaries do not allow exploration of detailed patterns in omics data that may have biological relevance. The use of interactive HTML displays with plots allows researchers to interact with omics data at a detailed level, but building these displays requires significant coding expertise. To overcome this barrier, the software MODE was built to empower users to build their own interactive HTML displays to support scientific discovery. These displays are easily shareable, do not depend on a specific operating system, and allow users to sort and filter plots by categorical or numerical variables called metas. MODE allows users to build and share these displays with several options for plot design and meta selection. The MODE web application and its capabilities are presented and then demonstrated on lipidomics data from a leaf wounding study.
Collapse
|
2
|
Laub V, Nan E, Elias L, Donaldson IJ, Bentsen M, Rusling LA, Schupp J, Lun JH, Plate KH, Looso M, Langer JD, Günther S, Bobola N, Schulte D. Integrated multi-omics analysis of PBX1 in mouse adult neural stem- and progenitor cells identifies a transcriptional module that functionally links PBX1 to TCF3/4. Nucleic Acids Res 2024; 52:12262-12280. [PMID: 39377397 PMCID: PMC11551771 DOI: 10.1093/nar/gkae864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 08/22/2024] [Accepted: 09/23/2024] [Indexed: 10/09/2024] Open
Abstract
Developmental transcription factors act in networks, but how these networks achieve cell- and tissue specificity is still poorly understood. Here, we explored pre-B cell leukemia homeobox 1 (PBX1) in adult neurogenesis combining genomic, transcriptomic, and proteomic approaches. ChIP-seq analysis uncovered PBX1 binding to numerous genomic sites. Integration of PBX1 ChIP-seq with ATAC-seq data predicted interaction partners, which were subsequently validated by mass spectrometry. Whole transcriptome spatial RNA analysis revealed shared expression dynamics of Pbx1 and interacting factors. Among these were class I bHLH proteins TCF3 and TCF4. RNA-seq following Pbx1, Tcf3 or Tcf4 knockdown identified proliferation- and differentiation associated genes as shared targets, while sphere formation assays following knockdown argued for functional cooperativity of PBX1 and TCF3 in progenitor cell proliferation. Notably, while physiological PBX1-TCF interaction has not yet been described, chromosomal translocation resulting in genomic TCF3::PBX1 fusion characterizes a subtype of acute lymphoblastic leukemia. Introducing Pbx1 into Nalm6 cells, a pre-B cell line expressing TCF3 but lacking PBX1, upregulated the leukemogenic genes BLK and NOTCH3, arguing that functional PBX1-TCF cooperativity likely extends to hematopoiesis. Our study hence uncovers a transcriptional module orchestrating the balance between progenitor cell proliferation and differentiation in adult neurogenesis with potential implications for leukemia etiology.
Collapse
Affiliation(s)
- Vera Laub
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
| | - Elisabeth Nan
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
| | - Lena Elias
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
| | - Ian J Donaldson
- University of Manchester, Faculty of Biology, Medicine and Health, Bioinformatics Core Facility, Manchester, M13 9PT, UK
| | - Mette Bentsen
- Max Planck Institute for Heart and Lung Research, Bioinformatics Core Unit (BCU), 61231 Bad Nauheim, Germany
| | - Leona A Rusling
- Max Planck Institute for Biophysics, Proteomics, and Max Planck Institute for Brain Research, 60438 Frankfurt am Main, Germany
| | - Jonathan Schupp
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
- Goethe University, Frankfurt Cancer Institute, 60528 Frankfurt am Main, Germany
| | - Jennifer H Lun
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
- Goethe University, Frankfurt Cancer Institute, 60528 Frankfurt am Main, Germany
| | - Karl H Plate
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
- Goethe University, Frankfurt Cancer Institute, 60528 Frankfurt am Main, Germany
| | - Mario Looso
- Max Planck Institute for Heart and Lung Research, Bioinformatics Core Unit (BCU), 61231 Bad Nauheim, Germany
| | - Julian D Langer
- Max Planck Institute for Biophysics, Proteomics, and Max Planck Institute for Brain Research, 60438 Frankfurt am Main, Germany
| | - Stefan Günther
- Max Planck Institute for Heart and Lung Research, Bioinformatics and Deep Sequencing Platform, 61231 Bad Nauheim, Germany
| | - Nicoletta Bobola
- University of Manchester, Faculty of Biology, Medicine and Health, Manchester, M13 9PT, UK
| | - Dorothea Schulte
- Goethe University, University Hospital Frankfurt, Neurological Institute (Edinger Institute), 60528 Frankfurt am Main, Germany
| |
Collapse
|
3
|
Chua EW, Ooi DJ, Nor Muhammad NA. A concise guide to essential R packages for analyses of DNA, RNA, and proteins. Mol Cells 2024; 47:100120. [PMID: 39374792 PMCID: PMC11541695 DOI: 10.1016/j.mocell.2024.100120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 09/30/2024] [Accepted: 09/30/2024] [Indexed: 10/09/2024] Open
Abstract
R is widely regarded as unrivaled by other high-level programming languages for its statistical functions. The popularity of R as a statistical language has led many to overlook its applications outside the statistical realm. In this brief review, we present a list of R packages for supporting projects that entail analyses of DNA, RNA, and proteins. These R packages span the gamut of important molecular techniques, from routine quantitative polymerase chain reaction (qPCR) and Western blotting to high-throughput sequencing and proteomics generating very large datasets. The text-mining power of R can also be harnessed to facilitate literature reviews and predict future research trends and avenues. We encourage researchers to make full use of R in their work, given the versatility of the language, as well as its straightforward syntax which eases the initial learning curve.
Collapse
Affiliation(s)
- Eng Wee Chua
- Centre for Drug and Herbal Development, Faculty of Pharmacy, Universiti Kebangsaan Malaysia, 50300 Kuala Lumpur, Malaysia.
| | - Der Jiun Ooi
- Department of Preclinical Sciences, Faculty of Dentistry, MAHSA University, 42610 Jenjarom, Selangor, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor, Malaysia
| |
Collapse
|
4
|
Goumenaki P, Günther S, Kikhi K, Looso M, Marín-Juez R, Stainier DYR. The innate immune regulator MyD88 dampens fibrosis during zebrafish heart regeneration. NATURE CARDIOVASCULAR RESEARCH 2024; 3:1158-1176. [PMID: 39271818 PMCID: PMC11399109 DOI: 10.1038/s44161-024-00538-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 08/06/2024] [Indexed: 09/15/2024]
Abstract
The innate immune response is triggered rapidly after injury and its spatiotemporal dynamics are critical for regeneration; however, many questions remain about its exact role. Here we show that MyD88, a key component of the innate immune response, controls not only the inflammatory but also the fibrotic response during zebrafish cardiac regeneration. We find in cryoinjured myd88-/- ventricles a significant reduction in neutrophil and macrophage numbers and the expansion of a collagen-rich endocardial population. Further analyses reveal compromised PI3K/AKT pathway activation in the myd88-/- endocardium and increased myofibroblasts and scarring. Notably, endothelial-specific overexpression of myd88 reverses these neutrophil, fibrotic and scarring phenotypes. Mechanistically, we identify the endocardial-derived chemokine gene cxcl18b as a target of the MyD88 signaling pathway, and using loss-of-function and gain-of-function tools, we show that it controls neutrophil recruitment. Altogether, these findings shed light on the pivotal role of MyD88 in modulating inflammation and fibrosis during tissue regeneration.
Collapse
Affiliation(s)
- Pinelopi Goumenaki
- Department of Developmental Genetics, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
- DZHK German Centre for Cardiovascular Research, Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| | - Stefan Günther
- DZHK German Centre for Cardiovascular Research, Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Bioinformatics and Deep Sequencing Platform, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Khrievono Kikhi
- Flow Cytometry Service Group, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Mario Looso
- DZHK German Centre for Cardiovascular Research, Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Rubén Marín-Juez
- Centre Hospitalier Universitaire Sainte-Justine Research Centre, Montreal, Quebec, Canada
- Department of Pathology and Cell Biology, University of Montreal, Montreal, Quebec, Canada
| | - Didier Y R Stainier
- Department of Developmental Genetics, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany.
- DZHK German Centre for Cardiovascular Research, Partner Site Rhine-Main, Bad Nauheim, Germany.
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany.
| |
Collapse
|
5
|
da Silva AR, Gunawan F, Boezio GLM, Faure E, Théron A, Avierinos JF, Lim S, Jha SG, Ramadass R, Guenther S, Looso M, Zaffran S, Juan T, Stainier DYR. egr3 is a mechanosensitive transcription factor gene required for cardiac valve morphogenesis. SCIENCE ADVANCES 2024; 10:eadl0633. [PMID: 38748804 PMCID: PMC11095463 DOI: 10.1126/sciadv.adl0633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 04/11/2024] [Indexed: 05/19/2024]
Abstract
Biomechanical forces, and their molecular transducers, including key mechanosensitive transcription factor genes, such as KLF2, are required for cardiac valve morphogenesis. However, klf2 mutants fail to completely recapitulate the valveless phenotype observed under no-flow conditions. Here, we identify the transcription factor EGR3 as a conserved biomechanical force transducer critical for cardiac valve formation. We first show that egr3 null zebrafish display a complete and highly penetrant loss of valve leaflets, leading to severe blood regurgitation. Using tissue-specific loss- and gain-of-function tools, we find that during cardiac valve formation, Egr3 functions cell-autonomously in endothelial cells, and identify one of its effectors, the nuclear receptor Nr4a2b. We further find that mechanical forces up-regulate egr3/EGR3 expression in the developing zebrafish heart and in porcine valvular endothelial cells, as well as during human aortic valve remodeling. Altogether, these findings reveal that EGR3 is necessary to transduce the biomechanical cues required for zebrafish cardiac valve morphogenesis, and potentially for pathological aortic valve remodeling in humans.
Collapse
Affiliation(s)
- Agatha Ribeiro da Silva
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| | - Felix Gunawan
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| | - Giulia L. M. Boezio
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| | - Emilie Faure
- Aix Marseille Université, INSERM, MMG, U1251, 13005 Marseille, France
| | - Alexis Théron
- Aix Marseille Université, INSERM, MMG, U1251, 13005 Marseille, France
- Service de Chirurgie Cardiaque, AP-HM, Hôpital de la Timone, 13005 Marseille, France
| | - Jean-François Avierinos
- Aix Marseille Université, INSERM, MMG, U1251, 13005 Marseille, France
- Service de Cardiologie, AP-HM, Hôpital de la Timone, 13005 Marseille, France
| | - SoEun Lim
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
| | - Shivam Govind Jha
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
| | - Radhan Ramadass
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
| | - Stefan Guenther
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Bioinformatics and Deep Sequencing Platform, Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Mario Looso
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
- Bioinformatics Core Unit (BCU), Max Planck Institute for Heart and Lung Research, Bad Nauheim, Germany
| | - Stéphane Zaffran
- Aix Marseille Université, INSERM, MMG, U1251, 13005 Marseille, France
| | - Thomas Juan
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| | - Didier Y. R. Stainier
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim, Germany
- German Centre for Cardiovascular Research (DZHK), Partner Site Rhine-Main, Bad Nauheim, Germany
- Cardio-Pulmonary Institute (CPI), Bad Nauheim, Germany
| |
Collapse
|
6
|
Ernst TR, Blischak JD, Nordlund P, Dalen J, Moore J, Bhamidipati A, Dwivedi P, LoGrasso J, Curado MR, Engelmann BW. OmicNavigator: open-source software for the exploration, visualization, and archival of omic studies. BMC Bioinformatics 2024; 25:162. [PMID: 38658834 PMCID: PMC11040775 DOI: 10.1186/s12859-024-05743-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 03/13/2024] [Indexed: 04/26/2024] Open
Abstract
BACKGROUND The results of high-throughput biology ('omic') experiments provide insight into biological mechanisms but can be challenging to explore, archive and share. The scale of these challenges continues to grow as omic research volume expands and multiple analytical technologies, bioinformatic pipelines, and visualization preferences have emerged. Multiple software applications exist that support omic study exploration and/or archival. However, an opportunity remains for open-source software that can archive and present the results of omic analyses with broad accommodation of study-specific analytical approaches and visualizations with useful exploration features. RESULTS We present OmicNavigator, an R package for the archival, visualization and interactive exploration of omic studies. OmicNavigator enables bioinformaticians to create web applications that interactively display their custom visualizations and analysis results linked with app-derived analytical tools, graphics, and tables. Studies created with OmicNavigator can be viewed within an interactive R session or hosted on a server for shared access. CONCLUSIONS OmicNavigator can be found at https://github.com/abbvie-external/OmicNavigator.
Collapse
Affiliation(s)
| | - John D Blischak
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
| | - Paul Nordlund
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
| | - Joe Dalen
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
| | - Justin Moore
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
- Current Address: Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston, TX, USA
| | | | - Pankaj Dwivedi
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
- Proteovant Therapeutics, 2500 Renaissance Blvd, King of Prussia, PA, USA
| | - Joe LoGrasso
- AbbVie Inc., 1 North Waukegan Rd, North Chicago, IL, 60064, USA
| | | | | |
Collapse
|
7
|
Laub V, Devraj K, Elias L, Schulte D. Bioinformatics for wet-lab scientists: practical application in sequencing analysis. BMC Genomics 2023; 24:382. [PMID: 37420172 PMCID: PMC10326960 DOI: 10.1186/s12864-023-09454-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 06/15/2023] [Indexed: 07/09/2023] Open
Abstract
BACKGROUND Genomics data is available to the scientific community after publication of research projects and can be investigated for a multitude of research questions. However, in many cases deposited data is only assessed and used for the initial publication, resulting in valuable resources not being exploited to their full depth. MAIN: A likely reason for this is that many wetlab-based researchers are not formally trained to apply bioinformatic tools and may therefore assume that they lack the necessary experience to do so themselves. In this article, we present a series of freely available, predominantly web-based platforms and bioinformatic tools that can be combined in analysis pipelines to interrogate different types of next-generation sequencing data. Additionally to the presented exemplary route, we also list a number of alternative tools that can be combined in a mix-and-match fashion. We place special emphasis on tools that can be followed and used correctly without extensive prior knowledge in programming. Such analysis pipelines can be applied to existing data downloaded from the public domain or be compared to the results of own experiments. CONCLUSION Integrating transcription factor binding to chromatin (ChIP-seq) with transcriptional output (RNA-seq) and chromatin accessibility (ATAC-seq) can not only assist to form a deeper understanding of the molecular interactions underlying transcriptional regulation but will also help establishing new hypotheses and pre-testing them in silico.
Collapse
Affiliation(s)
- Vera Laub
- Neurological Institute (Edinger Institute), University Hospital Frankfurt, Goethe University, Frankfurt, Germany.
| | - Kavi Devraj
- Neurological Institute (Edinger Institute), University Hospital Frankfurt, Goethe University, Frankfurt, Germany
- Department of Biological Sciences, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Hyderabad, Telangana, India
| | - Lena Elias
- Neurological Institute (Edinger Institute), University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Dorothea Schulte
- Neurological Institute (Edinger Institute), University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| |
Collapse
|
8
|
i2dash: Creation of Flexible, Interactive, and Web-based Dashboards for Visualization of Omics Data. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:568-577. [PMID: 34280547 PMCID: PMC9801041 DOI: 10.1016/j.gpb.2021.01.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 11/25/2020] [Accepted: 02/01/2021] [Indexed: 01/26/2023]
Abstract
Data visualization and interactive data exploration are important aspects of illustrating complex concepts and results from analyses of omics data. A suitable visualization has to be intuitive and accessible. Web-based dashboards have become popular tools for the arrangement, consolidation, and display of such visualizations. However, the combination of automated data processing pipelines handling omics data and dynamically generated, interactive dashboards is poorly solved. Here, we present i2dash, an R package intended to encapsulate functionality for the programmatic creation of customized dashboards. It supports interactive and responsive (linked) visualizations across a set of predefined graphical layouts. i2dash addresses the needs of data analysts/software developers for a tool that is compatible and attachable to any R-based analysis pipeline, thereby fostering the separation of data visualization on one hand and data analysis tasks on the other hand. In addition, the generic design of i2dash enables the development of modular extensions for specific needs. As a proof of principle, we provide an extension of i2dash optimized for single-cell RNA sequencing analysis, supporting the creation of dashboards for the visualization needs of such experiments. Equipped with these features, i2dash is suitable for extensive use in large-scale sequencing/bioinformatics facilities. Along this line, we provide i2dash as a containerized solution, enabling a straightforward large-scale deployment and sharing of dashboards using cloud services. i2dash is freely available via the R package archive CRAN (https://CRAN.R-project.org/package=i2dash).
Collapse
|
9
|
Mullan KA, Bramberger LM, Munday PR, Goncalves G, Revote J, Mifsud NA, Illing PT, Anderson A, Kwan P, Purcell AW, Li C. ggVolcanoR: A Shiny app for customizable visualization of differential expression datasets. Comput Struct Biotechnol J 2021; 19:5735-5740. [PMID: 34745458 PMCID: PMC8551465 DOI: 10.1016/j.csbj.2021.10.020] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 10/12/2021] [Accepted: 10/12/2021] [Indexed: 12/28/2022] Open
Abstract
Volcano and other analytical plots (e.g., correlation plots, upset plots, and heatmaps) serve as important data visualization methods for transcriptomic and proteomic analyses. Customizable generation of these plots is fundamentally important for a better understanding of dysregulated expression data and is therefore instrumental for the ensuing pathway analysis and biomarker identification. Here, we present an R-based Shiny application, termed ggVolcanoR, to allow for customizable generation and visualization of volcano plots, correlation plots, upset plots, and heatmaps for differential expression datasets, via a user-friendly interactive interface in both local executable version and web-based application without requiring programming expertise. Compared to currently existing packages, ggVolcanoR offers more practical options to optimize the generation of publication-quality volcano and other analytical plots for analyzing and comparing dysregulated genes/proteins across multiple differential expression datasets. In addition, ggVolcanoR provides an option to download the customized list of the filtered dysregulated expression data, which can be directly used as input for downstream pathway analysis. The source code of ggVolcanoR is available at https://github.com/KerryAM-R/ggVolcanoR and the webserver of ggVolcanoR 1.0 has been deployed and is freely available for academic purposes at https://ggvolcanor.erc.monash.edu/.
Collapse
Affiliation(s)
- Kerry A. Mullan
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Liesl M. Bramberger
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Prithvi Raj Munday
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Gabriel Goncalves
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Jerico Revote
- Monash eResearch Centre, Monash University, Melbourne, VIC 3800, Australia
| | - Nicole A. Mifsud
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Patricia T. Illing
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Alison Anderson
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Departments of Medicine and Neurology, University of Melbourne, Royal Melbourne Hospital, Melbourne, Victoria, Australia
| | - Patrick Kwan
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria, Australia
- Departments of Medicine and Neurology, University of Melbourne, Royal Melbourne Hospital, Melbourne, Victoria, Australia
- Department of Neurology, Alfred Health, Melbourne, Victoria, Australia
| | - Anthony W. Purcell
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| |
Collapse
|
10
|
Gao B, Zhu J, Negi S, Zhang X, Gyoneva S, Casey F, Wei R, Zhang B. Quickomics: exploring omics data in an intuitive, interactive and informative manner. Bioinformatics 2021; 37:3670-3672. [PMID: 33901288 DOI: 10.1093/bioinformatics/btab255] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 03/08/2021] [Accepted: 04/19/2021] [Indexed: 01/22/2023] Open
Abstract
SUMMARY We developed Quickomics, a feature-rich R Shiny-powered tool to enable biologists to fully explore complex omics statistical analysis results and perform advanced analysis in an easy-to-use interactive interface. It covers a broad range of secondary and tertiary analytical tasks after primary analysis of omics data is completed. Each functional module is equipped with customizable options and generates both interactive and publication-ready plots to uncover biological insights from data. The modular design makes the tool extensible with ease. AVAILABILITY Researchers can experience the functionalities with their own data or demo RNA-Seq and proteomics datasets by using the app hosted at http://quickomics.bxgenomics.com and following the tutorial, https://bit.ly/3rXIyhL. The source code under GPLv3 license is provided at https://github.com/interactivereport/Quickomics for local installation. SUPPLEMENTARY INFORMATION Supplementary materials are available at https://bit.ly/37HP17g.
Collapse
Affiliation(s)
- Benbo Gao
- Biogen Inc., Cambridge, Massachusetts, USA
| | - Jing Zhu
- Biogen Inc., Cambridge, Massachusetts, USA
| | | | | | | | | | - Ru Wei
- Biogen Inc., Cambridge, Massachusetts, USA
| | | |
Collapse
|
11
|
Li Y, Ma L, Wu D, Chen G. Advances in bulk and single-cell multi-omics approaches for systems biology and precision medicine. Brief Bioinform 2021; 22:6189773. [PMID: 33778867 DOI: 10.1093/bib/bbab024] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 12/31/2020] [Accepted: 01/20/2021] [Indexed: 12/13/2022] Open
Abstract
Multi-omics allows the systematic understanding of the information flow across different omics layers, while single omics can mainly reflect one aspect of the biological system. The advancement of bulk and single-cell sequencing technologies and related computational methods for multi-omics largely facilitated the development of system biology and precision medicine. Single-cell approaches have the advantage of dissecting cellular dynamics and heterogeneity, whereas traditional bulk technologies are limited to individual/population-level investigation. In this review, we first summarize the technologies for producing bulk and single-cell multi-omics data. Then, we survey the computational approaches for integrative analysis of bulk and single-cell multimodal data, respectively. Moreover, the databases and data storage for multi-omics, as well as the tools for visualizing multimodal data are summarized. We also outline the integration between bulk and single-cell data, and discuss the applications of multi-omics in precision medicine. Finally, we present the challenges and perspectives for multi-omics development.
Collapse
Affiliation(s)
| | - Lu Ma
- China Normal University, China
| | | | | |
Collapse
|
12
|
Willforss J, Siino V, Levander F. OmicLoupe: facilitating biological discovery by interactive exploration of multiple omic datasets and statistical comparisons. BMC Bioinformatics 2021; 22:107. [PMID: 33663372 PMCID: PMC7931979 DOI: 10.1186/s12859-021-04043-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 02/22/2021] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Visual exploration of gene product behavior across multiple omic datasets can pinpoint technical limitations in data and reveal biological trends. Still, such exploration is challenging as there is a need for visualizations that are tailored for the purpose. RESULTS The OmicLoupe software was developed to facilitate visual data exploration and provides more than 15 interactive cross-dataset visualizations for omics data. It expands visualizations to multiple datasets for quality control, statistical comparisons and overlap and correlation analyses, while allowing for rapid inspection and downloading of selected features. The usage of OmicLoupe is demonstrated in three different studies, where it allowed for detection of both technical data limitations and biological trends across different omic layers. An example is an analysis of SARS-CoV-2 infection based on two previously published studies, where OmicLoupe facilitated the identification of gene products with consistent expression changes across datasets at both the transcript and protein levels. CONCLUSIONS OmicLoupe provides fast exploration of omics data with tailored visualizations for comparisons within and across data layers. The interactive visualizations are highly informative and are expected to be useful in various analyses of both newly generated and previously published data. OmicLoupe is available at quantitativeproteomics.org/omicloupe.
Collapse
Affiliation(s)
- Jakob Willforss
- Department of Immunotechnology, Lund University, Lund, Sweden
| | - Valentina Siino
- Department of Immunotechnology, Lund University, Lund, Sweden
| | - Fredrik Levander
- Department of Immunotechnology, Lund University, Lund, Sweden.
- Science for Life Laboratory, National Bioinformatics Infrastructure Sweden (NBIS), Lund University, Lund, Sweden.
| |
Collapse
|
13
|
Ethephon Activates the Transcription of Senescence-Associated Genes and Nitrogen Mobilization in Grapevine Leaves ( Vitis vinifera cv. Riesling). PLANTS 2021; 10:plants10020333. [PMID: 33572361 PMCID: PMC7916130 DOI: 10.3390/plants10020333] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 02/05/2021] [Accepted: 02/05/2021] [Indexed: 01/17/2023]
Abstract
Nitrogen (N) remobilization in the context of leaf senescence is of considerable importance for the viability of perennial plants. In late-ripening crops, such as Vitis vinifera, it may also affect berry ripening and fruit quality. Numerous studies on the model plant Arabidopsis thaliana have confirmed an involvement of the plant hormone ethylene in the regulation of senescence. However, ethylene research on grapevine was mostly focused on its involvement in berry ripening and stress tolerance until now. To investigate the effect of ethylene on the initiation, regulation, and progress of senescence-dependent N mobilization in grapevine leaves, we treated field-grown Vitis vinifera cv. Riesling vines with 25 mM ethephon at the end of berry ripening. Ethephon induced premature chlorophyll degradation and caused a shift of the leaf transcriptome equivalent to developmental leaf senescence. The upregulated metabolic processes covered the entire N remobilization process chain, altered the amino acid composition in the leaves, and resulted in an average 60% decrease in leaf N. Our findings increase the fundamental knowledge about the initiation and manipulation of leaf N remobilization in perennial woody plants by ethephon. This offers a methodological approach to the targeted induction of senescence and thus to an improvement in the N supply of grapes.
Collapse
|
14
|
Marini F, Linke J, Binder H. ideal: an R/Bioconductor package for interactive differential expression analysis. BMC Bioinformatics 2020; 21:565. [PMID: 33297942 PMCID: PMC7724894 DOI: 10.1186/s12859-020-03819-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 10/15/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND RNA sequencing (RNA-seq) is an ever increasingly popular tool for transcriptome profiling. A key point to make the best use of the available data is to provide software tools that are easy to use but still provide flexibility and transparency in the adopted methods. Despite the availability of many packages focused on detecting differential expression, a method to streamline this type of bioinformatics analysis in a comprehensive, accessible, and reproducible way is lacking. RESULTS We developed the ideal software package, which serves as a web application for interactive and reproducible RNA-seq analysis, while producing a wealth of visualizations to facilitate data interpretation. ideal is implemented in R using the Shiny framework, and is fully integrated with the existing core structures of the Bioconductor project. Users can perform the essential steps of the differential expression analysis workflow in an assisted way, and generate a broad spectrum of publication-ready outputs, including diagnostic and summary visualizations in each module, all the way down to functional analysis. ideal also offers the possibility to seamlessly generate a full HTML report for storing and sharing results together with code for reproducibility. CONCLUSION ideal is distributed as an R package in the Bioconductor project ( http://bioconductor.org/packages/ideal/ ), and provides a solution for performing interactive and reproducible analyses of summarized RNA-seq expression data, empowering researchers with many different profiles (life scientists, clinicians, but also experienced bioinformaticians) to make the ideal use of the data at hand.
Collapse
Affiliation(s)
- Federico Marini
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| | - Jan Linke
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| | - Harald Binder
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Stefan-Meier-Str. 26, 79104 Freiburg, Germany
| |
Collapse
|
15
|
Tekman M, Batut B, Ostrovsky A, Antoniewski C, Clements D, Ramirez F, Etherington GJ, Hotz HR, Scholtalbers J, Manning JR, Bellenger L, Doyle MA, Heydarian M, Huang N, Soranzo N, Moreno P, Mautner S, Papatheodorou I, Nekrutenko A, Taylor J, Blankenberg D, Backofen R, Grüning B. A single-cell RNA-sequencing training and analysis suite using the Galaxy framework. Gigascience 2020; 9:5931798. [PMID: 33079170 PMCID: PMC7574357 DOI: 10.1093/gigascience/giaa102] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 08/30/2020] [Indexed: 11/25/2022] Open
Abstract
Background The vast ecosystem of single-cell RNA-sequencing tools has until recently been plagued by an excess of diverging analysis strategies, inconsistent file formats, and compatibility issues between different software suites. The uptake of 10x Genomics datasets has begun to calm this diversity, and the bioinformatics community leans once more towards the large computing requirements and the statistically driven methods needed to process and understand these ever-growing datasets. Results Here we outline several Galaxy workflows and learning resources for single-cell RNA-sequencing, with the aim of providing a comprehensive analysis environment paired with a thorough user learning experience that bridges the knowledge gap between the computational methods and the underlying cell biology. The Galaxy reproducible bioinformatics framework provides tools, workflows, and trainings that not only enable users to perform 1-click 10x preprocessing but also empower them to demultiplex raw sequencing from custom tagged and full-length sequencing protocols. The downstream analysis supports a range of high-quality interoperable suites separated into common stages of analysis: inspection, filtering, normalization, confounder removal, and clustering. The teaching resources cover concepts from computer science to cell biology. Access to all resources is provided at the singlecell.usegalaxy.eu portal. Conclusions The reproducible and training-oriented Galaxy framework provides a sustainable high-performance computing environment for users to run flexible analyses on both 10x and alternative platforms. The tutorials from the Galaxy Training Network along with the frequent training workshops hosted by the Galaxy community provide a means for users to learn, publish, and teach single-cell RNA-sequencing analysis.
Collapse
Affiliation(s)
- Mehmet Tekman
- Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| | - Bérénice Batut
- Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| | - Alexander Ostrovsky
- Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Christophe Antoniewski
- ARTbio, Sorbonne Université, CNRS FR 3631, Inserm US 037, Paris, France.,Institut de Biologie Paris Seine, 9 Quai Saint-Bernard Université Pierre et Marie Curie, Campus Jussieu, Bâtiments A-B-C, 75005 Paris, France
| | - Dave Clements
- Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Fidel Ramirez
- Boehringer Ingelheim International GmbH, Binger Strasse 173, 55216 Ingelheim am Rhein, Biberach, Germany
| | | | - Hans-Rudolf Hotz
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, 4058 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Maulbeerstrasse 66, 4058 Basel, Switzerland
| | - Jelle Scholtalbers
- European Molecular Biology Laboratory, Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Jonathan R Manning
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Lea Bellenger
- ARTbio, Sorbonne Université, CNRS FR 3631, Inserm US 037, Paris, France
| | - Maria A Doyle
- Research Computing Facility, Peter MacCallum Cancer Centre, Melbourne, 305 Grattan Street, Victoria 3000, Australia.,Sir Peter MacCallum Department of Oncology, The University of Melbourne, Victoria 3010, Australia
| | - Mohammad Heydarian
- Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Ni Huang
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.,Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Nicola Soranzo
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Pablo Moreno
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Stefan Mautner
- Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| | - Irene Papatheodorou
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - James Taylor
- Department of Biology, Johns Hopkins University, Mudd Hall 144, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Daniel Blankenberg
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, 9500 Euclid Avenue, NB21 Cleveland, OH 44195, USA
| | - Rolf Backofen
- Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| | - Björn Grüning
- Department of Bioinformatics, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany
| |
Collapse
|
16
|
Rutter L, Cook D. bigPint: A Bioconductor visualization package that makes big data pint-sized. PLoS Comput Biol 2020; 16:e1007912. [PMID: 32542031 PMCID: PMC7347224 DOI: 10.1371/journal.pcbi.1007912] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 07/09/2020] [Accepted: 04/27/2020] [Indexed: 11/20/2022] Open
Abstract
Interactive data visualization is imperative in the biological sciences. The development of independent layers of interactivity has been in pursuit in the visualization community. We developed bigPint, a data visualization package available on Bioconductor under the GPL-3 license (https://bioconductor.org/packages/release/bioc/html/bigPint.html). Our software introduces new visualization technology that enables independent layers of interactivity using Plotly in R, which aids in the exploration of large biological datasets. The bigPint package presents modernized versions of scatterplot matrices, volcano plots, and litre plots through the implementation of layered interactivity. These graphics have detected normalization issues, differential expression designation problems, and common analysis errors in public RNA-sequencing datasets. Researchers can apply bigPint graphics to their data by following recommended pipelines written in reproducible code in the user manual. In this paper, we explain how we achieved the independent layers of interactivity that are behind bigPint graphics. Pseudocode and source code are provided. Computational scientists can leverage our open-source code to expand upon our layered interactive technology and/or apply it in new ways toward other computational biology tasks. Biological disciplines face the challenge of increasingly large and complex data. One necessary approach toward eliciting information is data visualization. Newer visualization tools incorporate interactive capabilities that allow scientists to extract information more efficiently than static counterparts. In this paper, we introduce technology that allows multiple independent layers of interactive visualization written in open-source code. This technology can be repurposed across various biological problems. Here, we apply this technology to RNA-sequencing data, a popular next-generation sequencing approach that provides snapshots of RNA quantity in biological samples at given moments in time. It can be used to investigate cellular differences between health and disease, cellular changes in response to external stimuli, and additional biological inquiries. RNA-sequencing data is large, noisy, and biased. It requires sophisticated normalization. The most popular open-source RNA-sequencing data analysis software focuses on models, with little emphasis on integrating effective visualization tools. This is despite sound evidence that RNA-sequencing data is most effectively explored using graphical and numerical approaches in a complementary fashion. The software we introduce can make it easier for researchers to use models and visuals in an integrated fashion during RNA-sequencing data analysis.
Collapse
Affiliation(s)
- Lindsay Rutter
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| | - Dianne Cook
- Econometrics and Business Statistics, Monash University, Clayton, VIC, Australia
| |
Collapse
|
17
|
O'Shea K, Misra BB. Software tools, databases and resources in metabolomics: updates from 2018 to 2019. Metabolomics 2020; 16:36. [PMID: 32146531 DOI: 10.1007/s11306-020-01657-3] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 03/01/2020] [Indexed: 12/24/2022]
Abstract
Metabolomics has evolved as a discipline from a discovery and functional genomics tool, and is now a cornerstone in the era of big data-driven precision medicine. Sample preparation strategies and analytical technologies have seen enormous growth, and keeping pace with data analytics is challenging, to say the least. This review introduces and briefly presents around 100 metabolomics software resources, tools, databases, and other utilities that have surfaced or have improved in 2019. Table 1 provides the computational dependencies of the tools, categorizes the resources based on utility and ease of use, and provides hyperlinks to webpages where the tools can be downloaded or used. This review intends to keep the community of metabolomics researchers up to date with all the software tools, resources, and databases developed in 2019, in one place.
Collapse
Affiliation(s)
- Keiron O'Shea
- Institute of Biological, Environmental, and Rural Studies, Aberystwyth University, Ceredigion, Wales, SY23 3DA, UK
| | - Biswapriya B Misra
- Center for Precision Medicine, Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, Winston-Salem, NC, 27157, USA.
| |
Collapse
|
18
|
Akhmedov M, Martinelli A, Geiger R, Kwee I. Omics Playground: a comprehensive self-service platform for visualization, analytics and exploration of Big Omics Data. NAR Genom Bioinform 2020; 2:lqz019. [PMID: 33575569 PMCID: PMC7671354 DOI: 10.1093/nargab/lqz019] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 11/19/2019] [Indexed: 12/16/2022] Open
Abstract
As the cost of sequencing drops rapidly, the amount of 'omics data increases exponentially, making data visualization and interpretation-'tertiary' analysis a bottleneck. Specialized analytical tools requiring technical expertise are available. However, consolidated and multi-faceted tools that are easy to use for life scientists is highly needed and currently lacking. Here we present Omics Playground, a user-friendly and interactive self-service bioinformatics platform for the in-depth analysis, visualization and interpretation of transcriptomics and proteomics data. It provides a large number of different tools in which special attention has been paid to single cell data. With Omics Playground, life scientists can easily perform complex data analysis and visualization without coding, and significantly reduce the time to discovery.
Collapse
Affiliation(s)
- Murodzhon Akhmedov
- Institute for Research in Biomedicine, Faculty of Biomedical Sciences, Università della Svizzera Italiana, 6500 Bellinzona, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
- BigOmics Analytics, 6500 Bellinzona, Switzerland
| | - Axel Martinelli
- Institute for Research in Biomedicine, Faculty of Biomedical Sciences, Università della Svizzera Italiana, 6500 Bellinzona, Switzerland
- BigOmics Analytics, 6500 Bellinzona, Switzerland
| | - Roger Geiger
- Institute for Research in Biomedicine, Faculty of Biomedical Sciences, Università della Svizzera Italiana, 6500 Bellinzona, Switzerland
| | - Ivo Kwee
- Institute for Research in Biomedicine, Faculty of Biomedical Sciences, Università della Svizzera Italiana, 6500 Bellinzona, Switzerland
- BigOmics Analytics, 6500 Bellinzona, Switzerland
| |
Collapse
|
19
|
Marini F, Binder H. pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components. BMC Bioinformatics 2019; 20:331. [PMID: 31195976 PMCID: PMC6567655 DOI: 10.1186/s12859-019-2879-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Accepted: 05/07/2019] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Principal component analysis (PCA) is frequently used in genomics applications for quality assessment and exploratory analysis in high-dimensional data, such as RNA sequencing (RNA-seq) gene expression assays. Despite the availability of many software packages developed for this purpose, an interactive and comprehensive interface for performing these operations is lacking. RESULTS We developed the pcaExplorer software package to enhance commonly performed analysis steps with an interactive and user-friendly application, which provides state saving as well as the automated creation of reproducible reports. pcaExplorer is implemented in R using the Shiny framework and exploits data structures from the open-source Bioconductor project. Users can easily generate a wide variety of publication-ready graphs, while assessing the expression data in the different modules available, including a general overview, dimension reduction on samples and genes, as well as functional interpretation of the principal components. CONCLUSION pcaExplorer is distributed as an R package in the Bioconductor project ( http://bioconductor.org/packages/pcaExplorer/ ), and is designed to assist a broad range of researchers in the critical step of interactive data exploration.
Collapse
Affiliation(s)
- Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, Mainz, 55131 Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, Mainz, 55131 Germany
| | - Harald Binder
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, Stefan-Meier-Str. 26, Freiburg, 79104 Germany
| |
Collapse
|