1
|
Keel BN, Lindholm-Perry AK. Recent developments and future directions in meta-analysis of differential gene expression in livestock RNA-Seq. Front Genet 2022; 13:983043. [PMID: 36199583 PMCID: PMC9527320 DOI: 10.3389/fgene.2022.983043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 08/24/2022] [Indexed: 11/13/2022] Open
Abstract
Decreases in the costs of high-throughput sequencing technologies have led to continually increasing numbers of livestock RNA-Seq studies in the last decade. Although the number of studies has increased dramatically, most livestock RNA-Seq experiments are limited by cost to a small number of biological replicates. Meta-analysis procedures can be used to integrate and jointly analyze data from multiple independent studies. Meta-analyses increase the sample size, which in turn increase both statistical power and robustness of the results. In this work, we discuss cutting edge approaches to combining results from multiple independent RNA-Seq studies to improve livestock transcriptomics research. We review currently published RNA-Seq meta-analyses in livestock, describe many of the key issues specific to RNA-Seq meta-analysis in livestock species, and discuss future perspectives.
Collapse
|
2
|
Winter C, Camarão AAR, Steffen I, Jung K. Network meta-analysis of transcriptome expression changes in different manifestations of dengue virus infection. BMC Genomics 2022; 23:165. [PMID: 35220956 PMCID: PMC8882220 DOI: 10.1186/s12864-022-08390-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 02/15/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Several studies have been performed to study transcriptome profiles after dengue virus infections with partly different results. Due to slightly different settings of the individual studies, different genes and enriched gene sets are reported in these studies. The main aim of this network meta-analysis was to aggregate a selection of these studies to identify genes and gene sets that are more generally associated with dengue virus infection, i.e. with less dependence on the individual study settings.
Methods
We performed network meta-analysis by different approaches using publicly available gene expression data of five selected studies from the Gene Expression Omnibus database. The study network includes dengue fever (DF), hemorrhagic fever (DHF), shock syndrome (DSS) patients as well as convalescent and healthy control individuals. After data merging and missing value imputation, study-specific batch effects were removed. Pairwise differential expression analysis and subsequent gene-set enrichment analysis were performed between the five study groups. Furthermore, mutual information networks were derived from the top genes of each group comparison, and the separability between the three patient groups was studied by machine learning models.
Results
From the 10 possible pairwise group comparisons in the study network, six genes (IFI27, TPX2, CDT1, DTL, KCTD14 and CDCA3) occur with a noticeable frequency among the top listed genes of each comparison. Thus, there is an increased evidence that these genes play a general role in dengue virus infections. IFI27 and TPX2 have also been highlighted in the context of dengue virus infection by other studies. A few of the identified gene sets from the network meta-analysis overlap with findings from the original studies. Mutual information networks yield additional genes for which the observed pairwise correlation is different between the patient groups. Machine learning analysis shows a moderate separability of samples from the DF, DHF and DSS groups (accuracy about 80%).
Conclusions
Due to an increased sample size, the network meta-analysis could reveal additional genes which are called differentially expressed between the studied groups and that may help to better understand the molecular basis of this disease.
Collapse
|
3
|
Cervantes-Gracia K, Chahwan R, Husi H. Integrative OMICS Data-Driven Procedure Using a Derivatized Meta-Analysis Approach. Front Genet 2022; 13:828786. [PMID: 35186042 PMCID: PMC8855827 DOI: 10.3389/fgene.2022.828786] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 01/12/2022] [Indexed: 12/24/2022] Open
Abstract
The wealth of high-throughput data has opened up new opportunities to analyze and describe biological processes at higher resolution, ultimately leading to a significant acceleration of scientific output using high-throughput data from the different omics layers and the generation of databases to store and report raw datasets. The great variability among the techniques and the heterogeneous methodologies used to produce this data have placed meta-analysis methods as one of the approaches of choice to correlate the resultant large-scale datasets from different research groups. Through multi-study meta-analyses, it is possible to generate results with greater statistical power compared to individual analyses. Gene signatures, biomarkers and pathways that provide new insights of a phenotype of interest have been identified by the analysis of large-scale datasets in several fields of science. However, despite all the efforts, a standardized regulation to report large-scale data and to identify the molecular targets and signaling networks is still lacking. Integrative analyses have also been introduced as complementation and augmentation for meta-analysis methodologies to generate novel hypotheses. Currently, there is no universal method established and the different methods available follow different purposes. Herein we describe a new unifying, scalable and straightforward methodology to meta-analyze different omics outputs, but also to integrate the significant outcomes into novel pathways describing biological processes of interest. The significance of using proper molecular identifiers is highlighted as well as the potential to further correlate molecules from different regulatory levels. To show the methodology’s potential, a set of transcriptomic datasets are meta-analyzed as an example.
Collapse
Affiliation(s)
| | - Richard Chahwan
- Institute of Experimental Immunology, University of Zurich, Zurich, Switzerland
- *Correspondence: Richard Chahwan, ; Holger Husi,
| | - Holger Husi
- Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, United Kingdom
- Division of Biomedical Sciences, Centre for Health Science, University of the Highlands and Islands, Inverness, United Kingdom
- *Correspondence: Richard Chahwan, ; Holger Husi,
| |
Collapse
|
4
|
Saei H, Govahi A, Abiri A, Eghbali M, Abiri M. Comprehensive transcriptome mining identified the gene expression signature and differentially regulated pathways of the late-onset preeclampsia. Pregnancy Hypertens 2021; 25:91-102. [PMID: 34098523 DOI: 10.1016/j.preghy.2021.05.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 01/11/2021] [Accepted: 05/08/2021] [Indexed: 01/18/2023]
Abstract
Preeclampsia (PE) is categorized as a pregnancy-related hypertensive disorder and is a serious concern in pregnancies. Several factors, including genetic factors (placenta gene expression, and imprinting), oxidative stress, the inaccurate immune response of the mother, and the environmental factors are responsible for PE development, but still, the exact mechanism of the pathogenesis has remained unknown. The main aim of the present study is to identify the gene expression signature in placenta tissue, to unveil disease etiology mechanisms. The GEO, PubMed, and ArrayExpress databases have selected to identify gene expression datasets on placenta samples of both preeclampsia and the normotensive controls. A comprehensive gene expression meta-analysis of fourteen publicly available microarray data of preeclampsia disease has performed to identify gene expression signature and responsible biological pathways and processes. Using two different meta-analysis pipeline (in-house and INMEX) we have identified a total of 1234 differentially expressed genes (DEGs) with in-house method, including 713 overexpressed and 356 under-expressed genes whereas 272 DEGs (131 over and 141 under-expressed) have identified with INMEX, across PEs and healthy controls. Comprehensive functional enrichment and pathway analysis was performed by EnrichR library, whic revealed "Asparagine N-linked glycosylation Homo sapiens", "Nef and signal transduction", "Hemostasis", and "immune system" among the most enriched terms. The present study sets out to explain a novel database of candidate genetic markers and biological pathways that play a critical role in PE development, which might aid in the identification of diagnostic, prognostic, and therapeutic informative molecules.
Collapse
Affiliation(s)
- Hassan Saei
- Department of Medical Genetics and Molecular Biology, Faculty of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Ali Govahi
- Department of Medical Immunology, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Ameneh Abiri
- Perinatology Department, Arash Women's Hospital, Tehran University of Medical Sciences, Tehran, Iran
| | - Maryam Eghbali
- Department of Medical Genetics, Tehran University of Medical Sciences, Tehran, Iran
| | - Maryam Abiri
- Department of Medical Genetics and Molecular Biology, Faculty of Medicine, Iran University of Medical Sciences, Tehran, Iran; Shahid Akbarabadi Clinical Research Development Unit (ShACRDU), Iran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
5
|
Mitochondrial Consequences of Organ Preservation Techniques during Liver Transplantation. Int J Mol Sci 2021; 22:ijms22062816. [PMID: 33802177 PMCID: PMC7998211 DOI: 10.3390/ijms22062816] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 02/26/2021] [Accepted: 03/03/2021] [Indexed: 02/08/2023] Open
Abstract
Allograft ischemia during liver transplantation (LT) adversely affects the function of mitochondria, resulting in impairment of oxidative phosphorylation and compromised post-transplant recovery of the affected organ. Several preservation methods have been developed to improve donor organ quality; however, their effects on mitochondrial functions have not yet been compared. This study aimed to summarize the available data on mitochondrial effects of graft preservation methods in preclinical models of LT. Furthermore, a network meta-analysis was conducted to determine if any of these treatments provide a superior benefit, suggesting that they might be used on humans. A systematic search was conducted using electronic databases (EMBASE, MEDLINE (via PubMed), the Cochrane Central Register of Controlled Trials (CENTRAL) and Web of Science) for controlled animal studies using preservation methods for LT. The ATP content of the graft was the primary outcome, as this is an indicator overall mitochondrial function. Secondary outcomes were the respiratory activity of mitochondrial complexes, cytochrome c and aspartate aminotransferase (ALT) release. Both a random-effects model and the SYRCLE risk of bias analysis for animal studies were used. After a comprehensive search of the databases, 25 studies were enrolled in the analysis. Treatments that had the most significant protective effect on ATP content included hypothermic and subnormothermic machine perfusion (HMP and SNMP) (MD = −1.0, 95% CI: (−2.3, 0.3) and MD = −1.1, 95% CI: (−3.2, 1.02)), while the effects of warm ischemia (WI) without cold storage (WI) and normothermic machine perfusion (NMP) were less pronounced (MD = −1.8, 95% CI: (−2.9, −0.7) and MD = −2.1 MD; CI: (−4.6; 0.4)). The subgroup of static cold storage (SCS) with shorter preservation time (< 12 h) yielded better results than SCS ≥ 12 h, NMP and WI, in terms of ATP preservation and the respiratory capacity of complexes. HMP and SNMP stand out in terms of mitochondrial protection when compared to other treatments for LT in animals. The shorter storage time at lower temperatures, together with the dynamic preservation, provided superior protection for the grafts in terms of mitochondrial function. Additional clinical studies on human patients including marginal donors and longer ischemia times are needed to confirm any superiority of preservation methods with respect to mitochondrial function.
Collapse
|
6
|
Zhang Q, Xu X, Wu M, Qin T, Wu S, Liu H. MiRNA Polymorphisms and Hepatocellular Carcinoma Susceptibility: A Systematic Review and Network Meta-Analysis. Front Oncol 2021; 10:562019. [PMID: 33542895 PMCID: PMC7851082 DOI: 10.3389/fonc.2020.562019] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022] Open
Abstract
Background Hepatocellular carcinoma (HCC) is an intractable public health threat worldwide, representing the second leading cause of cancer-related mortality, with limited early detection and therapeutic options. Recent findings have revealed that the susceptibility of HCC is closely related to microRNA (miRNA). We performed this systematic review with a network meta-analysis to investigated four single nucleotide polymorphisms (SNPs) that most regularly reported in miRNAs, exploring their involvement in HCC susceptibility and interaction with hepatitis B virus (HBV). Methods Databases were reviewed for related studies published up to May 2019 to identify all studies that compared genotypes of miR-146a rs2910164, miR-149 rs2292832, miR-196a2 rs11614913, and miR-499 rs3746444 with no language and date restrictions. A pairwise meta-analysis was performed to estimate pooled odds ratios and 95% confidence intervals incorporating heterogeneity to assess the relationship between four miRNA polymorphisms and HCC. To further clarify the effect of polymorphisms on HCC, a Bayesian network meta-analysis was conducted to combine the effective sizes of direct and indirect comparisons. Calculations were performed by R version 3.6.1 and STATA 14.0. All steps were performed according to PRISMA guidelines. Results A total of 20 studies were enrolled in this network meta-analysis, providing 5,337 hepatocellular carcinoma cases and 6,585 controls. All included studies had an acceptable quality. Pairwise meta-analysis demonstrated that miR-196a2 rs11614913 was significantly associated with the susceptibility of HCC, while the other three SNPs were not found to have a significant association. In the analysis of HCC patients under different HBV infection status, only miR-196a2 revealed correlation of threefold risk. The network results showed no significant difference in the distribution of genotype frequencies except for miR-196a2, which appeared to have the highest superiority index when comparing and ranking four SNPs. Conclusion MiR-196a2 rs11614913 was significantly associated with the susceptibility of HCC, especially for HBV- related HCC, and that individuals with TC/CC were more susceptible. No significant association was found in the other three miRNA genes. MiR-196a2 could serve as the best predictor of susceptibility in HCC.
Collapse
Affiliation(s)
- Qimeng Zhang
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| | - Xueying Xu
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| | - Mingcheng Wu
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| | - Tiantian Qin
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| | - Shaoning Wu
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| | - Hongbo Liu
- Department of Health Statistics, School of Public Health, China Medical University, Shenyang, China
| |
Collapse
|
7
|
Abstract
Public databases featuring original, raw data from "Omics" experiments enable researchers to perform meta-analyses by combining either the raw data or the summarized results of several independent studies. In proteomics, high-throughput protein expression data is measured by diverse techniques such as mass spectrometry, 2-D gel electrophoresis or protein arrays yielding data of different scales. Therefore, direct data merging can be problematic, and combining the summarized data of the individual studies can be advantageous. A special form of meta-analysis is network meta-analysis, where studies with different settings of experimental groups can be combined. However, all studies must be linked by one experimental group that has to appear in each study. Usually that is the control group. Then, a study network is formed and indirect statistical inferences can also be made between study groups that appear not in each of the studies.In this chapter, we describe the working principle of and available software for network meta-analysis. The applicability to high-throughput protein expression data is demonstrated in an example from breast cancer research. We also describe the special challenges when applying this method.
Collapse
Affiliation(s)
- Christine Winter
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Foundation, Hannover, Germany
| | - Klaus Jung
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Foundation, Hannover, Germany.
| |
Collapse
|
8
|
Fonseca PAS, Suárez-Vega A, Cánovas A. Weighted Gene Correlation Network Meta-Analysis Reveals Functional Candidate Genes Associated with High- and Sub-Fertile Reproductive Performance in Beef Cattle. Genes (Basel) 2020; 11:genes11050543. [PMID: 32408659 PMCID: PMC7290847 DOI: 10.3390/genes11050543] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 05/04/2020] [Accepted: 05/06/2020] [Indexed: 12/13/2022] Open
Abstract
Improved reproductive efficiency could lead to economic benefits for the beef industry, once the intensive selection pressure has led to a decreased fertility. However, several factors limit our understanding of fertility traits, including genetic differences between populations and statistical limitations. In the present study, the RNA-sequencing data from uterine samples of high-fertile (HF) and sub-fertile (SF) animals was integrated using co-expression network meta-analysis, weighted gene correlation network analysis, identification of upstream regulators, variant calling, and network topology approaches. Using this pipeline, top hub-genes harboring fixed variants (HF × SF) were identified in differentially co-expressed gene modules (DcoExp). The functional prioritization analysis identified the genes with highest potential to be key-regulators of the DcoExp modules between HF and SF animals. Consequently, 32 functional candidate genes (10 upstream regulators and 22 top hub-genes of DcoExp modules) were identified. These genes were associated with the regulation of relevant biological processes for fertility, such as embryonic development, germ cell proliferation, and ovarian hormone regulation. Additionally, 100 candidate variants (single nucleotide polymorphisms (SNPs) and insertions and deletions (INDELs)) were identified within those genes. In the long-term, the results obtained here may help to reduce the frequency of subfertility in beef herds, reducing the associated economic losses caused by this condition.
Collapse
Affiliation(s)
- Pablo A. S. Fonseca
- Correspondence: (P.A.S.F.); (A.C.); Tel.: +1-519-824-4120 (ext. 56295) (A.C.)
| | | | - Angela Cánovas
- Correspondence: (P.A.S.F.); (A.C.); Tel.: +1-519-824-4120 (ext. 56295) (A.C.)
| |
Collapse
|
9
|
Jean-Quartier C, Jeanquartier F, Holzinger A. Open Data for Differential Network Analysis in Glioma. Int J Mol Sci 2020; 21:E547. [PMID: 31952211 PMCID: PMC7013918 DOI: 10.3390/ijms21020547] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Revised: 12/29/2019] [Accepted: 01/03/2020] [Indexed: 12/20/2022] Open
Abstract
The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes. By using selected exemplary tools and open-access resources for cancer research and differential network analysis, we highlight disturbed signaling components in brain cancer subtypes of glioma.
Collapse
|