Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Ho YY, Parmigiani G, Louis TA, Cope LM. Modeling liquid association. Biometrics 2011;67:133-41. [PMID: 20528865 DOI: 10.1111/j.1541-0420.2010.01440.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Number

Cited by Other Article(s)

Khayer N, Jalessi M, Farhadi M, Azad Z. S100a9 might act as a modulator of the Toll-like receptor 4 transduction pathway in chronic rhinosinusitis with nasal polyps. Sci Rep 2024;14:9722. [PMID: 38678138 PMCID: PMC11055867 DOI: 10.1038/s41598-024-60205-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 04/19/2024] [Indexed: 04/29/2024] Open

Zhang W, Ma Z, Wang L, Fan D, Ho YY. Genome-wide search algorithms for identifying dynamic gene co-expression via Bayesian variable selection. Stat Med 2023;42:5616-5629. [PMID: 37806971 DOI: 10.1002/sim.9928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 08/08/2023] [Accepted: 09/19/2023] [Indexed: 10/10/2023]

Tu D, Mahony B, Moore TM, Bertolero MA, Alexander-Bloch AF, Gur R, Bassett DS, Satterthwaite TD, Raznahan A, Shinohara RT. CoCoA: conditional correlation models with association size. Biostatistics 2023;25:154-170. [PMID: 35939558 PMCID: PMC10724258 DOI: 10.1093/biostatistics/kxac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 07/14/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open

Barton S, Broad Z, Ortiz-Barrientos D, Donovan D, Lefevre J. Hypergraphs and centrality measures identifying key features in gene expression data. Math Biosci 2023;366:109089. [PMID: 37914024 DOI: 10.1016/j.mbs.2023.109089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 10/16/2023] [Accepted: 10/18/2023] [Indexed: 11/03/2023]

Ma Z, Davis SW, Ho YY. Flexible copula model for integrating correlated multi-omics data from single-cell experiments. Biometrics 2022. [PMID: 35622236 DOI: 10.1111/biom.13701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 05/18/2022] [Indexed: 11/27/2022]

Li L, Zeng J, Zhang X. Generalized Liquid Association Analysis for Multimodal Data Integration. J Am Stat Assoc 2022;118:1984-1996. [PMID: 38099062 PMCID: PMC10720690 DOI: 10.1080/01621459.2021.2024437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 12/27/2021] [Indexed: 10/19/2022]

Shokati Eshkiki Z, Khayer N, Talebi A, Karbalaei R, Akbari A. Novel insight into pancreatic adenocarcinoma pathogenesis using liquid association analysis. BMC Med Genomics 2022;15:30. [PMID: 35180880 PMCID: PMC8855560 DOI: 10.1186/s12920-022-01174-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Accepted: 02/01/2022] [Indexed: 11/10/2022] Open

Khayer N, Jalessi M, Jahanbakhshi A, Tabib Khooei A, Mirzaie M. Nkx3-1 and Fech genes might be switch genes involved in pituitary non-functioning adenoma invasiveness. Sci Rep 2021;11:20943. [PMID: 34686726 PMCID: PMC8536755 DOI: 10.1038/s41598-021-00431-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 10/12/2021] [Indexed: 12/12/2022] Open

Abstract

Non-functioning pituitary adenomas (NFPAs) are typical pituitary macroadenomas in adults associated with increased mortality and morbidity. Although pituitary adenomas are commonly considered slow-growing benign brain tumors, numerous of them possess an invasive nature. Such tumors destroy sella turcica and invade the adjacent tissues such as the cavernous sinus and sphenoid sinus. In these cases, the most critical obstacle for complete surgical removal is the high risk of damaging adjacent vital structures. Therefore, the development of novel therapeutic strategies for either early diagnosis through biomarkers or medical therapies to reduce the recurrence rate of NFPAs is imperative. Identification of gene interactions has paved the way for decoding complex molecular mechanisms, including disease-related pathways, and identifying the most momentous genes involved in a specific disease. Currently, our knowledge of the invasion of the pituitary adenoma at the molecular level is not sufficient. The current study aimed to identify critical biomarkers and biological pathways associated with invasiveness in the NFPAs using a three-way interaction model for the first time. In the current study, the Liquid association method was applied to capture the statistically significant triplets involved in NFPAs invasiveness. Subsequently, Random Forest analysis was applied to select the most important switch genes. Finally, gene set enrichment (GSE) and gene regulatory network (GRN) analyses were applied to trace the biological relevance of the statistically significant triplets. The results of this study suggest that "mRNA processing" and "spindle organization" biological processes are important in NFAPs invasiveness. Specifically, our results suggest Nkx3-1 and Fech as two switch genes in NFAPs invasiveness that may be potential biomarkers or target genes in this pathology.

Collapse

Cao X, Pounds S. Gene-set distance analysis (GSDA): a powerful tool for gene-set association analysis. BMC Bioinformatics 2021;22:207. [PMID: 33882829 PMCID: PMC8059024 DOI: 10.1186/s12859-021-04110-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 03/30/2021] [Indexed: 11/23/2022] Open

Abstract

Background

Identifying sets of related genes (gene sets) that are empirically associated with a treatment or phenotype often yields valuable biological insights. Several methods effectively identify gene sets in which individual genes have simple monotonic relationships with categorical, quantitative, or censored event-time variables. Some distance-based methods, such as distance correlations, may detect complex non-monotone associations of a gene-set with a quantitative variable that elude other methods. However, the distance correlations have yet to be generalized to associate gene-sets with categorical and censored event-time endpoints. Also, there is a need to determine which genes empirically drive the significance of an association of a gene set with an endpoint.

Results

We develop gene-set distance analysis (GSDA) by generalizing distance correlations to evaluate the association of a gene set with categorical and censored event-time variables. We also develop a backward elimination procedure to identify a subset of genes that empirically drive significant associations. In simulation studies, GSDA more effectively identified complex non-monotone gene-set associations than did six other published methods. In the analysis of a pediatric acute myeloid leukemia (AML) data set, GSDA was the only method to discover that event-free survival (EFS) was associated with the 56-gene AML pathway gene-set, narrow that result down to 5 genes, and confirm the association of those 5 genes with EFS in a separate validation cohort. These results indicate that GSDA effectively identifies and characterizes complex non-monotonic gene-set associations that are missed by other methods.

Conclusion

GSDA is a powerful and flexible method to detect gene-set association with categorical, quantitative, or censored event-time variables, especially to detect complex non-monotonic gene-set associations. Available at https://CRAN.R-project.org/package=GSDA.

Supplementary information

The online version contains supplementary material available at 10.1186/s12859-021-04110-x.

Collapse

Yang Z, Ho YY. Modeling dynamic correlation in zero-inflated bivariate count data with applications to single-cell RNA sequencing data. Biometrics 2021;78:766-776. [PMID: 33720414 PMCID: PMC8477913 DOI: 10.1111/biom.13457] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 03/03/2021] [Accepted: 03/08/2021] [Indexed: 12/13/2022]

Wu G, Ge L, Zhao N, Liu F, Shi Z, Zheng N, Zhou D, Jiang X, Halverson L, Xie B. Environment dependent microbial co-occurrences across a cyanobacterial bloom in a freshwater lake. Environ Microbiol 2020;23:327-339. [PMID: 33185973 DOI: 10.1111/1462-2920.15315] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 10/28/2020] [Accepted: 11/09/2020] [Indexed: 11/29/2022]

Rps27a might act as a controller of microglia activation in triggering neurodegenerative diseases. PLoS One 2020;15:e0239219. [PMID: 32941527 PMCID: PMC7498011 DOI: 10.1371/journal.pone.0239219] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Accepted: 09/01/2020] [Indexed: 01/10/2023] Open

Abstract

Neurodegenerative diseases (NDDs) are increasing serious menaces to human health in the recent years. Despite exhibiting different clinical phenotypes and selective neuronal loss, there are certain common features in these disorders, suggesting the presence of commonly dysregulated pathways. Identifying causal genes and dysregulated pathways can be helpful in providing effective treatment in these diseases. Interestingly, in spite of the considerable researches on NDDs, to the best of our knowledge, no dysregulated genes and/or pathways were reported in common across all the major NDDs so far. In this study, for the first time, we have applied the three-way interaction model, as an approach to unravel sophisticated gene interactions, to trace switch genes and significant pathways that are involved in six major NDDs. Subsequently, a gene regulatory network was constructed to investigate the regulatory communication of statistically significant triplets. Finally, KEGG pathway enrichment analysis was applied to find possible common pathways. Because of the central role of neuroinflammation and immune system responses in both pathogenic and protective mechanisms in the NDDs, we focused on immune genes in this study. Our results suggest that "cytokine-cytokine receptor interaction" pathway is enriched in all of the studied NDDs, while "osteoclast differentiation" and "natural killer cell mediated cytotoxicity" pathways are enriched in five of the NDDs each. The results of this study indicate that three pathways that include "osteoclast differentiation", "natural killer cell mediated cytotoxicity" and "cytokine-cytokine receptor interaction" are common in five, five and six NDDs, respectively. Additionally, our analysis showed that Rps27a as a switch gene, together with the gene pair {Il-18, Cx3cl1} form a statistically significant and biologically relevant triplet in the major NDDs. More specifically, we suggested that Cx3cl1 might act as a potential upstream regulator of Il-18 in microglia activation, and in turn, might be controlled with Rps27a in triggering NDDs.

Collapse

Lu J, Lu Y, Ding Y, Xiao Q, Liu L, Cai Q, Kong Y, Bai Y, Yu T. DNLC: differential network local consistency analysis. BMC Bioinformatics 2019;20:489. [PMID: 31874600 PMCID: PMC6929334 DOI: 10.1186/s12859-019-3046-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 08/21/2019] [Indexed: 12/04/2022] Open

A hypergraph-based method for large-scale dynamic correlation study at the transcriptomic scale. BMC Genomics 2019;20:397. [PMID: 31117943 PMCID: PMC6530038 DOI: 10.1186/s12864-019-5787-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Accepted: 05/09/2019] [Indexed: 12/22/2022] Open

Abstract

Background

The biological regulatory system is highly dynamic. Correlations between functionally related genes change over different biological conditions, which are often unobserved in the data. At the gene level, the dynamic correlations result in three-way gene interactions involving a pair of genes that change correlation, and a third gene that reflects the underlying cellular conditions. This type of ternary relation can be quantified by the Liquid Association statistic. Studying these three-way interactions at the gene triplet level have revealed important regulatory mechanisms in the biological system. Currently, due to the extremely large amount of possible combinations of triplets within a high-throughput gene expression dataset, no method is available to examine the ternary relationship at the biological system level and formally address the false discovery issue.

Results

Here we propose a new method, Hypergraph for Dynamic Correlation (HDC), to construct module-level three-way interaction networks. The method is able to present integrative uniform hypergraphs to reflect the global dynamic correlation pattern in the biological system, providing guidance to down-stream gene triplet-level analyses. To validate the method’s ability, we conducted two real data experiments using a melanoma RNA-seq dataset from The Cancer Genome Atlas (TCGA) and a yeast cell cycle dataset. The resulting hypergraphs are clearly biologically plausible, and suggest novel relations relevant to the biological conditions in the data.

Conclusions

We believe the new approach provides a valuable alternative method to analyze omics data that can extract higher order structures. The software is at https://github.com/yunchuankong/HypergraphDynamicCorrelation.

Electronic supplementary material

The online version of this article (10.1186/s12864-019-5787-x) contains supplementary material, which is available to authorized users.

Collapse

Ai D, Li X, Pan H, Chen J, Cram JA, Xia LC. Explore mediated co-varying dynamics in microbial community using integrated local similarity and liquid association analysis. BMC Genomics 2019;20:185. [PMID: 30967122 PMCID: PMC6456937 DOI: 10.1186/s12864-019-5469-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

BACKGROUND

Discovering the key microbial species and environmental factors of microbial community and characterizing their relationships with other members are critical to ecosystem studies. The microbial co-occurrence patterns across a variety of environmental settings have been extensively characterized. However, previous studies were limited by their restriction toward pairwise relationships, while there was ample evidence of third-party mediated co-occurrence in microbial communities.

METHODS

We implemented and applied the triplet-based liquid association analysis in combination with the local similarity analysis procedure to microbial ecology data. We developed an intuitive scheme to visualize those complex triplet associations along with pairwise correlations. Using a time series from the marine microbial ecosystem as example, we identified pairs of operational taxonomic units (OTUs) where the strength of their associations appeared to relate to the values of a third "mediator" variable. These "mediator" variables appear to modulate the associations between pairs of bacteria.

RESULTS

Using this analysis, we were able to assess the OTUs' ability to regulate its functional partners in the community, typically not manifested in the pairwise correlation patterns. For example, we identified Flavobacteria as a multifaceted player in the marine microbial ecosystem, and its clades were involved in mediating other OTU pairs. By contrast, SAR11 clades were not active mediators of the community, despite being abundant and highly correlated with other OTUs. Our results suggested that Flavobacteria are more likely to respond to situations where particles and unusual sources of dissolved organic material are prevalent, such as after a plankton bloom. On the other hand, SAR11s are oligotrophic chemoheterotrophs with inflexible metabolisms, and their relationships with other organisms may be less governed by environmental or biological factors.

CONCLUSIONS

By integrating liquid association with local similarity analysis to explore the mediated co-varying dynamics, we presented a novel perspective and a useful toolkit to analyze and interpret time series data from microbial community. Our augmented association network analysis is thus more representative of the true underlying dynamic structure of the microbial community. The analytic software in this study was implemented as new functionalities of the ELSA (Extended local similarity analysis) tool, which is available for free download ( http://bitbucket.org/charade/elsa ).

Collapse

Kinzy TG, Starr TK, Tseng GC, Ho YY. Meta-analytic framework for modeling genetic coexpression dynamics. Stat Appl Genet Mol Biol 2019;18:/j/sagmb.ahead-of-print/sagmb-2017-0052/sagmb-2017-0052.xml. [DOI: 10.1515/sagmb-2017-0052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Khayer N, Mirzaie M, Marashi SA, Rezaei-Tavirani M, Goshadrou F. Three-way interaction model with switching mechanism as an effective strategy for tracing functionally-related genes. Expert Rev Proteomics 2018;16:161-169. [PMID: 30556756 DOI: 10.1080/14789450.2019.1559734] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Yu T. A new dynamic correlation algorithm reveals novel functional aspects in single cell and bulk RNA-seq data. PLoS Comput Biol 2018;14:e1006391. [PMID: 30080856 PMCID: PMC6095616 DOI: 10.1371/journal.pcbi.1006391] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Revised: 08/16/2018] [Accepted: 07/24/2018] [Indexed: 01/21/2023] Open

Abstract

Dynamic correlations are pervasive in high-throughput data. Large numbers of gene pairs can change their correlation patterns in response to observed/unobserved changes in physiological states. Finding changes in correlation patterns can reveal important regulatory mechanisms. Currently there is no method that can effectively detect global dynamic correlation patterns in a dataset. Given the challenging nature of the problem, the currently available methods use genes as surrogate measurements of physiological states, which cannot faithfully represent true underlying biological signals. In this study we develop a new method that directly identifies strong latent dynamic correlation signals from the data matrix, named DCA: Dynamic Correlation Analysis. At the center of the method is a new metric for the identification of pairs of variables that are highly likely to be dynamically correlated, without knowing the underlying physiological states that govern the dynamic correlation. We validate the performance of the method with extensive simulations. We applied the method to three real datasets: a single cell RNA-seq dataset, a bulk RNA-seq dataset, and a microarray gene expression dataset. In all three datasets, the method reveals novel latent factors with clear biological meaning, bringing new insights into the data.

Dynamic correlation is an important area in expression data. However it hasn’t received much attention because of the lack of effective methods that can unravel the complex relationship. Here we describe a new method that represents a substantial improvement over existing approaches. It achieves the goal of efficiently finding patterns of dynamic correlation in RNA-seq data, as well as detecting biological functions associated with the dynamic correlation patterns. Unlike traditional methods that focus on first-order structures, linear or nonlinear, our method finds second-order patterns that bring insights into the regulations of the complex system. Some of the interesting discoveries by the new method, such as immunological functions of some intestinal epithelial cells, are validated by recent biological publications.

Collapse

Xu X, Wang M, Li L, Che R, Li P, Pei L, Li H. Genome-wide trait-trait dynamics correlation study dissects the gene regulation pattern in maize kernels. BMC PLANT BIOLOGY 2017;17:163. [PMID: 29037150 PMCID: PMC5644097 DOI: 10.1186/s12870-017-1119-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Accepted: 10/09/2017] [Indexed: 06/07/2023]

Abstract

BACKGROUND

Dissecting the genetic basis and regulatory mechanisms for the biosynthesis and accumulation of nutrients in maize could lead to the improved nutritional quality of this crop. Gene expression is regulated at the genomic, transcriptional, and post-transcriptional levels, all of which can produce diversity among traits. However, the expression of most genes connected with a particular trait usually does not have a direct association with the variation of that trait. In addition, expression profiles of genes involved in a single pathway may vary as the intrinsic cellular state changes. To work around these issues, we utilized a statistical method, liquid association (LA) to investigate the complex pattern of gene regulation in maize kernels.

RESULTS

We applied LA to the expression profiles of 28,769 genes to dissect dynamic trait-trait correlation patterns in maize kernels. Among the 1000 LA pairs (LAPs) with the largest LA scores, 686 LAPs were identified conditional correlation. We also identified 830 and 215 LA-scouting leaders based on the positive and negative LA scores, which were significantly enriched for some biological processes and molecular functions. Our analysis of the dynamic co-expression patterns in the carotene biosynthetic pathway clearly indicated the important role of lcyE, CYP97A, ZEP1, and VDE in this pathway, which may change the direction of carotene biosynthesis by controlling the influx and efflux of the substrate. The dynamic trait-trait correlation patterns between gene expression and oil concentration in the fatty acid metabolic pathway and its complex regulatory network were also assessed. 23 of 26 oil-associated genes were correlated with oil concentration conditioning on 580 LA-scoutinggenes, and 5% of these LA-scouting genes were annotated as enzymes in the oil metabolic pathway.

CONCLUSIONS

By focusing on the carotenoid and oil biosynthetic pathways in maize, we showed that a genome-wide LA analysis provides a novel and effective way to detect transcriptional regulatory relationships. This method will help us understand the biological role of maize kernel genes and will benefit maize breeding programs.

Collapse

Khayer N, Marashi SA, Mirzaie M, Goshadrou F. Three-way interaction model to trace the mechanisms involved in Alzheimer's disease transgenic mice. PLoS One 2017;12:e0184697. [PMID: 28934252 PMCID: PMC5608283 DOI: 10.1371/journal.pone.0184697] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 08/29/2017] [Indexed: 11/19/2022] Open

Wang L, Liu S, Ding Y, Yuan SS, Ho YY, Tseng GC. Meta-analytic framework for liquid association. Bioinformatics 2017;33:2140-2147. [PMID: 28334340 PMCID: PMC6044323 DOI: 10.1093/bioinformatics/btx138] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Revised: 02/11/2017] [Accepted: 03/09/2017] [Indexed: 11/14/2022] Open

Abstract

MOTIVATION

Although coexpression analysis via pair-wise expression correlation is popularly used to elucidate gene-gene interactions at the whole-genome scale, many complicated multi-gene regulations require more advanced detection methods. Liquid association (LA) is a powerful tool to detect the dynamic correlation of two gene variables depending on the expression level of a third variable (LA scouting gene). LA detection from single transcriptomic study, however, is often unstable and not generalizable due to cohort bias, biological variation and limited sample size. With the rapid development of microarray and NGS technology, LA analysis combining multiple gene expression studies can provide more accurate and stable results.

RESULTS

In this article, we proposed two meta-analytic approaches for LA analysis (MetaLA and MetaMLA) to combine multiple transcriptomic studies. To compensate demanding computing, we also proposed a two-step fast screening algorithm for more efficient genome-wide screening: bootstrap filtering and sign filtering. We applied the methods to five Saccharomyces cerevisiae datasets related to environmental changes. The fast screening algorithm reduced 98% of running time. When compared with single study analysis, MetaLA and MetaMLA provided stronger detection signal and more consistent and stable results. The top triplets are highly enriched in fundamental biological processes related to environmental changes. Our method can help biologists understand underlying regulatory mechanisms under different environmental exposure or disease states.

AVAILABILITY AND IMPLEMENTATION

A MetaLA R package, data and code for this article are available at http://tsenglab.biostat.pitt.edu/software.htm.

CONTACT

ctseng@pitt.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Cao X, Crews KR, Downing J, Lamba J, Pounds SB. CC-PROMISE effectively integrates two forms of molecular data with multiple biologically related endpoints. BMC Bioinformatics 2016;17:382. [PMID: 27766934 PMCID: PMC5073973 DOI: 10.1186/s12859-016-1217-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

Yuan H, Li Z, Tang NLS, Deng M. A network based covariance test for detecting multivariate eQTL in saccharomyces cerevisiae. BMC SYSTEMS BIOLOGY 2016;10 Suppl 1:8. [PMID: 26818242 PMCID: PMC4895706 DOI: 10.1186/s12918-015-0245-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Gunderson T, Ho YY. An efficient algorithm to explore liquid association on a genome-wide scale. BMC Bioinformatics 2014;15:371. [PMID: 25431229 PMCID: PMC4255454 DOI: 10.1186/s12859-014-0371-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 10/30/2014] [Indexed: 01/04/2023] Open

Wang L, Zheng W, Zhao H, Deng M. Statistical analysis reveals co-expression patterns of many pairs of genes in yeast are jointly regulated by interacting loci. PLoS Genet 2013;9:e1003414. [PMID: 23555313 PMCID: PMC3610942 DOI: 10.1371/journal.pgen.1003414] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2012] [Accepted: 02/11/2013] [Indexed: 11/30/2022] Open

Abstract

Expression quantitative trait loci (eQTL) studies have generated large amounts of data in different organisms. The analyses of these data have led to many novel findings and biological insights on expression regulations. However, the role of epistasis in the joint regulation of multiple genes has not been explored. This is largely due to the computational complexity involved when multiple traits are simultaneously considered against multiple markers if an exhaustive search strategy is adopted. In this article, we propose a computationally feasible approach to identify pairs of chromosomal regions that interact to regulate co-expression patterns of pairs of genes. Our approach is built on a bivariate model whose covariance matrix depends on the joint genotypes at the candidate loci. We also propose a filtering process to reduce the computational burden. When we applied our method to a yeast eQTL dataset profiled under both the glucose and ethanol conditions, we identified a total of 225 and 224 modules, with each module consisting of two genes and two eQTLs where the two eQTLs epistatically regulate the co-expression patterns of the two genes. We found that many of these modules have biological interpretations. Under the glucose condition, ribosome biogenesis was co-regulated with the signaling and carbohydrate catabolic processes, whereas silencing and aging related genes were co-regulated under the ethanol condition with the eQTLs containing genes involved in oxidative stress response process.

eQTL studies collect both gene expression and genotype data, and they are highly informative as to how genes regulate expressions. Although much progress has been made in the analysis of such data, most studies have considered one marker at a time. As a result, those markers with weak marginal yet strong interactive effects may not be inferred from these single-marker-based analyses. In this article, using joint expression patterns between two genes (versus one gene) as the primary phenotype, we propose a novel statistical method to conduct an exhaustive search for joint marker analysis. When our method is applied to a well-studied dataset, we were able to identify many novel features that were overlooked by existing methods. Our general strategy has general applicability to other scientific problems.

Collapse

Qiu P, Zhang L. Identification of markers associated with global changes in DNA methylation regulation in cancers. BMC Bioinformatics 2012;13 Suppl 13:S7. [PMID: 23320390 PMCID: PMC3426805 DOI: 10.1186/1471-2105-13-s13-s7] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open