1
|
Sun L, Zhang A, Liang F. Time-varying dynamic Bayesian network learning for an fMRI study of emotion processing. Stat Med 2024; 43:2713-2733. [PMID: 38690642 PMCID: PMC11195441 DOI: 10.1002/sim.10096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 04/01/2024] [Accepted: 04/19/2024] [Indexed: 05/02/2024]
Abstract
This article presents a novel method for learning time-varying dynamic Bayesian networks. The proposed method breaks down the dynamic Bayesian network learning problem into a sequence of regression inference problems and tackles each problem using the Markov neighborhood regression technique. Notably, the method demonstrates scalability concerning data dimensionality, accommodates time-varying network structure, and naturally handles multi-subject data. The proposed method exhibits consistency and offers superior performance compared to existing methods in terms of estimation accuracy and computational efficiency, as supported by extensive numerical experiments. To showcase its effectiveness, we apply the proposed method to an fMRI study investigating the effective connectivity among various regions of interest (ROIs) during an emotion-processing task. Our findings reveal the pivotal role of the subcortical-cerebellum in emotion processing.
Collapse
Affiliation(s)
- Lizhe Sun
- Beijing International Center for Mathematical Research, Peking University and Department of Statistics, Purdue University
| | | | - Faming Liang
- Department of Statistics, Purdue University, West Lafayette, IN 47907
| |
Collapse
|
2
|
Shahid AH, Singh M. Computational intelligence techniques for medical diagnosis and prognosis: Problems and current developments. Biocybern Biomed Eng 2019. [DOI: 10.1016/j.bbe.2019.05.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
3
|
Yu D, Lim J, Wang X, Liang F, Xiao G. Enhanced construction of gene regulatory networks using hub gene information. BMC Bioinformatics 2017; 18:186. [PMID: 28335719 PMCID: PMC5364645 DOI: 10.1186/s12859-017-1576-1] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 03/03/2017] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Gene regulatory networks reveal how genes work together to carry out their biological functions. Reconstructions of gene networks from gene expression data greatly facilitate our understanding of underlying biological mechanisms and provide new opportunities for biomarker and drug discoveries. In gene networks, a gene that has many interactions with other genes is called a hub gene, which usually plays an essential role in gene regulation and biological processes. In this study, we developed a method for reconstructing gene networks using a partial correlation-based approach that incorporates prior information about hub genes. Through simulation studies and two real-data examples, we compare the performance in estimating the network structures between the existing methods and the proposed method. RESULTS In simulation studies, we show that the proposed strategy reduces errors in estimating network structures compared to the existing methods. When applied to Escherichia coli, the regulation network constructed by our proposed ESPACE method is more consistent with current biological knowledge than the SPACE method. Furthermore, application of the proposed method in lung cancer has identified hub genes whose mRNA expression predicts cancer progress and patient response to treatment. CONCLUSIONS We have demonstrated that incorporating hub gene information in estimating network structures can improve the performance of the existing methods.
Collapse
Affiliation(s)
- Donghyeon Yu
- Department of Statistics, Inha University, Incheon, Korea
| | - Johan Lim
- Department of Statistics, Seoul National University, Seoul, Korea
| | - Xinlei Wang
- Department of Statistical Science, Southern Methodist University, 6425 Boaz Lane, Dallas, TX 75205 USA
| | - Faming Liang
- Department of Biostatistics, University of Florida, 2004 Mowry Road, Gainesville, FL 32611 USA
| | - Guanghua Xiao
- Department of Clinical Sciences, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390 USA
| |
Collapse
|
4
|
Fuster-Parra P, Vidal-Conti J, Borràs PA, Palou P. Bayesian networks to identify statistical dependencies. A case study of Spanish university students' habits. Inform Health Soc Care 2016; 42:166-179. [PMID: 27245256 DOI: 10.1080/17538157.2016.1178117] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
OBJECTIVE The present study sought to discover the relationships among different features characterizing Spanish university students' habits through a Bayesian network (BN). The set of features with the strongest influence in specific features can be determined. METHODS A BN was built from a dataset composed of 13 relevant features, determining the dependencies and conditional independencies from empirical data in a multivariate context. The structure was learned with the bnlearn package in R language introducing prior knowledge, and the parameters were obtained with Netica software. Three reasoning patterns were considered to make inferences: intercausal, evidential, and causal reasoning. RESULTS BN determined the different relationships. Through inference several conclusions were achieved, for instance a high probability value of physical activity in low state was obtained when active peers were instantiated to none state, self-rated fitness to fair state, bmi to normal weight, sitting time to moderate, age to 22-25, and gender to woman state. CONCLUSIONS Bayesian networks may help to characterize Spanish University students' habits.
Collapse
Affiliation(s)
- P Fuster-Parra
- a Department of Mathematics and Computer Science , Universitat Illes Balears , Palma de Mallorca , Baleares , Spain
| | - J Vidal-Conti
- b Department of Education , Universitat Illes Balears , Palma de Mallorca , Baleares , Spain
| | - P A Borràs
- b Department of Education , Universitat Illes Balears , Palma de Mallorca , Baleares , Spain
| | - P Palou
- b Department of Education , Universitat Illes Balears , Palma de Mallorca , Baleares , Spain
| |
Collapse
|
5
|
Fuster-Parra P, Tauler P, Bennasar-Veny M, Ligęza A, López-González AA, Aguiló A. Bayesian network modeling: A case study of an epidemiologic system analysis of cardiovascular risk. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016; 126:128-142. [PMID: 26777431 DOI: 10.1016/j.cmpb.2015.12.010] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Revised: 11/28/2015] [Accepted: 12/11/2015] [Indexed: 06/05/2023]
Abstract
An extensive, in-depth study of cardiovascular risk factors (CVRF) seems to be of crucial importance in the research of cardiovascular disease (CVD) in order to prevent (or reduce) the chance of developing or dying from CVD. The main focus of data analysis is on the use of models able to discover and understand the relationships between different CVRF. In this paper a report on applying Bayesian network (BN) modeling to discover the relationships among thirteen relevant epidemiological features of heart age domain in order to analyze cardiovascular lost years (CVLY), cardiovascular risk score (CVRS), and metabolic syndrome (MetS) is presented. Furthermore, the induced BN was used to make inference taking into account three reasoning patterns: causal reasoning, evidential reasoning, and intercausal reasoning. Application of BN tools has led to discovery of several direct and indirect relationships between different CVRF. The BN analysis showed several interesting results, among them: CVLY was highly influenced by smoking being the group of men the one with highest risk in CVLY; MetS was highly influence by physical activity (PA) being again the group of men the one with highest risk in MetS, and smoking did not show any influence. BNs produce an intuitive, transparent, graphical representation of the relationships between different CVRF. The ability of BNs to predict new scenarios when hypothetical information is introduced makes BN modeling an Artificial Intelligence (AI) tool of special interest in epidemiological studies. As CVD is multifactorial the use of BNs seems to be an adequate modeling tool.
Collapse
Affiliation(s)
- P Fuster-Parra
- Department of Mathematics and Computer Science, Universitat Illes Balears, Palma de Mallorca, Baleares E-07122, Spain; Research Group on Evidence, Lifestyles & Health, Research Institute on Health Sciences (IUNICS), Universitat Illes Balears, Palma de Mallorca, Baleares E-07122, Spain.
| | - P Tauler
- Research Group on Evidence, Lifestyles & Health, Research Institute on Health Sciences (IUNICS), Universitat Illes Balears, Palma de Mallorca, Baleares E-07122, Spain
| | - M Bennasar-Veny
- Research Group on Evidence, Lifestyles & Health, Research Institute on Health Sciences (IUNICS), Universitat Illes Balears, Palma de Mallorca, Baleares E-07122, Spain
| | - A Ligęza
- Department of Applied Computer Science, AGH University of Science and Technology, Kraków PL-30-059, Poland
| | - A A López-González
- Prevention of Occupational Risks in Health Services, GESMA, Balearic Islands Health Service, Hospital de Manacor, Manacor, Baleares E-07500, Spain
| | - A Aguiló
- Research Group on Evidence, Lifestyles & Health, Research Institute on Health Sciences (IUNICS), Universitat Illes Balears, Palma de Mallorca, Baleares E-07122, Spain
| |
Collapse
|
6
|
Weak Convergence Rates of Population Versus Single-Chain Stochastic Approximation MCMC Algorithms. ADV APPL PROBAB 2016. [DOI: 10.1017/s0001867800007540] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
In this paper we establish the theory of weak convergence (toward a normal distribution) for both single-chain and population stochastic approximation Markov chain Monte Carlo (MCMC) algorithms (SAMCMC algorithms). Based on the theory, we give an explicit ratio of convergence rates for the population SAMCMC algorithm and the single-chain SAMCMC algorithm. Our results provide a theoretic guarantee that the population SAMCMC algorithms are asymptotically more efficient than the single-chain SAMCMC algorithms when the gain factor sequence decreases slower than O(1 / t), where t indexes the number of iterations. This is of interest for practical applications.
Collapse
|
7
|
Song Q, Wu M, Liang F. Weak Convergence Rates of Population Versus Single-Chain Stochastic Approximation MCMC Algorithms. ADV APPL PROBAB 2016. [DOI: 10.1239/aap/1418396243] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this paper we establish the theory of weak convergence (toward a normal distribution) for both single-chain and population stochastic approximation Markov chain Monte Carlo (MCMC) algorithms (SAMCMC algorithms). Based on the theory, we give an explicit ratio of convergence rates for the population SAMCMC algorithm and the single-chain SAMCMC algorithm. Our results provide a theoretic guarantee that the population SAMCMC algorithms are asymptotically more efficient than the single-chain SAMCMC algorithms when the gain factor sequence decreases slower than O(1 / t), where t indexes the number of iterations. This is of interest for practical applications.
Collapse
|
8
|
Fuster-Parra P, García-Mas A, Ponseti FJ, Palou P, Cruz J. A Bayesian network to discover relationships between negative features in sport: a case study of teen players. ACTA ACUST UNITED AC 2013. [DOI: 10.1007/s11135-013-9848-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
9
|
Yoo C. Bayesian Method for Causal Discovery of Latent-Variable Models from a Mixture of Experimental and Observational Data. Comput Stat Data Anal 2012; 56:2183-2205. [PMID: 32831439 DOI: 10.1016/j.csda.2012.01.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
This paper describes a Bayesian method for learning causal Bayesian networks through networks that contain latent variables from an arbitrary mixture of observational and experimental data. The paper presents Bayesian methods (including a new method) for learning the causal structure and parameters of the underlying causal process that is generating the data, given that the data contain a mixture of observational and experimental cases. These learning methods were applied using as input various mixtures of experimental and observational data that were generated from the ALARM causal Bayesian network. The paper reports how these structure predictions and parameter estimates compare with the true causal structures and parameters as given by the ALARM network. The paper shows that (1) the new method for learning Bayesian network structure from a mixture of data that this paper introduce, Gibbs Volume method, best estimates the probability of the data given the latent variable model and (2) using large data (>10,000 cases), another model, the implicit latent variable method, is asymptotically correct and efficient.
Collapse
Affiliation(s)
- Changwon Yoo
- Department of Biostatistics, Florida International University, 11200 SW 8 St., AHC2 580, Miami, FL 33199, / Tel: 305-348-4906
| |
Collapse
|
10
|
Allen JD, Xie Y, Chen M, Girard L, Xiao G. Comparing statistical methods for constructing large scale gene networks. PLoS One 2012; 7:e29348. [PMID: 22272232 PMCID: PMC3260142 DOI: 10.1371/journal.pone.0029348] [Citation(s) in RCA: 126] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Accepted: 11/25/2011] [Indexed: 12/14/2022] Open
Abstract
The gene regulatory network (GRN) reveals the regulatory relationships among genes and can provide a systematic understanding of molecular mechanisms underlying biological processes. The importance of computer simulations in understanding cellular processes is now widely accepted; a variety of algorithms have been developed to study these biological networks. The goal of this study is to provide a comprehensive evaluation and a practical guide to aid in choosing statistical methods for constructing large scale GRNs. Using both simulation studies and a real application in E. coli data, we compare different methods in terms of sensitivity and specificity in identifying the true connections and the hub genes, the ease of use, and computational speed. Our results show that these algorithms performed reasonably well, and each method has its own advantages: (1) GeneNet, WGCNA (Weighted Correlation Network Analysis), and ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks) performed well in constructing the global network structure; (2) GeneNet and SPACE (Sparse PArtial Correlation Estimation) performed well in identifying a few connections with high specificity.
Collapse
Affiliation(s)
- Jeffrey D Allen
- University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | | | | | | | | |
Collapse
|
11
|
Zhou Q. Multi-Domain Sampling With Applications to Structural Inference of Bayesian Networks. J Am Stat Assoc 2011. [DOI: 10.1198/jasa.2011.ap10346] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
12
|
Abstract
AbstractBayesian networks have become a widely used method in the modelling of uncertain knowledge. Owing to the difficulty domain experts have in specifying them, techniques that learn Bayesian networks from data have become indispensable. Recently, however, there have been many important new developments in this field. This work takes a broad look at the literature on learning Bayesian networks—in particular their structure—from data. Specific topics are not focused on in detail, but it is hoped that all the major fields in the area are covered. This article is not intended to be a tutorial—for this, there are many books on the topic, which will be presented. However, an effort has been made to locate all the relevant publications, so that this paper can be used as a ready reference to find the works on particular sub-topics.
Collapse
|
13
|
|