1
|
Yu JZ, Wu M, Bichler G, Aros-Vera F, Gao J. Reconstructing Sparse Multiplex Networks with Application to Covert Networks. ENTROPY (BASEL, SWITZERLAND) 2023; 25:142. [PMID: 36673283 PMCID: PMC9857694 DOI: 10.3390/e25010142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 01/07/2023] [Accepted: 01/08/2023] [Indexed: 06/17/2023]
Abstract
Network structure provides critical information for understanding the dynamic behavior of complex systems. However, the complete structure of real-world networks is often unavailable, thus it is crucially important to develop approaches to infer a more complete structure of networks. In this paper, we integrate the configuration model for generating random networks into an Expectation-Maximization-Aggregation (EMA) framework to reconstruct the complete structure of multiplex networks. We validate the proposed EMA framework against the Expectation-Maximization (EM) framework and random model on several real-world multiplex networks, including both covert and overt ones. It is found that the EMA framework generally achieves the best predictive accuracy compared to the EM framework and the random model. As the number of layers increases, the performance improvement of EMA over EM decreases. The inferred multiplex networks can be leveraged to inform the decision-making on monitoring covert networks as well as allocating limited resources for collecting additional information to improve reconstruction accuracy. For law enforcement agencies, the inferred complete network structure can be used to develop more effective strategies for covert network interdiction.
Collapse
Affiliation(s)
- Jin-Zhu Yu
- Department of Civil Engineering, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Mincheng Wu
- State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310058, China
| | - Gisela Bichler
- School of Criminology and Criminal Justice, California State University, San Bernardino, CA 92407, USA
| | - Felipe Aros-Vera
- Department of Industrial and Systems Engineering, Ohio University, Athens, OH 45701, USA
| | - Jianxi Gao
- Department of Computer Science, Rensselaer Polytechnic Institute (RPI), Troy, NY 12180, USA
- Network Science and Technology Center, Rensselaer Polytechnic Institute (RPI), Troy, NY 12180, USA
| |
Collapse
|
2
|
Taborsky P, Vermue L, Korzepa M, Morup M. The Bayesian Cut. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:4111-4124. [PMID: 32406825 DOI: 10.1109/tpami.2020.2994396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
An important task in the analysis of graphs is separating nodes into densely connected groups with little interaction between each other. Prominent methods here include flow based graph cutting procedures as well as statistical network modeling approaches. However, adequately accounting for this, the so-called community structure, in complex networks remains a major challenge. We present a novel generic Bayesian probabilistic model for graph cutting in which we derive an analytical solution to the marginalization of nuisance parameters under constraints enforcing community structure. As a part of the solution a large scale approximation for integrals involving multiple incomplete gamma functions is derived. Our multiple cluster solution presents a generic tool for Bayesian inference on Poisson weighted graphs across different domains. Applied on three real world social networks as well as three image segmentation problems our approach shows on par or better performance to existing spectral graph cutting and community detection methods, while learning the underlying parameter space. The developed procedure provides a principled statistical framework for graph cutting and the Bayesian Cut source code provided enables easy adoption of the procedure as an alternative to existing graph cutting methods.
Collapse
|
3
|
Wang J, Li K. Community structure exploration considering latent link patterns in complex networks. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.06.032] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
4
|
Albers KJ, Ambrosen KS, Liptrot MG, Dyrby TB, Schmidt MN, Mørup M. Using connectomics for predictive assessment of brain parcellations. Neuroimage 2021; 238:118170. [PMID: 34087365 DOI: 10.1016/j.neuroimage.2021.118170] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/19/2021] [Accepted: 05/10/2021] [Indexed: 12/29/2022] Open
Abstract
The organization of the human brain remains elusive, yet is of great importance to the mechanisms of integrative brain function. At the macroscale, its structural and functional interpretation is conventionally assessed at the level of cortical units. However, the definition and validation of such cortical parcellations are problematic due to the absence of a true gold standard. We propose a framework for quantitative evaluation of brain parcellations via statistical prediction of connectomics data. Specifically, we evaluate the extent in which the network representation at the level of cortical units (defined as parcels) accounts for high-resolution brain connectivity. Herein, we assess the pertinence and comparative ranking of ten existing parcellation atlases to account for functional (FC) and structural connectivity (SC) data based on data from the Human Connectome Project (HCP), and compare them to data-driven as well as spatially-homogeneous geometric parcellations including geodesic parcellations with similar size distributions as the atlases. We find substantial discrepancy in parcellation structures that well characterize FC and SC and differences in what well represents an individual's functional connectome when compared against the FC structure that is preserved across individuals. Surprisingly, simple spatial homogenous parcellations generally provide good representations of both FC and SC, but are inferior when their within-parcellation distribution of individual parcel sizes is matched to that of a valid atlas. This suggests that the choice of fine grained and coarse representations used by existing atlases are important. However, we find that resolution is more critical than the exact border location of parcels.
Collapse
Affiliation(s)
- Kristoffer J Albers
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark
| | - Karen S Ambrosen
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark; Danish Research Centre for Magnetic Resonance,Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Copenhagen, Denmark
| | - Matthew G Liptrot
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark
| | - Tim B Dyrby
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark; Danish Research Centre for Magnetic Resonance,Centre for Functional and Diagnostic Imaging and Research, Copenhagen University Hospital Amager and Hvidovre, Copenhagen, Denmark
| | - Mikkel N Schmidt
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark
| | - Morten Mørup
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Building 324, DK-2800 Kgs. Lyngby, Denmark.
| |
Collapse
|
5
|
Yu Q, Yu Z, Wang Z, Wang X, Wang Y. Overlapping community detection for count-value networks. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES 2019. [DOI: 10.1186/s13673-019-0202-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Abstract
Detecting network overlapping community has become a very hot research topic in the literature. However, overlapping community detection for count-value networks that naturally arise and are pervasive in our modern life, has not yet been thoroughly studied. We propose a generative model for count-value networks with overlapping community structure and use the Indian buffet process to model the community assignment matrix Z; thus, provide a flexible nonparametric Bayesian scheme that can allow the number of communities K to increase as more and more data are encountered instead of to be fixed in advance. Both collapsed and uncollapsed Gibbs sampler for the generative model have been derived. We conduct extensive experiments on simulated network data and real network data, and estimate the inference quality on single variable parameters. We find that the proposed model and inference procedure can bring us the desired experimental results.
Collapse
|
6
|
Tan LSL, Iorio MD. Dynamic degree-corrected blockmodels for social networks: A nonparametric approach. STAT MODEL 2018. [DOI: 10.1177/1471082x18770760] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A nonparametric approach to the modelling of social networks using degree-corrected stochastic blockmodels is proposed. The model for static network consists of a stochastic blockmodel using a probit regression formulation, and popularity parameters are incorporated to account for degree heterogeneity. We specify a Dirichlet process prior to detect community structure as well as to induce clustering in the popularity parameters. This approach is flexible yet parsimonious as it allows the appropriate number of communities and popularity clusters to be determined automatically by the data. We further discuss and implement extensions of the static model to dynamic networks. In a Bayesian framework, we perform posterior inference through MCMC algorithms. The models are illustrated using several real-world benchmark social networks.
Collapse
Affiliation(s)
- Linda SL Tan
- Department of Statistics and Applied Probability, Faculty of Science, National University of Singapore, Singapore
| | - Maria De Iorio
- Department of Statistical Science, University College London, London, UK
| |
Collapse
|
7
|
Abstract
Statistical network modelling has focused on representing the graph as a discrete structure, namely the adjacency matrix. When assuming exchangeability of this array-which can aid in modelling, computations and theoretical analysis-the Aldous-Hoover theorem informs us that the graph is necessarily either dense or empty. We instead consider representing the graph as an exchangeable random measure and appeal to the Kallenberg representation theorem for this object. We explore using completely random measures (CRMs) to define the exchangeable random measure, and we show how our CRM construction enables us to achieve sparse graphs while maintaining the attractive properties of exchangeability. We relate the sparsity of the graph to the Lévy measure defining the CRM. For a specific choice of CRM, our graphs can be tuned from dense to sparse on the basis of a single parameter. We present a scalable Hamiltonian Monte Carlo algorithm for posterior inference, which we use to analyse network properties in a range of real data sets, including networks with hundreds of thousands of nodes and millions of edges.
Collapse
|
8
|
Hinne M, Meijers A, Bakker R, Tiesinga PHE, Mørup M, van Gerven MAJ. The missing link: Predicting connectomes from noisy and partially observed tract tracing data. PLoS Comput Biol 2017; 13:e1005374. [PMID: 28141820 PMCID: PMC5308841 DOI: 10.1371/journal.pcbi.1005374] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Revised: 02/14/2017] [Accepted: 01/23/2017] [Indexed: 12/23/2022] Open
Abstract
Our understanding of the wiring map of the brain, known as the connectome, has increased greatly in the last decade, mostly due to technological advancements in neuroimaging techniques and improvements in computational tools to interpret the vast amount of available data. Despite this, with the exception of the C. elegans roundworm, no definitive connectome has been established for any species. In order to obtain this, tracer studies are particularly appealing, as these have proven highly reliable. The downside of tract tracing is that it is costly to perform, and can only be applied ex vivo. In this paper, we suggest that instead of probing all possible connections, hitherto unknown connections may be predicted from the data that is already available. Our approach uses a 'latent space model' that embeds the connectivity in an abstract physical space. Regions that are close in the latent space have a high chance of being connected, while regions far apart are most likely disconnected in the connectome. After learning the latent embedding from the connections that we did observe, the latent space allows us to predict connections that have not been probed previously. We apply the methodology to two connectivity data sets of the macaque, where we demonstrate that the latent space model is successful in predicting unobserved connectivity, outperforming two baselines and an alternative model in nearly all cases. Furthermore, we show how the latent spatial embedding may be used to integrate multimodal observations (i.e. anterograde and retrograde tracers) for the mouse neocortex. Finally, our probabilistic approach enables us to make explicit which connections are easy to predict and which prove difficult, allowing for informed follow-up studies.
Collapse
Affiliation(s)
- Max Hinne
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, the Netherlands
| | - Annet Meijers
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, the Netherlands
| | - Rembrandt Bakker
- Institute of Neuroscience and Medicine, Institute for Advanced Simulation and JARA BRAIN Institute I, Jülich Research Centre, Jülich, Germany
| | - Paul H. E. Tiesinga
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, the Netherlands
| | - Morten Mørup
- Technical University of Denmark, DTU Compute, Kgs. Lyngby, Denmark
| | - Marcel A. J. van Gerven
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, the Netherlands
| |
Collapse
|