1
|
Contisciani M, Hobbhahn M, Power EA, Hennig P, De Bacco C. Flexible inference in heterogeneous and attributed multilayer networks. PNAS NEXUS 2025; 4:pgaf005. [PMID: 39850077 PMCID: PMC11756377 DOI: 10.1093/pnasnexus/pgaf005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Accepted: 12/20/2024] [Indexed: 01/25/2025]
Abstract
Networked datasets can be enriched by different types of information about individual nodes or edges. However, most existing methods for analyzing such datasets struggle to handle the complexity of heterogeneous data, often requiring substantial model-specific analysis. In this article, we develop a probabilistic generative model to perform inference in multilayer networks with arbitrary types of information. Our approach employs a Bayesian framework combined with the Laplace matching technique to ease interpretation of inferred parameters. Furthermore, the algorithmic implementation relies on automatic differentiation, avoiding the need for explicit derivations. This makes our model scalable and flexible to adapt to any combination of input data. We demonstrate the effectiveness of our method in detecting overlapping community structures and performing various prediction tasks on heterogeneous multilayer data, where nodes and edges have different types of attributes. Additionally, we showcase its ability to unveil a variety of patterns in a social support network among villagers in rural India by effectively utilizing all input information in a meaningful way.
Collapse
Affiliation(s)
| | - Marius Hobbhahn
- Tübingen AI Center, University of Tübingen, Tübingen 72076, Germany
| | - Eleanor A Power
- Department of Methodology, London School of Economics and Political Sciences, London WC2A 2AE, United Kingdom
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| | - Philipp Hennig
- Tübingen AI Center, University of Tübingen, Tübingen 72076, Germany
| | - Caterina De Bacco
- Max Planck Institute for Intelligent Systems, Tübingen 72076, Germany
| |
Collapse
|
2
|
Mangold L, Roth C. Quantifying metadata relevance to network block structure using description length. COMMUNICATIONS PHYSICS 2024; 7:331. [PMID: 39398491 PMCID: PMC11469959 DOI: 10.1038/s42005-024-01819-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 09/30/2024] [Indexed: 10/15/2024]
Abstract
Network analysis is often enriched by including an examination of node metadata. In the context of understanding the mesoscale of networks it is often assumed that node groups based on metadata and node groups based on connectivity patterns are intrinsically linked. This assumption is increasingly being challenged, whereby metadata might be entirely unrelated to structure or, similarly, multiple sets of metadata might be relevant to the structure of a network in different ways. We propose the metablox tool to quantify the relationship between a network's node metadata and its mesoscale structure, measuring the strength of the relationship and the type of structural arrangement exhibited by the metadata. We show on a number of synthetic and empirical networks that our tool distinguishes relevant metadata and allows for this in a comparative setting, demonstrating that it can be used as part of systematic meta analyses for the comparison of networks from different domains.
Collapse
Affiliation(s)
- Lena Mangold
- Centre d’Analyse et de Mathématique Sociales (CNRS/EHESS), 54 Bd Raspail, 75006 Paris, France
- Computational Social Science Team, Centre Marc Bloch (CNRS/MEAE), Friedrichstr. 191, 10117 Berlin, Germany
| | - Camille Roth
- Centre d’Analyse et de Mathématique Sociales (CNRS/EHESS), 54 Bd Raspail, 75006 Paris, France
- Computational Social Science Team, Centre Marc Bloch (CNRS/MEAE), Friedrichstr. 191, 10117 Berlin, Germany
| |
Collapse
|
3
|
Guha S, Rodriguez-Acosta J, Dinov ID. A Bayesian Multiplex Graph Classifier of Functional Brain Connectivity Across Diverse Tasks of Cognitive Control. Neuroinformatics 2024; 22:457-472. [PMID: 38861097 DOI: 10.1007/s12021-024-09670-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2024] [Indexed: 06/12/2024]
Abstract
This article seeks to investigate the impact of aging on functional connectivity across different cognitive control scenarios, particularly emphasizing the identification of brain regions significantly associated with early aging. By conceptualizing functional connectivity within each cognitive control scenario as a graph, with brain regions as nodes, the statistical challenge revolves around devising a regression framework to predict a binary scalar outcome (aging or normal) using multiple graph predictors. Popular regression methods utilizing multiplex graph predictors often face limitations in effectively harnessing information within and across graph layers, leading to potentially less accurate inference and predictive accuracy, especially for smaller sample sizes. To address this challenge, we propose the Bayesian Multiplex Graph Classifier (BMGC). Accounting for multiplex graph topology, our method models edge coefficients at each graph layer using bilinear interactions between the latent effects associated with the two nodes connected by the edge. This approach also employs a variable selection framework on node-specific latent effects from all graph layers to identify influential nodes linked to observed outcomes. Crucially, the proposed framework is computationally efficient and quantifies the uncertainty in node identification, coefficient estimation, and binary outcome prediction. BMGC outperforms alternative methods in terms of the aforementioned metrics in simulation studies. An additional BMGC validation was completed using an fMRI study of brain networks in adults. The proposed BMGC technique identified that sensory motor brain network obeys certain lateral symmetries, whereas the default mode network exhibits significant brain asymmetries associated with early aging.
Collapse
Affiliation(s)
- Sharmistha Guha
- Department of Statistics, Texas A&M University, 3143 TAMU, College Station, 77843, TX, USA.
| | - Jose Rodriguez-Acosta
- Department of Statistics, Texas A&M University, 3143 TAMU, College Station, 77843, TX, USA
| | - Ivo D Dinov
- Statistics Online Computational Resource, University of Michigan, 426 N. Ingalls St., Ann Arbor, 48109, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, 48109, MI, USA
| |
Collapse
|
4
|
Badalyan A, Ruggeri N, De Bacco C. Structure and inference in hypergraphs with node attributes. Nat Commun 2024; 15:7073. [PMID: 39152121 PMCID: PMC11329712 DOI: 10.1038/s41467-024-51388-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 08/06/2024] [Indexed: 08/19/2024] Open
Abstract
Many networked datasets with units interacting in groups of two or more, encoded with hypergraphs, are accompanied by extra information about nodes, such as the role of an individual in a workplace. Here we show how these node attributes can be used to improve our understanding of the structure resulting from higher-order interactions. We consider the problem of community detection in hypergraphs and develop a principled model that combines higher-order interactions and node attributes to better represent the observed interactions and to detect communities more accurately than using either of these types of information alone. The method learns automatically from the input data the extent to which structure and attributes contribute to explain the data, down weighing or discarding attributes if not informative. Our algorithmic implementation is efficient and scales to large hypergraphs and interactions of large numbers of units. We apply our method to a variety of systems, showing strong performance in hyperedge prediction tasks and in selecting community divisions that correlate with attributes when these are informative, but discarding them otherwise. Our approach illustrates the advantage of using informative node attributes when available with higher-order data.
Collapse
Affiliation(s)
- Anna Badalyan
- Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen, Germany
| | - Nicolò Ruggeri
- Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen, Germany.
- Department of Computer Science, ETH, Zürich, Switzerland.
| | - Caterina De Bacco
- Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen, Germany.
| |
Collapse
|
5
|
Liu Y, Li A, Zeng A, Zhou J, Fan Y, Di Z. Motif-based community detection in heterogeneous multilayer networks. Sci Rep 2024; 14:8769. [PMID: 38627531 PMCID: PMC11021438 DOI: 10.1038/s41598-024-59120-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 04/08/2024] [Indexed: 04/19/2024] Open
Abstract
Multilayer networks composed of intralayer edges and interlayer edges are an important type of complex networks. Considering the heterogeneity of nodes and edges, it is necessary to design more reasonable and diverse community detection methods for multilayer networks. Existing research on community detection in multilayer networks mainly focuses on multiplexing networks (where the nodes are homogeneous and the edges are heterogeneous), but few studies have focused on heterogeneous multilayer networks where both nodes and edges represent different semantics. In this paper, we studied community detection on heterogeneous multilayer networks and proposed a motif-based detection algorithm. First, the communities and motifs of multilayer networks are defined, especially the interlayer motifs. Then, the modularity of multilayer networks based on these motifs is designed, and the community structure of the multilayer network is detected by maximizing the modularity of multilayer networks. Finally, we verify the effectiveness of the detection algorithm on synthetic networks. In the experiments on synthetic networks, comparing with the classical community detection algorithms (without considering interlayer heterogeneity), the motif-based modularity community detection algorithm can obtain better results under different evaluation indexes, and we found that there exists a certain relationship between motifs and communities. In addition, the proposed algorithm is applied in the empirical network, which shows its practicability in the real world. This study provides a solution for the investigation of heterogeneous information in multilayer networks.
Collapse
Affiliation(s)
- Yafang Liu
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China
| | - Aiwen Li
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China
| | - An Zeng
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China
| | - Jianlin Zhou
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China.
| | - Ying Fan
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China.
| | - Zengru Di
- School of Systems Science, Beijing Normal University, Beijing, 100875, People's Republic of China
| |
Collapse
|
6
|
Ruggeri N, Battiston F, De Bacco C. Framework to generate hypergraphs with community structure. Phys Rev E 2024; 109:034309. [PMID: 38632750 DOI: 10.1103/physreve.109.034309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 01/11/2024] [Indexed: 04/19/2024]
Abstract
In recent years hypergraphs have emerged as a powerful tool to study systems with multibody interactions which cannot be trivially reduced to pairs. While highly structured methods to generate synthetic data have proved fundamental for the standardized evaluation of algorithms and the statistical study of real-world networked data, these are scarcely available in the context of hypergraphs. Here we propose a flexible and efficient framework for the generation of hypergraphs with many nodes and large hyperedges, which allows specifying general community structures and tune different local statistics. We illustrate how to use our model to sample synthetic data with desired features (assortative or disassortative communities, mixed or hard community assignments, etc.), analyze community detection algorithms, and generate hypergraphs structurally similar to real-world data. Overcoming previous limitations on the generation of synthetic hypergraphs, our work constitutes a substantial advancement in the statistical modeling of higher-order systems.
Collapse
Affiliation(s)
- Nicolò Ruggeri
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076 Tübingen, Germany
- Department of Computer Science, ETH, 8004 Zürich, Switzerland
| | - Federico Battiston
- Department of Network and Data Science, Central European University, 1100 Vienna, Austria
| | - Caterina De Bacco
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076 Tübingen, Germany
| |
Collapse
|
7
|
Mauri D, Bonelli S, Ozella L. The "Second Life" of laboratory rats ( Rattus norvegicus): Assessment of social behavior of a colony of rats based on social network analysis. J APPL ANIM WELF SCI 2023; 26:693-707. [PMID: 36217647 DOI: 10.1080/10888705.2022.2132826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Rattus norvegicus is a social animal and holds a significant economic value, considering its use in scientific research. Here, we use the Social Network Analysis (SNA) approach to study the social interactions of a group of rats held in a post-laboratory animal care facility. We collected interaction data during four study periods, for a total of 60 days. At the group level, rats presented two communities for each study period, consisting mainly of littermates. At individual level, we found that the rats preferred to interact with individuals of the same strain and laboratory of origin and with littermates. At temporal level, we studied how stable social interactions were over time. During the first study period, we found high social stability, whereas the introduction of new individuals in the subsequent period caused social rearrangements; however, the initial social stability was restored. Our findings have shown that studying the social behavior of rats using SNA is a valuable tool for advancing our understanding of the social system of this species, which has the potential to enhance management and welfare practices.
Collapse
Affiliation(s)
- Diana Mauri
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
- "La Collina dei Conigli" non-profit Organization, Monza, Italy
| | - Simona Bonelli
- Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy
| | - Laura Ozella
- Department of Veterinary Sciences, University of Turin, Turin, Italy
| |
Collapse
|
8
|
Ruggeri N, Contisciani M, Battiston F, De Bacco C. Community detection in large hypergraphs. SCIENCE ADVANCES 2023; 9:eadg9159. [PMID: 37436987 PMCID: PMC10337898 DOI: 10.1126/sciadv.adg9159] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 06/12/2023] [Indexed: 07/14/2023]
Abstract
Hypergraphs, describing networks where interactions take place among any number of units, are a natural tool to model many real-world social and biological systems. Here, we propose a principled framework to model the organization of higher-order data. Our approach recovers community structure with accuracy exceeding that of currently available state-of-the-art algorithms, as tested in synthetic benchmarks with both hard and overlapping ground-truth partitions. Our model is flexible and allows capturing both assortative and disassortative community structures. Moreover, our method scales orders of magnitude faster than competing algorithms, making it suitable for the analysis of very large hypergraphs, containing millions of nodes and interactions among thousands of nodes. Our work constitutes a practical and general tool for hypergraph analysis, broadening our understanding of the organization of real-world higher-order systems.
Collapse
Affiliation(s)
- Nicolò Ruggeri
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076 Tübingen, Germany
- Department of Computer Science, ETH, 8004 Zürich, Switzerland
| | - Martina Contisciani
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076 Tübingen, Germany
| | - Federico Battiston
- Department of Network and Data Science, Central European University, 1100 Vienna, Austria
| | - Caterina De Bacco
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076 Tübingen, Germany
| |
Collapse
|
9
|
Inference of hyperedges and overlapping communities in hypergraphs. Nat Commun 2022; 13:7229. [PMID: 36433942 PMCID: PMC9700742 DOI: 10.1038/s41467-022-34714-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 11/02/2022] [Indexed: 11/27/2022] Open
Abstract
Hypergraphs, encoding structured interactions among any number of system units, have recently proven a successful tool to describe many real-world biological and social networks. Here we propose a framework based on statistical inference to characterize the structural organization of hypergraphs. The method allows to infer missing hyperedges of any size in a principled way, and to jointly detect overlapping communities in presence of higher-order interactions. Furthermore, our model has an efficient numerical implementation, and it runs faster than dyadic algorithms on pairwise records projected from higher-order data. We apply our method to a variety of real-world systems, showing strong performance in hyperedge prediction tasks, detecting communities well aligned with the information carried by interactions, and robustness against addition of noisy hyperedges. Our approach illustrates the fundamental advantages of a hypergraph probabilistic model when modeling relational systems with higher-order interactions.
Collapse
|
10
|
Mihelčić M. Redescription mining on data with background network information. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
11
|
Gribel D, Gendreau M, Vidal T. Semi-supervised clustering with inaccurate pairwise annotations. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
12
|
Sia J, Zhang W, Jonckheere E, Cook D, Bogdan P. Inferring functional communities from partially observed biological networks exploiting geometric topology and side information. Sci Rep 2022; 12:10883. [PMID: 35760826 PMCID: PMC9237089 DOI: 10.1038/s41598-022-14631-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 06/09/2022] [Indexed: 11/16/2022] Open
Abstract
Cellular biological networks represent the molecular interactions that shape function of living cells. Uncovering the organization of a biological network requires efficient and accurate algorithms to determine the components, termed communities, underlying specific processes. Detecting functional communities is challenging because reconstructed biological networks are always incomplete due to technical bias and biological complexity, and the evaluation of putative communities is further complicated by a lack of known ground truth. To address these challenges, we developed a geometric-based detection framework based on Ollivier-Ricci curvature to exploit information about network topology to perform community detection from partially observed biological networks. We further improved this approach by integrating knowledge of gene function, termed side information, into the Ollivier-Ricci curvature algorithm to aid in community detection. This approach identified essential conserved and varied biological communities from partially observed Arabidopsis protein interaction datasets better than the previously used methods. We show that Ollivier-Ricci curvature with side information identified an expanded auxin community to include an important protein stability complex, the Cop9 signalosome, consistent with previous reported links to auxin response and root development. The results show that community detection based on Ollivier-Ricci curvature with side information can uncover novel components and novel communities in biological networks, providing novel insight into the organization and function of complex networks.
Collapse
Affiliation(s)
- Jayson Sia
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90089, USA
| | - Wei Zhang
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Edmond Jonckheere
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90089, USA
| | - David Cook
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA.
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90089, USA.
| |
Collapse
|
13
|
Iacovissi L, De Bacco C. The interplay between ranking and communities in networks. Sci Rep 2022; 12:8992. [PMID: 35637266 PMCID: PMC9151911 DOI: 10.1038/s41598-022-12730-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 05/12/2022] [Indexed: 11/08/2022] Open
Abstract
Community detection and hierarchy extraction are usually thought of as separate inference tasks on networks. Considering only one of the two when studying real-world data can be an oversimplification. In this work, we present a generative model based on an interplay between community and hierarchical structures. It assumes that each node has a preference in the interaction mechanism and nodes with the same preference are more likely to interact, while heterogeneous interactions are still allowed. The sparsity of the network is exploited for implementing a more efficient algorithm. We demonstrate our method on synthetic and real-world data and compare performance with two standard approaches for community detection and ranking extraction. We find that the algorithm accurately retrieves the overall node's preference in different scenarios, and we show that it can distinguish small subsets of nodes that behave differently than the majority. As a consequence, the model can recognize whether a network has an overall preferred interaction mechanism. This is relevant in situations where there is no clear "a priori" information about what structure explains the observed network datasets well. Our model allows practitioners to learn this automatically from the data.
Collapse
Affiliation(s)
- Laura Iacovissi
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076, Tübingen, Germany
- Bosch Industry on Campus Lab, University of Tübingen, Tübingen, Germany
| | - Caterina De Bacco
- Max Planck Institute for Intelligent Systems, Cyber Valley, 72076, Tübingen, Germany.
| |
Collapse
|
14
|
Redhead D, Power EA. Social hierarchies and social networks in humans. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200440. [PMID: 35000451 PMCID: PMC8743884 DOI: 10.1098/rstb.2020.0440] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 08/04/2021] [Indexed: 01/06/2023] Open
Abstract
Across species, social hierarchies are often governed by dominance relations. In humans, where there are multiple culturally valued axes of distinction, social hierarchies can take a variety of forms and need not rest on dominance relations. Consequently, humans navigate multiple domains of status, i.e. relative standing. Importantly, while these hierarchies may be constructed from dyadic interactions, they are often more fundamentally guided by subjective peer evaluations and group perceptions. Researchers have typically focused on the distinct elements that shape individuals' relative standing, with some emphasizing individual-level attributes and others outlining emergent macro-level structural outcomes. Here, we synthesize work across the social sciences to suggest that the dynamic interplay between individual-level and meso-level properties of the social networks in which individuals are embedded are crucial for understanding the diverse processes of status differentiation across groups. More specifically, we observe that humans not only navigate multiple social hierarchies at any given time but also simultaneously operate within multiple, overlapping social networks. There are important dynamic feedbacks between social hierarchies and the characteristics of social networks, as the types of social relationships, their structural properties, and the relative position of individuals within them both influence and are influenced by status differentiation. This article is part of the theme issue 'The centennial of the pecking order: current state and future prospects for the study of dominance hierarchies'.
Collapse
Affiliation(s)
- Daniel Redhead
- Department of Human Behaviour, Ecology and Culture, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Eleanor A. Power
- Department of Methodology, London School of Economics and Political Science, London WC2A 2AE, UK
| |
Collapse
|
15
|
Correlation and dimension relevance in multidimensional networks: a systematic taxonomy. SOCIAL NETWORK ANALYSIS AND MINING 2021. [DOI: 10.1007/s13278-021-00801-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
16
|
Abstract
AbstractGrouping well-connected nodes that also result in label-homogeneous clusters is a task often known as attribute-aware community discovery. While approaching node-enriched graph clustering methods, rigorous tools need to be developed for evaluating the quality of the resulting partitions. In this work, we present X-Mark, a model that generates synthetic node-attributed graphs with planted communities. Its novelty consists in forming communities and node labels contextually while handling categorical or continuous attributive information. Moreover, we propose a comparison between attribute-aware algorithms, testing them against our benchmark. Accordingly to different classification schema from recent state-of-the-art surveys, our results suggest that X-Mark can shed light on the differences between several families of algorithms.
Collapse
|