1
|
Ding X, Yang H, Zhang J, Yang J, Xiang X. CEO: Identifying Overlapping Communities via Construction, Expansion and Optimization. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.03.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
2
|
Overlapping communities and roles in networks with node attributes: Probabilistic graphical modeling, Bayesian formulation and variational inference. ARTIF INTELL 2022. [DOI: 10.1016/j.artint.2021.103580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
3
|
Ponomarenko A, Pitsoulis L, Shamshetdinov M. Overlapping community detection in networks based on link partitioning and partitioning around medoids. PLoS One 2021; 16:e0255717. [PMID: 34432789 PMCID: PMC8386890 DOI: 10.1371/journal.pone.0255717] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 07/22/2021] [Indexed: 11/24/2022] Open
Abstract
In this paper, we present a new method for detecting overlapping communities in networks with a predefined number of clusters called LPAM (Link Partitioning Around Medoids). The overlapping communities in the graph are obtained by detecting the disjoint communities in the associated line graph employing link partitioning and partitioning around medoids which are done through the use of a distance function defined on the set of nodes. We consider both the commute distance and amplified commute distance as distance functions. The performance of the LPAM method is evaluated with computational experiments on real life instances, as well as synthetic network benchmarks. For small and medium-size networks, the exact solution was found, while for large networks we found solutions with a heuristic version of the LPAM method.
Collapse
|
4
|
Dynamic Analyses of Contagion Risk and Module Evolution on the SSE A-Shares Market Based on Minimum Information Entropy. ENTROPY 2021; 23:e23040434. [PMID: 33917234 PMCID: PMC8068080 DOI: 10.3390/e23040434] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 04/04/2021] [Accepted: 04/05/2021] [Indexed: 01/25/2023]
Abstract
The interactive effect is significant in the Chinese stock market, exacerbating the abnormal market volatilities and risk contagion. Based on daily stock returns in the Shanghai Stock Exchange (SSE) A-shares, this paper divides the period between 2005 and 2018 into eight bull and bear market stages to investigate interactive patterns in the Chinese financial market. We employ the Least Absolute Shrinkage and Selection Operator (LASSO) method to construct the stock network, compare the heterogeneity of bull and bear markets, and further use the Map Equation method to analyse the evolution of modules in the SSE A-shares market. Empirical results show that (1) the connected effect is more significant in bear markets than bull markets and gives rise to abnormal volatilities in the stock market; (2) a system module can be found in the network during the first four stages, and the industry aggregation effect leads to module differentiation in the last four stages; (3) some stocks have leading effects on others throughout eight periods, and medium- and small-cap stocks with poor financial conditions are more likely to become risk sources, especially in bear markets. Our conclusions are beneficial to improving investment strategies and making regulatory policies.
Collapse
|
5
|
Sewell DK. Model-Based Edge Clustering. J Comput Graph Stat 2020. [DOI: 10.1080/10618600.2020.1811104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
6
|
Ding X, Zhang J, Yang J. Node-community membership diversifies community structures: An overlapping community detection algorithm based on local expansion and boundary re-checking. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.105935] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
7
|
Emmons S, Mucha PJ. Map equation with metadata: Varying the role of attributes in community detection. Phys Rev E 2019; 100:022301. [PMID: 31574624 DOI: 10.1103/physreve.100.022301] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Indexed: 11/07/2022]
Abstract
Much of the community detection literature studies structural communities, communities defined solely by the connectivity patterns of the network. Often networks contain additional metadata which can inform community detection such as the grade and gender of students in a high school social network. In this work, we introduce a tuning parameter to the content map equation that allows users of the Infomap community detection algorithm to control the metadata's relative importance for identifying network structure. On synthetic networks, we show that our algorithm can overcome the structural detectability limit when the metadata are well aligned with community structure. On real-world networks, we show how our algorithm can achieve greater mutual information with the metadata at a cost in the traditional map equation. Our tuning parameter, like the focusing knob of a microscope, allows users to "zoom in" and "zoom out" on communities with varying levels of focus on the metadata.
Collapse
Affiliation(s)
- Scott Emmons
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Peter J Mucha
- Carolina Center for Interdisciplinary Applied Mathematics, Department of Mathematics, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
8
|
Dong S, Jain R. Overlapping community detection algorithm based on improved KNN. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-190724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Affiliation(s)
- Shi Dong
- School of Computer Science and Technology Zhoukou Normal University, Zhoukou, China
- School of Computer Science and Technology Huazhong Universtiy of Science and Technology, Wuhan, China
| | - Raj Jain
- School of Computer Science and Engineering Washington University in St Louis, Saint Louis, USA
| |
Collapse
|
9
|
A Central Edge Selection Based Overlapping Community Detection Algorithm for the Detection of Overlapping Structures in Protein⁻Protein Interaction Networks. Molecules 2018; 23:molecules23102633. [PMID: 30322177 PMCID: PMC6222769 DOI: 10.3390/molecules23102633] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Revised: 10/08/2018] [Accepted: 10/09/2018] [Indexed: 02/06/2023] Open
Abstract
Overlapping structures of protein⁻protein interaction networks are very prevalent in different biological processes, which reflect the sharing mechanism to common functional components. The overlapping community detection (OCD) algorithm based on central node selection (CNS) is a traditional and acceptable algorithm for OCD in networks. The main content of CNS is the central node selection and the clustering procedure. However, the original CNS does not consider the influence among the nodes and the importance of the division of the edges in networks. In this paper, an OCD algorithm based on a central edge selection (CES) algorithm for detection of overlapping communities of protein⁻protein interaction (PPI) networks is proposed. Different from the traditional CNS algorithms for OCD, the proposed algorithm uses community magnetic interference (CMI) to obtain more reasonable central edges in the process of CES, and employs a new distance between the non-central edge and the set of the central edges to divide the non-central edge into the correct cluster during the clustering procedure. In addition, the proposed CES improves the strategy of overlapping nodes pruning (ONP) to make the division more precisely. The experimental results on three benchmark networks and three biological PPI networks of Mus. musculus, Escherichia coli, and Cerevisiae show that the CES algorithm performs well.
Collapse
|
10
|
Micro-blog user community discovery using generalized SimRank edge weighting method. PLoS One 2018; 13:e0196447. [PMID: 29734358 PMCID: PMC5937763 DOI: 10.1371/journal.pone.0196447] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Accepted: 04/09/2018] [Indexed: 11/19/2022] Open
Abstract
Community discovery is one of the most popular issues in analyzing and understanding a network. Previous research suggests that the discovery can be enhanced by assigning weights to the edges of the network. This paper proposes a novel edge weighting method, which balances both local and global weighting based on the idea of shared neighbor ranging between users and the interpersonal significance of the social network community. We assume that users belonging to the same community have similar relationship network structures. By controlling the measure of "neighborhood", this method can adequately adapt to real-world networks. Therefore, the famous similarity calculation method-SimRank-can be regarded as a special case of our method. According to the practical significance of social networks, we propose a new evaluation method that uses the communication rate to measure its divided demerit to better express users' interaction relations than the ordinary modularity Q. Furthermore, the fast Newman algorithm is extended to weighted networks. In addition, we use four real networks in the largest Chinese micro-blog website Sina. The results of experiments demonstrate that the proposed method easily meets the balancing requirements and is more robust to different kinds of networks. The experimental results also indicate that the proposed algorithm outperforms several conventional weighting methods.
Collapse
|
11
|
Inverse Resolution Limit of Partition Density and Detecting Overlapping Communities by Link-Surprise. Sci Rep 2017; 7:12399. [PMID: 28963540 PMCID: PMC5622083 DOI: 10.1038/s41598-017-12432-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2017] [Accepted: 09/04/2017] [Indexed: 11/16/2022] Open
Abstract
Finding overlapping communities of complex networks remains a challenge in network science. To address this challenge, one of the widely used approaches is finding the communities of links by optimizing the objective function, partition density. In this study, we show that partition density suffers from inverse resolution limit; it has a strong preference to triangles. This resolution limit makes partition density an improper objective function for global optimization. The conditions where partition density prefers triangles to larger link community structures are analytically derived and confirmed with global optimization calculations using synthetic and real-world networks. To overcome this limitation of partition density, we suggest an alternative measure, Link Surprise, to find link communities, which is suitable for global optimization. Benchmark studies demonstrate that global optimization of Link Surprise yields meaningful and more accurate link community structures than partition density optimization.
Collapse
|
12
|
GLEAM: a graph clustering framework based on potential game optimization for large-scale social networks. Knowl Inf Syst 2017. [DOI: 10.1007/s10115-017-1105-6] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
13
|
|
14
|
Discovering communities in complex networks by edge label propagation. Sci Rep 2016; 6:22470. [PMID: 26926830 PMCID: PMC4772381 DOI: 10.1038/srep22470] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 02/16/2016] [Indexed: 12/31/2022] Open
Abstract
The discovery of the community structure of real-world networks is still an open problem. Many methods have been proposed to shed light on this problem, and most of these have focused on discovering node community. However, link community is also a powerful framework for discovering overlapping communities. Here we present a novel edge label propagation algorithm (ELPA), which combines the natural advantage of link communities with the efficiency of the label propagation algorithm (LPA). ELPA can discover both link communities and node communities. We evaluated ELPA on both synthetic and real-world networks, and compared it with five state-of-the-art methods. The results demonstrate that ELPA performs competitively with other algorithms.
Collapse
|
15
|
de Reus MA, Saenger VM, Kahn RS, van den Heuvel MP. An edge-centric perspective on the human connectome: link communities in the brain. Philos Trans R Soc Lond B Biol Sci 2015; 369:rstb.2013.0527. [PMID: 25180305 DOI: 10.1098/rstb.2013.0527] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Brain function depends on efficient processing and integration of information within a complex network of neural interactions, known as the connectome. An important aspect of connectome architecture is the existence of community structure, providing an anatomical basis for the occurrence of functional specialization. Typically, communities are defined as groups of densely connected network nodes, representing clusters of brain regions. Looking at the connectome from a different perspective, instead focusing on the interconnecting links or edges, we find that the white matter pathways between brain regions also exhibit community structure. Eleven link communities were identified: five spanning through the midline fissure, three through the left hemisphere and three through the right hemisphere. We show that these link communities are consistently identifiable and investigate the network characteristics of their underlying white matter pathways. Furthermore, examination of the relationship between link communities and brain regions revealed that the majority of brain regions participate in multiple link communities. In particular, the highly connected and central hub regions showed a rich level of community participation, supporting the notion that these hubs play a pivotal role as confluence zones in which neural information from different domains merges.
Collapse
Affiliation(s)
- Marcel A de Reus
- Brain Center Rudolf Magnus, Department of Psychiatry, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| | - Victor M Saenger
- Brain Center Rudolf Magnus, Department of Psychiatry, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| | - René S Kahn
- Brain Center Rudolf Magnus, Department of Psychiatry, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| | - Martijn P van den Heuvel
- Brain Center Rudolf Magnus, Department of Psychiatry, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
| |
Collapse
|
16
|
He D, Jin D, Chen Z, Zhang W. Identification of hybrid node and link communities in complex networks. Sci Rep 2015; 5:8638. [PMID: 25728010 PMCID: PMC4345336 DOI: 10.1038/srep08638] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 01/27/2015] [Indexed: 12/03/2022] Open
Abstract
Identifying communities in complex networks is an effective means for analyzing complex systems, with applications in diverse areas such as social science, engineering, biology and medicine. Finding communities of nodes and finding communities of links are two popular schemes for network analysis. These schemes, however, have inherent drawbacks and are inadequate to capture complex organizational structures in real networks. We introduce a new scheme and an effective approach for identifying complex mixture structures of node and link communities, called hybrid node-link communities. A central piece of our approach is a probabilistic model that accommodates node, link and hybrid node-link communities. Our extensive experiments on various real-world networks, including a large protein-protein interaction network and a large network of semantically associated words, illustrated that the scheme for hybrid communities is superior in revealing network characteristics. Moreover, the new approach outperformed the existing methods for finding node or link communities separately.
Collapse
Affiliation(s)
- Dongxiao He
- School of Computer Science and Technology, Tianjin University, Tianjin. 300072, P. R. China
| | - Di Jin
- School of Computer Science and Technology, Tianjin University, Tianjin. 300072, P. R. China
| | - Zheng Chen
- Department of Computer Science and Engineering, Washington University, St. Louis. MO 63130, USA
- Institute for Systems Biology, Jianghan University, Wuhan. Hubei 430056, P. R. China
| | - Weixiong Zhang
- Department of Computer Science and Engineering, Washington University, St. Louis. MO 63130, USA
- Institute for Systems Biology, Jianghan University, Wuhan. Hubei 430056, P. R. China
- Department of Genetics, Washington University, St. Louis. MO 63130, USA
| |
Collapse
|
17
|
Jin D, Gabrys B, Dang J. Combined node and link partitions method for finding overlapping communities in complex networks. Sci Rep 2015; 5:8600. [PMID: 25715829 PMCID: PMC4341207 DOI: 10.1038/srep08600] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 01/28/2015] [Indexed: 11/09/2022] Open
Abstract
Community detection in complex networks is a fundamental data analysis task in various domains, and how to effectively find overlapping communities in real applications is still a challenge. In this work, we propose a new unified model and method for finding the best overlapping communities on the basis of the associated node and link partitions derived from the same framework. Specifically, we first describe a unified model that accommodates node and link communities (partitions) together, and then present a nonnegative matrix factorization method to learn the parameters of the model. Thereafter, we infer the overlapping communities based on the derived node and link communities, i.e., determine each overlapped community between the corresponding node and link community with a greedy optimization of a local community function conductance. Finally, we introduce a model selection method based on consensus clustering to determine the number of communities. We have evaluated our method on both synthetic and real-world networks with ground-truths, and compared it with seven state-of-the-art methods. The experimental results demonstrate the superior performance of our method over the competing ones in detecting overlapping communities for all analysed data sets. Improved performance is particularly pronounced in cases of more complicated networked community structures.
Collapse
Affiliation(s)
- Di Jin
- School of Computer Science and Technology, Tianjin University, Tianjin 300073, P. R. China
| | - Bogdan Gabrys
- Data Science Institute, Faculty of Science and Technology, Bournemouth University, Poole, Dorset BH12 5BB, UK
| | - Jianwu Dang
- 1] School of Computer Science and Technology, Tianjin University, Tianjin 300073, P. R. China [2] School of Information Science, Japan Advanced Institute of Science and Technology, Japan
| |
Collapse
|
18
|
Mobile recommendation based on link community detection. ScientificWorldJournal 2014; 2014:259156. [PMID: 25243204 PMCID: PMC4163396 DOI: 10.1155/2014/259156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2014] [Revised: 07/20/2014] [Accepted: 07/22/2014] [Indexed: 11/24/2022] Open
Abstract
Since traditional mobile recommendation systems have difficulty in acquiring complete and accurate user information in mobile networks, the accuracy of recommendation is not high. In order to solve this problem, this paper proposes a novel mobile recommendation algorithm based on link community detection (MRLD). MRLD executes link label diffusion algorithm and maximal extended modularity (EQ) of greedy search to obtain the link community structure, and overlapping nodes belonging analysis (ONBA) is adopted to adjust the overlapping nodes in order to get the more accurate community structure. MRLD is tested on both synthetic and real-world networks, and the experimental results show that our approach is valid and feasible.
Collapse
|
19
|
He D, Jin D, Baquero C, Liu D. Link community detection using generative model and nonnegative matrix factorization. PLoS One 2014; 9:e86899. [PMID: 24489803 PMCID: PMC3904957 DOI: 10.1371/journal.pone.0086899] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 12/16/2013] [Indexed: 11/18/2022] Open
Abstract
Discovery of communities in complex networks is a fundamental data analysis problem with applications in various domains. While most of the existing approaches have focused on discovering communities of nodes, recent studies have shown the advantages and uses of link community discovery in networks. Generative models provide a promising class of techniques for the identification of modular structures in networks, but most generative models mainly focus on the detection of node communities rather than link communities. In this work, we propose a generative model, which is based on the importance of each node when forming links in each community, to describe the structure of link communities. We proceed to fit the model parameters by taking it as an optimization problem, and solve it using nonnegative matrix factorization. Thereafter, in order to automatically determine the number of communities, we extend the above method by introducing a strategy of iterative bipartition. This extended method not only finds the number of communities all by itself, but also obtains high efficiency, and thus it is more suitable to deal with large and unexplored real networks. We test this approach on both synthetic benchmarks and real-world networks including an application on a large biological network, and compare it with two highly related methods. Results demonstrate the superior performance of our approach over competing methods for the detection of link communities.
Collapse
Affiliation(s)
- Dongxiao He
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Di Jin
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
- School of Computer Science and Technology, Tianjin University, Tianjin, China
- School of Design, Engineering, and Computing, Bournemouth University, Poole, Dorset, United Kingdom
- * E-mail: (DJ); (DL)
| | - Carlos Baquero
- HASLab, INESC TEC and University of Minho, Braga, Portugal
| | - Dayou Liu
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
- * E-mail: (DJ); (DL)
| |
Collapse
|
20
|
Probabilistic analysis of communities and inner roles in networks: Bayesian generative models and approximate inference. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0130-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
21
|
Havemann F, Gläser J, Heinz M, Struck A. Identifying overlapping and hierarchical thematic structures in networks of scholarly papers: a comparison of three approaches. PLoS One 2012; 7:e33255. [PMID: 22479376 PMCID: PMC3314014 DOI: 10.1371/journal.pone.0033255] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2011] [Accepted: 02/12/2012] [Indexed: 11/19/2022] Open
Abstract
The aim of this paper is to introduce and assess three algorithms for the identification of overlapping thematic structures in networks of papers. We implemented three recently proposed approaches to the identification of overlapping and hierarchical substructures in graphs and applied the corresponding algorithms to a network of 492 information-science papers coupled via their cited sources. The thematic substructures obtained and overlaps produced by the three hierarchical cluster algorithms were compared to a content-based categorisation, which we based on the interpretation of titles, abstracts, and keywords. We defined sets of papers dealing with three topics located on different levels of aggregation: h-index, webometrics, and bibliometrics. We identified these topics with branches in the dendrograms produced by the three cluster algorithms and compared the overlapping topics they detected with one another and with the three predefined paper sets. We discuss the advantages and drawbacks of applying the three approaches to paper networks in research fields.
Collapse
Affiliation(s)
- Frank Havemann
- Institut für Bibliotheks- und Informationswissenschaft, Humboldt-Universität zu Berlin, Berlin, Germany.
| | | | | | | |
Collapse
|