1
|
Rajeh S, Cherifi H. A community-aware centrality framework based on overlapping modularity. SOCIAL NETWORK ANALYSIS AND MINING 2023. [DOI: 10.1007/s13278-023-01040-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
2
|
Rau N, Lücke J, Hartmann AK. Phase transition for parameter learning of hidden Markov models. Phys Rev E 2021; 104:044105. [PMID: 34781434 DOI: 10.1103/physreve.104.044105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 09/15/2021] [Indexed: 11/07/2022]
Abstract
We study a phase transition in parameter learning of hidden Markov models (HMMs). We do this by generating sequences of observed symbols from given discrete HMMs with uniformly distributed transition probabilities and a noise level encoded in the output probabilities. We apply the Baum-Welch (BW) algorithm, an expectation-maximization algorithm from the field of machine learning. By using the BW algorithm we then try to estimate the parameters of each investigated realization of an HMM. We study HMMs with n=4,8, and 16 states. By changing the amount of accessible learning data and the noise level, we observe a phase-transition-like change in the performance of the learning algorithm. For bigger HMMs and more learning data, the learning behavior improves tremendously below a certain threshold in the noise strength. For a noise level above the threshold, learning is not possible. Furthermore, we use an overlap parameter applied to the results of a maximum a posteriori (Viterbi) algorithm to investigate the accuracy of the hidden state estimation around the phase transition.
Collapse
Affiliation(s)
- Nikita Rau
- Institut für Physik, Universität Oldenburg, D-26111 Oldenburg, Germany
| | - Jörg Lücke
- Department of Medical Physics and Acoustics, Universität Oldenburg, D-26111 Oldenburg, Germany
| | | |
Collapse
|
3
|
Lovas JR, Yuste R. Ensemble synchronization in the reassembly of Hydra's nervous system. Curr Biol 2021; 31:3784-3796.e3. [PMID: 34297913 DOI: 10.1016/j.cub.2021.06.047] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 05/14/2021] [Accepted: 06/16/2021] [Indexed: 11/25/2022]
Abstract
Although much is known about how the structure of the nervous system develops, it is still unclear how its functional modularity arises. A dream experiment would be to observe the entire development of a nervous system, correlating the emergence of functional units with their associated behaviors. This is possible in the cnidarian Hydra vulgaris, which, after its complete dissociation into individual cells, can reassemble itself back together into a normal animal. We used calcium imaging to monitor the complete neuronal activity of dissociated Hydra as they reaggregated over several days. Initially uncoordinated neuronal activity became synchronized into coactive neuronal ensembles. These local modules then synchronized with others, building larger functional ensembles that eventually extended throughout the entire reaggregate, generating neuronal rhythms similar to those of intact animals. Global synchronization was not due to neurite outgrowth but to strengthening of functional connections between ensembles. We conclude that Hydra's nervous system achieves its functional reassembly through the hierarchical modularity of neuronal ensembles.
Collapse
Affiliation(s)
- Jonathan R Lovas
- Neurotechnology Center, Department Biological Sciences, Columbia University, New York, NY 10027, USA; Marine Biological Laboratory, Woods Hole, MA 02354, USA.
| | - Rafael Yuste
- Neurotechnology Center, Department Biological Sciences, Columbia University, New York, NY 10027, USA; Marine Biological Laboratory, Woods Hole, MA 02354, USA
| |
Collapse
|
4
|
Qiao HH, Deng ZH, Li HJ, Hu J, Song Q, Gao L. Research on historical phase division of terrorism: An analysis method by time series complex network. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.07.125] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
5
|
Foley TT, Kidder KM, Shell MS, Noid WG. Exploring the landscape of model representations. Proc Natl Acad Sci U S A 2020; 117:24061-24068. [PMID: 32929015 PMCID: PMC7533877 DOI: 10.1073/pnas.2000098117] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The success of any physical model critically depends upon adopting an appropriate representation for the phenomenon of interest. Unfortunately, it remains generally challenging to identify the essential degrees of freedom or, equivalently, the proper order parameters for describing complex phenomena. Here we develop a statistical physics framework for exploring and quantitatively characterizing the space of order parameters for representing physical systems. Specifically, we examine the space of low-resolution representations that correspond to particle-based coarse-grained (CG) models for a simple microscopic model of protein fluctuations. We employ Monte Carlo (MC) methods to sample this space and determine the density of states for CG representations as a function of their ability to preserve the configurational information, I, and large-scale fluctuations, Q, of the microscopic model. These two metrics are uncorrelated in high-resolution representations but become anticorrelated at lower resolutions. Moreover, our MC simulations suggest an emergent length scale for coarse-graining proteins, as well as a qualitative distinction between good and bad representations of proteins. Finally, we relate our work to recent approaches for clustering graphs and detecting communities in networks.
Collapse
Affiliation(s)
- Thomas T Foley
- Department of Chemistry, The Pennsylvania State University, University Park, PA 16802
- Department of Physics, The Pennsylvania State University, University Park, PA 16802
| | - Katherine M Kidder
- Department of Chemistry, The Pennsylvania State University, University Park, PA 16802
| | - M Scott Shell
- Department of Chemical Engineering, University of California, Santa Barbara, CA 93106
| | - W G Noid
- Department of Chemistry, The Pennsylvania State University, University Park, PA 16802;
| |
Collapse
|
6
|
Paret J, Jack RL, Coslovich D. Assessing the structural heterogeneity of supercooled liquids through community inference. J Chem Phys 2020; 152:144502. [DOI: 10.1063/5.0004732] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Joris Paret
- Laboratoire Charles Coulomb (L2C), Université de Montpellier, CNRS, Montpellier, France
| | - Robert L. Jack
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
| | - Daniele Coslovich
- Laboratoire Charles Coulomb (L2C), Université de Montpellier, CNRS, Montpellier, France
| |
Collapse
|
7
|
Hu K, Xiang J, Yu YX, Tang L, Xiang Q, Li JM, Tang YH, Chen YJ, Zhang Y. Significance-based multi-scale method for network community detection and its application in disease-gene prediction. PLoS One 2020; 15:e0227244. [PMID: 32196490 PMCID: PMC7083276 DOI: 10.1371/journal.pone.0227244] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 12/16/2019] [Indexed: 11/18/2022] Open
Abstract
Community detection in complex networks is an important issue in network science. Several statistical measures have been proposed and widely applied to detecting the communities in various complex networks. However, due to the lack of flexibility resolution, some of them have to encounter the resolution limit and thus are not compatible with multi-scale structures of complex networks. In this paper, we investigated a statistical measure of interest for community detection, Significance [Sci. Rep. 3 (2013) 2930], and analyzed its critical behaviors based on the theoretical derivation of critical number of communities and the phase diagram in community-partition transition. It was revealed that Significance exhibits far higher resolution than the traditional Modularity when the intra- and inter-link densities of communities are obviously different. Following the critical analysis, we developed a multi-resolution version of Significance for identifying communities in the multi-scale networks. Experimental tests in several typical networks have been performed and confirmed that the generalized Significance can be competent for the multi-scale communities detection. Moreover, it can effectively relax the first- and second-type resolution limits. Finally, we displayed an important potential application of the multi-scale Significance in computational biology: disease-gene identification, showing that extracting information from the perspective of multi-scale module mining is helpful for disease gene prediction.
Collapse
Affiliation(s)
- Ke Hu
- School of Physics and Optoelectronic Engineering, Xiangtan University, Xiangtan, Hunan, People’s Republic of China
| | - Ju Xiang
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, People’s Republic of China
- School of Basic Medical Sciences, Changsha Medical University, Changsha, Hunan, People’s Republic of China
| | - Yun-Xia Yu
- School of Physics and Optoelectronic Engineering, Xiangtan University, Xiangtan, Hunan, People’s Republic of China
| | - Liang Tang
- School of Basic Medical Sciences, Changsha Medical University, Changsha, Hunan, People’s Republic of China
| | - Qin Xiang
- School of Basic Medical Sciences, Changsha Medical University, Changsha, Hunan, People’s Republic of China
| | - Jian-Ming Li
- School of Basic Medical Sciences, Changsha Medical University, Changsha, Hunan, People’s Republic of China
- Department of Neurology, Xiang-ya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Department of Rehabilitation, Xiangya Boai Rehabilitation Hospital, Changsha, Hunan, People’s Republic of China
- Department of Neurology, Nanhua Affiliated Hospital, University of South China, Hengyang, Hunan, People’s Republic of China
| | - Yong-Hong Tang
- Department of Neurology, Nanhua Affiliated Hospital, University of South China, Hengyang, Hunan, People’s Republic of China
| | - Yong-Jun Chen
- Department of Neurology, Nanhua Affiliated Hospital, University of South China, Hengyang, Hunan, People’s Republic of China
| | - Yan Zhang
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, People’s Republic of China
- School of Basic Medical Sciences, Changsha Medical University, Changsha, Hunan, People’s Republic of China
| |
Collapse
|
8
|
Nia AM, Chen T, Barnette BL, Khanipov K, Ullrich RL, Bhavnani SK, Emmett MR. Efficient identification of multiple pathways: RNA-Seq analysis of livers from 56Fe ion irradiated mice. BMC Bioinformatics 2020; 21:118. [PMID: 32192433 PMCID: PMC7082965 DOI: 10.1186/s12859-020-3446-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 03/06/2020] [Indexed: 12/25/2022] Open
Abstract
Background mRNA interaction with other mRNAs and other signaling molecules determine different biological pathways and functions. Gene co-expression network analysis methods have been widely used to identify correlation patterns between genes in various biological contexts (e.g., cancer, mouse genetics, yeast genetics). A challenge remains to identify an optimal partition of the networks where the individual modules (clusters) are neither too small to make any general inferences, nor too large to be biologically interpretable. Clustering thresholds for identification of modules are not systematically determined and depend on user-settable parameters requiring optimization. The absence of systematic threshold determination may result in suboptimal module identification and a large number of unassigned features. Results In this study, we propose a new pipeline to perform gene co-expression network analysis. The proposed pipeline employs WGCNA, a software widely used to perform different aspects of gene co-expression network analysis, and Modularity Maximization algorithm, to analyze novel RNA-Seq data to understand the effects of low-dose 56Fe ion irradiation on the formation of hepatocellular carcinoma in mice. The network results, along with experimental validation, show that using WGCNA combined with Modularity Maximization, provides a more biologically interpretable network in our dataset, than that obtainable using WGCNA alone. The proposed pipeline showed better performance than the existing clustering algorithm in WGCNA, and identified a module that was biologically validated by a mitochondrial complex I assay. Conclusions We present a pipeline that can reduce the problem of parameter selection that occurs with the existing algorithm in WGCNA, for applicable RNA-Seq datasets. This may assist in the future discovery of novel mRNA interactions, and elucidation of their potential downstream molecular effects.
Collapse
Affiliation(s)
- Anna M Nia
- Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Texas, USA
| | - Tianlong Chen
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, Texas, USA
| | - Brooke L Barnette
- Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Texas, USA
| | - Kamil Khanipov
- Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas, USA
| | | | - Suresh K Bhavnani
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, Texas, USA
| | - Mark R Emmett
- Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Texas, USA. .,Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas, USA. .,Radiation Oncology, The University of Texas Medical Branch, Galveston, Texas, USA.
| |
Collapse
|
9
|
Kaalia R, Rajapakse JC. Refining modules to determine functionally significant clusters in molecular networks. BMC Genomics 2019; 20:901. [PMID: 31874644 PMCID: PMC6929267 DOI: 10.1186/s12864-019-6294-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 11/15/2019] [Indexed: 12/12/2022] Open
Abstract
Background Module detection algorithms relying on modularity maximization suffer from an inherent resolution limit that hinders detection of small topological modules, especially in molecular networks where most biological processes are believed to form small and compact communities. We propose a novel modular refinement approach that helps finding functionally significant modules of molecular networks. Results The module refinement algorithm improves the quality of topological modules in protein-protein interaction networks by finding biologically functionally significant modules. The algorithm is based on the fact that functional modules in biology do not necessarily represent those corresponding to maximum modularity. Larger modules corresponding to maximal modularity are incrementally re-modularized again under specific constraints so that smaller yet topologically and biologically valid modules are recovered. We show improvement in quality and functional coverage of modules using experiments on synthetic and real protein-protein interaction networks. We also compare our results with six existing methods available for clustering biological networks. Conclusion The proposed algorithm finds smaller but functionally relevant modules that are undetected by classical quality maximization approaches for modular detection. The refinement procedure helps to detect more functionally enriched modules in protein-protein interaction networks, which are also more coherent with functionally characterised gene sets.
Collapse
Affiliation(s)
- Rama Kaalia
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore.
| | - Jagath C Rajapakse
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
10
|
Organization in complex brain networks: Energy distributions and phase shift. J Theor Biol 2019; 476:30-35. [PMID: 31129077 DOI: 10.1016/j.jtbi.2019.05.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Revised: 05/22/2019] [Accepted: 05/23/2019] [Indexed: 11/24/2022]
Abstract
The Hamiltonian function of a network, derived from the intrinsic distributions of nodes and edges, magnified by resolution parameter has information on the distribution of energy in the network. In brain networks, the Hamiltonian function follows hierarchical features reflecting a power-law behavior which can be a signature of self-organization. Further, the transition of three distinct phases driven by resolution parameter is observed which could correspond to various important brain states. This resolution parameter could thus reflect a key parameter that controls and balances the energy distribution in the brain network.
Collapse
|
11
|
Kishore R, Gogineni AK, Nussinov Z, Sahu KK. A nature inspired modularity function for unsupervised learning involving spatially embedded networks. Sci Rep 2019; 9:2631. [PMID: 30796343 PMCID: PMC6385190 DOI: 10.1038/s41598-019-39180-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 01/18/2019] [Indexed: 11/09/2022] Open
Abstract
The quality of network clustering is often measured in terms of a commonly used metric known as "modularity". Modularity compares the clusters found in a network to those present in a random graph (a "null model"). Unfortunately, modularity is somewhat ill suited for studying spatially embedded networks, since a random graph contains no basic geometrical notions. Regardless of their distance, the null model assigns a nonzero probability for an edge to appear between any pair of nodes. Here, we propose a variant of modularity that does not rely on the use of a null model. To demonstrate the essentials of our method, we analyze networks generated from granular ensemble. We show that our method performs better than the most commonly used Newman-Girvan (NG) modularity in detecting the best (physically transparent) partitions in those systems. Our measure further properly detects hierarchical structures, whenever these are present.
Collapse
Affiliation(s)
- Raj Kishore
- School of Minerals, Metallurgical and Materials Engineering, Indian Institute of Technology Bhubaneswar-, Bhubaneswar, 752050, India
| | - Ajay K Gogineni
- School of Electrical Sciences, Indian Institute of Technology Bhubaneswar, Bhubaneswar, 752050, India
| | - Zohar Nussinov
- Department of Physics, Washington University in Saint Louis, Saint Louis, MO, 63130-4899, USA
| | - Kisor K Sahu
- School of Minerals, Metallurgical and Materials Engineering, Indian Institute of Technology Bhubaneswar-, Bhubaneswar, 752050, India.
| |
Collapse
|
12
|
Critical analysis of (Quasi-)Surprise for community detection in complex networks. Sci Rep 2018; 8:14459. [PMID: 30262896 PMCID: PMC6160439 DOI: 10.1038/s41598-018-32582-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Accepted: 05/08/2018] [Indexed: 02/07/2023] Open
Abstract
Module or community structures widely exist in complex networks, and optimizing statistical measures is one of the most popular approaches for revealing and identifying such structures in real-world applications. In this paper, we focus on critical behaviors of (Quasi-)Surprise, a type of statistical measure of interest for community structure, accompanied by a series of comparisons with other measures. Specially, the effect of various network parameters on the measures is thoroughly investigated. The critical number of dense subgraphs in partition transition is derived, and a kind of phase diagrams is provided to display and compare the phase transitions of the measures. The effect of “potential well” for (Quasi-)Surprise is revealed, which may be difficult to get across by general greedy (agglomerative or divisive) algorithms. Finally, an extension of Quasi-Surprise is introduced for the study of multi-scale structures. Experimental results are of help for understanding the critical behaviors of (Quasi-)Surprise, and may provide useful insight for the design of effective tools for community detection.
Collapse
|
13
|
Faqeeh A, Osat S, Radicchi F. Characterizing the Analogy Between Hyperbolic Embedding and Community Structure of Complex Networks. PHYSICAL REVIEW LETTERS 2018; 121:098301. [PMID: 30230906 DOI: 10.1103/physrevlett.121.098301] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 05/21/2018] [Indexed: 06/08/2023]
Abstract
We show that the community structure of a network can be used as a coarse version of its embedding in a hidden space with hyperbolic geometry. The finding emerges from a systematic analysis of several real-world and synthetic networks. We take advantage of the analogy for reinterpreting results originally obtained through network hyperbolic embedding in terms of community structure only. First, we show that the robustness of a multiplex network can be controlled by tuning the correlation between the community structures across different layers. Second, we deploy an efficient greedy protocol for network navigability that makes use of routing tables based on community structure.
Collapse
Affiliation(s)
- Ali Faqeeh
- MACSI, Department of Mathematics and Statistics, University of Limerick, Limerick V94 T9PX, Ireland
- Center for Complex Networks and Systems Research, School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana 47408, USA
| | - Saeed Osat
- Quantum Complexity Science Initiative, Skolkovo Institute of Science and Technology, Skoltech Building 3, Moscow 143026, Russia
| | - Filippo Radicchi
- Center for Complex Networks and Systems Research, School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana 47408, USA
| |
Collapse
|
14
|
Bordier C, Nicolini C, Forcellini G, Bifone A. Disrupted modular organization of primary sensory brain areas in schizophrenia. Neuroimage Clin 2018; 18:682-693. [PMID: 29876260 PMCID: PMC5987872 DOI: 10.1016/j.nicl.2018.02.035] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 02/21/2018] [Accepted: 02/28/2018] [Indexed: 12/29/2022]
Abstract
Abnormal brain resting-state functional connectivity has been consistently observed in patients affected by schizophrenia (SCZ) using functional MRI and other neuroimaging techniques. Graph theoretical methods provide a framework to investigate these defective functional interactions and their effects on the organization of brain connectivity networks. A few studies have shown altered distribution of connectivity within and between functional modules in SCZ patients, an indication of imbalanced functional segregation ad integration. However, no major alterations of modular organization have been reported in patients, and unambiguous identification of the neural substrates affected remains elusive. Recently, it has been demonstrated that current modularity analysis methods suffer from a fundamental and severe resolution limit, as they fail to detect features that are smaller than a scale determined by the size of the entire connectivity network. This resolution limit is likely to have hampered the ability to resolve differences between patients and controls in previous studies. Here, we apply Surprise, a novel resolution limit-free approach, to study the modular organization of resting state functional connectivity networks in a large cohort of SCZ patients and in matched healthy controls. Leveraging these important methodological advances we find new evidence of substantial fragmentation and reorganization involving primary sensory, auditory and visual areas in SCZ patients. Conversely, frontal and prefrontal areas, typically associated with higher cognitive functions, appear to be largely unaffected, with changes selectively involving language and speech processing areas. Our findings support the hypothesis that cognitive dysfunction in SCZ may involve deficits occurring already at early stages of sensory processing.
Collapse
Affiliation(s)
- Cécile Bordier
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto, TN, Italy.
| | - Carlo Nicolini
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto, TN, Italy; University of Verona, Verona, Italy
| | - Giulia Forcellini
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto, TN, Italy; Center for Mind/Brain Sciences, CIMeC, University of Trento, Rovereto, Italy
| | - Angelo Bifone
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto, TN, Italy.
| |
Collapse
|
15
|
Community Detection Based on Differential Evolution Using Social Spider Optimization. Symmetry (Basel) 2017. [DOI: 10.3390/sym9090183] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
16
|
Li HJ, Xiang J. Explore of the fuzzy community structure integrating the directed line graph and likelihood optimization. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2017. [DOI: 10.3233/jifs-169214] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Hui-Jia Li
- School of Management Science and Engineering, Central University of Finance and Economics, Beijing, China
| | - Ju Xiang
- Neuroscience Research Center, Changsha Medical University, Changsha, Hunan, China
| |
Collapse
|
17
|
A semi-synchronous label propagation algorithm with constraints for community detection in complex networks. Sci Rep 2017; 7:45836. [PMID: 28374836 PMCID: PMC5379178 DOI: 10.1038/srep45836] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Accepted: 03/03/2017] [Indexed: 01/12/2023] Open
Abstract
Community structure is an important feature of a complex network, where detection of the community structure can shed some light on the properties of such a complex network. Amongst the proposed community detection methods, the label propagation algorithm (LPA) emerges as an effective detection method due to its time efficiency. Despite this advantage in computational time, the performance of LPA is affected by randomness in the algorithm. A modified LPA, called CLPA-GNR, was proposed recently and it succeeded in handling the randomness issues in the LPA. However, it did not remove the tendency for trivial detection in networks with a weak community structure. In this paper, an improved CLPA-GNR is therefore proposed. In the new algorithm, the unassigned and assigned nodes are updated synchronously while the assigned nodes are updated asynchronously. A similarity score, based on the Sørensen-Dice index, is implemented to detect the initial communities and for breaking ties during the propagation process. Constraints are utilised during the label propagation and community merging processes. The performance of the proposed algorithm is evaluated on various benchmark and real-world networks. We find that it is able to avoid trivial detection while showing substantial improvement in the quality of detection.
Collapse
|
18
|
Li H. Detecting fuzzy network communities based on semi-supervised label propagation. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2016. [DOI: 10.3233/jifs-169171] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
19
|
Nicolini C, Bordier C, Bifone A. Community detection in weighted brain connectivity networks beyond the resolution limit. Neuroimage 2016; 146:28-39. [PMID: 27865921 PMCID: PMC5312822 DOI: 10.1016/j.neuroimage.2016.11.026] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 11/08/2016] [Accepted: 11/12/2016] [Indexed: 12/02/2022] Open
Abstract
Graph theory provides a powerful framework to investigate brain functional connectivity networks and their modular organization. However, most graph-based methods suffer from a fundamental resolution limit that may have affected previous studies and prevented detection of modules, or "communities", that are smaller than a specific scale. Surprise, a resolution-limit-free function rooted in discrete probability theory, has been recently introduced and applied to brain networks, revealing a wide size-distribution of functional modules (Nicolini and Bifone, 2016), in contrast with many previous reports. However, the use of Surprise is limited to binary networks, while brain networks are intrinsically weighted, reflecting a continuous distribution of connectivity strengths between different brain regions. Here, we propose Asymptotical Surprise, a continuous version of Surprise, for the study of weighted brain connectivity networks, and validate this approach in synthetic networks endowed with a ground-truth modular structure. We compare Asymptotical Surprise with leading community detection methods currently in use and show its superior sensitivity in the detection of small modules even in the presence of noise and intersubject variability such as those observed in fMRI data. We apply our novel approach to functional connectivity networks from resting state fMRI experiments, and demonstrate a heterogeneous modular organization, with a wide distribution of clusters spanning multiple scales. Finally, we discuss the implications of these findings for the identification of connector hubs, the brain regions responsible for the integration of the different network elements, showing that the improved resolution afforded by Asymptotical Surprise leads to a different classification compared to current methods. Methods to study modularity of brain connectivity networks have a resolution limit. Asymptotical Surprise, a nearly resolution-limit-free method for weighted graphs, is proposed. Improved sensitivity and specificity are demonstrated in model networks. Resting state functional connectivity networks consist of heterogeneous modules. Classification of hubs in function connectivity networks should be revised.
Collapse
Affiliation(s)
- Carlo Nicolini
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto (TN), Italy; University of Verona, Verona, Italy.
| | - Cécile Bordier
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto (TN), Italy
| | - Angelo Bifone
- Center for Neuroscience and Cognitive Systems, Istituto Italiano di Tecnologia, Rovereto (TN), Italy.
| |
Collapse
|
20
|
Analysis of Network Clustering Algorithms and Cluster Quality Metrics at Scale. PLoS One 2016; 11:e0159161. [PMID: 27391786 PMCID: PMC4938516 DOI: 10.1371/journal.pone.0159161] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 06/28/2016] [Indexed: 11/30/2022] Open
Abstract
Overview Notions of community quality underlie the clustering of networks. While studies surrounding network clustering are increasingly common, a precise understanding of the realtionship between different cluster quality metrics is unknown. In this paper, we examine the relationship between stand-alone cluster quality metrics and information recovery metrics through a rigorous analysis of four widely-used network clustering algorithms—Louvain, Infomap, label propagation, and smart local moving. We consider the stand-alone quality metrics of modularity, conductance, and coverage, and we consider the information recovery metrics of adjusted Rand score, normalized mutual information, and a variant of normalized mutual information used in previous work. Our study includes both synthetic graphs and empirical data sets of sizes varying from 1,000 to 1,000,000 nodes. Cluster Quality Metrics We find significant differences among the results of the different cluster quality metrics. For example, clustering algorithms can return a value of 0.4 out of 1 on modularity but score 0 out of 1 on information recovery. We find conductance, though imperfect, to be the stand-alone quality metric that best indicates performance on the information recovery metrics. Additionally, our study shows that the variant of normalized mutual information used in previous work cannot be assumed to differ only slightly from traditional normalized mutual information. Network Clustering Algorithms Smart local moving is the overall best performing algorithm in our study, but discrepancies between cluster evaluation metrics prevent us from declaring it an absolutely superior algorithm. Interestingly, Louvain performed better than Infomap in nearly all the tests in our study, contradicting the results of previous work in which Infomap was superior to Louvain. We find that although label propagation performs poorly when clusters are less clearly defined, it scales efficiently and accurately to large graphs with well-defined clusters.
Collapse
|
21
|
Burgess M, Adar E, Cafarella M. Link-Prediction Enhanced Consensus Clustering for Complex Networks. PLoS One 2016; 11:e0153384. [PMID: 27203750 PMCID: PMC4874693 DOI: 10.1371/journal.pone.0153384] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 03/29/2016] [Indexed: 11/19/2022] Open
Abstract
Many real networks that are collected or inferred from data are incomplete due to missing edges. Missing edges can be inherent to the dataset (Facebook friend links will never be complete) or the result of sampling (one may only have access to a portion of the data). The consequence is that downstream analyses that "consume" the network will often yield less accurate results than if the edges were complete. Community detection algorithms, in particular, often suffer when critical intra-community edges are missing. We propose a novel consensus clustering algorithm to enhance community detection on incomplete networks. Our framework utilizes existing community detection algorithms that process networks imputed by our link prediction based sampling algorithm and merges their multiple partitions into a final consensus output. On average our method boosts performance of existing algorithms by 7% on artificial data and 17% on ego networks collected from Facebook.
Collapse
Affiliation(s)
- Matthew Burgess
- Computer Science & Engineering, University of Michigan, Ann Arbor, MI, United States of America
| | - Eytan Adar
- Computer Science & Engineering, University of Michigan, Ann Arbor, MI, United States of America
- School of Information, University of Michigan, Ann Arbor, MI, United States of America
| | - Michael Cafarella
- Computer Science & Engineering, University of Michigan, Ann Arbor, MI, United States of America
| |
Collapse
|
22
|
Chin JH, Ratnavelu K. Detecting Community Structure by Using a Constrained Label Propagation Algorithm. PLoS One 2016; 11:e0155320. [PMID: 27176470 PMCID: PMC4866740 DOI: 10.1371/journal.pone.0155320] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 04/27/2016] [Indexed: 11/18/2022] Open
Abstract
Community structure is considered one of the most interesting features in complex networks. Many real-world complex systems exhibit community structure, where individuals with similar properties form a community. The identification of communities in a network is important for understanding the structure of said network, in a specific perspective. Thus, community detection in complex networks gained immense interest over the last decade. A lot of community detection methods were proposed, and one of them is the label propagation algorithm (LPA). The simplicity and time efficiency of the LPA make it a popular community detection method. However, the LPA suffers from instability detection due to randomness that is induced in the algorithm. The focus of this paper is to improve the stability and accuracy of the LPA, while retaining its simplicity. Our proposed algorithm will first detect the main communities in a network by using the number of mutual neighbouring nodes. Subsequently, nodes are added into communities by using a constrained LPA. Those constraints are then gradually relaxed until all nodes are assigned into groups. In order to refine the quality of the detected communities, nodes in communities can be switched to another community or removed from their current communities at various stages of the algorithm. We evaluated our algorithm on three types of benchmark networks, namely the Lancichinetti-Fortunato-Radicchi (LFR), Relaxed Caveman (RC) and Girvan-Newman (GN) benchmarks. We also apply the present algorithm to some real-world networks of various sizes. The current results show some promising potential, of the proposed algorithm, in terms of detecting communities accurately. Furthermore, our constrained LPA has a robustness and stability that are significantly better than the simple LPA as it is able to yield deterministic results.
Collapse
Affiliation(s)
- Jia Hou Chin
- Institute of Mathematical Science, University of Malaya, Kuala Lumpur, Malaysia
| | - Kuru Ratnavelu
- Institute of Mathematical Science, University of Malaya, Kuala Lumpur, Malaysia
- * E-mail:
| |
Collapse
|
23
|
Angulo-Garcia D, Berke JD, Torcini A. Cell Assembly Dynamics of Sparsely-Connected Inhibitory Networks: A Simple Model for the Collective Activity of Striatal Projection Neurons. PLoS Comput Biol 2016; 12:e1004778. [PMID: 26915024 PMCID: PMC4767417 DOI: 10.1371/journal.pcbi.1004778] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 01/27/2016] [Indexed: 11/19/2022] Open
Abstract
Striatal projection neurons form a sparsely-connected inhibitory network, and this arrangement may be essential for the appropriate temporal organization of behavior. Here we show that a simplified, sparse inhibitory network of Leaky-Integrate-and-Fire neurons can reproduce some key features of striatal population activity, as observed in brain slices. In particular we develop a new metric to determine the conditions under which sparse inhibitory networks form anti-correlated cell assemblies with time-varying activity of individual cells. We find that under these conditions the network displays an input-specific sequence of cell assembly switching, that effectively discriminates similar inputs. Our results support the proposal that GABAergic connections between striatal projection neurons allow stimulus-selective, temporally-extended sequential activation of cell assemblies. Furthermore, we help to show how altered intrastriatal GABAergic signaling may produce aberrant network-level information processing in disorders such as Parkinson's and Huntington's diseases.
Collapse
Affiliation(s)
- David Angulo-Garcia
- CNR - Consiglio Nazionale delle Ricerche - Istituto dei Sistemi Complessi, Sesto Fiorentino, Italy
- Aix-Marseille Université, Inserm, INMED UMR 901 and Institut de Neurosciences des Systèmes UMR 1106, Marseille, France
| | - Joshua D. Berke
- Department of Psychology and Biomedical Engineering, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Alessandro Torcini
- CNR - Consiglio Nazionale delle Ricerche - Istituto dei Sistemi Complessi, Sesto Fiorentino, Italy
- Aix-Marseille Université, Inserm, INMED UMR 901 and Institut de Neurosciences des Systèmes UMR 1106, Marseille, France
- Aix-Marseille Université, Université de Toulon, CNRS, CPT, UMR 7332, Marseille, France
- INFN Sez. Firenze, via Sansone, Sesto Fiorentino, Italy
- * E-mail:
| |
Collapse
|
24
|
Nicolini C, Bifone A. Modular structure of brain functional networks: breaking the resolution limit by Surprise. Sci Rep 2016; 6:19250. [PMID: 26763931 PMCID: PMC4725862 DOI: 10.1038/srep19250] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 12/02/2015] [Indexed: 11/08/2022] Open
Abstract
The modular organization of brain networks has been widely investigated using graph theoretical approaches. Recently, it has been demonstrated that graph partitioning methods based on the maximization of global fitness functions, like Newman's Modularity, suffer from a resolution limit, as they fail to detect modules that are smaller than a scale determined by the size of the entire network. Here we explore the effects of this limitation on the study of brain connectivity networks. We demonstrate that the resolution limit prevents detection of important details of the brain modular structure, thus hampering the ability to appreciate differences between networks and to assess the topological roles of nodes. We show that Surprise, a recently proposed fitness function based on probability theory, does not suffer from these limitations. Surprise maximization in brain co-activation and functional connectivity resting state networks reveals the presence of a rich structure of heterogeneously distributed modules, and differences in networks' partitions that are undetectable by resolution-limited methods. Moreover, Surprise leads to a more accurate identification of the network's connector hubs, the elements that integrate the brain modules into a cohesive structure.
Collapse
Affiliation(s)
- Carlo Nicolini
- University of Verona, Verona, Italy
- Istituto Italiano di Tecnologia, Center for Neuroscience and Cognitive Systems, Rovereto (TN), Italy
| | - Angelo Bifone
- Istituto Italiano di Tecnologia, Center for Neuroscience and Cognitive Systems, Rovereto (TN), Italy
| |
Collapse
|
25
|
Li HJ. The comparison of significance of fuzzy community partition across optimization methods. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2015. [DOI: 10.3233/ifs-151974] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
26
|
Gu Y, Qian X, Li Q, Wang M, Hong R, Tian Q. Image Annotation by Latent Community Detection and Multikernel Learning. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2015; 24:3450-3463. [PMID: 26068319 DOI: 10.1109/tip.2015.2443501] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Automatic image annotation is an attractive service for users and administrators of online photo sharing websites. In this paper, we propose an image annotation approach that exploits latent semantic community of labels and multikernel learning (LCMKL). First, a concept graph is constructed for labels indicating the relationship between the concepts. Based on the concept graph, semantic communities are explored using an automatic community detection method. For an image to be annotated, a multikernel support vector machine is used to determine the image's latent community from its visual features. Then, a candidate label ranking based approach is determined by intracommunity and intercommunity ranking. Experiments on the NUS-WIDE database and IAPR TC-12 data set demonstrate that LCMKL outperforms some state-of-the-art approaches.
Collapse
|
27
|
Traag VA. Faster unfolding of communities: speeding up the Louvain algorithm. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 92:032801. [PMID: 26465522 DOI: 10.1103/physreve.92.032801] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Indexed: 06/05/2023]
Abstract
Many complex networks exhibit a modular structure of densely connected groups of nodes. Usually, such a modular structure is uncovered by the optimization of some quality function. Although flawed, modularity remains one of the most popular quality functions. The Louvain algorithm was originally developed for optimizing modularity, but has been applied to a variety of methods. As such, speeding up the Louvain algorithm enables the analysis of larger graphs in a shorter time for various methods. We here suggest to consider moving nodes to a random neighbor community, instead of the best neighbor community. Although incredibly simple, it reduces the theoretical runtime complexity from O(m) to O(nlog〈k〉) in networks with a clear community structure. In benchmark networks, it speeds up the algorithm roughly 2-3 times, while in some real networks it even reaches 10 times faster runtimes. This improvement is due to two factors: (1) a random neighbor is likely to be in a "good" community and (2) random neighbors are likely to be hubs, helping the convergence. Finally, the performance gain only slightly diminishes the quality, especially for modularity, thus providing a good quality-performance ratio. However, these gains are less pronounced, or even disappear, for some other measures such as significance or surprise.
Collapse
Affiliation(s)
- V A Traag
- Royal Netherlands Institute of Southeast Asian and Caribbean Studies, Reuvensplaats 2, 2311 BE Leiden, the Netherlands and e-Humanities group, Royal Netherlands Academy of Arts and Sciences, Joan Muyskenweg 25, 1096 CJ Amsterdam, the Netherlands
| |
Collapse
|
28
|
Traag VA, Aldecoa R, Delvenne JC. Detecting communities using asymptotical surprise. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 92:022816. [PMID: 26382463 DOI: 10.1103/physreve.92.022816] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Indexed: 06/05/2023]
Abstract
Nodes in real-world networks are repeatedly observed to form dense clusters, often referred to as communities. Methods to detect these groups of nodes usually maximize an objective function, which implicitly contains the definition of a community. We here analyze a recently proposed measure called surprise, which assesses the quality of the partition of a network into communities. In its current form, the formulation of surprise is rather difficult to analyze. We here therefore develop an accurate asymptotic approximation. This allows for the development of an efficient algorithm for optimizing surprise. Incidentally, this leads to a straightforward extension of surprise to weighted graphs. Additionally, the approximation makes it possible to analyze surprise more closely and compare it to other methods, especially modularity. We show that surprise is (nearly) unaffected by the well-known resolution limit, a particular problem for modularity. However, surprise may tend to overestimate the number of communities, whereas they may be underestimated by modularity. In short, surprise works well in the limit of many small communities, whereas modularity works better in the limit of few large communities. In this sense, surprise is more discriminative than modularity and may find communities where modularity fails to discern any structure.
Collapse
Affiliation(s)
- V A Traag
- Royal Netherlands Institute of Southeast Asian and Caribbean Studies, Leiden, The Netherlands
- e-Humanities Group, Royal Netherlands Academy of Arts and Sciences, Amsterdam, The Netherlands
| | - R Aldecoa
- Department of Physics, Northeastern University, Boston, Massachusetts 02115, USA
| | - J-C Delvenne
- ICTEAM, Université catholique de Louvain, Louvain-la-Neuve, Belgium
- CORE, Université catholique de Louvain, Louvain-la-Neuve, Belgium
| |
Collapse
|
29
|
Kawamoto T, Rosvall M. Estimating the resolution limit of the map equation in community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 91:012809. [PMID: 25679659 DOI: 10.1103/physreve.91.012809] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Indexed: 06/04/2023]
Abstract
A community detection algorithm is considered to have a resolution limit if the scale of the smallest modules that can be resolved depends on the size of the analyzed subnetwork. The resolution limit is known to prevent some community detection algorithms from accurately identifying the modular structure of a network. In fact, any global objective function for measuring the quality of a two-level assignment of nodes into modules must have some sort of resolution limit or an external resolution parameter. However, it is yet unknown how the resolution limit affects the so-called map equation, which is known to be an efficient objective function for community detection. We derive an analytical estimate and conclude that the resolution limit of the map equation is set by the total number of links between modules instead of the total number of links in the full network as for modularity. This mechanism makes the resolution limit much less restrictive for the map equation than for modularity; in practice, it is orders of magnitudes smaller. Furthermore, we argue that the effect of the resolution limit often results from shoehorning multilevel modular structures into two-level descriptions. As we show, the hierarchical map equation effectively eliminates the resolution limit for networks with nested multilevel modular structures.
Collapse
Affiliation(s)
- Tatsuro Kawamoto
- Department of Computational Intelligence and Systems Science, Tokyo Institute of Technology, 4259-G5-22, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8502, Japan
| | - Martin Rosvall
- Integrated Science Lab, Department of Physics, Umeå University, SE-901 87 Umeå, Sweden
| |
Collapse
|
30
|
Hüffner F, Komusiewicz C, Liebtrau A, Niedermeier R. Partitioning Biological Networks into Highly Connected Clusters with Maximum Edge Coverage. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:455-467. [PMID: 26356014 DOI: 10.1109/tcbb.2013.177] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A popular clustering algorithm for biological networks which was proposed by Hartuv and Shamir identifies nonoverlapping highly connected components. We extend the approach taken by this algorithm by introducing the combinatorial optimization problem Highly Connected Deletion, which asks for removing as few edges as possible from a graph such that the resulting graph consists of highly connected components. We show that Highly Connected Deletion is NP-hard and provide a fixed-parameter algorithm and a kernelization. We propose exact and heuristic solution strategies, based on polynomial-time data reduction rules and integer linear programming with column generation. The data reduction typically identifies 75 percent of the edges that are deleted for an optimal solution; the column generation method can then optimally solve protein interaction networks with up to 6,000 vertices and 13,500 edges within five hours. Additionally, we present a new heuristic that finds more clusters than the method by Hartuv and Shamir.
Collapse
|
31
|
Darst RK, Nussinov Z, Fortunato S. Improving the performance of algorithms to find communities in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:032809. [PMID: 24730901 DOI: 10.1103/physreve.89.032809] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Indexed: 06/03/2023]
Abstract
Most algorithms to detect communities in networks typically work without any information on the cluster structure to be found, as one has no a priori knowledge of it, in general. Not surprisingly, knowing some features of the unknown partition could help its identification, yielding an improvement of the performance of the method. Here we show that, if the number of clusters was known beforehand, standard methods, like modularity optimization, would considerably gain in accuracy, mitigating the severe resolution bias that undermines the reliability of the results of the original unconstrained version. The number of clusters can be inferred from the spectra of the recently introduced nonbacktracking and flow matrices, even in benchmark graphs with realistic community structure. The limit of such a two-step procedure is the overhead of the computation of the spectra.
Collapse
Affiliation(s)
- Richard K Darst
- Department of Biomedical Engineering and Computational Science, Aalto University School of Science, P.O. Box 12200, FI-00076, Finland
| | - Zohar Nussinov
- Physics Department, Washington University in St. Louis, CB 1105, One Brookings Drive, St. Louis, Missouri 63130-4899, USA
| | - Santo Fortunato
- Department of Biomedical Engineering and Computational Science, Aalto University School of Science, P.O. Box 12200, FI-00076, Finland
| |
Collapse
|
32
|
Aldecoa R, Marín I. Exploring the limits of community detection strategies in complex networks. Sci Rep 2014; 3:2216. [PMID: 23860510 PMCID: PMC3713530 DOI: 10.1038/srep02216] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/18/2013] [Indexed: 12/05/2022] Open
Abstract
The characterization of network community structure has profound implications in several scientific areas. Therefore, testing the algorithms developed to establish the optimal division of a network into communities is a fundamental problem in the field. We performed here a highly detailed evaluation of community detection algorithms, which has two main novelties: 1) using complex closed benchmarks, which provide precise ways to assess whether the solutions generated by the algorithms are optimal; and, 2) A novel type of analysis, based on hierarchically clustering the solutions suggested by multiple community detection algorithms, which allows to easily visualize how different are those solutions. Surprise, a global parameter that evaluates the quality of a partition, confirms the power of these analyses. We show that none of the community detection algorithms tested provide consistently optimal results in all networks and that Surprise maximization, obtained by combining multiple algorithms, obtains quasi-optimal performances in these difficult benchmarks.
Collapse
Affiliation(s)
- Rodrigo Aldecoa
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas, Calle Jaime Roig 11, 46010, Valencia, Spain
| | | |
Collapse
|
33
|
Thomas CJ, Lambrechts J, Wolanski E, Traag VA, Blondel VD, Deleersnijder E, Hanert E. Numerical modelling and graph theory tools to study ecological connectivity in the Great Barrier Reef. Ecol Modell 2014. [DOI: 10.1016/j.ecolmodel.2013.10.002] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
34
|
Aldecoa R, Marin I. SurpriseMe: an integrated tool for network community structure characterization using Surprise maximization. Bioinformatics 2013; 30:1041-2. [DOI: 10.1093/bioinformatics/btt741] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
35
|
Hu D, Sarder P, Ronhovde P, Orthaus S, Achilefu S, Nussinov Z. Automatic segmentation of fluorescence lifetime microscopy images of cells using multiresolution community detection--a first study. J Microsc 2013; 253:54-64. [PMID: 24251410 DOI: 10.1111/jmi.12097] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Accepted: 10/09/2013] [Indexed: 11/30/2022]
Abstract
Inspired by a multiresolution community detection based network segmentation method, we suggest an automatic method for segmenting fluorescence lifetime (FLT) imaging microscopy (FLIM) images of cells in a first pilot investigation on two selected images. The image processing problem is framed as identifying segments with respective average FLTs against the background in FLIM images. The proposed method segments a FLIM image for a given resolution of the network defined using image pixels as the nodes and similarity between the FLTs of the pixels as the edges. In the resulting segmentation, low network resolution leads to larger segments, and high network resolution leads to smaller segments. Furthermore, using the proposed method, the mean-square error in estimating the FLT segments in a FLIM image was found to consistently decrease with increasing resolution of the corresponding network. The multiresolution community detection method appeared to perform better than a popular spectral clustering-based method in performing FLIM image segmentation. At high resolution, the spectral segmentation method introduced noisy segments in its output, and it was unable to achieve a consistent decrease in mean-square error with increasing resolution.
Collapse
Affiliation(s)
- D Hu
- Department of Physics, Washington University, St. Louis, Missouri, USA
| | | | | | | | | | | |
Collapse
|
36
|
Sun PG, Gao L, Yang Y. Maximizing modularity intensity for community partition and evolution. Inf Sci (N Y) 2013. [DOI: 10.1016/j.ins.2013.02.032] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
37
|
Surprise maximization reveals the community structure of complex networks. Sci Rep 2013; 3:1060. [PMID: 23320141 PMCID: PMC3544010 DOI: 10.1038/srep01060] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2012] [Accepted: 12/28/2012] [Indexed: 11/09/2022] Open
Abstract
How to determine the community structure of complex networks is an open question. It is critical to establish the best strategies for community detection in networks of unknown structure. Here, using standard synthetic benchmarks, we show that none of the algorithms hitherto developed for community structure characterization perform optimally. Significantly, evaluating the results according to their modularity, the most popular measure of the quality of a partition, systematically provides mistaken solutions. However, a novel quality function, called Surprise, can be used to elucidate which is the optimal division into communities. Consequently, we show that the best strategy to find the community structure of all the networks examined involves choosing among the solutions provided by multiple algorithms the one with the highest Surprise value. We conclude that Surprise maximization precisely reveals the community structure of complex networks.
Collapse
|
38
|
Hu D, Ronhovde P, Nussinov Z. Stability-to-instability transition in the structure of large-scale networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:066106. [PMID: 23368003 DOI: 10.1103/physreve.86.066106] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2012] [Indexed: 06/01/2023]
Abstract
We examine phase transitions between the "easy," "hard," and "unsolvable" phases when attempting to identify structure in large complex networks ("community detection") in the presence of disorder induced by network "noise" (spurious links that obscure structure), heat bath temperature T, and system size N. The partition of a graph into q optimally disjoint subgraphs or "communities" inherently requires Potts-type variables. In earlier work [Philos. Mag. 92, 406 (2012)], when examining power law and other networks (and general associated Potts models), we illustrated that transitions in the computational complexity of the community detection problem typically correspond to spin-glass-type transitions (and transitions to chaotic dynamics in mechanical analogs) at both high and low temperatures and/or noise. The computationally "hard" phase exhibits spin-glass type behavior including memory effects. The region over which the hard phase extends in the noise and temperature phase diagram decreases as N increases while holding the average number of nodes per community fixed. This suggests that in the thermodynamic limit a direct sharp transition may occur between the easy and unsolvable phases. When present, transitions at low temperature or low noise correspond to entropy driven (or "order by disorder") annealing effects, wherein stability may initially increase as temperature or noise is increased before becoming unsolvable at sufficiently high temperature or noise. Additional transitions between contending viable solutions (such as those at different natural scales) are also possible. Identifying community structure via a dynamical approach where "chaotic-type" transitions were found earlier. The correspondence between the spin-glass-type complexity transitions and transitions into chaos in dynamical analogs might extend to other hard computational problems. In this work, we examine large networks (with a power law distribution in cluster size) that have a large number of communities (q≫1). We infer that large systems at a constant ratio of q to the number of nodes N asymptotically tend towards insolvability in the limit of large N for any positive T. The asymptotic behavior of temperatures below which structure identification might be possible, T_{×}=O[1/lnq], decreases slowly, so for practical system sizes, there remains an accessible, and generally easy, global solvable phase at low temperature. We further employ multivariate Tutte polynomials to show that increasing q emulates increasing T for a general Potts model, leading to a similar stability region at low T. Given the relation between Tutte and Jones polynomials, our results further suggest a link between the above complexity transitions and transitions associated with random knots.
Collapse
Affiliation(s)
- Dandan Hu
- Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, Missouri 63130, USA
| | | | | |
Collapse
|
39
|
Chen C, Fushing H. Multiscale community geometry in a network and its application. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:041120. [PMID: 23214542 DOI: 10.1103/physreve.86.041120] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2012] [Revised: 08/17/2012] [Indexed: 06/01/2023]
Abstract
We introduce a between-ness-based distance metric to extract local and global information for each pair of nodes (or "vertices" used interchangeably) located in a binary network. Since this distance then superimposes a weighted graph upon such a binary network, a multiscale clustering mechanism, called data cloud geometry, is applicable to discover hierarchical communities within a binary network. This approach resolves many shortcomings of community finding approaches, which are primarily based on modularity optimization. Using several contrived and real binary networks, our community hierarchies compare favorably with results derived from a recently proposed approach based on time-scale differences of random walks and has already demonstrated significant improvements over module-based approaches, especially on the multiscale and the determination of the number of communities.
Collapse
Affiliation(s)
- Chen Chen
- University of California, Davis, California 95616, USA
| | | |
Collapse
|
40
|
Bettinelli A, Hansen P, Liberti L. Algorithm for parametric community detection in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:016107. [PMID: 23005491 DOI: 10.1103/physreve.86.016107] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Revised: 04/17/2012] [Indexed: 06/01/2023]
Abstract
Modularity maximization is extensively used to detect communities in complex networks. It has been shown, however, that this method suffers from a resolution limit: Small communities may be undetectable in the presence of larger ones even if they are very dense. To alleviate this defect, various modifications of the modularity function have been proposed as well as multiresolution methods. In this paper we systematically study a simple model (proposed by Pons and Latapy [Theor. Comput. Sci. 412, 892 (2011)] and similar to the parametric model of Reichardt and Bornholdt [Phys. Rev. E 74, 016110 (2006)]) with a single parameter α that balances the fraction of within community edges and the expected fraction of edges according to the configuration model. An exact algorithm is proposed to find optimal solutions for all values of α as well as the corresponding successive intervals of α values for which they are optimal. This algorithm relies upon a routine for exact modularity maximization and is limited to moderate size instances. An agglomerative hierarchical heuristic is therefore proposed to address parametric modularity detection in large networks. At each iteration the smallest value of α for which it is worthwhile to merge two communities of the current partition is found. Then merging is performed and the data are updated accordingly. An implementation is proposed with the same time and space complexity as the well-known Clauset-Newman-Moore (CNM) heuristic [Phys. Rev. E 70, 066111 (2004)]. Experimental results on artificial and real world problems show that (i) communities are detected by both exact and heuristic methods for all values of the parameter α; (ii) the dendrogram summarizing the results of the heuristic method provides a useful tool for substantive analysis, as illustrated particularly on a Les Misérables data set; (iii) the difference between the parametric modularity values given by the exact method and those given by the heuristic is moderate; (iv) the heuristic version of the proposed parametric method, viewed as a modularity maximization tool, gives better results than the CNM heuristic for large instances.
Collapse
Affiliation(s)
- Andrea Bettinelli
- Dipartimento di Tecnologie dell'Infomazione, Università degli Studi di Milano, via Bramante 65, Crema, Italy.
| | | | | |
Collapse
|
41
|
Ronhovde P, Chakrabarty S, Hu D, Sahu M, Sahu KK, Kelton KF, Mauro NA, Nussinov Z. Detection of hidden structures for arbitrary scales in complex physical systems. Sci Rep 2012; 2:329. [PMID: 22461970 PMCID: PMC3314987 DOI: 10.1038/srep00329] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2011] [Accepted: 02/29/2012] [Indexed: 11/14/2022] Open
Abstract
Recent decades have experienced the discovery of numerous complex materials. At the root of the complexity underlying many of these materials lies a large number of contending atomic- and largerscale configurations. In order to obtain a more detailed understanding of such systems, we need tools that enable the detection of pertinent structures on all spatial and temporal scales. Towards this end, we suggest a new method that applies to both static and dynamic systems which invokes ideas from network analysis and information theory. Our approach efficiently identifies basic unit cells, topological defects, and candidate natural structures. The method is particularly useful where a clear definition of order is lacking, and the identified features may constitute a natural point of departure for further analysis.
Collapse
|
42
|
Abstract
Communication is essential. It is vital between cells in multi-cellular organisms, and within cells. A signaling molecule binds to a receptor protein, and initiates a cascade of dynamic events. Signaling is a multistep pathway, which allows signal amplification: if some of the molecules in a pathway transmit the signal to multiple molecules, the result can be a large number of activated molecules across the cell and multiple reactions. That is how a small number of extracellular signaling molecules can produce a major cellular response. The pathway can relay signals from the extracellular space to the nucleus. How do signals travel efficiently over long-distances across the cell? Here we argue that evolution has utilized three properties: a modular functional organization of the cellular network; sequences in some key regions of proteins, such as linkers or loops, which were pre-encoded by evolution to facilitate signaling among domains; and compact interactions between proteins which is achieved via conformational disorder.
Collapse
Affiliation(s)
- Ruth Nussinov
- Basic Research Program, SAIC-Frederick, Inc., Center for Cancer Research Nanobiology Program, NCI-Frederick, Frederick, MD 21702, USA.
| |
Collapse
|
43
|
Hu D, Ronhovde P, Nussinov Z. Replica inference approach to unsupervised multiscale image segmentation. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:016101. [PMID: 22400619 DOI: 10.1103/physreve.85.016101] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2011] [Indexed: 05/31/2023]
Abstract
We apply a replica-inference-based Potts model method to unsupervised image segmentation on multiple scales. This approach was inspired by the statistical mechanics problem of "community detection" and its phase diagram. Specifically, the problem is cast as identifying tightly bound clusters ("communities" or "solutes") against a background or "solvent." Within our multiresolution approach, we compute information-theory-based correlations among multiple solutions ("replicas") of the same graph over a range of resolutions. Significant multiresolution structures are identified by replica correlations manifest by information theory overlaps. We further employ such information theory measures (such as normalized mutual information and variation of information), thermodynamic quantities such as the system entropy and energy, and dynamic measures monitoring the convergence time to viable solutions as metrics for transitions between various solvable and unsolvable phases. Within the solvable phase, transitions between contending solutions (such as those corresponding to segmentations on different scales) may also appear. With the aid of these correlations as well as thermodynamic measures, the phase diagram of the corresponding Potts model is analyzed at both zero and finite temperatures. Optimal parameters corresponding to a sensible unsupervised segmentations appear within the "easy phase" of the Potts model. Our algorithm is fast and shown to be at least as accurate as the best algorithms to date and to be especially suited to the detection of camouflaged images.
Collapse
Affiliation(s)
- Dandan Hu
- Department of Physics, Washington University, Campus Box 1105, 1 Brookings Drive, St. Louis, Missouri 63130, USA
| | | | | |
Collapse
|
44
|
Lancichinetti A, Fortunato S. Limits of modularity maximization in community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:066122. [PMID: 22304170 DOI: 10.1103/physreve.84.066122] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Revised: 10/17/2011] [Indexed: 05/21/2023]
Abstract
Modularity maximization is the most popular technique for the detection of community structure in graphs. The resolution limit of the method is supposedly solvable with the introduction of modified versions of the measure, with tunable resolution parameters. We show that multiresolution modularity suffers from two opposite coexisting problems: the tendency to merge small subgraphs, which dominates when the resolution is low; the tendency to split large subgraphs, which dominates when the resolution is high. In benchmark networks with heterogeneous distributions of cluster sizes, the simultaneous elimination of both biases is not possible and multiresolution modularity is not capable to recover the planted community structure, not even when it is pronounced and easily detectable by other methods, for any value of the resolution parameter. This holds for other multiresolution techniques and it is likely to be a general problem of methods based on global optimization.
Collapse
Affiliation(s)
- Andrea Lancichinetti
- Complex Networks and Systems Lagrange Lab, Institute for Scientific Interchange, I-10133 Torino, Italy
| | | |
Collapse
|
45
|
Ronhovde P, Chakrabarty S, Hu D, Sahu M, Sahu KK, Kelton KF, Mauro NA, Nussinov Z. Detecting hidden spatial and spatio-temporal structures in glasses and complex physical systems by multiresolution network clustering. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2011; 34:105. [PMID: 21959545 DOI: 10.1140/epje/i2011-11105-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2011] [Revised: 08/03/2011] [Accepted: 08/05/2011] [Indexed: 05/31/2023]
Abstract
We elaborate on a general method that we recently introduced for characterizing the "natural" structures in complex physical systems via multi-scale network analysis. The method is based on "community detection" wherein interacting particles are partitioned into an "ideal gas" of optimally decoupled groups of particles. Specifically, we construct a set of network representations ("replicas") of the physical system based on interatomic potentials and apply a multiscale clustering ("multiresolution community detection") analysis using information-based correlations among the replicas. Replicas may i) be different representations of an identical static system, ii) embody dynamics by considering replicas to be time separated snapshots of the system (with a tunable time separation), or iii) encode general correlations when different replicas correspond to different representations of the entire history of the system as it evolves in space-time. Inputs for our method are the inter-particle potentials or experimentally measured two (or higher order) particle correlations. We apply our method to computer simulations of a binary Kob-Andersen Lennard-Jones system in a mixture ratio of A(80)B(20) , a ternary model system with components "A", "B", and "C" in ratios of A(88)B(7)C(5) (as in Al(88)Y(7)Fe(5) , and to atomic coordinates in a Zr(80)Pt(20) system as gleaned by reverse Monte Carlo analysis of experimentally determined structure factors. We identify the dominant structures (disjoint or overlapping) and general length scales by analyzing extrema of the information theory measures. We speculate on possible links between i) physical transitions or crossovers and ii) changes in structures found by this method as well as phase transitions associated with the computational complexity of the community detection problem. We also briefly consider continuum approaches and discuss rigidity and the shear penetration depth in amorphous systems; this latter length scale increases as the system becomes progressively rigid.
Collapse
Affiliation(s)
- P Ronhovde
- Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, MO 63130, USA
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Traag VA, Van Dooren P, Nesterov Y. Narrow scope for resolution-limit-free community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:016114. [PMID: 21867264 DOI: 10.1103/physreve.84.016114] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Revised: 06/27/2011] [Indexed: 05/12/2023]
Abstract
Detecting communities in large networks has drawn much attention over the years. While modularity remains one of the more popular methods of community detection, the so-called resolution limit remains a significant drawback. To overcome this issue, it was recently suggested that instead of comparing the network to a random null model, as is done in modularity, it should be compared to a constant factor. However, it is unclear what is meant exactly by "resolution-limit-free," that is, not suffering from the resolution limit. Furthermore, the question remains what other methods could be classified as resolution-limit-free. In this paper we suggest a rigorous definition and derive some basic properties of resolution-limit-free methods. More importantly, we are able to prove exactly which class of community detection methods are resolution-limit-free. Furthermore, we analyze which methods are not resolution-limit-free, suggesting there is only a limited scope for resolution-limit-free community detection methods. Finally, we provide such a natural formulation, and show it performs superbly.
Collapse
Affiliation(s)
- V A Traag
- ICTEAM, Université Catholique de Louvain, Louvain-la Neuve, Belgium.
| | | | | |
Collapse
|
47
|
Zhan W, Zhang Z, Guan J, Zhou S. Evolutionary method for finding communities in bipartite networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:066120. [PMID: 21797454 DOI: 10.1103/physreve.83.066120] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Revised: 03/03/2011] [Indexed: 05/31/2023]
Abstract
An important step in unveiling the relation between network structure and dynamics defined on networks is to detect communities, and numerous methods have been developed separately to identify community structure in different classes of networks, such as unipartite networks, bipartite networks, and directed networks. Here, we show that the finding of communities in such networks can be unified in a general framework-detection of community structure in bipartite networks. Moreover, we propose an evolutionary method for efficiently identifying communities in bipartite networks. To this end, we show that both unipartite and directed networks can be represented as bipartite networks, and their modularity is completely consistent with that for bipartite networks, the detection of modular structure on which can be reformulated as modularity maximization. To optimize the bipartite modularity, we develop a modified adaptive genetic algorithm (MAGA), which is shown to be especially efficient for community structure detection. The high efficiency of the MAGA is based on the following three improvements we make. First, we introduce a different measure for the informativeness of a locus instead of the standard deviation, which can exactly determine which loci mutate. This measure is the bias between the distribution of a locus over the current population and the uniform distribution of the locus, i.e., the Kullback-Leibler divergence between them. Second, we develop a reassignment technique for differentiating the informative state a locus has attained from the random state in the initial phase. Third, we present a modified mutation rule which by incorporating related operations can guarantee the convergence of the MAGA to the global optimum and can speed up the convergence process. Experimental results show that the MAGA outperforms existing methods in terms of modularity for both bipartite and unipartite networks.
Collapse
Affiliation(s)
- Weihua Zhan
- Department of Computer Science and Technology, Tongji University, 4800 Cao'an Road, Shanghai 201804, China.
| | | | | | | |
Collapse
|
48
|
Subelj L, Bajec M. Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:036103. [PMID: 21517554 DOI: 10.1103/physreve.83.036103] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 11/06/2010] [Indexed: 05/30/2023]
Abstract
Label propagation has proven to be a fast method for detecting communities in large complex networks. Recent developments have also improved the accuracy of the approach; however, a general algorithm is still an open issue. We present an advanced label propagation algorithm that combines two unique strategies of community formation, namely, defensive preservation and offensive expansion of communities. The two strategies are combined in a hierarchical manner to recursively extract the core of the network and to identify whisker communities. The algorithm was evaluated on two classes of benchmark networks with planted partition and on 23 real-world networks ranging from networks with tens of nodes to networks with several tens of millions of edges. It is shown to be comparable to the current state-of-the-art community detection algorithms and superior to all previous label propagation algorithms, with comparable time complexity. In particular, analysis on real-world networks has proven that the algorithm has almost linear complexity, O(m¹·¹⁹), and scales even better than the basic label propagation algorithm (m is the number of edges in the network).
Collapse
Affiliation(s)
- Lovro Subelj
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia.
| | | |
Collapse
|