301
|
Lai D, Lu H, Nardini C. Enhanced modularity-based community detection by random walk network preprocessing. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:066118. [PMID: 20866489 DOI: 10.1103/physreve.81.066118] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2010] [Revised: 04/21/2010] [Indexed: 05/29/2023]
Abstract
The representation of real systems with network models is becoming increasingly common and critical to both capture and simplify systems' complexity, notably, via the partitioning of networks into communities. In this respect, the definition of modularity, a common and broadly used quality measure for networks partitioning, has induced a surge of efficient modularity-based community detection algorithms. However, recently, the optimization of modularity has been found to show a resolution limit, which reduces its effectiveness and range of applications. Therefore, one recent trend in this area of research has been related to the definition of novel quality functions, alternative to modularity. In this paper, however, instead of laying aside the important body of knowledge developed so far for modularity-based algorithms, we propose to use a strategy to preprocess networks before feeding them into modularity-based algorithms. This approach is based on the observation that dynamic processes triggered on vertices in the same community possess similar behavior patterns but dissimilar on vertices in different communities. Validations on real-world and synthetic networks demonstrate that network preprocessing can enhance the modularity-based community detection algorithms to find more natural clusters and effectively alleviates the problem of resolution limit.
Collapse
Affiliation(s)
- Darong Lai
- Department of Computer Science and Engineering, Shanghai Jiao Tong University, 200240 Shanghai, China.
| | | | | |
Collapse
|
302
|
|
303
|
Cafieri S, Hansen P, Liberti L. Loops and multiple edges in modularity maximization of networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:046102. [PMID: 20481781 DOI: 10.1103/physreve.81.046102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Indexed: 05/29/2023]
Abstract
The modularity maximization model proposed by Newman and Girvan for the identification of communities in networks works for general graphs possibly with loops and multiple edges. However, the applications usually correspond to simple graphs. These graphs are compared to a null model where the degree distribution is maintained but edges are placed at random. Therefore, in this null model there will be loops and possibly multiple edges. Sharp bounds on the expected number of loops, and their impact on the modularity, are derived. Then, building upon the work of Massen and Doye, but using algebra rather than simulation, we propose modified null models associated with graphs without loops but with multiple edges, graphs with loops but without multiple edges and graphs without loops nor multiple edges. We validate our models by using the exact algorithm for clique partitioning of Grötschel and Wakabayashi.
Collapse
Affiliation(s)
- Sonia Cafieri
- Department Mathématiques et Informatique, Ecole Nationale de l'Aviation Civile, 7 av E Belin, F-31055 Toulouse, France.
| | | | | |
Collapse
|
304
|
Steinhaeuser K, Chawla NV. Identifying and evaluating community structure in complex networks. Pattern Recognit Lett 2010. [DOI: 10.1016/j.patrec.2009.11.001] [Citation(s) in RCA: 140] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
305
|
Lancichinetti A, Radicchi F, Ramasco JJ. Statistical significance of communities in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:046110. [PMID: 20481789 DOI: 10.1103/physreve.81.046110] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Revised: 03/08/2010] [Indexed: 05/12/2023]
Abstract
Nodes in real-world networks are usually organized in local modules. These groups, called communities, are intuitively defined as subgraphs with a larger density of internal connections than of external links. In this work, we define a measure aimed at quantifying the statistical significance of single communities. Extreme and order statistics are used to predict the statistics associated with individual clusters in random graphs. These distributions allows us to define one community significance as the probability that a generic clustering algorithm finds such a group in a random graph. The method is successfully applied in the case of real-world networks for the evaluation of the significance of their communities.
Collapse
Affiliation(s)
- Andrea Lancichinetti
- Complex Networks Lagrange Laboratory, Turin, Italy and Physics Department, Politecnico di Torino, ISI Foundation, Turin, Italy
| | | | | |
Collapse
|
306
|
Chin A, Chignell M, Wang H. Tracking cohesive subgroups over time in inferred social networks. NEW REV HYPERMEDIA M 2010. [DOI: 10.1080/13614568.2010.496132] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
307
|
Good BH, de Montjoye YA, Clauset A. Performance of modularity maximization in practical contexts. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:046106. [PMID: 20481785 DOI: 10.1103/physreve.81.046106] [Citation(s) in RCA: 320] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Indexed: 05/12/2023]
Abstract
Although widely used in practice, the behavior and accuracy of the popular module identification technique called modularity maximization is not well understood in practical contexts. Here, we present a broad characterization of its performance in such situations. First, we revisit and clarify the resolution limit phenomenon for modularity maximization. Second, we show that the modularity function Q exhibits extreme degeneracies: it typically admits an exponential number of distinct high-scoring solutions and typically lacks a clear global maximum. Third, we derive the limiting behavior of the maximum modularity Qmax for one model of infinitely modular networks, showing that it depends strongly both on the size of the network and on the number of modules it contains. Finally, using three real-world metabolic networks as examples, we show that the degenerate solutions can fundamentally disagree on many, but not all, partition properties such as the composition of the largest modules and the distribution of module sizes. These results imply that the output of any modularity maximization procedure should be interpreted cautiously in scientific contexts. They also explain why many heuristics are often successful at finding high-scoring partitions in practice and why different heuristics can disagree on the modular structure of the same network. We conclude by discussing avenues for mitigating some of these behaviors, such as combining information from many degenerate solutions or using generative models.
Collapse
Affiliation(s)
- Benjamin H Good
- Department of Physics, Swarthmore College, Swarthmore, Pennsylvania 19081, USA and Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, New Mexico 87501, USA.
| | | | | |
Collapse
|
308
|
Cafieri S, Hansen P, Liberti L. Edge ratio and community structure in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:026105. [PMID: 20365629 DOI: 10.1103/physreve.81.026105] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Revised: 12/12/2009] [Indexed: 05/29/2023]
Abstract
A hierarchical divisive algorithm is proposed for identifying communities in complex networks. To that effect, the definition of community in the weak sense of Radicchi [Proc. Natl. Acad. Sci. U.S.A. 101, 2658 (2004)] is extended into a criterion for a bipartition to be optimal: one seeks to maximize the minimum for both classes of the bipartition of the ratio of inner edges to cut edges. A mathematical program is used within a dichotomous search to do this in an optimal way for each bipartition. This includes an exact solution of the problem of detecting indivisible communities. The resulting hierarchical divisive algorithm is compared with exact modularity maximization on both artificial and real world data sets. For two problems of the former kind optimal solutions are found; for five problems of the latter kind the edge ratio algorithm always appears to be competitive. Moreover, it provides additional information in several cases, notably through the use of the dendrogram summarizing the resolution. Finally, both algorithms are compared on reduced versions of the data sets of Girvan and Newman [Proc. Natl. Acad. Sci. U.S.A. 99, 7821 (2002)] and of Lancichinetti [Phys. Rev. E 78, 046110 (2008)]. Results for these instances appear to be comparable.
Collapse
Affiliation(s)
- Sonia Cafieri
- LIX, Ecole Polytechnique, F-91128 Palaiseau, France.
| | | | | |
Collapse
|
309
|
Ma X, Gao L, Yong X, Fu L. Semi-supervised clustering algorithm for community structure detection in complex networks. PHYSICA A: STATISTICAL MECHANICS AND ITS APPLICATIONS 2010; 389:187-197. [DOI: 10.1016/j.physa.2009.09.018] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
310
|
|
311
|
Chauhan S, Girvan M, Ott E. Spectral properties of networks with community structure. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:056114. [PMID: 20365050 DOI: 10.1103/physreve.80.056114] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2009] [Indexed: 05/29/2023]
Abstract
In this paper, we discuss the eigenspectra of networks with community structure. It is shown that in many cases, the spectrum of eigenvalues of the adjacency matrix of a network with community structure gives a clear indication of the number of communities in the network. In particular, for a network with N nodes and N_(c) communities, there will typically be N_(c) eigenvalues that are significantly larger than the magnitudes of all the other (N-N_(c)) eigenvalues. We discuss this property as well as its use and limitations for determining N_(c) .
Collapse
Affiliation(s)
- Sanjeev Chauhan
- Department of Physics, University of Maryland, College Park, MD 20742, USA.
| | | | | |
Collapse
|
312
|
Liang X, Liu Z, Li B. Weak signal transmission in complex networks and its application in detecting connectivity. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:046102. [PMID: 19905385 DOI: 10.1103/physreve.80.046102] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2008] [Revised: 03/26/2009] [Indexed: 05/28/2023]
Abstract
We present a network model of coupled oscillators to study how a weak signal is transmitted in complex networks. Through both theoretical analysis and numerical simulations, we find that the response of other nodes to the weak signal decays exponentially with their topological distance to the signal source and the coupling strength between two neighboring nodes can be figured out by the responses. This finding can be conveniently used to detect the topology of unknown network, such as the degree distribution, clustering coefficient and community structure, etc., by repeatedly choosing different nodes as the signal source. Through four typical networks, i.e., the regular one dimensional, small world, random, and scale-free networks, we show that the features of network can be approximately given by investigating many fewer nodes than the network size, thus our approach to detect the topology of unknown network may be efficient in practical situations with large network size.
Collapse
Affiliation(s)
- Xiaoming Liang
- Institute of Theoretical Physics, Department of Physics, East China Normal University, Shanghai, China
| | | | | |
Collapse
|
313
|
Traag VA, Bruggeman J. Community detection in networks with positive and negative links. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:036115. [PMID: 19905188 DOI: 10.1103/physreve.80.036115] [Citation(s) in RCA: 137] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2008] [Revised: 07/09/2009] [Indexed: 05/28/2023]
Abstract
Detecting communities in complex networks accurately is a prime challenge, preceding further analyses of network characteristics and dynamics. Until now, community detection took into account only positively valued links, while many actual networks also feature negative links. We extend an existing Potts model to incorporate negative links as well, resulting in a method similar to the clustering of signed graphs, as dealt with in social balance theory, but more general. To illustrate our method, we applied it to a network of international alliances and disputes. Using data from 1993-2001, it turns out that the world can be divided into six power blocs similar to Huntington's civilizations, with some notable exceptions.
Collapse
Affiliation(s)
- V A Traag
- Department of Mathematical Engineering, Université Catholique de Louvain, Bâtiment Euler, Avenue G Lemaître, 4 B-1348 Louvain-la-neuve, Belgium.
| | | |
Collapse
|
314
|
Lü Z, Huang W. Iterated tabu search for identifying community structure in complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:026130. [PMID: 19792223 DOI: 10.1103/physreve.80.026130] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Revised: 07/20/2009] [Indexed: 05/28/2023]
Abstract
This paper presents an iterated tabu search (denoted by ITS) algorithm for optimizing the modularity of community structure in complex networks. The proposed algorithm follows a general framework composed of two phases: basic optimization and postrefinement. When the basic optimization cannot improve the modularity any more, a postrefinement procedure is employed to further optimize the objective function with a global view. For both these two phases, iterated tabu search algorithm is employed to optimize the objective function. Computational results show the high effectiveness of the proposed ITS algorithm compared with six state-of-the-art algorithms in the literature. In particular, our ITS algorithm improves the previous best known modularity for several small and medium size networks.
Collapse
Affiliation(s)
- Zhipeng Lü
- School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
| | | |
Collapse
|
315
|
Gómez S, Jensen P, Arenas A. Analysis of community structure in networks of correlated data. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:016114. [PMID: 19658781 DOI: 10.1103/physreve.80.016114] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Revised: 03/16/2009] [Indexed: 05/14/2023]
Abstract
We present a reformulation of modularity that allows the analysis of the community structure in networks of correlated data. The modularity preserves the probabilistic semantics of the original definition even when the network is directed, weighted, signed, and has self-loops. This is the most general condition one can find in the study of any network, in particular those defined from correlated data. We apply our results to a real network of correlated data between stores in the city of Lyon (France).
Collapse
Affiliation(s)
- Sergio Gómez
- Departament d'Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili, 43007 Tarragona, Spain
| | | | | |
Collapse
|
316
|
Medus AD, Dorso CO. Alternative approach to community detection in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 79:066111. [PMID: 19658568 DOI: 10.1103/physreve.79.066111] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2008] [Revised: 05/21/2009] [Indexed: 05/28/2023]
Abstract
The problem of community detection is relevant in many disciplines of science and modularity optimization is the widely accepted method for this purpose. It has recently been shown that this approach presents a resolution limit by which it is not possible to detect communities with sizes smaller than a threshold, which depends on the network size. Moreover, it might happen that the communities resulting from such an approach do not satisfy the usual qualitative definition of commune; i.e., nodes in a commune are more connected among themselves than to nodes outside the commune. In this paper we present a different method for community detection in complex networks. We define merit factors based on the weak and strong community definitions formulated by Radicchi [Proc. Natl. Acad. Sci. U.S.A. 101, 2658 (2004)] and we show that these local definitions avoid the resolution limit problem found in the modularity optimization approach.
Collapse
Affiliation(s)
- A D Medus
- Departamento de Física, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires 1428, Argentina.
| | | |
Collapse
|
317
|
Stone EA, Ayroles JF. Modulated modularity clustering as an exploratory tool for functional genomic inference. PLoS Genet 2009; 5:e1000479. [PMID: 19424432 PMCID: PMC2673040 DOI: 10.1371/journal.pgen.1000479] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2008] [Accepted: 04/10/2009] [Indexed: 11/18/2022] Open
Abstract
In recent years, the advent of high-throughput assays, coupled with their diminishing cost, has facilitated a systems approach to biology. As a consequence, massive amounts of data are currently being generated, requiring efficient methodology aimed at the reduction of scale. Whole-genome transcriptional profiling is a standard component of systems-level analyses, and to reduce scale and improve inference clustering genes is common. Since clustering is often the first step toward generating hypotheses, cluster quality is critical. Conversely, because the validation of cluster-driven hypotheses is indirect, it is critical that quality clusters not be obtained by subjective means. In this paper, we present a new objective-based clustering method and demonstrate that it yields high-quality results. Our method, modulated modularity clustering (MMC), seeks community structure in graphical data. MMC modulates the connection strengths of edges in a weighted graph to maximize an objective function (called modularity) that quantifies community structure. The result of this maximization is a clustering through which tightly-connected groups of vertices emerge. Our application is to systems genetics, and we quantitatively compare MMC both to the hierarchical clustering method most commonly employed and to three popular spectral clustering approaches. We further validate MMC through analyses of human and Drosophila melanogaster expression data, demonstrating that the clusters we obtain are biologically meaningful. We show MMC to be effective and suitable to applications of large scale. In light of these features, we advocate MMC as a standard tool for exploration and hypothesis generation.
Collapse
Affiliation(s)
- Eric A Stone
- Department of Statistics, North Carolina State University, Raleigh, NC, USA.
| | | |
Collapse
|
318
|
|
319
|
Ren W, Yan G, Liao X, Xiao L. Simple probabilistic algorithm for detecting community structure. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 79:036111. [PMID: 19392022 DOI: 10.1103/physreve.79.036111] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2007] [Revised: 12/01/2008] [Indexed: 05/27/2023]
Abstract
With the growing number of available social and biological networks, the problem of detecting the network community structure is becoming more and more important which acts as the first step to analyze these data. The community structure is generally regarded as that nodes in the same community tend to have more edges and less if they are in different communities. We propose a simple probabilistic algorithm for detecting community structure which employs expectation-maximization (SPAEM). We also give a criterion based on the minimum description length to identify the optimal number of communities. SPAEM can detect overlapping nodes and handle weighted networks. It turns out to be powerful and effective by testing simulation data and some widely known data sets.
Collapse
Affiliation(s)
- Wei Ren
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, No. 55 Zhongguncun East Road, Beijing, China.
| | | | | | | |
Collapse
|
320
|
Yen L, Fouss F, Decaestecker C, Francq P, Saerens M. Graph nodes clustering with the sigmoid commute-time kernel: A comparative study. DATA KNOWL ENG 2009. [DOI: 10.1016/j.datak.2008.10.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
321
|
|
322
|
|
323
|
|
324
|
Uncovering Overlap Community Structure in Complex Networks Using Particle Competition. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE 2009. [DOI: 10.1007/978-3-642-05253-8_68] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
325
|
Orman GK, Labatut V. A Comparison of Community Detection Algorithms on Artificial Networks. DISCOVERY SCIENCE 2009. [DOI: 10.1007/978-3-642-04747-3_20] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
326
|
Teitelbaum T, Balenzuela P, Cano P, Buldú JM. Community structures and role detection in music networks. CHAOS (WOODBURY, N.Y.) 2008; 18:043105. [PMID: 19123615 DOI: 10.1063/1.2988285] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
We analyze the existence of community structures in two different social networks using data obtained from similarity and collaborative features between musical artists. Our analysis reveals some characteristic organizational patterns and provides information about the driving forces behind the growth of the networks. In the similarity network, we find a strong correlation between clusters of artists and musical genres. On the other hand, the collaboration network shows two different kinds of communities: rather small structures related to music bands and geographic zones, and much bigger communities built upon collaborative clusters with a high number of participants related through the period the artists were active. Finally, we detect the leading artists inside their corresponding communities and analyze their roles in the network by looking at a few topological properties of the nodes.
Collapse
Affiliation(s)
- T Teitelbaum
- Departamento de Fisica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and CONICET, Buenos Aires, Argentina
| | | | | | | |
Collapse
|
327
|
Carmi S, Krapivsky PL, Ben-Avraham D. Partition of networks into basins of attraction. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:066111. [PMID: 19256909 DOI: 10.1103/physreve.78.066111] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2008] [Indexed: 05/27/2023]
Abstract
We study partition of networks into basins of attraction based on a steepest ascent search for the node of highest degree. Each node is associated with, or "attracted" to its neighbor of maximal degree, as long as the degree is increasing. A node that has no neighbors of higher degree is a peak, attracting all the nodes in its basin. Maximally random scale-free networks exhibit different behavior based on their degree distribution exponent gamma : For small gamma (broad distribution) networks are dominated by a giant basin, whereas for large gamma (narrow distribution) there are numerous basins, with peaks attracting mainly their nearest neighbors. We derive expressions for the first two moments of the number of basins. We also obtain the complete distribution of basin sizes for a class of hierarchical deterministic scale-free networks that resemble random nets. Finally, we generalize the problem to regular networks and lattices where all degrees are equal, and thus the attractiveness of a node must be determined by an assigned weight, rather than the degree. We derive the complete distribution of basins of attraction resulting from randomly assigned weights in one-dimensional chains.
Collapse
Affiliation(s)
- Shai Carmi
- Center for Polymer Studies, Boston University, Boston, Massachusetts 02215, USA
| | | | | |
Collapse
|
328
|
Wang RS, Zhang S, Wang Y, Zhang XS, Chen L. Clustering complex networks and biological networks by nonnegative matrix factorization with various similarity measures. Neurocomputing 2008. [DOI: 10.1016/j.neucom.2007.12.043] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
329
|
Saez-Rodriguez J, Gayer S, Ginkel M, Gilles ED. Automatic decomposition of kinetic models of signaling networks minimizing the retroactivity among modules. ACTA ACUST UNITED AC 2008; 24:i213-9. [PMID: 18689828 DOI: 10.1093/bioinformatics/btn289] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION The modularity of biochemical networks in general, and signaling networks in particular, has been extensively studied over the past few years. It has been proposed to be a useful property to analyze signaling networks: by decomposing the network into subsystems, more manageable units are obtained that are easier to analyze. While many powerful algorithms are available to identify modules in protein interaction networks, less attention has been paid to signaling networks de.ned as chemical systems. Such a decomposition would be very useful as most quantitative models are de.ned using the latter, more detailed formalism. RESULTS Here, we introduce a novel method to decompose biochemical networks into modules so that the bidirectional (retroactive) couplings among the modules are minimized. Our approach adapts a method to detect community structures, and applies it to the so-called retroactivity matrix that characterizes the couplings of the network. Only the structure of the network, e.g. in SBML format, is required. Furthermore, the modularized models can be loaded into ProMoT, a modeling tool which supports modular modeling. This allows visualization of the models, exploiting their modularity and easy generation of models of one or several modules for further analysis. The method is applied to several relevant cases, including an entangled model of the EGF-induced MAPK cascade and a comprehensive model of EGF signaling, demonstrating its ability to uncover meaningful modules. Our approach can thus help to analyze large networks, especially when little a priori knowledge on the structure of the network is available. AVAILABILITY The decomposition algorithms implemented in MATLAB (Mathworks, Inc.) are freely available upon request. ProMoT is freely available at http://www.mpi-magdeburg.mpg.de/projects/promot. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Julio Saez-Rodriguez
- Max-Planck-Institute for Dynamics of Complex Technical Systems, Sandtorstr 1, 39106 Magdeburg, Germany
| | | | | | | |
Collapse
|
330
|
Lancichinetti A, Fortunato S, Radicchi F. Benchmark graphs for testing community detection algorithms. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:046110. [PMID: 18999496 DOI: 10.1103/physreve.78.046110] [Citation(s) in RCA: 648] [Impact Index Per Article: 38.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2008] [Revised: 09/11/2008] [Indexed: 05/11/2023]
Abstract
Community structure is one of the most important features of real networks and reveals the internal organization of the nodes. Many algorithms have been proposed but the crucial issue of testing, i.e., the question of how good an algorithm is, with respect to others, is still open. Standard tests include the analysis of simple artificial graphs with a built-in community structure, that the algorithm has to recover. However, the special graphs adopted in actual tests have a structure that does not reflect the real properties of nodes and communities found in real networks. Here we introduce a class of benchmark graphs, that account for the heterogeneity in the distributions of node degrees and of community sizes. We use this benchmark to test two popular methods of community detection, modularity optimization, and Potts model clustering. The results show that the benchmark poses a much more severe test to algorithms than standard benchmarks, revealing limits that may not be apparent at a first analysis.
Collapse
Affiliation(s)
- Andrea Lancichinetti
- Complex Systems Lagrange Laboratory (CNLL), Institute for Scientific Interchange (ISI), Viale S. Severo 65, 10133, Torino, Italy
| | | | | |
Collapse
|
331
|
Ye Z, Hu S, Yu J. Adaptive clustering algorithm for community detection in complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:046115. [PMID: 18999501 DOI: 10.1103/physreve.78.046115] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Indexed: 05/27/2023]
Abstract
Community structure is common in various real-world networks; methods or algorithms for detecting such communities in complex networks have attracted great attention in recent years. We introduced a different adaptive clustering algorithm capable of extracting modules from complex networks with considerable accuracy and robustness. In this approach, each node in a network acts as an autonomous agent demonstrating flocking behavior where vertices always travel toward their preferable neighboring groups. An optimal modular structure can emerge from a collection of these active nodes during a self-organization process where vertices constantly regroup. In addition, we show that our algorithm appears advantageous over other competing methods (e.g., the Newman-fast algorithm) through intensive evaluation. The applications in three real-world networks demonstrate the superiority of our algorithm to find communities that are parallel with the appropriate organization in reality.
Collapse
Affiliation(s)
- Zhenqing Ye
- James D. Watson Institute of Genome Sciences, Zhejiang University, Hangzhou, China
| | | | | |
Collapse
|
332
|
Schuetz P, Caflisch A. Multistep greedy algorithm identifies community structure in real-world and computer-generated networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:026112. [PMID: 18850902 DOI: 10.1103/physreve.78.026112] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2008] [Indexed: 05/26/2023]
Abstract
We have recently introduced a multistep extension of the greedy algorithm for modularity optimization. The extension is based on the idea that merging l pairs of communities (l>1) at each iteration prevents premature condensation into few large communities. Here, an empirical formula is presented for the choice of the step width l that generates partitions with (close to) optimal modularity for 17 real-world and 1100 computer-generated networks. Furthermore, an in-depth analysis of the communities of two real-world networks (the metabolic network of the bacterium E. coli and the graph of coappearing words in the titles of papers coauthored by Martin Karplus) provides evidence that the partition obtained by the multistep greedy algorithm is superior to the one generated by the original greedy algorithm not only with respect to modularity, but also according to objective criteria. In other words, the multistep extension of the greedy algorithm reduces the danger of getting trapped in local optima of modularity and generates more reasonable partitions.
Collapse
Affiliation(s)
- Philipp Schuetz
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.
| | | |
Collapse
|
333
|
Estrada E, Higham DJ, Hatano N. Communicability and multipartite structures in complex networks at negative absolute temperatures. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:026102. [PMID: 18850892 DOI: 10.1103/physreve.78.026102] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Indexed: 05/26/2023]
Abstract
We here present a method of clearly identifying multipartite subgraphs in a network. The method is based on a recently introduced concept of the communicability, which very clearly identifies communities in a complex network. We here show that, while the communicability at a positive temperature is useful in identifying communities, the communicability at a negative temperature is useful in identifying multipartite subgraphs; the latter quantity between two nodes is positive when the two nodes belong to the same subgraph and is negative when they do not. The method is able to discover "almost" multipartite structures, where intercommunity connections vastly outweigh intracommunity connections. We illustrate the relevance of this work to real-life food web and protein-protein interaction networks.
Collapse
Affiliation(s)
- Ernesto Estrada
- Institute of Complexity Science, Department of Physics and Department of Mathematics, University of Strathclyde, Glasgow G1 1XH, United Kingdom.
| | | | | |
Collapse
|
334
|
Hu Y, Chen H, Zhang P, Li M, Di Z, Fan Y. Comparative definition of community and corresponding identifying algorithm. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:026121. [PMID: 18850911 DOI: 10.1103/physreve.78.026121] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2008] [Indexed: 05/26/2023]
Abstract
A comparative definition for community in networks is proposed, and the corresponding detecting algorithm is given. A community is defined as a set of nodes, which satisfies the requirement that each node's degree inside the community should not be smaller than the node's degree toward any other community. In the algorithm, the attractive force of a community to a node is defined as the connections between them. Then employing an attractive-force-based self-organizing process, without any extra parameter, the best communities can be detected. Several artificial and real-world networks, including the Zachary karate club, college football, and large scientific collaboration networks, are analyzed. The algorithm works well in detecting communities, and it also gives a nice description of network division and group formation.
Collapse
Affiliation(s)
- Yanqing Hu
- Department of Systems Science, School of Management, Center for Complexity Research, Beijing Normal University, Beijing 100875, China
| | | | | | | | | | | |
Collapse
|
335
|
Chin A, Chignell M. Automatic detection of cohesive subgroups within social hypertext: A heuristic approach. NEW REV HYPERMEDIA M 2008. [DOI: 10.1080/13614560802357180] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
336
|
Hu Y, Li M, Zhang P, Fan Y, Di Z. Community detection by signaling on complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:016115. [PMID: 18764028 DOI: 10.1103/physreve.78.016115] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2007] [Revised: 04/23/2008] [Indexed: 05/26/2023]
Abstract
Based on a signaling process of complex networks, a method for identification of community structure is proposed. For a network with n nodes, every node is assumed to be a system which can send, receive, and record signals. Each node is taken as the initial signal source to excite the whole network one time. Then the source node is associated with an n -dimensional vector which records the effects of the signaling process. By this process, the topological relationship of nodes on the network could be transferred into a geometrical structure of vectors in n -dimensional Euclidean space. Then the best partition of groups is determined by F statistics and the final community structure is given by the K -means clustering method. This method can detect community structure both in unweighted and weighted networks. It has been applied to ad hoc networks and some real networks such as the Zachary karate club network and football team network. The results indicate that the algorithm based on the signaling process works well.
Collapse
Affiliation(s)
- Yanqing Hu
- Department of Systems Science, School of Management, Center for Complexity Research, Beijing Normal University, Beijing 100875, China
| | | | | | | | | |
Collapse
|
337
|
E W, Li T, Vanden-Eijnden E. Optimal partition and effective dynamics of complex networks. Proc Natl Acad Sci U S A 2008; 105:7907-12. [PMID: 18303119 PMCID: PMC2786939 DOI: 10.1073/pnas.0707563105] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2007] [Indexed: 11/18/2022] Open
Abstract
Given a large and complex network, we would like to find the best partition of this network into a small number of clusters. This question has been addressed in many different ways. Here we propose a strategy along the lines of optimal prediction for the Markov chains associated with the dynamics on these networks. We develop the necessary ingredients for such an optimal partition strategy, and we compare our strategy with the previous ones. We show that when the Markov chain is lumpable, we recover the partition with respect to which the chain is lumpable. We also discuss the case of well-clustered networks. Finally, we illustrate our strategy on several examples.
Collapse
Affiliation(s)
- Weinan E
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ 08544
- School of Mathematical Sciences, Peking University, Beijing 100871, China
| | - Tiejun Li
- Laboratory of Mathematics and Applied Mathematics and School of Mathematical Sciences, Peking University, Beijing 100871, China; and
| | | |
Collapse
|
338
|
Krawczyk MJ. Differential equations as a tool for community identification. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:065701. [PMID: 18643329 DOI: 10.1103/physreve.77.065701] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2008] [Indexed: 05/26/2023]
Abstract
We consider the task of identification of a cluster structure in random networks. The results of two methods are presented: (i) the Newman algorithm [M. E. J. Newman and M. Girvan, Phys. Rev. E 69, 026113 (2004)]; and (ii) our method based on differential equations. A series of computer experiments is performed to check if in applying these methods we are able to determine the structure of the network. The trial networks consist initially of well-defined clusters and are disturbed by introducing noise into their connectivity matrices. Further, we show that an improvement of the previous version of our method is possible by an appropriate choice of the threshold parameter beta . With this change, the results obtained by the two methods above are similar, and our method works better, for all the computer experiments we have done.
Collapse
Affiliation(s)
- Małgorzata J Krawczyk
- Faculty of Physics and Applied Computer Science, AGH University of Science and Technology, al. Mickiewicza 30, 30-059 Kraków, Poland.
| |
Collapse
|
339
|
Newman MEJ, Ghoshal G. Bicomponents and the robustness of networks to failure. PHYSICAL REVIEW LETTERS 2008; 100:138701. [PMID: 18518004 DOI: 10.1103/physrevlett.100.138701] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2007] [Indexed: 05/26/2023]
Abstract
We study bicomponents in networks, sets of nodes such that each pair in the set is connected by at least two independent paths, so that the failure of no single node in the network can cause them to become disconnected. We show that standard network models predict there to be essentially no small bicomponents in most networks, but there may be a giant bicomponent, whose presence coincides with the presence of the ordinary giant component, and we find that real networks seem by and large to follow this pattern, although there are some interesting exceptions. We also study the size of the giant bicomponent as nodes in the network fail and find in some cases that our networks are quite robust to failure, with large bicomponents persisting until almost all vertices have been removed.
Collapse
Affiliation(s)
- M E J Newman
- Department of Physics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | | |
Collapse
|
340
|
Lozano S, Arenas A, Sánchez A. Mesoscopic structure conditions the emergence of cooperation on social networks. PLoS One 2008; 3:e1892. [PMID: 18382673 PMCID: PMC2274863 DOI: 10.1371/journal.pone.0001892] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2007] [Accepted: 02/25/2008] [Indexed: 11/19/2022] Open
Abstract
Background We study the evolutionary Prisoner's Dilemma on two social networks substrates obtained from actual relational data. Methodology/Principal Findings We find very different cooperation levels on each of them that cannot be easily understood in terms of global statistical properties of both networks. We claim that the result can be understood at the mesoscopic scale, by studying the community structure of the networks. We explain the dependence of the cooperation level on the temptation parameter in terms of the internal structure of the communities and their interconnections. We then test our results on community-structured, specifically designed artificial networks, finding a good agreement with the observations in both real substrates. Conclusion Our results support the conclusion that studies of evolutionary games on model networks and their interpretation in terms of global properties may not be sufficient to study specific, real social systems. Further, the study allows us to define new quantitative parameters that summarize the mesoscopic structure of any network. In addition, the community perspective may be helpful to interpret the origin and behavior of existing networks as well as to design structures that show resilient cooperative behavior.
Collapse
Affiliation(s)
- Sergi Lozano
- ETH Zurich, Swiss Federal Institute of Technology, Zurich, Switzerland.
| | | | | |
Collapse
|
341
|
Schuetz P, Caflisch A. Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:046112. [PMID: 18517695 DOI: 10.1103/physreve.77.046112] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2007] [Revised: 02/22/2008] [Indexed: 05/26/2023]
Abstract
Identifying strongly connected substructures in large networks provides insight into their coarse-grained organization. Several approaches based on the optimization of a quality function, e.g., the modularity, have been proposed. We present here a multistep extension of the greedy algorithm (MSG) that allows the merging of more than one pair of communities at each iteration step. The essential idea is to prevent the premature condensation into few large communities. Upon convergence of the MSG a simple refinement procedure called "vertex mover" (VM) is used for reassigning vertices to neighboring communities to improve the final modularity value. With an appropriate choice of the step width, the combined MSG-VM algorithm is able to find solutions of higher modularity than those reported previously. The multistep extension does not alter the scaling of computational cost of the greedy algorithm.
Collapse
Affiliation(s)
- Philipp Schuetz
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | | |
Collapse
|
342
|
Karrer B, Levina E, Newman MEJ. Robustness of community structure in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:046119. [PMID: 18517702 DOI: 10.1103/physreve.77.046119] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2007] [Indexed: 05/21/2023]
Abstract
The discovery of community structure is a common challenge in the analysis of network data. Many methods have been proposed for finding community structure, but few have been proposed for determining whether the structure found is statistically significant or whether, conversely, it could have arisen purely as a result of chance. In this paper we show that the significance of community structure can be effectively quantified by measuring its robustness to small perturbations in network structure. We propose a suitable method for perturbing networks and a measure of the resulting change in community structure and use them to assess the significance of community structure in a variety of networks, both real and computer generated.
Collapse
Affiliation(s)
- Brian Karrer
- Department of Physics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | | | | |
Collapse
|
343
|
Rubensson EH, Bock N, Holmström E, Niklasson AMN. Recursive inverse factorization. J Chem Phys 2008; 128:104105. [DOI: 10.1063/1.2884921] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
344
|
Li Z, Zhang S, Wang RS, Zhang XS, Chen L. Quantitative function for community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:036109. [PMID: 18517463 DOI: 10.1103/physreve.77.036109] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2007] [Revised: 12/02/2007] [Indexed: 05/26/2023]
Abstract
We propose a quantitative function for community partition -- i.e., modularity density or D value. We demonstrate that this quantitative function is superior to the widely used modularity Q and also prove its equivalence with the objective function of the kernel k means. Both theoretical and numerical results show that optimizing the new criterion not only can resolve detailed modules that existing approaches cannot achieve, but also can correctly identify the number of communities.
Collapse
|
345
|
Ruan J, Zhang W. Identifying network communities with a high resolution. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:016104. [PMID: 18351912 DOI: 10.1103/physreve.77.016104] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2007] [Revised: 10/05/2007] [Indexed: 05/26/2023]
Abstract
Community structure is an important property of complex networks. The automatic discovery of such structure is a fundamental task in many disciplines, including sociology, biology, engineering, and computer science. Recently, several community discovery algorithms have been proposed based on the optimization of a modularity function (Q) . However, the problem of modularity optimization is NP-hard and the existing approaches often suffer from a prohibitively long running time or poor quality. Furthermore, it has been recently pointed out that algorithms based on optimizing Q will have a resolution limit; i.e., communities below a certain scale may not be detected. In this research, we first propose an efficient heuristic algorithm QCUT, which combines spectral graph partitioning and local search to optimize Q . Using both synthetic and real networks, we show that QCUT can find higher modularities and is more scalable than the existing algorithms. Furthermore, using QCUT as an essential component, we propose a recursive algorithm HQCUT to solve the resolution limit problem. We show that HQCUT can successfully detect communities at a much finer scale or with a higher accuracy than the existing algorithms. We also discuss two possible reasons that can cause the resolution limit problem and provide a method to distinguish them. Finally, we apply QCUT and HQCUT to study a protein-protein interaction network and show that the combination of the two algorithms can reveal interesting biological results that may be otherwise undetected.
Collapse
Affiliation(s)
- Jianhua Ruan
- Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| | | |
Collapse
|
346
|
|
347
|
Barber MJ. Modularity and community detection in bipartite networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 76:066102. [PMID: 18233893 DOI: 10.1103/physreve.76.066102] [Citation(s) in RCA: 244] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2007] [Revised: 09/13/2007] [Indexed: 05/14/2023]
Abstract
The modularity of a network quantifies the extent, relative to a null model network, to which vertices cluster into community groups. We define a null model appropriate for bipartite networks, and use it to define a bipartite modularity. The bipartite modularity is presented in terms of a modularity matrix B; some key properties of the eigenspectrum of B are identified and used to describe an algorithm for identifying modules in bipartite networks. The algorithm is based on the idea that the modules in the two parts of the network are dependent, with each part mutually being used to induce the vertices for the other part into the modules. We apply the algorithm to real-world network data, showing that the algorithm successfully identifies the modular structure of bipartite networks.
Collapse
Affiliation(s)
- Michael J Barber
- Austrian Research Centers GmbH-ARC, Bereich Systems Research, Vienna, Austria.
| |
Collapse
|
348
|
Pan RK, Sinha S. Modular networks emerge from multiconstraint optimization. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 76:045103. [PMID: 17995048 DOI: 10.1103/physreve.76.045103] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2007] [Revised: 07/17/2007] [Indexed: 05/25/2023]
Abstract
Modular structure is ubiquitous among complex networks. We note that most such systems are subject to multiple structural and functional constraints, e.g., minimizing the average path length and the total number of links, while maximizing robustness against perturbations in node activity. We show that the optimal networks satisfying these three constraints are characterized by the existence of multiple subnetworks (modules) sparsely connected to each other. In addition, these modules have distinct hubs, resulting in an overall heterogeneous degree distribution.
Collapse
Affiliation(s)
- Raj Kumar Pan
- The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600 113, India
| | | |
Collapse
|
349
|
Raghavan UN, Albert R, Kumara S. Near linear time algorithm to detect community structures in large-scale networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 76:036106. [PMID: 17930305 DOI: 10.1103/physreve.76.036106] [Citation(s) in RCA: 570] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2007] [Indexed: 05/23/2023]
Abstract
Community detection and analysis is an important methodology for understanding the organization of various real-world networks and has applications in problems as diverse as consensus formation in social communities or the identification of functional modules in biochemical networks. Currently used algorithms that identify the community structures in large-scale real-world networks require a priori information such as the number and sizes of communities or are computationally expensive. In this paper we investigate a simple label propagation algorithm that uses the network structure alone as its guide and requires neither optimization of a predefined objective function nor prior information about the communities. In our algorithm every node is initialized with a unique label and at every step each node adopts the label that most of its neighbors currently have. In this iterative process densely connected groups of nodes form a consensus on a unique label to form communities. We validate the algorithm by applying it to networks whose community structures are known. We also demonstrate that the algorithm takes an almost linear time and hence it is computationally less expensive than what was possible so far.
Collapse
Affiliation(s)
- Usha Nandini Raghavan
- Department of Industrial Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
350
|
Guimerà R, Sales-Pardo M, Amaral LAN. Module identification in bipartite and directed networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 76:036102. [PMID: 17930301 PMCID: PMC2262941 DOI: 10.1103/physreve.76.036102] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2007] [Indexed: 05/08/2023]
Abstract
Modularity is one of the most prominent properties of real-world complex networks. Here, we address the issue of module identification in two important classes of networks: bipartite networks and directed unipartite networks. Nodes in bipartite networks are divided into two nonoverlapping sets, and the links must have one end node from each set. Directed unipartite networks only have one type of node, but links have an origin and an end. We show that directed unipartite networks can be conveniently represented as bipartite networks for module identification purposes. We report on an approach especially suited for module detection in bipartite networks, and we define a set of random networks that enable us to validate the approach.
Collapse
Affiliation(s)
- Roger Guimerà
- Northwestern Institute on Complex Systems (NICO) and Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA
| | | | | |
Collapse
|