501
|
|
502
|
Song DM, Tumminello M, Zhou WX, Mantegna RN. Evolution of worldwide stock markets, correlation structure, and correlation-based graphs. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:026108. [PMID: 21929065 DOI: 10.1103/physreve.84.026108] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2011] [Indexed: 05/31/2023]
Abstract
We investigate the daily correlation present among market indices of stock exchanges located all over the world in the time period January 1996 to July 2009. We discover that the correlation among market indices presents both a fast and a slow dynamics. The slow dynamics reflects the development and consolidation of globalization. The fast dynamics is associated with critical events that originate in a specific country or region of the world and rapidly affect the global system. We provide evidence that the short term time scale of correlation among market indices is less than 3 trading months (about 60 trading days). The average values of the nondiagonal elements of the correlation matrix, correlation-based graphs, and the spectral properties of the largest eigenvalues and eigenvectors of the correlation matrix are carrying information about the fast and slow dynamics of the correlation of market indices. We introduce a measure of mutual information based on link co-occurrence in networks in order to detect the fast dynamics of successive changes of correlation-based graphs in a quantitative way.
Collapse
Affiliation(s)
- Dong-Ming Song
- School of Business, East China University of Science and Technology, Shanghai 200237, China
| | | | | | | |
Collapse
|
503
|
Yan X, Kelley S, Goldberg M, Biswal BB. Detecting overlapped functional clusters in resting state fMRI with Connected Iterative Scan: A graph theory based clustering algorithm. J Neurosci Methods 2011; 199:108-18. [DOI: 10.1016/j.jneumeth.2011.05.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Revised: 11/30/2010] [Accepted: 12/01/2010] [Indexed: 11/25/2022]
|
504
|
Traag VA, Van Dooren P, Nesterov Y. Narrow scope for resolution-limit-free community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:016114. [PMID: 21867264 DOI: 10.1103/physreve.84.016114] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Revised: 06/27/2011] [Indexed: 05/12/2023]
Abstract
Detecting communities in large networks has drawn much attention over the years. While modularity remains one of the more popular methods of community detection, the so-called resolution limit remains a significant drawback. To overcome this issue, it was recently suggested that instead of comparing the network to a random null model, as is done in modularity, it should be compared to a constant factor. However, it is unclear what is meant exactly by "resolution-limit-free," that is, not suffering from the resolution limit. Furthermore, the question remains what other methods could be classified as resolution-limit-free. In this paper we suggest a rigorous definition and derive some basic properties of resolution-limit-free methods. More importantly, we are able to prove exactly which class of community detection methods are resolution-limit-free. Furthermore, we analyze which methods are not resolution-limit-free, suggesting there is only a limited scope for resolution-limit-free community detection methods. Finally, we provide such a natural formulation, and show it performs superbly.
Collapse
Affiliation(s)
- V A Traag
- ICTEAM, Université Catholique de Louvain, Louvain-la Neuve, Belgium.
| | | | | |
Collapse
|
505
|
Papadopoulos S, Kompatsiaris Y, Vakali A, Spyridonos P. Community detection in Social Media. Data Min Knowl Discov 2011. [DOI: 10.1007/s10618-011-0224-z] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
506
|
Sohn Y, Choi MK, Ahn YY, Lee J, Jeong J. Topological cluster analysis reveals the systemic organization of the Caenorhabditis elegans connectome. PLoS Comput Biol 2011; 7:e1001139. [PMID: 21625578 PMCID: PMC3098222 DOI: 10.1371/journal.pcbi.1001139] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2009] [Accepted: 04/20/2011] [Indexed: 11/18/2022] Open
Abstract
The modular organization of networks of individual neurons interwoven through synapses has not been fully explored due to the incredible complexity of the connectivity architecture. Here we use the modularity-based community detection method for directed, weighted networks to examine hierarchically organized modules in the complete wiring diagram (connectome) of Caenorhabditis elegans (C. elegans) and to investigate their topological properties. Incorporating bilateral symmetry of the network as an important cue for proper cluster assignment, we identified anatomical clusters in the C. elegans connectome, including a body-spanning cluster, which correspond to experimentally identified functional circuits. Moreover, the hierarchical organization of the five clusters explains the systemic cooperation (e.g., mechanosensation, chemosensation, and navigation) that occurs among the structurally segregated biological circuits to produce higher-order complex behaviors. Caenorhabditis elegans (C. elegans) is a tiny worm whose neuronal network is fully revealed. Since the modular organization in a network of individual neurons interwoven through synapses is not yet fully explored owing to incredibly complex connectivity architecture, this study is designed to investigate hierarchically organized modules in this complete wiring diagram (connectome) of this worm. We used the modularity-based community detection algorithm and found that C. elegans had 5 anatomical clusters in the C. elegans connectome, which corresponded to experimentally-identified functional circuits. We found that the hierarchical organization of the 5 clusters explains the systemic cooperation including mechanosensation, chemosensation, and navigation that occurs among the structurally-segregated biological circuits to produce higher-order complex behaviors.
Collapse
Affiliation(s)
- Yunkyu Sohn
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
- Department of Political Science, University of California, San Diego, California, United States of America
| | - Myung-Kyu Choi
- Research Center for Cellulomics, Institute of Molecular Biology and Genetics School of Biological Sciences, Department of Biophysics and Chemical Biology, Seoul National University, Seoul, Republic of Korea
| | - Yong-Yeol Ahn
- Center for Complex Network Research, Department of Physics, Northeastern University, Boston, Massachusetts, United States of America
- Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Harvard University, Boston, Massachusetts, United States of America
| | - Junho Lee
- Research Center for Cellulomics, Institute of Molecular Biology and Genetics School of Biological Sciences, Department of Biophysics and Chemical Biology, Seoul National University, Seoul, Republic of Korea
| | - Jaeseung Jeong
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
- * E-mail:
| |
Collapse
|
507
|
Abstract
Modularity is a widely used quality measure for graph clusterings. Its exact maximization is NP-hard and prohibitively expensive for large graphs. Popular heuristics first perform a coarsening phase, where local search starting from singleton clusters is used to compute a preliminary clustering, and then optionally a refinement phase, where this clustering is improved by moving vertices between clusters. As a generalization, multilevel heuristics coarsen in several stages, and refine by moving entire clusters from each of these stages, not only individual vertices.
This article organizes existing and new single-level and multilevel heuristics into a coherent design space, and compares them experimentally with respect to their effectiveness (achieved modularity) and runtime. For coarsening by iterated cluster joining, it turns out that the most widely used criterion for joining clusters (modularity increase) is outperformed by other simple criteria, that a recent multistep algorithm [Schuetz and Caflisch 2008] is no improvement over simple single-step coarsening for these criteria, and that the recent multilevel coarsening by iterated vertex moving [Blondel et al. 2008] is somewhat faster but slightly less effective (with refinement). The new multilevel refinement is significantly more effective than the conventional single-level refinement or no refinement, in reasonable runtime.
A comparison with published benchmark results and algorithm implementations shows that multilevel local search heuristics, despite their relative simplicity, are competitive with the best algorithms in the literature.
Collapse
Affiliation(s)
- Randolf Rotta
- Brandenburgische Technische Universität Cottbus, Germany
| | - Andreas Noack
- Brandenburgische Technische Universität Cottbus, Germany
| |
Collapse
|
508
|
Tibély G, Kovanen L, Karsai M, Kaski K, Kertész J, Saramäki J. Communities and beyond: mesoscopic analysis of a large social network with complementary methods. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:056125. [PMID: 21728623 DOI: 10.1103/physreve.83.056125] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Revised: 03/15/2011] [Indexed: 05/31/2023]
Abstract
Community detection methods have so far been tested mostly on small empirical networks and on synthetic benchmarks. Much less is known about their performance on large real-world networks, which nonetheless are a significant target for application. We analyze the performance of three state-of-the-art community detection methods by using them to identify communities in a large social network constructed from mobile phone call records. We find that all methods detect communities that are meaningful in some respects but fall short in others, and that there often is a hierarchical relationship between communities detected by different methods. Our results suggest that community detection methods could be useful in studying the general mesoscale structure of networks, as opposed to only trying to identify dense structures.
Collapse
Affiliation(s)
- Gergely Tibély
- Institute of Physics and HAS-BME Condensed Matter Group, BME, Budapest, Budafoki út 8., H-1111, Hungary
| | | | | | | | | | | |
Collapse
|
509
|
Lancichinetti A, Radicchi F, Ramasco JJ, Fortunato S. Finding statistically significant communities in networks. PLoS One 2011; 6:e18961. [PMID: 21559480 PMCID: PMC3084717 DOI: 10.1371/journal.pone.0018961] [Citation(s) in RCA: 252] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2010] [Accepted: 03/14/2011] [Indexed: 11/18/2022] Open
Abstract
Community structure is one of the main structural features of networks, revealing both their internal organization and the similarity of their elementary units. Despite the large variety of methods proposed to detect communities in graphs, there is a big need for multi-purpose techniques, able to handle different types of datasets and the subtleties of community structure. In this paper we present OSLOM (Order Statistics Local Optimization Method), the first method capable to detect clusters in networks accounting for edge directions, edge weights, overlapping communities, hierarchies and community dynamics. It is based on the local optimization of a fitness function expressing the statistical significance of clusters with respect to random fluctuations, which is estimated with tools of Extreme and Order Statistics. OSLOM can be used alone or as a refinement procedure of partitions/covers delivered by other techniques. We have also implemented sequential algorithms combining OSLOM with other fast techniques, so that the community structure of very large networks can be uncovered. Our method has a comparable performance as the best existing algorithms on artificial benchmark graphs. Several applications on real networks are shown as well. OSLOM is implemented in a freely available software (http://www.oslom.org), and we believe it will be a valuable tool in the analysis of networks.
Collapse
Affiliation(s)
- Andrea Lancichinetti
- Complex Networks and Systems Lagrange
Laboratory, Institute for Scientific Interchange (ISI), Torino,
Italy
- Physics Department, Politecnico di Torino,
Torino, Italy
| | - Filippo Radicchi
- Howard Hughes Medical Institute (HHMI),
Northwestern University, Evanston, Illinois, United States of
America
| | - José J. Ramasco
- Complex Networks and Systems Lagrange
Laboratory, Institute for Scientific Interchange (ISI), Torino,
Italy
- Instituto de Física Interdisciplinar y
Sistemas Complejos IFISC (CSIC-UIB), Palma de Mallorca, Spain
| | - Santo Fortunato
- Complex Networks and Systems Lagrange
Laboratory, Institute for Scientific Interchange (ISI), Torino,
Italy
| |
Collapse
|
510
|
Scalco R, Caflisch A. Equilibrium Distribution from Distributed Computing (Simulations of Protein Folding). J Phys Chem B 2011; 115:6358-65. [DOI: 10.1021/jp2014918] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Riccardo Scalco
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Amedeo Caflisch
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| |
Collapse
|
511
|
Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems. PLoS One 2011; 6:e18209. [PMID: 21494658 PMCID: PMC3072965 DOI: 10.1371/journal.pone.0018209] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2011] [Accepted: 02/22/2011] [Indexed: 11/29/2022] Open
Abstract
To comprehend the hierarchical organization of large integrated systems, we introduce the hierarchical map equation, which reveals multilevel structures in networks. In this information-theoretic approach, we exploit the duality between compression and pattern detection; by compressing a description of a random walker as a proxy for real flow on a network, we find regularities in the network that induce this system-wide flow. Finding the shortest multilevel description of the random walker therefore gives us the best hierarchical clustering of the network — the optimal number of levels and modular partition at each level — with respect to the dynamics on the network. With a novel search algorithm, we extract and illustrate the rich multilevel organization of several large social and biological networks. For example, from the global air traffic network we uncover countries and continents, and from the pattern of scientific communication we reveal more than 100 scientific fields organized in four major disciplines: life sciences, physical sciences, ecology and earth sciences, and social sciences. In general, we find shallow hierarchical structures in globally interconnected systems, such as neural networks, and rich multilevel organizations in systems with highly separated regions, such as road networks.
Collapse
|
512
|
Khadivi A, Ajdari Rad A, Hasler M. Network community-detection enhancement by proper weighting. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:046104. [PMID: 21599237 DOI: 10.1103/physreve.83.046104] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2010] [Revised: 12/22/2010] [Indexed: 05/30/2023]
Abstract
In this paper, we show how proper assignment of weights to the edges of a complex network can enhance the detection of communities and how it can circumvent the resolution limit and the extreme degeneracy problems associated with modularity. Our general weighting scheme takes advantage of graph theoretic measures and it introduces two heuristics for tuning its parameters. We use this weighting as a preprocessing step for the greedy modularity optimization algorithm of Newman to improve its performance. The result of the experiments of our approach on computer-generated and real-world data networks confirm that the proposed approach not only mitigates the problems of modularity but also improves the modularity optimization.
Collapse
Affiliation(s)
- Alireza Khadivi
- Laboratory of Nonlinear Systems, School of Computer and Communication Sciences, Ecole Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | | | | |
Collapse
|
513
|
Subelj L, Bajec M. Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:036103. [PMID: 21517554 DOI: 10.1103/physreve.83.036103] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 11/06/2010] [Indexed: 05/30/2023]
Abstract
Label propagation has proven to be a fast method for detecting communities in large complex networks. Recent developments have also improved the accuracy of the approach; however, a general algorithm is still an open issue. We present an advanced label propagation algorithm that combines two unique strategies of community formation, namely, defensive preservation and offensive expansion of communities. The two strategies are combined in a hierarchical manner to recursively extract the core of the network and to identify whisker communities. The algorithm was evaluated on two classes of benchmark networks with planted partition and on 23 real-world networks ranging from networks with tens of nodes to networks with several tens of millions of edges. It is shown to be comparable to the current state-of-the-art community detection algorithms and superior to all previous label propagation algorithms, with comparable time complexity. In particular, analysis on real-world networks has proven that the algorithm has almost linear complexity, O(m¹·¹⁹), and scales even better than the basic label propagation algorithm (m is the number of edges in the network).
Collapse
Affiliation(s)
- Lovro Subelj
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia.
| | | |
Collapse
|
514
|
Zlatić V, Gabrielli A, Caldarelli G. Topologically biased random walk and community finding in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 82:066109. [PMID: 21230707 DOI: 10.1103/physreve.82.066109] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Revised: 10/26/2010] [Indexed: 05/30/2023]
Abstract
We present an approach of topology biased random walks for undirected networks. We focus on a one-parameter family of biases, and by using a formal analogy with perturbation theory in quantum mechanics we investigate the features of biased random walks. This analogy is extended through the use of parametric equations of motion to study the features of random walks vs parameter values. Furthermore, we show an analysis of the spectral gap maximum associated with the value of the second eigenvalue of the transition matrix related to the relaxation rate to the stationary state. Applications of these studies allow ad hoc algorithms for the exploration of complex networks and their communities.
Collapse
Affiliation(s)
- Vinko Zlatić
- Istituto Sistemi Complessi-CNR, UOS Sapienza, Dipartimento di Fisica, Università Sapienza, Rome, Italy
| | | | | |
Collapse
|
515
|
Lancichinetti A, Kivelä M, Saramäki J, Fortunato S. Characterizing the community structure of complex networks. PLoS One 2010; 5:e11976. [PMID: 20711338 PMCID: PMC2920821 DOI: 10.1371/journal.pone.0011976] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2010] [Accepted: 07/12/2010] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Community structure is one of the key properties of complex networks and plays a crucial role in their topology and function. While an impressive amount of work has been done on the issue of community detection, very little attention has been so far devoted to the investigation of communities in real networks. METHODOLOGY/PRINCIPAL FINDINGS We present a systematic empirical analysis of the statistical properties of communities in large information, communication, technological, biological, and social networks. We find that the mesoscopic organization of networks of the same category is remarkably similar. This is reflected in several characteristics of community structure, which can be used as "fingerprints" of specific network categories. While community size distributions are always broad, certain categories of networks consist mainly of tree-like communities, while others have denser modules. Average path lengths within communities initially grow logarithmically with community size, but the growth saturates or slows down for communities larger than a characteristic size. This behaviour is related to the presence of hubs within communities, whose roles differ across categories. Also the community embeddedness of nodes, measured in terms of the fraction of links within their communities, has a characteristic distribution for each category. CONCLUSIONS/SIGNIFICANCE Our findings, verified by the use of two fundamentally different community detection methods, allow for a classification of real networks and pave the way to a realistic modelling of networks' evolution.
Collapse
Affiliation(s)
- Andrea Lancichinetti
- Complex Networks and Systems Lagrange Laboratory, Institute for Scientific Interchange (ISI), Torino, Italy
| | - Mikko Kivelä
- Department of Biomedical Engineering and Computational Science, Aalto University, Espoo, Finland
| | - Jari Saramäki
- Department of Biomedical Engineering and Computational Science, Aalto University, Espoo, Finland
| | - Santo Fortunato
- Complex Networks and Systems Lagrange Laboratory, Institute for Scientific Interchange (ISI), Torino, Italy
| |
Collapse
|
516
|
Ma X, Gao L, Yong X. Eigenspaces of networks reveal the overlapping and hierarchical community structure more precisely. JOURNAL OF STATISTICAL MECHANICS: THEORY AND EXPERIMENT 2010; 2010:P08012. [DOI: 10.1088/1742-5468/2010/08/p08012] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
517
|
Chen X, Yan GY, Liao XP. A Novel Candidate Disease Genes Prioritization Method Based on Module Partition and Rank Fusion. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2010; 14:337-56. [DOI: 10.1089/omi.2009.0143] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Xing Chen
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, People's Republic of China
- Graduate University of Chinese Academy of Sciences, Beijing 100190, People's Republic of China
| | - Gui-Ying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, People's Republic of China
| | - Xiao-Ping Liao
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, People's Republic of China
- Graduate University of Chinese Academy of Sciences, Beijing 100190, People's Republic of China
| |
Collapse
|
518
|
Lewis ACF, Jones NS, Porter MA, Deane CM. The function of communities in protein interaction networks at multiple scales. BMC SYSTEMS BIOLOGY 2010; 4:100. [PMID: 20649971 PMCID: PMC2917431 DOI: 10.1186/1752-0509-4-100] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2010] [Accepted: 07/22/2010] [Indexed: 01/13/2023]
Abstract
BACKGROUND If biology is modular then clusters, or communities, of proteins derived using only protein interaction network structure should define protein modules with similar biological roles. We investigate the link between biological modules and network communities in yeast and its relationship to the scale at which we probe the network. RESULTS Our results demonstrate that the functional homogeneity of communities depends on the scale selected, and that almost all proteins lie in a functionally homogeneous community at some scale. We judge functional homogeneity using a novel test and three independent characterizations of protein function, and find a high degree of overlap between these measures. We show that a high mean clustering coefficient of a community can be used to identify those that are functionally homogeneous. By tracing the community membership of a protein through multiple scales we demonstrate how our approach could be useful to biologists focusing on a particular protein. CONCLUSIONS We show that there is no one scale of interest in the community structure of the yeast protein interaction network, but we can identify the range of resolution parameters that yield the most functionally coherent communities, and predict which communities are most likely to be functionally homogeneous.
Collapse
Affiliation(s)
- Anna C F Lewis
- Department of Statistics, University of Oxford, Oxford, UK
| | | | | | | |
Collapse
|
519
|
Ahn YY, Bagrow JP, Lehmann S. Link communities reveal multiscale complexity in networks. Nature 2010; 466:761-4. [PMID: 20562860 DOI: 10.1038/nature09182] [Citation(s) in RCA: 551] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 05/13/2010] [Indexed: 02/07/2023]
Abstract
Networks have become a key approach to understanding systems of interacting objects, unifying the study of diverse phenomena including biological organisms and human society. One crucial step when studying the structure and dynamics of networks is to identify communities: groups of related nodes that correspond to functional subunits such as protein complexes or social spheres. Communities in networks often overlap such that nodes simultaneously belong to several groups. Meanwhile, many networks are known to possess hierarchical organization, where communities are recursively grouped into a hierarchical structure. However, the fact that many real networks have communities with pervasive overlap, where each and every node belongs to more than one group, has the consequence that a global hierarchy of nodes cannot capture the relationships between overlapping groups. Here we reinvent communities as groups of links rather than nodes and show that this unorthodox approach successfully reconciles the antagonistic organizing principles of overlapping communities and hierarchy. In contrast to the existing literature, which has entirely focused on grouping nodes, link communities naturally incorporate overlap while revealing hierarchical organization. We find relevant link communities in many networks, including major biological networks such as protein-protein interaction and metabolic networks, and show that a large social network contains hierarchically organized community structures spanning inner-city to regional scales while maintaining pervasive overlap. Our results imply that link communities are fundamental building blocks that reveal overlap and hierarchical organization in networks to be two aspects of the same phenomenon.
Collapse
Affiliation(s)
- Yong-Yeol Ahn
- Center for Complex Network Research, Department of Physics, Northeastern University, Boston, Massachusetts 02115, USA
| | | | | |
Collapse
|
520
|
Ronhovde P, Nussinov Z. Local resolution-limit-free Potts model for community detection. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:046114. [PMID: 20481793 DOI: 10.1103/physreve.81.046114] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2009] [Revised: 02/28/2010] [Indexed: 05/29/2023]
Abstract
We report on an exceptionally accurate spin-glass-type Potts model for community detection. With a simple algorithm, we find that our approach is at least as accurate as the best currently available algorithms and robust to the effects of noise. It is also competitive with the best currently available algorithms in terms of speed and size of solvable systems. We find that the computational demand often exhibits superlinear scaling O(L1.3) where L is the number of edges in the system, and we have applied the algorithm to synthetic systems as large as 40 x 10(6) nodes and over 1 x 10(9) edges. A previous stumbling block encountered by popular community detection methods is the so-called "resolution limit." Being a "local" measure of community structure, our Potts model is free from this resolution-limit effect, and it further remains a local measure on weighted and directed graphs. We also address the mitigation of resolution-limit effects for two other popular Potts models.
Collapse
Affiliation(s)
- Peter Ronhovde
- Department of Physics, Washington University, St Louis, Missouri 63130, USA
| | | |
Collapse
|
521
|
Abstract
BACKGROUND As the size of the known human interactome grows, biologists increasingly rely on computational tools to identify patterns that represent protein complexes and pathways. Previous studies have shown that densely connected network components frequently correspond to community structure and functionally related modules. In this work, we present a novel method to identify densely connected and bipartite network modules based on a log odds score for shared neighbours. RESULTS To evaluate the performance of our method (NeMo), we compare it to other widely used tools for community detection including kMetis, MCODE, and spectral clustering. We test these methods on a collection of synthetically constructed networks and the set of MIPS human complexes. We apply our method to the CXC chemokine pathway and find a high scoring functional module of 12 disconnected phospholipase isoforms. CONCLUSION We present a novel method that combines a unique neighbour-sharing score with hierarchical agglomerative clustering to identify diverse network communities. The approach is unique in that we identify both dense network and dense bipartite network structures in a single approach. Our results suggest that the performance of NeMo is better than or competitive with leading approaches on both real and synthetic datasets. We minimize model complexity and generalization error in the Bayesian spirit by integrating out nuisance parameters. An implementation of our method is freely available for download as a plugin to Cytoscape through our website and through Cytoscape itself.
Collapse
Affiliation(s)
- Corban G Rivera
- Department of Biomedical Engineering and High-Throughput Biology Center, Johns Hopkins School of Medicine, Baltimore, MD 21218, USA.
| | | | | |
Collapse
|