401
|
Lu Z, Zhu Y, Li W, Wu W, Cheng X. Influence-based community partition for social networks. COMPUTATIONAL SOCIAL NETWORKS 2014. [DOI: 10.1186/s40649-014-0001-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
402
|
Esmailian P, Abtahi SE, Jalili M. Mesoscopic analysis of online social networks: the role of negative ties. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:042817. [PMID: 25375559 DOI: 10.1103/physreve.90.042817] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2014] [Indexed: 05/24/2023]
Abstract
A class of networks are those with both positive and negative links. In this manuscript, we studied the interplay between positive and negative ties on mesoscopic level of these networks, i.e., their community structure. A community is considered as a tightly interconnected group of actors; therefore, it does not borrow any assumption from balance theory and merely uses the well-known assumption in the community detection literature. We found that if one detects the communities based on only positive relations (by ignoring the negative ones), the majority of negative relations are already placed between the communities. In other words, negative ties do not have a major role in community formation of signed networks. Moreover, regarding the internal negative ties, we proved that most unbalanced communities are maximally balanced, and hence they cannot be partitioned into k nonempty sub-clusters with higher balancedness (k≥2). Furthermore, we showed that although the mediator triad ++- (hostile-mediator-hostile) is underrepresented, it constitutes a considerable portion of triadic relations among communities. Hence, mediator triads should not be ignored by community detection and clustering algorithms. As a result, if one uses a clustering algorithm that operates merely based on social balance, mesoscopic structure of signed networks significantly remains hidden.
Collapse
Affiliation(s)
- Pouya Esmailian
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | - Seyed Ebrahim Abtahi
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | - Mahdi Jalili
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| |
Collapse
|
403
|
Tang X, Xu T, Feng X, Yang G. Uncovering community structures with initialized Bayesian nonnegative matrix factorization. PLoS One 2014; 9:e107884. [PMID: 25268494 PMCID: PMC4182427 DOI: 10.1371/journal.pone.0107884] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2014] [Accepted: 08/19/2014] [Indexed: 11/18/2022] Open
Abstract
Uncovering community structures is important for understanding networks. Currently, several nonnegative matrix factorization algorithms have been proposed for discovering community structure in complex networks. However, these algorithms exhibit some drawbacks, such as unstable results and inefficient running times. In view of the problems, a novel approach that utilizes an initialized Bayesian nonnegative matrix factorization model for determining community membership is proposed. First, based on singular value decomposition, we obtain simple initialized matrix factorizations from approximate decompositions of the complex network’s adjacency matrix. Then, within a few iterations, the final matrix factorizations are achieved by the Bayesian nonnegative matrix factorization method with the initialized matrix factorizations. Thus, the network’s community structure can be determined by judging the classification of nodes with a final matrix factor. Experimental results show that the proposed method is highly accurate and offers competitive performance to that of the state-of-the-art methods even though it is not designed for the purpose of modularity maximization.
Collapse
Affiliation(s)
- Xianchao Tang
- School of Computer Science and Technology, Tianjin University, Tianjin, China
- * E-mail:
| | - Tao Xu
- School of Computer Science and Technology, Civil Aviation University of China, Tianjin, China
- Information Technology Research Base of Civil Aviation Administration of China, Tianjin, China
| | - Xia Feng
- School of Computer Science and Technology, Civil Aviation University of China, Tianjin, China
- Information Technology Research Base of Civil Aviation Administration of China, Tianjin, China
| | - Guoqing Yang
- School of Computer Science and Technology, Tianjin University, Tianjin, China
- Information Technology Research Base of Civil Aviation Administration of China, Tianjin, China
| |
Collapse
|
404
|
Lipowski A, Lipowska D. Generic criticality of community structure in random graphs. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:032815. [PMID: 25314489 DOI: 10.1103/physreve.90.032815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Indexed: 06/04/2023]
Abstract
We examine a community structure in random graphs of size n and link probability p/n determined with the Newman greedy optimization of modularity. Calculations show that for p<1 communities are nearly identical with clusters. For p=1 the average sizes of a community s(av) and of the giant community s(g) show a power-law increase s(av)∼n(α') and s(g)∼n(α). From numerical results we estimate α'≈0.26(1) and α≈0.50(1) and using the probability distribution of sizes of communities we suggest that α'=α/2 should hold. For p>1 the community structure remains critical: (i) s(av) and s(g) have a power-law increase with α'≈α<1 and (ii) the probability distribution of sizes of communities is very broad and nearly flat for all sizes up to s(g). For large p the modularity Q decays as Q∼p(-0.55), which is intermediate between some previous estimations. To check the validity of the results, we also determine the community structure using another method, namely, a nongreedy optimization of modularity. Tests with some benchmark networks show that the method outperforms the greedy version. For random graphs, however, the characteristics of the community structure determined using both greedy and nongreedy optimizations are, within small statistical fluctuations, the same.
Collapse
Affiliation(s)
- Adam Lipowski
- Faculty of Physics, Adam Mickiewicz University, 61-614 Poznań, Poland
| | - Dorota Lipowska
- Faculty of Modern Languages and Literature, Adam Mickiewicz University, 61-614 Poznań, Poland
| |
Collapse
|
405
|
Wilson JD, Wang S, Mucha PJ, Bhamidi S, Nobel AB. A testing based extraction algorithm for identifying significant communities in networks. Ann Appl Stat 2014. [DOI: 10.1214/14-aoas760] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
406
|
Fushing H, Chen C, Liu SY, Koehl P. Bootstrapping on undirected binary networks via statistical mechanics. JOURNAL OF STATISTICAL PHYSICS 2014; 156:823-842. [PMID: 25071295 PMCID: PMC4111278 DOI: 10.1007/s10955-014-1043-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
We propose a new method inspired from statistical mechanics for extracting geometric information from undirected binary networks and generating random networks that conform to this geometry. In this method an undirected binary network is perceived as a thermodynamic system with a collection of permuted adjacency matrices as its states. The task of extracting information from the network is then reformulated as a discrete combinatorial optimization problem of searching for its ground state. To solve this problem, we apply multiple ensembles of temperature regulated Markov chains to establish an ultrametric geometry on the network. This geometry is equipped with a tree hierarchy that captures the multiscale community structure of the network. We translate this geometry into a Parisi adjacency matrix, which has a relative low energy level and is in the vicinity of the ground state. The Parisi adjacency matrix is then further optimized by making block permutations subject to the ultrametric geometry. The optimal matrix corresponds to the macrostate of the original network. An ensemble of random networks is then generated such that each of these networks conforms to this macrostate; the corresponding algorithm also provides an estimate of the size of this ensemble. By repeating this procedure at different scales of the ultrametric geometry of the network, it is possible to compute its evolution entropy, i.e. to estimate the evolution of its complexity as we move from a coarse to a ne description of its geometric structure. We demonstrate the performance of this method on simulated as well as real data networks.
Collapse
Affiliation(s)
- Hsieh Fushing
- Department of Statistics, University of California, Davis, 1 Shields Ave, Davis, CA 95616
| | - Chen Chen
- Department of Statistics, University of California, Davis, 1 Shields Ave, Davis, CA 95616
| | - Shan-Yu Liu
- Department of Statistics, University of California, Davis, 1 Shields Ave, Davis, CA 95616
| | - Patrice Koehl
- Department of Computer Science, University of California, Davis, 1 Shields Ave, Davis, CA 95616
| |
Collapse
|
407
|
Robust brain parcellation using sparse representation on resting-state fMRI. Brain Struct Funct 2014; 220:3565-79. [PMID: 25156576 PMCID: PMC4575697 DOI: 10.1007/s00429-014-0874-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Accepted: 08/07/2014] [Indexed: 11/13/2022]
Abstract
Resting-state fMRI (rs-fMRI) has been widely used to segregate the brain into individual modules based on the presence of distinct connectivity patterns. Many parcellation methods have been proposed for brain parcellation using rs-fMRI, but their results have been somewhat inconsistent, potentially due to various types of noise. In this study, we provide a robust parcellation method for rs-fMRI-based brain parcellation, which constructs a sparse similarity graph based on the sparse representation coefficients of each seed voxel and then uses spectral clustering to identify distinct modules. Both the local time-varying BOLD signals and whole-brain connectivity patterns may be used as features and yield similar parcellation results. The robustness of our method was tested on both simulated and real rs-fMRI datasets. In particular, on simulated rs-fMRI data, sparse representation achieved good performance across different noise levels, including high accuracy of parcellation and high robustness to noise. On real rs-fMRI data, stable parcellation of the medial frontal cortex (MFC) and parietal operculum (OP) were achieved on three different datasets, with high reproducibility within each dataset and high consistency across these results. Besides, the parcellation of MFC was little influenced by the degrees of spatial smoothing. Furthermore, the consistent parcellation of OP was also well corresponding to cytoarchitectonic subdivisions and known somatotopic organizations. Our results demonstrate a new promising approach to robust brain parcellation using resting-state fMRI by sparse representation.
Collapse
|
408
|
Vitali S, Battiston S. The community structure of the global corporate network. PLoS One 2014; 9:e104655. [PMID: 25126722 PMCID: PMC4134229 DOI: 10.1371/journal.pone.0104655] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Accepted: 07/16/2014] [Indexed: 11/26/2022] Open
Abstract
We investigate the community structure of the global ownership network of transnational corporations. We find a pronounced organization in communities that cannot be explained by randomness. Despite the global character of this network, communities reflect first of all the geographical location of firms, while the industrial sector plays only a marginal role. We also analyze the meta-network in which the nodes are the communities and the links are obtained by aggregating the links among firms belonging to pairs of communities. We analyze the network centrality of the top 50 communities and we provide a quantitative assessment of the financial sector role in connecting the global economy.
Collapse
Affiliation(s)
- Stefania Vitali
- Department of Banking and Finance, University of Zurich, Zurich, Switzerland; Dipartimento di Scienze Economiche e Sociali, Università Politecnica delle Marche, Ancona, Italy
| | - Stefano Battiston
- Department of Banking and Finance, University of Zurich, Zurich, Switzerland
| |
Collapse
|
409
|
|
410
|
Stock portfolio structure of individual investors infers future trading behavior. PLoS One 2014; 9:e103006. [PMID: 25068302 PMCID: PMC4113374 DOI: 10.1371/journal.pone.0103006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Accepted: 06/24/2014] [Indexed: 11/19/2022] Open
Abstract
Although the understanding of and motivation behind individual trading behavior is an important puzzle in finance, little is known about the connection between an investor's portfolio structure and her trading behavior in practice. In this paper, we investigate the relation between what stocks investors hold, and what stocks they buy, and show that investors with similar portfolio structures to a great extent trade in a similar way. With data from the central register of shareholdings in Sweden, we model the market in a similarity network, by considering investors as nodes, connected with links representing portfolio similarity. From the network, we find investor groups that not only identify different investment strategies, but also represent individual investors trading in a similar way. These findings suggest that the stock portfolios of investors hold meaningful information, which could be used to earn a better understanding of stock market dynamics.
Collapse
|
411
|
Pan G, Zhang W, Wu Z, Li S. Online community detection for large complex networks. PLoS One 2014; 9:e102799. [PMID: 25061683 PMCID: PMC4111306 DOI: 10.1371/journal.pone.0102799] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2013] [Accepted: 06/23/2014] [Indexed: 11/22/2022] Open
Abstract
Complex networks describe a wide range of systems in nature and society. To understand complex networks, it is crucial to investigate their community structure. In this paper, we develop an online community detection algorithm with linear time complexity for large complex networks. Our algorithm processes a network edge by edge in the order that the network is fed to the algorithm. If a new edge is added, it just updates the existing community structure in constant time, and does not need to re-compute the whole network. Therefore, it can efficiently process large networks in real time. Our algorithm optimizes expected modularity instead of modularity at each step to avoid poor performance. The experiments are carried out using 11 public data sets, and are measured by two criteria, modularity and NMI (Normalized Mutual Information). The results show that our algorithm's running time is less than the commonly used Louvain algorithm while it gives competitive performance.
Collapse
Affiliation(s)
- Gang Pan
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
- * E-mail:
| | - Wangsheng Zhang
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Zhaohui Wu
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Shijian Li
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| |
Collapse
|
412
|
Harenberg S, Bello G, Gjeltema L, Ranshous S, Harlalka J, Seay R, Padmanabhan K, Samatova N. Community detection in large‐scale networks: a survey and empirical evaluation. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/wics.1319] [Citation(s) in RCA: 133] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Steve Harenberg
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Gonzalo Bello
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - L. Gjeltema
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Stephen Ranshous
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Jitendra Harlalka
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Ramona Seay
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Kanchana Padmanabhan
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Nagiza Samatova
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| |
Collapse
|
413
|
Abstract
Network methods have had profound influence in many domains and disciplines in the past decade. Community structure is a very important property of complex networks, but the accurate definition of a community remains an open problem. Here we defined community based on three properties, and then propose a simple and novel framework to detect communities based on network topology. We analyzed 16 different types of networks, and compared our partitions with Infomap, LPA, Fastgreedy and Walktrap, which are popular algorithms for community detection. Most of the partitions generated using our approach compare favorably to those generated by these other algorithms. Furthermore, we define overlapping nodes that combine community structure with shortest paths. We also analyzed the E. Coli. transcriptional regulatory network in detail, and identified modules with strong functional coherence.
Collapse
|
414
|
Tomasello MV, Perra N, Tessone CJ, Karsai M, Schweitzer F. The role of endogenous and exogenous mechanisms in the formation of R&D networks. Sci Rep 2014; 4:5679. [PMID: 25022561 PMCID: PMC4097357 DOI: 10.1038/srep05679] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 06/18/2014] [Indexed: 11/09/2022] Open
Abstract
We develop an agent-based model of strategic link formation in Research and Development (R&D) networks. Empirical evidence has shown that the growth of these networks is driven by mechanisms which are both endogenous to the system (that is, depending on existing alliances patterns) and exogenous (that is, driven by an exploratory search for newcomer firms). Extant research to date has not investigated both mechanisms simultaneously in a comparative manner. To overcome this limitation, we develop a general modeling framework to shed light on the relative importance of these two mechanisms. We test our model against a comprehensive dataset, listing cross-country and cross-sectoral R&D alliances from 1984 to 2009. Our results show that by fitting only three macroscopic properties of the network topology, this framework is able to reproduce a number of micro-level measures, including the distributions of degree, local clustering, path length and component size, and the emergence of network clusters. Furthermore, by estimating the link probabilities towards newcomers and established firms from the data, we find that endogenous mechanisms are predominant over the exogenous ones in the network formation, thus quantifying the importance of existing structures in selecting partner firms.
Collapse
Affiliation(s)
- Mario V. Tomasello
- Chair of Systems Design, Department of Management, Technology and Economics (D-MTEC), ETH Zurich, Weinbergstrasse 56/58, 8092 Zurich, Switzerland
| | - Nicola Perra
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA 02115, USA
| | - Claudio J. Tessone
- Chair of Systems Design, Department of Management, Technology and Economics (D-MTEC), ETH Zurich, Weinbergstrasse 56/58, 8092 Zurich, Switzerland
| | - Márton Karsai
- Laboratoire de l'Informatique du Parallélisme, INRIA-UMR 5668, IXXI, ENS de Lyon, 69364 Lyon, France
| | - Frank Schweitzer
- Chair of Systems Design, Department of Management, Technology and Economics (D-MTEC), ETH Zurich, Weinbergstrasse 56/58, 8092 Zurich, Switzerland
| |
Collapse
|
415
|
Sobolevsky S, Campari R, Belyi A, Ratti C. General optimization technique for high-quality community detection in complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:012811. [PMID: 25122346 DOI: 10.1103/physreve.90.012811] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Indexed: 05/05/2023]
Abstract
Recent years have witnessed the development of a large body of algorithms for community detection in complex networks. Most of them are based upon the optimization of objective functions, among which modularity is the most common, though a number of alternatives have been suggested in the scientific literature. We present here an effective general search strategy for the optimization of various objective functions for community detection purposes. When applied to modularity, on both real-world and synthetic networks, our search strategy substantially outperforms the best existing algorithms in terms of final scores of the objective function. In terms of execution time for modularity optimization this approach also outperforms most of the alternatives present in literature with the exception of fastest but usually less efficient greedy algorithms. The networks of up to 30000 nodes can be analyzed in time spans ranging from minutes to a few hours on average workstations, making our approach readily applicable to tasks not limited by strict time constraints but requiring the quality of partitioning to be as high as possible. Some examples are presented in order to demonstrate how this quality could be affected by even relatively small changes in the modularity score stressing the importance of optimization accuracy.
Collapse
Affiliation(s)
- Stanislav Sobolevsky
- SENSEable City Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | - Riccardo Campari
- SENSEable City Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | - Alexander Belyi
- Belarusian State University, 4 Nezavisimosti Avenue, Minsk, Belarus and SENSEable City Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| | - Carlo Ratti
- SENSEable City Laboratory, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
416
|
Liu X, Murata T, Wakita K. Detecting network communities beyond assortativity-related attributes. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:012806. [PMID: 25122341 DOI: 10.1103/physreve.90.012806] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2013] [Indexed: 06/03/2023]
Abstract
In network science, assortativity refers to the tendency of links to exist between nodes with similar attributes. In social networks, for example, links tend to exist between individuals of similar age, nationality, location, race, income, educational level, religious belief, and language. Thus, various attributes jointly affect the network topology. An interesting problem is to detect community structure beyond some specific assortativity-related attributes ρ, i.e., to take out the effect of ρ on network topology and reveal the hidden community structures which are due to other attributes. An approach to this problem is to redefine the null model of the modularity measure, so as to simulate the effect of ρ on network topology. However, a challenge is that we do not know to what extent the network topology is affected by ρ and by other attributes. In this paper, we propose a distance modularity, which allows us to freely choose any suitable function to simulate the effect of ρ. Such freedom can help us probe the effect of ρ and detect the hidden communities which are due to other attributes. We test the effectiveness of distance modularity on synthetic benchmarks and two real-world networks.
Collapse
Affiliation(s)
- Xin Liu
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan and CREST, Japan Science and Technology Agency, K's Gobancho, 7 Gobancho, Chiyoda, Tokyo, 102-0076 Japan and Wuhan University of Technology, 122 Luoshi Road, Wuhan, Hubei, 430070 China
| | - Tsuyoshi Murata
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan
| | - Ken Wakita
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan and CREST, Japan Science and Technology Agency, K's Gobancho, 7 Gobancho, Chiyoda, Tokyo, 102-0076 Japan
| |
Collapse
|
417
|
Exploring function prediction in protein interaction networks via clustering methods. PLoS One 2014; 9:e99755. [PMID: 24972109 PMCID: PMC4074043 DOI: 10.1371/journal.pone.0099755] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Accepted: 05/17/2014] [Indexed: 01/06/2023] Open
Abstract
Complex networks have recently become the focus of research in many fields. Their structure reveals crucial information for the nodes, how they connect and share information. In our work we analyze protein interaction networks as complex networks for their functional modular structure and later use that information in the functional annotation of proteins within the network. We propose several graph representations for the protein interaction network, each having different level of complexity and inclusion of the annotation information within the graph. We aim to explore what the benefits and the drawbacks of these proposed graphs are, when they are used in the function prediction process via clustering methods. For making this cluster based prediction, we adopt well established approaches for cluster detection in complex networks using most recent representative algorithms that have been proven as efficient in the task at hand. The experiments are performed using a purified and reliable Saccharomyces cerevisiae protein interaction network, which is then used to generate the different graph representations. Each of the graph representations is later analysed in combination with each of the clustering algorithms, which have been possibly modified and implemented to fit the specific graph. We evaluate results in regards of biological validity and function prediction performance. Our results indicate that the novel ways of presenting the complex graph improve the prediction process, although the computational complexity should be taken into account when deciding on a particular approach.
Collapse
|
418
|
Sah P, Singh LO, Clauset A, Bansal S. Exploring community structure in biological networks with random graphs. BMC Bioinformatics 2014; 15:220. [PMID: 24965130 PMCID: PMC4094994 DOI: 10.1186/1471-2105-15-220] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 05/20/2014] [Indexed: 11/25/2022] Open
Abstract
Background Community structure is ubiquitous in biological networks. There has been an increased interest in unraveling the community structure of biological systems as it may provide important insights into a system’s functional components and the impact of local structures on dynamics at a global scale. Choosing an appropriate community detection algorithm to identify the community structure in an empirical network can be difficult, however, as the many algorithms available are based on a variety of cost functions and are difficult to validate. Even when community structure is identified in an empirical system, disentangling the effect of community structure from other network properties such as clustering coefficient and assortativity can be a challenge. Results Here, we develop a generative model to produce undirected, simple, connected graphs with a specified degrees and pattern of communities, while maintaining a graph structure that is as random as possible. Additionally, we demonstrate two important applications of our model: (a) to generate networks that can be used to benchmark existing and new algorithms for detecting communities in biological networks; and (b) to generate null models to serve as random controls when investigating the impact of complex network features beyond the byproduct of degree and modularity in empirical biological networks. Conclusion Our model allows for the systematic study of the presence of community structure and its impact on network function and dynamics. This process is a crucial step in unraveling the functional consequences of the structural properties of biological systems and uncovering the mechanisms that drive these systems.
Collapse
Affiliation(s)
| | | | | | - Shweta Bansal
- Department of Biology, Georgetown University, 20057 Washington DC, USA.
| |
Collapse
|
419
|
Controllability and observability analysis for vertex domination centrality in directed networks. Sci Rep 2014; 4:5399. [PMID: 24954137 PMCID: PMC4066263 DOI: 10.1038/srep05399] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2013] [Accepted: 06/03/2014] [Indexed: 11/25/2022] Open
Abstract
Topological centrality is a significant measure for characterising the relative importance of a node in a complex network. For directed networks that model dynamic processes, however, it is of more practical importance to quantify a vertex's ability to dominate (control or observe) the state of other vertices. In this paper, based on the determination of controllable and observable subspaces under the global minimum-cost condition, we introduce a novel direction-specific index, domination centrality, to assess the intervention capabilities of vertices in a directed network. Statistical studies demonstrate that the domination centrality is, to a great extent, encoded by the underlying network's degree distribution and that most network positions through which one can intervene in a system are vertices with high domination centrality rather than network hubs. To analyse the interaction and functional dependence between vertices when they are used to dominate a network, we define the domination similarity and detect significant functional modules in glossary and metabolic networks through clustering analysis. The experimental results provide strong evidence that our indices are effective and practical in accurately depicting the structure of directed networks.
Collapse
|
420
|
Chakraborty T, Srinivasan S, Ganguly N, Bhowmick S, Mukherjee A. Constant communities in complex networks. Sci Rep 2014; 3:1825. [PMID: 23661107 PMCID: PMC6504828 DOI: 10.1038/srep01825] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 04/16/2013] [Indexed: 11/30/2022] Open
Abstract
Identifying community structure is a fundamental problem in network analysis. Most community detection algorithms are based on optimizing a combinatorial parameter, for example modularity. This optimization is generally NP-hard, thus merely changing the vertex order can alter their assignments to the community. However, there has been less study on how vertex ordering influences the results of the community detection algorithms. Here we identify and study the properties of invariant groups of vertices (constant communities) whose assignment to communities are, quite remarkably, not affected by vertex ordering. The percentage of constant communities can vary across different applications and based on empirical results we propose metrics to evaluate these communities. Using constant communities as a pre-processing step, one can significantly reduce the variation of the results. Finally, we present a case study on phoneme network and illustrate that constant communities, quite strikingly, form the core functional units of the larger communities.
Collapse
Affiliation(s)
- Tanmoy Chakraborty
- Dept. of Computer Science & Engg., Indian Institute of Technology, Kharagpur, India - 721302
| | | | | | | | | |
Collapse
|
421
|
Analyzing evolution of research topics with NEViewer: a new method based on dynamic co-word networks. Scientometrics 2014. [DOI: 10.1007/s11192-014-1347-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
422
|
Mall R, Langone R, Suykens JAK. Multilevel hierarchical kernel spectral clustering for real-life large scale complex networks. PLoS One 2014; 9:e99966. [PMID: 24949877 PMCID: PMC4065034 DOI: 10.1371/journal.pone.0099966] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 05/20/2014] [Indexed: 11/19/2022] Open
Abstract
Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.
Collapse
|
423
|
|
424
|
Ma L, Huang H, He Q, Chiew K, Liu Z. Toward seed-insensitive solutions to local community detection. J Intell Inf Syst 2014. [DOI: 10.1007/s10844-014-0315-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
425
|
Nguyen NP, Dinh TN, Shen Y, Thai MT. Dynamic social community detection and its applications. PLoS One 2014; 9:e91431. [PMID: 24722164 PMCID: PMC3982965 DOI: 10.1371/journal.pone.0091431] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Accepted: 12/18/2013] [Indexed: 11/19/2022] Open
Abstract
Community structure is one of the most commonly observed features of Online Social Networks (OSNs) in reality. The knowledge of this feature is of great advantage: it not only provides helpful insights into developing more efficient social-aware solutions but also promises a wide range of applications enabled by social and mobile networking, such as routing strategies in Mobile Ad Hoc Networks (MANETs) and worm containment in OSNs. Unfortunately, understanding this structure is very challenging, especially in dynamic social networks where social interactions are evolving rapidly. Our work focuses on the following questions: How can we efficiently identify communities in dynamic social networks? How can we adaptively update the network community structure based on its history instead of recomputing from scratch? To this end, we present Quick Community Adaptation (QCA), an adaptive modularity-based framework for not only discovering but also tracing the evolution of network communities in dynamic OSNs. QCA is very fast and efficient in the sense that it adaptively updates and discovers the new community structure based on its history together with the network changes only. This flexible approach makes QCA an ideal framework applicable for analyzing large-scale dynamic social networks due to its lightweight computing-resource requirement. To illustrate the effectiveness of our framework, we extensively test QCA on both synthesized and real-world social networks including Enron, arXiv e-print citation, and Facebook networks. Finally, we demonstrate the applicability of QCA in real applications: (1) A social-aware message forwarding strategy in MANETs, and (2) worm propagation containment in OSNs. Competitive results in comparison with other methods reveal that social-based techniques employing QCA as a community detection core outperform current available methods.
Collapse
Affiliation(s)
- Nam P. Nguyen
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
- * E-mail:
| | - Thang N. Dinh
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Yilin Shen
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| | - My T. Thai
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
426
|
|
427
|
Lung RI, Chira C, Andreica A. Game theory and extremal optimization for community detection in complex dynamic networks. PLoS One 2014; 9:e86891. [PMID: 24586257 PMCID: PMC3935827 DOI: 10.1371/journal.pone.0086891] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 12/17/2013] [Indexed: 11/23/2022] Open
Abstract
The detection of evolving communities in dynamic complex networks is a challenging problem that recently received attention from the research community. Dynamics clearly add another complexity dimension to the difficult task of community detection. Methods should be able to detect changes in the network structure and produce a set of community structures corresponding to different timestamps and reflecting the evolution in time of network data. We propose a novel approach based on game theory elements and extremal optimization to address dynamic communities detection. Thus, the problem is formulated as a mathematical game in which nodes take the role of players that seek to choose a community that maximizes their profit viewed as a fitness function. Numerical results obtained for both synthetic and real-world networks illustrate the competitive performance of this game theoretical approach.
Collapse
Affiliation(s)
- Rodica Ioana Lung
- Department of Statistics, Forecasting and Mathematics, Babeş-Bolyai University, Cluj Napoca, Romania
- * E-mail:
| | - Camelia Chira
- Department of Computer Science, Babeş-Bolyai University, Cluj Napoca, Romania
| | - Anca Andreica
- Department of Computer Science, Babeş-Bolyai University, Cluj Napoca, Romania
| |
Collapse
|
428
|
Zaki N, Mora A. A comparative analysis of computational approaches and algorithms for protein subcomplex identification. Sci Rep 2014; 4:4262. [PMID: 24584908 PMCID: PMC3939454 DOI: 10.1038/srep04262] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Accepted: 02/14/2014] [Indexed: 11/09/2022] Open
Abstract
High-throughput AP-MS methods have allowed the identification of many protein complexes. However, most post-processing methods of this type of data have been focused on detection of protein complexes and not its subcomplexes. Here, we review the results of some existing methods that may allow subcomplex detection and propose alternative methods in order to detect subcomplexes from AP-MS data. We assessed and drew comparisons between the use of overlapping clustering methods, methods based in the core-attachment model and our own prediction strategy (TRIBAL). The hypothesis behind TRIBAL is that subcomplex-building information may be concealed in the multiple edges generated by an interaction repeated in different contexts in raw data. The CACHET method offered the best results when the evaluation of the predicted subcomplexes was carried out using both the hypergeometric and geometric scores. TRIBAL offered the best performance when using a strict meet-min score.
Collapse
Affiliation(s)
- Nazar Zaki
- College of Information Technology, United Arab Emirates University, Al AinP.O. Box 17551, United Arab Emirates
| | - Antonio Mora
- 1] College of Information Technology, United Arab Emirates University, Al AinP.O. Box 17551, United Arab Emirates [2] Laboratory of Integrative Systems Medicine (LISM), Institute of Clinical Physiology (IFC), CNR, Pisa, Italy
| |
Collapse
|
429
|
Darst RK, Nussinov Z, Fortunato S. Improving the performance of algorithms to find communities in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:032809. [PMID: 24730901 DOI: 10.1103/physreve.89.032809] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Indexed: 06/03/2023]
Abstract
Most algorithms to detect communities in networks typically work without any information on the cluster structure to be found, as one has no a priori knowledge of it, in general. Not surprisingly, knowing some features of the unknown partition could help its identification, yielding an improvement of the performance of the method. Here we show that, if the number of clusters was known beforehand, standard methods, like modularity optimization, would considerably gain in accuracy, mitigating the severe resolution bias that undermines the reliability of the results of the original unconstrained version. The number of clusters can be inferred from the spectra of the recently introduced nonbacktracking and flow matrices, even in benchmark graphs with realistic community structure. The limit of such a two-step procedure is the overhead of the computation of the spectra.
Collapse
Affiliation(s)
- Richard K Darst
- Department of Biomedical Engineering and Computational Science, Aalto University School of Science, P.O. Box 12200, FI-00076, Finland
| | - Zohar Nussinov
- Physics Department, Washington University in St. Louis, CB 1105, One Brookings Drive, St. Louis, Missouri 63130-4899, USA
| | - Santo Fortunato
- Department of Biomedical Engineering and Computational Science, Aalto University School of Science, P.O. Box 12200, FI-00076, Finland
| |
Collapse
|
430
|
Didimo W, Montecchiani F. Fast layout computation of clustered networks: Algorithmic advances and experimental analysis. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2013.09.048] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
431
|
Aldecoa R, Marín I. Exploring the limits of community detection strategies in complex networks. Sci Rep 2014; 3:2216. [PMID: 23860510 PMCID: PMC3713530 DOI: 10.1038/srep02216] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/18/2013] [Indexed: 12/05/2022] Open
Abstract
The characterization of network community structure has profound implications in several scientific areas. Therefore, testing the algorithms developed to establish the optimal division of a network into communities is a fundamental problem in the field. We performed here a highly detailed evaluation of community detection algorithms, which has two main novelties: 1) using complex closed benchmarks, which provide precise ways to assess whether the solutions generated by the algorithms are optimal; and, 2) A novel type of analysis, based on hierarchically clustering the solutions suggested by multiple community detection algorithms, which allows to easily visualize how different are those solutions. Surprise, a global parameter that evaluates the quality of a partition, confirms the power of these analyses. We show that none of the community detection algorithms tested provide consistently optimal results in all networks and that Surprise maximization, obtained by combining multiple algorithms, obtains quasi-optimal performances in these difficult benchmarks.
Collapse
Affiliation(s)
- Rodrigo Aldecoa
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas, Calle Jaime Roig 11, 46010, Valencia, Spain
| | | |
Collapse
|
432
|
Darabos C, White MJ, Graham BE, Leung DN, Williams SM, Moore JH. The multiscale backbone of the human phenotype network based on biological pathways. BioData Min 2014; 7:1. [PMID: 24460644 PMCID: PMC3924922 DOI: 10.1186/1756-0381-7-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 12/19/2013] [Indexed: 01/17/2023] Open
Abstract
Background Networks are commonly used to represent and analyze large and complex systems of interacting elements. In systems biology, human disease networks show interactions between disorders sharing common genetic background. We built pathway-based human phenotype network (PHPN) of over 800 physical attributes, diseases, and behavioral traits; based on about 2,300 genes and 1,200 biological pathways. Using GWAS phenotype-to-genes associations, and pathway data from Reactome, we connect human traits based on the common patterns of human biological pathways, detecting more pleiotropic effects, and expanding previous studies from a gene-centric approach to that of shared cell-processes. Results The resulting network has a heavily right-skewed degree distribution, placing it in the scale-free region of the network topologies spectrum. We extract the multi-scale information backbone of the PHPN based on the local densities of the network and discarding weak connection. Using a standard community detection algorithm, we construct phenotype modules of similar traits without applying expert biological knowledge. These modules can be assimilated to the disease classes. However, we are able to classify phenotypes according to shared biology, and not arbitrary disease classes. We present examples of expected clinical connections identified by PHPN as proof of principle. Conclusions We unveil a previously uncharacterized connection between phenotype modules and discuss potential mechanistic connections that are obvious only in retrospect. The PHPN shows tremendous potential to become a useful tool both in the unveiling of the diseases’ common biology, and in the elaboration of diagnosis and treatments.
Collapse
Affiliation(s)
| | | | | | | | | | - Jason H Moore
- Department of Genetics, Institute for Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA.
| |
Collapse
|
433
|
Nacher JC, Keith B, Schwartz JM. Network medicine analysis of chondrocyte proteins towards new treatments of osteoarthritis. Proc Biol Sci 2014; 281:20132907. [PMID: 24430851 DOI: 10.1098/rspb.2013.2907] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Osteoarthritis (OA) is a progressive disorder with high incidence in the ageing human population that still has no treatment currently. This disorder induces the breakdown of articular cartilage, leading to the exposure and damage of bone surfaces. For a global understanding of OA development, the systematic integration of known OA-related proteins with protein-protein interaction (PPI) networks is required. In this work, the OA-related interactome was reconstructed using multiple data sources to have the most up-to-date information on OA-related proteins and their interactions. We then combined emergent concepts in network medicine to detect new unclassified OA-related proteins. The mapping of known OA-related proteins with PPI networks showed that these proteins are locally connected to each other and agglomerated in a large component. To expand this module, we applied a diffusion-based algorithm that probabilistically induces more searches in the vicinity of the seed OA-related proteins. As a result, the 10 topmost ranked proteins were connected to the OA disease module, supporting the local hypothesis. We computed structural modules and selected those that had the highest enrichment of OA-related proteins. The identified molecules show a link between structural topology and disease dysfunctionality. Interestingly, the protein Q6EEV6 was highlighted for OA association by both methods, reinforcing the potential involvement of this protein. These results suggest that similar disease-connected modules may exist in different human disorders, which could lead to systematic identification of genes or proteins that have a joint role in specific disease phenotypes.
Collapse
Affiliation(s)
- Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, , Miyama 2-2-1, Funabashi, Chiba 274-8510, Japan, Faculty of Life Sciences, University of Manchester, , Manchester M13 9PT, UK
| | | | | |
Collapse
|
434
|
Scibetta M, Boano F, Revelli R, Ridolfi L. Community Detection as a Tool for District Metered Areas Identification. ACTA ACUST UNITED AC 2014. [DOI: 10.1016/j.proeng.2014.02.167] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
435
|
|
436
|
|
437
|
Thomas CJ, Lambrechts J, Wolanski E, Traag VA, Blondel VD, Deleersnijder E, Hanert E. Numerical modelling and graph theory tools to study ecological connectivity in the Great Barrier Reef. Ecol Modell 2014. [DOI: 10.1016/j.ecolmodel.2013.10.002] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
438
|
Landi P, Piccardi C. Community analysis in directed networks: in-, out-, and pseudocommunities. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:012814. [PMID: 24580288 DOI: 10.1103/physreve.89.012814] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Indexed: 06/03/2023]
Abstract
When analyzing important classes of complex interconnected systems, link directionality can hardly be neglected if a precise and effective picture of the structure and function of the system is needed. If community analysis is performed, the notion of "community" itself is called into question, since the property of having a comparatively looser external connectivity could refer to the inbound or outbound links only or to both categories. In this paper, we introduce the notions of in-, out-, and in-/out-community in order to correctly classify the directedness of the interaction of a subnetwork with the rest of the system. Furthermore, we extend the scope of community analysis by introducing the notions of in-, out-, and in-/out-pseudocommunity. They are subnetworks having strong internal connectivity but also important interactions with the rest of the system, the latter taking place by means of a minority of its nodes only. The various types of (pseudo-)communities are qualified and distinguished by a suitable set of indicators and, on a given network, they can be discovered by using a "local" searching algorithm. The application to a broad set of benchmark networks and real-world examples proves that the proposed approach is able to effectively disclose the different types of structures above defined and to usefully classify the directionality of their interactions with the rest of the system.
Collapse
Affiliation(s)
- Pietro Landi
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, I-20133 Milano, Italy
| | - Carlo Piccardi
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, I-20133 Milano, Italy
| |
Collapse
|
439
|
Floretta L, Liechti J, Flammini A, De Los Rios P. Stochastic fluctuations and the detectability limit of network communities. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:060801. [PMID: 24483374 DOI: 10.1103/physreve.88.060801] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Revised: 10/01/2013] [Indexed: 06/03/2023]
Abstract
We have analyzed the detectability limits of network communities in the framework of the popular Girvan and Newman benchmark. By carefully taking into account the inevitable stochastic fluctuations that affect the construction of each and every instance of the benchmark, we come to the conclusion that the native, putative partition of the network is completely lost even before the in-degree/out-degree ratio becomes equal to that of a structureless Erdös-Rényi network. We develop a simple iterative scheme, analytically well described by an infinite branching process, to provide an estimate of the true detectability limit. Using various algorithms based on modularity optimization, we show that all of them behave (semiquantitatively) in the same way, with the same functional form of the detectability threshold as a function of the network parameters. Because the same behavior has also been found by further modularity-optimization methods and for methods based on different heuristics implementations, we conclude that indeed a correct definition of the detectability limit must take into account the stochastic fluctuations of the network construction.
Collapse
Affiliation(s)
- Lucio Floretta
- Laboratoire de Biophysique Statistique, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| | - Jonas Liechti
- Laboratoire de Biophysique Statistique, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| | - Alessandro Flammini
- School of Informatics and Computing, Indiana University, Bloomington, Indiana 47406, USA
| | - Paolo De Los Rios
- Laboratoire de Biophysique Statistique, Ecole Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| |
Collapse
|
440
|
Zhang ZY, Sun KD, Wang SQ. Enhanced community structure detection in complex networks with partial background information. Sci Rep 2013; 3:3241. [PMID: 24247657 PMCID: PMC4894381 DOI: 10.1038/srep03241] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 10/31/2013] [Indexed: 11/09/2022] Open
Abstract
Community structure detection in complex networks is important since it can help better understand the network topology and how the network works. However, there is still not a clear and widely-accepted definition of community structure, and in practice, different models may give very different results of communities, making it hard to explain the results. In this paper, different from the traditional methodologies, we design an enhanced semi-supervised learning framework for community detection, which can effectively incorporate the available prior information to guide the detection process and can make the results more explainable. By logical inference, the prior information is more fully utilized. The experiments on both the synthetic and the real-world networks confirm the effectiveness of the framework.
Collapse
Affiliation(s)
- Zhong-Yuan Zhang
- School of Statistics and Mathematics, Central University of Finance and Economics, P.R.China
| | | | | |
Collapse
|
441
|
Kikuchi M, Ogishima S, Miyamoto T, Miyashita A, Kuwano R, Nakaya J, Tanaka H. Identification of unstable network modules reveals disease modules associated with the progression of Alzheimer's disease. PLoS One 2013; 8:e76162. [PMID: 24348898 PMCID: PMC3858171 DOI: 10.1371/journal.pone.0076162] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2013] [Accepted: 08/20/2013] [Indexed: 11/18/2022] Open
Abstract
Alzheimer's disease (AD), the most common cause of dementia, is associated with aging, and it leads to neuron death. Deposits of amyloid β and aberrantly phosphorylated tau protein are known as pathological hallmarks of AD, but the underlying mechanisms have not yet been revealed. A high-throughput gene expression analysis previously showed that differentially expressed genes accompanying the progression of AD were more down-regulated than up-regulated in the later stages of AD. This suggested that the molecular networks and their constituent modules collapsed along with AD progression. In this study, by using gene expression profiles and protein interaction networks (PINs), we identified the PINs expressed in three brain regions: the entorhinal cortex (EC), hippocampus (HIP) and superior frontal gyrus (SFG). Dividing the expressed PINs into modules, we examined the stability of the modules with AD progression and with normal aging. We found that in the AD modules, the constituent proteins, interactions and cellular functions were not maintained between consecutive stages through all brain regions. Interestingly, the modules were collapsed with AD progression, specifically in the EC region. By identifying the modules that were affected by AD pathology, we found the transcriptional regulation-associated modules that interact with the proteasome-associated module via UCHL5 hub protein, which is a deubiquitinating enzyme. Considering PINs as a system made of network modules, we found that the modules relevant to the transcriptional regulation are disrupted in the EC region, which affects the ubiquitin-proteasome system.
Collapse
Affiliation(s)
- Masataka Kikuchi
- Department of Bioinformatics, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo, Japan
| | - Soichi Ogishima
- Department of Bioclinical Informatics, Tohoku Medical Megabank Organization, Tohoku University, Sendai-shi, Miyagi, Japan
- * E-mail:
| | - Tadashi Miyamoto
- Department of Bioinformatics, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo, Japan
| | - Akinori Miyashita
- Bioresource Science Branch, Center for Bioresources, Brain Research Institute, Niigata University, Niigata-shi, Niigata, Japan
| | - Ryozo Kuwano
- Bioresource Science Branch, Center for Bioresources, Brain Research Institute, Niigata University, Niigata-shi, Niigata, Japan
| | - Jun Nakaya
- Department of Bioclinical Informatics, Tohoku Medical Megabank Organization, Tohoku University, Sendai-shi, Miyagi, Japan
| | - Hiroshi Tanaka
- Department of Bioinformatics, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo, Japan
- Department of Bioclinical Informatics, Tohoku Medical Megabank Organization, Tohoku University, Sendai-shi, Miyagi, Japan
| |
Collapse
|
442
|
|
443
|
Rabbany R, Takaffoli M, Fagnan J, Zaïane OR, Campello RJGB. Communities validity: methodical evaluation of community mining algorithms. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0132-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
444
|
Mall R, Langone R, Suykens JAK. FURS: Fast and Unique Representative Subset selection retaining large-scale community structure. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0144-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
445
|
Traag VA, Krings G, Van Dooren P. Significant scales in community structure. Sci Rep 2013; 3:2930. [PMID: 24121597 PMCID: PMC3796307 DOI: 10.1038/srep02930] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2013] [Accepted: 09/24/2013] [Indexed: 11/09/2022] Open
Abstract
Many complex networks show signs of modular structure, uncovered by community detection. Although many methods succeed in revealing various partitions, it remains difficult to detect at what scale some partition is significant. This problem shows foremost in multi-resolution methods. We here introduce an efficient method for scanning for resolutions in one such method. Additionally, we introduce the notion of "significance" of a partition, based on subgraph probabilities. Significance is independent of the exact method used, so could also be applied in other methods, and can be interpreted as the gain in encoding a graph by making use of a partition. Using significance, we can determine "good" resolution parameters, which we demonstrate on benchmark networks. Moreover, optimizing significance itself also shows excellent performance. We demonstrate our method on voting data from the European Parliament. Our analysis suggests the European Parliament has become increasingly ideologically divided and that nationality plays no role.
Collapse
Affiliation(s)
- V A Traag
- 1] ICTEAM, Université catholique de Louvain [2] Royal Netherlands Institute of Southeast Asian and Caribbean Studies
| | | | | |
Collapse
|
446
|
Processing of natural sounds: characterization of multipeak spectral tuning in human auditory cortex. J Neurosci 2013; 33:11888-98. [PMID: 23864678 DOI: 10.1523/jneurosci.5306-12.2013] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
We examine the mechanisms by which the human auditory cortex processes the frequency content of natural sounds. Through mathematical modeling of ultra-high field (7 T) functional magnetic resonance imaging responses to natural sounds, we derive frequency-tuning curves of cortical neuronal populations. With a data-driven analysis, we divide the auditory cortex into five spatially distributed clusters, each characterized by a spectral tuning profile. Beyond neuronal populations with simple single-peaked spectral tuning (grouped into two clusters), we observe that ∼60% of auditory populations are sensitive to multiple frequency bands. Specifically, we observe sensitivity to multiple frequency bands (1) at exactly one octave distance from each other, (2) at multiple harmonically related frequency intervals, and (3) with no apparent relationship to each other. We propose that beyond the well known cortical tonotopic organization, multipeaked spectral tuning amplifies selected combinations of frequency bands. Such selective amplification might serve to detect behaviorally relevant and complex sound features, aid in segregating auditory scenes, and explain prominent perceptual phenomena such as octave invariance.
Collapse
|
447
|
Probabilistic analysis of communities and inner roles in networks: Bayesian generative models and approximate inference. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0130-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
448
|
Comin CH, da Fontoura Costa L. Shape, connectedness and dynamics in neuronal networks. J Neurosci Methods 2013; 220:100-15. [PMID: 23954264 DOI: 10.1016/j.jneumeth.2013.08.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2013] [Revised: 08/01/2013] [Accepted: 08/02/2013] [Indexed: 10/26/2022]
Abstract
The morphology of neurons is directly related to several aspects of the nervous system, including its connectedness, health, development, evolution, dynamics and, ultimately, behavior. Such interplays of the neuronal morphology can be understood within the more general shape-function paradigm. The current article reviews, in an introductory way, some key issues regarding the role of neuronal morphology in the nervous system, with emphasis on works developed in the authors' group. The following topics are addressed: (a) characterization of neuronal shape; (b) stochastic synthesis of neurons and neuronal systems; (c) characterization of the connectivity of neuronal networks by using complex networks concepts; and (d) investigations of influences of neuronal shape on network dynamics. The presented concepts and methods are useful also for several other multiple object systems, such as protein-protein interaction, tissues, aggregates and polymers.
Collapse
Affiliation(s)
- Cesar Henrique Comin
- Instituto de Física de São Carlos, Universidade de São Paulo, São Carlos, SP, Caixa Postal 369, 13560-970, Brazil.
| | | |
Collapse
|
449
|
West JD, Jacquet J, King MM, Correll SJ, Bergstrom CT. The role of gender in scholarly authorship. PLoS One 2013; 8:e66212. [PMID: 23894278 PMCID: PMC3718784 DOI: 10.1371/journal.pone.0066212] [Citation(s) in RCA: 320] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2013] [Accepted: 05/07/2013] [Indexed: 11/18/2022] Open
Abstract
Gender disparities appear to be decreasing in academia according to a number of metrics, such as grant funding, hiring, acceptance at scholarly journals, and productivity, and it might be tempting to think that gender inequity will soon be a problem of the past. However, a large-scale analysis based on over eight million papers across the natural sciences, social sciences, and humanities reveals a number of understated and persistent ways in which gender inequities remain. For instance, even where raw publication counts seem to be equal between genders, close inspection reveals that, in certain fields, men predominate in the prestigious first and last author positions. Moreover, women are significantly underrepresented as authors of single-authored papers. Academics should be aware of the subtle ways that gender disparities can occur in scholarly authorship.
Collapse
Affiliation(s)
- Jevin D West
- Department of Biology, University of Washington, Seattle, Washington, USA
| | | | | | | | | |
Collapse
|
450
|
Sun PG, Gao L, Yang Y. Maximizing modularity intensity for community partition and evolution. Inf Sci (N Y) 2013. [DOI: 10.1016/j.ins.2013.02.032] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|