501
|
|
502
|
Finn J, Brownscombe J, Haak C, Cooke S, Cormier R, Gagne T, Danylchuk A. Applying network methods to acoustic telemetry data: Modeling the movements of tropical marine fishes. Ecol Modell 2014. [DOI: 10.1016/j.ecolmodel.2013.12.014] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
503
|
|
504
|
Le Thi HA, Nguyen MC, Dinh TP. A DC Programming Approach for Finding Communities in Networks. Neural Comput 2014; 26:2827-54. [DOI: 10.1162/neco_a_00673] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Automatic discovery of community structures in complex networks is a fundamental task in many disciplines, including physics, biology, and the social sciences. The most used criterion for characterizing the existence of a community structure in a network is modularity, a quantitative measure proposed by Newman and Girvan ( 2004 ). The discovery community can be formulated as the so-called modularity maximization problem that consists of finding a partition of nodes of a network with the highest modularity. In this letter, we propose a fast and scalable algorithm called DCAM, based on DC (difference of convex function) programming and DCA (DC algorithms), an innovative approach in nonconvex programming framework for solving the modularity maximization problem. The special structure of the problem considered here has been well exploited to get an inexpensive DCA scheme that requires only a matrix-vector product at each iteration. Starting with a very large number of communities, DCAM furnishes, as output results, an optimal partition together with the optimal number of communities [Formula: see text]; that is, the number of communities is discovered automatically during DCAM’s iterations. Numerical experiments are performed on a variety of real-world network data sets with up to 4,194,304 nodes and 30,359,198 edges. The comparative results with height reference algorithms show that the proposed approach outperforms them not only on quality and rapidity but also on scalability. Moreover, it realizes a very good trade-off between the quality of solutions and the run time.
Collapse
Affiliation(s)
- Hoai An Le Thi
- Laboratory of Theoretical and Applied Computer Science, University of Lorraine, Ile du Saulcy, 57045 Metz, France
| | - Manh Cuong Nguyen
- Laboratory of Theoretical and Applied Computer Science, University of Lorraine, Ile du Saulcy, 57045 Metz, France
| | - Tao Pham Dinh
- Laboratoire of Mathematics, National Institute for Applied Sciences—Rouen, 76801 Saint-Étienne-du-Rouvray cedex, France
| |
Collapse
|
505
|
Label propagation with α-degree neighborhood impact for network community detection. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2014; 2014:130689. [PMID: 25525425 PMCID: PMC4265519 DOI: 10.1155/2014/130689] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Revised: 10/30/2014] [Accepted: 11/12/2014] [Indexed: 11/17/2022]
Abstract
Community detection is an important task for mining the structure and function of complex networks. In this paper, a novel label propagation approach with α-degree neighborhood impact is proposed for efficiently and effectively detecting communities in networks. Firstly, we calculate the neighborhood impact of each node in a network within the scope of its α-degree neighborhood network by using an iterative approach. To mitigate the problems of visiting order correlation and convergence difficulty when updating the node labels asynchronously, our method updates the labels in an ascending order on the α-degree neighborhood impact of all the nodes. The α-degree neighborhood impact is also taken as the updating weight value, where the parameter impact scope α can be set to a positive integer. Experimental results from several real-world and synthetic networks show that our method can reveal the community structure in networks rapidly and accurately. The performance of our method is better than other label propagation based methods.
Collapse
|
506
|
Hollander CD, Wu AS. Gossip-based solutions for discrete rendezvous in populations of communicating agents. PLoS One 2014; 9:e112612. [PMID: 25397882 PMCID: PMC4232358 DOI: 10.1371/journal.pone.0112612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 10/19/2014] [Indexed: 11/18/2022] Open
Abstract
The objective of the rendezvous problem is to construct a method that enables a population of agents to agree on a spatial (and possibly temporal) meeting location. We introduce the buffered gossip algorithm as a general solution to the rendezvous problem in a discrete domain with direct communication between decentralized agents. We compare the performance of the buffered gossip algorithm against the well known uniform gossip algorithm. We believe that a buffered solution is preferable to an unbuffered solution, such as the uniform gossip algorithm, because the use of a buffer allows an agent to use multiple information sources when determining its desired rendezvous point, and that access to multiple information sources may improve agent decision making by reinforcing or contradicting an initial choice. To show that the buffered gossip algorithm is an actual solution for the rendezvous problem, we construct a theoretical proof of convergence and derive the conditions under which the buffered gossip algorithm is guaranteed to produce a consensus on rendezvous location. We use these results to verify that the uniform gossip algorithm also solves the rendezvous problem. We then use a multi-agent simulation to conduct a series of simulation experiments to compare the performance between the buffered and uniform gossip algorithms. Our results suggest that the buffered gossip algorithm can solve the rendezvous problem faster than the uniform gossip algorithm; however, the relative performance between these two solutions depends on the specific constraints of the problem and the parameters of the buffered gossip algorithm.
Collapse
Affiliation(s)
- Christopher D. Hollander
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL, United States of America
- * E-mail:
| | - Annie S. Wu
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL, United States of America
| |
Collapse
|
507
|
Cheng J, Leng M, Li L, Zhou H, Chen X. Active semi-supervised community detection based on must-link and cannot-link constraints. PLoS One 2014; 9:e110088. [PMID: 25329660 PMCID: PMC4201489 DOI: 10.1371/journal.pone.0110088] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2014] [Accepted: 09/16/2014] [Indexed: 12/04/2022] Open
Abstract
Community structure detection is of great importance because it can help in discovering the relationship between the function and the topology structure of a network. Many community detection algorithms have been proposed, but how to incorporate the prior knowledge in the detection process remains a challenging problem. In this paper, we propose a semi-supervised community detection algorithm, which makes full utilization of the must-link and cannot-link constraints to guide the process of community detection and thereby extracts high-quality community structures from networks. To acquire the high-quality must-link and cannot-link constraints, we also propose a semi-supervised component generation algorithm based on active learning, which actively selects nodes with maximum utility for the proposed semi-supervised community detection algorithm step by step, and then generates the must-link and cannot-link constraints by accessing a noiseless oracle. Extensive experiments were carried out, and the experimental results show that the introduction of active learning into the problem of community detection makes a success. Our proposed method can extract high-quality community structures from networks, and significantly outperforms other comparison methods.
Collapse
Affiliation(s)
- Jianjun Cheng
- School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu Province, China
- * E-mail: (JC); (XC)
| | - Mingwei Leng
- School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu Province, China
| | - Longjie Li
- School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu Province, China
| | - Hanhai Zhou
- School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu Province, China
| | - Xiaoyun Chen
- School of Information Science and Engineering, Lanzhou University, Lanzhou, Gansu Province, China
- * E-mail: (JC); (XC)
| |
Collapse
|
508
|
Detecting community structures in networks by label propagation with prediction of percolation transition. ScientificWorldJournal 2014; 2014:148686. [PMID: 25110725 PMCID: PMC4119666 DOI: 10.1155/2014/148686] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Revised: 06/17/2014] [Accepted: 06/17/2014] [Indexed: 11/21/2022] Open
Abstract
Though label propagation algorithm (LPA) is one of the fastest algorithms for community detection in complex networks, the problem of trivial solutions frequently occurring in the algorithm affects its performance. We propose a label propagation algorithm with prediction of percolation transition (LPAp). After analyzing the reason for multiple solutions of LPA, by transforming the process of community detection into network construction process, a trivial solution in label propagation is considered as a giant component in the percolation transition. We add a prediction process of percolation transition in label propagation to delay the occurrence of trivial solutions, which makes small communities easier to be found. We also give an incomplete update condition which considers both neighbor purity and the contribution of small degree vertices to community detection to reduce the computation time of LPAp. Numerical tests are conducted. Experimental results on synthetic networks and real-world networks show that the LPAp is more accurate, more sensitive to small community, and has the ability to identify a single community structure. Moreover, LPAp with the incomplete update process can use less computation time than LPA, nearly without modularity loss.
Collapse
|
509
|
|
510
|
Harenberg S, Bello G, Gjeltema L, Ranshous S, Harlalka J, Seay R, Padmanabhan K, Samatova N. Community detection in large‐scale networks: a survey and empirical evaluation. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/wics.1319] [Citation(s) in RCA: 133] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Steve Harenberg
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Gonzalo Bello
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - L. Gjeltema
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Stephen Ranshous
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Jitendra Harlalka
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Ramona Seay
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Kanchana Padmanabhan
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| | - Nagiza Samatova
- Department of Computer Science North Carolina State University Raleigh NC USA
- Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN USA
| |
Collapse
|
511
|
Abstract
Network methods have had profound influence in many domains and disciplines in the past decade. Community structure is a very important property of complex networks, but the accurate definition of a community remains an open problem. Here we defined community based on three properties, and then propose a simple and novel framework to detect communities based on network topology. We analyzed 16 different types of networks, and compared our partitions with Infomap, LPA, Fastgreedy and Walktrap, which are popular algorithms for community detection. Most of the partitions generated using our approach compare favorably to those generated by these other algorithms. Furthermore, we define overlapping nodes that combine community structure with shortest paths. We also analyzed the E. Coli. transcriptional regulatory network in detail, and identified modules with strong functional coherence.
Collapse
|
512
|
Srivastava A, Kumar S, Ramaswamy R. Two-layer modular analysis of gene and protein networks in breast cancer. BMC SYSTEMS BIOLOGY 2014; 8:81. [PMID: 24997799 PMCID: PMC4105126 DOI: 10.1186/1752-0509-8-81] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 06/26/2014] [Indexed: 02/05/2023]
Abstract
Background Genomic, proteomic and high-throughput gene expression data, when integrated, can be used to map the interaction networks between genes and proteins. Different approaches have been used to analyze these networks, especially in cancer, where mutations in biologically related genes that encode mutually interacting proteins are believed to be involved. This system of integrated networks as a whole exhibits emergent biological properties that are not obvious at the individual network level. We analyze the system in terms of modules, namely a set of densely interconnected nodes that can be further divided into submodules that are expected to participate in multiple biological activities in coordinated manner. Results In the present work we construct two layers of the breast cancer network: the gene layer, where the correlation network of breast cancer genes is analyzed to identify gene modules, and the protein layer, where each gene module is extended to map out the network of expressed proteins and their interactions in order to identify submodules. Each module and its associated submodules are analyzed to test the robustness of their topological distribution. The constituent biological phenomena are explored through the use of the Gene Ontology. We thus construct a “network of networks”, and demonstrate that both the gene and protein interaction networks are modular in nature. By focusing on the ontological classification, we are able to determine the entire GO profiles that are distributed at different levels of hierarchy. Within each submodule most of the proteins are biologically correlated, and participate in groups of distinct biological activities. Conclusions The present approach is an effective method for discovering coherent gene modules and protein submodules. We show that this also provides a means of determining biological pathways (both novel and as well those that have been reported previously) that are related, in the present instance, to breast cancer. Similar strategies are likely to be useful in the analysis of other diseases as well.
Collapse
Affiliation(s)
- Alok Srivastava
- C R RAO Advanced Institute of Mathematics, Statistics and Computer Science, University of Hyderabad Campus, Hyderabad 500046, India.
| | | | | |
Collapse
|
513
|
Liu X, Murata T, Wakita K. Detecting network communities beyond assortativity-related attributes. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:012806. [PMID: 25122341 DOI: 10.1103/physreve.90.012806] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2013] [Indexed: 06/03/2023]
Abstract
In network science, assortativity refers to the tendency of links to exist between nodes with similar attributes. In social networks, for example, links tend to exist between individuals of similar age, nationality, location, race, income, educational level, religious belief, and language. Thus, various attributes jointly affect the network topology. An interesting problem is to detect community structure beyond some specific assortativity-related attributes ρ, i.e., to take out the effect of ρ on network topology and reveal the hidden community structures which are due to other attributes. An approach to this problem is to redefine the null model of the modularity measure, so as to simulate the effect of ρ on network topology. However, a challenge is that we do not know to what extent the network topology is affected by ρ and by other attributes. In this paper, we propose a distance modularity, which allows us to freely choose any suitable function to simulate the effect of ρ. Such freedom can help us probe the effect of ρ and detect the hidden communities which are due to other attributes. We test the effectiveness of distance modularity on synthetic benchmarks and two real-world networks.
Collapse
Affiliation(s)
- Xin Liu
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan and CREST, Japan Science and Technology Agency, K's Gobancho, 7 Gobancho, Chiyoda, Tokyo, 102-0076 Japan and Wuhan University of Technology, 122 Luoshi Road, Wuhan, Hubei, 430070 China
| | - Tsuyoshi Murata
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan
| | - Ken Wakita
- Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro, Tokyo, 152-8552 Japan and CREST, Japan Science and Technology Agency, K's Gobancho, 7 Gobancho, Chiyoda, Tokyo, 102-0076 Japan
| |
Collapse
|
514
|
Sah P, Singh LO, Clauset A, Bansal S. Exploring community structure in biological networks with random graphs. BMC Bioinformatics 2014; 15:220. [PMID: 24965130 PMCID: PMC4094994 DOI: 10.1186/1471-2105-15-220] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 05/20/2014] [Indexed: 11/25/2022] Open
Abstract
Background Community structure is ubiquitous in biological networks. There has been an increased interest in unraveling the community structure of biological systems as it may provide important insights into a system’s functional components and the impact of local structures on dynamics at a global scale. Choosing an appropriate community detection algorithm to identify the community structure in an empirical network can be difficult, however, as the many algorithms available are based on a variety of cost functions and are difficult to validate. Even when community structure is identified in an empirical system, disentangling the effect of community structure from other network properties such as clustering coefficient and assortativity can be a challenge. Results Here, we develop a generative model to produce undirected, simple, connected graphs with a specified degrees and pattern of communities, while maintaining a graph structure that is as random as possible. Additionally, we demonstrate two important applications of our model: (a) to generate networks that can be used to benchmark existing and new algorithms for detecting communities in biological networks; and (b) to generate null models to serve as random controls when investigating the impact of complex network features beyond the byproduct of degree and modularity in empirical biological networks. Conclusion Our model allows for the systematic study of the presence of community structure and its impact on network function and dynamics. This process is a crucial step in unraveling the functional consequences of the structural properties of biological systems and uncovering the mechanisms that drive these systems.
Collapse
Affiliation(s)
| | | | | | - Shweta Bansal
- Department of Biology, Georgetown University, 20057 Washington DC, USA.
| |
Collapse
|
515
|
|
516
|
A node influence based label propagation algorithm for community detection in networks. ScientificWorldJournal 2014; 2014:627581. [PMID: 24999491 PMCID: PMC4066938 DOI: 10.1155/2014/627581] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Revised: 04/25/2014] [Accepted: 05/13/2014] [Indexed: 11/20/2022] Open
Abstract
Label propagation algorithm (LPA) is an extremely fast community detection method and is widely used in large scale networks. In spite of the advantages of LPA, the issue of its poor stability has not yet been well addressed. We propose a novel node influence based label propagation algorithm for community detection (NIBLPA), which improves the performance of LPA by improving the node orders of label updating and the mechanism of label choosing when more than one label is contained by the maximum number of nodes. NIBLPA can get more stable results than LPA since it avoids the complete randomness of LPA. The experimental results on both synthetic and real networks demonstrate that NIBLPA maintains the efficiency of the traditional LPA algorithm, and, at the same time, it has a superior performance to some representative methods.
Collapse
|
517
|
|
518
|
Ma L, Huang H, He Q, Chiew K, Liu Z. Toward seed-insensitive solutions to local community detection. J Intell Inf Syst 2014. [DOI: 10.1007/s10844-014-0315-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
519
|
Decomposition-based multiobjective evolutionary algorithm for community detection in dynamic social networks. ScientificWorldJournal 2014; 2014:402345. [PMID: 24723806 PMCID: PMC3958684 DOI: 10.1155/2014/402345] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2013] [Accepted: 12/22/2013] [Indexed: 11/17/2022] Open
Abstract
Community structure is one of the most important properties in social networks. In dynamic networks, there are two conflicting criteria that need to be considered. One is the snapshot quality, which evaluates the quality of the community partitions at the current time step. The other is the temporal cost, which evaluates the difference between communities at different time steps. In this paper, we propose a decomposition-based multiobjective community detection algorithm to simultaneously optimize these two objectives to reveal community structure and its evolution in dynamic networks. It employs the framework of multiobjective evolutionary algorithm based on decomposition to simultaneously optimize the modularity and normalized mutual information, which quantitatively measure the quality of the community partitions and temporal cost, respectively. A local search strategy dealing with the problem-specific knowledge is incorporated to improve the effectiveness of the new algorithm. Experiments on computer-generated and real-world networks demonstrate that the proposed algorithm can not only find community structure and capture community evolution more accurately, but also be steadier than the two compared algorithms.
Collapse
|
520
|
Abstract
Traditional approaches to community detection, as studied by physicists, sociologists, and more recently computer scientists, aim at simply partitioning the social network graph. However, with the advent of online social networking sites, richer data has become available: beyond the link information, each user in the network is annotated with additional information, for example, demographics, shopping behavior, or interests. In this context, it is therefore important to develop mining methods which can take advantage of all available information. In the case of community detection, this means finding
good communities
(a set of nodes cohesive in the social graph) which are associated with
good descriptions
in terms of user information (node attributes).
Having good descriptions associated to our models make them understandable by domain experts and thus more useful in real-world applications. Another requirement dictated by real-world applications, is to develop methods that can use, when available, any domain-specific background knowledge. In the case of community detection the background knowledge could be a vague description of the communities sought in a specific application, or some prototypical nodes (e.g., good customers in the past), that represent what the analyst is looking for (a community of similar users).
Towards this goal, in this article, we define and study the problem of finding a diverse set of cohesive communities with concise descriptions. We propose an effective algorithm that alternates between two phases: a hill-climbing phase producing (possibly overlapping) communities, and a description induction phase which uses techniques from supervised pattern set mining. Our framework has the nice feature of being able to build well-described cohesive communities starting from any given description or seed set of nodes, which makes it very flexible and easily applicable in real-world applications.
Our experimental evaluation confirms that the proposed method discovers cohesive communities with concise descriptions in realistic and large online social networks such as D
elicious
, F
lickr
, and L
ast
FM.
Collapse
|
521
|
Sun Y, Zhao Y. Overlapping-box-covering method for the fractal dimension of complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:042809. [PMID: 24827295 DOI: 10.1103/physreve.89.042809] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Indexed: 06/03/2023]
Abstract
The fractality and self-similarity of complex networks have been widely investigated by evaluating the fractal dimension, the crux of which is how to locate the optimal solution or how to tile the network with the fewest boxes. The results yielded by the box-covering method with separated boxes possess great randomness or large errors. In this paper, we adopt the overlapping box to tile the entire network, called the overlapping-box-covering method. In such a case, for verifying its validity, we propose an overlapping-box-covering algorithm; we first apply it to three deterministic networks, then to four real-world fractal networks. It produces optimums or more accurate fractal dimension for the former; the quantities of boxes finally obtained for the latter are fewer and more deterministic, with the redundant box reaching up to 33.3%. The experimental results show that the overlapping-box-covering method is available and that the overlapping box outperforms the previous case, rendering the errors smaller. Moreover, we conclude that the overlapping box is an important determinant to acquire the fewest boxes for complex networks.
Collapse
Affiliation(s)
- Yuanyuan Sun
- College of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Yujie Zhao
- College of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
522
|
Atzmueller M, Becker M, Kibanov M, Scholz C, Doerfel S, Hotho A, Macek BE, Mitzlaff F, Mueller J, Stumme G. Ubicon and its applications for ubiquitous social computing. NEW REV HYPERMEDIA M 2014. [DOI: 10.1080/13614568.2013.873488] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
523
|
Aldecoa R, Marín I. Exploring the limits of community detection strategies in complex networks. Sci Rep 2014; 3:2216. [PMID: 23860510 PMCID: PMC3713530 DOI: 10.1038/srep02216] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/18/2013] [Indexed: 12/05/2022] Open
Abstract
The characterization of network community structure has profound implications in several scientific areas. Therefore, testing the algorithms developed to establish the optimal division of a network into communities is a fundamental problem in the field. We performed here a highly detailed evaluation of community detection algorithms, which has two main novelties: 1) using complex closed benchmarks, which provide precise ways to assess whether the solutions generated by the algorithms are optimal; and, 2) A novel type of analysis, based on hierarchically clustering the solutions suggested by multiple community detection algorithms, which allows to easily visualize how different are those solutions. Surprise, a global parameter that evaluates the quality of a partition, confirms the power of these analyses. We show that none of the community detection algorithms tested provide consistently optimal results in all networks and that Surprise maximization, obtained by combining multiple algorithms, obtains quasi-optimal performances in these difficult benchmarks.
Collapse
Affiliation(s)
- Rodrigo Aldecoa
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas, Calle Jaime Roig 11, 46010, Valencia, Spain
| | | |
Collapse
|
524
|
|
525
|
Cai Q, Gong M, Ma L, Jiao L. A Novel Clonal Selection Algorithm for Community Detection in Complex Networks. Comput Intell 2014. [DOI: 10.1111/coin.12031] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Qing Cai
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education; Xidian University, Xi'an; China
| | - Maoguo Gong
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education; Xidian University, Xi'an; China
| | - Lijia Ma
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education; Xidian University, Xi'an; China
| | - Licheng Jiao
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education; Xidian University, Xi'an; China
| |
Collapse
|
526
|
Gandon F, Buffa M, Cabrio E, Corby O, Faron-Zucker C, Giboin A, Le Thanh N, Mirbel I, Sander P, Tettamanzi A, Villata S. Challenges in Bridging Social Semantics and Formal Semantics on the Web. ENTERP INF SYST-UK 2014. [DOI: 10.1007/978-3-319-09492-2_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
527
|
Exploiting behaviors of communities of twitter users for link prediction. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0142-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
528
|
ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS. Data Min Knowl Discov 2013. [DOI: 10.1007/s10618-013-0331-0] [Citation(s) in RCA: 90] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
529
|
Jia C, Carson MB, Yu J. A fast weak motif-finding algorithm based on community detection in graphs. BMC Bioinformatics 2013; 14:227. [PMID: 23865838 PMCID: PMC3726413 DOI: 10.1186/1471-2105-14-227] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 07/12/2013] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Identification of transcription factor binding sites (also called 'motif discovery') in DNA sequences is a basic step in understanding genetic regulation. Although many successful programs have been developed, the problem is far from being solved on account of diversity in gene expression/regulation and the low specificity of binding sites. State-of-the-art algorithms have their own constraints (e.g., high time or space complexity for finding long motifs, low precision in identification of weak motifs, or the OOPS constraint: one occurrence of the motif instance per sequence) which limit their scope of application. RESULTS In this paper, we present a novel and fast algorithm we call TFBSGroup. It is based on community detection from a graph and is used to discover long and weak (l,d) motifs under the ZOMOPS constraint (zero, one or multiple occurrence(s) of the motif instance(s) per sequence), where l is the length of a motif and d is the maximum number of mutations between a motif instance and the motif itself. Firstly, TFBSGroup transforms the (l, d) motif search in sequences to focus on the discovery of dense subgraphs within a graph. It identifies these subgraphs using a fast community detection method for obtaining coarse-grained candidate motifs. Next, it greedily refines these candidate motifs towards the true motif within their own communities. Empirical studies on synthetic (l, d) samples have shown that TFBSGroup is very efficient (e.g., it can find true (18, 6), (24, 8) motifs within 30 seconds). More importantly, the algorithm has succeeded in rapidly identifying motifs in a large data set of prokaryotic promoters generated from the Escherichia coli database RegulonDB. The algorithm has also accurately identified motifs in ChIP-seq data sets for 12 mouse transcription factors involved in ES cell pluripotency and self-renewal. CONCLUSIONS Our novel heuristic algorithm, TFBSGroup, is able to quickly identify nearly exact matches for long and weak (l, d) motifs in DNA sequences under the ZOMOPS constraint. It is also capable of finding motifs in real applications. The source code for TFBSGroup can be obtained from http://bioinformatics.bioengr.uic.edu/TFBSGroup/.
Collapse
Affiliation(s)
- Caiyan Jia
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China.
| | | | | |
Collapse
|
530
|
What’s in Twitter, I know what parties are popular and who you are supporting now! SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0120-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
531
|
|
532
|
|
533
|
|
534
|
Boldi P, Rosa M, Vigna S. Robustness of social and web graphs to node removal. SOCIAL NETWORK ANALYSIS AND MINING 2013. [DOI: 10.1007/s13278-013-0096-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
535
|
Surprise maximization reveals the community structure of complex networks. Sci Rep 2013; 3:1060. [PMID: 23320141 PMCID: PMC3544010 DOI: 10.1038/srep01060] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2012] [Accepted: 12/28/2012] [Indexed: 11/09/2022] Open
Abstract
How to determine the community structure of complex networks is an open question. It is critical to establish the best strategies for community detection in networks of unknown structure. Here, using standard synthetic benchmarks, we show that none of the algorithms hitherto developed for community structure characterization perform optimally. Significantly, evaluating the results according to their modularity, the most popular measure of the quality of a partition, systematically provides mistaken solutions. However, a novel quality function, called Surprise, can be used to elucidate which is the optimal division into communities. Consequently, we show that the best strategy to find the community structure of all the networks examined involves choosing among the solutions provided by multiple algorithms the one with the highest Surprise value. We conclude that Surprise maximization precisely reveals the community structure of complex networks.
Collapse
|
536
|
Liu D, Jin D, Baquero C, He D, Yang B, Yu Q. Genetic Algorithm with a Local Search Strategy for Discovering Communities in Complex Networks. INT J COMPUT INT SYS 2013. [DOI: 10.1080/18756891.2013.773175] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
537
|
|
538
|
Yang B, Di J, Liu J, Liu D. Hierarchical community detection with applications to real-world network analysis. DATA KNOWL ENG 2013. [DOI: 10.1016/j.datak.2012.09.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
539
|
Bin Lu. Community detection algorithm using the definition of community. 2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP) 2012. [DOI: 10.1109/icwamtip.2012.6413429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
540
|
Hu D, Ronhovde P, Nussinov Z. Stability-to-instability transition in the structure of large-scale networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:066106. [PMID: 23368003 DOI: 10.1103/physreve.86.066106] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2012] [Indexed: 06/01/2023]
Abstract
We examine phase transitions between the "easy," "hard," and "unsolvable" phases when attempting to identify structure in large complex networks ("community detection") in the presence of disorder induced by network "noise" (spurious links that obscure structure), heat bath temperature T, and system size N. The partition of a graph into q optimally disjoint subgraphs or "communities" inherently requires Potts-type variables. In earlier work [Philos. Mag. 92, 406 (2012)], when examining power law and other networks (and general associated Potts models), we illustrated that transitions in the computational complexity of the community detection problem typically correspond to spin-glass-type transitions (and transitions to chaotic dynamics in mechanical analogs) at both high and low temperatures and/or noise. The computationally "hard" phase exhibits spin-glass type behavior including memory effects. The region over which the hard phase extends in the noise and temperature phase diagram decreases as N increases while holding the average number of nodes per community fixed. This suggests that in the thermodynamic limit a direct sharp transition may occur between the easy and unsolvable phases. When present, transitions at low temperature or low noise correspond to entropy driven (or "order by disorder") annealing effects, wherein stability may initially increase as temperature or noise is increased before becoming unsolvable at sufficiently high temperature or noise. Additional transitions between contending viable solutions (such as those at different natural scales) are also possible. Identifying community structure via a dynamical approach where "chaotic-type" transitions were found earlier. The correspondence between the spin-glass-type complexity transitions and transitions into chaos in dynamical analogs might extend to other hard computational problems. In this work, we examine large networks (with a power law distribution in cluster size) that have a large number of communities (q≫1). We infer that large systems at a constant ratio of q to the number of nodes N asymptotically tend towards insolvability in the limit of large N for any positive T. The asymptotic behavior of temperatures below which structure identification might be possible, T_{×}=O[1/lnq], decreases slowly, so for practical system sizes, there remains an accessible, and generally easy, global solvable phase at low temperature. We further employ multivariate Tutte polynomials to show that increasing q emulates increasing T for a general Potts model, leading to a similar stability region at low T. Given the relation between Tutte and Jones polynomials, our results further suggest a link between the above complexity transitions and transitions associated with random knots.
Collapse
Affiliation(s)
- Dandan Hu
- Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, Missouri 63130, USA
| | | | | |
Collapse
|
541
|
Kawadia V, Sreenivasan S. Sequential detection of temporal communities by estrangement confinement. Sci Rep 2012; 2:794. [PMID: 23145317 PMCID: PMC3494021 DOI: 10.1038/srep00794] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 09/24/2012] [Indexed: 11/09/2022] Open
Abstract
Temporal communities are the result of a consistent partitioning of nodes across multiple snapshots of an evolving network, and they provide insights into how dense clusters in a network emerge, combine, split and decay over time. To reliably detect temporal communities we need to not only find a good community partition in a given snapshot but also ensure that it bears some similarity to the partition(s) found in the previous snapshot(s), a particularly difficult task given the extreme sensitivity of community structure yielded by current methods to changes in the network structure. Here, motivated by the inertia of inter-node relationships, we present a new measure of partition distance called estrangement, and show that constraining estrangement enables one to find meaningful temporal communities at various degrees of temporal smoothness in diverse real-world datasets. Estrangement confinement thus provides a principled approach to uncovering temporal communities in evolving networks.
Collapse
|
542
|
Li J, Song Y. Community detection in complex networks using extended compact genetic algorithm. Soft comput 2012. [DOI: 10.1007/s00500-012-0942-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
543
|
Abstract
We consider the problem of finding the set of rankings that best represents a given group of orderings on the same collection of elements (preference lists). This problem arises from social choice and voting theory, in which each voter gives a preference on a set of alternatives, and a system outputs a single preference order based on the observed voters' preferences. In this paper, we observe that, if the given set of preference lists is not homogeneous, a unique true underling ranking might not exist. Moreover only the lists that share the highest amount of information should be aggregated, and thus multiple rankings might provide a more feasible solution to the problem. In this light, we propose Network Selection, an algorithm that, given a heterogeneous group of rankings, first discovers the different communities of homogeneous rankings and then combines only the rank orderings belonging to the same community into a single final ordering. Our novel approach is inspired by graph theory; indeed our set of lists can be loosely read as the nodes of a network. As a consequence, only the lists populating the same community in the network would then be aggregated. In order to highlight the strength of our proposal, we show an application both on simulated and on two real datasets, namely a financial and a biological dataset. Experimental results on simulated data show that Network Selection can significantly outperform existing related methods. The other way around, the empirical evidence achieved on real financial data reveals that Network Selection is also able to select the most relevant variables in data mining predictive models, providing a clear superiority in terms of predictive power of the models built. Furthermore, we show the potentiality of our proposal in the bioinformatics field, providing an application to a biological microarray dataset.
Collapse
|
544
|
Mandala S, Kumara S, Yao T. Detecting alternative graph clusterings. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:016111. [PMID: 23005495 DOI: 10.1103/physreve.86.016111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2011] [Revised: 05/31/2012] [Indexed: 06/01/2023]
Abstract
The problem of graph clustering or community detection has enjoyed a lot of attention in complex networks literature. A quality function, modularity, quantifies the strength of clustering and on maximization yields sensible partitions. However, in most real world networks, there are an exponentially large number of near-optimal partitions with some being very different from each other. Therefore, picking an optimal clustering among the alternatives does not provide complete information about network topology. To tackle this problem, we propose a graph perturbation scheme which can be used to identify an ensemble of near-optimal and diverse clusterings. We establish analytical properties of modularity function under the perturbation which ensures diversity. Our approach is algorithm independent and therefore can leverage any of the existing modularity maximizing algorithms. We numerically show that our methodology can systematically identify very different partitions on several existing data sets. The knowledge of diverse partitions sheds more light into the topological organization and helps gain a more complete understanding of the underlying complex network.
Collapse
Affiliation(s)
- Supreet Mandala
- Industrial Engineering Department, Pennsylvania State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
545
|
Bettinelli A, Hansen P, Liberti L. Algorithm for parametric community detection in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:016107. [PMID: 23005491 DOI: 10.1103/physreve.86.016107] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Revised: 04/17/2012] [Indexed: 06/01/2023]
Abstract
Modularity maximization is extensively used to detect communities in complex networks. It has been shown, however, that this method suffers from a resolution limit: Small communities may be undetectable in the presence of larger ones even if they are very dense. To alleviate this defect, various modifications of the modularity function have been proposed as well as multiresolution methods. In this paper we systematically study a simple model (proposed by Pons and Latapy [Theor. Comput. Sci. 412, 892 (2011)] and similar to the parametric model of Reichardt and Bornholdt [Phys. Rev. E 74, 016110 (2006)]) with a single parameter α that balances the fraction of within community edges and the expected fraction of edges according to the configuration model. An exact algorithm is proposed to find optimal solutions for all values of α as well as the corresponding successive intervals of α values for which they are optimal. This algorithm relies upon a routine for exact modularity maximization and is limited to moderate size instances. An agglomerative hierarchical heuristic is therefore proposed to address parametric modularity detection in large networks. At each iteration the smallest value of α for which it is worthwhile to merge two communities of the current partition is found. Then merging is performed and the data are updated accordingly. An implementation is proposed with the same time and space complexity as the well-known Clauset-Newman-Moore (CNM) heuristic [Phys. Rev. E 70, 066111 (2004)]. Experimental results on artificial and real world problems show that (i) communities are detected by both exact and heuristic methods for all values of the parameter α; (ii) the dendrogram summarizing the results of the heuristic method provides a useful tool for substantive analysis, as illustrated particularly on a Les Misérables data set; (iii) the difference between the parametric modularity values given by the exact method and those given by the heuristic is moderate; (iv) the heuristic version of the proposed parametric method, viewed as a modularity maximization tool, gives better results than the CNM heuristic for large instances.
Collapse
Affiliation(s)
- Andrea Bettinelli
- Dipartimento di Tecnologie dell'Infomazione, Università degli Studi di Milano, via Bramante 65, Crema, Italy.
| | | | | |
Collapse
|
546
|
Lancichinetti A, Fortunato S. Consensus clustering in complex networks. Sci Rep 2012; 2:336. [PMID: 22468223 PMCID: PMC3313482 DOI: 10.1038/srep00336] [Citation(s) in RCA: 381] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2012] [Accepted: 03/09/2012] [Indexed: 01/20/2023] Open
Abstract
The community structure of complex networks reveals both their organization and hidden relationships among their constituents. Most community detection methods currently available are not deterministic, and their results typically depend on the specific random seeds, initial conditions and tie-break rules adopted for their execution. Consensus clustering is used in data analysis to generate stable results out of a set of partitions delivered by stochastic methods. Here we show that consensus clustering can be combined with any existing method in a self-consistent way, enhancing considerably both the stability and the accuracy of the resulting partitions. This framework is also particularly suitable to monitor the evolution of community structure in temporal networks. An application of consensus clustering to a large citation network of physics papers demonstrates its capability to keep track of the birth, death and diversification of topics.
Collapse
|
547
|
Nishikawa T, Motter AE. Discovering network structure beyond communities. Sci Rep 2012; 1:151. [PMID: 22355667 PMCID: PMC3240966 DOI: 10.1038/srep00151] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2011] [Accepted: 09/26/2011] [Indexed: 11/16/2022] Open
Abstract
To understand the formation, evolution, and function of complex systems, it is crucial to understand the internal organization of their interaction networks. Partly due to the impossibility of visualizing large complex networks, resolving network structure remains a challenging problem. Here we overcome this difficulty by combining the visual pattern recognition ability of humans with the high processing speed of computers to develop an exploratory method for discovering groups of nodes characterized by common network properties, including but not limited to communities of densely connected nodes. Without any prior information about the nature of the groups, the method simultaneously identifies the number of groups, the group assignment, and the properties that define these groups. The results of applying our method to real networks suggest the possibility that most group structures lurk undiscovered in the fast-growing inventory of social, biological, and technological networks of scientific interest.
Collapse
Affiliation(s)
- Takashi Nishikawa
- Department of Mathematics, Clarkson University, Potsdam, NY 13699, USA.
| | | |
Collapse
|
548
|
Treviño S, Sun Y, Cooper TF, Bassler KE. Robust detection of hierarchical communities from Escherichia coli gene expression data. PLoS Comput Biol 2012; 8:e1002391. [PMID: 22383870 PMCID: PMC3285575 DOI: 10.1371/journal.pcbi.1002391] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2011] [Accepted: 01/03/2012] [Indexed: 11/19/2022] Open
Abstract
Determining the functional structure of biological networks is a central goal of systems biology. One approach is to analyze gene expression data to infer a network of gene interactions on the basis of their correlated responses to environmental and genetic perturbations. The inferred network can then be analyzed to identify functional communities. However, commonly used algorithms can yield unreliable results due to experimental noise, algorithmic stochasticity, and the influence of arbitrarily chosen parameter values. Furthermore, the results obtained typically provide only a simplistic view of the network partitioned into disjoint communities and provide no information of the relationship between communities. Here, we present methods to robustly detect co-regulated and functionally enriched gene communities and demonstrate their application and validity for Escherichia coli gene expression data. Applying a recently developed community detection algorithm to the network of interactions identified with the context likelihood of relatedness (CLR) method, we show that a hierarchy of network communities can be identified. These communities significantly enrich for gene ontology (GO) terms, consistent with them representing biologically meaningful groups. Further, analysis of the most significantly enriched communities identified several candidate new regulatory interactions. The robustness of our methods is demonstrated by showing that a core set of functional communities is reliably found when artificial noise, modeling experimental noise, is added to the data. We find that noise mainly acts conservatively, increasing the relatedness required for a network link to be reliably assigned and decreasing the size of the core communities, rather than causing association of genes into new communities.
Collapse
Affiliation(s)
- Santiago Treviño
- Department of Physics, University of Houston, Houston, Texas, United States of America
- Texas Center for Superconductivity, University of Houston, Houston, Texas, United States of America
| | - Yudong Sun
- Department of Physics, University of Houston, Houston, Texas, United States of America
- Texas Center for Superconductivity, University of Houston, Houston, Texas, United States of America
| | - Tim F. Cooper
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Kevin E. Bassler
- Department of Physics, University of Houston, Houston, Texas, United States of America
- Texas Center for Superconductivity, University of Houston, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
549
|
Grabowicz PA, Ramasco JJ, Moro E, Pujol JM, Eguiluz VM. Social features of online networks: the strength of intermediary ties in online social media. PLoS One 2012; 7:e29358. [PMID: 22247773 PMCID: PMC3256152 DOI: 10.1371/journal.pone.0029358] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Accepted: 11/27/2011] [Indexed: 11/29/2022] Open
Abstract
An increasing fraction of today's social interactions occur using online social media as communication channels. Recent worldwide events, such as social movements in Spain or revolts in the Middle East, highlight their capacity to boost people's coordination. Online networks display in general a rich internal structure where users can choose among different types and intensity of interactions. Despite this, there are still open questions regarding the social value of online interactions. For example, the existence of users with millions of online friends sheds doubts on the relevance of these relations. In this work, we focus on Twitter, one of the most popular online social networks, and find that the network formed by the basic type of connections is organized in groups. The activity of the users conforms to the landscape determined by such groups. Furthermore, Twitter's distinction between different types of interactions allows us to establish a parallelism between online and offline social networks: personal interactions are more likely to occur on internal links to the groups (the weakness of strong ties); events transmitting new information go preferentially through links connecting different groups (the strength of weak ties) or even more through links connecting to users belonging to several groups that act as brokers (the strength of intermediary ties).
Collapse
Affiliation(s)
- Przemyslaw A. Grabowicz
- Instituto de Fisica Interdisciplinaria y Sistemas Complejos (CSIC-UIB), Palma de Mallorca, Spain
| | - José J. Ramasco
- Instituto de Fisica Interdisciplinaria y Sistemas Complejos (CSIC-UIB), Palma de Mallorca, Spain
- * E-mail:
| | - Esteban Moro
- Instituto de Ingeniera del Conocimiento, Universidad Autónoma de Madrid, Madrid, Spain
- Instituto de Ciencias Matemáticas CSIC-UAM-UC3M-UCM, Departamento de Matemáticas y GISC, Universidad Carlos III de Madrid, Leganés, Spain
| | - Josep M. Pujol
- Telefónica Research, Barcelona, Spain
- 3scale Networks, Barcelona, Spain
| | - Victor M. Eguiluz
- Instituto de Fisica Interdisciplinaria y Sistemas Complejos (CSIC-UIB), Palma de Mallorca, Spain
| |
Collapse
|
550
|
Towards Linear Time Overlapping Community Detection in Social Networks. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING 2012. [DOI: 10.1007/978-3-642-30220-6_3] [Citation(s) in RCA: 133] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|