1
|
Ferdowsi S, Foulsham T, Rahmani A, Ognibene D, Citi L, Li W. Identifying the human olfactory and chemosignaling neural networks using event related fMRI and graph theory. Sci Rep 2025; 15:12000. [PMID: 40200074 PMCID: PMC11978775 DOI: 10.1038/s41598-025-96355-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 03/27/2025] [Indexed: 04/10/2025] Open
Abstract
This study aims to characterize and compare the functional neural networks associated with different olfactory stimuli, including air, non-social odours, and human body odours. We introduce a novel processing pipeline based on event-related functional magnetic resonance imaging (fMRI) and graph theory for network identification. To ensure the stability and small worldness of the characterized networks, we conduct statistical validations, network modularity assessments, and robustness measurement against local attacks. The key hypothesis is that human body odours (so-called social odours) and non-social odours engage distinct neural networks, particularly in regions responsible for social processing. We found that the posterior medial orbitofrontal cortex (pmOFC) and fusiform face area (FFA) demonstrate stronger centrality in the body odour network than the non-social odour and air networks. This observation supports the idea that social and olfactory information are integrated in the body odour network. Additionally, the anterior insula (INSa), posterior piriform cortex (PPC), and amygdala (AMY) exhibit high influence in air and odour networks by achieving higher centrality indices and playing a major role in improving the global efficiency. These findings offer impactful insight into how air, non-social, and social odours recruit distinct neural circuits, reinforcing the role of olfaction in human social behavior.
Collapse
Affiliation(s)
- Saideh Ferdowsi
- School of Mathematics, Statistics and Actuarial Science, University of Essex, Colchester, UK.
| | - Tom Foulsham
- Department of Psychology, University of Essex, Colchester, UK
| | | | - Dimitri Ognibene
- Department of Psychology, University of Milano-Bicocca, Milan, Italy
| | - Luca Citi
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK
| | - Wen Li
- Louis A. Faillace, MD, Department of Psychiatry and Behavioral Sciences, University of Texas Health Science Center, Houston, TX, USA
| |
Collapse
|
2
|
Su XS, Zhang YB, Jin WJ, Zhang ZJ, Xie ZK, Wang RY, Wang YJ, Qiu Y. Lily viruses regulate the viral community of the Lanzhou lily rhizosphere and indirectly affect rhizosphere carbon and nitrogen cycling. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 955:176808. [PMID: 39396785 DOI: 10.1016/j.scitotenv.2024.176808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 10/05/2024] [Accepted: 10/06/2024] [Indexed: 10/15/2024]
Abstract
The rhizosphere, where plant roots interact intensely with the soil, is a crucial but understudied area in terms of the impact of virus infection. In this study, we investigated the effects of lily symptomless virus (LSV) and cucumber mosaic virus (CMV) on the Lanzhou lily (Lilium davidii var. unicolor) rhizosphere using metagenomics and bioinformatics analysis. We found that virus infection significantly altered soil pH, inorganic carbon, nitrate nitrogen, and total sulfur. Co-infection with LSV and CMV had a greater influence than single infections on the α- and β-diversity of the rhizosphere viral community in which the absolute abundance of certain virus families (Siphoviridae, Podoviridae, and Myoviridae) increased significantly, whereas bacteria, fungi, and archaea remained relatively unaffected. These altered virus populations influenced the rhizosphere microbial carbon and nitrogen cycles by exerting top-down control on bacteria. Co-infection potentially weakened rhizosphere carbon fixation and promoted processes such as methane oxidation, nitrification, and denitrification. In addition, the co-occurrence network of bacteria and viruses in the rhizosphere revealed substantial changes in microbial community composition under co-infection. Our partial-least-squares path model confirmed that the diversity of the rhizosphere viral community indirectly regulated the carbon and nitrogen cycling functions of the microbial community, thus affecting the accumulation of carbon and nitrogen nutrients in the soil. Our results are the first report of the effects of virus infection on the lily rhizosphere, particularly for co-infection; they therefore complement research on the plant virus pathogenic mechanisms, and increase our understanding of the ecological role of rhizosphere soil viruses.
Collapse
Affiliation(s)
- Xue-Si Su
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; University of Chinese Academy of Sciences, Beijing 100049, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| | - Yu-Bao Zhang
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| | - Wei-Jie Jin
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; University of Chinese Academy of Sciences, Beijing 100049, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| | - Zhan-Jun Zhang
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; University of Chinese Academy of Sciences, Beijing 100049, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China
| | - Zhong-Kui Xie
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| | - Ruo-Yu Wang
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| | - Ya-Jun Wang
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China
| | - Yang Qiu
- Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China; Gansu Gaolan Field Scientific Observation and Research Station for Agricultural Ecosystem, Lanzhou 730000, China; Key Laboratory of Stress Physiology and Ecology in Cold and Arid Region, Gansu Province, Lanzhou 730000, China.
| |
Collapse
|
3
|
Large-scale community detection based on core node and layer-by-layer label propagation. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
4
|
KO: Modularity optimization in community detection. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08284-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
5
|
Bouguessa M, Nouri K. BiNeTClus. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3423067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
We investigate the problem of community detection in bipartite networks that are characterized by the presence of two types of nodes such that connections exist only between nodes of different types. While some approaches have been proposed to identify community structures in bipartite networks, there are a number of problems still to solve. In fact, the majority of the proposed approaches suffer from one or even more of the following limitations: (1) difficulty in detecting communities in the presence of many non-discriminating nodes with atypical connections that hide the community structures, (2) loss of relevant topological information due to the transformation of the bipartite network to standard plain graphs, and (3) manually specifying several input parameters, including the number of communities to be identified. To alleviate these problems, we propose BiNeTClus, a parameter-free community detection algorithm in bipartite networks that operates in two phases. The first phase focuses on identifying an initial grouping of nodes through a transactional data model capable of dealing with the situation that involves networks with many atypical connections, that is, sparsely connected nodes and nodes of one type that massively connect to all other nodes of the second type. The second phase aims to refine the clustering results of the first phase via an optimization strategy of the bipartite modularity to identify the final community structures. Our experiments on both synthetic and real networks illustrate the suitability of the proposed approach.
Collapse
Affiliation(s)
| | - Khaled Nouri
- University of Quebec at Montreal, Montreal, Canada
| |
Collapse
|
6
|
Luo L, Liu K, Guo B, Ma J. User interaction-oriented community detection based on cascading analysis. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2019.09.022] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
7
|
|
8
|
Yu W, Wang W, Jiao P, Li X. Evolutionary clustering via graph regularized nonnegative matrix factorization for exploring temporal networks. Knowl Based Syst 2019. [DOI: 10.1016/j.knosys.2019.01.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
9
|
IntraClusTSP—An Incremental Intra-Cluster Refinement Heuristic Algorithm for Symmetric Travelling Salesman Problem. Symmetry (Basel) 2018. [DOI: 10.3390/sym10120663] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The Symmetric Traveling Salesman Problem (sTSP) is an intensively studied NP-hard problem. It has many important real-life applications such as logistics, planning, manufacturing of microchips and DNA sequencing. In this paper we propose a cluster level incremental tour construction method called Intra-cluster Refinement Heuristic (IntraClusTSP). The proposed method can be used both to extend the tour with a new node and to improve the existing tour. The refinement step generates a local optimal tour for a cluster of neighbouring nodes and this local optimal tour is then merged into the global optimal tour. Based on the performed evaluation tests the proposed IntraClusTSP method provides an efficient incremental tour generation and it can improve the tour efficiency for every tested state-of-the-art methods including the most efficient Chained Lin-Kernighan refinement algorithm. As an application example, we apply IntraClusTSP to automatically determine the optimal number of clusters in a cluster analysis problem. The standard methods like Silhouette index, Elbow method or Gap statistic method, to estimate the number of clusters support only partitional (single level) clustering, while in many application areas, the hierarchical (multi-level) clustering provides a better clustering model. Our proposed method can discover hierarchical clustering structure and provides an outstanding performance both in accuracy and execution time.
Collapse
|
10
|
|
11
|
|
12
|
Memetic algorithm using node entropy and partition entropy for community detection in networks. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.02.063] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
13
|
Lu C, Ding Y, Zhang C. Understanding the impact change of a highly cited article: a content-based citation analysis. Scientometrics 2017. [DOI: 10.1007/s11192-017-2398-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
14
|
Multi-objective evolutionary algorithm using problem-specific genetic operators for community detection in networks. Neural Comput Appl 2017. [DOI: 10.1007/s00521-017-2884-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
15
|
Wang W, Jiao P, He D, Jin D, Pan L, Gabrys B. Autonomous overlapping community detection in temporal networks: A dynamic Bayesian nonnegative matrix factorization approach. Knowl Based Syst 2016. [DOI: 10.1016/j.knosys.2016.07.021] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
16
|
Ju Y, Zhang S, Ding N, Zeng X, Zhang X. Complex Network Clustering by a Multi-objective Evolutionary Algorithm Based on Decomposition and Membrane Structure. Sci Rep 2016; 6:33870. [PMID: 27670156 PMCID: PMC5037381 DOI: 10.1038/srep33870] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 09/05/2016] [Indexed: 12/25/2022] Open
Abstract
The field of complex network clustering is gaining considerable attention in recent years. In this study, a multi-objective evolutionary algorithm based on membranes is proposed to solve the network clustering problem. Population are divided into different membrane structures on average. The evolutionary algorithm is carried out in the membrane structures. The population are eliminated by the vector of membranes. In the proposed method, two evaluation objectives termed as Kernel J-means and Ratio Cut are to be minimized. Extensive experimental studies comparison with state-of-the-art algorithms proves that the proposed algorithm is effective and promising.
Collapse
Affiliation(s)
- Ying Ju
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Songming Zhang
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Ningxiang Ding
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Xiangxiang Zeng
- School of Information Science and Technology, Xiamen University, Xiamen, China
| | - Xingyi Zhang
- School of Computer Science and Technology, Anhui University, Anhui, China
| |
Collapse
|
17
|
A Novel Clustering Methodology Based on Modularity Optimisation for Detecting Authorship Affinities in Shakespearean Era Plays. PLoS One 2016; 11:e0157988. [PMID: 27571416 PMCID: PMC5003342 DOI: 10.1371/journal.pone.0157988] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Accepted: 06/08/2016] [Indexed: 01/22/2023] Open
Abstract
In this study we propose a novel, unsupervised clustering methodology for analyzing large datasets. This new, efficient methodology converts the general clustering problem into the community detection problem in graph by using the Jensen-Shannon distance, a dissimilarity measure originating in Information Theory. Moreover, we use graph theoretic concepts for the generation and analysis of proximity graphs. Our methodology is based on a newly proposed memetic algorithm (iMA-Net) for discovering clusters of data elements by maximizing the modularity function in proximity graphs of literary works. To test the effectiveness of this general methodology, we apply it to a text corpus dataset, which contains frequencies of approximately 55,114 unique words across all 168 written in the Shakespearean era (16th and 17th centuries), to analyze and detect clusters of similar plays. Experimental results and comparison with state-of-the-art clustering methods demonstrate the remarkable performance of our new method for identifying high quality clusters which reflect the commonalities in the literary style of the plays.
Collapse
|
18
|
Shang R, Zhang W, Jiao L. Circularly Searching Core Nodes Based Label Propagation Algorithm for Community Detection. INT J PATTERN RECOGN 2016. [DOI: 10.1142/s0218001416590242] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the application of community detection in complex networks becoming more and more extensive, the application of more and more algorithms for community detection are proposed and improved. Among these algorithms, the label propagation algorithm is simple, easy to perform and its time complexity is linear, but it has a strong randomness. Small communities in the label propagation process are easy to be swallowed. Therefore, this paper proposes a method to improve the partition results of label propagation algorithm based on the pre-partition by circularly searching core nodes and assigning label for nodes according to similarity of nodes. First, the degree of each node of the network is calculated. We go through the whole network to find the nodes with the maximal degrees in the neighbors as the core nodes. Next, we assign the core nodes’ labels to their neighbors according to the similarity between them, which can reduce the randomness of the label propagation algorithm. Then, we arrange the nodes whose labels had not been changed as the new network and find the new core nodes. After that, we update the labels of neighbor nodes according to the similarity between them again until the end of the iteration, to complete the pre-partition. The approach of circularly searching for core nodes increases the diversity of the network partition and prevents the smaller potential communities being swallowed in the process of partition. Then, we implement the label propagation algorithm on the whole network after the pre-partition. Finally, we adopt a modified method based on the degree of membership determined by the bidirectional attraction of nodes and their neighbor communities. This method can reduce the possibility of the error in partition of few nodes. Experiments on artificial and real networks show that the proposed algorithm can accurately divide the network and get higher degree of modularity compared with five existing algorithms.
Collapse
Affiliation(s)
- Ronghua Shang
- Key Laboratory of Intelligent Perception and Image, Understanding of Ministry of Education of China, Xidian University, Xi’an 710071, P. R. China
| | - Weitong Zhang
- Key Laboratory of Intelligent Perception and Image, Understanding of Ministry of Education of China, Xidian University, Xi’an 710071, P. R. China
| | - Licheng Jiao
- Key Laboratory of Intelligent Perception and Image, Understanding of Ministry of Education of China, Xidian University, Xi’an 710071, P. R. China
| |
Collapse
|
19
|
Abstract
Cancer therapy is challenged by the diversity of molecular implementations of oncogenic processes and by the resulting variation in therapeutic responses. Projects such as The Cancer Genome Atlas (TCGA) provide molecular tumor maps in unprecedented detail. The interpretation of these maps remains a major challenge. Here we distilled thousands of genetic and epigenetic features altered in cancers to ~500 selected functional events (SFEs). Using this simplified description, we derived a hierarchical classification of 3,299 TCGA tumors from 12 cancer types. The top classes are dominated by either mutations (M class) or copy number changes (C class). This distinction is clearest at the extremes of genomic instability, indicating the presence of different oncogenic processes. The full hierarchy shows functional event patterns characteristic of multiple cross-tissue groups of tumors, termed oncogenic signature classes. Targetable functional events in a tumor class are suggestive of class-specific combination therapy. These results may assist in the definition of clinical trials to match actionable oncogenic signatures with personalized therapies.
Collapse
|
20
|
Le Thi HA, Nguyen MC, Dinh TP. A DC Programming Approach for Finding Communities in Networks. Neural Comput 2014; 26:2827-54. [DOI: 10.1162/neco_a_00673] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Automatic discovery of community structures in complex networks is a fundamental task in many disciplines, including physics, biology, and the social sciences. The most used criterion for characterizing the existence of a community structure in a network is modularity, a quantitative measure proposed by Newman and Girvan ( 2004 ). The discovery community can be formulated as the so-called modularity maximization problem that consists of finding a partition of nodes of a network with the highest modularity. In this letter, we propose a fast and scalable algorithm called DCAM, based on DC (difference of convex function) programming and DCA (DC algorithms), an innovative approach in nonconvex programming framework for solving the modularity maximization problem. The special structure of the problem considered here has been well exploited to get an inexpensive DCA scheme that requires only a matrix-vector product at each iteration. Starting with a very large number of communities, DCAM furnishes, as output results, an optimal partition together with the optimal number of communities [Formula: see text]; that is, the number of communities is discovered automatically during DCAM’s iterations. Numerical experiments are performed on a variety of real-world network data sets with up to 4,194,304 nodes and 30,359,198 edges. The comparative results with height reference algorithms show that the proposed approach outperforms them not only on quality and rapidity but also on scalability. Moreover, it realizes a very good trade-off between the quality of solutions and the run time.
Collapse
Affiliation(s)
- Hoai An Le Thi
- Laboratory of Theoretical and Applied Computer Science, University of Lorraine, Ile du Saulcy, 57045 Metz, France
| | - Manh Cuong Nguyen
- Laboratory of Theoretical and Applied Computer Science, University of Lorraine, Ile du Saulcy, 57045 Metz, France
| | - Tao Pham Dinh
- Laboratoire of Mathematics, National Institute for Applied Sciences—Rouen, 76801 Saint-Étienne-du-Rouvray cedex, France
| |
Collapse
|
21
|
Detecting community structures in networks by label propagation with prediction of percolation transition. ScientificWorldJournal 2014; 2014:148686. [PMID: 25110725 PMCID: PMC4119666 DOI: 10.1155/2014/148686] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Revised: 06/17/2014] [Accepted: 06/17/2014] [Indexed: 11/21/2022] Open
Abstract
Though label propagation algorithm (LPA) is one of the fastest algorithms for community detection in complex networks, the problem of trivial solutions frequently occurring in the algorithm affects its performance. We propose a label propagation algorithm with prediction of percolation transition (LPAp). After analyzing the reason for multiple solutions of LPA, by transforming the process of community detection into network construction process, a trivial solution in label propagation is considered as a giant component in the percolation transition. We add a prediction process of percolation transition in label propagation to delay the occurrence of trivial solutions, which makes small communities easier to be found. We also give an incomplete update condition which considers both neighbor purity and the contribution of small degree vertices to community detection to reduce the computation time of LPAp. Numerical tests are conducted. Experimental results on synthetic networks and real-world networks show that the LPAp is more accurate, more sensitive to small community, and has the ability to identify a single community structure. Moreover, LPAp with the incomplete update process can use less computation time than LPA, nearly without modularity loss.
Collapse
|
22
|
|
23
|
Aldecoa R, Marín I. Exploring the limits of community detection strategies in complex networks. Sci Rep 2014; 3:2216. [PMID: 23860510 PMCID: PMC3713530 DOI: 10.1038/srep02216] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 06/18/2013] [Indexed: 12/05/2022] Open
Abstract
The characterization of network community structure has profound implications in several scientific areas. Therefore, testing the algorithms developed to establish the optimal division of a network into communities is a fundamental problem in the field. We performed here a highly detailed evaluation of community detection algorithms, which has two main novelties: 1) using complex closed benchmarks, which provide precise ways to assess whether the solutions generated by the algorithms are optimal; and, 2) A novel type of analysis, based on hierarchically clustering the solutions suggested by multiple community detection algorithms, which allows to easily visualize how different are those solutions. Surprise, a global parameter that evaluates the quality of a partition, confirms the power of these analyses. We show that none of the community detection algorithms tested provide consistently optimal results in all networks and that Surprise maximization, obtained by combining multiple algorithms, obtains quasi-optimal performances in these difficult benchmarks.
Collapse
Affiliation(s)
- Rodrigo Aldecoa
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas, Calle Jaime Roig 11, 46010, Valencia, Spain
| | | |
Collapse
|
24
|
Peixoto TP. Efficient Monte Carlo and greedy heuristic for the inference of stochastic block models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:012804. [PMID: 24580278 DOI: 10.1103/physreve.89.012804] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2013] [Indexed: 05/22/2023]
Abstract
We present an efficient algorithm for the inference of stochastic block models in large networks. The algorithm can be used as an optimized Markov chain Monte Carlo (MCMC) method, with a fast mixing time and a much reduced susceptibility to getting trapped in metastable states, or as a greedy agglomerative heuristic, with an almost linear O(Nln2N) complexity, where N is the number of nodes in the network, independent of the number of blocks being inferred. We show that the heuristic is capable of delivering results which are indistinguishable from the more exact and numerically expensive MCMC method in many artificial and empirical networks, despite being much faster. The method is entirely unbiased towards any specific mixing pattern, and in particular it does not favor assortative community structures.
Collapse
Affiliation(s)
- Tiago P Peixoto
- Institut für Theoretische Physik, Universität Bremen, Hochschulring 18, D-28359 Bremen, Germany
| |
Collapse
|
25
|
Dormann CF, Strauss R. A method for detecting modules in quantitative bipartite networks. Methods Ecol Evol 2013. [DOI: 10.1111/2041-210x.12139] [Citation(s) in RCA: 322] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Carsten F. Dormann
- Biometry and Environmental System Analysis; University of Freiburg; Freiburg Germany
| | - Rouven Strauss
- Department of Computer Science; Technion-Israel Institute of Technology; Haifa Israel
| |
Collapse
|
26
|
Surprise maximization reveals the community structure of complex networks. Sci Rep 2013; 3:1060. [PMID: 23320141 PMCID: PMC3544010 DOI: 10.1038/srep01060] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2012] [Accepted: 12/28/2012] [Indexed: 11/09/2022] Open
Abstract
How to determine the community structure of complex networks is an open question. It is critical to establish the best strategies for community detection in networks of unknown structure. Here, using standard synthetic benchmarks, we show that none of the algorithms hitherto developed for community structure characterization perform optimally. Significantly, evaluating the results according to their modularity, the most popular measure of the quality of a partition, systematically provides mistaken solutions. However, a novel quality function, called Surprise, can be used to elucidate which is the optimal division into communities. Consequently, we show that the best strategy to find the community structure of all the networks examined involves choosing among the solutions provided by multiple algorithms the one with the highest Surprise value. We conclude that Surprise maximization precisely reveals the community structure of complex networks.
Collapse
|
27
|
Lee J, Gross SP, Lee J. Modularity optimization by conformational space annealing. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:056702. [PMID: 23004898 DOI: 10.1103/physreve.85.056702] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Indexed: 06/01/2023]
Abstract
We propose a modularity optimization method, Mod-CSA, based on stochastic global optimization algorithm, conformational space annealing (CSA). Our method outperforms simulated annealing in terms of both efficiency and accuracy, finding higher modularity partitions with less computational resources required. The high modularity values found by our method are higher than, or equal to, the largest values previously reported. In addition, the method can be combined with other heuristic methods, and implemented in parallel fashion, allowing it to be applicable to large graphs with more than 10,000 nodes.
Collapse
Affiliation(s)
- Juyong Lee
- School of Computational Sciences, Korea Institute of Advanced Study, Seoul, Korea.
| | | | | |
Collapse
|
28
|
Chang YT, Leahy RM, Pantazis D. Modularity-based graph partitioning using conditional expected models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:016109. [PMID: 22400627 PMCID: PMC3880576 DOI: 10.1103/physreve.85.016109] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2011] [Revised: 10/03/2011] [Indexed: 05/31/2023]
Abstract
Modularity-based partitioning methods divide networks into modules by comparing their structure against random networks conditioned to have the same number of nodes, edges, and degree distribution. We propose a novel way to measure modularity and divide graphs, based on conditional probabilities of the edge strength of random networks. We provide closed-form solutions for the expected strength of an edge when it is conditioned on the degrees of the two neighboring nodes, or alternatively on the degrees of all nodes comprising the network. We analytically compute the expected network under the assumptions of Gaussian and Bernoulli distributions. When the Gaussian distribution assumption is violated, we prove that our expression is the best linear unbiased estimator. Finally, we investigate the performance of our conditional expected model in partitioning simulated and real-world networks.
Collapse
Affiliation(s)
- Yu-Teng Chang
- Department of Electrical Engineering, Signal and Image Processing Institute, University of Southern California, Los Angeles, California 90089, USA
| | | | | |
Collapse
|
29
|
Cafieri S, Hansen P, Liberti L. Locally optimal heuristic for modularity maximization of networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:056105. [PMID: 21728603 DOI: 10.1103/physreve.83.056105] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2010] [Revised: 02/08/2011] [Indexed: 05/31/2023]
Abstract
Community detection in networks based on modularity maximization is currently done with hierarchical divisive or agglomerative as well as partitioning heuristics, hybrids, and, in a few papers, exact algorithms. We consider here the case of hierarchical networks in which communities should be detected and propose a divisive heuristic which is locally optimal in the sense that each of the successive bipartitions is done in a provably optimal way. This heuristic is compared with the spectral-based hierarchical divisive heuristic of Newman [Proc. Natl. Acad. Sci. USA 103, 8577 (2006).] and with the hierarchical agglomerative heuristic of Clauset, Newman, and Moore [Phys. Rev. E 70, 066111 (2004).]. Computational results are given for a series of problems of the literature with up to 4941 vertices and 6594 edges. They show that the proposed divisive heuristic gives better results than the divisive heuristic of Newman and than the agglomerative heuristic of Clauset et al.
Collapse
Affiliation(s)
- Sonia Cafieri
- Laboratoire MAIA, École Nationale de l'Aviation Civile, 7 Avenue Edouard Belin, F-31055 Toulouse, France.
| | | | | |
Collapse
|
30
|
Abstract
Modularity is a widely used quality measure for graph clusterings. Its exact maximization is NP-hard and prohibitively expensive for large graphs. Popular heuristics first perform a coarsening phase, where local search starting from singleton clusters is used to compute a preliminary clustering, and then optionally a refinement phase, where this clustering is improved by moving vertices between clusters. As a generalization, multilevel heuristics coarsen in several stages, and refine by moving entire clusters from each of these stages, not only individual vertices.
This article organizes existing and new single-level and multilevel heuristics into a coherent design space, and compares them experimentally with respect to their effectiveness (achieved modularity) and runtime. For coarsening by iterated cluster joining, it turns out that the most widely used criterion for joining clusters (modularity increase) is outperformed by other simple criteria, that a recent multistep algorithm [Schuetz and Caflisch 2008] is no improvement over simple single-step coarsening for these criteria, and that the recent multilevel coarsening by iterated vertex moving [Blondel et al. 2008] is somewhat faster but slightly less effective (with refinement). The new multilevel refinement is significantly more effective than the conventional single-level refinement or no refinement, in reasonable runtime.
A comparison with published benchmark results and algorithm implementations shows that multilevel local search heuristics, despite their relative simplicity, are competitive with the best algorithms in the literature.
Collapse
Affiliation(s)
- Randolf Rotta
- Brandenburgische Technische Universität Cottbus, Germany
| | - Andreas Noack
- Brandenburgische Technische Universität Cottbus, Germany
| |
Collapse
|
31
|
Khadivi A, Ajdari Rad A, Hasler M. Network community-detection enhancement by proper weighting. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:046104. [PMID: 21599237 DOI: 10.1103/physreve.83.046104] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2010] [Revised: 12/22/2010] [Indexed: 05/30/2023]
Abstract
In this paper, we show how proper assignment of weights to the edges of a complex network can enhance the detection of communities and how it can circumvent the resolution limit and the extreme degeneracy problems associated with modularity. Our general weighting scheme takes advantage of graph theoretic measures and it introduces two heuristics for tuning its parameters. We use this weighting as a preprocessing step for the greedy modularity optimization algorithm of Newman to improve its performance. The result of the experiments of our approach on computer-generated and real-world data networks confirm that the proposed approach not only mitigates the problems of modularity but also improves the modularity optimization.
Collapse
Affiliation(s)
- Alireza Khadivi
- Laboratory of Nonlinear Systems, School of Computer and Communication Sciences, Ecole Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | | | | |
Collapse
|
32
|
Subelj L, Bajec M. Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:036103. [PMID: 21517554 DOI: 10.1103/physreve.83.036103] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 11/06/2010] [Indexed: 05/30/2023]
Abstract
Label propagation has proven to be a fast method for detecting communities in large complex networks. Recent developments have also improved the accuracy of the approach; however, a general algorithm is still an open issue. We present an advanced label propagation algorithm that combines two unique strategies of community formation, namely, defensive preservation and offensive expansion of communities. The two strategies are combined in a hierarchical manner to recursively extract the core of the network and to identify whisker communities. The algorithm was evaluated on two classes of benchmark networks with planted partition and on 23 real-world networks ranging from networks with tens of nodes to networks with several tens of millions of edges. It is shown to be comparable to the current state-of-the-art community detection algorithms and superior to all previous label propagation algorithms, with comparable time complexity. In particular, analysis on real-world networks has proven that the algorithm has almost linear complexity, O(m¹·¹⁹), and scales even better than the basic label propagation algorithm (m is the number of edges in the network).
Collapse
Affiliation(s)
- Lovro Subelj
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia.
| | | |
Collapse
|
33
|
Aloise D, Cafieri S, Caporossi G, Hansen P, Perron S, Liberti L. Column generation algorithms for exact modularity maximization in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 82:046112. [PMID: 21230350 DOI: 10.1103/physreve.82.046112] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Indexed: 05/30/2023]
Abstract
Finding modules, or clusters, in networks currently attracts much attention in several domains. The most studied criterion for doing so, due to Newman and Girvan [Phys. Rev. E 69, 026113 (2004)], is modularity maximization. Many heuristics have been proposed for maximizing modularity and yield rapidly near optimal solution or sometimes optimal ones but without a guarantee of optimality. There are few exact algorithms, prominent among which is a paper by Xu [Eur. Phys. J. B 60, 231 (2007)]. Modularity maximization can also be expressed as a clique partitioning problem and the row generation algorithm of Grötschel and Wakabayashi [Math. Program. 45, 59 (1989)] applied. We propose to extend both of these algorithms using the powerful column generation methods for linear and non linear integer programming. Performance of the four resulting algorithms is compared on problems from the literature. Instances with up to 512 entities are solved exactly. Moreover, the computing time of previously solved problems are reduced substantially.
Collapse
Affiliation(s)
- Daniel Aloise
- Department of Production Engineering, Universidade Federal do Rio Grande do Norte, Campus Universitário s/n, Natal, RN 59072-970, Brazil.
| | | | | | | | | | | |
Collapse
|
34
|
Radicchi F, Lancichinetti A, Ramasco JJ. Combinatorial approach to modularity. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 82:026102. [PMID: 20866871 DOI: 10.1103/physreve.82.026102] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2010] [Indexed: 05/29/2023]
Abstract
Communities are clusters of nodes with a higher than average density of internal connections. Their detection is of great relevance to better understand the structure and hierarchies present in a network. Modularity has become a standard tool in the area of community detection, providing at the same time a way to evaluate partitions and, by maximizing it, a method to find communities. In this work, we study the modularity from a combinatorial point of view. Our analysis (as the modularity definition) relies on the use of the configurational model, a technique that given a graph produces a series of randomized copies keeping the degree sequence invariant. We develop an approach that enumerates the null model partitions and can be used to calculate the probability distribution function of the modularity. Our theory allows for a deep inquiry of several interesting features characterizing modularity such as its resolution limit and the statistics of the partitions that maximize it. Additionally, the study of the probability of extremes of the modularity in the random graph partitions opens the way for a definition of the statistical significance of network partitions.
Collapse
Affiliation(s)
- Filippo Radicchi
- Complex Networks Lagrange Laboratory, ISI Foundation, Turin, Italy
| | | | | |
Collapse
|
35
|
Liu X, Murata T. An Efficient Algorithm for Optimizing Bipartite Modularity in Bipartite Networks. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2010. [DOI: 10.20965/jaciii.2010.p0408] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Modularity evaluates the quality of a division of network nodes into communities, and modularity optimization is the most widely used class of methods for detecting communities in networks. In bipartite networks, there are correspondingly bipartite modularity and bipartite modularity optimization. LPAb, a very fast label propagation algorithm based on bipartite modularity optimization, tends to become stuck in poor local maxima, yielding suboptimal community divisions with low bipartite modularity. We therefore propose LPAb+, a hybrid algorithm combining modified LPAb, or LPAb’, and MSG, a multistep greedy agglomerative algorithm, with the objective of using MSG to drive LPAb out of local maxima. We use four commonly used real-world bipartite networks to demonstrate LPAb+ capability in detecting community divisions with remarkably higher bipartite modularity than LPAb. We show how LPAb+ outperforms other bipartite modularity optimization algorithms, without compromising speed.
Collapse
|
36
|
Cafieri S, Hansen P, Liberti L. Loops and multiple edges in modularity maximization of networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:046102. [PMID: 20481781 DOI: 10.1103/physreve.81.046102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Indexed: 05/29/2023]
Abstract
The modularity maximization model proposed by Newman and Girvan for the identification of communities in networks works for general graphs possibly with loops and multiple edges. However, the applications usually correspond to simple graphs. These graphs are compared to a null model where the degree distribution is maintained but edges are placed at random. Therefore, in this null model there will be loops and possibly multiple edges. Sharp bounds on the expected number of loops, and their impact on the modularity, are derived. Then, building upon the work of Massen and Doye, but using algebra rather than simulation, we propose modified null models associated with graphs without loops but with multiple edges, graphs with loops but without multiple edges and graphs without loops nor multiple edges. We validate our models by using the exact algorithm for clique partitioning of Grötschel and Wakabayashi.
Collapse
Affiliation(s)
- Sonia Cafieri
- Department Mathématiques et Informatique, Ecole Nationale de l'Aviation Civile, 7 av E Belin, F-31055 Toulouse, France.
| | | | | |
Collapse
|
37
|
Cafieri S, Hansen P, Liberti L. Edge ratio and community structure in networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:026105. [PMID: 20365629 DOI: 10.1103/physreve.81.026105] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Revised: 12/12/2009] [Indexed: 05/29/2023]
Abstract
A hierarchical divisive algorithm is proposed for identifying communities in complex networks. To that effect, the definition of community in the weak sense of Radicchi [Proc. Natl. Acad. Sci. U.S.A. 101, 2658 (2004)] is extended into a criterion for a bipartition to be optimal: one seeks to maximize the minimum for both classes of the bipartition of the ratio of inner edges to cut edges. A mathematical program is used within a dichotomous search to do this in an optimal way for each bipartition. This includes an exact solution of the problem of detecting indivisible communities. The resulting hierarchical divisive algorithm is compared with exact modularity maximization on both artificial and real world data sets. For two problems of the former kind optimal solutions are found; for five problems of the latter kind the edge ratio algorithm always appears to be competitive. Moreover, it provides additional information in several cases, notably through the use of the dendrogram summarizing the resolution. Finally, both algorithms are compared on reduced versions of the data sets of Girvan and Newman [Proc. Natl. Acad. Sci. U.S.A. 99, 7821 (2002)] and of Lancichinetti [Phys. Rev. E 78, 046110 (2008)]. Results for these instances appear to be comparable.
Collapse
Affiliation(s)
- Sonia Cafieri
- LIX, Ecole Polytechnique, F-91128 Palaiseau, France.
| | | | | |
Collapse
|
38
|
Lai Z, Su J, Chen W, Wang C. Uncovering the properties of energy-weighted conformation space networks with a hydrophobic-hydrophilic model. Int J Mol Sci 2009; 10:1808-1823. [PMID: 19468340 PMCID: PMC2680648 DOI: 10.3390/ijms10041808] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2009] [Revised: 03/30/2009] [Accepted: 04/07/2009] [Indexed: 11/16/2022] Open
Abstract
The conformation spaces generated by short hydrophobic-hydrophilic (HP) lattice chains are mapped to conformation space networks (CSNs). The vertices (nodes) of the network are the conformations and the links are the transitions between them. It has been found that these networks have "small-world" properties without considering the interaction energy of the monomers in the chain, i. e. the hydrophobic or hydrophilic amino acids inside the chain. When the weight based on the interaction energy of the monomers in the chain is added to the CSNs, it is found that the weighted networks show the "scale-free" characteristic. In addition, it reveals that there is a connection between the scale-free property of the weighted CSN and the folding dynamics of the chain by investigating the relationship between the scale-free structure of the weighted CSN and the noted parameter Z score. Moreover, the modular (community) structure of weighted CSNs is also studied. These results are helpful to understand the topological properties of the CSN and the underlying free-energy landscapes.
Collapse
Affiliation(s)
- Zaizhi Lai
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, P.R. China; E-Mails:
(Z.L.);
(J.S.);
(W.C.)
| | - Jiguo Su
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, P.R. China; E-Mails:
(Z.L.);
(J.S.);
(W.C.)
- College of Science, Yanshan University, Qinhuangdao, 066004, P.R. China
| | - Weizu Chen
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, P.R. China; E-Mails:
(Z.L.);
(J.S.);
(W.C.)
| | - Cunxin Wang
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, P.R. China; E-Mails:
(Z.L.);
(J.S.);
(W.C.)
| |
Collapse
|
39
|
Noack A. Modularity clustering is force-directed layout. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 79:026102. [PMID: 19391801 DOI: 10.1103/physreve.79.026102] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2008] [Indexed: 05/26/2023]
Abstract
Two natural and widely used representations for the community structure of networks are clusterings, which partition the vertex set into disjoint subsets, and layouts, which assign the vertices to positions in a metric space. This paper unifies prominent characterizations of layout quality and clustering quality, by showing that energy models of pairwise attraction and repulsion subsume Newman and Girvan's modularity measure. Layouts with optimal energy are relaxations of, and are thus consistent with, clusterings with optimal modularity, which is of practical relevance because the two representations are complementary and often used together.
Collapse
Affiliation(s)
- Andreas Noack
- Institute of Computer Science, Brandenburg University of Technology, 03013 Cottbus, Germany
| |
Collapse
|
40
|
|
41
|
Schuetz P, Caflisch A. Multistep greedy algorithm identifies community structure in real-world and computer-generated networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 78:026112. [PMID: 18850902 DOI: 10.1103/physreve.78.026112] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2008] [Indexed: 05/26/2023]
Abstract
We have recently introduced a multistep extension of the greedy algorithm for modularity optimization. The extension is based on the idea that merging l pairs of communities (l>1) at each iteration prevents premature condensation into few large communities. Here, an empirical formula is presented for the choice of the step width l that generates partitions with (close to) optimal modularity for 17 real-world and 1100 computer-generated networks. Furthermore, an in-depth analysis of the communities of two real-world networks (the metabolic network of the bacterium E. coli and the graph of coappearing words in the titles of papers coauthored by Martin Karplus) provides evidence that the partition obtained by the multistep greedy algorithm is superior to the one generated by the original greedy algorithm not only with respect to modularity, but also according to objective criteria. In other words, the multistep extension of the greedy algorithm reduces the danger of getting trapped in local optima of modularity and generates more reasonable partitions.
Collapse
Affiliation(s)
- Philipp Schuetz
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.
| | | |
Collapse
|