1
|
|
2
|
Alcalá-Corona SA, Sandoval-Motta S, Espinal-Enríquez J, Hernández-Lemus E. Modularity in Biological Networks. Front Genet 2021; 12:701331. [PMID: 34594357 PMCID: PMC8477004 DOI: 10.3389/fgene.2021.701331] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 08/23/2021] [Indexed: 01/13/2023] Open
Abstract
Network modeling, from the ecological to the molecular scale has become an essential tool for studying the structure, dynamics and complex behavior of living systems. Graph representations of the relationships between biological components open up a wide variety of methods for discovering the mechanistic and functional properties of biological systems. Many biological networks are organized into a modular structure, so methods to discover such modules are essential if we are to understand the biological system as a whole. However, most of the methods used in biology to this end, have a limited applicability, as they are very specific to the system they were developed for. Conversely, from the statistical physics and network science perspective, graph modularity has been theoretically studied and several methods of a very general nature have been developed. It is our perspective that in particular for the modularity detection problem, biology and theoretical physics/network science are less connected than they should. The central goal of this review is to provide the necessary background and present the most applicable and pertinent methods for community detection in a way that motivates their further usage in biological research.
Collapse
Affiliation(s)
- Sergio Antonio Alcalá-Corona
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico.,Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Santiago Sandoval-Motta
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico.,Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico.,National Council on Science and Technology, Mexico City, Mexico
| | - Jesús Espinal-Enríquez
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico.,Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico.,Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
| |
Collapse
|
3
|
Luo S, Zhang Z, Zhang Y, Ma S. Co-Association Matrix-Based Multi-Layer Fusion for Community Detection in Attributed Networks. ENTROPY 2019; 21:e21010095. [PMID: 33266811 PMCID: PMC7514206 DOI: 10.3390/e21010095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Revised: 01/17/2019] [Accepted: 01/17/2019] [Indexed: 11/21/2022]
Abstract
Community detection is a challenging task in attributed networks, due to the data inconsistency between network topological structure and node attributes. The problem of how to effectively and robustly fuse multi-source heterogeneous data plays an important role in community detection algorithms. Although some algorithms taking both topological structure and node attributes into account have been proposed in recent years, the fusion strategy is simple and usually adopts the linear combination method. As a consequence of this, the detected community structure is vulnerable to small variations of the input data. In order to overcome this challenge, we develop a novel two-layer representation to capture the latent knowledge from both topological structure and node attributes in attributed networks. Then, we propose a weighted co-association matrix-based fusion algorithm (WCMFA) to detect the inherent community structure in attributed networks by using multi-layer fusion strategies. It extends the community detection method from a single-view to a multi-view style, which is consistent with the thinking model of human beings. Experiments show that our method is superior to the state-of-the-art community detection algorithms for attributed networks.
Collapse
Affiliation(s)
- Sheng Luo
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
| | - Zhifei Zhang
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
- Correspondence:
| | - Yuanjian Zhang
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
| | - Shuwen Ma
- Research Center of Big Data and Network Security, Tongji University, Shanghai 200092, China
| |
Collapse
|
4
|
Li HJ, Bu Z, Wang Z, Cao J, Shi Y. Enhance the Performance of Network Computation by a Tunable Weighting Strategy. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2018. [DOI: 10.1109/tetci.2018.2829906] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
5
|
Micro-blog user community discovery using generalized SimRank edge weighting method. PLoS One 2018; 13:e0196447. [PMID: 29734358 PMCID: PMC5937763 DOI: 10.1371/journal.pone.0196447] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Accepted: 04/09/2018] [Indexed: 11/19/2022] Open
Abstract
Community discovery is one of the most popular issues in analyzing and understanding a network. Previous research suggests that the discovery can be enhanced by assigning weights to the edges of the network. This paper proposes a novel edge weighting method, which balances both local and global weighting based on the idea of shared neighbor ranging between users and the interpersonal significance of the social network community. We assume that users belonging to the same community have similar relationship network structures. By controlling the measure of "neighborhood", this method can adequately adapt to real-world networks. Therefore, the famous similarity calculation method-SimRank-can be regarded as a special case of our method. According to the practical significance of social networks, we propose a new evaluation method that uses the communication rate to measure its divided demerit to better express users' interaction relations than the ordinary modularity Q. Furthermore, the fast Newman algorithm is extended to weighted networks. In addition, we use four real networks in the largest Chinese micro-blog website Sina. The results of experiments demonstrate that the proposed method easily meets the balancing requirements and is more robust to different kinds of networks. The experimental results also indicate that the proposed algorithm outperforms several conventional weighting methods.
Collapse
|
6
|
|
7
|
Shakeri H, Poggi-Corradini P, Albin N, Scoglio C. Network clustering and community detection using modulus of families of loops. Phys Rev E 2017; 95:012316. [PMID: 28208387 DOI: 10.1103/physreve.95.012316] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Indexed: 11/07/2022]
Abstract
We study the structure of loops in networks using the notion of modulus of loop families. We introduce an alternate measure of network clustering by quantifying the richness of families of (simple) loops. Modulus tries to minimize the expected overlap among loops by spreading the expected link usage optimally. We propose weighting networks using these expected link usages to improve classical community detection algorithms. We show that the proposed method enhances the performance of certain algorithms, such as spectral partitioning and modularity maximization heuristics, on standard benchmarks.
Collapse
Affiliation(s)
- Heman Shakeri
- Electrical and Computer Engineering Department, Kansas State University, Manhattan, Kansas 66506, USA
| | | | - Nathan Albin
- Mathematics Department, Kansas State University, Manhattan, Kansas 66506, USA
| | - Caterina Scoglio
- Electrical and Computer Engineering Department, Kansas State University, Manhattan, Kansas 66506, USA
| |
Collapse
|
8
|
Hu Y, Yang B, Wong HS. A weighted local view method based on observation over ground truth for community detection. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.03.028] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
9
|
Hu Y, Yang B. Characterizing the structure of large real networks to improve community detection. Neural Comput Appl 2016. [DOI: 10.1007/s00521-016-2264-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
10
|
Hoang SA, Bekiranov S. The network architecture of the Saccharomyces cerevisiae genome. PLoS One 2013; 8:e81972. [PMID: 24349163 PMCID: PMC3857230 DOI: 10.1371/journal.pone.0081972] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2013] [Accepted: 10/18/2013] [Indexed: 11/19/2022] Open
Abstract
We propose a network-based approach for surmising the spatial organization of genomes from high-throughput interaction data. Our strategy is based on methods for inferring architectural features of networks. Specifically, we employ a community detection algorithm to partition networks of genomic interactions. These community partitions represent an intuitive interpretation of genomic organization from interaction data. Furthermore, they are able to recapitulate known aspects of the spatial organization of the Saccharomyces cerevisiae genome, such as the rosette conformation of the genome, the clustering of centromeres, as well as tRNAs, and telomeres. We also demonstrate that simple architectural features of genomic interaction networks, such as cliques, can give meaningful insight into the functional role of the spatial organization of the genome. We show that there is a correlation between inter-chromosomal clique size and replication timing, as well as cohesin enrichment. Together, our network-based approach represents an effective and intuitive framework for interpreting high-throughput genomic interaction data. Importantly, there is a great potential for this strategy, given the rich literature and extensive set of existing tools in the field of network analysis.
Collapse
Affiliation(s)
- Stephen A. Hoang
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America
| | - Stefan Bekiranov
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America
| |
Collapse
|
11
|
Zhang S, Zhao H. Normalized modularity optimization method for community identification with degree adjustment. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:052802. [PMID: 24329313 DOI: 10.1103/physreve.88.052802] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2013] [Revised: 07/11/2013] [Indexed: 06/03/2023]
Abstract
As a fundamental problem in network study, community identification has attracted much attention from different fields. Representing a seminal work in this area, the modularity optimization method has been widely applied and studied. However, this method has issues in resolution limit and extreme degeneracy and may not perform well for networks with unbalanced structures. Although several methods have been proposed to overcome these limitations, they are all based on the original idea of defining modularity through comparing the total number of edges within the putative communities in the observed network with that in an equivalent randomly generated network. In this paper, we show that this modularity definition is not suitable to analyze some networks such as those with unbalanced structures. Instead, we propose to define modularity through the average degree within the communities and formulate modularity as comparing the sum of average degree within communities of the observed network to that of an equivalent randomly generated network. In addition, we also propose a degree-adjusted approach for further improvement when there are unbalanced structures. We analyze the theoretical properties of our degree adjusted method. Numerical experiments for both artificial networks and real networks demonstrate that average degree plays an important role in network community identification, and our proposed methods have better performance than existing ones.
Collapse
Affiliation(s)
- Shuqin Zhang
- Center for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| |
Collapse
|
12
|
|
13
|
Zhang S, Zhao H. Community identification in networks with unbalanced structure. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 85:066114. [PMID: 23005169 DOI: 10.1103/physreve.85.066114] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2011] [Revised: 04/18/2012] [Indexed: 06/01/2023]
Abstract
Community (module) structure is a common and important property of many types of networks, such as social networks and biological networks. Several classes of algorithms have been proposed for community structure detection and identification, including clustering techniques, modularity optimization, and other methods. Among these methods, the modularity optimization method has attracted a great deal of attention and much related research has been published. However, the existing modularity optimization method does not perform well in the presence of unbalanced community structures. In this paper, we introduce a metric to characterize the community structure better than other metrics in this situation, and we propose a method to infer the number of communities, which may solve the resolution limit problem. We then develop an algorithm for community structure identification based on eigendecompositions, and we give both simulated and real data examples to illustrate the better performance of our approach.
Collapse
Affiliation(s)
- Shuqin Zhang
- Center for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China.
| | | |
Collapse
|
14
|
Lou X, Suykens JAK. Finding communities in weighted networks through synchronization. CHAOS (WOODBURY, N.Y.) 2011; 21:043116. [PMID: 22225353 DOI: 10.1063/1.3655371] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Community detection in weighted networks is an important challenge. In this paper, we introduce a local weight ratio scheme for identifying the community structures of weighted networks within the context of the Kuramoto model by taking into account weights of links. The proposed scheme takes full advantage of the information of the link density among vertices and the closeness of relations between each vertex and its neighbors. By means of this scheme, we explore the connection between community structures and dynamic time scales of synchronization. Moreover, we can also unravel the hierarchical structures of weighted networks with a well-defined connectivity pattern by the synchronization process. The performance of the proposed method is evaluated on both computer-generated benchmark graphs and real-world networks.
Collapse
Affiliation(s)
- Xuyang Lou
- Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Jiangnan University, Wuxi 214122, China.
| | | |
Collapse
|
15
|
Sun S, Dong X, Fu Y, Tian W. An iterative network partition algorithm for accurate identification of dense network modules. Nucleic Acids Res 2011; 40:e18. [PMID: 22121225 PMCID: PMC3273790 DOI: 10.1093/nar/gkr1103] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A key step in network analysis is to partition a complex network into dense modules. Currently, modularity is one of the most popular benefit functions used to partition network modules. However, recent studies suggested that it has an inherent limitation in detecting dense network modules. In this study, we observed that despite the limitation, modularity has the advantage of preserving the primary network structure of the undetected modules. Thus, we have developed a simple iterative Network Partition (iNP) algorithm to partition a network. The iNP algorithm provides a general framework in which any modularity-based algorithm can be implemented in the network partition step. Here, we tested iNP with three modularity-based algorithms: multi-step greedy (MSG), spectral clustering and Qcut. Compared with the original three methods, iNP achieved a significant improvement in the quality of network partition in a benchmark study with simulated networks, identified more modules with significantly better enrichment of functionally related genes in both yeast protein complex network and breast cancer gene co-expression network, and discovered more cancer-specific modules in the cancer gene co-expression network. As such, iNP should have a broad application as a general method to assist in the analysis of biological networks.
Collapse
Affiliation(s)
- Siqi Sun
- State Key Laboratory of Genetic Engineering, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai 200433, PR China
| | | | | | | |
Collapse
|