1
|
Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context. Front Mol Biosci 2022; 9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open
Abstract
At the heart of the cellular machinery through the regulation of cellular functions, protein-protein interactions (PPIs) have a significant role. PPIs can be analyzed with network approaches. Construction of a PPI network requires prediction of the interactions. All PPIs form a network. Different biases such as lack of data, recurrence of information, and false interactions make the network unstable. Integrated strategies allow solving these different challenges. These approaches have shown encouraging results for the understanding of molecular mechanisms, drug action mechanisms, and identification of target genes. In order to give more importance to an interaction, it is evaluated by different confidence scores. These scores allow the filtration of the network and thus facilitate the representation of the network, essential steps to the identification and understanding of molecular mechanisms. In this review, we will discuss the main computational methods for predicting PPI, including ones confirming an interaction as well as the integration of PPIs into a network, and we will discuss visualization of these complex data.
Collapse
Affiliation(s)
- Vivian Robin
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Mickaël Leclercq
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Périn
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| |
Collapse
|
2
|
Hao J, Zhu W. Deep graph clustering with enhanced feature representations for community detection. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03381-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
3
|
Meng X, Li W, Peng X, Li Y, Li M. Protein interaction networks: centrality, modularity, dynamics, and applications. FRONTIERS OF COMPUTER SCIENCE 2021; 15:156902. [DOI: 10.1007/s11704-020-8179-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 08/12/2020] [Indexed: 01/03/2025]
|
4
|
Multi-Objective Evolutionary Algorithms to Find Community Structures in Large Networks. MATHEMATICS 2020. [DOI: 10.3390/math8112048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Real-world complex systems are often modeled by networks such that the elements are represented by vertices and their interactions are represented by edges. An important characteristic of these networks is that they contain clusters of vertices densely linked amongst themselves and more sparsely connected to nodes outside the cluster. Community detection in networks has become an emerging area of investigation in recent years, but most papers aim to solve single-objective formulations, often focused on optimizing structural metrics, including the modularity measure. However, several studies have highlighted that considering modularityas a unique objective often involves resolution limit and imbalance inconveniences. This paper opens a new avenue of research in the study of multi-objective variants of the classical community detection problem by applying multi-objective evolutionary algorithms that simultaneously optimize different objectives. In particular, they analyzed two multi-objective variants involving not only modularity but also the conductance metric and the imbalance in the number of nodes of the communities. With this aim, a new Pareto-based multi-objective evolutionary algorithm is presented that includes advanced initialization strategies and search operators. The results obtained when solving large-scale networks representing real-life power systems show the good performance of these methods and demonstrate that it is possible to obtain a balanced number of nodes in the clusters formed while also having high modularity and conductance values.
Collapse
|
5
|
Li W, Kang Q, Kong H, Liu C, Kang Y. A novel iterated greedy algorithm for detecting communities in complex network. SOCIAL NETWORK ANALYSIS AND MINING 2020. [DOI: 10.1007/s13278-020-00641-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
6
|
Duren Z, Chen X, Xin J, Wang Y, Wong WH. Time course regulatory analysis based on paired expression and chromatin accessibility data. Genome Res 2020; 30:622-634. [PMID: 32188700 PMCID: PMC7197475 DOI: 10.1101/gr.257063.119] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 03/09/2020] [Indexed: 12/19/2022]
Abstract
A time course experiment is a widely used design in the study of cellular processes such as differentiation or response to stimuli. In this paper, we propose time course regulatory analysis (TimeReg) as a method for the analysis of gene regulatory networks based on paired gene expression and chromatin accessibility data from a time course. TimeReg can be used to prioritize regulatory elements, to extract core regulatory modules at each time point, to identify key regulators driving changes of the cellular state, and to causally connect the modules across different time points. We applied the method to analyze paired chromatin accessibility and gene expression data from a retinoic acid (RA)-induced mouse embryonic stem cells (mESCs) differentiation experiment. The analysis identified 57,048 novel regulatory elements regulating cerebellar development, synapse assembly, and hindbrain morphogenesis, which substantially extended our knowledge of cis-regulatory elements during differentiation. Using single-cell RNA-seq data, we showed that the core regulatory modules can reflect the properties of different subpopulations of cells. Finally, the driver regulators are shown to be important in clarifying the relations between modules across adjacent time points. As a second example, our method on Ascl1-induced direct reprogramming from fibroblast to neuron time course data identified Id1/2 as driver regulators of early stage of reprogramming.
Collapse
Affiliation(s)
- Zhana Duren
- Department of Statistics, Stanford University, Stanford, California 94305, USA
| | - Xi Chen
- Department of Statistics, Stanford University, Stanford, California 94305, USA
| | - Jingxue Xin
- Department of Statistics, Stanford University, Stanford, California 94305, USA
| | - Yong Wang
- CEMS, NCMIS, MDIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100080, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China
| | - Wing Hung Wong
- Department of Statistics, Stanford University, Stanford, California 94305, USA
- Department of Biomedical Data Science, Bio-X Program, Center for Personal Dynamic Regulomes, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
7
|
Tang L, Mostafa S, Liao B, Wu FX. A network clustering based feature selection strategy for classifying autism spectrum disorder. BMC Med Genomics 2019; 12:153. [PMID: 31888621 PMCID: PMC6936069 DOI: 10.1186/s12920-019-0598-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 10/09/2019] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Advanced non-invasive neuroimaging techniques offer new approaches to study functions and structures of human brains. Whole-brain functional networks obtained from resting state functional magnetic resonance imaging has been widely used to study brain diseases like autism spectrum disorder (ASD). Auto-classification of ASD has become an important issue. Existing classification methods for ASD are based on features extracted from the whole-brain functional networks, which may be not discriminant enough for good performance. METHODS In this study, we propose a network clustering based feature selection strategy for classifying ASD. In our proposed method, we first apply symmetric non-negative matrix factorization to divide brain networks into four modules. Then we extract features from one of four modules called default mode network (DMN) and use them to train several classifiers for ASD classification. RESULTS The computational experiments show that our proposed method achieves better performances than those trained with features extracted from the whole brain network. CONCLUSION It is a good strategy to train the classifiers for ASD based on features from the default mode subnetwork.
Collapse
Affiliation(s)
- Lingkai Tang
- Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, S7N 5A9 Canada
| | - Sakib Mostafa
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, S7N 5A9 Canada
| | - Bo Liao
- School of Mathematics and Statistics, Hainan Normal University, Haikou, 571158 China
| | - Fang-Xiang Wu
- Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, S7N 5A9 Canada
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, S7N 5A9 Canada
| |
Collapse
|
8
|
Everson J, Hollingsworth JM, Adler-Milstein J. Comparing methods of grouping hospitals. Health Serv Res 2019; 54:1090-1098. [PMID: 31197825 DOI: 10.1111/1475-6773.13188] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
OBJECTIVE To compare the performance of widely used approaches for defining groups of hospitals and a new approach based on network analysis of shared patient volume. STUDY SETTING Non-federal acute care hospitals in the United States. STUDY DESIGN We assessed the measurement properties of four methods of grouping hospitals: hospital referral regions (HRRs), metropolitan statistical areas (MSAs), core-based statistical areas (CBSAs), and community detection algorithms (CDAs). DATA EXTRACTION METHODS We combined data from the 2014 American Hospital Association Annual Survey, the Census Bureau, the Dartmouth Atlas, and Medicare data on interhospital patient travel patterns. We then evaluated the distinctiveness of each grouping, reliability over time, and generalizability across populations. PRINCIPLE FINDINGS Hospital groups defined by CDAs were the most distinctive (modularity = 0.86 compared to 0.75 for HRRs and 0.83 for MSAs; 0.72 for CBSA), were reliable to alternative specifications, and had greater generalizability than HRRs, MSAs, or CBSAs. CDAs had lower reliability over time than MSAs or CBSAs (normalized mutual information between 2012 and 2014 CDAs = 0.93). CONCLUSIONS Community detection algorithm-defined hospital groups offer high validity, reliability to different specifications, and generalizability to many uses when compared to approaches in widespread use today. They may, therefore, offer a better choice for efforts seeking to analyze the behaviors and dynamics of groups of hospitals. Measures of modularity, shared information, inclusivity, and shared behavior can be used to evaluate different approaches to grouping providers.
Collapse
Affiliation(s)
- Jordan Everson
- Department of Health Policy, Vanderbilt University School of Medicine, Nashville, Tennessee
| | - John M Hollingsworth
- The Institute for Healthcare Policy and Innovation, Ann Arbor, Michigan.,Dow Division of Health Services Research, Department of Urology, University of Michigan Medical School, Ann Arbor, Michigan
| | - Julia Adler-Milstein
- Department of Medicine, University of California, San Francisco, San Francisco, California
| |
Collapse
|
9
|
Li R, Ye F, Xie S, Chen C, Zheng Z. Digging into it: Community detection via hidden attributes analysis. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.11.059] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
10
|
|
11
|
Fuzzy clustering in community detection based on nonnegative matrix factorization with two novel evaluation criteria. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2016.12.019] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
12
|
Liu G, Chai B, Yang K, Yu J, Zhou X. Overlapping functional modules detection in PPI network with pair-wise constrained non-negative matrix tri-factorisation. IET Syst Biol 2018. [PMID: 29533217 PMCID: PMC8687432 DOI: 10.1049/iet-syb.2017.0084] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
A large amount of available protein–protein interaction (PPI) data has been generated by high‐throughput experimental techniques. Uncovering functional modules from PPI networks will help us better understand the underlying mechanisms of cellular functions. Numerous computational algorithms have been designed to identify functional modules automatically in the past decades. However, most community detection methods (non‐overlapping or overlapping types) are unsupervised models, which cannot incorporate the well‐known protein complexes as a priori. The authors propose a novel semi‐supervised model named pairwise constrains nonnegative matrix tri‐factorisation (PCNMTF), which takes full advantage of the well‐known protein complexes to find overlapping functional modules based on protein module indicator matrix and module correlation matrix simultaneously from PPI networks. PCNMTF determinately models and learns the mixed module memberships of each protein by considering the correlation among modules simultaneously based on the non‐negative matrix tri‐factorisation. The experiment results on both synthetic and real‐world biological networks demonstrate that PCNMTF gains more precise functional modules than that of state‐of‐the‐art methods.
Collapse
Affiliation(s)
- Guangming Liu
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Bianfang Chai
- Department of Information Engineering, Hebei GEO University, Shijiazhuang, People's Republic of China
| | - Kuo Yang
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Jian Yu
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China
| | - Xuezhong Zhou
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, No. 3 Shangyuancun Haidian District, Beijing, People's Republic of China.
| |
Collapse
|
13
|
Guerrero M, Montoya FG, Baños R, Alcayde A, Gil C. Adaptive community detection in complex networks using genetic algorithms. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.05.029] [Citation(s) in RCA: 83] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
14
|
Chen Z, Xie Z, Zhang Q. Community detection based on local topological information and its application in power grid. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.04.093] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
15
|
Yang L, Cao X, Jin D, Wang X, Meng D. A Unified Semi-Supervised Community Detection Framework Using Latent Space Graph Regularization. IEEE TRANSACTIONS ON CYBERNETICS 2015; 45:2585-2598. [PMID: 25532203 DOI: 10.1109/tcyb.2014.2377154] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Community structure is one of the most important properties of complex networks and is a foundational concept in exploring and understanding networks. In real world, topology information alone is often inadequate to accurately find community structure due to its sparsity and noises. However, potential useful prior information can be obtained from domain knowledge in many applications. Thus, how to improve the community detection performance by combining network topology with prior information becomes an interesting and challenging problem. Previous efforts on utilizing such priors are either dedicated or insufficient. In this paper, we firstly present a unified interpretation to a group of existing community detection methods. And then based on this interpretation, we propose a unified semi-supervised framework to integrate network topology with prior information for community detection. If the prior information indicates that some nodes belong to the same community, we encode it by adding a graph regularization term to penalize the latent space dissimilarity of these nodes. This framework can be applied to many widely-used matrix-based community detection methods satisfying our interpretation, such as nonnegative matrix factorization, spectral clustering, and their variants. Extensive experiments on both synthetic and real networks show that the proposed framework significantly improves the accuracy of community detection, especially on networks with unclear structures.
Collapse
|
16
|
Psorakis I, Roberts S, Ebden M, Sheldon B. Overlapping community detection using Bayesian non-negative matrix factorization. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 83:066114. [PMID: 21797448 DOI: 10.1103/physreve.83.066114] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2011] [Revised: 05/17/2011] [Indexed: 05/31/2023]
Abstract
Identifying overlapping communities in networks is a challenging task. In this work we present a probabilistic approach to community detection that utilizes a Bayesian non-negative matrix factorization model to extract overlapping modules from a network. The scheme has the advantage of soft-partitioning solutions, assignment of node participation scores to modules, and an intuitive foundation. We present the performance of the method against a variety of benchmark problems and compare and contrast it to several other algorithms for community detection.
Collapse
Affiliation(s)
- Ioannis Psorakis
- Pattern Analysis and Machine Learning Research Group, Department of Engineering Science, University of Oxford, Oxford, United Kingdom.
| | | | | | | |
Collapse
|
17
|
Huang X, Zheng X, Yuan W, Wang F, Zhu S. Enhanced clustering of biomedical documents using ensemble non-negative matrix factorization. Inf Sci (N Y) 2011. [DOI: 10.1016/j.ins.2011.01.029] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Ma X, Gao L, Yong X. Eigenspaces of networks reveal the overlapping and hierarchical community structure more precisely. JOURNAL OF STATISTICAL MECHANICS: THEORY AND EXPERIMENT 2010; 2010:P08012. [DOI: 10.1088/1742-5468/2010/08/p08012] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
19
|
Ma X, Gao L, Yong X, Fu L. Semi-supervised clustering algorithm for community structure detection in complex networks. PHYSICA A: STATISTICAL MECHANICS AND ITS APPLICATIONS 2010; 389:187-197. [DOI: 10.1016/j.physa.2009.09.018] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|