1
|
Ding Q, Yang W, Xue G, Liu H, Cai Y, Que J, Jin X, Luo M, Pang F, Yang Y, Lin Y, Liu Y, Sun H, Tan R, Wang P, Xu Z, Jiang Q. Dimension reduction, cell clustering, and cell-cell communication inference for single-cell transcriptomics with DcjComm. Genome Biol 2024; 25:241. [PMID: 39252099 PMCID: PMC11382422 DOI: 10.1186/s13059-024-03385-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 08/30/2024] [Indexed: 09/11/2024] Open
Abstract
Advances in single-cell transcriptomics provide an unprecedented opportunity to explore complex biological processes. However, computational methods for analyzing single-cell transcriptomics still have room for improvement especially in dimension reduction, cell clustering, and cell-cell communication inference. Herein, we propose a versatile method, named DcjComm, for comprehensive analysis of single-cell transcriptomics. DcjComm detects functional modules to explore expression patterns and performs dimension reduction and clustering to discover cellular identities by the non-negative matrix factorization-based joint learning model. DcjComm then infers cell-cell communication by integrating ligand-receptor pairs, transcription factors, and target genes. DcjComm demonstrates superior performance compared to state-of-the-art methods.
Collapse
Affiliation(s)
- Qian Ding
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Wenyi Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Guangfu Xue
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Hongxin Liu
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yideng Cai
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Jinhao Que
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Xiyun Jin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Meng Luo
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Fenglan Pang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yuexin Yang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China
| | - Yi Lin
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Yusong Liu
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Haoxiu Sun
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Renjie Tan
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China
| | - Pingping Wang
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
| | - Zhaochun Xu
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
| | - Qinghua Jiang
- Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150000, China.
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, 150076, China.
- State Key Laboratory of Frigid Zone Cardiovascular Diseases (SKLFZCD), Harbin Medical University, Harbin, 150076, China.
| |
Collapse
|
2
|
Guo Z, Zhao X, Yao L, Long Z. Improved brain community structure detection by two-step weighted modularity maximization. PLoS One 2023; 18:e0295428. [PMID: 38064462 PMCID: PMC10707683 DOI: 10.1371/journal.pone.0295428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 11/21/2023] [Indexed: 12/18/2023] Open
Abstract
The human brain can be regarded as a complex network with interacting connections between brain regions. Complex brain network analyses have been widely applied to functional magnetic resonance imaging (fMRI) data and have revealed the existence of community structures in brain networks. The identification of communities may provide insight into understanding the topological functions of brain networks. Among various community detection methods, the modularity maximization (MM) method has the advantages of model conciseness, fast convergence and strong adaptability to large-scale networks and has been extended from single-layer networks to multilayer networks to investigate the community structure changes of brain networks. However, the problems of MM, suffering from instability and failing to detect hierarchical community structure in networks, largely limit the application of MM in the community detection of brain networks. In this study, we proposed the weighted modularity maximization (WMM) method by using the weight matrix to weight the adjacency matrix and improve the performance of MM. Moreover, we further proposed the two-step WMM method to detect the hierarchical community structures of networks by utilizing node attributes. The results of the synthetic networks without node attributes demonstrated that WMM showed better partition accuracy than both MM and robust MM and better stability than MM. The two-step WMM method showed better accuracy of community partitioning than WMM for synthetic networks with node attributes. Moreover, the results of resting state fMRI (rs-fMRI) data showed that two-step WMM had the advantage of detecting the hierarchical communities over WMM and was more insensitive to the density of the rs-fMRI networks than WMM.
Collapse
Affiliation(s)
- Zhitao Guo
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| | - Xiaojie Zhao
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| | - Li Yao
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| | - Zhiying Long
- School of Artificial Intelligence, Beijing Normal University, Beijing, China
| |
Collapse
|
3
|
Xiao G, Guan R, Cao Y, Huang Z, Xu Y. KISL: knowledge-injected semi-supervised learning for biological co-expression network modules. Front Genet 2023; 14:1151962. [PMID: 37205122 PMCID: PMC10185879 DOI: 10.3389/fgene.2023.1151962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 04/11/2023] [Indexed: 05/21/2023] Open
Abstract
The exploration of important biomarkers associated with cancer development is crucial for diagnosing cancer, designing therapeutic interventions, and predicting prognoses. The analysis of gene co-expression provides a systemic perspective on gene networks and can be a valuable tool for mining biomarkers. The main objective of co-expression network analysis is to discover highly synergistic sets of genes, and the most widely used method is weighted gene co-expression network analysis (WGCNA). With the Pearson correlation coefficient, WGCNA measures gene correlation, and uses hierarchical clustering to identify gene modules. The Pearson correlation coefficient reflects only the linear dependence between variables, and the main drawback of hierarchical clustering is that once two objects are clustered together, the process cannot be reversed. Hence, readjusting inappropriate cluster divisions is not possible. Existing co-expression network analysis methods rely on unsupervised methods that do not utilize prior biological knowledge for module delineation. Here we present a method for identification of outstanding modules in a co-expression network using a knowledge-injected semi-supervised learning approach (KISL), which utilizes apriori biological knowledge and a semi-supervised clustering method to address the issue existing in the current GCN-based clustering methods. To measure the linear and non-linear dependence between genes, we introduce a distance correlation due to the complexity of the gene-gene relationship. Eight RNA-seq datasets of cancer samples are used to validate its effectiveness. In all eight datasets, the KISL algorithm outperformed WGCNA when comparing the silhouette coefficient, Calinski-Harabasz index and Davies-Bouldin index evaluation metrics. According to the results, KISL clusters had better cluster evaluation values and better gene module aggregation. Enrichment analysis of the recognition modules demonstrated their effectiveness in discovering modular structures in biological co-expression networks. In addition, as a general method, KISL can be applied to various co-expression network analyses based on similarity metrics. Source codes for the KISL and the related scripts are available online at https://github.com/Mowonhoo/KISL.git.
Collapse
Affiliation(s)
- Gangyi Xiao
- College of Computer Science and Technology, Jilin University, Changchun, China
| | - Renchu Guan
- College of Computer Science and Technology, Jilin University, Changchun, China
| | - Yangkun Cao
- School of Artificial Intelligence Jilin University, Changchun, China
| | - Zhenyu Huang
- College of Computer Science and Technology, Jilin University, Changchun, China
- *Correspondence: Ying Xu, ; Zhenyu Huang,
| | - Ying Xu
- School of Medicine, Southern University of Science and Technology, Shenzhen, Guangdong, China
- *Correspondence: Ying Xu, ; Zhenyu Huang,
| |
Collapse
|
4
|
Manipur I, Giordano M, Piccirillo M, Parashuraman S, Maddalena L. Community Detection in Protein-Protein Interaction Networks and Applications. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:217-237. [PMID: 34951849 DOI: 10.1109/tcbb.2021.3138142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The ability to identify and characterize not only the protein-protein interactions but also their internal modular organization through network analysis is fundamental for understanding the mechanisms of biological processes at the molecular level. Indeed, the detection of the network communities can enhance our understanding of the molecular basis of disease pathology, and promote drug discovery and disease treatment in personalized medicine. This work gives an overview of recent computational methods for the detection of protein complexes and functional modules in protein-protein interaction networks, also providing a focus on some of its applications. We propose a systematic reformulation of frequently adopted taxonomies for these methods, also proposing new categories to keep up with the most recent research. We review the literature of the last five years (2017-2021) and provide links to existing data and software resources. Finally, we survey recent works exploiting module identification and analysis, in the context of a variety of disease processes for biomarker identification and therapeutic target detection. Our review provides the interested reader with an up-to-date and self-contained view of the existing research, with links to state-of-the-art literature and resources, as well as hints on open issues and future research directions in complex detection and its applications.
Collapse
|
5
|
Large-scale prediction of key dynamic interacting proteins in multiple cancers. Int J Biol Macromol 2022; 220:1124-1132. [PMID: 36027989 DOI: 10.1016/j.ijbiomac.2022.08.125] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 08/15/2022] [Accepted: 08/17/2022] [Indexed: 11/21/2022]
Abstract
Tracking cancer dynamic protein-protein interactions (PPIs) and deciphering their pathogenesis remain a challenge. We presented a dynamic PPIs' hypothesis: permanent and transient interactions might achieve dynamic switchings from normal cells to malignancy, which could cause maintenance functions to be interrupted and transient functions to be sustained. Based on the hypothesis, we first predicted >1400 key cancer genes (KCG) by applying PPI-express we proposed to 18 cancer gene expression datasets. We then further screened out key dynamic interactions (KDI) of cancer based on KCG and transient and permanent interactions under both conditions. Two prominent functional characteristics, "Cell cycle-related" and "Immune-related", were presented for KCG, suggesting that these might be their general characteristics. We found that, compared to permanent to transient KDI pairs (P2T) in the network, transient to permanent (T2P) have significantly higher edge betweenness (EB), and P2T pairs tending to locate intra-functional modules may play roles in maintaining normal biological functions, while T2P KDI pairs tending to locate inter-modules may play roles in biological signal transduction. It was consistent with our hypothesis. Also, we analyzed network characteristics of KDI pairs and their functions. Our findings of KDI may serve to understand and explain a few hallmarks of cancer.
Collapse
|
6
|
Li N, Jin D, Wei J, Huang Y, Xu J. Functional brain abnormalities in major depressive disorder using a multiscale community detection approach. Neuroscience 2022; 501:1-10. [PMID: 35964834 DOI: 10.1016/j.neuroscience.2022.08.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 08/04/2022] [Accepted: 08/05/2022] [Indexed: 11/28/2022]
Abstract
Major depressive disorder (MDD) is a serious disease associated with abnormal brain regions, however, the interconnection between specific brain regions related to depression has not been fully explored. To solve this problem, the paper proposes a novel multiscale community detection method to compare the differences in brain regions between normal controls (NC) and MDD patients. This study adopted the Brainnetome Atlas to divide the brain into 246 regions and extract the time series of each region. The Pearson correlation was used to measure the similarity among different brain regions to conduct the brain functional network and to perform multiscale community detection. The optimal brain community structure of each group was further explored based on the modularized Qcut algorithm, normalized mutual information (NMI), and variation of information (VI). The Jaccard index was then applied to compare the abnormalities of each brain region from different community environments between the brain function networks of NC and MDD patients. The experiments revealed several abnormal brain regions between NC and MDD, including the superior frontal gyrus, middle frontal gyrus, inferior frontal gyrus, orbital gyrus, superior temporal gyrus, middle temporal gyrus, inferior temporal gyrus, posterior superior temporal sulcus, inferior parietal gyrus, precuneus, postcentral gyrus, insular gyrus, cingulate gyrus, hippocampus and basal ganglia. Finally, a new subnetwork related to cognitive function was discovered, which was composed of the island gyrus and inferior frontal gyrus. All experiments indicated that the proposed method is useful in detecting functional brain abnormalities in MDD, and it can provide valuable insights into the diagnosis and treatment of MDD.
Collapse
Affiliation(s)
- Na Li
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Di Jin
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jianguo Wei
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Yuxiao Huang
- Columbian College of Arts & Sciences, George Washington University, Washington D.C., USA
| | - Junhai Xu
- Tianjin Key Lab of Cognitive Computing and Application, College of Intelligence and Computing, Tianjin University, Tianjin, China.
| |
Collapse
|
7
|
Sharma P, Rai P, Pandey AK. Detecting Protein Complexes Using Multiple Node Property of PPI Networks. 2021 4TH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE AND TECHNOLOGY (ICRTCST) 2022:218-222. [DOI: 10.1109/icrtcst54752.2022.9781859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Affiliation(s)
| | - Pankaj Rai
- BIT Sindri,Department of EE,Dhanbad,828123
| | | |
Collapse
|
8
|
Abstract
Network modeling transforms data into a structure of nodes and edges such that edges represent relationships between pairs of objects, then extracts clusters of densely connected nodes in order to capture high-dimensional relationships hidden in the data. This efficient and flexible strategy holds potential for unveiling complex patterns concealed within massive datasets, but standard implementations overlook several key issues that can undermine research efforts. These issues range from data imputation and discretization to correlation metrics, clustering methods, and validation of results. Here, we enumerate these pitfalls and provide practical strategies for alleviating their negative effects. These guidelines increase prospects for future research endeavors as they reduce type I and type II (false-positive and false-negative) errors and are generally applicable for network modeling applications across diverse domains.
Collapse
Affiliation(s)
- Sharlee Climer
- Department of Computer Science, University of Missouri – St. Louis, St. Louis, MO, USA
| |
Collapse
|
9
|
Mining social applications network from business perspective using modularity maximization for community detection. SOCIAL NETWORK ANALYSIS AND MINING 2021. [DOI: 10.1007/s13278-021-00798-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
10
|
Lyu C, Shi Y, Sun L. A Novel Local Community Detection Method Using Evolutionary Computation. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:3348-3360. [PMID: 31449039 DOI: 10.1109/tcyb.2019.2933041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The local community detection is a significant branch of the community detection problems. It aims at finding the local community to which a given starting node belongs. The local community detection plays an important role in analyzing the complex networks and recently has drawn much attention from the researchers. In the past few years, several local community detection algorithms have been proposed. However, the previous methods only make use of the limited local information of networks but overlook the other valuable information. In this article, we propose an evolutionary computation-based algorithm called evolutionary-based local community detection (ELCD) algorithm to detect local communities in the complex networks by taking advantages of the entire obtained information. The performance of the proposed algorithm is evaluated on both synthetic and real-world benchmark networks. The experimental results show that the proposed algorithm has a superior performance compared with the state-of-the-art local community detection methods. Furthermore, we test the proposed algorithm on incomplete real-world networks to show its effectiveness on the networks whose global information cannot be obtained.
Collapse
|
11
|
Liu P, Tian W. Identification of DNA methylation patterns and biomarkers for clear-cell renal cell carcinoma by multi-omics data analysis. PeerJ 2020; 8:e9654. [PMID: 32832275 PMCID: PMC7409785 DOI: 10.7717/peerj.9654] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/13/2020] [Indexed: 12/30/2022] Open
Abstract
Background Tumorigenesis is highly heterogeneous, and using clinicopathological signatures only is not enough to effectively distinguish clear cell renal cell carcinoma (ccRCC) and improve risk stratification of patients. DNA methylation (DNAm) with the stability and reversibility often occurs in the early stage of tumorigenesis. Disorders of transcription and metabolism are also an important molecular mechanisms of tumorigenesis. Therefore, it is necessary to identify effective biomarkers involved in tumorigenesis through multi-omics analysis, and these biomarkers also provide new potential therapeutic targets. Method The discovery stage involved 160 pairs of ccRCC and matched normal tissues for investigation of DNAm and biomarkers as well as 318 cases of ccRCC including clinical signatures. Correlation analysis of epigenetic, transcriptomic and metabolomic data revealed the connection and discordance among multi-omics and the deregulated functional modules. Diagnostic or prognostic biomarkers were obtained by the correlation analysis, the Least Absolute Shrinkage and Selection Operator (LASSO) and the LASSO-Cox methods. Two classifiers were established based on random forest (RF) and LASSO-Cox algorithms in training datasets. Seven independent datasets were used to evaluate robustness and universality. The molecular biological function of biomarkers were investigated using DAVID and GeneMANIA. Results Based on multi-omics analysis, the epigenetic measurements uniquely identified DNAm dysregulation of cellular mechanisms resulting in transcriptomic alterations, including cell proliferation, immune response and inflammation. Combination of the gene co-expression network and metabolic network identified 134 CpG sites (CpGs) as potential biomarkers. Based on the LASSO and RF algorithms, five CpGs were obtained to build a diagnostic classifierwith better classification performance (AUC > 99%). A eight-CpG-based prognostic classifier was obtained to improve risk stratification (hazard ratio (HR) > 4; log-rank test, p-value < 0.01). Based on independent datasets and seven additional cancers, the diagnostic and prognostic classifiers also had better robustness and stability. The molecular biological function of genes with abnormal methylation were significantly associated with glycolysis/gluconeogenesis and signal transduction. Conclusion The present study provides a comprehensive analysis of ccRCC using multi-omics data. These findings indicated that multi-omics analysis could identify some novel epigenetic factors, which were the most important causes of advanced cancer and poor clinical prognosis. Diagnostic and prognostic biomarkers were identified, which provided a promising avenue to develop effective therapies for ccRCC.
Collapse
Affiliation(s)
- Pengfei Liu
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.,Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
| | - Weidong Tian
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.,Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, China.,Children's Hospital of Fudan University, Shanghai, China
| |
Collapse
|
12
|
Abstract
In this paper, we proposed a novel community detection method based on the network structure transformation, that utilized deep learning. The probability transfer matrix of the network adjacency matrix was calculated, and the probability transfer matrix was used as the input of the deep learning network. We use a denoising autoencoder to nonlinearly map the probability transfer matrix into a new sub space. The community detection was calculated with the deep learning nonlinear transform of the network structure. The network nodes were clustered in the new space with the K-means clustering algorithm. The division of the community structure was obtained. We conducted extensive experimental tests on the benchmark networks and the standard networks (known as the initial division of communities). We tested the clustering results of the different types, and compared with the three base algorithms. The results showed that the proposed community detection model was effective. We compared the results with other traditional community detection methods. The empirical results on datasets of varying sizes demonstrated that our proposed method outperformed the other community detection methods for this task.
Collapse
|
13
|
Wang T, Peng J, Peng Q, Wang Y, Chen J. FSM: Fast and scalable network motif discovery for exploring higher-order network organizations. Methods 2019; 173:83-93. [PMID: 31306744 DOI: 10.1016/j.ymeth.2019.07.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 06/30/2019] [Accepted: 07/09/2019] [Indexed: 01/06/2023] Open
Abstract
Networks exhibit rich and diverse higher-order organizational structures. Network motifs, which are recurring significant patterns of inter-connections, are recognized as fundamental units to study the higher-order organizations of networks. However, the principle of selecting representative network motifs for local motif based clustering remains largely unexplored. We present a scalable algorithm called FSM for network motif discovery. FSM is advantageous in twofold. First, it accelerates the motif discovery process by effectively reducing the number of times for subgraph isomorphism labeling. Second, FSM adopts multiple heuristic optimizations for subgraph enumeration and classification to further improve its performance. Experimental results on biological networks show that, comparing with the existing network motif discovery algorithm, FSM is more efficient on computational efficiency and memory usage. Furthermore, with the large, frequent, and sparse network motifs discovered by FSM, the higher-order organizational structures of biological networks were successfully revealed, indicating that FSM is suitable to select network representative network motifs for exploring high-order network organizations.
Collapse
Affiliation(s)
- Tao Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Jiajie Peng
- School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Qidi Peng
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Yadong Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
| | - Jin Chen
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY, USA.
| |
Collapse
|
14
|
|
15
|
Rasti S, Vogiatzis C. A survey of computational methods in protein–protein interaction networks. ANNALS OF OPERATIONS RESEARCH 2019; 276:35-87. [DOI: 10.1007/s10479-018-2956-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
16
|
Sharma P, Bhattacharyya D, Kalita J. Detecting protein complexes based on a combination of topological and biological properties in protein-protein interaction network. J Genet Eng Biotechnol 2018; 16:217-226. [PMID: 30647725 PMCID: PMC6296571 DOI: 10.1016/j.jgeb.2017.11.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Revised: 11/01/2017] [Accepted: 11/17/2017] [Indexed: 01/04/2023]
Abstract
Protein complexes are known to play a major role in controlling cellular activity in a living being. Identifying complexes from raw protein protein interactions (PPIs) is an important area of research. Earlier work has been limited mostly to yeast. Such protein complex identification methods, when applied to large human PPIs often give poor performance. We introduce a novel method called CSC to detect protein complexes. The method is evaluated in terms of positive predictive value, sensitivity and accuracy using the datasets of the model organism, yeast and humans. CSC outperforms several other competing algorithms for both organisms. Further, we present a framework to establish the usefulness of CSC in analyzing the influence of a given disease gene in a complex topologically as well as biologically considering eight major association factors.
Collapse
Affiliation(s)
- Pooja Sharma
- Department of Computer Science & Engineering, Tezpur University Napaam, Tezpur 784028, Assam, India
| | - D.K. Bhattacharyya
- Department of Computer Science & Engineering, Tezpur University Napaam, Tezpur 784028, Assam, India
| | - J.K. Kalita
- Department of Computer Science, University of Colorado at Colorado, Springs, CO 80933-7150, USA
| |
Collapse
|
17
|
Abstract
Protein complexes are known to play a major role in controlling cellular activity in a living being. Identifying complexes from raw protein-protein interactions (PPIs) is an important area of research. Earlier work has been limited mostly to yeast and a few other model organisms. Such protein complex identification methods, when applied to large human PPIs often give poor performance. We introduce a novel method called ComFiR to detect such protein complexes and further rank diseased complexes based on a query disease. We have shown that it has better performance in identifying protein complexes from human PPI data. This method is evaluated in terms of positive predictive value, sensitivity and accuracy. We have introduced a ranking approach and showed its application on Alzheimer's disease.
Collapse
|
18
|
|
19
|
Sharma P, Bhattacharyya D. DCRS: A Multi-objective Protein Complex Finding Method. LECTURE NOTES IN NETWORKS AND SYSTEMS 2018:801-809. [DOI: 10.1007/978-981-10-6890-4_76] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
20
|
Liu C, Brattico E, Abu-Jamous B, Pereira CS, Jacobsen T, Nandi AK. Effect of Explicit Evaluation on Neural Connectivity Related to Listening to Unfamiliar Music. Front Hum Neurosci 2017; 11:611. [PMID: 29311874 PMCID: PMC5742221 DOI: 10.3389/fnhum.2017.00611] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 11/30/2017] [Indexed: 12/26/2022] Open
Abstract
People can experience different emotions when listening to music. A growing number of studies have investigated the brain structures and neural connectivities associated with perceived emotions. However, very little is known about the effect of an explicit act of judgment on the neural processing of emotionally-valenced music. In this study, we adopted the novel consensus clustering paradigm, called binarisation of consensus partition matrices (Bi-CoPaM), to study whether and how the conscious aesthetic evaluation of the music would modulate brain connectivity networks related to emotion and reward processing. Participants listened to music under three conditions - one involving a non-evaluative judgment, one involving an explicit evaluative aesthetic judgment, and one involving no judgment at all (passive listening only). During non-evaluative attentive listening we obtained auditory-limbic connectivity whereas when participants were asked to decide explicitly whether they liked or disliked the music excerpt, only two clusters of intercommunicating brain regions were found: one including areas related to auditory processing and action observation, and the other comprising higher-order structures involved with visual processing. Results indicate that explicit evaluative judgment has an impact on the neural auditory-limbic connectivity during affective processing of music.
Collapse
Affiliation(s)
- Chao Liu
- Department of Electronic and Computer Engineering, Brunel University London, Uxbridge, United Kingdom
| | - Elvira Brattico
- Department of Clinical Medicine, Center for Music in the Brain, Aarhus University & Royal Academy of Music Aarhus/Aalborg, Aarhus, Denmark.,AMI Centre, School of Science, Aalto University, Espoo, Finland
| | - Basel Abu-Jamous
- Department of Electronic and Computer Engineering, Brunel University London, Uxbridge, United Kingdom
| | | | - Thomas Jacobsen
- Experimental Psychology Unit, Helmut Schmidt University, University of Federal Armed Forces, Hamburg, Germany
| | - Asoke K Nandi
- Department of Electronic and Computer Engineering, Brunel University London, Uxbridge, United Kingdom.,The Key Laboratory of Embedded Systems and Service Computing, College of Electronic and Information Engineering, Tongji University, Shanghai, China
| |
Collapse
|
21
|
Alluri V, Toiviainen P, Burunat I, Kliuchko M, Vuust P, Brattico E. Connectivity patterns during music listening: Evidence for action-based processing in musicians. Hum Brain Mapp 2017; 38:2955-2970. [PMID: 28349620 DOI: 10.1002/hbm.23565] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Revised: 02/23/2017] [Accepted: 03/02/2017] [Indexed: 12/13/2022] Open
Abstract
Musical expertise is visible both in the morphology and functionality of the brain. Recent research indicates that functional integration between multi-sensory, somato-motor, default-mode (DMN), and salience (SN) networks of the brain differentiates musicians from non-musicians during resting state. Here, we aimed at determining whether brain networks differentially exchange information in musicians as opposed to non-musicians during naturalistic music listening. Whole-brain graph-theory analyses were performed on participants' fMRI responses. Group-level differences revealed that musicians' primary hubs comprised cerebral and cerebellar sensorimotor regions whereas non-musicians' dominant hubs encompassed DMN-related regions. Community structure analyses of the key hubs revealed greater integration of motor and somatosensory homunculi representing the upper limbs and torso in musicians. Furthermore, musicians who started training at an earlier age exhibited greater centrality in the auditory cortex, and areas related to top-down processes, attention, emotion, somatosensory processing, and non-verbal processing of speech. We here reveal how brain networks organize themselves in a naturalistic music listening situation wherein musicians automatically engage neural networks that are action-based while non-musicians use those that are perception-based to process an incoming auditory stream. Hum Brain Mapp 38:2955-2970, 2017. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Vinoo Alluri
- Department of Music, University of Jyväskylä, Jyväskylä, Finland
| | - Petri Toiviainen
- Department of Music, University of Jyväskylä, Jyväskylä, Finland
| | - Iballa Burunat
- Department of Music, University of Jyväskylä, Jyväskylä, Finland
| | - Marina Kliuchko
- Cognitive Brain Research Unit, Institute of Behavioural Sciences, University of Helsinki, Helsinki, Finland
| | - Peter Vuust
- Center for Music in the Brain (MIB), Department of Clinical Medicine, Aarhus University & Royal Academy of Music Aarhus/Aalborg, Denmark
| | - Elvira Brattico
- Center for Music in the Brain (MIB), Department of Clinical Medicine, Aarhus University & Royal Academy of Music Aarhus/Aalborg, Denmark.,Advanced Magnetic Imaging (AMI) Centre, Aalto University School of Science, Espoo, Finland
| |
Collapse
|
22
|
Goulas A, Stiers P, Hutchison RM, Everling S, Petrides M, Margulies DS. Intrinsic functional architecture of the macaque dorsal and ventral lateral frontal cortex. J Neurophysiol 2017; 117:1084-1099. [PMID: 28003408 PMCID: PMC5340881 DOI: 10.1152/jn.00486.2016] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 11/17/2016] [Indexed: 11/22/2022] Open
Abstract
Investigations of the cellular and connectional organization of the lateral frontal cortex (LFC) of the macaque monkey provide indispensable knowledge for generating hypotheses about the human LFC. However, despite numerous investigations, there are still debates on the organization of this brain region. In vivo neuroimaging techniques such as resting-state functional magnetic resonance imaging (fMRI) can be used to define the functional circuitry of brain areas, producing results largely consistent with gold-standard invasive tract-tracing techniques and offering the opportunity for cross-species comparisons within the same modality. Our results using resting-state fMRI from macaque monkeys to uncover the intrinsic functional architecture of the LFC corroborate previous findings and inform current debates. Specifically, within the dorsal LFC, we show that 1) the region along the midline and anterior to the superior arcuate sulcus is divided in two areas separated by the posterior supraprincipal dimple, 2) the cytoarchitectonically defined area 6DC/F2 contains two connectional divisions, and 3) a distinct area occupies the cortex around the spur of the arcuate sulcus, updating what was previously proposed to be the border between dorsal and ventral motor/premotor areas. Within the ventral LFC, the derived parcellation clearly suggests the presence of distinct areas: 1) an area with a somatomotor/orofacial connectional signature (putative area 44), 2) an area with an oculomotor connectional signature (putative frontal eye fields), and 3) premotor areas possibly hosting laryngeal and arm representations. Our results illustrate in detail the intrinsic functional architecture of the macaque LFC, thus providing valuable evidence for debates on its organization.NEW & NOTEWORTHY Resting-state functional MRI is used as a complementary method to invasive techniques to inform current debates on the organization of the macaque lateral frontal cortex. Given that the macaque cortex serves as a model for the human cortex, our results help generate more fine-tuned hypothesis for the organization of the human lateral frontal cortex.
Collapse
Affiliation(s)
- Alexandros Goulas
- Max Planck Research Group Neuroanatomy and Connectivity, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany;
| | - Peter Stiers
- Faculty of Psychology and Neuroscience, Department of Neuropsychology and Psychopharmacology, Maastricht University, Maastricht, The Netherlands
| | | | - Stefan Everling
- Robarts Research Institute, University of Western Ontario, London, Ontario, Canada; and
| | - Michael Petrides
- Cognitive Neuroscience Unit, Montreal Neurological Institute, McGill University, Montreal, Quebec, Canada
| | - Daniel S Margulies
- Max Planck Research Group Neuroanatomy and Connectivity, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| |
Collapse
|
23
|
Abstract
Recent work in the sociology of music suggests a declining importance of genre categories. Yet other work in this research stream and in the sociology of classification argues for the continued prevalence of genres as a meaningful tool through which creators, critics and consumers focus their attention in the topology of available works. Building from work in the study of categories and categorization we examine how boundary strength and internal differentiation structure the genre pairings of some 3 million musicians and groups. Using a range of network-based and statistical techniques, we uncover three musical “complexes,” which are collectively constituted by 16 smaller genre communities. Our analysis shows that the musical universe is not monolithically organized but rather composed of multiple worlds that are differently structured—i.e., uncentered, single-centered, and multi-centered.
Collapse
Affiliation(s)
- Daniel Silver
- Department of Sociology, University of Toronto Scarborough, Toronto, Ontario, Canada
- * E-mail:
| | - Monica Lee
- Department of Sociology, University of Chicago, Chicago, Illinois, United States of America
| | - C. Clayton Childress
- Department of Sociology, University of Toronto Scarborough, Toronto, Ontario, Canada
| |
Collapse
|
24
|
Ma Y, Hu J, Zhang N, Dong X, Li Y, Yang B, Tian W, Wang X. Prediction of Candidate Drugs for Treating Pancreatic Cancer by Using a Combined Approach. PLoS One 2016; 11:e0149896. [PMID: 26910401 PMCID: PMC4765895 DOI: 10.1371/journal.pone.0149896] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Accepted: 02/05/2016] [Indexed: 01/15/2023] Open
Abstract
Pancreatic cancer is the leading cause of death from solid malignancies worldwide. Currently, gemcitabine is the only drug approved for treating pancreatic cancer. Developing new therapeutic drugs for this disease is, therefore, an urgent need. The C-Map project has provided a wealth of gene expression data that can be mined for repositioning drugs, a promising approach to new drug discovery. Typically, a drug is considered potentially useful for treating a disease if the drug-induced differential gene expression profile is negatively correlated with the differentially expressed genes in the target disease. However, many of the potentially useful drugs (PUDs) identified by gene expression profile correlation are likely false positives because, in C-Map, the cultured cell lines to which the drug is applied are not derived from diseased tissues. To solve this problem, we developed a combined approach for predicting candidate drugs for treating pancreatic cancer. We first identified PUDs for pancreatic cancer by using C-Map-based gene expression correlation analyses. We then applied an algorithm (Met-express) to predict key pancreatic cancer (KPC) enzymes involved in pancreatic cancer metabolism. Finally, we selected candidates from the PUDs by requiring that their targets be KPC enzymes or the substrates/products of KPC enzymes. Using this combined approach, we predicted seven candidate drugs for treating pancreatic cancer, three of which are supported by literature evidence, and three were experimentally validated to be inhibitory to pancreatic cancer celllines.
Collapse
Affiliation(s)
- Yanfen Ma
- Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
- Health Science Center of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
| | - Jian Hu
- Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
| | - Ning Zhang
- Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
- Health Science Center of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
| | - Xinran Dong
- Department of Biostatistics and Computational Biology, School of Life Science, Fudan University, Shanghai, China
| | - Ying Li
- Health Science Center of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
- SHAANXI Kang Fu Hospital, Xi'an, Shaanxi province, P.R. China
| | - Bo Yang
- Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
| | - Weidong Tian
- Department of Biostatistics and Computational Biology, School of Life Science, Fudan University, Shanghai, China
| | - Xiaoqin Wang
- Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi province, P.R. China
| |
Collapse
|
25
|
Al-Harazi O, Al Insaif S, Al-Ajlan MA, Kaya N, Dzimiri N, Colak D. Integrated Genomic and Network-Based Analyses of Complex Diseases and Human Disease Network. J Genet Genomics 2015; 43:349-67. [PMID: 27318646 DOI: 10.1016/j.jgg.2015.11.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Revised: 10/22/2015] [Accepted: 11/20/2015] [Indexed: 12/16/2022]
Abstract
A disease phenotype generally reflects various pathobiological processes that interact in a complex network. The highly interconnected nature of the human protein interaction network (interactome) indicates that, at the molecular level, it is difficult to consider diseases as being independent of one another. Recently, genome-wide molecular measurements, data mining and bioinformatics approaches have provided the means to explore human diseases from a molecular basis. The exploration of diseases and a system of disease relationships based on the integration of genome-wide molecular data with the human interactome could offer a powerful perspective for understanding the molecular architecture of diseases. Recently, subnetwork markers have proven to be more robust and reliable than individual biomarker genes selected based on gene expression profiles alone, and achieve higher accuracy in disease classification. We have applied one of these methodologies to idiopathic dilated cardiomyopathy (IDCM) data that we have generated using a microarray and identified significant subnetworks associated with the disease. In this paper, we review the recent endeavours in this direction, and summarize the existing methodologies and computational tools for network-based analysis of complex diseases and molecular relationships among apparently different disorders and human disease network. We also discuss the future research trends and topics of this promising field.
Collapse
Affiliation(s)
- Olfat Al-Harazi
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Sadiq Al Insaif
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Monirah A Al-Ajlan
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia; College of Computer and Information Sciences, King Saud University, Riyadh 11451, Saudi Arabia
| | - Namik Kaya
- Department of Genetics, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Nduna Dzimiri
- Department of Genetics, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia
| | - Dilek Colak
- Department of Biostatistics, Epidemiology and Scientific Computing, King Faisal Specialist Hospital and Research Centre, Riyadh 11211, Saudi Arabia.
| |
Collapse
|
26
|
Quantitative assessment of gene expression network module-validation methods. Sci Rep 2015; 5:15258. [PMID: 26470848 PMCID: PMC4607977 DOI: 10.1038/srep15258] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 09/21/2015] [Indexed: 02/01/2023] Open
Abstract
Validation of pluripotent modules in diverse networks holds enormous potential for systems biology and network pharmacology. An arising challenge is how to assess the accuracy of discovering all potential modules from multi-omic networks and validating their architectural characteristics based on innovative computational methods beyond function enrichment and biological validation. To display the framework progress in this domain, we systematically divided the existing Computational Validation Approaches based on Modular Architecture (CVAMA) into topology-based approaches (TBA) and statistics-based approaches (SBA). We compared the available module validation methods based on 11 gene expression datasets, and partially consistent results in the form of homogeneous models were obtained with each individual approach, whereas discrepant contradictory results were found between TBA and SBA. The TBA of the Zsummary value had a higher Validation Success Ratio (VSR) (51%) and a higher Fluctuation Ratio (FR) (80.92%), whereas the SBA of the approximately unbiased (AU) p-value had a lower VSR (12.3%) and a lower FR (45.84%). The Gray area simulated study revealed a consistent result for these two models and indicated a lower Variation Ratio (VR) (8.10%) of TBA at 6 simulated levels. Despite facing many novel challenges and evidence limitations, CVAMA may offer novel insights into modular networks.
Collapse
|
27
|
Modular Patterns of Phase Desynchronization Networks During a Simple Visuomotor Task. Brain Topogr 2015; 29:118-29. [DOI: 10.1007/s10548-015-0451-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 09/07/2015] [Indexed: 10/23/2022]
|
28
|
Alavash M, Hilgetag CC, Thiel CM, Gießing C. Persistency and flexibility of complex brain networks underlie dual-task interference. Hum Brain Mapp 2015; 36:3542-62. [PMID: 26095953 PMCID: PMC6869626 DOI: 10.1002/hbm.22861] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Revised: 04/27/2015] [Accepted: 05/19/2015] [Indexed: 12/29/2022] Open
Abstract
Previous studies on multitasking suggest that performance decline during concurrent task processing arises from interfering brain modules. Here, we used graph-theoretical network analysis to define functional brain modules and relate the modular organization of complex brain networks to behavioral dual-task costs. Based on resting-state and task fMRI we explored two organizational aspects potentially associated with behavioral interference when human subjects performed a visuospatial and speech task simultaneously: the topological overlap between persistent single-task modules, and the flexibility of single-task modules in adaptation to the dual-task condition. Participants showed a significant decline in visuospatial accuracy in the dual-task compared with single visuospatial task. Global analysis of topological similarity between modules revealed that the overlap between single-task modules significantly correlated with the decline in visuospatial accuracy. Subjects with larger overlap between single-task modules showed higher behavioral interference. Furthermore, lower flexible reconfiguration of single-task modules in adaptation to the dual-task condition significantly correlated with larger decline in visuospatial accuracy. Subjects with lower modular flexibility showed higher behavioral interference. At the regional level, higher overlap between single-task modules and less modular flexibility in the somatomotor cortex positively correlated with the decline in visuospatial accuracy. Additionally, higher modular flexibility in cingulate and frontal control areas and lower flexibility in right-lateralized nodes comprising the middle occipital and superior temporal gyri supported dual-tasking. Our results suggest that persistency and flexibility of brain modules are important determinants of dual-task costs. We conclude that efficient dual-tasking benefits from a specific balance between flexibility and rigidity of functional brain modules.
Collapse
Affiliation(s)
- Mohsen Alavash
- Department of Psychology, Biological Psychology LabEuropean Medical School, Carl von Ossietzky Universität Oldenburg26111OldenburgGermany
| | - Claus C. Hilgetag
- Department of Computational NeuroscienceUniversity Medical Center Hamburg‐Eppendorf20246HamburgGermany
- Department of Health SciencesBoston UniversityBostonMassachusetts02215
| | - Christiane M. Thiel
- Department of Psychology, Biological Psychology LabEuropean Medical School, Carl von Ossietzky Universität Oldenburg26111OldenburgGermany
- Research Center Neurosensory ScienceCarl von Ossietzky Universität Oldenburg26111OldenburgGermany
| | - Carsten Gießing
- Department of Psychology, Biological Psychology LabEuropean Medical School, Carl von Ossietzky Universität Oldenburg26111OldenburgGermany
- Research Center Neurosensory ScienceCarl von Ossietzky Universität Oldenburg26111OldenburgGermany
| |
Collapse
|
29
|
Wang J, Zhong J, Chen G, Li M, Wu FX, Pan Y. ClusterViz: A Cytoscape APP for Cluster Analysis of Biological Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:815-822. [PMID: 26357321 DOI: 10.1109/tcbb.2014.2361348] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Cluster analysis of biological networks is one of the most important approaches for identifying functional modules and predicting protein functions. Furthermore, visualization of clustering results is crucial to uncover the structure of biological networks. In this paper, ClusterViz, an APP of Cytoscape 3 for cluster analysis and visualization, has been developed. In order to reduce complexity and enable extendibility for ClusterViz, we designed the architecture of ClusterViz based on the framework of Open Services Gateway Initiative. According to the architecture, the implementation of ClusterViz is partitioned into three modules including interface of ClusterViz, clustering algorithms and visualization and export. ClusterViz fascinates the comparison of the results of different algorithms to do further related analysis. Three commonly used clustering algorithms, FAG-EC, EAGLE and MCODE, are included in the current version. Due to adopting the abstract interface of algorithms in module of the clustering algorithms, more clustering algorithms can be included for the future use. To illustrate usability of ClusterViz, we provided three examples with detailed steps from the important scientific articles, which show that our tool has helped several research teams do their research work on the mechanism of the biological networks.
Collapse
|
30
|
Telesford QK, Laurienti PJ, Davenport AT, Friedman DP, Kraft RA, Daunais JB. The effects of chronic alcohol self-administration in nonhuman primate brain networks. Alcohol Clin Exp Res 2015; 39:659-71. [PMID: 25833027 PMCID: PMC6724209 DOI: 10.1111/acer.12688] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 01/21/2015] [Indexed: 11/28/2022]
Abstract
BACKGROUND Long-term alcohol abuse is associated with change in behavior, brain structure, and brain function. However, the nature of these changes is not well understood. In this study, we used network science to analyze a nonhuman primate model of ethanol self-administration to evaluate functional differences between animals with chronic alcohol use and animals with no exposure to alcohol. Of particular interest was how chronic alcohol exposure may affect the resting state network. METHODS Baseline resting state functional magnetic resonance imaging was acquired in a cohort of vervet monkeys. Animals underwent an induction period where they were exposed to an isocaloric maltose dextrin solution (control) or ethanol in escalating doses over three 30-day epochs. Following induction, animals were given ad libitum access to water and a maltose dextrin solution (control) or water and ethanol for 22 h/d over 12 months. Cross-sectional analyses examined region of interests in hubs and community structure across animals to determine differences between drinking and nondrinking animals after the 12-month free access period. RESULTS Animals were classified as lighter (<2.0 g/kg/d) or heavier drinkers (≥2.0 g/kg/d) based on a median split of their intake pattern during the 12-month ethanol free access period. Statistical analysis of hub connectivity showed significant differences in heavier drinkers for hubs in the precuneus, posterior parietal cortices, superior temporal gyrus, subgenual cingulate, and sensorimotor cortex. Heavier drinkers were also shown to have less consistent communities across the brain compared to lighter drinkers. The different level of consumption between the lighter and heavier drinking monkeys suggests that differences in connectivity may be intake dependent. CONCLUSIONS Animals that consume alcohol show topological differences in brain network organization, particularly in animals that drink heavily. Differences in the resting state network were linked to areas that are associated with spatial association, working memory, and visuomotor processing.
Collapse
Affiliation(s)
- Qawi K Telesford
- School of Biomedical Engineering and Sciences, Virginia Tech-Wake Forest University, Winston-Salem, North Carolina
| | | | | | | | | | | |
Collapse
|
31
|
Bennett L, Kittas A, Liu S, Papageorgiou LG, Tsoka S. Community structure detection for overlapping modules through mathematical programming in protein interaction networks. PLoS One 2014; 9:e112821. [PMID: 25412367 PMCID: PMC4239042 DOI: 10.1371/journal.pone.0112821] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Accepted: 10/15/2014] [Indexed: 12/05/2022] Open
Abstract
Community structure detection has proven to be important in revealing the underlying properties of complex networks. The standard problem, where a partition of disjoint communities is sought, has been continually adapted to offer more realistic models of interactions in these systems. Here, a two-step procedure is outlined for exploring the concept of overlapping communities. First, a hard partition is detected by employing existing methodologies. We then propose a novel mixed integer non linear programming (MINLP) model, known as OverMod, which transforms disjoint communities to overlapping. The procedure is evaluated through its application to protein-protein interaction (PPI) networks of the rat, E. coli, yeast and human organisms. Connector nodes of hard partitions exhibit topological and functional properties indicative of their suitability as candidates for multiple module membership. OverMod identifies two types of connector nodes, inter and intra-connector, each with their own particular characteristics pertaining to their topological and functional role in the organisation of the network. Inter-connector proteins are shown to be highly conserved proteins participating in pathways that control essential cellular processes, such as proliferation, differentiation and apoptosis and their differences with intra-connectors is highlighted. Many of these proteins are shown to possess multiple roles of distinct nature through their participation in different network modules, setting them apart from proteins that are simply ‘hubs’, i.e. proteins with many interaction partners but with a more specific biochemical role.
Collapse
Affiliation(s)
- Laura Bennett
- Centre for Process Systems Engineering, Department of Chemical Engineering, UCL (University College London), Torrington Place, WC1E 7JE, London, United Kingdom
| | - Aristotelis Kittas
- Department of Informatics, King's College London, Strand, WC2R 2LS, London, United Kingdom
| | - Songsong Liu
- Centre for Process Systems Engineering, Department of Chemical Engineering, UCL (University College London), Torrington Place, WC1E 7JE, London, United Kingdom
| | - Lazaros G. Papageorgiou
- Centre for Process Systems Engineering, Department of Chemical Engineering, UCL (University College London), Torrington Place, WC1E 7JE, London, United Kingdom
| | - Sophia Tsoka
- Department of Informatics, King's College London, Strand, WC2R 2LS, London, United Kingdom
- * E-mail:
| |
Collapse
|
32
|
Wilkins RW, Hodges DA, Laurienti PJ, Steen M, Burdette JH. Network science and the effects of music preference on functional brain connectivity: from Beethoven to Eminem. Sci Rep 2014; 4:6130. [PMID: 25167363 PMCID: PMC5385828 DOI: 10.1038/srep06130] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Accepted: 07/16/2014] [Indexed: 12/31/2022] Open
Abstract
Most people choose to listen to music that they prefer or 'like' such as classical, country or rock. Previous research has focused on how different characteristics of music (i.e., classical versus country) affect the brain. Yet, when listening to preferred music--regardless of the type--people report they often experience personal thoughts and memories. To date, understanding how this occurs in the brain has remained elusive. Using network science methods, we evaluated differences in functional brain connectivity when individuals listened to complete songs. We show that a circuit important for internally-focused thoughts, known as the default mode network, was most connected when listening to preferred music. We also show that listening to a favorite song alters the connectivity between auditory brain areas and the hippocampus, a region responsible for memory and social emotion consolidation. Given that musical preferences are uniquely individualized phenomena and that music can vary in acoustic complexity and the presence or absence of lyrics, the consistency of our results was unexpected. These findings may explain why comparable emotional and mental states can be experienced by people listening to music that differs as widely as Beethoven and Eminem. The neurobiological and neurorehabilitation implications of these results are discussed.
Collapse
Affiliation(s)
- R W Wilkins
- 1] Laboratory for Complex Brain Networks, Wake Forest School of Medicine, Winston-Salem, NC 27157 [2] Neuroimaging Laboratory for Complex Systems, Gateway MRI Center, Joint School for Nanoscience and Nanoengineering, University of North Carolina Greensboro, NC 27401 [3] Music Research Institute, University of North Carolina Greensboro, NC 27403.
| | - D A Hodges
- Music Research Institute, University of North Carolina Greensboro, NC 27403
| | - P J Laurienti
- Laboratory for Complex Brain Networks, Wake Forest School of Medicine, Winston-Salem, NC 27157
| | - M Steen
- Laboratory for Complex Brain Networks, Wake Forest School of Medicine, Winston-Salem, NC 27157
| | - J H Burdette
- Laboratory for Complex Brain Networks, Wake Forest School of Medicine, Winston-Salem, NC 27157
| |
Collapse
|
33
|
Liang YH, Cai B, Chen F, Wang G, Wang M, Zhong Y, Cheng ZM(M. Construction and validation of a gene co-expression network in grapevine (Vitis vinifera. L.). HORTICULTURE RESEARCH 2014; 1:14040. [PMID: 26504546 PMCID: PMC4596334 DOI: 10.1038/hortres.2014.40] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 05/20/2014] [Accepted: 06/09/2014] [Indexed: 05/04/2023]
Abstract
Gene co-expression analysis has been widely used for predicting gene functions because genes within modules of a co-expression network may be involved in similar biological processes and exhibit similar biological functions. To detect gene relationships in the grapevine genome, we constructed a grapevine gene co-expression network (GGCN) by compiling a total of 374 publically available grapevine microarray datasets. The GGCN consisted of 557 modules containing a total of 3834 nodes with 13 479 edges. The functions of the subnetwork modules were inferred by Gene ontology (GO) enrichment analysis. In 127 of the 557 modules containing two or more GO terms, 38 modules exhibited the most significantly enriched GO terms, including 'protein catabolism process', 'photosynthesis', 'cell biosynthesis process', 'biosynthesis of plant cell wall', 'stress response' and other important biological processes. The 'response to heat' GO term was highly represented in module 17, which is composed of many heat shock proteins. To further determine the potential functions of genes in module 17, we performed a Pearson correlation coefficient test, analyzed orthologous relationships with Arabidopsis genes and established gene expression correlations with real-time quantitative reverse transcriptase PCR (qRT-PCR). Our results indicated that many genes in module 17 were upregulated during the heat shock and recovery processes and downregulated in response to low temperature. Furthermore, two putative genes, Vit_07s0185g00040 and Vit_02s0025g04060, were highly expressed in response to heat shock and recovery. This study provides insight into GGCN gene modules and offers important references for gene functions and the discovery of new genes at the module level.
Collapse
Affiliation(s)
- Ying-Hai Liang
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
- Institute of Pomology, Academy of Agricultural Sciences of Jilin Province, Gong Zhuling 136100, China
| | - Bin Cai
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Fei Chen
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Gang Wang
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Min Wang
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Yan Zhong
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Zong-Ming (Max) Cheng
- College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
- Department of Plant Sciences, University of Tennessee, Knoxville 37996, TN, USA
| |
Collapse
|
34
|
Pan G, Zhang W, Wu Z, Li S. Online community detection for large complex networks. PLoS One 2014; 9:e102799. [PMID: 25061683 PMCID: PMC4111306 DOI: 10.1371/journal.pone.0102799] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2013] [Accepted: 06/23/2014] [Indexed: 11/22/2022] Open
Abstract
Complex networks describe a wide range of systems in nature and society. To understand complex networks, it is crucial to investigate their community structure. In this paper, we develop an online community detection algorithm with linear time complexity for large complex networks. Our algorithm processes a network edge by edge in the order that the network is fed to the algorithm. If a new edge is added, it just updates the existing community structure in constant time, and does not need to re-compute the whole network. Therefore, it can efficiently process large networks in real time. Our algorithm optimizes expected modularity instead of modularity at each step to avoid poor performance. The experiments are carried out using 11 public data sets, and are measured by two criteria, modularity and NMI (Normalized Mutual Information). The results show that our algorithm's running time is less than the commonly used Louvain algorithm while it gives competitive performance.
Collapse
Affiliation(s)
- Gang Pan
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
- * E-mail:
| | - Wangsheng Zhang
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Zhaohui Wu
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| | - Shijian Li
- Department of Computer Science, Zhejiang University, Hangzhou, Zhejiang, China
| |
Collapse
|
35
|
Batson G, Migliarese SJ, Soriano C, H. Burdette J, Laurienti PJ. Effects of Improvisational Dance on Balance in Parkinson's Disease: A Two-Phase fMRI Case Study. PHYSICAL & OCCUPATIONAL THERAPY IN GERIATRICS 2014. [DOI: 10.3109/02703181.2014.927946] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
36
|
Tully JP, Hill AE, Ahmed HMR, Whitley R, Skjellum A, Mukhtar MS. Expression-based network biology identifies immune-related functional modules involved in plant defense. BMC Genomics 2014; 15:421. [PMID: 24888606 PMCID: PMC4070563 DOI: 10.1186/1471-2164-15-421] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2013] [Accepted: 05/27/2014] [Indexed: 01/12/2023] Open
Abstract
Background Plants respond to diverse environmental cues including microbial perturbations by coordinated regulation of thousands of genes. These intricate transcriptional regulatory interactions depend on the recognition of specific promoter sequences by regulatory transcription factors. The combinatorial and cooperative action of multiple transcription factors defines a regulatory network that enables plant cells to respond to distinct biological signals. The identification of immune-related modules in large-scale transcriptional regulatory networks can reveal the mechanisms by which exposure to a pathogen elicits a precise phenotypic immune response. Results We have generated a large-scale immune co-expression network using a comprehensive set of Arabidopsis thaliana (hereafter Arabidopsis) transcriptomic data, which consists of a wide spectrum of immune responses to pathogens or pathogen-mimicking stimuli treatments. We employed both linear and non-linear models to generate Arabidopsis immune co-expression regulatory (AICR) network. We computed network topological properties and ascertained that this newly constructed immune network is densely connected, possesses hubs, exhibits high modularity, and displays hallmarks of a “real” biological network. We partitioned the network and identified 156 novel modules related to immune functions. Gene Ontology (GO) enrichment analyses provided insight into the key biological processes involved in determining finely tuned immune responses. We also developed novel software called OCCEAN (One Click Cis-regulatory Elements ANalysis) to discover statistically enriched promoter elements in the upstream regulatory regions of Arabidopsis at a whole genome level. We demonstrated that OCCEAN exhibits higher precision than the existing promoter element discovery tools. In light of known and newly discovered cis-regulatory elements, we evaluated biological significance of two key immune-related functional modules and proposed mechanism(s) to explain how large sets of diverse GO genes coherently function to mount effective immune responses. Conclusions We used a network-based, top-down approach to discover immune-related modules from transcriptomic data in Arabidopsis. Detailed analyses of these functional modules reveal new insight into the topological properties of immune co-expression networks and a comprehensive understanding of multifaceted plant defense responses. We present evidence that our newly developed software, OCCEAN, could become a popular tool for the Arabidopsis research community as well as potentially expand to analyze other eukaryotic genomes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-421) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, 35294-1170, USA.
| |
Collapse
|
37
|
Moussa MN, Wesley MJ, Porrino LJ, Hayasaka S, Bechara A, Burdette JH, Laurienti PJ. Age-related differences in advantageous decision making are associated with distinct differences in functional community structure. Brain Connect 2014; 4:193-202. [PMID: 24575804 DOI: 10.1089/brain.2013.0184] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Human decision making is dependent on not only the function of several brain regions but also their synergistic interaction. The specific function of brain areas within the ventromedial prefrontal cortex has long been studied in an effort to understand choice evaluation and decision making. These data specifically focus on whole-brain functional interconnectivity using the principles of network science. The Iowa Gambling Task (IGT) was the first neuropsychological task used to model real-life decisions in a way that factors reward, punishment, and uncertainty. Clinically, it has been used to detect decision-making impairments characteristic of patients with prefrontal cortex lesions. Here, we used performance on repeated blocks of the IGT as a behavioral measure of advantageous and disadvantageous decision making in young and mature adults. Both adult groups performed poorly by predominately making disadvantageous selections in the beginning stages of the task. In later phases of the task, young adults shifted to more advantageous selections and outperformed mature adults. Modularity analysis revealed stark underlying differences in visual, sensorimotor and medial prefrontal cortex community structure. In addition, changes in orbitofrontal cortex connectivity predicted behavioral deficits in IGT performance. Contrasts were driven by a difference in age but may also prove relevant to neuropsychiatric disorders associated with poor decision making, including the vulnerability to alcohol and/or drug addiction.
Collapse
Affiliation(s)
- Malaak Nasser Moussa
- 1 Laboratory for Complex Brain Networks, Wake Forest University School of Medicine , Winston-Salem, North Carolina
| | | | | | | | | | | | | |
Collapse
|
38
|
Survey of network-based approaches to research of cardiovascular diseases. BIOMED RESEARCH INTERNATIONAL 2014; 2014:527029. [PMID: 24772427 PMCID: PMC3977459 DOI: 10.1155/2014/527029] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Accepted: 02/07/2014] [Indexed: 01/08/2023]
Abstract
Cardiovascular diseases (CVDs) are the leading health problem worldwide. Investigating causes and mechanisms of CVDs calls for an integrative approach that would take into account its complex etiology. Biological networks generated from available data on biomolecular interactions are an excellent platform for understanding interconnectedness of all processes within a living cell, including processes that underlie diseases. Consequently, topology of biological networks has successfully been used for identifying genes, pathways, and modules that govern molecular actions underlying various complex diseases. Here, we review approaches that explore and use relationships between topological properties of biological networks and mechanisms underlying CVDs.
Collapse
|
39
|
Palsetia D, Patwary MMA, Agrawal A, Choudhary A. Excavating social circles via user interests. SOCIAL NETWORK ANALYSIS AND MINING 2014. [DOI: 10.1007/s13278-014-0170-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
40
|
Pizzuti C, Rombo SE. Algorithms and tools for protein-protein interaction networks clustering, with a special focus on population-based stochastic methods. ACTA ACUST UNITED AC 2014; 30:1343-52. [PMID: 24458952 DOI: 10.1093/bioinformatics/btu034] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Protein-protein interaction (PPI) networks are powerful models to represent the pairwise protein interactions of the organisms. Clustering PPI networks can be useful for isolating groups of interacting proteins that participate in the same biological processes or that perform together specific biological functions. Evolutionary orthologies can be inferred this way, as well as functions and properties of yet uncharacterized proteins. RESULTS We present an overview of the main state-of-the-art clustering methods that have been applied to PPI networks over the past decade. We distinguish five specific categories of approaches, describe and compare their main features and then focus on one of them, i.e. population-based stochastic search. We provide an experimental evaluation, based on some validation measures widely used in the literature, of techniques in this class, that are as yet less explored than the others. In particular, we study how the capability of Genetic Algorithms (GAs) to extract clusters in PPI networks varies when different topology-based fitness functions are used, and we compare GAs with the main techniques in the other categories. The experimental campaign shows that predictions returned by GAs are often more accurate than those produced by the contestant methods. Interesting issues still remain open about possible generalizations of GAs allowing for cluster overlapping. AVAILABILITY AND IMPLEMENTATION We point out which methods and tools described here are publicly available. CONTACT simona.rombo@math.unipa.it SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Clara Pizzuti
- Institute for High Performance Computing and Networking (ICAR), National Research Council of Italy (CNR), Via P. Bucci 41C, 87036 Rende (CS) and Department of Mathematics and Computer Science, University of Palermo, Via Archirafi 34, 90123 Palermo (PA), Italy
| | | |
Collapse
|
41
|
Hayasaka S. Functional connectivity networks with and without global signal correction. Front Hum Neurosci 2013; 7:880. [PMID: 24385961 PMCID: PMC3866385 DOI: 10.3389/fnhum.2013.00880] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Accepted: 12/03/2013] [Indexed: 11/17/2022] Open
Abstract
In functional connectivity analyses in BOLD (blood oxygenation level dependent) fMRI data, there is an ongoing debate on whether to correct global signals in fMRI time series data. Although the discussion has been ongoing in the fMRI community since the early days of fMRI data analyses, this subject has gained renewed attention in recent years due to the surging popularity of functional connectivity analyses, in particular graph theory-based network analyses. However, the impact of correcting (or not correcting) for global signals has not been systematically characterized in the context of network analyses. Thus, in this work, I examined the effect of global signal correction on an fMRI network analysis. In particular, voxel-based resting-state fMRI networks were constructed with and without global signal correction. The resulting functional connectivity networks were compared. Without global signal correction, the distributions of the correlation coefficients were positively biased. I also found that, without global signal correction, nodes along the interhemisphic fissure were highly connected whereas some nodes and subgraphs around white-matter tracts became disconnected from the rest of the network. These results from this study show differences between the networks with or without global signal correction.
Collapse
Affiliation(s)
- Satoru Hayasaka
- Department of Biostatistical Sciences, Wake Forest School of Medicine Winston-Salem, NC, USA ; Department of Radiology, Wake Forest School of Medicine Winston-Salem, NC, USA
| |
Collapse
|
42
|
Lei C, Tamim S, Bishop AJR, Ruan J. Fully automated protein complex prediction based on topological similarity and community structure. Proteome Sci 2013; 11:S9. [PMID: 24564887 PMCID: PMC3908383 DOI: 10.1186/1477-5956-11-s1-s9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
To understand the function of protein complexes and their association with biological processes, a lot of studies have been done towards analyzing the protein-protein interaction (PPI) networks. However, the advancement in high-throughput technology has resulted in a humongous amount of data for analysis. Moreover, high level of noise, sparseness, and skewness in degree distribution of PPI networks limits the performance of many clustering algorithms and further analysis of their interactions.
Collapse
Affiliation(s)
- Chengwei Lei
- Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Saleh Tamim
- Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Alexander JR Bishop
- Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
- Department of Cellular and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Jianhua Ruan
- Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| |
Collapse
|
43
|
Hofree M, Shen JP, Carter H, Gross A, Ideker T. Network-based stratification of tumor mutations. Nat Methods 2013; 10:1108-15. [PMID: 24037242 PMCID: PMC3866081 DOI: 10.1038/nmeth.2651] [Citation(s) in RCA: 530] [Impact Index Per Article: 44.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Accepted: 08/12/2013] [Indexed: 12/30/2022]
Abstract
Many forms of cancer have multiple subtypes with different causes and clinical outcomes. Somatic tumor genome sequences provide a rich new source of data for uncovering these subtypes but have proven difficult to compare, as two tumors rarely share the same mutations. Here we introduce network-based stratification (NBS), a method to integrate somatic tumor genomes with gene networks. This approach allows for stratification of cancer into informative subtypes by clustering together patients with mutations in similar network regions. We demonstrate NBS in ovarian, uterine and lung cancer cohorts from The Cancer Genome Atlas. For each tissue, NBS identifies subtypes that are predictive of clinical outcomes such as patient survival, response to therapy or tumor histology. We identify network regions characteristic of each subtype and show how mutation-derived subtypes can be used to train an mRNA expression signature, which provides similar information in the absence of DNA sequence.
Collapse
Affiliation(s)
- Matan Hofree
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California USA
| | - John P Shen
- Department of Medicine, University of California, San Diego, La Jolla, California USA
| | - Hannah Carter
- Department of Medicine, University of California, San Diego, La Jolla, California USA
| | - Andrew Gross
- Department of Bioengineering, University of California, San Diego, La Jolla, California USA
| | - Trey Ideker
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California USA
- Department of Medicine, University of California, San Diego, La Jolla, California USA
- Department of Bioengineering, University of California, San Diego, La Jolla, California USA
| |
Collapse
|
44
|
Telesford QK, Laurienti PJ, Friedman DP, Kraft RA, Daunais JB. The effects of alcohol on the nonhuman primate brain: a network science approach to neuroimaging. Alcohol Clin Exp Res 2013; 37:1891-900. [PMID: 23905720 PMCID: PMC3812370 DOI: 10.1111/acer.12181] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2012] [Accepted: 04/06/2013] [Indexed: 11/26/2022]
Abstract
BACKGROUND Animal studies have long been an important tool for basic research as they offer a degree of control often lacking in clinical studies. Of particular value is the use of nonhuman primates (NHPs) for neuroimaging studies. Currently, studies have been published using functional magnetic resonance imaging (fMRI) to understand the default-mode network in the NHP brain. Network science provides an alternative approach to neuroimaging allowing for evaluation of whole-brain connectivity. In this study, we used network science to build NHP brain networks from fMRI data to understand the basic functional organization of the NHP brain. We also explored how the brain network is affected following an acute ethanol (EtOH) pharmacological challenge. METHODS Baseline resting-state fMRI was acquired in an adult male rhesus macaque (n = 1) and a cohort of vervet monkeys (n = 10). A follow-up scan was conducted in the rhesus macaque to assess network variability and to assess the effects of an acute EtOH challenge on the brain network. RESULTS The most connected regions in the resting-state networks were similar across species and matched regions identified as the default-mode network in previous NHP fMRI studies. Under an acute EtOH challenge, the functional organization of the brain was significantly impacted. CONCLUSIONS Network science offers a great opportunity to understand the brain as a complex system and how pharmacological conditions can affect the system globally. These models are sensitive to changes in the brain and may prove to be a valuable tool in long-term studies on alcohol exposure.
Collapse
Affiliation(s)
- Qawi K Telesford
- School of Biomedical Engineering and Sciences , Virginia Tech-Wake Forest University, Winston-Salem, North Carolina
| | | | | | | | | |
Collapse
|
45
|
Zhang S, Zhao H. Normalized modularity optimization method for community identification with degree adjustment. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:052802. [PMID: 24329313 DOI: 10.1103/physreve.88.052802] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2013] [Revised: 07/11/2013] [Indexed: 06/03/2023]
Abstract
As a fundamental problem in network study, community identification has attracted much attention from different fields. Representing a seminal work in this area, the modularity optimization method has been widely applied and studied. However, this method has issues in resolution limit and extreme degeneracy and may not perform well for networks with unbalanced structures. Although several methods have been proposed to overcome these limitations, they are all based on the original idea of defining modularity through comparing the total number of edges within the putative communities in the observed network with that in an equivalent randomly generated network. In this paper, we show that this modularity definition is not suitable to analyze some networks such as those with unbalanced structures. Instead, we propose to define modularity through the average degree within the communities and formulate modularity as comparing the sum of average degree within communities of the observed network to that of an equivalent randomly generated network. In addition, we also propose a degree-adjusted approach for further improvement when there are unbalanced structures. We analyze the theoretical properties of our degree adjusted method. Numerical experiments for both artificial networks and real networks demonstrate that average degree plays an important role in network community identification, and our proposed methods have better performance than existing ones.
Collapse
Affiliation(s)
- Shuqin Zhang
- Center for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut 06520, USA
| |
Collapse
|
46
|
Ghasemi M, Rahgozar M, Bidkhori G, Masoudi-Nejad A. C-element: a new clustering algorithm to find high quality functional modules in PPI networks. PLoS One 2013; 8:e72366. [PMID: 24039752 PMCID: PMC3764100 DOI: 10.1371/journal.pone.0072366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Accepted: 07/16/2013] [Indexed: 11/18/2022] Open
Abstract
Graph clustering algorithms are widely used in the analysis of biological networks. Extracting functional modules in protein-protein interaction (PPI) networks is one such use. Most clustering algorithms whose focuses are on finding functional modules try either to find a clique like sub networks or to grow clusters starting from vertices with high degrees as seeds. These algorithms do not make any difference between a biological network and any other networks. In the current research, we present a new procedure to find functional modules in PPI networks. Our main idea is to model a biological concept and to use this concept for finding good functional modules in PPI networks. In order to evaluate the quality of the obtained clusters, we compared the results of our algorithm with those of some other widely used clustering algorithms on three high throughput PPI networks from Sacchromyces Cerevisiae, Homo sapiens and Caenorhabditis elegans as well as on some tissue specific networks. Gene Ontology (GO) analyses were used to compare the results of different algorithms. Each algorithm's result was then compared with GO-term derived functional modules. We also analyzed the effect of using tissue specific networks on the quality of the obtained clusters. The experimental results indicate that the new algorithm outperforms most of the others, and this improvement is more significant when tissue specific networks are used.
Collapse
Affiliation(s)
- Mahdieh Ghasemi
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
- Database Research Group (DBRG), Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Maseud Rahgozar
- Database Research Group (DBRG), Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Gholamreza Bidkhori
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
- * E-mail:
| |
Collapse
|
47
|
Chen J, Ma M, Shen N, Xi JJ, Tian W. Integration of cancer gene co-expression network and metabolic network to uncover potential cancer drug targets. J Proteome Res 2013; 12:2354-64. [PMID: 23590569 DOI: 10.1021/pr400162t] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Cell metabolism is critical for cancer cell transformation and progression. In this study, we have developed a novel method, named Met-express, that integrates a cancer gene co-expression network with the metabolic network to predict key enzyme-coding genes and metabolites in cancer cell metabolism. Met-express successfully identified a group of key enzyme-coding genes and metabolites in lung, leukemia, and breast cancers. Literature reviews suggest that approximately 33-53% of the predicted genes are either known or suggested anti-cancer drug targets, while 22% of the predicted metabolites are known or high-potential drug compounds in therapeutic use. Furthermore, experimental validations prove that 90% of the selected genes and 70% of metabolites demonstrate the significant anti-cancer phenotypes in cancer cells, implying that they may play important roles in cancer metabolism. Therefore, Met-express is a powerful tool for uncovering novel therapeutic biomarkers.
Collapse
Affiliation(s)
- Jingqi Chen
- State Key Laboratory of Genetic Engineering, Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
| | | | | | | | | |
Collapse
|
48
|
Chen Z, Zhang W. Integrative analysis using module-guided random forests reveals correlated genetic factors related to mouse weight. PLoS Comput Biol 2013; 9:e1002956. [PMID: 23505362 PMCID: PMC3591263 DOI: 10.1371/journal.pcbi.1002956] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2012] [Accepted: 01/14/2013] [Indexed: 01/07/2023] Open
Abstract
Complex traits such as obesity are manifestations of intricate interactions of multiple genetic factors. However, such relationships are difficult to identify. Thanks to the recent advance in high-throughput technology, a large amount of data has been collected for various complex traits, including obesity. These data often measure different biological aspects of the traits of interest, including genotypic variations at the DNA level and gene expression alterations at the RNA level. Integration of such heterogeneous data provides promising opportunities to understand the genetic components and possibly genetic architecture of complex traits. In this paper, we propose a machine learning based method, module-guided Random Forests (mgRF), to integrate genotypic and gene expression data to investigate genetic factors and molecular mechanism underlying complex traits. mgRF is an augmented Random Forests method enhanced by a network analysis for identifying multiple correlated variables of different types. We applied mgRF to genetic markers and gene expression data from a cohort of F2 female mouse intercross. mgRF outperformed several existing methods in our extensive comparison. Our new approach has an improved performance when combining both genotypic and gene expression data compared to using either one of the two types of data alone. The resulting predictive variables identified by mgRF provide information of perturbed pathways that are related to body weight. More importantly, the results uncovered intricate interactions among genetic markers and genes that have been overlooked if only one type of data was examined. Our results shed light on genetic mechanisms of obesity and our approach provides a promising complementary framework to the “genetics of gene expression” analysis for integrating genotypic and gene expression information for analyzing complex traits. Obesity has become a perilous global epidemic that can lead to complex diseases, such as diabetes and cardiovascular diseases. Much effort has been devoted to the studies of the genetic mechanisms that pillow the manifestation of obesity. Although a large quantity of experimental data has been accumulated lately using high-throughput techniques, our understanding of genetic mechanisms of obesity is still limited. The proposed method is motivated to address three critical issues that have impeded the existing methods. The first is the curse of dimensionality in selecting a subset of genetic elements related to the traits of interest from a large number of candidates. The second is genetic multiplicity underlying non-Mendelian traits, in which multiple genes are in interplay. The third issue is the integration of data from multiple sources in light of genetic multiplicity and curse of dimensionality. Here, we propose a new method, which augments the Random Forests method with a network-based analysis, to integrate genotypic and gene expression information and identify correlated multiple genetic elements underlying mouse weight. Our results shed light on complex genetic interactions underlying obesity, which can form viable hypotheses worthy of further investigation.
Collapse
Affiliation(s)
- Zheng Chen
- Department of Computer Science and Engineering, Washington University, St. Louis, Missouri, United States of America
| | - Weixiong Zhang
- Department of Computer Science and Engineering, Washington University, St. Louis, Missouri, United States of America
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, United States of America
- * E-mail:
| |
Collapse
|
49
|
Wang X, Yao J, Sun Y, Mai V. M-pick, a modularity-based method for OTU picking of 16S rRNA sequences. BMC Bioinformatics 2013; 14:43. [PMID: 23387433 PMCID: PMC3599145 DOI: 10.1186/1471-2105-14-43] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2012] [Accepted: 01/30/2013] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Binning 16S rRNA sequences into operational taxonomic units (OTUs) is an initial crucial step in analyzing large sequence datasets generated to determine microbial community compositions in various environments including that of the human gut. Various methods have been developed, but most suffer from either inaccuracies or from being unable to handle millions of sequences generated in current studies. Furthermore, existing binning methods usually require a priori decisions regarding binning parameters such as a distance level for defining an OTU. RESULTS We present a novel modularity-based approach (M-pick) to address the aforementioned problems. The new method utilizes ideas from community detection in graphs, where sequences are viewed as vertices on a weighted graph, each pair of sequences is connected by an imaginary edge, and the similarity of a pair of sequences represents the weight of the edge. M-pick first generates a graph based on pairwise sequence distances and then applies a modularity-based community detection technique on the graph to generate OTUs to capture the community structures in sequence data. To compare the performance of M-pick with that of existing methods, specifically CROP and ESPRIT-Tree, sequence data from different hypervariable regions of 16S rRNA were used and binning results were compared. CONCLUSIONS A new modularity-based clustering method for OTU picking of 16S rRNA sequences is developed in this study. The algorithm does not require a predetermined cut-off level, and our simulation studies suggest that it is superior to existing methods that require specified distance levels to define OTUs. The source code is available at http://plaza.ufl.edu/xywang/Mpick.htm.
Collapse
Affiliation(s)
- Xiaoyu Wang
- Department of Epidemiology, College of Public Health and Health Professions and College of Medicine, Emerging Pathogens Institute, University of Florida, Gainesville, FL 32610, USA
| | | | | | | |
Collapse
|
50
|
Gillis J, Pavlidis P. Assessing identity, redundancy and confounds in Gene Ontology annotations over time. ACTA ACUST UNITED AC 2013; 29:476-82. [PMID: 23297035 DOI: 10.1093/bioinformatics/bts727] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION The Gene Ontology (GO) is heavily used in systems biology, but the potential for redundancy, confounds with other data sources and problems with stability over time have been little explored. RESULTS We report that GO annotations are stable over short periods, with 3% of genes not being most semantically similar to themselves between monthly GO editions. However, we find that genes can alter their 'functional identity' over time, with 20% of genes not matching to themselves (by semantic similarity) after 2 years. We further find that annotation bias in GO, in which some genes are more characterized than others, has declined in yeast, but generally increased in humans. Finally, we discovered that many entries in protein interaction databases are owing to the same published reports that are used for GO annotations, with 66% of assessed GO groups exhibiting this confound. We provide a case study to illustrate how this information can be used in analyses of gene sets and networks. AVAILABILITY Data available at http://chibi.ubc.ca/assessGO.
Collapse
Affiliation(s)
- Jesse Gillis
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, 192B Genome Research Center, 500 Sunnyside Boulevard, Woodbury, NY 11797, USA
| | | |
Collapse
|