1
|
Gollapalli P, Ashok AK, G TS. System-level protein interaction network analysis and molecular dynamics study reveal interaction of ferulic acid with PTGS2 as a natural radioprotector. J Biomol Struct Dyn 2024; 42:2765-2781. [PMID: 37144749 DOI: 10.1080/07391102.2023.2208224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 04/20/2023] [Indexed: 05/06/2023]
Abstract
Ferulic acid is a crucial bioactive component of broccoli, wheat, and rice bran and is also an essential natural product that has undergone significant research. Ferulic acid's precise mode of action and effect on system-level protein networks have not been thoroughly investigated. An interactome was built using the STRING database and Cytoscape tools, utilizing 788 key proteins collected from PubMed literature to identify the ferulic acid-governed regulatory action on protein interaction network (PIN). The scale-free biological network of ferulic acid-rewired PIN is highly interconnected. We discovered 15 sub-modules using the MCODE tool for sub-modulization analysis and 153 enriched signaling pathways. Further, functional enrichment of top bottleneck proteins revealed the FoxO signaling pathway involved in enhancing cellular defense against oxidative stress. The selection of the critical regulatory proteins of the ferulic acid-rewired PIN was completed by performing analyses of topological characteristics such as GO term/pathways analysis, degree, bottleneck, molecular docking, and dynamics investigations. The current research derives a precise molecular mechanism for ferulic acid's action on the body. This in-depth in silico model would aid in understanding how ferulic acid origins its antioxidant and scavenging properties in the human body.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Pavan Gollapalli
- Center for Bioinformatics and Biostatistics, Nitte (Deemed to be University), Mangalore, Karnataka, India
| | - Avinash Karkada Ashok
- Department of Biotechnology, Siddaganga Institute of Technology, Tumakuru, Karnataka, India
| | - Tamizh Selvan G
- Central Research Laboratory, KS Hegde Medical Academy, Nitte (Deemed to be University), Mangalore, Karnataka, India
| |
Collapse
|
2
|
Voigt A, Almaas E. Complex Network Analysis in Microbial Systems: Theory and Examples. Methods Mol Biol 2022; 2349:167-191. [PMID: 34718996 DOI: 10.1007/978-1-0716-1585-0_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
A central driver for the field of systems biology is to develop an understanding of how interactions between components affect the functioning of a system as a whole. Network analysis is an approach that is uniquely suited to uncover patterns and organizing principles in a wide variety of complex systems. In this chapter, we will give a detailed description of basic concepts for characterizing empirical networks, frequently used random network models, and how to compute properties of networks using Python packages. We will demonstrate the application of network analysis by investigating several biological networks.
Collapse
Affiliation(s)
- André Voigt
- Department of Biotechnology, Norwegian University of Science & Technology, NTNU, Trondheim, Norway
| | - Eivind Almaas
- Department of Biotechnology, Norwegian University of Science & Technology, NTNU, Trondheim, Norway.
| |
Collapse
|
3
|
De la Fuente IM, Martínez L, Carrasco-Pujante J, Fedetz M, López JI, Malaina I. Self-Organization and Information Processing: From Basic Enzymatic Activities to Complex Adaptive Cellular Behavior. Front Genet 2021; 12:644615. [PMID: 34093645 PMCID: PMC8176287 DOI: 10.3389/fgene.2021.644615] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 04/16/2021] [Indexed: 11/13/2022] Open
Abstract
One of the main aims of current biology is to understand the origin of the molecular organization that underlies the complex dynamic architecture of cellular life. Here, we present an overview of the main sources of biomolecular order and complexity spanning from the most elementary levels of molecular activity to the emergence of cellular systemic behaviors. First, we have addressed the dissipative self-organization, the principal source of molecular order in the cell. Intensive studies over the last four decades have demonstrated that self-organization is central to understand enzyme activity under cellular conditions, functional coordination between enzymatic reactions, the emergence of dissipative metabolic networks (DMN), and molecular rhythms. The second fundamental source of order is molecular information processing. Studies on effective connectivity based on transfer entropy (TE) have made possible the quantification in bits of biomolecular information flows in DMN. This information processing enables efficient self-regulatory control of metabolism. As a consequence of both main sources of order, systemic functional structures emerge in the cell; in fact, quantitative analyses with DMN have revealed that the basic units of life display a global enzymatic structure that seems to be an essential characteristic of the systemic functional metabolism. This global metabolic structure has been verified experimentally in both prokaryotic and eukaryotic cells. Here, we also discuss how the study of systemic DMN, using Artificial Intelligence and advanced tools of Statistic Mechanics, has shown the emergence of Hopfield-like dynamics characterized by exhibiting associative memory. We have recently confirmed this thesis by testing associative conditioning behavior in individual amoeba cells. In these Pavlovian-like experiments, several hundreds of cells could learn new systemic migratory behaviors and remember them over long periods relative to their cell cycle, forgetting them later. Such associative process seems to correspond to an epigenetic memory. The cellular capacity of learning new adaptive systemic behaviors represents a fundamental evolutionary mechanism for cell adaptation.
Collapse
Affiliation(s)
- Ildefonso M. De la Fuente
- Department of Nutrition, CEBAS-CSIC Institute, Murcia, Spain
- Department of Mathematics, Faculty of Science and Technology, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Luis Martínez
- Department of Mathematics, Faculty of Science and Technology, University of the Basque Country, UPV/EHU, Leioa, Spain
- Basque Center of Applied Mathematics (BCAM), Bilbao, Spain
| | - Jose Carrasco-Pujante
- Department of Cell Biology and Histology, Faculty of Medicine and Nursing, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Maria Fedetz
- Department of Cell Biology and Immunology, Institute of Parasitology and Biomedicine “López-Neyra”, CSIC, Granada, Spain
| | - José I. López
- Department of Pathology, Cruces University Hospital, Biocruces-Bizkaia Health Research Institute, Barakaldo, Spain
| | - Iker Malaina
- Department of Mathematics, Faculty of Science and Technology, University of the Basque Country, UPV/EHU, Leioa, Spain
| |
Collapse
|
4
|
Wood ZT, Palkovacs EP, Olsen BJ, Kinnison MT. The Importance of Eco-evolutionary Potential in the Anthropocene. Bioscience 2021. [DOI: 10.1093/biosci/biab010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Abstract
Humans are dominant global drivers of ecological and evolutionary change, rearranging ecosystems and natural selection. In the present article, we show increasing evidence that human activity also plays a disproportionate role in shaping the eco-evolutionary potential of systems—the likelihood of ecological change generating evolutionary change and vice versa. We suggest that the net outcome of human influences on trait change, ecology, and the feedback loops that link them will often (but not always) be to increase eco-evolutionary potential, with important consequences for stability and resilience of populations, communities, and ecosystems. We also integrate existing ecological and evolutionary metrics to predict and manage the eco-evolutionary dynamics of human-affected systems. To support this framework, we use a simple eco–evo feedback model to show that factors affecting eco-evolutionary potential are major determinants of eco-evolutionary dynamics. Our framework suggests that proper management of anthropogenic effects requires a science of human effects on eco-evolutionary potential.
Collapse
Affiliation(s)
- Zachary T Wood
- School of Biology and Ecology and with the Maine Center for Genetics in the Environment at the University of Maine, Orono, Maine, United States
| | - Eric P Palkovacs
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, California, United States
| | - Brian J Olsen
- School of Biology and Ecology and with the Maine Center for Genetics in the Environment at the University of Maine, Orono, Maine, United States
| | - Michael T Kinnison
- School of Biology and Ecology and with the Maine Center for Genetics in the Environment at the University of Maine, Orono, Maine, United States
| |
Collapse
|
5
|
Kowalski A. A survey of human histone H1 subtypes interaction networks: Implications for histones H1 functioning. Proteins 2021; 89:792-810. [PMID: 33550666 DOI: 10.1002/prot.26059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 12/23/2020] [Accepted: 01/31/2021] [Indexed: 11/08/2022]
Abstract
To show a spectrum of histone H1 subtypes (H1.1-H1.5) activity realized through the protein-protein interactions, data selected from APID resources were processed with sequence-based bioinformatics approaches. Histone H1 subtypes participate in over half a thousand interactions with nuclear and cytosolic proteins (ComPPI database) engaged in the enzymatic activity and binding of nucleic acids and proteins (SIFTER tool). Small-scale networks of H1 subtypes (STRING network) have similar topological parameters (P > .05) which are, however, different for networks hubs between subtype H1.1 and H1.4 and subtype H1.3 and H1.5 (P < .05) (Cytoscape software). Based on enriched GO terms (g:Profiler toolset) of interacting proteins, molecular function and biological process of networks hubs is related to RNA binding and ribosome biogenesis (subtype H1.1 and H1.4), cell cycle and cell division (subtype H1.3 and H1.5) and protein ubiquitination and degradation (subtype H1.2). The residue propensity (BIPSPI predictor) and secondary structures of interacting surfaces (GOR algorithm) as well as a value of equilibrium dissociation constant (ISLAND predictor) indicate that a type of H1 subtypes interactions is transient in term of the stability and medium-strong in relation to the strength of binding. Histone H1 subtypes bind interacting partners in the intrinsic disorder-dependent mode (FoldIndex, PrDOS predictor), according to the coupled folding and binding and mutual synergistic folding mechanism. These results evidence that multifunctional H1 subtypes operate via protein interactions in the networks of crucial cellular processes and, therefore, confirm a new histone H1 paradigm relating to its functioning in the protein-protein interaction networks.
Collapse
Affiliation(s)
- Andrzej Kowalski
- Division of Medical Biology, Institute of Biology, Jan Kochanowski University in Kielce, Kielce, Poland
| |
Collapse
|
6
|
Desmet S, Brouckaert M, Boerjan W, Morreel K. Seeing the forest for the trees: Retrieving plant secondary biochemical pathways from metabolome networks. Comput Struct Biotechnol J 2020; 19:72-85. [PMID: 33384856 PMCID: PMC7753198 DOI: 10.1016/j.csbj.2020.11.050] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/26/2020] [Accepted: 11/28/2020] [Indexed: 02/06/2023] Open
Abstract
Over the last decade, a giant leap forward has been made in resolving the main bottleneck in metabolomics, i.e., the structural characterization of the many unknowns. This has led to the next challenge in this research field: retrieving biochemical pathway information from the various types of networks that can be constructed from metabolome data. Searching putative biochemical pathways, referred to as biotransformation paths, is complicated because several flaws occur during the construction of metabolome networks. Multiple network analysis tools have been developed to deal with these flaws, while in silico retrosynthesis is appearing as an alternative approach. In this review, the different types of metabolome networks, their flaws, and the various tools to trace these biotransformation paths are discussed.
Collapse
Affiliation(s)
- Sandrien Desmet
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Marlies Brouckaert
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Wout Boerjan
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Kris Morreel
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| |
Collapse
|
7
|
Polak D, Sanui T, Nishimura F, Shapira L. Diabetes as a risk factor for periodontal disease-plausible mechanisms. Periodontol 2000 2020; 83:46-58. [PMID: 32385872 DOI: 10.1111/prd.12298] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The present narrative review examines the scientific evidence of the biological mechanisms that may link periodontitis and diabetes, as a source of comorbidity. Publications regarding periodontitis and diabetes, in human, animals, and in vitro were screened for their relevance. Periodontal microbiome studies indicate a possible association between altered glucose metabolism in prediabetes and diabetes and changes in the periodontal microbiome. Coinciding with this, hyperglycemia enhances expression of pathogen receptors, which enhance host response to the dysbiotic microbiome. Hyperglycemia also promotes pro-inflammatory response independently or via the advanced glycation end product/receptor for advanced glycation end product pathway. These processes excite cellular tissue destruction functions, which further enhance pro-inflammatory cytokines expression and alteration in the RANKL/osteoprotegerin ratio, promoting formation and activation of osteoclasts. The evidence supports the role of several pathogenic mechanisms in the path of true causal comorbidity between poorly controlled diabetes and periodontitis. However, further research is needed to better understand these mechanisms and to explore other mechanisms.
Collapse
Affiliation(s)
- David Polak
- Department of Periodontology, Hebrew University-Hadassah Faculty of Dental Medicine, Jerusalem, Israel
| | - Terukazu Sanui
- Section of Periodontology, Division of Oral Rehabilitation, Kyushu University Faculty of Dental Science, Fukuoka, Japan
| | - Fusanori Nishimura
- Section of Periodontology, Division of Oral Rehabilitation, Kyushu University Faculty of Dental Science, Fukuoka, Japan
| | - Lior Shapira
- Department of Periodontology, Hebrew University-Hadassah Faculty of Dental Medicine, Jerusalem, Israel
| |
Collapse
|
8
|
Dhasmana A, Uniyal S, Anukriti, Kashyap VK, Somvanshi P, Gupta M, Bhardwaj U, Jaggi M, Yallapu MM, Haque S, Chauhan SC. Topological and system-level protein interaction network (PIN) analyses to deduce molecular mechanism of curcumin. Sci Rep 2020; 10:12045. [PMID: 32694520 PMCID: PMC7374742 DOI: 10.1038/s41598-020-69011-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 06/12/2020] [Indexed: 12/14/2022] Open
Abstract
Curcumin is an important bioactive component of turmeric and also one of the important natural products, which has been investigated extensively. The precise mode of action of curcumin and its impact on system level protein networks are still not well studied. To identify the curcumin governed regulatory action on protein interaction network (PIN), an interectome was created based on 788 key proteins, extracted from PubMed literatures, and constructed by using STRING and Cytoscape programs. The PIN rewired by curcumin was a scale-free, extremely linked biological system. MCODE plug-in was used for sub-modulization analysis, wherein we identified 25 modules; ClueGo plug-in was used for the pathway’s enrichment analysis, wherein 37 enriched signalling pathways were obtained. Most of them were associated with human diseases groups, particularly carcinogenesis, inflammation, and infectious diseases. Finally, the analysis of topological characteristic like bottleneck, degree, GO term/pathways analysis, bio-kinetics simulation, molecular docking, and dynamics studies were performed for the selection of key regulatory proteins of curcumin-rewired PIN. The current findings deduce a precise molecular mechanism that curcumin might exert in the system. This comprehensive in-silico study will help to understand how curcumin induces its anti-cancerous, anti-inflammatory, and anti-microbial effects in the human body.
Collapse
Affiliation(s)
- Anupam Dhasmana
- Department of Immunology and Microbiology, School of Medicine, University of Texas Rio Grande Valley, Edinburg, TX, USA.,Department of Biosciences and Cancer Research Institute, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Dehradun, India
| | - Swati Uniyal
- School of Biotechnology, Gautam Buddha University, Greater Noida, India
| | - Anukriti
- Department of Biosciences and Cancer Research Institute, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Dehradun, India
| | - Vivek Kumar Kashyap
- Department of Immunology and Microbiology, School of Medicine, University of Texas Rio Grande Valley, Edinburg, TX, USA
| | - Pallavi Somvanshi
- Department of Biotechnology, TERI School of Advanced Studies, 10, Institutional Area, Vasant Kunj,, New Delhi, India
| | - Meenu Gupta
- Department of Biosciences and Cancer Research Institute, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Dehradun, India
| | - Uma Bhardwaj
- Department of Biosciences and Cancer Research Institute, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Dehradun, India
| | - Meena Jaggi
- Department of Immunology and Microbiology, School of Medicine, University of Texas Rio Grande Valley, Edinburg, TX, USA
| | - Murali M Yallapu
- Department of Immunology and Microbiology, School of Medicine, University of Texas Rio Grande Valley, Edinburg, TX, USA
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan, Saudi Arabia
| | - Subhash C Chauhan
- Department of Immunology and Microbiology, School of Medicine, University of Texas Rio Grande Valley, Edinburg, TX, USA.
| |
Collapse
|
9
|
Zhu L, Zhang J, Zhang Y, Lang J, Xiang J, Bai X, Yan N, Tian G, Zhang H, Yang J. NAIGO: An Improved Method to Align PPI Networks Based on Gene Ontology and Graphlets. Front Bioeng Biotechnol 2020; 8:547. [PMID: 32637398 PMCID: PMC7318716 DOI: 10.3389/fbioe.2020.00547] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Accepted: 05/06/2020] [Indexed: 11/24/2022] Open
Abstract
With the development of high throughput technologies, there are more and more protein–protein interaction (PPI) networks available, which provide a need for efficient computational tools for network alignment. Network alignment is widely used to predict functions of certain proteins, identify conserved network modules, and study the evolutionary relationship across species or biological entities. However, network alignment is an NP-complete problem, and previous algorithms are usually slow or less accurate in aligning big networks like human vs. yeast. In this study, we proposed a fast yet accurate algorithm called Network Alignment by Integrating Biological Process (NAIGO). Specifically, we first divided the networks into subnets taking the advantage of known prior knowledge, such as gene ontology. For each subnet pair, we then developed a novel method to align them by considering both protein orthologous information and their local structural information. After that, we expanded the obtained local network alignments in a greedy manner. Taking the aligned pairs as seeds, we formulated the global network alignment problem as an assignment problem based on similarity matrix, which was solved by the Hungarian method. We applied NAIGO to align human and Saccharomyces cerevisiae S288c PPI network and compared the results with other popular methods like IsoRank, GRAAL, SANA, and NABEECO. As a result, our method outperformed the competitors by aligning more orthologous proteins or matched interactions. In addition, we found a few potential functional orthologous proteins such as RRM2B in human and DNA2 in S. cerevisiae S288c, which are related to DNA repair. We also identified a conserved subnet with six orthologous proteins EXO1, MSH3, MSH2, MLH1, MLH3, and MSH6, and six aligned interactions. All these proteins are associated with mismatch repair. Finally, we predicted a few proteins of S. cerevisiae S288c potentially involving in certain biological processes like autophagosome assembly.
Collapse
Affiliation(s)
- Lijuan Zhu
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| | - Ju Zhang
- Institute of Infectious Diseases, Beijing Ditan Hospital, Capital Medical University, and Beijing Key Laboratory of Emerging Infectious Diseases, Beijing, China
| | - Yi Zhang
- Department of Mathematics, Hebei University of Science & Technology, Shijiazhuang, China
| | | | - Ju Xiang
- Neuroscience Research Center & Department of Basic Medical Sciences, Changsha Medical University, Changsha, China.,School of Computer Science and Engineering, Central South University, Changsha, China
| | - Xiaogang Bai
- Department of Mathematics, Hebei University of Science & Technology, Shijiazhuang, China
| | - Na Yan
- Geneis Beijing Co., Ltd., Beijing, China
| | - Geng Tian
- Geneis Beijing Co., Ltd., Beijing, China
| | - Huajun Zhang
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| | | |
Collapse
|
10
|
Gysi DM, Nowick K. Construction, comparison and evolution of networks in life sciences and other disciplines. J R Soc Interface 2020; 17:20190610. [PMID: 32370689 PMCID: PMC7276545 DOI: 10.1098/rsif.2019.0610] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 04/09/2020] [Indexed: 12/12/2022] Open
Abstract
Network approaches have become pervasive in many research fields. They allow for a more comprehensive understanding of complex relationships between entities as well as their group-level properties and dynamics. Many networks change over time, be it within seconds or millions of years, depending on the nature of the network. Our focus will be on comparative network analyses in life sciences, where deciphering temporal network changes is a core interest of molecular, ecological, neuropsychological and evolutionary biologists. Further, we will take a journey through different disciplines, such as social sciences, finance and computational gastronomy, to present commonalities and differences in how networks change and can be analysed. Finally, we envision how borrowing ideas from these disciplines could enrich the future of life science research.
Collapse
Affiliation(s)
- Deisy Morselli Gysi
- Department of Computer Science, Interdisciplinary Center of Bioinformatics, University of Leipzig, 04109 Leipzig, Germany
- Swarm Intelligence and Complex Systems Group, Faculty of Mathematics and Computer Science, University of Leipzig, 04109 Leipzig, Germany
- Center for Complex Networks Research, Northeastern University, 177 Huntington Avenue, Boston, MA 02115, USA
| | - Katja Nowick
- Human Biology Group, Institute for Biology, Faculty of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Königin-Luise-Straβe 1-3, 14195 Berlin, Germany
| |
Collapse
|
11
|
Chang HC, Chu CP, Lin SJ, Hsiao CK. Network hub-node prioritization of gene regulation with intra-network association. BMC Bioinformatics 2020; 21:101. [PMID: 32164570 PMCID: PMC7069025 DOI: 10.1186/s12859-020-3444-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 03/06/2020] [Indexed: 11/10/2022] Open
Abstract
Background To identify and prioritize the influential hub genes in a gene-set or biological pathway, most analyses rely on calculation of marginal effects or tests of statistical significance. These procedures may be inappropriate since hub nodes are common connection points and therefore may interact with other nodes more often than non-hub nodes do. Such dependence among gene nodes can be conjectured based on the topology of the pathway network or the correlation between them. Results Here we develop a pathway activity score incorporating the marginal (local) effects of gene nodes as well as intra-network affinity measures. This score summarizes the expression levels in a gene-set/pathway for each sample, with weights on local and network information, respectively. The score is next used to examine the impact of each node through a leave-one-out evaluation. To illustrate the procedure, two cancer studies, one involving RNA-Seq from breast cancer patients with high-grade ductal carcinoma in situ and one microarray expression data from ovarian cancer patients, are used to assess the performance of the procedure, and to compare with existing methods, both ones that do and do not take into consideration correlation and network information. The hub nodes identified by the proposed procedure in the two cancer studies are known influential genes; some have been included in standard treatments and some are currently considered in clinical trials for target therapy. The results from simulation studies show that when marginal effects are mild or weak, the proposed procedure can still identify causal nodes, whereas methods relying only on marginal effect size cannot. Conclusions The NetworkHub procedure proposed in this research can effectively utilize the network information in combination with local effects derived from marker values, and provide a useful and complementary list of recommendations for prioritizing causal hubs.
Collapse
Affiliation(s)
- Hung-Ching Chang
- Division of Biostatistics, Institute of Epidemiology and Preventive Medicine, National Taiwan University, No. 17, Xu-Zhou Road, Taipei, 10055, Taiwan
| | - Chiao-Pei Chu
- Division of Biostatistics, Institute of Epidemiology and Preventive Medicine, National Taiwan University, No. 17, Xu-Zhou Road, Taipei, 10055, Taiwan
| | - Shu-Ju Lin
- Institute of Statistical Science, Academia Sinica, Taipei, 11529, Taiwan
| | - Chuhsing Kate Hsiao
- Division of Biostatistics, Institute of Epidemiology and Preventive Medicine, National Taiwan University, No. 17, Xu-Zhou Road, Taipei, 10055, Taiwan. .,Bioinformatics and Biostatistics Core, Center of Genomic Medicine, National Taiwan University, Taipei, 10055, Taiwan.
| |
Collapse
|
12
|
Akram P, Liao L. Prediction of comorbid diseases using weighted geometric embedding of human interactome. BMC Med Genomics 2019; 12:161. [PMID: 31888634 PMCID: PMC6936100 DOI: 10.1186/s12920-019-0605-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 10/16/2019] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Comorbidity is the phenomenon of two or more diseases occurring simultaneously not by random chance and presents great challenges to accurate diagnosis and treatment. As an effort toward better understanding the genetic causes of comorbidity, in this work, we have developed a computational method to predict comorbid diseases. Two diseases sharing common genes tend to increase their comorbidity. Previous work shows that after mapping the associated genes onto the human interactome the distance between the two disease modules (subgraphs) is correlated with comorbidity. METHODS To fully incorporate structural characteristics of interactome as features into prediction of comorbidity, our method embeds the human interactome into a high dimensional geometric space with weights assigned to the network edges and uses the projection onto different dimension to "fingerprint" disease modules. A supervised machine learning classifier is then trained to discriminate comorbid diseases versus non-comorbid diseases. RESULTS In cross-validation using a benchmark dataset of more than 10,000 disease pairs, we report that our model achieves remarkable performance of ROC score = 0.90 for comorbidity threshold at relative risk RR = 0 and 0.76 for comorbidity threshold at RR = 1, and significantly outperforms the previous method and the interactome generated by annotated data. To further incorporate prior knowledge pathways association with diseases, we weight the protein-protein interaction network edges according to their frequency of occurring in those pathways in such a way that edges with higher frequency will more likely be selected in the minimum spanning tree for geometric embedding. Such weighted embedding is shown to lead to further improvement of comorbid disease prediction. CONCLUSION The work demonstrates that embedding the two-dimension planar graph of human interactome into a high dimensional geometric space allows for characterizing and capturing disease modules (subgraphs formed by the disease associated genes) from multiple perspectives, and hence provides enriched features for a supervised classifier to discriminate comorbid disease pairs from non-comorbid disease pairs more accurately than based on simply the module separation.
Collapse
Affiliation(s)
- Pakeeza Akram
- School of Electrical Engineering and Computer Science (SEECS), National University of Sciences and Technology (NUST), H-12, Islamabad, Pakistan
- Department of Computer Science, University of Delaware, Newark, USA
| | - Li Liao
- Department of Computer Science, University of Delaware, Newark, USA
| |
Collapse
|
13
|
The Eminence of Co-Expressed Ties in Schizophrenia Network Communities. DATA 2019. [DOI: 10.3390/data4040149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Exploring gene networks is crucial for identifying significant biological interactions occurring in a disease condition. These interactions can be acknowledged by modeling the tie structure of networks. Such tie orientations are often detected within embedded community structures. However, most of the prevailing community detection modules are intended to capture information from nodes and its attributes, usually ignoring the ties. In this study, a modularity maximization algorithm is proposed based on nonlinear representation of local tangent space alignment (LTSA). Initially, the tangent coordinates are computed locally to identify k-nearest neighbors across the genes. These local neighbors are further optimized by generating a nonlinear network embedding function for detecting gene communities based on eigenvector decomposition. Experimental results suggest that this algorithm detects gene modules with a better modularity index of 0.9256, compared to other traditional community detection algorithms. Furthermore, co-expressed genes across these communities are identified by discovering the characteristic tie structures. These detected ties are known to have substantial biological influence in the progression of schizophrenia, thereby signifying the influence of tie patterns in biological networks. This technique can be extended logically on other diseases networks for detecting substantial gene “hotspots”.
Collapse
|
14
|
Haider S, Ponnusamy K, Singh RKB, Chakraborti A, Bamezai RNK. Hamiltonian energy as an efficient approach to identify the significant key regulators in biological networks. PLoS One 2019; 14:e0221463. [PMID: 31442253 PMCID: PMC6707611 DOI: 10.1371/journal.pone.0221463] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 08/07/2019] [Indexed: 12/27/2022] Open
Abstract
The topological characteristics of biological networks enable us to identify the key nodes in terms of modularity. However, due to a large size of the biological networks with many hubs and functional modules across intertwined layers within the network, it often becomes difficult to accomplish the task of identifying potential key regulators. We use for the first time a generalized formalism of Hamiltonian Energy (HE) with a recursive approach. The concept, when applied to the Apoptosis Regulatory Gene Network (ARGN), helped us identify 11 Motif hubs (MHs), which influenced the network up to motif levels. The approach adopted allowed to classify MHs into 5 significant motif hubs (S-MHs) and 6 non-significant motif hubs (NS-MHs). The significant motif hubs had a higher HE value and were considered as high-active key regulators; while the non-significant motif hubs had a relatively lower HE value and were considered as low-active key regulators, in network control mechanism. Further, we compared the results of the HE analyses with the topological characterization, after subjecting to the three conditions independently: (i) removing all MHs, (ii) removing only S-MHs, and (iii) removing only NS-MHs from the ARGN. This procedure allowed us to cross-validate the role of 5 S-MHs, NFk-B1, BRCA1, CEBPB, AR, and POU2F1 as the potential key regulators. The changes in HE calculations further showed that the removal of 5 S-MHs could cause perturbation at all levels of the network, a feature not discernible by topological analysis alone.
Collapse
Affiliation(s)
- Shazia Haider
- Department of Neurology, All India Institute of Medical Science (AIIMS), New Delhi, India
| | | | - R. K. Brojen Singh
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
- * E-mail: (RKBS); (AC); (RNKB)
| | - Anirban Chakraborti
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
- * E-mail: (RKBS); (AC); (RNKB)
| | - Rameshwar N. K. Bamezai
- Formerly at National Centre of Applied Human Genetics, School of Life Sciences, Jawaharlal Nehru University, New Delhi, India
- * E-mail: (RKBS); (AC); (RNKB)
| |
Collapse
|
15
|
Investigation of Precise Molecular Mechanistic Action of Tobacco-Associated Carcinogen `NNK´ Induced Carcinogenesis: A System Biology Approach. Genes (Basel) 2019; 10:genes10080564. [PMID: 31357510 PMCID: PMC6723528 DOI: 10.3390/genes10080564] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Revised: 07/22/2019] [Accepted: 07/24/2019] [Indexed: 12/21/2022] Open
Abstract
Cancer is the second deadliest disease listed by the WHO. One of the major causes of cancer disease is tobacco and consumption possibly due to its main component, 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone (NNK). A plethora of studies have been conducted in the past aiming to decipher the association of NNK with other diseases. However, it is strongly linked with cancer development. Despite these studies, a clear molecular mechanism and the impact of NNK on various system-level networks is not known. In the present study, system biology tools were employed to understand the key regulatory mechanisms and the perturbations that will happen in the cellular processes due to NNK. To investigate the system level influence of the carcinogen, NNK rewired protein–protein interaction network (PPIN) was generated from 544 reported proteins drawn out from 1317 articles retrieved from PubMed. The noise was removed from PPIN by the method of modulation. Gene ontology (GO) enrichment was performed on the seed proteins extracted from various modules to find the most affected pathways by the genes/proteins. For the modulation, Molecular COmplex DEtection (MCODE) was used to generate 19 modules containing 115 seed proteins. Further, scrutiny of the targeted biomolecules was done by the graph theory and molecular docking. GO enrichment analysis revealed that mostly cell cycle regulatory proteins were affected by NNK.
Collapse
|
16
|
Sood U, Hira P, Kumar R, Bajaj A, Rao DLN, Lal R, Shakarad M. Comparative Genomic Analyses Reveal Core-Genome-Wide Genes Under Positive Selection and Major Regulatory Hubs in Outlier Strains of Pseudomonas aeruginosa. Front Microbiol 2019; 10:53. [PMID: 30787911 PMCID: PMC6372532 DOI: 10.3389/fmicb.2019.00053] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Accepted: 01/14/2019] [Indexed: 12/11/2022] Open
Abstract
Genomic information for outlier strains of Pseudomonas aeruginosa is exiguous when compared with classical strains. We sequenced and constructed the complete genome of an environmental strain CR1 of P. aeruginosa and performed the comparative genomic analysis. It clustered with the outlier group, hence we scaled up the analyses to understand the differences in environmental and clinical outlier strains. We identified eight new regions of genomic plasticity and a plasmid pCR1 with a VirB/D4 complex followed by trimeric auto-transporter that can induce virulence phenotype in the genome of strain CR1. Virulence genotype analysis revealed that strain CR1 lacked hemolytic phospholipase C and D, three genes for LPS biosynthesis and had reduced antibiotic resistance genes when compared with clinical strains. Genes belonging to proteases, bacterial exporters and DNA stabilization were found to be under strong positive selection, thus facilitating pathogenicity and survival of the outliers. The outliers had the complete operon for the production of vibrioferrin, a siderophore present in plant growth promoting bacteria. The competence to acquire multidrug resistance and new virulence factors makes these strains a potential threat. However, we identified major regulatory hubs that can be used as drug targets against both the classical and outlier groups.
Collapse
Affiliation(s)
- Utkarsh Sood
- Department of Zoology, University of Delhi, New Delhi, India
- PhiXGen Private Limited, Gurugram, India
| | - Princy Hira
- Department of Zoology, University of Delhi, New Delhi, India
| | - Roshan Kumar
- Department of Zoology, University of Delhi, New Delhi, India
- PhiXGen Private Limited, Gurugram, India
- Department of Veterinary & Biomedical Sciences, South Dakota State University, Brookings, SD, United States
| | - Abhay Bajaj
- Department of Zoology, University of Delhi, New Delhi, India
- National Centre for Microbial Resource, National Centre for Cell Science, Pune, India
| | | | - Rup Lal
- Department of Zoology, University of Delhi, New Delhi, India
- PhiXGen Private Limited, Gurugram, India
| | | |
Collapse
|
17
|
Martínez-Adriano CA, Díaz-Castelazo C, Aguirre-Jaimes A. Flower-mediated plant-butterfly interactions in an heterogeneous tropical coastal ecosystem. PeerJ 2018; 6:e5493. [PMID: 30210938 PMCID: PMC6130237 DOI: 10.7717/peerj.5493] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 07/26/2018] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND Interspecific interactions play an important role in determining species richness and persistence in a given locality. However at some sites, the studies, especially for interaction networks on adult butterflies are scarce. The present study aimed the following objectives: (1) determine butterfly species richness and diversity that visit flowering plants, (2) compare species richness and diversity in butterfly-plant interactions among six different vegetation types and (3) analyze the structure of butterfly-flowering plant interaction networks mediated by flowers. METHODS The study was developed in six vegetation types within the natural reserve of La Mancha, located in Veracruz, Mexico. In each vegetation type, we recorded the frequency of flower visits by butterflies monthly in round plots (of radius 5 m) for 12 months. We calculated Shannon diversity for butterfly species and diversity of interactions per vegetation type. We determined the classic Jaccard similarity index among vegetation types and estimated parameters at network and species-level. RESULTS We found 123 species of butterflies belonging to 11 families and 87 genera. The highest number of species belonged to Hesperiidae (46 species), followed by Nymphalidae (28) and Pieridae (14). The highest butterfly diversity and interaction diversity was observed in pioneer dune vegetation (PDV), coastal dune scrub (CDS) and tropical deciduous flooding forest and wetland (TDF-W). The same order of vegetation types was found for interaction diversity. Highest species similarity was found between PDV-CDS and PDV-TDF. The butterfly-plant interaction network showed a nested structure with one module. The species Ascia monuste, Euptoieta hegesia and Leptotes cassius were the most generalist in the network, while Horama oedippus, E. hegesia, and L. cassius were the species with highest dependencies per plant species. DISCUSSION Our study is important because it constitutes a pioneer study of butterfly-plant interactions in this protected area, at least for adult butterflies; it shows the diversity of interactions among flowering plants and butterflies. Our research constitutes the first approach (at a community level) to explore the functional role of pollination services that butterflies provide to plant communities. We highlighted that open areas show a higher diversity and these areas shared a higher number of species that shaded sites. In the interaction networks parameters, our results highlighted the higher dependence of butterflies by the flowers on which they feed than vice versa. In conclusion, the plant species (as a feeding resource) seem to limit the presence of butterfly species. Thus, this protected area is highly relevant for Lepidoptera diversity and the interaction between these insects and flowering plants. We suggest that studying plant and butterfly diversity in tropical habitats will provide insight into their interspecific interactions and community structure.
Collapse
Affiliation(s)
| | - Cecilia Díaz-Castelazo
- Red de Interacciones Multitróficas, Instituto de Ecología, A. C., Xalapa, Veracruz, México
| | - Armando Aguirre-Jaimes
- Red de Interacciones Multitróficas, Instituto de Ecología, A. C., Xalapa, Veracruz, México
| |
Collapse
|
18
|
Network approach of the conformational change of c-Src, a tyrosine kinase, by molecular dynamics simulation. Sci Rep 2018; 8:5673. [PMID: 29618744 PMCID: PMC5884825 DOI: 10.1038/s41598-018-23964-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 03/20/2018] [Indexed: 12/31/2022] Open
Abstract
Non-receptor tyrosine kinase c-Src plays a critical role in numerous cellular signalling pathways. Activation of c-Src from its inactive to the active state involves large-scale conformational changes, and is controlled by the phosphorylation state of two major phosphorylation sites, Tyr416 and Tyr527. A detailed mechanism for the entire conformational transition of c-Src via phosphorylation control of Tyr416 and Tyr527 is still elusive. In this study, we investigated the inactive-to-active conformational change of c-Src by targeted molecular dynamics simulation. Based on the simulation, we proposed a dynamical scenario for the activation process of c-Src. A detailed study of the conformational transition pathway based on network analysis suggests that Lys321 plays a key role in the c-Src activation process.
Collapse
|
19
|
Zhang Q, Ma C, Gearing M, Wang PG, Chin LS, Li L. Integrated proteomics and network analysis identifies protein hubs and network alterations in Alzheimer's disease. Acta Neuropathol Commun 2018; 6:19. [PMID: 29490708 PMCID: PMC5831854 DOI: 10.1186/s40478-018-0524-2] [Citation(s) in RCA: 112] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 02/22/2018] [Indexed: 12/12/2022] Open
Abstract
Although the genetic causes for several rare, familial forms of Alzheimer’s disease (AD) have been identified, the etiology of the sporadic form of AD remains unclear. Here, we report a systems-level study of disease-associated proteome changes in human frontal cortex of sporadic AD patients using an integrated approach that combines mass spectrometry-based quantitative proteomics, differential expression analysis, and co-expression network analysis. Our analyses of 16 human brain tissues from AD patients and age-matched controls showed organization of the cortical proteome into a network of 24 biologically meaningful modules of co-expressed proteins. Of these, 5 modules are positively correlated to AD phenotypes with hub proteins that are up-regulated in AD, and 6 modules are negatively correlated to AD phenotypes with hub proteins that are down-regulated in AD. Our study generated a molecular blueprint of altered protein networks in AD brain and uncovered the dysregulation of multiple pathways and processes in AD brain, including altered proteostasis, RNA homeostasis, immune response, neuroinflammation, synaptic transmission, vesicular transport, cell signaling, cellular metabolism, lipid homeostasis, mitochondrial dynamics and function, cytoskeleton organization, and myelin-axon interactions. Our findings provide new insights into AD pathogenesis and suggest novel candidates for future diagnostic and therapeutic development.
Collapse
|
20
|
Akram P, Liao L. Prediction of missing common genes for disease pairs using network based module separation on incomplete human interactome. BMC Genomics 2017; 18:902. [PMID: 29244004 PMCID: PMC5731604 DOI: 10.1186/s12864-017-4272-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Background Identification of common genes associated with comorbid diseases can be critical in understanding their pathobiological mechanism. This work presents a novel method to predict missing common genes associated with a disease pair. Searching for missing common genes is formulated as an optimization problem to minimize network based module separation from two subgraphs produced by mapping genes associated with disease onto the interactome. Results Using cross validation on more than 600 disease pairs, our method achieves significantly higher average receiver operating characteristic ROC Score of 0.95 compared to a baseline ROC score 0.60 using randomized data. Conclusion Missing common genes prediction is aimed to complete gene set associated with comorbid disease for better understanding of biological intervention. It will also be useful for gene targeted therapeutics related to comorbid diseases. This method can be further considered for prediction of missing edges to complete the subgraph associated with disease pair.
Collapse
Affiliation(s)
- Pakeeza Akram
- Department of Computer & Information Sciences, University of Delaware, Newark, DE, USA
| | - Li Liao
- Department of Computer & Information Sciences, University of Delaware, Newark, DE, USA.
| |
Collapse
|
21
|
Comparative Genomic Analysis Reveals Habitat-Specific Genes and Regulatory Hubs within the Genus Novosphingobium. mSystems 2017; 2:mSystems00020-17. [PMID: 28567447 PMCID: PMC5443232 DOI: 10.1128/msystems.00020-17] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 04/17/2017] [Indexed: 11/24/2022] Open
Abstract
This study highlights the significant role of the genetic repertoire of a microorganism in the similarity between Novosphingobium strains. The results suggest that the phylogenetic relationships were mostly influenced by metabolic trait enrichment, which is possibly governed by the microenvironment of each microbe’s respective niche. Using core genome analysis, the enrichment of a certain set of genes specific to a particular habitat was determined, which provided insights on the influence of habitat on the distribution of metabolic traits for Novosphingobium strains. We also identified habitat-specific protein hubs, which suggested delineation of Novosphingobium strains based on their habitat. Examining the available genomes of ecologically diverse bacterial species and analyzing the habitat-specific genes are useful for understanding the distribution and evolution of functional and phylogenetic diversity in the genus Novosphingobium. Species belonging to the genus Novosphingobium are found in many different habitats and have been identified as metabolically versatile. Through comparative genomic analysis, we identified habitat-specific genes and regulatory hubs that could determine habitat selection for Novosphingobium spp. Genomes from 27 Novosphingobium strains isolated from diverse habitats such as rhizosphere soil, plant surfaces, heavily contaminated soils, and marine and freshwater environments were analyzed. Genome size and coding potential were widely variable, differing significantly between habitats. Phylogenetic relationships between strains were less likely to describe functional genotype similarity than the habitat from which they were isolated. In this study, strains (19 out of 27) with a recorded habitat of isolation, and at least 3 representative strains per habitat, comprised four ecological groups—rhizosphere, contaminated soil, marine, and freshwater. Sulfur acquisition and metabolism were the only core genomic traits to differ significantly in proportion between these ecological groups; for example, alkane sulfonate (ssuABCD) assimilation was found exclusively in all of the rhizospheric isolates. When we examined osmolytic regulation in Novosphingobium spp. through ectoine biosynthesis, which was assumed to be marine habitat specific, we found that it was also present in isolates from contaminated soil, suggesting its relevance beyond the marine system. Novosphingobium strains were also found to harbor a wide variety of mono- and dioxygenases, responsible for the metabolism of several aromatic compounds, suggesting their potential to act as degraders of a variety of xenobiotic compounds. Protein-protein interaction analysis revealed β-barrel outer membrane proteins as habitat-specific hubs in each of the four habitats—freshwater (Saro_1868), marine water (PP1Y_AT17644), rhizosphere (PMI02_00367), and soil (V474_17210). These outer membrane proteins could play a key role in habitat demarcation and extend our understanding of the metabolic versatility of the Novosphingobium species. IMPORTANCE This study highlights the significant role of a microorganism’s genetic repertoire in structuring the similarity between Novosphingobium strains. The results suggest that the phylogenetic relationships were mostly influenced by metabolic trait enrichment, which is possibly governed by the microenvironment of each microbe’s respective niche. Using core genome analysis, the enrichment of a certain set of genes specific to a particular habitat was determined, which provided insights on the influence of habitat on the distribution of metabolic traits in Novosphingobium strains. We also identified habitat-specific protein hubs, which suggested delineation of Novosphingobium strains based on their habitat. Examining the available genomes of ecologically diverse bacterial species and analyzing the habitat-specific genes are useful for understanding the distribution and evolution of functional and phylogenetic diversity in the genus Novosphingobium.
Collapse
|
22
|
Puig-Butille JA, Gimenez-Xavier P, Visconti A, Nsengimana J, Garcia-García F, Tell-Marti G, Escamez MJ, Newton-Bishop J, Bataille V, del Río M, Dopazo J, Falchi M, Puig S. Genomic expression differences between cutaneous cells from red hair color individuals and black hair color individuals based on bioinformatic analysis. Oncotarget 2017; 8:11589-11599. [PMID: 28030792 PMCID: PMC5355288 DOI: 10.18632/oncotarget.14140] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Accepted: 11/21/2016] [Indexed: 12/11/2022] Open
Abstract
The MC1R gene plays a crucial role in pigmentation synthesis. Loss-of-function MC1R variants, which impair protein function, are associated with red hair color (RHC) phenotype and increased skin cancer risk. Cultured cutaneous cells bearing loss-of-function MC1R variants show a distinct gene expression profile compared to wild-type MC1R cultured cutaneous cells. We analysed the gene signature associated with RHC co-cultured melanocytes and keratinocytes by Protein-Protein interaction (PPI) network analysis to identify genes related with non-functional MC1R variants. From two detected networks, we selected 23 nodes as hub genes based on topological parameters. Differential expression of hub genes was then evaluated in healthy skin biopsies from RHC and black hair color (BHC) individuals. We also compared gene expression in melanoma tumors from individuals with RHC versus BHC. Gene expression in normal skin from RHC cutaneous cells showed dysregulation in 8 out of 23 hub genes (CLN3, ATG10, WIPI2, SNX2, GABARAPL2, YWHA, PCNA and GBAS). Hub genes did not differ between melanoma tumors in RHC versus BHC individuals. The study suggests that healthy skin cells from RHC individuals present a constitutive genomic deregulation associated with the red hair phenotype and identify novel genes involved in melanocyte biology.
Collapse
Affiliation(s)
- Joan Anton Puig-Butille
- Biochemistry and Molecular Genetics Department, Melanoma Unit, Hospital Clinic & IDIBAPS, CIBER de Enfermedades Raras (CIBERER), Barcelona, Spain
| | - Pol Gimenez-Xavier
- Dermatology Department, Melanoma Unit, Hospital Clinic & IDIBAPS, CIBER de Enfermedades Raras (CIBERER), Barcelona, Spain
| | - Alessia Visconti
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Jérémie Nsengimana
- Section of Epidemiology and Biostatistics, Leeds Institute of Cancer and Pathology, University of Leeds, Leeds, UK
| | - Francisco Garcia-García
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Gemma Tell-Marti
- Dermatology Department, Melanoma Unit, Hospital Clinic & IDIBAPS, CIBER de Enfermedades Raras (CIBERER), Barcelona, Spain
| | - Maria José Escamez
- Departamento de Bioingeniería, Universidad Carlos III de Madrid, CIEMAT, IIS-Fundación Jiménez Díaz, CIBER de Enfermedades Raras (CIBERER), Madrid, Spain
| | - Julia Newton-Bishop
- Section of Epidemiology and Biostatistics, Leeds Institute of Cancer and Pathology, University of Leeds, Leeds, UK
| | - Veronique Bataille
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Marcela del Río
- Departamento de Bioingeniería, Universidad Carlos III de Madrid, CIEMAT, IIS-Fundación Jiménez Díaz, CIBER de Enfermedades Raras (CIBERER), Madrid, Spain
| | - Joaquín Dopazo
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
- Functional Genomics Node, (INB) at CIPF, Valencia, Spain
- CIBER de Enfermedades Raras (CIBERER), Valencia, Spain
| | - Mario Falchi
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Susana Puig
- Dermatology Department, Melanoma Unit, Hospital Clinic & IDIBAPS, CIBER de Enfermedades Raras (CIBERER), Barcelona, Spain
| |
Collapse
|
23
|
Nandi S, Subramanian A, Sarkar RR. An integrative machine learning strategy for improved prediction of essential genes in Escherichia coli metabolism using flux-coupled features. MOLECULAR BIOSYSTEMS 2017; 13:1584-1596. [DOI: 10.1039/c7mb00234c] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
We propose an integrated machine learning process to predict gene essentiality in Escherichia coli K-12 MG1655 metabolism that outperforms known methods.
Collapse
Affiliation(s)
- Sutanu Nandi
- Chemical Engineering and Process Development
- CSIR-National Chemical Laboratory
- Pune-411008
- India
- Academy of Scientific & Innovative Research (AcSIR)
| | - Abhishek Subramanian
- Chemical Engineering and Process Development
- CSIR-National Chemical Laboratory
- Pune-411008
- India
- Academy of Scientific & Innovative Research (AcSIR)
| | - Ram Rup Sarkar
- Chemical Engineering and Process Development
- CSIR-National Chemical Laboratory
- Pune-411008
- India
- Academy of Scientific & Innovative Research (AcSIR)
| |
Collapse
|
24
|
Ko Y, Cho M, Lee JS, Kim J. Identification of disease comorbidity through hidden molecular mechanisms. Sci Rep 2016; 6:39433. [PMID: 27991583 PMCID: PMC5172201 DOI: 10.1038/srep39433] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 11/22/2016] [Indexed: 12/27/2022] Open
Abstract
Despite multiple diseases co-occur, their underlying common molecular mechanisms remain elusive. Identification of comorbid diseases by considering the interactions between molecular components is a key to understand the underlying disease mechanisms. Here, we developed a novel approach utilizing both common disease-causing genes and underlying molecular pathways to identify comorbid diseases. Our approach enables the analysis of common pathologies shared by comorbid diseases through molecular interaction networks. We found that the integration of direct genetic sharing and indirect high-level molecular associations revealed significantly strong consistency with known comorbid diseases. In addition, neoplasm-related diseases showed high comorbidity patterns within themselves as well as with other diseases, indicating severe complications. This study demonstrated that molecular pathway information could be used to discover disease comorbidity and hidden biological mechanism to understand pathogenesis and provide new insight on disease pathology.
Collapse
Affiliation(s)
- Younhee Ko
- Department of Clinical Genetics, Department of Pediatrics, Yonsei University College of Medicine, Seoul 03722, South Korea
| | - Minah Cho
- Department of Stem Cell and Regenerative Biology, Konkuk University, Seoul 05029, South Korea
| | - Jin-Sung Lee
- Department of Clinical Genetics, Department of Pediatrics, Yonsei University College of Medicine, Seoul 03722, South Korea
| | - Jaebum Kim
- Department of Stem Cell and Regenerative Biology, Konkuk University, Seoul 05029, South Korea
| |
Collapse
|
25
|
Multiplatform serum metabolic phenotyping combined with pathway mapping to identify biochemical differences in smokers. Bioanalysis 2016; 8:2023-43. [DOI: 10.4155/bio-2016-0108] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Aim: Determining perturbed biochemical functions associated with tobacco smoking should be helpful for establishing causal relationships between exposure and adverse events. Results: A multiplatform comparison of serum of smokers (n = 55) and never-smokers (n = 57) using nuclear magnetic resonance spectroscopy, UPLC–MS and statistical modeling revealed clustering of the classes, distinguished by metabolic biomarkers. The identified metabolites were subjected to metabolic pathway enrichment, modeling adverse biological events using available databases. Perturbation of metabolites involved in chronic obstructive pulmonary disease, cardiovascular diseases and cancer were identified and discussed. Conclusion: Combining multiplatform metabolic phenotyping with knowledge-based mapping gives mechanistic insights into disease development, which can be applied to next-generation tobacco and nicotine products for comparative risk assessment.
Collapse
|
26
|
Comparative genomic analysis of novel Acinetobacter symbionts: A combined systems biology and genomics approach. Sci Rep 2016; 6:29043. [PMID: 27378055 PMCID: PMC4932630 DOI: 10.1038/srep29043] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 06/08/2016] [Indexed: 12/20/2022] Open
Abstract
The increasing trend of antibiotic resistance in Acinetobacter drastically limits the range of therapeutic agents required to treat multidrug resistant (MDR) infections. This study focused on analysis of novel Acinetobacter strains using a genomics and systems biology approach. Here we used a network theory method for pathogenic and non-pathogenic Acinetobacter spp. to identify the key regulatory proteins (hubs) in each strain. We identified nine key regulatory proteins, guaA, guaB, rpsB, rpsI, rpsL, rpsE, rpsC, rplM and trmD, which have functional roles as hubs in a hierarchical scale-free fractal protein-protein interaction network. Two key hubs (guaA and guaB) were important for insect-associated strains, and comparative analysis identified guaA as more important than guaB due to its role in effective module regulation. rpsI played a significant role in all the novel strains, while rplM was unique to sheep-associated strains. rpsM, rpsB and rpsI were involved in the regulation of overall network topology across all Acinetobacter strains analyzed in this study. Future analysis will investigate whether these hubs are useful as drug targets for treating Acinetobacter infections.
Collapse
|
27
|
Bahmani R, Kim DG, Kim JA, Hwang S. The Density and Length of Root Hairs Are Enhanced in Response to Cadmium and Arsenic by Modulating Gene Expressions Involved in Fate Determination and Morphogenesis of Root Hairs in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2016; 7:1763. [PMID: 27933081 PMCID: PMC5120091 DOI: 10.3389/fpls.2016.01763] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 11/08/2016] [Indexed: 05/19/2023]
Abstract
Root hairs are tubular outgrowths that originate from epidermal cells. Exposure of Arabidopsis to cadmium (Cd) and arsenic [arsenite, As(III)] increases root hair density and length. To examine the underlying mechanism, we measured the expression of genes involved in fate determination and morphogenesis of root hairs. Cd and As(III) downregulated TTG1 and GL2 (negative regulators of fate determination) and upregulated GEM (positive regulator), suggesting that root hair fate determination is stimulated by Cd and As(III). Cd and As(III) increased the transcript levels of genes involved in root hair initiation (RHD6 and AXR2) and root hair elongation (AUX1, AXR1, ETR1, and EIN2) except CTR1. DR5::GUS transgenic Arabidopsis showed a higher DR5 expression in the root tip, suggesting that Cd and As(III) increased the auxin content in the root tip. Knockdown of TTG1 in Arabidopsis resulted in increased root hair density and decreased root hair length compared with the control (Col-0) on 1/2 MS media. This phenotype may be attributed to the downregulation of GL2 and CTR1 and upregulation of RHD6. By contrast, gem mutant plants displayed a decrease in root hair density and length with reduced expression of RHD6, AXR2, AUX1, AXR1, ETR1, CTR1, and EIN2. Taken together, our results indicate that fate determination, initiation, and elongation of root hairs are stimulated in response to Cd and As(III) through the modulation of the expression of genes involved in these processes in Arabidopsis.
Collapse
Affiliation(s)
- Ramin Bahmani
- Department of Molecular Biology, Sejong UniversitySeoul, South Korea
- Department of Bioindustry and Bioresource Engineering, Sejong UniversitySeoul, South Korea
- Plant Engineering Research Institute, Sejong UniversitySeoul, South Korea
| | - Dong G. Kim
- Department of Molecular Biology, Sejong UniversitySeoul, South Korea
- Department of Bioindustry and Bioresource Engineering, Sejong UniversitySeoul, South Korea
- Plant Engineering Research Institute, Sejong UniversitySeoul, South Korea
| | - Jin A. Kim
- Department of Molecular Biology, Sejong UniversitySeoul, South Korea
- Department of Bioindustry and Bioresource Engineering, Sejong UniversitySeoul, South Korea
| | - Seongbin Hwang
- Department of Molecular Biology, Sejong UniversitySeoul, South Korea
- Department of Bioindustry and Bioresource Engineering, Sejong UniversitySeoul, South Korea
- Plant Engineering Research Institute, Sejong UniversitySeoul, South Korea
- *Correspondence: Seongbin Hwang,
| |
Collapse
|
28
|
Nafis S, Ponnusamy K, Husain M, Singh RKB, Bamezai RNK. Identification of key regulators and their controlling mechanism in a combinatorial apoptosis network: a systems biology approach. MOLECULAR BIOSYSTEMS 2016; 12:3357-3369. [DOI: 10.1039/c6mb00526h] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
NFKB1, SP1 and hsa-let-7a, were identified as key regulators of apoptosis, by network theory through probability of signal propagation, hub-removal and motif analysis.
Collapse
Affiliation(s)
- Shazia Nafis
- Department of Biotechnology
- Jamia Millia Islamia (Central University)
- New Delhi
- India
- School of Computational and Integrative Sciences
| | - Kalaiarasan Ponnusamy
- National Centre of Applied Human Genetics
- School of Life Sciences
- Jawaharlal Nehru University
- New Delhi
- India
| | - Mohammad Husain
- Department of Biotechnology
- Jamia Millia Islamia (Central University)
- New Delhi
- India
| | - R. K. Brojen Singh
- School of Computational and Integrative Sciences
- Jawaharlal Nehru University
- New Delhi
- India
| | - Rameshwar N. K. Bamezai
- School of Computational and Integrative Sciences
- Jawaharlal Nehru University
- New Delhi
- India
- National Centre of Applied Human Genetics
| |
Collapse
|
29
|
Zhang Q, Li J, Xue H, Kong L, Wang Y. Network-based methods for identifying critical pathways of complex diseases: a survey. MOLECULAR BIOSYSTEMS 2016; 12:1082-1089. [DOI: 10.1039/c5mb00815h] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/29/2023]
Abstract
We review seven major network-based pathway analysis methods and enumerate their benefits and limitations from an algorithmic perspective to provide a reference for the next generation of pathway analysis methods.
Collapse
Affiliation(s)
- Qiaosheng Zhang
- School of Computer Science and Technology
- Harbin Institute of Technology
- China
- Heilongjiang Bayi Agricultural University
- China
| | - Jie Li
- School of Computer Science and Technology
- Harbin Institute of Technology
- China
| | - Hanqing Xue
- School of Computer Science and Technology
- Harbin Institute of Technology
- China
| | - Leilei Kong
- School of Computer Science and Technology
- Heilongjiang Institute of Technology
- China
| | - Yadong Wang
- School of Computer Science and Technology
- Harbin Institute of Technology
- China
| |
Collapse
|
30
|
Gan Y, Zheng S, Baak JP, Zhao S, Zheng Y, Luo N, Liao W, Fu C. Prediction of the anti-inflammatory mechanisms of curcumin by module-based protein interaction network analysis. Acta Pharm Sin B 2015; 5:590-5. [PMID: 26713275 PMCID: PMC4675814 DOI: 10.1016/j.apsb.2015.09.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Revised: 08/14/2015] [Accepted: 09/11/2015] [Indexed: 12/16/2022] Open
Abstract
Curcumin, the medically active component from Curcuma longa (Turmeric), is widely used to treat inflammatory diseases. Protein interaction network (PIN) analysis was used to predict its mechanisms of molecular action. Targets of curcumin were obtained based on ChEMBL and STITCH databases. Protein–protein interactions (PPIs) were extracted from the String database. The PIN of curcumin was constructed by Cytoscape and the function modules identified by gene ontology (GO) enrichment analysis based on molecular complex detection (MCODE). A PIN of curcumin with 482 nodes and 1688 interactions was constructed, which has scale-free, small world and modular properties. Based on analysis of these function modules, the mechanism of curcumin is proposed. Two modules were found to be intimately associated with inflammation. With function modules analysis, the anti-inflammatory effects of curcumin were related to SMAD, ERG and mediation by the TLR family. TLR9 may be a potential target of curcumin to treat inflammation.
Collapse
Key Words
- Anti-inflammatory
- Curcumin
- Cytoscape
- ETS, erythroblast transformation-specific
- GO, gene ontology
- Gene ontology enrichment analysis
- IFNs, interferons
- IL, interleukin
- JAK-STAT, Janus kinase-STAT
- MAPK, mitogen-activated protein kinase
- MCODE, molecular complex detection
- Module
- Molecular complex detection
- Molecular mechanism
- NF-κB, nuclear factor kappa B
- PIN, protein interaction network
- PPIs, protein–protein interactions
- Protein interaction network
- STATs, signal transducer and activator of transcription complexes
- TLR, toll-like receptor
Collapse
|
31
|
The Mechanism Research of Qishen Yiqi Formula by Module-Network Analysis. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2015; 2015:497314. [PMID: 26379745 PMCID: PMC4561322 DOI: 10.1155/2015/497314] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 08/04/2015] [Indexed: 01/18/2023]
Abstract
Qishen Yiqi formula (QSYQ) has the effect of tonifying Qi and promoting blood circulation, which is widely used to treat the cardiovascular diseases with Qi deficiency and blood stasis syndrome. However, the mechanism of QSYQ to tonify Qi and promote blood circulation is rarely reported at molecular or systems level. This study aimed to elucidate the mechanism of QSYQ based on the protein interaction network (PIN) analysis. The targets' information of the active components was obtained from ChEMBL and STITCH databases and was further used to search against protein-protein interactions by String database. Next, the PINs of QSYQ were constructed by Cytoscape and were analyzed by gene ontology enrichment analysis based on Markov Cluster algorithm. Finally, based on the topological parameters, the properties of scale-free, small world, and modularity of the QSYQ's PINs were analyzed. And based on function modules, the mechanism of QSYQ was elucidated. The results indicated that Qi-tonifying efficacy of QSYQ may be partly attributed to the regulation of amino acid metabolism, carbohydrate metabolism, lipid metabolism, and cAMP metabolism, while QSYQ improves the blood stasis through the regulation of blood coagulation and cardiac muscle contraction. Meanwhile, the “synergy” of formula compatibility was also illuminated.
Collapse
|
32
|
Resendis-Antonio O, González-Torres C, Jaime-Muñoz G, Hernandez-Patiño CE, Salgado-Muñoz CF. Modeling metabolism: A window toward a comprehensive interpretation of networks in cancer. Semin Cancer Biol 2015; 30:79-87. [DOI: 10.1016/j.semcancer.2014.04.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2014] [Revised: 04/01/2014] [Accepted: 04/04/2014] [Indexed: 12/01/2022]
|
33
|
Networks and Hierarchies: Approaching Complexity in Evolutionary Theory. INTERDISCIPLINARY EVOLUTION RESEARCH 2015. [DOI: 10.1007/978-3-319-15045-1_6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
34
|
Nafis S, Kalaiarasan P, Brojen Singh RK, Husain M, Bamezai RNK. Apoptosis regulatory protein-protein interaction demonstrates hierarchical scale-free fractal network. Brief Bioinform 2014; 16:675-99. [PMID: 25256288 DOI: 10.1093/bib/bbu036] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2014] [Accepted: 08/21/2014] [Indexed: 12/29/2022] Open
Abstract
Dysregulation or inhibition of apoptosis favors cancer and many other diseases. Understanding of the network interaction of the genes involved in apoptotic pathway, therefore, is essential, to look for targets of therapeutic intervention. Here we used the network theory methods, using experimentally validated 25 apoptosis regulatory proteins and identified important genes for apoptosis regulation, which demonstrated a hierarchical scale-free fractal protein-protein interaction network. TP53, BRCA1, UBIQ and CASP3 were recognized as a four key regulators. BRCA1 and UBIQ were also individually found to control highly clustered modules and play an important role in the stability of the overall network. The connection among the BRCA1, UBIQ and TP53 proteins was found to be important for regulation, which controlled their own respective communities and the overall network topology. The feedback loop regulation motif was identified among NPM1, BRCA1 and TP53, and these crucial motif topologies were also reflected in high frequency. The propagation of the perturbed signal from hubs was found to be active upto some distance, after which propagation started decreasing and TP53 was the most efficient signal propagator. From the functional enrichment analysis, most of the apoptosis regulatory genes associated with cardiovascular diseases and highly expressed in brain tissues were identified. Apart from TP53, BRCA1 was observed to regulate apoptosis by influencing motif, propagation of signals and module regulation, reflecting their biological significance. In future, biochemical investigation of the observed hub-interacting partners could provide further understanding about their role in the pathophysiology of cancer.
Collapse
|
35
|
Adenovirus E1A targets the DREF nuclear factor to regulate virus gene expression, DNA replication, and growth. J Virol 2014; 88:13469-81. [PMID: 25210186 DOI: 10.1128/jvi.02538-14] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED The adenovirus E1A gene is the first gene expressed upon viral infection. E1A remodels the cellular environment to maximize permissivity for viral replication. E1A is also the major transactivator of viral early gene expression and a coregulator of a large number of cellular genes. E1A carries out its functions predominantly by binding to cellular regulatory proteins and altering their activities. The unstructured nature of E1A enables it to bind to a large variety of cellular proteins and form new molecular complexes with novel functions. The C terminus of E1A is the least-characterized region of the protein, with few known binding partners. Here we report the identification of cellular factor DREF (ZBED1) as a novel and direct binding partner of E1A. Our studies identify a dual role for DREF in the viral life cycle. DREF contributes to activation of gene expression from all viral promoters early in infection. Unexpectedly, it also functions as a growth restriction factor for adenovirus as knockdown of DREF enhances virus growth and increases viral genome copy number late in the infection. We also identify DREF as a component of viral replication centers. E1A affects the subcellular distribution of DREF within PML bodies and enhances DREF SUMOylation. Our findings identify DREF as a novel E1A C terminus binding partner and provide evidence supporting a role for DREF in viral replication. IMPORTANCE This work identifies the putative transcription factor DREF as a new target of the E1A oncoproteins of human adenovirus. DREF was found to primarily localize with PML nuclear bodies in uninfected cells and to relocalize into virus replication centers during infection. DREF was also found to be SUMOylated, and this was enhanced in the presence of E1A. Knockdown of DREF reduced the levels of viral transcripts detected at 20 h, but not at 40 h, postinfection, increased overall virus yield, and enhanced viral DNA replication. DREF was also found to localize to viral promoters during infection together with E1A. These results suggest that DREF contributes to activation of viral gene expression. However, like several other PML-associated proteins, DREF also appears to function as a growth restriction factor for adenovirus infection.
Collapse
|
36
|
Minervini G, Panizzoni E, Giollo M, Masiero A, Ferrari C, Tosatto SCE. Design and analysis of a Petri net model of the Von Hippel-Lindau (VHL) tumor suppressor interaction network. PLoS One 2014; 9:e96986. [PMID: 24886840 PMCID: PMC4041725 DOI: 10.1371/journal.pone.0096986] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 04/14/2014] [Indexed: 02/01/2023] Open
Abstract
Von Hippel-Lindau (VHL) syndrome is a hereditary condition predisposing to the development of different cancer forms, related to germline inactivation of the homonymous tumor suppressor pVHL. The best characterized function of pVHL is the ubiquitination dependent degradation of Hypoxia Inducible Factor (HIF) via the proteasome. It is also involved in several cellular pathways acting as a molecular hub and interacting with more than 200 different proteins. Molecular details of pVHL plasticity remain in large part unknown. Here, we present a novel manually curated Petri Net (PN) model of the main pVHL functional pathways. The model was built using functional information derived from the literature. It includes all major pVHL functions and is able to credibly reproduce VHL syndrome at the molecular level. The reliability of the PN model also allowed in silico knockout experiments, driven by previous model analysis. Interestingly, PN analysis suggests that the variability of different VHL manifestations is correlated with the concomitant inactivation of different metabolic pathways.
Collapse
Affiliation(s)
| | | | - Manuel Giollo
- Dept. of Biomedical Sciences, University of Padua, Padua, Italy
- Dept. of Information Engineering, University of Padua, Padua, Italy
| | | | - Carlo Ferrari
- Dept. of Information Engineering, University of Padua, Padua, Italy
| | | |
Collapse
|
37
|
Liu X, Pan L. Detection of driver metabolites in the human liver metabolic network using structural controllability analysis. BMC SYSTEMS BIOLOGY 2014; 8:51. [PMID: 24885538 PMCID: PMC4024020 DOI: 10.1186/1752-0509-8-51] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Accepted: 04/28/2014] [Indexed: 12/20/2022]
Abstract
Background Abnormal states in human liver metabolism are major causes of human liver diseases ranging from hepatitis to hepatic tumor. The accumulation in relevant data makes it feasible to derive a large-scale human liver metabolic network (HLMN) and to discover important biological principles or drug-targets based on network analysis. Some studies have shown that interesting biological phenomenon and drug-targets could be discovered by applying structural controllability analysis (which is a newly prevailed concept in networks) to biological networks. The exploration on the connections between structural controllability theory and the HLMN could be used to uncover valuable information on the human liver metabolism from a fresh perspective. Results We applied structural controllability analysis to the HLMN and detected driver metabolites. The driver metabolites tend to have strong ability to influence the states of other metabolites and weak susceptibility to be influenced by the states of others. In addition, the metabolites were classified into three classes: critical, high-frequency and low-frequency driver metabolites. Among the identified 36 critical driver metabolites, 27 metabolites were found to be essential; the high-frequency driver metabolites tend to participate in different metabolic pathways, which are important in regulating the whole metabolic systems. Moreover, we explored some other possible connections between the structural controllability theory and the HLMN, and find that transport reactions and the environment play important roles in the human liver metabolism. Conclusion There are interesting connections between the structural controllability theory and the human liver metabolism: driver metabolites have essential biological functions; the crucial role of extracellular metabolites and transport reactions in controlling the HLMN highlights the importance of the environment in the health of human liver metabolism.
Collapse
Affiliation(s)
| | - Linqiang Pan
- Key Laboratory of Image Information Processing and Intelligent Control, School of Automation, Huazhong University of Science and Technology, Luoyu Road 1037, 430074 Wuhan, China.
| |
Collapse
|
38
|
Sengupta J, Ghosh D. Multi-level and multi-scale integrative approach to the understanding of human blastocyst implantation. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 114:49-60. [DOI: 10.1016/j.pbiomolbio.2013.12.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 12/04/2013] [Indexed: 11/25/2022]
|
39
|
Wang HQ, Tsai CJ. CorSig: a general framework for estimating statistical significance of correlation and its application to gene co-expression analysis. PLoS One 2013; 8:e77429. [PMID: 24194884 PMCID: PMC3806744 DOI: 10.1371/journal.pone.0077429] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Accepted: 09/02/2013] [Indexed: 11/19/2022] Open
Abstract
UNLABELLED With the rapid increase of omics data, correlation analysis has become an indispensable tool for inferring meaningful associations from a large number of observations. Pearson correlation coefficient (PCC) and its variants are widely used for such purposes. However, it remains challenging to test whether an observed association is reliable both statistically and biologically. We present here a new method, CorSig, for statistical inference of correlation significance. CorSig is based on a biology-informed null hypothesis, i.e., testing whether the true PCC (ρ) between two variables is statistically larger than a user-specified PCC cutoff (τ), as opposed to the simple null hypothesis of ρ = 0 in existing methods, i.e., testing whether an association can be declared without a threshold. CorSig incorporates Fisher's Z transformation of the observed PCC (r), which facilitates use of standard techniques for p-value computation and multiple testing corrections. We compared CorSig against two methods: one uses a minimum PCC cutoff while the other (Zhu's procedure) controls correlation strength and statistical significance in two discrete steps. CorSig consistently outperformed these methods in various simulation data scenarios by balancing between false positives and false negatives. When tested on real-world Populus microarray data, CorSig effectively identified co-expressed genes in the flavonoid pathway, and discriminated between closely related gene family members for their differential association with flavonoid and lignin pathways. The p-values obtained by CorSig can be used as a stand-alone parameter for stratification of co-expressed genes according to their correlation strength in lieu of an arbitrary cutoff. CorSig requires one single tunable parameter, and can be readily extended to other correlation measures. Thus, CorSig should be useful for a wide range of applications, particularly for network analysis of high-dimensional genomic data. SOFTWARE AVAILABILITY A web server for CorSig is provided at http://202.127.200.1:8080/probeWeb. R code for CorSig is freely available for non-commercial use at http://aspendb.uga.edu/downloads.
Collapse
Affiliation(s)
- Hong-Qiang Wang
- Intelligent Computing Lab, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
- * E-mail: (HQW); (CJT)
| | - Chung-Jui Tsai
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
- Warnell School of Forestry and Natural Resources, University of Georgia, Athens, Georgia, United States of America
- * E-mail: (HQW); (CJT)
| |
Collapse
|
40
|
|
41
|
Higareda-Almaraz JC, Valtierra-Gutiérrez IA, Hernandez-Ortiz M, Contreras S, Hernandez E, Encarnacion S. Analysis and prediction of pathways in HeLa cells by integrating biological levels of organization with systems-biology approaches. PLoS One 2013; 8:e65433. [PMID: 23785426 PMCID: PMC3680226 DOI: 10.1371/journal.pone.0065433] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2012] [Accepted: 04/30/2013] [Indexed: 11/18/2022] Open
Abstract
It has recently begun to be considered that cancer is a systemic disease and that it must be studied at every level of complexity using many of the currently available approaches, including high-throughput technologies and bioinformatics. To achieve such understanding in cervical cancer, we collected information on gene, protein and phosphoprotein expression of the HeLa cell line and performed a comprehensive analysis of the different signaling pathways, transcription networks and metabolic events in which they participate. A total expression analysis by RNA-Seq of the HeLa cell line showed that 19,974 genes were transcribed. Of these, 3,360 were over-expressed, and 2,129 under-expressed when compared to the NHEK cell line. A protein-protein interaction network was derived from the over-expressed genes and used to identify central elements and, together with the analysis of over-represented transcription factor motifs, to predict active signaling and regulatory pathways. This was further validated by Metal-Oxide Affinity Chromatography (MOAC) and Tandem Mass Spectrometry (MS/MS) assays which retrieved phosphorylated proteins. The 14-3-3 family members emerge as important regulators in carcinogenesis and as possible clinical targets. We observed that the different over- and under-regulated pathways in cervical cancer could be interrelated through elements that participate in crosstalks, therefore belong to what we term "meta-pathways". Additionally, we highlighted the relations of each one of the differentially represented pathways to one or more of the ten hallmarks of cancer. These features could be maintained in many other types of cancer, regardless of mutations or genomic rearrangements, and favor their robustness, adaptations and the evasion of tissue control. Probably, this could explain why cancer cells are not eliminated by selective pressure and why therapy trials directed against molecular targets are not as effective as expected.
Collapse
Affiliation(s)
- Juan Carlos Higareda-Almaraz
- Functional Genomics of Prokaryotes Research Program, Center for Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
| | - Ilse A. Valtierra-Gutiérrez
- Functional Genomics of Prokaryotes Research Program, Center for Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
- Undergraduate Program on Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
| | - Magdalena Hernandez-Ortiz
- Functional Genomics of Prokaryotes Research Program, Center for Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
| | - Sandra Contreras
- Functional Genomics of Prokaryotes Research Program, Center for Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
| | - Erika Hernandez
- Undergraduate Program on Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
| | - Sergio Encarnacion
- Functional Genomics of Prokaryotes Research Program, Center for Genomic Sciences, National Autonomous University of Mexico, Cuernavaca, Morelos, Mexico
- * E-mail:
| |
Collapse
|
42
|
Grinev VV, Ramanouskaya TV, Gloushen SV. Multidimensional control of cell structural robustness. Cell Biol Int 2013; 37:1023-37. [PMID: 23686647 DOI: 10.1002/cbin.10128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2013] [Accepted: 04/21/2013] [Indexed: 11/12/2022]
Abstract
Ample adaptive and functional opportunities of a living cell are determined by the complexity of its structural organisation. However, such complexity gives rise to a problem of maintenance of the coherence of inner processes in macroscopic interims and in macroscopic volumes which is necessary to support the structural robustness of a cell. The solution to this problem lies in multidimensional control of the adaptive and functional changes of a cell as well as its self-renewing processes in the context of environmental conditions. Six mechanisms (principles) form the basis of this multidimensional control: regulatory circuits with feedback loops, redundant inner diversity within a cell, multilevel distributed network organisation of a cell, molecular selection within a cell, continuous informational flows and functioning with a reserve of power. In the review we provide detailed analysis of these mechanisms, discuss their specific functions and the role of the superposition of these mechanisms in the maintenance of cell structural robustness in a wide range of environmental conditions.
Collapse
Affiliation(s)
- Vasily V Grinev
- Biology Faculty, Department of Genetics, Belarusian State University, 220030, Minsk, Belarus.
| | | | | |
Collapse
|
43
|
Langfelder P, Mischel PS, Horvath S. When is hub gene selection better than standard meta-analysis? PLoS One 2013; 8:e61505. [PMID: 23613865 PMCID: PMC3629234 DOI: 10.1371/journal.pone.0061505] [Citation(s) in RCA: 219] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Accepted: 03/12/2013] [Indexed: 01/15/2023] Open
Abstract
Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.
Collapse
Affiliation(s)
- Peter Langfelder
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
| | - Paul S. Mischel
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Steve Horvath
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
- Departments of Human Genetics and Biostatistics, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
44
|
De la Fuente IM, Cortes JM, Pelta DA, Veguillas J. Attractor metabolic networks. PLoS One 2013; 8:e58284. [PMID: 23554883 PMCID: PMC3598861 DOI: 10.1371/journal.pone.0058284] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2012] [Accepted: 02/01/2013] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The experimental observations and numerical studies with dissipative metabolic networks have shown that cellular enzymatic activity self-organizes spontaneously leading to the emergence of a Systemic Metabolic Structure in the cell, characterized by a set of different enzymatic reactions always locked into active states (metabolic core) while the rest of the catalytic processes are only intermittently active. This global metabolic structure was verified for Escherichia coli, Helicobacter pylori and Saccharomyces cerevisiae, and it seems to be a common key feature to all cellular organisms. In concordance with these observations, the cell can be considered a complex metabolic network which mainly integrates a large ensemble of self-organized multienzymatic complexes interconnected by substrate fluxes and regulatory signals, where multiple autonomous oscillatory and quasi-stationary catalytic patterns simultaneously emerge. The network adjusts the internal metabolic activities to the external change by means of flux plasticity and structural plasticity. METHODOLOGY/PRINCIPAL FINDINGS In order to research the systemic mechanisms involved in the regulation of the cellular enzymatic activity we have studied different catalytic activities of a dissipative metabolic network under different external stimuli. The emergent biochemical data have been analysed using statistical mechanic tools, studying some macroscopic properties such as the global information and the energy of the system. We have also obtained an equivalent Hopfield network using a Boltzmann machine. Our main result shows that the dissipative metabolic network can behave as an attractor metabolic network. CONCLUSIONS/SIGNIFICANCE We have found that the systemic enzymatic activities are governed by attractors with capacity to store functional metabolic patterns which can be correctly recovered from specific input stimuli. The network attractors regulate the catalytic patterns, modify the efficiency in the connection between the multienzymatic complexes, and stably retain these modifications. Here for the first time, we have introduced the general concept of attractor metabolic network, in which this dynamic behavior is observed.
Collapse
Affiliation(s)
- Ildefonso M De la Fuente
- Quantitative Biomedicine Unit, BioCruces Health Research Institute, Barakaldo, Basque Country, Spain.
| | | | | | | |
Collapse
|
45
|
A kinetic model of the evolution of a protein interaction network. BMC Genomics 2013; 14:172. [PMID: 23497092 PMCID: PMC3751699 DOI: 10.1186/1471-2164-14-172] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 03/08/2013] [Indexed: 11/10/2022] Open
Abstract
Background Known protein interaction networks have very particular properties. Old proteins tend to have more interactions than new ones. One of the best statistical representatives of this property is the node degree distribution (distribution of proteins having a given number of interactions). It has previously been shown that this distribution is very close to the sum of two distinct exponential components. In this paper, we asked: What are the possible mechanisms of evolution for such types of networks? To answer this question, we tested a kinetic model for simplified evolution of a protein interactome. Our proposed model considers the emergence of new genes and interactions and the loss of old ones. We assumed that there are generally two coexisting classes of proteins. Proteins constituting the first class are essential only for ecological adaptations and are easily lost when ecological conditions change. Proteins of the second class are essential for basic life processes and, hence, are always effectively protected against deletion. All proteins can transit between the above classes in both directions. We also assumed that the phenomenon of gene duplication is always related to ecological adaptation and that a new copy of a duplicated gene is not essential. According to this model, all proteins gain new interactions with a rate that preferentially increases with the number of interactions (the rich get richer). Proteins can also gain interactions because of duplication. Proteins lose their interactions both with and without the loss of partner genes. Results The proposed model reproduces the main properties of protein-protein interaction networks very well. The connectivity of the oldest part of the interaction network is densest, and the node degree distribution follows the sum of two shifted power-law functions, which is a theoretical generalization of the previous finding. The above distribution covers the wide range of values of node degrees very well, much better than a power law or generalized power law supplemented with an exponential cut-off. The presented model also relates the total number of interactome links to the total number of interacting proteins. The theoretical results were for the interactomes of A. thaliana, B. taurus, C. elegans, D. melanogaster, E. coli, H. pylori, H. sapiens, M. musculus, R. norvegicus and S. cerevisiae. Conclusions Using these approaches, the kinetic parameters could be estimated. Finally, the model revealed the evolutionary kinetics of proteome formation, the phenomenon of protein differentiation and the process of gaining new interactions.
Collapse
|
46
|
Mukherjee S, Sambarey A, Prashanthi K, Chandra N. Current trends in modeling host–pathogen interactions. WIRES DATA MINING AND KNOWLEDGE DISCOVERY 2013; 3:109-128. [DOI: 10.1002/widm.1085] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
AbstractThe rapid emergence of infectious diseases calls for immediate attention to determine practical solutions for intervention strategies. To this end, it becomes necessary to obtain a holistic view of the complex host–pathogen interactome. Advances in omics and related technology have resulted in massive generation of data for the interacting systems at unprecedented levels of detail. Systems‐level studies with the aid of mathematical tools contribute to a deeper understanding of biological systems, where intuitive reasoning alone does not suffice. In this review, we discuss different aspects of host–pathogen interactions (HPIs) and the available data resources and tools used to study them. We discuss in detail models of HPIs at various levels of abstraction, along with their applications and limitations. We also enlist a few case studies, which incorporate different modeling approaches, providing significant insights into disease. © 2013 Wiley Periodicals, Inc.This article is categorized under:
Algorithmic Development > Biological Data Mining
Collapse
|
47
|
Horvath S, Zhang Y, Langfelder P, Kahn RS, Boks MPM, van Eijk K, van den Berg LH, Ophoff RA. Aging effects on DNA methylation modules in human brain and blood tissue. Genome Biol 2012; 13:R97. [PMID: 23034122 PMCID: PMC4053733 DOI: 10.1186/gb-2012-13-10-r97] [Citation(s) in RCA: 471] [Impact Index Per Article: 36.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2012] [Accepted: 10/03/2012] [Indexed: 11/30/2022] Open
Abstract
Background Several recent studies reported aging effects on DNA methylation levels of individual CpG dinucleotides. But it is not yet known whether aging-related consensus modules, in the form of clusters of correlated CpG markers, can be found that are present in multiple human tissues. Such a module could facilitate the understanding of aging effects on multiple tissues. Results We therefore employed weighted correlation network analysis of 2,442 Illumina DNA methylation arrays from brain and blood tissues, which enabled the identification of an age-related co-methylation module. Module preservation analysis confirmed that this module can also be found in diverse independent data sets. Biological evaluation showed that module membership is associated with Polycomb group target occupancy counts, CpG island status and autosomal chromosome location. Functional enrichment analysis revealed that the aging-related consensus module comprises genes that are involved in nervous system development, neuron differentiation and neurogenesis, and that it contains promoter CpGs of genes known to be down-regulated in early Alzheimer's disease. A comparison with a standard, non-module based meta-analysis revealed that selecting CpGs based on module membership leads to significantly increased gene ontology enrichment, thus demonstrating that studying aging effects via consensus network analysis enhances the biological insights gained. Conclusions Overall, our analysis revealed a robustly defined age-related co-methylation module that is present in multiple human tissues, including blood and brain. We conclude that blood is a promising surrogate for brain tissue when studying the effects of age on DNA methylation profiles.
Collapse
|
48
|
Oldham MC, Langfelder P, Horvath S. Network methods for describing sample relationships in genomic datasets: application to Huntington's disease. BMC SYSTEMS BIOLOGY 2012; 6:63. [PMID: 22691535 PMCID: PMC3441531 DOI: 10.1186/1752-0509-6-63] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 05/03/2012] [Indexed: 01/08/2023]
Abstract
BACKGROUND Genomic datasets generated by new technologies are increasingly prevalent in disparate areas of biological research. While many studies have sought to characterize relationships among genomic features, commensurate efforts to characterize relationships among biological samples have been less common. Consequently, the full extent of sample variation in genomic studies is often under-appreciated, complicating downstream analytical tasks such as gene co-expression network analysis. RESULTS Here we demonstrate the use of network methods for characterizing sample relationships in microarray data generated from human brain tissue. We describe an approach for identifying outlying samples that does not depend on the choice or use of clustering algorithms. We introduce a battery of measures for quantifying the consistency and integrity of sample relationships, which can be compared across disparate studies, technology platforms, and biological systems. Among these measures, we provide evidence that the correlation between the connectivity and the clustering coefficient (two important network concepts) is a sensitive indicator of homogeneity among biological samples. We also show that this measure, which we refer to as cor(K,C), can distinguish biologically meaningful relationships among subgroups of samples. Specifically, we find that cor(K,C) reveals the profound effect of Huntington's disease on samples from the caudate nucleus relative to other brain regions. Furthermore, we find that this effect is concentrated in specific modules of genes that are naturally co-expressed in human caudate nucleus, highlighting a new strategy for exploring the effects of disease on sets of genes. CONCLUSIONS These results underscore the importance of systematically exploring sample relationships in large genomic datasets before seeking to analyze genomic feature activity. We introduce a standardized platform for this purpose using freely available R software that has been designed to enable iterative and interactive exploration of sample networks.
Collapse
Affiliation(s)
- Michael C Oldham
- Department of Neurology, The Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, USA.
| | | | | |
Collapse
|
49
|
Navid A. Applications of system-level models of metabolism for analysis of bacterial physiology and identification of new drug targets. Brief Funct Genomics 2012; 10:354-64. [PMID: 22199377 DOI: 10.1093/bfgp/elr034] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
For nearly all of the 20th century, biologists gained considerable insights into the fundamental principles of cellular dynamics by examining select modules of biochemical processes. This form of analysis provides detailed information about the workings of the examined pathways. However, any attempt to alter the normal function of bacteria (perhaps for industrial or medicinal goals) requires a detailed global understanding of cellular mechanisms. The reductionist mode of analysis cannot provide the required information for developing the needed perspective on the complex interactions of biochemical pathways. Thankfully, the increasing availability of microbial genomic, transcriptomic, proteomic and other high-throughput data permits system-level analyses of microbiology. During the past two decades, systems biologists have developed constraint-based genome-scale models (GSM) of metabolism for a variety of pathogens. These models are important tools for assessing the metabolic capabilities of various genotypes. Simultaneously, new computational methods have been developed that use these network reconstructions to answer an array of important immunological questions. The objective of this article is to briefly review some of the uses of GSMs for studying bacterial metabolism under different conditions and to discuss how the calculated solutions can be used for rational design of drugs.
Collapse
Affiliation(s)
- Ali Navid
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94551, USA.
| |
Collapse
|
50
|
Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc 2012; 7:670-85. [PMID: 22422314 DOI: 10.1038/nprot.2012.004] [Citation(s) in RCA: 332] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Computational analysis and interactive visualization of biological networks and protein structures are common tasks for gaining insight into biological processes. This protocol describes three workflows based on the NetworkAnalyzer and RINalyzer plug-ins for Cytoscape, a popular software platform for networks. NetworkAnalyzer has become a standard Cytoscape tool for comprehensive network topology analysis. In addition, RINalyzer provides methods for exploring residue interaction networks derived from protein structures. The first workflow uses NetworkAnalyzer to perform a topological analysis of biological networks. The second workflow applies RINalyzer to study protein structure and function and to compute network centrality measures. The third workflow combines NetworkAnalyzer and RINalyzer to compare residue networks. The full protocol can be completed in ∼2 h.
Collapse
|