651
|
Hu B, Yang YCT, Huang Y, Zhu Y, Lu ZJ. POSTAR: a platform for exploring post-transcriptional regulation coordinated by RNA-binding proteins. Nucleic Acids Res 2016; 45:D104-D114. [PMID: 28053162 PMCID: PMC5210617 DOI: 10.1093/nar/gkw888] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Revised: 09/23/2016] [Accepted: 09/27/2016] [Indexed: 01/01/2023] Open
Abstract
We present POSTAR (http://POSTAR.ncrnalab.org), a resource of POST-trAnscriptional Regulation coordinated by RNA-binding proteins (RBPs). Precise characterization of post-transcriptional regulatory maps has accelerated dramatically in the past few years. Based on new studies and resources, POSTAR supplies the largest collection of experimentally probed (∼23 million) and computationally predicted (approximately 117 million) RBP binding sites in the human and mouse transcriptomes. POSTAR annotates every transcript and its RBP binding sites using extensive information regarding various molecular regulatory events (e.g., splicing, editing, and modification), RNA secondary structures, disease-associated variants, and gene expression and function. Moreover, POSTAR provides a friendly, multi-mode, integrated search interface, which helps users to connect multiple RBP binding sites with post-transcriptional regulatory events, phenotypes, and diseases. Based on our platform, we were able to obtain novel insights into post-transcriptional regulation, such as the putative association between CPSF6 binding, RNA structural domains, and Li-Fraumeni syndrome SNPs. In summary, POSTAR represents an early effort to systematically annotate post-transcriptional regulatory maps and explore the putative roles of RBPs in human diseases.
Collapse
Affiliation(s)
- Boqin Hu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Center for Plant Biology and Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Yu-Cheng T Yang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Center for Plant Biology and Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.,Department of Statistics, University of California Los Angeles, Los Angeles, CA 90095-1554, USA
| | - Yiming Huang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Center for Plant Biology and Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Yumin Zhu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Center for Plant Biology and Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Zhi John Lu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Center for Plant Biology and Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
652
|
Xia P, Zhang X, Xie Y, Guan M, Villeneuve DL, Yu H. Functional Toxicogenomic Assessment of Triclosan in Human HepG2 Cells Using Genome-Wide CRISPR-Cas9 Screening. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2016; 50:10682-10692. [PMID: 27459410 DOI: 10.1021/acs.est.6b02328] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
There are thousands of chemicals used by humans and detected in the environment for which limited or no toxicological data are available. Rapid and cost-effective approaches for assessing the toxicological properties of chemicals are needed. We used CRISPR-Cas9 functional genomic screening to identify the potential molecular mechanism of a widely used antimicrobial triclosan (TCS) in HepG2 cells. Resistant genes at IC50 (the concentration causing a 50% reduction in cell viability) were significantly enriched in the adherens junction pathway, MAPK signaling pathway, and PPAR signaling pathway, suggesting a potential role in the molecular mechanism of TCS-induced cytotoxicity. Evaluation of the top-ranked resistant genes, FTO (encoding an mRNA demethylase) and MAP2K3 (a MAP kinase kinase family gene), revealed that their loss conferred resistance to TCS. In contrast, sensitive genes at IC10 and IC20 were specifically enriched in pathways involved with immune responses, which was concordant with transcriptomic profiling of TCS at concentrations of <IC10. It is suggested that the CRISPR-Cas9 fingerprint may reveal the patterns of TCS toxicity at low concentration levels. Moreover, we retrieved the potential connection between CRISPR-Cas9 fingerprint and disease terms, obesity, and breast cancer from an existing chemical-gene-disease database. Overall, CRISPR-Cas9 functional genomic screening offers an alternative approach for chemical toxicity testing.
Collapse
Affiliation(s)
- Pu Xia
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University , Nanjing 210023, People's Republic of China
| | - Xiaowei Zhang
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University , Nanjing 210023, People's Republic of China
| | - Yuwei Xie
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University , Nanjing 210023, People's Republic of China
| | - Miao Guan
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University , Nanjing 210023, People's Republic of China
| | - Daniel L Villeneuve
- Mid-Continent Ecology Division, United States Environmental Protection Agency , Duluth, Minnesota 55804, United States
| | - Hongxia Yu
- State Key Laboratory of Pollution Control & Resource Reuse, School of the Environment, Nanjing University , Nanjing 210023, People's Republic of China
| |
Collapse
|
653
|
Li P, Nie Y, Yu J. Fusing literature and full network data improves disease similarity computation. BMC Bioinformatics 2016; 17:326. [PMID: 27578323 PMCID: PMC5006367 DOI: 10.1186/s12859-016-1205-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2016] [Accepted: 08/24/2016] [Indexed: 01/01/2023] Open
Abstract
Background Identifying relatedness among diseases could help deepen understanding for the underlying pathogenic mechanisms of diseases, and facilitate drug repositioning projects. A number of methods for computing disease similarity had been developed; however, none of them were designed to utilize information of the entire protein interaction network, using instead only those interactions involving disease causing genes. Most of previously published methods required gene-disease association data, unfortunately, many diseases still have very few or no associated genes, which impeded broad adoption of those methods. In this study, we propose a new method (MedNetSim) for computing disease similarity by integrating medical literature and protein interaction network. MedNetSim consists of a network-based method (NetSim), which employs the entire protein interaction network, and a MEDLINE-based method (MedSim), which computes disease similarity by mining the biomedical literature. Results Among function-based methods, NetSim achieved the best performance. Its average AUC (area under the receiver operating characteristic curve) reached 95.2 %. MedSim, whose performance was even comparable to some function-based methods, acquired the highest average AUC in all semantic-based methods. Integration of MedSim and NetSim (MedNetSim) further improved the average AUC to 96.4 %. We further studied the effectiveness of different data sources. It was found that quality of protein interaction data was more important than its volume. On the contrary, higher volume of gene-disease association data was more beneficial, even with a lower reliability. Utilizing higher volume of disease-related gene data further improved the average AUC of MedNetSim and NetSim to 97.5 % and 96.7 %, respectively. Conclusions Integrating biomedical literature and protein interaction network can be an effective way to compute disease similarity. Lacking sufficient disease-related gene data, literature-based methods such as MedSim can be a great addition to function-based algorithms. It may be beneficial to steer more resources torward studying gene-disease associations and improving the quality of protein interaction data. Disease similarities can be computed using the proposed methods at http://www.digintelli.com:8000/. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1205-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ping Li
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yaling Nie
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jingkai Yu
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.
| |
Collapse
|
654
|
Talevi A. Computational approaches for innovative antiepileptic drug discovery. Expert Opin Drug Discov 2016; 11:1001-16. [DOI: 10.1080/17460441.2016.1216965] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
655
|
In Search of 'Birth Month Genes': Using Existing Data Repositories to Locate Genes Underlying Birth Month-Disease Relationships. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2016; 2016:189-98. [PMID: 27570668 PMCID: PMC5001771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Prenatal and perinatal exposures vary seasonally (e.g., sunlight, allergens) and many diseases are linked with variance in exposure. Epidemiologists often measure these changes using birth month as a proxy for seasonal variance. Likewise, Genome-Wide Association Studies have associated or implicated these same diseases with many genes. Both disparate data types (epidemiological and genetic) can provide key insights into the underlying disease biology. We developed an algorithm that links 1) epidemiological data from birth month studies with 2) genetic data from published gene-disease association studies. Our framework uses existing data repositories - PubMed, DisGeNET and Gene Ontology - to produce a bipartite network that connects enriched seasonally varying biofactorss with birth month dependent diseases (BMDDs) through their overlapping developmental gene sets. As a proof-of-concept, we investigate 7 known BMDDs and highlight three important biological networks revealed by our algorithm and explore some interesting genetic mechanisms potentially responsible for the seasonal contribution to BMDDs.
Collapse
|
656
|
Plaisier CL, O'Brien S, Bernard B, Reynolds S, Simon Z, Toledo CM, Ding Y, Reiss DJ, Paddison PJ, Baliga NS. Causal Mechanistic Regulatory Network for Glioblastoma Deciphered Using Systems Genetics Network Analysis. Cell Syst 2016; 3:172-186. [PMID: 27426982 DOI: 10.1016/j.cels.2016.06.006] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Revised: 04/20/2016] [Accepted: 06/09/2016] [Indexed: 12/30/2022]
Abstract
We developed the transcription factor (TF)-target gene database and the Systems Genetics Network Analysis (SYGNAL) pipeline to decipher transcriptional regulatory networks from multi-omic and clinical patient data, and we applied these tools to 422 patients with glioblastoma multiforme (GBM). The resulting gbmSYGNAL network predicted 112 somatically mutated genes or pathways that act through 74 TFs and 37 microRNAs (miRNAs) (67 not previously associated with GBM) to dysregulate 237 distinct co-regulated gene modules associated with patient survival or oncogenic processes. The regulatory predictions were associated to cancer phenotypes using CRISPR-Cas9 and small RNA perturbation studies and also demonstrated GBM specificity. Two pairwise combinations (ETV6-NFKB1 and romidepsin-miR-486-3p) predicted by the gbmSYGNAL network had synergistic anti-proliferative effects. Finally, the network revealed that mutations in NF1 and PIK3CA modulate IRF1-mediated regulation of MHC class I antigen processing and presentation genes to increase tumor lymphocyte infiltration and worsen prognosis. Importantly, SYGNAL is widely applicable for integrating genomic and transcriptomic measurements from other human cohorts.
Collapse
Affiliation(s)
| | - Sofie O'Brien
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA
| | - Brady Bernard
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA
| | - Sheila Reynolds
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA
| | - Zac Simon
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA
| | - Chad M Toledo
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Molecular and Cellular Biology Program, University of Washington, Seattle, WA 98195, USA
| | - Yu Ding
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - David J Reiss
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA
| | - Patrick J Paddison
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Molecular and Cellular Biology Program, University of Washington, Seattle, WA 98195, USA
| | - Nitin S Baliga
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109-5234, USA.
| |
Collapse
|
657
|
Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources. PLoS Comput Biol 2016; 12:e1004989. [PMID: 27336457 PMCID: PMC4918977 DOI: 10.1371/journal.pcbi.1004989] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 05/17/2016] [Indexed: 12/21/2022] Open
Abstract
The diversity of online resources storing biological data in different formats provides a challenge for bioinformaticians to integrate and analyse their biological data. The semantic web provides a standard to facilitate knowledge integration using statements built as triples describing a relation between two objects. WikiPathways, an online collaborative pathway resource, is now available in the semantic web through a SPARQL endpoint at http://sparql.wikipathways.org. Having biological pathways in the semantic web allows rapid integration with data from other resources that contain information about elements present in pathways using SPARQL queries. In order to convert WikiPathways content into meaningful triples we developed two new vocabularies that capture the graphical representation and the pathway logic, respectively. Each gene, protein, and metabolite in a given pathway is defined with a standard set of identifiers to support linking to several other biological resources in the semantic web. WikiPathways triples were loaded into the Open PHACTS discovery platform and are available through its Web API (https://dev.openphacts.org/docs) to be used in various tools for drug development. We combined various semantic web resources with the newly converted WikiPathways content using a variety of SPARQL query types and third-party resources, such as the Open PHACTS API. The ability to use pathway information to form new links across diverse biological data highlights the utility of integrating WikiPathways in the semantic web.
Collapse
|
658
|
Purifying selection shapes the coincident SNP distribution of primate coding sequences. Sci Rep 2016; 6:27272. [PMID: 27255481 PMCID: PMC4891680 DOI: 10.1038/srep27272] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 05/17/2016] [Indexed: 12/13/2022] Open
Abstract
Genome-wide analysis has observed an excess of coincident single nucleotide polymorphisms (coSNPs) at human-chimpanzee orthologous positions, and suggested that this is due to cryptic variation in the mutation rate. While this phenomenon primarily corresponds with non-coding coSNPs, the situation in coding sequences remains unclear. Here we calculate the observed-to-expected ratio of coSNPs (coSNPO/E) to estimate the prevalence of human-chimpanzee coSNPs, and show that the excess of coSNPs is also present in coding regions. Intriguingly, coSNPO/E is much higher at zero-fold than at nonzero-fold degenerate sites; such a difference is due to an elevation of coSNPO/E at zero-fold degenerate sites, rather than a reduction at nonzero-fold degenerate ones. These trends are independent of chimpanzee subpopulation, population size, or sequencing techniques; and hold in broad generality across primates. We find that this discrepancy cannot fully explained by sequence contexts, shared ancestral polymorphisms, SNP density, and recombination rate, and that coSNPO/E in coding sequences is significantly influenced by purifying selection. We also show that selection and mutation rate affect coSNPO/E independently, and coSNPs tend to be less damaging and more correlated with human diseases than non-coSNPs. These suggest that coSNPs may represent a “signature” during primate protein evolution.
Collapse
|
659
|
Digles D, Zdrazil B, Neefs JM, Van Vlijmen H, Herhaus C, Caracoti A, Brea J, Roibás B, Loza MI, Queralt-Rosinach N, Furlong LI, Gaulton A, Bartek L, Senger S, Chichester C, Engkvist O, Evelo CT, Franklin NI, Marren D, Ecker GF, Jacoby E. Open PHACTS computational protocols for in silico target validation of cellular phenotypic screens: knowing the knowns. MEDCHEMCOMM 2016; 7:1237-1244. [PMID: 27774140 PMCID: PMC5063042 DOI: 10.1039/c6md00065g] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Accepted: 05/10/2016] [Indexed: 01/09/2023]
Abstract
Phenotypic screening is in a renaissance phase and is expected by many academic and industry leaders to accelerate the discovery of new drugs for new biology. Given that phenotypic screening is per definition target agnostic, the emphasis of in silico and in vitro follow-up work is on the exploration of possible molecular mechanisms and efficacy targets underlying the biological processes interrogated by the phenotypic screening experiments. Herein, we present six exemplar computational protocols for the interpretation of cellular phenotypic screens based on the integration of compound, target, pathway, and disease data established by the IMI Open PHACTS project. The protocols annotate phenotypic hit lists and allow follow-up experiments and mechanistic conclusions. The annotations included are from ChEMBL, ChEBI, GO, WikiPathways and DisGeNET. Also provided are protocols which select from the IUPHAR/BPS Guide to PHARMACOLOGY interaction file selective compounds to probe potential targets and a correlation robot which systematically aims to identify an overlap of active compounds in both the phenotypic as well as any kinase assay. The protocols are applied to a phenotypic pre-lamin A/C splicing assay selected from the ChEMBL database to illustrate the process. The computational protocols make use of the Open PHACTS API and data and are built within the Pipeline Pilot and KNIME workflow tools.
Collapse
Affiliation(s)
- D Digles
- Department of Pharmaceutical Chemistry , University of Vienna , Pharmacoinformatics Research Group , Althanstraße 14 , 1090 Wien , Austria .
| | - B Zdrazil
- Department of Pharmaceutical Chemistry , University of Vienna , Pharmacoinformatics Research Group , Althanstraße 14 , 1090 Wien , Austria .
| | - J-M Neefs
- Janssen Research & Development , Turnhoutseweg 30 , B-2340 Beerse , Belgium .
| | - H Van Vlijmen
- Janssen Research & Development , Turnhoutseweg 30 , B-2340 Beerse , Belgium .
| | - C Herhaus
- Merck KGaA, Merck Serono R&D , Computational Chemistry , Frankfurter Straße 250 , 64293 Darmstadt , Germany
| | - A Caracoti
- BIOVIA , a Dassault Systèmes brand , 334 Cambridge Science Park , Cambridge CB4 0WN , UK
| | - J Brea
- Grupo BioFarma-USEF , Departamento de Farmacología , Facultad de Farmacia , Campus Universitario Sur s/n , 15782 Santiago de Compostela , Spain
| | - B Roibás
- Grupo BioFarma-USEF , Departamento de Farmacología , Facultad de Farmacia , Campus Universitario Sur s/n , 15782 Santiago de Compostela , Spain
| | - M I Loza
- Grupo BioFarma-USEF , Departamento de Farmacología , Facultad de Farmacia , Campus Universitario Sur s/n , 15782 Santiago de Compostela , Spain
| | - N Queralt-Rosinach
- Research Programme on Biomedical Informatics (GRIB) , Hospital del Mar Medical Research Institute (IMIM) , Department of Experimental and Health Sciences , Universitat Pompeu Fabra , C/Dr Aiguader 88 , E-08003 Barcelona , Spain
| | - L I Furlong
- Research Programme on Biomedical Informatics (GRIB) , Hospital del Mar Medical Research Institute (IMIM) , Department of Experimental and Health Sciences , Universitat Pompeu Fabra , C/Dr Aiguader 88 , E-08003 Barcelona , Spain
| | - A Gaulton
- European Molecular Biology Laboratory , European Bioinformatics Institute (EMBL-EBI) , Wellcome Genome Campus , Hinxton , Cambridge CB10 1SD , UK
| | - L Bartek
- GlaxoSmithKline , Medicines Research Centre , Stevenage SG1 2NY , UK
| | - S Senger
- GlaxoSmithKline , Medicines Research Centre , Stevenage SG1 2NY , UK
| | - C Chichester
- Swiss Institute of Bioinformatics , CALIPHO Group , CMU Rue Michel-Servet 1 , 1211 Geneva 4 , Switzerland ; Nestlé Institute of Health Sciences SA , EPFL Innovation Park, Bâtiment H , 1015 Lausanne , Switzerland
| | - O Engkvist
- Chemistry Innovation Centre , Discovery Sciences , AstraZeneca R&D Gothenburg , SE-431 83 Mölndal , Sweden
| | - C T Evelo
- Department of Bioinformatics - BiGCaT , P.O. Box 616 , UNS50 Box19 , NL-6200MD Maastricht , The Netherlands
| | - N I Franklin
- Open Innovation Drug Discovery , Discovery Chemistry Eli Lilly and Company , Lilly Corporate Center , DC 1920 , Indianapolis , IN 46285 , USA
| | - D Marren
- Eli Lilly and Company Ltd. , Lilly Research Centre , Erl Wood Manor, Sunninghill Road , Windlesham , Surrey GU20 6PH , England , UK
| | - G F Ecker
- Department of Pharmaceutical Chemistry , University of Vienna , Pharmacoinformatics Research Group , Althanstraße 14 , 1090 Wien , Austria .
| | - E Jacoby
- Janssen Research & Development , Turnhoutseweg 30 , B-2340 Beerse , Belgium .
| |
Collapse
|
660
|
Yang J, Wu SJ, Yang SY, Peng JW, Wang SN, Wang FY, Song YX, Qi T, Li YX, Li YY. DNetDB: The human disease network database based on dysfunctional regulation mechanism. BMC SYSTEMS BIOLOGY 2016; 10:36. [PMID: 27209279 PMCID: PMC4875653 DOI: 10.1186/s12918-016-0280-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/05/2016] [Indexed: 11/18/2022]
Abstract
Disease similarity study provides new insights into disease taxonomy, pathogenesis, which plays a guiding role in diagnosis and treatment. The early studies were limited to estimate disease similarities based on clinical manifestations, disease-related genes, medical vocabulary concepts or registry data, which were inevitably biased to well-studied diseases and offered small chance of discovering novel findings in disease relationships. In other words, genome-scale expression data give us another angle to address this problem since simultaneous measurement of the expression of thousands of genes allows for the exploration of gene transcriptional regulation, which is believed to be crucial to biological functions. Although differential expression analysis based methods have the potential to explore new disease relationships, it is difficult to unravel the upstream dysregulation mechanisms of diseases. We therefore estimated disease similarities based on gene expression data by using differential coexpression analysis, a recently emerging method, which has been proved to be more potential to capture dysfunctional regulation mechanisms than differential expression analysis. A total of 1,326 disease relationships among 108 diseases were identified, and the relevant information constituted the human disease network database (DNetDB). Benefiting from the use of differential coexpression analysis, the potential common dysfunctional regulation mechanisms shared by disease pairs (i.e. disease relationships) were extracted and presented. Statistical indicators, common disease-related genes and drugs shared by disease pairs were also included in DNetDB. In total, 1,326 disease relationships among 108 diseases, 5,598 pathways, 7,357 disease-related genes and 342 disease drugs are recorded in DNetDB, among which 3,762 genes and 148 drugs are shared by at least two diseases. DNetDB is the first database focusing on disease similarity from the viewpoint of gene regulation mechanism. It provides an easy-to-use web interface to search and browse the disease relationships and thus helps to systematically investigate etiology and pathogenesis, perform drug repositioning, and design novel therapeutic interventions. Database URL: http://app.scbit.org/DNetDB/#.
Collapse
Affiliation(s)
- Jing Yang
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, P.R. China
| | - Su-Juan Wu
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China
| | - Shao-You Yang
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Jia-Wei Peng
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Shi-Nuo Wang
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Fu-Yan Wang
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Yu-Xing Song
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Ting Qi
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China.,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China
| | - Yi-Xue Li
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China. .,Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, P.R. China. .,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China. .,Shanghai Engineering Research Center of Pharmaceutical Translation, 1278 Keyuan Road, Shanghai, 201203, P.R. China.
| | - Yuan-Yuan Li
- Shanghai Center for Bioinformation Technology, Shanghai, 200235, P.R. China. .,Shanghai Industrial Technology Institute, 1278 Keyuan Road, Shanghai, 201203, P.R. China. .,Shanghai Engineering Research Center of Pharmaceutical Translation, 1278 Keyuan Road, Shanghai, 201203, P.R. China.
| |
Collapse
|
661
|
Breen MS, Uhlmann A, Nday CM, Glatt SJ, Mitt M, Metsalpu A, Stein DJ, Illing N. Candidate gene networks and blood biomarkers of methamphetamine-associated psychosis: an integrative RNA-sequencing report. Transl Psychiatry 2016; 6:e802. [PMID: 27163203 PMCID: PMC5070070 DOI: 10.1038/tp.2016.67] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Revised: 01/21/2016] [Accepted: 01/24/2016] [Indexed: 02/08/2023] Open
Abstract
The clinical presentation, course and treatment of methamphetamine (METH)-associated psychosis (MAP) are similar to that observed in schizophrenia (SCZ) and subsequently MAP has been hypothesized as a pharmacological and environmental model of SCZ. However, several challenges currently exist in diagnosing MAP accurately at the molecular and neurocognitive level before the MAP model can contribute to the discovery of SCZ biomarkers. We directly assessed subcortical brain structural volumes and clinical parameters of MAP within the framework of an integrative genome-wide RNA-Seq blood transcriptome analysis of subjects diagnosed with MAP (N=10), METH dependency without psychosis (MA; N=10) and healthy controls (N=10). First, we identified discrete groups of co-expressed genes (that is, modules) and tested them for functional annotation and phenotypic relationships to brain structure volumes, life events and psychometric measurements. We discovered one MAP-associated module involved in ubiquitin-mediated proteolysis downregulation, enriched with 61 genes previously found implicated in psychosis and SCZ across independent blood and post-mortem brain studies using convergent functional genomic (CFG) evidence. This module demonstrated significant relationships with brain structure volumes including the anterior corpus callosum (CC) and the nucleus accumbens. Furthermore, a second MAP and psychoticism-associated module involved in circadian clock upregulation was also enriched with 39 CFG genes, further associated with the CC. Subsequently, a machine-learning analysis of differentially expressed genes identified single blood-based biomarkers able to differentiate controls from methamphetamine dependents with 87% accuracy and MAP from MA subjects with 95% accuracy. CFG evidence validated a significant proportion of these putative MAP biomarkers in independent studies including CLN3, FBP1, TBC1D2 and ZNF821 (RNA degradation), ELK3 and SINA3 (circadian clock) and PIGF and UHMK1 (ubiquitin-mediated proteolysis). Finally, focusing analysis on brain structure volumes revealed significantly lower bilateral hippocampal volumes in MAP subjects. Overall, these results suggest similar molecular and neurocognitive mechanisms underlying the pathophysiology of psychosis and SCZ regardless of substance abuse and provide preliminary evidence supporting the MAP paradigm as an exemplar for SCZ biomarker discovery.
Collapse
Affiliation(s)
- M S Breen
- Department of Clinical and
Experimental Sciences, Faculty of Medicine, University of Southampton,
Southampton, UK
| | - A Uhlmann
- Department of Psychiatry and MRC Unit
on Anxiety and Stress Disorders, Groote Schuur Hospital (J-2), University of
Cape Town, Cape Town, South Africa
| | - C M Nday
- Department of Molecular and Cellular
Biology, University of Cape Town, Cape Town, South
Africa
| | - S J Glatt
- Psychiatric Genetic Epidemiology and
Neurobiology Laboratory, Departments of Psychiatry and Behavioral Sciences
and Neuroscience and Physiology, Medical Genetics Research Center, SUNY
Upstate Medical University, Syracuse, NY,
USA
| | - M Mitt
- The Estonian Genome Center,
University of Tartu, Tartu, Estonia
| | - A Metsalpu
- The Estonian Genome Center,
University of Tartu, Tartu, Estonia
| | - D J Stein
- Department of Psychiatry and MRC Unit
on Anxiety and Stress Disorders, Groote Schuur Hospital (J-2), University of
Cape Town, Cape Town, South Africa
| | - N Illing
- Department of Molecular and Cellular
Biology, University of Cape Town, Cape Town, South
Africa
| |
Collapse
|
662
|
Soliman M, Nasraoui O, Cooper NGF. Building a glaucoma interaction network using a text mining approach. BioData Min 2016; 9:17. [PMID: 27152122 PMCID: PMC4857381 DOI: 10.1186/s13040-016-0096-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 04/23/2016] [Indexed: 11/21/2022] Open
Abstract
Background The volume of biomedical literature and its underlying knowledge base is rapidly expanding, making it beyond the ability of a single human being to read through all the literature. Several automated methods have been developed to help make sense of this dilemma. The present study reports on the results of a text mining approach to extract gene interactions from the data warehouse of published experimental results which are then used to benchmark an interaction network associated with glaucoma. To the best of our knowledge, there is, as yet, no glaucoma interaction network derived solely from text mining approaches. The presence of such a network could provide a useful summative knowledge base to complement other forms of clinical information related to this disease. Results A glaucoma corpus was constructed from PubMed Central and a text mining approach was applied to extract genes and their relations from this corpus. The extracted relations between genes were checked using reference interaction databases and classified generally as known or new relations. The extracted genes and relations were then used to construct a glaucoma interaction network. Analysis of the resulting network indicated that it bears the characteristics of a small world interaction network. Our analysis showed the presence of seven glaucoma linked genes that defined the network modularity. A web-based system for browsing and visualizing the extracted glaucoma related interaction networks is made available at http://neurogene.spd.louisville.edu/GlaucomaINViewer/Form1.aspx. Conclusions This study has reported the first version of a glaucoma interaction network using a text mining approach. The power of such an approach is in its ability to cover a wide range of glaucoma related studies published over many years. Hence, a bigger picture of the disease can be established. To the best of our knowledge, this is the first glaucoma interaction network to summarize the known literature. The major findings were a set of relations that could not be found in existing interaction databases and that were found to be new, in addition to a smaller subnetwork consisting of interconnected clusters of seven glaucoma genes. Future improvements can be applied towards obtaining a better version of this network. Electronic supplementary material The online version of this article (doi:10.1186/s13040-016-0096-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maha Soliman
- Department of Anatomical Sciences and Neurobiology, University of Louisville, School of Medicine, Louisville, KY USA
| | - Olfa Nasraoui
- Knowledge Discovery & Web Mining Lab, Department of Computer Engineering & Computer Science, University of Louisville, J.B Speed School of Engineering, Louisville, KY USA
| | - Nigel G F Cooper
- Department of Anatomical Sciences and Neurobiology, University of Louisville, School of Medicine, Louisville, KY USA
| |
Collapse
|
663
|
Queralt-Rosinach N, Piñero J, Bravo À, Sanz F, Furlong LI. DisGeNET-RDF: harnessing the innovative power of the Semantic Web to explore the genetic basis of diseases. Bioinformatics 2016; 32:2236-8. [PMID: 27153650 PMCID: PMC4937199 DOI: 10.1093/bioinformatics/btw214] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Accepted: 04/14/2016] [Indexed: 11/13/2022] Open
Abstract
Motivation: DisGeNET-RDF makes available knowledge on the genetic basis of human diseases in the Semantic Web. Gene-disease associations (GDAs) and their provenance metadata are published as human-readable and machine-processable web resources. The information on GDAs included in DisGeNET-RDF is interlinked to other biomedical databases to support the development of bioinformatics approaches for translational research through evidence-based exploitation of a rich and fully interconnected linked open data. Availability and implementation:http://rdf.disgenet.org/ Contact:support@disgenet.org
Collapse
Affiliation(s)
- Núria Queralt-Rosinach
- Integrative Biomedical Informatics (IBI) Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), C/Doctor Aiguader 88, E-08003 Barcelona, Spain
| | - Janet Piñero
- Integrative Biomedical Informatics (IBI) Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), C/Doctor Aiguader 88, E-08003 Barcelona, Spain
| | - Àlex Bravo
- Integrative Biomedical Informatics (IBI) Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), C/Doctor Aiguader 88, E-08003 Barcelona, Spain
| | - Ferran Sanz
- Integrative Biomedical Informatics (IBI) Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), C/Doctor Aiguader 88, E-08003 Barcelona, Spain
| | - Laura I Furlong
- Integrative Biomedical Informatics (IBI) Group, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), C/Doctor Aiguader 88, E-08003 Barcelona, Spain
| |
Collapse
|
664
|
Piñero J, Berenstein A, Gonzalez-Perez A, Chernomoretz A, Furlong LI. Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing. Sci Rep 2016; 6:24570. [PMID: 27080396 PMCID: PMC4832203 DOI: 10.1038/srep24570] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 03/31/2016] [Indexed: 12/25/2022] Open
Abstract
Characterizing the behavior of disease genes in the context of biological networks has the potential to shed light on disease mechanisms, and to reveal both new candidate disease genes and therapeutic targets. Previous studies addressing the network properties of disease genes have produced contradictory results. Here we have explored the causes of these discrepancies and assessed the relationship between the network roles of disease genes and their tolerance to deleterious germline variants in human populations leveraging on: the abundance of interactome resources, a comprehensive catalog of disease genes and exome variation data. We found that the most salient network features of disease genes are driven by cancer genes and that genes related to different types of diseases play network roles whose centrality is inversely correlated to their tolerance to likely deleterious germline mutations. This proved to be a multiscale signature, including global, mesoscopic and local network centrality features. Cancer driver genes, the most sensitive to deleterious variants, occupy the most central positions, followed by dominant disease genes and then by recessive disease genes, which are tolerant to variants and isolated within their network modules.
Collapse
Affiliation(s)
- Janet Piñero
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF). C/Dr. Aiguader, 88. 08003- Barcelona, Spain
| | - Ariel Berenstein
- Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires. Pabellón 1, Ciudad Universitaria, Buenos Aires, Argentina.,Instituto de Física de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Pabellón 1, Ciudad Universitaria, Buenos Aires, Argentina
| | - Abel Gonzalez-Perez
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF). C/Dr. Aiguader, 88. 08003- Barcelona, Spain
| | - Ariel Chernomoretz
- Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires. Pabellón 1, Ciudad Universitaria, Buenos Aires, Argentina.,Instituto de Física de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Pabellón 1, Ciudad Universitaria, Buenos Aires, Argentina.,Laboratorio de Biología de Sistemas Integrativa, Fundación Instituto Leloir, Buenos Aires, Argentina
| | - Laura I Furlong
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), DCEXS, Pompeu Fabra University (UPF). C/Dr. Aiguader, 88. 08003- Barcelona, Spain
| |
Collapse
|
665
|
Gompert Z. A Continuous Correlated Beta Process Model for Genetic Ancestry in Admixed Populations. PLoS One 2016; 11:e0151047. [PMID: 26966908 PMCID: PMC4788345 DOI: 10.1371/journal.pone.0151047] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 02/23/2016] [Indexed: 11/24/2022] Open
Abstract
Admixture and recombination create populations and genomes with genetic ancestry from multiple source populations. Analyses of genetic ancestry in admixed populations are relevant for trait and disease mapping, studies of speciation, and conservation efforts. Consequently, many methods have been developed to infer genome-average ancestry and to deconvolute ancestry into continuous local ancestry blocks or tracts within individuals. Current methods for local ancestry inference perform well when admixture occurred recently or hybridization is ongoing, or when admixture occurred in the distant past such that local ancestry blocks have fixed in the admixed population. However, methods to infer local ancestry frequencies in isolated admixed populations still segregating for ancestry do not exist. In the current paper, I develop and test a continuous correlated beta process model to fill this analytical gap. The method explicitly models autocorrelations in ancestry frequencies at the population-level and uses discriminant analysis of SNP windows to take advantage of ancestry blocks within individuals. Analyses of simulated data sets show that the method is generally accurate such that ancestry frequency estimates exhibited low root-mean-square error and were highly correlated with the true values, particularly when large (±10 or ±20) SNP windows were used. Along these lines, the proposed method outperformed post hoc inference of ancestry frequencies from a traditional hidden Markov model (i.e., the linkage model in structure), particularly when admixture occurred more distantly in the past with little on-going gene flow or was followed by natural selection. The reliability and utility of the method was further assessed by analyzing genetic ancestry in an admixed human population (Uyghur) and three populations from a hybrid zone between Mus domesticus and M. musculus. Considerable variation in ancestry frequencies was detected within and among chromosomes in the Uyghur, with a large region of excess French ancestry harboring a gene with a known disease association. Similar variation was detected in the mouse hybrid zone, with notable constancy in regions of excess ancestry among admixed populations. By filling what has been an analytical gap, the proposed method should be a useful tool for many biologists. A computer program (popanc), written in C++, has been developed based on the proposed method and is available on-line at http://sourceforge.net/projects/popanc/.
Collapse
Affiliation(s)
- Zachariah Gompert
- Department of Biology, Utah State University, Logan, UT, United States of America
- * E-mail:
| |
Collapse
|
666
|
Berto S, Perdomo-Sabogal A, Gerighausen D, Qin J, Nowick K. A Consensus Network of Gene Regulatory Factors in the Human Frontal Lobe. Front Genet 2016; 7:31. [PMID: 27014338 PMCID: PMC4782181 DOI: 10.3389/fgene.2016.00031] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 02/18/2016] [Indexed: 01/29/2023] Open
Abstract
Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID) or autism spectrum disorders (ASD). Because many of these genes are gene regulatory factors (GRFs) we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies.
Collapse
Affiliation(s)
- Stefano Berto
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University LeipzigLeipzig, Germany; Paul-Flechsig Institute for Brain Research, University of LeipzigLeipzig, Germany; Department of Neuroscience, University of Texas Southwestern Medical CenterDallas, TX, USA
| | - Alvaro Perdomo-Sabogal
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University Leipzig Leipzig, Germany
| | - Daniel Gerighausen
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University Leipzig Leipzig, Germany
| | - Jing Qin
- Department of Mathematics and Computer Sciences, University of Southern DenmarkOdense, Denmark; Institute for Theoretical Chemistry, University of ViennaVienna, Austria
| | - Katja Nowick
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University LeipzigLeipzig, Germany; Paul-Flechsig Institute for Brain Research, University of LeipzigLeipzig, Germany
| |
Collapse
|
667
|
Vafaee F. Using Multi-objective Optimization to Identify Dynamical Network Biomarkers as Early-warning Signals of Complex Diseases. Sci Rep 2016; 6:22023. [PMID: 26906975 PMCID: PMC4764930 DOI: 10.1038/srep22023] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 02/03/2016] [Indexed: 12/31/2022] Open
Abstract
Biomarkers have gained immense scientific interest and clinical value in the practice of medicine. With unprecedented advances in high-throughput technologies, research interest in identifying novel and customized disease biomarkers for early detection, diagnosis, or drug responses is rapidly growing. Biomarkers can be identified in different levels of molecular biomarkers, networks biomarkers and dynamical network biomarkers (DNBs). The latter is a recently developed concept which relies on the idea that a cell is a complex system whose behavior is emerged from interplay of various molecules, and this network of molecules dynamically changes over time. A DNB can serve as an early-warning signal of disease progression, or as a leading network that drives the system into the disease state, and thus unravels mechanisms of disease initiation and progression. It is therefore of great importance to identify DNBs efficiently and reliably. In this work, the problem of DNB identification is defined as a multi-objective optimization problem, and a framework to identify DNBs out of time-course high-throughput data is proposed. Temporal gene expression data of a lung injury with carbonyl chloride inhalation exposure has been used as a case study, and the functional role of the discovered biomarker in the pathogenesis of lung injury has been thoroughly analyzed.
Collapse
Affiliation(s)
- Fatemeh Vafaee
- Charles Perkins Centre, University of Sydney, Sydney, Australia
- School of Mathematics and Statistics, University of Sydney, Sydney, Australia
| |
Collapse
|
668
|
Chagoyen M, Pazos F. Characterization of clinical signs in the human interactome. Bioinformatics 2016; 32:1761-5. [PMID: 26861820 DOI: 10.1093/bioinformatics/btw054] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 01/22/2016] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Many diseases are related by shared associated molecules and pathways, exhibiting comorbidities and common phenotypes, an indication of the continuous nature of the human pathological landscape. Although it is continuous, this landscape is always partitioned into discrete diseases when studied at the molecular level. Clinical signs are also important phenotypic descriptors that can reveal the molecular mechanisms that underlie pathological states, but have seldom been the subject of systemic research. Here, we quantify the modular nature of the clinical signs associated with genetic diseases in the human interactome. RESULTS We found that clinical signs are reflected as modules at the molecular network level, to at least to the same extent as diseases. They can thus serve as a valid complementary partition of the human pathological landscape, with implications for etiology research, diagnosis and treatment. CONTACT monica.chagoyen@cnb.csic.es SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Monica Chagoyen
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), Darwin 3, Madrid 28049, Spain
| | - Florencio Pazos
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), Darwin 3, Madrid 28049, Spain
| |
Collapse
|
669
|
dbAARD & AGP: A computational pipeline for the prediction of genes associated with age related disorders. J Biomed Inform 2016; 60:153-61. [PMID: 26836976 DOI: 10.1016/j.jbi.2016.01.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 11/11/2015] [Accepted: 01/12/2016] [Indexed: 01/01/2023]
Abstract
The atrocious behavioral and physiological shift with aging accelerate occurrence of deleterious disorders. Contemporary research is focused at uncovering the role of genetic associations in age-related disorders (ARDs). While the completion of the Human Genome Project and the HapMap project has generated huge amount of data on genetic variations; Genome-Wide Association Studies (GWAS) have identified genetic variations, essentially SNPs associated with several disorders including ARDs. However, a repository that houses all such ARD associations is lacking. The present work is aimed at filling this void. A database, dbAARD (database of Aging and Age Related Disorders) has been developed which hosts information on more than 3000 genetic variations significantly (p-value <0.05) associated with 51 ARDs. Furthermore, a machine learning based gene prediction tool AGP (Age Related Disorders Gene Prediction) has been constructed by employing rotation forest algorithm, to prioritize genes associated with ARDs. The tool achieved an overall accuracy in terms of precision 75%, recall 76%, F-measure 76% and AUC 0.85. Both the web resources have been made available online at http://genomeinformatics.dce.edu/dbAARD/ and http://genomeinformatics.dce.edu/AGP/ respectively for easy retrieval and usage by the scientific community. We believe that this work may facilitate the analysis of plethora of variants associated with ARDs and provide cues for deciphering the biology of aging.
Collapse
|
670
|
The Importance of Drug Repurposing in the Field of Antiepileptic Drug Development. METHODS IN PHARMACOLOGY AND TOXICOLOGY 2016. [DOI: 10.1007/978-1-4939-6355-3_19] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
671
|
Joseph S, Barai RS, Bhujbalrao R, Idicula-Thomas S. PCOSKB: A KnowledgeBase on genes, diseases, ontology terms and biochemical pathways associated with PolyCystic Ovary Syndrome. Nucleic Acids Res 2015; 44:D1032-5. [PMID: 26578565 PMCID: PMC4702829 DOI: 10.1093/nar/gkv1146] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/19/2015] [Indexed: 12/22/2022] Open
Abstract
Polycystic ovary syndrome (PCOS) is one of the major causes of female subfertility worldwide and ≈7–10% of women in reproductive age are affected by it. The affected individuals exhibit varying types and levels of comorbid conditions, along with the classical PCOS symptoms. Extensive studies on PCOS across diverse ethnic populations have resulted in a plethora of information on dysregulated genes, gene polymorphisms and diseases linked to PCOS. However, efforts have not been taken to collate and link these data. Our group, for the first time, has compiled PCOS-related information available through scientific literature; cross-linked it with molecular, biochemical and clinical databases and presented it as a user-friendly, web-based online knowledgebase for the benefit of the scientific and clinical community. Manually curated information on associated genes, single nucleotide polymorphisms, diseases, gene ontology terms and pathways along with supporting reference literature has been collated and included in PCOSKB (http://pcoskb.bicnirrh.res.in).
Collapse
Affiliation(s)
- Shaini Joseph
- Biomedical Informatics Center of Indian Council of Medical Research, National Institute for Research in Reproductive Health, Mumbai-400012, India
| | - Ram Shankar Barai
- Biomedical Informatics Center of Indian Council of Medical Research, National Institute for Research in Reproductive Health, Mumbai-400012, India
| | | | - Susan Idicula-Thomas
- Biomedical Informatics Center of Indian Council of Medical Research, National Institute for Research in Reproductive Health, Mumbai-400012, India
| |
Collapse
|
672
|
Guan X, Yi Y, Huang Y, Hu Y, Li X, Wang X, Fan H, Wang G, Wang D. Revealing potential molecular targets bridging colitis and colorectal cancer based on multidimensional integration strategy. Oncotarget 2015; 6:37600-12. [PMID: 26461477 PMCID: PMC4741951 DOI: 10.18632/oncotarget.6067] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Accepted: 09/24/2015] [Indexed: 02/05/2023] Open
Abstract
Chronic inflammation may play a vital role in the pathogenesis of inflammation-associated tumors. However, the underlying mechanisms bridging ulcerative colitis (UC) and colorectal cancer (CRC) remain unclear. Here, we integrated multidimensional interaction resources, including gene expression profiling, protein-protein interactions (PPIs), transcriptional and post-transcriptional regulation data, and virus-host interactions, to tentatively explore potential molecular targets that functionally link UC and CRC at a systematic level. In this work, by deciphering the overlapping genes, crosstalking genes and pivotal regulators of both UC- and CRC-associated functional module pairs, we revealed a variety of genes (including FOS and DUSP1, etc.), transcription factors (including SMAD3 and ETS1, etc.) and miRNAs (including miR-155 and miR-196b, etc.) that may have the potential to complete the connections between UC and CRC. Interestingly, further analyses of the virus-host interaction network demonstrated that several virus proteins (including EBNA-LP of EBV and protein E7 of HPV) frequently inter-connected to UC- and CRC-associated module pairs with their validated targets significantly enriched in both modules of the host. Together, our results suggested that multidimensional integration strategy provides a novel approach to discover potential molecular targets that bridge the connections between UC and CRC, which could also be extensively applied to studies on other inflammation-related cancers.
Collapse
Affiliation(s)
- Xu Guan
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Ying Yi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yan Huang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yongfei Hu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xiaobo Li
- Department of Pathology, Harbin Medical University, Harbin, China
| | - Xishan Wang
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Huihui Fan
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Guiyu Wang
- Department of Colorectal Cancer Surgery, the Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Dong Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- Department of Biochemistry and Molecular Biology, Shantou University Medical College, Shantou, China
| |
Collapse
|
673
|
Abascal MF, Besso MJ, Rosso M, Mencucci MV, Aparicio E, Szapiro G, Furlong LI, Vazquez-Levin MH. CDH1/E-cadherin and solid tumors. An updated gene-disease association analysis using bioinformatics tools. Comput Biol Chem 2015; 60:9-20. [PMID: 26674224 DOI: 10.1016/j.compbiolchem.2015.10.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2015] [Revised: 10/17/2015] [Accepted: 10/19/2015] [Indexed: 12/13/2022]
Abstract
Cancer is a group of diseases that causes millions of deaths worldwide. Among cancers, Solid Tumors (ST) stand-out due to their high incidence and mortality rates. Disruption of cell-cell adhesion is highly relevant during tumor progression. Epithelial-cadherin (protein: E-cadherin, gene: CDH1) is a key molecule in cell-cell adhesion and an abnormal expression or/and function(s) contributes to tumor progression and is altered in ST. A systematic study was carried out to gather and summarize current knowledge on CDH1/E-cadherin and ST using bioinformatics resources. The DisGeNET database was exploited to survey CDH1-associated diseases. Reported mutations in specific ST were obtained by interrogating COSMIC and IntOGen tools. CDH1 Single Nucleotide Polymorphisms (SNP) were retrieved from the dbSNP database. DisGeNET analysis identified 609 genes annotated to ST, among which CDH1 was listed. Using CDH1 as query term, 26 disease concepts were found, 21 of which were neoplasms-related terms. Using DisGeNET ALL Databases, 172 disease concepts were identified. Of those, 80 ST disease-related terms were subjected to manual curation and 75/80 (93.75%) associations were validated. On selected ST, 489 CDH1 somatic mutations were listed in COSMIC and IntOGen databases. Breast neoplasms had the highest CDH1-mutation rate. CDH1 was positioned among the 20 genes with highest mutation frequency and was confirmed as driver gene in breast cancer. Over 14,000 SNP for CDH1 were found in the dbSNP database. This report used DisGeNET to gather/compile current knowledge on gene-disease association for CDH1/E-cadherin and ST; data curation expanded the number of terms that relate them. An updated list of CDH1 somatic mutations was obtained with COSMIC and IntOGen databases and of SNP from dbSNP. This information can be used to further understand the role of CDH1/E-cadherin in health and disease.
Collapse
Affiliation(s)
- María Florencia Abascal
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - María José Besso
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - Marina Rosso
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - María Victoria Mencucci
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - Evangelina Aparicio
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - Gala Szapiro
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| | - Laura Inés Furlong
- Research Programme on Biomedical Informatics (GRIB) (IMIM), DCEXS, Universitat Pompeu Fabra, C/Dr Aiguader 88, Zip Code 08003, Barcelona, Spain.
| | - Mónica Hebe Vazquez-Levin
- Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología & Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina; Laboratory of Cell-Cell Interaction in Cancer and Reproduction, Instituto de Biología y Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Fundación IBYME (FIBYME), Vuelta de Obligado 2490, Zip Code C1428ADN, Buenos Aires, Argentina.
| |
Collapse
|
674
|
A functional module-based exploration between inflammation and cancer in esophagus. Sci Rep 2015; 5:15340. [PMID: 26489668 PMCID: PMC4614801 DOI: 10.1038/srep15340] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 09/23/2015] [Indexed: 12/26/2022] Open
Abstract
Inflammation contributing to the underlying progression of diverse human cancers has been generally appreciated, however, explorations into the molecular links between inflammation and cancer in esophagus are still at its early stage. In our study, we presented a functional module-based approach, in combination with multiple data resource (gene expression, protein-protein interactions (PPI), transcriptional and post-transcriptional regulations) to decipher the underlying links. Via mapping differentially expressed disease genes, functional disease modules were identified. As indicated, those common genes and interactions tended to play important roles in linking inflammation and cancer. Based on crosstalk analysis, we demonstrated that, although most disease genes were not shared by both kinds of modules, they might act through participating in the same or similar functions to complete the molecular links. Additionally, we applied pivot analysis to extract significant regulators for per significant crosstalk module pair. As shown, pivot regulators might manipulate vital parts of the module subnetworks, and then work together to bridge inflammation and cancer in esophagus. Collectively, based on our functional module analysis, we demonstrated that shared genes or interactions, significant crosstalk modules, and those significant pivot regulators were served as different functional parts underlying the molecular links between inflammation and cancer in esophagus.
Collapse
|
675
|
Kutmon M, Riutta A, Nunes N, Hanspers K, Willighagen EL, Bohler A, Mélius J, Waagmeester A, Sinha SR, Miller R, Coort SL, Cirillo E, Smeets B, Evelo CT, Pico AR. WikiPathways: capturing the full diversity of pathway knowledge. Nucleic Acids Res 2015; 44:D488-94. [PMID: 26481357 PMCID: PMC4702772 DOI: 10.1093/nar/gkv1024] [Citation(s) in RCA: 310] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 09/28/2015] [Indexed: 12/19/2022] Open
Abstract
WikiPathways (http://www.wikipathways.org) is an open, collaborative platform for capturing and disseminating models of biological pathways for data visualization and analysis. Since our last NAR update, 4 years ago, WikiPathways has experienced massive growth in content, which continues to be contributed by hundreds of individuals each year. New aspects of the diversity and depth of the collected pathways are described from the perspective of researchers interested in using pathway information in their studies. We provide updates on extensions and services to support pathway analysis and visualization via popular standalone tools, i.e. PathVisio and Cytoscape, web applications and common programming environments. We introduce the Quick Edit feature for pathway authors and curators, in addition to new means of publishing pathways and maintaining custom pathway collections to serve specific research topics and communities. In addition to the latest milestones in our pathway collection and curation effort, we also highlight the latest means to access the content as publishable figures, as standard data files, and as linked data, including bulk and programmatic access.
Collapse
Affiliation(s)
- Martina Kutmon
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Anders Riutta
- Gladstone Institutes, San Francisco, California, CA 94158, USA
| | - Nuno Nunes
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | | | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Anwesha Bohler
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Jonathan Mélius
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Andra Waagmeester
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Micelio, Antwerp, 2180 Antwerp, Belgium
| | - Sravanthi R Sinha
- Keshav Memorial Institute of Technology, Hyderabad, Telangana 500029, India
| | - Ryan Miller
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Susan L Coort
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Elisa Cirillo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Bart Smeets
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Chris T Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| | - Alexander R Pico
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6229 ER Maastricht, The Netherlands
| |
Collapse
|
676
|
Profiling RNA editing in human tissues: towards the inosinome Atlas. Sci Rep 2015; 5:14941. [PMID: 26449202 PMCID: PMC4598827 DOI: 10.1038/srep14941] [Citation(s) in RCA: 163] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 09/09/2015] [Indexed: 12/26/2022] Open
Abstract
Adenine to Inosine RNA editing is a widespread co- and post-transcriptional mechanism mediated by ADAR enzymes acting on double stranded RNA. It has a plethora of biological effects, appears to be particularly pervasive in humans with respect to other mammals, and is implicated in a number of diverse human pathologies. Here we present the first human inosinome atlas comprising 3,041,422 A-to-I events identified in six tissues from three healthy individuals. Matched directional total-RNA-Seq and whole genome sequence datasets were generated and analysed within a dedicated computational framework, also capable of detecting hyper-edited reads. Inosinome profiles are tissue specific and edited gene sets consistently show enrichment of genes involved in neurological disorders and cancer. Overall frequency of editing also varies, but is strongly correlated with ADAR expression levels. The inosinome database is available at: http://srv00.ibbe.cnr.it/editing/.
Collapse
|
677
|
Faner R, Gutiérrez-Sacristán A, Castro-Acosta A, Grosdidier S, Gan W, Sánchez-Mayor M, Lopez-Campos JL, Pozo-Rodriguez F, Sanz F, Mannino D, Furlong LI, Agusti A. Molecular and clinical diseasome of comorbidities in exacerbated COPD patients. Eur Respir J 2015; 46:1001-10. [PMID: 26250499 DOI: 10.1183/13993003.00763-2015] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Accepted: 06/11/2015] [Indexed: 01/05/2023]
Abstract
The frequent occurrence of comorbidities in patients with chronic obstructive pulmonary disease (COPD) suggests that they may share pathobiological processes and/or risk factors.To explore these possibilities we compared the clinical diseasome and the molecular diseasome of 5447 COPD patients hospitalised because of an exacerbation of the disease. The clinical diseasome is a network representation of the relationships between diseases, in which diseases are connected if they co-occur more than expected at random; in the molecular diseasome, diseases are linked if they share associated genes or interaction between proteins.The results showed that about half of the disease pairs identified in the clinical diseasome had a biological counterpart in the molecular diseasome, particularly those related to inflammation and vascular tone regulation. Interestingly, the clinical diseasome of these patients appears independent of age, cumulative smoking exposure or severity of airflow limitation.These results support the existence of shared molecular mechanisms among comorbidities in COPD.
Collapse
Affiliation(s)
- Rosa Faner
- Fundació Privada Clinic per a la Recerca Biomèdica, Barcelona, Spain CIBER Enfermedades Respiratorias (CIBERES), Spain Co-primary authors
| | - Alba Gutiérrez-Sacristán
- Integrative Biomedical Informatics Group, Research Program on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Universitat Pompeu Fabra, Barcelona, Spain Co-primary authors
| | - Ady Castro-Acosta
- CIBER Enfermedades Respiratorias (CIBERES), Spain Instituto de Investigación, Hospital 12 de Octubre, Madrid, Spain
| | - Solène Grosdidier
- Integrative Biomedical Informatics Group, Research Program on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Universitat Pompeu Fabra, Barcelona, Spain
| | - Wenqi Gan
- Dept of Preventive Medicine and Environmental Health, University of Kentucky College of Public Health, Lexington, KY, USA
| | - Milagros Sánchez-Mayor
- Integrative Biomedical Informatics Group, Research Program on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Universitat Pompeu Fabra, Barcelona, Spain
| | - Jose Luis Lopez-Campos
- CIBER Enfermedades Respiratorias (CIBERES), Spain Unidad Médico-Quirúrgica de Enfermedades Respiratorias, Instituto de Biomedicina de Sevilla (IBiS), Hospital Universitario Virgen del Rocio/Universidad de Sevilla, Sevilla, Spain
| | - Francisco Pozo-Rodriguez
- CIBER Enfermedades Respiratorias (CIBERES), Spain Instituto de Investigación, Hospital 12 de Octubre, Madrid, Spain
| | - Ferran Sanz
- Integrative Biomedical Informatics Group, Research Program on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Universitat Pompeu Fabra, Barcelona, Spain
| | - David Mannino
- Dept of Preventive Medicine and Environmental Health, University of Kentucky College of Public Health, Lexington, KY, USA
| | - Laura I Furlong
- Integrative Biomedical Informatics Group, Research Program on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Universitat Pompeu Fabra, Barcelona, Spain
| | - Alvar Agusti
- Fundació Privada Clinic per a la Recerca Biomèdica, Barcelona, Spain CIBER Enfermedades Respiratorias (CIBERES), Spain Thorax Institute, Hospital Clinic, IDIBAPS, Univ. Barcelona, Barcelona, Spain
| |
Collapse
|
678
|
Bello SM, Smith CL, Eppig JT. Allele, phenotype and disease data at Mouse Genome Informatics: improving access and analysis. Mamm Genome 2015; 26:285-94. [PMID: 26162703 PMCID: PMC4534497 DOI: 10.1007/s00335-015-9582-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 06/23/2015] [Indexed: 11/16/2022]
Abstract
A core part of the Mouse Genome Informatics (MGI) resource is the collection of mouse mutations and the annotation phenotypes and diseases displayed by mice carrying these mutations. These data are integrated with the rest of data in MGI and exported to numerous other resources. The use of mouse phenotype data to drive translational research into human disease has expanded rapidly with the improvements in sequencing technology. MGI has implemented many improvements in allele and phenotype data annotation, search, and display to facilitate access to these data through multiple avenues. For example, the description of alleles has been modified to include more detailed categories of allele attributes. This allows improved discrimination between mutation types. Further, connections have been created between mutations involving multiple genes and each of the genes overlapping the mutation. This allows users to readily find all mutations affecting a gene and see all genes affected by a mutation. In a similar manner, the genes expressed by transgenic or knock-in alleles are now connected to these alleles. The advanced search forms and public reports have been updated to take advantage of these improvements. These search forms and reports are used by an expanding number of researchers to identify novel human disease genes and mouse models of human disease.
Collapse
Affiliation(s)
- Susan M Bello
- Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME, 04609, USA,
| | | | | |
Collapse
|
679
|
A pipeline for the systematic identification of non-redundant full-ORF cDNAs for polymorphic and evolutionary divergent genomes: Application to the ascidian Ciona intestinalis. Dev Biol 2015; 404:149-63. [PMID: 26025923 PMCID: PMC4528069 DOI: 10.1016/j.ydbio.2015.05.014] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2014] [Revised: 05/11/2015] [Accepted: 05/12/2015] [Indexed: 12/17/2022]
Abstract
Genome-wide resources, such as collections of cDNA clones encoding for complete proteins (full-ORF clones), are crucial tools for studying the evolution of gene function and genetic interactions. Non-model organisms, in particular marine organisms, provide a rich source of functional diversity. Marine organism genomes are, however, frequently highly polymorphic and encode proteins that diverge significantly from those of well-annotated model genomes. The construction of full-ORF clone collections from non-model organisms is hindered by the difficulty of predicting accurately the N-terminal ends of proteins, and distinguishing recent paralogs from highly polymorphic alleles. We report a computational strategy that overcomes these difficulties, and allows for accurate gene level clustering of transcript data followed by the automated identification of full-ORFs with correct 5'- and 3'-ends. It is robust to polymorphism, includes paralog calling and does not require evolutionary proximity to well annotated model organisms. We developed this pipeline for the ascidian Ciona intestinalis, a highly polymorphic member of the divergent sister group of the vertebrates, emerging as a powerful model organism to study chordate gene function, Gene Regulatory Networks and molecular mechanisms underlying human pathologies. Using this pipeline we have generated the first full-ORF collection for a highly polymorphic marine invertebrate. It contains 19,163 full-ORF cDNA clones covering 60% of Ciona coding genes, and full-ORF orthologs for approximately half of curated human disease-associated genes.
Collapse
|
680
|
Gutiérrez-Sacristán A, Grosdidier S, Valverde O, Torrens M, Bravo À, Piñero J, Sanz F, Furlong LI. PsyGeNET: a knowledge platform on psychiatric disorders and their genes. Bioinformatics 2015; 31:3075-7. [PMID: 25964630 PMCID: PMC4565028 DOI: 10.1093/bioinformatics/btv301] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2015] [Accepted: 05/05/2015] [Indexed: 12/29/2022] Open
Abstract
Summary: PsyGeNET (Psychiatric disorders and Genes association NETwork) is a knowledge platform for the exploratory analysis of psychiatric diseases and their associated genes. PsyGeNET is composed of a database and a web interface supporting data search, visualization, filtering and sharing. PsyGeNET integrates information from DisGeNET and data extracted from the literature by text mining, which has been curated by domain experts. It currently contains 2642 associations between 1271 genes and 37 psychiatric disease concepts. In its first release, PsyGeNET is focused on three psychiatric disorders: major depression, alcohol and cocaine use disorders. PsyGeNET represents a comprehensive, open access resource for the analysis of the molecular mechanisms underpinning psychiatric disorders and their comorbidities. Availability and implementation: The PysGeNET platform is freely available at http://www.psygenet.org/. The PsyGeNET database is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). Contact:lfurlong@imim.es Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alba Gutiérrez-Sacristán
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Solène Grosdidier
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Olga Valverde
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Marta Torrens
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Àlex Bravo
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Janet Piñero
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Ferran Sanz
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| | - Laura I Furlong
- Research Group on Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences (DCEXS), Universitat Pompeu Fabra (UPF), Neurobiology of Behaviour Research Group (GReNeC), IMIM, DCEXS, UPF and Institute of Neuropsychiatry and Addiction, Parc de Salut Mar, Universitat Autònoma de Barcelona, Barcelona 08003, Spain
| |
Collapse
|