Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

120
(from Reference Citation Analysis)

Article PDFs (39)

Cited by > 0 (107)

Searched Name

Jaap Heringa

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Liu T, Feenstra KA, Huang Z, Heringa J. Mining literature and pathway data to explore the relations of ketamine with neurotransmitters and gut microbiota using a knowledge-graph. Bioinformatics 2024;40:btad771. [PMID: 38147362 PMCID: PMC10769815 DOI: 10.1093/bioinformatics/btad771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 11/06/2023] [Accepted: 12/25/2023] [Indexed: 12/27/2023] Open

Abstract

MOTIVATION

Up-to-date pathway knowledge is usually presented in scientific publications for human reading, making it difficult to utilize these resources for semantic integration and computational analysis of biological pathways. We here present an approach to mining knowledge graphs by combining manual curation with automated named entity recognition and automated relation extraction. This approach allows us to study pathway-related questions in detail, which we here show using the ketamine pathway, aiming to help improve understanding of the role of gut microbiota in the antidepressant effects of ketamine.

RESULTS

The thus devised ketamine pathway 'KetPath' knowledge graph comprises five parts: (i) manually curated pathway facts from images; (ii) recognized named entities in biomedical texts; (iii) identified relations between named entities; (iv) our previously constructed microbiota and pre-/probiotics knowledge bases; and (v) multiple community-accepted public databases. We first assessed the performance of automated extraction of relations between named entities using the specially designed state-of-the-art tool BioKetBERT. The query results show that we can retrieve drug actions, pathway relations, co-occurring entities, and their relations. These results uncover several biological findings, such as various gut microbes leading to increased expression of BDNF, which may contribute to the sustained antidepressant effects of ketamine. We envision that the methods and findings from this research will aid researchers who wish to integrate and query data and knowledge from multiple biomedical databases and literature simultaneously.

AVAILABILITY AND IMPLEMENTATION

Data and query protocols are available in the KetPath repository at https://dx.doi.org/10.5281/zenodo.8398941 and https://github.com/tingcosmos/KetPath.

Collapse

Gavai A, Bouzembrak Y, Mu W, Martin F, Kaliyaperumal R, van Soest J, Choudhury A, Heringa J, Dekker A, Marvin HJP. Author Correction: Applying federated learning to combat food fraud in food supply chains. NPJ Sci Food 2023;7:57. [PMID: 37857631 PMCID: PMC10587136 DOI: 10.1038/s41538-023-00232-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2023] Open

Gavai A, Bouzembrak Y, Mu W, Martin F, Kaliyaperumal R, van Soest J, Choudhury A, Heringa J, Dekker A, Marvin HJP. Applying federated learning to combat food fraud in food supply chains. NPJ Sci Food 2023;7:46. [PMID: 37658060 PMCID: PMC10474077 DOI: 10.1038/s41538-023-00220-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 08/16/2023] [Indexed: 09/03/2023] Open

Lakbir S, Lahoz S, Cuatrecasas M, Camps J, Glas RA, Heringa J, Meijer GA, Abeln S, Fijneman RJA. Tumour break load is a biologically relevant feature of genomic instability with prognostic value in colorectal cancer. Eur J Cancer 2022;177:94-102. [PMID: 36334560 DOI: 10.1016/j.ejca.2022.09.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 09/28/2022] [Accepted: 09/30/2022] [Indexed: 01/06/2023]

Affiliation(s)

Soufyan Lakbir Bioinformatics Group, Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam 1081HV, the Netherlands; Department of Pathology, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, the Netherlands
Sara Lahoz Translational Colorectal Cancer Genomics, Gastrointestinal and Pancreatic Oncology Team, Institut D'Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Hospital Clínic de Barcelona, Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Barcelona, 08036, Spain
Miriam Cuatrecasas Pathology Department, Biomedical Diagnostic Center (CDB), Hospital Clínic de Barcelona, Institut D'Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Universitat de Barcelona (UB), Barcelona, 08036, Spain
Jordi Camps Translational Colorectal Cancer Genomics, Gastrointestinal and Pancreatic Oncology Team, Institut D'Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Hospital Clínic de Barcelona, Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Barcelona, 08036, Spain; Department of Cell Biology, Physiology and Immunology, Faculty of Medicine, Autonomous University of Barcelona, Bellaterra, 08193, Spain
Roel A Glas Bioinformatics Group, Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam 1081HV, the Netherlands; Department of Pathology, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, the Netherlands
Jaap Heringa Bioinformatics Group, Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam 1081HV, the Netherlands; AIMMS - Amsterdam Institute for Molecules Medicines and Systems, Vrije Universiteit Amsterdam, Amsterdam 1081HV, the Netherlands
Gerrit A Meijer Department of Pathology, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, the Netherlands
Sanne Abeln Bioinformatics Group, Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam 1081HV, the Netherlands; Life Sciences and Health Research Group, Centrum Wiskunde & Informatica (CWI), Science Park 123, Amsterdam 1098 XG, the Netherlands.
Remond J A Fijneman Department of Pathology, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, the Netherlands.

Collapse

van Bree E, Alarcón CR, Lakbir S, Stelloo E, Buranelli C, Hondema A, van 't Erve I, Vessies D, Delis-van Diemen P, Tijssen M, Bolijn A, Lanfermeijer M, Linders D, Swennenhuis J, van den Broek D, Heringa J, Meijer G, Carvalho B, Feitsma H, Abeln S, Fijneman RJA. Abstract A020: Structural variants in the pathogenesis of colorectal cancer: The elephant in the room. Cancer Res 2022. [DOI: 10.1158/1538-7445.crc22-a020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]

Abstract Abstract Background: Cancer is caused by somatic DNA alterations, comprising single/small nucleotide variants (SNVs), somatic copy number alterations (SCNAs) and chromosomal rearrangement structural variants (SVs). We previously demonstrated that SVs are recurrently identified in hundreds of genes and are highly prevalent in common fragile site genes, e.g., in MACROD2 in >40% of colorectal cancers (CRCs). However, computational methods that discriminate SV-driver from SV-passenger events are lacking and laboratory methods to detect SVs at nucleotide resolution from routinely obtained formalin-fixed paraffin-embedded (FFPE) tumor tissue material are underdeveloped. Therefore, despite the abundant presence of SVs, knowledge about their biological and clinical impact is limited. Aim: The aim of our studies is to identify genes of which the function is frequently affected by SV, to understand how these genes contribute to CRC pathogenesis, and to translate these SVs into clinically relevant biomarkers. Methods: We made use of publicly available deep whole genome DNA sequencing data and tumor-matched RNA sequencing data from the Hartwig Medical Foundation to develop the algorithm ‘CoBRA’: Computation of Biologically Relevant Alterations. Adenoma-derived organoids were used for CRISPR/Cas9-mediated gene modulation for functional analysis of SV-driver events. Cergentis’ targeted locus capture (FFPE-TLC) technology was used to detect SVs at nucleotide resolution from FFPE material, which were translated into droplet digital PCR (ddPCR) assays for the detection of SVs in cell-free circulating tumor DNA (ctDNA) in liquid biopsies. Results: The CoBRA algorithm associated the presence of SV-events in frequently affected genes to the extent in which genome-wide RNA sequencing data were altered. In this way, CoBRA ranked SV-events in genes according to their putative impact on tumor biology. SVs in MACROD2 ranked among those with the highest impact on tumor biology. Therefore, we generated focal deletions in MACROD2 in adenoma-derived organoids for functional analyses. Moreover, using FFPE tumor tissue material we detected SVs at nucleotide resolution in MACROD2 and three other genes in 21 out of 29 patients. SVs were verified by PCR on tumor tissue and subsequently translated into ddPCR biomarker assays for detection of SVs in ctDNA in blood from the same patients. Conclusions: We developed the computational method CoBRA and succeeded to detect SVs with high impact on tumor biology. These SVs are prioritized for functional analysis in pre-malignant adenoma-derived organoids; for targeted detection in routinely obtained FFPE tumor tissue material; and for translation into liquid biopsy ctDNA assays. Proof of concept was delivered for MACROD2. Our novel computational and laboratory methodologies provide valuable tools to effectively explore the biological and clinical impact of SVs, which will contribute to our understanding of these common recurrent somatic alterations in CRC and their translation into clinically relevant biomarker applications. Citation Format: Elise van Bree, Carmen Rubio Alarcón, Soufyan Lakbir, Ellen Stelloo, Caterina Buranelli, Amber Hondema, Iris van 't Erve, Daan Vessies, Pien Delis-van Diemen, Marianne Tijssen, Anne Bolijn, Mirthe Lanfermeijer, Dorothe Linders, Joost Swennenhuis, Daan van den Broek, Jaap Heringa, Gerrit Meijer, Beatriz Carvalho, Harma Feitsma, Sanne Abeln, Remond J. A. Fijneman. Structural variants in the pathogenesis of colorectal cancer: The elephant in the room [abstract]. In: Proceedings of the AACR Special Conference on Colorectal Cancer; 2022 Oct 1-4; Portland, OR. Philadelphia (PA): AACR; Cancer Res 2022;82(23 Suppl_1):Abstract nr A020. Collapse

Liu T, Lan G, Feenstra KA, Huang Z, Heringa J. Towards a knowledge graph for pre-/probiotics and microbiota-gut-brain axis diseases. Sci Rep 2022;12:18977. [PMID: 36347868 PMCID: PMC9643397 DOI: 10.1038/s41598-022-21735-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 09/30/2022] [Indexed: 11/09/2022] Open

Stringer B, de Ferrante H, Abeln S, Heringa J, Feenstra KA, Haydarlou R. PIPENN: protein interface prediction from sequence with an ensemble of neural nets. Bioinformatics 2022;38:2111-2118. [PMID: 35150231 PMCID: PMC9004643 DOI: 10.1093/bioinformatics/btac071] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 01/16/2022] [Accepted: 02/04/2022] [Indexed: 02/03/2023] Open

Abstract

MOTIVATION

The interactions between proteins and other molecules are essential to many biological and cellular processes. Experimental identification of interface residues is a time-consuming, costly and challenging task, while protein sequence data are ubiquitous. Consequently, many computational and machine learning approaches have been developed over the years to predict such interface residues from sequence. However, the effectiveness of different Deep Learning (DL) architectures and learning strategies for protein-protein, protein-nucleotide and protein-small molecule interface prediction has not yet been investigated in great detail. Therefore, we here explore the prediction of protein interface residues using six DL architectures and various learning strategies with sequence-derived input features.

RESULTS

We constructed a large dataset dubbed BioDL, comprising protein-protein interactions from the PDB, and DNA/RNA and small molecule interactions from the BioLip database. We also constructed six DL architectures, and evaluated them on the BioDL benchmarks. This shows that no single architecture performs best on all instances. An ensemble architecture, which combines all six architectures, does consistently achieve peak prediction accuracy. We confirmed these results on the published benchmark set by Zhang and Kurgan (ZK448), and on our own existing curated homo- and heteromeric protein interaction dataset. Our PIPENN sequence-based ensemble predictor outperforms current state-of-the-art sequence-based protein interface predictors on ZK448 on all interaction types, achieving an AUC-ROC of 0.718 for protein-protein, 0.823 for protein-nucleotide and 0.842 for protein-small molecule.

AVAILABILITY AND IMPLEMENTATION

Source code and datasets are available at https://github.com/ibivu/pipenn/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Hou Q, Stringer B, Waury K, Capel H, Haydarlou R, Xue F, Abeln S, Heringa J, Feenstra KA. SeRenDIP-CE: Sequence-based Interface Prediction for Conformational Epitopes. Bioinformatics 2021;37:3421-3427. [PMID: 33974039 PMCID: PMC8136078 DOI: 10.1093/bioinformatics/btab321] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 03/26/2021] [Accepted: 04/26/2021] [Indexed: 11/21/2022] Open

Liu T, Pan X, Wang X, Feenstra KA, Heringa J, Huang Z. Predicting the relationships between gut microbiota and mental disorders with knowledge graphs. Health Inf Sci Syst 2020;9:3. [PMID: 33262885 PMCID: PMC7686388 DOI: 10.1007/s13755-020-00128-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 09/30/2020] [Indexed: 01/14/2023] Open

Dijkstra MJJ, van der Ploeg AJ, Feenstra KA, Fokkink WJ, Abeln S, Heringa J. Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit. Bioinformatics 2020;35:5315-5317. [PMID: 31368486 PMCID: PMC6954659 DOI: 10.1093/bioinformatics/btz572] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 05/29/2019] [Accepted: 07/29/2019] [Indexed: 12/03/2022] Open

Hou Q, De Geest PFG, Griffioen CJ, Abeln S, Heringa J, Feenstra KA. SeRenDIP: SEquential REmasteriNg to DerIve profiles for fast and accurate predictions of PPI interface positions. Bioinformatics 2020;35:4794-4796. [PMID: 31116381 DOI: 10.1093/bioinformatics/btz428] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 05/12/2019] [Accepted: 05/17/2019] [Indexed: 11/13/2022] Open

Jacobsen A, Ivanova O, Amini S, Heringa J, Kemmeren P, Feenstra KA. A framework for exhaustive modelling of genetic interaction patterns using Petri nets. Bioinformatics 2020;36:2142-2149. [PMID: 31845959 DOI: 10.1093/bioinformatics/btz917] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 07/09/2019] [Accepted: 12/13/2019] [Indexed: 11/13/2022] Open

Jacobsen A, de Miranda Azevedo R, Juty N, Batista D, Coles S, Cornet R, Courtot M, Crosas M, Dumontier M, Evelo CT, Goble C, Guizzardi G, Hansen KK, Hasnain A, Hettne K, Heringa J, Hooft RW, Imming M, Jeffery KG, Kaliyaperumal R, Kersloot MG, Kirkpatrick CR, Kuhn T, Labastida I, Magagna B, McQuilton P, Meyers N, Montesanti A, van Reisen M, Rocca-Serra P, Pergl R, Sansone SA, da Silva Santos LOB, Schneider J, Strawn G, Thompson M, Waagmeester A, Weigel T, Wilkinson MD, Willighagen EL, Wittenburg P, Roos M, Mons B, Schultes E. FAIR Principles: Interpretations and Implementation Considerations. Data Intellegence 2020. [DOI: 10.1162/dint_r_00024] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Affiliation(s)

Annika Jacobsen Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
Ricardo de Miranda Azevedo Institute of Data Science, Maastricht University, Universiteitssingel 60, Maastricht 6229 ER, The Netherlands
Nick Juty Department of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PL, UK
Dominique Batista Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
Simon Coles School of Chemistry, Faculty of Engineering and Physical Sciences, University of Southampton, SO17 1BJ, UK
Ronald Cornet Amsterdam UMC, University of Amsterdam, Amsterdam 1000 GG, The Netherlands
Mélanie Courtot European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
Mercè Crosas Harvard University, Cambridge, Massachusetts 02138, USA
Michel Dumontier Institute of Data Science, Maastricht University, Universiteitssingel 60, Maastricht 6229 ER, The Netherlands
Chris T. Evelo Department of Bioinformatics – BiGCaT, NUTRIM, Maastricht University, Maastricht 6229 ER, The Netherlands
Carole Goble Department of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PL, UK
Giancarlo Guizzardi Conceptual and Cognitive Modeling Research Group (CORE), Free University of Bozen-Bolzano, Bolzano 39100, Italy
Karsten Kryger Hansen Aalborg University, Aalborg DK-9220, Denmark
Ali Hasnain Insight Centre for Data Analytics, National University of Ireland Galway, H91 TK33, Ireland
Kristina Hettne Centre for Digital Scholarship, Leiden University Libraries, Leiden, 2333 ZA, The Netherlands
Jaap Heringa Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands
Rob W.W. Hooft Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands Dutch Techcentre for Life Sciences (DTL), Utrecht, The Netherlands
Melanie Imming SURF, Utrecht 3511 EP, The Netherlands
Keith G. Jeffery Keith G Jeffery Consultants, Faringdon, UK
Rajaram Kaliyaperumal Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
Martijn G. Kersloot Amsterdam UMC, University of Amsterdam, Amsterdam 1000 GG, The Netherlands Castor EDC, Paasheuvelweg 25, Wing 5D, 1105 BP, Amsterdam, The Netherlands
Christine R. Kirkpatrick San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
Tobias Kuhn Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands
Ignasi Labastida Learning and Research Resources Centre (CRAI), Universitat de Barcelona, 08007 Barcelona, Spain
Barbara Magagna Environment Agency Austria, A-1090 Vienna, Austria
Peter McQuilton Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
Natalie Meyers University of Notre Dame, 75004 Paris, France
Annalisa Montesanti Health Research Board (HRB), Dublin 2, DO2 H638, Ireland
Mirjam van Reisen Liacs Institute of Advanced Computer Science, Leiden University, 2311 GJ Leiden, The Netherlands
Philippe Rocca-Serra Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
Robert Pergl Czech Technical University in Prague, Faculty of Information Technology (FIT CTU), 160 00 Prague 6, Czech Republic
Susanna-Assunta Sansone Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
Luiz Olavo Bonino da Silva Santos GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
Juliane Schneider Harvard Catalyst Clinical and Translational Science Center, Boston, MA 02115, USA
George Strawn US National Academy of Sciences, Washington DC 20418, USA
Mark Thompson Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
Andra Waagmeester Micelio, Ekeren, Antwerp, Belgium
Tobias Weigel Deutsches Klimarechenzentrum, Bundesstrasse 45a, 20146 Hamburg, Germany
Mark D. Wilkinson Center for Plant Biotechnology and Genomics UPM-INIA, Madrid 28040, Spain
Egon L. Willighagen Department of Bioinformatics – BiGCaT, NUTRIM, Maastricht University, Maastricht 6229 ER, The Netherlands
Peter Wittenburg Max Planck Computing and Data Facility, Gießenbachstraße 2, 85748 Garching, Germany
Marco Roos Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
Barend Mons Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
Erik Schultes GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands Leiden Center for Data Science, 2311 EZ Leiden, The Netherlands

Collapse

Saunders G, Baudis M, Becker R, Beltran S, Béroud C, Birney E, Brooksbank C, Brunak S, Van den Bulcke M, Drysdale R, Capella-Gutierrez S, Flicek P, Florindi F, Goodhand P, Gut I, Heringa J, Holub P, Hooyberghs J, Juty N, Keane TM, Korbel JO, Lappalainen I, Leskosek B, Matthijs G, Mayrhofer MT, Metspalu A, Navarro A, Newhouse S, Nyrönen T, Page A, Persson B, Palotie A, Parkinson H, Rambla J, Salgado D, Steinfelder E, Swertz MA, Valencia A, Varma S, Blomberg N, Scollen S. Leveraging European infrastructures to access 1 million human genomes by 2022. Nat Rev Genet 2019;20:693-701. [PMID: 31455890 PMCID: PMC7115898 DOI: 10.1038/s41576-019-0156-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2019] [Indexed: 01/22/2023]

Affiliation(s)

Gary Saunders ELIXIR Hub, Wellcome Genome Campus, Hinxton, Cambridge, UK
Michael Baudis University of Zurich, Zurich, Switzerland
Regina Becker Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
Sergi Beltran CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain
Christophe Béroud Aix Marseille Univ, INSERM, MMG, Marseille, France Département de Génétique Médicale et de Biologie Cellulaire, APHM, Hôpital d'Enfants de la Timone, Marseille, France
Ewan Birney European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Cath Brooksbank European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Søren Brunak Department of Health Technology, Technical University of Denmark, Lyngby, Denmark Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
Marc Van den Bulcke Cancer Centre, Epidemiology and Public Health, Sciensano, Ixelles, Belgium
Rachel Drysdale ELIXIR Hub, Wellcome Genome Campus, Hinxton, Cambridge, UK
Salvador Capella-Gutierrez Barcelona Supercomputing Centre (BSC), Barcelona, Spain
Paul Flicek European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Francesco Florindi BBMRI-ERIC, Graz, Austria
Peter Goodhand Ontario Institute for Cancer Research, Toronto, Ontario, Canada Global Alliance for Genomics and Health, Toronto, Ontario, Canada
Ivo Gut CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain
Jaap Heringa Department of Computer Science, Vrije Universiteit, Amsterdam, Netherlands
Petr Holub BBMRI-ERIC, Graz, Austria
Jef Hooyberghs Flemish Institute for Technological Research, VITO, Mol, Belgium
Nick Juty School of Computer Science, The University of Manchester, Manchester, UK
Thomas M Keane European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Jan O Korbel European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
Ilkka Lappalainen CSC - IT Center for Science, Espoo, Finland
Brane Leskosek IBMI, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
Gert Matthijs Katholieke Universiteit Leuven, Leuven, Belgium
Michaela Th Mayrhofer BBMRI-ERIC, Graz, Austria
Andres Metspalu Estonian Genome Center, University of Tartu, Tartu, Estonia
Arcadi Navarro Institute of Evolutionary Biology (UPF-CSIC), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
Steven Newhouse European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Tommi Nyrönen CSC - IT Center for Science, Espoo, Finland
Angela Page Global Alliance for Genomics and Health, Toronto, Ontario, Canada Broad Institute of MIT and Harvard, Cambridge, MA, USA
Bengt Persson Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala, Sweden
Aarno Palotie Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
Helen Parkinson European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Jordi Rambla Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
David Salgado Aix Marseille Univ, INSERM, MMG, Marseille, France
Erik Steinfelder BBMRI-ERIC, Graz, Austria
Morris A Swertz BBMRI-NL/University Medical Center Groningen, University of Groningen, Groningen, Netherlands
Alfonso Valencia Barcelona Supercomputing Centre (BSC), Barcelona, Spain ICREA, Pg., Barcelona, Spain
Susheel Varma European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
Niklas Blomberg ELIXIR Hub, Wellcome Genome Campus, Hinxton, Cambridge, UK
Serena Scollen ELIXIR Hub, Wellcome Genome Campus, Hinxton, Cambridge, UK.

Collapse

Willems SM, Abeln S, Feenstra KA, de Bree R, van der Poel EF, Baatenburg de Jong RJ, Heringa J, van den Brekel MWM. The potential use of big data in oncology. Oral Oncol 2019;98:8-12. [PMID: 31521885 DOI: 10.1016/j.oraloncology.2019.09.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 07/31/2019] [Accepted: 09/06/2019] [Indexed: 12/16/2022]

van Gelder CWG, Hooft RWW, van Rijswijk MN, van den Berg L, Kok RG, Reinders M, Mons B, Heringa J. Bioinformatics in the Netherlands: the value of a nationwide community. Brief Bioinform 2019;20:540-550. [PMID: 28968694 PMCID: PMC6433734 DOI: 10.1093/bib/bbx087] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 07/03/2017] [Indexed: 11/14/2022] Open

Fijneman RJA, Mekkes N, Broek EVD, Stringer B, Glas RA, Komor MA, Rausch C, Lieshout SV, Cuppen E, Smith ML, Sebra RP, Rowell WJ, Ashby M, Carvalho B, Heringa J, Meijer GA, Abeln S. Abstract 1738: Characterization of structural variants within MACROD2 in the pathogenesis of colorectal cancer. Cancer Res 2019. [DOI: 10.1158/1538-7445.am2019-1738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Abstract Background: Cancer is caused by somatic DNA alterations, which comprise small nucleotide variants (SNVs), chromosome somatic copy number alterations (SCNAs) and chromosomal breakpoint structural variants (SVs). Previously, we investigated SCNA-associated SVs in colorectal cancer (CRC) and demonstrated that SVs within the MACROD2 gene are highly prevalent. This raises the question whether SVs in MACROD2 may already be present in CRC precursor lesions, i.e. in colorectal adenomas. We have also demonstrated that loss of MACROD2 protein expression is associated with poor response to treatment with 5-fluorouracil-based adjuvant chemotherapy, indicating that MACROD2 function is clinically relevant. The aim of this study is to characterize SVs within MACROD2 in more detail in the pathogenesis of colorectal cancer. Methods: The frequencies of SCNA-associated SVs in 466 CRCs were compared to those in 118 colorectal adenomas, using array-comparative genomic hybridization. Targeted PacBio long-read sequencing was applied to detect and characterize SVs at nucleotide resolution within MACROD2, in tens of primary CRCs. Illumina whole genome sequencing data of > 450 CRC metastatic lesions, generated by the Hartwig Medical Foundation (HMF; www.hartwigmedicalfoundation.nl), were used for validation purposes. Results: MACROD2 SCNA-associated SVs were rarely detected among 118 colorectal adenomas (<2%) while being highly prevalent among 466 CRCs (40%). SVs in MACROD2 are currently being characterized at nucleotide resolution by analysis of targeted PacBio long-read sequencing data, the results of which will be presented during the AACR annual meeting. Preliminary analysis of HMF whole genome sequencing data confirms that at least 40% of CRC metastatic lesions are affected by SVs within the MACROD2 gene, most commonly by focal deletions. Discussion: The current observation that SVs in MACROD2 are nearly absent in adenomas while being highly prevalent in CRCs indicates that MACROD2 is affected at a late stage of colorectal adenoma-to-carcinoma progression. A recent publication by Sakthianandeswaren et al (Cancer Discovery 2018) indicated that loss of MACROD2 promotes chromosomal instability. Taken together, these data support a model in which adenoma-to-carcinoma progression is driven, at least in part, by genomic instability caused by loss of function of the MACROD2 tumor suppressor gene. Citation Format: Remond J A Fijneman, Nienke Mekkes, Evert van den Broek, Bas Stringer, Roel A. Glas, Malgorzata A. Komor, Christian Rausch, Stef van Lieshout, Edwin Cuppen, Melissa L. Smith, Robert P. Sebra, William J. Rowell, Meredith Ashby, Beatriz Carvalho, Jaap Heringa, Gerrit A. Meijer, Sanne Abeln. Characterization of structural variants within MACROD2 in the pathogenesis of colorectal cancer [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 1738. Collapse

Amini S, Jacobsen A, Ivanova O, Lijnzaad P, Heringa J, Holstege FCP, Feenstra KA, Kemmeren P. The ability of transcription factors to differentially regulate gene expression is a crucial component of the mechanism underlying inversion, a frequently observed genetic interaction pattern. PLoS Comput Biol 2019;15:e1007061. [PMID: 31083661 PMCID: PMC6532943 DOI: 10.1371/journal.pcbi.1007061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 05/23/2019] [Accepted: 04/30/2019] [Indexed: 12/21/2022] Open

Abstract

Genetic interactions, a phenomenon whereby combinations of mutations lead to unexpected effects, reflect how cellular processes are wired and play an important role in complex genetic diseases. Understanding the molecular basis of genetic interactions is crucial for deciphering pathway organization as well as understanding the relationship between genetic variation and disease. Several hypothetical molecular mechanisms have been linked to different genetic interaction types. However, differences in genetic interaction patterns and their underlying mechanisms have not yet been compared systematically between different functional gene classes. Here, differences in the occurrence and types of genetic interactions are compared for two classes, gene-specific transcription factors (GSTFs) and signaling genes (kinases and phosphatases). Genome-wide gene expression data for 63 single and double deletion mutants in baker's yeast reveals that the two most common genetic interaction patterns are buffering and inversion. Buffering is typically associated with redundancy and is well understood. In inversion, genes show opposite behavior in the double mutant compared to the corresponding single mutants. The underlying mechanism is poorly understood. Although both classes show buffering and inversion patterns, the prevalence of inversion is much stronger in GSTFs. To decipher potential mechanisms, a Petri Net modeling approach was employed, where genes are represented as nodes and relationships between genes as edges. This allowed over 9 million possible three and four node models to be exhaustively enumerated. The models show that a quantitative difference in interaction strength is a strict requirement for obtaining inversion. In addition, this difference is frequently accompanied with a second gene that shows buffering. Taken together, these results provide a mechanistic explanation for inversion. Furthermore, the ability of transcription factors to differentially regulate expression of their targets provides a likely explanation why inversion is more prevalent for GSTFs compared to kinases and phosphatases.

Collapse

Dijkstra M, Bawono P, Abeln S, Feenstra KA, Fokkink W, Heringa J. Motif-Aware PRALINE: Improving the alignment of motif regions. PLoS Comput Biol 2018;14:e1006547. [PMID: 30383764 PMCID: PMC6233922 DOI: 10.1371/journal.pcbi.1006547] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Revised: 11/13/2018] [Accepted: 10/05/2018] [Indexed: 11/21/2022] Open

Abstract

Protein or DNA motifs are sequence regions which possess biological importance. These regions are often highly conserved among homologous sequences. The generation of multiple sequence alignments (MSAs) with a correct alignment of the conserved sequence motifs is still difficult to achieve, due to the fact that the contribution of these typically short fragments is overshadowed by the rest of the sequence. Here we extended the PRALINE multiple sequence alignment program with a novel motif-aware MSA algorithm in order to address this shortcoming. This method can incorporate explicit information about the presence of externally provided sequence motifs, which is then used in the dynamic programming step by boosting the amino acid substitution matrix towards the motif. The strength of the boost is controlled by a parameter, α. Using a benchmark set of alignments we confirm that a good compromise can be found that improves the matching of motif regions while not significantly reducing the overall alignment quality. By estimating α on an unrelated set of reference alignments we find there is indeed a strong conservation signal for motifs. A number of typical but difficult MSA use cases are explored to exemplify the problems in correctly aligning functional sequence motifs and how the motif-aware alignment method can be employed to alleviate these problems.

The most important functional parts of proteins are often small—but very specific—sequence motifs. Moreover, these motifs tend to be strongly conserved during evolution due to their functional role. Nevertheless, when trying to align protein sequences of the same family, it is often very difficult to align such motifs using standard multiple sequence alignment methods. Aligning functional residues correctly is essential to detect motif conservation, which can be used to filter out spuriously occurring motifs. Additionally, many downstream analyses, such as phylogenetics, are strongly reliant on alignment quality. We have developed a sequence alignment program named Motif-Aware PRALINE (MA-PRALINE) that incorporates information about motifs explicitly. Motifs are provided to MA-PRALINE in the PROSITE pattern syntax; it then scans the input sequences for instances of the pattern and provides a score bonus to matching sequence positions. Our method provides a reproducible alternative to editing alignments by hand in order to account for motif conservation, which is a tedious and error-prone process. We will show that MA-PRALINE allows the alignment of motif-rich regions to be fine-tuned while not degrading the rest of the alignment. MA-PRALINE is available on GitHub as open source software; this allows it to be easily tailored to similar problems. We apply MA-PRALINE on the HIV-1 envelope glycoprotein (gp120) to get an improved alignment of the N-terminal glycosylation motifs. The presence of these motifs is essential for the virus in evading the immune response of the host.

Collapse

Dijkstra M, Fokkink W, Heringa J, van Dijk E, Abeln S. The characteristics of molten globule states and folding pathways strongly depend on the sequence of a protein. Mol Phys 2018. [DOI: 10.1080/00268976.2018.1496290] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Anton Feenstra K, Abeln S, Westerhuis JA, Brancos dos Santos F, Molenaar D, Teusink B, Hoefsloot HCJ, Heringa J. Training for translation between disciplines: a philosophy for life and data sciences curricula. Bioinformatics 2018;34:i4-i12. [PMID: 29950011 PMCID: PMC6022589 DOI: 10.1093/bioinformatics/bty233] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

van Gelder CWG, Hooft RWW, van Rijswijk MN, van den Berg L, Kok RG, Reinders M, Mons B, Heringa J. Bioinformatics in the Netherlands: the value of a nationwide community. Brief Bioinform 2018;19:359. [PMID: 29267862 DOI: 10.1093/bib/bbx171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Hou Q, De Geest PFG, Vranken WF, Heringa J, Feenstra KA. Seeing the trees through the forest: sequence-based homo- and heteromeric protein-protein interaction sites prediction using random forest. Bioinformatics 2018;33:1479-1487. [PMID: 28073761 DOI: 10.1093/bioinformatics/btx005] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 01/06/2017] [Indexed: 11/13/2022] Open

Abstract

Motivation

Genome sequencing is producing an ever-increasing amount of associated protein sequences. Few of these sequences have experimentally validated annotations, however, and computational predictions are becoming increasingly successful in producing such annotations. One key challenge remains the prediction of the amino acids in a given protein sequence that are involved in protein-protein interactions. Such predictions are typically based on machine learning methods that take advantage of the properties and sequence positions of amino acids that are known to be involved in interaction. In this paper, we evaluate the importance of various features using Random Forest (RF), and include as a novel feature backbone flexibility predicted from sequences to further optimise protein interface prediction.

Results

We observe that there is no single sequence feature that enables pinpointing interacting sites in our Random Forest models. However, combining different properties does increase the performance of interface prediction. Our homomeric-trained RF interface predictor is able to distinguish interface from non-interface residues with an area under the ROC curve of 0.72 in a homomeric test-set. The heteromeric-trained RF interface predictor performs better than existing predictors on a independent heteromeric test-set. We trained a more general predictor on the combined homomeric and heteromeric dataset, and show that in addition to predicting homomeric interfaces, it is also able to pinpoint interface residues in heterodimers. This suggests that our random forest model and the features included capture common properties of both homodimer and heterodimer interfaces.

Availability and Implementation

The predictors and test datasets used in our analyses are freely available ( http://www.ibi.vu.nl/downloads/RF_PPI/ ).

Contact

k.a.feenstra@vu.nl.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Zhang C, Bijlard J, Staiger C, Scollen S, van Enckevort D, Hoogstrate Y, Senf A, Hiltemann S, Repo S, Pipping W, Bierkens M, Payralbe S, Stringer B, Heringa J, Stubbs A, Bonino Da Silva Santos LO, Belien J, Weistra W, Azevedo R, van Bochove K, Meijer G, Boiten JW, Rambla J, Fijneman R, Spalding JD, Abeln S. Systematically linking tranSMART, Galaxy and EGA for reusing human translational research data. F1000Res 2017;6. [PMID: 29123641 PMCID: PMC5657030 DOI: 10.12688/f1000research.12168.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/14/2017] [Indexed: 01/11/2023] Open

Abstract

The availability of high-throughput molecular profiling techniques has provided more accurate and informative data for regular clinical studies. Nevertheless, complex computational workflows are required to interpret these data. Over the past years, the data volume has been growing explosively, requiring robust human data management to organise and integrate the data efficiently. For this reason, we set up an ELIXIR implementation study, together with the Translational research IT (TraIT) programme, to design a data ecosystem that is able to link raw and interpreted data. In this project, the data from the TraIT Cell Line Use Case (TraIT-CLUC) are used as a test case for this system. Within this ecosystem, we use the European Genome-phenome Archive (EGA) to store raw molecular profiling data; tranSMART to collect interpreted molecular profiling data and clinical data for corresponding samples; and Galaxy to store, run and manage the computational workflows. We can integrate these data by linking their repositories systematically. To showcase our design, we have structured the TraIT-CLUC data, which contain a variety of molecular profiling data types, for storage in both tranSMART and EGA. The metadata provided allows referencing between tranSMART and EGA, fulfilling the cycle of data submission and discovery; we have also designed a data flow from EGA to Galaxy, enabling reanalysis of the raw data in Galaxy. In this way, users can select patient cohorts in tranSMART, trace them back to the raw data and perform (re)analysis in Galaxy. Our conclusion is that the majority of metadata does not necessarily need to be stored (redundantly) in both databases, but that instead FAIR persistent identifiers should be available for well-defined data ontology levels: study, data access committee, physical sample, data sample and raw data file. This approach will pave the way for the stable linkage and reuse of data.

Collapse

Affiliation(s)

Chao Zhang Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, 1081 HV, Netherlands
Jochem Bijlard Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, 1081 HV, Netherlands.,The Hyve, Utrecht, 3511 MJ, Netherlands
Christine Staiger SURFsara, Amsterdam, 1098 XG, Netherlands
Serena Scollen ELIXIR Hub, Hinxton, CB10 1SD, UK
David van Enckevort Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, 9712 CP, Netherlands
Youri Hoogstrate Department of Bioinformatics, Erasmus University Medical Center, Rotterdam, 3015 CE, Netherlands
Alexander Senf EMBL-EBI, Hinxton, CB10 1SD, UK
Saskia Hiltemann Department of Bioinformatics, Erasmus University Medical Center, Rotterdam, 3015 CE, Netherlands
Susanna Repo ELIXIR Hub, Hinxton, CB10 1SD, UK
Wibo Pipping The Hyve, Utrecht, 3511 MJ, Netherlands
Mariska Bierkens Netherlands Cancer Institute, Amsterdam, 1066 CX, Netherlands
Stefan Payralbe The Hyve, Utrecht, 3511 MJ, Netherlands
Bas Stringer Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, 1081 HV, Netherlands
Jaap Heringa Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, 1081 HV, Netherlands
Andrew Stubbs Department of Bioinformatics, Erasmus University Medical Center, Rotterdam, 3015 CE, Netherlands
Luiz Olavo Bonino Da Silva Santos Dutch Techcentre for Life Sciences, Utrecht, 3521 AL, Netherlands
Jeroen Belien Department of Pathology, VU University Medical Center Amsterdam, Amsterdam, 1081 HV, Netherlands
Ward Weistra The Hyve, Utrecht, 3511 MJ, Netherlands
Rita Azevedo Lygature, Utrecht, 3521 AL, Netherlands
Kees van Bochove The Hyve, Utrecht, 3511 MJ, Netherlands
Gerrit Meijer Netherlands Cancer Institute, Amsterdam, 1066 CX, Netherlands
Jan-Willem Boiten Lygature, Utrecht, 3521 AL, Netherlands
Jordi Rambla Centre for Genomic Regulation (CRG), Barcelona, 08003, Spain
Remond Fijneman Netherlands Cancer Institute, Amsterdam, 1066 CX, Netherlands
J Dylan Spalding EMBL-EBI, Hinxton, CB10 1SD, UK
Sanne Abeln Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, 1081 HV, Netherlands

Collapse

Haydarlou R, Jacobsen A, Bonzanni N, Feenstra KA, Abeln S, Heringa J. BioASF: a framework for automatically generating executable pathway models specified in BioPAX. Bioinformatics 2017;32:i60-i69. [PMID: 27307645 PMCID: PMC4908334 DOI: 10.1093/bioinformatics/btw250] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

van Gool AJ, Bietrix F, Caldenhoven E, Zatloukal K, Scherer A, Litton JE, Meijer G, Blomberg N, Smith A, Mons B, Heringa J, Koot WJ, Smit MJ, Hajduch M, Rijnders T, Ussi A. Bridging the translational innovation gap through good biomarker practice. Nat Rev Drug Discov 2017;16:587-588. [DOI: 10.1038/nrd.2017.72] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Hoogstrate Y, Zhang C, Senf A, Bijlard J, Hiltemann S, van Enckevort D, Repo S, Heringa J, Jenster G, J A Fijneman R, Boiten JW, A Meijer G, Stubbs A, Rambla J, Spalding D, Abeln S. Integration of EGA secure data access into Galaxy. F1000Res 2017;5. [PMID: 28232859 PMCID: PMC5302147 DOI: 10.12688/f1000research.10221.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/30/2016] [Indexed: 12/31/2022] Open

Abstract

High-throughput molecular profiling techniques are routinely generating vast amounts of data for translational medicine studies. Secure access controlled systems are needed to manage, store, transfer and distribute these data due to its personally identifiable nature. The European Genome-phenome Archive (EGA) was created to facilitate access and management to long-term archival of bio-molecular data. Each data provider is responsible for ensuring a Data Access Committee is in place to grant access to data stored in the EGA. Moreover, the transfer of data during upload and download is encrypted. ELIXIR, a European research infrastructure for life-science data, initiated a project (2016 Human Data Implementation Study) to understand and document the ELIXIR requirements for secure management of controlled-access data. As part of this project, a full ecosystem was designed to connect archived raw experimental molecular profiling data with interpreted data and the computational workflows, using the CTMM Translational Research IT (CTMM-TraIT) infrastructure http://www.ctmm-trait.nl as an example. Here we present the first outcomes of this project, a framework to enable the download of EGA data to a Galaxy server in a secure way. Galaxy provides an intuitive user interface for molecular biologists and bioinformaticians to run and design data analysis workflows. More specifically, we developed a tool -- ega_download_streamer - that can download data securely from EGA into a Galaxy server, which can subsequently be further processed. This tool will allow a user within the browser to run an entire analysis containing sensitive data from EGA, and to make this analysis available for other researchers in a reproducible manner, as shown with a proof of concept study. The tool ega_download_streamer is available in the Galaxy tool shed: https://toolshed.g2.bx.psu.edu/view/yhoogstrate/ega_download_streamer.

Collapse

Palma A, Tinti M, Paoluzi S, Santonico E, Brandt BW, Hooft van Huijsduijnen R, Masch A, Heringa J, Schutkowski M, Castagnoli L, Cesareni G. Both Intrinsic Substrate Preference and Network Context Contribute to Substrate Selection of Classical Tyrosine Phosphatases. J Biol Chem 2017;292:4942-4952. [PMID: 28159843 DOI: 10.1074/jbc.m116.757518] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Revised: 01/31/2017] [Indexed: 01/19/2023] Open

Rajendran R, May A, Sherry L, Kean R, Williams C, Jones BL, Burgess KV, Heringa J, Abeln S, Brandt BW, Munro CA, Ramage G. Integrating Candida albicans metabolism with biofilm heterogeneity by transcriptome mapping. Sci Rep 2016;6:35436. [PMID: 27765942 PMCID: PMC5073228 DOI: 10.1038/srep35436] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 09/29/2016] [Indexed: 12/20/2022] Open

Heringa J, Reinders M, Abeln S, de Ridder J. ECCB 2016: The 15th European Conference on Computational Biology. Bioinformatics 2016;32:i389-i392. [PMID: 27587653 DOI: 10.1093/bioinformatics/btw481] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Hou Q, Lensink MF, Heringa J, Feenstra KA. CLUB-MARTINI: Selecting Favourable Interactions amongst Available Candidates, a Coarse-Grained Simulation Approach to Scoring Docking Decoys. PLoS One 2016;11:e0155251. [PMID: 27166787 PMCID: PMC4864233 DOI: 10.1371/journal.pone.0155251] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 04/26/2016] [Indexed: 01/12/2023] Open

Abstract

Large-scale identification of native binding orientations is crucial for understanding the role of protein-protein interactions in their biological context. Measuring binding free energy is the method of choice to estimate binding strength and reveal the relevance of particular conformations in which proteins interact. In a recent study, we successfully applied coarse-grained molecular dynamics simulations to measure binding free energy for two protein complexes with similar accuracy to full-atomistic simulation, but 500-fold less time consuming. Here, we investigate the efficacy of this approach as a scoring method to identify stable binding conformations from thousands of docking decoys produced by protein docking programs. To test our method, we first applied it to calculate binding free energies of all protein conformations in a CAPRI (Critical Assessment of PRedicted Interactions) benchmark dataset, which included over 19000 protein docking solutions for 15 benchmark targets. Based on the binding free energies, we ranked all docking solutions to select the near-native binding modes under the assumption that the native-solutions have lowest binding free energies. In our top 100 ranked structures, for the ‘easy’ targets that have many near-native conformations, we obtain a strong enrichment of acceptable or better quality structures; for the ‘hard’ targets without near-native decoys, our method is still able to retain structures which have native binding contacts. Moreover, in our top 10 selections, CLUB-MARTINI shows a comparable performance when compared with other state-of-the-art docking scoring functions. As a proof of concept, CLUB-MARTINI performs remarkably well for many targets and is able to pinpoint near-native binding modes in the top selections. To the best of our knowledge, this is the first time interaction free energy calculated from MD simulations have been used to rank docking solutions at a large scale.

Collapse

Lelieveld SH, Schütte J, Dijkstra MJJ, Bawono P, Kinston SJ, Göttgens B, Heringa J, Bonzanni N. ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites. Nucleic Acids Res 2016;44:e72. [PMID: 26721389 PMCID: PMC4856970 DOI: 10.1093/nar/gkv1518] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Revised: 12/15/2015] [Accepted: 12/16/2015] [Indexed: 12/23/2022] Open

Hou Q, Dutilh BE, Huynen MA, Heringa J, Feenstra KA. Sequence specificity between interacting and non-interacting homologs identifies interface residues--a homodimer and monomer use case. BMC Bioinformatics 2015;16:325. [PMID: 26449222 PMCID: PMC4599308 DOI: 10.1186/s12859-015-0758-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 09/30/2015] [Indexed: 11/17/2022] Open

El-Kebir M, Soueidan H, Hume T, Beisser D, Dittrich M, Müller T, Blin G, Heringa J, Nikolski M, Wessels LFA, Klau GW. xHeinz: an algorithm for mining cross-species network modules under a flexible conservation model. Bioinformatics 2015;31:3147-55. [PMID: 26023104 DOI: 10.1093/bioinformatics/btv316] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 05/18/2015] [Indexed: 01/18/2023] Open

Abstract

MOTIVATION

Integrative network analysis methods provide robust interpretations of differential high-throughput molecular profile measurements. They are often used in a biomedical context-to generate novel hypotheses about the underlying cellular processes or to derive biomarkers for classification and subtyping. The underlying molecular profiles are frequently measured and validated on animal or cellular models. Therefore the results are not immediately transferable to human. In particular, this is also the case in a study of the recently discovered interleukin-17 producing helper T cells (Th17), which are fundamental for anti-microbial immunity but also known to contribute to autoimmune diseases.

RESULTS

We propose a mathematical model for finding active subnetwork modules that are conserved between two species. These are sets of genes, one for each species, which (i) induce a connected subnetwork in a species-specific interaction network, (ii) show overall differential behavior and (iii) contain a large number of orthologous genes. We propose a flexible notion of conservation, which turns out to be crucial for the quality of the resulting modules in terms of biological interpretability. We propose an algorithm that finds provably optimal or near-optimal conserved active modules in our model. We apply our algorithm to understand the mechanisms underlying Th17 T cell differentiation in both mouse and human. As a main biological result, we find that the key regulation of Th17 differentiation is conserved between human and mouse.

AVAILABILITY AND IMPLEMENTATION

xHeinz, an implementation of our algorithm, as well as all input data and results, are available at http://software.cwi.nl/xheinz and as a Galaxy service at http://services.cbib.u-bordeaux2.fr/galaxy in CBiB Tools.

CONTACT

gunnar.klau@cwi.nl

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

May A, Brandt BW, El-Kebir M, Klau GW, Zaura E, Crielaard W, Heringa J, Abeln S. metaModules identifies key functional subnetworks in microbiome-related disease. Bioinformatics 2015;32:1678-85. [PMID: 26342232 DOI: 10.1093/bioinformatics/btv526] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2015] [Accepted: 09/02/2015] [Indexed: 11/13/2022] Open

Affiliation(s)

Ali May Centre for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands, Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Amsterdam Institute for Molecules Medicines and Systems (AIMMS), VU University Amsterdam, Amsterdam, The Netherlands
Bernd W Brandt Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands
Mohammed El-Kebir Centre for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands, Department of Computer Science and Center for Computational Molecular Biology, Brown University, Providence, USA and Life Sciences, Centre for Mathematics and Computer Science (CWI), Amsterdam, The Netherlands
Gunnar W Klau Centre for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands, Amsterdam Institute for Molecules Medicines and Systems (AIMMS), VU University Amsterdam, Amsterdam, The Netherlands, Life Sciences, Centre for Mathematics and Computer Science (CWI), Amsterdam, The Netherlands
Egija Zaura Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands
Wim Crielaard Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands
Jaap Heringa Centre for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands, Amsterdam Institute for Molecules Medicines and Systems (AIMMS), VU University Amsterdam, Amsterdam, The Netherlands
Sanne Abeln Centre for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands, Amsterdam Institute for Molecules Medicines and Systems (AIMMS), VU University Amsterdam, Amsterdam, The Netherlands

Collapse

Bawono P, van der Velde A, Abeln S, Heringa J. Quantifying the displacement of mismatches in multiple sequence alignment benchmarks. PLoS One 2015;10:e0127431. [PMID: 25993129 PMCID: PMC4438059 DOI: 10.1371/journal.pone.0127431] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Accepted: 04/14/2015] [Indexed: 11/18/2022] Open

Abstract

Multiple Sequence Alignment (MSA) methods are typically benchmarked on sets of reference alignments. The quality of the alignment can then be represented by the sum-of-pairs (SP) or column (CS) scores, which measure the agreement between a reference and corresponding query alignment. Both the SP and CS scores treat mismatches between a query and reference alignment as equally bad, and do not take the separation into account between two amino acids in the query alignment, that should have been matched according to the reference alignment. This is significant since the magnitude of alignment shifts is often of relevance in biological analyses, including homology modeling and MSA refinement/manual alignment editing. In this study we develop a new alignment benchmark scoring scheme, SPdist, that takes the degree of discordance of mismatches into account by measuring the sequence distance between mismatched residue pairs in the query alignment. Using this new score along with the standard SP score, we investigate the discriminatory behavior of the new score by assessing how well six different MSA methods perform with respect to BAliBASE reference alignments. The SP score and the SPdist score yield very similar outcomes when the reference and query alignments are close. However, for more divergent reference alignments the SPdist score is able to distinguish between methods that keep alignments approximately close to the reference and those exhibiting larger shifts. We observed that by using SPdist together with SP scoring we were able to better delineate the alignment quality difference between alternative MSA methods. With a case study we exemplify why it is important, from a biological perspective, to consider the separation of mismatches. The SPdist scoring scheme has been implemented in the VerAlign web server (http://www.ibi.vu.nl/programs/veralignwww/). The code for calculating SPdist score is also available upon request.

Collapse

May A, Abeln S, Buijs MJ, Heringa J, Crielaard W, Brandt BW. NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL. Nucleic Acids Res 2015;43:W301-5. [PMID: 25878034 PMCID: PMC4489229 DOI: 10.1093/nar/gkv346] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 04/03/2015] [Indexed: 02/04/2023] Open

El-Kebir M, Brandt BW, Heringa J, Klau GW. NatalieQ: a web server for protein-protein interaction network querying. BMC Syst Biol 2014;8:40. [PMID: 24690407 PMCID: PMC3998945 DOI: 10.1186/1752-0509-8-40] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 03/20/2014] [Indexed: 01/17/2023]

El-Kebir M, Marschall T, Wohlers I, Patterson M, Heringa J, Schönhuth A, Klau GW. Mapping proteins in the presence of paralogs using units of coevolution. BMC Bioinformatics 2014;14 Suppl 15:S18. [PMID: 24564758 PMCID: PMC3852051 DOI: 10.1186/1471-2105-14-s15-s18] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

May A, Abeln S, Crielaard W, Heringa J, Brandt BW. Unraveling the outcome of 16S rDNA-based taxonomy analysis through mock data and simulations. ACTA ACUST UNITED AC 2014;30:1530-8. [PMID: 24519382 DOI: 10.1093/bioinformatics/btu085] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Affiliation(s)

Ali May Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The NetherlandsDepartment of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The Netherlands
Sanne Abeln Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The NetherlandsDepartment of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The Netherlands
Wim Crielaard Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The Netherlands
Jaap Heringa Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The NetherlandsDepartment of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The NetherlandsDepartment of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The Netherlands
Bernd W Brandt Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU and AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands and NBIC Netherlands Bioinformatics Centre, Nijmegen, The Netherlands

Collapse

Bawono P, Heringa J. PRALINE: a versatile multiple sequence alignment toolkit. Methods Mol Biol 2014;1079:245-62. [PMID: 24170407 DOI: 10.1007/978-1-62703-646-7_16] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Bonzanni N, Garg A, Feenstra KA, Schütte J, Kinston S, Miranda-Saavedra D, Heringa J, Xenarios I, Göttgens B. Hard-wired heterogeneity in blood stem cells revealed using a dynamic regulatory network model. Bioinformatics 2013;29:i80-8. [PMID: 23813012 PMCID: PMC3694641 DOI: 10.1093/bioinformatics/btt243] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Motivation: Combinatorial interactions of transcription factors with cis-regulatory elements control the dynamic progression through successive cellular states and thus underpin all metazoan development. The construction of network models of cis-regulatory elements, therefore, has the potential to generate fundamental insights into cellular fate and differentiation. Haematopoiesis has long served as a model system to study mammalian differentiation, yet modelling based on experimentally informed cis-regulatory interactions has so far been restricted to pairs of interacting factors. Here, we have generated a Boolean network model based on detailed cis-regulatory functional data connecting 11 haematopoietic stem/progenitor cell (HSPC) regulator genes.

Results: Despite its apparent simplicity, the model exhibits surprisingly complex behaviour that we charted using strongly connected components and shortest-path analysis in its Boolean state space. This analysis of our model predicts that HSPCs display heterogeneous expression patterns and possess many intermediate states that can act as ‘stepping stones’ for the HSPC to achieve a final differentiated state. Importantly, an external perturbation or ‘trigger’ is required to exit the stem cell state, with distinct triggers characterizing maturation into the various different lineages. By focusing on intermediate states occurring during erythrocyte differentiation, from our model we predicted a novel negative regulation of Fli1 by Gata1, which we confirmed experimentally thus validating our model. In conclusion, we demonstrate that an advanced mammalian regulatory network model based on experimentally validated cis-regulatory interactions has allowed us to make novel, experimentally testable hypotheses about transcriptional mechanisms that control differentiation of mammalian stem cells.

Contact:j.heringa@vu.nl or ioannis.xenarios@isb-sib.ch or bg200@cam.ac.uk

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

Gijsbers EF, Feenstra KA, van Nuenen AC, Navis M, Heringa J, Schuitemaker H, Kootstra NA. HIV-1 replication fitness of HLA-B*57/58:01 CTL escape variants is restored by the accumulation of compensatory mutations in gag. PLoS One 2013;8:e81235. [PMID: 24339913 PMCID: PMC3855271 DOI: 10.1371/journal.pone.0081235] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2013] [Accepted: 10/10/2013] [Indexed: 11/30/2022] Open

May A, Pool R, van Dijk E, Bijlard J, Abeln S, Heringa J, Feenstra KA. Coarse-grained versus atomistic simulations: realistic interaction free energies for real proteins. ACTA ACUST UNITED AC 2013;30:326-34. [PMID: 24273239 DOI: 10.1093/bioinformatics/btt675] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

van den Kerkhof TLGM, Feenstra KA, Euler Z, van Gils MJ, Rijsdijk LWE, Boeser-Nunnink BD, Heringa J, Schuitemaker H, Sanders RW. HIV-1 envelope glycoprotein signatures that correlate with the development of cross-reactive neutralizing activity. Retrovirology 2013;10:102. [PMID: 24059682 PMCID: PMC3849187 DOI: 10.1186/1742-4690-10-102] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 09/12/2013] [Indexed: 01/08/2023] Open

Affiliation(s)

Tom L G M van den Kerkhof Department of Experimental Immunology and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
K Anton Feenstra Center for Integrative Bioinformatics VU (IBIVU) and Amsterdam Institute for Molecules, Medicine and Systems (AIMMS), VU University Amsterdam, 1081 HV Amsterdam, the Netherlands Netherlands Bioinformatics Center (NBIC), 6525 GA Nijmegen, the Netherlands
Zelda Euler Department of Experimental Immunology and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
Marit J van Gils Department of Experimental Immunology and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
Linda W E Rijsdijk Center for Integrative Bioinformatics VU (IBIVU) and Amsterdam Institute for Molecules, Medicine and Systems (AIMMS), VU University Amsterdam, 1081 HV Amsterdam, the Netherlands
Brigitte D Boeser-Nunnink Department of Experimental Immunology and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
Jaap Heringa Center for Integrative Bioinformatics VU (IBIVU) and Amsterdam Institute for Molecules, Medicine and Systems (AIMMS), VU University Amsterdam, 1081 HV Amsterdam, the Netherlands Netherlands Bioinformatics Center (NBIC), 6525 GA Nijmegen, the Netherlands Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands
Hanneke Schuitemaker Department of Experimental Immunology and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands Crucell Holland BV, 2333 CN Leiden, the Netherlands
Rogier W Sanders Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, 1105 AZ Amsterdam, the Netherlands Department of Microbiology and Immunology, Weill Medical College, Cornell University, New York, NY 10065 USA

Collapse

Hettling H, Alders DJC, Heringa J, Binsl TW, Groeneveld ABJ, van Beek JHGM. Computational estimation of tricarboxylic acid cycle fluxes using noisy NMR data from cardiac biopsies. BMC Syst Biol 2013;7:82. [PMID: 23965343 PMCID: PMC3765389 DOI: 10.1186/1752-0509-7-82] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 08/15/2013] [Indexed: 11/16/2022]

Abstract

Background

The aerobic energy metabolism of cardiac muscle cells is of major importance for the contractile function of the heart. Because energy metabolism is very heterogeneously distributed in heart tissue, especially during coronary disease, a method to quantify metabolic fluxes in small tissue samples is desirable. Taking tissue biopsies after infusion of substrates labeled with stable carbon isotopes makes this possible in animal experiments. However, the appreciable noise level in NMR spectra of extracted tissue samples makes computational estimation of metabolic fluxes challenging and a good method to define confidence regions was not yet available.

Results

Here we present a computational analysis method for nuclear magnetic resonance (NMR) measurements of tricarboxylic acid (TCA) cycle metabolites. The method was validated using measurements on extracts of single tissue biopsies taken from porcine heart in vivo. Isotopic enrichment of glutamate was measured by NMR spectroscopy in tissue samples taken at a single time point after the timed infusion of ¹³C labeled substrates for the TCA cycle. The NMR intensities for glutamate were analyzed with a computational model describing carbon transitions in the TCA cycle and carbon exchange with amino acids. The model dynamics depended on five flux parameters, which were optimized to fit the NMR measurements. To determine confidence regions for the estimated fluxes, we used the Metropolis-Hastings algorithm for Markov chain Monte Carlo (MCMC) sampling to generate extensive ensembles of feasible flux combinations that describe the data within measurement precision limits. To validate our method, we compared myocardial oxygen consumption calculated from the TCA cycle flux with in vivo blood gas measurements for 38 hearts under several experimental conditions, e.g. during coronary artery narrowing.

Conclusions

Despite the appreciable NMR noise level, the oxygen consumption in the tissue samples, estimated from the NMR spectra, correlates with blood-gas oxygen uptake measurements for the whole heart. The MCMC method provides confidence regions for the estimated metabolic fluxes in single cardiac biopsies, taking the quantified measurement noise level and the nonlinear dependencies between parameters fully into account.

Collapse

Schütte J, Bonzanni N, Kinston S, Lelieveld S, Moignard V, Heringa J, Feenstra A, Gottgens B. Reconstructing a core regulatory network model for blood stem/progenitor cells. Exp Hematol 2013. [DOI: 10.1016/j.exphem.2013.05.266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Abeln S, Molenaar D, Feenstra KA, Hoefsloot HCJ, Teusink B, Heringa J. Bioinformatics and systems biology: bridging the gap between heterogeneous student backgrounds. Brief Bioinform 2013;14:589-98. [PMID: 23603092 DOI: 10.1093/bib/bbt023] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Pool R, Heringa J, Hoefling M, Schulz R, Smith JC, Feenstra KA. Enabling grand-canonical Monte Carlo: Extending the flexibility of GROMACS through the GromPy python interface module. J Comput Chem 2012;33:1207-14. [DOI: 10.1002/jcc.22947] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Revised: 12/21/2011] [Accepted: 01/09/2012] [Indexed: 11/06/2022]

Binsl TW, De Graaf AA, Venema K, Heringa J, Maathuis A, De Waard P, Van Beek JHGM. Measuring non-steady-state metabolic fluxes in starch-converting faecal microbiota in vitro. Benef Microbes 2011;1:391-405. [PMID: 21831778 DOI: 10.3920/bm2010.0038] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Abstract

This paper explores human gut bacterial metabolism of starch using a combined analytical and computational modelling approach for metabolite and flux analysis. Non-steady-state isotopic labelling experiments were performed with human faecal microbiota in a well-established in vitro model of the human colon. After culture stabilisation, [U-13C] starch was added and samples were taken at regular intervals. Metabolite concentrations and 13C isotopomeric distributions were measured amongst other things for acetate, propionate and butyrate by mass spectrometry and NMR. The vast majority of metabolic flux analysis methods based on isotopomer analysis published to date are not applicable to metabolic non-steady-state experiments. We therefore developed a new ordinary differential equation-based representation of a metabolic model of human faecal microbiota to determine eleven metabolic parameters that characterised the metabolic flux distribution in the isotope labelling experiment. The feasibility of the model parameter quantification was demonstrated on noisy in silico data using a downhill simplex optimisation, matching simulated labelling patterns of isotopically labelled metabolites with measured metabolite and isotope labelling data. Using the experimental data, we determined an increasing net label influx from starch during the experiment from 94±1 µmol/l/min to 133±3 µmol/l/min. Only about 12% of the total carbon flux from starch reached propionate. Propionate production mainly proceeded via succinate with a small contribution via acrylate. The remaining flux from starch yielded acetate (35%) and butyrate (53%). Interpretation of 13C NMR multiplet signals further revealed that butyrate, valerate and caproate were mainly synthesised via cross-feeding, using acetate as a co-substrate. This study demonstrates for the first time that the experimental design and the analysis of the results by computational modelling allows the determination of time-resolved effects of nutrition on the flux distribution within human faecal microbiota in metabolic non-steady-state.

Collapse