1
|
Balaur I, Nickerson DP, Welter D, Wodke JA, Ancien F, Gebhardt T, Grouès V, Hermjakob H, König M, Radde N, Rougny A, Schneider R, Malik-Sheriff RS, Shiferaw KB, Stefan M, Satagopam V, Waltemath D. FAIRification of computational models in biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.21.644517. [PMID: 40196669 PMCID: PMC11974689 DOI: 10.1101/2025.03.21.644517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
Computational models are essential for studying complex systems which, particularly in clinical settings, need to be quality-approved and transparent. To enhance the communication of a model's features and capabilities, we propose an adaptation of the Findability, Accessibility, Interoperability and Reusability (FAIR) indicators published by the Research Data Alliance to assess models encoded in domain-specific standards, such as those established by COMBINE. The assessments guide FAIRification and add value to models.
Collapse
Affiliation(s)
- Irina Balaur
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | | | - Danielle Welter
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
- Luxembourg National Data Service (LNDS), Luxembourg
| | - Judith A.H. Wodke
- Medical Informatics Laboratory, University Medicine Greifswald, Germany
| | - Francois Ancien
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | - Tom Gebhardt
- Medical Informatics Laboratory, University Medicine Greifswald, Germany
| | - Valentin Grouès
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), UK
| | - Matthias König
- Institute for Biology, Institute for Theoretical Biology, Humboldt University of Berlin, Germany
| | - Nicole Radde
- Institute for Stochastics and Applications, University Stuttgart, Germany
| | - Adrien Rougny
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | | | | | | | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg
| | - Dagmar Waltemath
- Medical Informatics Laboratory, University Medicine Greifswald, Germany
| |
Collapse
|
2
|
van Niekerk DD, Rust E, Bruggeman F, Westerhoff HV, Snoep JL. Is distance from equilibrium a good indicator for a reaction's flux control? Biosystems 2023; 232:104988. [PMID: 37541333 DOI: 10.1016/j.biosystems.2023.104988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/26/2023] [Accepted: 07/27/2023] [Indexed: 08/06/2023]
Abstract
By analysing a large set of models obtained from the JWS Online and Biomodels databases, we tested to what extent the disequilibrium ratio can be used as an estimator for the flux control of a reaction, a discussion point that was already raised by Kacser and Burns, and Heinrich and Rapoport in their seminal MCA manuscripts. Whereas no functional relation was observed, the disequilibrium ratio can be used as an estimator for the maximal flux control of a reaction step. We extended the original analysis of the relationship by incorporating the overall pathway disequilibrium ratio in the expression, which made it possible to make explicit expressions for flux control coefficients.
Collapse
Affiliation(s)
| | - Erik Rust
- Department of Biochemistry, Stellenbosch University, South Africa
| | - Frank Bruggeman
- Molecular Cell Biology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Hans V Westerhoff
- Molecular Cell Biology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands; Swammerdam Institute for Life Sciences, University of Amsterdam, Sciencepark 904, 1098 XH Amsterdam, The Netherlands; School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester M13 9PT, UK; Stellenbosch Institute of Advanced Studies (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa
| | - Jacky L Snoep
- Department of Biochemistry, Stellenbosch University, South Africa; Molecular Cell Biology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands.
| |
Collapse
|
3
|
Malik-Sheriff RS, Glont M, Nguyen TVN, Tiwari K, Roberts MG, Xavier A, Vu MT, Men J, Maire M, Kananathan S, Fairbanks EL, Meyer JP, Arankalle C, Varusai TM, Knight-Schrijver V, Li L, Dueñas-Roca C, Dass G, Keating SM, Park YM, Buso N, Rodriguez N, Hucka M, Hermjakob H. BioModels-15 years of sharing computational models in life science. Nucleic Acids Res 2020; 48:D407-D415. [PMID: 31701150 PMCID: PMC7145643 DOI: 10.1093/nar/gkz1055] [Citation(s) in RCA: 151] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Revised: 10/22/2019] [Accepted: 11/06/2019] [Indexed: 01/05/2023] Open
Abstract
Computational modelling has become increasingly common in life science research. To provide a platform to support universal sharing, easy accessibility and model reproducibility, BioModels (https://www.ebi.ac.uk/biomodels/), a repository for mathematical models, was established in 2005. The current BioModels platform allows submission of models encoded in diverse modelling formats, including SBML, CellML, PharmML, COMBINE archive, MATLAB, Mathematica, R, Python or C++. The models submitted to BioModels are curated to verify the computational representation of the biological process and the reproducibility of the simulation results in the reference publication. The curation also involves encoding models in standard formats and annotation with controlled vocabularies following MIRIAM (minimal information required in the annotation of biochemical models) guidelines. BioModels now accepts large-scale submission of auto-generated computational models. With gradual growth in content over 15 years, BioModels currently hosts about 2000 models from the published literature. With about 800 curated models, BioModels has become the world’s largest repository of curated models and emerged as the third most used data resource after PubMed and Google Scholar among the scientists who use modelling in their research. Thus, BioModels benefits modellers by providing access to reliable and semantically enriched curated models in standard formats that are easy to share, reproduce and reuse.
Collapse
Affiliation(s)
- Rahuman S Malik-Sheriff
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mihai Glont
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tung V N Nguyen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Krishna Tiwari
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.,Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Matthew G Roberts
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ashley Xavier
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Manh T Vu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jinghao Men
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthieu Maire
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarubini Kananathan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emma L Fairbanks
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Johannes P Meyer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Chinmay Arankalle
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thawfeek M Varusai
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Lu Li
- Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Corina Dueñas-Roca
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gaurhari Dass
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah M Keating
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Young M Park
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nicola Buso
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nicolas Rodriguez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.,Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Michael Hucka
- California Institute of Technology, Pasadena, 91125, CA, USA
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Lifeomics, National Center for Protein Sciences (The PHOENIX Center, Beijing), Beijing 102206, China
| |
Collapse
|
4
|
Neal ML, König M, Nickerson D, Mısırlı G, Kalbasi R, Dräger A, Atalag K, Chelliah V, Cooling MT, Cook DL, Crook S, de Alba M, Friedman SH, Garny A, Gennari JH, Gleeson P, Golebiewski M, Hucka M, Juty N, Myers C, Olivier BG, Sauro HM, Scharm M, Snoep JL, Touré V, Wipat A, Wolkenhauer O, Waltemath D. Harmonizing semantic annotations for computational models in biology. Brief Bioinform 2019; 20:540-550. [PMID: 30462164 PMCID: PMC6433895 DOI: 10.1093/bib/bby087] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 08/08/2018] [Accepted: 08/17/2018] [Indexed: 02/06/2023] Open
Abstract
Life science researchers use computational models to articulate and test hypotheses about the behavior of biological systems. Semantic annotation is a critical component for enhancing the interoperability and reusability of such models as well as for the integration of the data needed for model parameterization and validation. Encoded as machine-readable links to knowledge resource terms, semantic annotations describe the computational or biological meaning of what models and data represent. These annotations help researchers find and repurpose models, accelerate model composition and enable knowledge integration across model repositories and experimental data stores. However, realizing the potential benefits of semantic annotation requires the development of model annotation standards that adhere to a community-based annotation protocol. Without such standards, tool developers must account for a variety of annotation formats and approaches, a situation that can become prohibitively cumbersome and which can defeat the purpose of linking model elements to controlled knowledge resource terms. Currently, no consensus protocol for semantic annotation exists among the larger biological modeling community. Here, we report on the landscape of current annotation practices among the COmputational Modeling in BIology NEtwork community and provide a set of recommendations for building a consensus approach to semantic annotation.
Collapse
Affiliation(s)
- Maxwell Lewis Neal
- Seattle Children’s Research Institute, Center for Global Infectious Disease Research, Seattle, USA
| | - Matthias König
- Department of Biology, Humboldt-University Berlin, Institute for Theoretical Biology, Berlin, Germany
| | - David Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, NZ
| | - Göksel Mısırlı
- School of Computing and Mathematics, Keele University, Keele, UK
| | - Reza Kalbasi
- Auckland Bioengineering Institute, University of Auckland, Auckland, NZ
| | - Andreas Dräger
- Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Center for Bioinformatics Tübingen (ZBIT), University of Tübingen, Tübingen, Germany
- Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Koray Atalag
- Auckland Bioengineering Institute, University of Auckland, Auckland, NZ
| | - Vijayalakshmi Chelliah
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Michael T Cooling
- Auckland Bioengineering Institute, University of Auckland, Auckland, NZ
| | - Daniel L Cook
- Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
| | - Sharon Crook
- School of Mathematical and Statistical Sciences, Arizona State University, Tempe, USA
| | - Miguel de Alba
- German Federal Institute for Risk Assessment, Berlin, Germany
| | | | - Alan Garny
- Auckland Bioengineering Institute, University of Auckland, Auckland, NZ
| | - John H Gennari
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
| | - Padraig Gleeson
- Department of Neuroscience, Physiology and Pharmacology, University College London, London, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Heidelberg, Germany
| | - Michael Hucka
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
| | - Nick Juty
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Chris Myers
- Department of Electrical and Computer Engineering, University of Utah, Salt Lake City, UT, USA
| | - Brett G Olivier
- Systems Bioinformatics, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Modelling of Biological Processes, BioQUANT/COS, Heidelberg University, Germany
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Martin Scharm
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | - Jacky L Snoep
- Department of Biochemistry, Stellenbosch University, Matieland, South Africa
- Department of Molecular Cell Physiology, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- Manchester Institute for Biotechnology, University of Manchester, Manchester, UK
| | - Vasundra Touré
- Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Anil Wipat
- School of Computing Science, Newcastle University, Newcastle upon Tyne, UK
| | - Olaf Wolkenhauer
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
- Stellenbosch Institute for Advanced Study (STIAS), Stellenbosch, South Africa
| | - Dagmar Waltemath
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| |
Collapse
|
5
|
Viswan NA, HarshaRani GV, Stefan MI, Bhalla US. FindSim: A Framework for Integrating Neuronal Data and Signaling Models. Front Neuroinform 2018; 12:38. [PMID: 29997492 PMCID: PMC6028806 DOI: 10.3389/fninf.2018.00038] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 06/05/2018] [Indexed: 12/30/2022] Open
Abstract
Current experiments touch only small but overlapping parts of very complex subcellular signaling networks in neurons. Even with modern optical reporters and pharmacological manipulations, a given experiment can only monitor and control a very small subset of the diverse, multiscale processes of neuronal signaling. We have developed FindSim (Framework for Integrating Neuronal Data and SIgnaling Models) to anchor models to structured experimental datasets. FindSim is a framework for integrating many individual electrophysiological and biochemical experiments with large, multiscale models so as to systematically refine and validate the model. We use a structured format for encoding the conditions of many standard physiological and pharmacological experiments, specifying which parts of the model are involved, and comparing experiment outcomes with model output. A database of such experiments is run against successive generations of composite cellular models to iteratively improve the model against each experiment, while retaining global model validity. We suggest that this toolchain provides a principled and scalable way to tackle model complexity and diversity of data sources.
Collapse
Affiliation(s)
- Nisha A Viswan
- National Centre for Biological Sciences, Bangalore, India.,Tata Institute of Fundamental Research, The University of Trans-Disciplinary Health Sciences and Technology, Bangalore, India
| | | | - Melanie I Stefan
- Centre for Discovery Brain Sciences, University of Edinburgh, Edinburgh, United Kingdom.,ZJU-UoE Institute, Zhejiang University, Hangzhou, China
| | | |
Collapse
|
6
|
Lambusch F, Waltemath D, Wolkenhauer O, Sandkuhl K, Rosenke C, Henkel R. Identifying frequent patterns in biochemical reaction networks: a workflow. Database (Oxford) 2018; 2018:5048438. [PMID: 29992320 PMCID: PMC6030809 DOI: 10.1093/database/bay051] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Revised: 04/27/2018] [Accepted: 04/29/2018] [Indexed: 11/15/2022]
Abstract
Computational models in biology encode molecular and cell biological processes. Many of these models can be represented as biochemical reaction networks. Studying such networks, one is mostly interested in systems that share similar reactions and mechanisms. Typical goals of an investigation thus include understanding of model parts, identification of reoccurring patterns and recognition of biologically relevant motifs. The large number and size of available models, however, require automated methods to support researchers in achieving their goals. Specifically for the problem of finding patterns in large networks only partial solutions exist. We propose a workflow that identifies frequent structural patterns in biochemical reaction networks encoded in the Systems Biology Markup Language. The workflow utilizes a subgraph mining algorithm to detect the network patterns. Once patterns are identified, the textual pattern description can automatically be converted into a graphical representation. Furthermore, information about the distribution of patterns among a selected set of models can be retrieved. The workflow was validated with 575 models from the curated branch of BioModels. In this paper, we highlight interesting and frequent structural patterns. Furthermore, we provide exemplary patterns that incorporate terms from the Systems Biology Ontology. Our workflow can be applied to a custom set of models or to models already existing in our graph database MaSyMoS. The occurrences of frequent patterns may give insight into the encoding of central biological processes, evaluate postulated biological motifs or serve as a similarity measure for models that share common structures.Database URL: https://github.com/FabienneL/BioNet-Mining.
Collapse
Affiliation(s)
- Fabienne Lambusch
- Business Information Systems, University of Rostock, Rostock, Mecklenburg-Vorpommern, Germany
| | - Dagmar Waltemath
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Mecklenburg-Vorpommern, Germany
| | - Olaf Wolkenhauer
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Mecklenburg-Vorpommern, Germany
- Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre, Stellenbosch University, Stellenbosch, South Africa
| | - Kurt Sandkuhl
- Business Information Systems, University of Rostock, Rostock, Mecklenburg-Vorpommern, Germany
- ITMO University, 49 Kronverksky Pr., St. Petersburg, Russia
| | - Christian Rosenke
- Visual Computing and Computer Graphics, University of Rostock, Rostock, Mecklenburg-Vorpommern, Germany
| | - Ron Henkel
- Scientific Databases and Visualization, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| |
Collapse
|
7
|
Henkel R, Hoehndorf R, Kacprowski T, Knüpfer C, Liebermeister W, Waltemath D. Notions of similarity for systems biology models. Brief Bioinform 2018; 19:77-88. [PMID: 27742665 PMCID: PMC5862271 DOI: 10.1093/bib/bbw090] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 08/28/2016] [Indexed: 01/23/2023] Open
Abstract
Systems biology models are rapidly increasing in complexity, size and numbers. When building large models, researchers rely on software tools for the retrieval, comparison, combination and merging of models, as well as for version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of 'similarity' may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here we survey existing methods for the comparison of models, introduce quantitative measures for model similarity, and discuss potential applications of combined similarity measures. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on a combination of different model aspects. The six aspects that we define as potentially relevant for similarity are underlying encoding, references to biological entities, quantitative behaviour, qualitative behaviour, mathematical equations and parameters and network structure. We argue that future similarity measures will benefit from combining these model aspects in flexible, problem-specific ways to mimic users' intuition about model similarity, and to support complex model searches in databases.
Collapse
Affiliation(s)
| | | | | | | | | | - Dagmar Waltemath
- Department of Systems Biology and Bioinformatics, Institute of Computer Science, University of Rostock, Rostock, Germany
| |
Collapse
|
8
|
Dongliang X, Jingchang P, Bailing W. Multiple kernels learning-based biological entity relationship extraction method. J Biomed Semantics 2017; 8:38. [PMID: 29297359 PMCID: PMC5763518 DOI: 10.1186/s13326-017-0138-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Background Automatic extracting protein entity interaction information from biomedical literature can help to build protein relation network and design new drugs. There are more than 20 million literature abstracts included in MEDLINE, which is the most authoritative textual database in the field of biomedicine, and follow an exponential growth over time. This frantic expansion of the biomedical literature can often be difficult to absorb or manually analyze. Thus efficient and automated search engines are necessary to efficiently explore the biomedical literature using text mining techniques. Results The P, R, and F value of tag graph method in Aimed corpus are 50.82, 69.76, and 58.61%, respectively. The P, R, and F value of tag graph kernel method in other four evaluation corpuses are 2–5% higher than that of all-paths graph kernel. And The P, R and F value of feature kernel and tag graph kernel fuse methods is 53.43, 71.62 and 61.30%, respectively. The P, R and F value of feature kernel and tag graph kernel fuse methods is 55.47, 70.29 and 60.37%, respectively. It indicated that the performance of the two kinds of kernel fusion methods is better than that of simple kernel. Conclusion In comparison with the all-paths graph kernel method, the tag graph kernel method is superior in terms of overall performance. Experiments show that the performance of the multi-kernels method is better than that of the three separate single-kernel method and the dual-mutually fused kernel method used hereof in five corpus sets.
Collapse
Affiliation(s)
- Xu Dongliang
- School of Mechanical, Electrical and Information Engineering, ShanDong University, WenHua West Road, WeiHai, 264209, China
| | - Pan Jingchang
- School of Mechanical, Electrical and Information Engineering, ShanDong University, WenHua West Road, WeiHai, 264209, China.
| | - Wang Bailing
- School of Computer Science and Technology, Harbin Institute of Technology, WenHua West Road, WeiHai, 264209, China
| |
Collapse
|