1
|
Nakulugamuwa Gamage H, Chetty M, Lim S, Hallinan J. MICFuzzy: A maximal information content based fuzzy approach for reconstructing genetic networks. PLoS One 2023; 18:e0288174. [PMID: 37418430 DOI: 10.1371/journal.pone.0288174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 06/21/2023] [Indexed: 07/09/2023] Open
Abstract
In systems biology, the accurate reconstruction of Gene Regulatory Networks (GRNs) is crucial since these networks can facilitate the solving of complex biological problems. Amongst the plethora of methods available for GRN reconstruction, information theory and fuzzy concepts-based methods have abiding popularity. However, most of these methods are not only complex, incurring a high computational burden, but they may also produce a high number of false positives, leading to inaccurate inferred networks. In this paper, we propose a novel hybrid fuzzy GRN inference model called MICFuzzy which involves the aggregation of the effects of Maximal Information Coefficient (MIC). This model has an information theory-based pre-processing stage, the output of which is applied as an input to the novel fuzzy model. In this preprocessing stage, the MIC component filters relevant genes for each target gene to significantly reduce the computational burden of the fuzzy model when selecting the regulatory genes from these filtered gene lists. The novel fuzzy model uses the regulatory effect of the identified activator-repressor gene pairs to determine target gene expression levels. This approach facilitates accurate network inference by generating a high number of true regulatory interactions while significantly reducing false regulatory predictions. The performance of MICFuzzy was evaluated using DREAM3 and DREAM4 challenge data, and the SOS real gene expression dataset. MICFuzzy outperformed the other state-of-the-art methods in terms of F-score, Matthews Correlation Coefficient, Structural Accuracy, and SS_mean, and outperformed most of them in terms of efficiency. MICFuzzy also had improved efficiency compared with the classical fuzzy model since the design of MICFuzzy leads to a reduction in combinatorial computation.
Collapse
Affiliation(s)
| | - Madhu Chetty
- Health Innovation and Transformation Centre, Federation University, Churchill, Victoria, Australia
| | - Suryani Lim
- Health Innovation and Transformation Centre, Federation University, Churchill, Victoria, Australia
| | | |
Collapse
|
2
|
Majumder S, Thakran Y, Pal V, Singh K. Fuzzy and Rough Set Theory Based Computational Framework for Mining Genetic Interaction Triplets From Gene Expression Profiles for Lung Adenocarcinoma. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3469-3481. [PMID: 34665736 DOI: 10.1109/tcbb.2021.3120844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Genetic interactions are very helpful in understanding different disease and discovering drugs for it. Compared to the gene pairs that represent the genetic interactions between two genes, the gene triplets are more informative and useful. However, existing works on genetic interactions among gene triplets have primarily focused on detecting gene triplets from time series gene expression profiles. Generating the time series gene expression profiles for humans is quite impracticable but the labeled gene expression profiles are available for different diseases in case of humans. In this paper, a computational framework has been proposed to detect gene triplets from labeled gene expression profiles. First, it employs Rough Set Theory for extracting the key genes and then designs a fuzzy inference system for generating possible gene triplets. Further, Root Mean Squared Error measure has been used to prune out the irrelevant gene triplets. In the present work, the proposed computational framework has been applied to labeled lung adenocarcinoma dataset and can be applied to any other labeled gene expression dataset. The extracted gene triplets and their functionalities have been verified with existing biological literature and benchmark databases and the results of verification signify that the proposed framework is promising in terms of finding useful genetic triplets. Further, the proposed framework has been found more efficient as compared to an existing mutual information-based technique in terms of detecting known genetic interactions.
Collapse
|
3
|
Liu F, Heiner M, Gilbert D. Fuzzy Petri nets for modelling of uncertain biological systems. Brief Bioinform 2018; 21:198-210. [PMID: 30590430 DOI: 10.1093/bib/bby118] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Revised: 10/09/2018] [Accepted: 11/16/2018] [Indexed: 12/28/2022] Open
Abstract
The modelling of biological systems is accompanied with epistemic uncertainties that range from structural uncertainty to parametric uncertainty due to such limitations as insufficient understanding of the underlying mechanism and incomplete measurement data of a system. Fuzzy logic approaches such as fuzzy Petri nets (FPNs) are effective in addressing these issues. In this paper, we review FPNs that have been used for modelling uncertain biological systems, which we classify in three categories: basic fuzzy Petri nets, fuzzy quantitative Petri nets and Petri nets with fuzzy kinetic parameters. For each category of these FPNs, we summarize its modelling capabilities and current applications, discuss its merits and drawbacks and give suggestions for further research. This understanding on how to use FPNs for modelling uncertain biological systems will assist readers in selecting appropriate FPN classes for specific modelling circumstances. This review may also promote the extensive research and application of FPNs in the systems biology area.
Collapse
Affiliation(s)
- Fei Liu
- School of Software Engineering, South China University of Technology, Guangzhou, P. R. China
| | - Monika Heiner
- Department of Computer Science, Brandenburg University of Technology Cottbus-Senftenberg, Cottbus, Germany
| | - David Gilbert
- Department of Computer Science, Brunel University London, Middlesex, UK
| |
Collapse
|
4
|
Khayer N, Mirzaie M, Marashi SA, Rezaei-Tavirani M, Goshadrou F. Three-way interaction model with switching mechanism as an effective strategy for tracing functionally-related genes. Expert Rev Proteomics 2018; 16:161-169. [PMID: 30556756 DOI: 10.1080/14789450.2019.1559734] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Introduction: Identification of functionally-related genes is an important step in understanding biological systems. The most popular strategy to infer functional dependence is to study pairwise correlations between gene expression levels. However, certain functionally-related genes may have a low expression correlation due to their nonlinear interactions. The use of a three-way interaction (3WI) model with switching mechanism (SM) is a relatively new strategy to trace functionally-related genes. The 3WI model traces the dynamic and nonlinear nature of the co-expression relationship of two genes by introducing their link to the expression level of a third gene. Areas covered: In this paper, we reviewed a variety of existing methods for tracing the 3WIs. Furthermore, we provide a comprehensive review of the previous biological studies based on 3WI models. Expert commentary: Comparison of features of these methods indicates that the modified liquid association algorithm has the best efficiency for tracing 3WI between others. The limited number of biological studies based on the 3WI suggests that high computational demand of the available algorithms is a major challenge to apply this approach for analyzing high-throughput omics data.
Collapse
Affiliation(s)
- Nasibeh Khayer
- a Department of Basic Sciences, Faculty of Paramedical Sciences , Shahid Beheshti University of Medical Sciences , Tehran , Iran
| | - Mehdi Mirzaie
- b Department of Applied Mathematics, Faculty of Mathematical Sciences , Tarbiat Modares University , Tehran , Iran
| | - Sayed-Amir Marashi
- c Department of Biotechnology , College of Science, University of Tehran , Tehran , Iran
| | - Mostafa Rezaei-Tavirani
- d Proteomics Research Center , Shahid Beheshti University of Medical Sciences , Tehran , Iran
| | - Fatemeh Goshadrou
- a Department of Basic Sciences, Faculty of Paramedical Sciences , Shahid Beheshti University of Medical Sciences , Tehran , Iran
| |
Collapse
|
5
|
Raza K. Fuzzy logic based approaches for gene regulatory network inference. Artif Intell Med 2018; 97:189-203. [PMID: 30573378 DOI: 10.1016/j.artmed.2018.12.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 12/10/2018] [Accepted: 12/12/2018] [Indexed: 12/26/2022]
Abstract
The rapid advancements in high-throughput techniques have fueled large-scale production of biological data at very affordable costs. Some of these techniques are microarrays and next-generation sequencing that provide genome level insight of living cells. As a result, the size of most of the biological databases, such as NCBI-GEO, NCBI-SRA, etc., is growing exponentially. These biological data are analyzed using various computational techniques for knowledge discovery - which is also one of the objectives of bioinformatics research. Gene regulatory network (GRN) is a gene-gene interaction network which plays a pivotal role in understanding gene regulation processes and disease mechanism at the molecular level. From last couple of decades, researchers are interested in developing computational algorithms for GRN inference (GRNI) from high-throughput experimental data. Several computational approaches have been proposed for inferring GRN from gene expression data including statistical techniques (correlation coefficient), information theory (mutual information), regression-based approaches, probabilistic approaches (Bayesian networks, naïve byes), artificial neural networks and fuzzy logic. The fuzzy logic, along with its hybridization with other intelligent approaches, is a well-studied technique in GRNI due to its several advantages. In this paper, we present a consolidated review on fuzzy logic and its hybrid approaches developed during last two decades for GRNI.
Collapse
Affiliation(s)
- Khalid Raza
- Department of Computer Science, Jamia Millia Islamia, New Delhi, India.
| |
Collapse
|
6
|
Małysiak-Mrozek B. Uncertainty, imprecision, and many-valued logics in protein bioinformatics. Math Biosci 2018; 309:143-162. [PMID: 30118719 DOI: 10.1016/j.mbs.2018.08.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 07/24/2018] [Accepted: 08/09/2018] [Indexed: 11/15/2022]
Abstract
Understanding proteins, their structures, functions, mutual interactions, activity in cellular reactions, interactions with drugs, and expression in body cells is a key to efficient medical diagnosis, drug production, and treatment of patients. Machine learning and data exploration methods supported by many-valued logics allow to grasp the imprecision and uncertainties that naturally occur in proteins and other biomolecules. Many-valued logics, like Łukasiewicz logic or fuzzy logic, are non-classical logics that do not restrict the number of truth values to only two values of true or false, but they allow for a larger set of truth degrees. In this paper, we briefly review the use of many-valued logics, especially the fuzzy logic, in bioinformatics. Then, we focus on protein bioinformatics, and present selected applications of many-valued logics in the analysis of complex protein structures, including; (1) potential-based protein similarity searching, (2) matching proteins on the basis of secondary structures, (3) 3D protein structure alignment, (4) prediction of intrinsically disordered proteins, and (5) fuzzy querying in large collections of Big macromolecular Data. Results of presented studies show that the utilization of many-valued logics can enrich the investigations of protein molecules, in which uncertainty and imprecision are prevalent problems. The paper discusses all observed benefits brought by the application of many-valued logics in investigations related to selected protein analyzes carried out by the author.
Collapse
Affiliation(s)
- Bożena Małysiak-Mrozek
- Institute of Informatics, Silesian University of Technology, Akademicka 16, Gliwice 44-100, Poland.
| |
Collapse
|
7
|
Li X, Li Y, Liu Y, Wang L. Genetic Expression Level Prediction Based on Extended Fuzzy Petri Nets. INT J PATTERN RECOGN 2017. [DOI: 10.1142/s0218001417500367] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the advances in technique for high throughput data gathering such as microarrays, DNA sequencing machines and continuous development of human genome project, the traditional physical and chemical methods have been more difficult to meet the requests of time consuming and results accuracy. Exploring and understanding the causal relationship of complex gene regulatory networks and transforming the massive data of large-scale biological research to useful biological knowledge are the present challenge. As a result, there are two typical applications both the confidence value prediction of DNA sequence and influence degree prediction of gene expression which have become the hot issues in our daily life. In this paper, two extended fuzzy Petri nets approaches are proposed, based on the existing fuzzy Petri net model, to model and analyze for the hot issues respectively. One is the fuzzy colored Petri net, which combines fuzzy Petri net with colored Petri net to model fuzzy rule-based reasoning and determine confidence values for bases called in DNA sequence. The other is extended fuzzy Petri net, which integrates reverse reasoning into fuzzy Petri net and is proposed to model gene regulatory network. It can predict the change in expression level of target based on the input expression level of activator/repressor. Compared with the method of fuzzy Petri net, the two extended fuzzy Petri nets models perform more accurately in the following typical experiment reasoning outcomes and show that the proposed methods are feasible and available.
Collapse
Affiliation(s)
- Xiaozhong Li
- College of Computer Science and Information Engineering, Tianjin University of Science and Technology, Tianjin, 300222, P. R. China
| | - Yong Li
- College of Computer Science and Information Engineering, Tianjin University of Science and Technology, Tianjin, 300222, P. R. China
| | - Ying Liu
- College of Computer Science and Information Engineering, Tianjin University of Science and Technology, Tianjin, 300222, P. R. China
| | - Long Wang
- College of Computer Science and Information Engineering, Tianjin University of Science and Technology, Tianjin, 300222, P. R. China
| |
Collapse
|
8
|
Liu F, Heiner M, Yang M. Fuzzy Stochastic Petri Nets for Modeling Biological Systems with Uncertain Kinetic Parameters. PLoS One 2016; 11:e0149674. [PMID: 26910830 PMCID: PMC4766190 DOI: 10.1371/journal.pone.0149674] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 01/29/2016] [Indexed: 12/27/2022] Open
Abstract
Stochastic Petri nets (SPNs) have been widely used to model randomness which is an inherent feature of biological systems. However, for many biological systems, some kinetic parameters may be uncertain due to incomplete, vague or missing kinetic data (often called fuzzy uncertainty), or naturally vary, e.g., between different individuals, experimental conditions, etc. (often called variability), which has prevented a wider application of SPNs that require accurate parameters. Considering the strength of fuzzy sets to deal with uncertain information, we apply a specific type of stochastic Petri nets, fuzzy stochastic Petri nets (FSPNs), to model and analyze biological systems with uncertain kinetic parameters. FSPNs combine SPNs and fuzzy sets, thereby taking into account both randomness and fuzziness of biological systems. For a biological system, SPNs model the randomness, while fuzzy sets model kinetic parameters with fuzzy uncertainty or variability by associating each parameter with a fuzzy number instead of a crisp real value. We introduce a simulation-based analysis method for FSPNs to explore the uncertainties of outputs resulting from the uncertainties associated with input parameters, which works equally well for bounded and unbounded models. We illustrate our approach using a yeast polarization model having an infinite state space, which shows the appropriateness of FSPNs in combination with simulation-based analysis for modeling and analyzing biological systems with uncertain information.
Collapse
Affiliation(s)
- Fei Liu
- Control and Simulation Center, Harbin Institute of Technology, Harbin, 150080 China
- * E-mail: (FL); (MY)
| | - Monika Heiner
- Department of Computer Science, Brandenburg University of Technology Cottbus-Senftenberg, Cottbus, 03013 Germany
| | - Ming Yang
- Control and Simulation Center, Harbin Institute of Technology, Harbin, 150080 China
- * E-mail: (FL); (MY)
| |
Collapse
|
9
|
Bordon J, Moskon M, Zimic N, Mraz M. Fuzzy Logic as a Computational Tool for Quantitative Modelling of Biological Systems with Uncertain Kinetic Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:1199-1205. [PMID: 26451831 DOI: 10.1109/tcbb.2015.2424424] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Quantitative modelling of biological systems has become an indispensable computational approach in the design of novel and analysis of existing biological systems. However, kinetic data that describe the system's dynamics need to be known in order to obtain relevant results with the conventional modelling techniques. These data are often hard or even impossible to obtain. Here, we present a quantitative fuzzy logic modelling approach that is able to cope with unknown kinetic data and thus produce relevant results even though kinetic data are incomplete or only vaguely defined. Moreover, the approach can be used in the combination with the existing state-of-the-art quantitative modelling techniques only in certain parts of the system, i.e., where kinetic data are missing. The case study of the approach proposed here is performed on the model of three-gene repressilator.
Collapse
|