1
|
Chakraborty M, Biswas SK, Purkayastha B. Rule extraction using ensemble of neural network ensembles. COGN SYST RES 2022. [DOI: 10.1016/j.cogsys.2022.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
2
|
Johnson ZJ, Krutkin DD, Bohutskyi P, Kalyuzhnaya MG. Metals and methylotrophy: Via global gene expression studies. Methods Enzymol 2021; 650:185-213. [PMID: 33867021 DOI: 10.1016/bs.mie.2021.01.046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
A number of minerals, such as copper, cobalt, and rare earth elements (REE), are essential modulators of microbial one-carbon metabolism. This chapter provides an overview of the gene expression study design and analysis protocols for uncovering REE-induced changes in methylotrophic bacteria. By interrogating relationships and differences in total gene expression induced by mineral micronutrients, a deeper understanding of gene regulation at a systems scale can be gained. With careful design and execution of RNA-sequencing experiments, thorough processing and assessment of read quality can be utilized to assess and adjust for possible biases. By ensuring only quality data are utilized in downstream processes, differential gene expression, overrepresented analyses, and gene-set enrichment analyses provide reliable and reproducible representation of pathways and functions which are being affected by changes in environmental conditions.
Collapse
Affiliation(s)
- Zachary J Johnson
- Department of Biology, San Diego State University, San Diego, CA, United States
| | - Dennis D Krutkin
- Department of Biology, San Diego State University, San Diego, CA, United States
| | - Pavlo Bohutskyi
- Pacific Northwest National Laboratory, Richland, WA, United States
| | - Marina G Kalyuzhnaya
- Department of Biology, San Diego State University, San Diego, CA, United States.
| |
Collapse
|
3
|
Wang Y, Wang L, Yang F, Di W, Chang Q. Advantages of direct input-to-output connections in neural networks: The Elman network for stock index forecasting. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2020.09.031] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
4
|
Tang Z, Chai X, Wang Y, Cao S. Gene Regulatory Network Construction Based on a Particle Swarm Optimization of a Long Short-term Memory Network. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191023115224] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
The Gene Regulatory Network (GRN) is a model for studying the
function and behavior of genes by treating the genome as a whole, which can reveal the gene
expression mechanism. However, due to the dynamics, nonlinearity, and complexity of gene
expression data, it is a challenging task to construct a GRN precisely. And in the circulating
cooling water system, the Slime-Forming Bacteria (SFB) is one of the bacteria that helps to form
dirt. In order to explore the microbial fouling mechanism of SFB, constructing a GRN for the
fouling-forming genes of SFB is significant.
Objective:
Propose an effective GRN construction method and construct a GRN for the foulingforming
genes of SFB.
Methods:
In this paper, a combination method of Long Short-Term Memory Network (LSTM) and
Mean Impact Value (MIV) was applied for GRN reconstruction. Firstly, LSTM was employed to
establish a gene expression prediction model. To improve the performance of LSTM, a Particle
Swarm Optimization (PSO) was introduced to optimize the weight and learning rate. Then, the
MIV was used to infer the regulation among genes. In view of the fouling-forming problem of
SFB, we have designed electromagnetic field experiments and transcriptome sequencing
experiments to locate the fouling-forming genes and obtain gene expression data.
Results:
In order to test the proposed approach, the proposed method was applied to three datasets:
a simulated dataset and two real biology datasets. By comparing with other methods, the
experimental results indicate that the proposed method has higher modeling accuracy and it can be
used to effectively construct a GRN. And at last, a GRN for fouling-forming genes of SFB was
constructed using the proposed approach.
Conclusion:
The experiments indicated that the proposed approach can reconstruct a GRN
precisely, and compared with other approaches, the proposed approach performs better in
extracting the regulations among genes.
Collapse
Affiliation(s)
- Zhenhao Tang
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| | - Xiangying Chai
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| | - Yu Wang
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| | - Shengxian Cao
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| |
Collapse
|
5
|
Li H, Bai J, Cui X, Li Y, Sun S. A new secondary decomposition-ensemble approach with cuckoo search optimization for air cargo forecasting. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106161] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
6
|
Abstract
Background:Time series expression data of genes contain relations among different genes, which are difficult to model precisely. Slime-forming bacteria is one of the three major harmful bacteria types in industrial circulating cooling water systems.Objective:This study aimed at constructing gene regulation network(GRN) for slime-forming bacteria to understand the microbial fouling mechanism.Methods:For this purpose, an Adaptive Elman Neural Network (AENN) to reveal the relationships among genes using gene expression time series is proposed. The parameters of Elman neural network were optimized adaptively by a Genetic Algorithm (GA). And a Pearson correlation analysis is applied to discover the relationships among genes. In addition, the gene expression data of slime-forming bacteria by transcriptome gene sequencing was presented.Results:To evaluate our proposed method, we compared several alternative data-driven approaches, including a Neural Fuzzy Recurrent Network (NFRN), a basic Elman Neural Network (ENN), and an ensemble network. The experimental results of simulated and real datasets demonstrate that the proposed approach has a promising performance for modeling Gene Regulation Networks (GRNs). We also applied the proposed method for the GRN construction of slime-forming bacteria and at last a GRN for 6 genes was constructed.Conclusion:The proposed GRN construction method can effectively extract the regulations among genes. This is also the first report to construct the GRN for slime-forming bacteria.
Collapse
Affiliation(s)
- Shengxian Cao
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| | - Yu Wang
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| | - Zhenhao Tang
- School of Automation Engineering, Northeast Electric Power University, Jilin, China
| |
Collapse
|
7
|
A Comparison Study on Rule Extraction from Neural Network Ensembles, Boosted Shallow Trees, and SVMs. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING 2018. [DOI: 10.1155/2018/4084850] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
One way to make the knowledge stored in an artificial neural network more intelligible is to extract symbolic rules. However, producing rules from Multilayer Perceptrons (MLPs) is an NP-hard problem. Many techniques have been introduced to generate rules from single neural networks, but very few were proposed for ensembles. Moreover, experiments were rarely assessed by 10-fold cross-validation trials. In this work, based on the Discretized Interpretable Multilayer Perceptron (DIMLP), experiments were performed on 10 repetitions of stratified 10-fold cross-validation trials over 25 binary classification problems. The DIMLP architecture allowed us to produce rules from DIMLP ensembles, boosted shallow trees (BSTs), and Support Vector Machines (SVM). The complexity of rulesets was measured with the average number of generated rules and average number of antecedents per rule. From the 25 used classification problems, the most complex rulesets were generated from BSTs trained by “gentle boosting” and “real boosting.” Moreover, we clearly observed that the less complex the rules were, the better their fidelity was. In fact, rules generated from decision stumps trained by modest boosting were, for almost all the 25 datasets, the simplest with the highest fidelity. Finally, in terms of average predictive accuracy and average ruleset complexity, the comparison of some of our results to those reported in the literature proved to be competitive.
Collapse
|
8
|
Kordmahalleh MM, Sefidmazgi MG, Harrison SH, Homaifar A. Identifying time-delayed gene regulatory networks via an evolvable hierarchical recurrent neural network. BioData Min 2017; 10:29. [PMID: 28785315 PMCID: PMC5543747 DOI: 10.1186/s13040-017-0146-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 07/14/2017] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND The modeling of genetic interactions within a cell is crucial for a basic understanding of physiology and for applied areas such as drug design. Interactions in gene regulatory networks (GRNs) include effects of transcription factors, repressors, small metabolites, and microRNA species. In addition, the effects of regulatory interactions are not always simultaneous, but can occur after a finite time delay, or as a combined outcome of simultaneous and time delayed interactions. Powerful biotechnologies have been rapidly and successfully measuring levels of genetic expression to illuminate different states of biological systems. This has led to an ensuing challenge to improve the identification of specific regulatory mechanisms through regulatory network reconstructions. Solutions to this challenge will ultimately help to spur forward efforts based on the usage of regulatory network reconstructions in systems biology applications. METHODS We have developed a hierarchical recurrent neural network (HRNN) that identifies time-delayed gene interactions using time-course data. A customized genetic algorithm (GA) was used to optimize hierarchical connectivity of regulatory genes and a target gene. The proposed design provides a non-fully connected network with the flexibility of using recurrent connections inside the network. These features and the non-linearity of the HRNN facilitate the process of identifying temporal patterns of a GRN. RESULTS Our HRNN method was implemented with the Python language. It was first evaluated on simulated data representing linear and nonlinear time-delayed gene-gene interaction models across a range of network sizes and variances of noise. We then further demonstrated the capability of our method in reconstructing GRNs of the Saccharomyces cerevisiae synthetic network for in vivo benchmarking of reverse-engineering and modeling approaches (IRMA). We compared the performance of our method to TD-ARACNE, HCC-CLINDE, TSNI and ebdbNet across different network sizes and levels of stochastic noise. We found our HRNN method to be superior in terms of accuracy for nonlinear data sets with higher amounts of noise. CONCLUSIONS The proposed method identifies time-delayed gene-gene interactions of GRNs. The topology-based advancement of our HRNN worked as expected by more effectively modeling nonlinear data sets. As a non-fully connected network, an added benefit to HRNN was how it helped to find the few genes which regulated the target gene over different time delays.
Collapse
Affiliation(s)
- Mina Moradi Kordmahalleh
- Department of Electrical and Computer Engineering, North Carolina A&T State University, 1601 E. Market Street, Greensboro, 27411 NC USA
| | - Mohammad Gorji Sefidmazgi
- Department of Electrical and Computer Engineering, North Carolina A&T State University, 1601 E. Market Street, Greensboro, 27411 NC USA
| | - Scott H Harrison
- Department of Biology, North Carolina A&T State University, 1601 E. Market Street, Greensboro, 27411 NC USA
| | - Abdollah Homaifar
- Department of Electrical and Computer Engineering, North Carolina A&T State University, 1601 E. Market Street, Greensboro, 27411 NC USA
| |
Collapse
|
9
|
Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems. Adv Bioinformatics 2017; 2017:4827171. [PMID: 28250767 PMCID: PMC5303608 DOI: 10.1155/2017/4827171] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Revised: 10/10/2016] [Accepted: 10/19/2016] [Indexed: 11/17/2022] Open
Abstract
Gene regulatory network (GRN) reconstruction is the process of identifying regulatory gene interactions from experimental data through computational analysis. One of the main reasons for the reduced performance of previous GRN methods had been inaccurate prediction of cascade motifs. Cascade error is defined as the wrong prediction of cascade motifs, where an indirect interaction is misinterpreted as a direct interaction. Despite the active research on various GRN prediction methods, the discussion on specific methods to solve problems related to cascade errors is still lacking. In fact, the experiments conducted by the past studies were not specifically geared towards proving the ability of GRN prediction methods in avoiding the occurrences of cascade errors. Hence, this research aims to propose Multiple Linear Regression (MLR) to infer GRN from gene expression data and to avoid wrongly inferring of an indirect interaction (A → B → C) as a direct interaction (A → C). Since the number of observations of the real experiment datasets was far less than the number of predictors, some predictors were eliminated by extracting the random subnetworks from global interaction networks via an established extraction method. In addition, the experiment was extended to assess the effectiveness of MLR in dealing with cascade error by using a novel experimental procedure that had been proposed in this work. The experiment revealed that the number of cascade errors had been very minimal. Apart from that, the Belsley collinearity test proved that multicollinearity did affect the datasets used in this experiment greatly. All the tested subnetworks obtained satisfactory results, with AUROC values above 0.5.
Collapse
|
10
|
|
11
|
|
12
|
Yang B, Chen Y, Jiang M. Reverse engineering of gene regulatory networks using flexible neural tree models. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.07.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|