1
|
Babalola OO, Fadiji AE, Enagbonma BJ, Alori ET, Ayilara MS, Ayangbenro AS. The Nexus Between Plant and Plant Microbiome: Revelation of the Networking Strategies. Front Microbiol 2020; 11:548037. [PMID: 33013781 PMCID: PMC7499240 DOI: 10.3389/fmicb.2020.548037] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 08/12/2020] [Indexed: 12/16/2022] Open
Abstract
The diversity of plant-associated microbes is enormous and complex. These microbiomes are structured and form complex interconnected microbial networks that are important in plant health and ecosystem functioning. Understanding the composition of the microbiome and their core function is important in unraveling their networking strategies and their potential influence on plant performance. The network is altered by the host plant species, which in turn influence the microbial interaction dynamics and co-evolution. We discuss the plant microbiome and the complex interplay among microbes and between their host plants. We provide an overview of how plant performance is influenced by the microbiome diversity and function.
Collapse
Affiliation(s)
- Olubukola Oluranti Babalola
- Food Security and Safety Niche, Faculty of Natural and Agricultural Sciences, North-West University, Mmabatho, South Africa
| | - Ayomide E Fadiji
- Food Security and Safety Niche, Faculty of Natural and Agricultural Sciences, North-West University, Mmabatho, South Africa
| | - Ben J Enagbonma
- Food Security and Safety Niche, Faculty of Natural and Agricultural Sciences, North-West University, Mmabatho, South Africa
| | - Elizabeth T Alori
- Department of Crop and Soil Sciences, Landmark University, Omu-Aran, Nigeria
| | - Modupe S Ayilara
- Food Security and Safety Niche, Faculty of Natural and Agricultural Sciences, North-West University, Mmabatho, South Africa
| | - Ayansina S Ayangbenro
- Food Security and Safety Niche, Faculty of Natural and Agricultural Sciences, North-West University, Mmabatho, South Africa
| |
Collapse
|
2
|
An L, Doerge RW. Dynamic clustering of gene expression. ISRN BIOINFORMATICS 2012; 2012:537217. [PMID: 25969750 PMCID: PMC4393063 DOI: 10.5402/2012/537217] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 08/05/2012] [Indexed: 11/23/2022]
Abstract
It is well accepted that genes are simultaneously involved in multiple biological processes and that genes are coordinated over the duration of such events. Unfortunately, clustering methodologies that group genes for the purpose of novel gene discovery fail to acknowledge the dynamic nature of biological processes and provide static clusters, even when the expression of genes is assessed across time or developmental stages. By taking advantage of techniques and theories from time frequency analysis, periodic gene expression profiles are dynamically clustered based on the assumption that different spectral frequencies characterize different biological processes. A two-step cluster validation approach is proposed to statistically estimate both the optimal number of clusters and to distinguish significant clusters from noise. The resulting clusters reveal coordinated coexpressed genes. This novel dynamic clustering approach has broad applicability to a vast range of sequential data scenarios where the order of the series is of interest.
Collapse
Affiliation(s)
- Lingling An
- Department of Agricultural and Biosystems Engineering, University of Arizona, Tucson, AZ 85721, USA
| | - R. W. Doerge
- Department of Statistics, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
3
|
Abstract
Over the past 20 years, Omics technologies emerged as the consensual denomination of holistic molecular profiling. These techniques enable parallel measurements of biological -omes, or "all constituents considered collectively", and utilize the latest advancements in transcriptomics, proteomics, metabolomics, imaging, and bioinformatics. The technological accomplishments in increasing the sensitivity and throughput of the analytical devices, the standardization of the protocols and the widespread availability of reagents made the capturing of static molecular portraits of biological systems a routine task. The next generation of time course molecular profiling already allows for extensive molecular snapshots to be taken along the trajectory of time evolution of the investigated biological systems. Such datasets provide the basis for application of the inverse scientific approach. It consists in the inference of scientific hypotheses and theories about the structure and dynamics of the investigated biological system without any a priori knowledge, solely relying on data analysis to unveil the underlying patterns. However, most temporal Omics data still contain a limited number of time points, taken over arbitrary time intervals, through measurements on biological processes shifted in time. The analysis of the resulting short and noisy time series data sets is a challenge. Traditional statistical methods for the study of static Omics datasets are of limited relevance and new methods are required. This chapter discusses such algorithms which enable the application of the inverse analysis approach to short Omics time series.
Collapse
|
4
|
Gao S, Hartman JL, Carter JL, Hessner MJ, Wang X. Global analysis of phase locking in gene expression during cell cycle: the potential in network modeling. BMC SYSTEMS BIOLOGY 2010; 4:167. [PMID: 21129191 PMCID: PMC3017040 DOI: 10.1186/1752-0509-4-167] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2010] [Accepted: 12/03/2010] [Indexed: 11/10/2022]
Abstract
Background In nonlinear dynamic systems, synchrony through oscillation and frequency modulation is a general control strategy to coordinate multiple modules in response to external signals. Conversely, the synchrony information can be utilized to infer interaction. Increasing evidence suggests that frequency modulation is also common in transcription regulation. Results In this study, we investigate the potential of phase locking analysis, a technique to study the synchrony patterns, in the transcription network modeling of time course gene expression data. Using the yeast cell cycle data, we show that significant phase locking exists between transcription factors and their targets, between gene pairs with prior evidence of physical or genetic interactions, and among cell cycle genes. When compared with simple correlation we found that the phase locking metric can identify gene pairs that interact with each other more efficiently. In addition, it can automatically address issues of arbitrary time lags or different dynamic time scales in different genes, without the need for alignment. Interestingly, many of the phase locked gene pairs exhibit higher order than 1:1 locking, and significant phase lags with respect to each other. Based on these findings we propose a new phase locking metric for network reconstruction using time course gene expression data. We show that it is efficient at identifying network modules of focused biological themes that are important to cell cycle regulation. Conclusions Our result demonstrates the potential of phase locking analysis in transcription network modeling. It also suggests the importance of understanding the dynamics underlying the gene expression patterns.
Collapse
Affiliation(s)
- Shouguo Gao
- Department of Physics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
| | | | | | | | | |
Collapse
|
5
|
Tang B, Wu X, Tan G, Chen SS, Jing Q, Shen B. Computational inference and analysis of genetic regulatory networks via a supervised combinatorial-optimization pattern. BMC SYSTEMS BIOLOGY 2010; 4 Suppl 2:S3. [PMID: 20840730 PMCID: PMC2982690 DOI: 10.1186/1752-0509-4-s2-s3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Background Post-genome era brings about diverse categories of omics data. Inference and analysis of genetic regulatory networks act prominently in extracting inherent mechanisms, discovering and interpreting the related biological nature and living principles beneath mazy phenomena, and eventually promoting the well-beings of humankind. Results A supervised combinatorial-optimization pattern based on information and signal-processing theories is introduced into the inference and analysis of genetic regulatory networks. An associativity measure is proposed to define the regulatory strength/connectivity, and a phase-shift metric determines regulatory directions among components of the reconstructed networks. Thus, it solves the undirected regulatory problems arising from most of current linear/nonlinear relevance methods. In case of computational and topological redundancy, we constrain the classified group size of pair candidates within a multiobjective combinatorial optimization (MOCO) pattern. Conclusions We testify the proposed approach on two real-world microarray datasets of different statistical characteristics. Thus, we reveal the inherent design mechanisms for genetic networks by quantitative means, facilitating further theoretic analysis and experimental design with diverse research purposes. Qualitative comparisons with other methods and certain related focuses needing further work are illustrated within the discussion section.
Collapse
Affiliation(s)
- Binhua Tang
- Department of Bioinformatics, Tongji University, Shanghai, China.
| | | | | | | | | | | |
Collapse
|
6
|
Hu X, Xu P, Wu S, Asgari S, Bergsneider M. A data mining framework for time series estimation. J Biomed Inform 2009; 43:190-9. [PMID: 19900575 DOI: 10.1016/j.jbi.2009.11.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2008] [Revised: 10/13/2009] [Accepted: 11/02/2009] [Indexed: 11/30/2022]
Abstract
Time series estimation techniques are usually employed in biomedical research to derive variables less accessible from a set of related and more accessible variables. These techniques are traditionally built from systems modeling approaches including simulation, blind decovolution, and state estimation. In this work, we define target time series (TTS) and its related time series (RTS) as the output and input of a time series estimation process, respectively. We then propose a novel data mining framework for time series estimation when TTS and RTS represent different sets of observed variables from the same dynamic system. This is made possible by mining a database of instances of TTS, its simultaneously recorded RTS, and the input/output dynamic models between them. The key mining strategy is to formulate a mapping function for each TTS-RTS pair in the database that translates a feature vector extracted from RTS to the dissimilarity between true TTS and its estimate from the dynamic model associated with the same TTS-RTS pair. At run time, a feature vector is extracted from an inquiry RTS and supplied to the mapping function associated with each TTS-RTS pair to calculate a dissimilarity measure. An optimal TTS-RTS pair is then selected by analyzing these dissimilarity measures. The associated input/output model of the selected TTS-RTS pair is then used to simulate the TTS given the inquiry RTS as an input. An exemplary implementation was built to address a biomedical problem of noninvasive intracranial pressure assessment. The performance of the proposed method was superior to that of a simple training-free approach of finding the optimal TTS-RTS pair by a conventional similarity-based search on RTS features.
Collapse
Affiliation(s)
- Xiao Hu
- Neural Systems and Dynamics Lab, Department of Neurosurgery, Geffen School of Medicine at University of California, Los Angeles, CA 90095, USA.
| | | | | | | | | |
Collapse
|
7
|
Spectral preprocessing for clustering time-series gene expressions. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2009:713248. [PMID: 19381338 DOI: 10.1155/2009/713248] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2008] [Accepted: 01/19/2009] [Indexed: 11/17/2022]
Abstract
Based on gene expression profiles, genes can be partitioned into clusters, which might be associated with biological processes or functions, for example, cell cycle, circadian rhythm, and so forth. This paper proposes a novel clustering preprocessing strategy which combines clustering with spectral estimation techniques so that the time information present in time series gene expressions is fully exploited. By comparing the clustering results with a set of biologically annotated yeast cell-cycle genes, the proposed clustering strategy is corroborated to yield significantly different clusters from those created by the traditional expression-based schemes. The proposed technique is especially helpful in grouping genes participating in time-regulated processes.
Collapse
|
8
|
Hu J, Hu F. Estimating equation-based causality analysis with application to microarray time series data. Biostatistics 2009; 10:468-80. [PMID: 19329818 DOI: 10.1093/biostatistics/kxp005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Microarray time-course data can be used to explore interactions among genes and infer gene network. The crucial step in constructing gene network is to develop an appropriate causality test. In this regard, the expression profile of each gene can be treated as a time series. A typical existing method establishes the Granger causality based on Wald type of test, which relies on the homoscedastic normality assumption of the data distribution. However, this assumption can be seriously violated in real microarray experiments and thus may lead to inconsistent test results and false scientific conclusions. To overcome the drawback, we propose an estimating equation-based method which is robust to both heteroscedasticity and nonnormality of the gene expression data. In fact, it only requires the residuals to be uncorrelated. We will use simulation studies and a real-data example to demonstrate the applicability of the proposed method.
Collapse
Affiliation(s)
- Jianhua Hu
- Department of Biostatistics, Division of Quantitative Sciences, University of Texas M. D. Anderson Cancer Center, Houston, TX, USA.
| | | |
Collapse
|
9
|
Zhu D, Hero AO, Qin ZS, Swaroop A. High throughput screening of co-expressed gene pairs with controlled false discovery rate (FDR) and minimum acceptable strength (MAS). J Comput Biol 2008; 12:1029-45. [PMID: 16201920 DOI: 10.1089/cmb.2005.12.1029] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Many exploratory microarray data analysis tools such as gene clustering and relevance networks rely on detecting pairwise gene co-expression. Traditional screening of pairwise co-expression either controls biological significance or statistical significance, but not both. The former approach does not provide stochastic error control, and the later approach screens many co-expressions with excessively low correlation. We have designed and implemented a statistically sound two-stage co-expression detection algorithm that controls both statistical significance (false discovery rate, FDR) and biological significance (minimum acceptable strength, MAS) of the discovered co-expressions. Based on estimation of pairwise gene correlation, the algorithm provides an initial co-expression discovery that controls only FDR, which is then followed by a second stage co-expression discovery which controls both FDR and MAS. It also computes and thresholds the set of FDR p-values for each correlation that satisfied the MAS criterion. Using simulated data, we validated asymptotic null distributions of the Pearson and Kendall correlation coefficients and the two-stage error-control procedure; we also compared our two-stage test procedure with another two-stage test procedure using the receiver operating characteristic (ROC) curve. We then used yeast galactose metabolism data to illustrate the advantage of our method for clustering genes and constructing a relevance network. The method has been implemented in an R package "GeneNT" that is freely available from the Comprehensive R Archive Network (CRAN): www.cran.r-project.org/.
Collapse
Affiliation(s)
- Dongxiao Zhu
- Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA.
| | | | | | | |
Collapse
|
10
|
Kiesel J, Miller C, Abu-Amer Y, Aurora R. Systems level analysis of osteoclastogenesis reveals intrinsic and extrinsic regulatory interactions. Dev Dyn 2007; 236:2181-97. [PMID: 17584858 DOI: 10.1002/dvdy.21206] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Osteoclasts are bone-resorbing cells derived from the myeloid lineage that play a central role in bone remodeling and inflammatory bone erosion diseases. The receptor activator of NF-kappaB ligand (RANKL) produced by osteoblasts and activated immune cells initiates the development of osteoclasts in the bone marrow. Using time series gene expression data, the intrinsic processes and the extrinsic factors that control osteoclastogenesis were identified. The gene expression profiles display distinct commitment and differentiation phases. Analysis of the time course revealed several mechanistic details, including the complex role of cholesterol in osteoclast development. Epistatic interactions and the coordination between cellular processes that regulate development were inferred from the coexpression network. The coexpression network indicated that osteoclasts induce angiogenesis and recruit T-cells to the site of osteoclastogenesis early in the commitment phase. The resulting model provides an essential framework for a better understanding of the epigenetic program of osteoclastogenesis.
Collapse
Affiliation(s)
- Jennifer Kiesel
- Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine, Saint Louis, Missouri, USA
| | | | | | | |
Collapse
|
11
|
Mukhopadhyay ND, Chatterjee S. Causality and pathway search in microarray time series experiment. Bioinformatics 2006; 23:442-9. [PMID: 17158516 DOI: 10.1093/bioinformatics/btl598] [Citation(s) in RCA: 91] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Interaction among time series can be explored in many ways. All the approach has the usual problem of low power and high dimensional model. Here we attempted to build a causality network among a set of time series. The causality has been established by Granger causality, and then constructing the pathway has been implemented by finding the Minimal Spanning Tree within each connected component of the inferred network. False discovery rate measurement has been used to identify the most significant causalities. RESULTS Simulation shows good convergence and accuracy of the algorithm. Robustness of the procedure has been demonstrated by applying the algorithm in a non-stationary time series setup. Application of the algorithm in a real dataset identified many causalities, with some overlap with previously known ones. Assembled network of the genes reveals features of the network that are common wisdom about naturally occurring networks.
Collapse
|
12
|
Abstract
UNLABELLED TimeView is a MATLAB program that compares multiple temporal datasets from microarray experiments under two or more conditions, for example, temporal variation of cellular response upon exposure to different drugs. The current paucity of programs designed to efficiently compare and visualise gene expression profiles in such datasets led us to design TimeView, which also enhances data visualisation by plotting the expression profiles of a large number of genes on a single screen. AVAILABILITY TimeView is available free of charge to all users at http://hugroup.cems.umn.edu/Research/Genomics/Timeview/timeview.htm. To use TimeView, users will require access to the commercial software MATLAB (version 6.5). A help document is available on the TimeView website.
Collapse
Affiliation(s)
- Mugdha Gadgil
- Department of Chemical Engineering and Materials Science, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | | | | | | |
Collapse
|
13
|
|