1
|
Cortijo S, Bhattarai M, Locke JCW, Ahnert SE. Co-expression Networks From Gene Expression Variability Between Genetically Identical Seedlings Can Reveal Novel Regulatory Relationships. FRONTIERS IN PLANT SCIENCE 2020; 11:599464. [PMID: 33384705 PMCID: PMC7770228 DOI: 10.3389/fpls.2020.599464] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/05/2020] [Indexed: 06/12/2023]
Abstract
Co-expression networks are a powerful tool to understand gene regulation. They have been used to identify new regulation and function of genes involved in plant development and their response to the environment. Up to now, co-expression networks have been inferred using transcriptomes generated on plants experiencing genetic or environmental perturbation, or from expression time series. We propose a new approach by showing that co-expression networks can be constructed in the absence of genetic and environmental perturbation, for plants at the same developmental stage. For this, we used transcriptomes that were generated from genetically identical individual plants that were grown under the same conditions and for the same amount of time. Twelve time points were used to cover the 24-h light/dark cycle. We used variability in gene expression between individual plants of the same time point to infer a co-expression network. We show that this network is biologically relevant and use it to suggest new gene functions and to identify new targets for the transcriptional regulators GI, PIF4, and PRR5. Moreover, we find different co-regulation in this network based on changes in expression between individual plants, compared to the usual approach requiring environmental perturbation. Our work shows that gene co-expression networks can be identified using variability in gene expression between individual plants, without the need for genetic or environmental perturbations. It will allow further exploration of gene regulation in contexts with subtle differences between plants, which could be closer to what individual plants in a population might face in the wild.
Collapse
Affiliation(s)
- Sandra Cortijo
- The Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
- UMR5004 Biochimie et Physiologie Moléculaire des Plantes, Univ Montpellier, CNRS, INRAE, Institut Agro, Montpellier, France
| | - Marcel Bhattarai
- The Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
| | - James C. W. Locke
- The Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
| | - Sebastian E. Ahnert
- The Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom
- Theory of Condensed Matter, Cavendish Laboratory, University of Cambridge, Cambridge, United Kingdom
- Department of Chemical Engineering and Biotechnology, Philippa Fawcett Drive, University of Cambridge, Cambridge, United Kingdom
- The Alan Turing Institute, British Library, London, United Kingdom
| |
Collapse
|
2
|
Zhang F, Liu X, Zhang A, Jiang Z, Chen L, Zhang X. Genome-wide dynamic network analysis reveals a critical transition state of flower development in Arabidopsis. BMC PLANT BIOLOGY 2019; 19:11. [PMID: 30616516 PMCID: PMC6323737 DOI: 10.1186/s12870-018-1589-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 12/04/2018] [Indexed: 05/06/2023]
Abstract
BACKGROUND The flowering transition which is controlled by a complex and intricate gene regulatory network plays an important role in the reproduction for offspring of plants. It is a challenge to identify the critical transition state as well as the genes that control the transition of flower development. With the emergence of massively parallel sequencing, a great number of time-course transcriptome data greatly facilitate the exploration of the developmental phase transition in plants. Although some network-based bioinformatics analyses attempted to identify the genes that control the phase transition, they generally overlooked the dynamics of regulation and resulted in unreliable results. In addition, the results of these methods cannot be self-explained. RESULTS In this work, to reveal a critical transition state and identify the transition-specific genes of flower development, we implemented a genome-wide dynamic network analysis on temporal gene expression data in Arabidopsis by dynamic network biomarker (DNB) method. In the analysis, DNB model which can exploit collective fluctuations and correlations of different metabolites at a network level was used to detect the imminent critical transition state or the tipping point. The genes that control the phase transition can be identified by the difference of weighted correlations between the genes interested and the other genes in the global network. To construct the gene regulatory network controlling the flowering transition, we applied NARROMI algorithm which can reduce the noisy, redundant and indirect regulations on the expression data of the transition-specific genes. In the results, the critical transition state detected during the formation of flowers corresponded to the development of flowering on the 7th to 9th day in Arabidopsis. Among of 233 genes identified to be highly fluctuated at the transition state, a high percentage of genes with maximum expression in pollen was detected, and 24 genes were validated to participate in stress reaction process, as well as other floral-related pathways. Composed of three major subnetworks, a gene regulatory network with 150 nodes and 225 edges was found to be highly correlated with flowering transition. The gene ontology (GO) annotation of pathway enrichment analysis revealed that the identified genes are enriched in the catalytic activity, metabolic process and cellular process. CONCLUSIONS This study provides a novel insight to identify the real causality of the phase transition with genome-wide dynamic network analysis.
Collapse
Affiliation(s)
- Fuping Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specially Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074 China
- University of Chinese Academy of Sciences, Beijing, 10049 China
| | - Xiaoping Liu
- Key Laboratory of Systems Biology, Innovation Center for Cell Signaling Network, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China
| | - Aidi Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specially Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074 China
| | - Zhonglin Jiang
- Key Laboratory of Systems Biology, Innovation Center for Cell Signaling Network, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Innovation Center for Cell Signaling Network, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China
| | - Xiujun Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specially Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074 China
| |
Collapse
|
3
|
Differential Coexpression Analysis Reveals Extensive Rewiring of Arabidopsis Gene Coexpression in Response to Pseudomonas syringae Infection. Sci Rep 2016; 6:35064. [PMID: 27721457 PMCID: PMC5056366 DOI: 10.1038/srep35064] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 09/23/2016] [Indexed: 01/21/2023] Open
Abstract
Plant defense responses to pathogens involve massive transcriptional reprogramming. Recently, differential coexpression analysis has been developed to study the rewiring of gene networks through microarray data, which is becoming an important complement to traditional differential expression analysis. Using time-series microarray data of Arabidopsis thaliana infected with Pseudomonas syringae, we analyzed Arabidopsis defense responses to P. syringae through differential coexpression analysis. Overall, we found that differential coexpression was a common phenomenon of plant immunity. Genes that were frequently involved in differential coexpression tend to be related to plant immune responses. Importantly, many of those genes have similar average expression levels between normal plant growth and pathogen infection but have different coexpression partners. By integrating the Arabidopsis regulatory network into our analysis, we identified several transcription factors that may be regulators of differential coexpression during plant immune responses. We also observed extensive differential coexpression between genes within the same metabolic pathways. Several metabolic pathways, such as photosynthesis light reactions, exhibited significant changes in expression correlation between normal growth and pathogen infection. Taken together, differential coexpression analysis provides a new strategy for analyzing transcriptional data related to plant defense responses and new insights into the understanding of plant-pathogen interactions.
Collapse
|
4
|
Li H, Yang S, Wang C, Zhou Y, Zhang Z. AraPPISite: a database of fine-grained protein-protein interaction site annotations for Arabidopsis thaliana. PLANT MOLECULAR BIOLOGY 2016; 92:105-16. [PMID: 27338257 DOI: 10.1007/s11103-016-0498-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Accepted: 05/26/2016] [Indexed: 05/18/2023]
Abstract
Knowledge about protein interaction sites provides detailed information of protein-protein interactions (PPIs). To date, nearly 20,000 of PPIs from Arabidopsis thaliana have been identified. Nevertheless, the interaction site information has been largely missed by previously published PPI databases. Here, AraPPISite, a database that presents fine-grained interaction details for A. thaliana PPIs is established. First, the experimentally determined 3D structures of 27 A. thaliana PPIs are collected from the Protein Data Bank database and the predicted 3D structures of 3023 A. thaliana PPIs are modeled by using two well-established template-based docking methods. For each experimental/predicted complex structure, AraPPISite not only provides an interactive user interface for browsing interaction sites, but also lists detailed evolutionary and physicochemical properties of these sites. Second, AraPPISite assigns domain-domain interactions or domain-motif interactions to 4286 PPIs whose 3D structures cannot be modeled. In this case, users can easily query protein interaction regions at the sequence level. AraPPISite is a free and user-friendly database, which does not require user registration or any configuration on local machines. We anticipate AraPPISite can serve as a helpful database resource for the users with less experience in structural biology or protein bioinformatics to probe the details of PPIs, and thus accelerate the studies of plant genetics and functional genomics. AraPPISite is available at http://systbio.cau.edu.cn/arappisite/index.html .
Collapse
Affiliation(s)
- Hong Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Shiping Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Chuan Wang
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, 94305, USA
| | - Yuan Zhou
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
5
|
Network-Based Comparative Analysis of Arabidopsis Immune Responses to Golovinomyces orontii and Botrytis cinerea Infections. Sci Rep 2016; 6:19149. [PMID: 26750561 PMCID: PMC4707498 DOI: 10.1038/srep19149] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 12/07/2015] [Indexed: 12/14/2022] Open
Abstract
A comprehensive exploration of common and specific plant responses to biotrophs and necrotrophs is necessary for a better understanding of plant immunity. Here, we compared the Arabidopsis defense responses evoked by the biotrophic fungus Golovinomyces orontii and the necrotrophic fungus Botrytis cinerea through integrative network analysis. Two time-course transcriptional datasets were integrated with an Arabidopsis protein-protein interaction (PPI) network to construct a G. orontii conditional PPI sub-network (gCPIN) and a B. cinerea conditional PPI sub-network (bCPIN). We found that hubs in gCPIN and bCPIN played important roles in disease resistance. Hubs in bCPIN evolved faster than hubs in gCPIN, indicating the different selection pressures imposed on plants by different pathogens. By analyzing the common network from gCPIN and bCPIN, we identified two network components in which the genes were heavily involved in defense and development, respectively. The co-expression relationships between interacting proteins connecting the two components were different under G. orontii and B. cinerea infection conditions. Closer inspection revealed that auxin-related genes were overrepresented in the interactions connecting these two components, suggesting a critical role of auxin signaling in regulating the different co-expression relationships. Our work may provide new insights into plant defense responses against pathogens with different lifestyles.
Collapse
|
6
|
Competition-cooperation relationship networks characterize the competition and cooperation between proteins. Sci Rep 2015; 5:11619. [PMID: 26108281 PMCID: PMC4479874 DOI: 10.1038/srep11619] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 06/01/2015] [Indexed: 01/04/2023] Open
Abstract
By analyzing protein-protein interaction (PPI) networks, one can find that a protein may have multiple binding partners. However, it is difficult to determine whether the interactions with these partners occur simultaneously from binary PPIs alone. Here, we construct the yeast and human competition-cooperation relationship networks (CCRNs) based on protein structural interactomes to clearly exhibit the relationship (competition or cooperation) between two partners of the same protein. If two partners compete for the same interaction interface, they would be connected by a competitive edge; otherwise, they would be connected by a cooperative edge. The properties of three kinds of hubs (i.e., competitive, modest, and cooperative hubs) are analyzed in the CCRNs. Our results show that competitive hubs have higher clustering coefficients and form clusters in the human CCRN, but these tendencies are not observed in the yeast CCRN. We find that the human-specific proteins contribute significantly to these differences. Subsequently, we conduct a series of computational experiments to investigate the regulatory mechanisms that avoid competition between proteins. Our comprehensive analyses reveal that for most yeast and human protein competitors, transcriptional regulation plays an important role. Moreover, the human-specific proteins have a particular preference for other regulatory mechanisms, such as alternative splicing.
Collapse
|
7
|
Dong X, Jiang Z, Peng YL, Zhang Z. Revealing shared and distinct gene network organization in Arabidopsis immune responses by integrative analysis. PLANT PHYSIOLOGY 2015; 167:1186-203. [PMID: 25614062 PMCID: PMC4348776 DOI: 10.1104/pp.114.254292] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Pattern-triggered immunity (PTI) and effector-triggered immunity (ETI) are two main plant immune responses to counter pathogen invasion. Genome-wide gene network organizing principles leading to quantitative differences between PTI and ETI have remained elusive. We combined an advanced machine learning method and modular network analysis to systematically characterize the organizing principles of Arabidopsis (Arabidopsis thaliana) PTI and ETI at three network resolutions. At the single network node/edge level, we ranked genes and gene interactions based on their ability to distinguish immune response from normal growth and successfully identified many immune-related genes associated with PTI and ETI. Topological analysis revealed that the top-ranked gene interactions tend to link network modules. At the subnetwork level, we identified a subnetwork shared by PTI and ETI encompassing 1,159 genes and 1,289 interactions. This subnetwork is enriched in interactions linking network modules and is also a hotspot of attack by pathogen effectors. The subnetwork likely represents a core component in the coordination of multiple biological processes to favor defense over development. Finally, we constructed modular network models for PTI and ETI to explain the quantitative differences in the global network architecture. Our results indicate that the defense modules in ETI are organized into relatively independent structures, explaining the robustness of ETI to genetic mutations and effector attacks. Taken together, the multiscale comparisons of PTI and ETI provide a systems biology perspective on plant immunity and emphasize coordination among network modules to establish a robust immune response.
Collapse
Affiliation(s)
- Xiaobao Dong
- State Key Laboratory of Agrobiotechnology (X.D., Z.J., Y.-L.P., Z.Z.), College of Biological Sciences (X.D., Z.J., Z.Z.), and Ministry of Agriculture Key Laboratory for Plant Pathology (Y.-L.P.), China Agricultural University, Beijing 100193, China
| | - Zhenhong Jiang
- State Key Laboratory of Agrobiotechnology (X.D., Z.J., Y.-L.P., Z.Z.), College of Biological Sciences (X.D., Z.J., Z.Z.), and Ministry of Agriculture Key Laboratory for Plant Pathology (Y.-L.P.), China Agricultural University, Beijing 100193, China
| | - You-Liang Peng
- State Key Laboratory of Agrobiotechnology (X.D., Z.J., Y.-L.P., Z.Z.), College of Biological Sciences (X.D., Z.J., Z.Z.), and Ministry of Agriculture Key Laboratory for Plant Pathology (Y.-L.P.), China Agricultural University, Beijing 100193, China
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology (X.D., Z.J., Y.-L.P., Z.Z.), College of Biological Sciences (X.D., Z.J., Z.Z.), and Ministry of Agriculture Key Laboratory for Plant Pathology (Y.-L.P.), China Agricultural University, Beijing 100193, China
| |
Collapse
|
8
|
Santoni D, Swiercz A, Zmieńko A, Kasprzak M, Blazewicz M, Bertolazzi P, Felici G. An integrated approach (CLuster Analysis Integration Method) to combine expression data and protein-protein interaction networks in agrigenomics: application on Arabidopsis thaliana. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:155-65. [PMID: 24404838 DOI: 10.1089/omi.2013.0050] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Experimental co-expression data and protein-protein interaction networks are frequently used to analyze the interactions among genes or proteins. Recent studies have investigated methods to integrate these two sources of information. We propose a new method to integrate co-expression data obtained through DNA microarray analysis (MA) and protein-protein interaction (PPI) network data, and apply it to Arabidopsis thaliana. The proposed method identifies small subsets of highly interacting proteins. Based on the analysis of the basis of co-localization and mRNA developmental expression, we show that these groups provide important biological insights; additionally, these subsets are significantly enriched with respect to KEGG Pathways and can be used to predict successfully whether proteins belong to known pathways. Thus, the method is able to provide relevant biological information and support the functional identification of complex genetic traits of economic value in plant agrigenomics research. The method has been implemented in a prototype software tool named CLAIM (CLuster Analysis Integration Method) and can be downloaded from http://bio.cs.put.poznan.pl/research_fields . CLAIM is based on the separate clustering of MA and PPI data; the clusters are merged in a special graph; cliques of this graph are subsets of strongly connected proteins. The proposed method was successfully compared with existing methods. CLAIM appears to be a useful semi-automated tool for protein functional analysis and warrants further evaluation in agrigenomics research.
Collapse
Affiliation(s)
- Daniele Santoni
- 1 Institute for Systems Analysis and Computer Science "Antonio Ruberti" , National Research Council of Italy, Rome, Italy
| | | | | | | | | | | | | |
Collapse
|
9
|
Molineris I, Ala U, Provero P, Di Cunto F. Drug repositioning for orphan genetic diseases through Conserved Anticoexpressed Gene Clusters (CAGCs). BMC Bioinformatics 2013; 14:288. [PMID: 24088245 PMCID: PMC3851137 DOI: 10.1186/1471-2105-14-288] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Accepted: 09/24/2013] [Indexed: 12/12/2022] Open
Abstract
Background The development of new therapies for orphan genetic diseases represents an extremely important medical and social challenge. Drug repositioning, i.e. finding new indications for approved drugs, could be one of the most cost- and time-effective strategies to cope with this problem, at least in a subset of cases. Therefore, many computational approaches based on the analysis of high throughput gene expression data have so far been proposed to reposition available drugs. However, most of these methods require gene expression profiles directly relevant to the pathologic conditions under study, such as those obtained from patient cells and/or from suitable experimental models. In this work we have developed a new approach for drug repositioning, based on identifying known drug targets showing conserved anti-correlated expression profiles with human disease genes, which is completely independent from the availability of ‘ad hoc’ gene expression data-sets. Results By analyzing available data, we provide evidence that the genes displaying conserved anti-correlation with drug targets are antagonistically modulated in their expression by treatment with the relevant drugs. We then identified clusters of genes associated to similar phenotypes and showing conserved anticorrelation with drug targets. On this basis, we generated a list of potential candidate drug-disease associations. Importantly, we show that some of the proposed associations are already supported by independent experimental evidence. Conclusions Our results support the hypothesis that the identification of gene clusters showing conserved anticorrelation with drug targets can be an effective method for drug repositioning and provide a wide list of new potential drug-disease associations for experimental validation.
Collapse
Affiliation(s)
- Ivan Molineris
- Molecular Biotechnology Centre, Department of Molecular Biotechnology and Health Sciences, University of Torino, Via Nizza 52, 10126, Torino, Italy.
| | | | | | | |
Collapse
|
10
|
Braun P, Aubourg S, Van Leene J, De Jaeger G, Lurin C. Plant protein interactomes. ANNUAL REVIEW OF PLANT BIOLOGY 2013; 64:161-87. [PMID: 23330791 DOI: 10.1146/annurev-arplant-050312-120140] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Protein-protein interactions are a critical element of biological systems, and the analysis of interaction partners can provide valuable hints about unknown functions of a protein. In recent years, several large-scale protein interaction studies have begun to unravel the complex networks through which plant proteins exert their functions. Two major classes of experimental approaches are used for protein interaction mapping: analysis of direct interactions using binary methods such as yeast two-hybrid or split ubiquitin, and analysis of protein complexes through affinity purification followed by mass spectrometry. In addition, bioinformatics predictions can suggest interactions that have evaded detection by other methods or those of proteins that have not been investigated. Here we review the major approaches to construct, analyze, use, and carry out quality control on plant protein interactome networks. We present experimental and computational approaches for large-scale mapping, methods for validation or smaller-scale functional studies, important bioinformatics resources, and findings from recently published large-scale plant interactome network maps.
Collapse
Affiliation(s)
- Pascal Braun
- Department of Plant Systems Biology, Center for Life and Food Sciences Weihenstephan, Technische Universität München (TUM), 85354 Freising-Weihenstephan, Germany.
| | | | | | | | | |
Collapse
|
11
|
Hosseinpour B, HajiHoseini V, Kashfi R, Ebrahimie E, Hemmatzadeh F. Protein interaction network of Arabidopsis thaliana female gametophyte development identifies novel proteins and relations. PLoS One 2012; 7:e49931. [PMID: 23239973 PMCID: PMC3519845 DOI: 10.1371/journal.pone.0049931] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 10/17/2012] [Indexed: 01/01/2023] Open
Abstract
Although the female gametophyte in angiosperms consists of just seven cells, it has a complex biological network. In this study, female gametophyte microarray data from Arabidopsis thaliana were integrated into the Arabidopsis interactome database to generate a putative interaction map of the female gametophyte development including proteome map based on biological processes and molecular functions of proteins. Biological and functional groups as well as topological characteristics of the network were investigated by analyzing phytohormones, plant defense, cell death, transporters, regulatory factors, and hydrolases. This approach led to the prediction of critical members and bottlenecks of the network. Seventy-four and 24 upregulated genes as well as 171 and 3 downregulated genes were identified in subtracted networks based on biological processes and molecular function respectively, including novel genes such as the pathogenesis-related protein 4, ER type Ca(2+) ATPase 3, dihydroflavonol reductase, and ATP disulfate isomerase. Biologically important relationships between genes, critical nodes, and new essential proteins such as AT1G26830, AT5G20850, CYP74A, AT1G42396, PR4 and MEA were found in the interactome's network. The positions of novel genes, both upregulated and downregulated, and their relationships with biological pathways, in particular phytohormones, were highlighted in this study.
Collapse
Affiliation(s)
- Batool Hosseinpour
- Institute of Agriculture, Iranian Research Organization for Science and Technology, Tehran, Iran
| | - Vahid HajiHoseini
- Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran
| | - Rafieh Kashfi
- Department of Crop Production & Plant Breeding, College of Agriculture, Shiraz University, Shiraz, Iran
| | - Esmaeil Ebrahimie
- Department of Crop Production & Plant Breeding, College of Agriculture, Shiraz University, Shiraz, Iran
- School of Molecular and Biomedical Science, The University of Adelaide, Adelaide, South Australia, Australia
- * E-mail: (EE); (FH)
| | - Farhid Hemmatzadeh
- School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- * E-mail: (EE); (FH)
| |
Collapse
|
12
|
Why assembling plant genome sequences is so challenging. BIOLOGY 2012; 1:439-59. [PMID: 24832233 PMCID: PMC4009782 DOI: 10.3390/biology1020439] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Revised: 09/05/2012] [Accepted: 09/06/2012] [Indexed: 12/16/2022]
Abstract
In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed.
Collapse
|
13
|
Zhou Y, Zhou YS, He F, Song J, Zhang Z. Can simple codon pair usage predict protein-protein interaction? MOLECULAR BIOSYSTEMS 2012; 8:1396-404. [PMID: 22392100 DOI: 10.1039/c2mb05427b] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Deciphering functional interactions between proteins is one of the great challenges in biology. Sequence-based homology-free encoding schemes have been increasingly applied to develop promising protein-protein interaction (PPI) predictors by means of statistical or machine learning methods. Here we analyze the relationship between codon pair usage and PPIs in yeast. We show that codon pair usage of interacting protein pairs differs significantly from randomly expected. This motivates the development of a novel approach for predicting PPIs, with codon pair frequency difference as input to a Support Vector Machine predictor, termed as CCPPI. 10-fold cross-validation tests based on yeast PPI datasets with balanced positive-to-negative ratios indicate that CCPPI performs better than other sequence-based encoding schemes. Moreover, it ranks the best when tested on an unbalanced large-scale dataset. Although CCPPI is subjected to high false positive rates like many PPI predictors, statistical analyses of the predicted true positives confirm that the success of CCPPI is partly ascribed to its capability to capture proteomic co-expression and functional similarities between interacting protein pairs. Our findings suggest that codon pairs of interacting protein pairs evolve in a coordinated manner and consequently they provide additional information beyond amino acids-based encoding schemes. CCPPI has been made freely available at: http://protein.cau.edu.cn/ccppi.
Collapse
Affiliation(s)
- Yuan Zhou
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | | | | | | | | |
Collapse
|
14
|
Sreenivasulu N, Schnurbusch T. A genetic playground for enhancing grain number in cereals. TRENDS IN PLANT SCIENCE 2012; 17:91-101. [PMID: 22197176 DOI: 10.1016/j.tplants.2011.11.003] [Citation(s) in RCA: 125] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Revised: 11/03/2011] [Accepted: 11/14/2011] [Indexed: 05/18/2023]
Abstract
Improving the yield stability of cereal crops with a view to bolstering global food security is an important priority. The components of final grain number per plant at harvest are determined by fertile spikes per plant, number of fertile spikelets per spike and number of grains per spikelet. In this review article, we focus on the genetic factors of floral development and inflorescence architecture known to influence grain number and provide a broad overview of genes and genetic pathways that potentially can be manipulated to increase the yield of cereal crops, in particular wheat (Triticum aestivum) and barley (Hordeum vulgare). In addition, we discuss the outcome of multidisciplinary genomics knowledge to identify potential gene targets to develop conceptual ideotypes to meet the future demand.
Collapse
Affiliation(s)
- Nese Sreenivasulu
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK), Interdisciplinary Center for Crop Plant Research (IZN) Research Group Stress Genomics, Corrensstr. 3, 06466 Gatersleben, Germany
| | | |
Collapse
|
15
|
Integrating Rare-Variant Testing, Function Prediction, and Gene Network in Composite Resequencing-Based Genome-Wide Association Studies (CR-GWAS). G3-GENES GENOMES GENETICS 2011; 1:233-43. [PMID: 22384334 PMCID: PMC3276137 DOI: 10.1534/g3.111.000364] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2011] [Accepted: 07/05/2011] [Indexed: 01/08/2023]
Abstract
High-density array-based genome-wide association studies (GWAS) are complemented by exome sequencing and whole-genome resequencing-based association studies. Here we present a composite resequencing-based genome-wide association study (CR-GWAS) strategy that systematically exploits collective biological information and analytical tools for a robust analysis. We showcased the utility of this strategy by using Arabidopsis (Arabidopsis thaliana) resequencing data. Bioinformatic predictions of biological function alteration at each locus were integrated into the process of association testing of both common and rare variants for complex traits with a suite of statistics. Significant signals were then filtered with a priori candidate loci generated from genome database and gene network models to obtain a posteriori candidate loci. A probabilistic gene network (AraNet) that interrogates network neighborhoods of genes was then used to expand the filtering power to examine the significant testing signals. Using this strategy, we confirmed the known true positives and identified several new promising associations. Promising genes (AP1, FCA, FRI, FLC, FLM, SPL5, FY, and DCL2) were shown to control for flowering time through either common variants or rare variants within a diverse set of Arabidopsis accessions. Although many of these candidate genes were cloned earlier with mutational studies, identifying their allele variation contribution to overall phenotypic variation among diverse natural accessions is critical. Our rare allele testing established a greater number of connections than previous analyses in which this issue was not addressed. More importantly, our results demonstrated the potential of integrating various biological, statistical, and bioinformatic tools into complex trait dissection.
Collapse
|