1
|
Cooper M, Messina CD. Breeding crops for drought-affected environments and improved climate resilience. THE PLANT CELL 2023; 35:162-186. [PMID: 36370076 PMCID: PMC9806606 DOI: 10.1093/plcell/koac321] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 11/01/2022] [Indexed: 05/12/2023]
Abstract
Breeding climate-resilient crops with improved levels of abiotic and biotic stress resistance as a response to climate change presents both opportunities and challenges. Applying the framework of the "breeder's equation," which is used to predict the response to selection for a breeding program cycle, we review methodologies and strategies that have been used to successfully breed crops with improved levels of drought resistance, where the target population of environments (TPEs) is a spatially and temporally heterogeneous mixture of drought-affected and favorable (water-sufficient) environments. Long-term improvement of temperate maize for the US corn belt is used as a case study and compared with progress for other crops and geographies. Integration of trait information across scales, from genomes to ecosystems, is needed to accurately predict yield outcomes for genotypes within the current and future TPEs. This will require transdisciplinary teams to explore, identify, and exploit novel opportunities to accelerate breeding program outcomes; both improved germplasm resources and improved products (cultivars, hybrids, clones, and populations) that outperform and replace the products in use by farmers, in combination with modified agronomic management strategies suited to their local environments.
Collapse
Affiliation(s)
- Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, Brisbane, Queensland 4072, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Carlos D Messina
- Horticultural Sciences Department, University of Florida, Gainesville, Florida 32611, USA
| |
Collapse
|
2
|
Depuydt T, Vandepoele K. Multi-omics network-based functional annotation of unknown Arabidopsis genes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 108:1193-1212. [PMID: 34562334 DOI: 10.1111/tpj.15507] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 09/20/2021] [Indexed: 06/13/2023]
Abstract
Unraveling gene function is pivotal to understanding the signaling cascades that control plant development and stress responses. As experimental profiling is costly and labor intensive, there is a clear need for high-confidence computational annotation. In contrast to detailed gene-specific functional information, transcriptomics data are widely available for both model and crop species. Here, we describe a novel automated function prediction method, which leverages complementary information from multiple expression datasets by analyzing study-specific gene co-expression networks. First, we benchmarked the prediction performance on recently characterized Arabidopsis thaliana genes, and showed that our method outperforms state-of-the-art expression-based approaches. Next, we predicted biological process annotations for known (n = 15 790) and unknown (n = 11 865) genes in A. thaliana and validated our predictions using experimental protein-DNA and protein-protein interaction data (covering >220 000 interactions in total), obtaining a set of high-confidence functional annotations. Our method assigned at least one validated annotation to 5054 (42.6%) unknown genes, and at least one novel validated function to 3408 (53.0%) genes with computational annotations only. These omics-supported functional annotations shed light on a variety of developmental processes and molecular responses, such as flower and root development, defense responses to fungi and bacteria, and phytohormone signaling, and help fill the information gap on biological process annotations in Arabidopsis. An in-depth analysis of two context-specific networks, modeling seed development and response to water deprivation, shows how previously uncharacterized genes function within the respective networks. Moreover, our automated function prediction approach can be applied in future studies to facilitate gene discovery for crop improvement.
Collapse
Affiliation(s)
- Thomas Depuydt
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| |
Collapse
|
3
|
Zwaenepoel A, Diels T, Amar D, Van Parys T, Shamir R, Van de Peer Y, Tzfadia O. MorphDB: Prioritizing Genes for Specialized Metabolism Pathways and Gene Ontology Categories in Plants. FRONTIERS IN PLANT SCIENCE 2018; 9:352. [PMID: 29616063 PMCID: PMC5867296 DOI: 10.3389/fpls.2018.00352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2018] [Accepted: 03/02/2018] [Indexed: 05/20/2023]
Abstract
Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.
Collapse
Affiliation(s)
- Arthur Zwaenepoel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Tim Diels
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - David Amar
- Stanford Center for Inherited Cardiovascular Disease, Stanford University, Stanford, CA, United States
| | - Thomas Van Parys
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Ron Shamir
- Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
- Genomics Research Institute, University of Pretoria, Pretoria, South Africa
- *Correspondence: Yves Van de Peer
| | - Oren Tzfadia
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
- Oren Tzfadia
| |
Collapse
|
4
|
de Abreu E Lima F, Westhues M, Cuadros-Inostroza Á, Willmitzer L, Melchinger AE, Nikoloski Z. Metabolic robustness in young roots underpins a predictive model of maize hybrid performance in the field. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 90:319-329. [PMID: 28122143 DOI: 10.1111/tpj.13495] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Revised: 12/14/2016] [Accepted: 01/23/2017] [Indexed: 05/05/2023]
Abstract
Heterosis has been extensively exploited for yield gain in maize (Zea mays L.). Here we conducted a comparative metabolomics-based analysis of young roots from in vitro germinating seedlings and from leaves of field-grown plants in a panel of inbred lines from the Dent and Flint heterotic patterns as well as selected F1 hybrids. We found that metabolite levels in hybrids were more robust than in inbred lines. Using state-of-the-art modeling techniques, the most robust metabolites from roots and leaves explained up to 37 and 44% of the variance in the biomass from plants grown in two distinct field trials. In addition, a correlation-based analysis highlighted the trade-off between defense-related metabolites and hybrid performance. Therefore, our findings demonstrated the potential of metabolic profiles from young maize roots grown under tightly controlled conditions to predict hybrid performance in multiple field trials, thus bridging the greenhouse-field gap.
Collapse
Affiliation(s)
| | - Matthias Westhues
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Fruwirthstr. 21, 70593, Stuttgart, Germany
| | | | - Lothar Willmitzer
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Albrecht E Melchinger
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Fruwirthstr. 21, 70593, Stuttgart, Germany
| | - Zoran Nikoloski
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Germany
| |
Collapse
|
5
|
Brockmöller T, Ling Z, Li D, Gaquerel E, Baldwin IT, Xu S. Nicotiana attenuata Data Hub (NaDH): an integrative platform for exploring genomic, transcriptomic and metabolomic data in wild tobacco. BMC Genomics 2017; 18:79. [PMID: 28086860 PMCID: PMC5237228 DOI: 10.1186/s12864-016-3465-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Accepted: 12/23/2016] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Nicotiana attenuata (coyote tobacco) is an ecological model for studying plant-environment interactions and plant gene function under real-world conditions. During the last decade, large amounts of genomic, transcriptomic and metabolomic data have been generated with this plant which has provided new insights into how native plants interact with herbivores, pollinators and microbes. However, an integrative and open access platform that allows for the efficient mining of these -omics data remained unavailable until now. DESCRIPTION We present the Nicotiana attenuata Data Hub (NaDH) as a centralized platform for integrating and visualizing genomic, phylogenomic, transcriptomic and metabolomic data in N. attenuata. The NaDH currently hosts collections of predicted protein coding sequences of 11 plant species, including two recently sequenced Nicotiana species, and their functional annotations, 222 microarray datasets from 10 different experiments, a transcriptomic atlas based on 20 RNA-seq expression profiles and a metabolomic atlas based on 895 metabolite spectra analyzed by mass spectrometry. We implemented several visualization tools, including a modified version of the Electronic Fluorescent Pictograph (eFP) browser, co-expression networks and the Interactive Tree Of Life (iTOL) for studying gene expression divergence among duplicated homologous. In addition, the NaDH allows researchers to query phylogenetic trees of 16,305 gene families and provides tools for analyzing their evolutionary history. Furthermore, we also implemented tools to identify co-expressed genes and metabolites, which can be used for predicting the functions of genes. Using the transcription factor NaMYB8 as an example, we illustrate that the tools and data in NaDH can facilitate identification of candidate genes involved in the biosynthesis of specialized metabolites. CONCLUSION The NaDH provides interactive visualization and data analysis tools that integrate the expression and evolutionary history of genes in Nicotiana, which can facilitate rapid gene discovery and comparative genomic analysis. Because N. attenuata shares many genome-wide features with other Nicotiana species including cultivated tobacco, and hence NaDH can be a resource for exploring the function and evolution of genes in Nicotiana species in general. The NaDH can be accessed at: http://nadh.ice.mpg.de/ .
Collapse
Affiliation(s)
- Thomas Brockmöller
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, D-07745 Jena, Germany
| | - Zhihao Ling
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, D-07745 Jena, Germany
| | - Dapeng Li
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, D-07745 Jena, Germany
| | - Emmanuel Gaquerel
- Centre for Organismal Studies, Heidelberg University, Im Neuenheimer Feld 360, Heidelberg, D-69120 Germany
| | - Ian T. Baldwin
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, D-07745 Jena, Germany
| | - Shuqing Xu
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, Hans-Knöll-Straße 8, D-07745 Jena, Germany
| |
Collapse
|
6
|
Gu X, Lu T. Commentary: Comparative Transcriptome Analysis of Raphanus sativus Tissues. FRONTIERS IN PLANT SCIENCE 2016; 6:1191. [PMID: 26779225 PMCID: PMC4700402 DOI: 10.3389/fpls.2015.01191] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 12/11/2015] [Indexed: 05/31/2023]
|
7
|
Rosado-Souza L, Scossa F, Chaves IS, Kleessen S, Salvador LFD, Milagre JC, Finger F, Bhering LL, Sulpice R, Araújo WL, Nikoloski Z, Fernie AR, Nunes-Nesi A. Exploring natural variation of photosynthetic, primary metabolism and growth parameters in a large panel of Capsicum chinense accessions. PLANTA 2015; 242:677-691. [PMID: 26007687 DOI: 10.1007/s00425-015-2332-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Accepted: 05/13/2015] [Indexed: 06/04/2023]
Abstract
Collectively, the results presented improve upon the utility of an important genetic resource and attest to a complex genetic basis for differences in both leaf metabolism and fruit morphology between natural populations. Diversity of accessions within the same species provides an alternative method to identify physiological and metabolic traits that have large effects on growth regulation, biomass and fruit production. Here, we investigated physiological and metabolic traits as well as parameters related to plant growth and fruit production of 49 phenotypically diverse pepper accessions of Capsicum chinense grown ex situ under controlled conditions. Although single-trait analysis identified up to seven distinct groups of accessions, working with the whole data set by multivariate analyses allowed the separation of the 49 accessions in three clusters. Using all 23 measured parameters and data from the geographic origin for these accessions, positive correlations between the combined phenotypes and geographic origin were observed, supporting a robust pattern of isolation-by-distance. In addition, we found that fruit set was positively correlated with photosynthesis-related parameters, which, however, do not explain alone the differences in accession susceptibility to fruit abortion. Our results demonstrated that, although the accessions belong to the same species, they exhibit considerable natural intraspecific variation with respect to physiological and metabolic parameters, presenting diverse adaptation mechanisms and being a highly interesting source of information for plant breeders. This study also represents the first study combining photosynthetic, primary metabolism and growth parameters for Capsicum to date.
Collapse
Affiliation(s)
- Laise Rosado-Souza
- Max-Planck Partner Group at the Departamento de Biologia Vegetal, Universidade Federal de Viçosa, Viçosa, Minas Gerais, 36570-900, Brazil
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Omranian N, Kleessen S, Tohge T, Klie S, Basler G, Mueller-Roeber B, Fernie AR, Nikoloski Z. Differential metabolic and coexpression networks of plant metabolism. TRENDS IN PLANT SCIENCE 2015; 20:266-268. [PMID: 25791509 DOI: 10.1016/j.tplants.2015.02.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Revised: 02/04/2015] [Accepted: 02/19/2015] [Indexed: 05/14/2023]
Abstract
Recent analyses have demonstrated that plant metabolic networks do not differ in their structural properties and that genes involved in basic metabolic processes show smaller coexpression than genes involved in specialized metabolism. By contrast, our analysis reveals differences in the structure of plant metabolic networks and patterns of coexpression for genes in (non)specialized metabolism. Here we caution that conclusions concerning the organization of plant metabolism based on network-driven analyses strongly depend on the computational approaches used.
Collapse
Affiliation(s)
- Nooshin Omranian
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, Germany; Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | | | - Takayuki Tohge
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, Germany
| | | | - Georg Basler
- Estación Experimental del Zaidín CSIC, 18008 Granada, Spain
| | - Bernd Mueller-Roeber
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, Germany; Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Alisdair R Fernie
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, Germany
| | - Zoran Nikoloski
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, Potsdam, Germany.
| |
Collapse
|
9
|
Hu C, Shi J, Quan S, Cui B, Kleessen S, Nikoloski Z, Tohge T, Alexander D, Guo L, Lin H, Wang J, Cui X, Rao J, Luo Q, Zhao X, Fernie AR, Zhang D. Metabolic variation between japonica and indica rice cultivars as revealed by non-targeted metabolomics. Sci Rep 2014; 4:5067. [PMID: 24861081 PMCID: PMC5381408 DOI: 10.1038/srep05067] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 05/08/2014] [Indexed: 01/07/2023] Open
Abstract
Seed metabolites are critically important both for plant development and human nutrition; however, the natural variation in their levels remains poorly characterized. Here we profiled 121 metabolites in mature seeds of a wide panel Oryza sativa japonica and indica cultivars, revealing correlations between the metabolic phenotype and geographic origin of the rice seeds. Moreover, japonica and indica subspecies differed significantly not only in the relative abundances of metabolites but also in their corresponding metabolic association networks. These findings provide important insights into metabolic adaptation in rice subgroups, bridging the gap between genome and phenome, and facilitating the identification of genetic control of metabolic properties that can serve as a basis for the future improvement of rice quality via metabolic engineering.
Collapse
Affiliation(s)
- Chaoyang Hu
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- These authors contributed equally to this work
| | - Jianxin Shi
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- These authors contributed equally to this work
| | - Sheng Quan
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- These authors contributed equally to this work
| | - Bo Cui
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Sabrina Kleessen
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany
| | - Takayuki Tohge
- Central Metabolism Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany
| | | | - Lining Guo
- Metabolon Inc., Durham, North Carolina 27713, USA
| | - Hong Lin
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jing Wang
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xiao Cui
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jun Rao
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qian Luo
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xiangxiang Zhao
- Jiangsu Key Laboratory for Eco-Agricultural Biotechnology around Hongze Lake, Huaiyin Normal University, Huaian, Jiangsu, 223300, China
| | - Alisdair R. Fernie
- Central Metabolism Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany
| | - Dabing Zhang
- National Center for Molecular Characterization of Genetically Modified Organisms, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
10
|
Montojo J, Zuberi K, Shao Q, Bader GD, Morris Q. Network Assessor: an automated method for quantitative assessment of a network's potential for gene function prediction. Front Genet 2014; 5:123. [PMID: 24904632 PMCID: PMC4032932 DOI: 10.3389/fgene.2014.00123] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2014] [Accepted: 04/21/2014] [Indexed: 01/17/2023] Open
Abstract
Significant effort has been invested in network-based gene function prediction algorithms based on the guilt by association (GBA) principle. Existing approaches for assessing prediction performance typically compute evaluation metrics, either averaged across all functions being considered, or strictly from properties of the network. Since the success of GBA algorithms depends on the specific function being predicted, evaluation metrics should instead be computed for each function. We describe a novel method for computing the usefulness of a network by measuring its impact on gene function cross validation prediction performance across all gene functions. We have implemented this in software called Network Assessor, and describe its use in the GeneMANIA (GM) quality control system. Network Assessor is part of the GM command line tools.
Collapse
Affiliation(s)
- Jason Montojo
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto Toronto, ON, Canada
| | - Khalid Zuberi
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto Toronto, ON, Canada
| | - Quentin Shao
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto Toronto, ON, Canada
| | - Gary D Bader
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto Toronto, ON, Canada
| | - Quaid Morris
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto Toronto, ON, Canada
| |
Collapse
|
11
|
Bellucci E, Bitocchi E, Ferrarini A, Benazzo A, Biagetti E, Klie S, Minio A, Rau D, Rodriguez M, Panziera A, Venturini L, Attene G, Albertini E, Jackson SA, Nanni L, Fernie AR, Nikoloski Z, Bertorelle G, Delledonne M, Papa R. Decreased Nucleotide and Expression Diversity and Modified Coexpression Patterns Characterize Domestication in the Common Bean. THE PLANT CELL 2014; 26:1901-1912. [PMID: 24850850 PMCID: PMC4079357 DOI: 10.1105/tpc.114.124040] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/15/2014] [Accepted: 04/29/2014] [Indexed: 05/02/2023]
Abstract
Using RNA sequencing technology and de novo transcriptome assembly, we compared representative sets of wild and domesticated accessions of common bean (Phaseolus vulgaris) from Mesoamerica. RNA was extracted at the first true-leaf stage, and de novo assembly was used to develop a reference transcriptome; the final data set consists of ∼190,000 single nucleotide polymorphisms from 27,243 contigs in expressed genomic regions. A drastic reduction in nucleotide diversity (∼60%) is evident for the domesticated form, compared with the wild form, and almost 50% of the contigs that are polymorphic were brought to fixation by domestication. In parallel, the effects of domestication decreased the diversity of gene expression (18%). While the coexpression networks for the wild and domesticated accessions demonstrate similar seminal network properties, they show distinct community structures that are enriched for different molecular functions. After simulating the demographic dynamics during domestication, we found that 9% of the genes were actively selected during domestication. We also show that selection induced a further reduction in the diversity of gene expression (26%) and was associated with 5-fold enrichment of differentially expressed genes. While there is substantial evidence of positive selection associated with domestication, in a few cases, this selection has increased the nucleotide diversity in the domesticated pool at target loci associated with abiotic stress responses, flowering time, and morphology.
Collapse
Affiliation(s)
- Elisa Bellucci
- Department of Agricultural, Food, and Environmental Sciences, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Elena Bitocchi
- Department of Agricultural, Food, and Environmental Sciences, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Alberto Ferrarini
- Department of Biotechnology, University of Verona, 37134 Verona, Italy
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, 44100 Ferrara, Italy
| | - Eleonora Biagetti
- Department of Agricultural, Food, and Environmental Sciences, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Sebastian Klie
- Genes and Small Molecules Group, Max Planck Institute of Molecular Plant Physiology, Potsdam-Golm 14476, Germany
| | - Andrea Minio
- Department of Biotechnology, University of Verona, 37134 Verona, Italy
| | - Domenico Rau
- Department of Agriculture, University of Sassari, 07100 Sassari, Italy
| | - Monica Rodriguez
- Department of Agriculture, University of Sassari, 07100 Sassari, Italy
| | - Alex Panziera
- Department of Life Sciences and Biotechnology, University of Ferrara, 44100 Ferrara, Italy Department of Biodiversity and Molecular Ecology, Fondazione Edmund Mach, 38010 S. Michele all'Adige, Italy
| | - Luca Venturini
- Department of Biotechnology, University of Verona, 37134 Verona, Italy
| | - Giovanna Attene
- Department of Agriculture, University of Sassari, 07100 Sassari, Italy
| | - Emidio Albertini
- Department of Applied Biology, University of Perugia, 06121 Perugia, Italy
| | - Scott A Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia 30602
| | - Laura Nanni
- Department of Agricultural, Food, and Environmental Sciences, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Alisdair R Fernie
- Central Metabolism Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam-Golm 14476, Germany
| | - Giorgio Bertorelle
- Department of Life Sciences and Biotechnology, University of Ferrara, 44100 Ferrara, Italy
| | | | - Roberto Papa
- Department of Agricultural, Food, and Environmental Sciences, Università Politecnica delle Marche, 60131 Ancona, Italy Consiglio per la Ricerca e Sperimentazione in Agricoltura, Cereal Research Centre (CRA-CER), 71122 Foggia, Italy
| |
Collapse
|
12
|
Ma C, Xin M, Feldmann KA, Wang X. Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis. THE PLANT CELL 2014; 26:520-37. [PMID: 24520154 PMCID: PMC3967023 DOI: 10.1105/tpc.113.121913] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Revised: 12/13/2013] [Accepted: 01/10/2014] [Indexed: 05/18/2023]
Abstract
Machine learning (ML) is an intelligent data mining technique that builds a prediction model based on the learning of prior knowledge to recognize patterns in large-scale data sets. We present an ML-based methodology for transcriptome analysis via comparison of gene coexpression networks, implemented as an R package called machine learning-based differential network analysis (mlDNA) and apply this method to reanalyze a set of abiotic stress expression data in Arabidopsis thaliana. The mlDNA first used a ML-based filtering process to remove nonexpressed, constitutively expressed, or non-stress-responsive "noninformative" genes prior to network construction, through learning the patterns of 32 expression characteristics of known stress-related genes. The retained "informative" genes were subsequently analyzed by ML-based network comparison to predict candidate stress-related genes showing expression and network differences between control and stress networks, based on 33 network topological characteristics. Comparative evaluation of the network-centric and gene-centric analytic methods showed that mlDNA substantially outperformed traditional statistical testing-based differential expression analysis at identifying stress-related genes, with markedly improved prediction accuracy. To experimentally validate the mlDNA predictions, we selected 89 candidates out of the 1784 predicted salt stress-related genes with available SALK T-DNA mutagenesis lines for phenotypic screening and identified two previously unreported genes, mutants of which showed salt-sensitive phenotypes.
Collapse
|
13
|
Hansen BO, Vaid N, Musialak-Lange M, Janowski M, Mutwil M. Elucidating gene function and function evolution through comparison of co-expression networks of plants. FRONTIERS IN PLANT SCIENCE 2014; 5:394. [PMID: 25191328 PMCID: PMC4137175 DOI: 10.3389/fpls.2014.00394] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Accepted: 07/23/2014] [Indexed: 05/20/2023]
Abstract
The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed) genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23). In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We showed that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.
Collapse
|