1
|
Simões ASB, Borges MM, Grazina L, Nunes J. Stone Pine ( Pinus pinea L.) High-Added-Value Genetics: An Overview. Genes (Basel) 2024; 15:84. [PMID: 38254973 PMCID: PMC10815827 DOI: 10.3390/genes15010084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 01/05/2024] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open
Abstract
Stone pine (Pinus pinea L.) has received limited attention in terms of genetic research. However, genomic techniques hold promise for decoding the stone pine genome and contributing to developing a more resilient bioeconomy. Retrotransposon and specific genetic markers are effective tools for determining population-specific genomic diversity. Studies on the transcriptome and proteome have identified differentially expressed genes PAS1, CLV1, ATAF1, and ACBF involved in shoot bud formation. The stone pine proteome shows variation among populations and shows the industrial potential of the enzyme pinosylvin. Microsatellite studies have revealed low levels of polymorphism and a unique genetic diversity in stone pine, which may contribute to its environmental adaptation. Transcriptomic and proteomic analyses uncover the genetic and molecular responses of stone pine to fungal infections and nematode infestations, elucidating the defense activation, gene regulation, and the potential role of terpenes in pathogen resistance. Transcriptomics associated with carbohydrate metabolism, dehydrins, and transcription factors show promise as targets for improving stone pine's drought stress response and water retention capabilities. Stone pine presents itself as an important model tree for studying climate change adaptation due to its characteristics. While knowledge gaps exist, stone pine's genetic resources hold significant potential, and ongoing advancements in techniques offer prospects for future exploration.
Collapse
Affiliation(s)
- Ana Sofia B. Simões
- Association BLC3–Technology and Innovation Campus, Centre Bio R&D Unit, Rua Nossa Senhora da Conceição 2, Lagares da Beira, 3405-155 Oliveira do Hospital, Portugal; (M.M.B.); (L.G.); (J.N.)
| | - Margarida Machado Borges
- Association BLC3–Technology and Innovation Campus, Centre Bio R&D Unit, Rua Nossa Senhora da Conceição 2, Lagares da Beira, 3405-155 Oliveira do Hospital, Portugal; (M.M.B.); (L.G.); (J.N.)
| | - Liliana Grazina
- Association BLC3–Technology and Innovation Campus, Centre Bio R&D Unit, Rua Nossa Senhora da Conceição 2, Lagares da Beira, 3405-155 Oliveira do Hospital, Portugal; (M.M.B.); (L.G.); (J.N.)
| | - João Nunes
- Association BLC3–Technology and Innovation Campus, Centre Bio R&D Unit, Rua Nossa Senhora da Conceição 2, Lagares da Beira, 3405-155 Oliveira do Hospital, Portugal; (M.M.B.); (L.G.); (J.N.)
- BLC3 Evolution Lda, 3405-155 Oliveira do Hospital, Portugal
| |
Collapse
|
2
|
de Miguel M, Rodríguez-Quilón I, Heuertz M, Hurel A, Grivet D, Jaramillo-Correa JP, Vendramin GG, Plomion C, Majada J, Alía R, Eckert AJ, González-Martínez SC. Polygenic adaptation and negative selection across traits, years and environments in a long-lived plant species (Pinus pinaster Ait., Pinaceae). Mol Ecol 2022; 31:2089-2105. [PMID: 35075727 DOI: 10.1111/mec.16367] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 11/30/2021] [Accepted: 01/11/2022] [Indexed: 11/26/2022]
Abstract
A decade of genetic association studies in multiple organisms suggests that most complex traits are polygenic, i.e., they have a genetic architecture determined by numerous loci each with small effect-size. Thus, determining the degree of polygenicity and its variation across traits, environments and time is crucial to understand the genetic basis of phenotypic variation. We applied multilocus approaches to estimate the degree of polygenicity of fitness-related traits in a long-lived plant (Pinus pinaster Ait., maritime pine) and to analyze this variation across environments and years. We evaluated five categories of fitness-related traits (survival, height, phenology, functional, and biotic-stress response traits) in a clonal common-garden network, planted in contrasted environments (over 12,500 trees). Most of the analyzed traits showed evidence of local adaptation based on Qst -Fst comparisons. We further observed a remarkably stable degree of polygenicity, averaging 6% (range of 0-27%), across traits, environments and years. We detected evidence of negative selection, which could explain, at least partially, the high degree of polygenicity. Because polygenic adaptation can occur rapidly, our results suggest that current predictions on the capacity of natural forest tree populations to adapt to new environments should be revised, especially in the current context of climate change.
Collapse
Affiliation(s)
- Marina de Miguel
- INRAE, Univ. Bordeaux, BIOGECO, F-33610, Cestas, France.,EGFV, Univ. Bordeaux, Bordeaux Sciences Agro, INRAE, ISVV, F-33882, Villenave d'Ornon, France
| | - Isabel Rodríguez-Quilón
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | | | - Agathe Hurel
- INRAE, Univ. Bordeaux, BIOGECO, F-33610, Cestas, France
| | - Delphine Grivet
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Juan-Pablo Jaramillo-Correa
- Department of Evolutionary Ecology, Institute of Ecology, Universidad Nacional Autónoma de México, AP 70-275, México City, CDMX 04510, Mexico
| | - Giovanni G Vendramin
- Institute of Biosciences and Bioresources, Division of Florence, National Research Council, 50019, Sesto Fiorentino (FI), Italy
| | | | - Juan Majada
- Sección Forestal, SERIDA, Finca Experimental ''La Mata'', 33820, Grado, Principado de Asturias, Spain
| | - Ricardo Alía
- EGFV, Univ. Bordeaux, Bordeaux Sciences Agro, INRAE, ISVV, F-33882, Villenave d'Ornon, France
| | - Andrew J Eckert
- Department of Biology, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | | |
Collapse
|
3
|
Olsson S, Lorenzo Z, Zabal-Aguirre M, Piotti A, Vendramin GG, González-Martínez SC, Grivet D. Evolutionary history of the mediterranean Pinus halepensis-brutia species complex using gene-resequencing and transcriptomic approaches. PLANT MOLECULAR BIOLOGY 2021; 106:367-380. [PMID: 33934278 DOI: 10.1007/s11103-021-01155-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/22/2021] [Indexed: 06/12/2023]
Abstract
Complementary gene-resequencing and transcriptomic approaches reveal contrasted evolutionary histories in a species complex. Pinus halepensis and Pinus brutia are closely related species that can intercross, but occupy different geographical ranges and bioclimates. To study the evolution of this species complex and to provide genomic resources for further research, we produce and analyze two new complementary sets of genetic resources: (i) a set of 172 re-sequenced genomic target loci analyzed in 45 individuals, and (ii) a set of 11 transcriptome assemblies. These two datasets provide insights congruent with previous studies: P. brutia displays high level of genetic diversity and no genetic sub-structure, while P. halepensis shows three main genetic clusters, the western Mediterranean and North African clusters displaying much lower genetic diversity than the eastern Mediterranean cluster, the latter cluster having similar genetic diversity to P. brutia. In addition, these datasets provide new insights on the timing of the species-complex history: the two species would have split at the end of the tertiary, and the changing climatic conditions of the Mediterranean region at the end of the Tertiary-beginning of the Quaternary, together with the distinct species tolerance to harsh climatic conditions would have resulted in different geographic distributions, demographic histories and genetic patterns of the two pines. The multiple glacial-interglacial cycles during the Quaternary would have led to the expansion of P. brutia in the Middle East, while P. halepensis would have been through bottlenecks. The last glaciations, from 0.6 Mya on, would have affected further the Western genetic pool of P. halepensis.
Collapse
Affiliation(s)
- Sanna Olsson
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain.
| | - Zaida Lorenzo
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Mario Zabal-Aguirre
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Andrea Piotti
- Institute of Biosciences and Bioresources, Division of Florence, National Research Council, 50019, Sesto Fiorentino, Florence, Italy
| | - Giovanni G Vendramin
- Institute of Biosciences and Bioresources, Division of Florence, National Research Council, 50019, Sesto Fiorentino, Florence, Italy
| | - Santiago C González-Martínez
- UMR BIOGECO, INRAE, University of Bordeaux, 33610, Cestas, France
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain
| | - Delphine Grivet
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain.
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain.
| |
Collapse
|
4
|
Seoane P, Espigares M, Carmona R, Polonio Á, Quintana J, Cretazzo E, Bota J, Pérez-García A, Dios Alché JD, Gómez L, Claros MG. TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms. BMC Bioinformatics 2018; 19:416. [PMID: 30453874 PMCID: PMC6245506 DOI: 10.1186/s12859-018-2384-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND The advances in high-throughput sequencing technologies are allowing more and more de novo assembling of transcriptomes from many new organisms. Some degree of automation and evaluation is required to warrant reproducibility, repetitivity and the selection of the best possible transcriptome. Workflows and pipelines are becoming an absolute requirement for such a purpose, but the issue of assembling evaluation for de novo transcriptomes in organisms lacking a sequenced genome remains unsolved. An automated, reproducible and flexible framework called TransFlow to accomplish this task is described. RESULTS TransFlow with its five independent modules was designed to build different workflows depending on the nature of the original reads. This architecture enables different combinations of Illumina and Roche/454 sequencing data, and can be extended to other sequencing platforms. Its capabilities are illustrated with the selection of reliable plant reference transcriptomes and the assembling six transcriptomes (three case studies for grapevine leaves, olive tree pollen, and chestnut stem, and other three for haustorium, epiphytic structures and their combination for the phytopathogenic fungus Podosphaera xanthii). Arabidopsis and poplar transcriptomes revealed to be the best references. A common result regarding de novo assemblies is that Illumina paired-end reads of 100 nt in length assembled with OASES can provide reliable transcriptomes, while the contribution of longer reads is noticeable only when they complement a set of short, single-reads. CONCLUSIONS TransFlow can handle up to 181 different assembling strategies. Evaluation based on principal component analyses allows its self-adaptation to different sets of reads to provide a suitable transcriptome for each combination of reads and assemblers. As a result, each case study has its own behaviour, prioritises evaluation parameters, and gives an objective and automated way for detecting the best transcriptome within a pool of them. Sequencing data type and quantity (preferably several hundred millions of 2×100 nt or longer), assemblers (OASES for Illumina, MIRA4 and EULER-SR reconciled with CAP3 for Roche/454) and strategy (preferably scaffolding with OASES, and probably merging with Roche/454 when available) arise as the most impacting factors.
Collapse
Affiliation(s)
- Pedro Seoane
- Departmento de Biología Molecular y Bioquímica, Universidad de Málaga, Campus de Teatinos s/n, Malaga, 29071 Spain
| | - Marina Espigares
- Departmento de Biología Molecular y Bioquímica, Universidad de Málaga, Campus de Teatinos s/n, Malaga, 29071 Spain
| | - Rosario Carmona
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cell and Molecular Biology of Plants. Estación Experimental del Zaidín. CSIC, Prof. Albareda, 1, Granada, 18160 Spain
| | - Álvaro Polonio
- Departamento de Microbiología, and Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora”, Universidad de Málaga, Consejo Superior de Investigaciones Científicas (IHSM-UMA-CSIC), Campus de Teatinos s/n, Malaga, 29071 Spain
| | - Julia Quintana
- Department of Chemistry and Biochemistry, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA, 01609-2280 USA
| | - Enrico Cretazzo
- Instituto Andaluz de Investigación y Formación Agraria (IFAPA), Centro de Churriana, Cortijo de la Cruz s/n, Churriana, 29140 Spain
| | - Josefina Bota
- Grup de Recerca en Biologia de les Plantes en Condicions Mediterrànies, Departament de Biologia, Universitat de les Illes Balears, Carretera de Valldemossa, km 7.5, Palma de Mallorca, 07122 Spain
| | - Alejandro Pérez-García
- Departamento de Microbiología, and Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora”, Universidad de Málaga, Consejo Superior de Investigaciones Científicas (IHSM-UMA-CSIC), Campus de Teatinos s/n, Malaga, 29071 Spain
| | - Juan de Dios Alché
- Plant Reproductive Biology Laboratory, Department of Biochemistry, Cell and Molecular Biology of Plants. Estación Experimental del Zaidín. CSIC, Prof. Albareda, 1, Granada, 18160 Spain
| | - Luis Gómez
- Departamento de Sistemas y Recursos Naturales, ETSI Forestal, de Montes y del Medio Natural, Universidad Politécnica de Madrid, Ciudad Universitaria, Madrid, 28040 Spain
- CBGP, INIA-Universidad Politécnica de Madrid, Campus de Montegancedo, Pozuelo de Alarcón, 28223 Spain
| | - M. Gonzalo Claros
- Departmento de Biología Molecular y Bioquímica, Universidad de Málaga, Campus de Teatinos s/n, Malaga, 29071 Spain
| |
Collapse
|
5
|
Telfer E, Graham N, Macdonald L, Sturrock S, Wilcox P, Stanbra L. Approaches to variant discovery for conifer transcriptome sequencing. PLoS One 2018; 13:e0205835. [PMID: 30395612 PMCID: PMC6218030 DOI: 10.1371/journal.pone.0205835] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2018] [Accepted: 10/02/2018] [Indexed: 12/30/2022] Open
Abstract
There is a wide diversity of bioinformatic tools available for the assembly of next generation sequence and subsequence variant calling to identify genetic markers at scale. Integration of genomics tools such as genomic selection, association studies, pedigree analysis and analysis of genetic diversity, into operational breeding is a goal for New Zealand’s most widely planted exotic tree species, Pinus radiata. In the absence of full reference genomes for large megagenomes such as in conifers, RNA sequencing in a range of genotypes and tissue types, offers a rich source of genetic markers for downstream application. We compared nine different assembler and variant calling software combinations in a single transcriptomic library and found that Single Nucleotide Polymorphism (SNPs) discovery could vary by as much as an order of magnitude (8,061 SNPs up to 86,815 SNPs). The assembler with the best realignment of the packages trialled, Trinity, in combination with several variant callers was then applied to a much larger multi-genotype, multi-tissue transcriptome and identified 683,135 in silico SNPs across a predicted 449,951 exons when mapped to the Pinus taeda ver 1.01e reference.
Collapse
Affiliation(s)
- Emily Telfer
- New Zealand Forest Research Institute LTD. trading as Scion, Rotorua, New Zealand
- * E-mail:
| | - Natalie Graham
- New Zealand Forest Research Institute LTD. trading as Scion, Rotorua, New Zealand
| | - Lucy Macdonald
- New Zealand Forest Research Institute LTD. trading as Scion, Rotorua, New Zealand
| | - Shane Sturrock
- New Zealand Forest Research Institute LTD. trading as Scion, Rotorua, New Zealand
- Real Time Genomics, Hamilton, New Zealand
| | - Phillip Wilcox
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
| | - Lisa Stanbra
- New Zealand Forest Research Institute LTD. trading as Scion, Rotorua, New Zealand
| |
Collapse
|
6
|
Zhao YJ, Cao Y, Wang J, Xiong Z. Transcriptome sequencing of Pinus kesiya var. langbianensis and comparative analysis in the Pinus phylogeny. BMC Genomics 2018; 19:725. [PMID: 30285615 PMCID: PMC6171231 DOI: 10.1186/s12864-018-5127-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 09/27/2018] [Indexed: 11/10/2022] Open
Abstract
Background Pines are widely distributed in the Northern Hemisphere and have a long evolutionary history. The availability of transcriptome data has facilitated comparative transcriptomics for studying the evolutionary patterns associated with the different geographical distributions of species in the Pinus phylogeny. Results The transcriptome of Pinus kesiya var. langbianensis was sequenced using the Illumina HiSeq 2000 platform, and a total of 68,881 unigenes were assembled by Trinity. Transcriptome sequences of another 12 conifer species were downloaded from public databases. All of the pairwise orthologues were identified by comparative transcriptome analysis in 13 conifer species, from which the rate of diversification was calculated and a phylogenetic tree inferred. All of the fast-evolving positive selection sequences were identified, and some salt-, drought-, and abscisic acid-resistance genes were discovered. Conclusions mRNA sequences of P. kesiya var. langbianensis were obtained by transcriptome sequencing, and a large number of simple sequence repeat and short nucleotide polymorphism loci were detected. These data can be used in molecular marker-assisted selected in pine breeding. Divergence times were estimated in the 13 conifer species using comparative transcriptomic analysis. A number of positive selection genes were found to be related to environmental factors. Salt- and abscisic acid-related genes exhibited different selection patterns between coastal and inland Pinus. Our findings help elucidate speciation patterns in the Pinus lineage. Electronic supplementary material The online version of this article (10.1186/s12864-018-5127-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- You-Jie Zhao
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, Southwest Forestry University, Kunming, 650224, Yunnan, People's Republic of China.,College of Big data and Intelligent Engineering, Southwest Forestry University, Kunming, 650224, Yunnan, People's Republic of China
| | - Yong Cao
- College of Big data and Intelligent Engineering, Southwest Forestry University, Kunming, 650224, Yunnan, People's Republic of China
| | - Juan Wang
- Eco-development Academy, Southwest Forestry University, Kunming, 650224, Yunnan, People's Republic of China
| | - Zhi Xiong
- College of Light industry and Food, Southwest Forestry University, Kunming, 650224, Yunnan, People's Republic of China.
| |
Collapse
|
7
|
Schenck CA, Holland CK, Schneider MR, Men Y, Lee SG, Jez JM, Maeda HA. Molecular basis of the evolution of alternative tyrosine biosynthetic routes in plants. Nat Chem Biol 2017; 13:1029-1035. [PMID: 28671678 DOI: 10.1038/nchembio.2414] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 04/11/2017] [Indexed: 11/09/2022]
Abstract
L-Tyrosine (Tyr) is essential for protein synthesis and is a precursor of numerous specialized metabolites crucial for plant and human health. Tyr can be synthesized via two alternative routes by different key regulatory TyrA family enzymes, prephenate dehydrogenase (PDH, also known as TyrAp) or arogenate dehydrogenase (ADH, also known as TyrAa), representing a unique divergence of primary metabolic pathways. The molecular foundation underlying the evolution of these alternative Tyr pathways is currently unknown. Here we characterized recently diverged plant PDH and ADH enzymes, obtained the X-ray crystal structure of soybean PDH, and identified a single amino acid residue that defines TyrA substrate specificity and regulation. Structures of mutated PDHs co-crystallized with Tyr indicate that substitutions of Asn222 confer ADH activity and Tyr sensitivity. Reciprocal mutagenesis of the corresponding residue in divergent plant ADHs further introduced PDH activity and relaxed Tyr sensitivity, highlighting the critical role of this residue in TyrA substrate specificity that underlies the evolution of alternative Tyr biosynthetic pathways in plants.
Collapse
Affiliation(s)
- Craig A Schenck
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Cynthia K Holland
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Matthew R Schneider
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Yusen Men
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Soon Goo Lee
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Joseph M Jez
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Hiroshi A Maeda
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
8
|
Pfannebecker KC, Lange M, Rupp O, Becker A. An Evolutionary Framework for Carpel Developmental Control Genes. Mol Biol Evol 2017; 34:330-348. [PMID: 28049761 DOI: 10.1093/molbev/msw229] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Carpels are the female reproductive organs of flowering plants (angiosperms), enclose the ovules, and develop into fruits. The presence of carpels unites angiosperms, and they are suggested to be the most important autapomorphy of the angiosperms, e.g., they prevent inbreeding and allow efficient seed dispersal. Many transcriptional regulators and coregulators essential for carpel development are encoded by diverse gene families and well characterized in Arabidopsis thaliana. Among these regulators are AGAMOUS (AG), ETTIN (ETT), LEUNIG (LUG), SEUSS (SEU), SHORT INTERNODE/STYLISH (SHI/STY), and SEPALLATA1, 2, 3, 4 (SEP1, 2, 3, 4). However, the timing of the origin and their subsequent molecular evolution of these carpel developmental regulators are largely unknown. Here, we have sampled homologs of these carpel developmental regulators from the sequenced genomes of a wide taxonomic sampling of the land plants, such as Physcomitrella patens, Selaginella moellendorfii, Picea abies, and several angiosperms. Careful phylogenetic analyses were carried out that provide a phylogenetic background for the different gene families and provide minimal estimates for the ages of these developmental regulators. Our analyses and published work show that LUG-, SEU-, and SHI/STY-like genes were already present in the Most Recent Common Ancestor (MRCA) of all land plants, AG- and SEP-like genes were present in the MRCA of seed plants and their origin may coincide with the ξ Whole Genome Duplication. Our work shows that the carpel development regulatory network was, in part, recruited from preexisting network components that were present in the MRCA of angiosperms and modified to regulate gynoecium development.
Collapse
Affiliation(s)
- Kai C Pfannebecker
- Department of Biology and Chemistry, Institute of Botany, Justus-Liebig-University, Gießen, Germany
| | - Matthias Lange
- Department of Biology and Chemistry, Institute of Botany, Justus-Liebig-University, Gießen, Germany
| | - Oliver Rupp
- Department of Biology and Chemistry, Institute of Bioinformatics and Systems Biology, Justus-Liebig-University, Gießen, Germany
| | - Annette Becker
- Department of Biology and Chemistry, Institute of Botany, Justus-Liebig-University, Gießen, Germany
| |
Collapse
|
9
|
Prunier J, Verta JP, MacKay JJ. Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function. THE NEW PHYTOLOGIST 2016; 209:44-62. [PMID: 26206592 DOI: 10.1111/nph.13565] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Accepted: 06/14/2015] [Indexed: 05/21/2023]
Abstract
Conifers have been understudied at the genomic level despite their worldwide ecological and economic importance but the situation is rapidly changing with the development of next generation sequencing (NGS) technologies. With NGS, genomics research has simultaneously gained in speed, magnitude and scope. In just a few years, genomes of 20-24 gigabases have been sequenced for several conifers, with several others expected in the near future. Biological insights have resulted from recent sequencing initiatives as well as genetic mapping, gene expression profiling and gene discovery research over nearly two decades. We review the knowledge arising from conifer genomics research emphasizing genome evolution and the genomic basis of adaptation, and outline emerging questions and knowledge gaps. We discuss future directions in three areas with potential inputs from NGS technologies: the evolutionary impacts of adaptation in conifers based on the adaptation-by-speciation model; the contributions of genetic variability of gene expression in adaptation; and the development of a broader understanding of genetic diversity and its impacts on genome function. These research directions promise to sustain research aimed at addressing the emerging challenges of adaptation that face conifer trees.
Collapse
Affiliation(s)
- Julien Prunier
- Centre for Forest Research and Institute for Systems and Integrative Biology, Université Laval, Quebec, QC, G1V 0A6, Canada
| | - Jukka-Pekka Verta
- Friedrich Miescher Laboratory of the Max Planck Society, Spemannstrasse 39, Tübingen, 72076, Germany
| | - John J MacKay
- Centre for Forest Research and Institute for Systems and Integrative Biology, Université Laval, Quebec, QC, G1V 0A6, Canada
| |
Collapse
|
10
|
Visser EA, Wegrzyn JL, Steenkmap ET, Myburg AA, Naidoo S. Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome. BMC Genomics 2015; 16:1057. [PMID: 26652261 PMCID: PMC4676862 DOI: 10.1186/s12864-015-2277-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 12/06/2015] [Indexed: 11/25/2022] Open
Abstract
Background Pines are the most important tree species to the international forestry industry, covering 42 % of the global industrial forest plantation area. One of the most pressing threats to cultivation of some pine species is the pitch canker fungus, Fusarium circinatum, which can have devastating effects in both the field and nursery. Investigation of the Pinus-F. circinatum host-pathogen interaction is crucial for development of effective disease management strategies. As with many non-model organisms, investigation of host-pathogen interactions in pine species is hampered by limited genomic resources. This was partially alleviated through release of the 22 Gbp Pinus taeda v1.01 genome sequence (http://pinegenome.org/pinerefseq/) in 2014. Despite the fact that the fragmented state of the genome may hamper comprehensive transcriptome analysis, it is possible to leverage the inherent redundancy resulting from deep RNA sequencing with Illumina short reads to assemble transcripts in the absence of a completed reference sequence. These data can then be integrated with available genomic data to produce a comprehensive transcriptome resource. The aim of this study was to provide a foundation for gene expression analysis of disease response mechanisms in Pinus patula through transcriptome assembly. Results Eighteen de novo and two reference based assemblies were produced for P. patula shoot tissue. For this purpose three transcriptome assemblers, Trinity, Velvet/OASES and SOAPdenovo-Trans, were used to maximise diversity and completeness of assembled transcripts. Redundancy in the assembly was reduced using the EvidentialGene pipeline. The resulting 52 Mb P. patula v1.0 shoot transcriptome consists of 52 112 unigenes, 60 % of which could be functionally annotated. Conclusions The assembled transcriptome will serve as a major genomic resource for future investigation of P. patula and represents the largest gene catalogue produced to date for this species. Furthermore, this assembly can help detect gene-based genetic markers for P. patula and the comparative assembly workflow could be applied to generate similar resources for other non-model species. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2277-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Erik A Visser
- Department of Genetics, Forestry and Agricultural Biotechnology Institute (FABI), Genomics Research Institute (GRI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa.
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269, USA.
| | - Emma T Steenkmap
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), Genomics Research Institute (GRI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa.
| | - Alexander A Myburg
- Department of Genetics, Forestry and Agricultural Biotechnology Institute (FABI), Genomics Research Institute (GRI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa.
| | - Sanushka Naidoo
- Department of Genetics, Forestry and Agricultural Biotechnology Institute (FABI), Genomics Research Institute (GRI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa.
| |
Collapse
|
11
|
Carmona R, Zafra A, Seoane P, Castro AJ, Guerrero-Fernández D, Castillo-Castillo T, Medina-García A, Cánovas FM, Aldana-Montes JF, Navas-Delgado I, Alché JDD, Claros MG. ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome. FRONTIERS IN PLANT SCIENCE 2015; 6:625. [PMID: 26322066 PMCID: PMC4531244 DOI: 10.3389/fpls.2015.00625] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Accepted: 07/28/2015] [Indexed: 05/18/2023]
Abstract
Plant reproductive transcriptomes have been analyzed in different species due to the agronomical and biotechnological importance of plant reproduction. Here we presented an olive tree reproductive transcriptome database with samples from pollen and pistil at different developmental stages, and leaf and root as control vegetative tissues http://reprolive.eez.csic.es). It was developed from 2,077,309 raw reads to 1,549 Sanger sequences. Using a pre-defined workflow based on open-source tools, sequences were pre-processed, assembled, mapped, and annotated with expression data, descriptions, GO terms, InterPro signatures, EC numbers, KEGG pathways, ORFs, and SSRs. Tentative transcripts (TTs) were also annotated with the corresponding orthologs in Arabidopsis thaliana from TAIR and RefSeq databases to enable Linked Data integration. It results in a reproductive transcriptome comprising 72,846 contigs with average length of 686 bp, of which 63,965 (87.8%) included at least one functional annotation, and 55,356 (75.9%) had an ortholog. A minimum of 23,568 different TTs was identified and 5,835 of them contain a complete ORF. The representative reproductive transcriptome can be reduced to 28,972 TTs for further gene expression studies. Partial transcriptomes from pollen, pistil, and vegetative tissues as control were also constructed. ReprOlive provides free access and download capability to these results. Retrieval mechanisms for sequences and transcript annotations are provided. Graphical localization of annotated enzymes into KEGG pathways is also possible. Finally, ReprOlive has included a semantic conceptualisation by means of a Resource Description Framework (RDF) allowing a Linked Data search for extracting the most updated information related to enzymes, interactions, allergens, structures, and reactive oxygen species.
Collapse
Affiliation(s)
- Rosario Carmona
- Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Consejo Superior de Investigaciones CientíficasGranada, Spain
- Plataforma Andaluza de Bioinformática, Edificio de Bioinnovación, Universidad de MálagaMálaga, Spain
| | - Adoración Zafra
- Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Consejo Superior de Investigaciones CientíficasGranada, Spain
| | - Pedro Seoane
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Universidad de MálagaMálaga, Spain
| | - Antonio J. Castro
- Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Consejo Superior de Investigaciones CientíficasGranada, Spain
| | - Darío Guerrero-Fernández
- Plataforma Andaluza de Bioinformática, Edificio de Bioinnovación, Universidad de MálagaMálaga, Spain
| | | | - Ana Medina-García
- Departamento de Lenguajes y Ciencias de la Computación, Universidad de MálagaMálaga, Spain
| | - Francisco M. Cánovas
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Universidad de MálagaMálaga, Spain
| | - José F. Aldana-Montes
- Departamento de Lenguajes y Ciencias de la Computación, Universidad de MálagaMálaga, Spain
| | - Ismael Navas-Delgado
- Departamento de Lenguajes y Ciencias de la Computación, Universidad de MálagaMálaga, Spain
| | - Juan de Dios Alché
- Department of Biochemistry, Cell and Molecular Biology of Plants, Estación Experimental del Zaidín, Consejo Superior de Investigaciones CientíficasGranada, Spain
| | - M. Gonzalo Claros
- Plataforma Andaluza de Bioinformática, Edificio de Bioinnovación, Universidad de MálagaMálaga, Spain
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Universidad de MálagaMálaga, Spain
- *Correspondence: M. Gonzalo Claros, Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Universidad de Málaga, Campus de Teatinos, 29071 Málaga, Spain,
| |
Collapse
|
12
|
Benzekri H, Armesto P, Cousin X, Rovira M, Crespo D, Merlo MA, Mazurais D, Bautista R, Guerrero-Fernández D, Fernandez-Pozo N, Ponce M, Infante C, Zambonino JL, Nidelet S, Gut M, Rebordinos L, Planas JV, Bégout ML, Claros MG, Manchado M. De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray. BMC Genomics 2014; 15:952. [PMID: 25366320 PMCID: PMC4232633 DOI: 10.1186/1471-2164-15-952] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 10/15/2014] [Indexed: 12/26/2022] Open
Abstract
Background Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and tools were recently described in these species, further sequencing efforts are required to establish a complete transcriptome, and to identify new molecular markers. Moreover, the comparative analysis of transcriptomes will be useful to understand flatfish evolution. Results A comprehensive characterization of the transcriptome for each species was carried out using a large set of Illumina data (more than 1,800 millions reads for each sole species) and 454 reads (more than 5 millions reads only in S. senegalensis), providing coverages ranging from 1,384x to 2,543x. After a de novo assembly, 45,063 and 38,402 different transcripts were obtained, comprising 18,738 and 22,683 full-length cDNAs in S. senegalensis and S. solea, respectively. A reference transcriptome with the longest unique transcripts and putative non-redundant new transcripts was established for each species. A subset of 11,953 reference transcripts was qualified as highly reliable orthologs (>97% identity) between both species. A small subset of putative species-specific, lineage-specific and flatfish-specific transcripts were also identified. Furthermore, transcriptome data permitted the identification of single nucleotide polymorphisms and simple-sequence repeats confirmed by FISH to be used in further genetic and expression studies. Moreover, evidences on the retention of crystallins crybb1, crybb1-like and crybb3 in the two species of soles are also presented. Transcriptome information was applied to the design of a microarray tool in S. senegalensis that was successfully tested and validated by qPCR. Finally, transcriptomic data were hosted and structured at SoleaDB. Conclusions Transcriptomes and molecular markers identified in this study represent a valuable source for future genomic studies in these economically important species. Orthology analysis provided new clues regarding sole genome evolution indicating a divergent evolution of crystallins in flatfish. The design of a microarray and establishment of a reference transcriptome will be useful for large-scale gene expression studies. Moreover, the integration of transcriptomic data in the SoleaDB will facilitate the management of genomic information in these important species. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-952) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Manuel Manchado
- IFAPA Centro El Toruño, IFAPA, Consejeria de Agricultura y Pesca, 11500 El Puerto de Santa María, Cádiz, Spain.
| |
Collapse
|
13
|
Cañas RA, Canales J, Gómez-Maldonado J, Ávila C, Cánovas FM. Transcriptome analysis in maritime pine using laser capture microdissection and 454 pyrosequencing. TREE PHYSIOLOGY 2014; 34:1278-88. [PMID: 24391165 DOI: 10.1093/treephys/tpt113] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Maritime pine (Pinus pinaster Aiton) is one of the most advanced conifer models for genomics research. Conifer genomes are extremely large and major advances have recently been made in the characterization of transcriptomes. The combination of laser capture microdissection (LCM) and next-generation sequencing is a powerful tool with which to resolve the entire transcriptome of specific cell types and tissues. In the current work, we have developed a protocol for transcriptomic analyses of conifer tissue types using LCM and 454 pyrosequencing. Tissue sections were isolated using non-fixed flash-frozen samples processed by LCM. Complementary DNA synthesis and amplification from tiny amounts of total RNA from LCM samples was performed using an adapted protocol for C: onifer R: NA A: mplification (CRA+). The cDNA amplification yield and cDNA quality provided by CRA+ were adequate for 454 pyrosequencing. Furthermore, read length and quality results of the 454 runs were near the optimal parameters considered by Roche for transcriptome sequencing. Using the CRA+ protocol, non-specific amplifications were prevented, problems derived from poly(A:T) tails in the 454 sequencing technology were reduced, and read length and read number considerably enhanced. This technical approach will facilitate global gene expression analysis in individual tissues of conifers and may also be applied to other plant species.
Collapse
Affiliation(s)
- Rafael A Cañas
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Instituto Andaluz de Biotecnología, Universidad de Málaga, Campus Universitario de Teatinos s/n, Málaga 29071, Spain
| | - Javier Canales
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Instituto Andaluz de Biotecnología, Universidad de Málaga, Campus Universitario de Teatinos s/n, Málaga 29071, Spain
| | - Josefa Gómez-Maldonado
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Instituto Andaluz de Biotecnología, Universidad de Málaga, Campus Universitario de Teatinos s/n, Málaga 29071, Spain
| | - Concepción Ávila
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Instituto Andaluz de Biotecnología, Universidad de Málaga, Campus Universitario de Teatinos s/n, Málaga 29071, Spain
| | - Francisco M Cánovas
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Instituto Andaluz de Biotecnología, Universidad de Málaga, Campus Universitario de Teatinos s/n, Málaga 29071, Spain
| |
Collapse
|
14
|
Canales J, Bautista R, Label P, Gómez-Maldonado J, Lesur I, Fernández-Pozo N, Rueda-López M, Guerrero-Fernández D, Castro-Rodríguez V, Benzekri H, Cañas RA, Guevara MA, Rodrigues A, Seoane P, Teyssier C, Morel A, Ehrenmann F, Le Provost G, Lalanne C, Noirot C, Klopp C, Reymond I, García-Gutiérrez A, Trontin JF, Lelu-Walter MA, Miguel C, Cervera MT, Cantón FR, Plomion C, Harvengt L, Avila C, Gonzalo Claros M, Cánovas FM. De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology. PLANT BIOTECHNOLOGY JOURNAL 2014; 12:286-99. [PMID: 24256179 DOI: 10.1111/pbi.12136] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2013] [Revised: 09/24/2013] [Accepted: 09/26/2013] [Indexed: 05/21/2023]
Abstract
Maritime pine (Pinus pinasterAit.) is a widely distributed conifer species in Southwestern Europe and one of the most advanced models for conifer research. In the current work, comprehensive characterization of the maritime pine transcriptome was performed using a combination of two different next-generation sequencing platforms, 454 and Illumina. De novo assembly of the transcriptome provided a catalogue of 26 020 unique transcripts in maritime pine trees and a collection of 9641 full-length cDNAs. Quality of the transcriptome assembly was validated by RT-PCR amplification of selected transcripts for structural and regulatory genes. Transcription factors and enzyme-encoding transcripts were annotated. Furthermore, the available sequencing data permitted the identification of polymorphisms and the establishment of robust single nucleotide polymorphism (SNP) and simple-sequence repeat (SSR) databases for genotyping applications and integration of translational genomics in maritime pine breeding programmes. All our data are freely available at SustainpineDB, the P. pinaster expressional database. Results reported here on the maritime pine transcriptome represent a valuable resource for future basic and applied studies on this ecological and economically important pine species.
Collapse
Affiliation(s)
- Javier Canales
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Universidad de Málaga, Málaga, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Wegrzyn JL, Liechty JD, Stevens KA, Wu LS, Loopstra CA, Vasquez-Gross HA, Dougherty WM, Lin BY, Zieve JJ, Martínez-García PJ, Holt C, Yandell M, Zimin AV, Yorke JA, Crepeau MW, Puiu D, Salzberg SL, de Jong PJ, Mockaitis K, Main D, Langley CH, Neale DB. Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation. Genetics 2014; 196:891-909. [PMID: 24653211 PMCID: PMC3948814 DOI: 10.1534/genetics.113.159996] [Citation(s) in RCA: 129] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Accepted: 12/13/2013] [Indexed: 01/08/2023] Open
Abstract
The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.
Collapse
Affiliation(s)
- Jill L. Wegrzyn
- Department of Plant Sciences, University of California, Davis, California 95616
| | - John D. Liechty
- Department of Plant Sciences, University of California, Davis, California 95616
| | - Kristian A. Stevens
- Department of Evolution and Ecology, University of California, Davis, California 95616
| | - Le-Shin Wu
- National Center for Genome Analysis Support, Indiana University, Bloomington, Indiana 47405
| | - Carol A. Loopstra
- Department of Ecosystem Science and Management, Texas A&M University, College Station, Texas 77843
| | | | - William M. Dougherty
- Department of Evolution and Ecology, University of California, Davis, California 95616
| | - Brian Y. Lin
- Department of Plant Sciences, University of California, Davis, California 95616
| | - Jacob J. Zieve
- Department of Plant Sciences, University of California, Davis, California 95616
| | | | - Carson Holt
- Department of Human Genetics, University of Utah, Salt Lake City, Utah 84112
| | - Mark Yandell
- Department of Human Genetics, University of Utah, Salt Lake City, Utah 84112
| | - Aleksey V. Zimin
- Institute for Physical Sciences and Technology, University of Maryland, College Park, Maryland 20742
| | - James A. Yorke
- Institute for Physical Sciences and Technology, University of Maryland, College Park, Maryland 20742
- Departments of Mathematics and Physics, University of Maryland, College Park, Maryland 20742
| | - Marc W. Crepeau
- Department of Evolution and Ecology, University of California, Davis, California 95616
| | - Daniela Puiu
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, The Johns Hopkins University, Baltimore, Maryland 21205
| | - Steven L. Salzberg
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, The Johns Hopkins University, Baltimore, Maryland 21205
| | - Pieter J. de Jong
- Children’s Hospital Oakland Research Institute, Oakland, California 94609
| | | | - Doreen Main
- Department of Horticulture, Washington State University, Pullman, Washington 99163
| | - Charles H. Langley
- Department of Evolution and Ecology, University of California, Davis, California 95616
| | - David B. Neale
- Department of Plant Sciences, University of California, Davis, California 95616
| |
Collapse
|
16
|
Mann IK, Wegrzyn JL, Rajora OP. Generation, functional annotation and comparative analysis of black spruce (Picea mariana) ESTs: an important conifer genomic resource. BMC Genomics 2013; 14:702. [PMID: 24119028 PMCID: PMC4007752 DOI: 10.1186/1471-2164-14-702] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 10/08/2013] [Indexed: 12/01/2022] Open
Abstract
Background EST (expressed sequence tag) sequences and their annotation provide a highly valuable resource for gene discovery, genome sequence annotation, and other genomics studies that can be applied in genetics, breeding and conservation programs for non-model organisms. Conifers are long-lived plants that are ecologically and economically important globally, and have a large genome size. Black spruce (Picea mariana), is a transcontinental species of the North American boreal and temperate forests. However, there are limited transcriptomic and genomic resources for this species. The primary objective of our study was to develop a black spruce transcriptomic resource to facilitate on-going functional genomics projects related to growth and adaptation to climate change. Results We conducted bidirectional sequencing of cDNA clones from a standard cDNA library constructed from black spruce needle tissues. We obtained 4,594 high quality (2,455 5' end and 2,139 3' end) sequence reads, with an average read-length of 532 bp. Clustering and assembly of ESTs resulted in 2,731 unique sequences, consisting of 2,234 singletons and 497 contigs. Approximately two-thirds (63%) of unique sequences were functionally annotated. Genes involved in 36 molecular functions and 90 biological processes were discovered, including 24 putative transcription factors and 232 genes involved in photosynthesis. Most abundantly expressed transcripts were associated with photosynthesis, growth factors, stress and disease response, and transcription factors. A total of 216 full-length genes were identified. About 18% (493) of the transcripts were novel, representing an important addition to the Genbank EST database (dbEST). Fifty-seven di-, tri-, tetra- and penta-nucleotide simple sequence repeats were identified. Conclusions We have developed the first high quality EST resource for black spruce and identified 493 novel transcripts, which may be species-specific related to life history and ecological traits. We have also identified full-length genes and microsatellite-containing ESTs. Based on EST sequence similarities, black spruce showed close evolutionary relationships with congeneric Picea glauca and Picea sitchensis compared to other Pinaceae members and angiosperms. The EST sequences reported here provide an important resource for genome annotation, functional and comparative genomics, molecular breeding, conservation and management studies and applications in black spruce and related conifer species.
Collapse
Affiliation(s)
- Ishminder K Mann
- Forest Genetics and Biotechnology Group, Department of Biology, Life Sciences Centre, Dalhousie University, 1355 Oxford Street, Halifax, NS B3H 4J1, Canada.
| | | | | |
Collapse
|
17
|
de Vega-Bartol JJ, Simões M, Lorenz WW, Rodrigues AS, Alba R, Dean JFD, Miguel CM. Transcriptomic analysis highlights epigenetic and transcriptional regulation during zygotic embryo development of Pinus pinaster. BMC PLANT BIOLOGY 2013; 13:123. [PMID: 23987738 PMCID: PMC3844413 DOI: 10.1186/1471-2229-13-123] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2013] [Accepted: 08/24/2013] [Indexed: 05/18/2023]
Abstract
BACKGROUND It is during embryogenesis that the plant body plan is established and the meristems responsible for all post-embryonic growth are specified. The molecular mechanisms governing conifer embryogenesis are still largely unknown. Their elucidation may contribute valuable information to clarify if the distinct features of embryo development in angiosperms and gymnosperms result from differential gene regulation. To address this issue, we have performed the first transcriptomic analysis of zygotic embryo development in a conifer species (Pinus pinaster) focusing our study in particular on regulatory genes playing important roles during plant embryo development, namely epigenetic regulators and transcription factors. RESULTS Microarray analysis of P. pinaster zygotic embryogenesis was performed at five periods of embryo development from early developing to mature embryos. Our results show that most changes in transcript levels occurred in the first and the last embryo stage-to-stage transitions, namely early to pre-cotyledonary embryo and cotyledonary to mature embryo. An analysis of functional categories for genes that were differentially expressed through embryogenesis highlighted several epigenetic regulation mechanisms. While putative orthologs of transcripts associated with mechanisms that target transposable elements and repetitive sequences were strongly expressed in early embryogenesis, PRC2-mediated repression of genes seemed more relevant during late embryogenesis. On the other hand, functions related to sRNA pathways appeared differentially regulated across all stages of embryo development with a prevalence of miRNA functions in mid to late embryogenesis. Identification of putative transcription factor genes differentially regulated between consecutive embryo stages was strongly suggestive of the relevance of auxin responses and regulation of auxin carriers during early embryogenesis. Such responses could be involved in establishing embryo patterning. Later in development, transcripts with homology to genes acting on modulation of auxin flow and determination of adaxial-abaxial polarity were up-regulated, as were putative orthologs of genes required for meristem formation and function as well as establishment of organ boundaries. Comparative analysis with A. thaliana embryogenesis also highlighted genes involved in auxin-mediated responses, as well as epigenetic regulation, indicating highly correlated transcript profiles between the two species. CONCLUSIONS This is the first report of a time-course transcriptomic analysis of zygotic embryogenesis in a conifer. Taken together our results show that epigenetic regulation and transcriptional control related to auxin transport and response are critical during early to mid stages of pine embryogenesis and that important events during embryogenesis seem to be coordinated by putative orthologs of major developmental regulators in angiosperms.
Collapse
Affiliation(s)
- José J de Vega-Bartol
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Marta Simões
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - W Walter Lorenz
- Warnell School of Forestry and Natural Resources, The University of Georgia, Athens, GA 30602, USA
| | - Andreia S Rodrigues
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Rob Alba
- Monsanto Company, Mailstop CC4, 700 Chesterfield Parkway West, Chesterfield, MO 63017, USA
| | - Jeffrey F D Dean
- Warnell School of Forestry and Natural Resources, The University of Georgia, Athens, GA 30602, USA
| | - Célia M Miguel
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| |
Collapse
|
18
|
Neale DB, Langley CH, Salzberg SL, Wegrzyn JL. Open access to tree genomes: the path to a better forest. Genome Biol 2013; 14:120. [PMID: 23796049 PMCID: PMC3706761 DOI: 10.1186/gb-2013-14-6-120] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
An open-access culture and a well-developed comparative-genomics infrastructure must be developed in forest trees to derive the full potential of genome sequencing in this diverse group of plants that are the dominant species in much of the earth's terrestrial ecosystems.
Collapse
|
19
|
Craven-Bartle B, Pascual MB, Cánovas FM, Avila C. A Myb transcription factor regulates genes of the phenylalanine pathway in maritime pine. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 74:755-66. [PMID: 23451763 DOI: 10.1111/tpj.12158] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Revised: 02/19/2013] [Accepted: 02/25/2013] [Indexed: 05/22/2023]
Abstract
During the life cycles of conifer trees, such as maritime pine (Pinus pinaster Ait.), large quantities of carbon skeletons are irreversibly immobilized in the wood. In energetic terms this is an expensive process, in which carbon from photosynthesis is channelled through the shikimate pathway for the biosynthesis of phenylpropanoids. This crucial metabolic pathway is finely regulated, primarily through transcriptional control, and because phenylalanine is the precursor for phenylpropanoid biosynthesis, the precise regulation of phenylalanine synthesis and use should occur simultaneously. The promoters of three genes encoding the enzymes prephenate aminotransferase (PAT), phenylalanine ammonia lyase (PAL) and glutamine synthetase (GS1b) contain AC elements involved in the transcriptional activation mediated by R2R3-Myb factors. We have examined the capacity of the R2R3-Myb transcription factors Myb1, Myb4 and Myb8 to co-regulate the expression of PAT, PAL and GS1b. Only Myb8 was able to activate the transcription of the three genes. Moreover, the expression of this transcription factor is higher in lignified tissues, in which a high demand for phenylpropanoids exits. In a gain-of-function experiment, we have shown that Myb8 can specifically bind a well-conserved eight-nucleotide-long AC-II element in the promoter regions of PAT, PAL and GS1b, thereby activating their expression. Our results show that Myb8 regulates the expression of these genes involved in phenylalanine metabolism, which is required for channelling photosynthetic carbon to promote wood formation. The co-localization of PAT, PAL, GS1b and MYB8 transcripts in vascular cells further supports this conclusion.
Collapse
Affiliation(s)
- Blanca Craven-Bartle
- Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Campus Universitario de Teatinos, Universidad de Málaga, 29071 Málaga, Spain
| | | | | | | |
Collapse
|
20
|
Niu SH, Li ZX, Yuan HW, Chen XY, Li Y, Li W. Transcriptome characterisation of Pinus tabuliformis and evolution of genes in the Pinus phylogeny. BMC Genomics 2013; 14:263. [PMID: 23597112 PMCID: PMC3640921 DOI: 10.1186/1471-2164-14-263] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Accepted: 04/15/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Chinese pine (Pinus tabuliformis) is an indigenous conifer species in northern China but is relatively underdeveloped as a genomic resource; thus, limiting gene discovery and breeding. Large-scale transcriptome data were obtained using a next-generation sequencing platform to compensate for the lack of P. tabuliformis genomic information. RESULTS The increasing amount of transcriptome data on Pinus provides an excellent resource for multi-gene phylogenetic analysis and studies on how conserved genes and functions are maintained in the face of species divergence. The first P. tabuliformis transcriptome from a normalised cDNA library of multiple tissues and individuals was sequenced in a full 454 GS-FLX run, producing 911,302 sequencing reads. The high quality overlapping expressed sequence tags (ESTs) were assembled into 46,584 putative transcripts, and more than 700 SSRs and 92,000 SNPs/InDels were characterised. Comparative analysis of the transcriptome of six conifer species yielded 191 orthologues, from which we inferred a phylogenetic tree, evolutionary patterns and calculated rates of gene diversion. We also identified 938 fast evolving sequences that may be useful for identifying genes that perhaps evolved in response to positive selection and might be responsible for speciation in the Pinus lineage. CONCLUSIONS A large collection of high-quality ESTs was obtained, de novo assembled and characterised, which represents a dramatic expansion of the current transcript catalogues of P. tabuliformis and which will gradually be applied in breeding programs of P. tabuliformis. Furthermore, these data will facilitate future studies of the comparative genomics of P. tabuliformis and other related species.
Collapse
Affiliation(s)
- Shi-Hui Niu
- National Engineering Laboratory for Forest Tree Breeding, College of Biological Science and Technology, Beijing Forestry University, Beijing 100083, People's Republic of China
| | | | | | | | | | | |
Collapse
|
21
|
Chancerel E, Lamy JB, Lesur I, Noirot C, Klopp C, Ehrenmann F, Boury C, Provost GL, Label P, Lalanne C, Léger V, Salin F, Gion JM, Plomion C. High-density linkage mapping in a pine tree reveals a genomic region associated with inbreeding depression and provides clues to the extent and distribution of meiotic recombination. BMC Biol 2013; 11:50. [PMID: 23597128 PMCID: PMC3660193 DOI: 10.1186/1741-7007-11-50] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 04/16/2013] [Indexed: 09/03/2023] Open
Abstract
BACKGROUND The availability of a large expressed sequence tags (EST) resource and recent advances in high-throughput genotyping technology have made it possible to develop highly multiplexed SNP arrays for multi-objective genetic applications, including the construction of meiotic maps. Such approaches are particularly useful in species with a large genome size, precluding the use of whole-genome shotgun assembly with current technologies. RESULTS In this study, a 12 k-SNP genotyping array was developed for maritime pine from an extensive EST resource assembled into a unigene set. The offspring of three-generation outbred and inbred mapping pedigrees were then genotyped. The inbred pedigree consisted of a classical F2 population resulting from the selfing of a single inter-provenance (Landes x Corsica) hybrid tree, whereas the outbred pedigree (G2) resulted from a controlled cross of two intra-provenance (Landes x Landes) hybrid trees. This resulted in the generation of three linkage maps based on SNP markers: one from the parental genotype of the F2 population (1,131 markers in 1,708 centimorgan (cM)), and one for each parent of the G2 population (1,015 and 1,110 markers in 1,447 and 1,425 cM for the female and male parents, respectively). A comparison of segregation patterns in the progeny obtained from the two types of mating (inbreeding and outbreeding) led to the identification of a chromosomal region carrying an embryo viability locus with a semi-lethal allele. Following selfing and segregation, zygote mortality resulted in a deficit of Corsican homozygous genotypes in the F2 population. This dataset was also used to study the extent and distribution of meiotic recombination along the length of the chromosomes and the effect of sex and/or genetic background on recombination. The genetic background of trees in which meiotic recombination occurred was found to have a significant effect on the frequency of recombination. Furthermore, only a small proportion of the recombination hot- and cold-spots were common to all three genotypes, suggesting that the spatial pattern of recombination was genetically variable. CONCLUSION This study led to the development of classical genomic tools for this ecologically and economically important species. It also identified a chromosomal region bearing a semi-lethal recessive allele and demonstrated the genetic variability of recombination rate over the genome.
Collapse
|
22
|
Howe GT, Yu J, Knaus B, Cronn R, Kolpak S, Dolan P, Lorenz WW, Dean JFD. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation. BMC Genomics 2013; 14:137. [PMID: 23445355 PMCID: PMC3673906 DOI: 10.1186/1471-2164-14-137] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2012] [Accepted: 01/31/2013] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. RESULTS We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. CONCLUSIONS Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
Collapse
Affiliation(s)
- Glenn T Howe
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, Oregon, 97331, USA
| | - Jianbin Yu
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, Oregon, 97331, USA
- Current address, DuPont Pioneer International, Willmar, Minnesota, 56201, USA
| | - Brian Knaus
- Pacific Northwest Research Station, USDA Forest Service, Corvallis, Oregon, 97331, USA
| | - Richard Cronn
- Pacific Northwest Research Station, USDA Forest Service, Corvallis, Oregon, 97331, USA
| | - Scott Kolpak
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, Oregon, 97331, USA
| | - Peter Dolan
- Department of Mathematics, University of Minnesota, Morris, MN, USA
| | - W Walter Lorenz
- Warnell School of Forestry and Natural Resources, University of Georgia, Athens, Georgia, 30602, USA
| | - Jeffrey FD Dean
- Warnell School of Forestry and Natural Resources, University of Georgia, Athens, Georgia, 30602, USA
| |
Collapse
|
23
|
Howe GT, Yu J, Knaus B, Cronn R, Kolpak S, Dolan P, Lorenz WW, Dean JFD. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation. BMC Genomics 2013. [PMID: 23445355 DOI: 10.1186/1471‐2164‐14‐137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. RESULTS We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. CONCLUSIONS Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
Collapse
Affiliation(s)
- Glenn T Howe
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, Oregon 97331, USA.
| | | | | | | | | | | | | | | |
Collapse
|
24
|
Mackay J, Dean JFD, Plomion C, Peterson DG, Cánovas FM, Pavy N, Ingvarsson PK, Savolainen O, Guevara MÁ, Fluch S, Vinceti B, Abarca D, Díaz-Sala C, Cervera MT. Towards decoding the conifer giga-genome. PLANT MOLECULAR BIOLOGY 2012; 80:555-69. [PMID: 22960864 DOI: 10.1007/s11103-012-9961-7] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2012] [Accepted: 08/24/2012] [Indexed: 05/21/2023]
Abstract
Several new initiatives have been launched recently to sequence conifer genomes including pines, spruces and Douglas-fir. Owing to the very large genome sizes ranging from 18 to 35 gigabases, sequencing even a single conifer genome had been considered unattainable until the recent throughput increases and cost reductions afforded by next generation sequencers. The purpose of this review is to describe the context for these new initiatives. A knowledge foundation has been acquired in several conifers of commercial and ecological interest through large-scale cDNA analyses, construction of genetic maps and gene mapping studies aiming to link phenotype and genotype. Exploratory sequencing in pines and spruces have pointed out some of the unique properties of these giga-genomes and suggested strategies that may be needed to extract value from their sequencing. The hope is that recent and pending developments in sequencing technology will contribute to rapidly filling the knowledge vacuum surrounding their structure, contents and evolution. Researchers are also making plans to use comparative analyses that will help to turn the data into a valuable resource for enhancing and protecting the world's conifer forests.
Collapse
Affiliation(s)
- John Mackay
- Center for Forest Research, Institute for Integrative and Systems Biology, Université Laval, Québec, Québec G1V 0A6, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Santos CS, Pinheiro M, Silva AI, Egas C, Vasconcelos MW. Searching for resistance genes to Bursaphelenchus xylophilus using high throughput screening. BMC Genomics 2012; 13:599. [PMID: 23134679 PMCID: PMC3542250 DOI: 10.1186/1471-2164-13-599] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 10/30/2012] [Indexed: 11/01/2022] Open
Abstract
BACKGROUND Pine wilt disease (PWD), caused by the pinewood nematode (PWN; Bursaphelenchus xylophilus), damages and kills pine trees and is causing serious economic damage worldwide. Although the ecological mechanism of infestation is well described, the plant's molecular response to the pathogen is not well known. This is due mainly to the lack of genomic information and the complexity of the disease. High throughput sequencing is now an efficient approach for detecting the expression of genes in non-model organisms, thus providing valuable information in spite of the lack of the genome sequence. In an attempt to unravel genes potentially involved in the pine defense against the pathogen, we hereby report the high throughput comparative sequence analysis of infested and non-infested stems of Pinus pinaster (very susceptible to PWN) and Pinus pinea (less susceptible to PWN). RESULTS Four cDNA libraries from infested and non-infested stems of P. pinaster and P. pinea were sequenced in a full 454 GS FLX run, producing a total of 2,083,698 reads. The putative amino acid sequences encoded by the assembled transcripts were annotated according to Gene Ontology, to assign Pinus contigs into Biological Processes, Cellular Components and Molecular Functions categories. Most of the annotated transcripts corresponded to Picea genes-25.4-39.7%, whereas a smaller percentage, matched Pinus genes, 1.8-12.8%, probably a consequence of more public genomic information available for Picea than for Pinus. The comparative transcriptome analysis showed that when P. pinaster was infested with PWN, the genes malate dehydrogenase, ABA, water deficit stress related genes and PAR1 were highly expressed, while in PWN-infested P. pinea, the highly expressed genes were ricin B-related lectin, and genes belonging to the SNARE and high mobility group families. Quantitative PCR experiments confirmed the differential gene expression between the two pine species. CONCLUSIONS Defense-related genes triggered by nematode infestation were detected in both P. pinaster and P. pinea transcriptomes utilizing 454 pyrosequencing technology. P. pinaster showed higher abundance of genes related to transcriptional regulation, terpenoid secondary metabolism (including some with nematicidal activity) and pathogen attack. P. pinea showed higher abundance of genes related to oxidative stress and higher levels of expression in general of stress responsive genes. This study provides essential information about the molecular defense mechanisms utilized by P. pinaster and P. pinea against PWN infestation and contributes to a better understanding of PWD.
Collapse
Affiliation(s)
- Carla S Santos
- CBQF – Centro de Biotecnologia e Química Fina, Escola Superior de Biotecnologia, Centro Regional do Porto da Universidade Católica Portuguesa, Rua Dr. António Bernardino Almeida, Porto, 4200-072, Portugal
| | - Miguel Pinheiro
- Bioinformatics Unit, Biocant, Parque Tecnológico de Cantanhede, Núcleo 04, Lote 03, Cantanhede, 3060-197, Portugal
| | - Ana I Silva
- CBQF – Centro de Biotecnologia e Química Fina, Escola Superior de Biotecnologia, Centro Regional do Porto da Universidade Católica Portuguesa, Rua Dr. António Bernardino Almeida, Porto, 4200-072, Portugal
| | - Conceição Egas
- Advanced Services Unit, Biocant, Parque Tecnológico de Cantanhede, Núcleo 04, Lote 03, Cantanhede, 3060-197, Portugal
| | - Marta W Vasconcelos
- CBQF – Centro de Biotecnologia e Química Fina, Escola Superior de Biotecnologia, Centro Regional do Porto da Universidade Católica Portuguesa, Rua Dr. António Bernardino Almeida, Porto, 4200-072, Portugal
| |
Collapse
|
26
|
Why assembling plant genome sequences is so challenging. BIOLOGY 2012; 1:439-59. [PMID: 24832233 PMCID: PMC4009782 DOI: 10.3390/biology1020439] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Revised: 09/05/2012] [Accepted: 09/06/2012] [Indexed: 12/16/2022]
Abstract
In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed.
Collapse
|
27
|
Zhang Y, Zhang S, Han S, Li X, Qi L. Transcriptome profiling and in silico analysis of somatic embryos in Japanese larch (Larix leptolepis). PLANT CELL REPORTS 2012; 31:1637-57. [PMID: 22622308 DOI: 10.1007/s00299-012-1277-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 04/19/2012] [Accepted: 04/20/2012] [Indexed: 05/13/2023]
Abstract
UNLABELLED Japanese larch (Larix leptolepis) is an ecologically and economically important species mainly grown in northeastern China, Japan and Europe. However, erratic flowering and poor germplasm resources caused by high embryo abortion rates have hampered breeding of Larix species. Somatic embryogenesis (SE) is an effective tool for the production of L. leptolepis with desirable characteristics, such as expression of totipotency, preparation of synthetic seeds, and genetic transformation. However, public genomic resources for this species are limited. We sequenced 591,759 raw expressed sequence tags (ESTs) from a 454 sequencing cDNA library of L. leptolepis somatic embryos, resulting in 572,403 high-quality reads. These reads were assembled into 70,927 unique sequences (UniGenes), including 32,321 contigs and 38,606 singletons. After removal of low-quality sequences, 65,115 UniGenes were annotated using the UniProtKB program. Based on their sequence similarity with known proteins, the matched 30,372 sequences from 664 species were estimated to represent approximately 19,000 unique genes. Gene ontology analysis revealed 21,324 UniGenes assigned to 51 categories. By Kyoto Encyclopedia of Genes and Genomes mapping, 25,773 transcripts were associated with 160 biochemical pathways. Further analysis screened four signal transduction pathways represented by 337 enzymes and 17 secondary metabolites. In silico analysis reveals that 207 UniESTs in Larix are homologous to MAPKs genes identified from other model plants, which may be involved in regulating SE development. This study provides an initial insight into the Larix transcriptomes of the pro-embryogenic mass and is a sound basis for future studies. KEY MESSAGE We constructed a large, full-length 454 sequencing cDNA library of Larix leptolepis during somatic embryogenesis. More than 590,000 sequences were obtained and a deep-coverage EST database was constructed.
Collapse
Affiliation(s)
- Yuan Zhang
- Laboratory of Cell Biology, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | | | | | | | | |
Collapse
|
28
|
Canales J, Rueda-López M, Craven-Bartle B, Avila C, Cánovas FM. Novel insights into regulation of asparagine synthetase in conifers. FRONTIERS IN PLANT SCIENCE 2012; 3:100. [PMID: 22654888 PMCID: PMC3359511 DOI: 10.3389/fpls.2012.00100] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 04/27/2012] [Indexed: 05/18/2023]
Abstract
Asparagine, a key amino acid for nitrogen storage and transport in plants, is synthesized via the ATP-dependent reaction catalyzed by the enzyme asparagine synthetase (AS; EC 6.3.5.4). In this work, we present the molecular analysis of two full-length cDNAs that encode asparagine synthetase in maritime pine (Pinus pinaster Ait.), PpAS1, and PpAS2. Phylogenetic analyses of the deduced amino acid sequences revealed that both genes are class II AS, suggesting an ancient origin of these genes in plants. A comparative study of PpAS1 and PpAS2 gene expression profiles showed that PpAS1 gene is highly regulated by developmental and environmental factors, while PpAS2 is expressed constitutively. To determine the molecular mechanisms underpinning the differential expression of PpAS1, the promoter region of the gene was isolated and putative binding sites for MYB transcription factors were identified. Gel mobility shift assays showed that a MYB protein from Pinus taeda (PtMYB1) was able to interact with the promoter region of PpAS1. Furthermore, transient expression analyses in pine cells revealed a negative effect of PtMYB1 on PpAS1 expression. The potential role of MYB factors in the transcriptional regulation of PpAS1 in vascular cells is discussed.
Collapse
Affiliation(s)
- Javier Canales
- Departamento de Biología Molecular y Bioquímica, Instituto Andaluz de Biotecnología, Universidad de MálagaMálaga, Spain
| | - Marina Rueda-López
- Departamento de Biología Molecular y Bioquímica, Instituto Andaluz de Biotecnología, Universidad de MálagaMálaga, Spain
| | - Blanca Craven-Bartle
- Departamento de Biología Molecular y Bioquímica, Instituto Andaluz de Biotecnología, Universidad de MálagaMálaga, Spain
| | - Concepción Avila
- Departamento de Biología Molecular y Bioquímica, Instituto Andaluz de Biotecnología, Universidad de MálagaMálaga, Spain
| | - Francisco M. Cánovas
- Departamento de Biología Molecular y Bioquímica, Instituto Andaluz de Biotecnología, Universidad de MálagaMálaga, Spain
| |
Collapse
|
29
|
Perdiguero P, Collada C, Barbero MDC, García Casado G, Cervera MT, Soto A. Identification of water stress genes in Pinus pinaster Ait. by controlled progressive stress and suppression-subtractive hybridization. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2012; 50:44-53. [PMID: 22099518 DOI: 10.1016/j.plaphy.2011.09.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2011] [Accepted: 09/30/2011] [Indexed: 05/04/2023]
Abstract
Climate change is a major challenge particularly for forest tree species, which will have to face the severe alterations of environmental conditions with their current genetic pool. Thus, an understanding of their adaptive responses is of the utmost interest. In this work we have selected Pinus pinaster as a model species. This pine is one of the most important conifers (for which molecular tools and knowledge are far more scarce than for angiosperms) in the Mediterranean Basin, which is characterised in all foreseen scenarios as one of the regions most drastically affected by climate change, mainly because of increasing temperature and, particularly, by increasing drought. We have induced a controlled, increasing water stress by adding PEG to a hydroponic culture. We have generated a subtractive library, with the aim of identifying the genes induced by this stress and have searched for the most reliable expressional candidate genes, based on their overexpression during water stress, as revealed by microarray analysis and confirmed by RT-PCR. We have selected a set of 67 candidate genes belonging to different functional groups that will be useful molecular tools for further studies on drought stress responses, adaptation, and population genomics in conifers, as well as in breeding programs.
Collapse
Affiliation(s)
- Pedro Perdiguero
- GENFOR Grupo de investigación en Genética y Fisiología Forestal, Universidad Politécnica de Madrid, E-28040 Madrid, Spain
| | | | | | | | | | | |
Collapse
|