1
|
Ellison EL, Zhou P, Hermanson P, Chu YH, Read A, Hirsch CN, Grotewold E, Springer NM. Mutator transposon insertions within maize genes often provide a novel outward reading promoter. Genetics 2023; 225:iyad171. [PMID: 37815810 DOI: 10.1093/genetics/iyad171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 09/04/2023] [Indexed: 10/11/2023] Open
Abstract
The highly active family of Mutator (Mu) DNA transposons has been widely used for forward and reverse genetics in maize. There are examples of Mu-suppressible alleles that result in conditional phenotypic effects based on the activity of Mu. Phenotypes from these Mu-suppressible mutations are observed in Mu-active genetic backgrounds, but absent when Mu activity is lost. For some Mu-suppressible alleles, phenotypic suppression likely results from an outward-reading promoter within Mu that is only active when the autonomous Mu element is silenced or lost. We isolated 35 Mu alleles from the UniformMu population that represent insertions in 24 different genes. Most of these mutant alleles are due to insertions within gene coding sequences, but several 5' UTR and intron insertions were included. RNA-seq and de novo transcript assembly were utilized to document the transcripts produced from 33 of these Mu insertion alleles. For 20 of the 33 alleles, there was evidence of transcripts initiating within the Mu sequence reading through the gene. This outward-reading promoter activity was detected in multiple types of Mu elements and does not depend on the orientation of Mu. Expression analyses of Mu-initiated transcripts revealed the Mu promoter often provides gene expression levels and patterns that are similar to the wild-type gene. These results suggest the Mu promoter may represent a minimal promoter that can respond to gene cis-regulatory elements. Findings from this study have implications for maize researchers using the UniformMu population, and more broadly highlight a strategy for transposons to co-exist with their host.
Collapse
Affiliation(s)
- Erika L Ellison
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Peter Hermanson
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Yi-Hsuan Chu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Andrew Read
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| |
Collapse
|
2
|
Khuman A, Kumar V, Chaudhary B. Evolutionary expansion and expression dynamics of cytokinin-catabolizing CKX gene family in the modern amphidiploid mustard ( Brassica sp.). 3 Biotech 2022; 12:233. [PMID: 35996674 PMCID: PMC9391556 DOI: 10.1007/s13205-022-03294-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 08/02/2022] [Indexed: 11/01/2022] Open
Abstract
Plant cytokinins (CKs) promote development and physiological processes, drought tolerance, root architecture, and ultimately crop productivity. Biologically active CKs (iP, tZ, and cZ) are precisely maintained in the vegetative and floral tissues through their irreversible degradation by developmentally regulated CK-catabolizing cytokinin oxidase/dehydrogenase (CKX) enzyme. A meta-analysis of CKX proteins was performed through an exhaustive exploration of multiple genome databases of cyanobacteria, bryophyte, monocot and eudicot plants to reveal the intricate evolutionary profiles of CKX enzymes specific to the family Brassicaceae. At least 175 unique paralogous/orthologous CKX sequences were successfully retrieved and phylogenetically clustered into distinct groups. Observations of structural divergences among paralogous sequences compared to their orthologs indicated that the progenitor CKX sequence had been subjected to massive structural modifications, possibly as a result of the evolutionary split between monocots and eudicots. An analysis of dN/dS comparisons of orthologous genes revealed that segmental CKX gene duplications have evolved primarily under purifying selection. Further, 24 CKX genes with conserved signature domain were identified in the amphidiploid Brassica juncea genome (AABB; 2n = 36). Genetic evolution of paralogous and orthologous genes was largely responsible for the expansion of CKX homoeologs in the amphidiploid Brassica genomes. Also, comparative analyses of 1.5 kb-long upstream regulatory regions of BjCKX genes identified various development- and stress-responsive elements. Spatial and temporal expression profiles of CKX genes were primarily attributed to their structural diversity observed in the 5'-regulatory regions along with species evolution. This data suggested that CKX duplicate genes had partitioned their spatial expression (= function) during evolution. These findings illustrated the evolutionary importance of CKX genes during plant development, and also suggested their deployment for future crop improvement programs. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-022-03294-0.
Collapse
Affiliation(s)
| | - Vijay Kumar
- Department of Botany, Shivaji College, University of Delhi, New Delhi, 110027 India
| | - Bhupendra Chaudhary
- School of Biotechnology, Gautam Buddha University, Greater Noida, 201312 India
| |
Collapse
|
3
|
Liang Z, Anderson SN, Noshay JM, Crisp PA, Enders TA, Springer NM. Genetic and epigenetic variation in transposable element expression responses to abiotic stress in maize. PLANT PHYSIOLOGY 2021; 186:420-433. [PMID: 33591319 PMCID: PMC8154091 DOI: 10.1093/plphys/kiab073] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 02/05/2021] [Indexed: 06/12/2023]
Abstract
Transposable elements (TEs) pervade most eukaryotic genomes. The repetitive nature of TEs complicates the analysis of their expression. Evaluation of the expression of both TE families (using unique and multi-mapping reads) and specific elements (using uniquely mapping reads) in leaf tissue of three maize (Zea mays) inbred lines subjected to heat or cold stress reveals no evidence for genome-wide activation of TEs; however, some specific TE families generate transcripts only in stress conditions. There is substantial variation for which TE families exhibit stress-responsive expression in the different genotypes. In order to understand the factors that drive expression of TEs, we focused on a subset of families in which we could monitor expression of individual elements. The stress-responsive activation of a TE family can often be attributed to a small number of elements in the family that contains regions lacking DNA methylation. Comparisons of the expression of TEs in different genotypes revealed both genetic and epigenetic variation. Many of the specific TEs that are activated in stress in one inbred are not present in the other inbred, explaining the lack of activation. Among the elements that are shared in both genomes but only expressed in one genotype, we found that many exhibit differences in DNA methylation such that the genotype without expression is fully methylated. This study provides insights into the regulation of expression of TEs in normal and stress conditions and highlights the role of chromatin variation between elements in a family or between genotypes for contributing to expression variation. The highly repetitive nature of many TEs complicates the analysis of their expression. Although most TEs are not expressed, some exhibits expression in certain tissues or conditions. We monitored the expression of both TE families (using unique and multi-mapping reads) and specific elements (using uniquely mapping reads) in leaf tissue of three maize (Zea mays) inbred lines subjected to heat or cold stress. While genome-wide activation of TEs did not occur, some TE families generated transcripts only in stress conditions with variation by genotype. To better understand the factors that drive expression of TEs, we focused on a subset of families in which we could monitor expression of individual elements. In most cases, stress-responsive activation of a TE family was attributed to a small number of elements in the family. The elements that contained small regions lacking DNA methylation regions showed enriched expression while fully methylated elements were rarely expressed in control or stress conditions. The cause of varied expression in the different genotypes was due to both genetic and epigenetic variation. Many specific TEs activated by stress in one inbred were not present in the other inbred. Among the elements shared in both genomes, full methylation inhibited expression in one of the genotypes. This study provides insights into the regulation of TE expression in normal and stress conditions and highlights the role of chromatin variation between elements in a family or between genotypes for contributing to expression.
Collapse
Affiliation(s)
- Zhikai Liang
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, USA
| | - Sarah N Anderson
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, Iowa, USA
| | - Jaclyn M Noshay
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, USA
| | - Peter A Crisp
- School of Agriculture and Food Sciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Tara A Enders
- Department of Biology, Hofstra University, Hempstead, New York, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, USA
| |
Collapse
|
4
|
Purugganan MD, Jackson SA. Advancing crop genomics from lab to field. Nat Genet 2021; 53:595-601. [PMID: 33958781 DOI: 10.1038/s41588-021-00866-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 03/22/2021] [Indexed: 01/23/2023]
Abstract
Crop genomics remains a key element in ensuring scientific progress to secure global food security. It has been two decades since the sequence of the first plant genome, that of Arabidopsis thaliana, was released, and soon after that the draft sequencing of the rice genome was completed. Since then, the genomes of more than 100 crops have been sequenced, plant genome research has expanded across multiple fronts and the next few years promise to bring further advances spurred by the advent of new technologies and approaches. We are likely to see continued innovations in crop genome sequencing, genetic mapping and the acquisition of multiple levels of biological data. There will be exciting opportunities to integrate genome-scale information across multiple scales of biological organization, leading to advances in our mechanistic understanding of crop biological processes, which will, in turn, provide greater impetus for translation of laboratory results to the field.
Collapse
Affiliation(s)
- Michael D Purugganan
- Center for Genomics and Systems Biology, New York University, New York, NY, USA. .,Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.
| | | |
Collapse
|
5
|
Anderson SN, Stitzer MC, Zhou P, Ross-Ibarra J, Hirsch CD, Springer NM. Dynamic Patterns of Transcript Abundance of Transposable Element Families in Maize. G3 (BETHESDA, MD.) 2019; 9:3673-3682. [PMID: 31506319 PMCID: PMC6829137 DOI: 10.1534/g3.119.400431] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 09/08/2019] [Indexed: 12/21/2022]
Abstract
Transposable Elements (TEs) are mobile elements that contribute the majority of DNA sequences in the maize genome. Due to their repetitive nature, genomic studies of TEs are complicated by the difficulty of properly attributing multi-mapped short reads to specific genomic loci. Here, we utilize a method to attribute RNA-seq reads to TE families rather than particular loci in order to characterize transcript abundance for TE families in the maize genome. We applied this method to assess per-family expression of transposable elements in >800 published RNA-seq libraries representing a range of maize development, genotypes, and hybrids. While a relatively small proportion of TE families are transcribed, expression is highly dynamic with most families exhibiting tissue-specific expression. A large number of TE families were specifically detected in pollen and endosperm, consistent with reproductive dynamics that maintain silencing of TEs in the germ line. We find that B73 transcript abundance is a poor predictor of TE expression in other genotypes and that transcript levels can differ even for shared TEs. Finally, by assessing recombinant inbred line and hybrid transcriptomes, complex patterns of TE transcript abundance across genotypes emerged. Taken together, this study reveals a dynamic contribution of TEs to maize transcriptomes.
Collapse
Affiliation(s)
| | - Michelle C Stitzer
- Department of Evolution and Ecology and Center for Population Biology and
| | - Peng Zhou
- Department of Plant and Microbial Biology and
| | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology and Center for Population Biology and
- Genome Center, University of California, Davis, California 95616
| | - Cory D Hirsch
- Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota 55108, and
| | | |
Collapse
|
6
|
Noshay JM, Anderson SN, Zhou P, Ji L, Ricci W, Lu Z, Stitzer MC, Crisp PA, Hirsch CN, Zhang X, Schmitz RJ, Springer NM. Monitoring the interplay between transposable element families and DNA methylation in maize. PLoS Genet 2019; 15:e1008291. [PMID: 31498837 PMCID: PMC6752859 DOI: 10.1371/journal.pgen.1008291] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Revised: 09/19/2019] [Accepted: 07/05/2019] [Indexed: 12/15/2022] Open
Abstract
DNA methylation and epigenetic silencing play important roles in the regulation of transposable elements (TEs) in many eukaryotic genomes. A majority of the maize genome is derived from TEs that can be classified into different orders and families based on their mechanism of transposition and sequence similarity, respectively. TEs themselves are highly methylated and it can be tempting to view them as a single uniform group. However, the analysis of DNA methylation profiles in flanking regions provides evidence for distinct groups of chromatin properties at different TE families. These differences among TE families are reproducible in different tissues and different inbred lines. TE families with varying levels of DNA methylation in flanking regions also show distinct patterns of chromatin accessibility and modifications within the TEs. The differences in the patterns of DNA methylation flanking TE families arise from a combination of non-random insertion preferences of TE families, changes in DNA methylation triggered by the insertion of the TE and subsequent selection pressure. A set of nearly 70,000 TE polymorphisms among four assembled maize genomes were used to monitor the level of DNA methylation at haplotypes with and without the TE insertions. In many cases, TE families with high levels of DNA methylation in flanking sequence are enriched for insertions into highly methylated regions. The majority of the >2,500 TE insertions into unmethylated regions result in changes in DNA methylation in haplotypes with the TE, suggesting the widespread potential for TE insertions to condition altered methylation in conserved regions of the genome. This study highlights the interplay between TEs and the methylome of a major crop species.
Collapse
Affiliation(s)
- Jaclyn M. Noshay
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul MN, United States of America
| | - Sarah N. Anderson
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul MN, United States of America
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul MN, United States of America
| | - Lexiang Ji
- Institute of Bioinformatics, University of Georgia, Athens GA, United States of America
| | - William Ricci
- Department of Plant Biology, University of Georgia, Athens GA, United States of America
| | - Zefu Lu
- Department of Genetics, University of Georgia, Athens GA, United States of America
| | - Michelle C. Stitzer
- Department of Plant Sciences, University of California Davis, Davis CA, United States of America
| | - Peter A. Crisp
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul MN, United States of America
| | - Candice N. Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul MN, United States of America
| | - Xiaoyu Zhang
- Department of Plant Biology, University of Georgia, Athens GA, United States of America
| | - Robert J. Schmitz
- Department of Genetics, University of Georgia, Athens GA, United States of America
| | - Nathan M. Springer
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul MN, United States of America
| |
Collapse
|
7
|
Qi M, Li Z, Liu C, Hu W, Ye L, Xie Y, Zhuang Y, Zhao F, Teng W, Zheng Q, Fan Z, Xu L, Lang Z, Tong Y, Zhang Y. CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome. Nucleic Acids Res 2019; 46:e107. [PMID: 29931324 PMCID: PMC6182137 DOI: 10.1093/nar/gky522] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 05/24/2018] [Indexed: 11/24/2022] Open
Abstract
Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and restriction enzyme-based methods all have difficulty in obtaining long novel genomic sequences from highly divergent population with large genome size. The present study presented a reference-independent core genome targeted sequencing approach, CGT-seq, which employed epigenomic information from both active and repressive epigenetic marks to guide the assembly of the core genome mainly composed of promoter and intragenic regions. This method was relatively easily implemented, and displayed high sensitivity and specificity for capturing the core genome of bread wheat. 95% intragenic and 89% promoter region from wheat were covered by CGT-seq read. We further demonstrated in rice that CGT-seq captured hundreds of novel genes and regulatory sequences from a previously unsequenced ecotype. Together, with specific enrichment and sequencing of regions within and nearby genes, CGT-seq is a time- and resource-effective approach to profiling functionally relevant regions in sequenced and non-sequenced populations with large genomes.
Collapse
Affiliation(s)
- Meifang Qi
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Zijuan Li
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Chunmei Liu
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Wenyan Hu
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Luhuan Ye
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Yilin Xie
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Yili Zhuang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Fei Zhao
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Wan Teng
- University of the Chinese Academy of Sciences, Beijing 100049, China.,The State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Qi Zheng
- University of the Chinese Academy of Sciences, Beijing 100049, China.,The State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhenjun Fan
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,Henan University, school of life science
| | - Lin Xu
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Zhaobo Lang
- University of the Chinese Academy of Sciences, Beijing 100049, China.,National Key Laboratory of Plant Molecular Genetics, Shanghai Center for Plant Stress Biology and Center of Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai 200032, China
| | - Yiping Tong
- University of the Chinese Academy of Sciences, Beijing 100049, China.,The State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Yijing Zhang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fengalin Road, Shanghai 200032, China.,University of the Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
8
|
De KK, Pal T, Mondal A, Majumder M, Ghorai A. Extended centromere and chromosomal mosaicism in some varieties of grass pea, Lathyrus sativus L. THE NUCLEUS 2019. [DOI: 10.1007/s13237-018-0245-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022] Open
|
9
|
Boone M, De Koker A, Callewaert N. Capturing the 'ome': the expanding molecular toolbox for RNA and DNA library construction. Nucleic Acids Res 2018; 46:2701-2721. [PMID: 29514322 PMCID: PMC5888575 DOI: 10.1093/nar/gky167] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 02/05/2018] [Accepted: 02/23/2018] [Indexed: 12/14/2022] Open
Abstract
All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application.
Collapse
Affiliation(s)
- Morgane Boone
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| | - Andries De Koker
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| | - Nico Callewaert
- Center for Medical Biotechnology, VIB, Zwijnaarde 9052, Belgium
- Department of Biochemistry and Microbiology, Ghent University, Ghent 9000, Belgium
| |
Collapse
|
10
|
Underwood CJ, Henderson IR, Martienssen RA. Genetic and epigenetic variation of transposable elements in Arabidopsis. CURRENT OPINION IN PLANT BIOLOGY 2017; 36:135-141. [PMID: 28343122 PMCID: PMC5746046 DOI: 10.1016/j.pbi.2017.03.002] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Revised: 03/02/2017] [Accepted: 03/06/2017] [Indexed: 05/03/2023]
Abstract
Transposable elements are mobile genetic elements that are prevalent in plant genomes and are silenced by epigenetic modification. Different epigenetic modification pathways play distinct roles in the control of transposable element transcription, replication and recombination. The Arabidopsis genome contains families of all of the major transposable element classes, which are differentially enriched in particular genomic regions. Whole genome sequencing and DNA methylation profiling of hundreds of natural Arabidopsis accessions has revealed that transposable elements exhibit significant intraspecific genetic and epigenetic variation, and that genetic variation often underlies epigenetic variation. Together, epigenetic modification and the forces of selection define the scope within which transposable elements can contribute to, and control, genome evolution.
Collapse
Affiliation(s)
- Charles J Underwood
- Watson School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Department of Plant Sciences, Downing Street, University of Cambridge, Cambridge, UK
| | - Ian R Henderson
- Department of Plant Sciences, Downing Street, University of Cambridge, Cambridge, UK
| | - Robert A Martienssen
- Watson School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; HHMI-GBMF, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
11
|
Sudheesh S, Verma P, Forster JW, Cogan NOI, Kaur S. Generation and Characterisation of a Reference Transcriptome for Lentil (Lens culinaris Medik.). Int J Mol Sci 2016; 17:E1887. [PMID: 27845747 PMCID: PMC5133886 DOI: 10.3390/ijms17111887] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Revised: 09/19/2016] [Accepted: 10/31/2016] [Indexed: 01/14/2023] Open
Abstract
RNA-Seq using second-generation sequencing technologies permits generation of a reference unigene set for a given species, in the absence of a well-annotated genome sequence, supporting functional genomics studies, gene characterisation and detailed expression analysis for specific morphophysiological or environmental stress response traits. A reference unigene set for lentil has been developed, consisting of 58,986 contigs and scaffolds with an N50 length of 1719 bp. Comparison to gene complements from related species, reference protein databases, previously published lentil transcriptomes and a draft genome sequence validated the current dataset in terms of degree of completeness and utility. A large proportion (98%) of unigenes were expressed in more than one tissue, at varying levels. Candidate genes associated with mechanisms of tolerance to both boron toxicity and time of flowering were identified, which can eventually be used for the development of gene-based markers. This study has provided a comprehensive, assembled and annotated reference gene set for lentil that can be used for multiple applications, permitting identification of genes for pathway-specific expression analysis, genetic modification approaches, development of resources for genotypic analysis, and assistance in the annotation of a future lentil genome sequence.
Collapse
Affiliation(s)
- Shimna Sudheesh
- Biosciences Research, Agriculture Victoria, AgriBio, 5 Ring Road, La Trobe University, Bundoora, VIC 3083, Australia.
| | - Preeti Verma
- Biosciences Research, Agriculture Victoria, AgriBio, 5 Ring Road, La Trobe University, Bundoora, VIC 3083, Australia.
| | - John W Forster
- Biosciences Research, Agriculture Victoria, AgriBio, 5 Ring Road, La Trobe University, Bundoora, VIC 3083, Australia.
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia.
| | - Noel O I Cogan
- Biosciences Research, Agriculture Victoria, AgriBio, 5 Ring Road, La Trobe University, Bundoora, VIC 3083, Australia.
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia.
| | - Sukhjiwan Kaur
- Biosciences Research, Agriculture Victoria, AgriBio, 5 Ring Road, La Trobe University, Bundoora, VIC 3083, Australia.
| |
Collapse
|
12
|
Sánchez-Martín J, Steuernagel B, Ghosh S, Herren G, Hurni S, Adamski N, Vrána J, Kubaláková M, Krattinger SG, Wicker T, Doležel J, Keller B, Wulff BBH. Rapid gene isolation in barley and wheat by mutant chromosome sequencing. Genome Biol 2016; 17:221. [PMID: 27795210 PMCID: PMC5087116 DOI: 10.1186/s13059-016-1082-1] [Citation(s) in RCA: 186] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Accepted: 10/10/2016] [Indexed: 11/18/2022] Open
Abstract
Identification of causal mutations in barley and wheat is hampered by their large genomes and suppressed recombination. To overcome these obstacles, we have developed MutChromSeq, a complexity reduction approach based on flow sorting and sequencing of mutant chromosomes, to identify induced mutations by comparison to parental chromosomes. We apply MutChromSeq to six mutants each of the barley Eceriferum-q gene and the wheat Pm2 genes. This approach unambiguously identified single candidate genes that were verified by Sanger sequencing of additional mutants. MutChromSeq enables reference-free forward genetics in barley and wheat, thus opening up their pan-genomes to functional genomics.
Collapse
Affiliation(s)
- Javier Sánchez-Martín
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | | | - Sreya Ghosh
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Gerhard Herren
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | - Severine Hurni
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | - Nikolai Adamski
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Jan Vrána
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, Olomouc, CZ-78371 Czech Republic
| | - Marie Kubaláková
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, Olomouc, CZ-78371 Czech Republic
| | - Simon G. Krattinger
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | - Thomas Wicker
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | - Jaroslav Doležel
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, Olomouc, CZ-78371 Czech Republic
| | - Beat Keller
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, Zürich, CH-8008 Switzerland
| | | |
Collapse
|
13
|
Xu W, Wang T, Xu S, Xu S, Wu L, Wu Y, Bian P. Radiation-induced epigenetic bystander effects demonstrated in Arabidopsis thaliana. Radiat Res 2015; 183:511-24. [PMID: 25938771 DOI: 10.1667/rr13909.1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Radiation-induced bystander effects (RIBE) in vivo in the higher plant Arabidopsis thaliana ( A. thaliana ) have been well demonstrated in terms of effects on development and genetics. However, there is not yet robust evidence regarding RIBE-mediated epigenetic changes in plants. To address this, in the current work the roots of A. thaliana seedlings were locally irradiated with 10 Gy of α particles, after which DNA methylation in bystander aerial plants were detected using the methylation-sensitive amplification polymorphism (MSAP) and bisulfite sequencing PCR (BSP). Results showed that irradiation of the roots led to long-distance changes in DNA methylation patterns at some CCGG sites over the whole genome, specifically from hemi-methylation to non-methylation, and the methylation ratios, mainly at CG sites, strongly indicating the existence of RIBE-mediated epigenetic changes in higher plants. Root irradiation also influenced expressions of DNA methylation-related MET1, DRM2 and SUVH4 genes and demethylation-related DML3 gene in bystander aerial plants, suggesting a modulation of RIBE to the methylation machinery in plants. In addition, the multicopy P35S:GUS in A. thaliana line L5-1, which is silenced epigenetically by DNA methylation and histone modification, was transcriptionally activated through the RIBE. The transcriptional activation could be significantly inhibited by the treatment with reactive oxygen species (ROS) scavenger dimethyl sulfoxide (DMSO), indicative of a pivotal role of ROS in RIBE-mediated epigenetic changes. Time course analyses showed that the bystander signaling molecule(s) for transcriptional activation of multicopy P35S:GUS, although of unknown chemical nature, were generated in the root cells within 24 h postirradiation.
Collapse
Affiliation(s)
- Wei Xu
- Key Laboratory of Ion Beam Bio-engineering, Hefei Institutes of Physical Science, Chinese Academy of Sciences and Anhui Province, Hefei, Anhui 230031, P.R. China
| | | | | | | | | | | | | |
Collapse
|
14
|
Evolution of DNA methylation patterns in the Brassicaceae is driven by differences in genome organization. PLoS Genet 2014; 10:e1004785. [PMID: 25393550 PMCID: PMC4230842 DOI: 10.1371/journal.pgen.1004785] [Citation(s) in RCA: 140] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2014] [Accepted: 09/26/2014] [Indexed: 12/21/2022] Open
Abstract
DNA methylation is an ancient molecular modification found in most eukaryotes. In plants, DNA methylation is not only critical for transcriptionally silencing transposons, but can also affect phenotype by altering expression of protein coding genes. The extent of its contribution to phenotypic diversity over evolutionary time is, however, unclear, because of limited stability of epialleles that are not linked to DNA mutations. To dissect the relative contribution of DNA methylation to transposon surveillance and host gene regulation, we leveraged information from three species in the Brassicaceae that vary in genome architecture, Capsella rubella, Arabidopsis lyrata, and Arabidopsis thaliana. We found that the lineage-specific expansion and contraction of transposon and repeat sequences is the main driver of interspecific differences in DNA methylation. The most heavily methylated portions of the genome are thus not conserved at the sequence level. Outside of repeat-associated methylation, there is a surprising degree of conservation in methylation at single nucleotides located in gene bodies. Finally, dynamic DNA methylation is affected more by tissue type than by environmental differences in all species, but these responses are not conserved. The majority of DNA methylation variation between species resides in hypervariable genomic regions, and thus, in the context of macroevolution, is of limited phenotypic consequence. DNA methylation is an epigenetic mark that has received a great deal of attention in plants because it can be stably transmitted across generations. However, the rate of DNA methylation change, or epimutation, is greater than that of DNA mutation. In addition, different from DNA sequence, DNA methylation can vary within an individual in response to developmental or environmental cues. Whether altered characters can be passed on to the next generation via directed modifications in DNA methylation is a question of great interest. We have compared how DNA methylation changes between species, tissues, and environments using three closely related crucifers as examples. We found that DNA methylation is different between roots and shoots and changes with temperatures, but that such changes are not conserved across species. Moreover, most of the methylated sites are not conserved between species. This suggests that DNA methylation may respond to immediate fluctuations in the environment, but this response is not retained over long evolutionary periods. Thus, in contrast to transcriptional responses, conserved epigenetic responses at the level of DNA methylation are not widespread. Instead, the patterns of DNA methylation are largely determined by the evolution of genome structure, and responsive loci are likely short-lived accidents of this process.
Collapse
|
15
|
Ilut DC, Nydam ML, Hare MP. Defining loci in restriction-based reduced representation genomic data from nonmodel species: sources of bias and diagnostics for optimal clustering. BIOMED RESEARCH INTERNATIONAL 2014; 2014:675158. [PMID: 25057498 PMCID: PMC4095725 DOI: 10.1155/2014/675158] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Revised: 06/02/2014] [Accepted: 06/04/2014] [Indexed: 01/15/2023]
Abstract
Next generation sequencing holds great promise for applications of phylogeography, landscape genetics, and population genomics in wild populations of nonmodel species, but the robustness of inferences hinges on careful experimental design and effective bioinformatic removal of predictable artifacts. Addressing this issue, we use published genomes from a tunicate, stickleback, and soybean to illustrate the potential for bioinformatic artifacts and introduce a protocol to minimize two sources of error expected from similarity-based de-novo clustering of stacked reads: the splitting of alleles into different clusters, which creates false homozygosity, and the grouping of paralogs into the same cluster, which creates false heterozygosity. We present an empirical application focused on Ciona savignyi, a tunicate with very high SNP heterozygosity (~0.05), because high diversity challenges the computational efficiency of most existing nonmodel pipelines while also potentially exacerbating paralog artifacts. The simulated and empirical data illustrate the advantages of using higher sequence difference clustering thresholds than is typical and demonstrate the utility of our protocol for efficiently identifying an optimum threshold from data without prior knowledge of heterozygosity. The empirical Ciona savignyi data also highlight null alleles as a potentially large source of false homozygosity in restriction-based reduced representation genomic data.
Collapse
Affiliation(s)
- Daniel C. Ilut
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14850, USA
| | - Marie L. Nydam
- Division of Science and Mathematics, Centre College, Danville, KY 40422, USA
| | - Matthew P. Hare
- Department of Natural Resources, Cornell University, Ithaca, NY 14850, USA
| |
Collapse
|
16
|
Low ETL, Rosli R, Jayanthi N, Mohd-Amin AH, Azizi N, Chan KL, Maqbool NJ, Maclean P, Brauning R, McCulloch A, Moraga R, Ong-Abdullah M, Singh R. Analyses of hypomethylated oil palm gene space. PLoS One 2014; 9:e86728. [PMID: 24497974 PMCID: PMC3907425 DOI: 10.1371/journal.pone.0086728] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2013] [Accepted: 12/15/2013] [Indexed: 12/21/2022] Open
Abstract
Demand for palm oil has been increasing by an average of ∼8% the past decade and currently accounts for about 59% of the world's vegetable oil market. This drives the need to increase palm oil production. Nevertheless, due to the increasing need for sustainable production, it is imperative to increase productivity rather than the area cultivated. Studies on the oil palm genome are essential to help identify genes or markers that are associated with important processes or traits, such as flowering, yield and disease resistance. To achieve this, 294,115 and 150,744 sequences from the hypomethylated or gene-rich regions of Elaeis guineensis and E. oleifera genome were sequenced and assembled into contigs. An additional 16,427 shot-gun sequences and 176 bacterial artificial chromosomes (BAC) were also generated to check the quality of libraries constructed. Comparison of these sequences revealed that although the methylation-filtered libraries were sequenced at low coverage, they still tagged at least 66% of the RefSeq supported genes in the BAC and had a filtration power of at least 2.0. A total 33,752 microsatellites and 40,820 high-quality single nucleotide polymorphism (SNP) markers were identified. These represent the most comprehensive collection of microsatellites and SNPs to date and would be an important resource for genetic mapping and association studies. The gene models predicted from the assembled contigs were mined for genes of interest, and 242, 65 and 14 oil palm transcription factors, resistance genes and miRNAs were identified respectively. Examples of the transcriptional factors tagged include those associated with floral development and tissue culture, such as homeodomain proteins, MADS, Squamosa and Apetala2. The E. guineensis and E. oleifera hypomethylated sequences provide an important resource to understand the molecular mechanisms associated with important agronomic traits in oil palm.
Collapse
Affiliation(s)
- Eng-Ti L. Low
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Rozana Rosli
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Nagappan Jayanthi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Ab Halim Mohd-Amin
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Norazah Azizi
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Kuang-Lim Chan
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | | | - Paul Maclean
- AgResearch Ruakura Research Centre, Hamilton, New Zealand
| | - Rudi Brauning
- AgResearch Invermay Agricultural Centre, Mosgiel, New Zealand
| | - Alan McCulloch
- AgResearch Invermay Agricultural Centre, Mosgiel, New Zealand
| | - Roger Moraga
- AgResearch Grasslands Research Centre, Palmerston North, New Zealand
| | - Meilina Ong-Abdullah
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
| | - Rajinder Singh
- Advanced Biotechnology and Breeding Centre, Malaysian Palm Oil Board, Kajang, Selangor, Malaysia
- * E-mail:
| |
Collapse
|
17
|
Rai KM, Singh SK, Bhardwaj A, Kumar V, Lakhwani D, Srivastava A, Jena SN, Yadav HK, Bag SK, Sawant SV. Large-scale resource development in Gossypium hirsutum L. by 454 sequencing of genic-enriched libraries from six diverse genotypes. PLANT BIOTECHNOLOGY JOURNAL 2013; 11:953-963. [PMID: 23782852 DOI: 10.1111/pbi.12088] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Revised: 04/19/2013] [Accepted: 05/04/2013] [Indexed: 06/02/2023]
Abstract
The sequence information has been proved to be an essential genomic resource in case of crop plants for their genetic improvement and better utilization by humans. To dissect the Gossypium hirsutum genome for large-scale development of genomic resources, we adopted hypomethylated restriction-based genomic enrichment strategy to sequence six diverse genotypes. Approximately 5.2-Gb data (more than 18.36 million reads) was generated which, after assembly, represents nearly 1.27-Gb genomic sequences. We predicted a total of 93,363 gene models (21,399 full length) and identified 35,923 gene models which were validated against already sequenced plant genomes. A total of 1,093 transcription factor-encoding genes, 3,135 promoter sequences and 78 miRNA (including 17 newly identified in Gossypium) were predicted. We identified significant no. of molecular markers including 47,093 novel simple sequence repeats and 66,364 novel single nucleotide polymorphisms. In addition, we developed NBRI-Comprehensive Cotton Genomics database, a web resource to provide access of cotton-related genomic resources developed at NBRI. This study contributes considerable amount of genomic resources and suggests a potential role of genic-enriched sequencing in genomic resource development for orphan crop plants.
Collapse
Affiliation(s)
- Krishan Mohan Rai
- Plant Molecular Biology Laboratory, CSIR-National Botanical Research Institute, Lucknow, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Sierro N, van Oeveren J, van Eijk MJT, Martin F, Stormo KE, Peitsch MC, Ivanov NV. Whole genome profiling physical map and ancestral annotation of tobacco Hicks Broadleaf. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 75:880-9. [PMID: 23672264 PMCID: PMC3824204 DOI: 10.1111/tpj.12247] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 04/22/2013] [Accepted: 05/09/2013] [Indexed: 05/08/2023]
Abstract
Genomics-based breeding of economically important crops such as banana, coffee, cotton, potato, tobacco and wheat is often hampered by genome size, polyploidy and high repeat content. We adapted sequence-based whole-genome profiling (WGP™) technology to obtain insight into the polyploidy of the model plant Nicotiana tabacum (tobacco). N. tabacum is assumed to originate from a hybridization event between ancestors of Nicotiana sylvestris and Nicotiana tomentosiformis approximately 200,000 years ago. This resulted in tobacco having a haploid genome size of 4500 million base pairs, approximately four times larger than the related tomato (Solanum lycopersicum) and potato (Solanum tuberosum) genomes. In this study, a physical map containing 9750 contigs of bacterial artificial chromosomes (BACs) was constructed. The mean contig size was 462 kbp, and the calculated genome coverage equaled the estimated tobacco genome size. We used a method for determination of the ancestral origin of the genome by annotation of WGP sequence tags. This assignment agreed with the ancestral annotation available from the tobacco genetic map, and may be used to investigate the evolution of homoeologous genome segments after polyploidization. The map generated is an essential scaffold for the tobacco genome. We propose the combination of WGP physical mapping technology and tag profiling of ancestral lines as a generally applicable method to elucidate the ancestral origin of genome segments of polyploid species. The physical mapping of genes and their origins will enable application of biotechnology to polyploid plants aimed at accelerating and increasing the precision of breeding for abiotic and biotic stress resistance.
Collapse
Affiliation(s)
- Nicolas Sierro
- Biological System Research, Philip Morris International R&D, Philip Morris Products SA Quai Jeanrenaud 5, 2000Neuchatel, Switzerland
| | - Jan van Oeveren
- Keygene NVAgro Business Park 90, 6708 PW, Wageningen, The Netherlands
| | | | - Florian Martin
- Biological System Research, Philip Morris International R&D, Philip Morris Products SA Quai Jeanrenaud 5, 2000Neuchatel, Switzerland
| | - Keith E Stormo
- Amplicon Express Inc.2345 NE Hopkins Court, Pullman, WA, 99163, USA
| | - Manuel C Peitsch
- Biological System Research, Philip Morris International R&D, Philip Morris Products SA Quai Jeanrenaud 5, 2000Neuchatel, Switzerland
| | - Nikolai V Ivanov
- Biological System Research, Philip Morris International R&D, Philip Morris Products SA Quai Jeanrenaud 5, 2000Neuchatel, Switzerland
| |
Collapse
|
19
|
Abstract
The long terminal repeat retrotransposon, Magnaporthe gypsy-like element (MAGGY), has been shown to be targeted for cytosine methylation in a subset of Magnaporthe oryzae field isolates. Analysis of the F1 progeny from a genetic cross between methylation-proficient (Br48) and methylation-deficient (GFSI1-7-2) isolates revealed that methylation of the MAGGY element was governed by a single dominant gene. Positional cloning followed by gene disruption and complementation experiments revealed that the responsible gene was the DNA methyltransferase, MoDMT1, an ortholog of Neurospora crassa Dim-2. A survey of MAGGY methylation in 60 Magnaporthe field isolates revealed that 42 isolates from rice, common millet, wheat, finger millet, and buffelgrass were methylation proficient while 18 isolates from foxtail millet, green bristlegrass, Japanese panicgrass, torpedo grass, Guinea grass, and crabgrass were methylation deficient. Phenotypic analyses showed that MoDMT1 plays no major role in development and pathogenicity of the fungus. Quantitative polymerase chain reaction analysis showed that the average copy number of genomic MAGGY elements was not significantly different between methylation-deficient and -proficient field isolates even though the levels of MAGGY transcript were generally higher in the former group. MoDMT1 gene sequences in the methylation-deficient isolates suggested that at least three independent mutations were responsible for the loss of MoDMT1 function. Overall, our data suggest that MoDMT1 is not essential for the natural life cycle of the fungus and raise the possibility that the genus Magnaporthe may be losing the mechanism of DNA methylation on the evolutionary time scale.
Collapse
|
20
|
Neves LG, Davis JM, Barbazuk WB, Kirst M. Whole-exome targeted sequencing of the uncharacterized pine genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 75:146-156. [PMID: 23551702 DOI: 10.1111/tpj.12193] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Revised: 04/26/2013] [Accepted: 03/26/2013] [Indexed: 05/20/2023]
Abstract
The large genome size of many species hinders the development and application of genomic tools to study them. For instance, loblolly pine (Pinus taeda L.), an ecologically and economically important conifer, has a large and yet uncharacterized genome of 21.7 Gbp. To characterize the pine genome, we performed exome capture and sequencing of 14 729 genes derived from an assembly of expressed sequence tags. Efficiency of sequence capture was evaluated and shown to be similar across samples with increasing levels of complexity, including haploid cDNA, haploid genomic DNA and diploid genomic DNA. However, this efficiency was severely reduced for probes that overlapped multiple exons, presumably because intron sequences hindered probe:exon hybridizations. Such regions could not be entirely avoided during probe design, because of the lack of a reference sequence. To improve the throughput and reduce the cost of sequence capture, a method to multiplex the analysis of up to eight samples was developed. Sequence data showed that multiplexed capture was reproducible among 24 haploid samples, and can be applied for high-throughput analysis of targeted genes in large populations. Captured sequences were de novo assembled, resulting in 11 396 expanded and annotated gene models, significantly improving the knowledge about the pine gene space. Interspecific capture was also evaluated with over 98% of all probes designed from P. taeda that were efficient in sequence capture, were also suitable for analysis of the related species Pinus elliottii Engelm.
Collapse
Affiliation(s)
- Leandro G Neves
- Graduate Program in Plant Molecular and Cellular Biology, University of Florida, Gainesville, FL 32611, USA
| | - John M Davis
- Graduate Program in Plant Molecular and Cellular Biology, University of Florida, Gainesville, FL 32611, USA
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| | - William B Barbazuk
- Graduate Program in Plant Molecular and Cellular Biology, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
- Department of Biology, University of Florida, Gainesville, FL 32611, USA
| | - Matias Kirst
- Graduate Program in Plant Molecular and Cellular Biology, University of Florida, Gainesville, FL 32611, USA
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
21
|
Pandey N, Ranjan A, Pant P, Tripathi RK, Ateek F, Pandey HP, Patre UV, Sawant SV. CAMTA 1 regulates drought responses in Arabidopsis thaliana. BMC Genomics 2013; 14:216. [PMID: 23547968 PMCID: PMC3621073 DOI: 10.1186/1471-2164-14-216] [Citation(s) in RCA: 107] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 03/22/2013] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Transcription factors (TF) play a crucial role in regulating gene expression and are fit to regulate diverse cellular processes by interacting with other proteins. A TF named calmodulin binding transcription activator (CAMTA) was identified in Arabidopsis thaliana (AtCAMTA1-6). To explore the role of CAMTA1 in drought response, the phenotypic differences and gene expression was studied between camta1 and Col-0 under drought condition. RESULTS In camta1, root development was abolished showing high-susceptibility to induced osmotic stress resulting in small wrinkled rosette leaves and stunted primary root. In camta1 under drought condition, we identified growth retardation, poor WUE, low photosystem II efficiency, decline in RWC and higher sensitivity to drought with reduced survivability. The microarray analysis of drought treated camta1 revealed that CAMTA1 regulates "drought recovery" as most indicative pathway along with other stress response, osmotic balance, apoptosis, DNA methylation and photosynthesis. Interestingly, majority of positively regulated genes were related to plasma membrane and chloroplast. Further, our analysis indicates that CAMTA1 regulates several stress responsive genes including RD26, ERD7, RAB18, LTPs, COR78, CBF1, HSPs etc. and promoter of these genes were enriched with CAMTA recognition cis-element. CAMTA1 probably regulate drought recovery by regulating expression of AP2-EREBP transcription factors and Abscisic acid response. CONCLUSION CAMTA1 rapidly changes broad spectrum of responsive genes of membrane integrity and photosynthetic machinery by generating ABA response for challenging drought stress. Our results demonstrate the important role of CAMTA1 in regulating drought response in Arabidopsis, thus could be genetically engineered for improving drought tolerance in crop.
Collapse
Affiliation(s)
- Neha Pandey
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| | - Alok Ranjan
- Present address: Centre for AIDS Health Disparities Research, Meharry Medical College, Nashville, TN 37208, USA
| | - Poonam Pant
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| | - Rajiv K Tripathi
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| | - Farha Ateek
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| | | | - Uday V Patre
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| | - Samir V Sawant
- Council of Scientific and Industrial Research-National Botanical Research Institute, Rana Pratap Marg, Lucknow 226001, INDIA
| |
Collapse
|
22
|
Matvienko M, Kozik A, Froenicke L, Lavelle D, Martineau B, Perroud B, Michelmore R. Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride. PLoS One 2013; 8:e55913. [PMID: 23409088 PMCID: PMC3568094 DOI: 10.1371/journal.pone.0055913] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Accepted: 01/04/2013] [Indexed: 12/22/2022] Open
Abstract
Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.
Collapse
Affiliation(s)
- Marta Matvienko
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Alexander Kozik
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Lutz Froenicke
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Dean Lavelle
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Belinda Martineau
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Bertrand Perroud
- Genome Center, University of California Davis, Davis, California, United States of America
| | - Richard Michelmore
- Genome Center, University of California Davis, Davis, California, United States of America
- Departments of Plant Sciences, Molecular and Cellular Biology, and Medical Microbiology and Immunology, University of California Davis, Davis, California, United States of America
| |
Collapse
|
23
|
Nagarajan RP, Fouse SD, Bell RJA, Costello JF. Methods for cancer epigenome analysis. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 754:313-38. [PMID: 22956508 DOI: 10.1007/978-1-4419-9967-2_15] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Accurate detection of epimutations in tumor cells is crucial for -understanding the molecular pathogenesis of cancer. Alterations in DNA methylation in cancer are functionally important and clinically relevant, but even this well-studied area is continually re-evaluated in light of unanticipated results, such as the strong association between aberrant DNA methylation in adult tumors and polycomb group profiles in embryonic stem cells, cancer-associated genetic mutations in epigenetic regulators such as DNMT3A and TET family genes, and the discovery of altered 5-hydroxymethylcytosine, a product of TET proteins acting on 5-methylcytosine, in human tumors with TET mutations. The abundance and distribution of covalent histone modifications in primary cancer tissues relative to normal cells is an important but largely uncharted area, although there is good evidence for a mechanistic role of cancer-specific alterations in histone modifications in tumor etiology, drug response, and tumor progression. Meanwhile, the discovery of new epigenetic marks continues, and there are many useful methods for epigenome analysis applicable to primary tumor samples, in addition to cancer cell lines. For DNA methylation and hydroxymethylation, next-generation sequencing allows increasingly inexpensive and quantitative whole-genome profiling. Similarly, the refinement and maturation of chromatin immunoprecipitation with next-generation sequencing (ChIP-seq) has made possible genome-wide mapping of histone modifications, open chromatin, and transcription factor binding sites. Computational tools have been developed apace with these epigenome methods to better enable accurate interpretation of the profiling data.
Collapse
|
24
|
Zhou L, Holliday JA. Targeted enrichment of the black cottonwood (Populus trichocarpa) gene space using sequence capture. BMC Genomics 2012; 13:703. [PMID: 23241106 PMCID: PMC3542275 DOI: 10.1186/1471-2164-13-703] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 12/05/2012] [Indexed: 01/04/2023] Open
Abstract
Background High-throughput re-sequencing is rapidly becoming the method of choice for studies of neutral and adaptive processes in natural populations across taxa. As re-sequencing the genome of large numbers of samples is still cost-prohibitive in many cases, methods for genome complexity reduction have been developed in attempts to capture most ecologically-relevant genetic variation. One of these approaches is sequence capture, in which oligonucleotide baits specific to genomic regions of interest are synthesized and used to retrieve and sequence those regions. Results We used sequence capture to re-sequence most predicted exons, their upstream regulatory regions, as well as numerous random genomic intervals in a panel of 48 genotypes of the angiosperm tree Populus trichocarpa (black cottonwood, or ‘poplar’). A total of 20.76Mb (5%) of the poplar genome was targeted, corresponding to 173,040 baits. With 12 indexed samples run in each of four lanes on an Illumina HiSeq instrument (2x100 paired-end), 86.8% of the bait regions were on average sequenced at a depth ≥10X. Few off-target regions (>250bp away from any bait) were present in the data, but on average ~80bp on either side of the baits were captured and sequenced to an acceptable depth (≥10X) to call heterozygous SNPs. Nucleotide diversity estimates within and adjacent to protein-coding genes were similar to those previously reported in Populus spp., while intergenic regions had higher values consistent with a relaxation of selection. Conclusions Our results illustrate the efficiency and utility of sequence capture for re-sequencing highly heterozygous tree genomes, and suggest design considerations to optimize the use of baits in future studies.
Collapse
Affiliation(s)
- Lecong Zhou
- Department of Forest Resources and Environmental Conservation, Virginia Polytechnic Institute and State University, 304 Cheatham Hall, Blacksburg, VA 24061, USA
| | | |
Collapse
|
25
|
Eichten SR, Ellis NA, Makarevitch I, Yeh CT, Gent JI, Guo L, McGinnis KM, Zhang X, Schnable PS, Vaughn MW, Dawe RK, Springer NM. Spreading of heterochromatin is limited to specific families of maize retrotransposons. PLoS Genet 2012; 8:e1003127. [PMID: 23271981 PMCID: PMC3521669 DOI: 10.1371/journal.pgen.1003127] [Citation(s) in RCA: 138] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2012] [Accepted: 10/15/2012] [Indexed: 12/25/2022] Open
Abstract
Transposable elements (TEs) have the potential to act as controlling elements to influence the expression of genes and are often subject to heterochromatic silencing. The current paradigm suggests that heterochromatic silencing can spread beyond the borders of TEs and influence the chromatin state of neighboring low-copy sequences. This would allow TEs to condition obligatory or facilitated epialleles and act as controlling elements. The maize genome contains numerous families of class I TEs (retrotransposons) that are present in moderate to high copy numbers, and many are found in regions near genes, which provides an opportunity to test whether the spreading of heterochromatin from retrotransposons is prevalent. We have investigated the extent of heterochromatin spreading into DNA flanking each family of retrotransposons by profiling DNA methylation and di-methylation of lysine 9 of histone 3 (H3K9me2) in low-copy regions of the maize genome. The effects of different retrotransposon families on local chromatin are highly variable. Some retrotransposon families exhibit enrichment of heterochromatic marks within 800–1,200 base pairs of insertion sites, while other families exhibit very little evidence for the spreading of heterochromatic marks. The analysis of chromatin state in genotypes that lack specific insertions suggests that the heterochromatin in low-copy DNA flanking retrotransposons often results from the spreading of silencing marks rather than insertion-site preferences. Genes located near TEs that exhibit spreading of heterochromatin tend to be expressed at lower levels than other genes. Our findings suggest that a subset of retrotransposon families may act as controlling elements influencing neighboring sequences, while the majority of retrotransposons have little effect on flanking sequences. Transposable elements comprise a substantial portion of many eukaryotic genomes. These mobile fragments of DNA can directly mutate genes through insertions into coding regions but may also affect the gene regulation through nearby insertions. There is evidence that the majority of transposable elements are epigenetically silenced, and in some cases this silencing may spread to neighboring sequences. This spreading of heterochromatin could create a significant fitness tradeoff between transposon silencing and gene expression. The maize genome has a complex organization with many genes flanked by retrotransposons, providing an opportunity to study the interaction of retrotransposons and genes. To survey the prevalence of heterochromatin spreading associated with different retrotransposon families, we profiled the spread of heterochromatin into nearby low copy sequences for 150 high copy retrotransposon families. While many retrotransposons exhibit little to no spreading of heterochromatin, there are some retrotransposon families that do exhibit spreading. Genes located near retrotransposons that spread heterochromatin have lower expression levels. The families of retrotransposons that spread heterochromatin marks to nearby low-copy sequences may have increased fitness costs for the host genome due to their suppression of genes located near insertions.
Collapse
Affiliation(s)
- Steven R. Eichten
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Nathanael A. Ellis
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Irina Makarevitch
- Biology Department, Hamline University, Saint Paul, Minnesota, United States of America
| | - Cheng-Ting Yeh
- Center for Plant Genomics and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Jonathan I. Gent
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Lin Guo
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Karen M. McGinnis
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Xiaoyu Zhang
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Patrick S. Schnable
- Center for Plant Genomics and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Matthew W. Vaughn
- Texas Advanced Computing Center, University of Texas at Austin, Austin, Texas, United States of America
| | - R. Kelly Dawe
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Nathan M. Springer
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail:
| |
Collapse
|
26
|
Deschamps S, Nannapaneni K, Zhang Y, Hayes K. Local assemblies of paired-end reduced representation libraries sequenced with the illumina genome analyzer in maize. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2012; 2012:360598. [PMID: 23093955 PMCID: PMC3474217 DOI: 10.1155/2012/360598] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 09/15/2012] [Indexed: 06/01/2023]
Abstract
The use of next-generation DNA sequencing technologies has greatly facilitated reference-guided variant detection in complex plant genomes. However, complications may arise when regions adjacent to a read of interest are used for marker assay development, or when reference sequences are incomplete, as short reads alone may not be long enough to ascertain their uniqueness. Here, the possibility of generating longer sequences in discrete regions of the large and complex genome of maize is demonstrated, using a modified version of a paired-end RAD library construction strategy. Reads are generated from DNA fragments first digested with a methylation-sensitive restriction endonuclease, sheared, enriched with biotin and a selective PCR amplification step, and then sequenced at both ends. Sequences are locally assembled into contigs by subgrouping pairs based on the identity of the read anchored by the restriction site. This strategy applied to two maize inbred lines (B14 and B73) generated 183,609 and 129,018 contigs, respectively, out of which at least 76% were >200 bps in length. A subset of putative single nucleotide polymorphisms from contigs aligning to the B73 reference genome with at least one mismatch was resequenced, and 90% of those in B14 were confirmed, indicating that this method is a potent approach for variant detection and marker development in species with complex genomes or lacking extensive reference sequences.
Collapse
Affiliation(s)
- Stéphane Deschamps
- DuPont Agricultural Biotechnology, P.O. Box 80353, Wilmington, DE 19880, USA
| | | | - Yun Zhang
- DuPont Pioneer, P.O. Box 1004, Johnston, IA 50131, USA
| | - Kevin Hayes
- DuPont Pioneer, P.O. Box 1004, Johnston, IA 50131, USA
| |
Collapse
|
27
|
Zellmer AJ, Hanes MM, Hird SM, Carstens BC. Deep Phylogeographic Structure and Environmental Differentiation in the Carnivorous Plant Sarracenia alata. Syst Biol 2012; 61:763-77. [DOI: 10.1093/sysbio/sys048] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Amanda J. Zellmer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; and 2 Department of Biology, Eastern Michigan University, Ypsilanti, MI 48197, USA
| | - Margaret M. Hanes
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; and 2 Department of Biology, Eastern Michigan University, Ypsilanti, MI 48197, USA
| | - Sarah M. Hird
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; and 2 Department of Biology, Eastern Michigan University, Ypsilanti, MI 48197, USA
| | - Bryan C. Carstens
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; and 2 Department of Biology, Eastern Michigan University, Ypsilanti, MI 48197, USA
| |
Collapse
|
28
|
Hamilton JP, Buell CR. Advances in plant genome sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2012; 70:177-90. [PMID: 22449051 DOI: 10.1111/j.1365-313x.2012.04894.x] [Citation(s) in RCA: 99] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The study of plant biology in the 21st century is, and will continue to be, vastly different from that in the 20th century. One driver for this has been the use of genomics methods to reveal the genetic blueprints for not one but dozens of plant species, as well as resolving genome differences in thousands of individuals at the population level. Genomics technology has advanced substantially since publication of the first plant genome sequence, that of Arabidopsis thaliana, in 2000. Plant genomics researchers have readily embraced new algorithms, technologies and approaches to generate genome, transcriptome and epigenome datasets for model and crop species that have permitted deep inferences into plant biology. Challenges in sequencing any genome include ploidy, heterozygosity and paralogy, all which are amplified in plant genomes compared to animal genomes due to the large genome sizes, high repetitive sequence content, and rampant whole- or segmental genome duplication. The ability to generate de novo transcriptome assemblies provides an alternative approach to bypass these complex genomes and access the gene space of these recalcitrant species. The field of genomics is driven by technological improvements in sequencing platforms; however, software and algorithm development has lagged behind reductions in sequencing costs, improved throughput, and quality improvements. It is anticipated that sequencing platforms will continue to improve the length and quality of output, and that the complementary algorithms and bioinformatic software needed to handle large, repetitive genomes will improve. The future is bright for an exponential improvement in our understanding of plant biology.
Collapse
Affiliation(s)
- John P Hamilton
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | | |
Collapse
|
29
|
Tsukahara S, Kawabe A, Kobayashi A, Ito T, Aizu T, Shin-i T, Toyoda A, Fujiyama A, Tarutani Y, Kakutani T. Centromere-targeted de novo integrations of an LTR retrotransposon of Arabidopsis lyrata. Genes Dev 2012; 26:705-13. [PMID: 22431508 DOI: 10.1101/gad.183871.111] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The plant genome evolves with rapid proliferation of LTR-type retrotransposons, which is associated with their clustered accumulation in gene-poor regions, such as centromeres. Despite their major role for plant genome evolution, no mobile LTR element with targeted integration into gene-poor regions has been identified in plants. Here, we report such targeted integrations de novo. We and others have previously shown that an ATCOPIA93 family retrotransposon in Arabidopsis thaliana is mobilized when the DNA methylation machinery is compromised. Although ATCOPIA93 family elements are low copy number in the wild-type A. thaliana genome, high-copy-number related elements are found in the wild-type Arabidopsis lyrata genome, and they show centromere-specific localization. To understand the mechanisms for the clustered accumulation of the A. lyrata elements directly, we introduced one of them, named Tal1 (Transposon of Arabidopsis lyrata 1), into A. thaliana by transformation. The introduced Tal1 was retrotransposed in A. thaliana, and most of the retrotransposed copies were found in centromeric repeats of A. thaliana, suggesting targeted integration. The targeted integration is especially surprising because the centromeric repeat sequences differ considerably between A. lyrata and A. thaliana. Our results revealed unexpectedly dynamic controls for evolution of the transposon-rich heterochromatic regions.
Collapse
Affiliation(s)
- Sayuri Tsukahara
- Department of Integrated Genetics, National Institute of Genetics, Shizuoka 411-8540, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, Caig V, Heller-Uszynska K, Jaccoud D, Hopper C, Aschenbrenner-Kilian M, Evers M, Peng K, Cayla C, Hok P, Uszynski G. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol Biol 2012; 888:67-89. [PMID: 22665276 DOI: 10.1007/978-1-61779-870-2_5] [Citation(s) in RCA: 382] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
In the last 20 years, we have observed an exponential growth of the DNA sequence data and simular increase in the volume of DNA polymorphism data generated by numerous molecular marker technologies. Most of the investment, and therefore progress, concentrated on human genome and genomes of selected model species. Diversity Arrays Technology (DArT), developed over a decade ago, was among the first "democratizing" genotyping technologies, as its performance was primarily driven by the level of DNA sequence variation in the species rather than by the level of financial investment. DArT also proved more robust to genome size and ploidy-level differences among approximately 60 organisms for which DArT was developed to date compared to other high-throughput genotyping technologies. The success of DArT in a number of organisms, including a wide range of "orphan crops," can be attributed to the simplicity of underlying concepts: DArT combines genome complexity reduction methods enriching for genic regions with a highly parallel assay readout on a number of "open-access" microarray platforms. The quantitative nature of the assay enabled a number of applications in which allelic frequencies can be estimated from DArT arrays. A typical DArT assay tests for polymorphism tens of thousands of genomic loci with the final number of markers reported (hundreds to thousands) reflecting the level of DNA sequence variation in the tested loci. Detailed DArT methods, protocols, and a range of their application examples as well as DArT's evolution path are presented.
Collapse
Affiliation(s)
- Andrzej Kilian
- Diversity Arrays Technology Pty Ltd, Yarralumla, Canberra, ACT, Australia.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Barhoumi A, Halas NJ. Detecting Chemically Modified DNA Bases Using Surface Enhanced Raman Spectroscopy. J Phys Chem Lett 2011; 2:3118-3123. [PMID: 24427449 PMCID: PMC3888867 DOI: 10.1021/jz201423b] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Post-translational modifications of DNA- changes in the chemical structure of individual bases that occur without changes in the DNA sequence- are known to alter gene expression. They are believed to result in frequently deleterious phenotypic changes, such as cancer. Methylation of adenine, methylation and hydroxymethylation of cytosine, and guanine oxidation are the primary DNA base modifications identified to date. Here we show it is possible to use surface enhanced Raman spectroscopy (SERS) to detect these primary DNA base modifications. SERS detection of modified DNA bases is label-free and requires minimal additional sample preparation, reducing the possibility of additional chemical modifications induced prior to measurement. This approach shows the feasibility of DNA base modification assessment as a potentially routine analysis that may be further developed for clinical diagnostics.
Collapse
Affiliation(s)
- Aoune Barhoumi
- Department of Chemistry, Rice University, Houston, TX 77005 (USA)
| | - Naomi J. Halas
- Department of Chemistry, Rice University, Houston, TX 77005 (USA)
- Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005 (USA)
- Department of Bioengineering, Rice University, Houston, TX 77005 (USA)
- Corresponding Author PHONE: (713) 348-5611; Fax: (+1)713-348-5686
| |
Collapse
|
32
|
Paux E, Sourdille P, Mackay I, Feuillet C. Sequence-based marker development in wheat: advances and applications to breeding. Biotechnol Adv 2011; 30:1071-88. [PMID: 21989506 DOI: 10.1016/j.biotechadv.2011.09.015] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2011] [Revised: 08/24/2011] [Accepted: 09/25/2011] [Indexed: 01/04/2023]
Abstract
In the past two decades, the wheat community has made remarkable progress in developing molecular resources for breeding. A wide variety of molecular tools has been established to accelerate genetic and physical mapping for facilitating the efficient identification of molecular markers linked to genes and QTL of agronomic interest. Already, wheat breeders are benefiting from a wide range of techniques to follow the introgression of the most favorable alleles in elite material and develop improved varieties. Breeders soon will be able to take advantage of new technological developments based on Next Generation Sequencing. In this paper, we review the molecular toolbox available to wheat scientists and breeders for performing fundamental genomic studies and breeding. Special emphasis is given on the production and detection of single nucleotide polymorphisms (SNPs) that should enable a step change in saturating the wheat genome for more efficient genetic studies and for the development of new selection methods. The perspectives offered by the access to an ordered full genome sequence for further marker development and enhanced precision breeding is also discussed. Finally, we discuss the advantages and limitations of marker-assisted selection for supporting wheat improvement.
Collapse
Affiliation(s)
- Etienne Paux
- INRA-UBP 1095, Genetics Diversity and Ecophysiology of Cereals, 234 Avenue du Brézet, Clermont-Ferrand, France
| | | | | | | |
Collapse
|
33
|
Rivarola M, Foster JT, Chan AP, Williams AL, Rice DW, Liu X, Melake-Berhan A, Huot Creasy H, Puiu D, Rosovitz MJ, Khouri HM, Beckstrom-Sternberg SM, Allan GJ, Keim P, Ravel J, Rabinowicz PD. Castor bean organelle genome sequencing and worldwide genetic diversity analysis. PLoS One 2011; 6:e21743. [PMID: 21750729 PMCID: PMC3131294 DOI: 10.1371/journal.pone.0021743] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2011] [Accepted: 06/10/2011] [Indexed: 11/26/2022] Open
Abstract
Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade.
Collapse
MESH Headings
- Base Sequence
- Ricinus communis/classification
- Ricinus communis/genetics
- Ricinus communis/growth & development
- DNA, Chloroplast/chemistry
- DNA, Chloroplast/genetics
- DNA, Circular/chemistry
- DNA, Circular/genetics
- DNA, Mitochondrial/chemistry
- DNA, Mitochondrial/genetics
- DNA, Plant/chemistry
- DNA, Plant/genetics
- Genetic Variation
- Genome, Chloroplast/genetics
- Genome, Mitochondrial/genetics
- Genome, Plant/genetics
- Molecular Sequence Data
- Phylogeny
- Polymorphism, Single Nucleotide
- Sequence Analysis, DNA
- Species Specificity
Collapse
Affiliation(s)
- Maximo Rivarola
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Jeffrey T. Foster
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Agnes P. Chan
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Amber L. Williams
- Department of Biological Sciences, Environmental Genetics and Genomics Laboratory, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Danny W. Rice
- Department of Biology, Indiana University, Bloomington, Indiana, United States of America
| | - Xinyue Liu
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | | | - Heather Huot Creasy
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Daniela Puiu
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - M. J. Rosovitz
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Hoda M. Khouri
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Stephen M. Beckstrom-Sternberg
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
- Pathogen Genomics Division, Translational Genomics Research Institute, Phoenix, Arizona, United States of America
| | - Gerard J. Allan
- Department of Biological Sciences, Environmental Genetics and Genomics Laboratory, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Paul Keim
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Jacques Ravel
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| | - Pablo D. Rabinowicz
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
- J. Craig Venter Institute, Rockville, Maryland, United States of America
- Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, Maryland, United States of America
| |
Collapse
|
34
|
Abstract
Single nucleotide polymorphisms (SNPs) are single base differences between haplotypes. SNPs are abundant in many species and valuable as markers for genetic map construction, modern molecular breeding programs, and quantitative genetic studies. SNPs are readily mined from genomic DNA or cDNA sequence obtained from individuals having two or more distinct genotypes. While automated Sanger sequencing has become less expensive over time, it is still costly to acquire deep Sanger sequence from several genotypes. "Next-generation" DNA sequencing technologies that utilize new chemistries and massively parallel approaches have enabled DNA sequences to be acquired at extremely high depths of coverage faster and for less cost than traditional sequencing. One such method is represented by the Roche/454 Life Sciences GS-FLX Titanium Series, which currently uses pyrosequencing to produce up to 400-600 million bases of DNA sequence/run (>1 million reads, ~400 bp/read). This chapter discusses the use of high-throughput pyrosequencing for SNP discovery by focusing on 454 sequencing of maize cDNA, the development of a computational pipeline for polymorphism detection, and the subsequent identification of over 7,000 putative SNPs between Mo17 and B73 maize. In addition, alternative alignment and polymorphism detection strategies that implement Illumina short reads, data processing and visualization tools, and reduced representation techniques that reduce the sequencing of repeat DNA, thus enabling efficient analysis of genome sequence, are discussed.
Collapse
Affiliation(s)
- W Brad Barbazuk
- Department of Biology and the Genetics Institute, University of Florida, Gainesville, FL, USA.
| | | |
Collapse
|
35
|
[Construction of a methylation filtration library in Hevea brasiliensis.]. YI CHUAN = HEREDITAS 2010; 32:1071-6. [PMID: 20943496 DOI: 10.3724/sp.j.1005.2010.01071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In order to enrich gene encoding region of Hevea brasiliensis, a methylation filtration library was constructed using Escherichia coli McrBC restriction-modification system. The titers of the non-amplified library and the amplified library were 2.6×106 pfu/ml and 9.0×109, respectively. The rate of positive clones was 86.4%. The lengths of inserted DNA sequence ranged from 1 kb to 2.5 kb and the average size of inserts was 1.2 kb. One hundred clones were selected randomly for sequencing, resulting in splicing out of 81 non-redundant sequences, including 6 contigs and 75 singlets. The redundancy was 17.35%. Blast analysis showed that 39.5% of non-redundant sequences were homologous with the Nr database, 14.81% with the EST database, and 32.1% were unknown sequences. Some sequences were related genes for flowering, insect and disease resistance. Therefore, the rubber tree methylation library is helpful for discovery and cloning of functional genes.
Collapse
|
36
|
Chan AP, Crabtree J, Zhao Q, Lorenzi H, Orvis J, Puiu D, Melake-Berhan A, Jones KM, Redman J, Chen G, Cahoon EB, Gedil M, Stanke M, Haas BJ, Wortman JR, Fraser-Liggett CM, Ravel J, Rabinowicz PD. Draft genome sequence of the oilseed species Ricinus communis. Nat Biotechnol 2010; 28:951-6. [PMID: 20729833 PMCID: PMC2945230 DOI: 10.1038/nbt.1674] [Citation(s) in RCA: 288] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2010] [Accepted: 08/02/2010] [Indexed: 11/11/2022]
Abstract
Castor bean (Ricinus communis) is an oilseed crop that belongs to the spurge (Euphorbiaceae) family, which comprises approximately 6,300 species that include cassava (Manihot esculenta), rubber tree (Hevea brasiliensis) and physic nut (Jatropha curcas). It is primarily of economic interest as a source of castor oil, used for the production of high-quality lubricants because of its high proportion of the unusual fatty acid ricinoleic acid. However, castor bean genomics is also relevant to biosecurity as the seeds contain high levels of ricin, a highly toxic, ribosome-inactivating protein. Here we report the draft genome sequence of castor bean (4.6-fold coverage), the first for a member of the Euphorbiaceae. Whereas most of the key genes involved in oil synthesis and turnover are single copy, the number of members of the ricin gene family is larger than previously thought. Comparative genomics analysis suggests the presence of an ancient hexaploidization event that is conserved across the dicotyledonous lineage.
Collapse
Affiliation(s)
- Agnes P. Chan
- J. Craig Venter Institute (JCVI), Rockville, MD 20850, USA
| | - Jonathan Crabtree
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Qi Zhao
- J. Craig Venter Institute (JCVI), Rockville, MD 20850, USA
| | - Hernan Lorenzi
- J. Craig Venter Institute (JCVI), Rockville, MD 20850, USA
| | - Joshua Orvis
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Daniela Puiu
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | | | - Kristine M. Jones
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Julia Redman
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Grace Chen
- United States Department of Agriculture, Agricultural Research Service, Western Regional Research Center, Crop Improvement and Utilization, Albany, CA 94710, USA
| | - Edgar B. Cahoon
- Center for Plant Science Innovation and Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, NE 68588, USA
| | - Melaku Gedil
- International Institute of Tropical Agriculture, Oyo State, Ibadan, Nigeria
| | - Mario Stanke
- Institut für Mikrobiologie und Genetik, Abteilung Bioinformatik, Universität Göttingen, Göttingen, Germany
| | - Brian J. Haas
- Broad Institute of the Massachusetts Institute of Technology and Harvard, Cambridge MA 02141, USA
| | - Jennifer R. Wortman
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Claire M. Fraser-Liggett
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Jacques Ravel
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Pablo D. Rabinowicz
- J. Craig Venter Institute (JCVI), Rockville, MD 20850, USA
- Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, MD 21201, USA
- Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
37
|
Dhillon B, Cavaletto JR, Wood KV, Goodwin SB. Accidental amplification and inactivation of a methyltransferase gene eliminates cytosine methylation in Mycosphaerella graminicola. Genetics 2010; 186:67-77. [PMID: 20610411 PMCID: PMC2940312 DOI: 10.1534/genetics.110.117408] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A de novo search for repetitive elements in the genome sequence of the wheat pathogen Mycosphaerella graminicola identified a family of repeats containing a DNA cytosine methyltransferase sequence (MgDNMT). All 23 MgDNMT sequences identified carried signatures of repeat induced point mutation (RIP). All copies were subtelomeric in location except for one on chromosome 6. Synteny with M. fijiensis implied that the nontelomeric copy on chromosome 6 served as a template for subsequent amplifications. Southern analysis revealed that the MgDNMT sequence also was amplified in 15 additional M. graminicola isolates from various geographical regions. However, this amplification event was specific to M. graminicola; a search for MgDNMT homologs identified only a single, unmutated copy in the genomes of 11 other ascomycetes. A genome-wide methylation assay revealed that M. graminicola lacks cytosine methylation, as expected if its MgDNMT gene is inactivated. Methylation was present in several other species tested, including the closest known relatives of M. graminicola, species S1 and S2. Therefore, the observed changes most likely occurred within the past 10,500 years since the divergence between M. graminicola and S1. Our data indicate that the recent amplification of a single-copy MgDNMT gene made it susceptible to RIP, resulting in complete loss of cytosine methylation in M. graminicola.
Collapse
Affiliation(s)
- Braham Dhillon
- Department of Botany and Plant Pathology, Purdue University, West Lafayette, Indiana 47907, USA
| | | | | | | |
Collapse
|
38
|
Vollbrecht E, Duvick J, Schares JP, Ahern KR, Deewatthanawong P, Xu L, Conrad LJ, Kikuchi K, Kubinec TA, Hall BD, Weeks R, Unger-Wallace E, Muszynski M, Brendel VP, Brutnell TP. Genome-wide distribution of transposed Dissociation elements in maize. THE PLANT CELL 2010; 22:1667-85. [PMID: 20581308 PMCID: PMC2910982 DOI: 10.1105/tpc.109.073452] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2009] [Revised: 04/09/2010] [Accepted: 06/09/2010] [Indexed: 05/18/2023]
Abstract
The maize (Zea mays) transposable element Dissociation (Ds) was mobilized for large-scale genome mutagenesis and to study its endogenous biology. Starting from a single donor locus on chromosome 10, over 1500 elements were distributed throughout the genome and positioned on the maize physical map. Genetic strategies to enrich for both local and unlinked insertions were used to distribute Ds insertions. Global, regional, and local insertion site trends were examined. We show that Ds transposed to both linked and unlinked sites and displayed a nonuniform distribution on the genetic map around the donor r1-sc:m3 locus. Comparison of Ds and Mutator insertions reveals distinct target preferences, which provide functional complementarity of the two elements for gene tagging in maize. In particular, Ds displays a stronger preference for insertions within exons and introns, whereas Mutator insertions are more enriched in promoters and 5'-untranslated regions. Ds has no strong target site consensus sequence, but we identified properties of the DNA molecule inherent to its local structure that may influence Ds target site selection. We discuss the utility of Ds for forward and reverse genetics in maize and provide evidence that genes within a 2- to 3-centimorgan region flanking Ds insertions will serve as optimal targets for regional mutagenesis.
Collapse
Affiliation(s)
- Erik Vollbrecht
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa 50011, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Foster JT, Allan GJ, Chan AP, Rabinowicz PD, Ravel J, Jackson PJ, Keim P. Single nucleotide polymorphisms for assessing genetic diversity in castor bean (Ricinus communis). BMC PLANT BIOLOGY 2010; 10:13. [PMID: 20082707 PMCID: PMC2832895 DOI: 10.1186/1471-2229-10-13] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2009] [Accepted: 01/18/2010] [Indexed: 05/21/2023]
Abstract
BACKGROUND Castor bean (Ricinus communis) is an agricultural crop and garden ornamental that is widely cultivated and has been introduced worldwide. Understanding population structure and the distribution of castor bean cultivars has been challenging because of limited genetic variability. We analyzed the population genetics of R. communis in a worldwide collection of plants from germplasm and from naturalized populations in Florida, U.S. To assess genetic diversity we conducted survey sequencing of the genomes of seven diverse cultivars and compared the data to a reference genome assembly of a widespread cultivar (Hale). We determined the population genetic structure of 676 samples using single nucleotide polymorphisms (SNPs) at 48 loci. RESULTS Bayesian clustering indicated five main groups worldwide and a repeated pattern of mixed genotypes in most countries. High levels of population differentiation occurred between most populations but this structure was not geographically based. Most molecular variance occurred within populations (74%) followed by 22% among populations, and 4% among continents. Samples from naturalized populations in Florida indicated significant population structuring consistent with local demes. There was significant population differentiation for 56 of 78 comparisons in Florida (pairwise population phiPT values, p < 0.01). CONCLUSION Low levels of genetic diversity and mixing of genotypes have led to minimal geographic structuring of castor bean populations worldwide. Relatively few lineages occur and these are widely distributed. Our approach of determining population genetic structure using SNPs from genome-wide comparisons constitutes a framework for high-throughput analyses of genetic diversity in plants, particularly in species with limited genetic diversity.
Collapse
Affiliation(s)
- Jeffrey T Foster
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, AZ 86011-4073 USA
| | - Gerard J Allan
- Department of Biological Sciences, Environmental Genetics and Genomics Laboratory, Northern Arizona University, Flagstaff, AZ 86011-5640 USA
| | - Agnes P Chan
- J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
| | - Pablo D Rabinowicz
- J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
- Institute for Genome Sciences, University of Maryland School of Medicine, 20 Penn Street, Baltimore, MD 21201 USA
- Department of Biochemistry & Molecular Biology, University of Maryland School of Medicine, 20 Penn Street, Baltimore, MD 21201 USA
| | - Jacques Ravel
- J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
- Institute for Genome Sciences, University of Maryland School of Medicine, 20 Penn Street, Baltimore, MD 21201 USA
- Department of Microbiology & Immunology, University of Maryland School of Medicine, 20 Penn Street, Baltimore, MD 21201 USA
| | - Paul J Jackson
- Defense Biology Division, Lawrence Livermore National Laboratory, Livermore, CA 94551 USA
| | - Paul Keim
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, AZ 86011-4073 USA
| |
Collapse
|
40
|
Bang KH, Lee JW, Kim YC, Kim DH, Lee EH, Jeung JU. Construction of Genomic DNA Library of Korean Ginseng (Panax ginseng C. A. MEYER) and Development of Sequence-Tagged Sites. Biol Pharm Bull 2010; 33:1579-88. [DOI: 10.1248/bpb.33.1579] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Kyong-Hwan Bang
- Ginseng Research Division, National Institute of Horticultural & Herbal Science, Korean Rural Development Administration (RDA)
| | - Jei-Wan Lee
- Ginseng Research Division, National Institute of Horticultural & Herbal Science, Korean Rural Development Administration (RDA)
| | - Young-Chang Kim
- Ginseng Research Division, National Institute of Horticultural & Herbal Science, Korean Rural Development Administration (RDA)
| | - Dong-Hwi Kim
- Ginseng Research Division, National Institute of Horticultural & Herbal Science, Korean Rural Development Administration (RDA)
| | - Eung-Ho Lee
- Ginseng Research Division, National Institute of Horticultural & Herbal Science, Korean Rural Development Administration (RDA)
| | - Ji-Ung Jeung
- Rice Research Division, National Institute of Crop Science, Korean Rural Development Administration (RDA)
| |
Collapse
|
41
|
Zhou S, Wei F, Nguyen J, Bechner M, Potamousis K, Goldstein S, Pape L, Mehan MR, Churas C, Pasternak S, Forrest DK, Wise R, Ware D, Wing RA, Waterman MS, Livny M, Schwartz DC. A single molecule scaffold for the maize genome. PLoS Genet 2009; 5:e1000711. [PMID: 19936062 PMCID: PMC2774507 DOI: 10.1371/journal.pgen.1000711] [Citation(s) in RCA: 115] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 10/05/2009] [Indexed: 11/18/2022] Open
Abstract
About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars. The maize genome contains abundant repeats interspersed by low-copy, gene-coding sequences that make it a challenge to sequence; consequently, current BAC sequence assemblies average 11 contigs per clone. The iMap deals with such complexity by the judicious integration of IBM genetic and B73 physical maps, but the B73 genome structure could differ from the IBM population because of genetic recombination and subsequent rearrangements. Accordingly, we report a genome-wide, high-resolution optical map of maize B73 genome that was constructed from the direct analysis of genomic DNA molecules without using genetic markers. The integration of optical and iMap resources with comparisons to FPC maps enabled a uniquely comprehensive and scalable assessment of a given BAC's sequence assembly, its placement within a FPC contig, and the location of this FPC contig within a chromosome-wide pseudomolecule. As such, the overall utility of the maize optical map for the validation of sequence assemblies has been significant and demonstrates the inherent advantages of single molecule platforms. Construction of the maize optical map represents the first physical map of a eukaryotic genome larger than 400 Mb that was created de novo from individual genomic DNA molecules.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Fusheng Wei
- Department of Plant Sciences, Arizona Genomics Institute, University of Arizona, Tucson, Arizona, United States of America
| | - John Nguyen
- Departments of Mathematics, Biology, and Computer Science, University of Southern California, Los Angeles, California, United States of America
| | - Mike Bechner
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Konstantinos Potamousis
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Steve Goldstein
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Louise Pape
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Michael R. Mehan
- Departments of Mathematics, Biology, and Computer Science, University of Southern California, Los Angeles, California, United States of America
| | - Chris Churas
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Shiran Pasternak
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Dan K. Forrest
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Roger Wise
- Corn Insects and Crop Genetics Research, United States Department of Agriculture–Agricultural Research Service and Department of Plant Pathology, Iowa State University, Ames, Iowa, United States of America
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
- Plant, Soil, and Nutrition Research, United States Department of Agriculture–Agricultural Research Service, Ithaca, New York, United States of America
| | - Rod A. Wing
- Department of Plant Sciences, Arizona Genomics Institute, University of Arizona, Tucson, Arizona, United States of America
| | - Michael S. Waterman
- Departments of Mathematics, Biology, and Computer Science, University of Southern California, Los Angeles, California, United States of America
| | - Miron Livny
- Computer Sciences Department, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - David C. Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, UW Biotechnology Center, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
- * E-mail:
| |
Collapse
|
42
|
Imelfort M, Edwards D. De novo sequencing of plant genomes using second-generation technologies. Brief Bioinform 2009; 10:609-18. [DOI: 10.1093/bib/bbp039] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
43
|
Mun JH, Kwon SJ, Yang TJ, Seol YJ, Jin M, Kim JA, Lim MH, Kim JS, Baek S, Choi BS, Yu HJ, Kim DS, Kim N, Lim KB, Lee SI, Hahn JH, Lim YP, Bancroft I, Park BS. Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication. Genome Biol 2009; 10:R111. [PMID: 19821981 PMCID: PMC2784326 DOI: 10.1186/gb-2009-10-10-r111] [Citation(s) in RCA: 158] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 08/09/2009] [Accepted: 10/12/2009] [Indexed: 02/01/2023] Open
Abstract
Euchromatic regions of the Brassica rapa genome were sequenced and mapped onto the corresponding regions in the Arabidopsis thaliana genome. Background Brassica rapa is one of the most economically important vegetable crops worldwide. Owing to its agronomic importance and phylogenetic position, B. rapa provides a crucial reference to understand polyploidy-related crop genome evolution. The high degree of sequence identity and remarkably conserved genome structure between Arabidopsis and Brassica genomes enables comparative tiling sequencing using Arabidopsis sequences as references to select the counterpart regions in B. rapa, which is a strong challenge of structural and comparative crop genomics. Results We assembled 65.8 megabase-pairs of non-redundant euchromatic sequence of B. rapa and compared this sequence to the Arabidopsis genome to investigate chromosomal relationships, macrosynteny blocks, and microsynteny within blocks. The triplicated B. rapa genome contains only approximately twice the number of genes as in Arabidopsis because of genome shrinkage. Genome comparisons suggest that B. rapa has a distinct organization of ancestral genome blocks as a result of recent whole genome triplication followed by a unique diploidization process. A lack of the most recent whole genome duplication (3R) event in the B. rapa genome, atypical of other Brassica genomes, may account for the emergence of B. rapa from the Brassica progenitor around 8 million years ago. Conclusions This work demonstrates the potential of using comparative tiling sequencing for genome analysis of crop species. Based on a comparative analysis of the B. rapa sequences and the Arabidopsis genome, it appears that polyploidy and chromosomal diploidization are ongoing processes that collectively stabilize the B. rapa genome and facilitate its evolution.
Collapse
Affiliation(s)
- Jeong-Hwan Mun
- Department of Agricultural Biotechnology, National Academy of Agricultural Science, Rural Development Administration, 150 Suin-ro, Gwonseon-gu, Suwon 441-707, Korea.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Kronmiller BA, Wise RP. Computational finishing of large sequence contigs reveals interspersed nested repeats and gene islands in the rf1-associated region of maize. PLANT PHYSIOLOGY 2009; 151:483-495. [PMID: 19675151 PMCID: PMC2754626 DOI: 10.1104/pp.109.143370] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2009] [Accepted: 08/03/2009] [Indexed: 05/26/2023]
Abstract
The architecture of grass genomes varies on multiple levels. Large long terminal repeat retrotransposon clusters occupy significant portions of the intergenic regions, and islands of protein-encoding genes are interspersed among the repeat clusters. Hence, advanced assembly techniques are required to obtain completely finished genomes as well as to investigate gene and transposable element distributions. To characterize the organization and distribution of repeat clusters and gene islands across large grass genomes, we present 961- and 594-kb contiguous sequence contigs associated with the rf1 (for restorer of fertility1) locus in the near-centromeric region of maize (Zea mays) chromosome 3. We present two methods for computational finishing of highly repetitive bacterial artificial chromosome clones that have proved successful to close all sequence gaps caused by transposable element insertions. Sixteen repeat clusters were observed, ranging in length from 23 to 155 kb. These repeat clusters are almost exclusively long terminal repeat retrotransposons, of which the paleontology of insertion varies throughout the cluster. Gene islands contain from one to four predicted genes, resulting in a gene density of one gene per 16 kb in gene islands and one gene per 111 kb over the entire sequenced region. The two sequence contigs, when compared with the rice (Oryza sativa) and sorghum (Sorghum bicolor) genomes, retain gene colinearity of 50% and 71%, respectively, and 70% and 100%, respectively, for high-confidence gene models. Collinear genes on single gene islands show that while most expansion of the maize genome has occurred in the repeat clusters, gene islands are not immune and have experienced growth in both intragene and intergene locations.
Collapse
Affiliation(s)
- Brent A Kronmiller
- Bioinformatics and Computational Biology, Iowa State University, Ames, Iowa 50011-1020, USA
| | | |
Collapse
|
45
|
Zeng SH, Liu D, Wang Y. [Advances of gene enrichment in plant genome]. YI CHUAN = HEREDITAS 2009; 31:799-808. [PMID: 19689940 DOI: 10.3724/sp.j.1005.2009.00799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The genome size varies greatly in higher plants. Repetitive sequences account for most of the large plant genomes while low-copy or single copy genic sequences, referred to as gene space, take up only a small portion of the genomes. Considering the large amount of repetitive sequences, it is a great challenge to obtain genic sequences using high-throughout methods in non-model plants bearing large genomes. Currently, several approaches have been developed for gene enrichment on a genome-wide scale, such as cDNA library, methylation filtration library, high Cot library and transposon tagging. Here, we reviewed the technical principles, advantages and disadvantages of these methods, as well as the recent development of methylation filtration technology. An in-depth discussion was performed for selection of one method or combination of methods according to the research objectives and plant materials, especially for plants with large genomes.
Collapse
Affiliation(s)
- Shao-Hua Zeng
- Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China.
| | | | | |
Collapse
|
46
|
Langdon T, Thomas A, Huang L, Farrar K, King J, Armstead I. Fragments of the key flowering gene GIGANTEA are associated with helitron-type sequences in the Pooideae grass Lolium perenne. BMC PLANT BIOLOGY 2009; 9:70. [PMID: 19500419 PMCID: PMC2702305 DOI: 10.1186/1471-2229-9-70] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2009] [Accepted: 06/07/2009] [Indexed: 05/27/2023]
Abstract
BACKGROUND Helitrons are a class of transposable elements which have been identified in a number of species of plants, animals and fungi. They are unique in their proposed rolling-circle mode of replication, have a highly variable copy-number and have been implicated in the restructuring of coding sequences both by their insertion into existing genes and by their incorporation of transcriptionally competent gene fragments. Helitron discovery depends on identifying associated DNA signature sequences and comprehensive evaluation of helitron contribution to a particular genome requires detailed computational analysis of whole genome sequence. Therefore, the role which helitrons have played in modelling non-model plant genomes is largely unknown. RESULTS Cloning of the flowering gene GIGANTEA (GI) from a BAC library of the Pooideae grass Lolium perenne (perennial ryegrass) identified the target gene and several GI pseudogene fragments spanning the first five exons. Analysis of genomic sequence 5' and 3' of one these GI fragments revealed motifs consistent with helitron-type transposon insertion, specifically a putative 5'-A (downward arrow) T-3' insertion site containing 5'-TC and CTAG-3' borders with a sub-terminal 16 bp hairpin. Screening of a BAC library of the closely related grass species Festuca pratensis (meadow fescue) indicated similar helitron-associated GI fragments present in this genome, as well as non-helitron associated GI fragments derived from the same region of GI. In order to investigate the possible extent of ancestral helitron-activity in L. perenne, a methylation-filtered GeneThresher genomic library developed from this species was screened for potential helitron 3' hairpin sequences associated with a 3'-CTRR motif. This identified 7 potential helitron hairpin-types present between at least 9 and 51 times within the L. perenne methylation-filtered library. CONCLUSION This represents evidence for a possible ancestral role for helitrons in modelling the genomes of Lolium and related species.
Collapse
Affiliation(s)
- Tim Langdon
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| | - Ann Thomas
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| | - Lin Huang
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| | - Kerrie Farrar
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| | - Julie King
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| | - Ian Armstead
- Institute of Biological, Environmental and Rural Sciences, Gogerddan Campus, Aberystwyth University, Ceredigion, SY23 3EB, UK
| |
Collapse
|
47
|
He L, Dooner HK. Haplotype structure strongly affects recombination in a maize genetic interval polymorphic for Helitron and retrotransposon insertions. Proc Natl Acad Sci U S A 2009; 106:8410-6. [PMID: 19416860 PMCID: PMC2688972 DOI: 10.1073/pnas.0902972106] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2009] [Indexed: 11/18/2022] Open
Abstract
We have asked here how the remarkable variation in maize haplotype structure affects recombination. We compared recombination across a genetic interval of 9S in 2 highly dissimilar heterozygotes that shared 1 parent. The genetic interval in the common haplotype is approximately 100 kb long and contains 6 genes interspersed with gene-fragment-bearing Helitrons and retrotransposons that, together, comprise 70% of its length. In one heterozygote, most intergenic insertions are homozygous, although polymorphic, enabling us to determine whether any recombination junctions fall within them. In the other, most intergenic insertions are hemizygous and, thus, incapable of homologous recombination. Our analysis of the frequency and distribution of recombination in the interval revealed that: (i) Most junctions were circumscribed to the gene space, where they showed a highly nonuniform distribution. In both heterozygotes, more than half of the junctions fell in the stc1 gene, making it a clear recombination hotspot in the region. However, the genetic size of stc1 was 2-fold lower when flanked by a hemizygous 25-kb retrotransposon cluster. (ii) No junctions fell in the hypro1 gene in either heterozygote, making it a genic recombination coldspot. (iii) No recombination occurred within the gene fragments borne on Helitrons nor within retrotransposons, so neither insertion class contributes to the interval's genetic length. (iv) Unexpectedly, several junctions fell in an intergenic region not shared by all 3 haplotypes. (v) In general, the ability of a sequence to recombine correlated inversely with its methylation status. Our results show that haplotypic structural variability strongly affects the frequency and distribution of recombination events in maize.
Collapse
Affiliation(s)
- Limei He
- The Waksman Institute, Rutgers University, Piscataway, NJ 08855; and
| | - Hugo K. Dooner
- The Waksman Institute, Rutgers University, Piscataway, NJ 08855; and
- Department of Plant Biology, Rutgers University, New Brunswick, NJ 08901
| |
Collapse
|
48
|
Marfil CF, Camadro EL, Masuelli RW. Phenotypic instability and epigenetic variability in a diploid potato of hybrid origin, Solanum ruiz-lealii. BMC PLANT BIOLOGY 2009; 9:21. [PMID: 19232108 PMCID: PMC2656509 DOI: 10.1186/1471-2229-9-21] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Accepted: 02/20/2009] [Indexed: 05/05/2023]
Abstract
BACKGROUND The wild potato Solanum ruiz-lealii Brüch. (2n = 2x = 24), a species of hybrid origin, is endemic to Mendoza province, Argentina. Recurrent flower malformations, which varied among inflorescences of the same plant, were observed in a natural population. These abnormalities could be the result of genomic instabilities, nucleus-cytoplasmic incompatibility or epigenetic changes. To shed some light on their origin, nuclear and mitochondrial DNA of plants with normal and plants with both normal and malformed flowers (from here on designated as plants with normal and plants with abnormal flower phenotypes, respectively) were analyzed by AFLP and restriction analyses, respectively. Also, the wide genome methylation status and the level of methylation of a repetitive sequence were studied by MSAP and Southern blots analyses, respectively. RESULTS AFLP markers and restriction patterns of mitochondrial DNA did not allow the differentiation of normal from abnormal flower phenotypes. However, methylation patterns of nuclear DNA discriminated normal and abnormal flower phenotypes into two different groups, indicating that abnormal phenotypes have a similar methylation status which, in turn, was different from the methylation patterns of normal phenotypes. The abnormal flower phenotype was obtained by treating a normal plant with 5-Azacytidine, a demethylating agent, giving support to the idea of the role of DNA methylation in the origin of flower abnormalities. In addition, the variability detected for DNA methylation was greater than the detected for nucleotide sequence. CONCLUSION The epigenetic nature of the observed flower abnormalities is consistent with the results and indicates that in the diploid hybrid studied, natural variation in methylation profiles of anonymous DNA sequences could be of biological significance.
Collapse
Affiliation(s)
- Carlos F Marfil
- Laboratorio de Biología Molecular, EEA La Consulta INTA, Facultad de Ciencias Agrarias, Universidad Nacional de Cuyo and CONICET, M5528AHB, Alte. Brown 500, Chacras de Coria Mendoza, Argentina
| | - Elsa L Camadro
- EEA Balcarce, INTA-FCA, UNMdP and CONICET, CC 276, 7620, Balcarce, Bs. As., Argentina
| | - Ricardo W Masuelli
- Laboratorio de Biología Molecular, EEA La Consulta INTA, Facultad de Ciencias Agrarias, Universidad Nacional de Cuyo and CONICET, M5528AHB, Alte. Brown 500, Chacras de Coria Mendoza, Argentina
| |
Collapse
|
49
|
Nelson W, Luo M, Ma J, Estep M, Estill J, He R, Talag J, Sisneros N, Kudrna D, Kim H, Ammiraju JSS, Collura K, Bharti AK, Messing J, Wing RA, SanMiguel P, Bennetzen JL, Soderlund C. Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains. BMC Genomics 2008; 9:621. [PMID: 19099592 PMCID: PMC2628917 DOI: 10.1186/1471-2164-9-621] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2008] [Accepted: 12/19/2008] [Indexed: 11/30/2022] Open
Abstract
Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.
Collapse
Affiliation(s)
- William Nelson
- Arizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, Arizona, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Epigenetic Phenomena and Epigenomics in Maize. Epigenomics 2008. [DOI: 10.1007/978-1-4020-9187-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|