1
|
Abstract
Techniques involving whole-genome sequencing and whole-population sequencing (metagenomics) are beginning to revolutionize the study of ecology and evolution. This revolution is furthest advanced in the Bacteria and Archaea, and more sequence data are required for genomic ecology to be fully applied to the majority of eukaryotes. Recently developed next-generation sequencing technologies provide practical, massively parallel sequencing at lower cost and without the requirement for large, automated facilities, making genome and transcriptome sequencing and resequencing possible for more projects and more species. These sequencing methods include the 454 implementation of pyrosequencing, Solexa/Illumina reversible terminator technologies, polony sequencing and AB SOLiD. All of these methods use nanotechnology to generate hundreds of thousands of small sequence reads at one time. These technologies have the potential to bring the genomics revolution to whole populations, and to organisms such as endangered species or species of ecological and evolutionary interest. A future is now foreseeable where ecologists may resequence entire genomes from wild populations and perform population genetic studies at a genome, rather than gene, level. The new technologies for high throughput sequencing, their limitations and their applicability to evolutionary and environmental studies, are discussed in this review.
Collapse
Affiliation(s)
- Matthew E Hudson
- Department of Crop Sciences, University of Illinois, Urbana, 334 NSRC, 1101 W. Peabody Blvd., IL 61801, USA
| |
Collapse
|
2
|
Varshney RK, Nayak SN, May GD, Jackson SA. Next-generation sequencing technologies and their implications for crop genetics and breeding. Trends Biotechnol 2009; 27:522-30. [PMID: 19679362 DOI: 10.1016/j.tibtech.2009.05.006] [Citation(s) in RCA: 420] [Impact Index Per Article: 26.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2009] [Revised: 05/21/2009] [Accepted: 05/27/2009] [Indexed: 10/20/2022]
Abstract
Using next-generation sequencing technologies it is possible to resequence entire plant genomes or sample entire transcriptomes more efficiently and economically and in greater depth than ever before. Rather than sequencing individual genomes, we envision the sequencing of hundreds or even thousands of related genomes to sample genetic diversity within and between germplasm pools. Identification and tracking of genetic variation are now so efficient and precise that thousands of variants can be tracked within large populations. In this review, we outline some important areas such as the large-scale development of molecular markers for linkage mapping, association mapping, wide crosses and alien introgression, epigenetic modifications, transcript profiling, population genetics and de novo genome/organellar genome assembly for which these technologies are expected to advance crop genetics and breeding, leading to crop improvement.
Collapse
Affiliation(s)
- Rajeev K Varshney
- Centre of Excellence in Genomics (CEG), International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru 502324, A.P., India.
| | | | | | | |
Collapse
|
3
|
Reddy PS, Nair S, Mallikarjuna G, Kaul T, Markandeya G, Sopory SK, Reddy MK. A high-throughput, low-cost method for the preparation of "sequencing-ready" phage DNA template. Anal Biochem 2008; 376:258-61. [PMID: 18334224 DOI: 10.1016/j.ab.2008.02.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2008] [Revised: 02/16/2008] [Accepted: 02/19/2008] [Indexed: 10/22/2022]
Abstract
We have developed a simple and efficient protocol for the isolation of good-quality recombinant phage DNA useful for all downstream processing, including automated sequencing. The overnight-grown phage particles were effectively precipitated (without any contaminating Escherichia coli DNA and other culture media components) by adjusting the pH of the culture medium to 5.2 with sodium acetate, followed by addition of ethanol to 25%. The phage DNA was selectively precipitated with ethanol in the presence of guanidinium thiocyanate under alkaline pH, resulting in uniform quality and quantity of phage DNA. The quality of the phage DNA preparation was demonstrated by DNA sequencing that provided an average read length of >700 bases (PHRED20 quality). This protocol for plating, picking, growing, and subsequent DNA purification of individual phage clones can be completely automated using any standard robotic platform. This protocol does not require any commercial kits and can be completed within 2h.
Collapse
Affiliation(s)
- Palakolanu Sudhakar Reddy
- International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi 110 067, India
| | | | | | | | | | | | | |
Collapse
|
4
|
Smailus DE, Warren RL, Holt RA. Constructing large DNA segments by iterative clone recombination. SYSTEMS AND SYNTHETIC BIOLOGY 2008; 1:139-44. [PMID: 19003448 PMCID: PMC2398714 DOI: 10.1007/s11693-008-9011-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2007] [Revised: 11/28/2007] [Accepted: 01/08/2008] [Indexed: 11/05/2022]
Abstract
Methods for constructing large contiguous segments of DNA will be enabling for Synthetic Biology, where the assembly of genes encoding circuits, biosynthetic pathways or even whole microbial organisms is of interest. Currently, in vitro approaches to DNA synthesis are adequate for generating DNAs that are up to 10s of kbp in length, and in vivo recombination strategies are more suitable for building DNA constructs that are 100 kbp or larger. We have developed a vector system for efficient assembly of large DNA molecules by iterative in vivo recombination of fosmid clones. Two custom fosmid vectors have been built, pFOSAMP and pFOSKAN, that support antibiotic switching. Using this technique we rebuilt two non-contiguous regions of the Haemophilus influenzae genome as episomes in recombinogenic Escherichia coli host cells. These regions together comprise190 kbp, or 10.4% of the H. influenze genome.
Collapse
Affiliation(s)
- Duane E Smailus
- Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, Suite 100, 570 West 7th Avenue, Vancouver, BC, Canada, V5Z 4S6
| | | | | |
Collapse
|
6
|
Savidor A, Donahoo RS, Hurtado-Gonzales O, Verberkmoes NC, Shah MB, Lamour KH, McDonald WH. Expressed peptide tags: an additional layer of data for genome annotation. J Proteome Res 2007; 5:3048-58. [PMID: 17081056 DOI: 10.1021/pr060134x] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here, we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six-frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well-annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed a high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well-annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller subdatabases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While approximately 76% of Phytophthora EPTs supported the current annotation, a portion of them (7.7% and 12.9% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.
Collapse
Affiliation(s)
- Alon Savidor
- Graduate School of Genome Science and Technology, University of Tennessee-Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, USA
| | | | | | | | | | | | | |
Collapse
|
7
|
Khattra J, Delaney AD, Zhao Y, Siddiqui A, Asano J, McDonald H, Pandoh P, Dhalla N, Prabhu AL, Ma K, Lee S, Ally A, Tam A, Sa D, Rogers S, Charest D, Stott J, Zuyderduyn S, Varhol R, Eaves C, Jones S, Holt R, Hirst M, Hoodless PA, Marra MA. Large-scale production of SAGE libraries from microdissected tissues, flow-sorted cells, and cell lines. Genome Res 2006; 17:108-16. [PMID: 17135571 PMCID: PMC1716260 DOI: 10.1101/gr.5488207] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of approximately 5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling.
Collapse
Affiliation(s)
- Jaswinder Khattra
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Allen D. Delaney
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Yongjun Zhao
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Asim Siddiqui
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Jennifer Asano
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Helen McDonald
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Pawan Pandoh
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Noreen Dhalla
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Anna-liisa Prabhu
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Kevin Ma
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Stephanie Lee
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Adrian Ally
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Angela Tam
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Danne Sa
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Sean Rogers
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - David Charest
- Genome British Columbia, Vancouver, British Columbia V5Z 1C6, Canada
| | - Jeff Stott
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Scott Zuyderduyn
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Richard Varhol
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Connie Eaves
- Terry Fox Laboratory, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 1L3, Canada
| | - Steven Jones
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Robert Holt
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Martin Hirst
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
| | - Pamela A. Hoodless
- Terry Fox Laboratory, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 1L3, Canada
| | - Marco A. Marra
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4S6, Canada
- Corresponding author.E-mail ; fax (604) 877-6085
| |
Collapse
|
8
|
Emrich SJ, Barbazuk WB, Li L, Schnable PS. Gene discovery and annotation using LCM-454 transcriptome sequencing. Genome Res 2006; 17:69-73. [PMID: 17095711 PMCID: PMC1716268 DOI: 10.1101/gr.5145806] [Citation(s) in RCA: 248] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
454 DNA sequencing technology achieves significant throughput relative to traditional approaches. More than 261,000 ESTs were generated by 454 Life Sciences from cDNA isolated using laser capture microdissection (LCM) from the developmentally important shoot apical meristem (SAM) of maize (Zea mays L.). This single sequencing run annotated >25,000 maize genomic sequences and also captured approximately 400 expressed transcripts for which homologous sequences have not yet been identified in other species. Approximately 70% of the ESTs generated in this study had not been captured during a previous EST project conducted using a cDNA library constructed from hand-dissected apex tissue that is highly enriched for SAMs. In addition, at least 30% of the 454-ESTs do not align to any of the approximately 648,000 extant maize ESTs using conservative alignment criteria. These results indicate that the combination of LCM and the deep sequencing possible with 454 technology enriches for SAM transcripts not present in current EST collections. RT-PCR was used to validate the expression of 27 genes whose expression had been detected in the SAM via LCM-454 technology, but that lacked orthologs in GenBank. Significantly, transcripts from approximately 74% (20/27) of these validated SAM-expressed "orphans" were not detected in meristem-rich immature ears. We conclude that the coupling of LCM and 454 sequencing technologies facilitates the discovery of rare, possibly cell-type-specific transcripts.
Collapse
Affiliation(s)
- Scott J. Emrich
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, Iowa 50010, USA
- Department of Electrical and Computer Engineering, Iowa State University, Ames, Iowa 50010, USA
| | - W. Brad Barbazuk
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
| | - Li Li
- Interdepartmental Plant Physiology Graduate Major and Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa 50010, USA
| | - Patrick S. Schnable
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, Iowa 50010, USA
- Interdepartmental Plant Physiology Graduate Major and Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, Iowa 50010, USA
- Department of Agronomy and Center for Plant Genomics, Iowa State University, Ames, Iowa 50010, USA
- Corresponding author.E-mail ; fax (515) 294-5256
| |
Collapse
|
9
|
Missirlis PI, Smailus DE, Holt RA. A high-throughput screen identifying sequence and promiscuity characteristics of the loxP spacer region in Cre-mediated recombination. BMC Genomics 2006; 7:73. [PMID: 16595017 PMCID: PMC1479339 DOI: 10.1186/1471-2164-7-73] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2005] [Accepted: 04/04/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cre-loxP recombination refers to the process of site-specific recombination mediated by two loxP sequences and the Cre recombinase protein. Transgenic experiments exploit integrative recombination, where a donor plasmid carrying a loxP site and DNA of interest integrate into a recipient loxP site in a target genome. Unfortunately, integrative recombination is highly inefficient because the insert is flanked by two loxP sites, which themselves become targets for Cre and lead to subsequent excision of the insert. A small number of mutations have been discovered in parts of the loxP sequence, specifically the spacer and inverted repeat segments, that increase the efficiency of integrative recombination. In this study we introduce a high-throughput in vitro assay to rapidly detect novel loxP spacer mutants and describe the sequence characteristics of successful recombinants. RESULTS We created synthetic loxP oligonucleotides that contained a combination of inverted repeat mutations (the lox66 and lox71 mutations) and mutant spacer sequences, degenerate at 6 of the 8 positions. After in vitro Cre recombination, 3,124 recombinant clones were identified by sequencing. Included in this set were 31 unique, novel, self-recombining sequences. Using network visualization tools, we recognized 12 spacer sets with restricted promiscuity. We observed that increased guanine content at all spacer positions save for position 8 resulted in increased recombination. Interestingly, recombination between identical spacers was not preferred over non-identical spacers. We also identified a set of 16 pairs of loxP spacers that reacted at least twice with another spacer, but not themselves. Further, neither the wild-type P1 phage loxP sequence nor any of the known loxP spacer mutants appeared to be kinetically favoured by Cre recombinase. CONCLUSION This study approached loxP spacer mutant screening in an unbiased manner, assuming nothing about candidate loxP sites save for the conserved 4 and 5 spacer positions. Candidate sites were free to recombine with any other sequence in the pool of all possible sites. The subset of loxP sites identified here are candidates for in vivo serial recombination as they have already demonstrated limited promiscuity with other loxP spacer and stability in the presence of Cre.
Collapse
Affiliation(s)
- Perseus I Missirlis
- Genome Sciences Centre, BC Cancer Agency, Suite 100, 570 West 7th Ave, Vancouver, BC, V5Z 4S6, Canada
| | - Duane E Smailus
- Genome Sciences Centre, BC Cancer Agency, Suite 100, 570 West 7th Ave, Vancouver, BC, V5Z 4S6, Canada
| | - Robert A Holt
- Genome Sciences Centre, BC Cancer Agency, Suite 100, 570 West 7th Ave, Vancouver, BC, V5Z 4S6, Canada
| |
Collapse
|