1
|
Rozanova IV, Khlestkina EK. [NGS sequencing in barley breeding and genetic studies]. Vavilovskii Zhurnal Genet Selektsii 2021; 24:348-355. [PMID: 33659817 PMCID: PMC7716553 DOI: 10.18699/vj20.627] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Barley (Hordeum vulgare L.) is the one of the most important cereal species used as food and feed crops, as well as for malting and alcohol production. At the end of the last century, traditional breeding techniques were complemented by the use of DNA markers. Molecular markers have also been used extensively for molecular genetic mapping and QTL analysis. In 2012, the barley genome sequencing was completed, which provided a broad range of new opportunities - from a more efficient search for candidate genes controlling economically important traits to genomic selection. The review summarizes the results of the studies performed after barley genome sequencing, which discovered new areas of barley genetics and breeding with high throughput screening and genotyping methods. During this period, intensive studies aimed at identification of barley genomic loci associated with economically important traits have been carried out; online databases and tools for working with barley genomic data and their deposition have appeared and are being replenished. In recent years, GWAS analysis has been used for large-scale phenotypegenotype association studies, which has been widely used in barley since 2010 due to the developed SNP-arrays, as well as genotyping methods based on direct NGS sequencing of selected fractions of the genome. To date, more than 80 papers have been published that describe the results of the GWAS analysis in barley. SNP identification associated with economically important traits and their transformation into CAPS or KASP markers convenient for screening selection material significantly expands the possibilities of marker-assisted selection of barley. In addition, the currently available information on potential target genes and the quality of the whole barley genome sequence provides a good base for applying genome editing technologies to create material for the creation of varieties with desired properties.
Collapse
Affiliation(s)
- I V Rozanova
- Federal Research Center the N.I. Vavilov All-Russian Institute of Plant Genetic Resources (VIR), St. Petersburg, Russia Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - E K Khlestkina
- Federal Research Center the N.I. Vavilov All-Russian Institute of Plant Genetic Resources (VIR), St. Petersburg, Russia Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
2
|
Abstract
Barley (Hordeum vulgare), one of the most widely cultivated cereal crops, possesses a large genome of 5.1 Gbp. Through various international collaborations, the genome has recently been sequenced and assembled at the chromosome-scale by exploiting available genetic and genomic resources. Many wild and cultivated barley accessions have been collected and preserved around the world. These accessions are crucial to obtain diverse natural and induced barley variants. The barley bioresource project aims to investigate the diversity of this crop based on purified seed and DNA samples of a large number of collected accessions. The long-term goal of this project is to analyse the genome sequences of major barley accessions worldwide. In view of technical limitations, a strategy has been employed to establish the exome structure of a selected number of accessions and to perform high-quality chromosome-scale assembly of the genomes of several major representative accessions. For the future project, an efficient annotation pipeline is essential for establishing the function of genomes and genes as well as for using this information for sequence-based digital barley breeding. In this article, the author reviews the existing barley resources along with their applications and discuss possible future directions of research in barley genomics.
Collapse
Affiliation(s)
- Kazuhiro Sato
- Institute of Plant Science and Resources, Okayama University, Kurashiki 710-0046, Japan
| |
Collapse
|
3
|
Barley Genome Sequencing and Assembly—A First Version Reference Sequence. COMPENDIUM OF PLANT GENOMES 2018. [DOI: 10.1007/978-3-319-92528-8_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
4
|
Ariyadasa R, Mascher M, Nussbaumer T, Schulte D, Frenkel Z, Poursarebani N, Zhou R, Steuernagel B, Gundlach H, Taudien S, Felder M, Platzer M, Himmelbach A, Schmutzer T, Hedley PE, Muehlbauer GJ, Scholz U, Korol A, Mayer KF, Waugh R, Langridge P, Graner A, Stein N. A sequence-ready physical map of barley anchored genetically by two million single-nucleotide polymorphisms. PLANT PHYSIOLOGY 2014; 164:412-23. [PMID: 24243933 PMCID: PMC3875818 DOI: 10.1104/pp.113.228213] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Accepted: 11/13/2013] [Indexed: 05/18/2023]
Abstract
Barley (Hordeum vulgare) is an important cereal crop and a model species for Triticeae genomics. To lay the foundation for hierarchical map-based sequencing, a genome-wide physical map of its large and complex 5.1 billion-bp genome was constructed by high-information content fingerprinting of almost 600,000 bacterial artificial chromosomes representing 14-fold haploid genome coverage. The resultant physical map comprises 9,265 contigs with a cumulative size of 4.9 Gb representing 96% of the physical length of the barley genome. The reliability of the map was verified through extensive genetic marker information and the analysis of topological networks of clone overlaps. A minimum tiling path of 66,772 minimally overlapping clones was defined that will serve as a template for hierarchical clone-by-clone map-based shotgun sequencing. We integrated whole-genome shotgun sequence data from the individuals of two mapping populations with published bacterial artificial chromosome survey sequence information to genetically anchor the physical map. This novel approach in combination with the comprehensive whole-genome shotgun sequence data sets allowed us to independently validate and improve a previously reported physical and genetic framework. The resources developed in this study will underpin fine-mapping and cloning of agronomically important genes and the assembly of a draft genome sequence.
Collapse
|
5
|
Zuriaga E, Soriano JM, Zhebentyayeva T, Romero C, Dardick C, Cañizares J, Badenes ML. Genomic analysis reveals MATH gene(s) as candidate(s) for Plum pox virus (PPV) resistance in apricot (Prunus armeniaca L.). MOLECULAR PLANT PATHOLOGY 2013; 14:663-77. [PMID: 23672686 PMCID: PMC6638718 DOI: 10.1111/mpp.12037] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Sharka disease, caused by Plum pox virus (PPV), is the most important viral disease affecting Prunus species. A major PPV resistance locus (PPVres) has been mapped to the upper part of apricot (Prunus armeniaca) linkage group 1. In this study, a physical map of the PPVres locus in the PPV-resistant cultivar 'Goldrich' was constructed. Bacterial artificial chromosome (BAC) clones belonging to the resistant haplotype contig were sequenced using 454/GS-FLX Titanium technology. Concurrently, the whole genome of seven apricot varieties (three PPV-resistant and four PPV-susceptible) and two PPV-susceptible apricot relatives (P. sibirica var. davidiana and P. mume) were obtained using the Illumina-HiSeq2000 platform. Single nucleotide polymorphisms (SNPs) within the mapped interval, recorded from alignments against the peach genome, allowed us to narrow down the PPVres locus to a region of ∼196 kb. Searches for polymorphisms linked in coupling with the resistance led to the identification of 68 variants within 23 predicted transcripts according to peach genome annotation. Candidate resistance genes were ranked combining data from variant calling and predicted functions inferred from sequence homology. Together, the results suggest that members of a cluster of meprin and TRAF-C homology domain (MATHd)-containing proteins are the most likely candidate genes for PPV resistance in apricot. Interestingly, MATHd proteins are hypothesized to control long-distance movement (LDM) of potyviruses in Arabidopsis, and restriction for LDM is also a major component of PPV resistance in apricot. Although the PPV resistance gene(s) remains to be unambiguously identified, these results pave the way to the determination of the underlying mechanism and to the development of more accurate breeding strategies.
Collapse
Affiliation(s)
- Elena Zuriaga
- Instituto Valenciano de Investigaciones Agrarias (IVIA), Apartado Oficial, 46113 Moncada, Valencia, Spain
| | | | | | | | | | | | | |
Collapse
|
6
|
Lonardi S, Duma D, Alpert M, Cordero F, Beccuti M, Bhat PR, Wu Y, Ciardo G, Alsaihati B, Ma Y, Wanamaker S, Resnik J, Bozdag S, Luo MC, Close TJ. Combinatorial pooling enables selective sequencing of the barley gene space. PLoS Comput Biol 2013; 9:e1003010. [PMID: 23592960 PMCID: PMC3617026 DOI: 10.1371/journal.pcbi.1003010] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Accepted: 02/05/2013] [Indexed: 11/23/2022] Open
Abstract
For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. The problem of obtaining the full genomic sequence of an organism has been solved either via a global brute-force approach (called whole-genome shotgun) or by a divide-and-conquer strategy (called clone-by-clone). Both approaches have advantages and disadvantages in terms of cost, manual labor, and the ability to deal with sequencing errors and highly repetitive regions of the genome. With the advent of second-generation sequencing instruments, the whole-genome shotgun approach has been the preferred choice. The clone-by-clone strategy is, however, still very relevant for large complex genomes. In fact, several research groups and international consortia have produced clone libraries and physical maps for many economically or ecologically important organisms and now are in a position to proceed with sequencing. In this manuscript, we demonstrate the feasibility of this approach on the gene-space of a large, very repetitive plant genome. The novelty of our approach is that, in order to take advantage of the throughput of the current generation of sequencing instruments, we pool hundreds of clones using a special type of “smart” pooling design that allows one to establish with high accuracy the source clone from the sequenced reads in a pool. Extensive simulations and experimental results support our claims.
Collapse
Affiliation(s)
- Stefano Lonardi
- Department of Computer Science and Engineering, University of California, Riverside, California, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
A physical, genetic and functional sequence assembly of the barley genome. Nature 2012; 491:711-6. [PMID: 23075845 DOI: 10.1038/nature11543] [Citation(s) in RCA: 925] [Impact Index Per Article: 77.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Accepted: 08/30/2012] [Indexed: 12/13/2022]
Abstract
Barley (Hordeum vulgare L.) is among the world's earliest domesticated and most important crop plants. It is diploid with a large haploid genome of 5.1 gigabases (Gb). Here we present an integrated and ordered physical, genetic and functional sequence resource that describes the barley gene-space in a structured whole-genome context. We developed a physical map of 4.98 Gb, with more than 3.90 Gb anchored to a high-resolution genetic map. Projecting a deep whole-genome shotgun assembly, complementary DNA and deep RNA sequence data onto this framework supports 79,379 transcript clusters, including 26,159 'high-confidence' genes with homology support from other plant genomes. Abundant alternative splicing, premature termination codons and novel transcriptionally active regions suggest that post-transcriptional processing forms an important regulatory layer. Survey sequences from diverse accessions reveal a landscape of extensive single-nucleotide variation. Our data provide a platform for both genome-assisted research and enabling contemporary crop improvement.
Collapse
|
8
|
Advances in BAC-based physical mapping and map integration strategies in plants. J Biomed Biotechnol 2012; 2012:184854. [PMID: 22500080 PMCID: PMC3303678 DOI: 10.1155/2012/184854] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2011] [Revised: 10/26/2011] [Accepted: 11/11/2011] [Indexed: 12/29/2022] Open
Abstract
In the advent of next-generation sequencing (NGS) platforms, map-based sequencing strategy has been recently suppressed being too expensive and laborious. The detailed studies on NGS drafts alone indicated these assemblies remain far from gold standard reference quality, especially when applied on complex genomes. In this context the conventional BAC-based physical mapping has been identified as an important intermediate layer in current hybrid sequencing strategy. BAC-based physical map construction and its integration with high-density genetic maps have benefited from NGS and high-throughput array platforms. This paper addresses the current advancements of BAC-based physical mapping and high-throughput map integration strategies to obtain densely anchored well-ordered physical maps. The resulted maps are of immediate utility while providing a template to harness the maximum benefits of the current NGS platforms.
Collapse
|
9
|
Vera Ruiz EM, Soriano JM, Romero C, Zhebentyayeva T, Terol J, Zuriaga E, Llácer G, Abbott AG, Badenes ML. Narrowing down the apricot Plum pox virus resistance locus and comparative analysis with the peach genome syntenic region. MOLECULAR PLANT PATHOLOGY 2011; 12:535-47. [PMID: 21722293 PMCID: PMC6640391 DOI: 10.1111/j.1364-3703.2010.00691.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Sharka disease, caused by the Plum pox virus (PPV), is one of the main limiting factors for stone fruit crops worldwide. Only a few resistance sources have been found in apricot (Prunus armeniaca L.), and most studies have located a major PPV resistance locus (PPVres) on linkage group 1 (LG1). However, the mapping accuracy was not sufficiently reliable and PPVres was predicted within a low confidence interval. In this study, we have constructed two high-density simple sequence repeat (SSR) improved maps with 0.70 and 0.68 markers/cm, corresponding to LG1 of 'Lito' and 'Goldrich' PPV-resistant cultivars, respectively. Using these maps, and excluding genotype-phenotype incongruent individuals, a new binary trait locus (BTL) analysis for PPV resistance was performed, narrowing down the PPVres support intervals to 7.3 and 5.9 cm in 'Lito' and 'Goldrich', respectively. Subsequently, 71 overlapping oligonucleotides (overgo) probes were hybridized against an apricot bacterial artificial chromosome (BAC) library, identifying 870 single BACs from which 340 were anchored onto a map region of approximately 30-40 cm encompassing PPVres. Partial BAC contigs assigned to the two allelic haplotypes (resistant/susceptible) of the PPVres locus were built by high-information content fingerprinting (HICF). In addition, a total of 300 BAC-derived sequences were obtained, and 257 showed significant homology with the peach genome scaffold_1 corresponding to LG1. According to the peach syntenic genome sequence, PPVres was predicted within a region of 2.16 Mb in which a few candidate resistance genes were identified.
Collapse
Affiliation(s)
- Elsa María Vera Ruiz
- Instituto Valenciano de Investigaciones Agrarias (IVIA), Apartado Oficial, Moncada, Valencia, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Steuernagel B, Taudien S, Gundlach H, Seidel M, Ariyadasa R, Schulte D, Petzold A, Felder M, Graner A, Scholz U, Mayer KFX, Platzer M, Stein N. De novo 454 sequencing of barcoded BAC pools for comprehensive gene survey and genome analysis in the complex genome of barley. BMC Genomics 2009; 10:547. [PMID: 19930547 PMCID: PMC2784808 DOI: 10.1186/1471-2164-10-547] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2009] [Accepted: 11/20/2009] [Indexed: 01/18/2023] Open
Abstract
Background De novo sequencing the entire genome of a large complex plant genome like the one of barley (Hordeum vulgare L.) is a major challenge both in terms of experimental feasibility and costs. The emergence and breathtaking progress of next generation sequencing technologies has put this goal into focus and a clone based strategy combined with the 454/Roche technology is conceivable. Results To test the feasibility, we sequenced 91 barcoded, pooled, gene containing barley BACs using the GS FLX platform and assembled the sequences under iterative change of parameters. The BAC assemblies were characterized by N50 of ~50 kb (N80 ~31 kb, N90 ~21 kb) and a Q40 of 94%. For ~80% of the clones, the best assemblies consisted of less than 10 contigs at 24-fold mean sequence coverage. Moreover we show that gene containing regions seem to assemble completely and uninterrupted thus making the approach suitable for detecting complete and positionally anchored genes. By comparing the assemblies of four clones to their complete reference sequences generated by the Sanger method, we evaluated the distribution, quality and representativeness of the 454 sequences as well as the consistency and reliability of the assemblies. Conclusion The described multiplex 454 sequencing of barcoded BACs leads to sequence consensi highly representative for the clones. Assemblies are correct for the majority of contigs. Though the resolution of complex repetitive structures requires additional experimental efforts, our approach paves the way for a clone based strategy of sequencing the barley genome.
Collapse
Affiliation(s)
- Burkhard Steuernagel
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), D-06466 Gatersleben, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Bozdag S, Close TJ, Lonardi S. A compartmentalized approach to the assembly of physical maps. BMC Bioinformatics 2009; 10:217. [PMID: 19604400 PMCID: PMC2717093 DOI: 10.1186/1471-2105-10-217] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2008] [Accepted: 07/15/2009] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Physical maps have been historically one of the cornerstones of genome sequencing and map-based cloning strategies. They also support marker assisted breeding and EST mapping. The problem of building a high quality physical map is computationally challenging due to unavoidable noise in the input fingerprint data. RESULTS We propose a novel compartmentalized method for the assembly of high quality physical maps from fingerprinted clones. The knowledge of genetic markers enables us to group clones into clusters so that clones in the same cluster are more likely to overlap. For each cluster of clones, a local physical map is first constructed using FingerPrinted Contigs (FPC). Then, all the individual maps are carefully merged into the final physical map. Experimental results on the genomes of rice and barley demonstrate that the compartmentalized assembly produces significantly more accurate maps, and that it can detect and isolate clones that would induce "chimeric" contigs if used in the final assembly. CONCLUSION The software is available for download at http://www.cs.ucr.edu/~sbozdag/assembler/
Collapse
Affiliation(s)
- Serdar Bozdag
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | | | | |
Collapse
|
12
|
Xia Z, Watanabe S, Chen Q, Sato S, Harada K. A novel manual pooling system for preparing three-dimensional pools of a deep coverage soybean bacterial artificial chromosome library. Mol Ecol Resour 2009; 9:516-24. [PMID: 21564681 DOI: 10.1111/j.1755-0998.2008.02503.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Compared with hybridization-based techniques, polymerase chain reaction-based screening of large insert libraries has been used widely as it is fast, easy and sensitive. However, various pooling strategies are needed to ensure efficient screening. It is time-consuming and labourious to prepare three-dimensional pools for a deep coverage bacterial artificial chromosome (BAC) library of soybean (1.12 × 10(9) bp) in the absence of robotic facility. In the present study, we describe a novel manual pooling system for preparing three-dimensional pools of a soybean BAC library. This simple technique enables a single researcher to construct three-dimensional pools for a deep-coverage (12 haploid genome equivalents) BAC library of soybean in less than 2 months without any robotic manipulation. When the prepared three-dimensional pools were screened with 29 polymerase chain reaction-based markers, an average of 9.2 clones per marker were identified. These identified clones will be useful either in quantitative trait loci gene isolation or in synteny study between soybean and other legumes including Lotus japonicus. This efficient pooling system could be applied to any other BAC libraries without the need for robotic manipulation.
Collapse
Affiliation(s)
- Zhengjun Xia
- National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan Faculty of Horticulture, Chiba University, 648 Matsudo, Chiba 271-8510, Japan Soybean Research Institute, Key Laboratory of Soybean Biology, Ministry of Education, Northeast Agricultural University Harbin, Harbin 150030, China Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba 292-0818, Japan
| | | | | | | | | |
Collapse
|
13
|
Schulte D, Close TJ, Graner A, Langridge P, Matsumoto T, Muehlbauer G, Sato K, Schulman AH, Waugh R, Wise RP, Stein N. The international barley sequencing consortium--at the threshold of efficient access to the barley genome. PLANT PHYSIOLOGY 2009; 149:142-7. [PMID: 19126706 PMCID: PMC2613708 DOI: 10.1104/pp.108.128967] [Citation(s) in RCA: 133] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2008] [Accepted: 11/03/2008] [Indexed: 05/18/2023]
Affiliation(s)
- Daniela Schulte
- Leibniz Institute of Plant Genetics and Crop Plant Research, 06466 Gatersleben, Germany
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Isolation, analysis and marker utility of novel miniature inverted repeat transposable elements from the barley genome. Mol Genet Genomics 2008; 280:275-85. [DOI: 10.1007/s00438-008-0363-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2008] [Accepted: 06/21/2008] [Indexed: 12/15/2022]
|