1
|
Wang Y, Singh R, Tong E, Tang M, Zheng L, Fang H, Li R, Guo L, Song J, Srinivasan R, Sharma A, Lin L, Trujillo JA, Manshardt R, Chen LY, Ming R, Yu Q. Positional cloning and characterization of the papaya diminutive mutant reveal a truncating mutation in the CpMMS19 gene. THE NEW PHYTOLOGIST 2020; 225:2006-2021. [PMID: 31733154 DOI: 10.1111/nph.16325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2018] [Accepted: 10/19/2019] [Indexed: 06/10/2023]
Abstract
The papaya diminutive mutant exhibits miniature stature, retarded growth and reduced fertility. This undesirable mutation appeared in the variety 'Sunset', the progenitor of the transgenic line 'SunUp', and was accidentally carried forward into breeding populations. The diminutive mutation was mapped to chromosome 2 and fine mapped to scaffold 25. Sequencing of a bacterial artificial chromosome in the fine mapped region led to the identification of the target gene responsible for the diminutive mutant, a gene orthologous to MMS19 with a 36.8 kb deletion co-segregating with the diminutive mutant. The genomic sequence of CpMMS19 is 62 kb, consisting of 20 exons and 19 introns. It encodes a protein of 1143 amino acids while the diminutive allele encodes a truncated protein of 287 amino acids. Expression of the full-length CpMMS19 was able to complement the thermosensitive growth of the yeast mms19 deletion mutant while expression of the diminutive allele resulted in increased thermosensitivity. Over-expression of the diminutive allele in Arabidopsis met18 mutant results in a high frequency of seed abortion. The papaya diminutive phenotype is caused by an alteration in gene function rather than a loss-of-function mutation. SCAR (sequence characterized amplified region) markers were developed for rapid detection of the diminutive allele in breeding populations.
Collapse
Affiliation(s)
- Ying Wang
- College of Crop Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Ratnesh Singh
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
| | - Eric Tong
- Hawaii Agriculture Research Center, Kunia, HI, 96759, USA
| | - Min Tang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Liwei Zheng
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
| | - Hongkun Fang
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Ruoyu Li
- College of Crop Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Lin Guo
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Jinjin Song
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Rajeswari Srinivasan
- Department of Tropical Plant & Soil Sciences, University of Hawaii, Honolulu, HI, 96822, USA
| | - Anupma Sharma
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
| | - Lianyu Lin
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Jorge A Trujillo
- Department of Biology, University of Texas Rio Grande Valley, Edinburg, TX, 78539, USA
| | - Richard Manshardt
- Department of Tropical Plant & Soil Sciences, University of Hawaii, Honolulu, HI, 96822, USA
| | - Li-Yu Chen
- FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Ray Ming
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Qingyi Yu
- Texas A&M AgriLife Research Center at Dallas, Texas A&M University System, Dallas, TX, 75252, USA
- Hawaii Agriculture Research Center, Kunia, HI, 96759, USA
| |
Collapse
|
2
|
Filho JAF, de Brito LS, Leão AP, Alves AA, Formighieri EF, Júnior MTS. In Silico Approach for Characterization and Comparison of Repeats in the Genomes of Oil and Date Palms. Bioinform Biol Insights 2017; 11:1177932217702388. [PMID: 28469420 PMCID: PMC5402704 DOI: 10.1177/1177932217702388] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 03/02/2017] [Indexed: 11/16/2022] Open
Abstract
Transposable elements (TEs) are mobile genetic elements present in almost all eukaryotic genomes. Due to their typical patterns of repetition, discovery, and characterization, they demand analysis by various bioinformatics software. Probably, as a result of the need for a complex analysis, many genomes publicly available do not have these elements annotated yet. In this study, a de novo and homology-based identification of TEs and microsatellites was performed using genomic data from 3 palm species: Elaeis oleifera (American oil palm, v.1, Embrapa, unpublished; v.8, Malaysian Palm Oil Board [MPOB], public), Elaeis guineensis (African oil palm, v.5, MPOB, public), and Phoenix dactylifera (date palm). The estimated total coverage of TEs was 50.96% (523 572 kb) and 42.31% (593 463 kb), 39.41% (605 015 kb), and 33.67% (187 361 kb), respectively. A total of 155 726 microsatellite loci were identified in the genomes of oil and date palms. This is the first detailed description of repeats in the genomes of oil and date palms. A relatively high diversity and abundance of TEs were found in the genomes, opening a range of further opportunities for applied research in these genera. The development of molecular markers (mainly simple sequence repeat), which may be immediately applied in breeding programs of those species to support the selection of superior genotypes and to enhance knowledge of the genetic structure of the breeding and natural populations, is the most notable opportunity.
Collapse
Affiliation(s)
- Jaire Alves Ferreira Filho
- Graduate Program in Plant Biotechnology, Federal University of Lavras (UFLA), Lavras, Brazil.,Embrapa Agroenergia, Parque Estação Biológica (PqEB), Brasília, Brazil.,Center of Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, Brazil
| | | | | | | | | | - Manoel Teixeira Souza Júnior
- Graduate Program in Plant Biotechnology, Federal University of Lavras (UFLA), Lavras, Brazil.,Embrapa Agroenergia, Parque Estação Biológica (PqEB), Brasília, Brazil
| |
Collapse
|
3
|
Si Z, Du B, Huo J, He S, Liu Q, Zhai H. A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition. BMC Genomics 2016; 17:945. [PMID: 27871234 PMCID: PMC5117676 DOI: 10.1186/s12864-016-3302-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Accepted: 11/15/2016] [Indexed: 11/14/2022] Open
Abstract
Background Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. Results The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93–10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. Conclusions The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3302-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zengzhi Si
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Bing Du
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Jinxi Huo
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Shaozhen He
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China
| | - Qingchang Liu
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China.
| | - Hong Zhai
- Beijing Key Laboratory of Crop Genetic Improvement/Laboratory of Crop Heterosis and Utilization, Ministry of Education, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
4
|
Santos AA, Penha HA, Bellec A, Munhoz CDF, Pedrosa-Harand A, Bergès H, Vieira MLC. Begin at the beginning: A BAC-end view of the passion fruit (Passiflora) genome. BMC Genomics 2014; 15:816. [PMID: 25260959 PMCID: PMC4189760 DOI: 10.1186/1471-2164-15-816] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2014] [Accepted: 09/22/2014] [Indexed: 12/16/2022] Open
Abstract
Background The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. Results The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n = 9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). Conclusions We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome study. Remarkably, a number of BAC-end pair sequences could be mapped to intervals of the sequenced Arabidopsis thaliana, V. vinifera and P. trichocarpa chromosomes, and putative collinear microsyntenic regions were identified. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-816) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Maria Lucia Carneiro Vieira
- Departamento de Genética, Universidade de São Paulo, Escola Superior de Agricultura "Luiz de Queiroz", P,O, Box 83, 13400-970 Piracicaba, Brazil.
| |
Collapse
|
5
|
Dereeper A, Guyot R, Tranchant-Dubreuil C, Anthony F, Argout X, de Bellis F, Combes MC, Gavory F, de Kochko A, Kudrna D, Leroy T, Poulain J, Rondeau M, Song X, Wing R, Lashermes P. BAC-end sequences analysis provides first insights into coffee (Coffea canephora P.) genome composition and evolution. PLANT MOLECULAR BIOLOGY 2013; 83:177-189. [PMID: 23708951 DOI: 10.1007/s11103-013-0077-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2012] [Accepted: 05/14/2013] [Indexed: 06/02/2023]
Abstract
Coffee is one of the world's most important agricultural commodities. Coffee belongs to the Rubiaceae family in the euasterid I clade of dicotyledonous plants, to which the Solanaceae family also belongs. Two bacterial artificial chromosome (BAC) libraries of a homozygous doubled haploid plant of Coffea canephora were constructed using two enzymes, HindIII and BstYI. A total of 134,827 high quality BAC-end sequences (BESs) were generated from the 73,728 clones of the two libraries, and 131,412 BESs were conserved for further analysis after elimination of chloroplast and mitochondrial sequences. This corresponded to almost 13 % of the estimated size of the C. canephora genome. 6.7 % of BESs contained simple sequence repeats, the most abundant (47.8 %) being mononucleotide motifs. These sequences allow the development of numerous useful marker sites. Potential transposable elements (TEs) represented 11.9 % of the full length BESs. A difference was observed between the BstYI and HindIII libraries (14.9 vs. 8.8 %). Analysis of BESs against known coding sequences of TEs indicated that 11.9 % of the genome corresponded to known repeat sequences, like for other flowering plants. The number of genes in the coffee genome was estimated at 41,973 which is probably overestimated. Comparative genome mapping revealed that microsynteny was higher between coffee and grapevine than between coffee and tomato or Arabidopsis. BESs constitute valuable resources for the first genome wide survey of coffee and provide new insights into the composition and evolution of the coffee genome.
Collapse
Affiliation(s)
- Alexis Dereeper
- Institut de Recherche pour le Développement (IRD), UMR RPB (CIRAD, IRD, UM2), BP 64501, 34394, Montpellier Cedex 5, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Sengupta S, Das B, Prasad M, Acharyya P, Ghose TK. A comparative survey of genetic diversity among a set of Caricaceae accessions using microsatellite markers. SPRINGERPLUS 2013; 2:345. [PMID: 23961410 PMCID: PMC3736075 DOI: 10.1186/2193-1801-2-345] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Accepted: 07/23/2013] [Indexed: 12/03/2022]
Abstract
A preliminary survey of genetic diversity among 34 commercially popular Carica papaya cultivars from India and abroad, 6 accessions of Vasconcellea species and 1 accession of Jacaratia spinosa, was done using 20 simple sequence repeat (SSR) markers. The SSR profiles were used to find out total number of alleles, null and rare alleles, Polymorphism Information Content (PIC) values and to calculate similarity matrix using Jaccard’s coefficient. The subsequent dendrogram was made by unweighted pair-group method of arithmetic average (UPGMA) and neighbor-joining method. Based on these parameters a comparison was made between the Indian papaya cultivars and the rest of the accessions. All the markers showed polymorphism and a total of 140 alleles were identified. The average number of alleles was 7 alleles/locus. Categorically the Vasconcellea and Jacaratia species had 54 alleles, the 7 non-Indian Carica papaya accessions had 70 and the 27 Indian accessions had 102 alleles. The average PIC value was 0.735 per marker. A total of 37 rare alleles were identified. Jacaratia spinosa had 17 rare alleles. Nineteen null alleles were detected among the Carica papaya accessions. A Carica papaya accession from South Africa, Hortus Gold had 5 null alleles. The genetic similarity among the accessions ranged from 7% to 67%. In the dendrogram, the Vasconcellea and Jacaratia spinosa accessions separated as a distinct cluster from the rest of the Carica papaya accessions. The study indicated that the accessions of Indian Carica papaya cultivars included in this survey are genetically more diverse than the non-Indian Carica papaya cultivars.
Collapse
Affiliation(s)
- Samik Sengupta
- National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067 India
| | | | | | | | | |
Collapse
|
7
|
Liu D, Zeng SH, Chen JJ, Zhang YJ, Xiao G, Zhu LY, Wang Y. First insights into the large genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Ttaditional medicinal plant. Int J Mol Sci 2013; 14:13559-76. [PMID: 23807511 PMCID: PMC3742203 DOI: 10.3390/ijms140713559] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Revised: 05/16/2013] [Accepted: 06/06/2013] [Indexed: 11/30/2022] Open
Abstract
Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant.
Collapse
Affiliation(s)
- Di Liu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
- University of Chinese Academy of Sciences, Beijing 100039, China
| | - Shao-Hua Zeng
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; E-Mail:
| | - Jian-Jun Chen
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| | - Yan-Jun Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| | - Gong Xiao
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
- University of Chinese Academy of Sciences, Beijing 100039, China
| | - Lin-Yao Zhu
- Wuhan Vegetable Research Station, Wuhan 430065, China; E-Mail:
| | - Ying Wang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| |
Collapse
|
8
|
Kim C, Lee TH, Compton RO, Robertson JS, Pierce GJ, Paterson AH. A genome-wide BAC end-sequence survey of sugarcane elucidates genome composition, and identifies BACs covering much of the euchromatin. PLANT MOLECULAR BIOLOGY 2013; 81:139-47. [PMID: 23161199 DOI: 10.1007/s11103-012-9987-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2012] [Accepted: 11/07/2012] [Indexed: 05/09/2023]
Abstract
BAC-end sequences (BESs) of hybrid sugarcane cultivar R570 are presented. A total of 66,990 informative BESs were obtained from 43,874 BAC clones. Similarity search using a variety of public databases revealed that 13.5 and 42.8 % of BESs match known gene-coding and repeat regions, respectively. That 11.7 % of BESs are still unmatched to any nucleotide sequences in the current public databases despite the fact that a close relative, sorghum, is fully sequenced, indicates that there may be many sugarcane-specific or lineage-specific sequences. We found 1,742 simple sequence repeat motifs in 1,585 BESs, spanning 27,383 bp in length. As simple sequence repeat markers derived from BESs have some advantages over randomly generated markers, these may be particularly useful for comparing BAC-based physical maps with genetic maps. BES and overgo hybridization information was used for anchoring sugarcane BAC clones to the sorghum genome sequence. While sorghum and sugarcane have extensive similarity in terms of genomic structure, only 2,789 BACs (6.4 %) could be confidently anchored to the sorghum genome at the stringent threshold of having both-end information (BESs or overgos) within 300 Kb. This relatively low rate of anchoring may have been caused in part by small- or large-scale genomic rearrangements in the Saccharum genus after two rounds of whole genome duplication since its divergence from the sorghum lineage about 7.8 million years ago. Limiting consideration to only low-copy matches, 1,245 BACs were placed to 1,503 locations, covering ~198 Mb of the sorghum genome or about 78 % of the estimated 252 Mb of euchromatin. BESs and their analyses presented here may provide an early profile of the sugarcane genome as well as a basis for BAC-by-BAC sequencing of much of the basic gene set of sugarcane.
Collapse
Affiliation(s)
- Changsoo Kim
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | | | | | | | | | | |
Collapse
|
9
|
Zhang Q, Ma B, Li H, Chang Y, Han Y, Li J, Wei G, Zhao S, Khan MA, Zhou Y, Gu C, Zhang X, Han Z, Korban SS, Li S, Han Y. Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple. BMC Genomics 2012; 13:537. [PMID: 23039990 PMCID: PMC3704940 DOI: 10.1186/1471-2164-13-537] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Accepted: 10/04/2012] [Indexed: 11/10/2022] Open
Abstract
Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding community in marker-assisted breeding, and for performing comparative genomic studies in Rosaceae.
Collapse
Affiliation(s)
- Qiong Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, the Chinese Academy of Sciences, Wuhan 430074, People's Republic of China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Na JK, Wang J, Murray JE, Gschwend AR, Zhang W, Yu Q, Navajas-Pérez R, Feltus FA, Chen C, Kubat Z, Moore PH, Jiang J, Paterson AH, Ming R. Construction of physical maps for the sex-specific regions of papaya sex chromosomes. BMC Genomics 2012; 13:176. [PMID: 22568889 PMCID: PMC3430574 DOI: 10.1186/1471-2164-13-176] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2011] [Accepted: 03/12/2012] [Indexed: 12/26/2022] Open
Abstract
Background Papaya is a major fruit crop in tropical and subtropical regions worldwide. It is trioecious with three sex forms: male, female, and hermaphrodite. Sex determination is controlled by a pair of nascent sex chromosomes with two slightly different Y chromosomes, Y for male and Yh for hermaphrodite. The sex chromosome genotypes are XY (male), XYh (hermaphrodite), and XX (female). The papaya hermaphrodite-specific Yh chromosome region (HSY) is pericentromeric and heterochromatic. Physical mapping of HSY and its X counterpart is essential for sequencing these regions and uncovering the early events of sex chromosome evolution and to identify the sex determination genes for crop improvement. Results A reiterate chromosome walking strategy was applied to construct the two physical maps with three bacterial artificial chromosome (BAC) libraries. The HSY physical map consists of 68 overlapped BACs on the minimum tiling path, and covers all four HSY-specific Knobs. One gap remained in the region of Knob 1, the only knob structure shared between HSY and X, due to the lack of HSY-specific sequences. This gap was filled on the physical map of the HSY corresponding region in the X chromosome. The X physical map consists of 44 BACs on the minimum tiling path with one gap remaining in the middle, due to the nature of highly repetitive sequences. This gap was filled on the HSY physical map. The borders of the non-recombining HSY were defined genetically by fine mapping using 1460 F2 individuals. The genetically defined HSY spanned approximately 8.5 Mb, whereas its X counterpart extended about 5.4 Mb including a 900 Kb region containing the Knob 1 shared by the HSY and X. The 8.5 Mb HSY corresponds to 4.5 Mb of its X counterpart, showing 4 Mb (89%) DNA sequence expansion. Conclusion The 89% increase of DNA sequence in HSY indicates rapid expansion of the Yh chromosome after genetic recombination was suppressed 2–3 million years ago. The genetically defined borders coincide with the common BACs on the minimum tiling paths of HSY and X. The minimum tiling paths of HSY and its X counterpart are being used for sequencing these X and Yh-specific regions.
Collapse
Affiliation(s)
- Jong-Kuk Na
- Department of Plant Biology, University of Illinois at Urbana Champaign, Urbana, IL 61801, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Wu J, Gu YQ, Hu Y, You FM, Dandekar AM, Leslie CA, Aradhya M, Dvorak J, Luo MC. Characterizing the walnut genome through analyses of BAC end sequences. PLANT MOLECULAR BIOLOGY 2012; 78:95-107. [PMID: 22101470 DOI: 10.1007/s11103-011-9849-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2011] [Accepted: 10/29/2011] [Indexed: 05/31/2023]
Abstract
Persian walnut (Juglans regia L.) is an economically important tree for its nut crop and timber. To gain insight into the structure and evolution of the walnut genome, we constructed two bacterial artificial chromosome (BAC) libraries, containing a total of 129,024 clones, from in vitro-grown shoots of J. regia cv. Chandler using the HindIII and MboI cloning sites. A total of 48,218 high-quality BAC end sequences (BESs) were generated, with an accumulated sequence length of 31.2 Mb, representing approximately 5.1% of the walnut genome. Analysis of repeat DNA content in BESs revealed that approximately 15.42% of the genome consists of known repetitive DNA, while walnut-unique repetitive DNA identified in this study constitutes 13.5% of the genome. Among the walnut-unique repetitive DNA, Julia SINE and JrTRIM elements represent the first identified walnut short interspersed element (SINE) and terminal-repeat retrotransposon in miniature (TRIM) element, respectively; both types of elements are abundant in the genome. As in other species, these SINEs and TRIM elements could be exploited for developing repeat DNA-based molecular markers in walnut. Simple sequence repeats (SSR) from BESs were analyzed and found to be more abundant in BESs than in expressed sequence tags. The density of SSR in the walnut genome analyzed was also slightly higher than that in poplar and papaya. Sequence analysis of BESs indicated that approximately 11.5% of the walnut genome represents a coding sequence. This study is an initial characterization of the walnut genome and provides the largest genomic resource currently available; as such, it will be a valuable tool in studies aimed at genetically improving walnut.
Collapse
Affiliation(s)
- Jiajie Wu
- Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus. Mol Biol Rep 2011; 39:5373-86. [PMID: 22170603 DOI: 10.1007/s11033-011-1338-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2011] [Accepted: 12/03/2011] [Indexed: 10/14/2022]
Abstract
The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.
Collapse
|
13
|
Vera Ruiz EM, Soriano JM, Romero C, Zhebentyayeva T, Terol J, Zuriaga E, Llácer G, Abbott AG, Badenes ML. Narrowing down the apricot Plum pox virus resistance locus and comparative analysis with the peach genome syntenic region. MOLECULAR PLANT PATHOLOGY 2011; 12:535-47. [PMID: 21722293 PMCID: PMC6640391 DOI: 10.1111/j.1364-3703.2010.00691.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Sharka disease, caused by the Plum pox virus (PPV), is one of the main limiting factors for stone fruit crops worldwide. Only a few resistance sources have been found in apricot (Prunus armeniaca L.), and most studies have located a major PPV resistance locus (PPVres) on linkage group 1 (LG1). However, the mapping accuracy was not sufficiently reliable and PPVres was predicted within a low confidence interval. In this study, we have constructed two high-density simple sequence repeat (SSR) improved maps with 0.70 and 0.68 markers/cm, corresponding to LG1 of 'Lito' and 'Goldrich' PPV-resistant cultivars, respectively. Using these maps, and excluding genotype-phenotype incongruent individuals, a new binary trait locus (BTL) analysis for PPV resistance was performed, narrowing down the PPVres support intervals to 7.3 and 5.9 cm in 'Lito' and 'Goldrich', respectively. Subsequently, 71 overlapping oligonucleotides (overgo) probes were hybridized against an apricot bacterial artificial chromosome (BAC) library, identifying 870 single BACs from which 340 were anchored onto a map region of approximately 30-40 cm encompassing PPVres. Partial BAC contigs assigned to the two allelic haplotypes (resistant/susceptible) of the PPVres locus were built by high-information content fingerprinting (HICF). In addition, a total of 300 BAC-derived sequences were obtained, and 257 showed significant homology with the peach genome scaffold_1 corresponding to LG1. According to the peach syntenic genome sequence, PPVres was predicted within a region of 2.16 Mb in which a few candidate resistance genes were identified.
Collapse
Affiliation(s)
- Elsa María Vera Ruiz
- Instituto Valenciano de Investigaciones Agrarias (IVIA), Apartado Oficial, Moncada, Valencia, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Construction of papaya male and female BAC libraries and application in physical mapping of the sex chromosomes. J Biomed Biotechnol 2011; 2011:929472. [PMID: 21765640 PMCID: PMC3134383 DOI: 10.1155/2011/929472] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2010] [Accepted: 05/09/2011] [Indexed: 01/06/2023] Open
Abstract
Papaya is a major fruit crop in the tropics and has recently evolved sex chromosomes. Towards sequencing the papaya sex chromosomes, two bacterial artificial chromosome (BAC) libraries were constructed from papaya male and female genomic DNA. The female BAC library was constructed using restriction enzyme BstY I and consists of 36,864 clones with an average insert size of 104 kb, providing 10.3x genome equivalents. The male BAC library was constructed using restriction enzyme EcoR I and consists of 55,296 clones with an average insert size of 101 kb, providing 15.0x genome equivalents. The male BAC library was used in constructing the physical map of the male-specific region of the male Y chromosome (MSY) and in filling gaps and extending the physical map of the hermaphrodite-specific region of the Yh chromosome (HSY) and the X chromosome physical map. The female BAC library was used to extend the X physical map gap. The MSY, HSY, and X physical maps offer a unique opportunity to study chromosomal rearrangements, Y chromosome degeneration, and dosage compensation of the papaya nascent sex chromosomes.
Collapse
|
15
|
Genet C, Dehais P, Palti Y, Gao G, Gavory F, Wincker P, Quillet E, Boussaha M. Analysis of BAC-end sequences in rainbow trout: content characterization and assessment of synteny between trout and other fish genomes. BMC Genomics 2011; 12:314. [PMID: 21672188 PMCID: PMC3125269 DOI: 10.1186/1471-2164-12-314] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2010] [Accepted: 06/14/2011] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Rainbow trout (Oncorhynchus mykiss) are cultivated worldwide for aquaculture production and are widely used as a model species to gain knowledge of many aspects of fish biology. The common ancestor of the salmonids experienced a whole genome duplication event, making extant salmonids such as the rainbow trout an excellent model for studying the evolution of tetraploidization and re-diploidization in vertebrates. However, the lack of a reference genome sequence hampers research progress for both academic and applied purposes. In order to enrich the genomic tools already available in this species and provide further insight on the complexity of its genome, we sequenced a large number of rainbow trout BAC-end sequences (BES) and characterized their contents. RESULTS A total of 176,485 high quality BES, were generated, representing approximately 4% of the trout genome. BES analyses identified 6,848 simple sequence repeats (SSRs), of which 3,854 had high quality flanking sequences for PCR primers design. The first rainbow trout repeat elements database (INRA RT rep1.0) containing 735 putative repeat elements was developed, and identified almost 59.5% of the BES database in base-pairs as repetitive sequence. Approximately 55% of the BES reads (97,846) had more than 100 base pairs of contiguous non-repetitive sequences. The fractions of the 97,846 non-repetitive trout BES reads that had significant BLASTN hits against the zebrafish, medaka and stickleback genome databases were 15%, 16.2% and 17.9%, respectively, while the fractions of the non-repetitive BES reads that had significant BLASTX hits against the zebrafish, medaka, and stickleback protein databases were 10.7%, 9.5% and 9.5%, respectively. Comparative genomics using paired BAC-ends revealed several regions of conserved synteny across all the fish species analyzed in this study. CONCLUSIONS The characterization of BES provided insights on the rainbow trout genome. The discovery of specific repeat elements will facilitate analyses of sequence content (e.g. for SNPs discovery and for transcriptome characterization) and future genome sequence assemblies. The numerous microsatellites will facilitate integration of the linkage and physical maps and serve as valuable resource for fine mapping QTL and positional cloning of genes affecting aquaculture production traits. Furthermore, comparative genomics through BES can be used for identifying positional candidate genes from QTL mapping studies, aid in future assembly of a reference genome sequence and elucidating sequence content and complexity in the rainbow trout genome.
Collapse
Affiliation(s)
- Carine Genet
- INRA, UMR 1313 GABI, Génétique Animale et Biologie Intégrative, 78350 Jouy-en-Josas, France.
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Ragupathy R, Rathinavelu R, Cloutier S. Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome. BMC Genomics 2011; 12:217. [PMID: 21554714 PMCID: PMC3113786 DOI: 10.1186/1471-2164-12-217] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Accepted: 05/09/2011] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Flax (Linum usitatissimum L.) is an important source of oil rich in omega-3 fatty acids, which have proven health benefits and utility as an industrial raw material. Flax seeds also contain lignans which are associated with reducing the risk of certain types of cancer. Its bast fibres have broad industrial applications. However, genomic tools needed for molecular breeding were non existent. Hence a project, Total Utilization Flax GENomics (TUFGEN) was initiated. We report here the first genome-wide physical map of flax and the generation and analysis of BAC-end sequences (BES) from 43,776 clones, providing initial insights into the genome. RESULTS The physical map consists of 416 contigs spanning ~368 Mb, assembled from 32,025 fingerprints, representing roughly 54.5% to 99.4% of the estimated haploid genome (370-675 Mb). The N50 size of the contigs was estimated to be ~1,494 kb. The longest contig was ~5,562 kb comprising 437 clones. There were 96 contigs containing more than 100 clones. Approximately 54.6 Mb representing 8-14.8% of the genome was obtained from 80,337 BES. Annotation revealed that a large part of the genome consists of ribosomal DNA (~13.8%), followed by known transposable elements at 6.1%. Furthermore, ~7.4% of sequence was identified to harbour novel repeat elements. Homology searches against flax-ESTs and NCBI-ESTs suggested that ~5.6% of the transcriptome is unique to flax. A total of 4064 putative genomic SSRs were identified and are being developed as novel markers for their use in molecular breeding. CONCLUSION The first genome-wide physical map of flax constructed with BAC clones provides a framework for accessing target loci with economic importance for marker development and positional cloning. Analysis of the BES has provided insights into the uniqueness of the flax genome. Compared to other plant genomes, the proportion of rDNA was found to be very high whereas the proportion of known transposable elements was low. The SSRs identified from BES will be valuable in saturating existing linkage maps and for anchoring physical and genetic maps. The physical map and paired-end reads from BAC clones will also serve as scaffolds to build and validate the whole genome shotgun assembly.
Collapse
Affiliation(s)
- Raja Ragupathy
- Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Rd, Winnipeg, MB, R3T 2M9, Canada
| | - Rajkumar Rathinavelu
- Genomics & Bioinformatics Division, ITC Research & Development Centre, Bangalore, India
| | - Sylvie Cloutier
- Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Rd, Winnipeg, MB, R3T 2M9, Canada
- Department of Plant Science, University of Manitoba, 66 Dafoe Rd, Winnipeg, MB, R3T 2N2, Canada
| |
Collapse
|
17
|
Bohra A, Dubey A, Saxena RK, Penmetsa RV, Poornima KN, Kumar N, Farmer AD, Srivani G, Upadhyaya HD, Gothalwal R, Ramesh S, Singh D, Saxena K, Kishor PBK, Singh NK, Town CD, May GD, Cook DR, Varshney RK. Analysis of BAC-end sequences (BESs) and development of BES-SSR markers for genetic mapping and hybrid purity assessment in pigeonpea (Cajanus spp.). BMC PLANT BIOLOGY 2011; 11:56. [PMID: 21447154 PMCID: PMC3079640 DOI: 10.1186/1471-2229-11-56] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2010] [Accepted: 03/29/2011] [Indexed: 05/20/2023]
Abstract
BACKGROUND Pigeonpea [Cajanus cajan (L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. RESULTS A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using HindIII (34,560 clones) and BamHI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the first SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. CONCLUSION In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea.
Collapse
Affiliation(s)
- Abhishek Bohra
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - Anuja Dubey
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Biotechnology and Bioinformatics Centre, Barkatullah University, Bhopal 462026, India
| | - Rachit K Saxena
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - R Varma Penmetsa
- Department of Plant Pathology, University of California, Davis, CA 95616, USA
| | - KN Poornima
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Biotechnology, University of Agricultural Sciences (UAS), Bangalore 560065, India
| | - Naresh Kumar
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Plant Breeding and Genetics, CCS Haryana Agricultural University (CCSHAU), Hisar 125004, India
| | - Andrew D Farmer
- National Center for Genome Resources (NCGR), Santa Fe, N M 87505, USA
| | - Gudipati Srivani
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - Hari D Upadhyaya
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - Ragini Gothalwal
- Department of Biotechnology and Bioinformatics Centre, Barkatullah University, Bhopal 462026, India
| | - S Ramesh
- Department of Biotechnology, University of Agricultural Sciences (UAS), Bangalore 560065, India
| | - Dhiraj Singh
- Department of Plant Breeding and Genetics, CCS Haryana Agricultural University (CCSHAU), Hisar 125004, India
| | - Kulbhushan Saxena
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - PB Kavi Kishor
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - Nagendra K Singh
- National Research Center on Plant Biotechnology (NRCPB), New Delhi 110012, India
| | | | - Gregory D May
- National Center for Genome Resources (NCGR), Santa Fe, N M 87505, USA
| | - Douglas R Cook
- Department of Plant Pathology, University of California, Davis, CA 95616, USA
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Generation Challenge Programme (GCP), c/o CIMMYT, 06600 Mexico DF, Mexico
| |
Collapse
|
18
|
Hsu CC, Chung YL, Chen TC, Lee YL, Kuo YT, Tsai WC, Hsiao YY, Chen YW, Wu WL, Chen HH. An overview of the Phalaenopsis orchid genome through BAC end sequence analysis. BMC PLANT BIOLOGY 2011; 11:3. [PMID: 21208460 PMCID: PMC3027094 DOI: 10.1186/1471-2229-11-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Accepted: 01/06/2011] [Indexed: 05/03/2023]
Abstract
BACKGROUND Phalaenopsis orchids are popular floral crops, and development of new cultivars is economically important to floricultural industries worldwide. Analysis of orchid genes could facilitate orchid improvement. Bacterial artificial chromosome (BAC) end sequences (BESs) can provide the first glimpses into the sequence composition of a novel genome and can yield molecular markers for use in genetic mapping and breeding. RESULTS We used two BAC libraries (constructed using the BamHI and HindIII restriction enzymes) of Phalaenopsis equestris to generate pair-end sequences from 2,920 BAC clones (71.4% and 28.6% from the BamHI and HindIII libraries, respectively), at a success rate of 95.7%. A total of 5,535 BESs were generated, representing 4.5 Mb, or about 0.3% of the Phalaenopsis genome. The trimmed sequences ranged from 123 to 1,397 base pairs (bp) in size, with an average edited read length of 821 bp. When these BESs were subjected to sequence homology searches, it was found that 641 (11.6%) were predicted to represent protein-encoding regions, whereas 1,272 (23.0%) contained repetitive DNA. Most of the repetitive DNA sequences were gypsy- and copia-like retrotransposons (41.9% and 12.8%, respectively), whereas only 10.8% were DNA transposons. Further, 950 potential simple sequence repeats (SSRs) were discovered. Dinucleotides were the most abundant repeat motifs; AT/TA dimer repeats were the most frequent SSRs, representing 253 (26.6%) of all identified SSRs. Microsynteny analysis revealed that more BESs mapped to the whole-genome sequences of poplar than to those of grape or Arabidopsis, and even fewer mapped to the rice genome. This work will facilitate analysis of the Phalaenopsis genome, and will help clarify similarities and differences in genome composition between orchids and other plant species. CONCLUSION Using BES analysis, we obtained an overview of the Phalaenopsis genome in terms of gene abundance, the presence of repetitive DNA and SSR markers, and the extent of microsynteny with other plant species. This work provides a basis for future physical mapping of the Phalaenopsis genome and advances our knowledge thereof.
Collapse
Affiliation(s)
- Chia-Chi Hsu
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Yu-Lin Chung
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Tien-Chih Chen
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Yu-Ling Lee
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Yi-Tzu Kuo
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Wen-Chieh Tsai
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Orchid Research Center, National Cheng Kung University, Tainan 701, Taiwan
| | - Yu-Yun Hsiao
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Yun-Wen Chen
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Wen-Luan Wu
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Orchid Research Center, National Cheng Kung University, Tainan 701, Taiwan
| | - Hong-Hwa Chen
- Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Orchid Research Center, National Cheng Kung University, Tainan 701, Taiwan
| |
Collapse
|
19
|
González VM, Rodríguez-Moreno L, Centeno E, Benjak A, Garcia-Mas J, Puigdomènech P, Aranda MA. Genome-wide BAC-end sequencing of Cucumis melo using two BAC libraries. BMC Genomics 2010; 11:618. [PMID: 21054843 PMCID: PMC3091759 DOI: 10.1186/1471-2164-11-618] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2010] [Accepted: 11/05/2010] [Indexed: 11/10/2022] Open
Abstract
Background Although melon (Cucumis melo L.) is an economically important fruit crop, no genome-wide sequence information is openly available at the current time. We therefore sequenced BAC-ends representing a total of 33,024 clones, half of them from a previously described melon BAC library generated with restriction endonucleases and the remainder from a new random-shear BAC library. Results We generated a total of 47,140 high-quality BAC-end sequences (BES), 91.7% of which were paired-BES. Both libraries were assembled independently and then cross-assembled to obtain a final set of 33,372 non-redundant, high-quality sequences. These were grouped into 6,411 contigs (4.5 Mb) and 26,961 non-assembled BES (14.4 Mb), representing ~4.2% of the melon genome. The sequences were used to screen genomic databases, identifying 7,198 simple sequence repeats (corresponding to one microsatellite every 2.6 kb) and 2,484 additional repeats of which 95.9% represented transposable elements. The sequences were also used to screen expressed sequence tag (EST) databases, revealing 11,372 BES that were homologous to ESTs. This suggests that ~30% of the melon genome consists of coding DNA. We observed regions of microsynteny between melon paired-BES and six other dicotyledonous plant genomes. Conclusion The analysis of nearly 50,000 BES from two complementary genomic libraries covered ~4.2% of the melon genome, providing insight into properties such as microsatellite and transposable element distribution, and the percentage of coding DNA. The observed synteny between melon paired-BES and six other plant genomes showed that useful comparative genomic data can be derived through large scale BAC-end sequencing by anchoring a small proportion of the melon genome to other sequenced genomes.
Collapse
Affiliation(s)
- Víctor M González
- Molecular Genetics Department, Center for Research in Agricultural Genomics CRAG (CSIC-IRTA-UAB), Jordi Girona, 18-26, 08034 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
20
|
Davis TM, Shields ME, Zhang Q, Tombolato-Terzić D, Bennetzen JL, Pontaroli AC, Wang H, Yao Q, SanMiguel P, Folta KM. An examination of targeted gene neighborhoods in strawberry. BMC PLANT BIOLOGY 2010; 10:81. [PMID: 20441596 PMCID: PMC2890015 DOI: 10.1186/1471-2229-10-81] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2009] [Accepted: 05/04/2010] [Indexed: 05/19/2023]
Abstract
BACKGROUND Strawberry (Fragaria spp.) is the familiar name of a group of economically important crop plants and wild relatives that also represent an emerging system for the study of gene and genome evolution. Its small stature, rapid seed-to-seed cycle, transformability and miniscule basic genome make strawberry an attractive system to study processes related to plant physiology, development and crop production; yet it lacks substantial genomics-level resources. This report addresses this deficiency by characterizing 0.71 Mbp of gene space from a diploid species (F. vesca). The twenty large genomic tracks (30-52 kb) captured as fosmid inserts comprise gene regions with roles in flowering, disease resistance, and metabolism. RESULTS A detailed description of the studied regions reveals 131 Blastx-supported gene sites and eight additional EST-supported gene sites. Only 15 genes have complete EST coverage, enabling gene modelling, while 76 lack EST support. Instances of microcolinearity with Arabidopsis thaliana were identified in twelve inserts. A relatively high portion (25%) of targeted genes were found in unanticipated tandem duplications. The effectiveness of six FGENESH training models was assessed via comparisons among ab initio predictions and homology-based gene and start/stop codon identifications. Fourteen transposable-element-related sequences and 158 simple sequence repeat loci were delineated. CONCLUSIONS This report details the structure and content of targeted regions of the strawberry genome. The data indicate that the strawberry genome is gene-dense, with an average of one protein-encoding gene or pseudogene per 5.9 kb. Current overall EST coverage is sparse. The unexpected gene duplications and their differential patterns of EST support suggest possible subfunctionalization or pseudogenization of these sequences. This report provides a high-resolution depiction of targeted gene neighborhoods that will aid whole-genome sequence assembly, provide valuable tools for plant breeders and advance the understanding of strawberry genome evolution.
Collapse
Affiliation(s)
- Thomas M Davis
- Department of Biological Sciences, University of New Hampshire, Durham, NH 03824 USA
| | - Melanie E Shields
- Department of Molecular, Cellular & Biomedical Sciences, University of New Hampshire, Durham, NH 03824 USA
| | - Qian Zhang
- Department of Biological Sciences, University of New Hampshire, Durham, NH 03824 USA
| | - Denise Tombolato-Terzić
- Horticultural Sciences Department and Plant Molecular and Cellular Biology Program, PO Box 110690, 1301 Fifield Hall, Gainesville, FL 32611 USA
| | | | - Ana C Pontaroli
- Department of Genetics, University of Georgia, Athens, GA 30602 USA
- Estación Experimental Agropecuaria Balcarce, Instituto Nacional de Tecnología Agropecuaria (INTA) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET); CC 276 (7620) Balcarce, Argentina
| | - Hao Wang
- Department of Genetics, University of Georgia, Athens, GA 30602 USA
| | - Qin Yao
- Department of Genetics, University of Georgia, Athens, GA 30602 USA
| | - Phillip SanMiguel
- Department of Horticulture and Landscape Architecture, Purdue Univ., West Lafayette, IN 47907 USA
| | - Kevin M Folta
- Horticultural Sciences Department and Plant Molecular and Cellular Biology Program, PO Box 110690, 1301 Fifield Hall, Gainesville, FL 32611 USA
| |
Collapse
|
21
|
SplinkBES: a splinkerette-based method for generating long end sequences from large insert DNA libraries. Biotechniques 2009; 47:681-2, 684, 686, passim. [PMID: 19737131 DOI: 10.2144/000113122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We report on the development of a novel splinkerette-based method for generating long end sequences from large insert library clones, using a carrot (Daucus carota L.) BAC library as a model. The procedure involves digestion of the BAC DNA with a 6-bp restriction enzyme, followed by ligation of splinkerette adaptors that serve as primer-annealing sites for PCR amplification of the BAC ends. The resulting amplicons are sequenced from both directions, and when overlapping, the pairs of sequences are assembled, originating two types of BAC end sequences (BESs): ungapped and gapped. The average sequence length for ungapped and gapped BESs was 698 and 1055 nucleotides, respectively, with an overall average length of 838 nucleotides. This is considerably higher than the average length typically obtained by direct end sequencing. Through the analysis of actual and in silico-generated BES of different lengths from carrot and five model organisms, we demonstrated that longer BESs are more informative, since they had more matches to the GenBank database and contained more simple sequence repeats (SSRs). A pilot high-throughput procedure is proposed for splinkerette-based end sequencing (SplinkBES). This method may contribute to generating more robust BES analysis and provide a richer source of BES-derived markers for genomics, mapping, and breeding.
Collapse
|
22
|
Gu YQ, Ma Y, Huo N, Vogel JP, You FM, Lazo GR, Nelson WM, Soderlund C, Dvorak J, Anderson OD, Luo MC. A BAC-based physical map of Brachypodium distachyon and its comparative analysis with rice and wheat. BMC Genomics 2009; 10:496. [PMID: 19860896 PMCID: PMC2774330 DOI: 10.1186/1471-2164-10-496] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Accepted: 10/27/2009] [Indexed: 11/13/2022] Open
Abstract
Background Brachypodium distachyon (Brachypodium) has been recognized as a new model species for comparative and functional genomics of cereal and bioenergy crops because it possesses many biological attributes desirable in a model, such as a small genome size, short stature, self-pollinating habit, and short generation cycle. To maximize the utility of Brachypodium as a model for basic and applied research it is necessary to develop genomic resources for it. A BAC-based physical map is one of them. A physical map will facilitate analysis of genome structure, comparative genomics, and assembly of the entire genome sequence. Results A total of 67,151 Brachypodium BAC clones were fingerprinted with the SNaPshot HICF fingerprinting method and a genome-wide physical map of the Brachypodium genome was constructed. The map consisted of 671 contigs and 2,161 clones remained as singletons. The contigs and singletons spanned 414 Mb. A total of 13,970 gene-related sequences were detected in the BAC end sequences (BES). These gene tags aligned 345 contigs with 336 Mb of rice genome sequence, showing that Brachypodium and rice genomes are generally highly colinear. Divergent regions were mainly in the rice centromeric regions. A dot-plot of Brachypodium contigs against the rice genome sequences revealed remnants of the whole-genome duplication caused by paleotetraploidy, which were previously found in rice and sorghum. Brachypodium contigs were anchored to the wheat deletion bin maps with the BES gene-tags, opening the door to Brachypodium-Triticeae comparative genomics. Conclusion The construction of the Brachypodium physical map, and its comparison with the rice genome sequence demonstrated the utility of the SNaPshot-HICF method in the construction of BAC-based physical maps. The map represents an important genomic resource for the completion of Brachypodium genome sequence and grass comparative genomics. A draft of the physical map and its comparisons with rice and wheat are available at .
Collapse
Affiliation(s)
- Yong Q Gu
- 1Genomics and Gene Discovery Research Unit, USDA-ARS, Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710,USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Yu Q, Tong E, Skelton RL, Bowers JE, Jones MR, Murray JE, Hou S, Guan P, Acob RA, Luo MC, Moore PH, Alam M, Paterson AH, Ming R. A physical map of the papaya genome with integrated genetic map and genome sequence. BMC Genomics 2009; 10:371. [PMID: 19664231 PMCID: PMC3224731 DOI: 10.1186/1471-2164-10-371] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2009] [Accepted: 08/07/2009] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Papaya is a major fruit crop in tropical and subtropical regions worldwide and has primitive sex chromosomes controlling sex determination in this trioecious species. The papaya genome was recently sequenced because of its agricultural importance, unique biological features, and successful application of transgenic papaya for resistance to papaya ringspot virus. As a part of the genome sequencing project, we constructed a BAC-based physical map using a high information-content fingerprinting approach to assist whole genome shotgun sequence assembly. RESULTS The physical map consists of 963 contigs, representing 9.4x genome equivalents, and was integrated with the genetic map and genome sequence using BAC end sequences and a sequence-tagged high-density genetic map. The estimated genome coverage of the physical map is about 95.8%, while 72.4% of the genome was aligned to the genetic map. A total of 1,181 high quality overgo (overlapping oligonucleotide) probes representing conserved sequences in Arabidopsis and genetically mapped loci in Brassica were anchored on the physical map, which provides a foundation for comparative genomics in the Brassicales. The integrated genetic and physical map aligned with the genome sequence revealed recombination hotspots as well as regions suppressed for recombination across the genome, particularly on the recently evolved sex chromosomes. Suppression of recombination spread to the adjacent region of the male specific region of the Y chromosome (MSY), and recombination rates were recovered gradually and then exceeded the genome average. Recombination hotspots were observed at about 10 Mb away on both sides of the MSY, showing 7-fold increase compared with the genome wide average, demonstrating the dynamics of recombination of the sex chromosomes. CONCLUSION A BAC-based physical map of papaya was constructed and integrated with the genetic map and genome sequence. The integrated map facilitated the draft genome assembly, and is a valuable resource for comparative genomics and map-based cloning of agronomically and economically important genes and for sex chromosome research.
Collapse
Affiliation(s)
- Qingyi Yu
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
| | - Eric Tong
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
| | - Rachel L Skelton
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
| | - John E Bowers
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Meghan R Jones
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
| | - Jan E Murray
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Shaobin Hou
- Center for Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI 96822, USA
| | - Peizhu Guan
- Department of Molecular Bioscience and Bioengineering, University of Hawaii, Honolulu, HI 96822, USA
| | - Ricelle A Acob
- Department of Molecular Bioscience and Bioengineering, University of Hawaii, Honolulu, HI 96822, USA
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California, Davis, CA 95616, USA
| | - Paul H Moore
- USDA-ARS, Pacific Basin Agricultural Research Center, Hilo, HI 96720, USA
| | - Maqsudul Alam
- Center for Advanced Studies in Genomics, Proteomics and Bioinformatics, University of Hawaii, Honolulu, HI 96822, USA
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Ray Ming
- Cellular and Molecular Biology Research Unit, Hawaii Agriculture Research Center, Aiea, HI 96701, USA
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
24
|
Chapus C, Edwards SV. Genome evolution in Reptilia: in silico chicken mapping of 12,000 BAC-end sequences from two reptiles and a basal bird. BMC Genomics 2009; 10 Suppl 2:S8. [PMID: 19607659 PMCID: PMC2966332 DOI: 10.1186/1471-2164-10-s2-s8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND With the publication of the draft chicken genome and the recent production of several BAC clone libraries from non-avian reptiles and birds, it is now possible to undertake more detailed comparative genomic studies in Reptilia. Of interest in particular are the genomic events that transformed the large, repeat-rich genomes of mammals and non-avian reptiles into the minimalist chicken genome. We have used paired BAC end sequences (BESs) from the American alligator (Alligator mississippiensis), painted turtle (Chrysemys picta) and emu (Dromaius novaehollandiae) to investigate patterns of sequence divergence, gene and retroelement content, and microsynteny between these species and chicken. RESULTS From a total of 11,967 curated BESs, we successfully mapped 725, 773 and 2597 sequences in alligator, turtle, and emu, respectively, to sites in the draft chicken genome using a stringent BLAST protocol. Most commonly, sequences mapped to a single site in the chicken genome. Of 1675, 1828 and 2936 paired BESs obtained for alligator, turtle, and emu, respectively, a total of 34 (alligator, 2%), 24 (turtle, 1.3%) and 479 (emu, 16.3%) pairs were found to map with high confidence and in the correct orientation and with BAC-sized intermarker distances to single chicken chromosomes, including 25 such paired hits in emu mapping to the chicken Z chromosome. By determining the insert sizes of a subset of BAC clones from these three species, we also found a significant correlation between the intermarker distance in alligator and turtle and in chicken, with slopes as expected on the basis of the ratio of the genome sizes. CONCLUSION Our results suggest that a large number of small-scale chromosomal rearrangements and deletions in the lineage leading to chicken have drastically reduced the number of detected syntenies observed between the chicken and alligator, turtle, and emu genomes and imply that small deletions occurring widely throughout the genomes of reptilian and avian ancestors led to the ~50% reduction in genome size observed in birds compared to reptiles. We have also mapped and identified likely gene regions in hundreds of new BAC clones from these species.
Collapse
Affiliation(s)
- Charles Chapus
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
25
|
Jung S, Jiwan D, Cho I, Lee T, Abbott A, Sosinski B, Main D. Synteny of Prunus and other model plant species. BMC Genomics 2009; 10:76. [PMID: 19208249 PMCID: PMC2647949 DOI: 10.1186/1471-2164-10-76] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2008] [Accepted: 02/10/2009] [Indexed: 01/09/2023] Open
Abstract
Background Fragmentary conservation of synteny has been reported between map-anchored Prunus sequences and Arabidopsis. With the availability of genome sequence for fellow rosid I members Populus and Medicago, we analyzed the synteny between Prunus and the three model genomes. Eight Prunus BAC sequences and map-anchored Prunus sequences were used in the comparison. Results We found a well conserved synteny across the Prunus species – peach, plum, and apricot – and Populus using a set of homologous Prunus BACs. Conversely, we could not detect any synteny with Arabidopsis in this region. Other peach BACs also showed extensive synteny with Populus. The syntenic regions detected were up to 477 kb in Populus. Two syntenic regions between Arabidopsis and these BACs were much shorter, around 10 kb. We also found syntenic regions that are conserved between the Prunus BACs and Medicago. The array of synteny corresponded with the proposed whole genome duplication events in Populus and Medicago. Using map-anchored Prunus sequences, we detected many syntenic blocks with several gene pairs between Prunus and Populus or Arabidopsis. We observed a more complex network of synteny between Prunus-Arabidopsis, indicative of multiple genome duplication and subsequence gene loss in Arabidopsis. Conclusion Our result shows the striking microsynteny between the Prunus BACs and the genome of Populus and Medicago. In macrosynteny analysis, more distinct Prunus regions were syntenic to Populus than to Arabidopsis.
Collapse
Affiliation(s)
- Sook Jung
- Department of Horticulture and Landscape Architecture, Washington State University, Pullman, WA 99164, USA.
| | | | | | | | | | | | | |
Collapse
|
26
|
Cavagnaro PF, Chung SM, Szklarczyk M, Grzebelus D, Senalik D, Atkins AE, Simon PW. Characterization of a deep-coverage carrot (Daucus carota L.) BAC library and initial analysis of BAC-end sequences. Mol Genet Genomics 2008; 281:273-88. [PMID: 19104839 DOI: 10.1007/s00438-008-0411-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2008] [Accepted: 11/29/2008] [Indexed: 11/29/2022]
Abstract
Carrot is the most economically important member of the Apiaceae family and a major source of provitamin A carotenoids in the human diet. However, carrot molecular resources are relatively underdeveloped, hampering a number of genetic studies. Here, we report on the synthesis and characterization of a bacterial artificial chromosome (BAC) library of carrot. The library is 17.3-fold redundant and consists of 92,160 clones with an average insert size of 121 kb. To provide an overview of the composition and organization of the carrot nuclear genome we generated and analyzed 2,696 BAC-end sequences (BES) from nearly 2,000 BACs, totaling 1.74 Mb of BES. This analysis revealed that 14% of the BES consists of known repetitive elements, with transposable elements representing more than 80% of this fraction. Eleven novel carrot repetitive elements were identified, covering 8.5% of the BES. Analysis of microsatellites showed a comparably low frequency for these elements in the carrot BES. Comparisons of the translated BES with protein databases indicated that approximately 10% of the carrot genome represents coding sequences. Moreover, among eight dicot species used for comparison purposes, carrot BES had highest homology to protein-coding sequences from tomato. This deep-coverage library will aid carrot breeding and genetics.
Collapse
Affiliation(s)
- Pablo F Cavagnaro
- Department of Horticulture, University of Wisconsin, 1575 Linden Drive, Madison, WI 53706, USA
| | | | | | | | | | | | | |
Collapse
|
27
|
BAC end sequences corresponding to the B4 resistance gene cluster in common bean: a resource for markers and synteny analyses. Mol Genet Genomics 2008; 280:521-33. [PMID: 18813956 DOI: 10.1007/s00438-008-0384-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2008] [Accepted: 09/06/2008] [Indexed: 10/21/2022]
Abstract
In common bean, a complex disease resistance (R) gene cluster, harboring many specific R genes against various pathogens, is located at the end of the linkage group B4. A BAC library of the Meso-american bean genotype BAT93 was screened with PRLJ1, a probe previously shown to be specific to the B4 R gene cluster, leading to the identification of 73 positive BAC clones. BAC-end sequencing (BES) of the 73 positive BACs generated 75 kb of sequence. These BACs were organized into 6 contigs, all mapped at the B4 R gene cluster. To evaluate the potential of BES for marker development, BES-derived specific primers were used to check for linkage with two allelic anthracnose R specificities Co-3 and Co-3 ( 2 ), through the analysis of pairs of Near Isogenic Lines (NILs). Out of 32 primer pairs tested, two revealed polymorphisms between the NILs, confirming the suspected location of Co-3 and Co-3 ( 2 ) at the B4 cluster. In order to identify the orthologous region of the B4 R gene cluster in the two model legume genomes, bean BESs were used as queries in TBLASTX searches of Medicago truncatula and Lotus japonicus BAC clones. Putative orthologous regions were identified on chromosome Mt6 and Lj2, in agreement with the colinearity observed between Mt and Lj for these regions.
Collapse
|
28
|
Terol J, Naranjo MA, Ollitrault P, Talon M. Development of genomic resources for Citrus clementina: characterization of three deep-coverage BAC libraries and analysis of 46,000 BAC end sequences. BMC Genomics 2008; 9:423. [PMID: 18801166 PMCID: PMC2561056 DOI: 10.1186/1471-2164-9-423] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2008] [Accepted: 09/18/2008] [Indexed: 11/24/2022] Open
Abstract
Background Citrus species constitute one of the major tree fruit crops of the subtropical regions with great economic importance. However, their peculiar reproductive characteristics, low genetic diversity and the long-term nature of tree breeding mostly impair citrus variety improvement. In woody plants, genomic science holds promise of improvements and in the Citrus genera the development of genomic tools may be crucial for further crop improvements. In this work we report the characterization of three BAC libraries from Clementine (Citrus clementina), one of the most relevant citrus fresh fruit market cultivars, and the analyses of 46.000 BAC end sequences. Clementine is a diploid plant with an estimated haploid genome size of 367 Mb and 2n = 18 chromosomes, which makes feasible the use of genomics tools to boost genetic improvement. Results Three genomic BAC libraries of Citrus clementina were constructed through EcoRI, MboI and HindIII digestions and 56,000 clones, representing an estimated genomic coverage of 19.5 haploid genome-equivalents, were picked. BAC end sequencing (BES) of 28,000 clones produced 28.1 Mb of genomic sequence that allowed the identification of the repetitive fraction (12.5% of the genome) and estimation of gene content (31,000 genes) of this species. BES analyses identified 3,800 SSRs and 6,617 putative SNPs. Comparative genomic studies showed that citrus gene homology and microsyntheny with Populus trichocarpa was rather higher than with Arabidopsis thaliana, a species phylogenetically closer to citrus. Conclusion In this work, we report the characterization of three BAC libraries from C. clementina, and a new set of genomic resources that may be useful for isolation of genes underlying economically important traits, physical mapping and eventually crop improvement in Citrus species. In addition, BAC end sequencing has provided a first insight on the basic structure and organization of the citrus genome and has yielded valuable molecular markers for genetic mapping and cloning of genes of agricultural interest. Paired end sequences also may be very helpful for whole-genome sequencing programs.
Collapse
Affiliation(s)
- Javier Terol
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias, Carretera Moncada, Náquera, Km. 4,5 Moncada, Valencia, E46113, Spain.
| | | | | | | |
Collapse
|
29
|
Ríos G, Naranjo MA, Iglesias DJ, Ruiz-Rivero O, Geraud M, Usach A, Talón M. Characterization of hemizygous deletions in citrus using array-comparative genomic hybridization and microsynteny comparisons with the poplar genome. BMC Genomics 2008; 9:381. [PMID: 18691431 PMCID: PMC2533677 DOI: 10.1186/1471-2164-9-381] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2008] [Accepted: 08/09/2008] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a valuable genomics-based protocol for the structural analysis of deletion mutations on an heterozygous background. RESULTS Two independent fast neutron mutants of self-incompatible clementine (Citrus clementina Hort. Ex Tan. cv. Clemenules) were the subject of the study. Both mutants, named 39B3 and 39E7, were expected to carry DNA deletions in hemizygous dosage. Array-based Comparative Genomic Hybridization (array-CGH) using a Citrus cDNA microarray allowed the identification of underrepresented genes in these two mutants. Subsequent comparison of citrus deleted genes with annotated plant genomes, especially poplar, made possible to predict the presence of a large deletion in 39B3 of about 700 kb and at least two deletions of approximately 100 and 500 kb in 39E7. The deletion in 39B3 was further characterized by PCR on available Citrus BACs, which helped us to build a partial physical map of the deletion. Among the deleted genes, ClpC-like gene coding for a putative subunit of a multifunctional chloroplastic protease involved in the regulation of chlorophyll b synthesis was directly related to the mutated phenotype since the mutant showed a reduced chlorophyll a/b ratio in green tissues. CONCLUSION In this work, we report the use of array-CGH for the successful identification of genes included in a hemizygous deletion induced by fast neutron irradiation on Citrus clementina. The study of gene content and order into the 39B3 deletion also led to the unexpected conclusion that microsynteny and local gene colinearity in this species were higher with Populus trichocarpa than with the phylogenetically closer Arabidopsis thaliana. This work corroborates the potential of Citrus genomic resources to assist mutagenesis-based approaches for functional genetics, structural studies and comparative genomics, and hence to facilitate citrus variety improvement.
Collapse
Affiliation(s)
- Gabino Ríos
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias, Carretera Moncada-Náquera km 4,5, 46113 Moncada (Valencia), Spain.
| | | | | | | | | | | | | |
Collapse
|
30
|
Chanderbali AS, Albert VA, Ashworth VETM, Clegg MT, Litz RE, Soltis DE, Soltis PS. Persea americana (avocado): bringing ancient flowers to fruit in the genomics era. Bioessays 2008; 30:386-96. [PMID: 18348249 DOI: 10.1002/bies.20721] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The avocado (Persea americana) is a major crop commodity worldwide. Moreover, avocado, a paleopolyploid, is an evolutionary "outpost" among flowering plants, representing a basal lineage (the magnoliid clade) near the origin of the flowering plants themselves. Following centuries of selective breeding, avocado germplasm has been characterized at the level of microsatellite and RFLP markers. Nonetheless, little is known beyond these general diversity estimates, and much work remains to be done to develop avocado as a major subtropical-zone crop. Among the goals of avocado improvement are to develop varieties with fruit that will "store" better on the tree, show uniform ripening and have better post-harvest storage. Avocado transcriptome sequencing, genome mapping and partial genomic sequencing will represent a major step toward the goal of sequencing the entire avocado genome, which is expected to aid in improving avocado varieties and production, as well as understanding the evolution of flowers from non-flowering seed plants (gymnosperms). Additionally, continued evolutionary and other comparative studies of flower and fruit development in different avocado strains can be accomplished at the gene expression level, including in comparison with avocado relatives, and these should provide important insights into the genetic regulation of fruit development in basal angiosperms.
Collapse
|
31
|
Datema E, Mueller LA, Buels R, Giovannoni JJ, Visser RGF, Stiekema WJ, van Ham RCHJ. Comparative BAC end sequence analysis of tomato and potato reveals overrepresentation of specific gene families in potato. BMC PLANT BIOLOGY 2008; 8:34. [PMID: 18405374 PMCID: PMC2324086 DOI: 10.1186/1471-2229-8-34] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2007] [Accepted: 04/11/2008] [Indexed: 05/18/2023]
Abstract
BACKGROUND Tomato (Solanum lycopersicon) and potato (S. tuberosum) are two economically important crop species, the genomes of which are currently being sequenced. This study presents a first genome-wide analysis of these two species, based on two large collections of BAC end sequences representing approximately 19% of the tomato genome and 10% of the potato genome. RESULTS The tomato genome has a higher repeat content than the potato genome, primarily due to a higher number of retrotransposon insertions in the tomato genome. On the other hand, simple sequence repeats are more abundant in potato than in tomato. The two genomes also differ in the frequency distribution of SSR motifs. Based on EST and protein alignments, potato appears to contain up to 6,400 more putative coding regions than tomato. Major gene families such as cytochrome P450 mono-oxygenases and serine-threonine protein kinases are significantly overrepresented in potato, compared to tomato. Moreover, the P450 superfamily appears to have expanded spectacularly in both species compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Both tomato and potato appear to have a low level of microsynteny with A. thaliana. A higher degree of synteny was observed with Populus trichocarpa, specifically in the region between 15.2 and 19.4 Mb on P. trichocarpa chromosome 10. CONCLUSION The findings in this paper present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. When the complete genome sequences of these species become available, whole-genome comparisons and protein- or repeat-family specific studies may shed more light on the observations made here.
Collapse
Affiliation(s)
- Erwin Datema
- Applied Bioinformatics, Plant Research International, PO Box 16, 6700 AA, Wageningen, The Netherlands
- Laboratory of Bioinformatics, Wageningen University, Transitorium, Dreijenlaan 3, 6703 HA Wageningen, The Netherlands
| | - Lukas A Mueller
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Robert Buels
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - James J Giovannoni
- United States Department of Agriculture and Boyce Thompson Institute for Plant, Research, Cornell University, Ithaca, New York 14853, USA
| | - Richard GF Visser
- Laboratory of Plant Breeding, Wageningen University, P.O. Box 386, 6700 AJ Wageningen, The Netherlands
| | - Willem J Stiekema
- Laboratory of Bioinformatics, Wageningen University, Transitorium, Dreijenlaan 3, 6703 HA Wageningen, The Netherlands
- Centre for BioSystems Genomics (CBSG), PO Box 98, 6700 AB Wageningen, The Netherlands
| | - Roeland CHJ van Ham
- Applied Bioinformatics, Plant Research International, PO Box 16, 6700 AA, Wageningen, The Netherlands
- Laboratory of Bioinformatics, Wageningen University, Transitorium, Dreijenlaan 3, 6703 HA Wageningen, The Netherlands
| |
Collapse
|
32
|
Huo N, Lazo GR, Vogel JP, You FM, Ma Y, Hayden DM, Coleman-Derr D, Hill TA, Dvorak J, Anderson OD, Luo MC, Gu YQ. The nuclear genome of Brachypodium distachyon: analysis of BAC end sequences. Funct Integr Genomics 2007; 8:135-47. [PMID: 17985162 DOI: 10.1007/s10142-007-0062-7] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2007] [Revised: 10/04/2007] [Accepted: 10/06/2007] [Indexed: 10/22/2022]
Abstract
Due in part to its small genome (approximately 350 Mb), Brachypodium distachyon is emerging as a model system for temperate grasses, including important crops like wheat and barley. We present the analysis of 10.9% of the Brachypodium genome based on 64,696 bacterial artificial chromosome (BAC) end sequences (BES). Analysis of repeat DNA content in BES revealed that approximately 11.0% of the genome consists of known repetitive DNA. The vast majority of the Brachypodium repetitive elements are LTR retrotransposons. While Bare-1 retrotransposons are common to wheat and barley, Brachypodium repetitive element sequence-1 (BRES-1), closely related to Bare-1, is also abundant in Brachypodium. Moreover, unique Brachypodium repetitive element sequences identified constitute approximately 7.4% of its genome. Simple sequence repeats from BES were analyzed, and flanking primer sequences for SSR detection potentially useful for genetic mapping are available at http://brachypodium.pw.usda.gov . Sequence analyses of BES indicated that approximately 21.2% of the Brachypodium genome represents coding sequence. Furthermore, Brachypodium BES have more significant matches to ESTs from wheat than rice or maize, although these species have similar sizes of EST collections. A phylogenetic analysis based on 335 sequences shared among seven grass species further revealed a closer relationship between Brachypodium and Triticeae than Brachypodium and rice or maize.
Collapse
Affiliation(s)
- Naxin Huo
- Genomics and Gene Discovery Research Unit, USDA-ARS Western Regional Research Center, 800 Buchanan Street, Albany, CA 94710, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Telgmann-Rauber A, Jamsari A, Kinney MS, Pires JC, Jung C. Genetic and physical maps around the sex-determining M-locus of the dioecious plant asparagus. Mol Genet Genomics 2007; 278:221-34. [PMID: 17609979 DOI: 10.1007/s00438-007-0235-z] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2007] [Accepted: 03/26/2007] [Indexed: 11/30/2022]
Abstract
Asparagus officinalis L. is a dioecious plant. A region called the M-locus located on a pair of homomorphic sex chromosomes controls the sexual dimorphism in asparagus. The aim of this work was to clone the region determining sex in asparagus from its position in the genome. The structure of the region encompassing M should be investigated and compared to the sex-determining regions in other dioecious model species. To establish an improved basis for physical mapping, a high-resolution genetic map was enriched with AFLP markers closely linked to the target locus by carrying out a bulked segregant analysis. By screening a BAC library with AFLP- and STS-markers followed by chromosome walking, a physical map with eight contigs could be established. However, the gaps between the contigs could not be closed due to a plethora of repetitive elements. Surprisingly, two of the contigs on one side of the M-locus did not overlap although they have been established with two markers, which mapped in a distance as low as 0.25 cM flanking the sex locus. Thus, the clustering of the markers indicates a reduced recombination frequency within the M-region. On the opposite side of the M-locus, a contig was mapped in a distance of 0.38 cM. Four closely linked BAC clones were partially sequenced and 64 putative ORFs were identified. Interestingly, only 25% of the ORFs showed sequence similarity to known proteins and ESTs. In addition, an accumulation of repetitive sequences and a low gene density was revealed in the sex-determining region of asparagus. Molecular cytogenetic and sequence analysis of BACs flanking the M-locus indicate that the BACs contain highly repetitive sequences that localize to centromeric and pericentromeric locations on all asparagus chromosomes, which hindered the localization of the M-locus to the single pair of sex chromosomes. We speculate that dioecious Silene, papaya and Asparagus species may represent three stages in the evolution of XX, XY sex determination systems. Given that asparagus still rarely produces hermaphroditic flowers and has homomorphic sex chromosomes, this species may be an ideal system to further investigates early sex chromosome evolution and the origins of dioecy.
Collapse
Affiliation(s)
- Alexa Telgmann-Rauber
- Plant Breeding Institute, Christian-Albrechts-University Kiel, Olshausenstr. 40, Kiel 24098, Germany
| | | | | | | | | |
Collapse
|
34
|
Cheung F, Town CD. A BAC end view of the Musa acuminata genome. BMC PLANT BIOLOGY 2007; 7:29. [PMID: 17562019 PMCID: PMC1904220 DOI: 10.1186/1471-2229-7-29] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2006] [Accepted: 06/11/2007] [Indexed: 05/04/2023]
Abstract
BACKGROUND Musa species contain the fourth most important crop in developing countries. Here, we report the analysis of 6,252 BAC end-sequences, in order to view the sequence composition of the Musa acuminata genome in a cost effective and efficient manner. RESULTS BAC end sequencing generated 6,252 reads representing 4,420,944 bp, including 2,979 clone pairs with an average read length after cleaning and filtering of 707 bp. All sequences have been submitted to GenBank, with the accession numbers DX451975-DX458350. The BAC end-sequences, were searched against several databases and significant homology was found to mitochondria and chloroplast (2.6%), transposons and repetitive sequences (36%) and proteins (11%). Functional interpretation of the protein matches was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of categories. From protein matching regions of Musa BAC end-sequences, it was determined that the GC content of coding regions was 47%. Where protein matches encompassed a start codon, GC content as a function of position (5' to 3') across 129 bp sliding windows generates a "rice-like" gradient. A total of 352 potential SSR markers were discovered. The most abundant simple sequence repeats in four size categories were AT-rich. After filtering mitochondria and chloroplast matches, thousands of BAC end-sequences had a significant BLASTN match to the Oryza sativa and Arabidopsis genome sequence. Of these, a small number of BAC end-sequence pairs were shown to map to neighboring regions of the Oryza sativa genome representing regions of potential microsynteny. CONCLUSION Database searches with the BAC end-sequences and ab initio analysis identified those reads likely to contain transposons, repeat sequences, proteins and simple sequence repeats. Approximately 600 BAC end-sequences contained protein sequences that were not found in the existing available Musa expressed sequence tags, repeat or transposon databases. In addition, gene statistics, GC content and profile could also be estimated based on the region matching the top protein hit. A small number of BAC end pair sequences can be mapped to neighboring regions of the Oryza sativa representing regions of potential microsynteny. These results suggest that a large-scale BAC end sequencing strategy has the potential to anchor a small proportion of the genome of Musa acuminata to the genomes of Oryza sativa and possibly Arabidopsis.
Collapse
Affiliation(s)
- Foo Cheung
- J Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
| | - Christopher D Town
- J Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
| |
Collapse
|
35
|
Yu Q, Hou S, Hobza R, Feltus FA, Wang X, Jin W, Skelton RL, Blas A, Lemke C, Saw JH, Moore PH, Alam M, Jiang J, Paterson AH, Vyskot B, Ming R. Chromosomal location and gene paucity of the male specific region on papaya Y chromosome. Mol Genet Genomics 2007; 278:177-85. [PMID: 17520292 DOI: 10.1007/s00438-007-0243-z] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2007] [Accepted: 04/26/2007] [Indexed: 10/23/2022]
Abstract
Sex chromosomes in flowering plants evolved recently and many of them remain homomorphic, including those in papaya. We investigated the chromosomal location of papaya's small male specific region of the hermaphrodite Y (Yh) chromosome (MSY) and its genomic features. We conducted chromosome fluorescence in situ hybridization mapping of Yh-specific bacterial artificial chromosomes (BACs) and placed the MSY near the centromere of the papaya Y chromosome. Then we sequenced five MSY BACs to examine the genomic features of this specialized region, which resulted in the largest collection of contiguous genomic DNA sequences of a Y chromosome in flowering plants. Extreme gene paucity was observed in the papaya MSY with no functional gene identified in 715 kb MSY sequences. A high density of retroelements and local sequence duplications were detected in the MSY that is suppressed for recombination. Location of the papaya MSY near the centromere might have provided recombination suppression and fostered paucity of genes in the male specific region of the Y chromosome. Our findings provide critical information for deciphering the sex chromosomes in papaya and reference information for comparative studies of other sex chromosomes in animals and plants.
Collapse
Affiliation(s)
- Qingyi Yu
- Hawaii Agriculture Research Center, Aiea, HI 96701, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Freeling M, Rapaka L, Lyons E, Pedersen B, Thomas BC. G-boxes, bigfoot genes, and environmental response: characterization of intragenomic conserved noncoding sequences in Arabidopsis. THE PLANT CELL 2007; 19:1441-57. [PMID: 17496117 PMCID: PMC1913728 DOI: 10.1105/tpc.107.050419] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2007] [Revised: 03/10/2007] [Accepted: 04/19/2007] [Indexed: 05/15/2023]
Abstract
A tetraploidy left Arabidopsis thaliana with 6358 pairs of homoeologs that, when aligned, generated 14,944 intragenomic conserved noncoding sequences (CNSs). Our previous work assembled these phylogenetic footprints into a database. We show that known transcription factor (TF) binding motifs, including the G-box, are overrepresented in these CNSs. A total of 254 genes spanning long lengths of CNS-rich chromosomes (Bigfoot) dominate this database. Therefore, we made subdatabases: one containing Bigfoot genes and the other containing genes with three to five CNSs (Smallfoot). Bigfoot genes are generally TFs that respond to signals, with their modal CNS positioned 3.1 kb 5' from the ATG. Smallfoot genes encode components of signal transduction machinery, the cytoskeleton, or involve transcription. We queried each subdatabase with each possible 7-nucleotide sequence. Among hundreds of hits, most were purified from CNSs, and almost all of those significantly enriched in CNSs had no experimental history. The 7-mers in CNSs are not 5'- to 3'-oriented in Bigfoot genes but are often oriented in Smallfoot genes. CNSs with one G-box tend to have two G-boxes. CNSs were shared with the homoeolog only and with no other gene, suggesting that binding site turnover impedes detection. Bigfoot genes may function in adaptation to environmental change.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| | | | | | | | | |
Collapse
|