1
|
Construction of Whole Genomes from Scaffolds Using Single Cell Strand-Seq Data. Int J Mol Sci 2021; 22:ijms22073617. [PMID: 33807210 PMCID: PMC8037727 DOI: 10.3390/ijms22073617] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 03/23/2021] [Accepted: 03/27/2021] [Indexed: 12/26/2022] Open
Abstract
Accurate reference genome sequences provide the foundation for modern molecular biology and genomics as the interpretation of sequence data to study evolution, gene expression, and epigenetics depends heavily on the quality of the genome assembly used for its alignment. Correctly organising sequenced fragments such as contigs and scaffolds in relation to each other is a critical and often challenging step in the construction of robust genome references. We previously identified misoriented regions in the mouse and human reference assemblies using Strand-seq, a single cell sequencing technique that preserves DNA directionality Here we demonstrate the ability of Strand-seq to build and correct full-length chromosomes by identifying which scaffolds belong to the same chromosome and determining their correct order and orientation, without the need for overlapping sequences. We demonstrate that Strand-seq exquisitely maps assembly fragments into large related groups and chromosome-sized clusters without using new assembly data. Using template strand inheritance as a bi-allelic marker, we employ genetic mapping principles to cluster scaffolds that are derived from the same chromosome and order them within the chromosome based solely on directionality of DNA strand inheritance. We prove the utility of our approach by generating improved genome assemblies for several model organisms including the ferret, pig, Xenopus, zebrafish, Tasmanian devil and the Guinea pig.
Collapse
|
2
|
Li C, Chen L, Liu X, Shi X, Guo Y, Huang R, Nie F, Zheng C, Zhang C, Ma RZ. A high-density BAC physical map covering the entire MHC region of addax antelope genome. BMC Genomics 2019; 20:479. [PMID: 31185912 PMCID: PMC6558854 DOI: 10.1186/s12864-019-5790-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2019] [Accepted: 05/10/2019] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The mammalian major histocompatibility complex (MHC) harbours clusters of genes associated with the immunological defence of animals against infectious pathogens. At present, no complete MHC physical map is available for any of the wild ruminant species in the world. RESULTS The high-density physical map is composed of two contigs of 47 overlapping bacterial artificial chromosome (BAC) clones, with an average of 115 Kb for each BAC, covering the entire addax MHC genome. The first contig has 40 overlapping BAC clones covering an approximately 2.9 Mb region of MHC class I, class III, and class IIa, and the second contig has 7 BAC clones covering an approximately 500 Kb genomic region that harbours MHC class IIb. The relative position of each BAC corresponding to the MHC sequence was determined by comparative mapping using PCR screening of the BAC library of 192,000 clones, and the order of BACs was determined by DNA fingerprinting. The overlaps of neighboring BACs were cross-verified by both BAC-end sequencing and co-amplification of identical PCR fragments within the overlapped region, with their identities further confirmed by DNA sequencing. CONCLUSIONS We report here the successful construction of a high-quality physical map for the addax MHC region using BACs and comparative mapping. The addax MHC physical map we constructed showed one gap of approximately 18 Mb formed by an ancient autosomal inversion that divided the MHC class II into IIa and IIb. The autosomal inversion provides compelling evidence that the MHC organizations in all of the ruminant species are relatively conserved.
Collapse
Affiliation(s)
- Chaokun Li
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Longxin Chen
- Zhengzhou Key Laboratory of Molecular Biology, Zhengzhou Normal University, Zhengzhou, 450044, China
| | - Xuefeng Liu
- Beijing Key Laboratory of Captive Wildlife Technologies, Beijing Zoo, Beijing, 100044, China
| | - Xiaoqian Shi
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yu Guo
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Rui Huang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Fangyuan Nie
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Changming Zheng
- Beijing Key Laboratory of Captive Wildlife Technologies, Beijing Zoo, Beijing, 100044, China
| | - Chenglin Zhang
- Beijing Key Laboratory of Captive Wildlife Technologies, Beijing Zoo, Beijing, 100044, China.
- Beijing Zoo, No. 137 West straight door Avenue, Xicheng District, Beijing, 100032, China.
| | - Runlin Z Ma
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, S2-316 Building #2, West Beichen Road, Chaoyang District, Beijing, 100101, China.
- Zhengzhou Key Laboratory of Molecular Biology, Zhengzhou Normal University, Zhengzhou, 450044, China.
- School of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
3
|
Fomin E. A Simple Approach to the Reconstruction of a Set of Points from the Multiset of n2 Pairwise Distances in n2 Steps for the Sequencing Problem: II. Algorithm. J Comput Biol 2016; 23:934-942. [DOI: 10.1089/cmb.2016.0046] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Eduard Fomin
- Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia
| |
Collapse
|
4
|
Pan Y, Wang X, Liu L, Wang H, Luo M. Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. PLoS One 2016; 11:e0161583. [PMID: 27611682 PMCID: PMC5017645 DOI: 10.1371/journal.pone.0161583] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Accepted: 08/08/2016] [Indexed: 11/19/2022] Open
Abstract
A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences.
Collapse
Affiliation(s)
- Yonglong Pan
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Xiaoming Wang
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Lin Liu
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Hao Wang
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Meizhong Luo
- National Key Laboratory of Crop Genetic Improvement and College of Life Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
- * E-mail:
| |
Collapse
|
5
|
Jung CJ, Ménoret S, Brusselle L, Tesson L, Usal C, Chenouard V, Remy S, Ouisse LH, Poirier N, Vanhove B, de Jong PJ, Anegon I. Comparative Analysis of piggyBac, CRISPR/Cas9 and TALEN Mediated BAC Transgenesis in the Zygote for the Generation of Humanized SIRPA Rats. Sci Rep 2016; 6:31455. [PMID: 27530248 PMCID: PMC4987655 DOI: 10.1038/srep31455] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Accepted: 07/14/2016] [Indexed: 01/12/2023] Open
Abstract
BAC transgenic mammalian systems offer an important platform for recapitulating human gene expression and disease modeling. While the larger body mass, and greater genetic and physiologic similarity to humans render rats well suited for reproducing human immune diseases and evaluating therapeutic strategies, difficulties of generating BAC transgenic rats have hindered progress. Thus, an efficient method for BAC transgenesis in rats would be valuable. Immunodeficient mice carrying a human SIRPA transgene have previously been shown to support improved human cell hematopoiesis. Here, we have generated for the first time, human SIRPA BAC transgenic rats, for which the gene is faithfully expressed, functionally active, and germline transmissible. To do this, human SIRPA BAC was modified with elements to work in coordination with genome engineering technologies-piggyBac, CRISPR/Cas9 or TALEN. Our findings show that piggyBac transposition is a more efficient approach than the classical BAC transgenesis, resulting in complete BAC integration with predictable end sequences, thereby permitting precise assessment of the integration site. Neither CRISPR/Cas9 nor TALEN increased BAC transgenesis. Therefore, an efficient generation of human SIRPA transgenic rats using piggyBac opens opportunities for expansion of humanized transgenic rat models in the future to advance biomedical research and therapeutic applications.
Collapse
Affiliation(s)
- Chris J Jung
- Center for Genetics, Children's Hospital Oakland Research Institute, CA 94609, Oakland, USA
| | - Séverine Ménoret
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Lucas Brusselle
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Laurent Tesson
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Claire Usal
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Vanessa Chenouard
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Séverine Remy
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Laure-Hélène Ouisse
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| | - Nicolas Poirier
- INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France.,OSE Immunotherapeutics, 44000 Nantes, France
| | - Bernard Vanhove
- INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France.,OSE Immunotherapeutics, 44000 Nantes, France
| | - Pieter J de Jong
- Center for Genetics, Children's Hospital Oakland Research Institute, CA 94609, Oakland, USA
| | - Ignacio Anegon
- Platform Rat Transgenesis Immunophenomic, SFR Francois Bonamy, CNRS UMS3556 Nantes, F44093, France.,INSERM UMR 1064-ITUN; CHU de Nantes, Nantes F44093, France
| |
Collapse
|
6
|
Abstract
Unraveling the complex network of neural circuits that form the nervous system demands tools that can manipulate specific circuits. The recent evolution of genetic tools to target neural circuits allows an unprecedented precision in elucidating their function. Here we describe two general approaches for achieving circuit specificity. The first uses the genetic identity of a cell, such as a transcription factor unique to a circuit, to drive expression of a molecule that can manipulate cell function. The second uses the spatial connectivity of a circuit to achieve specificity: one genetic element is introduced at the origin of a circuit and the other at its termination. When the two genetic elements combine within a neuron, they can alter its function. These two general approaches can be combined to allow manipulation of neurons with a specific genetic identity by introducing a regulatory gene into the origin or termination of the circuit. We consider the advantages and disadvantages of both these general approaches with regard to specificity and efficacy of the manipulations. We also review the genetic techniques that allow gain- and loss-of-function within specific neural circuits. These approaches introduce light-sensitive channels (optogenetic) or drug sensitive channels (chemogenetic) into neurons that form specific circuits. We compare these tools with others developed for circuit-specific manipulation and describe the advantages of each. Finally, we discuss how these tools might be applied for identification of the neural circuits that mediate behavior and for repair of neural connections.
Collapse
Affiliation(s)
- Hong Geun Park
- Burke Medical Research Institute, White Plains, NY, USA.
| | - Jason B Carmel
- Burke Medical Research Institute, White Plains, NY, USA
- Brain and Mind Research Institute and Departments of Neurology and Pediatrics, Weill Cornell Medical College, New York, NY, USA
| |
Collapse
|
7
|
Major Histocompatibility Complex Genes Map to Two Chromosomes in an Evolutionarily Ancient Reptile, the Tuatara Sphenodon punctatus. G3-GENES GENOMES GENETICS 2015; 5:1439-51. [PMID: 25953959 PMCID: PMC4502378 DOI: 10.1534/g3.115.017467] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Major histocompatibility complex (MHC) genes are a central component of the vertebrate immune system and usually exist in a single genomic region. However, considerable differences in MHC organization and size exist between different vertebrate lineages. Reptiles occupy a key evolutionary position for understanding how variation in MHC structure evolved in vertebrates, but information on the structure of the MHC region in reptiles is limited. In this study, we investigate the organization and cytogenetic location of MHC genes in the tuatara (Sphenodon punctatus), the sole extant representative of the early-diverging reptilian order Rhynchocephalia. Sequencing and mapping of 12 clones containing class I and II MHC genes from a bacterial artificial chromosome library indicated that the core MHC region is located on chromosome 13q. However, duplication and translocation of MHC genes outside of the core region was evident, because additional class I MHC genes were located on chromosome 4p. We found a total of seven class I sequences and 11 class II β sequences, with evidence for duplication and pseudogenization of genes within the tuatara lineage. The tuatara MHC is characterized by high repeat content and low gene density compared with other species and we found no antigen processing or MHC framework genes on the MHC gene-containing clones. Our findings indicate substantial differences in MHC organization in tuatara compared with mammalian and avian MHCs and highlight the dynamic nature of the MHC. Further sequencing and annotation of tuatara and other reptile MHCs will determine if the tuatara MHC is representative of nonavian reptiles in general.
Collapse
|
8
|
Liu K, Zhang P, Gao J, Liu H, Li G, Qiu Z, Zhang Y, Ren J, Tan P, Ma RZ. Closing a gap in the physical map of the ovine major histocompatibility complex. Anim Genet 2015; 42:204-7. [PMID: 20497154 DOI: 10.1111/j.1365-2052.2010.02083.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A 184 kb gap in an ovine MHC physical map was successfully closed by identification of two overlapping clones (304C7 and 222G18) from a Chinese fine wool merino sheep BAC library. The location and tiling path of the two clones were confirmed by BAC-end sequencing and PCR amplification of loci in overlapping regions. Full-length sequencing of the clones identified 13 novel ovine genes in the gap between loci Notch4 and Btnl2, and eight of them belonging to the Butyrophilin-like (Btn-like or Btnl) gene family. The scattered distribution of the Btnl gene cluster at the gap provided a clue to explain the difficulties previously experienced in closing the gap. Completed BAC contigs of the ovine MHC will facilitate sequencing of the entire ovine leukocyte antigen (OLA) region, providing detailed information for comparative studies of MHC evolution.
Collapse
Affiliation(s)
- K Liu
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China.Graduate School of the Chinese Academy of Sciences, Beijing 100149, China.College of Life Sciences, Shihezi University, Xinjiang 832000, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Hoskins RA, Carlson JW, Wan KH, Park S, Mendez I, Galle SE, Booth BW, Pfeiffer BD, George RA, Svirskas R, Krzywinski M, Schein J, Accardo MC, Damia E, Messina G, Méndez-Lago M, de Pablos B, Demakova OV, Andreyeva EN, Boldyreva LV, Marra M, Carvalho AB, Dimitri P, Villasante A, Zhimulev IF, Rubin GM, Karpen GH, Celniker SE. The Release 6 reference sequence of the Drosophila melanogaster genome. Genome Res 2015; 25:445-58. [PMID: 25589440 PMCID: PMC4352887 DOI: 10.1101/gr.185579.114] [Citation(s) in RCA: 313] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Drosophila melanogaster plays an important role in molecular,
genetic, and genomic studies of heredity, development, metabolism, behavior, and
human disease. The initial reference genome sequence reported more than a decade ago
had a profound impact on progress in Drosophila research, and
improving the accuracy and completeness of this sequence continues to be important to
further progress. We previously described improvement of the 117-Mb sequence in the
euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a
whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here,
we report an improved reference sequence of the single-copy and middle-repetitive
regions of the genome, produced using cytogenetic mapping to mitotic and polytene
chromosomes, clone-based finishing and BAC fingerprint verification, ordering of
scaffolds by alignment to cDNA sequences, incorporation of other map and sequence
data, and validation by whole-genome optical restriction mapping. These data
substantially improve the accuracy and completeness of the reference sequence and the
order and orientation of sequence scaffolds into chromosome arm assemblies.
Representation of the Y chromosome and other heterochromatic regions
is particularly improved. The new 143.9-Mb reference sequence, designated Release 6,
effectively exhausts clone-based technologies for mapping and sequencing. Highly
repeat-rich regions, including large satellite blocks and functional elements such as
the ribosomal RNA genes and the centromeres, are largely inaccessible to current
sequencing and assembly methods and remain poorly represented. Further significant
improvements will require sequencing technologies that do not depend on molecular
cloning and that produce very long reads.
Collapse
Affiliation(s)
- Roger A Hoskins
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA;
| | - Joseph W Carlson
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Kenneth H Wan
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Soo Park
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Ivonne Mendez
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Samuel E Galle
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Benjamin W Booth
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Barret D Pfeiffer
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Reed A George
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Robert Svirskas
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Martin Krzywinski
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - Jacqueline Schein
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - Maria Carmela Accardo
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Elisabetta Damia
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Giovanni Messina
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - María Méndez-Lago
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Beatriz de Pablos
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Olga V Demakova
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Evgeniya N Andreyeva
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Lidiya V Boldyreva
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Marco Marra
- Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, V5Z 4S6, Canada
| | - A Bernardo Carvalho
- Departamento de Genética, Universidade Federal do Rio de Janeiro, CEP 21944-970, Rio de Janeiro, Brazil
| | - Patrizio Dimitri
- Dipartimento di Biologia e Biotecnologie "Charles Darwin" and Istituto Pasteur Fondazione Cenci-Bolognetti, Sapienza Università di Roma, 00185 Roma, Italy
| | - Alfredo Villasante
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Igor F Zhimulev
- Institute of Molecular and Cellular Biology, Russian Academy of Sciences, Novosibirsk, 630090, Russia; Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Gerald M Rubin
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia 20147, USA
| | - Gary H Karpen
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Susan E Celniker
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA;
| |
Collapse
|
10
|
Przysiecka Ł, Książkiewicz M, Wolko B, Naganowska B. Structure, expression profile and phylogenetic inference of chalcone isomerase-like genes from the narrow-leafed lupin (Lupinus angustifolius L.) genome. FRONTIERS IN PLANT SCIENCE 2015; 6:268. [PMID: 25954293 PMCID: PMC4404975 DOI: 10.3389/fpls.2015.00268] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 04/03/2015] [Indexed: 05/20/2023]
Abstract
Lupins, like other legumes, have a unique biosynthesis scheme of 5-deoxy-type flavonoids and isoflavonoids. A key enzyme in this pathway is chalcone isomerase (CHI), a member of CHI-fold protein family, encompassing subfamilies of CHI1, CHI2, CHI-like (CHIL), and fatty acid-binding (FAP) proteins. Here, two Lupinus angustifolius (narrow-leafed lupin) CHILs, LangCHIL1 and LangCHIL2, were identified and characterized using DNA fingerprinting, cytogenetic and linkage mapping, sequencing and expression profiling. Clones carrying CHIL sequences were assembled into two contigs. Full gene sequences were obtained from these contigs, and mapped in two L. angustifolius linkage groups by gene-specific markers. Bacterial artificial chromosome fluorescence in situ hybridization approach confirmed the localization of two LangCHIL genes in distinct chromosomes. The expression profiles of both LangCHIL isoforms were very similar. The highest level of transcription was in the roots of the third week of plant growth; thereafter, expression declined. The expression of both LangCHIL genes in leaves and stems was similar and low. Comparative mapping to reference legume genome sequences revealed strong syntenic links; however, LangCHIL2 contig had a much more conserved structure than LangCHIL1. LangCHIL2 is assumed to be an ancestor gene, whereas LangCHIL1 probably appeared as a result of duplication. As both copies are transcriptionally active, questions arise concerning their hypothetical functional divergence. Screening of the narrow-leafed lupin genome and transcriptome with CHI-fold protein sequences, followed by Bayesian inference of phylogeny and cross-genera synteny survey, identified representatives of all but one (CHI1) main subfamilies. They are as follows: two copies of CHI2, FAPa2 and CHIL, and single copies of FAPb and FAPa1. Duplicated genes are remnants of whole genome duplication which is assumed to have occurred after the divergence of Lupinus, Arachis, and Glycine.
Collapse
Affiliation(s)
- Łucja Przysiecka
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of SciencesPoznań, Poland
- NanoBioMedical Centre, Adam Mickiewicz UniversityPoznań, Poland
| | - Michał Książkiewicz
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of SciencesPoznań, Poland
- *Correspondence: Michał Książkiewicz, Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, Poznań 60-479, Poland
| | - Bogdan Wolko
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of SciencesPoznań, Poland
| | - Barbara Naganowska
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of SciencesPoznań, Poland
| |
Collapse
|
11
|
Barakat TS, Gribnau J. Generation of knockout alleles by RFLP based BAC targeting of polymorphic embryonic stem cells. Methods Mol Biol 2015; 1227:143-80. [PMID: 25239745 DOI: 10.1007/978-1-4939-1652-8_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The isolation of germ line competent mouse Embryonic Stem (ES) cells and the ability to modify the genome by homologous recombination has revolutionized life science research. Since its initial discovery, several approaches have been introduced to increase the efficiency of homologous recombination, including the use of isogenic DNA for the generation of targeting constructs, and the use of Bacterial Artificial Chromosomes (BACs). BACs have the advantage of combining long stretches of homologous DNA, thereby increasing targeting efficiencies, with the possibilities delivered by BAC recombineering approaches, which provide the researcher with almost unlimited possibilities to efficiently edit the genome in a controlled fashion. Despite these advantages of BAC targeting approaches, a widespread use has been hampered, mainly because of the difficulties in identifying BAC-targeted knockout alleles by conventional methods like Southern Blotting. Recently, we introduced a novel BAC targeting strategy, in which Restriction Fragment Length Polymorphisms (RFLPs) are targeted in polymorphic mouse ES cells, enabling an efficient and easy PCR-based readout to identify properly targeted alleles. Here we provide a detailed protocol for the generation of targeting constructs, targeting of ES cells, and convenient PCR-based analysis of targeted clones, which enable the user to generate knockout ES cells of almost every gene in the mouse genome within a 2-month period.
Collapse
Affiliation(s)
- Tahsin Stefan Barakat
- Department of Reproduction and Development, Erasmus MC, University Medical Center, Room Ee 09-71, PO Box 2040, 3000 CA, Rotterdam, The Netherlands,
| | | |
Collapse
|
12
|
Książkiewicz M, Zielezinski A, Wyrwa K, Szczepaniak A, Rychel S, Karlowski W, Wolko B, Naganowska B. Remnants of the Legume Ancestral Genome Preserved in Gene-Rich Regions: Insights from Lupinus angustifolius Physical, Genetic, and Comparative Mapping. PLANT MOLECULAR BIOLOGY REPORTER 2015; 33:84-101. [PMID: 25620837 PMCID: PMC4295026 DOI: 10.1007/s11105-014-0730-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The narrow-leafed lupin (Lupinus angustifolius) was recently considered as a legume reference species. Genetic resources have been developed, including a draft genome sequence, linkage maps, nuclear DNA libraries, and cytogenetic chromosome-specific landmarks. Here, we used a complex approach, involving DNA fingerprinting, sequencing, genetic mapping, and molecular cytogenetics, to localize and analyze L. angustifolius gene-rich regions (GRRs). A L. angustifolius genomic bacterial artificial chromosome (BAC) library was screened with short sequence repeat (SSR)-based probes. Selected BACs were fingerprinted and assembled into contigs. BAC-end sequence (BES) annotation allowed us to choose clones for sequencing, targeting GRRs. Additionally, BESs were aligned to the scaffolds of the genome sequence. The genetic map was supplemented with 35 BES-derived markers, distributed in 14 linkage groups and tagging 37 scaffolds. The identified GRRs had an average gene density of 19.6 genes/100 kb and physical-to-genetic distance ratios of 11 to 109 kb/cM. Physical and genetic mapping was supported by multi-BAC-fluorescence in situ hybridization (FISH), and five new linkage groups were assigned to the chromosomes. Syntenic links to the genome sequences of five legume species (Medicago truncatula, Glycine max, Lotus japonicus, Phaseolus vulgaris, and Cajanus cajan) were identified. The comparative mapping of the two largest lupin GRRs provides novel evidence for ancient duplications in all of the studied species. These regions are conserved among representatives of the main clades of Papilionoideae. Furthermore, despite the complex evolution of legumes, some segments of the nuclear genome were not substantially modified and retained their quasi-ancestral structures. Cytogenetic markers anchored in these regions constitute a platform for heterologous mapping of legume genomes.
Collapse
Affiliation(s)
- Michał Książkiewicz
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| | - Andrzej Zielezinski
- Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, 61-614 Poznan, Poland
| | - Katarzyna Wyrwa
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| | - Anna Szczepaniak
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| | - Sandra Rychel
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| | - Wojciech Karlowski
- Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, 61-614 Poznan, Poland
| | - Bogdan Wolko
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| | - Barbara Naganowska
- Department of Genomics, Institute of Plant Genetics of the Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznan, Poland
| |
Collapse
|
13
|
Zhang J, Shao C, Zhang L, Liu K, Gao F, Dong Z, Xu P, Chen S. A first generation BAC-based physical map of the half-smooth tongue sole (Cynoglossus semilaevis) genome. BMC Genomics 2014; 15:215. [PMID: 24650389 PMCID: PMC3998196 DOI: 10.1186/1471-2164-15-215] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 03/10/2014] [Indexed: 02/06/2023] Open
Abstract
Background Half-smooth tongue sole (Cynoglossus semilaevis Günther) has been exploited as a commercially important cultured marine flatfish, and female grows 2–3 times faster than male. Genetic studies, especially on the chromosomal sex-determining system of this species, have been carried out in the last decade. Although the genome of half-smooth tongue sole was relatively small (626.9 Mb), there are still some difficulties in the high-quality assembly of the next generation genome sequencing reads without the assistance of a physical map, especially for the W chromosome of this fish due to abundance of repetitive sequences. The objective of this study is to construct a bacterial artificial chromosome (BAC)-based physical map for half-smooth tongue sole with the method of high information content fingerprinting (HICF). Results A physical map of half-smooth tongue sole was constructed with 30, 294 valid fingerprints (7.5 × genome coverage) with a tolerance of 4 and an initial cutoff of 1e-60. A total of 29,709 clones were assembled into 1,485 contigs with an average length of 539 kb and a N50 length of 664 kb. There were 394 contigs longer than the N50 length, and these contigs will be a useful resource for future integration with linkage map and whole genome sequence assembly. The estimated physical length of the assembled contigs was 797 Mb, representing approximately 1.27 coverage of the half-smooth tongue sole genome. The largest contig contained 410 BAC clones with a physical length of 3.48 Mb. Almost all of the 676 BAC clones (99.9%) in the 21 randomly selected contigs were positively validated by PCR assays, thereby confirming the reliability of the assembly. Conclusions A first generation BAC-based physical map of half-smooth tongue sole was constructed with high reliability. The map will promote genetic improvement programs of this fish, especially integration of physical and genetic maps, fine-mappings of important gene and/or QTL, comparative and evolutionary genomics studies, as well as whole genome sequence assembly.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Peng Xu
- Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao 266071, China.
| | | |
Collapse
|
14
|
Varshney RK, Mir RR, Bhatia S, Thudi M, Hu Y, Azam S, Zhang Y, Jaganathan D, You FM, Gao J, Riera-Lizarazu O, Luo MC. Integrated physical, genetic and genome map of chickpea (Cicer arietinum L.). Funct Integr Genomics 2014; 14:59-73. [PMID: 24610029 PMCID: PMC4273598 DOI: 10.1007/s10142-014-0363-6] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2013] [Revised: 01/27/2014] [Accepted: 01/31/2014] [Indexed: 10/25/2022]
Abstract
Physical map of chickpea was developed for the reference chickpea genotype (ICC 4958) using bacterial artificial chromosome (BAC) libraries targeting 71,094 clones (~12× coverage). High information content fingerprinting (HICF) of these clones gave high-quality fingerprinting data for 67,483 clones, and 1,174 contigs comprising 46,112 clones and 3,256 singletons were defined. In brief, 574 Mb genome size was assembled in 1,174 contigs with an average of 0.49 Mb per contig and 3,256 singletons represent 407 Mb genome. The physical map was linked with two genetic maps with the help of 245 BAC-end sequence (BES)-derived simple sequence repeat (SSR) markers. This allowed locating some of the BACs in the vicinity of some important quantitative trait loci (QTLs) for drought tolerance and reistance to Fusarium wilt and Ascochyta blight. In addition, fingerprinted contig (FPC) assembly was also integrated with the draft genome sequence of chickpea. As a result, ~965 BACs including 163 minimum tilling path (MTP) clones could be mapped on eight pseudo-molecules of chickpea forming 491 hypothetical contigs representing 54,013,992 bp (~54 Mb) of the draft genome. Comprehensive analysis of markers in abiotic and biotic stress tolerance QTL regions led to identification of 654, 306 and 23 genes in drought tolerance "QTL-hotspot" region, Ascochyta blight resistance QTL region and Fusarium wilt resistance QTL region, respectively. Integrated physical, genetic and genome map should provide a foundation for cloning and isolation of QTLs/genes for molecular dissection of traits as well as markers for molecular breeding for chickpea improvement.
Collapse
Affiliation(s)
- Rajeev K. Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
| | - Reyazul Rouf Mir
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
| | - Sabhyata Bhatia
- National Institute of Plant Genome Research (NIPGR), New Delhi, India
| | - Mahendar Thudi
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
| | - Yuqin Hu
- University of California, Davis, USA
| | - Sarwar Azam
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
| | | | - Deepa Jaganathan
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
| | - Frank M. You
- Cereal Research Centre, Agriculture and Agri-Food Canada, Winnipeg, Canada
| | | | - Oscar Riera-Lizarazu
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
- Dow AgroSciences, Pullman, USA
| | | |
Collapse
|
15
|
Wells R, Trick M, Soumpourou E, Clissold L, Morgan C, Werner P, Gibbard C, Clarke M, Jennaway R, Bancroft I. The control of seed oil polyunsaturate content in the polyploid crop species Brassica napus. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2014; 33:349-362. [PMID: 24489479 PMCID: PMC3901927 DOI: 10.1007/s11032-013-9954-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Accepted: 09/05/2013] [Indexed: 05/18/2023]
Abstract
Many important plant species have polyploidy in their recent ancestry, complicating inferences about the genetic basis of trait variation. Although the principal locus controlling the proportion of polyunsaturated fatty acids (PUFAs) in seeds of Arabidopsis thaliana is known (fatty acid desaturase 2; FAD2), commercial cultivars of a related crop, oilseed rape (Brassica napus), with very low PUFA content have yet to be developed. We showed that a cultivar of oilseed rape with lower than usual PUFA content has non-functional alleles at three of the four orthologous FAD2 loci. To explore the genetic basis further, we developed an ethyl methanesulphonate mutagenised population, JBnaCAB_E, and used it to identify lines that also carried mutations in the remaining functional copy. This confirmed the hypothesised basis of variation, resulting in an allelic series of mutant lines showing a spectrum of PUFA contents of seed oil. Several lines had PUFA content of ~6 % and oleic acid content of ~84 %, achieving a long-standing industry objective: very high oleic, very low PUFA rapeseed without the use of genetic modification technology. The population contains a high rate of mutations and represents an important resource for research in B. napus.
Collapse
Affiliation(s)
- Rachel Wells
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Martin Trick
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | | | - Leah Clissold
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
- Present Address: The Genome Analysis Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Colin Morgan
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - Peter Werner
- KWS UK Ltd., 56 Church Street, Thriplow, Hertfordshire, SG8 7RE UK
| | - Carl Gibbard
- KWS UK Ltd., 56 Church Street, Thriplow, Hertfordshire, SG8 7RE UK
| | | | - Richard Jennaway
- Saaten-Union UK Ltd., Rosalie Field Station, Bradley Road, Cowlinge, Newmarket, Suffolk, CB8 9HU UK
| | - Ian Bancroft
- John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
- Present Address: Department of Biology, University of York, Heslington, York, YO41 5DD UK
| |
Collapse
|
16
|
BAIT: Organizing genomes and mapping rearrangements in single cells. Genome Med 2013; 5:82. [PMID: 24028793 PMCID: PMC3971352 DOI: 10.1186/gm486] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2013] [Accepted: 09/09/2013] [Indexed: 12/30/2022] Open
Abstract
Strand-seq is a single-cell sequencing technique to finely map sister chromatid exchanges (SCEs) and other rearrangements. To analyze these data, we introduce BAIT, software which assigns templates and identifies and localizes SCEs. We demonstrate BAIT can refine completed reference assemblies, identifying approximately 21 Mb of incorrectly oriented fragments and placing over half (2.6 Mb) of the orphan fragments in mm10/GRCm38. BAIT also stratifies scaffold-stage assemblies, potentially accelerating the assembling and finishing of reference genomes. BAIT is available at http://sourceforge.net/projects/bait/.
Collapse
|
17
|
Cação SMB, Silva NV, Domingues DS, Vieira LGE, Diniz LEC, Vinecky F, Alves GSC, Andrade AC, Carpentieri-Pipolo V, Pereira LFP. Construction and characterization of a BAC library from the Coffea arabica genotype Timor Hybrid CIFC 832/2. Genetica 2013; 141:217-26. [PMID: 23677718 DOI: 10.1007/s10709-013-9720-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Accepted: 05/02/2013] [Indexed: 10/26/2022]
Abstract
Most of the world's coffee production originates from Coffea arabica, an allotetraploid species with low genetic diversity and for which few genomic resources are available. Genomic libraries with large DNA fragment inserts are useful tools for the study of plant genomes, including the production of physical maps, integration studies of physical and genetic maps, genome structure analysis and gene isolation by positional cloning. Here, we report the construction and characterization of a Bacterial Artificial Chromosome (BAC) library from C. arabica Timor Hybrid CIFC 832/2, a parental genotype for several modern coffee cultivars. The BAC library consists of 56,832 clones with an average insert size of 118 kb, which represents a dihaploid genome coverage of five to sixfold. The content of organellar DNA was estimated at 1.04 and 0.5 % for chloroplast and mitochondrial DNA, respectively. The BAC library was screened for the NADPH-dependent mannose-6-phosphate reductase gene (CaM6PR) with markers positioned on four linkage groups of a partial C. arabica genetic map. A mixed approach using PCR and membrane hybridization of BAC pools allowed for the discovery of nine BAC clones with the CaM6PR gene and 53 BAC clones that were anchored to the genetic map with simple sequence repeat markers. This library will be a useful tool for future studies on comparative genomics and the identification of genes and regulatory elements controlling major traits in this economically important crop species.
Collapse
Affiliation(s)
- S M B Cação
- Laboratory of Plant Biotechnology, Instituto Agronomico do Paraná, CP 481 Londrina, Paraná 86001-970, Brazil
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Abstract
Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Collapse
Affiliation(s)
- Elaine R Mardis
- The Genome Institute at Washington University School of Medicine, St. Louis, Missouri 63108, USA.
| |
Collapse
|
19
|
Li G, Liu K, Jiao S, Liu H, Blair HT, Zhang P, Cui X, Tan P, Gao J, Ma RZ. A physical map of a BAC clone contig covering the entire autosome insertion between ovine MHC Class IIa and IIb. BMC Genomics 2012; 13:398. [PMID: 22897909 PMCID: PMC3475007 DOI: 10.1186/1471-2164-13-398] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2012] [Accepted: 08/03/2012] [Indexed: 11/24/2022] Open
Abstract
Background The ovine Major Histocompatibility Complex (MHC) harbors genes involved in overall resistance/susceptibility of the host to infectious diseases. Compared to human and mouse, the ovine MHC is interrupted by a large piece of autosome insertion via a hypothetical chromosome inversion that constitutes ~25% of ovine chromosome 20. The evolutionary consequence of such an inversion and an insertion (inversion/insertion) in relation to MHC function remains unknown. We previously constructed a BAC clone physical map for the ovine MHC exclusive of the insertion region. Here we report the construction of a high-density physical map covering the autosome insertion in order to address the question of what the inversion/insertion had to do with ruminants during the MHC evolution. Results A total of 119 pairs of comparative bovine oligo primers were utilized to screen an ovine BAC library for positive clones and the orders and overlapping relationships of the identified clones were determined by DNA fingerprinting, BAC-end sequencing, and sequence-specific PCR. A total of 368 positive BAC clones were identified and 108 of the effective clones were ordered into an overlapping BAC contig to cover the consensus region between ovine MHC class IIa and IIb. Therefore, a continuous physical map covering the entire ovine autosome inversion/insertion region was successfully constructed. The map confirmed the bovine sequence assembly for the same homologous region. The DNA sequences of 185 BAC-ends have been deposited into NCBI database with the access numbers HR309252 through HR309068, corresponding to dbGSS ID 30164010 through 30163826. Conclusions We have constructed a high-density BAC clone physical map for the ovine autosome inversion/insertion between the MHC class IIa and IIb. The entire ovine MHC region is now fully covered by a continuous BAC clone contig. The physical map we generated will facilitate MHC functional studies in the ovine, as well as the comparative MHC evolution in ruminants.
Collapse
Affiliation(s)
- Gang Li
- School of Life Sciences, Shihezi University, Xinjiang 832003, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Urban A, Rossier J. Genetic targeting of specific neuronal cell types in the cerebral cortex. PROGRESS IN BRAIN RESEARCH 2012; 196:163-92. [PMID: 22341326 DOI: 10.1016/b978-0-444-59426-6.00009-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Understanding the structure and function of cortical circuits requires the identification of and control over specific cell types in the cortex. To address these obstacles, recent optogenetic approaches have been developed. The capacity to activate, silence, or monitor specific cell types by combining genetics, virology, and optics will decipher the role of specific groups of neurons within circuits with a spatiotemporal resolution that overcomes standard approaches. In this review, the various strategies for selective genetic targeting of a defined neuronal population are discussed as well as the pros and cons of the use of transgenic animals and recombinant viral vectors for the expression of transgenes in a specific set of neurons.
Collapse
Affiliation(s)
- Alan Urban
- Laboratoire de Neurobiologie et Diversité Cellulaire, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7637, Ecole Supérieure de Physique et de Chimie Industrielles, Paris, France.
| | | |
Collapse
|
21
|
Hartwich H, Nothwang HG. An easy and versatile 2-step protocol for targeted modification and subcloning of DNA from bacterial artificial chromosomes using non-commercial plasmids. BMC Res Notes 2012; 5:156. [PMID: 22433714 PMCID: PMC3327625 DOI: 10.1186/1756-0500-5-156] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2011] [Accepted: 03/20/2012] [Indexed: 11/17/2022] Open
Abstract
Background Promoter-specific expression of foreign DNA in transgenic organisms often relies on bacterial artificial chromosomes (BACs). This approach requires modification and subcloning of BAC-DNA by recombineering technologies in Escherichia coli. Most current protocols rely on commercial kits or isolation of BACs, their transfer between different host strains, and their restriction. Findings In this report we present a 2-step protocol for efficient modification and subcloning of DNA from bacterial artificial chromosomes using the non-commercial plasmids pKM208 and pTP223, distributed from addgene.com. A targeting cassette was successfully integrated into a BAC and 42 kb of this construct were subcloned. Both a plasmid-derived substrate with longer homology arms and a PCR-generated substrate with short homology arms (50 bp) were used for recombination. pKM208 and pTP223 contain all required genes for recombineering, but differ in their antibiotic resistance genes. This makes the system independent of the selection markers on the DNA molecules targeted for recombination. Conclusions The time and cost saving protocol presented here compares favorably to currently used systems. Using non-commercial plasmids, it allows targeted modification and cloning of large DNA (> 40 kb) fragments in vivo without restriction and ligation. Furthermore, both steps are performed in the same host eliminating the need to isolate BAC DNA and to use different bacterial strains.
Collapse
Affiliation(s)
- Heiner Hartwich
- Department of Neurogenetics, Institute of Biology and Environmental Sciences, Carl von Ossietzky University, Carl-von-Ossietzky-Strasse 9-11, 26129 Oldenburg, Germany.
| | | |
Collapse
|
22
|
Bai Y, Sartor M, Cavalcoli J. Current status and future perspectives for sequencing livestock genomes. J Anim Sci Biotechnol 2012; 3:8. [PMID: 22958500 PMCID: PMC3436607 DOI: 10.1186/2049-1891-3-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2012] [Accepted: 03/01/2012] [Indexed: 11/10/2022] Open
Abstract
Only in recent years, the draft sequences for several agricultural animals have been assembled. Assembling an individual animal's entire genome sequence or specific region(s) of interest is increasingly important for agricultural researchers to perform genetic comparisons between animals with different performance. We review the current status for several sequenced agricultural species and suggest that next generation sequencing (NGS) technology with decreased sequencing cost and increased speed of sequencing can benefit agricultural researchers. By taking advantage of advanced NGS technologies, genes and chromosomal regions that are more labile to the influence of environmental factors could be pinpointed. A more long term goal would be addressing the question of how animals respond at the molecular and cellular levels to different environmental models (e.g. nutrition). Upon revealing important genes and gene-environment interactions, the rate of genetic improvement can also be accelerated. It is clear that NGS technologies will be able to assist animal scientists to efficiently raise animals and to better prevent infectious diseases so that overall costs of animal production can be decreased.
Collapse
Affiliation(s)
- Yongsheng Bai
- Center for Computational Medicine and Bioinformatics and Department of Computational Medicine and Bioinformatics, The University of Michigan, 100 Washtenaw Ave,, 2017 Palmer Commons, Ann Arbor, Michigan 48109-2218, USA.
| | | | | |
Collapse
|
23
|
Construction of BIBAC and BAC libraries from a variety of organisms for advanced genomics research. Nat Protoc 2012; 7:479-99. [PMID: 22343430 DOI: 10.1038/nprot.2011.456] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Large-insert BAC (bacterial artificial chromosome) and BIBAC (binary BAC) libraries are essential for modern genomics research for all organisms. We helped pioneer the BAC and BIBAC technologies, and by using them we have constructed hundreds of BAC and BIBAC libraries for different species of plants, animals, marine animals, insects, algae and microbes. These libraries have been used globally for different aspects of genomics research. Here we describe the procedure with the latest improvements that we have made and used for construction of BIBAC libraries. The procedure includes the preparation of BIBAC vectors, the preparation of clonable fragments of the desired size from the source DNA, the construction and transformation of BIBACs and, finally, the characterization and assembly of BIBAC libraries. We also specify the modifications necessary for construction of BAC libraries using the protocol. The entire protocol takes ∼7 d.
Collapse
|
24
|
Zhang M, Zhang Y, Scheuring CF, Wu CC, Dong JJ, Zhang HB. Preparation of megabase-sized DNA from a variety of organisms using the nuclei method for advanced genomics research. Nat Protoc 2012; 7:467-78. [PMID: 22343429 DOI: 10.1038/nprot.2011.455] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Megabase-sized DNA is crucial to modern genomics research of all organisms. Among the preparation methods developed, the nuclei method is the simplest and most widely used for preparing high-quality megabase-sized DNA from divergent organisms. In this method, nuclei are first isolated by physically grinding the source tissues. The nontarget cytoplast organellar genomes and metabolites are removed by centrifugation and washing, thus maximizing the utility of the method and substantially improving the digestibility and clonability of the resultant DNA. The nuclei are then embedded in an agarose matrix containing numerous pores, allowing the access of restriction enzymes while preventing the DNA from physical shearing. DNA is extracted from the nuclei, purified and subsequently manipulated in the agarose matrix. Here we describe the nuclei method that we have successfully used to prepare high-quality megabase-sized DNA from hundreds of plant, animal, fish, insect, algal and microbial species. The entire protocol takes ∼3 d.
Collapse
Affiliation(s)
- Meiping Zhang
- Department of Soil and Crop Sciences, Texas A&M University, College Station, Texas, USA
| | | | | | | | | | | |
Collapse
|
25
|
Philippe R, Choulet F, Paux E, van Oeveren J, Tang J, Wittenberg AHJ, Janssen A, van Eijk MJT, Stormo K, Alberti A, Wincker P, Akhunov E, van der Vossen E, Feuillet C. Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome. BMC Genomics 2012; 13:47. [PMID: 22289472 PMCID: PMC3311077 DOI: 10.1186/1471-2164-13-47] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2011] [Accepted: 01/30/2012] [Indexed: 01/28/2023] Open
Abstract
Background Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the maize physical map. However, restriction fragments sizes shared randomly between two non-overlapping BACs often lead to chimerical contigs and mis-assembled BACs in such large and repetitive genomes. Whole Genome Profiling (WGP™) was developed recently as a new sequence-based physical mapping technology and has the potential to limit this problem. Results A subset of the wheat 3B chromosome BAC library covering 230 Mb was used to establish a WGP physical map and to compare it to a map obtained with the SNaPshot technology. We first adapted the WGP-based assembly methodology to cope with the complexity of the wheat genome. Then, the results showed that the WGP map covers the same length than the SNaPshot map but with 30% less contigs and, more importantly with 3.5 times less mis-assembled BACs. Finally, we evaluated the benefit of integrating WGP tags in different sequence assemblies obtained after Roche/454 sequencing of BAC pools. We showed that while WGP tag integration improves assemblies performed with unpaired reads and with paired-end reads at low coverage, it does not significantly improve sequence assemblies performed at high coverage (25x) with paired-end reads. Conclusions Our results demonstrate that, with a suitable assembly methodology, WGP builds more robust physical maps than the SNaPshot technology in wheat and that WGP can be adapted to any genome. Moreover, WGP tag integration in sequence assemblies improves low quality assembly. However, to achieve a high quality draft sequence assembly, a sequencing depth of 25x paired-end reads is required, at which point WGP tag integration does not provide additional scaffolding value. Finally, we suggest that WGP tags can support the efficient sequencing of BAC pools by enabling reliable assignment of sequence scaffolds to their BAC of origin, a feature that is of great interest when using BAC pooling strategies to reduce the cost of sequencing large genomes.
Collapse
Affiliation(s)
- Romain Philippe
- INRA-UBP, UMR1095, Genetics Diversity and Ecophysiology of Cereals, 234 Avenue du Brezet, 63100 Clermont- Ferrand, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Siddle HV, Deakin JE, Coggill P, Whilming LG, Harrow J, Kaufman J, Beck S, Belov K. The tammar wallaby major histocompatibility complex shows evidence of past genomic instability. BMC Genomics 2011; 12:421. [PMID: 21854592 PMCID: PMC3179965 DOI: 10.1186/1471-2164-12-421] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2010] [Accepted: 08/19/2011] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND The major histocompatibility complex (MHC) is a group of genes with a variety of roles in the innate and adaptive immune responses. MHC genes form a genetically linked cluster in eutherian mammals, an organization that is thought to confer functional and evolutionary advantages to the immune system. The tammar wallaby (Macropus eugenii), an Australian marsupial, provides a unique model for understanding MHC gene evolution, as many of its antigen presenting genes are not linked to the MHC, but are scattered around the genome. RESULTS Here we describe the 'core' tammar wallaby MHC region on chromosome 2q by ordering and sequencing 33 BAC clones, covering over 4.5 MB and containing 129 genes. When compared to the MHC region of the South American opossum, eutherian mammals and non-mammals, the wallaby MHC has a novel gene organization. The wallaby has undergone an expansion of MHC class II genes, which are separated into two clusters by the class III genes. The antigen processing genes have undergone duplication, resulting in two copies of TAP1 and three copies of TAP2. Notably, Kangaroo Endogenous Retroviral Elements are present within the region and may have contributed to the genomic instability. CONCLUSIONS The wallaby MHC has been extensively remodeled since the American and Australian marsupials last shared a common ancestor. The instability is characterized by the movement of antigen presenting genes away from the core MHC, most likely via the presence and activity of retroviral elements. We propose that the movement of class II genes away from the ancestral class II region has allowed this gene family to expand and diversify in the wallaby. The duplication of TAP genes in the wallaby MHC makes this species a unique model organism for studying the relationship between MHC gene organization and function.
Collapse
Affiliation(s)
- Hannah V Siddle
- Faculty of Veterinary Science, University of Sydney, NSW 2006, AUSTRALIA
- University of Cambridge, Department of Pathology, Cambridge CB2 1QP, UK
| | - Janine E Deakin
- ARC Centre of Excellence for Kangaroo Genomics, Research School of Biological Sciences, Australian National University, Canberra, ACT 0200, Australia
| | - Penny Coggill
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Laurens G Whilming
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Jennifer Harrow
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambridgeshire, CB10 1SA, UK
| | - Jim Kaufman
- University of Cambridge, Department of Pathology, Cambridge CB2 1QP, UK
| | - Stephan Beck
- UCL Cancer Institute, University College London, London WC1E 6BT, UK
| | - Katherine Belov
- Faculty of Veterinary Science, University of Sydney, NSW 2006, AUSTRALIA
| |
Collapse
|
27
|
Schulte D, Ariyadasa R, Shi B, Fleury D, Saski C, Atkins M, deJong P, Wu CC, Graner A, Langridge P, Stein N. BAC library resources for map-based cloning and physical map construction in barley (Hordeum vulgare L.). BMC Genomics 2011; 12:247. [PMID: 21595870 PMCID: PMC3224359 DOI: 10.1186/1471-2164-12-247] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Accepted: 05/19/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although second generation sequencing (2GS) technologies allow re-sequencing of previously gold-standard-sequenced genomes, whole genome shotgun sequencing and de novo assembly of large and complex eukaryotic genomes is still difficult. Availability of a genome-wide physical map is therefore still a prerequisite for whole genome sequencing for genomes like barley. To start such an endeavor, large insert genomic libraries, i.e. Bacterial Artificial Chromosome (BAC) libraries, which are unbiased and representing deep haploid genome coverage, need to be ready in place. RESULT Five new BAC libraries were constructed for barley (Hordeum vulgare L.) cultivar Morex. These libraries were constructed in different cloning sites (HindIII, EcoRI, MboI and BstXI) of the respective vectors. In order to enhance unbiased genome representation and to minimize the number of gaps between BAC contigs, which are often due to uneven distribution of restriction sites, a mechanically sheared library was also generated. The new BAC libraries were fully characterized in depth by scrutinizing the major quality parameters such as average insert size, degree of contamination (plate wide, neighboring, and chloroplast), empty wells and off-scale clones (clones with <30 or >250 fragments). Additionally a set of gene-based probes were hybridized to high density BAC filters and showed that genome coverage of each library is between 2.4 and 6.6 X. CONCLUSION BAC libraries representing >20 haploid genomes are available as a new resource to the barley research community. Systematic utilization of these libraries in high-throughput BAC fingerprinting should allow developing a genome-wide physical map for the barley genome, which will be instrumental for map-based gene isolation and genome sequencing.
Collapse
Affiliation(s)
- Daniela Schulte
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstr. 3, 06466 Gatersleben, Germany
- KWS SAAT AG, Grimsehlstr. 31, 37555 Einbeck, Germany
| | - Ruvini Ariyadasa
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstr. 3, 06466 Gatersleben, Germany
| | - Bujun Shi
- Australian Centre of Plant Functional Genomics, University of Adelaide, PMB 1 Glen Osmond SA 5064, Australia
| | - Delphine Fleury
- Australian Centre of Plant Functional Genomics, University of Adelaide, PMB 1 Glen Osmond SA 5064, Australia
| | - Chris Saski
- Clemson University Genomics Institute (CUGI), 51 New Cherry St. BRC 310, Clemson, SC 29634, USA
| | - Michael Atkins
- Clemson University Genomics Institute (CUGI), 51 New Cherry St. BRC 310, Clemson, SC 29634, USA
| | - Pieter deJong
- BACPAC Resources, Children's Hospital Oakland, 747 52nd St. Oakland, CA 94609, USA
| | - Cheng-Cang Wu
- Lucigen Corporation, 2120 West Greenview Dr., Middleton, WI 53562, USA
| | - Andreas Graner
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstr. 3, 06466 Gatersleben, Germany
| | - Peter Langridge
- Australian Centre of Plant Functional Genomics, University of Adelaide, PMB 1 Glen Osmond SA 5064, Australia
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstr. 3, 06466 Gatersleben, Germany
| |
Collapse
|
28
|
Slobodan J, Corbett R, Wye N, Schein JE, Marra MA, Coope RJN. High performance gel imaging with a commercial single lens reflex camera. THE REVIEW OF SCIENTIFIC INSTRUMENTS 2011; 82:034301. [PMID: 21456768 DOI: 10.1063/1.3553027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
A high performance gel imaging system was constructed using a digital single lens reflex camera with epi-illumination to image 19 × 23 cm agarose gels with up to 10,000 DNA bands each. It was found to give equivalent performance to a laser scanner in this high throughput DNA fingerprinting application using the fluorophore SYBR Green(®). The specificity and sensitivity of the imager and scanner were within 1% using the same band identification software. Low and high cost color filters were also compared and it was found that with care, good results could be obtained with inexpensive dyed acrylic filters in combination with more costly dielectric interference filters, but that very poor combinations were also possible. Methods for determining resolution, dynamic range, and optical efficiency for imagers are also proposed to facilitate comparison between systems.
Collapse
Affiliation(s)
- J Slobodan
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Suite 100 570 W 7th Ave, Vancouver, BC V5Z 4S6, Canada
| | | | | | | | | | | |
Collapse
|
29
|
van Oeveren J, de Ruiter M, Jesse T, van der Poel H, Tang J, Yalcin F, Janssen A, Volpin H, Stormo KE, Bogden R, van Eijk MJT, Prins M. Sequence-based physical mapping of complex genomes by whole genome profiling. Genome Res 2011; 21:618-25. [PMID: 21324881 DOI: 10.1101/gr.112094.110] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
We present whole genome profiling (WGP), a novel next-generation sequencing-based physical mapping technology for construction of bacterial artificial chromosome (BAC) contigs of complex genomes, using Arabidopsis thaliana as an example. WGP leverages short read sequences derived from restriction fragments of two-dimensionally pooled BAC clones to generate sequence tags. These sequence tags are assigned to individual BAC clones, followed by assembly of BAC contigs based on shared regions containing identical sequence tags. Following in silico analysis of WGP sequence tags and simulation of a map of Arabidopsis chromosome 4 and maize, a WGP map of Arabidopsis thaliana ecotype Columbia was constructed de novo using a six-genome equivalent BAC library. Validation of the WGP map using the Columbia reference sequence confirmed that 350 BAC contigs (98%) were assembled correctly, spanning 97% of the 102-Mb calculated genome coverage. We demonstrate that WGP maps can also be generated for more complex plant genomes and will serve as excellent scaffolds to anchor genetic linkage maps and integrate whole genome sequence data.
Collapse
|
30
|
Frenkel Z, Paux E, Mester D, Feuillet C, Korol A. LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes. BMC Bioinformatics 2010; 11:584. [PMID: 21118513 PMCID: PMC3098104 DOI: 10.1186/1471-2105-11-584] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/30/2010] [Indexed: 11/25/2022] Open
Abstract
Background Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that are then anchored to genetic maps with molecular markers. High Information Content Fingerprinting has become the method of choice for large and repetitive genomes such as those of maize, barley, and wheat. However, the high level of repeated DNA present in these genomes requires the application of very stringent criteria to ensure a reliable assembly with the FingerPrinted Contig (FPC) software, which often results in short contig lengths (of 3-5 clones before merging) as well as an unreliable assembly in some difficult regions. Difficulties can originate from a non-linear topological structure of clone overlaps, low power of clone ordering algorithms, and the absence of tools to identify sources of gaps in Minimal Tiling Paths (MTPs). Results To address these problems, we propose a novel approach that: (i) reduces the rate of false connections and Q-clones by using a new cutoff calculation method; (ii) obtains reliable clusters robust to the exclusion of single clone or clone overlap; (iii) explores the topological contig structure by considering contigs as networks of clones connected by significant overlaps; (iv) performs iterative clone clustering combined with ordering and order verification using re-sampling methods; and (v) uses global optimization methods for clone ordering and Band Map construction. The elements of this new analytical framework called Linear Topological Contig (LTC) were applied on datasets used previously for the construction of the physical map of wheat chromosome 3B with FPC. The performance of LTC vs. FPC was compared also on the simulated BAC libraries based on the known genome sequences for chromosome 1 of rice and chromosome 1 of maize. Conclusions The results show that compared to other methods, LTC enables the construction of highly reliable and longer contigs (5-12 clones before merging), the detection of "weak" connections in contigs and their "repair", and the elongation of contigs obtained by other assembly methods.
Collapse
Affiliation(s)
- Zeev Frenkel
- University of Haifa, Institute of Evolution, Haifa 31905, Israel.
| | | | | | | | | |
Collapse
|
31
|
Fang GC, Blackmon BP, Henry DC, Staton ME, Saski CA, Hodges SA, Tomkins JP, Luo H. Genomic tools development for Aquilegia: construction of a BAC-based physical map. BMC Genomics 2010; 11:621. [PMID: 21059242 PMCID: PMC3091760 DOI: 10.1186/1471-2164-11-621] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 11/08/2010] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The genus Aquilegia, consisting of approximately 70 taxa, is a member of the basal eudicot lineage, Ranuculales, which is evolutionarily intermediate between monocots and core eudicots, and represents a relatively unstudied clade in the angiosperm phylogenetic tree that bridges the gap between these two major plant groups. Aquilegia species are closely related and their distribution covers highly diverse habitats. These provide rich resources to better understand the genetic basis of adaptation to different pollinators and habitats that in turn leads to rapid speciation. To gain insights into the genome structure and facilitate gene identification, comparative genomics and whole-genome shotgun sequencing assembly, BAC-based genomics resources are of crucial importance. RESULTS BAC-based genomic resources, including two BAC libraries, a physical map with anchored markers and BAC end sequences, were established from A. formosa. The physical map was composed of a total of 50,155 BAC clones in 832 contigs and 3939 singletons, covering 21X genome equivalents. These contigs spanned a physical length of 689.8 Mb (~2.3X of the genome) suggesting the complex heterozygosity of the genome. A set of 197 markers was developed from ESTs induced by drought-stress, or involved in anthocyanin biosynthesis or floral development, and was integrated into the physical map. Among these were 87 genetically mapped markers that anchored 54 contigs, spanning 76.4 Mb (25.5%) across the genome. Analysis of a selection of 12,086 BAC end sequences (BESs) from the minimal tiling path (MTP) allowed a preview of the Aquilegia genome organization, including identification of transposable elements, simple sequence repeats and gene content. Common repetitive elements previously reported in both monocots and core eudicots were identified in Aquilegia suggesting the value of this genome in connecting the two major plant clades. Comparison with sequenced plant genomes indicated a higher similarity to grapevine (Vitis vinifera) than to rice and Arabidopsis in the transcriptomes. CONCLUSIONS The A. formosa BAC-based genomic resources provide valuable tools to study Aquilegia genome. Further integration of other existing genomics resources, such as ESTs, into the physical map should enable better understanding of the molecular mechanisms underlying adaptive radiation and elaboration of floral morphology.
Collapse
Affiliation(s)
- Guang-Chen Fang
- Department of Genetics and Biochemistry, Clemson University, SC 29634, USA
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Gong S, Kus L, Heintz N. Rapid bacterial artificial chromosome modification for large-scale mouse transgenesis. Nat Protoc 2010; 5:1678-96. [PMID: 20885380 DOI: 10.1038/nprot.2010.131] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We report here a high-throughput method for the modification of bacterial artificial chromosomes (BACs) that uses a novel two-plasmid approach. In this protocol, a vector modified in our laboratory to hold an R6Kγ origin of replication and a marker recombination cassette is inserted into a BAC in a single recombination step. Temporal control of recombination is achieved through the use of a second plasmid, pSV1.RecA, which possesses a recombinase gene and a temperature-sensitive origin of replication. This highly efficient protocol has allowed us to successfully modify more than 2,000 BACs, from which over 1,000 BAC transgenic mice have been generated. A complete cycle from BAC choice to embryo implantation takes about 5 weeks. Marker genes introduced into the mice include EGFP and EGFP-L10a. All vectors used in this project can be obtained from us by request, and the EGFP reporter mice are available through the Mutant Mouse Regional Resource Center (NINDS/GENSAT collection). CNS anatomical expression maps of the mice are available to the public at http://www.gensat.org/.
Collapse
|
33
|
Lin L, Pierce GJ, Bowers JE, Estill JC, Compton RO, Rainville LK, Kim C, Lemke C, Rong J, Tang H, Wang X, Braidotti M, Chen AH, Chicola K, Collura K, Epps E, Golser W, Grover C, Ingles J, Karunakaran S, Kudrna D, Olive J, Tabassum N, Um E, Wissotski M, Yu Y, Zuccolo A, ur Rahman M, Peterson DG, Wing RA, Wendel JF, Paterson AH. A draft physical map of a D-genome cotton species (Gossypium raimondii). BMC Genomics 2010; 11:395. [PMID: 20569427 PMCID: PMC2996926 DOI: 10.1186/1471-2164-11-395] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2010] [Accepted: 06/22/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genetically anchored physical maps of large eukaryotic genomes have proven useful both for their intrinsic merit and as an adjunct to genome sequencing. Cultivated tetraploid cottons, Gossypium hirsutum and G. barbadense, share a common ancestor formed by a merger of the A and D genomes about 1-2 million years ago. Toward the long-term goal of characterizing the spectrum of diversity among cotton genomes, the worldwide cotton community has prioritized the D genome progenitor Gossypium raimondii for complete sequencing. RESULTS A whole genome physical map of G. raimondii, the putative D genome ancestral species of tetraploid cottons was assembled, integrating genetically-anchored overgo hybridization probes, agarose based fingerprints and 'high information content fingerprinting' (HICF). A total of 13,662 BAC-end sequences and 2,828 DNA probes were used in genetically anchoring 1585 contigs to a cotton consensus genetic map, and 370 and 438 contigs, respectively to Arabidopsis thaliana (AT) and Vitis vinifera (VV) whole genome sequences. CONCLUSION Several lines of evidence suggest that the G. raimondii genome is comprised of two qualitatively different components. Much of the gene rich component is aligned to the Arabidopsis and Vitis vinifera genomes and shows promise for utilizing translational genomic approaches in understanding this important genome and its resident genes. The integrated genetic-physical map is of value both in assembling and validating a planned reference sequence.
Collapse
Affiliation(s)
- Lifeng Lin
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Gary J Pierce
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - John E Bowers
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - James C Estill
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Rosana O Compton
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Lisa K Rainville
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Changsoo Kim
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Cornelia Lemke
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Junkang Rong
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- School of Agriculture and Food Sciences, Zhejiang Forestry University, Lin'an, Hangzhou, Zhejiang, 311300, China
| | - Haibao Tang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Department of Plant and Microbiology, College of Natural Resources, University of California, Berkeley, CA, USA
| | - Xiyin Wang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Michele Braidotti
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Amy H Chen
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Kristen Chicola
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Kristi Collura
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Ethan Epps
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Wolfgang Golser
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Corrinne Grover
- Department of Ecology, Evolution, & Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Jennifer Ingles
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | | | - Dave Kudrna
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Jaime Olive
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Nabila Tabassum
- National Institute for Biotechnology & Genetic Engineering (NIBGE), Faisalabad, Pakistan
| | - Eareana Um
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
| | - Marina Wissotski
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Yeisoo Yu
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Andrea Zuccolo
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Mehboob ur Rahman
- National Institute for Biotechnology & Genetic Engineering (NIBGE), Faisalabad, Pakistan
| | - Daniel G Peterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Life Sciences & Biotechnology Institute, Mississippi State University, Mississippi State, MS 39762 USA
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences and BIO5 Institute for Collaborative Research, University of Arizona, Tucson, AZ 85721, USA
| | - Jonathan F Wendel
- Department of Ecology, Evolution, & Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30605, USA
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
34
|
Liu SY, Yu K, Huffner M, Park SJ, Banik M, Pauls KP, Crosby W. Construction of a BAC library and a physical map of a major QTL for CBB resistance of common bean (Phaseolus vulgaris L.). Genetica 2010; 138:709-16. [PMID: 20419470 DOI: 10.1007/s10709-010-9450-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2009] [Accepted: 02/25/2010] [Indexed: 11/29/2022]
Abstract
A major quantitative trait loci (QTL) conditioning common bacterial blight (CBB) resistance in common bean (Phaseolus vulgaris L.) lines HR45 and HR67 was derived from XAN159, a resistant line obtained from an interspecific cross between common bean lines and the tepary bean (P. acutifolius L.) line PI319443. This source of CBB resistance is widely used in bean breeding. Several other CBB resistance QTL have been identified but none of them have been physically mapped. Four molecular markers tightly linked to this QTL have been identified suitable for marker assisted selection and physical mapping of the resistance gene. A bacterial artificial chromosome (BAC) library was constructed from high molecular weight DNA of HR45 and is composed of 33,024 clones. The size of individual BAC clone inserts ranges from 30 kb to 280 kb with an average size of 107 kb. The library is estimated to represent approximately sixfold genome coverage. The BAC library was screened as BAC pools using four PCR-based molecular markers. Two to seven BAC clones were identified by each marker. Two clones were found to have both markers PV-tttc001 and STS183. One preliminary contig was assembled based on DNA finger printing of those positive BAC clones. The minimum tiling path of the contig contains 6 BAC clones spanning an estimated size of 750 kb covering the QTL region.
Collapse
Affiliation(s)
- S Y Liu
- Agriculture Agri-Food Canada, Greenhouse and Processing Crops Research Center, Harrow, ON, N0R 1G0, Canada
| | | | | | | | | | | | | |
Collapse
|
35
|
Schueler MG, Swanson W, Thomas PJ, Green ED. Adaptive evolution of foundation kinetochore proteins in primates. Mol Biol Evol 2010; 27:1585-97. [PMID: 20142441 DOI: 10.1093/molbev/msq043] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Rapid evolution is a hallmark of centromeric DNA in eukaryotic genomes. Yet, the centromere itself has a conserved functional role that is mediated by the kinetochore protein complex. To broaden our understanding about both the DNA and proteins that interact at the functional centromere, we sought to gain a detailed view of the evolutionary events that have shaped the primate kinetochore. Specifically, we performed comparative mapping and sequencing of the genomic regions encompassing the genes encoding three foundation kinetochore proteins: Centromere Proteins A, B, and C (CENP-A, CENP-B, and CENP-C). A histone H3 variant, CENP-A provides the foundation of the centromere-specific nucleosome. Comparative sequence analyses of the CENP-A gene in 14 primate species revealed encoded amino-acid residues within both the histone-fold domain and the N-terminal tail that are under strong positive selection. Similar comparative analyses of CENP-C, another foundation protein essential for centromere function, identified amino-acid residues throughout the protein under positive selection in the primate lineage, including several in the centromere localization and DNA-binding regions. Perhaps surprisingly, the gene encoding CENP-B, a kinetochore protein that binds specifically to alpha-satellite DNA, was not found to be associated with signatures of positive selection. These findings point to important and distinct evolutionary forces operating on the DNA and proteins of the primate centromere.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Institutes of Health, Bethesda, MD, USA.
| | | | | | | | | |
Collapse
|
36
|
Palti Y, Gahr SA, Purcell MK, Hadidi S, Rexroad CE, Wiens GD. Identification, characterization and genetic mapping of TLR7, TLR8a1 and TLR8a2 genes in rainbow trout (Oncorhynchus mykiss). DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2010; 34:219-233. [PMID: 19825389 DOI: 10.1016/j.dci.2009.10.002] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2009] [Revised: 10/02/2009] [Accepted: 10/05/2009] [Indexed: 05/28/2023]
Abstract
Induction of the innate immune pathways is critical for early anti-viral defense but there is limited understanding of how teleost fish recognize viral molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 7 and 8 bind single-stranded RNA of viral origin and are activated by synthetic anti-viral imidazoquinoline compounds. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR7 and TLR8 gene orthologs and their mRNA expression. Two TLR7/8 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA fingerprinting and genetic linkage analyses. Direct sequencing of two representative BACs revealed intact omTLR7 and omTLR8a1 open reading frames (ORFs) located on chromosome 3 and a second locus on chromosome 22 that contains an omTLR8a2 ORF and a putative TLR7 pseudogene. We used the omTLR8a1/2 nomenclature for the two trout TLR8 genes as phylogenetic analysis revealed that they and all the other teleost TLR8 genes sequenced to date are similar to the zebrafish TLR8a, but are distinct from the zebrafish TLR8b. The duplicated trout loci exhibit conserved synteny with other fish genomes extending beyond the tandem of TLR7/8 genes. The trout TLR7 and 8a1/2 genes are composed of a single large exon similar to all other described TLR7/8 genes. The omTLR7 ORF is predicted to encode a 1049 amino acid (aa) protein with 84% similarity to the Fugu TLR7 and a conserved pattern of predicted leucine-rich repeats (LRR). The omTLR8a1 and omTLR8a2 are predicted to encode 1035- and 1034-aa proteins, respectively, and have 86% similarity to each other. omTLR8a1 is likely the ortholog of the only Atlantic salmon TLR8 gene described to date as they have 95% aa sequence similarity. The tissue expression profiles of omTLR7, omTLR8a1 and omTLR8a2 in healthy trout were highest in spleen tissue followed by anterior and then posterior kidney tissues. Rainbow trout anterior kidney leukocytes produced elevated levels of pro-inflammatory and type I interferon cytokines mRNA in response to stimulation with the human TLR7/8 agonist R848 or the TLR3 agonist poly I:C. Only poly I:C-induced IFN2 transcription was significantly suppressed in the presence of chloroquine, a compound known to block endosomal acidification and inhibit endosomal maturation. The effect of chloroquine on R848-induced cytokine expression was equivocal and so it remains questionable whether rainbow trout recognition of R848 requires endosomal maturation. TLR7 and TLR8a1 expression levels in rainbow trout anterior kidney leukocytes were not affected by poly I:C or R848 treatments, but surprisingly, TLR8a2 expression was moderately down-regulated by R848. The down-regulation of omTLR8a2 may imply that this gene has evolved to a new or altered function in rainbow trout, as often occurs when the two duplicated genes remain active.
Collapse
Affiliation(s)
- Yniv Palti
- United States Department of Agriculture, Agriculture Research Service, National Center for Cool and Cold Water Aquaculture, Kearneysville, WV 25430, USA.
| | | | | | | | | | | |
Collapse
|
37
|
Muñoz-Torres MC, Saski C, Blackmon B, Romero-Severson J, Werren JH. Development of bacterial artificial chromosome library resources for parasitoid Hymenoptera (Nasonia vitripennis and Nasonia giraulti: Pteromalidae). INSECT MOLECULAR BIOLOGY 2010; 19 Suppl 1:181-187. [PMID: 20167027 DOI: 10.1111/j.1365-2583.2009.00962.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
The species of the genus Nasonia possess qualities that make them excellent candidates for genetic and genomic studies. To increase the wealth of genomic resources for the genus we constructed publicly available bacterial artificial chromosome (BAC) libraries for Nasonia vitripennis and Nasonia giraulti. Libraries have 36 864 clones each, empty-vector contents of approximately 2% and average insert sizes of 113.1 and 97.7 Kb, respectively, representing 12 and 11 genome equivalents. The N. vitripennis library was used for genome sequence assembly and in efforts at positional cloning of a developmental gene. The genome assembly of N. vitripennis is currently composed on 6181 un-joined scaffolds. These BAC libraries can be used to identify and close regions between scaffolds of the genome assemblies of both species.
Collapse
|
38
|
New tools for investigating the comparative biology of Caenorhabditis briggsae and C. elegans. Genetics 2009; 184:853-63. [PMID: 20008572 DOI: 10.1534/genetics.109.110270] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Comparative studies of Caenorhabditis briggsae and C. elegans have provided insights into gene function and developmental control in both organisms. C. elegans is a well developed model organism with a variety of molecular and genetic tools to study gene functions. In contrast, there are only very limited tools available for its closest relative, C. briggsae. To take advantage of the full potential of this comparative approach, we have developed several genetic and molecular tools to facilitate functional analysis in C. briggsae. First, we designed and implemented an SNP-based oligonucleotide microarray for rapid mapping of genetic mutants in C. briggsae. Second, we generated a mutagenized frozen library to permit the isolation of targeted deletions and used the library to recover a deletion mutant of cbr-unc-119 for use as a transgenic marker. Third, we used the cbr-unc-119 mutant in ballistic transformation and generated fluorescently labeled strains that allow automated lineaging and cellular resolution expression analysis. Finally, we demonstrated the potential of automated lineaging by profiling expression of egl-5, hlh-1, and pha-4 at cellular resolution and by detailed phenotyping of the perturbations on the Wnt signaling pathway. These additions to the experimental toolkit for C. briggsae should greatly increase its utility in comparative studies with C. elegans. With the emerging sequence of nematode species more closely related to C. briggsae, these tools may open novel avenues of experimentation in C. briggsae itself.
Collapse
|
39
|
Kronmiller BA, Wise RP. Computational finishing of large sequence contigs reveals interspersed nested repeats and gene islands in the rf1-associated region of maize. PLANT PHYSIOLOGY 2009; 151:483-495. [PMID: 19675151 PMCID: PMC2754626 DOI: 10.1104/pp.109.143370] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2009] [Accepted: 08/03/2009] [Indexed: 05/26/2023]
Abstract
The architecture of grass genomes varies on multiple levels. Large long terminal repeat retrotransposon clusters occupy significant portions of the intergenic regions, and islands of protein-encoding genes are interspersed among the repeat clusters. Hence, advanced assembly techniques are required to obtain completely finished genomes as well as to investigate gene and transposable element distributions. To characterize the organization and distribution of repeat clusters and gene islands across large grass genomes, we present 961- and 594-kb contiguous sequence contigs associated with the rf1 (for restorer of fertility1) locus in the near-centromeric region of maize (Zea mays) chromosome 3. We present two methods for computational finishing of highly repetitive bacterial artificial chromosome clones that have proved successful to close all sequence gaps caused by transposable element insertions. Sixteen repeat clusters were observed, ranging in length from 23 to 155 kb. These repeat clusters are almost exclusively long terminal repeat retrotransposons, of which the paleontology of insertion varies throughout the cluster. Gene islands contain from one to four predicted genes, resulting in a gene density of one gene per 16 kb in gene islands and one gene per 111 kb over the entire sequenced region. The two sequence contigs, when compared with the rice (Oryza sativa) and sorghum (Sorghum bicolor) genomes, retain gene colinearity of 50% and 71%, respectively, and 70% and 100%, respectively, for high-confidence gene models. Collinear genes on single gene islands show that while most expansion of the maize genome has occurred in the repeat clusters, gene islands are not immune and have experienced growth in both intragene and intergene locations.
Collapse
Affiliation(s)
- Brent A Kronmiller
- Bioinformatics and Computational Biology, Iowa State University, Ames, Iowa 50011-1020, USA
| | | |
Collapse
|
40
|
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res 2009; 19:1639-45. [PMID: 19541911 DOI: 10.1101/gr.092759.109] [Citation(s) in RCA: 7394] [Impact Index Per Article: 462.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We created a visualization tool called Circos to facilitate the identification and analysis of similarities and differences arising from comparisons of genomes. Our tool is effective in displaying variation in genome structure and, generally, any other kind of positional relationships between genomic intervals. Such data are routinely produced by sequence alignments, hybridization arrays, genome mapping, and genotyping studies. Circos uses a circular ideogram layout to facilitate the display of relationships between pairs of positions by the use of ribbons, which encode the position, size, and orientation of related genomic elements. Circos is capable of displaying data as scatter, line, and histogram plots, heat maps, tiles, connectors, and text. Bitmap or vector images can be created from GFF-style data inputs and hierarchical configuration files, which can be easily generated by automated tools, making Circos suitable for rapid deployment in data analysis and reporting pipelines.
Collapse
Affiliation(s)
- Martin Krzywinski
- Canada's Michael Smith Genome Sciences Center, Vancouver, British Columbia V5Z 4S6, Canada.
| | | | | | | | | | | | | | | |
Collapse
|
41
|
Bartoszewski G, Gawronski P, Szklarczyk M, Verbakel H, Havey MJ. A one-megabase physical map provides insights on gene organization in the enormous mitochondrial genome of cucumber. Genome 2009; 52:299-307. [DOI: 10.1139/g09-006] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Cucumber ( Cucumis sativus ) has one of the largest mitochondrial genomes known among all eukaryotes, due in part to the accumulation of short 20 to 60 bp repetitive DNA motifs. Recombination among these repetitive DNAs produces rearrangements affecting organization and expression of mitochondrial genes. To more efficiently identify rearrangements in the cucumber mitochondrial DNA, we built two nonoverlapping 800 and 220 kb BAC contigs and assigned major mitochondrial genes to these BACs. Polymorphism carried on the largest BAC contig was used to confirm paternal transmission. Mitochondrial genes were distributed across BACs and physically distant, although occasional clustering was observed. Introns in the nad1, nad4, and nad7 genes were larger than those reported in other plants, due in part to accumulation of short repetitive DNAs and indicating that increased intron sizes contributed to mitochondrial genome expansion in cucumber. Mitochondrial genes atp6 and atp9 are physically close to each other and cotranscribed. These physical contigs will be useful for eventual sequencing of the cucumber mitochondrial DNA, which can be exploited to more efficiently screen for unique rearrangements affecting mitochondrial gene expression.
Collapse
Affiliation(s)
- Grzegorz Bartoszewski
- Department of Plant Genetics, Breeding, and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of the Life Sciences, ul. Nowoursynowska 159, 02-776 Warsaw, Poland
- Department of Genetics, Plant Breeding, and Seed Production, Agricultural University of Krakow, al. 29 Listopada 54, 31-425 Krakow, Poland
- Keygene N.V, Agro Business Park 90, 6708 PW Wageningen, The Netherlands
- US Department of Agriculture, Department of Horticulture, Agricultural Research Service, Vegetable Crops Unit, 1575 Linden Drive, University of Wisconsin, Madison, WI 53706, USA
| | - Piotr Gawronski
- Department of Plant Genetics, Breeding, and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of the Life Sciences, ul. Nowoursynowska 159, 02-776 Warsaw, Poland
- Department of Genetics, Plant Breeding, and Seed Production, Agricultural University of Krakow, al. 29 Listopada 54, 31-425 Krakow, Poland
- Keygene N.V, Agro Business Park 90, 6708 PW Wageningen, The Netherlands
- US Department of Agriculture, Department of Horticulture, Agricultural Research Service, Vegetable Crops Unit, 1575 Linden Drive, University of Wisconsin, Madison, WI 53706, USA
| | - Marek Szklarczyk
- Department of Plant Genetics, Breeding, and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of the Life Sciences, ul. Nowoursynowska 159, 02-776 Warsaw, Poland
- Department of Genetics, Plant Breeding, and Seed Production, Agricultural University of Krakow, al. 29 Listopada 54, 31-425 Krakow, Poland
- Keygene N.V, Agro Business Park 90, 6708 PW Wageningen, The Netherlands
- US Department of Agriculture, Department of Horticulture, Agricultural Research Service, Vegetable Crops Unit, 1575 Linden Drive, University of Wisconsin, Madison, WI 53706, USA
| | - Henk Verbakel
- Department of Plant Genetics, Breeding, and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of the Life Sciences, ul. Nowoursynowska 159, 02-776 Warsaw, Poland
- Department of Genetics, Plant Breeding, and Seed Production, Agricultural University of Krakow, al. 29 Listopada 54, 31-425 Krakow, Poland
- Keygene N.V, Agro Business Park 90, 6708 PW Wageningen, The Netherlands
- US Department of Agriculture, Department of Horticulture, Agricultural Research Service, Vegetable Crops Unit, 1575 Linden Drive, University of Wisconsin, Madison, WI 53706, USA
| | - Michael J. Havey
- Department of Plant Genetics, Breeding, and Biotechnology, Faculty of Horticulture and Landscape Architecture, Warsaw University of the Life Sciences, ul. Nowoursynowska 159, 02-776 Warsaw, Poland
- Department of Genetics, Plant Breeding, and Seed Production, Agricultural University of Krakow, al. 29 Listopada 54, 31-425 Krakow, Poland
- Keygene N.V, Agro Business Park 90, 6708 PW Wageningen, The Netherlands
- US Department of Agriculture, Department of Horticulture, Agricultural Research Service, Vegetable Crops Unit, 1575 Linden Drive, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
42
|
Nicotra ML, Powell AE, Rosengarten RD, Moreno M, Grimwood J, Lakkis FG, Dellaporta SL, Buss LW. A hypervariable invertebrate allodeterminant. Curr Biol 2009; 19:583-9. [PMID: 19303297 DOI: 10.1016/j.cub.2009.02.040] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2009] [Revised: 02/08/2009] [Accepted: 02/09/2009] [Indexed: 11/25/2022]
Abstract
Colonial marine invertebrates, such as sponges, corals, bryozoans, and ascidians, often live in densely populated communities where they encounter other members of their species as they grow over their substratum. Such encounters typically lead to a natural histocompatibility response in which colonies either fuse to become a single, chimeric colony or reject and aggressively compete for space. These allorecognition phenomena mediate intraspecific competition, support allotypic diversity, control the level at which selection acts, and resemble allogeneic interactions in pregnancy and transplantation. Despite the ubiquity of allorecognition in colonial phyla, however, its molecular basis has not been identified beyond what is currently known about histocompatibility in vertebrates and protochordates. We positionally cloned an allorecognition gene by using inbred strains of the cnidarian, Hydractinia symbiolongicarpus, which is a model system for the study of invertebrate allorecognition. The gene identified encodes a putative transmembrane receptor expressed in all tissues capable of allorecognition that is highly polymorphic and predicts allorecognition responses in laboratory and field-derived strains. This study reveals that a previously undescribed hypervariable molecule bearing three extracellular domains with greatest sequence similarity to the immunoglobulin superfamily is an allodeterminant in a lower metazoan.
Collapse
Affiliation(s)
- Matthew L Nicotra
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Xia Z, Watanabe S, Chen Q, Sato S, Harada K. A novel manual pooling system for preparing three-dimensional pools of a deep coverage soybean bacterial artificial chromosome library. Mol Ecol Resour 2009; 9:516-24. [PMID: 21564681 DOI: 10.1111/j.1755-0998.2008.02503.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Compared with hybridization-based techniques, polymerase chain reaction-based screening of large insert libraries has been used widely as it is fast, easy and sensitive. However, various pooling strategies are needed to ensure efficient screening. It is time-consuming and labourious to prepare three-dimensional pools for a deep coverage bacterial artificial chromosome (BAC) library of soybean (1.12 × 10(9) bp) in the absence of robotic facility. In the present study, we describe a novel manual pooling system for preparing three-dimensional pools of a soybean BAC library. This simple technique enables a single researcher to construct three-dimensional pools for a deep-coverage (12 haploid genome equivalents) BAC library of soybean in less than 2 months without any robotic manipulation. When the prepared three-dimensional pools were screened with 29 polymerase chain reaction-based markers, an average of 9.2 clones per marker were identified. These identified clones will be useful either in quantitative trait loci gene isolation or in synteny study between soybean and other legumes including Lotus japonicus. This efficient pooling system could be applied to any other BAC libraries without the need for robotic manipulation.
Collapse
Affiliation(s)
- Zhengjun Xia
- National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan Faculty of Horticulture, Chiba University, 648 Matsudo, Chiba 271-8510, Japan Soybean Research Institute, Key Laboratory of Soybean Biology, Ministry of Education, Northeast Agricultural University Harbin, Harbin 150030, China Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba 292-0818, Japan
| | | | | | | | | |
Collapse
|
44
|
Scheibye-Alsing K, Hoffmann S, Frankel A, Jensen P, Stadler PF, Mang Y, Tommerup N, Gilchrist MJ, Nygård AB, Cirera S, Jørgensen CB, Fredholm M, Gorodkin J. Sequence assembly. Comput Biol Chem 2008; 33:121-36. [PMID: 19152793 DOI: 10.1016/j.compbiolchem.2008.11.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 11/28/2008] [Accepted: 11/28/2008] [Indexed: 01/20/2023]
Abstract
Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and plays an important role in processing the information generated by these methods. Here, we provide a comprehensive overview of the current publicly available sequence assembly programs. We describe the basic principles of computational assembly along with the main concerns, such as repetitive sequences in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html.
Collapse
Affiliation(s)
- K Scheibye-Alsing
- Division of Genetics and Bioinformatics, IBHV, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Tsipouri V, Schueler MG, Hu S, Dutra A, Pak E, Riethman H, Green ED. Comparative sequence analyses reveal sites of ancestral chromosomal fusions in the Indian muntjac genome. Genome Biol 2008; 9:R155. [PMID: 18957082 PMCID: PMC2760882 DOI: 10.1186/gb-2008-9-10-r155] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2008] [Revised: 10/15/2008] [Accepted: 10/28/2008] [Indexed: 01/22/2023] Open
Abstract
Comparative mapping and sequencing was used to characterize the sites of ancestral chromosomal fusions in the Indian muntjac genome. Background Indian muntjac (Muntiacus muntjak vaginalis) has an extreme mammalian karyotype, with only six and seven chromosomes in the female and male, respectively. Chinese muntjac (Muntiacus reevesi) has a more typical mammalian karyotype, with 46 chromosomes in both sexes. Despite this disparity, the two muntjac species are morphologically similar and can even interbreed to produce viable (albeit sterile) offspring. Previous studies have suggested that a series of telocentric chromosome fusion events involving telomeric and/or satellite repeats led to the extant Indian muntjac karyotype. Results We used a comparative mapping and sequencing approach to characterize the sites of ancestral chromosomal fusions in the Indian muntjac genome. Specifically, we screened an Indian muntjac bacterial artificial-chromosome library with a telomere repeat-specific probe. Isolated clones found by fluorescence in situ hybridization to map to interstitial regions on Indian muntjac chromosomes were further characterized, with a subset then subjected to shotgun sequencing. Subsequently, we isolated and sequenced overlapping clones extending from the ends of some of these initial clones; we also generated orthologous sequence from isolated Chinese muntjac clones. The generated Indian muntjac sequence has been analyzed for the juxtaposition of telomeric and satellite repeats and for synteny relationships relative to other mammalian genomes, including the Chinese muntjac. Conclusions The generated sequence data and comparative analyses provide a detailed genomic context for seven ancestral chromosome fusion sites in the Indian muntjac genome, which further supports the telocentric fusion model for the events leading to the unusual karyotypic differences among muntjac species.
Collapse
Affiliation(s)
- Vicky Tsipouri
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Integration of cytogenetic and genetic linkage maps unveils the physical architecture of tomato chromosome 2. Genetics 2008; 179:1211-20. [PMID: 18622034 DOI: 10.1534/genetics.108.089532] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We report the integration of the linkage map of tomato chromosome 2 with a high-density bacterial artificial chromosome fluorescence in situ hybridization (BAC-FISH)-based cytogenetic map. The euchromatic block of chromosome 2 resides between 13 and 142 cM and has a physical length of 48.12 microm, with 1 microm equivalent to 540 kb. BAC-FISH resolved a pair of loci that were 3.7-3.9 Mb apart and were not resolved on the linkage map. Most of the regions had crossover densities close to the mean of approximately 200 kb/cM. Relatively hot and cold spots of recombination were unevenly distributed along the chromosome. The distribution of centimorgan/micrometer values was similar to the previously reported recombination nodule distribution along the pachytene chromosome. FISH-based physical maps will play an important role in advanced genomics research for tomato, including map-based cloning of agronomically important traits and whole-genome sequencing.
Collapse
|
47
|
Jakše J, Meyer JDF, Suzuki G, McCallum J, Cheung F, Town CD, Havey MJ. Pilot sequencing of onion genomic DNA reveals fragments of transposable elements, low gene densities, and significant gene enrichment after methyl filtration. Mol Genet Genomics 2008; 280:287-92. [DOI: 10.1007/s00438-008-0364-z] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Accepted: 06/22/2008] [Indexed: 10/21/2022]
|
48
|
Identification and characterisation of structural maintenance of chromosome 1 (smc1) mutants of Coprinopsis cinerea. Mol Genet Genomics 2008; 280:223-32. [DOI: 10.1007/s00438-008-0358-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2008] [Accepted: 06/11/2008] [Indexed: 11/26/2022]
|
49
|
Mun JH, Kwon SJ, Yang TJ, Kim HS, Choi BS, Baek S, Kim JS, Jin M, Kim JA, Lim MH, Lee SI, Kim HI, Kim H, Lim YP, Park BS. The first generation of a BAC-based physical map of Brassica rapa. BMC Genomics 2008; 9:280. [PMID: 18549474 PMCID: PMC2432078 DOI: 10.1186/1471-2164-9-280] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Accepted: 06/12/2008] [Indexed: 11/30/2022] Open
Abstract
Background The genus Brassica includes the most extensively cultivated vegetable crops worldwide. Investigation of the Brassica genome presents excellent challenges to study plant genome evolution and divergence of gene function associated with polyploidy and genome hybridization. A physical map of the B. rapa genome is a fundamental tool for analysis of Brassica "A" genome structure. Integration of a physical map with an existing genetic map by linking genetic markers and BAC clones in the sequencing pipeline provides a crucial resource for the ongoing genome sequencing effort and assembly of whole genome sequences. Results A genome-wide physical map of the B. rapa genome was constructed by the capillary electrophoresis-based fingerprinting of 67,468 Bacterial Artificial Chromosome (BAC) clones using the five restriction enzyme SNaPshot technique. The clones were assembled into contigs by means of FPC v8.5.3. After contig validation and manual editing, the resulting contig assembly consists of 1,428 contigs and is estimated to span 717 Mb in physical length. This map provides 242 anchored contigs on 10 linkage groups to be served as seed points from which to continue bidirectional chromosome extension for genome sequencing. Conclusion The map reported here is the first physical map for Brassica "A" genome based on the High Information Content Fingerprinting (HICF) technique. This physical map will serve as a fundamental genomic resource for accelerating genome sequencing, assembly of BAC sequences, and comparative genomics between Brassica genomes. The current build of the B. rapa physical map is available at the B. rapa Genome Project website for the user community.
Collapse
Affiliation(s)
- Jeong-Hwan Mun
- Brassica Genomics Team, National Institute of Agricultural Biotechnology, Rural Development Administration, 225 Seodun-dong, Gwonseon-gu, Suwon 441-707, South Korea.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Yang XW, Gong S. An overview on the generation of BAC transgenic mice for neuroscience research. ACTA ACUST UNITED AC 2008; Chapter 5:Unit 5.20. [PMID: 18428622 DOI: 10.1002/0471142301.ns0520s31] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
This unit provides a comprehensive overview on the generation of transgenic mice using bacterial artificial chromosomes (BACs), and the application of BAC transgenic mice in neuroscience research. In the first section, advantages of the BAC transgenic approach compared to the conventional transgenic approach are summarized. In the second section, important considerations in designing BAC transgenic constructs are outlined. Four commonly used BAC transgenic construct designs are also outlined. Concepts of modifying BACs by homologous recombination in E. coli to introduce a variety of mutations into BACs, and important steps to characterize a modified BAC prior to the generation of transgenic mice are also presented. In the final section, some of the important applications of BAC transgenic mice in neuroscience research, including studying gene expression, gene function, mapping neuronal circuitry, and modeling human diseases, are described.
Collapse
Affiliation(s)
- X William Yang
- Department of Psychiatry and Biobehavioral Sciences, Center for Neurobehavioral Genetics, Neuropsychiatric Institute, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | | |
Collapse
|