1
|
Fan H, Wu Q, Wei F, Yang F, Ng BL, Hu Y. Chromosome-level genome assembly for giant panda provides novel insights into Carnivora chromosome evolution. Genome Biol 2019; 20:267. [PMID: 31810476 PMCID: PMC6898958 DOI: 10.1186/s13059-019-1889-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 11/15/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Chromosome evolution is an important driver of speciation and species evolution. Previous studies have detected chromosome rearrangement events among different Carnivora species using chromosome painting strategies. However, few of these studies have focused on chromosome evolution at a nucleotide resolution due to the limited availability of chromosome-level Carnivora genomes. Although the de novo genome assembly of the giant panda is available, current short read-based assemblies are limited to moderately sized scaffolds, making the study of chromosome evolution difficult. RESULTS Here, we present a chromosome-level giant panda draft genome with a total size of 2.29 Gb. Based on the giant panda genome and published chromosome-level dog and cat genomes, we conduct six large-scale pairwise synteny alignments and identify evolutionary breakpoint regions. Interestingly, gene functional enrichment analysis shows that for all of the three Carnivora genomes, some genes located in evolutionary breakpoint regions are significantly enriched in pathways or terms related to sensory perception of smell. In addition, we find that the sweet receptor gene TAS1R2, which has been proven to be a pseudogene in the cat genome, is located in an evolutionary breakpoint region of the giant panda, suggesting that interchromosomal rearrangement may play a role in the cat TAS1R2 pseudogenization. CONCLUSIONS We show that the combined strategies employed in this study can be used to generate efficient chromosome-level genome assemblies. Moreover, our comparative genomics analyses provide novel insights into Carnivora chromosome evolution, linking chromosome evolution to functional gene evolution.
Collapse
Affiliation(s)
- Huizhong Fan
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Qi Wu
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Fuwen Wei
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Fengtang Yang
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Bee Ling Ng
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Yibo Hu
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
2
|
Koch EM, Schweizer RM, Schweizer TM, Stahler DR, Smith DW, Wayne RK, Novembre J. De Novo Mutation Rate Estimation in Wolves of Known Pedigree. Mol Biol Evol 2019; 36:2536-2547. [PMID: 31297530 PMCID: PMC6805234 DOI: 10.1093/molbev/msz159] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2019] [Revised: 06/18/2019] [Accepted: 06/24/2019] [Indexed: 12/11/2022] Open
Abstract
Knowledge of mutation rates is crucial for calibrating population genetics models of demographic history in units of years. However, mutation rates remain challenging to estimate because of the need to identify extremely rare events. We estimated the nuclear mutation rate in wolves by identifying de novo mutations in a pedigree of seven wolves. Putative de novo mutations were discovered by whole-genome sequencing and were verified by Sanger sequencing of parents and offspring. Using stringent filters and an estimate of the false negative rate in the remaining observable genome, we obtain an estimate of ∼4.5 × 10-9 per base pair per generation and provide conservative bounds between 2.6 × 10-9 and 7.1 × 10-9. Although our estimate is consistent with recent mutation rate estimates from ancient DNA (4.0 × 10-9 and 3.0-4.5 × 10-9), it suggests a wider possible range. We also examined the consequences of our rate and the accompanying interval for dating several critical events in canid demographic history. For example, applying our full range of rates to coalescent models of dog and wolf demographic history implies a wide set of possible divergence times between the ancestral populations of dogs and extant Eurasian wolves (16,000-64,000 years ago) although our point estimate indicates a date between 25,000 and 33,000 years ago. Aside from one study in mice, ours provides the only direct mammalian mutation rate outside of primates and is likely to be vital to future investigations of mutation rate evolution.
Collapse
Affiliation(s)
- Evan M Koch
- Department of Ecology and Evolution, University of Chicago, Chicago, IL
| | - Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, MT
| | - Teia M Schweizer
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Daniel R Stahler
- Yellowstone Center for Resources, National Park Service, Yellowstone National Park, WY
| | - Douglas W Smith
- Yellowstone Center for Resources, National Park Service, Yellowstone National Park, WY
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA
| | - John Novembre
- Department of Ecology and Evolution, University of Chicago, Chicago, IL
- Department of Human Genetics, University of Chicago, Chicago, IL
| |
Collapse
|
3
|
de Oliveira EA, Bertollo LAC, Rab P, Ezaz T, Yano CF, Hatanaka T, Jegede OI, Tanomtong A, Liehr T, Sember A, Maruyama SR, Feldberg E, Viana PF, Cioffi MDB. Cytogenetics, genomics and biodiversity of the South American and African Arapaimidae fish family (Teleostei, Osteoglossiformes). PLoS One 2019; 14:e0214225. [PMID: 30908514 PMCID: PMC6433368 DOI: 10.1371/journal.pone.0214225] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 03/09/2019] [Indexed: 11/19/2022] Open
Abstract
Osteoglossiformes represents one of the most ancestral teleost lineages, currently widespread over almost all continents, except for Antarctica. However, data involving advanced molecular cytogenetics or comparative genomics are yet largely limited for this fish group. Therefore, the present investigations focus on the osteoglossiform family Arapaimidae, studying a unique fish model group with advanced molecular cytogenetic genomic tools. The aim is to better explore and clarify certain events and factors that had impact on evolutionary history of this fish group. For that, both South American and African representatives of Arapaimidae, namely Arapaima gigas and Heterotis niloticus, were examined. Both species differed markedly by diploid chromosome numbers, with 2n = 56 found in A. gigas and 2n = 40 exhibited by H. niloticus. Conventional cytogenetics along with fluorescence in situ hybridization revealed some general trends shared by most osteoglossiform species analyzed thus far, such as the presence of only one chromosome pair bearing 18S and 5S rDNA sites and karyotypes dominated by acrocentric chromosomes, resembling thus the patterns of hypothetical ancestral teleost karyotype. Furthermore, the genomes of A. gigas and H. niloticus display remarkable divergence in terms of repetitive DNA content and distribution, as revealed by comparative genomic hybridization (CGH). On the other hand, genomic diversity of single copy sequences studied through principal component analyses (PCA) based on SNP alleles genotyped by the DArT seq procedure demonstrated a very low genetic distance between the South American and African Arapaimidae species; this pattern contrasts sharply with the scenario found in other osteoglossiform species. Underlying evolutionary mechanisms potentially explaining the obtained data have been suggested and discussed.
Collapse
Affiliation(s)
- Ezequiel Aguiar de Oliveira
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
- Secretaria de Estado de Educação de Mato Grosso–SEDUC-MT, Cuiabá, MT, Brazil
| | - Luiz Antonio Carlos Bertollo
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
| | - Petr Rab
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Czech Republic
| | - Tariq Ezaz
- Institute for Applied Ecology, University of Canberra, Canberra, Australia
| | - Cassia Fernanda Yano
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
| | - Terumi Hatanaka
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
| | | | - Alongklod Tanomtong
- Toxic Substances in Livestock and Aquatic Animals Research Group, KhonKaen University, Muang, KhonKaen, Thailand
| | - Thomas Liehr
- Institute of Human Genetics, University Hospital Jena, Jena, Germany
| | - Alexandr Sember
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Czech Republic
| | - Sandra Regina Maruyama
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
| | - Eliana Feldberg
- Instituto Nacional de Pesquisas da Amazônia, Coordenação de Biodiversidade, Laboratório de Genética Animal, Petrópolis, CEP: Manaus, AM, Brazil
| | - Patrik Ferreira Viana
- Instituto Nacional de Pesquisas da Amazônia, Coordenação de Biodiversidade, Laboratório de Genética Animal, Petrópolis, CEP: Manaus, AM, Brazil
| | - Marcelo de Bello Cioffi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luiz, São Carlos, SP, Brazil
| |
Collapse
|
4
|
Martinez PA, Jacobina UP, Fernandes RV, Brito C, Penone C, Amado TF, Fonseca CR, Bidau CJ. A comparative study on karyotypic diversification rate in mammals. Heredity (Edinb) 2016; 118:366-373. [PMID: 27804966 DOI: 10.1038/hdy.2016.110] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Revised: 09/19/2016] [Accepted: 09/27/2016] [Indexed: 01/23/2023] Open
Abstract
Chromosomal rearrangements have a relevant role in organismic evolution. However, little is known about the mechanisms that lead different phylogenetic clades to have different chromosomal rearrangement rates. Here, we investigate the causes behind the wide karyotypic diversity exhibited by mammals. In particular, we analyzed the role of metabolic, reproductive, biogeographic and genomic characteristics on the rates of macro- and microstructural karyotypic diversification (rKD) using comparative phylogenetic methods. We found evidence that reproductive characteristics such as larger litter size per year and longevity, by allowing a higher number of meioses in absolute time, favor a higher probability of chromosomal change. Furthermore, families with large geographic distributions but containing species with restricted geographic ranges showed a greater probability of fixation of macrostructural chromosomal changes in different geographic areas. Finally, rKD does not evolve by Brownian motion because the mutation rate depends on the concerted evolution of repetitive sequences. The decisive factors of rKD evolution will be natural selection, genetic drift and meiotic drive that will eventually allow or not the fixation of the rearrangements. Our results indicate that mammalian karyotypic diversity is influenced by historical and adaptive mechanisms where reproductive and genomic factors modulate the rate of chromosomal change.
Collapse
Affiliation(s)
- P A Martinez
- PIBi Lab-Laboratorio de Pesquisas Integrativas em Biodiversidade, Pós-Graduação em Ecologia e Conservação, Universidade Federal de Sergipe, São Cristovão, Brazil
| | | | - R V Fernandes
- Departamento de Ecologia, Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | - C Brito
- Departamento de Ecologia, Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | - C Penone
- Institute of Plant Science, University of Bern, Bern, Switzerland
| | - T F Amado
- BioMa-Biodiversity and Macroecology Lab, Department of Biology and Geology, Physics and Inorganic Chemistry, Rey Juan Carlos University, Mostoles, Spain
| | - C R Fonseca
- Departamento de Ecologia, Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | - C J Bidau
- Paraná y Los Claveles, Garupá, Argentina
| |
Collapse
|
5
|
Adler PH, Yildirim A, Onder Z, Tasci GT, Duzlu O, Arslan MO, Ciloglu A, Sari B, Parmaksizoglu N, Inci A. Rearrangement hotspots in the sex chromosome of the Palearctic black fly Simulium bergi (Diptera, Simuliidae). Comp Cytogenet 2016; 10:295-310. [PMID: 27551350 PMCID: PMC4977804 DOI: 10.3897/compcytogen.v10i2.8855] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 06/01/2016] [Indexed: 06/06/2023]
Abstract
An extreme example of nonrandom rearrangements, especially inversion breaks, is described in the polytene chromosomes of the black fly Simulium bergi Rubtsov, 1956 from Armenia and Turkey. A total of 48 rearrangements was discovered, relative to the standard banding sequence for the subgenus Simulium Latreille, 1802. One rearrangement, an inversion (IIS-C) in the short arm of the second chromosome, was fixed. Six (12.5%) of the rearrangements were autosomal polymorphisms, and the remaining 41 (85.4%) were sex linked. More than 40 X- and Y-linked rearrangements, predominantly inversions, were clustered in the long arm of the second chromosome (IIL), representing about 15% of the total complement. The pattern conforms to a nonrandom model of chromosome breakage, perhaps associated with an underlying molecular mechanism.
Collapse
Affiliation(s)
- Peter H. Adler
- Department of Agricultural and Environmental Sciences, 130 McGinty Court, Clemson University, Clemson, SC 29634-0310 USA
| | - Alparslan Yildirim
- Department of Parasitology, Faculty of Veterinary Medicine, Erciyes University, Kayseri, Turkey
| | - Zuhal Onder
- Department of Parasitology, Faculty of Veterinary Medicine, Erciyes University, Kayseri, Turkey
| | - G. Taskin Tasci
- Department of Parasitology, Faculty of Veterinary Medicine, Kafkas University, Kars, Turkey
| | - Onder Duzlu
- Department of Parasitology, Faculty of Veterinary Medicine, Erciyes University, Kayseri, Turkey
| | - M. Ozkan Arslan
- Department of Microbiology, Faculty of Medicine, Kafkas University, Kars, Turkey
| | - Arif Ciloglu
- Department of Parasitology, Faculty of Veterinary Medicine, Erciyes University, Kayseri, Turkey
| | - Baris Sari
- Department of Parasitology, Faculty of Veterinary Medicine, Kafkas University, Kars, Turkey
| | - Nilgun Parmaksizoglu
- Department of Parasitology, Faculty of Veterinary Medicine, Kafkas University, Kars, Turkey
| | - Abdullah Inci
- Department of Parasitology, Faculty of Veterinary Medicine, Erciyes University, Kayseri, Turkey
| |
Collapse
|
6
|
Wang M, Huang X, Li R, Xu H, Jin L, He Y. Detecting recent positive selection with high accuracy and reliability by conditional coalescent tree. Mol Biol Evol 2014; 31:3068-80. [PMID: 25135945 DOI: 10.1093/molbev/msu244] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Studies of natural selection, followed by functional validation, are shedding light on understanding of genetic mechanisms underlying human evolution and adaptation. Classic methods for detecting selection, such as the integrated haplotype score (iHS) and Fay and Wu's H statistic, are useful for candidate gene searching underlying positive selection. These methods, however, have limited capability to localize causal variants in selection target regions. In this study, we developed a novel method based on conditional coalescent tree to detect recent positive selection by counting unbalanced mutations on coalescent gene genealogies. Extensive simulation studies revealed that our method is more robust than many other approaches against biases due to various demographic effects, including population bottleneck, expansion, or stratification, while not sacrificing its power. Furthermore, our method demonstrated its superiority in localizing causal variants from massive linked genetic variants. The rate of successful localization was about 20-40% higher than that of other state-of-the-art methods on simulated data sets. On empirical data, validated functional causal variants of four well-known positive selected genes were all successfully localized by our method, such as ADH1B, MCM6, APOL1, and HBB. Finally, the computational efficiency of this new method was much higher than that of iHS implementations, that is, 24-66 times faster than the REHH package, and more than 10,000 times faster than the original iHS implementation. These magnitudes make our method suitable for applying on large sequencing data sets. Software can be downloaded from https://github.com/wavefancy/scct.
Collapse
Affiliation(s)
- Minxian Wang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Xin Huang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Ran Li
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Hongyang Xu
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Li Jin
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yungang He
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
7
|
Huang J, Zhao Y, Shiraigol W, Li B, Bai D, Ye W, Daidiikhuu D, Yang L, Jin B, Zhao Q, Gao Y, Wu J, Bao W, Li A, Zhang Y, Han H, Bai H, Bao Y, Zhao L, Zhai Z, Zhao W, Sun Z, Zhang Y, Meng H, Dugarjaviin M. Analysis of horse genomes provides insight into the diversification and adaptive evolution of karyotype. Sci Rep 2014; 4:4958. [PMID: 24828444 PMCID: PMC4021364 DOI: 10.1038/srep04958] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2013] [Accepted: 04/22/2014] [Indexed: 12/22/2022] Open
Abstract
Karyotypic diversification is more prominent in Equus species than in other mammals. Here, using next generation sequencing technology, we generated and de novo assembled quality genomes sequences for a male wild horse (Przewalski's horse) and a male domestic horse (Mongolian horse), with about 93-fold and 91-fold coverage, respectively. Portion of Y chromosome from wild horse assemblies (3 M bp) and Mongolian horse (2 M bp) were also sequenced and de novo assembled. We confirmed a Robertsonian translocation event through the wild horse's chromosomes 23 and 24, which contained sequences that were highly homologous with those on the domestic horse's chromosome 5. The four main types of rearrangement, insertion of unknown origin, inserted duplication, inversion, and relocation, are not evenly distributed on all the chromosomes, and some chromosomes, such as the X chromosome, contain more rearrangements than others, and the number of inversions is far less than the number of insertions and relocations in the horse genome. Furthermore, we discovered the percentages of LINE_L1 and LTR_ERV1 are significantly increased in rearrangement regions. The analysis results of the two representative Equus species genomes improved our knowledge of Equus chromosome rearrangement and karyotype evolution.
Collapse
Affiliation(s)
- Jinlong Huang
- 1] College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China [2]
| | - Yiping Zhao
- 1] College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China [2]
| | - Wunierfu Shiraigol
- 1] College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China [2]
| | - Bei Li
- 1] College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China [2]
| | - Dongyi Bai
- 1] College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China [2]
| | - Weixing Ye
- 1] Shanghai Personal Biotechnology Limited Company, 777 Longwu Road, Shanghai 200236, P.R. China [2]
| | - Dorjsuren Daidiikhuu
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Lihua Yang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Burenqiqige Jin
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Qinan Zhao
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Yahan Gao
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Jing Wu
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Wuyundalai Bao
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Anaer Li
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Yuhong Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Haige Han
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Haitang Bai
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Yanqing Bao
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| | - Lele Zhao
- School of Agriculture and Biology, Shanghai Jiaotong University; Shanghai Key Laboratory of Veterinary Biotechnology, 800 Dongchuan Road, Shanghai 200240, P. R. China
| | - Zhengxiao Zhai
- School of Agriculture and Biology, Shanghai Jiaotong University; Shanghai Key Laboratory of Veterinary Biotechnology, 800 Dongchuan Road, Shanghai 200240, P. R. China
| | - Wenjing Zhao
- School of Agriculture and Biology, Shanghai Jiaotong University; Shanghai Key Laboratory of Veterinary Biotechnology, 800 Dongchuan Road, Shanghai 200240, P. R. China
| | - Zikui Sun
- Shanghai Personal Biotechnology Limited Company, 777 Longwu Road, Shanghai 200236, P.R. China
| | - Yan Zhang
- Virginia Bioinformatics Institute, Virginia Tech, Washington Street, MC0477, Blacksburg, Virginia, 24061, USA
| | - He Meng
- School of Agriculture and Biology, Shanghai Jiaotong University; Shanghai Key Laboratory of Veterinary Biotechnology, 800 Dongchuan Road, Shanghai 200240, P. R. China
| | - Manglai Dugarjaviin
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, P.R. China
| |
Collapse
|
8
|
Abstract
Background Companion animals like dogs frequently develop tumors with age and similarly to human malignancies, display interpatient tumoral heterogeneity. Tumors are frequently characterized with regard to their mutation spectra, changes in gene expression or protein levels. Among others, these changes affect proteins involved in the DNA damage response (DDR), which served as a basis for the development of numerous clinically relevant cancer therapies. Even though the effects of different DNA damaging agents, as well as DDR kinetics, have been well characterized in mammalian cells in vitro, very little is so far known about the kinetics of DDR in tumor and normal tissues in vivo. Discussion Due to (i) the similarities between human and canine genomes, (ii) the course of spontaneous tumor development, as well as (iii) common exposure to environmental agents, canine tumors are potentially an excellent model to study DDR in vivo. This is further supported by the fact that dogs show approximately the same rate of tumor development with age as humans. Though similarities between human and dog osteosarcoma, as well as mammary tumors have been well established, only few studies using canine tumor samples addressed the importance of affected DDR pathways in tumor progression, thus leaving many questions unanswered. Summary Studies in humans showed that misregulated DDR pathways play an important role during tumor development, as well as in treatment response. Since dogs are proposed to be a good tumor model in many aspects of cancer research, we herein critically investigate the current knowledge of canine DDR and discuss (i) its future potential for studies on the in vivo level, as well as (ii) its possible translation to veterinary and human medicine.
Collapse
Affiliation(s)
| | | | - Carla Rohrer Bley
- Division of Radiation Oncology, Vetsuisse Faculty, University of Zurich, Winterthurerstrasse 260, 8057 Zurich, Switzerland.
| |
Collapse
|
9
|
Berglund J, Nevalainen EM, Molin AM, Perloski M, André C, Zody MC, Sharpe T, Hitte C, Lindblad-Toh K, Lohi H, Webster MT. Novel origins of copy number variation in the dog genome. Genome Biol 2012; 13:R73. [PMID: 22916802 PMCID: PMC4053742 DOI: 10.1186/gb-2012-13-8-r73] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2012] [Accepted: 08/23/2012] [Indexed: 11/10/2022] Open
Abstract
Background Copy number variants (CNVs) account for substantial variation between genomes and are a major source of normal and pathogenic phenotypic differences. The dog is an ideal model to investigate mutational mechanisms that generate CNVs as its genome lacks a functional ortholog of the PRDM9 gene implicated in recombination and CNV formation in humans. Here we comprehensively assay CNVs using high-density array comparative genomic hybridization in 50 dogs from 17 dog breeds and 3 gray wolves. Results We use a stringent new method to identify a total of 430 high-confidence CNV loci, which range in size from 9 kb to 1.6 Mb and span 26.4 Mb, or 1.08%, of the assayed dog genome, overlapping 413 annotated genes. Of CNVs observed in each breed, 98% are also observed in multiple breeds. CNVs predicted to disrupt gene function are significantly less common than expected by chance. We identify a significant overrepresentation of peaks of GC content, previously shown to be enriched in dog recombination hotspots, in the vicinity of CNV breakpoints. Conclusions A number of the CNVs identified by this study are candidates for generating breed-specific phenotypes. Purifying selection seems to be a major factor shaping structural variation in the dog genome, suggesting that many CNVs are deleterious. Localized peaks of GC content appear to be novel sites of CNV formation in the dog genome by non-allelic homologous recombination, potentially activated by the loss of PRDM9. These sequence features may have driven genome instability and chromosomal rearrangements throughout canid evolution.
Collapse
|
10
|
Axelsson E, Webster MT, Ratnakumar A, Ponting CP, Lindblad-Toh K. Death of PRDM9 coincides with stabilization of the recombination landscape in the dog genome. Genome Res 2012; 22:51-63. [PMID: 22006216 PMCID: PMC3246206 DOI: 10.1101/gr.124123.111] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Accepted: 10/05/2011] [Indexed: 11/25/2022]
Abstract
Analysis of diverse eukaryotes has revealed that recombination events cluster in discrete genomic locations known as hotspots. In humans, a zinc-finger protein, PRDM9, is believed to initiate recombination in >40% of hotspots by binding to a specific DNA sequence motif. However, the PRDM9 coding sequence is disrupted in the dog genome assembly, raising questions regarding the nature and control of recombination in dogs. By analyzing the sequences of PRDM9 orthologs in a number of dog breeds and several carnivores, we show here that this gene was inactivated early in canid evolution. We next use patterns of linkage disequilibrium using more than 170,000 SNP markers typed in almost 500 dogs to estimate the recombination rates in the dog genome using a coalescent-based approach. Broad-scale recombination rates show good correspondence with an existing linkage-based map. Significant variation in recombination rate is observed on the fine scale, and we are able to detect over 4000 recombination hotspots with high confidence. In contrast to human hotspots, 40% of canine hotspots are characterized by a distinct peak in GC content. A comparative genomic analysis indicates that these peaks are present also as weaker peaks in the panda, suggesting that the hotspots have been continually reinforced by accelerated and strongly GC biased nucleotide substitutions, consistent with the long-term action of biased gene conversion on the dog lineage. These results are consistent with the loss of PRDM9 in canids, resulting in a greater evolutionary stability of recombination hotspots. The genetic determinants of recombination hotspots in the dog genome may thus reflect a fundamental process of relevance to diverse animal species.
Collapse
Affiliation(s)
- Erik Axelsson
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75237 Uppsala, Sweden
| | - Matthew T. Webster
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75237 Uppsala, Sweden
| | - Abhirami Ratnakumar
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75237 Uppsala, Sweden
| | - Chris P. Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom
| | - Kerstin Lindblad-Toh
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75237 Uppsala, Sweden
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
11
|
Alekseyev MA, Pevzner PA. Comparative genomics reveals birth and death of fragile regions in mammalian evolution. Genome Biol 2010; 11:R117. [PMID: 21118492 PMCID: PMC3156956 DOI: 10.1186/gb-2010-11-11-r117] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 10/05/2010] [Accepted: 11/30/2010] [Indexed: 12/15/2022] Open
Abstract
Background An important question in genome evolution is whether there exist fragile regions (rearrangement hotspots) where chromosomal rearrangements are happening over and over again. Although nearly all recent studies supported the existence of fragile regions in mammalian genomes, the most comprehensive phylogenomic study of mammals raised some doubts about their existence. Results Here we demonstrate that fragile regions are subject to a birth and death process, implying that fragility has a limited evolutionary lifespan. Conclusions This finding implies that fragile regions migrate to different locations in different mammals, explaining why there exist only a few chromosomal breakpoints shared between different lineages. The birth and death of fragile regions as a phenomenon reinforces the hypothesis that rearrangements are promoted by matching segmental duplications and suggests putative locations of the currently active fragile regions in the human genome.
Collapse
Affiliation(s)
- Max A Alekseyev
- Department of Computer Science & Engineering, University of South Carolina, 301 Main St, Columbia, SC 29208, USA.
| | | |
Collapse
|
12
|
Abstract
The genomes of birds and nonavian reptiles (Reptilia) are critical for understanding genome evolution in mammals and amniotes generally. Despite decades of study at the chromosomal and single-gene levels, and the evidence for great diversity in genome size, karyotype, and sex chromosome diversity, reptile genomes are virtually unknown in the comparative genomics era. The recent sequencing of the chicken and zebra finch genomes, in conjunction with genome scans and the online publication of the Anolis lizard genome, has begun to clarify the events leading from an ancestral amniote genome--predicted to be large and to possess a diverse repeat landscape on par with mammals and a birdlike sex chromosome system--to the small and highly streamlined genomes of birds. Reptilia exhibit a wide range of evolutionary rates of different subgenomes and, from isochores to mitochondrial DNA, provide a critical contrast to the genomic paradigms established in mammals.
Collapse
Affiliation(s)
- Daniel E Janes
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | | | | | | | | |
Collapse
|
13
|
Das S, Nikolaidis N, Goto H, McCallister C, Li J, Hirano M, Cooper MD. Comparative genomics and evolution of the alpha-defensin multigene family in primates. Mol Biol Evol 2010; 27:2333-43. [PMID: 20457584 DOI: 10.1093/molbev/msq118] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Defensin genes encode small cationic antimicrobial peptides that form an important part of the innate immune system. They are divided into three families, alpha (α), beta (β), and theta (), according to arrangement of the disulfide bonding pattern between cysteine residues. Considering the functional importance of defensins, investigators have studied the evolution and the genomic organization of defensin genes. However, these studies have been restricted mainly to β-defensins. To understand the evolutionary dynamics of α-defensin genes among primates, we identified the α-defensin repertoires in human, chimpanzee, orangutan, macaque, and marmoset. The α-defensin genes in primates can be classified into three phylogenetic classes (class I, II, and III). The presence of all three classes in the marmoset indicates that their divergence occurred before the separation of New World and Old World monkeys. Comparative analysis of the α-defensin genomic clusters suggests that the makeup of the α-defensin gene repertoires between primates is quite different, as their genes have undergone dramatic birth-and-death evolution. Analysis of the encoded peptides of the α-defensin genes indicates that despite the overall high level of sequence divergence, certain amino acid residues or motifs are conserved within and between the three phylogenetic classes. The evolution of α-defensins in primates, therefore, appears to be governed by two opposing evolutionary forces. One force stabilizes specific amino acid residues and motifs to preserve the functional and structural integrity of the molecules and the other diversifies the sequences generating molecules with a wide range of activities against a large number of pathogens.
Collapse
Affiliation(s)
- Sabyasachi Das
- Department of Pathology and Laboratory Medicine, Emory Vaccine Center, School of Medicine, Emory University, USA.
| | | | | | | | | | | | | |
Collapse
|
14
|
|
15
|
Abstract
Recombination is typically thought of as a symmetrical process resulting in large-scale reciprocal genetic exchanges between homologous chromosomes. Recombination events, however, are also accompanied by short-scale, unidirectional exchanges known as gene conversion in the neighborhood of the initiating double-strand break. A large body of evidence suggests that gene conversion is GC-biased in many eukaryotes, including mammals and human. AT/GC heterozygotes produce more GC- than AT-gametes, thus conferring a population advantage to GC-alleles in high-recombining regions. This apparently unimportant feature of our molecular machinery has major evolutionary consequences. Structurally, GC-biased gene conversion explains the spatial distribution of GC-content in mammalian genomes-the so-called isochore structure. Functionally, GC-biased gene conversion promotes the segregation and fixation of deleterious AT --> GC mutations, thus increasing our genomic mutation load. Here we review the recent evidence for a GC-biased gene conversion process in mammals, and its consequences for genomic landscapes, molecular evolution, and human functional genomics.
Collapse
Affiliation(s)
- Laurent Duret
- Université de Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622, Villeurbanne, France.
| | | |
Collapse
|
16
|
Lemaitre C, Zaghloul L, Sagot MF, Gautier C, Arneodo A, Tannier E, Audit B. Analysis of fine-scale mammalian evolutionary breakpoints provides new insight into their relation to genome organisation. BMC Genomics 2009; 10:335. [PMID: 19630943 PMCID: PMC2722678 DOI: 10.1186/1471-2164-10-335] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 07/24/2009] [Indexed: 11/21/2022] Open
Abstract
Background The Intergenic Breakage Model, which is the current model of structural genome evolution, considers that evolutionary rearrangement breakages happen with a uniform propensity along the genome but are selected against in genes, their regulatory regions and in-between. However, a growing body of evidence shows that there exists regions along mammalian genomes that present a high susceptibility to breakage. We reconsidered this question taking advantage of a recently published methodology for the precise detection of rearrangement breakpoints based on pairwise genome comparisons. Results We applied this methodology between the genome of human and those of five sequenced eutherian mammals which allowed us to delineate evolutionary breakpoint regions along the human genome with a finer resolution (median size 26.6 kb) than obtained before. We investigated the distribution of these breakpoints with respect to genome organisation into domains of different activity. In agreement with the Intergenic Breakage Model, we observed that breakpoints are under-represented in genes. Surprisingly however, the density of breakpoints in small intergenes (1 per Mb) appears significantly higher than in gene deserts (0.1 per Mb). More generally, we found a heterogeneous distribution of breakpoints that follows the organisation of the genome into isochores (breakpoints are more frequent in GC-rich regions). We then discuss the hypothesis that regions with an enhanced susceptibility to breakage correspond to regions of high transcriptional activity and replication initiation. Conclusion We propose a model to describe the heterogeneous distribution of evolutionary breakpoints along human chromosomes that combines natural selection and a mutational bias linked to local open chromatin state.
Collapse
Affiliation(s)
- Claire Lemaitre
- Université de Bordeaux, Centre de Bioinformatique - Génomique Fonctionnelle Bordeaux, F-33000 Bordeaux, France.
| | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
BACKGROUND The very recent availability of fully sequenced individual human genomes is a major revolution in biology which is certainly going to provide new insights into genetic diseases and genomic rearrangements. RESULTS We mapped the insertions, deletions and SNPs (single nucleotide polymorphisms) that are present in Craig Venter's genome, more precisely on chromosomes 17 to 22, and compared them with the human reference genome hg17. Our results show that insertions and deletions are almost absent in L1 and generally scarce in L2 isochore families (GC-poor L1+L2 isochores represent slightly over half of the human genome), whereas they increase in GC-rich isochores, largely paralleling the densities of genes, retroviral integrations and Alu sequences. The distributions of insertions/deletions are in striking contrast with those of SNPs which exhibit almost the same density across all isochore families with, however, a trend for lower concentrations in gene-rich regions. CONCLUSIONS Our study strongly suggests that the distribution of insertions/deletions is due to the structure of chromatin which is mostly open in gene-rich, GC-rich isochores, and largely closed in gene-poor, GC-poor isochores. The different distributions of insertions/deletions and SNPs are clearly related to the two different responsible mechanisms, namely recombination and point mutations.
Collapse
Affiliation(s)
- Maria Costantini
- Stazione Zoologica Anton Dohrn, Naples, Italy
- * E-mail: (MC); (GB)
| | - Giorgio Bernardi
- Stazione Zoologica Anton Dohrn, Naples, Italy
- * E-mail: (MC); (GB)
| |
Collapse
|
18
|
Abstract
Recently completed whole-genome sequencing projects marked the transition from gene-based phylogenetic studies to phylogenomics analysis of entire genomes. We developed an algorithm MGRA for reconstructing ancestral genomes and used it to study the rearrangement history of seven mammalian genomes: human, chimpanzee, macaque, mouse, rat, dog, and opossum. MGRA relies on the notion of the multiple breakpoint graphs to overcome some limitations of the existing approaches to ancestral genome reconstructions. MGRA also generates the rearrangement-based characters guiding the phylogenetic tree reconstruction when the phylogeny is unknown.
Collapse
Affiliation(s)
- Max A. Alekseyev
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093-0404, USA
| |
Collapse
|
19
|
Abstract
Multi-break rearrangements break a genome into multiple fragments and further glue them together in a new order. While 2-break rearrangements represent standard reversals, fusions, fissions, and translocations, 3-break rearrangements represent a natural generalization of transpositions. Alekseyev and Pevzner (2007a, 2008a) studied multi-break rearrangements in circular genomes and further applied them to the analysis of chromosomal evolution in mammalian genomes. In this paper, we extend these results to the more difficult case of linear genomes. In particular, we give lower bounds for the rearrangement distance between linear genomes and for the breakpoint re-use rate as functions of the number and proportion of transpositions. We further use these results to analyze comparative genomic architecture of mammalian genomes.
Collapse
Affiliation(s)
- Max A Alekseyev
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, California 92093, USA.
| |
Collapse
|
20
|
|
21
|
Marques-Bonet T, Cheng Z, She X, Eichler EE, Navarro A. The genomic distribution of intraspecific and interspecific sequence divergence of human segmental duplications relative to human/chimpanzee chromosomal rearrangements. BMC Genomics 2008; 9:384. [PMID: 18699995 DOI: 10.1186/1471-2164-9-384] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2007] [Accepted: 08/12/2008] [Indexed: 11/20/2022] Open
Abstract
Background It has been suggested that chromosomal rearrangements harbor the molecular footprint of the biological phenomena which they induce, in the form, for instance, of changes in the sequence divergence rates of linked genes. So far, all the studies of these potential associations have focused on the relationship between structural changes and the rates of evolution of single-copy DNA and have tried to exclude segmental duplications (SDs). This is paradoxical, since SDs are one of the primary forces driving the evolution of structure and function in our genomes and have been linked not only with novel genes acquiring new functions, but also with overall higher DNA sequence divergence and major chromosomal rearrangements. Results Here we take the opposite view and focus on SDs. We analyze several of the features of SDs, including the rates of intraspecific divergence between paralogous copies of human SDs and of interspecific divergence between human SDs and chimpanzee DNA. We study how divergence measures relate to chromosomal rearrangements, while considering other factors that affect evolutionary rates in single copy DNA. Conclusion We find that interspecific SD divergence behaves similarly to divergence of single-copy DNA. In contrast, old and recent paralogous copies of SDs do present different patterns of intraspecific divergence. Also, we show that some relatively recent SDs accumulate in regions that carry inversions in sister lineages.
Collapse
|
22
|
Edwards CA, Mungall AJ, Matthews L, Ryder E, Gray DJ, Pask AJ, Shaw G, Graves JA, Rogers J, Dunham I, Renfree MB, Ferguson-Smith AC. The evolution of the DLK1-DIO3 imprinted domain in mammals. PLoS Biol 2008; 6:e135. [PMID: 18532878 PMCID: PMC2408620 DOI: 10.1371/journal.pbio.0060135] [Citation(s) in RCA: 139] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Accepted: 04/18/2008] [Indexed: 11/20/2022] Open
Abstract
A comprehensive, domain-wide comparative analysis of genomic imprinting between mammals that imprint and those that do not can provide valuable information about how and why imprinting evolved. The imprinting status, DNA methylation, and genomic landscape of the Dlk1-Dio3 cluster were determined in eutherian, metatherian, and prototherian mammals including tammar wallaby and platypus. Imprinting across the whole domain evolved after the divergence of eutherian from marsupial mammals and in eutherians is under strong purifying selection. The marsupial locus at 1.6 megabases, is double that of eutherians due to the accumulation of LINE repeats. Comparative sequence analysis of the domain in seven vertebrates determined evolutionary conserved regions common to particular sub-groups and to all vertebrates. The emergence of Dlk1-Dio3 imprinting in eutherians has occurred on the maternally inherited chromosome and is associated with region-specific resistance to expansion by repetitive elements and the local introduction of noncoding transcripts including microRNAs and C/D small nucleolar RNAs. A recent mammal-specific retrotransposition event led to the formation of a completely new gene only in the eutherian domain, which may have driven imprinting at the cluster. Mammals have two copies of each gene in their somatic cells, and most of these gene pairs are regulated and expressed simultaneously. A fraction of mammalian genes, however, is subject to imprinting—a chemical modification that marks a gene according to its parental origin, so that one parent's copy is expressed while the other parent's copy is silenced. How and why this process evolved is the subject of much speculation. Here we have shown that all the genes in one genomic region, Dlk1-Dio3, which are imprinted in placental mammals such as mouse and human, are not imprinted in marsupial (wallaby) or monotreme (platypus) mammals. This is in contrast to a small number of other imprinted genes that are imprinted in marsupials and other therian mammals and indicates that imprinting arose at each genomic domain at different stages of mammalian evolution. We have compared the sequence of the Dlk1-Dio3 region between seven vertebrate species and identified sequences that are differentially represented in mammals that imprint compared to those that do not. Our data indicate that once imprinted gene regulation is acquired in a domain, it becomes evolutionarily constrained to remain unchanged. A comparative analysis of genomic imprinting between mammals that imprint and those that don't has provided insights into how and why imprinting evolved.
Collapse
Affiliation(s)
- Carol A Edwards
- Department of Physiology, Development, and Neuroscience, University of Cambridge, Cambridge, United Kingdom
| | - Andrew J Mungall
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Lucy Matthews
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Edward Ryder
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Dionne J Gray
- Department of Physiology, Development, and Neuroscience, University of Cambridge, Cambridge, United Kingdom
| | - Andrew J Pask
- Department of Zoology, University of Melbourne, Victoria, Australia
| | - Geoffrey Shaw
- Department of Zoology, University of Melbourne, Victoria, Australia
| | - Jennifer A.M Graves
- Research School of Biological Sciences, The Australian National University, Canberra, Australia
| | - Jane Rogers
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | - Ian Dunham
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | - Anne C Ferguson-Smith
- Department of Physiology, Development, and Neuroscience, University of Cambridge, Cambridge, United Kingdom
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
23
|
Nguyen DQ, Webber C, Hehir-Kwa J, Pfundt R, Veltman J, Ponting CP. Reduced purifying selection prevails over positive selection in human copy number variant evolution. Genome Res 2008; 18:1711-23. [PMID: 18687881 DOI: 10.1101/gr.077289.108] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Copy number variation is a dominant contributor to genomic variation and may frequently underlie an individual's variable susceptibilities to disease. Here we question our previous proposition that copy number variants (CNVs) are often retained in the human population because of their adaptive benefit. We show that genic biases of CNVs are best explained, not by positive selection, but by reduced efficiency of selection in eliminating deleterious changes from the human population. Of four CNV data sets examined, three exhibit significant increases in protein evolutionary rates. These increases appear to be attributable to the frequent coincidence of CNVs with segmental duplications (SDs) that recombine infrequently. Furthermore, human orthologs of mouse genes, which, when disrupted, result in pre- or postnatal lethality, are unusually depleted in CNVs. Together, these findings support a model of reduced purifying selection (Hill-Robertson interference) within copy number variable regions that are enriched in nonessential genes, allowing both the fixation of slightly deleterious substitutions and increased drift of CNV alleles. Additionally, all four CNV sets exhibited increased rates of interspecies chromosomal rearrangement and nucleotide substitution and an increased gene density. We observe that sequences with high G+C contents are most prone to copy number variation. In particular, frequently duplicated human SD sequence, or CNVs that are large and/or observed frequently, tend to be elevated in G+C content. In contrast, SD sequences that appear fixed in the human population lie more frequently within low G+C sequence. These findings provide an overarching view of how CNVs arise and segregate in the human population.
Collapse
Affiliation(s)
- Duc-Quang Nguyen
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, Oxford OX1 3QX, United Kingdom
| | | | | | | | | | | |
Collapse
|
24
|
Abstract
The evolution of karyotypes has been the subject of intensive study since the middle of the 20th century. This was motivated by the observation that the karyotypes of related species showed remarkable conservation. The recent emergence of whole-genome sequencing projects gives the opportunity to complement the cytogenetic approaches by addressing the conservation of karyotypes using chromosome sequence comparison. In this short review we present a description of recent advances in computational biology methods dedicated to the study of chromosome evolution and more specifically ancestral karyotype reconstruction in an attempt to provide an integrated overview of both cytogenetic and computational approaches.
Collapse
|
25
|
Lin CH, Bourque G, Tan P. A comparative synteny map of Burkholderia species links large-scale genome rearrangements to fine-scale nucleotide variation in prokaryotes. Mol Biol Evol 2007; 25:549-58. [PMID: 18162473 DOI: 10.1093/molbev/msm282] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Genome rearrangement events, including inversions and translocations, are frequently observed across related microbial species, but the impact of such events on functional diversity is unclear. To clarify this relationship, we compared 4 members of the Gram-negative Burkholderia family (Burkholderia pseudomallei, Burkholderia mallei, Burkholderia thailandensis, and Burkholderia cenocepacia) and identified a core set of 2,590 orthologs present in all 4 species (metagenes). The metagenes were organized into 255 synteny blocks whose relative order has been altered by a predicted minimum of 242 genome rearrangement events. Functionally, metagenes within individual synteny blocks were often related. The molecular divergence of metagenes adjacent to synteny breakpoints (boundary metagenes) was significantly greater compared with metagenes within blocks, suggesting an association between breakpoint locations and local fine-scale nucleotide alterations. This phenomenon, referred to as boundary element associated divergence, was also observed in Pseudomonas and Shigella, suggesting that this is a common phenomenon in prokaryotes. We also observed preferential localization of species-specific genes and insertion sequence element to synteny breakpoints in Burkholderia. Our results suggest that in prokaryotes, genome rearrangements may influence functional diversity through the enhanced divergence of boundary genes and the creation of foci for acquiring and deleting species-specific genes.
Collapse
Affiliation(s)
- Chi Ho Lin
- Genome Institute of Singapore, Singapore
| | | | | |
Collapse
|
26
|
Abstract
In a landmark paper, Nadeau and Taylor [18] formulated the random breakage model (RBM) of chromosome evolution that postulates that there are no rearrangement hotspots in the human genome. In the next two decades, numerous studies with progressively increasing levels of resolution made RBM the de facto theory of chromosome evolution. Despite the fact that RBM had prophetic prediction power, it was recently refuted by Pevzner and Tesler [4], who introduced the fragile breakage model (FBM), postulating that the human genome is a mosaic of solid regions (with low propensity for rearrangements) and fragile regions (rearrangement hotspots). However, the rebuttal of RBM caused a controversy and led to a split among researchers studying genome evolution. In particular, it remains unclear whether some complex rearrangements (e.g., transpositions) can create an appearance of rearrangement hotspots. We contribute to the ongoing debate by analyzing multi-break rearrangements that break a genome into multiple fragments and further glue them together in a new order. In particular, we demonstrate that (1) even if transpositions were a dominant force in mammalian evolution, the arguments in favor of FBM still stand, and (2) the "gene deletion" argument against FBM is flawed.
Collapse
Affiliation(s)
- Max A Alekseyev
- Department of Computer Science and Engineering, University of California San Diego, San Diego, California, United States of America.
| | | |
Collapse
|
27
|
Reed C, Fofanov V, Putonti C, Chumakov S, Slezak T, Fofanov Y. Effect of the mutation rate and background size on the quality of pathogen identification. ACTA ACUST UNITED AC 2007; 23:2665-71. [PMID: 17881407 DOI: 10.1093/bioinformatics/btm420] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Genomic-based methods have significant potential for fast and accurate identification of organisms or even genes of interest in complex environmental samples (air, water, soil, food, etc.), especially when isolation of the target organism cannot be performed by a variety of reasons. Despite this potential, the presence of the unknown, variable and usually large quantities of background DNA can cause interference resulting in false positive outcomes. RESULTS In order to estimate how the genomic diversity of the background (total length of all of the different genomes present in the background), target length and target mutation rate affect the probability of misidentifications, we introduce a mathematical definition for the quality of an individual signature in the presence of a background based on its length and number of mismatches needed to transform the signature into the closest subsequence present in the background. This definition, in conjunction with a probabilistic framework, allows one to predict the minimal signature length required to identify the target in the presence of different sizes of backgrounds and the effect of the target's mutation rate on the quality of its identification. The model assumptions and predictions were validated using both Monte Carlo simulations and real genomic data examples. The proposed model can be used to determine appropriate signature lengths for various combinations of target and background genome sizes. It also predicted that any genomic signatures will be unable to identify target if its mutation rate is >5%. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chris Reed
- Department of Computer Science, University of Houston, 501 Philip G. Hoffman Hall, Houston, TX 77204, USA
| | | | | | | | | | | |
Collapse
|
28
|
Goodstadt L, Heger A, Webber C, Ponting CP. An analysis of the gene complement of a marsupial, Monodelphis domestica: evolution of lineage-specific genes and giant chromosomes. Genome Res 2007; 17:969-81. [PMID: 17495010 PMCID: PMC1899124 DOI: 10.1101/gr.6093907] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The newly sequenced genome of Monodelphis domestica not only provides the out-group necessary to better understand our own eutherian lineage, but it enables insights into the innovative biology of metatherians. Here, we compare Monodelphis with Homo sequences from alignments of single nucleotides, genes, and whole chromosomes. Using PhyOP, we have established orthologs in Homo for 82% (15,250) of Monodelphis gene predictions. Those with single orthologs in each species exhibited a high median synonymous substitution rate (d(S) = 1.02), thereby explaining the relative paucity of aligned regions outside of coding sequences. Orthology assignments were used to construct a synteny map that illustrates the considerable fragmentation of Monodelphis and Homo karyotypes since their therian last common ancestor. Fifteen percent of Monodelphis genes are predicted, from their low divergence at synonymous sites, to have been duplicated in the metatherian lineage. The majority of Monodelphis-specific genes possess predicted roles in chemosensation, reproduction, adaptation to specific diets, and immunity. Using alignments of Monodelphis genes to sequences from either Homo or Trichosurus vulpecula (an Australian marsupial), we show that metatherian X chromosomes have elevated silent substitution rates and high G+C contents in comparison with both metatherian autosomes and eutherian chromosomes. Each of these elevations is also a feature of subtelomeric chromosomal regions. We attribute these observations to high rates of female-specific recombination near the chromosomal ends and within the X chromosome, which act to sustain or increase G+C levels by biased gene conversion. In particular, we propose that the higher G+C content of the Monodelphis X chromosome is a direct consequence of its small size relative to the giant autosomes.
Collapse
Affiliation(s)
- Leo Goodstadt
- MRC Functional Genetics Unit, University of Oxford, Department of Physiology, Oxford, UK.
| | | | | | | |
Collapse
|
29
|
Mikkelsen TS, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A, Jurka J, Kamal M, Mauceli E, Searle SMJ, Sharpe T, Baker ML, Batzer MA, Benos PV, Belov K, Clamp M, Cook A, Cuff J, Das R, Davidow L, Deakin JE, Fazzari MJ, Glass JL, Grabherr M, Greally JM, Gu W, Hore TA, Huttley GA, Kleber M, Jirtle RL, Koina E, Lee JT, Mahony S, Marra MA, Miller RD, Nicholls RD, Oda M, Papenfuss AT, Parra ZE, Pollock DD, Ray DA, Schein JE, Speed TP, Thompson K, VandeBerg JL, Wade CM, Walker JA, Waters PD, Webber C, Weidman JR, Xie X, Zody MC, Graves JAM, Ponting CP, Breen M, Samollow PB, Lander ES, Lindblad-Toh K. Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature 2007; 447:167-77. [PMID: 17495919 DOI: 10.1038/nature05805] [Citation(s) in RCA: 508] [Impact Index Per Article: 29.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 04/03/2007] [Indexed: 12/15/2022]
Abstract
We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.
Collapse
Affiliation(s)
- Tarjei S Mikkelsen
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Affiliation(s)
- Gerton Lunter
- Gerton Lunter () University of Oxford, Oxford, United Kingdom
| |
Collapse
|
31
|
Sutter NB, Bustamante CD, Chase K, Gray MM, Zhao K, Zhu L, Padhukasahasram B, Karlins E, Davis S, Jones PG, Quignon P, Johnson GS, Parker HG, Fretwell N, Mosher DS, Lawler DF, Satyaraj E, Nordborg M, Lark KG, Wayne RK, Ostrander EA. A single IGF1 allele is a major determinant of small size in dogs. Science 2007; 316:112-5. [PMID: 17412960 PMCID: PMC2789551 DOI: 10.1126/science.1137045] [Citation(s) in RCA: 426] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The domestic dog exhibits greater diversity in body size than any other terrestrial vertebrate. We used a strategy that exploits the breed structure of dogs to investigate the genetic basis of size. First, through a genome-wide scan, we identified a major quantitative trait locus (QTL) on chromosome 15 influencing size variation within a single breed. Second, we examined genetic variation in the 15-megabase interval surrounding the QTL in small and giant breeds and found marked evidence for a selective sweep spanning a single gene (IGF1), encoding insulin-like growth factor 1. A single IGF1 single-nucleotide polymorphism haplotype is common to all small breeds and nearly absent from giant breeds, suggesting that the same causal sequence variant is a major contributor to body size in all small dogs.
Collapse
Affiliation(s)
- Nathan B. Sutter
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | - Carlos D. Bustamante
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Kevin Chase
- Department of Biology, University of Utah, Salt Lake City, UT 84112, USA
| | - Melissa M. Gray
- Department of Ecology and Environmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Keyan Zhao
- Department of Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Lan Zhu
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Badri Padhukasahasram
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Eric Karlins
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | - Sean Davis
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | - Paul G. Jones
- The WALTHAM Centre for Pet Nutrition, Waltham on the Wolds, Leicestershire, LE14 4RT, UK
| | - Pascale Quignon
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | - Gary S. Johnson
- Department of Veterinary Pathobiology, University of Missouri, Columbia, MO 65211, USA
| | - Heidi G. Parker
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | - Neale Fretwell
- The WALTHAM Centre for Pet Nutrition, Waltham on the Wolds, Leicestershire, LE14 4RT, UK
| | - Dana S. Mosher
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| | | | | | - Magnus Nordborg
- Department of Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - K. Gordon Lark
- Department of Biology, University of Utah, Salt Lake City, UT 84112, USA
| | - Robert K. Wayne
- Department of Ecology and Environmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Elaine A. Ostrander
- National Human Genome Research Institute, Building 50, Room 5349, 50 South Drive MSC 8000, Bethesda, MD 20892–8000, USA
| |
Collapse
|
32
|
Abstract
This article analyzes mammalian genome rearrangements at higher resolution than has been published to date. We identify 3171 intervals, covering approximately 92% of the human genome, within which we find no rearrangements larger than 50 kilobases (kb) in the lineages leading to human, mouse, rat, and dog from their most recent common ancestor. Combining intervals that are adjacent in all contemporary species produces 1338 segments that may contain large insertions or deletions but that are free of chromosome fissions or fusions as well as inversions or translocations >50 kb in length. We describe a new method for predicting the ancestral order and orientation of those intervals from their observed adjacencies in modern species. We combine the results from this method with data from chromosome painting experiments to produce a map of an early mammalian genome that accounts for 96.8% of the available human genome sequence data. The precision is further increased by mapping inversions as small as 31 bp. Analysis of the predicted evolutionary breakpoints in the human lineage confirms certain published observations but disagrees with others. Although only a few mammalian genomes are currently sequenced to high precision, our theoretical analyses and computer simulations indicate that our results are reasonably accurate and that they will become highly accurate in the foreseeable future. Our methods were developed as part of a project to reconstruct the genome sequence of the last ancestor of human, dogs, and most other placental mammals.
Collapse
Affiliation(s)
- Jian Ma
- Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania 16802, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
Although large-scale copy-number variation is an important contributor to conspecific genomic diversity, whether these variants frequently contribute to human phenotype differences remains unknown. If they have few functional consequences, then copy-number variants (CNVs) might be expected both to be distributed uniformly throughout the human genome and to encode genes that are characteristic of the genome as a whole. We find that human CNVs are significantly overrepresented close to telomeres and centromeres and in simple tandem repeat sequences. Additionally, human CNVs were observed to be unusually enriched in those protein-coding genes that have experienced significantly elevated synonymous and nonsynonymous nucleotide substitution rates, estimated between single human and mouse orthologues. CNV genes encode disproportionately large numbers of secreted, olfactory, and immunity proteins, although they contain fewer than expected genes associated with Mendelian disease. Despite mouse CNVs also exhibiting a significant elevation in synonymous substitution rates, in most other respects they do not differ significantly from the genomic background. Nevertheless, they encode proteins that are depleted in olfactory function, and they exhibit significantly decreased amino acid sequence divergence. Natural selection appears to have acted discriminately among human CNV genes. The significant overabundance, within human CNVs, of genes associated with olfaction, immunity, protein secretion, and elevated coding sequence divergence, indicates that a subset may have been retained in the human population due to the adaptive benefit of increased gene dosage. By contrast, the functional characteristics of mouse CNVs either suggest that advantageous gene copies have been depleted during recent selective breeding of laboratory mouse strains or suggest that they were preferentially fixed as a consequence of the larger effective population size of wild mice. It thus appears that CNV differences among mouse strains do not provide an appropriate model for large-scale sequence variations in the human population. Until recently, it was thought that most inherited human diversity results from genetic variation at single nucleotide sites. However, recent studies discovered many larger-scale differences, involving the duplication or deletion of thousands of bases. Do these large-scale differences contribute greatly to characteristics of human individuals, or are they of little consequence? For clues to solve this mystery the authors looked to the signatures of adaptive evolution written into the DNA. They reasoned that if large-scale DNA differences are beneficial, they should be enriched in genes, particularly those involved in fighting infection and sensing our environment. The authors discovered such enrichments indicating that some large-scale sequence differences have been advantageous during the last approximately 100,000 y of human history. By contrast, modern laboratory mice exhibit few signs of beneficial large-scale DNA differences, perhaps because advantageous sequences have swept rapidly through their ancestral populations. Some large-scale variations in human genomes thus appear to be a legacy of past evolutionary challenges to our species.
Collapse
Affiliation(s)
- Duc-Quang Nguyen
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Caleb Webber
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Chris P Ponting
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
34
|
|
35
|
Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ, Zody MC, Mauceli E, Xie X, Breen M, Wayne RK, Ostrander EA, Ponting CP, Galibert F, Smith DR, DeJong PJ, Kirkness E, Alvarez P, Biagi T, Brockman W, Butler J, Chin CW, Cook A, Cuff J, Daly MJ, DeCaprio D, Gnerre S, Grabherr M, Kellis M, Kleber M, Bardeleben C, Goodstadt L, Heger A, Hitte C, Kim L, Koepfli KP, Parker HG, Pollinger JP, Searle SMJ, Sutter NB, Thomas R, Webber C, Baldwin J, Abebe A, Abouelleil A, Aftuck L, Ait-Zahra M, Aldredge T, Allen N, An P, Anderson S, Antoine C, Arachchi H, Aslam A, Ayotte L, Bachantsang P, Barry A, Bayul T, Benamara M, Berlin A, Bessette D, Blitshteyn B, Bloom T, Blye J, Boguslavskiy L, Bonnet C, Boukhgalter B, Brown A, Cahill P, Calixte N, Camarata J, Cheshatsang Y, Chu J, Citroen M, Collymore A, Cooke P, Dawoe T, Daza R, Decktor K, DeGray S, Dhargay N, Dooley K, Dooley K, Dorje P, Dorjee K, Dorris L, Duffey N, Dupes A, Egbiremolen O, Elong R, Falk J, Farina A, Faro S, Ferguson D, Ferreira P, Fisher S, FitzGerald M, Foley K, Foley C, Franke A, Friedrich D, Gage D, Garber M, Gearin G, Giannoukos G, Goode T, Goyette A, Graham J, Grandbois E, Gyaltsen K, Hafez N, Hagopian D, Hagos B, Hall J, Healy C, Hegarty R, Honan T, Horn A, Houde N, Hughes L, Hunnicutt L, Husby M, Jester B, Jones C, Kamat A, Kanga B, Kells C, Khazanovich D, Kieu AC, Kisner P, Kumar M, Lance K, Landers T, Lara M, Lee W, Leger JP, Lennon N, Leuper L, LeVine S, Liu J, Liu X, Lokyitsang Y, Lokyitsang T, Lui A, Macdonald J, Major J, Marabella R, Maru K, Matthews C, McDonough S, Mehta T, Meldrim J, Melnikov A, Meneus L, Mihalev A, Mihova T, Miller K, Mittelman R, Mlenga V, Mulrain L, Munson G, Navidi A, Naylor J, Nguyen T, Nguyen N, Nguyen C, Nguyen T, Nicol R, Norbu N, Norbu C, Novod N, Nyima T, Olandt P, O'Neill B, O'Neill K, Osman S, Oyono L, Patti C, Perrin D, Phunkhang P, Pierre F, Priest M, Rachupka A, Raghuraman S, Rameau R, Ray V, Raymond C, Rege F, Rise C, Rogers J, Rogov P, Sahalie J, Settipalli S, Sharpe T, Shea T, Sheehan M, Sherpa N, Shi J, Shih D, Sloan J, Smith C, Sparrow T, Stalker J, Stange-Thomann N, Stavropoulos S, Stone C, Stone S, Sykes S, Tchuinga P, Tenzing P, Tesfaye S, Thoulutsang D, Thoulutsang Y, Topham K, Topping I, Tsamla T, Vassiliev H, Venkataraman V, Vo A, Wangchuk T, Wangdi T, Weiand M, Wilkinson J, Wilson A, Yadav S, Yang S, Yang X, Young G, Yu Q, Zainoun J, Zembek L, Zimmer A, Lander ES. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 2005; 438:803-19. [PMID: 16341006 DOI: 10.1038/nature04338] [Citation(s) in RCA: 1680] [Impact Index Per Article: 88.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2005] [Accepted: 10/11/2005] [Indexed: 12/12/2022]
Abstract
Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
Collapse
Affiliation(s)
- Kerstin Lindblad-Toh
- Broad Institute of Harvard and MIT, 320 Charles Street, Cambridge, Massachusetts 02141, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|