1
|
Singh VK, Singh S, Nandhini PB, Bhatia AK, Dixit SP, Ganguly I. Comparative genomic diversity analysis of copy number variations (CNV) in indicine and taurine cattle thriving in Europe and Indian subcontinent. Anim Biotechnol 2023; 34:3483-3494. [PMID: 36592947 DOI: 10.1080/10495398.2022.2162910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Copy number variations (CNVs) include deletions, duplications, and insertions that are larger than 50 bp in size causing structural variation responsible for diversity, adaptation, and breed development. Indian cattle breeds are highly diverse from the taurine breeds. The pattern of CNVRs in 191 animals belonging to 39 cattle breeds (four Indicine and 35 Taurine) was studied based on Illumina 777K BovineHD chip data. The Indicine breeds revealed 2590 CNVs and 335 copy number variation regions (CNVRs) in autosomes. Out of the identified CNVs, 50 were found to be novel. Structure analysis revealed admixed nature of Siri. Neighbor joining tree from CNVR data showed that hot (Kankrej and Hallikar) and cold (Ladakhi and Siri) adapted cattle breeds clustered separately. CNVR of Indian and European breeds revealed that Balkan and Italian breeds of Podolian group are admixed with Indian cattle breeds corroborating indicine introgression (6.1-13.5%). CNVRs spanning the regions of olfactory receptors and immune system genes were identified. AMOVA revealed 9% variation among populations which is 2% greater than SNP based studies showing higher inclusion of variation by CNVR. Detailed analysis of CNVs/CNVRs in Indian cattle adapted to hot and cold climate, and their diversity among worldwide cattle is presented in this study.
Collapse
Affiliation(s)
- V K Singh
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - S Singh
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, India
| | - P B Nandhini
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - A K Bhatia
- Animal Genetic Resources Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, India
| | - S P Dixit
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, India
| | - I Ganguly
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, India
| |
Collapse
|
2
|
Fan S, Kong C, Chen Y, Zheng X, Zhou R, Zhang X, Wu X, Zhang W, Ding Y, Yin Z. Copy Number Variation Analysis Revealed the Evolutionary Difference between Chinese Indigenous Pigs and Asian Wild Boars. Genes (Basel) 2023; 14:472. [PMID: 36833399 PMCID: PMC9957247 DOI: 10.3390/genes14020472] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 02/08/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023] Open
Abstract
Copy number variation (CNV) has been widely used to study the evolution of different species. We first discovered different CNVs in 24 Anqingliubai pigs and 6 Asian wild boars using next-generation sequencing at the whole-genome level with 10× depth to understand the relationship between genetic evolution and production traits in wild boars and domestic pigs. A total of 97,489 CNVs were identified and divided into 10,429 copy number variation regions (CNVRs), occupying 32.06% of the porcine genome. Chromosome 1 had the most CNVRs, and chromosome 18 had the least. Ninety-six CNVRs were selected using VST 1% based on the signatures of all CNVRs, and sixty-five genes were identified in the selected regions. These genes were strongly correlated with traits distinguishing groups by enrichment in Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathways, such as growth (CD36), reproduction (CIT, RLN), detoxification (CYP3A29), and fatty acid metabolism (ELOVL6). The QTL overlapping regions were associated with meat traits, growth, and immunity, which was consistent with CNV analysis. Our findings increase the understanding of evolved genome structural variations between wild boars and domestic pigs, and provide new molecular biomarkers to guide breeding and the efficient use of available genetic resources.
Collapse
Affiliation(s)
- Shuhao Fan
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Chengcheng Kong
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
- School of Pharmacy, Anhui University of Chinese Medicine, Hefei 230036, China
| | - Yige Chen
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Xianrui Zheng
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Ren Zhou
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Xiaodong Zhang
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Xudong Wu
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Wei Zhang
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Yueyun Ding
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| | - Zongjun Yin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China
| |
Collapse
|
3
|
Peripolli E, Stafuzza NB, Machado MA, do Carmo Panetto JC, do Egito AA, Baldi F, da Silva MVGB. Assessment of copy number variants in three Brazilian locally adapted cattle breeds using whole-genome re-sequencing data. Anim Genet 2023; 54:254-270. [PMID: 36740987 DOI: 10.1111/age.13298] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 12/13/2021] [Accepted: 01/13/2023] [Indexed: 02/07/2023]
Abstract
Further characterization of genetic structural variations should strongly focus on small and endangered local breeds given their role in unraveling genes and structural variants underlying selective pressures and phenotype variation. A comprehensive genome-wide assessment of copy number variations (CNVs) based on whole-genome re-sequencing data was performed on three Brazilian locally adapted cattle breeds (Caracu Caldeano, Crioulo Lageano, and Pantaneiro) using the ARS-UCD1.2 genome assembly. Data from 36 individuals with an average coverage depth of 14.07× per individual was used. A total of 24 945 CNVs were identified distributed among the breeds (Caracu Caldeano = 7285, Crioulo Lageano = 7297, and Pantaneiro = 10 363). Deletion events were 1.75-2.07-fold higher than duplications, and the total length of CNVs is composed mostly of a high number of segments between 10 and 30 kb. CNV regions (CNVRs) are not uniformly scattered throughout the genomes (n = 463), and 105 CNVRs were found overlapping among the studied breeds. Functional annotation of the CNVRs revealed variants with high consequence on protein sequence harboring relevant genes, in which we highlighted the BOLA-DQB, BOLA-DQA5, CD1A, β-defensins, PRG3, and ULBP21 genes. Enrichment analysis based on the gene list retrieved from the CNVRs disclosed over-represented terms (p < 0.01) strongly associated with immunity and cattle resilience to harsh environments. Additionally, QTL associated with body conformation and dairy-related traits were also unveiled within the CNVRs. These results provide better understanding of the selective forces shaping the genome of such cattle breeds and identify traces of natural selection pressures by which these populations have been exposed to challenging environmental conditions.
Collapse
Affiliation(s)
- Elisa Peripolli
- School of Agricultural and Veterinarian Sciences, São Paulo State University (Unesp), Jaboticabal, Brazil
| | | | | | | | | | - Fernando Baldi
- School of Agricultural and Veterinarian Sciences, São Paulo State University (Unesp), Jaboticabal, Brazil
| | | |
Collapse
|
4
|
Smith G, Manzano-Marín A, Reyes-Prieto M, Antunes CSR, Ashworth V, Goselle ON, Jan AAA, Moya A, Latorre A, Perotti MA, Braig HR. Human follicular mites: Ectoparasites becoming symbionts. Mol Biol Evol 2022; 39:msac125. [PMID: 35724423 PMCID: PMC9218549 DOI: 10.1093/molbev/msac125] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 05/23/2022] [Accepted: 05/31/2022] [Indexed: 12/13/2022] Open
Abstract
Most humans carry mites in the hair follicles of their skin for their entire lives. Follicular mites are the only metazoans tha continuously live on humans. We propose that Demodex folliculorum (Acari) represents a transitional stage from a host-injuring obligate parasite to an obligate symbiont. Here, we describe the profound impact of this transition on the genome and physiology of the mite. Genome sequencing revealed that the permanent host association of D. folliculorum led to an extensive genome reduction through relaxed selection and genetic drift, resulting in the smallest number of protein-coding genes yet identified among panarthropods. Confocal microscopy revealed that this gene loss coincided with an extreme reduction in the number of cells. Single uninucleate muscle cells are sufficient to operate each of the three segments that form each walking leg. While it has been assumed that the reduction of the cell number in parasites starts early in development, we identified a greater total number of cells in the last developmental stage (nymph) than in the terminal adult stage, suggesting that reduction starts at the adult or ultimate stage of development. This is the first evolutionary step in an arthropod species adopting a reductive, parasitic or endosymbiotic lifestyle. Somatic nuclei show underreplication at the diploid stage. Novel eye structures or photoreceptors as well as a unique human host melatonin-guided day/night rhythm are proposed for the first time. The loss of DNA repair genes coupled with extreme endogamy might have set this mite species on an evolutionary dead-end trajectory.
Collapse
Affiliation(s)
- Gilbert Smith
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | - Alejandro Manzano-Marín
- Centre for Microbiology and Environmental Systems Science (CMESS), University of Vienna, Vienna, Austria
| | - Mariana Reyes-Prieto
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
| | | | - Victoria Ashworth
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | - Obed Nanjul Goselle
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
| | | | - Andrés Moya
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
- Center for Networked Biomedical Research in Epidemiology and Public Health (CIBEResp), Madrid, Spain
| | - Amparo Latorre
- Institute of Integrative Systems Biology (I2Sysbio), Universitat de València and Spanish Research Council (CSIC), València, Spain
- Foundation for the Promotion of Health and Biomedical Research of the Valencian Community (FISABIO), València, Spain
- Center for Networked Biomedical Research in Epidemiology and Public Health (CIBEResp), Madrid, Spain
| | - M Alejandra Perotti
- School of Biological Sciences, University of Reading, Reading, United Kingdom
| | - Henk R Braig
- School of Natural Sciences, Bangor University, Bangor, Wales, United Kingdom
- Institute and Museum of Natural Sciences, National University of San Juan, San Juan, Argentina
| |
Collapse
|
5
|
Kava R, Peripolli E, Berton MP, Lemos M, Lobo RB, Stafuzza NB, Pereira AS, Baldi F. Genome-wide structural variations in Brazilian Senepol cattle, a tropically adapted taurine breed. Livest Sci 2021. [DOI: 10.1016/j.livsci.2021.104708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
6
|
Karn RC, Yazdanifar G, Pezer Ž, Boursot P, Laukaitis CM. Androgen-Binding Protein (Abp) Evolutionary History: Has Positive Selection Caused Fixation of Different Paralogs in Different Taxa of the Genus Mus? Genome Biol Evol 2021; 13:6377336. [PMID: 34581786 PMCID: PMC8525912 DOI: 10.1093/gbe/evab220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/20/2021] [Indexed: 11/14/2022] Open
Abstract
Comparison of the androgen-binding protein (Abp) gene regions of six Mus genomes provides insights into the evolutionary history of this large murid rodent gene family. We identified 206 unique Abp sequences and mapped their physical relationships. At least 48 are duplicated and thus present in more than two identical copies. All six taxa have substantially elevated LINE1 densities in Abp regions compared with flanking regions, similar to levels in mouse and rat genomes, although nonallelic homologous recombination seems to have only occurred in Mus musculus domesticus. Phylogenetic and structural relationships support the hypothesis that the extensive Abp expansion began in an ancestor of the genus Mus. We also found duplicated Abpa27's in two taxa, suggesting that previously reported selection on a27 alleles may have actually detected selection on haplotypes wherein different paralogs were lost in each. Other studies reported that a27 gene and species trees were incongruent, likely because of homoplasy. However, L1MC3 phylogenies, supposed to be homoplasy-free compared with coding regions, support our paralog hypothesis because the L1MC3 phylogeny was congruent with the a27 topology. This paralog hypothesis provides an alternative explanation for the origin of the a27 gene that is suggested to be fixed in the three different subspecies of Mus musculus and to mediate sexual selection and incipient reinforcement between at least two of them. Finally, we ask why there are so many Abp genes, especially given the high frequency of pseudogenes and suggest that relaxed selection operates over a large part of the gene clusters.
Collapse
Affiliation(s)
- Robert C Karn
- Gene Networks in Neural and Developmental Plasticity, Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | | | - Željka Pezer
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Pierre Boursot
- Institut des Sciences de l'Evolution Montpellier, Université de Montpellier, CNRS, IRD, France
| | - Christina M Laukaitis
- Carle Health and Carle Illinois College of Medicine, University of Illinois, Urbana-Champaign, USA
| |
Collapse
|
7
|
Hollox EJ, Zuccherato LW, Tucci S. Genome structural variation in human evolution. Trends Genet 2021; 38:45-58. [PMID: 34284881 DOI: 10.1016/j.tig.2021.06.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 06/21/2021] [Accepted: 06/22/2021] [Indexed: 01/01/2023]
Abstract
Structural variation (SV) is a large difference (typically >100 bp) in the genomic structure of two genomes and includes both copy number variation and variation that does not change copy number of a genomic region, such as an inversion. Improved reference genomes, combined with widespread genome sequencing using short-read sequencing technology, and increasingly using long-read sequencing, have reignited interest in SV. Recent large-scale studies and functional focused analyses have highlighted the role of SV in human evolution. In this review, we highlight human-specific SVs involved in changes in the brain, population-specific SVs that affect response to the environment, including adaptation to diet and infectious diseases, and summarise the contribution of archaic hominin admixture to present-day human SV.
Collapse
Affiliation(s)
- Edward J Hollox
- Department of Genetics and Genome Biology, University of Leicester, UK.
| | - Luciana W Zuccherato
- Núcleo de Ensino e Pesquisa, Instituto Mário Penna, Belo Horizonte, Brazil; Departmento de Bioquímica e Imunologia, Universidade de Minas Gerais, Belo Horizonte, Brazil
| | - Serena Tucci
- Department of Anthropology, Yale University, New Haven, CT, USA
| |
Collapse
|
8
|
Abdullaev ET, Umarova IR, Arndt PF. Modelling segmental duplications in the human genome. BMC Genomics 2021; 22:496. [PMID: 34215180 PMCID: PMC8254307 DOI: 10.1186/s12864-021-07789-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 06/10/2021] [Indexed: 11/22/2022] Open
Abstract
Background Segmental duplications (SDs) are long DNA sequences that are repeated in a genome and have high sequence identity. In contrast to repetitive elements they are often unique and only sometimes have multiple copies in a genome. There are several well-studied mechanisms responsible for segmental duplications: non-allelic homologous recombination, non-homologous end joining and replication slippage. Such duplications play an important role in evolution, however, we do not have a full understanding of the dynamic properties of the duplication process. Results We study segmental duplications through a graph representation where nodes represent genomic regions and edges represent duplications between them. The resulting network (the SD network) is quite complex and has distinct features which allow us to make inference on the evolution of segmantal duplications. We come up with the network growth model that explains features of the SD network thus giving us insights on dynamics of segmental duplications in the human genome. Based on our analysis of genomes of other species the network growth model seems to be applicable for multiple mammalian genomes. Conclusions Our analysis suggests that duplication rates of genomic loci grow linearly with the number of copies of a duplicated region. Several scenarios explaining such a preferential duplication rates were suggested. Supplementary Information The online version contains supplementary material available at (10.1186/s12864-021-07789-7).
Collapse
Affiliation(s)
- Eldar T Abdullaev
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestraße 63/73, Berlin, 14195, Germany.
| | - Iren R Umarova
- Faculty of Computational Mathematics and Cybernetics, Moscow State University, Leninskiye Gory 1-52, Moscow, 119991, Russia
| | - Peter F Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestraße 63/73, Berlin, 14195, Germany
| |
Collapse
|
9
|
Balzano E, Pelliccia F, Giunta S. Genome (in)stability at tandem repeats. Semin Cell Dev Biol 2020; 113:97-112. [PMID: 33109442 DOI: 10.1016/j.semcdb.2020.10.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 09/26/2020] [Accepted: 10/10/2020] [Indexed: 12/12/2022]
Abstract
Repeat sequences account for over half of the human genome and represent a significant source of variation that underlies physiological and pathological states. Yet, their study has been hindered due to limitations in short-reads sequencing technology and difficulties in assembly. A important category of repetitive DNA in the human genome is comprised of tandem repeats (TRs), where repetitive units are arranged in a head-to-tail pattern. Compared to other regions of the genome, TRs carry between 10 and 10,000 fold higher mutation rate. There are several mutagenic mechanisms that can give rise to this propensity toward instability, but their precise contribution remains speculative. Given the high degree of homology between these sequences and their arrangement in tandem, once damaged, TRs have an intrinsic propensity to undergo aberrant recombination with non-allelic exchange and generate harmful rearrangements that may undermine the stability of the entire genome. The dynamic mutagenesis at TRs has been found to underlie individual polymorphism associated with neurodegenerative and neuromuscular disorders, as well as complex genetic diseases like cancer and diabetes. Here, we review our current understanding of the surveillance and repair mechanisms operating within these regions, and we describe how alterations in these protective processes can readily trigger mutational signatures found at TRs, ultimately resulting in the pathological correlation between TRs instability and human diseases. Finally, we provide a viewpoint to counter the detrimental effects that TRs pose in light of their selection and conservation, as important drivers of human evolution.
Collapse
Affiliation(s)
- Elisa Balzano
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Franca Pelliccia
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Simona Giunta
- The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA; Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy.
| |
Collapse
|
10
|
Systematic identification of genetic systems associated with phenotypes in patients with rare genomic copy number variations. Hum Genet 2020; 140:457-475. [PMID: 32778951 DOI: 10.1007/s00439-020-02214-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 07/30/2020] [Indexed: 01/02/2023]
Abstract
Copy number variation (CNV) related disorders tend to show complex phenotypic profiles that do not match known diseases. This makes it difficult to ascertain their underlying molecular basis. A potential solution is to compare the affected genomic regions for multiple patients that share a pathological phenotype, looking for commonalities. Here, we present a novel approach to associate phenotypes with functional systems, in terms of GO categories and KEGG and Reactome pathways, based on patient data. The approach uses genomic and phenomic data from the same patients, finding shared genomic regions between patients with similar phenotypes. These regions are mapped to genes to find associated functional systems. We applied the approach to analyse patients in the DECIPHER database with de novo CNVs, finding functional systems associated with most phenotypes, often due to mutations affecting related genes in the same genomic region. Manual inspection of the ten top-scoring phenotypes found multiple FunSys connections supported by the previous studies for seven of them. The workflow also produces reports focussed on the genes and FunSys connected to the different phenotypes, alongside patient-specific reports, which give details of the associated genes and FunSys for each individual in the cohort. These can be run in "confidential" mode, preserving patient confidentiality. The workflow presented here can be used to associate phenotypes with functional systems using data at the level of a whole cohort of patients, identifying important connections that could not be found when considering them individually. The full workflow is available for download, enabling it to be run on any patient cohort for which phenotypic and CNV data are available.
Collapse
|
11
|
James WPT, Johnson RJ, Speakman JR, Wallace DC, Frühbeck G, Iversen PO, Stover PJ. Nutrition and its role in human evolution. J Intern Med 2019; 285:533-549. [PMID: 30772945 DOI: 10.1111/joim.12878] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Our understanding of human evolution has improved rapidly over recent decades, facilitated by large-scale cataloguing of genomic variability amongst both modern and archaic humans. It seems clear that the evolution of the ancestors of chimpanzees and hominins separated 7-9 million years ago with some migration out of Africa by the earlier hominins; Homo sapiens slowly emerged as climate change resulted in drier, less forested African conditions. The African populations expanded and evolved in many different conditions with slow mutation and selection rates in the human genome, but with much more rapid mutation occurring in mitochondrial DNA. We now have evidence stretching back 300 000 years of humans in their current form, but there are clearly four very different large African language groups that correlate with population DNA differences. Then, about 50 000-100 000 years ago a small subset of modern humans also migrated out of Africa resulting in a persistent signature of more limited genetic diversity amongst non-African populations. Hybridization with archaic hominins occurred around this time such that all non-African modern humans possess some Neanderthal ancestry and Melanesian populations additionally possess some Denisovan ancestry. Human populations both within and outside Africa also adapted to diverse aspects of their local environment including altitude, climate, UV exposure, diet and pathogens, in some cases leaving clear signatures of patterns of genetic variation. Notable examples include haemoglobin changes conferring resistance to malaria, other immune changes and the skin adaptations favouring the synthesis of vitamin D. As humans migrated across Eurasia, further major mitochondrial changes occurred with some interbreeding with ancient hominins and the development of alcohol intolerance. More recently, an ability to retain lactase persistence into adulthood has evolved rapidly under the environmental stimulus of pastoralism with the ability to husband lactating ruminants. Increased amylase copy numbers seem to relate to the availability of starchy foods, whereas the capacity to desaturase and elongate monounsaturated fatty acids in different societies seems to be influenced by whether there is a lack of supply of readily available dietary sources of long-chain polyunsaturated fatty acids. The process of human evolution includes genetic drift and adaptation to local environments, in part through changes in mitochondrial and nuclear DNA. These genetic changes may underlie susceptibilities to some modern human pathologies including folate-responsive neural tube defects, diabetes, other age-related pathologies and mental health disorders.
Collapse
Affiliation(s)
- W P T James
- London School of Hygiene and Tropical Medicine, London, UK
| | - R J Johnson
- Division of Renal Diseases and Hypertension, University of Colorado, Denver, CO, USA
| | - J R Speakman
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - D C Wallace
- Mitochondrial and Epigenomic Medicine, Children's Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
| | - G Frühbeck
- Endocrinology and Nutrition, Clinica Universidad de Navarra, Pamplona, Spain
| | - P O Iversen
- Department of Nutrition, University of Oslo, Oslo, Norway
| | - P J Stover
- Vice Chancellor and Dean for Agriculture and Life Sciences, Texas A&M AgriLife, College Station, TX, USA
| |
Collapse
|
12
|
Lin YL, Gokcumen O. Fine-Scale Characterization of Genomic Structural Variation in the Human Genome Reveals Adaptive and Biomedically Relevant Hotspots. Genome Biol Evol 2019; 11:1136-1151. [PMID: 30887040 PMCID: PMC6475128 DOI: 10.1093/gbe/evz058] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/16/2019] [Indexed: 12/25/2022] Open
Abstract
Genomic structural variants (SVs) are distributed nonrandomly across the human genome. The "hotspots" of SVs have been implicated in evolutionary innovations, as well as medical conditions. However, the evolutionary and biomedical features of these hotspots remain incompletely understood. Here, we analyzed data from 2,504 genomes to construct a refined map of 1,148 SV hotspots in human genomes. We confirmed that segmental duplication-related nonallelic homologous recombination is an important mechanistic driver of SV hotspot formation. However, to our surprise, we also found that a majority of SVs in hotspots do not form through such recombination-based mechanisms, suggesting diverse mechanistic and selective forces shaping hotspots. Indeed, our evolutionary analyses showed that the majority of SV hotspots are within gene-poor regions and evolve under relaxed negative selection or neutrality. However, we still found a small subset of SV hotspots harboring genes that are enriched for anthropologically crucial functions and evolve under geography-specific and balancing adaptive forces. These include two independent hotspots on different chromosomes affecting alpha and beta hemoglobin gene clusters. Biomedically, we found that the SV hotspots coincide with breakpoints of clinically relevant, large de novo SVs, significantly more often than genome-wide expectations. For example, we showed that the breakpoints of multiple large SVs, which lead to idiopathic short stature, coincide with SV hotspots. Therefore, the mutational instability in SV hotpots likely enables chromosomal breaks that lead to pathogenic structural variation formations. Overall, our study contributes to a better understanding of the mutational and adaptive landscape of the genome.
Collapse
Affiliation(s)
- Yen-Lung Lin
- Department of Biological Sciences, University at Buffalo
| | - Omer Gokcumen
- Department of Biological Sciences, University at Buffalo
- Corresponding author: E-mail: or
| |
Collapse
|
13
|
Characterization and evolutionary dynamics of complex regions in eukaryotic genomes. SCIENCE CHINA-LIFE SCIENCES 2019; 62:467-488. [PMID: 30810961 DOI: 10.1007/s11427-018-9458-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 11/05/2018] [Indexed: 01/07/2023]
Abstract
Complex regions in eukaryotic genomes are typically characterized by duplications of chromosomal stretches that often include one or more genes repeated in a tandem array or in relatively close proximity. Nevertheless, the repetitive nature of these regions, together with the often high sequence identity among repeats, have made complex regions particularly recalcitrant to proper molecular characterization, often being misassembled or completely absent in genome assemblies. This limitation has prevented accurate functional and evolutionary analyses of these regions. This is becoming increasingly relevant as evidence continues to support a central role for complex genomic regions in explaining human disease, developmental innovations, and ecological adaptations across phyla. With the advent of long-read sequencing technologies and suitable assemblers, the development of algorithms that can accommodate sample heterozygosity, and the adoption of a pangenomic-like view of these regions, accurate reconstructions of complex regions are now within reach. These reconstructions will finally allow for accurate functional and evolutionary studies of complex genomic regions, underlying the generation of genotype-phenotype maps of unprecedented resolution.
Collapse
|
14
|
Pezer Ž, Chung AG, Karn RC, Laukaitis CM. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions. Genome Biol Evol 2018; 9:3858091. [PMID: 28575204 PMCID: PMC5513543 DOI: 10.1093/gbe/evx099] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/26/2017] [Indexed: 12/26/2022] Open
Abstract
The Androgen-binding protein (Abp) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus (Mmd) and Mus musculus musculus (Mmm), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd, primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm, Mus musculus castaneus and an outgroup, Mus spretus, although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice.
Collapse
Affiliation(s)
- Željka Pezer
- Max Planck Institute for Evolutionary Biology, Plön, Germany.,Ruđer Bošković Institute, Zagreb, Croatia
| | - Amanda G Chung
- Department of Medicine, College of Medicine, University of Arizona
| | - Robert C Karn
- Department of Medicine, College of Medicine, University of Arizona
| | | |
Collapse
|
15
|
Dosage sensitivity is a major determinant of human copy number variant pathogenicity. Nat Commun 2017; 8:14366. [PMID: 28176757 PMCID: PMC5309798 DOI: 10.1038/ncomms14366] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Accepted: 12/20/2016] [Indexed: 01/22/2023] Open
Abstract
Human copy number variants (CNVs) account for genome variation an order of magnitude larger than single-nucleotide polymorphisms. Although much of this variation has no phenotypic consequences, some variants have been associated with disease, in particular neurodevelopmental disorders. Pathogenic CNVs are typically very large and contain multiple genes, and understanding the cause of the pathogenicity remains a major challenge. Here we show that pathogenic CNVs are significantly enriched for genes involved in development and genes that have greater evolutionary copy number conservation across mammals, indicative of functional constraints. Conversely, genes found in benign CNV regions have more variable copy number. These evolutionary constraints are characteristic of genes in pathogenic CNVs and can only be explained by dosage sensitivity of those genes. These results implicate dosage sensitivity of individual genes as a common cause of CNV pathogenicity. These evolutionary metrics suggest a path to identifying disease genes in pathogenic CNVs. Copy number variants (CNVs) cause significant genomic variation in humans and may be benign or may cause disease. Here, the authors show that pathogenic CNVs are evolutionarily constrained compared with benign, pointing to dosage sensitivity as a potential cause of disease.
Collapse
|
16
|
Keel BN, Keele JW, Snelling WM. Genome-wide copy number variation in the bovine genome detected using low coverage sequence of popular beef breeds,. Anim Genet 2016; 48:141-150. [DOI: 10.1111/age.12519] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/27/2016] [Indexed: 12/19/2022]
Affiliation(s)
- B. N. Keel
- USDA; ARS; U.S. Meat Animal Research Center; Clay Center NE 68933 USA
| | - J. W. Keele
- USDA; ARS; U.S. Meat Animal Research Center; Clay Center NE 68933 USA
| | - W. M. Snelling
- USDA; ARS; U.S. Meat Animal Research Center; Clay Center NE 68933 USA
| |
Collapse
|
17
|
Macé A, Tuke MA, Beckmann JS, Lin L, Jacquemont S, Weedon MN, Reymond A, Kutalik Z. New quality measure for SNP array based CNV detection. Bioinformatics 2016; 32:3298-3305. [PMID: 27402902 DOI: 10.1093/bioinformatics/btw477] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 07/03/2016] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Only a few large systematic studies have evaluated the impact of copy number variants (CNVs) on common diseases. Several million individuals have been genotyped on single nucleotide variation arrays, which could be used for genome-wide CNVs association studies. However, CNV calls remain prone to false positives and only empirical filtering strategies exist in the literature. To overcome this issue, we defined a new quality score (QS) estimating the probability of a CNV called by PennCNV to be confirmed by other software. RESULTS Out-of-sample comparison showed that the correlation between the consensus CNV status and the QS is twice as high as it is for any previously proposed CNV filters. ROC curves displayed an AUC higher than 0.8 and simulations showed an increase up to 20% in statistical power when using QS in comparison to other filtering strategies. Superior performance was confirmed also for alternative consensus CNV definition and through improving known CNV-trait associations. AVAILABILITY AND IMPLEMENTATION http://goo.gl/T6yuFM CONTACT: zoltan.kutalik@unil.ch or aurelien@mace@unil.chSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A Macé
- Institute of Social and Preventive Medicine, University Hospital of Lausanne, Lausanne, Switzerland Department of Computational Biology, University of Lausanne, Lausanne, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - M A Tuke
- Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK
| | - J S Beckmann
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - L Lin
- Division of Cardiology, Geneva University Hospital, Geneva, Switzerland
| | - S Jacquemont
- Service de Génétique Médicale, Centre Universitaire Hospitalier Vaudois, Lausanne, Switzerland
| | - M N Weedon
- Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK
| | - A Reymond
- Center for Integrative Genomics, University for Lausanne, Lausanne, Switzerland
| | - Z Kutalik
- Institute of Social and Preventive Medicine, University Hospital of Lausanne, Lausanne, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
18
|
Jenkins GM, Goddard ME, Black MA, Brauning R, Auvray B, Dodds KG, Kijas JW, Cockett N, McEwan JC. Copy number variants in the sheep genome detected using multiple approaches. BMC Genomics 2016; 17:441. [PMID: 27277319 PMCID: PMC4898393 DOI: 10.1186/s12864-016-2754-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2015] [Accepted: 05/19/2016] [Indexed: 02/07/2023] Open
Abstract
Background Copy number variants (CNVs) are a type of polymorphism found to underlie phenotypic variation, both in humans and livestock. Most surveys of CNV in livestock have been conducted in the cattle genome, and often utilise only a single approach for the detection of copy number differences. Here we performed a study of CNV in sheep, using multiple methods to identify and characterise copy number changes. Comprehensive information from small pedigrees (trios) was collected using multiple platforms (array CGH, SNP chip and whole genome sequence data), with these data then analysed via multiple approaches to identify and verify CNVs. Results In total, 3,488 autosomal CNV regions (CNVRs) were identified in this study, which substantially builds on an initial survey of the sheep genome that identified 135 CNVRs. The average length of the identified CNVRs was 19 kb (range of 1 kb to 3.6 Mb), with shorter CNVRs being more frequent than longer CNVRs. The total length of all CNVRs was 67.6Mbps, which equates to 2.7 % of the sheep autosomes. For individuals this value ranged from 0.24 to 0.55 %, and the majority of CNVRs were identified in single animals. Rather than being uniformly distributed throughout the genome, CNVRs tended to be clustered. Application of three independent approaches for CNVR detection facilitated a comparison of validation rates. CNVs identified on the Roche-NimbleGen 2.1M CGH array generally had low validation rates with lower density arrays, while whole genome sequence data had the highest validation rate (>60 %). Conclusions This study represents the first comprehensive survey of the distribution, prevalence and characteristics of CNVR in sheep. Multiple approaches were used to detect CNV regions and it appears that the best method for verifying CNVR on a large scale involves using a combination of detection methodologies. The characteristics of the 3,488 autosomal CNV regions identified in this study are comparable to other CNV regions reported in the literature and provide a valuable and sizeable addition to the small subset of published sheep CNVs. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2754-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gemma M Jenkins
- AbacusBio Limited, 442 Moray Place, PO Box 5585, Dunedin, 9058, New Zealand.
| | - Michael E Goddard
- Victorian Department of Economic Development, Jobs, Transport and Resources, Bundoora, VIC, 3083, Australia
| | - Michael A Black
- Department of Biochemistry, University of Otago, 710 Cumberland St, Dunedin, 9054, New Zealand
| | - Rudiger Brauning
- AgResearch, Invermay Agricultural Centre, PB 50034, Mosgiel, 9053, New Zealand
| | - Benoit Auvray
- Department of Biochemistry, University of Otago, 710 Cumberland St, Dunedin, 9054, New Zealand
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, PB 50034, Mosgiel, 9053, New Zealand
| | - James W Kijas
- CSIRO Animal, Food and Health Sciences, Queensland Bioscience Precinct, 306 Carmody Road, St Lucia, QLD 4067, Australia
| | - Noelle Cockett
- Utah State University, 1435 Old Main Hill, Logan, UT, 84322-1435-1435, USA
| | - John C McEwan
- AgResearch, Invermay Agricultural Centre, PB 50034, Mosgiel, 9053, New Zealand
| |
Collapse
|
19
|
Urnikyte A, Domarkiene I, Stoma S, Ambrozaityte L, Uktveryte I, Meskiene R, Kasiulevičius V, Burokiene N, Kučinskas V. CNV analysis in the Lithuanian population. BMC Genet 2016; 17:64. [PMID: 27142071 PMCID: PMC4855864 DOI: 10.1186/s12863-016-0373-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 04/22/2016] [Indexed: 12/13/2022] Open
Abstract
Background Although copy number variation (CNV) has received much attention, knowledge about the characteristics of CNVs such as occurrence rate and distribution in the genome between populations and within the same population is still insufficient. In this study, Illumina 770 K HumanOmniExpress-12 v1.0 (and v1.1) arrays were used to examine the diversity and distribution of CNVs in 286 unrelated individuals from the two main ethnolinguistic groups of the Lithuanian population (Aukštaičiai and Žemaičiai) (see Additional file 3). For primary data analysis, the Illumina GenomeStudio™ Genotyping Module v1.9 and two algorithms, cnvPartition 3.2.0 and QuantiSNP 2.0, were used to identify high-confidence CNVs. Results A total of 478 autosomal CNVs were detected by both algorithms, and those were clustered in 87 copy number variation regions (CNVRs), spanning ~12.5 Mb of the genome (see Table 1). At least 8.6 % of the CNVRs were unique and had not been reported in the Database of Genomic Variants. Most CNVRs (57.5 %) were rare, with a frequency of <1 %, whereas common CNVRs with at least 5 % frequency made up only 1.1 % of all CNVRs identified. About 49 % of non-singleton CNVRs were shared between Aukštaičiai and Žemaičiai, and the remaining CNVRs were specific to each group. Many of the CNVs detected (66 %) overlapped with known UCSC gene regions. Conclusions The ethnolinguistic groups of the Lithuanian population could not be differentiated based on CNV profiles, which may reflect their geographical proximity and suggest the homogeneity of the Lithuanian population. In addition, putative novel CNVs unique to the Lithuanian population were identified. The results of our study enhance the CNV map of the Lithuanian population. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0373-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- A Urnikyte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania.
| | - I Domarkiene
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - S Stoma
- Master of Science (MSc), Bioinformatics student, VU University Amsterdam, Amsterdam, Netherlands
| | - L Ambrozaityte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - I Uktveryte
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - R Meskiene
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - V Kasiulevičius
- Clinics of Internal Diseases, Family Medicine and Oncology, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - N Burokiene
- Clinics of Internal Diseases, Family Medicine and Oncology, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| | - V Kučinskas
- Department of Human and Medical Genetics, Faculty of Medicine, Vilnius University, Santariskiu St. 2, LT-08661, Vilnius, Lithuania
| |
Collapse
|
20
|
Li YR, Li J, Zhao SD, Bradfield JP, Mentch FD, Maggadottir SM, Hou C, Abrams DJ, Chang D, Gao F, Guo Y, Wei Z, Connolly JJ, Cardinale CJ, Bakay M, Glessner JT, Li D, Kao C, Thomas KA, Qiu H, Chiavacci RM, Kim CE, Wang F, Snyder J, Richie MD, Flatø B, Førre Ø, Denson LA, Thompson SD, Becker ML, Guthery SL, Latiano A, Perez E, Resnick E, Russell RK, Wilson DC, Silverberg MS, Annese V, Lie BA, Punaro M, Dubinsky MC, Monos DS, Strisciuglio C, Staiano A, Miele E, Kugathasan S, Ellis JA, Munro JE, Sullivan KE, Wise CA, Chapel H, Cunningham-Rundles C, Grant SFA, Orange JS, Sleiman PMA, Behrens EM, Griffiths AM, Satsangi J, Finkel TH, Keinan A, Prak ETL, Polychronakos C, Baldassano RN, Li H, Keating BJ, Hakonarson H. Meta-analysis of shared genetic architecture across ten pediatric autoimmune diseases. Nat Med 2015; 21:1018-27. [PMID: 26301688 PMCID: PMC4863040 DOI: 10.1038/nm.3933] [Citation(s) in RCA: 166] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2015] [Accepted: 07/23/2015] [Indexed: 12/15/2022]
Abstract
Genome-wide association studies (GWASs) have identified hundreds of susceptibility genes, including shared associations across clinically distinct autoimmune diseases. We performed an inverse χ(2) meta-analysis across ten pediatric-age-of-onset autoimmune diseases (pAIDs) in a case-control study including more than 6,035 cases and 10,718 shared population-based controls. We identified 27 genome-wide significant loci associated with one or more pAIDs, mapping to in silico-replicated autoimmune-associated genes (including IL2RA) and new candidate loci with established immunoregulatory functions such as ADGRL2, TENM3, ANKRD30A, ADCY7 and CD40LG. The pAID-associated single-nucleotide polymorphisms (SNPs) were functionally enriched for deoxyribonuclease (DNase)-hypersensitivity sites, expression quantitative trait loci (eQTLs), microRNA (miRNA)-binding sites and coding variants. We also identified biologically correlated, pAID-associated candidate gene sets on the basis of immune cell expression profiling and found evidence of genetic sharing. Network and protein-interaction analyses demonstrated converging roles for the signaling pathways of type 1, 2 and 17 helper T cells (TH1, TH2 and TH17), JAK-STAT, interferon and interleukin in multiple autoimmune diseases.
Collapse
Affiliation(s)
- Yun R Li
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Medical Scientist Training Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jin Li
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Sihai D Zhao
- Department of Biostatistics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jonathan P Bradfield
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Frank D Mentch
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - S Melkorka Maggadottir
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Division of Allergy and Immunology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Cuiping Hou
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Debra J Abrams
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Diana Chang
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, USA
- Program in Computational Biology and Medicine, Cornell University, Ithaca, New York, USA
| | - Feng Gao
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, USA
| | - Yiran Guo
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Zhi Wei
- Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, USA
| | - John J Connolly
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Christopher J Cardinale
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Marina Bakay
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Joseph T Glessner
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Dong Li
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Charlly Kao
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Kelly A Thomas
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Haijun Qiu
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Rosetta M Chiavacci
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Cecilia E Kim
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Fengxiang Wang
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - James Snyder
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Marylyn D Richie
- Department of Biochemistry and Molecular Biology, Eberly College of Science, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Berit Flatø
- Department of Rheumatology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Øystein Førre
- Department of Rheumatology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Lee A Denson
- Division of Gastroenterology, The Center for Inflammatory Bowel Disease, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA
| | - Susan D Thompson
- Divison of Rheumatology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, USA
| | - Mara L Becker
- Division of Rheumatology, Children's Mercy Hospitals and Clinics, Kansas City, Missouri, USA
| | - Stephen L Guthery
- Department of Pediatrics, University of Utah School of Medicine and Primary Children's Medical Center, Salt Lake City, Utah, USA
| | - Anna Latiano
- Division of Gastroenterology, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo, Italy
| | - Elena Perez
- Division of Pediatric Allergy and Immunology, University of Miami Miller School of Medicine, Miami, Florida, USA
| | - Elena Resnick
- Institute of Immunology and Department of Medicine, Mount Sinai School of Medicine, New York, New York, USA
| | - Richard K Russell
- Department of Paediatric Gastroenterology, Yorkhill Hospital for Sick Children, Glasgow, Scotland, UK
| | - David C Wilson
- Paediatric Gastroenterology and Nutrition, Royal Hospital for Sick Children, University of Edinburgh, Ediburgh, UK
| | - Mark S Silverberg
- Mount Sinai Hospital IBD Centre, University of Toronto, Toronto, Ontario, Canada
| | - Vito Annese
- Unit of Gastroenterology, Department of Medical and Surgical Specialties, Careggi University Hospital, Florence, Italy
| | - Benedicte A Lie
- Department of Immunology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Marilynn Punaro
- Department of Rheumatology, Texas Scottish Rite Hospital for Children, Dallas, Texas, USA
| | - Marla C Dubinsky
- Department of Pediatrics, Pediatric IBD Center, Cedars Sinai Medical Center, Los Angeles, California, USA
| | - Dimitri S Monos
- Department of Pathology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Caterina Strisciuglio
- Department of Translational Medical Science, Section of Pediatrics, University of Naples Federico II, Naples, Italy
| | - Annamaria Staiano
- Department of Translational Medical Science, Section of Pediatrics, University of Naples Federico II, Naples, Italy
| | - Erasmo Miele
- Department of Translational Medical Science, Section of Pediatrics, University of Naples Federico II, Naples, Italy
| | - Subra Kugathasan
- Department of Pediatrics, Emory University School of Medicine and Children's Health Care of Atlanta, Atlanta, Georgia, USA
| | - Justine A Ellis
- Genes, Environment and Complex Disease, Murdoch Children's Research Institute, Parkville, Victoria, Australia
- Department of Pediatrics, University of Melbourne, Parkville, Victoria, Australia
| | - Jane E Munro
- Pediatric Rheumatology Unit, Royal Children's Hospital, Parkville, Victoria, Australia
- Arthritis and Rheumatology Research, Murdoch Children's Research Institute, Parkville, Victoria, Australia
| | - Kathleen E Sullivan
- Division of Allergy and Immunology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Carol A Wise
- Sarah M. and Charles E. Seay Center for Musculoskeletal Research, Texas Scottish Rite Hospital for Children, Dallas, Texas, USA
| | - Helen Chapel
- Department of Clinical Immunology, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | | | - Struan F A Grant
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jordan S Orange
- Section of Immunology, Allergy, and Rheumatology, Department of Pediatric Medicine, Texas Children's Hospital, Houston, Texas, USA
| | - Patrick M A Sleiman
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Edward M Behrens
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Division of Rheumatology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Anne M Griffiths
- The Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada
| | - Jack Satsangi
- Gastrointestinal Unit, Division of Medical Sciences, School of Molecular and Clinical Medicine, University of Edinburgh, Edinburgh, UK
| | - Terri H Finkel
- Department of Pediatrics, Nemours Children's Hospital, Orlando, Florida, USA
| | - Alon Keinan
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, USA
- Program in Computational Biology and Medicine, Cornell University, Ithaca, New York, USA
| | - Eline T Luning Prak
- Department of Pathology and Lab Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Constantin Polychronakos
- Departments of Pediatrics and Human Genetics, McGill University Health Centre Research Institute, Montréal, Québec, Canada
| | - Robert N Baldassano
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Division of Gastroenterology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - Hongzhe Li
- Department of Pathology and Lab Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Brendan J Keating
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Hakon Hakonarson
- The Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Division of Pulmonary Medicine, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| |
Collapse
|
21
|
Pezer Ž, Harr B, Teschke M, Babiker H, Tautz D. Divergence patterns of genic copy number variation in natural populations of the house mouse (Mus musculus domesticus) reveal three conserved genes with major population-specific expansions. Genome Res 2015; 25:1114-24. [PMID: 26149421 PMCID: PMC4509996 DOI: 10.1101/gr.187187.114] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 06/05/2015] [Indexed: 11/29/2022]
Abstract
Copy number variation represents a major source of genetic divergence, yet the evolutionary dynamics of genic copy number variation in natural populations during differentiation and adaptation remain unclear. We applied a read depth approach to genome resequencing data to detect copy number variants (CNVs) ≥1 kb in wild-caught mice belonging to four populations of Mus musculus domesticus. We complemented the bioinformatics analyses with experimental validation using droplet digital PCR. The specific focus of our analysis is CNVs that include complete genes, as these CNVs could be expected to contribute most directly to evolutionary divergence. In total, 1863 transcription units appear to be completely encompassed within CNVs in at least one individual when compared to the reference assembly. Further, 179 of these CNVs show population-specific copy number differences, and 325 are subject to complete deletion in multiple individuals. Among the most copy-number variable genes are three highly conserved genes that encode the splicing factor CWC22, the spindle protein SFI1, and the Holliday junction recognition protein HJURP. These genes exhibit population-specific expansion patterns that suggest involvement in local adaptations. We found that genes that overlap with large segmental duplications are generally more copy-number variable. These genes encode proteins that are relevant for environmental and behavioral interactions, such as vomeronasal and olfactory receptors, as well as major urinary proteins and several proteins of unknown function. The overall analysis shows that genic CNVs contribute more to population differentiation in mice than in humans and may promote and speed up population divergence.
Collapse
Affiliation(s)
- Željka Pezer
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Bettina Harr
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Meike Teschke
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Hiba Babiker
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Diethard Tautz
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| |
Collapse
|
22
|
Paudel Y, Madsen O, Megens HJ, Frantz LAF, Bosse M, Crooijmans RPMA, Groenen MAM. Copy number variation in the speciation of pigs: a possible prominent role for olfactory receptors. BMC Genomics 2015; 16:330. [PMID: 25896665 PMCID: PMC4413995 DOI: 10.1186/s12864-015-1449-9] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 03/09/2015] [Indexed: 12/02/2022] Open
Abstract
Background Unraveling the genetic mechanisms associated with reduced gene flow between genetically differentiated populations is key to understand speciation. Different types of structural variations (SVs) have been found as a source of genetic diversity in a wide range of species. Previous studies provided detailed knowledge on the potential evolutionary role of SVs, especially copy number variations (CNVs), between well diverged species of e.g. primates. However, our understanding of their significance during ongoing speciation processes is limited due to the lack of CNV data from closely related species. The genus Sus (pig and its close relatives) which started to diverge ~4 Mya presents an excellent model for studying the role of CNVs during ongoing speciation. Results In this study, we identified 1408 CNV regions (CNVRs) across the genus Sus. These CNVRs encompass 624 genes and were found to evolve ~2.5 times faster than single nucleotide polymorphisms (SNPs). The majority of these copy number variable genes are olfactory receptors (ORs) known to play a prominent role in food foraging and mate recognition in Sus. Phylogenetic analyses, including novel Bayesian analysis, based on CNVRs that overlap ORs retain the well-accepted topology of the genus Sus whereas CNVRs overlapping genes other than ORs show evidence for random drift and/or admixture. Conclusion We hypothesize that inter-specific variation in copy number of ORs provided the means for rapid adaptation to different environments during the diversification of the genus Sus in the Pliocene. Furthermore, these regions might have acted as barriers preventing massive gene flow between these species during the multiple hybridization events that took place later in the Pleistocene suggesting a possible prominent role of ORs in the ongoing Sus speciation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1449-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yogesh Paudel
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands. .,Current address: Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, 4070, Basel, Switzerland.
| | - Ole Madsen
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| | - Hendrik-Jan Megens
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| | - Laurent A F Frantz
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| | - Mirte Bosse
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| | - Richard P M A Crooijmans
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| | - Martien A M Groenen
- Animal Breeding and Genomics Centre, Wageningen University, 6700 AH, Wageningen, The Netherlands.
| |
Collapse
|
23
|
Li D, Zhao H, Kranzler HR, Li MD, Jensen KP, Zayats T, Farrer LA, Gelernter J. Genome-wide association study of copy number variations (CNVs) with opioid dependence. Neuropsychopharmacology 2015; 40:1016-26. [PMID: 25345593 PMCID: PMC4330517 DOI: 10.1038/npp.2014.290] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2014] [Revised: 08/18/2014] [Accepted: 08/26/2014] [Indexed: 12/20/2022]
Abstract
Single-nucleotide polymorphisms that have been associated with opioid dependence (OD) altogether account for only a small proportion of the known heritability. Most of the genetic risk factors are unknown. Some of the 'missing heritability' might be explained by copy number variations (CNVs) in the human genome. We used Illumina HumanOmni1 arrays to genotype 5152 African-American and European-American OD cases and screened controls and implemented combined CNV calling methods. After quality control measures were applied, a genome-wide association study (GWAS) of CNVs with OD was performed. For common CNVs, two deletions and one duplication were significantly associated with OD genome-wide (eg, P=2 × 10(-8) and OR (95% CI)=0.64 (0.54-0.74) for a chromosome 18q12.3 deletion). Several rare or unique CNVs showed suggestive or marginal significance with large effect sizes. This study is the first GWAS of OD using CNVs. Some identified CNVs harbor genes newly identified here to be of biological importance in addiction, whereas others affect genes previously known to contribute to substance dependence risk. Our findings augment our specific knowledge of the importance of genomic variation in addictive disorders, and provide an addiction CNV pool for further research. These findings require replication.
Collapse
Affiliation(s)
- Dawei Li
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT, USA
- Department of Computer Science, University of Vermont, Burlington, VT, USA
- Neuroscience, Behavior, and Health Initiative, University of Vermont, Burlington, VT, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
- Department of Genetics, School of Medicine, Yale University, New Haven, CT, USA
| | - Henry R Kranzler
- Department of Psychiatry, University of Pennsylvania School of Medicine and VISN 4 MIRECC, Philadelphia VAMC, Philadelphia, PA, USA
| | - Ming D Li
- Department of Psychiatry and Neurobehavioral Sciences, University of Virginia, Charlottesville, VA, USA
| | - Kevin P Jensen
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
| | - Tetyana Zayats
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
| | - Lindsay A Farrer
- Departments of Medicine (Biomedical Genetics), Neurology, Ophthalmology, Genetics and Genomics, Biostatistics, and Epidemiology, Boston University Schools of Medicine and Public Health, Boston, MA, USA
| | - Joel Gelernter
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
- Department of Genetics, School of Medicine, Yale University, New Haven, CT, USA
- VA Connecticut Healthcare Center, Department of Neurobiology, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
24
|
Zarrei M, MacDonald JR, Merico D, Scherer SW. A copy number variation map of the human genome. Nat Rev Genet 2015; 16:172-83. [DOI: 10.1038/nrg3871] [Citation(s) in RCA: 565] [Impact Index Per Article: 62.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
25
|
Gamazon E, Cox N, Davis L. Structural architecture of SNP effects on complex traits. Am J Hum Genet 2014; 95:477-89. [PMID: 25307299 DOI: 10.1016/j.ajhg.2014.09.009] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Accepted: 09/16/2014] [Indexed: 12/16/2022] Open
Abstract
Despite the discovery of copy-number variation (CNV) across the genome nearly 10 years ago, current SNP-based analysis methodologies continue to collapse the homozygous (i.e., A/A), hemizygous (i.e., A/0), and duplicative (i.e., A/A/A) genotype states, treating the genotype variable as irreducible or unaltered by other colocalizing forms of genetic (e.g., structural) variation. Our understanding of common, genome-wide CNVs suggests that the canonical genotype construct might belie the enormous complexity of the genome. Here we present multiple analyses of several phenotypes and provide methods supporting a conceptual shift that embraces the structural dimension of genotype. We comprehensively investigate the impact of the structural dimension of genotype on (1) GWAS methods, (2) interpretation of rare LOF variants, (3) characterization of genomic architecture, and (4) implications for mapping loci involved in complex disease. Taken together, these results argue for the inclusion of a structural dimension and suggest that some portion of the "missing" heritability might be recovered through integration of the structural dimension of SNP effects on complex traits.
Collapse
|
26
|
Hughes TE, Langdale JA, Kelly S. The impact of widespread regulatory neofunctionalization on homeolog gene evolution following whole-genome duplication in maize. Genome Res 2014; 24:1348-55. [PMID: 24788921 PMCID: PMC4120087 DOI: 10.1101/gr.172684.114] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Whole-genome duplications are a widespread feature of plant genome evolution, having been detected in all flowering plant lineages. Despite the prevalence of these events, the extent to which duplicated genes (homeolog gene pairs) functionally diverge (neofunctionalization) is unclear. We present a genome-wide analysis of molecular evolution and regulatory neofunctionalization in maize (Zea mays L.). We demonstrate that 13% of all homeolog gene pairs in maize are regulatory neofunctionalized in leaves, and that regulatory neofunctionalized genes experience enhanced purifying selection. We show that significantly more genes have been regulatory neofunctionalized in foliar leaves than in husk leaves and that both leaf types have experienced selection for distinct functional roles. Furthermore, we demonstrate that biased subgenome expression dominance occurs only in the presence of regulatory neofunctionalization and that in nonregulatory neofunctionalized genes subgenome dominance is progressively acquired during development. Taken together, our study reveals several novel insights into the evolution of maize, genes, and gene expression, and provides a general model for gene evolution following whole-genome duplication in plants.
Collapse
Affiliation(s)
- Thomas E Hughes
- Department of Plant Sciences, University of Oxford, Oxford, OX1 3RB, United Kingdom
| | - Jane A Langdale
- Department of Plant Sciences, University of Oxford, Oxford, OX1 3RB, United Kingdom
| | - Steven Kelly
- Department of Plant Sciences, University of Oxford, Oxford, OX1 3RB, United Kingdom
| |
Collapse
|
27
|
Zain SM, Mohamed R, Cooper DN, Razali R, Rampal S, Mahadeva S, Chan WK, Anwar A, Rosli NSM, Mahfudz AS, Cheah PL, Basu RC, Mohamed Z. Genome-wide analysis of copy number variation identifies candidate gene loci associated with the progression of non-alcoholic fatty liver disease. PLoS One 2014; 9:e95604. [PMID: 24743702 PMCID: PMC3990722 DOI: 10.1371/journal.pone.0095604] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 03/27/2014] [Indexed: 02/06/2023] Open
Abstract
Between 10 and 25% of individuals with non-alcoholic fatty liver disease (NAFLD) develop hepatic fibrosis leading to cirrhosis and hepatocellular carcinoma (HCC). To investigate the molecular basis of disease progression, we performed a genome-wide analysis of copy number variation (CNV) in a total of 49 patients with NAFLD [10 simple steatosis and 39 non-alcoholic steatohepatitis (NASH)] and 49 matched controls using high-density comparative genomic hybridization (CGH) microarrays. A total of 11 CNVs were found to be unique to individuals with simple steatosis, whilst 22 were common between simple steatosis and NASH, and 224 were unique to NASH. We postulated that these CNVs could be involved in the pathogenesis of NAFLD progression. After stringent filtering, we identified four rare and/or novel CNVs that may influence the pathogenesis of NASH. Two of these CNVs, located at 13q12.11 and 12q13.2 respectively, harbour the exportin 4 (XPO4) and phosphodiesterase 1B (PDE1B) genes which are already known to be involved in the etiology of liver cirrhosis and HCC. Cross-comparison of the genes located at these four CNV loci with genes already known to be associated with NAFLD yielded a set of genes associated with shared biological processes including cell death, the key process involved in 'second hit' hepatic injury. To our knowledge, this pilot study is the first to provide CNV information of potential relevance to the NAFLD spectrum. These data could prove invaluable in predicting patients at risk of developing NAFLD and more importantly, those who will subsequently progress to NASH.
Collapse
Affiliation(s)
- Shamsul Mohd Zain
- The Pharmacogenomics Laboratory, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
- Department of Pharmacology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
- * E-mail: (SMZ); (RM)
| | - Rosmawati Mohamed
- Department of Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
- * E-mail: (SMZ); (RM)
| | - David N. Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Rozaimi Razali
- Sengenics Sdn Bhd, High Impact Reseach Building, University of Malaya, Kuala Lumpur, Malaysia
| | - Sanjay Rampal
- Julius Centre University of Malaya, Department of Social and Preventive Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Sanjiv Mahadeva
- Department of Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Wah-Kheong Chan
- Department of Medicine, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Arif Anwar
- Sengenics Sdn Bhd, High Impact Reseach Building, University of Malaya, Kuala Lumpur, Malaysia
| | | | - Anis Shafina Mahfudz
- Medical Imaging Unit, Faculty of Medicine, University of Technology MARA, Sungai Buloh Campus, Selangor, Malaysia
| | - Phaik-Leng Cheah
- Department of Pathology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| | - Roma Choudhury Basu
- Clinical Investigation Centre, University Malaya Medical Centre, Kuala Lumpur, Malaysia
| | - Zahurin Mohamed
- The Pharmacogenomics Laboratory, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
- Department of Pharmacology, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
| |
Collapse
|
28
|
Abstract
The past fifty years have seen the development and application of numerous statistical methods to identify genomic regions that appear to be shaped by natural selection. These methods have been used to investigate the macro- and microevolution of a broad range of organisms, including humans. Here, we provide a comprehensive outline of these methods, explaining their conceptual motivations and statistical interpretations. We highlight areas of recent and future development in evolutionary genomics methods and discuss ongoing challenges for researchers employing such tests. In particular, we emphasize the importance of functional follow-up studies to characterize putative selected alleles and the use of selection scans as hypothesis-generating tools for investigating evolutionary histories.
Collapse
Affiliation(s)
- Joseph J Vitti
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138; ,
| | | | | |
Collapse
|
29
|
Longpre KM, Kinstlinger NS, Mead EA, Wang Y, Thekkumthala AP, Carreno KA, Hot A, Keefer JM, Tully L, Katz LS, Pietrzykowski AZ. Seasonal variation of urinary microRNA expression in male goats (Capra hircus) as assessed by next generation sequencing. Gen Comp Endocrinol 2014; 199:1-15. [PMID: 24457251 DOI: 10.1016/j.ygcen.2014.01.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 12/03/2013] [Accepted: 01/08/2014] [Indexed: 01/09/2023]
Abstract
Testosterone plays a key role in preparation of a male domesticated goat (Capra hircus) to breeding season including changes in the urogenital tract of a male goat (buck). microRNAs are important regulators of cellular metabolism, differentiation and function. They are powerful intermediaries of hormonal activity in the body, including the urogenital tract. We investigated seasonal changes in expression of microRNAs in goat buck urine and their potential consequences using next generation sequencing (microRNA-Seq). We determined the location of each microRNA gene in the goat genome. Testosterone was measured by radioimmunoassay and the androgen receptor binding sites (ARBS) in the promoters of the microRNA genes were determined by MatInspector. The overall impact of regulated microRNAs on cellular physiology was assessed by mirPath. We observed high testosterone levels during the breeding season and changes in the expression of forty microRNAs. Nineteen microRNAs were upregulated, while twenty-one were downregulated. We identified several ARBS in the promoters of regulated microRNAs. Notably, the mostly inhibited microRNA, miR-1246, has a unique set of several gene copy variants associated with a cluster of androgen receptor binding sites. Concomitant changes in regulated microRNA expression could promote transcription, proliferation and differentiation of urogenital tract cells. Together, these findings indicate that in a domesticated goat (Capra hircus), there are specific changes in the microRNA expression profile in buck urine during breeding season, which could be attributable to high testosterone levels during breeding, and could help in preparation of the urogenital tract for high metabolic demands of that season.
Collapse
Affiliation(s)
- Kristy M Longpre
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Noah S Kinstlinger
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Edward A Mead
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Yongping Wang
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Austin P Thekkumthala
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Katherine A Carreno
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Azra Hot
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Jennifer M Keefer
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Luke Tully
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Larry S Katz
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA
| | - Andrzej Z Pietrzykowski
- Rutgers University, Department of Animal Sciences, 67 Poultry Farm Lane, New Brunswick, NJ 08901, USA.
| |
Collapse
|
30
|
Woods S, Coghlan A, Rivers D, Warnecke T, Jeffries SJ, Kwon T, Rogers A, Hurst LD, Ahringer J. Duplication and retention biases of essential and non-essential genes revealed by systematic knockdown analyses. PLoS Genet 2013; 9:e1003330. [PMID: 23675306 PMCID: PMC3649981 DOI: 10.1371/journal.pgen.1003330] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2012] [Accepted: 01/04/2013] [Indexed: 11/19/2022] Open
Abstract
When a duplicate gene has no apparent loss-of-function phenotype, it is commonly considered that the phenotype has been masked as a result of functional redundancy with the remaining paralog. This is supported by indirect evidence showing that multi-copy genes show loss-of-function phenotypes less often than single-copy genes and by direct tests of phenotype masking using select gene sets. Here we take a systematic genome-wide RNA interference approach to assess phenotype masking in paralog pairs in the Caenorhabditis elegans genome. Remarkably, in contrast to expectations, we find that phenotype masking makes only a minor contribution to the low knockdown phenotype rate for duplicate genes. Instead, we find that non-essential genes are highly over-represented among duplicates, leading to a low observed loss-of-function phenotype rate. We further find that duplicate pairs derived from essential and non-essential genes have contrasting evolutionary dynamics: whereas non-essential genes are both more often successfully duplicated (fixed) and lost, essential genes are less often duplicated but upon successful duplication are maintained over longer periods. We expect the fundamental evolutionary duplication dynamics presented here to be broadly applicable. Duplicate genes occur in all organisms. It has been found that mutations in duplicate genes cause defects much less often than when single copy genes are mutated. It is widely believed that this is due to functional redundancy—that is, the two genes can carry out similar functions so that the non-mutated duplicate gene can cover for or “mask” the phenotype of the mutation in the first duplicate. To determine whether this hypothesis is true, it is necessary to test systematically whether defects indeed occur in the organism when both duplicate genes are inhibited. We have for the first time carried out such an analysis in a multicellular organism, the nematode Caenorhabditis elegans. In contrast to expectations, we observed that when both copies of duplicate genes are inhibited deleterious effects are very rare. We show that this is because duplicate genes are much more often non-essential compared to genes where there is only a single copy. Non-essential genes are also lost from the genome much more often than essential genes. However, when essential genes are duplicated, they remain present in the genome over longer periods. Our results give a framework to explain the evolutionary dynamics of duplications in the genome.
Collapse
Affiliation(s)
- Shane Woods
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Avril Coghlan
- Department of Microbiology, University College Cork, Cork, Ireland
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - David Rivers
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Tobias Warnecke
- Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| | - Sean J. Jeffries
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Taejoon Kwon
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Anthony Rogers
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Laurence D. Hurst
- Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
- * E-mail: (LDH); (JA)
| | - Julie Ahringer
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- * E-mail: (LDH); (JA)
| |
Collapse
|
31
|
Poptsova M, Banerjee S, Gokcumen O, Rubin MA, Demichelis F. Impact of constitutional copy number variants on biological pathway evolution. BMC Evol Biol 2013; 13:19. [PMID: 23342974 PMCID: PMC3563492 DOI: 10.1186/1471-2148-13-19] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2012] [Accepted: 01/18/2013] [Indexed: 11/10/2022] Open
Abstract
Background Inherited Copy Number Variants (CNVs) can modulate the expression levels of individual genes. However, little is known about how CNVs alter biological pathways and how this varies across different populations. To trace potential evolutionary changes of well-described biological pathways, we jointly queried the genomes and the transcriptomes of a collection of individuals with Caucasian, Asian or Yoruban descent combining high-resolution array and sequencing data. Results We implemented an enrichment analysis of pathways accounting for CNVs and genes sizes and detected significant enrichment not only in signal transduction and extracellular biological processes, but also in metabolism pathways. Upon the estimation of CNV population differentiation (CNVs with different polymorphism frequencies across populations), we evaluated that 22% of the pathways contain at least one gene that is proximal to a CNV (CNV-gene pair) that shows significant population differentiation. The majority of these CNV-gene pairs belong to signal transduction pathways and 6% of the CNV-gene pairs show statistical association between the copy number states and the transcript levels. Conclusions The analysis suggested possible examples of positive selection within individual populations including NF-kB, MAPK signaling pathways, and Alu/L1 retrotransposition factors. Altogether, our results suggest that constitutional CNVs may modulate subtle pathway changes through specific pathway enzymes, which may become fixed in some populations.
Collapse
Affiliation(s)
- Maria Poptsova
- Department of Pathology and Laboratory Medicine, Weill Cornell Medical College, New York, NY, USA
| | | | | | | | | |
Collapse
|
32
|
Li Y, Mei S, Zhang X, Peng X, Liu G, Tao H, Wu H, Jiang S, Xiong Y, Li F. Identification of genome-wide copy number variations among diverse pig breeds by array CGH. BMC Genomics 2012; 13:725. [PMID: 23265576 PMCID: PMC3573951 DOI: 10.1186/1471-2164-13-725] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Accepted: 12/19/2012] [Indexed: 11/15/2022] Open
Abstract
Background Recent studies have shown that copy number variation (CNV) in mammalian genomes contributes to phenotypic diversity, including health and disease status. In domestic pigs, CNV has been catalogued by several reports, but the extent of CNV and the phenotypic effects are far from clear. The goal of this study was to identify CNV regions (CNVRs) in pigs based on array comparative genome hybridization (aCGH). Results Here a custom-made tiling oligo-nucleotide array was used with a median probe spacing of 2506 bp for screening 12 pigs including 3 Chinese native pigs (one Chinese Erhualian, one Tongcheng and one Yangxin pig), 5 European pigs (one Large White, one Pietrain, one White Duroc and two Landrace pigs), 2 synthetic pigs (Chinese new line DIV pigs) and 2 crossbred pigs (Landrace × DIV pigs) with a Duroc pig as the reference. Two hundred and fifty-nine CNVRs across chromosomes 1–18 and X were identified, with an average size of 65.07 kb and a median size of 98.74 kb, covering 16.85 Mb or 0.74% of the whole genome. Concerning copy number status, 93 (35.91%) CNVRs were called as gains, 140 (54.05%) were called as losses and the remaining 26 (10.04%) were called as both gains and losses. Of all detected CNVRs, 171 (66.02%) and 34 (13.13%) CNVRs directly overlapped with Sus scrofa duplicated sequences and pig QTLs, respectively. The CNVRs encompassed 372 full length Ensembl transcripts. Two CNVRs identified by aCGH were validated using real-time quantitative PCR (qPCR). Conclusions Using 720 K array CGH (aCGH) we described a map of porcine CNVs which facilitated the identification of structural variations for important phenotypes and the assessment of the genetic diversity of pigs.
Collapse
Affiliation(s)
- Yan Li
- Key Laboratory of Pig Genetics and Breeding of Ministry of Agriculture & Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan, 430070, PR China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Devalle S, Sartore RC, Paulsen BS, Borges HL, Martins RAP, Rehen SK. Implications of aneuploidy for stem cell biology and brain therapeutics. Front Cell Neurosci 2012; 6:36. [PMID: 22973193 PMCID: PMC3433681 DOI: 10.3389/fncel.2012.00036] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2012] [Accepted: 08/18/2012] [Indexed: 12/29/2022] Open
Abstract
Understanding the cellular basis of neurological disorders have advanced at a slow pace, especially due to the extreme invasiveness of brain biopsying and limitations of cell lines and animal models that have been used. Since the derivation of pluripotent stem cells (PSCs), a novel source of cells for regenerative medicine and disease modeling has become available, holding great potential for the neurology field. However, safety for therapy and accurateness for modeling have been a matter of intense debate, considering that genomic instability, including the gain and loss of chromosomes (aneuploidy), has been repeatedly observed in those cells. Despite the fact that recent reports have described some degree of aneuploidy as being normal during neuronal differentiation and present in healthy human brains, this phenomenon is particularly controversial since it has traditionally been associated with cancer and disabling syndromes. It is therefore necessary to appreciate, to which extent, aneuploid pluripotent stem cells are suitable for regenerative medicine and neurological modeling and also the limits that separate constitutive from disease-related aneuploidy. In this review, recent findings regarding chromosomal instability in PSCs and within the brain will be discussed.
Collapse
Affiliation(s)
- Sylvie Devalle
- National Laboratory for Embryonic Stem Cells, Institute of Biomedical Sciences, Federal University of Rio de Janeiro Rio de Janeiro, RJ, Brazil
| | | | | | | | | | | |
Collapse
|
34
|
Genomic structure and evolution of multigene families: "flowers" on the human genome. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:917678. [PMID: 22779033 PMCID: PMC3388347 DOI: 10.1155/2012/917678] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2011] [Revised: 04/06/2012] [Accepted: 04/09/2012] [Indexed: 11/17/2022]
Abstract
We report the results of an extensive investigation of genomic structures in the human genome, with a particular focus on relatively large repeats (>50 kb) in adjacent chromosomal regions. We named such structures “Flowers” because the pattern observed on dot plots resembles a flower. We detected a total of 291 Flowers in the human genome. They were predominantly located in euchromatic regions. Flowers are gene-rich compared to the average gene density of the genome. Genes involved in systems receiving environmental information, such as immunity and detoxification, were overrepresented in Flowers. Within a Flower, the mean number of duplication units was approximately four. The maximum and minimum identities between homologs in a Flower showed different distributions; the maximum identity was often concentrated to 100% identity, while the minimum identity was evenly distributed in the range of 78% to 100%. Using a gene conversion detection test, we found frequent and/or recent gene conversion events within the tested Flowers. Interestingly, many of those converted regions contained protein-coding genes. Computer simulation studies suggest that one role of such frequent gene conversions is the elongation of the life span of gene families in a Flower by the resurrection of pseudogenes.
Collapse
|
35
|
Boulding H, Webber C. Large-scale objective association of mouse phenotypes with human symptoms through structural variation identified in patients with developmental disorders. Hum Mutat 2012; 33:874-83. [PMID: 22396327 DOI: 10.1002/humu.22069] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2011] [Accepted: 02/21/2012] [Indexed: 11/06/2022]
Abstract
Copy number variants (CNVs) are thought to underlie many human developmental abnormalities. However, it is unclear how many of these CNVs exert their pathogenic effects or, in particular, how distinct CNVs at dispersed loci can give rise to the same abnormality. We hypothesize that the mouse orthologs of genes whose copy number change gives rise to the same human abnormality might also yield a similar phenotype when disrupted in mice. Thus, by bringing together a large number of disparate CNVs, we may be able to identify an unusually overrepresented phenotype among the affected genes' mouse orthologs. We obtained 1,624 de novo CNVs identified in patients with developmental abnormalities from Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources and European Cytogeneticists Association Register of Unbalanced Chromosome Aberrations database. Forming CNV sets for each of 1,088 distinct human abnormalities, we were able to associate a total of 143 (13%) human abnormalities with mouse model phenotypes. Although many mouse phenotypes are readily comparable to their associated human abnormality, others are less so, generating novel biological hypotheses. Of the 2,086 candidate genes that contribute to these associations, 65% have not been previously associated with human disease in Online Mendelian Inheritance in Man, and their distribution suggests both extensive pleiotropy and epistasis while also proposing a small number of simple additive consequences.
Collapse
Affiliation(s)
- Hannah Boulding
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | | |
Collapse
|
36
|
Clop A, Vidal O, Amills M. Copy number variation in the genomes of domestic animals. Anim Genet 2012; 43:503-17. [PMID: 22497594 DOI: 10.1111/j.1365-2052.2012.02317.x] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/28/2011] [Indexed: 12/28/2022]
Abstract
Copy number variation (CNV) might be one of the main contributors to phenotypic diversity and evolutionary adaptation in animals and plants, employing a wide variety of mechanisms, such as gene dosage and transcript structure alterations, to modulate organismal plasticity. In the past 4 years, considerable advances have been made in the characterization of the genomic architecture of CNV in domestic species. First, low-resolution CNV maps were produced for cattle, goat, sheep, pig, dog, chicken, duck and turkey, showing that these structural polymorphisms comprise a significant part of these genomes. Furthermore, CNVs have been associated with several pigmentation (white coat in horse, pig and sheep) and morphological (late feathering and pea comb in chicken) traits, as well as with susceptibility to a wide array of diseases and developmental disorders, for example osteopetrosis, anhidrotic ectodermal dysplasia, copper toxicosis, intersexuality, cone degeneration, periodic fever and dermoid sinus, among others. In the future, development of high-resolution tools for CNV detection and typing combined with the implementation of databases integrating CNV, QTL and gene expression data will be essential to identify and measure the impact of this source of structural variation on the many phenotypes that are relevant to animal breeders and veterinary practitioners.
Collapse
Affiliation(s)
- A Clop
- Department of Medical and Molecular Genetics, King's College London, Great Maze Pond, SE1 9RT, London, UK
| | | | | |
Collapse
|
37
|
Podder S, Ghosh TC. Evolutionary dynamics of human autoimmune disease genes and malfunctioned immunological genes. BMC Evol Biol 2012; 12:10. [PMID: 22276655 PMCID: PMC3347981 DOI: 10.1186/1471-2148-12-10] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Accepted: 01/25/2012] [Indexed: 02/01/2023] Open
Abstract
Background One of the main issues of molecular evolution is to divulge the principles in dictating the evolutionary rate differences among various gene classes. Immunological genes have received considerable attention in evolutionary biology as candidates for local adaptation and for studying functionally important polymorphisms. The normal structure and function of immunological genes will be distorted when they experience mutations leading to immunological dysfunctions. Results Here, we examined the fundamental differences between the genes which on mutation give rise to autoimmune or other immune system related diseases and the immunological genes that do not cause any disease phenotypes. Although the disease genes examined are analogous to non-disease genes in product, expression, function, and pathway affiliation, a statistically significant decrease in evolutionary rate has been found in autoimmune disease genes relative to all other immune related diseases and non-disease genes. Possible ways of accumulation of mutation in the three steps of the central dogma (DNA-mRNA-Protein) have been studied to trace the mutational effects predisposed to disease consequence and acquiring higher selection pressure. Principal Component Analysis and Multivariate Regression Analysis have established the predominant role of single nucleotide polymorphisms in guiding the evolutionary rate of immunological disease and non-disease genes followed by m-RNA abundance, paralogs number, fraction of phosphorylation residue, alternatively spliced exon, protein residue burial and protein disorder. Conclusions Our study provides an empirical insight into the etiology of autoimmune disease genes and other immunological diseases. The immediate utility of our study is to help in disease gene identification and may also help in medicinal improvement of immune related disease.
Collapse
|
38
|
Stothard P, Choi JW, Basu U, Sumner-Thomson JM, Meng Y, Liao X, Moore SS. Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery. BMC Genomics 2011; 12:559. [PMID: 22085807 PMCID: PMC3229636 DOI: 10.1186/1471-2164-12-559] [Citation(s) in RCA: 123] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 11/15/2011] [Indexed: 01/27/2023] Open
Abstract
Background One of the goals of livestock genomics research is to identify the genetic differences responsible for variation in phenotypic traits, particularly those of economic importance. Characterizing the genetic variation in livestock species is an important step towards linking genes or genomic regions with phenotypes. The completion of the bovine genome sequence and recent advances in DNA sequencing technology allow for in-depth characterization of the genetic variations present in cattle. Here we describe the whole-genome resequencing of two Bos taurus bulls from distinct breeds for the purpose of identifying and annotating novel forms of genetic variation in cattle. Results The genomes of a Black Angus bull and a Holstein bull were sequenced to 22-fold and 19-fold coverage, respectively, using the ABI SOLiD system. Comparisons of the sequences with the Btau4.0 reference assembly yielded 7 million single nucleotide polymorphisms (SNPs), 24% of which were identified in both animals. Of the total SNPs found in Holstein, Black Angus, and in both animals, 81%, 81%, and 75% respectively are novel. In-depth annotations of the data identified more than 16 thousand distinct non-synonymous SNPs (85% novel) between the two datasets. Alignments between the SNP-altered proteins and orthologues from numerous species indicate that many of the SNPs alter well-conserved amino acids. Several SNPs predicted to create or remove stop codons were also found. A comparison between the sequencing SNPs and genotyping results from the BovineHD high-density genotyping chip indicates a detection rate of 91% for homozygous SNPs and 81% for heterozygous SNPs. The false positive rate is estimated to be about 2% for both the Black Angus and Holstein SNP sets, based on follow-up genotyping of 422 and 427 SNPs, respectively. Comparisons of read depth between the two bulls along the reference assembly identified 790 putative copy-number variations (CNVs). Ten randomly selected CNVs, five genic and five non-genic, were successfully validated using quantitative real-time PCR. The CNVs are enriched for immune system genes and include genes that may contribute to lactation capacity. The majority of the CNVs (69%) were detected as regions with higher abundance in the Holstein bull. Conclusions Substantial genetic differences exist between the Black Angus and Holstein animals sequenced in this work and the Hereford reference sequence, and some of this variation is predicted to affect evolutionarily conserved amino acids or gene copy number. The deeply annotated SNPs and CNVs identified in this resequencing study can serve as useful genetic tools, and as candidates in searches for phenotype-altering DNA differences.
Collapse
Affiliation(s)
- Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada
| | | | | | | | | | | | | |
Collapse
|
39
|
Cardoso-Moreira M, Emerson JJ, Clark AG, Long M. Drosophila duplication hotspots are associated with late-replicating regions of the genome. PLoS Genet 2011; 7:e1002340. [PMID: 22072977 PMCID: PMC3207856 DOI: 10.1371/journal.pgen.1002340] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Accepted: 08/26/2011] [Indexed: 12/22/2022] Open
Abstract
Duplications play a significant role in both extremes of the phenotypic spectrum of newly arising mutations: they can have severe deleterious effects (e.g. duplications underlie a variety of diseases) but can also be highly advantageous. The phenotypic potential of newly arisen duplications has stimulated wide interest in both the mutational and selective processes shaping these variants in the genome. Here we take advantage of the Drosophila simulans–Drosophila melanogaster genetic system to further our understanding of both processes. Regarding mutational processes, the study of two closely related species allows investigation of the potential existence of shared duplication hotspots, and the similarities and differences between the two genomes can be used to dissect its underlying causes. Regarding selection, the difference in the effective population size between the two species can be leveraged to ask questions about the strength of selection acting on different classes of duplications. In this study, we conducted a survey of duplication polymorphisms in 14 different lines of D. simulans using tiling microarrays and combined it with an analogous survey for the D. melanogaster genome. By integrating the two datasets, we identified duplication hotspots conserved between the two species. However, unlike the duplication hotspots identified in mammalian genomes, Drosophila duplication hotspots are not associated with sequences of high sequence identity capable of mediating non-allelic homologous recombination. Instead, Drosophila duplication hotspots are associated with late-replicating regions of the genome, suggesting a link between DNA replication and duplication rates. We also found evidence supporting a higher effectiveness of selection on duplications in D. simulans than in D. melanogaster. This is also true for duplications segregating at high frequency, where we find evidence in D. simulans that a sizeable fraction of these mutations is being driven to fixation by positive selection. DNA duplications are important contributors to the phenotypic differences observed between individuals. These mutations can disrupt the normal functioning of genes and so are often associated with disease. But because they can add genetic information they can also lead to evolutionary change. Understanding how selection and non-random mutation processes shape the distribution of duplications throughout the genome is important to elucidate both the medical and evolutionary impacts of these mutations. Here, we examined the roles of selection and mutation in shaping patterns of duplication polymorphisms across the genomes of the fruit fly Drosophila melanogaster and its sister species, D. simulans. We found that selection is pervasive in both genomes but is more efficient in D. simulans than in D. melanogaster. We also found that these two species have shared duplication hotspots, i.e. orthologous regions experiencing high rates of duplication in the two genomes. After excluding the hypothesis that Drosophila duplication hotspots are associated with regions of the genome rich in segmental duplications (as observed for mammalian genomes), we show that they are associated with late-replicating regions of the genome. Our work therefore proposes a link between DNA replication and rates of duplication across the genome.
Collapse
|
40
|
Ponting CP, Nellåker C, Meader S. Rapid turnover of functional sequence in human and other genomes. Annu Rev Genomics Hum Genet 2011; 12:275-99. [PMID: 21721940 DOI: 10.1146/annurev-genom-090810-183115] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The amount of a genome's sequence that is functional has been surprisingly difficult to estimate accurately. This has severely hindered analyses asking whether the amount of functional genomic sequence correlates with organismal complexity. Most studies estimate these amounts by considering nucleotide substitution rates within aligned sequences. These approaches show reduced power to identify sequence that is aligned, functional, and constrained only within narrowly defined phyla. The neutral indel model exploits insertions or deletions (indels) rather than substitutions in predicting functional sequence. Surprisingly, this method indicates that half of all functional sequence is specific to individual eutherian lineages. This review considers the rates at which coding or noncoding and functional or nonfunctional sequence changes among mammalian genomes. In contrast to the slow rate at which protein-coding sequence changes, functional noncoding sequence appears to change or be turned over at rapid rates in mammals.
Collapse
Affiliation(s)
- Chris P Ponting
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom.
| | | | | |
Collapse
|
41
|
Webber C. Functional enrichment analysis with structural variants: pitfalls and strategies. Cytogenet Genome Res 2011; 135:277-85. [PMID: 21997137 DOI: 10.1159/000331670] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Interpreting the phenotypic consequences of human structural variation remains challenging. Functional enrichment analysis, which can identify functional enrichments among genes affected by structural variants, is providing significant biological insights into the genotype-phenotype relationship. In this review, we discuss the different approaches and choices in the application of this technique to human structural variation. We consider the importance of choosing the right background distribution for detection, the significance of the gene selection criteria, the effects of tissue-specific gene length biases and discuss sources of functional annotations with a focus on Gene Ontology and mouse phenotypic resources. Throughout this review, we highlight potential sources of significant bias that are of particular concern to the analysis of structural variants, and illustrate the importance of examining the expectations upon which enrichment analysis techniques depend.
Collapse
Affiliation(s)
- C Webber
- Department of Physiology, Anatomy and Genetics, MRC Functional Genomics Unit, University of Oxford, Oxford, UK.
| |
Collapse
|
42
|
Chen W, Hayward C, Wright AF, Hicks AA, Vitart V, Knott S, Wild SH, Pramstaller PP, Wilson JF, Rudan I, Porteous DJ. Copy number variation across European populations. PLoS One 2011; 6:e23087. [PMID: 21829696 PMCID: PMC3150386 DOI: 10.1371/journal.pone.0023087] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 07/12/2011] [Indexed: 12/13/2022] Open
Abstract
Genome analysis provides a powerful approach to test for evidence of genetic variation within and between geographical regions and local populations. Copy number variants which comprise insertions, deletions and duplications of genomic sequence provide one such convenient and informative source. Here, we investigate copy number variants from genome wide scans of single nucleotide polymorphisms in three European population isolates, the island of Vis in Croatia, the islands of Orkney in Scotland and the South Tyrol in Italy. We show that whereas the overall copy number variant frequencies are similar between populations, their distribution is highly specific to the population of origin, a finding which is supported by evidence for increased kinship correlation for specific copy number variants within populations.
Collapse
Affiliation(s)
- Wanting Chen
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
| | - Caroline Hayward
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Alan F. Wright
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Andrew A. Hicks
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
| | - Veronique Vitart
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, United Kingdom
| | - Sara Knott
- Institute of Evolutionary Biology, University of Edinburgh, Ashworth Laboratories, King's Buildings, Edinburgh, United Kingdom
| | - Sarah H. Wild
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Peter P. Pramstaller
- Institute of Genetic Medicine, European Academy Bozen/Bolzano (EURAC), Bolzano/Bozen, Italy - Affiliated Institute of the University of Lübeck, Lübeck, Germany
- Department of Neurology, General Central Hospital, Bolzano, Italy
- Department of Neurology, University of Lübeck, Lübeck, Germany
| | - James F. Wilson
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
| | - Igor Rudan
- Centre for Population Health Sciences, The University of Edinburgh Medical School, Edinburgh, United Kingdom
- Croatian Centre for Global Health, University of Split Medical School, Split, Croatia
| | - David J. Porteous
- Medical Genetics Section, Centre for Molecular Medicine, Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
43
|
Johansson ACV, Feuk L. Characterization of copy number-stable regions in the human genome. Hum Mutat 2011; 32:947-55. [PMID: 21542059 DOI: 10.1002/humu.21524] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 04/20/2011] [Indexed: 01/25/2023]
Abstract
In the past few years the number of copy number variants (CNVs) identified in the human genome has increased significantly, but our understanding of the functional impact of CNVs is still limited. Clinically significant variations cannot easily be distinguished from benign, complicating interpretation of patient data. Multiple studies have focused on analysis of regions that vary in copy number in specific disorders. Here we use the opposite strategy and focus our analysis on regions that never seem to vary in the general population, hypothesizing that these are copy number stable because variations within them are deleterious. Our results show that copy number stable regions are characterized by correlation with a number of genomic features, allowing us to define a list of genomic regions that are dosage sensitive in humans. We find that these dosage-sensitive regions show significant overlap with de novo CNVs identified in patients with intellectual disability or autism. There is also a significant association between copy number stable regions and rare inherited variants in autism patients, but not in controls. Based on this predictive power, we propose that copy number stable regions can be used to complement maps of known CNVs to facilitate interpretation of patient data.
Collapse
Affiliation(s)
- Anna C V Johansson
- Department of Immunology, Genetics and Pathology, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden
| | | |
Collapse
|
44
|
Cooper DN, Kehrer-Sawatzki H. Exploring the potential relevance of human-specific genes to complex disease. Hum Genomics 2011; 5:99-107. [PMID: 21296743 PMCID: PMC3525227 DOI: 10.1186/1479-7364-5-2-99] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Although human disease genes generally tend to be evolutionarily more ancient than non-disease genes, complex disease genes appear to be represented more frequently than Mendelian disease genes among genes of more recent evolutionary origin. It is therefore proposed that the analysis of human-specific genes might provide new insights into the genetics of complex disease. Cross-comparison with the Human Gene Mutation Database (http://www.hgmd.org) revealed a number of examples of disease-causing and disease-associated mutations in putatively human-specific genes. A sizeable proportion of these were missense polymorphisms associated with complex disease. Since both human-specific genes and genes associated with complex disease have often experienced particularly rapid rates of evolutionary change, either due to weaker purifying selection or positive selection, it is proposed that a significant number of human-specific genes may play a role in complex disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK.
| | | |
Collapse
|
45
|
Hardwick RJ, Machado LR, Zuccherato LW, Antolinos S, Xue Y, Shawa N, Gilman RH, Cabrera L, Berg DE, Tyler-Smith C, Kelly P, Tarazona-Santos E, Hollox EJ. A worldwide analysis of beta-defensin copy number variation suggests recent selection of a high-expressing DEFB103 gene copy in East Asia. Hum Mutat 2011; 32:743-50. [PMID: 21387465 PMCID: PMC3263423 DOI: 10.1002/humu.21491] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2010] [Accepted: 02/10/2011] [Indexed: 11/21/2022]
Abstract
Beta-defensins are a family of multifunctional genes with roles in defense against pathogens, reproduction, and pigmentation. In humans, six beta-defensin genes are clustered in a repeated region which is copy-number variable (CNV) as a block, with a diploid copy number between 1 and 12. The role in host defense makes the evolutionary history of this CNV particularly interesting, because morbidity due to infectious disease is likely to have been an important selective force in human evolution, and to have varied between geographical locations. Here, we show CNV of the beta-defensin region in chimpanzees, and identify a beta-defensin block in the human lineage that contains rapidly evolving noncoding regulatory sequences. We also show that variation at one of these rapidly evolving sequences affects expression levels and cytokine responsiveness of DEFB103, a key inhibitor of influenza virus fusion at the cell surface. A worldwide analysis of beta-defensin CNV in 67 populations shows an unusually high frequency of high-DEFB103-expressing copies in East Asia, the geographical origin of historical and modern influenza epidemics, possibly as a result of selection for increased resistance to influenza in this region. Hum Mutat 32:743–750, 2011. © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Robert J Hardwick
- Department of Genetics, University of Leicester, University Road, Leicester, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Felekkis K, Voskarides K, Dweep H, Sticht C, Gretz N, Deltas C. Increased Number of MicroRNA Target Sites in Genes Encoded in CNV Regions. Evidence for an Evolutionary Genomic Interaction. Mol Biol Evol 2011; 28:2421-4. [DOI: 10.1093/molbev/msr078] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
|
47
|
Harris EE. Nonadaptive processes in primate and human evolution. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2011; 143 Suppl 51:13-45. [PMID: 21086525 DOI: 10.1002/ajpa.21439] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Evolutionary biology has tended to focus on adaptive evolution by positive selection as the primum mobile of evolutionary trajectories in species while underestimating the importance of nonadaptive evolutionary processes. In this review, I describe evidence that suggests that primate and human evolution has been strongly influenced by nonadaptive processes, particularly random genetic drift and mutation. This is evidenced by three fundamental effects: a relative relaxation of selective constraints (i.e., purifying selection), a relative increase in the fixation of slightly deleterious mutations, and a general reduction in the efficacy of positive selection. These effects are observed in protein-coding, regulatory regions, and in gene expression data, as well as in an augmentation of fixation of large-scale mutations, including duplicated genes, mobile genetic elements, and nuclear mitochondrial DNA. The evidence suggests a general population-level explanation such as a reduction in effective population size (N(e)). This would have tipped the balance between the evolutionary forces of natural selection and random genetic drift toward genetic drift for variants having small selective effects. After describing these proximate effects, I describe the potential consequences of these effects for primate and human evolution. For example, an increase in the fixation of slightly deleterious mutations could potentially have led to an increase in the fixation rate of compensatory mutations that act to suppress the effects of slightly deleterious substitutions. The potential consequences of compensatory evolution for the evolution of novel gene functions and in potentially confounding the detection of positively selected genes are explored. The consequences of the passive accumulation of large-scale genomic mutations by genetic drift are unclear, though evidence suggests that new gene copies as well as insertions of transposable elements into genes can potentially lead to adaptive phenotypes. Finally, because a decrease in selective constraint at the genetic level is expected to have effects at the morphological level, I review studies that compare rates of morphological change in various mammalian and island populations where N(e) is reduced. Furthermore, I discuss evidence that suggests that craniofacial morphology in the Homo lineage has shifted from an evolutionary rate constrained by purifying selection toward a neutral evolutionary rate.
Collapse
Affiliation(s)
- Eugene E Harris
- Department of Biological Sciences and Geology, Queensborough Community College, City University of New York, Bayside, NY 10364, USA.
| |
Collapse
|
48
|
Shaikh TH, Haldeman-Englert C, Geiger EA, Ponting CP, Webber C. Genes and biological processes commonly disrupted in rare and heterogeneous developmental delay syndromes. Hum Mol Genet 2010; 20:880-93. [PMID: 21147756 DOI: 10.1093/hmg/ddq527] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Rare copy number variations (CNVs) are a recognized cause of common human disease. Predicting the genetic element(s) within a small CNV whose copy number loss or gain underlies a specific phenotype might be achieved reasonably rapidly for single patients. Identifying the biological processes that are commonly disrupted within a large patient cohort which possess larger CNVs, however, requires a more objective approach that exploits genomic resources. In this study, we first identified 98 large, rare CNVs within patients exhibiting multiple congenital anomalies. All patients presented with global developmental delay (DD), while other secondary symptoms such as cardiac defects, craniofacial features and seizures were varyingly presented. By applying a robust statistical procedure that matches patients' clinical phenotypes to laboratory mouse gene knockouts, we were able to strongly implicate anomalies in brain morphology and, separately, in long-term potentiation as manifestations of these DD patients' disorders. These and other significantly enriched model phenotypes provide insights into the pathoetiology of human DD and behavioral and anatomical secondary symptoms that are specific to DD patients. These enrichments set apart 103 genes, from among thousands overlapped by these CNVs, as strong candidates whose copy number change causally underlies approximately 46% of the cohort's DD syndromes and between 59 and 80% of the cohort's secondary symptoms. We also identified significantly enriched model phenotypes among genes overlapped by CNVs in both DD and learning disability cohorts, indicating a congruent etiology. These results demonstrate the high predictive potential of model organism phenotypes when implicating candidate genes for rare genomic disorders.
Collapse
Affiliation(s)
- Tamim H Shaikh
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA.
| | | | | | | | | |
Collapse
|
49
|
Fontanesi L, Martelli PL, Beretti F, Riggio V, Dall'Olio S, Colombo M, Casadio R, Russo V, Portolano B. An initial comparative map of copy number variations in the goat (Capra hircus) genome. BMC Genomics 2010; 11:639. [PMID: 21083884 PMCID: PMC3011854 DOI: 10.1186/1471-2164-11-639] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2010] [Accepted: 11/17/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The goat (Capra hircus) represents one of the most important farm animal species. It is reared in all continents with an estimated world population of about 800 million of animals. Despite its importance, studies on the goat genome are still in their infancy compared to those in other farm animal species. Comparative mapping between cattle and goat showed only a few rearrangements in agreement with the similarity of chromosome banding. We carried out a cross species cattle-goat array comparative genome hybridization (aCGH) experiment in order to identify copy number variations (CNVs) in the goat genome analysing animals of different breeds (Saanen, Camosciata delle Alpi, Girgentana, and Murciano-Granadina) using a tiling oligonucleotide array with ~385,000 probes designed on the bovine genome. RESULTS We identified a total of 161 CNVs (an average of 17.9 CNVs per goat), with the largest number in the Saanen breed and the lowest in the Camosciata delle Alpi goat. By aggregating overlapping CNVs identified in different animals we determined CNV regions (CNVRs): on the whole, we identified 127 CNVRs covering about 11.47 Mb of the virtual goat genome referred to the bovine genome (0.435% of the latter genome). These 127 CNVRs included 86 loss and 41 gain and ranged from about 24 kb to about 1.07 Mb with a mean and median equal to 90,292 bp and 49,530 bp, respectively. To evaluate whether the identified goat CNVRs overlap with those reported in the cattle genome, we compared our results with those obtained in four independent cattle experiments. Overlapping between goat and cattle CNVRs was highly significant (P < 0.0001) suggesting that several chromosome regions might contain recurrent interspecies CNVRs. Genes with environmental functions were over-represented in goat CNVRs as reported in other mammals. CONCLUSIONS We describe a first map of goat CNVRs. This provides information on a comparative basis with the cattle genome by identifying putative recurrent interspecies CNVs between these two ruminant species. Several goat CNVs affect genes with important biological functions. Further studies are needed to evaluate the functional relevance of these CNVs and their effects on behavior, production, and disease resistance traits in goats.
Collapse
Affiliation(s)
- Luca Fontanesi
- DIPROVAL, Sezione di Allevamenti Zootecnici, University of Bologna, Via F.lli Rosselli 107, 42123 Reggio Emilia, Italy.
| | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Bassett AS, Scherer SW, Brzustowicz LM. Copy number variations in schizophrenia: critical review and new perspectives on concepts of genetics and disease. Am J Psychiatry 2010; 167:899-914. [PMID: 20439386 PMCID: PMC3295834 DOI: 10.1176/appi.ajp.2009.09071016] [Citation(s) in RCA: 144] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
OBJECTIVE Structural variations of DNA, such as copy number variations (CNVs), are recognized to contribute both to normal genomic variability and to risk for human diseases. For example, schizophrenia has an established connection with 22q11.2 deletions. Recent genome-wide studies have provided initial evidence that CNVs at other loci may also be associated with schizophrenia. In this article, the authors provide a brief overview of CNVs, review recent findings related to schizophrenia, outline implications for clinical practice and diagnostic subtyping, and make recommendations for future reports on CNVs to improve interpretation of results. METHOD The review included genome-wide surveys of CNVs in schizophrenia that included one or more comparison groups, were published before 2009, and used newer methods. Six studies were identified. RESULTS Despite some limitations, these initial genome-wide studies of CNVs provide replicated associations of schizophrenia with rare 1q21.1 and 15q13.3 deletions. Collectively, the results point to a more general mutational mechanism involving rare CNVs that elevate risk for schizophrenia, especially more developmental forms of the disease. Including 22q11.2 deletions, rare risk-associated CNVs appear to account for up to 2% of schizophrenia. CONCLUSIONS The more penetrant CNVs have direct implications for clinical practice and diagnostic subtyping. CNVs with lower penetrance promise to contribute to our genetic understanding of pathogenesis. The findings provide insight into a broader neuropsychiatric spectrum for schizophrenia than previously conceived and indicate new directions for genetic studies.
Collapse
|