101
|
Liang KH, Lin CL, Hsu CW, Lai MW, Chien RN, Yeh CT. UGT2B28 genomic variation is associated with hepatitis B e-antigen seroconversion in response to antiviral therapy. Sci Rep 2016; 6:34088. [PMID: 27665939 PMCID: PMC5036189 DOI: 10.1038/srep34088] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 09/05/2016] [Indexed: 02/07/2023] Open
Abstract
Seroconversion of hepatitis B virus (HBV) e-antigen (HBeAg) is a critical but often-missed therapeutic goal in standard antiviral treatments. An extreme-phenotype genome-wide association study was performed, comparing untreated spontaneous recoverers (with seroconversion of HBV surface antigen) versus entecavir-treated patients failing to achieve HBeAg seroconversion. A single-nucleotide-polymorphism rs2132039 on the UGT2B28 gene, alongside an adjacent copy number polymorphism (CNP605), manifested the strongest clinical associations (P = 3.4 × 10-8 and 0.001, respectively). Multivariate analysis showed that rs2132039-TT genotypes, but not CNP605 copy numbers, remained associated to spontaneous recoverers (P = 0.009). The clinical association of rs2132039 was validated successfully in an independent cohort (n = 302; P = 0.002). Longitudinal case-only analyses revealed that the rs2132039-TT genotype predicted shorter time-to-HBeAg-seroconversion in all antiviral-treated patients (n = 380, P = 0.012), as well as the peginterferon-treated subgroup (n = 123; P = 0.024, Hazard ratio [HR] = 2.104, Confidence interval [CI] = 1.105-4.007). In the entecavir-treated subgroup, the predictive effect was restricted by pretreatment alanine aminotransferase (ALT) and aspartate aminotransferase (AST) levels, with effective prediction observed in patients with ALT < 200 IU/ml and ALT/AST ratio <2 (n = 132; P = 0.013, HR = 10.538, CI = 1.420-78.196).
Collapse
Affiliation(s)
- Kung-Hao Liang
- Liver Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- Molecular Medicine Research Center, Chang Gung University, Taoyuan, Taiwan
| | - Chih-Lang Lin
- Liver Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- Liver Research Unit, Keelung Chang Gung Memorial Hospital, Keelung, Taiwan
- Chang Gung University, College of Medicine, Taiwan
| | - Chao-Wei Hsu
- Liver Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- Chang Gung University, College of Medicine, Taiwan
| | - Ming-Wei Lai
- Liver Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- Chang Gung University, College of Medicine, Taiwan
| | - Rong-Nan Chien
- Liver Research Unit, Keelung Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Chau-Ting Yeh
- Liver Research Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
- Molecular Medicine Research Center, Chang Gung University, Taoyuan, Taiwan
- Chang Gung University, College of Medicine, Taiwan
| |
Collapse
|
102
|
Morgan AP, Holt JM, McMullan RC, Bell TA, Clayshulte AMF, Didion JP, Yadgary L, Thybert D, Odom DT, Flicek P, McMillan L, de Villena FPM. The Evolutionary Fates of a Large Segmental Duplication in Mouse. Genetics 2016; 204:267-85. [PMID: 27371833 PMCID: PMC5012392 DOI: 10.1534/genetics.116.191007] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2016] [Accepted: 06/27/2016] [Indexed: 01/21/2023] Open
Abstract
Gene duplication and loss are major sources of genetic polymorphism in populations, and are important forces shaping the evolution of genome content and organization. We have reconstructed the origin and history of a 127-kbp segmental duplication, R2d, in the house mouse (Mus musculus). R2d contains a single protein-coding gene, Cwc22 De novo assembly of both the ancestral (R2d1) and the derived (R2d2) copies reveals that they have been subject to nonallelic gene conversion events spanning tens of kilobases. R2d2 is also a hotspot for structural variation: its diploid copy number ranges from zero in the mouse reference genome to >80 in wild mice sampled from around the globe. Hemizygosity for high copy-number alleles of R2d2 is associated in cis with meiotic drive; suppression of meiotic crossovers; and copy-number instability, with a mutation rate in excess of 1 per 100 transmissions in some laboratory populations. Our results provide a striking example of allelic diversity generated by duplication and demonstrate the value of de novo assembly in a phylogenetic context for understanding the mutational processes affecting duplicate genes.
Collapse
Affiliation(s)
- Andrew P Morgan
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - J Matthew Holt
- Department of Computer Science, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Rachel C McMullan
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Timothy A Bell
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Amelia M-F Clayshulte
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - John P Didion
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Liran Yadgary
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| | - David Thybert
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Cambridge, CB10 1SD, United Kingdom
| | - Duncan T Odom
- Cancer Research United Kingdom Cambridge Institute, University of Cambridge, CB2 0RE, United Kingdom Wellcome Trust Sanger Institute, Wellcome Genome Campus, Cambridge, CB10 1SA, United Kingdom
| | - Paul Flicek
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Genome Campus, Cambridge, CB10 1SD, United Kingdom Wellcome Trust Sanger Institute, Wellcome Genome Campus, Cambridge, CB10 1SA, United Kingdom
| | - Leonard McMillan
- Department of Computer Science, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Fernando Pardo-Manuel de Villena
- Department of Genetics and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
103
|
Rebuzzini P, Zuccotti M, Redi CA, Garagna S. Achilles' heel of pluripotent stem cells: genetic, genomic and epigenetic variations during prolonged culture. Cell Mol Life Sci 2016; 73:2453-66. [PMID: 26961132 PMCID: PMC11108315 DOI: 10.1007/s00018-016-2171-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Revised: 01/28/2016] [Accepted: 02/25/2016] [Indexed: 12/12/2022]
Abstract
Pluripotent stem cells differentiate into almost any specialized adult cell type of an organism. PSCs can be derived either from the inner cell mass of a blastocyst-giving rise to embryonic stem cells-or after reprogramming of somatic terminally differentiated cells to obtain ES-like cells, named induced pluripotent stem cells. The potential use of these cells in the clinic, for investigating in vitro early embryonic development or for screening the effects of new drugs or xenobiotics, depends on capability to maintain their genome integrity during prolonged culture and differentiation. Both human and mouse PSCs are prone to genomic and (epi)genetic instability during in vitro culture, a feature that seriously limits their real potential use. Culture-induced variations of specific chromosomes or genes, are almost all unpredictable and, as a whole, differ among independent cell lines. They may arise at different culture passages, suggesting the absence of a safe passage number maintaining genome integrity and rendering the control of genomic stability mandatory since the very early culture passages. The present review highlights the urgency for further studies on the mechanisms involved in determining (epi)genetic and chromosome instability, exploiting the knowledge acquired earlier on other cell types.
Collapse
Affiliation(s)
- Paola Rebuzzini
- Laboratorio di Biologia dello Sviluppo, Dipartimento di Biologia e Biotecnologie 'Lazzaro Spallanzani', Università degli Studi di Pavia, Via Ferrata 9, 27100, Pavia, Italy.
- Center for Health Technologies (C.H.T.), Università degli Studi di Pavia, Via Ferrata 1, Pavia, Italy.
| | - Maurizio Zuccotti
- Unita' di Anatomia, Istologia ed Embriologia, Dipartimento di Scienze Biomediche, Biotecnologiche e Traslazionali (S.BI.BI.T.), Università degli Studi di Parma, Via Volturno 39, 43100, Parma, Italy.
| | - Carlo Alberto Redi
- Laboratorio di Biologia dello Sviluppo, Dipartimento di Biologia e Biotecnologie 'Lazzaro Spallanzani', Università degli Studi di Pavia, Via Ferrata 9, 27100, Pavia, Italy
- Center for Health Technologies (C.H.T.), Università degli Studi di Pavia, Via Ferrata 1, Pavia, Italy
- Fondazione I.R.C.C.S. Policlinico San Matteo, Piazzale Golgi, 19, 27100, Pavia, Italy
| | - Silvia Garagna
- Laboratorio di Biologia dello Sviluppo, Dipartimento di Biologia e Biotecnologie 'Lazzaro Spallanzani', Università degli Studi di Pavia, Via Ferrata 9, 27100, Pavia, Italy.
- Center for Health Technologies (C.H.T.), Università degli Studi di Pavia, Via Ferrata 1, Pavia, Italy.
| |
Collapse
|
104
|
Bailey J. Monkey-based research on human disease: the implications of genetic differences. Altern Lab Anim 2016; 42:287-317. [PMID: 25413291 DOI: 10.1177/026119291404200504] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Assertions that the use of monkeys to investigate human diseases is valid scientifically are frequently based on a reported 90-93% genetic similarity between the species. Critical analyses of the relevance of monkey studies to human biology, however, indicate that this genetic similarity does not result in sufficient physiological similarity for monkeys to constitute good models for research, and that monkey data do not translate well to progress in clinical practice for humans. Salient examples include the failure of new drugs in clinical trials, the highly different infectivity and pathology of SIV/HIV, and poor extrapolation of research on Alzheimer's disease, Parkinson's disease and stroke. The major molecular differences underlying these inter-species phenotypic disparities have been revealed by comparative genomics and molecular biology - there are key differences in all aspects of gene expression and protein function, from chromosome and chromatin structure to post-translational modification. The collective effects of these differences are striking, extensive and widespread, and they show that the superficial similarity between human and monkey genetic sequences is of little benefit for biomedical research. The extrapolation of biomedical data from monkeys to humans is therefore highly unreliable, and the use of monkeys must be considered of questionable value, particularly given the breadth and potential of alternative methods of enquiry that are currently available to scientists.
Collapse
Affiliation(s)
- Jarrod Bailey
- New England Anti-Vivisection Society (NEAVS), Boston, MA, USA
| |
Collapse
|
105
|
Deakin JE, Kruger-Andrzejewska M. Marsupials as models for understanding the role of chromosome rearrangements in evolution and disease. Chromosoma 2016; 125:633-44. [PMID: 27255308 DOI: 10.1007/s00412-016-0603-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Revised: 05/19/2016] [Accepted: 05/23/2016] [Indexed: 12/28/2022]
Abstract
Chromosome rearrangements have been implicated in diseases, such as cancer, and speciation, but it remains unclear whether rearrangements are causal or merely a consequence of these processes. Two marsupial families with very different rates of karyotype evolution provide excellent models in which to study the role of chromosome rearrangements in a disease and evolutionary context. The speciose family Dasyuridae displays remarkable karyotypic conservation, with all species examined to date possessing nearly identical karyotypes. Despite the seemingly high degree of chromosome stability within this family, they appear prone to developing tumours, including transmissible devil facial tumours. In contrast, chromosome rearrangements have been frequent in the evolution of the species-rich family Macropodidae, which displays a high level of karyotypic diversity. In particular, the genus Petrogale (rock-wallabies) displays an extraordinary level of chromosome rearrangement among species. For six parapatric Petrogale species, it appears that speciation has essentially been caught in the act, providing an opportunity to determine whether chromosomal rearrangements are a cause or consequence of speciation in this system. This review highlights the reasons that these two marsupial families are excellent models for testing hypotheses for hotspots of chromosome rearrangement and deciphering the role of chromosome rearrangements in disease and speciation.
Collapse
Affiliation(s)
- Janine E Deakin
- Institute for Applied Ecology, University of Canberra, Canberra, ACT, 2617, Australia.
| | | |
Collapse
|
106
|
|
107
|
Shao M, Moret BM. A Fast and Exact Algorithm for the Exemplar Breakpoint Distance. J Comput Biol 2016; 23:337-46. [DOI: 10.1089/cmb.2015.0193] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Mingfu Shao
- Laboratory for Computational Biology and Bioinformatics, School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Bernard M.E. Moret
- Laboratory for Computational Biology and Bioinformatics, School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
108
|
Abstract
As a species, we possess unique biological features that distinguish us from other primates. Here, we review recent efforts to identify changes in gene regulation that drove the evolution of novel human phenotypes. We discuss genotype-directed comparisons of human and nonhuman primate genomes to identify human-specific genetic changes that may encode new regulatory functions. We also review phenotype-directed approaches, which use comparisons of gene expression or regulatory function in homologous human and nonhuman primate cells and tissues to identify changes in expression levels or regulatory activity that may be due to genetic changes in humans. Together, these studies are beginning to reveal the landscape of regulatory innovation in human evolution and point to specific regulatory changes for further study. Finally, we highlight two novel strategies to model human-specific regulatory functions in vivo: primate induced pluripotent stem cells and the generation of humanized mice by genome editing.
Collapse
Affiliation(s)
- Steven K Reilly
- Department of Genetics, Yale School of Medicine, New Haven, Connecticut 06510;
| | - James P Noonan
- Department of Genetics, Yale School of Medicine, New Haven, Connecticut 06510; .,Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut 06511.,Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, Connecticut 06510
| |
Collapse
|
109
|
Schmidt K, Noureen A, Kronenberg F, Utermann G. Structure, function, and genetics of lipoprotein (a). J Lipid Res 2016; 57:1339-59. [PMID: 27074913 DOI: 10.1194/jlr.r067314] [Citation(s) in RCA: 360] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Indexed: 12/29/2022] Open
Abstract
Lipoprotein (a) [Lp(a)] has attracted the interest of researchers and physicians due to its intriguing properties, including an intragenic multiallelic copy number variation in the LPA gene and the strong association with coronary heart disease (CHD). This review summarizes present knowledge of the structure, function, and genetics of Lp(a) with emphasis on the molecular and population genetics of the Lp(a)/LPA trait, as well as aspects of genetic epidemiology. It highlights the role of genetics in establishing Lp(a) as a risk factor for CHD, but also discusses uncertainties, controversies, and lack of knowledge on several aspects of the genetic Lp(a) trait, not least its function.
Collapse
Affiliation(s)
- Konrad Schmidt
- Divisions of Human Genetics Medical University of Innsbruck, Innsbruck, Austria Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria
| | - Asma Noureen
- Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria
| | - Florian Kronenberg
- Genetic Epidemiology, Medical University of Innsbruck, Innsbruck, Austria
| | - Gerd Utermann
- Divisions of Human Genetics Medical University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
110
|
Host Mitochondrial Association Evolved in the Human Parasite Toxoplasma gondii via Neofunctionalization of a Gene Duplicate. Genetics 2016; 203:283-98. [PMID: 26920761 PMCID: PMC4858780 DOI: 10.1534/genetics.115.186270] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 02/14/2016] [Indexed: 01/30/2023] Open
Abstract
In Toxoplasma gondii, an intracellular parasite of humans and other animals, host mitochondrial association (HMA) is driven by a gene family that encodes multiple mitochondrial association factor 1 (MAF1) proteins. However, the importance of MAF1 gene duplication in the evolution of HMA is not understood, nor is the impact of HMA on parasite biology. Here we used within- and between-species comparative analysis to determine that the MAF1 locus is duplicated in T. gondii and its nearest extant relative Hammondia hammondi, but not another close relative, Neospora caninum. Using cross-species complementation, we determined that the MAF1 locus harbors multiple distinct paralogs that differ in their ability to mediate HMA, and that only T. gondii and H. hammondi harbor HMA+ paralogs. Additionally, we found that exogenous expression of an HMA+ paralog in T. gondii strains that do not normally exhibit HMA provides a competitive advantage over their wild-type counterparts during a mouse infection. These data indicate that HMA likely evolved by neofunctionalization of a duplicate MAF1 copy in the common ancestor of T. gondii and H. hammondi, and that the neofunctionalized gene duplicate is selectively advantageous.
Collapse
|
111
|
Franke L, el Bannoudi H, Jansen DTSL, Kok K, Trynka G, Diogo D, Swertz M, Fransen K, Knevel R, Gutierrez-Achury J, Ärlestig L, Greenberg JD, Kremer J, Pappas DA, Kanterakis A, Weersma RK, van der Helm-van Mil AHM, Guryev V, Rantapää-Dahlqvist S, Gregersen PK, Plenge RM, Wijmenga C, Huizinga TWJ, Ioan-Facsinay A, Toes REM, Zhernakova A. Association analysis of copy numbers of FC-gamma receptor genes for rheumatoid arthritis and other immune-mediated phenotypes. Eur J Hum Genet 2016; 24:263-70. [PMID: 25966632 PMCID: PMC4717214 DOI: 10.1038/ejhg.2015.95] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Revised: 03/27/2015] [Accepted: 04/15/2015] [Indexed: 12/20/2022] Open
Abstract
Segmental duplications (SDs) comprise about 5% of the human genome and are enriched for immune genes. SD loci often show copy numbers variations (CNV), which are difficult to tag with genotyping methods. CNV in the Fcγ receptor region (FCGR) has been suggested to be associated with rheumatic diseases. The objective of this study was to delineate association of FCGR-CNV with rheumatoid arthritis (RA), coeliac disease and Inflammatory bowel disease incidence. We developed a method to accurately quantify CNV in SD loci based on the intensity values from the Immunochip platform and applied it to the FCGR locus. We determined the method's validity using three independent assays: segregation analysis in families, arrayCGH, and whole genome sequencing. Our data showed the presence of two separate CNVs in the FCGR locus. The first region encodes FCGR2A, FCGR3A and part of FCGR2C gene, the second encodes another part of FCGR2C, FCGR3B and FCGR2B. Analysis of CNV status in 4578 individuals with RA and 5457 controls indicated association of duplications in the FCGR3B gene in antibody-negative RA (P=0.002, OR=1.43). Deletion in FCGR3B was associated with increased risk of antibody-positive RA, consistently with previous reports (P=0.023, OR=1.23). A clear genotype-phenotype relationship was observed: CNV polymorphisms of the FCGR3A gene correlated to CD16A expression (encoded by FCGR3A) on CD8 T-cells. In conclusion, our method allows determining the CNV status of the FCGR locus, we identified association of CNV in FCGR3B to RA and showed a functional relationship between CNV in the FCGR3A gene and CD16A expression.
Collapse
Affiliation(s)
- Lude Franke
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Hanane el Bannoudi
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Diahann T S L Jansen
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Klaas Kok
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Gosia Trynka
- Division of Rheumatology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Dorothee Diogo
- Division of Rheumatology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Morris Swertz
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
- Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Karin Fransen
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Rachel Knevel
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Javier Gutierrez-Achury
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Lisbeth Ärlestig
- Department of Public Health and Clinical Medicine/Rheumatology, Umeå University, Umeå, Sweden
| | - Jeffrey D Greenberg
- Department of Medicine, New York University School of Medicine, New York, New York, USA
| | - Joel Kremer
- Department of Medicine, Albany Medical College, Albany, New York, USA
| | - Dimitrios A Pappas
- Department of Medicine, Columbia University College of Physicians and Surgeons, New York, New York, USA
| | - Alexandros Kanterakis
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
- Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Rinse K Weersma
- Department of Gastroenterology and Hepatology, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | | | - Viktor Guryev
- Laboratory of Genome Structure and Ageing, European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | | | | | - Robert M Plenge
- Division of Rheumatology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Cisca Wijmenga
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
| | - Tom W-J Huizinga
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Andreea Ioan-Facsinay
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Rene E M Toes
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Alexandra Zhernakova
- Department of Genetics, University of Groningen, University Medical Centre Groningen, Groningen, The Netherlands
- Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| |
Collapse
|
112
|
Formation of Nup98-containing nuclear bodies in HeLa sublines is linked to genomic rearrangements affecting chromosome 11. Chromosoma 2015; 125:789-805. [PMID: 26685999 DOI: 10.1007/s00412-015-0567-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Revised: 12/06/2015] [Accepted: 12/10/2015] [Indexed: 01/23/2023]
Abstract
Nup98 is an important component of the nuclear pore complex (NPC) and also a rare but recurrent target for chromosomal translocation in leukaemogenesis. Nup98 contains multiple cohesive Gly-Leu-Phe-Gly (GLFG) repeats that are critical notably for the formation of intranuclear GLFG bodies. Previous studies have reported the existence of GLFG bodies in cells overexpressing exogenous Nup98 or in a HeLa subline (HeLa-C) expressing an unusual elevated amount of endogenous Nup98. Here, we have analysed the presence of Nup98-containing bodies in several human cell lines. We found that HEp-2, another HeLa subline, contains GLFG bodies that are distinct from those identified in HeLa-C. Rapid amplification of cDNA ends (RACE) revealed that HEp-2 cells express additional truncated forms of Nup98 fused to a non-coding region of chromosome 11q22.1. Cytogenetic analyses using FISH and array-CGH further revealed chromosomal rearrangements that were distinct from those observed in leukaemic cells. Indeed, HEp-2 cells feature a massive amplification of juxtaposed NUP98 and 11q22.1 loci on a chromosome marker derived from chromosome 3. Unexpectedly, minor co-amplifications of NUP98 and 11q22.1 loci were also observed in other HeLa sublines, but on rearranged chromosomes 11. Altogether, this study reveals that distinct genomic rearrangements affecting NUP98 are associated with the formation of GLFG bodies in specific HeLa sublines.
Collapse
|
113
|
Yuan B, Liu P, Gupta A, Beck CR, Tejomurtula A, Campbell IM, Gambin T, Simmons AD, Withers MA, Harris RA, Rogers J, Schwartz DC, Lupski JR. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates. PLoS Genet 2015; 11:e1005686. [PMID: 26641089 PMCID: PMC4671654 DOI: 10.1371/journal.pgen.1005686] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Accepted: 10/29/2015] [Indexed: 11/30/2022] Open
Abstract
Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. Genomic instability due to the intrinsic sequence architecture of the genome, such as low copy repeats (LCRs), is a major contributor to de novo mutations that can occur in the process of human genome evolution. LCRs can mediate genomic rearrangements associated with genomic disorders by acting as substrates for nonallelic homologous recombination. Juvenile-onset nephronophthisis 1 is the most frequent genetic cause of renal failure in children. An LCR-mediated, homozygous common recurrent deletion encompassing NPHP1 is found in the majority of affected subjects, while heterozygous deletion representing the nephronophthisis 1 recessive carrier state is frequently observed amongst world populations. Interestingly, the human NPHP1 locus is located proximal to the head-to-head fusion site of two ancestral chromosomes that occurred in the great apes, which resulted in a reduction of chromosome number from 48 in nonhuman primates to the current 46 in humans. In this study, we characterized and provided evidence for the diverse genomic architecture at the NPHP1 locus and potential structural variant haplotypes in the human population. Furthermore, our analyses of primate genomes shed light on the massive changes of genomic architecture at the human NPHP1 locus and delineated a model for the emergence of the LCRs during primate evolution.
Collapse
Affiliation(s)
- Bo Yuan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Aditya Gupta
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and The UW-Biotechnology Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Christine R. Beck
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Anusha Tejomurtula
- Graduate Program in Diagnostic Genetics, School of Health Professions, University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America
| | - Ian M. Campbell
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Tomasz Gambin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Alexandra D. Simmons
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Marjorie A. Withers
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - R. Alan Harris
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
| | - Jeffrey Rogers
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
| | - David C. Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and The UW-Biotechnology Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas, United States of America
- Texas Children’s Hospital, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
114
|
Ferrier DEK. The origin of the Hox/ParaHox genes, the Ghost Locus hypothesis and the complexity of the first animal. Brief Funct Genomics 2015; 15:333-41. [PMID: 26637506 DOI: 10.1093/bfgp/elv056] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
A key aim in evolutionary biology is to deduce ancestral states to better understand the evolutionary origins of clades of interest and the diversification process(es) that has/have elaborated them. These ancestral deductions can hit difficulties when undetected loss events are misinterpreted as ancestral absences. With the ever-increasing amounts of animal genomic sequence data, we are gaining a much clearer view of the preponderance of differential gene losses across animal lineages. This has become particularly clear with recent progress in our understanding of the origins of the Hox/ParaHox developmental control genes relative to the earliest branching lineages of the animal kingdom: the sponges (Porifera), comb jellies (Ctenophora) and placozoans (Placozoa). These reassessments of the diversity and complexity of developmental control genes in the earliest animal ancestors need to go hand-in-hand with complementary advances in comparative morphology, phylogenetics and palaeontology to clarify our understanding of the complexity of the last common ancestor of all animals. The field is currently undergoing a shift from the traditional consensus of a sponge-like animal ancestor from which morphological and molecular elaboration subsequently evolved, to a scenario of a more complex animal ancestor, with subsequent losses and simplifications in various lineages.
Collapse
|
115
|
Kim YJ, Ahn K, Gim JA, Oh MH, Han K, Kim HS. Gene structure variation in segmental duplication block C of human chromosome 7q 11.23 during primate evolution. Gene 2015. [PMID: 26196062 DOI: 10.1016/j.gene.2015.07.060] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Segmental duplication, or low-copy repeat (LCR) event, occurs during primate evolution and is an important source of genomic diversity, including gain or loss of gene function. The human chromosome 7q 11.23 is related to the William-Beuren syndrome and contains large region-specific LCRs composed of blocks A, B, and C that have different copy numbers in humans and different primates. We analyzed the structure of POM121, NSUN5, FKBP6, and TRIM50 genes in the LCRs of block C. Based on computational analysis, POM121B created by a segmental duplication acquired a new exonic region, whereas NSUN5B (NSUN5C) showed structural variation by integration of HERV-K LTR after duplication from the original NSUN5 gene. The TRIM50 gene originally consists of seven exons, whereas the duplicated TRIM73 and TRIM74 genes present five exons because of homologous recombination-mediated deletion. In addition, independent duplication events of the FKBP6 gene generated two pseudogenes at different genomic locations. In summary, these clustered genes are created by segmental duplication, indicating that they show dynamic evolutionary events, leading to structure variation in the primate genome.
Collapse
Affiliation(s)
- Yun-Ji Kim
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Kung Ahn
- TBI, Theragen BiO Institute, TheragenEtex, Suwon 443-270, Republic of Korea
| | - Jeong-An Gim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea
| | - Man Hwan Oh
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea
| | - Kyudong Han
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea.
| |
Collapse
|
116
|
Schilling S, Gramzow L, Lobbes D, Kirbis A, Weilandt L, Hoffmeier A, Junker A, Weigelt-Fischer K, Klukas C, Wu F, Meng Z, Altmann T, Theißen G. Non-canonical structure, function and phylogeny of the Bsister MADS-box gene OsMADS30 of rice (Oryza sativa). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 84:1059-1072. [PMID: 26473514 DOI: 10.1111/tpj.13055] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 10/02/2015] [Accepted: 10/06/2015] [Indexed: 06/05/2023]
Abstract
Bsister MADS-box genes play key roles in female reproductive organ and seed development throughout seed plants. This view is supported by their high conservation in terms of sequence, expression and function. In grasses, there are three subclades of Bsister genes: the OsMADS29-, the OsMADS30- and the OsMADS31-like genes. Here, we report on the evolution of the OsMADS30-like genes. Our analyses indicate that these genes evolved under relaxed purifying selection and are rather weakly expressed. OsMADS30, the representative of the OsMADS30-like genes from rice (Oryza sativa), shows strong sequence deviations in its 3' region when compared to orthologues from other grass species. We show that this is due to a 2.4-kbp insertion, possibly of a hitherto unknown helitron, which confers a heterologous C-terminal domain to OsMADS30. This putative helitron is not present in the OsMADS30 orthologues from closely related wild rice species, pointing to a relatively recent insertion event. Unlike other Bsister mutants O. sativa plants carrying a T-DNA insertion in the OsMADS30 gene do not show aberrant seed phenotypes, indicating that OsMADS30 likely does not have a canonical 'Bsister function'. However, imaging-based phenotyping of the T-DNA carrying plants revealed alterations in shoot size and architecture. We hypothesize that sequence deviations that accumulated during a period of relaxed selection in the gene lineage that led to OsMADS30 and the alteration of the C-terminal domain might have been a precondition for a potential neo-functionalization of OsMADS30 in O. sativa.
Collapse
Affiliation(s)
- Susanne Schilling
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Lydia Gramzow
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Dajana Lobbes
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Alexander Kirbis
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Lisa Weilandt
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Andrea Hoffmeier
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| | - Astrid Junker
- Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, D-06466, Germany
| | - Kathleen Weigelt-Fischer
- Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, D-06466, Germany
| | - Christian Klukas
- Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, D-06466, Germany
| | - Feng Wu
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Zheng Meng
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Thomas Altmann
- Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, D-06466, Germany
| | - Günter Theißen
- Department of Genetics, Friedrich Schiller University Jena, Jena, D-07743, Germany
| |
Collapse
|
117
|
Mudge JM, Harrow J. Creating reference gene annotation for the mouse C57BL6/J genome assembly. Mamm Genome 2015; 26:366-78. [PMID: 26187010 PMCID: PMC4602055 DOI: 10.1007/s00335-015-9583-x] [Citation(s) in RCA: 168] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 06/18/2015] [Indexed: 12/14/2022]
Abstract
Annotation on the reference genome of the C57BL6/J mouse has been an ongoing project ever since the draft genome was first published. Initially, the principle focus was on the identification of all protein-coding genes, although today the importance of describing long non-coding RNAs, small RNAs, and pseudogenes is recognized. Here, we describe the progress of the GENCODE mouse annotation project, which combines manual annotation from the HAVANA group with Ensembl computational annotation, alongside experimental and in silico validation pipelines from other members of the consortium. We discuss the more recent incorporation of next-generation sequencing datasets into this workflow, including the usage of mass-spectrometry data to potentially identify novel protein-coding genes. Finally, we will outline how the C57BL6/J genebuild can be used to gain insights into the variant sites that distinguish different mouse strains and species.
Collapse
|
118
|
Zeng X, Li B, Welch R, Rojo C, Zheng Y, Dewey CN, Keleş S. Perm-seq: Mapping Protein-DNA Interactions in Segmental Duplication and Highly Repetitive Regions of Genomes with Prior-Enhanced Read Mapping. PLoS Comput Biol 2015; 11:e1004491. [PMID: 26484757 PMCID: PMC4618727 DOI: 10.1371/journal.pcbi.1004491] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Accepted: 08/06/2015] [Indexed: 11/19/2022] Open
Abstract
Segmental duplications and other highly repetitive regions of genomes contribute significantly to cells' regulatory programs. Advancements in next generation sequencing enabled genome-wide profiling of protein-DNA interactions by chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq). However, interactions in highly repetitive regions of genomes have proven difficult to map since short reads of 50-100 base pairs (bps) from these regions map to multiple locations in reference genomes. Standard analytical methods discard such multi-mapping reads and the few that can accommodate them are prone to large false positive and negative rates. We developed Perm-seq, a prior-enhanced read allocation method for ChIP-seq experiments, that can allocate multi-mapping reads in highly repetitive regions of the genomes with high accuracy. We comprehensively evaluated Perm-seq, and found that our prior-enhanced approach significantly improves multi-read allocation accuracy over approaches that do not utilize additional data types. The statistical formalism underlying our approach facilitates supervising of multi-read allocation with a variety of data sources including histone ChIP-seq. We applied Perm-seq to 64 ENCODE ChIP-seq datasets from GM12878 and K562 cells and identified many novel protein-DNA interactions in segmental duplication regions. Our analysis reveals that although the protein-DNA interactions sites are evolutionarily less conserved in repetitive regions, they share the overall sequence characteristics of the protein-DNA interactions in non-repetitive regions.
Collapse
Affiliation(s)
- Xin Zeng
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Bo Li
- California Institute for Quantitative Biosciences, University of California, Berkeley, California, United States of America
| | - Rene Welch
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Constanza Rojo
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Ye Zheng
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Colin N. Dewey
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Sündüz Keleş
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, United States of America
| |
Collapse
|
119
|
Warren WC, Jasinska AJ, García-Pérez R, Svardal H, Tomlinson C, Rocchi M, Archidiacono N, Capozzi O, Minx P, Montague MJ, Kyung K, Hillier LW, Kremitzki M, Graves T, Chiang C, Hughes J, Tran N, Huang Y, Ramensky V, Choi OW, Jung YJ, Schmitt CA, Juretic N, Wasserscheid J, Turner TR, Wiseman RW, Tuscher JJ, Karl JA, Schmitz JE, Zahn R, O'Connor DH, Redmond E, Nisbett A, Jacquelin B, Müller-Trutwin MC, Brenchley JM, Dione M, Antonio M, Schroth GP, Kaplan JR, Jorgensen MJ, Thomas GWC, Hahn MW, Raney BJ, Aken B, Nag R, Schmitz J, Churakov G, Noll A, Stanyon R, Webb D, Thibaud-Nissen F, Nordborg M, Marques-Bonet T, Dewar K, Weinstock GM, Wilson RK, Freimer NB. The genome of the vervet (Chlorocebus aethiops sabaeus). Genome Res 2015; 25:1921-33. [PMID: 26377836 PMCID: PMC4665013 DOI: 10.1101/gr.192922.115] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 09/10/2015] [Indexed: 01/20/2023]
Abstract
We describe a genome reference of the African green monkey or vervet (Chlorocebus aethiops). This member of the Old World monkey (OWM) superfamily is uniquely valuable for genetic investigations of simian immunodeficiency virus (SIV), for which it is the most abundant natural host species, and of a wide range of health-related phenotypes assessed in Caribbean vervets (C. a. sabaeus), whose numbers have expanded dramatically since Europeans introduced small numbers of their ancestors from West Africa during the colonial era. We use the reference to characterize the genomic relationship between vervets and other primates, the intra-generic phylogeny of vervet subspecies, and genome-wide structural variations of a pedigreed C. a. sabaeus population. Through comparative analyses with human and rhesus macaque, we characterize at high resolution the unique chromosomal fission events that differentiate the vervets and their close relatives from most other catarrhine primates, in whom karyotype is highly conserved. We also provide a summary of transposable elements and contrast these with the rhesus macaque and human. Analysis of sequenced genomes representing each of the main vervet subspecies supports previously hypothesized relationships between these populations, which range across most of sub-Saharan Africa, while uncovering high levels of genetic diversity within each. Sequence-based analyses of major histocompatibility complex (MHC) polymorphisms reveal extremely low diversity in Caribbean C. a. sabaeus vervets, compared to vervets from putatively ancestral West African regions. In the C. a. sabaeus research population, we discover the first structural variations that are, in some cases, predicted to have a deleterious effect; future studies will determine the phenotypic impact of these variations.
Collapse
Affiliation(s)
- Wesley C Warren
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Anna J Jasinska
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA; Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Raquel García-Pérez
- ICREA at Institut de Biologia Evolutiva, (UPF-CSIC) and Centro Nacional de Analisis Genomico (CNAG), PRBB/PCB, 08003 Barcelona, Spain
| | - Hannes Svardal
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), 1030 Vienna, Austria
| | - Chad Tomlinson
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Mariano Rocchi
- Department of Biology, University of Bari, Bari 70126, Italy
| | | | - Oronzo Capozzi
- Department of Biology, University of Bari, Bari 70126, Italy
| | - Patrick Minx
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Michael J Montague
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Kim Kyung
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - LaDeana W Hillier
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Milinn Kremitzki
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Tina Graves
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Colby Chiang
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | | | - Nam Tran
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Yu Huang
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Vasily Ramensky
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Oi-Wa Choi
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Yoon J Jung
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Christopher A Schmitt
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Nikoleta Juretic
- Department of Human Genetics, McGill University, Montreal QC H3A 1B1, Canada
| | | | - Trudy R Turner
- Department of Anthropology, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin 53705, USA; Department of Genetics Faculty of Natural and Agricultural Sciences, University of the Free State, Bloemfontein, 9300 South Africa
| | - Roger W Wiseman
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Jennifer J Tuscher
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Julie A Karl
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Jörn E Schmitz
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Boston, Massachusetts 02115, USA
| | - Roland Zahn
- Crucell Holland B.V., 2333 CN Leiden, The Netherlands
| | - David H O'Connor
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Eugene Redmond
- St. Kitts Biomedical Research Foundation, St. Kitts, West Indies
| | - Alex Nisbett
- St. Kitts Biomedical Research Foundation, St. Kitts, West Indies
| | - Béatrice Jacquelin
- Institut Pasteur, Unité de Régulation des Infections Rétrovirales, 75015 Paris, France
| | | | - Jason M Brenchley
- National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland 20892-9821, USA
| | | | | | | | - Jay R Kaplan
- Center for Comparative Medicine Research, Wake Forest School of Medicine, Winston-Salem 27157-1040, USA
| | - Matthew J Jorgensen
- Center for Comparative Medicine Research, Wake Forest School of Medicine, Winston-Salem 27157-1040, USA
| | - Gregg W C Thomas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Brian J Raney
- University of California Santa Cruz, Santa Cruz, California 95060, USA
| | - Bronwen Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Rishi Nag
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Juergen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany; Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Angela Noll
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany
| | - Roscoe Stanyon
- Department of Biology, University of Florence, 50122 Florence, Italy
| | - David Webb
- National Center for Biotechnology Information, Bethesda, Maryland 20894, USA
| | | | - Magnus Nordborg
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), 1030 Vienna, Austria
| | - Tomas Marques-Bonet
- ICREA at Institut de Biologia Evolutiva, (UPF-CSIC) and Centro Nacional de Analisis Genomico (CNAG), PRBB/PCB, 08003 Barcelona, Spain
| | - Ken Dewar
- Department of Human Genetics, McGill University, Montreal QC H3A 1B1, Canada
| | - George M Weinstock
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06001, USA
| | - Richard K Wilson
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Nelson B Freimer
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| |
Collapse
|
120
|
Brkušanin M, Kosać A, Jovanović V, Pešović J, Brajušković G, Dimitrijević N, Todorović S, Romac S, Milić Rašić V, Savić-Pavićević D. Joint effect of the SMN2 and SERF1A genes on childhood-onset types of spinal muscular atrophy in Serbian patients. J Hum Genet 2015; 60:723-8. [PMID: 26311540 DOI: 10.1038/jhg.2015.104] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 07/08/2015] [Accepted: 07/30/2015] [Indexed: 11/09/2022]
Abstract
Spinal muscular atrophy (SMA) is caused by functional loss of the survival of motor neuron 1 (SMN1) gene. Despite genetic homogeneity, phenotypic variability indicates the involvement of disease modifiers. SMN1 is located in 5q13.2 segmental duplication, enriched in genes and prone to unequal rearrangements, which results in copy number polymorphism (CNP). We examined the influence of CNP of 5q13.2 genes and their joint effect on childhood-onset SMA phenotype. Multiplex ligation-dependent probe amplification (MLPA) was used to construct 5q13.2 alleles and assess copy number of the SMN2, small EDRK-rich factor 1A (SERF1A) and NLR family apoptosis inhibitory protein (NAIP) genes in 99 Serbian patients with SMN1 homozygous absence (23-type I, 37-type II and 39-mild type III) and 122 patients' parents. Spearman rank test was performed to test correlation of individual genes and SMA type. Generalized linear models and backward selection were performed to obtain a model explaining phenotypic variation with the smallest set of variables. 5q13.2 alleles most commonly associated with type I harbored large-scale deletions, while those detected in types II and III originated from conversion of SMN1 to SMN2. Inverse correlation was observed between SMN2, SERF1A and NAIP CNP and SMA type (P=2.2e-16, P=4.264e-10, P=2.722e-8, respectively). The best minimal model describing phenotypic variability included SMN2 (P<2e-16), SERF1A (P<2e-16) and their interaction (P=0.02628). SMN2 and SERF1A have a joint modifying effect on childhood-onset SMA phenotype.
Collapse
Affiliation(s)
- Miloš Brkušanin
- Centre for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | - Ana Kosać
- Clinic for Neurology and Psychiatry for Children and Youth, Belgrade, Serbia
| | - Vladimir Jovanović
- Department of Genetic Research, Institute for Biological Research 'Siniša Stanković', University of Belgrade, Belgrade, Serbia
| | - Jovan Pešović
- Centre for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | - Goran Brajušković
- Centre for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | - Nikola Dimitrijević
- Department of Neurology, University Children's Hospital, Faculty of Medicine, University of Belgrade, Belgrade, Serbia.,Faculty of Medicine, University of Belgrade, Belgrade, Serbia
| | - Slobodanka Todorović
- Clinic for Neurology and Psychiatry for Children and Youth, Belgrade, Serbia.,Faculty of Medicine, University of Belgrade, Belgrade, Serbia
| | - Stanka Romac
- Centre for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | - Vedrana Milić Rašić
- Clinic for Neurology and Psychiatry for Children and Youth, Belgrade, Serbia.,Faculty of Medicine, University of Belgrade, Belgrade, Serbia
| | - Dušanka Savić-Pavićević
- Centre for Human Molecular Genetics, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| |
Collapse
|
121
|
Human GPR42 is a transcribed multisite variant that exhibits copy number polymorphism and is functional when heterologously expressed. Sci Rep 2015; 5:12880. [PMID: 26260360 PMCID: PMC4531286 DOI: 10.1038/srep12880] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2015] [Accepted: 07/13/2015] [Indexed: 01/22/2023] Open
Abstract
FFAR3 (GPR41) is a G-protein coupled receptor for which short-chain fatty acids serve as endogenous ligands. The receptor is found on gut enteroendocrine L-cells, pancreatic β-cells, and sympathetic neurons, and is implicated in obesity, diabetes, allergic airway disease, and altered immune function. In primates, FFAR3 is segmentally duplicated resulting in GPR42, a gene currently classified as a suspected pseudogene. In this study, we sequenced FFAR3 and GPR42 open reading frames from 56 individuals and found an unexpectedly high frequency of polymorphisms contributing to several complex haplotypes. We also identified a frequent (18.8%) structural variation that results in GPR42 copy number polymorphism. Finally, sequencing revealed that 50.6% of GPR42 haplotypes differed from FFAR3 by only a single non-synonymous substitution and that the GPR42 reference sequence matched only 4.4% of the alleles. Sequencing of cDNA from human sympathetic ganglia and colon revealed processed transcripts matching the GPR42 genotype. Expression of several GPR42 haplotypes in rat sympathetic neurons revealed diverse pharmacological phenotypes that differed in potency and efficacy. Our data suggest that GPR42 be reclassified as a functioning gene and that recognition of sequence and copy number polymorphism of the FFAR3/GPR42 complex be considered during genetic and pharmacological investigation of these receptors.
Collapse
|
122
|
Dobigny G, Britton-Davidian J, Robinson TJ. Chromosomal polymorphism in mammals: an evolutionary perspective. Biol Rev Camb Philos Soc 2015; 92:1-21. [PMID: 26234165 DOI: 10.1111/brv.12213] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Revised: 06/23/2015] [Accepted: 07/09/2015] [Indexed: 12/28/2022]
Abstract
Although chromosome rearrangements (CRs) are central to studies of genome evolution, our understanding of the evolutionary consequences of the early stages of karyotypic differentiation (i.e. polymorphism), especially the non-meiotic impacts, is surprisingly limited. We review the available data on chromosomal polymorphisms in mammals so as to identify taxa that hold promise for developing a more comprehensive understanding of chromosomal change. In doing so, we address several key questions: (i) to what extent are mammalian karyotypes polymorphic, and what types of rearrangements are principally involved? (ii) Are some mammalian lineages more prone to chromosomal polymorphism than others? More specifically, do (karyotypically) polymorphic mammalian species belong to lineages that are also characterized by past, extensive karyotype repatterning? (iii) How long can chromosomal polymorphisms persist in mammals? We discuss the evolutionary implications of these questions and propose several research avenues that may shed light on the role of chromosome change in the diversification of mammalian populations and species.
Collapse
Affiliation(s)
- Gauthier Dobigny
- Institut de Recherche pour le Développement, Centre de Biologie pour la Gestion des Populations (UMR IRD-INRA-Cirad-Montpellier SupAgro), Campus International de Baillarguet, CS30016, 34988, Montferrier-sur-Lez, France
| | - Janice Britton-Davidian
- Institut des Sciences de l'Evolution, Université de Montpellier, CNRS, IRD, EPHE, Cc065, Place Eugène Bataillon, 34095, Montpellier Cedex 5, France
| | - Terence J Robinson
- Evolutionary Genomics Group, Department of Botany and Zoology, Stellenbosch University, Private Bag X1, Matieland, Stellenbosch, 7062, South Africa
| |
Collapse
|
123
|
Abstract
The world of primate genomics is expanding rapidly in new and exciting ways owing to lowered costs and new technologies in molecular methods and bioinformatics. The primate order is composed of 78 genera and 478 species, including human. Taxonomic inferences are complex and likely a consequence of ongoing hybridization, introgression, and reticulate evolution among closely related taxa. Recently, we applied large-scale sequencing methods and extensive taxon sampling to generate a highly resolved phylogeny that affirms, reforms, and extends previous depictions of primate speciation. The next stage of research uses this phylogeny as a foundation for investigating genome content, structure, and evolution across primates. Ongoing and future applications of a robust primate phylogeny are discussed, highlighting advancements in adaptive evolution of genes and genomes, taxonomy and conservation management of endangered species, next-generation genomic technologies, and biomedicine.
Collapse
Affiliation(s)
- Jill Pecon-Slattery
- Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland 21702; Current Affiliation: Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, Virginia 22630;
| |
Collapse
|
124
|
Dumont BL. Interlocus gene conversion explains at least 2.7% of single nucleotide variants in human segmental duplications. BMC Genomics 2015; 16:456. [PMID: 26077037 PMCID: PMC4467073 DOI: 10.1186/s12864-015-1681-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 06/01/2015] [Indexed: 01/24/2023] Open
Abstract
Background Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles. Results Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters. Conclusions Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Beth L Dumont
- Initiative in Biological Complexity, North Carolina State University, 112 Derieux Place, 3510 Thomas Hall, Campus Box 7614, Raleigh, NC, 27695-7614, USA.
| |
Collapse
|
125
|
Melkikh AV. Paradoxes of early stages of evolution of life and biological complexity. ORIGINS LIFE EVOL B 2015; 45:163-71. [PMID: 25754592 DOI: 10.1007/s11084-015-9414-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 12/15/2014] [Indexed: 10/23/2022]
Abstract
Two of the most fundamental questions concerning the origin of life, how biologically important molecules (RNA, proteins) find their unique spatial configuration, and how coding sequences can evolve beyond a certain critical length, are discussed. It is shown that both of these problems have not been solved. Experiments that could clarify the mechanisms of interaction between biologically important molecules in the simplest cells are discussed.
Collapse
|
126
|
Ruchi R, Genovese G, Lee J, Charoonratana VT, Bernhardy AJ, Alper SL, Kopp JB, Thadhani R, Friedman DJ, Pollak MR. Copy Number Variation at the APOL1 Locus. PLoS One 2015; 10:e0125410. [PMID: 25933006 PMCID: PMC4416782 DOI: 10.1371/journal.pone.0125410] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 03/23/2015] [Indexed: 11/26/2022] Open
Abstract
Two coding variants in the APOL1 gene (G1 and G2) explain most of the high rate of kidney disease in African Americans. APOL1-associated kidney disease risk inheritance follows an autosomal recessive pattern: The relative risk of kidney disease associated with inheritance of two high-risk variants is 7–30 fold, depending on the specific kidney phenotype. We wished to determine if the variability in phenotype might in part reflect structural differences in APOL1 gene. We analyzed sequence coverage from 1000 Genomes Project Phase 3 samples as well as exome sequencing data from African American kidney disease cases for copy number variation. 8 samples sequenced in the 1000 Genomes Project showed increased coverage over a ~100kb region that includes APOL2, APOL1 and part of MYH9, suggesting the presence of APOL1 copy number greater than 2. We reasoned that such duplications should be enriched in apparent G1 heterozygotes with kidney disease. Using a PCR-based assay, we observed the presence of this duplication in additional samples from apparent G0G1 or G0G2 individuals. The frequency of this APOL1 duplication was compared among cases (n = 123) and controls (n = 255) with apparent G0G1 heterozygosity. The presence of APOL1 duplication was observed in 4.06% of cases and 0.78% controls, preliminary evidence that this APOL1 duplication may alter susceptibility to kidney disease (p = 0.03). Taqman-based copy number assays confirmed the presence of 3 APOL1 copies in individuals positive for this specific duplication by PCR assay, but also identified a small number of individuals with additional APOL1 copies of presumably different structure. These observations motivate further studies to better assess the contribution of APOL1 copy number on kidney disease risk and on APOL1 function. Investigators and clinicians genotyping APOL1 should also consider whether the particular genotyping platform used is subject to technical errors when more than two copies of APOL1 are present.
Collapse
Affiliation(s)
- Rupam Ruchi
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Division of Nephrology, Department of Medicine, University of Florida, Gainesville, Florida, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Giulio Genovese
- Stanley Center, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| | - Jessica Lee
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Victoria T. Charoonratana
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Andrea J. Bernhardy
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Seth L. Alper
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Stanley Center, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| | - Jeffrey B. Kopp
- Kidney Diseases Branch, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Ravi Thadhani
- Renal Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - David J. Friedman
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Center for Vascular Biology Research, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Martin R. Pollak
- Division of Nephrology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
127
|
Shao M, Lin Y, Moret BM. An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes. J Comput Biol 2015; 22:425-35. [DOI: 10.1089/cmb.2014.0096] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Affiliation(s)
- Mingfu Shao
- Laboratory for Computational Biology and Bioinformatics, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Yu Lin
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California
| | - Bernard M.E. Moret
- Laboratory for Computational Biology and Bioinformatics, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
128
|
Costantini TW, Dang X, Yurchyshyna MV, Coimbra R, Eliceiri BP, Baird A. A Human-Specific α7-Nicotinic Acetylcholine Receptor Gene in Human Leukocytes: Identification, Regulation and the Consequences of CHRFAM7A Expression. Mol Med 2015; 21:323-36. [PMID: 25860877 DOI: 10.2119/molmed.2015.00018] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 04/02/2015] [Indexed: 12/30/2022] Open
Abstract
The human genome contains a variant form of the α7-nicotinic acetylcholine receptor (α7nAChR) gene that is uniquely human. This CHRFAM7A gene arose during human speciation and recent data suggests that its expression alters ligand tropism of the normally homopentameric human α7-AChR ligand-gated cell surface ion channel that is found on the surface of many different cell types. To understand its possible significance in regulating inflammation in humans, we investigated its expression in normal human leukocytes and leukocyte cell lines, compared CHRFAM7A expression to that of the CHRNA7 gene, mapped its promoter and characterized the effects of stable CHRFAM7A overexpression. We report here that CHRFAM7A is highly expressed in human leukocytes but that the levels of both CHRFAM7A and CHRNA7 mRNAs were independent and varied widely. To this end, mapping of the CHRFAM7A promoter in its 5'-untranslated region (UTR) identified a unique 1-kb sequence that independently regulates CHRFAM7A gene expression. Because overexpression of CHRFAM7A in THP1 cells altered the cell phenotype and modified the expression of genes associated with focal adhesion (for example, FAK, P13K, Akt, rho, GEF, Elk1, CycD), leukocyte transepithelial migration (Nox, ITG, MMPs, PKC) and cancer (kit, kitL, ras, cFos cyclinD1, Frizzled and GPCR), we conclude that CHRFAM7A is biologically active. Most surprisingly however, stable CHRFAM7A overexpression in THP1 cells upregulated CHRNA7, which, in turn, led to increased binding of the specific α7nAChR ligand, bungarotoxin, on the THP1 cell surface. Taken together, these data confirm the close association between CHRFAM7A and CHRNA7 expression, establish a biological consequence to CHRFAM7A expression in human leukocytes and support the possibility that this human-specific gene might contribute to, and/or gauge, a human-specific response to inflammation.
Collapse
Affiliation(s)
- Todd W Costantini
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America
| | - Xitong Dang
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America.,Cardiovascular Research Center, Luzhou Medical College, Luzhou, Sichuan, China
| | - Maryana V Yurchyshyna
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America
| | - Raul Coimbra
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America
| | - Brian P Eliceiri
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America
| | - Andrew Baird
- Department of Surgery, University of California San Diego Health Sciences, San Diego, California, United States of America
| |
Collapse
|
129
|
Farré M, Robinson TJ, Ruiz-Herrera A. An Integrative Breakage Model of genome architecture, reshuffling and evolution: The Integrative Breakage Model of genome evolution, a novel multidisciplinary hypothesis for the study of genome plasticity. Bioessays 2015; 37:479-88. [PMID: 25739389 DOI: 10.1002/bies.201400174] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 12/23/2022]
Abstract
Our understanding of genomic reorganization, the mechanics of genomic transmission to offspring during germ line formation, and how these structural changes contribute to the speciation process, and genetic disease is far from complete. Earlier attempts to understand the mechanism(s) and constraints that govern genome remodeling suffered from being too narrowly focused, and failed to provide a unified and encompassing view of how genomes are organized and regulated inside cells. Here, we propose a new multidisciplinary Integrative Breakage Model for the study of genome evolution. The analysis of the high-level structural organization of genomes (nucleome), together with the functional constrains that accompany genome reshuffling, provide insights into the origin and plasticity of genome organization that may assist with the detection and isolation of therapeutic targets for the treatment of complex human disorders.
Collapse
Affiliation(s)
- Marta Farré
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Campus UAB, Barcelona, Spain
| | | | | |
Collapse
|
130
|
|
131
|
Watson CT, Steinberg KM, Graves TA, Warren RL, Malig M, Schein J, Wilson RK, Holt RA, Eichler EE, Breden F. Sequencing of the human IG light chain loci from a hydatidiform mole BAC library reveals locus-specific signatures of genetic diversity. Genes Immun 2015; 16:24-34. [PMID: 25338678 PMCID: PMC4304971 DOI: 10.1038/gene.2014.56] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 09/03/2014] [Accepted: 09/03/2014] [Indexed: 12/24/2022]
Abstract
Germline variation at immunoglobulin (IG) loci is critical for pathogen-mediated immunity, but establishing complete haplotype sequences in these regions has been problematic because of complex sequence architecture and diploid source DNA. We sequenced BAC clones from the effectively haploid human hydatidiform mole cell line, CHM1htert, across the light chain IG loci, kappa (IGK) and lambda (IGL), creating single haplotype representations of these regions. The IGL haplotype generated here is 1.25 Mb of contiguous sequence, including four novel IGLV alleles, one novel IGLC allele, and an 11.9-kb insertion. The CH17 IGK haplotype consists of two 644 kb proximal and 466 kb distal contigs separated by a large gap of unknown size; these assemblies added 49 kb of unique sequence extending into this gap. Our analysis also resulted in the characterization of seven novel IGKV alleles and a 16.7-kb region exhibiting signatures of interlocus sequence exchange between distal and proximal IGKV gene clusters. Genetic diversity in IGK/IGL was compared with that of the IG heavy chain (IGH) locus within the same haploid genome, revealing threefold (IGK) and sixfold (IGL) higher diversity in the IGH locus, potentially associated with increased levels of segmental duplication and the telomeric location of IGH.
Collapse
Affiliation(s)
- C T Watson
- Department of Biological Sciences, Simon Fraser University, Burnaby, British Columbia, Canada
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - K M Steinberg
- Department of Genome Sciences, University of Washington, Seattle, WA USA
- The Genome Institute, Washington University, St Louis, MO USA
| | - T A Graves
- The Genome Institute, Washington University, St Louis, MO USA
| | - R L Warren
- Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia Canada
| | - M Malig
- Department of Genome Sciences, University of Washington, Seattle, WA USA
| | - J Schein
- Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia Canada
| | - R K Wilson
- The Genome Institute, Washington University, St Louis, MO USA
| | - R A Holt
- Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia Canada
| | - E E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA USA
- Howard Hughes Medical Institute, Seattle, WA USA
| | - F Breden
- Department of Biological Sciences, Simon Fraser University, Burnaby, British Columbia, Canada
| |
Collapse
|
132
|
Chen L, Zhou W, Zhang L, Zhang F. Genome architecture and its roles in human copy number variation. Genomics Inform 2014; 12:136-44. [PMID: 25705150 PMCID: PMC4330246 DOI: 10.5808/gi.2014.12.4.136] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 11/12/2014] [Accepted: 11/12/2014] [Indexed: 02/06/2023] Open
Abstract
Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs), are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.
Collapse
Affiliation(s)
- Lu Chen
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Weichen Zhou
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Ling Zhang
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Feng Zhang
- School of Life Sciences, Fudan University, Shanghai 200438, China. ; Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai 200438, China
| |
Collapse
|
133
|
Jaratlerdsiri W, Deakin J, Godinez RM, Shan X, Peterson DG, Marthey S, Lyons E, McCarthy FM, Isberg SR, Higgins DP, Chong AY, John JS, Glenn TC, Ray DA, Gongora J. Comparative genome analyses reveal distinct structure in the saltwater crocodile MHC. PLoS One 2014; 9:e114631. [PMID: 25503521 PMCID: PMC4263668 DOI: 10.1371/journal.pone.0114631] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 11/11/2014] [Indexed: 12/22/2022] Open
Abstract
The major histocompatibility complex (MHC) is a dynamic genome region with an essential role in the adaptive immunity of vertebrates, especially antigen presentation. The MHC is generally divided into subregions (classes I, II and III) containing genes of similar function across species, but with different gene number and organisation. Crocodylia (crocodilians) are widely distributed and represent an evolutionary distinct group among higher vertebrates, but the genomic organisation of MHC within this lineage has been largely unexplored. Here, we studied the MHC region of the saltwater crocodile (Crocodylus porosus) and compared it with that of other taxa. We characterised genomic clusters encompassing MHC class I and class II genes in the saltwater crocodile based on sequencing of bacterial artificial chromosomes. Six gene clusters spanning ∼452 kb were identified to contain nine MHC class I genes, six MHC class II genes, three TAP genes, and a TRIM gene. These MHC class I and class II genes were in separate scaffold regions and were greater in length (2-6 times longer) than their counterparts in well-studied fowl B loci, suggesting that the compaction of avian MHC occurred after the crocodilian-avian split. Comparative analyses between the saltwater crocodile MHC and that from the alligator and gharial showed large syntenic areas (>80% identity) with similar gene order. Comparisons with other vertebrates showed that the saltwater crocodile had MHC class I genes located along with TAP, consistent with birds studied. Linkage between MHC class I and TRIM39 observed in the saltwater crocodile resembled MHC in eutherians compared, but absent in avian MHC, suggesting that the saltwater crocodile MHC appears to have gene organisation intermediate between these two lineages. These observations suggest that the structure of the saltwater crocodile MHC, and other crocodilians, can help determine the MHC that was present in the ancestors of archosaurs.
Collapse
Affiliation(s)
- Weerachai Jaratlerdsiri
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - Janine Deakin
- Evolution Ecology and Genetics, Research School of Biology, Australian National University, Canberra, Australian Capital Territory 2601, Australia
- Institute for Applied Ecology, University of Canberra, Canberra, Australian Capital Territory 2601, Australia
| | - Ricardo M. Godinez
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, United States of America
- Department of Genetics, Harvard Medical School, 77 Louis Pasteur Ave., Boston, Massachusetts 02115, United States of America
| | - Xueyan Shan
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Daniel G. Peterson
- Institute for Genomics, Biocomputing and Biotechnology (IGBB), Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Sylvain Marthey
- Animal Genetics and Integrative Biology, INRA, UMR 1313 Jouy-en-Josas 78352, France
| | - Eric Lyons
- School of Plant Science, University of Arizona, Tucson, Arizona 85721, United States of America
| | - Fiona M. McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, Arizona 85721, United States of America
| | - Sally R. Isberg
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
- Center for Crocodile Research, P.O. Box 329, Noonamah, Northern Territory 0837, Australia
| | - Damien P. Higgins
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - Amanda Y. Chong
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| | - John St John
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California 95064, United States of America
| | - Travis C. Glenn
- Department of Environmental Health Science, University of Georgia, Athens, Georgia 30602, United States of America
| | - David A. Ray
- Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi 39762, United States of America
- Institute for Genomics, Biocomputing and Biotechnology (IGBB), Mississippi State University, Mississippi State, Mississippi 39762, United States of America
| | - Jaime Gongora
- Faculty of Veterinary Science, University of Sydney, Sydney, New South Wales 2006, Australia
| |
Collapse
|
134
|
Global patterns of apparent copy number variation in birds revealed by cross-species comparative genomic hybridization. Chromosome Res 2014; 22:59-70. [PMID: 24570127 DOI: 10.1007/s10577-014-9405-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
There is a growing interest in copy number variation (CNV) and the recognition of its importance in phenotype, disease, adaptation and speciation. CNV data is usually ascertained by array-CGH within-species, but similar inter-species comparisons have also been made in primates, mice and domestic mammals. Here, we conducted a broad appraisal of putative cross-species CNVs in birds, 16 species in all, using the standard array-CGH approach. Using a chicken oligonucleotide microarray, we detected 790 apparent CNVs within 135 unique regions and developed a bioinformatic tool 'CNV Analyser' for analysing and visualising cross-species data sets. We successfully addressed four hypotheses as follows: (a) Cross-species CNVs (compared to chicken) are, as suggested from preliminary evidence, smaller and fewer in number than in mammals; this 'dogma' was rejected in the light of the new evidence. (b) CNVs in birds are likely to have a functional effect through an association with genes; a large proportion of detected regions (70 %) were indeed associated with genes (suggesting functional significance), however, not necessarily more so than in mammals. (c) There are more CNVs in birds with more rearranged karyotypes; this hypothesis was rejected. Indeed, Falco species contained fewer than most with relatively standard (chicken-like) karyotypes. (d) There are more CNVs per megabase on micro-chromosomes than macrochromosomes; this hypothesis was accepted. Indeed, in species with rearranged karyotypes characterised by chromosomal fusions, the fused former microchromosomes still 'behaved' as though they were their microchromosomal ancestors. Gene ontology analysis of CNVRs revealed enrichment in immune response and antigen presentation genes and five CNVRs were perfectly correlated with the unique loss of sexual dichromatism in one Galliformes species.
Collapse
|
135
|
Altenhoff AM, Škunca N, Glover N, Train CM, Sueki A, Piližota I, Gori K, Tomiczek B, Müller S, Redestig H, Gonnet GH, Dessimoz C. The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements. Nucleic Acids Res 2014; 43:D240-9. [PMID: 25399418 PMCID: PMC4383958 DOI: 10.1093/nar/gku1158] [Citation(s) in RCA: 155] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The Orthologous Matrix (OMA) project is a method and associated database inferring evolutionary relationships amongst currently 1706 complete proteomes (i.e. the protein sequence associated for every protein-coding gene in all genomes). In this update article, we present six major new developments in OMA: (i) a new web interface; (ii) Gene Ontology function predictions as part of the OMA pipeline; (iii) better support for plant genomes and in particular homeologs in the wheat genome; (iv) a new synteny viewer providing the genomic context of orthologs; (v) statically computed hierarchical orthologous groups subsets downloadable in OrthoXML format; and (vi) possibility to export parts of the all-against-all computations and to combine them with custom data for 'client-side' orthology prediction. OMA can be accessed through the OMA Browser and various programmatic interfaces at http://omabrowser.org.
Collapse
Affiliation(s)
- Adrian M Altenhoff
- University College London, Gower Street, London WC1E 6BT, UK Swiss Institute of Bioinformatics, Universitätstr. 6, 8092 Zurich, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
| | - Nives Škunca
- University College London, Gower Street, London WC1E 6BT, UK Swiss Institute of Bioinformatics, Universitätstr. 6, 8092 Zurich, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
| | - Natasha Glover
- University College London, Gower Street, London WC1E 6BT, UK Institut National de la Recherche Agronomique (INRA) UMR1095, Genetics, Diversity and Ecophysiology of Cereals, 5 Chemin de Beaulieu, 63039 Clermont-Ferrand, France Bayer CropScience NV, Technologiepark 38, 9052 Gent, Belgium
| | | | - Anna Sueki
- University College London, Gower Street, London WC1E 6BT, UK
| | - Ivana Piližota
- University College London, Gower Street, London WC1E 6BT, UK
| | - Kevin Gori
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Steven Müller
- University College London, Gower Street, London WC1E 6BT, UK
| | | | - Gaston H Gonnet
- Swiss Institute of Bioinformatics, Universitätstr. 6, 8092 Zurich, Switzerland ETH Zurich, Computer Science, Universitätstr. 6, 8092 Zurich, Switzerland
| | - Christophe Dessimoz
- University College London, Gower Street, London WC1E 6BT, UK European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
136
|
Massip F, Sheinman M, Schbath S, Arndt PF. How evolution of genomes is reflected in exact DNA sequence match statistics. Mol Biol Evol 2014; 32:524-35. [PMID: 25398628 PMCID: PMC4298173 DOI: 10.1093/molbev/msu313] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.
Collapse
Affiliation(s)
- Florian Massip
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany UR1077, Unite Mathematiques Informatique et Genome, INRA, domaine de Vilvert, Jouy-en-Josas, France
| | - Michael Sheinman
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| | - Sophie Schbath
- UR1077, Unite Mathematiques Informatique et Genome, INRA, domaine de Vilvert, Jouy-en-Josas, France
| | - Peter F Arndt
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
| |
Collapse
|
137
|
Dharia AP, Obla A, Gajdosik MD, Simon A, Nelson CE. Tempo and mode of gene duplication in mammalian ribosomal protein evolution. PLoS One 2014; 9:e111721. [PMID: 25369106 PMCID: PMC4219774 DOI: 10.1371/journal.pone.0111721] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 10/06/2014] [Indexed: 12/17/2022] Open
Abstract
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism.
Collapse
Affiliation(s)
- Asav P. Dharia
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Ajay Obla
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Matthew D. Gajdosik
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Amanda Simon
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Craig E. Nelson
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
138
|
Soh YQS, Alföldi J, Pyntikova T, Brown LG, Graves T, Minx PJ, Fulton RS, Kremitzki C, Koutseva N, Mueller JL, Rozen S, Hughes JF, Owens E, Womack JE, Murphy WJ, Cao Q, de Jong P, Warren WC, Wilson RK, Skaletsky H, Page DC. Sequencing the mouse Y chromosome reveals convergent gene acquisition and amplification on both sex chromosomes. Cell 2014; 159:800-13. [PMID: 25417157 DOI: 10.1016/j.cell.2014.09.052] [Citation(s) in RCA: 231] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Revised: 09/04/2014] [Accepted: 09/22/2014] [Indexed: 01/27/2023]
Abstract
We sequenced the MSY (male-specific region of the Y chromosome) of the C57BL/6J strain of the laboratory mouse Mus musculus. In contrast to theories that Y chromosomes are heterochromatic and gene poor, the mouse MSY is 99.9% euchromatic and contains about 700 protein-coding genes. Only 2% of the MSY derives from the ancestral autosomes that gave rise to the mammalian sex chromosomes. Instead, all but 45 of the MSY's genes belong to three acquired, massively amplified gene families that have no homologs on primate MSYs but do have acquired, amplified homologs on the mouse X chromosome. The complete mouse MSY sequence brings to light dramatic forces in sex chromosome evolution: lineage-specific convergent acquisition and amplification of X-Y gene families, possibly fueled by antagonism between acquired X-Y homologs. The mouse MSY sequence presents opportunities for experimental studies of a sex-specific chromosome in its entirety, in a genetically tractable model organism.
Collapse
Affiliation(s)
- Y Q Shirleen Soh
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Jessica Alföldi
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | | | - Laura G Brown
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
| | - Tina Graves
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Patrick J Minx
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Robert S Fulton
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Colin Kremitzki
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Natalia Koutseva
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Jacob L Mueller
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Steve Rozen
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | | | - Elaine Owens
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - James E Womack
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - William J Murphy
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - Qing Cao
- BACPAC Resources, Children's Hospital Oakland, 747 52nd Street, Oakland, CA 94609, USA
| | - Pieter de Jong
- BACPAC Resources, Children's Hospital Oakland, 747 52nd Street, Oakland, CA 94609, USA
| | - Wesley C Warren
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Richard K Wilson
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Helen Skaletsky
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
| | - David C Page
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA.
| |
Collapse
|
139
|
Ottolini B, Hornsby MJ, Abujaber R, MacArthur JAL, Badge RM, Schwarzacher T, Albertson DG, Bevins CL, Solnick JV, Hollox EJ. Evidence of convergent evolution in humans and macaques supports an adaptive role for copy number variation of the β-defensin-2 gene. Genome Biol Evol 2014; 6:3025-38. [PMID: 25349268 PMCID: PMC4255768 DOI: 10.1093/gbe/evu236] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
β-defensins are a family of important peptides of innate immunity, involved in host defense, immunomodulation, reproduction, and pigmentation. Genes encoding β-defensins show evidence of birth-and-death evolution, adaptation by amino acid sequence changes, and extensive copy number variation (CNV) within humans and other species. The role of CNV in the adaptation of β-defensins to new functions remains unclear, as does the adaptive role of CNV in general. Here, we fine-map CNV of a cluster of β-defensins in humans and rhesus macaques. Remarkably, we found that the structure of the CNV is different between primates, with distinct mutational origins and CNV boundaries defined by retroviral long terminal repeat elements. Although the human β-defensin CNV region is 322 kb and encompasses several genes, including β-defensins, a long noncoding RNA gene, and testes-specific zinc-finger transcription factors, the orthologous region in the rhesus macaque shows CNV of a 20-kb region, containing only a single gene, the ortholog of the human β-defensin-2 gene. Despite its independent origins, the range of gene copy numbers in the rhesus macaque is similar to humans. In addition, the rhesus macaque gene has been subject to divergent positive selection at the amino acid level following its initial duplication event between 3 and 9.5 Ma, suggesting adaptation of this gene as the macaque successfully colonized novel environments outside Africa. Therefore, the molecular phenotype of β-defensin-2 CNV has undergone convergent evolution, and this gene shows evidence of adaptation at the amino acid level in rhesus macaques.
Collapse
Affiliation(s)
| | - Michael J Hornsby
- Department of Microbiology and Immunology, University of California Davis School of Medicine
| | - Razan Abujaber
- Department of Genetics, University of Leicester, United Kingdom
| | - Jacqueline A L MacArthur
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco Present address: European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Richard M Badge
- Department of Genetics, University of Leicester, United Kingdom
| | | | - Donna G Albertson
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco Present address: Bluestone Center for Clinical Research, New York University College of Dentistry, New York, New York
| | - Charles L Bevins
- Department of Microbiology and Immunology, University of California Davis School of Medicine
| | - Jay V Solnick
- Department of Microbiology and Immunology, University of California Davis School of Medicine Department of Medicine, Center for Comparative Medicine, and the California National Primate Research Center, University of California
| | - Edward J Hollox
- Department of Genetics, University of Leicester, United Kingdom
| |
Collapse
|
140
|
Parrott AM, Mathews MB. The evolution and consequences of snaR family transposition in primates. Mob Genet Elements 2014; 1:291-295. [PMID: 22545241 PMCID: PMC3337139 DOI: 10.4161/mge.18478] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The small NF90 associated RNA (snaR) family of small noncoding RNAs (ncRNA) appears to have evolved from retrotransposon ancestors at or soon after pivotal stages in primate evolution. snaRs are thought to be derived from a FLAM C-like (free left Alu monomer) element through multiple short insertion/deletion (indel) and nucleotide (nt) substitution events. Tracing snaR’s complex evolutionary history through primate genomes led to the recent discovery of two novel retrotransposons: the Alu/snaR related (ASR) and catarrhine ancestor of snaR (CAS) elements. ASR elements are present in the genomes of Simiiformes, CAS elements are present in Old World Monkeys and apes, and snaRs are restricted to the African Great Apes (Homininae, including human, gorilla, chimpanzee and bonobo). Unlike their ancestors, snaRs have disseminated by multiple rounds of segmental duplication of a larger encompassing element. This process has produced large tandem gene arrays in humans and possibly precipitated the accelerated evolution of snaR. Furthermore, snaR segmental duplication created a new form of chorionic gonadotropin β subunit (CGβ) gene, recently classified as Type II CGβ, which has altered mRNA tissue expression and can generate a novel short peptide.
Collapse
Affiliation(s)
- Andrew M Parrott
- Department of Biochemistry and Molecular Biology; New Jersey Medical School; University of Medicine and Dentistry of New Jersey; Newark, NJ USA
| | | |
Collapse
|
141
|
Zagaria A, Anelli L, Coccaro N, Tota G, Casieri P, Cellamare A, Minervini A, Minervini CF, Brunetti C, Cumbo C, Specchia G, Albano F. 5'RUNX1-3'USP42 chimeric gene in acute myeloid leukemia can occur through an insertion mechanism rather than translocation and may be mediated by genomic segmental duplications. Mol Cytogenet 2014; 7:66. [PMID: 25298786 PMCID: PMC4189616 DOI: 10.1186/s13039-014-0066-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 09/17/2014] [Indexed: 12/17/2022] Open
Abstract
Background The runt-related transcription factor 1 (RUNX1) gene is a transcription factor that acts as a master regulator of hematopoiesis and represents one of the most frequent targets of chromosomal rearrangements in human leukemias. The t(7;21)(p22;q22) rearrangement generating a 5′RUNX1-3′USP42 fusion transcript has been reported in two cases of pediatric acute myeloid leukemia (AML) and further in eight adult cases of myeloid neoplasms. We describe the first case of adult AML with a 5′RUNX1-3′USP42 fusion gene generated by an insertion event instead of chromosomal translocation. Methods Conventional and molecular cytogenetic analyses allowed the precise characterization of the chromosomal rearrangement and breakpoints identification. Gene expression analysis was performed by quantitative real-time PCR experiments, whereas bioinformatic studies were carried out for revealing structural genomic characteristics of breakpoint regions. Results We identified an adult AML case bearing a ins(21;7)(q22;p15p22) generating a 5′RUNX1-3′USP42 fusion gene on der(21) chromosome and causing USP42 gene over-expression. Bioinformatic analysis of the genomic regions involved in ins(21;7)/t(7;21) showed the presence of interchromosomal segmental duplications (SDs) next to the USP42 and RUNX1 genes, that may underlie a non-allelic homologous recombination between chromosome 7 and 21 in AML. Conclusions We report the first case of a 5′RUNX1-3′USP42 chimeric gene generated by a chromosomal cryptic insertion in an adult AML patient. Our data revealed that there may be a pivotal role for SDs in this very rare but recurrent chromosomal rearrangement.
Collapse
Affiliation(s)
- Antonella Zagaria
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Luisa Anelli
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Nicoletta Coccaro
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Giuseppina Tota
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Paola Casieri
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Angelo Cellamare
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Angela Minervini
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Crescenzio Francesco Minervini
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Claudia Brunetti
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Cosimo Cumbo
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Giorgina Specchia
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| | - Francesco Albano
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology Section - University of Bari, P.zza G. Cesare, 11 70124 Bari, Italy
| |
Collapse
|
142
|
Carbone L, Harris RA, Gnerre S, Veeramah KR, Lorente-Galdos B, Huddleston J, Meyer TJ, Herrero J, Roos C, Aken B, Anaclerio F, Archidiacono N, Baker C, Barrell D, Batzer MA, Beal K, Blancher A, Bohrson CL, Brameier M, Campbell MS, Capozzi O, Casola C, Chiatante G, Cree A, Damert A, de Jong PJ, Dumas L, Fernandez-Callejo M, Flicek P, Fuchs NV, Gut I, Gut M, Hahn MW, Hernandez-Rodriguez J, Hillier LW, Hubley R, Ianc B, Izsvák Z, Jablonski NG, Johnstone LM, Karimpour-Fard A, Konkel MK, Kostka D, Lazar NH, Lee SL, Lewis LR, Liu Y, Locke DP, Mallick S, Mendez FL, Muffato M, Nazareth LV, Nevonen KA, O'Bleness M, Ochis C, Odom DT, Pollard KS, Quilez J, Reich D, Rocchi M, Schumann GG, Searle S, Sikela JM, Skollar G, Smit A, Sonmez K, ten Hallers B, Terhune E, Thomas GWC, Ullmer B, Ventura M, Walker JA, Wall JD, Walter L, Ward MC, Wheelan SJ, Whelan CW, White S, Wilhelm LJ, Woerner AE, Yandell M, Zhu B, Hammer MF, Marques-Bonet T, Eichler EE, Fulton L, Fronick C, Muzny DM, Warren WC, Worley KC, Rogers J, Wilson RK, Gibbs RA. Gibbon genome and the fast karyotype evolution of small apes. Nature 2014; 513:195-201. [PMID: 25209798 PMCID: PMC4249732 DOI: 10.1038/nature13679] [Citation(s) in RCA: 221] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2014] [Accepted: 07/14/2014] [Indexed: 12/22/2022]
Abstract
Gibbons are small arboreal apes that display an accelerated rate of evolutionary chromosomal rearrangement and occupy a key node in the primate phylogeny between Old World monkeys and great apes. Here we present the assembly and analysis of a northern white-cheeked gibbon (Nomascus leucogenys) genome. We describe the propensity for a gibbon-specific retrotransposon (LAVA) to insert into chromosome segregation genes and alter transcription by providing a premature termination site, suggesting a possible molecular mechanism for the genome plasticity of the gibbon lineage. We further show that the gibbon genera (Nomascus, Hylobates, Hoolock and Symphalangus) experienced a near-instantaneous radiation ∼5 million years ago, coincident with major geographical changes in southeast Asia that caused cycles of habitat compression and expansion. Finally, we identify signatures of positive selection in genes important for forelimb development (TBX5) and connective tissues (COL1A1) that may have been involved in the adaptation of gibbons to their arboreal habitat. The genome of the gibbon, a tree-dwelling ape from Asia positioned between Old World monkeys and the great apes, is presented, providing insights into the evolutionary history of gibbon species and their accelerated karyotypes, as well as evidence for selection of genes such as those for forelimb development and connective tissue that may be important for locomotion through trees. The many species of gibbons are small, tree-living apes from Southeast Asia, most of them listed as 'endangered' or 'critically endangered' on the IUCN list. In their presentation of the genome of the northern white-cheeked gibbon (Nomascus leucogenys) , Lucia Carbone and colleagues provide intriguing insights into the biology and evolutionary history of a group that straddles the divide between Old World monkeys and the great apes. The authors investigate how a novel gibbon-specific retrotransposon might be the source of gibbons' genome plasticity. Rapid karyotype evolution combined with multiple episodes of climate and environmental change might explain the almost instantaneous divergence of the four gibbon genera. Positive selection on genes involved in forelimb development and connective tissue might have been related to gibbons' unique mode of locomotion in the tropical canopy.
Collapse
Affiliation(s)
- Lucia Carbone
- 1] Oregon Health &Science University, Department of Behavioral Neuroscience, 3181 SW Sam Jackson Park Road Portland, Oregon 97239, USA. [2] Oregon National Primate Research Center, Division of Neuroscience, 505 NW 185th Avenue, Beaverton, Oregon 97006, USA. [3] Oregon Health &Science University, Department of Molecular &Medical Genetics, 3181 SW Sam Jackson Park Road, Portland, Oregon 97239, USA. [4] Oregon Health &Science University, Bioinformatics and Computational Biology Division, Department of Medical Informatics &Clinical Epidemiology, 3181 SW Sam Jackson Park Road, Portland, Oregon 97239, USA
| | - R Alan Harris
- Baylor College of Medicine, Department of Molecular and Human Genetics, One Baylor Plaza, Houston, Texas 77030, USA
| | - Sante Gnerre
- Nabsys, 60 Clifford Street, Providence, Rhode Island 02903, USA
| | - Krishna R Veeramah
- 1] University of Arizona, ARL Division of Biotechnology, Tucson, Arizona 85721, USA. [2] Stony Brook University, Department of Ecology and Evolution, Stony Brook, New York 11790, USA
| | - Belen Lorente-Galdos
- IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - John Huddleston
- 1] Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA. [2] Howard Hughes Medical Institute, 1705 NE Pacific Street, Seattle, Washington 98195, USA
| | - Thomas J Meyer
- Oregon Health &Science University, Department of Behavioral Neuroscience, 3181 SW Sam Jackson Park Road Portland, Oregon 97239, USA
| | - Javier Herrero
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] The Genome Analysis Centre, Norwich Research Park, Norwich NR4 7UH, UK. [3] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Christian Roos
- Leibniz Institute for Primate Research, Gene Bank of Primates, German Primate Center, Göttingen 37077, Germany
| | - Bronwen Aken
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fabio Anaclerio
- University of Bari, Department of Biology, Via Orabona 4, 70125, Bari, Italy
| | | | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Daniel Barrell
- 1] European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mark A Batzer
- Louisiana State University, Department of Biological Sciences, Baton Rouge, Louisiana 70803, USA
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Craig L Bohrson
- The Johns Hopkins University School of Medicine, Department of Oncology, Division of Biostatistics and Bioinformatics, Baltimore, Maryland 21205, USA
| | - Markus Brameier
- Leibniz Institute for Primate Research, Gene Bank of Primates, German Primate Center, Göttingen 37077, Germany
| | | | - Oronzo Capozzi
- University of Bari, Department of Biology, Via Orabona 4, 70125, Bari, Italy
| | - Claudio Casola
- Texas A&M University, Department of Ecosystem Science and Management, College Station, Texas 77843, USA
| | - Giorgia Chiatante
- University of Bari, Department of Biology, Via Orabona 4, 70125, Bari, Italy
| | - Andrew Cree
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Annette Damert
- Babes-Bolyai-University, Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Cluj-Napoca 400084, Romania
| | - Pieter J de Jong
- Children's Hospital Oakland Research Institute, BACPAC Resources, Oakland, California 94609, USA
| | - Laura Dumas
- University of Colorado School of Medicine, Department of Biochemistry and Molecular Genetics, Aurora, Colorado 80045, USA
| | - Marcos Fernandez-Callejo
- IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nina V Fuchs
- Max Delbrück Center for Molecular Medicine, Berlin 13125, Germany
| | - Ivo Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Barcelona 08028, Spain
| | - Marta Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Barcelona 08028, Spain
| | - Matthew W Hahn
- Indiana University, School of Informatics and Computing, Bloomington, Indiana 47408, USA
| | - Jessica Hernandez-Rodriguez
- IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - LaDeana W Hillier
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Robert Hubley
- Institute for Systems Biology, Seattle, Washington 98109-5234, USA
| | - Bianca Ianc
- Babes-Bolyai-University, Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Cluj-Napoca 400084, Romania
| | - Zsuzsanna Izsvák
- Max Delbrück Center for Molecular Medicine, Berlin 13125, Germany
| | - Nina G Jablonski
- The Pennsylvania State University, Department of Anthropology, University Park, Pennsylvania 16802, USA
| | - Laurel M Johnstone
- University of Arizona, ARL Division of Biotechnology, Tucson, Arizona 85721, USA
| | - Anis Karimpour-Fard
- University of Colorado School of Medicine, Department of Biochemistry and Molecular Genetics, Aurora, Colorado 80045, USA
| | - Miriam K Konkel
- Louisiana State University, Department of Biological Sciences, Baton Rouge, Louisiana 70803, USA
| | - Dennis Kostka
- University of Pittsburgh School of Medicine, Department of Developmental Biology, Department of Computational and Systems Biology, Pittsburg, Pennsylvania 15261, USA
| | - Nathan H Lazar
- Oregon Health &Science University, Bioinformatics and Computational Biology Division, Department of Medical Informatics &Clinical Epidemiology, 3181 SW Sam Jackson Park Road, Portland, Oregon 97239, USA
| | - Sandra L Lee
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Lora R Lewis
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Yue Liu
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Devin P Locke
- 1] The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Swapan Mallick
- Harvard Medical School, Department of Genetics, Boston, Massachusetts 02115, USA
| | - Fernando L Mendez
- 1] University of Arizona, ARL Division of Biotechnology, Tucson, Arizona 85721, USA. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lynne V Nazareth
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Kimberly A Nevonen
- Oregon National Primate Research Center, Division of Neuroscience, 505 NW 185th Avenue, Beaverton, Oregon 97006, USA
| | - Majesta O'Bleness
- University of Colorado School of Medicine, Department of Biochemistry and Molecular Genetics, Aurora, Colorado 80045, USA
| | - Cornelia Ochis
- Babes-Bolyai-University, Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Cluj-Napoca 400084, Romania
| | - Duncan T Odom
- 1] European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. [2] University of Cambridge, Cancer Research UK-Cambridge Institute, Cambridge CB2 0RE, UK
| | - Katherine S Pollard
- 1] University of California, Gladstone Institutes, San Francisco, California 94158-226, USA. [2] Institute for Human Genetics, University of California, San Francisco, California 94143-0794, USA. [3] Division of Biostatistics, University of California, San Francisco, California 94143-0794, USA
| | - Javier Quilez
- IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, 08003 Barcelona, Spain
| | - David Reich
- Harvard Medical School, Department of Genetics, Boston, Massachusetts 02115, USA
| | - Mariano Rocchi
- University of Bari, Department of Biology, Via Orabona 4, 70125, Bari, Italy
| | - Gerald G Schumann
- Paul Ehrlich Institute, Division of Medical Biotechnology, 63225 Langen, Germany
| | - Stephen Searle
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - James M Sikela
- University of Colorado School of Medicine, Department of Biochemistry and Molecular Genetics, Aurora, Colorado 80045, USA
| | - Gabriella Skollar
- Gibbon Conservation Center, 19100 Esguerra Rd, Santa Clarita, California 91350, USA
| | - Arian Smit
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Kemal Sonmez
- 1] Oregon Health &Science University, Bioinformatics and Computational Biology Division, Department of Medical Informatics &Clinical Epidemiology, 3181 SW Sam Jackson Park Road, Portland, Oregon 97239, USA. [2] Oregon Health &Science University, Center for Spoken Language Understanding, Institute on Development and Disability, Portland, Oregon 97239, USA
| | - Boudewijn ten Hallers
- 1] Children's Hospital Oakland Research Institute, BACPAC Resources, Oakland, California 94609, USA. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Elizabeth Terhune
- Oregon National Primate Research Center, Division of Neuroscience, 505 NW 185th Avenue, Beaverton, Oregon 97006, USA
| | - Gregg W C Thomas
- Indiana University, School of Informatics and Computing, Bloomington, Indiana 47408, USA
| | - Brygg Ullmer
- Louisiana State University, School of Electrical Engineering and Computer Science, Baton Rouge, Louisiana 70803, USA
| | - Mario Ventura
- University of Bari, Department of Biology, Via Orabona 4, 70125, Bari, Italy
| | - Jerilyn A Walker
- Louisiana State University, Department of Biological Sciences, Baton Rouge, Louisiana 70803, USA
| | - Jeffrey D Wall
- 1] Institute for Human Genetics, University of California, San Francisco, California 94143-0794, USA. [2] Division of Biostatistics, University of California, San Francisco, California 94143-0794, USA
| | - Lutz Walter
- Leibniz Institute for Primate Research, Gene Bank of Primates, German Primate Center, Göttingen 37077, Germany
| | - Michelle C Ward
- 1] University of Cambridge, Cancer Research UK-Cambridge Institute, Cambridge CB2 0RE, UK. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Sarah J Wheelan
- The Johns Hopkins University School of Medicine, Department of Oncology, Division of Biostatistics and Bioinformatics, Baltimore, Maryland 21205, USA
| | - Christopher W Whelan
- 1] Oregon Health &Science University, Center for Spoken Language Understanding, Institute on Development and Disability, Portland, Oregon 97239, USA. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Simon White
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Larry J Wilhelm
- Oregon National Primate Research Center, Division of Neuroscience, 505 NW 185th Avenue, Beaverton, Oregon 97006, USA
| | - August E Woerner
- University of Arizona, ARL Division of Biotechnology, Tucson, Arizona 85721, USA
| | - Mark Yandell
- University of Utah, Salt Lake City, Utah 84112, USA
| | - Baoli Zhu
- 1] Children's Hospital Oakland Research Institute, BACPAC Resources, Oakland, California 94609, USA. [2] Bill Lyons Informatics Center, UCL Cancer Institute, University College London, London WC1E 6DD, UK (J.He); Seven Bridges Genomics, Cambridge, Massachusetts 02138, USA (D.P.L.); Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA (F.L.M.); BioNano Genomics, San Diego, California 92121, USA (B.t.H.); University of Chicago, Department of Human Genetics, Chicago, Illinois 60637, USA (M.C.W.); Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts 02138, USA (C.W.W.); The CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China (B.Z.)
| | - Michael F Hammer
- University of Arizona, ARL Division of Biotechnology, Tucson, Arizona 85721, USA
| | - Tomas Marques-Bonet
- 1] IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, 08003 Barcelona, Spain. [2] Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Barcelona 08028, Spain
| | - Evan E Eichler
- 1] Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA. [2] Howard Hughes Medical Institute, 1705 NE Pacific Street, Seattle, Washington 98195, USA
| | - Lucinda Fulton
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Catrina Fronick
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Wesley C Warren
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Kim C Worley
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - Richard K Wilson
- The Genome Center at Washington University, Washington University School of Medicine, 4444 Forest Park Avenue, Saint Louis, Missouri 63108, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| |
Collapse
|
143
|
Irwin DM. Evolution of the vertebrate goose-type lysozyme gene family. BMC Evol Biol 2014; 14:188. [PMID: 25167808 PMCID: PMC4243810 DOI: 10.1186/s12862-014-0188-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2014] [Accepted: 08/12/2014] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Lysozyme g is an antibacterial enzyme that was first found in the eggs of some birds, but recently has been found in additional species, including non-vertebrates. Some previously characterized lysozyme g sequences are suggested to have altered secretion potential and enzymatic activity, however the distribution of these altered sequences is unknown. Duplicated copies of the lysozyme g gene exist in some species; however, the origins of the duplicates and their roles in altered function are unclear. RESULTS We identified 234 lysozyme g sequences from 118 vertebrate species, including 181 sequences that are full or near full length representing all vertebrate classes except cartilaginous fish. Phylogenetic analysis shows that most lysozyme g gene duplicates are recent or lineage specific events, however three amplification events are more ancient, those in an early amniote, an early mammal, and an early teleost. The older gene duplications are associated with changes in function, including changes in secretion potential and muramidase antibacterial enzymatic activity. CONCLUSIONS Lysozyme g is an essential muramidase enzyme that is widespread in vertebrates. Duplication of the lysozyme g gene, and the retention of non-secreted isozymes that have lost enzymatic activity indicate that lysozyme g has an activity other than the muramidase activity associated with being an antibacterial enzyme.
Collapse
Affiliation(s)
- David M Irwin
- Department of Laboratory Medicine and Pathobiology, Faculty of Medicine, University of Toronto, 1 King's College Circle, Toronto M5S 1A8, Ontario, Canada.
| |
Collapse
|
144
|
Chen X, Wang J, Mitchell E, Guo J, Wang L, Zhang Y, Hodge JC, Shen Y. Recurrent 8q13.2-13.3 microdeletions associated with branchio-oto-renal syndrome are mediated by human endogenous retroviral (HERV) sequence blocks. BMC MEDICAL GENETICS 2014; 15:90. [PMID: 25135225 PMCID: PMC4152767 DOI: 10.1186/s12881-014-0090-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Accepted: 07/18/2014] [Indexed: 01/18/2023]
Abstract
Background Human endogenous retroviral (HERV) sequences are the remnants of ancient retroviral infection and comprise approximately 8% of the human genome. The high abundance and interspersed nature of homologous HERV sequences make them ideal substrates for genomic rearrangements. A role for HERV sequences in mediating human disease-associated rearrangement has been reported but is likely currently underappreciated. Methods and Results In the present study, two independent de novo 8q13.2-13.3 microdeletion events were identified in patients with clinical features of Branchio-Oto-Renal (BOR) syndrome. Nucleotide-level mapping demonstrated the identical breakpoints, suggesting a recurrent microdeletion including multiple genes such as EYA1, SULF1, and SLCO5A1, which is mediated by HERV1 homologous sequences. Conclusions These findings raise the potential that HERV sequences may more commonly underlie recombination of dosage sensitive regions associated with recurrent syndromes.
Collapse
|
145
|
Sun X, Yang A. Exceptionally large mitochondrial fragments to the nucleus in sequenced mollusk genomes. Mitochondrial DNA A DNA Mapp Seq Anal 2014; 27:1409-10. [PMID: 25103445 DOI: 10.3109/19401736.2014.947604] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The available genome sequences of three mollusks (Biomphalaria glabrata, Aplysia californica and Crassostrea gigas) were first used to investigate the nuclear mitochondrial DNAs (NUMTs) in mollusks. The analysis showed that the NUMT contents were high in B. glabrata (17.738 Kb) and C. gigas (17.192 Kb), of which all or almost all mtDNA sequences were transferred to the nucleus, whereas NUMTs are rare (584 bp) in A. californica. The length of NUMTs was 61 to 5492 bp for B. glabrata, 1711 to 15,481 bp for C. gigas, and 124 to 460 bp for A. californica. The largest C. gigas NUMT covered 84.9% (15,481 bp) of its mitochondrial genome, which is rarely found in invertebrates so far. No correlation was found between NUMT content and genome size in the three sequenced mollusk genomes.
Collapse
Affiliation(s)
- Xiujun Sun
- a Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences , Qingdao , P.R. China
| | - Aiguo Yang
- a Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences , Qingdao , P.R. China
| |
Collapse
|
146
|
A high resolution map of mammalian X chromosome fragile regions assessed by large-scale comparative genomics. Mamm Genome 2014; 25:618-35. [PMID: 25086724 DOI: 10.1007/s00335-014-9537-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 07/14/2014] [Indexed: 10/24/2022]
Abstract
Chromosomal evolution involves multiple changes at structural and numerical levels. These changes, which are related to the variation of the gene number and their location, can be tracked by the identification of syntenic blocks (SB). First reports proposed that ~180-280 SB might be shared by mouse and human species. More recently, further studies including additional genomes have identified up to ~1,400 SB during the evolution of eutherian species. A considerable number of studies regarding the X chromosome's structure and evolution have been undertaken because of its extraordinary biological impact on reproductive fitness and speciation. Some have identified evolutionary breakpoint regions and fragile sites at specific locations in the human X chromosome. However, mapping these regions to date has involved using low-to-moderate resolution techniques. Such scenario might be related to underestimating their total number and giving an inaccurate location. The present study included using a combination of bioinformatics methods for identifying, at base-pair level, chromosomal rearrangements occurring during X chromosome evolution in 13 mammalian species. A comparative technique using four different algorithms was used for optimizing the detection of hotspot regions in the human X chromosome. We identified a significant interspecific variation in SB size which was related to genetic information gain regarding the human X chromosome. We found that human hotspot regions were enriched by LINE-1 and Alu transposable elements, which may have led to intraspecific chromosome rearrangement events. New fragile regions located in the human X chromosome have also been postulated. We estimate that the high resolution map of X chromosome fragile sites presented here constitutes useful data concerning future studies on mammalian evolution and human disease.
Collapse
|
147
|
The common marmoset genome provides insight into primate biology and evolution. Nat Genet 2014; 46:850-7. [PMID: 25038751 PMCID: PMC4138798 DOI: 10.1038/ng.3042] [Citation(s) in RCA: 170] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 06/27/2014] [Indexed: 02/06/2023]
Abstract
We report the whole-genome sequence of the common marmoset (Callithrix jacchus). The 2.26-Gb genome of a female marmoset was assembled using Sanger read data (6×) and a whole-genome shotgun strategy. A first analysis has permitted comparison with the genomes of apes and Old World monkeys and the identification of specific features that might contribute to the unique biology of this diminutive primate, including genetic changes that may influence body size, frequent twinning and chimerism. We observed positive selection in growth hormone/insulin-like growth factor genes (growth pathways), respiratory complex I genes (metabolic pathways), and genes encoding immunobiological factors and proteases (reproductive and immunity pathways). In addition, both protein-coding and microRNA genes related to reproduction exhibited evidence of rapid sequence evolution. This genome sequence for a New World monkey enables increased power for comparative analyses among available primate genomes and facilitates biomedical research application.
Collapse
|
148
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
149
|
Jiang J, Wang J, Wang H, Zhang Y, Kang H, Feng X, Wang J, Yin Z, Bao W, Zhang Q, Liu JF. Global copy number analyses by next generation sequencing provide insight into pig genome variation. BMC Genomics 2014; 15:593. [PMID: 25023178 PMCID: PMC4111851 DOI: 10.1186/1471-2164-15-593] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 07/04/2014] [Indexed: 01/10/2023] Open
Abstract
Background Copy number variations (CNVs) confer significant effects on genetic innovation and phenotypic variation. Previous CNV studies in swine seldom focused on in-depth characterization of global CNVs. Results Using whole-genome assembly comparison (WGAC) and whole-genome shotgun sequence detection (WSSD) approaches by next generation sequencing (NGS), we probed formation signatures of both segmental duplications (SDs) and individualized CNVs in an integrated fashion, building the finest resolution CNV and SD maps of pigs so far. We obtained copy number estimates of all protein-coding genes with copy number variation carried by individuals, and further confirmed two genes with high copy numbers in Meishan pigs through an enlarged population. We determined genome-wide CNV hotspots, which were significantly enriched in SD regions, suggesting evolution of CNV hotspots may be affected by ancestral SDs. Through systematically enrichment analyses based on simulations and bioinformatics analyses, we revealed CNV-related genes undergo a different selective constraint from those CNV-unrelated regions, and CNVs may be associated with or affect pig health and production performance under recent selection. Conclusions Our studies lay out one way for characterization of CNVs in the pig genome, provide insight into the pig genome variation and prompt CNV mechanisms studies when using pigs as biomedical models for human diseases. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-593) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Jian-Feng Liu
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
150
|
Zhang ZZ, Pannunzio NR, Han L, Hsieh CL, Yu K, Lieber MR. The strength of an Ig switch region is determined by its ability to drive R loop formation and its number of WGCW sites. Cell Rep 2014; 8:557-69. [PMID: 25017067 DOI: 10.1016/j.celrep.2014.06.021] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Revised: 05/20/2014] [Accepted: 06/12/2014] [Indexed: 12/22/2022] Open
Abstract
R loops exist at the murine IgH switch regions and possibly other locations, but their functional importance is unclear. In biochemical systems, R loop initiation requires DNA sequence regions containing clusters of G nucleotides, but cellular studies have not been done. Here, we vary the G-clustering, total switch region length, and the number of target sites (WGCW sites for the activation-induced deaminase) at synthetic switch regions in a murine B cell line to determine the effect on class switch recombination (CSR). G-clusters increase CSR regardless of their immediate proximity to the WGCW sites. This increase is accompanied by an increase in R loop formation. CSR efficiency correlates better with the absolute number of WGCW sites in the switch region rather than the total switch region length or density of WGCW sites. Thus, the overall strength of the switch region depends on G-clusters, which initiate R loop formation, and on the number of WGCW sites.
Collapse
Affiliation(s)
- Zheng Z Zhang
- USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA; Molecular and Computational Biology Section, Department of Biological Sciences, USC Dornsife, University of Southern California, Los Angeles, CA 90033, USA; Departments of Pathology, Biochemistry & Molecular Biology, Molecular Microbiology & Immunology, and Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Nicholas R Pannunzio
- USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA; Molecular and Computational Biology Section, Department of Biological Sciences, USC Dornsife, University of Southern California, Los Angeles, CA 90033, USA; Departments of Pathology, Biochemistry & Molecular Biology, Molecular Microbiology & Immunology, and Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Li Han
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA
| | - Chih-Lin Hsieh
- USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA; Molecular and Computational Biology Section, Department of Biological Sciences, USC Dornsife, University of Southern California, Los Angeles, CA 90033, USA; Departments of Pathology, Biochemistry & Molecular Biology, Molecular Microbiology & Immunology, and Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Kefei Yu
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA.
| | - Michael R Lieber
- USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA; Molecular and Computational Biology Section, Department of Biological Sciences, USC Dornsife, University of Southern California, Los Angeles, CA 90033, USA; Departments of Pathology, Biochemistry & Molecular Biology, Molecular Microbiology & Immunology, and Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA.
| |
Collapse
|