1
|
Lynn H, Sun X, Ayshiev D, Siegler JH, Rizzo AN, Karnes JH, Gonzales Garay M, Wang T, Casanova N, Camp SM, Ellis NA, Garcia JGN. Single nucleotide polymorphisms in the MYLKP1 pseudogene are associated with increased colon cancer risk in African Americans. PLoS One 2018; 13:e0200916. [PMID: 30161129 DOI: 10.1371/journal.pone.0200916] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 07/04/2018] [Indexed: 12/18/2022] Open
Abstract
INTRODUCTION Pseudogenes are paralogues of functional genes historically viewed as defunct due to either the lack of regulatory elements or the presence of frameshift mutations. Recent evidence, however, suggests that pseudogenes may regulate gene expression, although the functional role of pseudogenes remains largely unknown. We previously reported that MYLKP1, the pseudogene of MYLK that encodes myosin light chain kinase (MLCK), is highly expressed in lung and colon cancer cell lines and tissues but not in normal lung or colon. The MYLKP1 promoter is minimally active in normal bronchial epithelial cells but highly active in lung adenocarcinoma cells. In this study, we further validate MYLKP1 as an oncogene via elucidation of the functional role of MYLKP1 genetic variants in colon cancer risk. METHODS Proliferation and migration assays were performed in MYLKP1-transfected colon and lung cancer cell lines (H441, A549) and commercially-available normal lung and colon cells. Fourteen MYLKP1 SNPs (MAFs >0.01) residing within the 4 kb MYLKP1 promoter region, the core 1.4 kb of MYLKP1 gene, and a 4 kb enhancer region were selected and genotyped in a colorectal cancer cohort. MYLKP1 SNP influences on activity of MYLKP1 promoter (2kb) was assessed by dual luciferase reporter assay. RESULTS Cancer cell lines, H441 and A549, exhibited increased MYLKP1 expression, increased MYLKP1 luciferase promoter activity, increased proliferation and migration. Genotyping studies identified two MYLKP1 SNPs (rs12490683; rs12497343) that significantly increase risk of colon cancer in African Americans compared to African American controls. Rs12490683 and rs12497343 further increase MYLKP1 promoter activity compared to the wild type MYLKP1 promoter. CONCLUSION MYLKP1 is a cancer-promoting pseudogene whose genetic variants differentially enhance cancer risk in African American populations.
Collapse
|
2
|
Yu HJ, Serebryannyy LA, Fry M, Greene M, Chernaya O, Hu WY, Chew TL, Mahmud N, Kadkol SS, Glover S, Prins G, Strakova Z, de Lanerolle P. Tumor stiffness is unrelated to myosin light chain phosphorylation in cancer cells. PLoS One 2013; 8:e79776. [PMID: 24224004 DOI: 10.1371/journal.pone.0079776] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 09/25/2013] [Indexed: 01/15/2023] Open
Abstract
Many tumors are stiffer than their surrounding tissue. This increase in stiffness has been attributed, in part, to a Rho-dependent elevation of myosin II light chain phosphorylation. To characterize this mechanism further, we studied myosin light chain kinase (MLCK), the main enzyme that phosphorylates myosin II light chains. We anticipated that increases in MLCK expression and activity would contribute to the increased stiffness of cancer cells. However, we find that MLCK mRNA and protein levels are substantially less in cancer cells and tissues than in normal cells. Consistent with this observation, cancer cells contract 3D collagen matrices much more slowly than normal cells. Interestingly, inhibiting MLCK or Rho kinase did not affect the 3D gel contractions while blebbistatin partially and cytochalasin D maximally inhibited contractions. Live cell imaging of cells in collagen gels showed that cytochalasin D inhibited filopodia-like projections that formed between cells while a MLCK inhibitor had no effect on these projections. These data suggest that myosin II phosphorylation is dispensable in regulating the mechanical properties of tumors.
Collapse
|
3
|
Abstract
Pseudogenes, the nonfunctional homologs of functional genes and thus exemplified as 'genomic fossils' provide intriguing snapshots of the evolutionary history of human genome. These defunct copies generally arise by retrotransposition or duplication followed by various genetic disablements. In this study, focusing on human pseudogenes and their functional homologues we describe their characteristic features and relevance to protein sequence evolution. We recapitulate that pseudogenes harbor disease-causing degenerative sequence variations in conjunction with the immense disease gene association of their progenitors. Furthermore, we also discuss the issue of functional resurrection and the potentiality observed in some pseudogenes to regulate their functional counterparts.
Collapse
Affiliation(s)
- Kamalika Sen
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India. Tel.: +91 33 2355 6626; Fax: +91 33 2355 3886;
| | | |
Collapse
|
4
|
Quignon P, Rimbault M, Robin S, Galibert F. Genetics of canine olfaction and receptor diversity. Mamm Genome 2012; 23:132-43. [PMID: 22080304 DOI: 10.1007/s00335-011-9371-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2011] [Accepted: 10/23/2011] [Indexed: 01/06/2023]
Abstract
Olfaction is a particularly important sense in the dog. Humans selected for this capacity during the domestication process, and selection has continued to be employed to enhance this ability. In this review we first describe the different olfactory systems that exist and the different odorant receptors that are expressed in those systems. We then focus on the dog olfactory receptors by describing the olfactory receptor gene repertoire and its polymorphisms. Finally, we discuss the different uses of dog olfaction and the questions that still need to be studied.
Collapse
|
5
|
Abstract
Pseudogenes are considered nonfunctional genomic artifacts of catastrophic pathways. Recent evidence, however, indicates novel roles for pseudogenes as regulators of gene expression. We tested the functionality of myosin light chain kinase pseudogene (MYLKP1) in human cells and tissues by RT-PCR, promoter activity, and cell proliferation assays. MYLKP1 is partially duplicated from the original MYLK gene that encodes nonmuscle and smooth muscle myosin light chain kinase (smMLCK) isoforms and regulates cell contractility and cytokinesis. Despite strong homology with the smMLCK promoter (∼ 89.9%), the MYLKP1 promoter is minimally active in normal bronchial epithelial cells but highly active in lung adenocarcinoma cells. Moreover, MYLKP1 and smMLCK exhibit negatively correlated transcriptional patterns in normal and cancer cells with MYLKP1 strongly expressed in cancer cells and smMLCK highly expressed in non-neoplastic cells. For instance, expression of smMLCK decreased (19.5 ± 4.7 fold) in colon carcinoma tissues compared to normal colon tissues. Mechanistically, MYLKP1 overexpression inhibits smMLCK expression in cancer cells by decreasing RNA stability, leading to increased cell proliferation. These studies provide strong evidence for the functional involvement of pseudogenes in carcinogenesis and suggest MYLKP1 as a potential novel diagnostic or therapeutic target in human cancers.
Collapse
Affiliation(s)
- Yoo Jeong Han
- Department of Medicine, University of Illinois at Chicago, Chicago, Illinois 60612-7227, USA
| | | | | | | | | |
Collapse
|
6
|
Abstract
Perception of chemical stimuli from the environment is essential to most animals; accordingly, they are equipped with a complex olfactory system capable of receiving a nearly unlimited number of odorous substances and pheromones. This enormous task is accomplished by olfactory sensory neurons (OSNs) arranged in several chemosensory compartments in the nose. The sensitive and selective responsiveness of OSNs to odorous molecules and pheromones is based on distinct receptors in their chemosensory membrane; consequently, olfactory receptors play a key role for a reliable recognition and an accurate processing of chemosensory information. They are therefore considered as key elements for an understanding of the principles and mechanisms underlying the sense of smell. The repertoire of olfactory receptors in mammals encompasses hundreds of different receptor types which are highly diverse and expressed in distinct subcompartments of the nose. Accordingly, they are categorized into several receptor families, including odorant receptors (ORs), vomeronasal receptors (V1Rs and V2Rs), trace amine-associated receptors (TAARs), formyl peptide receptors (FPRs), and the membrane guanylyl cyclase GC-D. This large and complex receptor repertoire is the basis for the enormous chemosensory capacity of the olfactory system.
Collapse
Affiliation(s)
- Joerg Fleischer
- Institute of Physiology, University of Hohenheim Stuttgart, Germany
| | | | | |
Collapse
|
7
|
Abstract
Compared with other mammals, the genomes of humans and other primates show an enrichment of large, interspersed segmental duplications (SDs) with high levels of sequence identity. Recent evidence has begun to shed light on the origin of primate SDs, pointing to a complex interplay of mechanisms and indicating that distinct waves of duplication took place during primate evolution. There is also evidence for a strong association between duplication, genomic instability and large-scale chromosomal rearrangements. Exciting new findings suggest that SDs have not only created novel primate gene families, but might have also influenced current human genic and phenotypic variation on a previously unappreciated scale. A growing number of examples link natural human genetic variation of these regions to susceptibility to common disease.
Collapse
Affiliation(s)
- Jeffrey A Bailey
- Department of Pathology, Case Western University School of Medicine and University Hospitals of Cleveland, Ohio 44106, USA
| | | |
Collapse
|
8
|
Yin F, Hoggatt AM, Zhou J, Herring BP. 130-kDa smooth muscle myosin light chain kinase is transcribed from a CArG-dependent, internal promoter within the mouse mylk gene. Am J Physiol Cell Physiol 2006; 290:C1599-609. [PMID: 16407417 DOI: 10.1152/ajpcell.00289.2005] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The 130-kDa smooth muscle myosin light chain kinase (smMLCK) is a Ca2+/CaM-regulated enzyme that plays a pivotal role in the initiation of smooth muscle contraction and regulation of cellular migration and division. Despite the critical importance of smMLCK in these processes, little is known about the mechanisms regulating its expression. In this study, we have identified the proximal promoter of smMLCK within an intron of the mouse mylk gene. The mylk gene encodes at least two isoforms of MLCK (130 and 220 kDa) and telokin. Luciferase reporter gene assays demonstrated that a 282-bp fragment (−167 to +115) of the smMLCK promoter was sufficient for maximum activity in A10 smooth muscle cells and 10T1/2 fibroblasts. Deletion of the 16 bp between −167 and −151, which included a CArG box, resulted in a nearly complete loss of promoter activity. Gel mobility shift assays and chromatin immunoprecipitation assays demonstrated that serum response factor (SRF) binds to this CArG box both in vitro and in vivo. SRF knockdown by short hairpin RNA decreased endogenous smMLCK expression in A10 cells. Although the SRF coactivator myocardin induced smMLCK expression in 10T1/2 cells, myocardin activated the promoter only two- to fourfold in reporter gene assays. Addition of either intron 1 or 6 kb of the 5′ upstream sequence did not lead to any further activation of the promoter by myocardin. The proximal smMLCK promoter also contains a consensus GATA-binding site that bound GATA-6. GATA-6 binding to this site decreased endogenous smMLCK expression, inhibited promoter activity in smooth muscle cells, and blocked the ability of myocardin to induce smMLCK expression. Altogether, these data suggest that SRF and SRF-associated factors play a key role in regulating the expression of smMLCK.
Collapse
Affiliation(s)
- Feng Yin
- Dept. of Cellular and Integrative Physiology, Indiana Univ. School of Medicine, 635 Barnhill Dr., Indianapolis, IN 46202-5120, USA
| | | | | | | |
Collapse
|
9
|
Gao L, Grant A, Halder I, Brower R, Sevransky J, Maloney JP, Moss M, Shanholtz C, Yates CR, Meduri GU, Shriver MD, Ingersoll R, Scott AF, Beaty TH, Moitra J, Ma SF, Ye SQ, Barnes KC, Garcia JGN. Novel polymorphisms in the myosin light chain kinase gene confer risk for acute lung injury. Am J Respir Cell Mol Biol 2006; 34:487-95. [PMID: 16399953 PMCID: PMC2644210 DOI: 10.1165/rcmb.2005-0404oc] [Citation(s) in RCA: 132] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2005] [Accepted: 12/05/2005] [Indexed: 11/24/2022] Open
Abstract
The genetic basis of acute lung injury (ALI) is poorly understood. The myosin light chain kinase (MYLK) gene encodes the nonmuscle myosin light chain kinase isoform, a multifunctional protein involved in the inflammatory response (apoptosis, vascular permeability, leukocyte diapedesis). To examine MYLK as a novel candidate gene in sepsis-associated ALI, we sequenced exons, exon-intron boundaries, and 2 kb of 5' UTR of the MYLK, which revealed 51 single-nucleotide polymorphisms (SNPs). Potential association of 28 MYLK SNPs with sepsis-associated ALI were evaluated in a case-control sample of 288 European American subjects (EAs) with sepsis alone, subjects with sepsis-associated ALI, or healthy control subjects, and a sample population of 158 African American subjects (AAs) with sepsis and ALI. Significant single locus associations in EAs were observed between four MYLK SNPs and the sepsis phenotype (P<0.001), with an additional SNP associated with the ALI phenotype (P=0.03). A significant association of a single SNP (identical to the SNP identified in EAs) was observed in AAs with sepsis (P=0.002) and with ALI (P=0.01). Three sepsis risk-conferring haplotypes in EAs were defined downstream of start codon of smooth muscle MYLK isoform, a region containing putative regulatory elements (P<0.001). In contrast, multiple haplotypic analyses revealed an ALI-specific, risk-conferring haplotype at 5' of the MYLK gene in both European and African Americans and an additional 3' region haplotype only in African Americans. These data strongly implicate MYLK genetic variants to confer increased risk of sepsis and sepsis-associated ALI.
Collapse
Affiliation(s)
- Li Gao
- Department of Medicine, University of Chicago Pritzker School of Medicine, 5841 S. Maryland Avenue, W604, Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Yue Y, Haaf T. 7E olfactory receptor gene clusters and evolutionary chromosome rearrangements. Cytogenet Genome Res 2006; 112:6-10. [PMID: 16276084 DOI: 10.1159/000087507] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2005] [Accepted: 03/01/2005] [Indexed: 11/19/2022] Open
Abstract
Olfactory receptor (OR) genes of the 7E subfamily have been duplicated to multiple regions throughout the human genome. Segmental duplications containing 7E OR genes have been associated with both pathological and evolutionary chromosome rearrangements. Many of these breakpoint regions coincide with breaks of chromosomal synteny in the mouse, rat and/or chicken genomes. Collectively, these data suggest that 7E OR-containing regions represent hot spots of genomic instability.
Collapse
Affiliation(s)
- Y Yue
- Institute for Human Genetics, Mainz University School of Medicine, Mainz, Germany
| | | |
Collapse
|
11
|
Yue Y, Grossmann B, Tsend-Ayush E, Grützner F, Ferguson-Smith MA, Yang F, Haaf T. Genomic structure and paralogous regions of the inversion breakpoint occurring between human chromosome 3p12.3 and orangutan chromosome 2. Cytogenet Genome Res 2005; 108:98-105. [PMID: 15545721 DOI: 10.1159/000080807] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2003] [Accepted: 12/12/2003] [Indexed: 11/19/2022] Open
Abstract
Intrachromosomal duplications play a significant role in human genome pathology and evolution. To better understand the molecular basis of evolutionary chromosome rearrangements, we performed molecular cytogenetic and sequence analyses of the breakpoint region that distinguishes human chromosome 3p12.3 and orangutan chromosome 2. FISH with region-specific BAC clones demonstrated that the breakpoint-flanking sequences are duplicated intrachromosomally on orangutan 2 and human 3q21 as well as at many pericentromeric and subtelomeric sites throughout the genomes. Breakage and rearrangement of the human 3p12.3-homologous region in the orangutan lineage were associated with a partial loss of duplicated sequences in the breakpoint region. Consistent with our FISH mapping results, computational analysis of the human chromosome 3 genomic sequence revealed three 3p12.3-paralogous sequence blocks on human chromosome 3q21 and smaller blocks on the short arm end 3p26-->p25. This is consistent with the view that sequences from an ancestral site at 3q21 were duplicated at 3p12.3 in a common ancestor of orangutan and humans. Our results show that evolutionary chromosome rearrangements are associated with microduplications and microdeletions, contributing to the DNA differences between closely related species.
Collapse
Affiliation(s)
- Y Yue
- Institute for Human Genetics, Mainz University School of Medicine, Germany
| | | | | | | | | | | | | |
Collapse
|
12
|
Bailey JA, Eichler EE. Genome-wide detection and analysis of recent segmental duplications within mammalian organisms. Cold Spring Harb Symp Quant Biol 2004; 68:115-24. [PMID: 15338609 DOI: 10.1101/sqb.2003.68.115] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- J A Bailey
- Department of Genetics, Center for Computational Genomics, Case Western Reserve University School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA
| | | |
Collapse
|
13
|
Abstract
Comparative mapping of more than 100 region-specific clones from human chromosome 3 in Bornean and Sumatran orangutans, siamang gibbon, and Old and New World monkeys allowed us to reconstruct ancestral simian and hominoid chromosomes. A single paracentric inversion derives chromosome 1 of the Old World monkey Presbytis cristata from the simian ancestor. In the New World monkey Callithrix geoffroyi and siamang, the ancestor diverged on multiple chromosomes, through utilizing different breakpoints. One shared and two independent inversions derive Bornean orangutan 2 and human 3, implying that neither Bornean orangutans nor humans have conserved the ancestral chromosome form. The inversions, fissions, and translocations in the five species analyzed involve at least 14 different evolutionary breakpoints along the entire length of human 3; however, particular regions appear to be more susceptible to chromosome reshuffling. The ancestral pericentromeric region has promoted both large-scale and micro-rearrangements. Small segments homologous to human 3q11.2 and 3q21.2 were repositioned intrachromosomally independent of the surrounding markers in the orangutan lineage. Breakage and rearrangement of the human 3p12.3 region were associated with extensive intragenomic duplications at multiple orangutan and gibbon subtelomeric sites. We propose that new chromosomes and genomes arise through large-scale rearrangements of evolutionarily conserved genomic building blocks and additional duplication, amplification, and/or repositioning of inherently unstable smaller DNA segments contained within them.
Collapse
Affiliation(s)
- Enkhjargal Tsend-Ayush
- Comparative Genomics Group, Research School of Biological Sciences, Australian National University, Canberra ACT 2601, Australia
| | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
Humans perceive an immense variety of chemicals as having distinct odors. Odor perception initiates in the nose, where odorants are detected by a large family of olfactory receptors (ORs). ORs have diverse protein sequences but can be assigned to subfamilies on the basis of sequence relationships. Members of the same subfamily have related sequences and are likely to recognize structurally related odorants. To gain insight into the mechanisms underlying odor perception, we analyzed the human OR gene family. By searching the human genome database, we identified 339 intact OR genes and 297 OR pseudogenes. Determination of their genomic locations showed that OR genes are unevenly distributed among 51 different loci on 21 human chromosomes. Sequence comparisons showed that the human OR family is composed of 172 subfamilies. Types of odorant structures that may be recognized by some subfamilies were predicted by identifying subfamilies that contain ORs with known odor ligands or human homologs of such ORs. Analysis of the chromosomal locations of members of each OR subfamily revealed that most subfamilies are encoded by a single chromosomal locus. Moreover, many loci encode only one or a few subfamilies, suggesting that different parts of the genome may, to some extent, be involved in the detection of different types of odorant structural motifs.
Collapse
Affiliation(s)
- Bettina Malnic
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | | | | |
Collapse
|
15
|
Amadou C, Younger RM, Sims S, Matthews LH, Rogers J, Kumanovics A, Ziegler A, Beck S, Lindahl KF. Co-duplication of olfactory receptor and MHC class I genes in the mouse major histocompatibility complex. Hum Mol Genet 2003; 12:3025-40. [PMID: 14506126 DOI: 10.1093/hmg/ddg317] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We report the 897 kb sequence of a cluster of olfactory receptor (OR) genes located at the distal end of the major histocompatibility complex (MHC) class I region on mouse chromosome 17 of strain 129/SvJ (H2bc). With additional information from the mouse genome draft sequence, we identified 59 OR loci (approximately 20% pseudogenes) in contrast to only 25 OR loci (approximately 50% pseudogenes) in the corresponding centromeric OR cluster that is part of the 'extended MHC class I region' on human chromosome 6. Comparative analysis leads to three major observations: (i) most of the OR subfamilies have evolved independently in the two species, expanding more in the mouse, and resulting in co-orthologs--subfamilies of highly similar paralogs that keep orthologous relationships with their human counterparts; (ii) three of the mouse OR subfamilies have no orthologs in humans; and (iii) MHC class I loci are interspersed in the OR cluster in mouse but not in human, and were subjected to co-duplication with OR genes. Screening of our sequence against the available sequences of other strains/haplotypes revealed that most of the OR loci are polymorphic and that the number of OR loci may vary among strains/haplotypes. Our findings that MHC-linked OR loci share duplication with MHC class I loci, have duplicated extensively and are polymorphic revives questions about potential reciprocal influences acting on the dynamics and evolution of the H2 region and the H2-linked OR loci.
Collapse
MESH Headings
- Alleles
- Amino Acid Sequence
- Animals
- Chromosome Mapping
- Chromosomes, Human, Pair 6
- Chromosomes, Mammalian
- Consensus Sequence
- Evolution, Molecular
- Gene Duplication
- Genes, MHC Class I
- Haplotypes
- Histocompatibility Antigens Class II/genetics
- Humans
- Major Histocompatibility Complex/genetics
- Mice
- Mice, Inbred Strains
- Molecular Sequence Data
- Mutation
- Phylogeny
- Polymorphism, Genetic
- Protein Structure, Tertiary
- Receptors, Odorant/chemistry
- Receptors, Odorant/genetics
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Species Specificity
Collapse
Affiliation(s)
- Claire Amadou
- Howard Hughes Medical Institute and Center for Immunology, University of Texas Southwestern Medical Center, Dallas, 75390-9050, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Abstract
We report a comprehensive comparative analysis of human and mouse olfactory receptor (OR) genes encoding OR37 subtypes to determine the repertoire, chromosomal organization, and relatedness of these genes. Two OR37 clusters were found in both mouse (chromosome 4) and human (chromosome 9); with five genes in cluster I and three (mouse) and seven genes (human) in cluster II. The pronounced diversity of noncoding sequence regions in both genomic loci indicates a long-term coexistence of the two clusters and the genes within the clusters. In contrast, the coding regions, particularly of genes in cluster I, showed remarkably high sequence identity, a feature quite unique for OR genes. The conservation of only the coding sequences indicates that OR37 may be under negative selection pressure and suggests that the OR37 receptor family may be tuned to recognize distinct sets of signaling molecules. A comparison of mouse and human OR37 gene clusters revealed that genes in cluster I are highly related within each species whereas genes in cluster II are highly related across species. These data reflect a unique and complex evolutionary history of the OR37 family.
Collapse
Affiliation(s)
- Reiner Hoppe
- Institute of Physiology, University Hohenheim, Garbenstrasse 30, 70593, Stuttgart, Germany
| | | | | |
Collapse
|
17
|
Abstract
Large segmental duplications (SDs) constitute at least 3.6% of the human genome and have increased its size, complexity, and diversity. SDs can mediate ectopic sequence exchange resulting in gross chromosomal rearrangements that could contribute to speciation and disease. We have identified and evaluated a subset of human SDs that harbor an 88-member subfamily of olfactory receptor (OR)-like genes called the 7Es. At least 92% of these genes appear to be pseudogenes when compared to other OR genes. The 7E-containing SDs (7E SDs) have duplicated to at least 35 regions of the genome via intra- and interchromosomal duplication events. In contrast to many human SDs, the 7E SDs are not biased towards pericentromeric or subtelomeric regions. We find evidence for gene conversion among 7E genes and larger sequence exchange between 7E SDs, supporting the hypothesis that long, highly similar stretches of DNA facilitate ectopic interactions. The complex structure and history of the 7E SDs necessitates extension of the current model of large-scale DNA duplication. Despite their appearance as pseudogenes, some 7E genes exhibit a signature of purifying selection, and at least one 7E gene is expressed.
Collapse
Affiliation(s)
- Tera Newman
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | |
Collapse
|
18
|
Fan Y, Linardopoulou E, Friedman C, Williams E, Trask BJ. Genomic structure and evolution of the ancestral chromosome fusion site in 2q13-2q14.1 and paralogous regions on other human chromosomes. Genome Res 2002; 12:1651-62. [PMID: 12421751 PMCID: PMC187548 DOI: 10.1101/gr.337602] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Human chromosome 2 was formed by the head-to-head fusion of two ancestral chromosomes that remained separate in other primates. Sequences that once resided near the ends of the ancestral chromosomes are now interstitially located in 2q13-2q14.1. Portions of these sequences had duplicated to other locations prior to the fusion. Here we present analyses of the genomic structure and evolutionary history of >600 kb surrounding the fusion site and closely related sequences on other human chromosomes. Sequence blocks that closely flank the inverted arrays of degenerate telomere repeats marking the fusion site are duplicated at many, primarily subtelomeric, locations. In addition, large portions of a 168-kb centromere-proximal block are duplicated at 9pter, 9p11.2, and 9q13, with 98%-99% average sequence identity. A 67-kb block on the distal side of the fusion site is highly homologous to sequences at 22qter. A third ~100-kb segment is 96% identical to a region in 2q11.2. By integrating data on the extent and similarity of these paralogous blocks, including the presence of phylogenetically informative repetitive elements, with observations of their chromosomal distribution in nonhuman primates, we infer the order of the duplications that led to their current arrangement. Several of these duplicated blocks may be associated with breakpoints of inversions that occurred during primate evolution and of recurrent chromosome rearrangements in humans.
Collapse
Affiliation(s)
- Yuxin Fan
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | | | | | | | |
Collapse
|
19
|
Ballarati L, Piccini I, Carbone L, Archidiacono N, Rollier A, Marozzi A, Meneveri R, Ginelli E. Human genome dispersal and evolution of 4q35 duplications and interspersed LSau repeats. Gene 2002; 296:21-7. [PMID: 12383499 DOI: 10.1016/s0378-1119(02)00858-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We have investigated the evolutionary history of the 4q35 paralogous region, and of a sub-family of interspersed LSau repeats. In HSA, 4q35 duplications were localized at 1q12, 3p12.3, 4q35, 10q26, 20cen, whereas duplicons and interspersed LSau repeats simultaneously labeled the p arm of acrocentric chromosomes. A multi-site localization of 4q35-like sequences was also observed in PTR, GGO, PPY, HLA (Hominoidea) and PAN (Old World monkey), thus indicating that duplications of this region have occurred extensively in the two clades, which diverged at least 25 million years ago. In HSA, PTR and PAN, 4q35-derived duplicons co-localized with rDNA, whereas in GGO and PPY this association was partially lacking. In PAN, the single- and multi-site distribution of rDNA and paralogous sequences, respectively, indicates a different timing of sequence dispersal. The sub-family of interspersed LSau repeats showed a lesser dispersal than 4q35 duplications both in man and great apes. This finding suggests that duplications and repeated sequences have undergone different expansion/contraction events during evolution. The mechanisms underlying the dispersal of paralogous regions may be further derived through studies comparing the detailed structural organization of these genomic regions in man and primates.
Collapse
MESH Headings
- Animals
- Base Sequence
- Chromosomes, Human, Pair 10/genetics
- Chromosomes, Human, Pair 22/genetics
- Chromosomes, Human, Pair 3/genetics
- Chromosomes, Human, Pair 4/genetics
- DNA/chemistry
- DNA/genetics
- Evolution, Molecular
- Gene Duplication
- Genome, Human
- Humans
- In Situ Hybridization, Fluorescence
- Interspersed Repetitive Sequences/genetics
- Molecular Sequence Data
- Sequence Analysis, DNA
Collapse
Affiliation(s)
- L Ballarati
- Dipartimento di Biologia e Genetica per le Scienze Mediche, Università di Milano, Via Viotti 3/5, Italy
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Angata T, Kerr SC, Greaves DR, Varki NM, Crocker PR, Varki A. Cloning and characterization of human Siglec-11. A recently evolved signaling molecule that can interact with SHP-1 and SHP-2 and is expressed by tissue macrophages, including brain microglia. J Biol Chem 2002; 277:24466-74. [PMID: 11986327 DOI: 10.1074/jbc.m202833200] [Citation(s) in RCA: 143] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Siglecs are sialic acid-recognizing animal lectins of the immunoglobulin superfamily. We have cloned and characterized a novel human molecule, Siglec-11, that belongs to the subgroup of CD33/Siglec-3-related Siglecs. As with others in this subgroup, the cytosolic domain of Siglec-11 is phosphorylated at tyrosine residue(s) upon pervanadate treatment of cells and then recruits the protein-tyrosine phosphatases SHP-1 and SHP-2. However, Siglec-11 has several novel features relative to the other CD33/Siglec-3-related Siglecs. First, it binds specifically to alpha2-8-linked sialic acids. Second, unlike other CD33/Siglec-3-related Siglecs, Siglec-11 was not found on peripheral blood leukocytes. Instead, we observed its expression on macrophages in various tissues, such as liver Kupffer cells. Third, it was also expressed on brain microglia, thus becoming the second Siglec to be found in the nervous system. Fourth, whereas the Siglec-11 gene is on human chromosome 19, it lies outside the previously described CD33/Siglec-3-related Siglec cluster on this chromosome. Fifth, analyses of genome data bases indicate that Siglec-11 has no mouse ortholog and that it is likely to be the last canonical human Siglec to be reported. Finally, although Siglec-11 shows marked sequence similarity to human Siglec-10 in its extracellular domain, the cytosolic tail appears only distantly related. Analysis of genomic regions surrounding the Siglec-11 gene suggests that it is actually a chimeric molecule that arose from relatively recent gene duplication and recombination events, involving the extracellular domain of a closely related ancestral Siglec gene (which subsequently became a pseudogene) and a transmembrane and cytosolic tail derived from another ancestral Siglec.
Collapse
MESH Headings
- Amino Acid Sequence
- Antigens, CD/analysis
- Antigens, CD/chemistry
- Antigens, CD/genetics
- Antigens, CD/metabolism
- Antigens, Differentiation, Myelomonocytic/analysis
- Antigens, Differentiation, Myelomonocytic/chemistry
- Antigens, Differentiation, Myelomonocytic/genetics
- Antigens, Differentiation, Myelomonocytic/metabolism
- Appendix/cytology
- Appendix/metabolism
- Base Sequence
- Brain/physiology
- Cloning, Molecular
- Evolution, Molecular
- Humans
- Intracellular Signaling Peptides and Proteins
- Lectins/chemistry
- Lectins/genetics
- Lectins/metabolism
- Macrophages/physiology
- Membrane Proteins
- Microglia/physiology
- Molecular Sequence Data
- Organ Specificity
- Palatine Tonsil/cytology
- Palatine Tonsil/metabolism
- Protein Tyrosine Phosphatase, Non-Receptor Type 11
- Protein Tyrosine Phosphatase, Non-Receptor Type 6
- Protein Tyrosine Phosphatases/metabolism
- Pseudogenes
- RNA, Messenger/genetics
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Alignment
- Sequence Homology, Amino Acid
- Sialic Acid Binding Ig-like Lectin 3
- Transcription, Genetic
Collapse
Affiliation(s)
- Takashi Angata
- Glycobiology Research and Training Center, Department of Medicine, University of California, San Diego, La Jolla, California 92093-0687, USA
| | | | | | | | | | | |
Collapse
|
21
|
Abstract
Certain members of the olfactory receptor superfamily appear to be expressed not only in chemosensory neurons of the nasal epithelium. Analyzing the transgenic mouse line MOL2.3-IGITL, the olfactory receptor subtype MOL2.3 was found to be expressed in distinct subpopulations of cells within a cranial, a cervical as well as within a thoracic ganglion. By means of coexpressed markers, the axonal processes of MOL2.3 expressing cells could be visualized and thus the target tissues innervated by these ganglionic neurons identified. Stained fibers, but no stained cell bodies were visible in distinct head regions, notably in the lateral nasal gland and in the so-called Harderian gland; staining was also observed on distinct segments of blood vessels, especially within the tongue. In the thoracic region, the heart and a small segment of the aorta as well as a distinct population of lung alveoli were labeled by incoming blue fibers. Expression of MOL2.3 in cells of the autonomic nervous system supports the idea that at least some of the multiple olfactory receptor types serve functions others than odorant detection.
Collapse
Affiliation(s)
- Martina Weber
- Institute of Physiology, University of Hohenheim, Garbenstrasse 30, 70593 Stuttgart, Germany
| | | | | | | |
Collapse
|
22
|
Abstract
An estimated 5% of the human genome consists of interspersed duplications that have arisen over the past 35 million years of evolution. Two categories of such recently duplicated segments can be distinguished: segmental duplications between nonhomologous chromosomes (transchromosomal duplications) and duplications mainly restricted to a particular chromosome (chromosome-specific duplications). Many of these duplications exhibit an extraordinarily high degree of sequence identity at the nucleotide level (>95%) and span large genomic distances (1-100 kb). Preliminary analyses indicate that these same regions are targets for rapid evolutionary turnover among the genomes of closely related primates. The dynamic nature of these regions because of recurrent chromosomal rearrangement, and their ability to create fusion genes from juxtaposed cassettes suggest that duplicative transposition was an important force in the evolution of our genome.
Collapse
Affiliation(s)
- E E Eichler
- Dept of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, OH 44106, USA.
| |
Collapse
|
23
|
Bagheri-Fam S, Ferraz C, Demaille J, Scherer G, Pfeifer D. Comparative genomics of the SOX9 region in human and Fugu rubripes: conservation of short regulatory sequence elements within large intergenic regions. Genomics 2001; 78:73-82. [PMID: 11707075 DOI: 10.1006/geno.2001.6648] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Campomelic dysplasia (CD), a human skeletal malformation syndrome with XY sex reversal, is caused by heterozygous mutations in and around the gene SOX9. SOX9 has an extended 5' control region, as indicated by CD translocation breakpoints scattered over 1 Mb proximal to SOX9 and by expression data from mice transgenic for human SOX9-spanning yeast artificial chromosomes. To identify long-range regulatory elements within the SOX9 5' control region, we compared approximately 3.7 Mb and 195 kb of sequence around human and Fugu rubripes SOX9, respectively. We identified only seven and five protein-coding genes in the human and F. rubripes sequences, respectively. Four of the F. rubripes genes have been mapped in humans; all reside on chromosome 17 but show extensive intrachromosomal gene shuffling compared with the gene order in F. rubripes. In both species, very large intergenic distances separate SOX9 from its directly flanking genes: 2 Mb and 500 kb on either side of SOX9 in humans, and 68 and 97 kb on either side of SOX9 in F. rubripes. Comparative sequence analysis of the intergenic regions revealed five conserved elements, E1-E5, up to 290 kb 5' to human SOX9 and up to 18 kb 5' to F. rubripes SOX9, and three such elements, E6-E8, 3' to SOX9. Where available, mouse sequences confirm conservation of the elements. From the yeast artificial chromosome transgenic data, elements E3-E5 are candidate enhancers for SOX9 expression in limb and vertebral column, and 8 of 10 CD translocation breakpoints separate these elements from SOX9.
Collapse
Affiliation(s)
- S Bagheri-Fam
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacherstr. 33, Freiburg, D-79106, Germany
| | | | | | | | | |
Collapse
|
24
|
Giorgi D, Ferraz C, Mattéi MG, Demaille J, Rouquier S. The myosin light chain kinase gene is not duplicated in mouse: partial structure and chromosomal localization of Mylk. Genomics 2001; 75:49-56. [PMID: 11472067 DOI: 10.1006/geno.2001.6571] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The gene encoding myosin light chain kinase (MYLK) is duplicated on human chromosome 3 (HSA3; 3p13;3q21) and on a chromosome with conserved synteny to HSA3 in most non-human primate species. In human, the functional copy resides on 3q21, whereas the 3p13 site contains a pseudogene. To trace the origin of the duplication, we characterized the mouse gene Mylk. A single sequence corresponding to the functional Mylk was detected. We sequenced a 180-kb bacterial artificial chromosome clone containing the 24 first exons of Mylk; the complete mouse gene is expected to span >200 kb. Comparisons with the draft of the human genome revealed that the sequence and structure of MYLK are conserved in mammals. Fluorescence in situ hybridization (FISH) analysis indicated that the mouse gene localizes to a single site on chromosome 16B4-B5, a region with conserved synteny with HSA3q. Our study provides information on both the structure and the evolution of MYLK in mammals and suggests that it was duplicated after the divergence of rodents and primates.
Collapse
Affiliation(s)
- D Giorgi
- IGH, CNRS UPR 1142, rue de la Cardonille, 34396 Montpellier, Cédex 5, France
| | | | | | | | | |
Collapse
|
25
|
Abstract
Olfactory receptors likely constitute the largest gene superfamily in the vertebrate genome. Here we present the nearly complete human olfactory subgenome elucidated by mining the genome draft with gene discovery algorithms. Over 900 olfactory receptor genes and pseudogenes (ORs) were identified, two-thirds of which were not annotated previously. The number of extrapolated ORs is in good agreement with previous theoretical predictions. The sequence of at least 63% of the ORs is disrupted by what appears to be a random process of pseudogene formation. ORs constitute 17 gene families, 4 of which contain more than 100 members each. "Fish-like" Class I ORs, previously considered a relic in higher tetrapods, constitute as much as 10% of the human repertoire, all in one large cluster on chromosome 11. Their lower pseudogene fraction suggests a functional significance. ORs are disposed on all human chromosomes except 20 and Y, and nearly 80% are found in clusters of 6-138 genes. A novel comparative cluster analysis was used to trace the evolutionary path that may have led to OR proliferation and diversification throughout the genome. The results of this analysis suggest the following genome expansion history: first, the generation of a "tetrapod-specific" Class II OR cluster on chromosome 11 by local duplication, then a single-step duplication of this cluster to chromosome 1, and finally an avalanche of duplication events out of chromosome 1 to most other chromosomes. The results of the data mining and characterization of ORs can be accessed at the Human Olfactory Receptor Data Exploratorium Web site (http://bioinfo.weizmann.ac.il/HORDE).
Collapse
Affiliation(s)
- G Glusman
- Department of Molecular Genetics and the Crown Human Genome Center, The Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | |
Collapse
|
26
|
Giglio S, Broman KW, Matsumoto N, Calvari V, Gimelli G, Neumann T, Ohashi H, Voullaire L, Larizza D, Giorda R, Weber JL, Ledbetter DH, Zuffardi O. Olfactory receptor-gene clusters, genomic-inversion polymorphisms, and common chromosome rearrangements. Am J Hum Genet 2001; 68:874-83. [PMID: 11231899 PMCID: PMC1275641 DOI: 10.1086/319506] [Citation(s) in RCA: 281] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 01/22/2001] [Indexed: 11/03/2022] Open
Abstract
The olfactory receptor (OR)-gene superfamily is the largest in the mammalian genome. Several of the human OR genes appear in clusters with > or = 10 members located on almost all human chromosomes, and some chromosomes contain more than one cluster. We demonstrate, by experimental and in silico data, that unequal crossovers between two OR gene clusters in 8p are responsible for the formation of three recurrent chromosome macrorearrangements and a submicroscopic inversion polymorphism. The first two macrorearrangements are the inverted duplication of 8p, inv dup(8p), which is associated with a distinct phenotype, and a supernumerary marker chromosome, +der(8)(8p23.1pter), which is also a recurrent rearrangement and is associated with minor anomalies. We demonstrate that it is the reciprocal of the inv dup(8p). The third macrorearrangment is a recurrent 8p23 interstitial deletion associated with heart defect. Since inv dup(8p)s originate consistently in maternal meiosis, we investigated the maternal chromosomes 8 in eight mothers of subjects with inv dup(8p) and in the mother of one subject with +der(8), by means of probes included between the two 8p-OR gene clusters. All the mothers were heterozygous for an 8p submicroscopic inversion that was delimited by the 8p-OR gene clusters and was present, in heterozygous state, in 26% of a population of European descent. Thus, inversion heterozygosity may cause susceptibility to unequal recombination, leading to the formation of the inv dup(8p) or to its reciprocal product, the +der(8p). After the Yp inversion polymorphism, which is the preferential background for the PRKX/PRKY translocation in XX males and XY females, the OR-8p inversion is the second genomic polymorphism that confers susceptibility to the formation of common chromosome rearrangements. Accordingly, it may be possible to develop a profile of the individual risk of having progeny with chromosome rearrangements.
Collapse
MESH Headings
- Chromosome Breakage/genetics
- Chromosome Deletion
- Chromosome Inversion
- Chromosomes, Artificial, Bacterial/genetics
- Chromosomes, Human, Pair 8/genetics
- Cloning, Molecular
- Contig Mapping
- Crossing Over, Genetic/genetics
- DNA Probes/genetics
- Electrophoresis, Gel, Pulsed-Field
- Female
- Genes, Duplicate/genetics
- Heterozygote
- Humans
- In Situ Hybridization, Fluorescence
- Male
- Microsatellite Repeats/genetics
- Multigene Family/genetics
- Polymorphism, Genetic/genetics
- Receptors, Odorant/genetics
Collapse
Affiliation(s)
- S Giglio
- Department of Human Genetics, The University of Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14509] [Impact Index Per Article: 630.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Abstract
A 140.7-kb segment of zebrafish genomic DNA known to contain odorant receptor (OR) genes was fully sequenced to characterize more completely the organization of this gene cluster. A total of 20 OR genes were identified in this region. The most highly related genes are grouped in closest proximity to one another and in the same transcriptional orientation, indicating that a series of tandem duplications was responsible for the expansion of the OR gene family in teleost fish. Our analysis also revealed sequences that may be involved in the transcriptional regulation of OR genes within the cluster.
Collapse
Affiliation(s)
- J C Dugas
- Department of Molecular and Cell Biology, Division of Neurobiology, University of California at Berkeley, 269 Life Sciences Addition, Berkeley, California 94720-3200, USA
| | | |
Collapse
|
29
|
Abstract
The vomeronasal organ (VNO) or Jacobson's organ is responsible in terrestrial vertebrates for the sensory perception of pheromones, chemicals that elicit stereotyped behaviors among individuals of the same species. Pheromone-induced behaviors and a functional VNO have been described in a number of mammals, but the existence of this sensory system in human is still debated. Recently, two nonhomologous gene families, V1R and V2R, encoding pheromone receptors have been identified in rat. These receptors belong to the seven-transmembrane domain G-protein-coupled receptor superfamily. We sought to characterize V1R-like genes in the human genome. We have identified seven different human sequences by PCR and library screening with rodent sequences. These human sequences exhibit characteristic features of V1R receptors and show 52%–59% of amino acid sequence identity with the rat sequences. Using PCR on a monochromosomal somatic cell hybrid panel and/or FISH, we demonstrate that these V1R-like sequences are distributed on chromosomes 7, 16, 20, 13, 14, 15, 21, and 22 and possibly on additional chromosomes. One sequence hybridizes to pericentromeric locations on all the acrocentric chromosomes (13, 14, 15, 21, and 22). All of the seven V1R-like sequences analyzed show interrupted reading frames, indicating that they represent nonfunctional pseudogenes. The preponderence of pseudogenes among human V1R sequences and the striking anatomical differences between rodent and human VNO raise the possibility that humans may have lost the V1R/VNO-mediated sensory functions of rodents.[Sequence data from this article have been deposited with the DDBJ/EMBL/GenBank Data Libraries under accession nos. U73852–73853 andAF253312–253316.]
Collapse
|
30
|
Abstract
The vomeronasal organ (VNO) or Jacobson's organ is responsible in terrestrial vertebrates for the sensory perception of pheromones, chemicals that elicit stereotyped behaviors among individuals of the same species. Pheromone-induced behaviors and a functional VNO have been described in a number of mammals, but the existence of this sensory system in human is still debated. Recently, two nonhomologous gene families, V1R and V2R, encoding pheromone receptors have been identified in rat. These receptors belong to the seven-transmembrane domain G-protein-coupled receptor superfamily. We sought to characterize V1R-like genes in the human genome. We have identified seven different human sequences by PCR and library screening with rodent sequences. These human sequences exhibit characteristic features of V1R receptors and show 52%-59% of amino acid sequence identity with the rat sequences. Using PCR on a monochromosomal somatic cell hybrid panel and/or FISH, we demonstrate that these V1R-like sequences are distributed on chromosomes 7, 16, 20, 13, 14, 15, 21, and 22 and possibly on additional chromosomes. One sequence hybridizes to pericentromeric locations on all the acrocentric chromosomes (13, 14, 15, 21, and 22). All of the seven V1R-like sequences analyzed show interrupted reading frames, indicating that they represent nonfunctional pseudogenes. The preponderence of pseudogenes among human V1R sequences and the striking anatomical differences between rodent and human VNO raise the possibility that humans may have lost the V1R/VNO-mediated sensory functions of rodents.
Collapse
Affiliation(s)
- D Giorgi
- Institut de Génétique Humaine, Centre Nationale de la Recherche Scientifique, Unité Propre de Recherche 1142, 34396 Montpellier, CEDEX 5, France
| | | | | | | |
Collapse
|
31
|
Abstract
The genomic and cDNA structures were studied for eight human olfactory receptor (OR) genes within the chromosome 17p13.3 cluster. A common gene structure was revealed, which included an approximately 1-kb intronless coding region terminated by a signal for polyadenylation and a variable number of upstream noncoding exons. The latter were found to be alternatively spliced, giving rise to different isoforms of OR mRNA. While the initial exons mostly agreed with previous computer predictions and were conserved within OR subfamilies, other upstream exons were novel and idiosyncratic. In some cases, repetitive sequences were involved in the generation of splice sites and putative transcription control elements. Such gene structure is consistent with early repertoire enhancement by retrogene generation, which was likely followed by extensive genomic duplication. Each OR gene had a unique signature of transcription factor elements, consistent with a combinatorial expression control mechanism.
Collapse
Affiliation(s)
- A Sosinsky
- Department of Molecular Genetics, The Crown Human Genome Center, Rehovot, 76100, Israel
| | | | | |
Collapse
|
32
|
Conzelmann S, Levai O, Bode B, Eisel U, Raming K, Breer H, Strotmann J. A novel brain receptor is expressed in a distinct population of olfactory sensory neurons. Eur J Neurosci 2000; 12:3926-34. [PMID: 11069588 DOI: 10.1046/j.1460-9568.2000.00286.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Three novel G-protein-coupled receptor genes related to the previously described RA1c gene have been isolated from the mouse genome. Expression of these genes has been detected in distinct areas of the brain and also in the olfactory epithelium of the nose. Developmental studies revealed a differential onset of expression: in the brain at embryonic stage 17, in the olfactory system at stage E12. In order to determine which cell type in the olfactory epithelium expresses this unique receptor type, a transgenic approach was employed which allowed a coexpression of histological markers together with the receptor and thus visualization of the appropriate cell population. It was found that the receptor-expressing cells were located very close to the basal membrane of the epithelium; however, the cells extended a dendritic process to the epithelial surface and their axons projected into the main olfactory bulb where they converged onto two or three glomeruli in the dorsal and posterior region of the bulb. Thus, these data provide evidence that this unique type of receptor is expressed in mature olfactory neurons and suggests that it may be involved in the detection of special odour molecules.
Collapse
Affiliation(s)
- S Conzelmann
- Institute of Physiology, University of Hohenheim, Garbenstrasse 30, 70593 Stuttgart, Germany
| | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
The olfactory receptor multigene family is organized in clusters spread throughout the genome. In the present study, we have sequenced two subregions of the mOR37 gene cluster on mouse chromosome 4. The resulting 100 kb of sequence revealed seven odorant receptor coding regions and one gene fragment. Sequence analyses reveal that the mOR37 gene cluster may represent a rather ancient cluster. The mOR37 genes exhibit a complex intron/exon structure, and some appear to be differentially spliced. All genes in the cluster share conserved sequence motifs 5' of their putative initial exons, which represent potential binding sites for transcription factors. The clustered organization and conserved sequence motifs suggest common expression control mechanisms for these genes.
Collapse
Affiliation(s)
- R Hoppe
- Institute of Physiology, University of Hohenheim, Garbenstrasse 30, Stuttgart, D-70593, Germany
| | | | | | | | | |
Collapse
|
34
|
Horvath JE, Schwartz S, Eichler EE. The mosaic structure of human pericentromeric DNA: a strategy for characterizing complex regions of the human genome. Genome Res 2000; 10:839-52. [PMID: 10854415 PMCID: PMC310890 DOI: 10.1101/gr.10.6.839] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The pericentromeric regions of human chromosomes pose particular problems for both mapping and sequencing. These difficulties are due, in large part, to the presence of duplicated genomic segments that are distributed among multiple human chromosomes. To ensure contiguity of genomic sequence in these regions, we designed a sequence-based strategy to characterize different pericentromeric regions using a single (162 kb) 2p11 seed sequence as a point of reference. Molecular and cytogenetic techniques were first used to construct a paralogy map that delineated the interchromosomal distribution of duplicated segments throughout the human genome. Monochromosomal hybrid DNAs were PCR amplified by primer pairs designed to the 2p11 reference sequence. The PCR products were directly sequenced and used to develop a catalog of sequence tags for each duplicon for each chromosome. A total of 685 paralogous sequence variants were generated by sequencing 34.7 kb of paralogous pericentromeric sequence. Using PCR products as hybridization probes, we were able to identify 702 human BAC clones, of which a subset, 107 clones, were analyzed at the sequence level. We used diagnostic paralogous sequence variants to assign 65 of these BACs to at least 9 chromosomal pericentromeric regions: 1q12, 2p11, 9p11/q12, 10p11, 14q11, 15q11, 16p11, 17p11, and 22q11. Comparisons with existing sequence and physical maps for the human genome suggest that many of these BACs map to regions of the genome with sequence gaps. Our analysis indicates that large portions of pericentromeric DNA are virtually devoid of unique sequences. Instead, they consist of a mosaic of different genomic segments that have had different propensities for duplication. These biologic properties may be exploited for the rapid characterization of, not only pericentromeric DNA, but also other complex paralogous regions of the human genome.
Collapse
Affiliation(s)
- J E Horvath
- Department of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106 USA
| | | | | |
Collapse
|
35
|
Abstract
Chromosome-specific low-copy repeats, or duplicons, occur in multiple regions of the human genome. Homologous recombination between different duplicon copies leads to chromosomal rearrangements, such as deletions, duplications, inversions, and inverted duplications, depending on the orientation of the recombining duplicons. When such rearrangements cause dosage imbalance of a developmentally important gene(s), genetic diseases now termed genomic disorders result, at a frequency of 0.7-1/1000 births. Duplicons can have simple or very complex structures, with variation in copy number from 2 to >10 repeats, and each varying in size from a few kilobases in length to hundreds of kilobases. Analysis of the different duplicons involved in human genomic disorders identifies features that may predispose to recombination, including large size and high sequence identity between the recombining copies, putative recombination promoting features, and the presence of multiple genes/pseudogenes that may include genes expressed in germ cells. Most of the chromosome rearrangements involve duplicons near pericentromeric regions, which may relate to the propensity of such regions to accumulate duplicons. Detailed analyses of the structure, polymorphic variation, and mechanisms of recombination in genomic disorders, as well as the evolutionary origin of various duplicons will further our understanding of the structure, function, and fluidity of the human genome.
Collapse
Affiliation(s)
- Y Ji
- Department of Genetics, Case Western Reserve University School of Medicine, and Center for Human Genetics, University Hospitals of Cleveland, Cleveland, Ohio 44106 USA
| | | | | | | |
Collapse
|
36
|
Rouquier S, Blancher A, Giorgi D. The olfactory receptor gene repertoire in primates and mouse: evidence for reduction of the functional fraction in primates. Proc Natl Acad Sci U S A 2000; 97:2870-4. [PMID: 10706615 PMCID: PMC16022 DOI: 10.1073/pnas.040580197] [Citation(s) in RCA: 163] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Olfactory receptors (ORs) located in the cell membrane of olfactory sensory neurons of the nasal epithelium are responsible for odor detection by binding specific odorant ligands. Primates are thought to have a reduced sense of smell (microsmatic) with respect to other mammals such as dogs or rodents. We have previously demonstrated that over 70% of the human OR genes have become nonfunctional pseudogenes, leading us to hypothesize that the reduced sense of smell could correlate with the loss of functional genes. To extend these results, we sampled the OR gene repertoire of 10 primate species, from prosimian lemur to human, in addition to mouse. About 221 previously unidentified primate sequences and 33 mouse sequences were analyzed. These sequences encode ORs distributed in seven families and 56 subfamilies. Analysis showed a high fraction ( approximately 50% on average) of pseudogenes in hominoids. In contrast, only approximately 27% of OR genes are pseudogenes in Old World monkeys, and New World monkeys are almost free of pseudogenes. The prosimian branch seems to have evolved differently from the other primates and has approximately 37% pseudogene content. No pseudogenes were found in mouse. With the exception of New World monkeys, we demonstrate that primates have a high fraction of OR pseudogenes compared with mouse. We hypothesize that under relaxed selective constraints, primates would have progressively accumulated pseudogenes with the highest level seen in hominoids. The fraction of pseudogenes in the OR gene repertoire could parallel the evolution of the olfactory sensory function.
Collapse
Affiliation(s)
- S Rouquier
- Institut de Génétique Humaine, Centre National de la Recherche Scientifique Unité Propre de Recherche 1142, 141 rue de la Cardonille, 34396 Montpellier cédex 5, France
| | | | | |
Collapse
|
37
|
Abstract
Pseudogenes are commonly encountered during investigation of the genomes of a wide range of life forms. This review concentrates on vertebrate, and in particular mammalian, pseudogenes and describes their origin and subsequent evolution. Consideration is also given to pseudogenes that are transcribed and to the unusual group of genes that exist at the interface between functional genes and non-functional pseudogenes. As the sequences of different genomes are characterised, the recognition and interpretation of pseudogene sequences will become more important and have a greater impact in the field of molecular genetics.
Collapse
Affiliation(s)
- A J Mighell
- Molecular Medicine Unit, The University of Leeds, Clinical Sciences Building, St. James's University Hospital, Leeds, UK.
| | | | | | | |
Collapse
|
38
|
Glusman G, Sosinsky A, Ben-Asher E, Avidan N, Sonkin D, Bahar A, Rosenthal A, Clifton S, Roe B, Ferraz C, Demaille J, Lancet D. Sequence, structure, and evolution of a complete human olfactory receptor gene cluster. Genomics 2000; 63:227-45. [PMID: 10673334 DOI: 10.1006/geno.1999.6030] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The olfactory receptor (OR) gene cluster on human chromosome 17p13.3 was subjected to mixed shotgun automated DNA sequencing. The resulting 412 kb of genomic sequence include 17 OR coding regions, 6 of which are pseudogenes. Six of the coding regions were discovered only upon genomic sequencing, while the others were previously reported as partial sequences. A comparison of DNA sequences in the vicinity of the OR coding regions revealed a common gene structure with an intronless coding region and at least one upstream noncoding exon. Potential gene control regions including specific pyrimidine:purine tracts and Olf-1 sites have been identified. One of the pseudogenes apparently has evolved into a CpG island. Four extensive CpG islands can be discerned within the cluster, not coupled to specific OR genes. The cluster is flanked at its telomeric end by an unidentified open reading frame (C17orf2) with no significant similarity to any known protein. A high proportion of the cluster sequence (about 60%) belongs to various families of interspersed repetitive elements, with a clear predominance of LINE repeats. The OR genes in the cluster belong to two families and seven subfamilies, which show a relatively high degree of intermixing along the cluster, in seemingly random orientations. This genomic organization may be best accounted for by a complex series of evolutionary events.
Collapse
Affiliation(s)
- G Glusman
- Department of Molecular Genetics and The Crown Human Genome Center, The Weizmann Institute of Science, Rehovot, 76100, Israel
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
O’keefe C, Eichler E. The Pathological Consequences and Evolutionary Implications of Recent Human Genomic Duplications. Comparative Genomics 2000. [DOI: 10.1007/978-94-011-4309-7_5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
40
|
Abstract
Gene duplication is one of the primary forces of evolutionary change. We present data from three different pericentromeric regions of human chromosomes, which indicate that such regions of the genome have been sites of recent genomic duplication. This form of duplication has involved the evolutionary movement of segments of genomic material, including both intronic and exonic sequence, from diverse regions of the genome toward the pericentromeric regions. Sequence analyses of the target sites of duplication have identified a novel class of interspersed GC-rich repeats located precisely at the boundaries of duplication. Estimates of the evolutionary age of these duplications indicate that they have occurred between 10 and 25 mya. In contrast, comparative analyses confirm that the GC-rich pericentromeric repeats have existed within the pericentromeric regions of primate chromosomes before the divergence of the cercopithecoid and hominoid lineages ( approximately 30 mya). These data provide molecular evidence for considerable interchromosomal duplication of genic segments during the evolution of the hominoid genome and strongly implicate GC-rich repeat elements as playing a direct role in the pericentromeric localization of these events
Collapse
MESH Headings
- Animals
- Base Sequence
- Centromere
- Chromosomes, Human, Pair 1
- Chromosomes, Human, Pair 10
- Chromosomes, Human, Pair 14
- Chromosomes, Human, Pair 15
- Chromosomes, Human, Pair 16
- Chromosomes, Human, Pair 2
- Chromosomes, Human, Pair 22
- Chromosomes, Human, Pair 9
- Evolution, Molecular
- Gene Duplication
- Hominidae/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Microsatellite Repeats/genetics
- Molecular Sequence Data
- Sequence Alignment
- X Chromosome
Collapse
Affiliation(s)
- E E Eichler
- Department of Genetics and Center for Human Genetics, Case Western Reserve School of Medicine and University Hospitals of Cleveland, Cleveland, Ohio 44106, USA.
| | | | | |
Collapse
|
41
|
Sharon D, Glusman G, Pilpel Y, Khen M, Gruetzner F, Haaf T, Lancet D. Primate evolution of an olfactory receptor cluster: diversification by gene conversion and recent emergence of pseudogenes. Genomics 1999; 61:24-36. [PMID: 10512677 DOI: 10.1006/geno.1999.5900] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The olfactory receptor (OR) subgenome harbors the largest known gene family in mammals, disposed in clusters on numerous chromosomes. We have carried out a comparative evolutionary analysis of the best characterized genomic OR gene cluster, on human chromosome 17p13. Fifteen orthologs from chimpanzee (localized to chromosome 19p15), as well as key OR counterparts from other primates, have been identified and sequenced. Comparison among orthologs and paralogs revealed a multiplicity of gene conversion events, which occurred exclusively within OR subfamilies. These appear to lead to segment shuffling in the odorant binding site, an evolutionary process reminiscent of somatic combinatorial diversification in the immune system. We also demonstrate that the functional mammalian OR repertoire has undergone a rapid decline in the past 10 million years: while for the common ancestor of all great apes an intact OR cluster is inferred, in present-day humans and great apes the cluster includes nearly 40% pseudogenes.
Collapse
Affiliation(s)
- D Sharon
- Department of Molecular Genetics and the Crown Human Genome Center, The Weizmann Institute of Science, Rehovot, 76100, Israel
| | | | | | | | | | | | | |
Collapse
|
42
|
Abstract
The sense of smell is highly sophisticated in vertebrates but Homo sapiens ranks low in olfactory performance when compared to other species - why? Olfaction initiates with the interaction of odorants with specific receptors on the surface of olfactory sensory neurons in the nose. The genes encoding odorant receptors form the largest family in the vertebrate genome, numbering as many as 1000 in rodents. It has recently come to light that the repertoire of human odorant receptor genes, unlike in other vertebrates, is riddled with pseudogenes.
Collapse
Affiliation(s)
- P Mombaerts
- The Rockefeller University 1230 York Avenue, New York, New York, 10021, USA.
| |
Collapse
|