1
|
|
2
|
Katinka MD, Duprat S, Cornillot E, Méténier G, Thomarat F, Prensier G, Barbe V, Peyretaillade E, Brottier P, Wincker P, Delbac F, El Alaoui H, Peyret P, Saurin W, Gouy M, Weissenbach J, Vivarès CP. Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi. Nature 2001; 414:450-3. [PMID: 11719806 DOI: 10.1038/35106579] [Citation(s) in RCA: 730] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Microsporidia are obligate intracellular parasites infesting many animal groups. Lacking mitochondria and peroxysomes, these unicellular eukaryotes were first considered a deeply branching protist lineage that diverged before the endosymbiotic event that led to mitochondria. The discovery of a gene for a mitochondrial-type chaperone combined with molecular phylogenetic data later implied that microsporidia are atypical fungi that lost mitochondria during evolution. Here we report the DNA sequences of the 11 chromosomes of the approximately 2.9-megabase (Mb) genome of Encephalitozoon cuniculi (1,997 potential protein-coding genes). Genome compaction is reflected by reduced intergenic spacers and by the shortness of most putative proteins relative to their eukaryote orthologues. The strong host dependence is illustrated by the lack of genes for some biosynthetic pathways and for the tricarboxylic acid cycle. Phylogenetic analysis lends substantial credit to the fungal affiliation of microsporidia. Because the E. cuniculi genome contains genes related to some mitochondrial functions (for example, Fe-S cluster assembly), we hypothesize that microsporidia have retained a mitochondrion-derived organelle.
Collapse
Affiliation(s)
- M D Katinka
- Genoscope, UMR CNRS 8030, CP 5706, 91057 Evry cedex, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Lecompte O, Ripp R, Puzos-Barbe V, Duprat S, Heilig R, Dietrich J, Thierry JC, Poch O. Genome evolution at the genus level: comparison of three complete genomes of hyperthermophilic archaea. Genome Res 2001; 11:981-93. [PMID: 11381026 PMCID: PMC311118 DOI: 10.1101/gr.gr1653r] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have compared three complete genomes of closely related hyperthermophilic species of Archaea belonging to the Pyrococcus genus: Pyrococcus abyssi, Pyrococcus horikoshii, and Pyrococcus furiosus. At the genomic level, the comparison reveals a differential conservation among four regions of the Pyrococcus chromosomes correlated with the location of genetic elements mediating DNA reorganization. This discloses the relative contribution of the major mechanisms that promote genomic plasticity in these Archaea, namely rearrangements linked to the replication terminus, insertion sequence-mediated recombinations, and DNA integration within tRNA genes. The combination of these mechanisms leads to a high level of genomic plasticity in these hyperthermophilic Archaea, at least comparable to the plasticity observed between closely related bacteria. At the proteomic level, the comparison of the three Pyrococcus species sheds light on specific selection pressures acting both on their coding capacities and evolutionary rates. Indeed, thanks to two independent methods, the "reciprocal best hits" approach and a new distance ratio analysis, we detect the false orthology relationships within the Pyrococcus lineage. This reveals a high amount of differential gains and losses of genes since the divergence of the three closely related species. The resulting polymorphism is probably linked to an adaptation of these free-living organisms to differential environmental constraints. As a corollary, we delineate the set of orthologous genes shared by the three species, that is, the genes that may characterize the Pyrococcus genus. In this conserved core, the amino acid substitution rate is equal between P. abyssi and P. horikoshii for most of their shared proteins, even for fast-evolving ones. In contrast, strong discrepancies exist among the substitution rates observed in P. furiosus relative to the two other species, which is in disagreement with the molecular clock hypothesis.
Collapse
Affiliation(s)
- O Lecompte
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, UPR 9004, Illkirch, CU de Strasbourg, France
| | | | | | | | | | | | | | | |
Collapse
|
4
|
Peyret P, Katinka MD, Duprat S, Duffieux F, Barbe V, Barbazanges M, Weissenbach J, Saurin W, Vivarès CP. Sequence and analysis of chromosome I of the amitochondriate intracellular parasite Encephalitozoon cuniculi (Microspora). Genome Res 2001; 11:198-207. [PMID: 11157783 PMCID: PMC311017 DOI: 10.1101/gr.164301] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
A DNA sequencing program was applied to the small (<3 Mb) genome of the microsporidian Encephalitozoon cuniculi, an amitochondriate eukaryotic parasite of mammals, and the sequence of the smallest chromosome was determined. The approximately 224-kb E. cuniculi chromosome I exhibits a dyad symmetry characterized by two identical 37-kb subtelomeric regions which are divergently oriented and extend just downstream of the inverted copies of an 8-kb duplicated cluster of six genes. Each subtelomeric region comprises a single 16S-23S rDNA transcription unit, flanked by various tandemly repeated sequences, and ends with approximately 1 kb of heterogeneous telomeric repeats. The central (or core) region of the chromosome harbors a highly compact arrangement of 132 potential protein-coding genes plus two tRNA genes (one gene per 1.14 kb). Most genes occur as single copies with no identified introns. Of these putative genes, only 53 could be assigned to known functions. A number of genes from the transcription and translation machineries as well as from other cellular processes display characteristic eukaryotic signatures or are clearly eukaryote-specific.
Collapse
Affiliation(s)
- P Peyret
- Equipe de Parasitologie Moléculaire et Cellulaire, LBP, UMR CNRS 6023, Université Blaise Pascal, 63177 Aubière Cedex, France. pierre.peyret@ lbp.univ-bpclermont.fr
| | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Chardon P, Rogel-Gaillard C, Cattolico L, Duprat S, Vaiman M, Renard C. Sequence of the swine major histocompatibility complex region containing all non-classical class I genes. Tissue Antigens 2001; 57:55-65. [PMID: 11169259 DOI: 10.1034/j.1399-0039.2001.057001055.x] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
A segment of 158,063 nucleotides of the pig major histocompatibility complex (SLA) and corresponding to the junction of the class I and class III regions was sequenced entirely. The centromeric part of the segment contained six class III genes including the three tumor necrosis factor genes, while the telomeric part contained three genes belonging to the class I region. The order and the molecular organization of these genes were exactly conserved in the SLA and HLA complexes, except for the SC1 gene which displayed a shift of the reading frame in swine. The cluster of the three SLA class I-related genes (Ib) and the MIC1 and MIC2 genes were located in the middle of the segment, in the following order from the centromeric side onwards, SLA-6, SLA-7, SLA-8, MIC-1 and MIC-2. All three SLA Ib genes displayed an overall molecular structure compatible with the expression of membrane-anchored glycoproteins. The SLA-7 and SLA-8 genes bear greater resemblance than to the SLA-6 gene. Six SLA-6 alleles have been previously defined differing each from the other by unique point mutations. One of them, appeared to have arisen through the occurrence of a gene conversion event in which the SLA-7 gene served as template. Only MIC-2 gene might be functional, the second MIC-1 gene being truncated. In all, the 14 genes characterized spans 37% of the total sequence. The remaining 63% nucleotides comprised a number of repeat DNA motives, including LINE fragments, SINEs, microsatellites, and also numerous nucleotide stretches not yet defined in swine.
Collapse
Affiliation(s)
- P Chardon
- Laboratoire de Radiobiologie et Etude du Génome, INRA CEA, Jouy en Josas, France.
| | | | | | | | | | | |
Collapse
|
6
|
Artiguenave F, Wincker P, Brottier P, Duprat S, Jovelin F, Scarpelli C, Verdier J, Vico V, Weissenbach J, Saurin W. Genomic exploration of the hemiascomycetous yeasts: 2. Data generation and processing. FEBS Lett 2000; 487:13-6. [PMID: 11152877 DOI: 10.1016/s0014-5793(00)02273-0] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The generation of sequencing data for the hemiascomycetous yeast random sequence tag project was performed using the procedures established at GENOSCOPE. These procedures include a series of protocols for the sequencing reactions, using infra-red labelled primers, performed on both ends of the plasmid inserts in the same reaction tube, and their analysis on automated DNA sequencers. They also include a package of computer programs aimed at detecting potential assignation errors, selecting good quality sequences and estimating their useful length.
Collapse
Affiliation(s)
- F Artiguenave
- GENOSCOPE, Centre National de Séquençage, 2 rue Gaston Crémieux, P.O. Box 91, Cedex, F-Evry, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Deloukas P, Schuler GD, Gyapay G, Beasley EM, Soderlund C, Rodriguez-Tomé P, Hui L, Matise TC, McKusick KB, Beckmann JS, Bentolila S, Bihoreau M, Birren BB, Browne J, Butler A, Castle AB, Chiannilkulchai N, Clee C, Day PJ, Dehejia A, Dibling T, Drouot N, Duprat S, Fizames C, Fox S, Gelling S, Green L, Harrison P, Hocking R, Holloway E, Hunt S, Keil S, Lijnzaad P, Louis-Dit-Sully C, Ma J, Mendis A, Miller J, Morissette J, Muselet D, Nusbaum HC, Peck A, Rozen S, Simon D, Slonim DK, Staples R, Stein LD, Stewart EA, Suchard MA, Thangarajah T, Vega-Czarny N, Webber C, Wu X, Hudson J, Auffray C, Nomura N, Sikela JM, Polymeropoulos MH, James MR, Lander ES, Hudson TJ, Myers RM, Cox DR, Weissenbach J, Boguski MS, Bentley DR. A physical map of 30,000 human genes. Science 1998; 282:744-6. [PMID: 9784132 DOI: 10.1126/science.282.5389.744] [Citation(s) in RCA: 434] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
A map of 30,181 human gene-based markers was assembled and integrated with the current genetic map by radiation hybrid mapping. The new gene map contains nearly twice as many genes as the previous release, includes most genes that encode proteins of known function, and is twofold to threefold more accurate than the previous version. A redesigned, more informative and functional World Wide Web site (www.ncbi.nlm.nih.gov/genemap) provides the mapping information and associated data and annotations. This resource constitutes an important infrastructure and tool for the study of complex genetic traits, the positional cloning of disease genes, the cross-referencing of mammalian genomes, and validated human transcribed sequences for large-scale studies of gene expression.
Collapse
Affiliation(s)
- P Deloukas
- Sanger Centre, Hinxton Hall, Hinxton, Cambridge CB10 1SA UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Schuler GD, Boguski MS, Stewart EA, Stein LD, Gyapay G, Rice K, White RE, Rodriguez-Tom P, Aggarwal A, Bajorek E, Bentolila S, Birren BB, Butler A, Castle AB, Chiannilkulchai N, Chu A, Clee C, Cowles S, Day PJR, Dibling T, East C, Drouot N, Dunham I, Duprat S, Edwards C, Fan JB, Fang N, Fizames C, Garrett C, Green L, Hadley D, Harris M, Harrison P, Brady S, Hicks A, Holloway E, Hui L, Hussain S, Louis-Dit-Sully C, Ma J, MacGilvery A, Mader C, Maratukulam A, Matise TC, McKusick KB, Morissette J, Mungall A, Muselet D, Nusbaum HC, Page DC, Peck A, Perkins S, Piercy M, Qin F, Quackenbush J, Ranby S, Reif T, Rozen S, Sanders C, She X, Silva J, Slonim DK, Soderlund C, Sun WL, Tabar P, Thangarajah T, Vega-Czarny N, Vollrath D, Voyticky S, Wilmer T, Wu X, Adams MD, Auffray C, Walter NAR, Brandon R, Dehejia A, Goodfellow PN, Houlgatte R, Hudson JR, Ide SE, Iorio KR, Lee WY, Seki N, Nagase T, Ishikawa K, Nomura N, Phillips C, Polymeropoulos MH, Sandusky M, Schmitt K, Berry R, Swanson K, Torres R, Venter JC, Sikela JM, Beckmann JS, Weissenbach J, Myers RM, Cox DR, James MR, Bentley D, Deloukas P, Lander ES, Hudson TJ. A Gene Map of the Human Genome. Science 1996. [DOI: 10.1126/science.274.5287.540] [Citation(s) in RCA: 717] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
9
|
Schuler GD, Boguski MS, Stewart EA, Stein LD, Gyapay G, Rice K, White RE, Rodriguez-Tomé P, Aggarwal A, Bajorek E, Bentolila S, Birren BB, Butler A, Castle AB, Chiannilkulchai N, Chu A, Clee C, Cowles S, Day PJ, Dibling T, Drouot N, Dunham I, Duprat S, East C, Edwards C, Fan JB, Fang N, Fizames C, Garrett C, Green L, Hadley D, Harris M, Harrison P, Brady S, Hicks A, Holloway E, Hui L, Hussain S, Louis-Dit-Sully C, Ma J, MacGilvery A, Mader C, Maratukulam A, Matise TC, McKusick KB, Morissette J, Mungall A, Muselet D, Nusbaum HC, Page DC, Peck A, Perkins S, Piercy M, Qin F, Quackenbush J, Ranby S, Reif T, Rozen S, Sanders C, She X, Silva J, Slonim DK, Soderlund C, Sun WL, Tabar P, Thangarajah T, Vega-Czarny N, Vollrath D, Voyticky S, Wilmer T, Wu X, Adams MD, Auffray C, Walter NA, Brandon R, Dehejia A, Goodfellow PN, Houlgatte R, Hudson JR, Ide SE, Iorio KR, Lee WY, Seki N, Nagase T, Ishikawa K, Nomura N, Phillips C, Polymeropoulos MH, Sandusky M, Schmitt K, Berry R, Swanson K, Torres R, Venter JC, Sikela JM, Beckmann JS, Weissenbach J, Myers RM, Cox DR, James MR, Bentley D, Deloukas P, Lander ES, Hudson TJ. A gene map of the human genome. Science 1996; 274:540-6. [PMID: 8849440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The human genome is thought to harbor 50,000 to 100,000 genes, of which about half have been sampled to date in the form of expressed sequence tags. An international consortium was organized to develop and map gene-based sequence tagged site markers on a set of two radiation hybrid panels and a yeast artificial chromosome library. More than 16,000 human genes have been mapped relative to a framework map that contains about 1000 polymorphic genetic markers. The gene map unifies the existing genetic and physical maps with the nucleotide and protein sequence databases in a fashion that should speed the discovery of genes underlying inherited human disease. The integrated resource is available through a site on the World Wide Web at http://www.ncbi.nlm.nih.gov/SCIENCE96/.
Collapse
Affiliation(s)
- G D Schuler
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Houlgatte R, Mariage-Samson R, Duprat S, Tessier A, Bentolila S, Lamy B, Auffray C. The Genexpress Index: a resource for gene discovery and the genic map of the human genome. Genome Res 1995; 5:272-304. [PMID: 8593614 DOI: 10.1101/gr.5.3.272] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Detailed analysis of a set of 18,698 sequences derived from both ends of 10,979 human skeletal muscle and brain cDNA clones defined 6676 functional families, characterized by their sequence signatures over 5750 distinct human gene transcripts. About half of these genes have been assigned to specific chromosomes utilizing 2733 eSTS markers, the polymerase chain reaction, and DNA from human-rodent somatic cell hybrids. Sequence and clone clustering and a functional classification together with comprehensive data base searches and annotations made it possible to develop extensive sequence and map cross-indexes, define electronic expression profiles, identify a new set of overlapping genes, and provide numerous new candidate genes for human pathologies.
Collapse
Affiliation(s)
- R Houlgatte
- Genexpress, Centre National de la Recherche Scientifique (CNRS) UPR420, Villejuif, France.
| | | | | | | | | | | | | |
Collapse
|
11
|
Auffray C, Behar G, Bois F, Bouchier C, Da Silva C, Devignes MD, Duprat S, Houlgatte R, Jumeau MN, Lamy B. [IMAGE: molecular integration of the analysis of the human genome and its expression]. C R Acad Sci III 1995; 318:263-72. [PMID: 7757816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
We have developed an integrated approach for the analysis of human cDNA libraries from neuromuscular tissues, based on the acquisition of primary structural, expression and mapping data. 26,938 sequence signatures (over 7 million bases) have been derived from both ends of skeletal muscle and brain cDNA clones. Primary redundancy analysis and classification of database similarities made it possible to characterize by structural data about 8,000 human gene transcripts, the majority of which is catalogued for the first time. Collecting hybridization signatures of complex cDNA probes derived from the tissues of origin to cDNA clones arrayed on high density filters provided a global and quantifiable view of the complexity and level of expression of the different transcripts. The development of 2,792 eSTS markers amplifiable by PCR defined the chromosomal localization of some 2,500 genes corresponding to the transcripts sequenced. The data collected are part of the corpus of the human gene transcript catalog and the genic map of the human genome.
Collapse
|