1
|
Heilig R, Eckenberg R, Petit JL, Fonknechten N, Da Silva C, Cattolico L, Levy M, Barbe V, de Berardinis V, Ureta-Vidal A, Pelletier E, Vico V, Anthouard V, Rowen L, Madan A, Qin S, Sun H, Du H, Pepin K, Artiguenave F, Robert C, Cruaud C, Brüls T, Jaillon O, Friedlander L, Samson G, Brottier P, Cure S, Ségurens B, Anière F, Samain S, Crespeau H, Abbasi N, Aiach N, Boscus D, Dickhoff R, Dors M, Dubois I, Friedman C, Gouyvenoux M, James R, Madan A, Mairey-Estrada B, Mangenot S, Martins N, Ménard M, Oztas S, Ratcliffe A, Shaffer T, Trask B, Vacherie B, Bellemere C, Belser C, Besnard-Gonnet M, Bartol-Mavel D, Boutard M, Briez-Silla S, Combette S, Dufossé-Laurent V, Ferron C, Lechaplais C, Louesse C, Muselet D, Magdelenat G, Pateau E, Petit E, Sirvain-Trukniewicz P, Trybou A, Vega-Czarny N, Bataille E, Bluet E, Bordelais I, Dubois M, Dumont C, Guérin T, Haffray S, Hammadi R, Muanga J, Pellouin V, Robert D, Wunderle E, Gauguet G, Roy A, Sainte-Marthe L, Verdier J, Verdier-Discala C, Hillier L, Fulton L, McPherson J, Matsuda F, Wilson R, Scarpelli C, Gyapay G, Wincker P, Saurin W, Quétier F, Waterston R, Hood L, Weissenbach J. Publisher Correction: The DNA sequence and analysis of human chromosome 14. Nature 2023; 620:E17. [PMID: 37491470 DOI: 10.1038/s41586-023-06403-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Affiliation(s)
- Roland Heilig
- Genoscope-Centre National de Séquençage, 91000, Evry, France.
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France.
| | - Ralph Eckenberg
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Jean-Louis Petit
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Núria Fonknechten
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Corinne Da Silva
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Michaël Levy
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Valérie Barbe
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Eric Pelletier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Virginie Vico
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Lee Rowen
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Anup Madan
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Shizhen Qin
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Hui Sun
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Hui Du
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Kymberlie Pepin
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | | | - Corinne Cruaud
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Thomas Brüls
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Olivier Jaillon
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Gaelle Samson
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | | | - Susan Cure
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Franck Anière
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Sylvie Samain
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Hervé Crespeau
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Nissa Abbasi
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Nathalie Aiach
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Didier Boscus
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Rachel Dickhoff
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Monica Dors
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Ivan Dubois
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Rose James
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Anuradha Madan
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | | | - Sophie Mangenot
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Manuela Ménard
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Sophie Oztas
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Amber Ratcliffe
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Tristan Shaffer
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Barbara Trask
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Benoit Vacherie
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Caroline Belser
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Magali Boutard
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | - Carolyne Ferron
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | | | - Emilie Pateau
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | - Arnaud Trybou
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Elodie Bataille
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Elodie Bluet
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Maria Dubois
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Corinne Dumont
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Thomas Guérin
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Rachid Hammadi
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | | | | | - Edith Wunderle
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Gilbert Gauguet
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Alice Roy
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - Jean Verdier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | | | - LaDeana Hillier
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Lucinda Fulton
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - John McPherson
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | - Richard Wilson
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | | | - Gábor Gyapay
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Patrick Wincker
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - William Saurin
- Genoscope-Centre National de Séquençage, 91000, Evry, France
| | - Francis Quétier
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| | - Robert Waterston
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri, 63108, USA
| | - Leroy Hood
- Institute for Systems Biology, Seattle, Washington, 98103, USA
| | - Jean Weissenbach
- Genoscope-Centre National de Séquençage, 91000, Evry, France
- UMR-8030, CNRS et Université d'Evry, 91000, Evry, France
| |
Collapse
|
2
|
Severin J, Beal K, Vilella AJ, Fitzgerald S, Schuster M, Gordon L, Ureta-Vidal A, Flicek P, Herrero J. eHive: an artificial intelligence workflow system for genomic analysis. BMC Bioinformatics 2010; 11:240. [PMID: 20459813 PMCID: PMC2885371 DOI: 10.1186/1471-2105-11-240] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Accepted: 05/11/2010] [Indexed: 12/03/2022] Open
Abstract
Background The Ensembl project produces updates to its comparative genomics resources with each of its several releases per year. During each release cycle approximately two weeks are allocated to generate all the genomic alignments and the protein homology predictions. The number of calculations required for this task grows approximately quadratically with the number of species. We currently support 50 species in Ensembl and we expect the number to continue to grow in the future. Results We present eHive, a new fault tolerant distributed processing system initially designed to support comparative genomic analysis, based on blackboard systems, network distributed autonomous agents, dataflow graphs and block-branch diagrams. In the eHive system a MySQL database serves as the central blackboard and the autonomous agent, a Perl script, queries the system and runs jobs as required. The system allows us to define dataflow and branching rules to suit all our production pipelines. We describe the implementation of three pipelines: (1) pairwise whole genome alignments, (2) multiple whole genome alignments and (3) gene trees with protein homology inference. Finally, we show the efficiency of the system in real case scenarios. Conclusions eHive allows us to produce computationally demanding results in a reliable and efficient way with minimal supervision and high throughput. Further documentation is available at: http://www.ensembl.org/info/docs/eHive/.
Collapse
Affiliation(s)
- Jessica Severin
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Wilkinson P, Sengerova J, Matteoni R, Chen CK, Soulat G, Ureta-Vidal A, Fessele S, Hagn M, Massimi M, Pickford K, Butler RH, Marschall S, Mallon AM, Pickard A, Raspa M, Scavizzi F, Fray M, Larrigaldie V, Leyritz J, Birney E, Tocchini-Valentini GP, Brown S, Herault Y, Montoliu L, de Angelis MH, Smedley D. EMMA--mouse mutant resources for the international scientific community. Nucleic Acids Res 2009; 38:D570-6. [PMID: 19783817 PMCID: PMC2808872 DOI: 10.1093/nar/gkp799] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The laboratory mouse is the premier animal model for studying human disease and thousands of mutants have been identified or produced, most recently through gene-specific mutagenesis approaches. High throughput strategies by the International Knockout Mouse Consortium (IKMC) are producing mutants for all protein coding genes. Generating a knock-out line involves huge monetary and time costs so capture of both the data describing each mutant alongside archiving of the line for distribution to future researchers is critical. The European Mouse Mutant Archive (EMMA) is a leading international network infrastructure for archiving and worldwide provision of mouse mutant strains. It operates in collaboration with the other members of the Federation of International Mouse Resources (FIMRe), EMMA being the European component. Additionally EMMA is one of four repositories involved in the IKMC, and therefore the current figure of 1700 archived lines will rise markedly. The EMMA database gathers and curates extensive data on each line and presents it through a user-friendly website. A BioMart interface allows advanced searching including integrated querying with other resources e.g. Ensembl. Other resources are able to display EMMA data by accessing our Distributed Annotation System server. EMMA database access is publicly available at http://www.emmanet.org.
Collapse
Affiliation(s)
- Phil Wilkinson
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Vilella AJ, Severin J, Ureta-Vidal A, Heng L, Durbin R, Birney E. EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. Genome Res 2008; 19:327-35. [PMID: 19029536 DOI: 10.1101/gr.073585.107] [Citation(s) in RCA: 841] [Impact Index Per Article: 52.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog prediction, showing a large increase in coverage using the phylogenetic approach. All data are made available in a number of formats and will be kept up to date with the Ensembl project.
Collapse
Affiliation(s)
- Albert J Vilella
- EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | | | | | | | | | | |
Collapse
|
5
|
Ruan J, Li H, Chen Z, Coghlan A, Coin LJM, Guo Y, Hériché JK, Hu Y, Kristiansen K, Li R, Liu T, Moses A, Qin J, Vang S, Vilella AJ, Ureta-Vidal A, Bolund L, Wang J, Durbin R. TreeFam: 2008 Update. Nucleic Acids Res 2007; 36:D735-40. [PMID: 18056084 PMCID: PMC2238856 DOI: 10.1093/nar/gkm1005] [Citation(s) in RCA: 243] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all animal gene families, as well as orthologue and paralogue assignments. Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families. We have expanded TreeFam to include 25 fully sequenced animal genomes, as well as four genomes from plant and fungal outgroup species. We have also introduced more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues. The user interface for viewing phylogenetic trees and family information has been improved. Furthermore, a new perl API lets users easily extract data from the TreeFam mysql database.
Collapse
Affiliation(s)
- Jue Ruan
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Heng Li
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Zhongzhong Chen
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Avril Coghlan
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Lachlan James M. Coin
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Yiran Guo
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Jean-Karim Hériché
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Yafeng Hu
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Karsten Kristiansen
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Ruiqiang Li
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Tao Liu
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Alan Moses
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Junjie Qin
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Søren Vang
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Albert J. Vilella
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Abel Ureta-Vidal
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Lars Bolund
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Jun Wang
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
| | - Richard Durbin
- Beijing Institute of Genomics of the Chinese Academy of Sciences, Beijing Genomics Institute, Beijing 101300, China, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, Department of Epidemiology & Public Health, Imperial College, St Mary's Campus, Norfolk Place, London W2 1PG, UK, Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Research Unit for Molecular Medicine, Aarhus University Hospital and Faculty of Health Sciences, University of Aarhus, DK-8200 Aarhus N, Denmark, EMBL-European Bioinformatics Institute, Hinxton, Cambridge, UK and Institute of Human Genetics, University of Aarhus, DK-8000 Aarhus C, Denmark
- *To whom correspondence should be addressed.+44 (0) 1223 834244+44 (0) 1223 494919
| |
Collapse
|
6
|
Flicek P, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Eyre T, Fitzgerald S, Fernandez-Banet J, Gräf S, Haider S, Hammond M, Holland R, Howe KL, Howe K, Johnson N, Jenkinson A, Kähäri A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Prlic A, Rice S, Rios D, Schuster M, Sealy I, Slater G, Smedley D, Spudich G, Trevanion S, Vilella AJ, Vogel J, White S, Wood M, Birney E, Cox T, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Hubbard TJP, Kasprzyk A, Proctor G, Smith J, Ureta-Vidal A, Searle S. Ensembl 2008. Nucleic Acids Res 2007; 36:D707-14. [PMID: 18000006 PMCID: PMC2238821 DOI: 10.1093/nar/gkm988] [Citation(s) in RCA: 370] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The Ensembl project (http://www.ensembl.org) is a comprehensive genome information system featuring an integrated set of genome annotation, databases and other information for chordate and selected model organism and disease vector genomes. As of release 47 (October 2007), Ensembl fully supports 35 species, with preliminary support for six additional species. New species in the past year include platypus and horse. Major additions and improvements to Ensembl since our previous report include extensive support for functional genomics data in the form of a specialized functional genomics database, genome-wide maps of protein–DNA interactions and the Ensembl regulatory build; support for customization of the Ensembl web interface through the addition of user accounts and user groups; and increased support for genome resequencing. We have also introduced new comparative genomics-based data mining options and report on the continued development of our software infrastructure.
Collapse
Affiliation(s)
- P Flicek
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Margulies EH, Cooper GM, Asimenos G, Thomas DJ, Dewey CN, Siepel A, Birney E, Keefe D, Schwartz AS, Hou M, Taylor J, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Brown JB, Bickel P, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Stone EA, Rosenbloom KR, Kent WJ, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Hinrichs A, Trumbower H, Clawson H, Zweig A, Kuhn RM, Barber G, Harte R, Karolchik D, Field MA, Moore RA, Matthewson CA, Schein JE, Marra MA, Antonarakis SE, Batzoglou S, Goldman N, Hardison R, Haussler D, Miller W, Pachter L, Green ED, Sidow A. Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome. Genome Res 2007; 17:760-74. [PMID: 17567995 PMCID: PMC1891336 DOI: 10.1101/gr.6034307] [Citation(s) in RCA: 170] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization.
Collapse
Affiliation(s)
- Elliott H Margulies
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, Kuehn MS, Taylor CM, Neph S, Koch CM, Asthana S, Malhotra A, Adzhubei I, Greenbaum JA, Andrews RM, Flicek P, Boyle PJ, Cao H, Carter NP, Clelland GK, Davis S, Day N, Dhami P, Dillon SC, Dorschner MO, Fiegler H, Giresi PG, Goldy J, Hawrylycz M, Haydock A, Humbert R, James KD, Johnson BE, Johnson EM, Frum TT, Rosenzweig ER, Karnani N, Lee K, Lefebvre GC, Navas PA, Neri F, Parker SCJ, Sabo PJ, Sandstrom R, Shafer A, Vetrie D, Weaver M, Wilcox S, Yu M, Collins FS, Dekker J, Lieb JD, Tullius TD, Crawford GE, Sunyaev S, Noble WS, Dunham I, Denoeud F, Reymond A, Kapranov P, Rozowsky J, Zheng D, Castelo R, Frankish A, Harrow J, Ghosh S, Sandelin A, Hofacker IL, Baertsch R, Keefe D, Dike S, Cheng J, Hirsch HA, Sekinger EA, Lagarde J, Abril JF, Shahab A, Flamm C, Fried C, Hackermüller J, Hertel J, Lindemeyer M, Missal K, Tanzer A, Washietl S, Korbel J, Emanuelsson O, Pedersen JS, Holroyd N, Taylor R, Swarbreck D, Matthews N, Dickson MC, Thomas DJ, Weirauch MT, Gilbert J, Drenkow J, Bell I, Zhao X, Srinivasan KG, Sung WK, Ooi HS, Chiu KP, Foissac S, Alioto T, Brent M, Pachter L, Tress ML, Valencia A, Choo SW, Choo CY, Ucla C, Manzano C, Wyss C, Cheung E, Clark TG, Brown JB, Ganesh M, Patel S, Tammana H, Chrast J, Henrichsen CN, Kai C, Kawai J, Nagalakshmi U, Wu J, Lian Z, Lian J, Newburger P, Zhang X, Bickel P, Mattick JS, Carninci P, Hayashizaki Y, Weissman S, Hubbard T, Myers RM, Rogers J, Stadler PF, Lowe TM, Wei CL, Ruan Y, Struhl K, Gerstein M, Antonarakis SE, Fu Y, Green ED, Karaöz U, Siepel A, Taylor J, Liefer LA, Wetterstrand KA, Good PJ, Feingold EA, Guyer MS, Cooper GM, Asimenos G, Dewey CN, Hou M, Nikolaev S, Montoya-Burgos JI, Löytynoja A, Whelan S, Pardi F, Massingham T, Huang H, Zhang NR, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Seringhaus M, Church D, Rosenbloom K, Kent WJ, Stone EA, Batzoglou S, Goldman N, Hardison RC, Haussler D, Miller W, Sidow A, Trinklein ND, Zhang ZD, Barrera L, Stuart R, King DC, Ameur A, Enroth S, Bieda MC, Kim J, Bhinge AA, Jiang N, Liu J, Yao F, Vega VB, Lee CWH, Ng P, Shahab A, Yang A, Moqtaderi Z, Zhu Z, Xu X, Squazzo S, Oberley MJ, Inman D, Singer MA, Richmond TA, Munn KJ, Rada-Iglesias A, Wallerman O, Komorowski J, Fowler JC, Couttet P, Bruce AW, Dovey OM, Ellis PD, Langford CF, Nix DA, Euskirchen G, Hartman S, Urban AE, Kraus P, Van Calcar S, Heintzman N, Kim TH, Wang K, Qu C, Hon G, Luna R, Glass CK, Rosenfeld MG, Aldred SF, Cooper SJ, Halees A, Lin JM, Shulha HP, Zhang X, Xu M, Haidar JNS, Yu Y, Ruan Y, Iyer VR, Green RD, Wadelius C, Farnham PJ, Ren B, Harte RA, Hinrichs AS, Trumbower H, Clawson H, Hillman-Jackson J, Zweig AS, Smith K, Thakkapallayil A, Barber G, Kuhn RM, Karolchik D, Armengol L, Bird CP, de Bakker PIW, Kern AD, Lopez-Bigas N, Martin JD, Stranger BE, Woodroffe A, Davydov E, Dimas A, Eyras E, Hallgrímsdóttir IB, Huppert J, Zody MC, Abecasis GR, Estivill X, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VVB, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Koriabine M, Nefedov M, Osoegawa K, Yoshinaga Y, Zhu B, de Jong PJ. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007; 447:799-816. [PMID: 17571346 PMCID: PMC2212820 DOI: 10.1038/nature05874] [Citation(s) in RCA: 3782] [Impact Index Per Article: 222.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.
Collapse
|
9
|
Hubbard TJP, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Fitzgerald S, Fernandez-Banet J, Graf S, Haider S, Hammond M, Herrero J, Holland R, Howe K, Howe K, Johnson N, Kahari A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Melsopp C, Megy K, Meidl P, Ouverdin B, Parker A, Prlic A, Rice S, Rios D, Schuster M, Sealy I, Severin J, Slater G, Smedley D, Spudich G, Trevanion S, Vilella A, Vogel J, White S, Wood M, Cox T, Curwen V, Durbin R, Fernandez-Suarez XM, Flicek P, Kasprzyk A, Proctor G, Searle S, Smith J, Ureta-Vidal A, Birney E. Ensembl 2007. Nucleic Acids Res 2006; 35:D610-7. [PMID: 17148474 PMCID: PMC1761443 DOI: 10.1093/nar/gkl996] [Citation(s) in RCA: 657] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of chordate genome sequences. Over the past year the number of genomes available from Ensembl has increased from 15 to 33, with the addition of sites for the mammalian genomes of elephant, rabbit, armadillo, tenrec, platypus, pig, cat, bush baby, common shrew, microbat and european hedgehog; the fish genomes of stickleback and medaka and the second example of the genomes of the sea squirt (Ciona savignyi) and the mosquito (Aedes aegypti). Some of the major features added during the year include the first complete gene sets for genomes with low-sequence coverage, the introduction of new strain variation data and the introduction of new orthology/paralog annotations based on gene trees.
Collapse
Affiliation(s)
- T J P Hubbard
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Birney E, Andrews D, Caccamo M, Chen Y, Clarke L, Coates G, Cox T, Cunningham F, Curwen V, Cutts T, Down T, Durbin R, Fernandez-Suarez XM, Flicek P, Gräf S, Hammond M, Herrero J, Howe K, Iyer V, Jekosch K, Kähäri A, Kasprzyk A, Keefe D, Kokocinski F, Kulesha E, London D, Longden I, Melsopp C, Meidl P, Overduin B, Parker A, Proctor G, Prlic A, Rae M, Rios D, Redmond S, Schuster M, Sealy I, Searle S, Severin J, Slater G, Smedley D, Smith J, Stabenau A, Stalker J, Trevanion S, Ureta-Vidal A, Vogel J, White S, Woodwark C, Hubbard TJP. Ensembl 2006. Nucleic Acids Res 2006; 34:D556-61. [PMID: 16381931 PMCID: PMC1347495 DOI: 10.1093/nar/gkj133] [Citation(s) in RCA: 323] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The Ensembl () project provides a comprehensive and integrated source of annotation of large genome sequences. Over the last year the number of genomes available from the Ensembl site has increased from 4 to 19, with the addition of the mammalian genomes of Rhesus macaque and Opossum, the chordate genome of Ciona intestinalis and the import and integration of the yeast genome. The year has also seen extensive improvements to both data analysis and presentation, with the introduction of a redesigned website, the addition of RNA gene and regulatory annotation and substantial improvements to the integration of human genome variation data.
Collapse
Affiliation(s)
- E Birney
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Hubbard T, Andrews D, Caccamo M, Cameron G, Chen Y, Clamp M, Clarke L, Coates G, Cox T, Cunningham F, Curwen V, Cutts T, Down T, Durbin R, Fernandez-Suarez XM, Gilbert J, Hammond M, Herrero J, Hotz H, Howe K, Iyer V, Jekosch K, Kahari A, Kasprzyk A, Keefe D, Keenan S, Kokocinsci F, London D, Longden I, McVicker G, Melsopp C, Meidl P, Potter S, Proctor G, Rae M, Rios D, Schuster M, Searle S, Severin J, Slater G, Smedley D, Smith J, Spooner W, Stabenau A, Stalker J, Storey R, Trevanion S, Ureta-Vidal A, Vogel J, White S, Woodwark C, Birney E. Ensembl 2005. Nucleic Acids Res 2005; 33:D447-53. [PMID: 15608235 PMCID: PMC540092 DOI: 10.1093/nar/gki138] [Citation(s) in RCA: 354] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2004] [Revised: 11/01/2004] [Accepted: 11/01/2004] [Indexed: 11/17/2022] Open
Abstract
The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of large genome sequences. Over the last year the number of genomes available from the Ensembl site has increased by 7 to 16, with the addition of the six vertebrate genomes of chimpanzee, dog, cow, chicken, tetraodon and frog and the insect genome of honeybee. The majority have been annotated automatically using the Ensembl gene build system, showing its flexibility to reliably annotate a wide variety of genomes. With the increased number of vertebrate genomes, the comparative analysis provided to users has been greatly improved, with new website interfaces allowing annotation of different genomes to be directly compared. The Ensembl software system is being increasingly widely reused in different projects showing the benefits of a completely open approach to software development and distribution.
Collapse
Affiliation(s)
- T Hubbard
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Abstract
OBJECTIVE We report an epidemiological study with an analysis of the risk factors of the HTLV-1 seroprevalence in pregnant women and their children in the town of St Laurent du Maroni, French Guyana. MATERIAL AND METHOD HTLV-1 seroprevalence and risk associated factors were first studied in all the pregnant women having delivered at St. Laurent between July 1991 and June 1993. Then, a retrospective analysis was performed in the children, aged between 18 months and 12 years old, born from HTLV-1 infected mothers, focusing especially on the duration of breast feeding and the level of HTLV-1 anti body titers and proviral load. RESULTS The global HTLV-1 seroprevalence was 4.4% (75/1727) but it was more prevalent among ethnic groups of African origin such as the Noir Marron population (5.5%) and Haitians (6.3%). In the Noir-Marron population, which represents 70% of the studied population, HTLV-1 seropositivity was associated with a maternal age of>35 years, prior miscarriage, prior cesarean section, parity>4, gravidity>6 and negative rhesus factor. After logistic regression, HTLV-1 seropositivity remained associated with gravidity>6 and negative rhesus factor. Out of the 216 children born from 81 HTLV-1 infected mothers, only 21 were found to be HTLV-1 seropositive, giving a crude HTLV-1 transmission rate of 9.7% while among the 180 breast-fed children 10.6% were HTLV-1 seropositive. HTLV-1 seropositivity in children was associated with elevated maternal anti HTLV-1 antibody titer, high maternal HTLV-1 proviral load and child's gender, girls being more frequently HTLV-1 infected than boys. CONCLUSION HTLV-1 infection, which can be responsible for severe pathologies in adults (adult T cell leukemia and tropical spastic paraparesis/HTLV-1 associated myelopathy) should be screened during pregnancy in women originating from high HTLV-1 endemic areas, as for France, mainly the French West Indies, French Guyana and Intertropical Africa. In case of HTLV-1 seropositivity, mothers should be informed on the risk of transmission and promotion of bottle feeding of their children should be strongly proposed.
Collapse
Affiliation(s)
- G Carles
- Service de Gynécologie-Obstétrique, Centre Hospitalier Franck Joly, 97320 Saint-Laurent-du-Maroni, Guyane
| | | | | | | | | | | | | |
Collapse
|
13
|
Birney E, Andrews TD, Bevan P, Caccamo M, Chen Y, Clarke L, Coates G, Cuff J, Curwen V, Cutts T, Down T, Eyras E, Fernandez-Suarez XM, Gane P, Gibbins B, Gilbert J, Hammond M, Hotz HR, Iyer V, Jekosch K, Kahari A, Kasprzyk A, Keefe D, Keenan S, Lehvaslaiho H, McVicker G, Melsopp C, Meidl P, Mongin E, Pettett R, Potter S, Proctor G, Rae M, Searle S, Slater G, Smedley D, Smith J, Spooner W, Stabenau A, Stalker J, Storey R, Ureta-Vidal A, Woodwark KC, Cameron G, Durbin R, Cox A, Hubbard T, Clamp M. An overview of Ensembl. Genome Res 2004; 14:925-8. [PMID: 15078858 PMCID: PMC479121 DOI: 10.1101/gr.1860604] [Citation(s) in RCA: 305] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Ensembl (http://www.ensembl.org/) is a bioinformatics project to organize biological information around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of individual genomes, and of the synteny and orthology relationships between them. It is also a framework for integration of any biological data that can be mapped onto features derived from the genomic sequence. Ensembl is available as an interactive Web site, a set of flat files, and as a complete, portable open source software system for handling genomes. All data are provided without restriction, and code is freely available. Ensembl's aims are to continue to "widen" this biological integration to include other model organisms relevant to understanding human biology as they become available; to "deepen" this integration to provide an ever more seamless linkage between equivalent components in different species; and to provide further classification of functional elements in the genome that have been previously elusive.
Collapse
Affiliation(s)
- Ewan Birney
- EMBL European Bioinformatics Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Gibbs RA, Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, Scherer S, Scott G, Steffen D, Worley KC, Burch PE, Okwuonu G, Hines S, Lewis L, DeRamo C, Delgado O, Dugan-Rocha S, Miner G, Morgan M, Hawes A, Gill R, Celera, Holt RA, Adams MD, Amanatides PG, Baden-Tillson H, Barnstead M, Chin S, Evans CA, Ferriera S, Fosler C, Glodek A, Gu Z, Jennings D, Kraft CL, Nguyen T, Pfannkoch CM, Sitter C, Sutton GG, Venter JC, Woodage T, Smith D, Lee HM, Gustafson E, Cahill P, Kana A, Doucette-Stamm L, Weinstock K, Fechtel K, Weiss RB, Dunn DM, Green ED, Blakesley RW, Bouffard GG, De Jong PJ, Osoegawa K, Zhu B, Marra M, Schein J, Bosdet I, Fjell C, Jones S, Krzywinski M, Mathewson C, Siddiqui A, Wye N, McPherson J, Zhao S, Fraser CM, Shetty J, Shatsman S, Geer K, Chen Y, Abramzon S, Nierman WC, Havlak PH, Chen R, Durbin KJ, Simons R, Ren Y, Song XZ, Li B, Liu Y, Qin X, Cawley S, Worley KC, Cooney AJ, D'Souza LM, Martin K, Wu JQ, Gonzalez-Garay ML, Jackson AR, Kalafus KJ, McLeod MP, Milosavljevic A, Virk D, Volkov A, Wheeler DA, Zhang Z, Bailey JA, Eichler EE, Tuzun E, Birney E, Mongin E, Ureta-Vidal A, Woodwark C, Zdobnov E, Bork P, Suyama M, Torrents D, Alexandersson M, Trask BJ, Young JM, Huang H, Wang H, Xing H, Daniels S, Gietzen D, Schmidt J, Stevens K, Vitt U, Wingrove J, Camara F, Mar Albà M, Abril JF, Guigo R, Smit A, Dubchak I, Rubin EM, Couronne O, Poliakov A, Hübner N, Ganten D, Goesele C, Hummel O, Kreitler T, Lee YA, Monti J, Schulz H, Zimdahl H, Himmelbauer H, Lehrach H, Jacob HJ, Bromberg S, Gullings-Handley J, Jensen-Seaman MI, Kwitek AE, Lazar J, Pasko D, Tonellato PJ, Twigger S, Ponting CP, Duarte JM, Rice S, Goodstadt L, Beatson SA, Emes RD, Winter EE, Webber C, Brandt P, Nyakatura G, Adetobi M, Chiaromonte F, Elnitski L, Eswara P, Hardison RC, Hou M, Kolbe D, Makova K, Miller W, Nekrutenko A, Riemer C, Schwartz S, Taylor J, Yang S, Zhang Y, Lindpaintner K, Andrews TD, Caccamo M, Clamp M, Clarke L, Curwen V, Durbin R, Eyras E, Searle SM, Cooper GM, Batzoglou S, Brudno M, Sidow A, Stone EA, Venter JC, Payseur BA, Bourque G, López-Otín C, Puente XS, Chakrabarti K, Chatterji S, Dewey C, Pachter L, Bray N, Yap VB, Caspi A, Tesler G, Pevzner PA, Haussler D, Roskin KM, Baertsch R, Clawson H, Furey TS, Hinrichs AS, Karolchik D, Kent WJ, Rosenbloom KR, Trumbower H, Weirauch M, Cooper DN, Stenson PD, Ma B, Brent M, Arumugam M, Shteynberg D, Copley RR, Taylor MS, Riethman H, Mudunuri U, Peterson J, Guyer M, Felsenfeld A, Old S, Mockrin S, Collins F. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 2004; 428:493-521. [PMID: 15057822 DOI: 10.1038/nature02426] [Citation(s) in RCA: 1512] [Impact Index Per Article: 75.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2003] [Accepted: 02/20/2004] [Indexed: 01/16/2023]
Abstract
The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.
Collapse
Affiliation(s)
- Richard A Gibbs
- Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, MS BCM226, One Baylor Plaza, Houston, Texas 77030, USA. http://www.hgsc.bcm.tmc.edu
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Porcel BM, Delfour O, Castelli V, De Berardinis V, Friedlander L, Cruaud C, Ureta-Vidal A, Scarpelli C, Wincker P, Schächter V, Saurin W, Gyapay G, Salanoubat M, Weissenbach J. Numerous novel annotations of the human genome sequence supported by a 5'-end-enriched cDNA collection. Genome Res 2004; 14:463-71. [PMID: 14962985 PMCID: PMC353234 DOI: 10.1101/gr.1481104] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A collection of 90,000 human cDNA clones generated to increase the fraction of "full-length" cDNAs available was analyzed by sequence alignment on the human genome assembly. Five hundred fifty-two gene models not found in LocusLink, with coding regions of at least 300 bp, were defined by using this collection. Exon composition proposed for novel genes showed an average of 4.7 exons per gene. In 20% of the cases, at least half of the exons predicted for new genes coincided with evolutionary conserved regions defined by sequence comparisons with the pufferfish Tetraodon nigroviridis. Among this subset, CpG islands were observed at the 5' end of 75%. In-frame stop codons upstream of the initiator ATG were present in 49% of the new genes, and 16% contained a coding region comprising at least 50% of the cDNA sequence. This cDNA resource also provided candidate small protein-coding genes, usually not included in genome annotations. In addition, analysis of a sample from this cDNA collection indicates that approximately 380 gene models described in LocusLink could be extended at their 5' end by at least one new exon. Finally, this cDNA resource provided an experimental support for annotations based exclusively on predictions, thus representing a resource substantially improving the human genome annotation.
Collapse
Affiliation(s)
- Betina M Porcel
- Genoscope-Centre National de Séquençage and CNRS UMR-8030, 91000 Evry, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Birney E, Andrews D, Bevan P, Caccamo M, Cameron G, Chen Y, Clarke L, Coates G, Cox T, Cuff J, Curwen V, Cutts T, Down T, Durbin R, Eyras E, Fernandez-Suarez XM, Gane P, Gibbins B, Gilbert J, Hammond M, Hotz H, Iyer V, Kahari A, Jekosch K, Kasprzyk A, Keefe D, Keenan S, Lehvaslaiho H, McVicker G, Melsopp C, Meidl P, Mongin E, Pettett R, Potter S, Proctor G, Rae M, Searle S, Slater G, Smedley D, Smith J, Spooner W, Stabenau A, Stalker J, Storey R, Ureta-Vidal A, Woodwark C, Clamp M, Hubbard T. Ensembl 2004. Nucleic Acids Res 2004; 32:D468-70. [PMID: 14681459 PMCID: PMC308772 DOI: 10.1093/nar/gkh038] [Citation(s) in RCA: 143] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organize biology around the sequences of large genomes. It is a comprehensive and integrated source of annotation of large genome sequences, available via interactive website, web services or flat files. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. The facilities of the system range from sequence analysis to data storage and visualization and installations exist around the world both in companies and at academic sites. With a total of nine genome sequences available from Ensembl and more genomes to follow, recent developments have focused mainly on closer integration between genomes and external data.
Collapse
Affiliation(s)
- E Birney
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
The increasing number of complete and nearly complete metazoan genome sequences provides a significant amount of material for large-scale comparative genomic analysis. Finding new effective methods to analyse such enormous datasets has been the object of intense research. Three main areas in comparative genomics have recently shown important developments: whole-genome alignment, gene prediction and regulatory-region prediction. Each of these areas improves the methods of deciphering long genomic sequences and uncovering what lies hidden in them.
Collapse
Affiliation(s)
- Abel Ureta-Vidal
- EnsEMBL Project, Room A2-06, EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | |
Collapse
|
18
|
Heilig R, Eckenberg R, Petit JL, Fonknechten N, Da Silva C, Cattolico L, Levy M, Barbe V, de Berardinis V, Ureta-Vidal A, Pelletier E, Vico V, Anthouard V, Rowen L, Madan A, Qin S, Sun H, Du H, Pepin K, Artiguenave F, Robert C, Cruaud C, Brüls T, Jaillon O, Friedlander L, Samson G, Brottier P, Cure S, Ségurens B, Anière F, Samain S, Crespeau H, Abbasi N, Aiach N, Boscus D, Dickhoff R, Dors M, Dubois I, Friedman C, Gouyvenoux M, James R, Madan A, Mairey-Estrada B, Mangenot S, Martins N, Ménard M, Oztas S, Ratcliffe A, Shaffer T, Trask B, Vacherie B, Bellemere C, Belser C, Besnard-Gonnet M, Bartol-Mavel D, Boutard M, Briez-Silla S, Combette S, Dufossé-Laurent V, Ferron C, Lechaplais C, Louesse C, Muselet D, Magdelenat G, Pateau E, Petit E, Sirvain-Trukniewicz P, Trybou A, Vega-Czarny N, Bataille E, Bluet E, Bordelais I, Dubois M, Dumont C, Guérin T, Haffray S, Hammadi R, Muanga J, Pellouin V, Robert D, Wunderle E, Gauguet G, Roy A, Sainte-Marthe L, Verdier J, Verdier-Discala C, Hillier L, Fulton L, McPherson J, Matsuda F, Wilson R, Scarpelli C, Gyapay G, Wincker P, Saurin W, Quétier F, Waterston R, Hood L, Weissenbach J. The DNA sequence and analysis of human chromosome 14. Nature 2003; 421:601-7. [PMID: 12508121 DOI: 10.1038/nature01348] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2002] [Accepted: 12/03/2002] [Indexed: 11/08/2022]
Abstract
Chromosome 14 is one of five acrocentric chromosomes in the human genome. These chromosomes are characterized by a heterochromatic short arm that contains essentially ribosomal RNA genes, and a euchromatic long arm in which most, if not all, of the protein-coding genes are located. The finished sequence of human chromosome 14 comprises 87,410,661 base pairs, representing 100% of its euchromatic portion, in a single continuous segment covering the entire long arm with no gaps. Two loci of crucial importance for the immune system, as well as more than 60 disease genes, have been localized so far on chromosome 14. We identified 1,050 genes and gene fragments, and 393 pseudogenes. On the basis of comparisons with other vertebrate genomes, we estimate that more than 96% of the chromosome 14 genes have been annotated. From an analysis of the CpG island occurrences, we estimate that 70% of these annotated genes are complete at their 5' end.
Collapse
Affiliation(s)
- Roland Heilig
- Genoscope-Centre National de Séquençage, 91000, Evry, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Clamp M, Andrews D, Barker D, Bevan P, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, Durbin R, Eyras E, Gilbert J, Hammond M, Hubbard T, Kasprzyk A, Keefe D, Lehvaslaiho H, Iyer V, Melsopp C, Mongin E, Pettett R, Potter S, Rust A, Schmidt E, Searle S, Slater G, Smith J, Spooner W, Stabenau A, Stalker J, Stupka E, Ureta-Vidal A, Vastrik I, Birney E. Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res 2003; 31:38-42. [PMID: 12519943 PMCID: PMC165530 DOI: 10.1093/nar/gkg083] [Citation(s) in RCA: 180] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of human, mouse and other genome sequences, available as either an interactive web site or as flat files. Ensembl also integrates manually annotated gene structures from external sources where available. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. These range from sequence analysis to data storage and visualisation and installations exist around the world in both companies and at academic sites. With both human and mouse genome sequences available and more vertebrate sequences to follow, many of the recent developments in Ensembl have focusing on developing automatic comparative genome analysis and visualisation.
Collapse
Affiliation(s)
- M Clamp
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P, Antonarakis SE, Attwood J, Baertsch R, Bailey J, Barlow K, Beck S, Berry E, Birren B, Bloom T, Bork P, Botcherby M, Bray N, Brent MR, Brown DG, Brown SD, Bult C, Burton J, Butler J, Campbell RD, Carninci P, Cawley S, Chiaromonte F, Chinwalla AT, Church DM, Clamp M, Clee C, Collins FS, Cook LL, Copley RR, Coulson A, Couronne O, Cuff J, Curwen V, Cutts T, Daly M, David R, Davies J, Delehaunty KD, Deri J, Dermitzakis ET, Dewey C, Dickens NJ, Diekhans M, Dodge S, Dubchak I, Dunn DM, Eddy SR, Elnitski L, Emes RD, Eswara P, Eyras E, Felsenfeld A, Fewell GA, Flicek P, Foley K, Frankel WN, Fulton LA, Fulton RS, Furey TS, Gage D, Gibbs RA, Glusman G, Gnerre S, Goldman N, Goodstadt L, Grafham D, Graves TA, Green ED, Gregory S, Guigó R, Guyer M, Hardison RC, Haussler D, Hayashizaki Y, Hillier LW, Hinrichs A, Hlavina W, Holzer T, Hsu F, Hua A, Hubbard T, Hunt A, Jackson I, Jaffe DB, Johnson LS, Jones M, Jones TA, Joy A, Kamal M, Karlsson EK, Karolchik D, Kasprzyk A, Kawai J, Keibler E, Kells C, Kent WJ, Kirby A, Kolbe DL, Korf I, Kucherlapati RS, Kulbokas EJ, Kulp D, Landers T, Leger JP, Leonard S, Letunic I, Levine R, Li J, Li M, Lloyd C, Lucas S, Ma B, Maglott DR, Mardis ER, Matthews L, Mauceli E, Mayer JH, McCarthy M, McCombie WR, McLaren S, McLay K, McPherson JD, Meldrim J, Meredith B, Mesirov JP, Miller W, Miner TL, Mongin E, Montgomery KT, Morgan M, Mott R, Mullikin JC, Muzny DM, Nash WE, Nelson JO, Nhan MN, Nicol R, Ning Z, Nusbaum C, O'Connor MJ, Okazaki Y, Oliver K, Overton-Larty E, Pachter L, Parra G, Pepin KH, Peterson J, Pevzner P, Plumb R, Pohl CS, Poliakov A, Ponce TC, Ponting CP, Potter S, Quail M, Reymond A, Roe BA, Roskin KM, Rubin EM, Rust AG, Santos R, Sapojnikov V, Schultz B, Schultz J, Schwartz MS, Schwartz S, Scott C, Seaman S, Searle S, Sharpe T, Sheridan A, Shownkeen R, Sims S, Singer JB, Slater G, Smit A, Smith DR, Spencer B, Stabenau A, Stange-Thomann N, Sugnet C, Suyama M, Tesler G, Thompson J, Torrents D, Trevaskis E, Tromp J, Ucla C, Ureta-Vidal A, Vinson JP, Von Niederhausern AC, Wade CM, Wall M, Weber RJ, Weiss RB, Wendl MC, West AP, Wetterstrand K, Wheeler R, Whelan S, Wierzbowski J, Willey D, Williams S, Wilson RK, Winter E, Worley KC, Wyman D, Yang S, Yang SP, Zdobnov EM, Zody MC, Lander ES. Initial sequencing and comparative analysis of the mouse genome. Nature 2002; 420:520-62. [PMID: 12466850 DOI: 10.1038/nature01262] [Citation(s) in RCA: 4791] [Impact Index Per Article: 217.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2002] [Accepted: 10/31/2002] [Indexed: 12/18/2022]
Abstract
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
Collapse
MESH Headings
- Animals
- Base Composition
- Chromosomes, Mammalian/genetics
- Conserved Sequence/genetics
- CpG Islands/genetics
- Evolution, Molecular
- Gene Expression Regulation
- Genes/genetics
- Genetic Variation/genetics
- Genome
- Genome, Human
- Genomics
- Humans
- Mice/classification
- Mice/genetics
- Mice, Knockout
- Mice, Transgenic
- Models, Animal
- Multigene Family/genetics
- Mutagenesis
- Neoplasms/genetics
- Physical Chromosome Mapping
- Proteome/genetics
- Pseudogenes/genetics
- Quantitative Trait Loci/genetics
- RNA, Untranslated/genetics
- Repetitive Sequences, Nucleic Acid/genetics
- Selection, Genetic
- Sequence Analysis, DNA
- Sex Chromosomes/genetics
- Species Specificity
- Synteny
Collapse
|
21
|
Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, Durbin R, Eyras E, Gilbert J, Hammond M, Huminiecki L, Kasprzyk A, Lehvaslaiho H, Lijnzaad P, Melsopp C, Mongin E, Pettett R, Pocock M, Potter S, Rust A, Schmidt E, Searle S, Slater G, Smith J, Spooner W, Stabenau A, Stalker J, Stupka E, Ureta-Vidal A, Vastrik I, Clamp M. The Ensembl genome database project. Nucleic Acids Res 2002; 30:38-41. [PMID: 11752248 PMCID: PMC99161 DOI: 10.1093/nar/30.1.38] [Citation(s) in RCA: 1062] [Impact Index Per Article: 48.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.
Collapse
Affiliation(s)
- T Hubbard
- The Wellcome Trust Sanger Institute and European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Firat H, Tourdot S, Ureta-Vidal A, Scardino A, Suhrbier A, Buseyne F, Rivìere Y, Danos O, Michel ML, Kosmatopoulos K, Lemonnier FA. Design of a polyepitope construct for the induction of HLA-A0201-restricted HIV 1-specific CTL responses using HLA-A*0201 transgenic, H-2 class I KO mice. Eur J Immunol 2001; 31:3064-74. [PMID: 11592083 DOI: 10.1002/1521-4141(2001010)31:10<3064::aid-immu3064>3.0.co;2-l] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
HLA-A*0201 transgenic, H-2D(b)/mouse beta2-microglobulin double-knockout mice were used to compare and optimize the immunogenic potential of 17HIV 1-derived,HLA-A0201-restricted epitopic peptides. A tyrosine substitution in position 1 of the epitopic peptides, which increases both their affinity for and their HLA-A0201 molecule stabilizing capacity, was introduced in a significant proportion, having verified that such modifications enhance their immunogenicity in respect of their natural antigenicity. Based on these results, a 13-polyepitope construct was inserted in the pre-S2 segment of the hepatitis B middle glycoprotein and used for DNA immunization. Long-lasting CTL responses against most of the inserted epitopes could be elicited simultaneously in a single animal with cross-recognition in several cases of their most common natural variants.
Collapse
Affiliation(s)
- H Firat
- Unité d'Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Ureta-Vidal A, Pique C, Garcia Z, Dehée A, Tortevoye P, Désiré N, Gessain A, Chancerel B, Gout O, Lemonnier FA, Cochet M. Human T cell leukemia virus Type I (HTLV-I) infection induces greater expansions of CD8 T lymphocytes in persons with HTLV-I-associated myelopathy/tropical spastic paraparesis than in asymptomatic carriers. J Infect Dis 2001; 183:857-64. [PMID: 11237801 DOI: 10.1086/319264] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2000] [Revised: 12/06/2000] [Indexed: 11/04/2022] Open
Abstract
A quantitative study of the T cell receptor repertoire was performed ex vivo on CD4 and CD8 T cell subsets of human T cell leukemia virus type I (HTLV-I)-infected asymptomatic carriers and patients with HTLV-I-associated myelopathy/tropical spastic paraparesis (HAM/TSP). Indexes of oligoclonality that compiled all repertoire modifications were calculated for peripheral blood mononuclear cells and for CD4 and CD8 T cell subsets. Both patients with HAM/TSP and asymptomatic carriers had greater T lymphocyte expansions than did uninfected donors, which was independent of age and at least twice higher in the CD8 than in the CD4 cell compartment. Some expanded CD8 T cells corresponded to cytotoxic T lymphocytes directed against various epitopes of the immunodominant Tax protein. Patients with HAM/TSP had significantly higher CD8 cell expansions than did asymptomatic carriers. These results highlight the prognostic value of measuring CD8 T cell expansions during follow-up of HTLV-I infection.
Collapse
Affiliation(s)
- A Ureta-Vidal
- Unité d'Immunité Cellulaire Antivirale, Paris, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Kazanji M, Ureta-Vidal A, Ozden S, Tangy F, de Thoisy B, Fiette L, Talarmin A, Gessain A, de Thé G. Lymphoid organs as a major reservoir for human T-cell leukemia virus type 1 in experimentally infected squirrel monkeys (Saimiri sciureus): provirus expression, persistence, and humoral and cellular immune responses. J Virol 2000; 74:4860-7. [PMID: 10775625 PMCID: PMC112009 DOI: 10.1128/jvi.74.10.4860-4867.2000] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The aim of this study was to investigate the distribution of human T-cell leukemia virus type 1 (HTLV-1) in various organs of serially sacrificed squirrel monkeys (Saimiri sciureus) in order to localize the reservoir of the virus and to evaluate the relationship between viral expression and the humoral or cellular immune response during infection. Six squirrel monkeys infected with HTLV-1 were sacrificed 6, 12, and 35 days and 3, 6, and 26 months after inoculation, and 20 organs and tissues were collected from each animal. PCR and reverse transcription-PCR (RT-PCR) were performed with gag and tax primers. Proviral DNA was detected by PCR in peripheral blood mononuclear cells (PBMCs) of monkeys sacrificed 6 days after inoculation and in PBMCs, spleens, and lymph nodes of monkeys sacrificed 12 and 35 days and 3, 6, and 26 months after inoculation. Furthermore, tax/rex mRNA was detected by RT-PCR in the PBMCs of two monkeys 8 to 12 days after inoculation and in the spleens and lymph nodes of the monkey sacrificed on day 12. In this animal, scattered HTLV-1 tax/rex mRNA-positive lymphocytes were detected by in situ hybridization in frozen sections of the spleen, around the germinal centers and close to the arterial capillaries. Anti-HTLV-1 cell-mediated immunity was evaluated at various times after inoculation. Anti-p40(Tax) and anti-Env cytolytic T-cell responses were detected 2 months after infection and remained detectable thereafter. When Tax peptides were used, this response appeared to be directed against various Tax epitopes. Our results indicate that squirrel monkeys represent a promising animal model for studying the early events of HTLV-1 infection and for evaluating candidate vaccines against HTLV-1.
Collapse
Affiliation(s)
- M Kazanji
- Unité d'Oncologie Virale, Institut Pasteur, Paris, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Pique C, Ureta-Vidal A, Gessain A, Chancerel B, Gout O, Tamouza R, Agis F, Dokhélar MC. Evidence for the chronic in vivo production of human T cell leukemia virus type I Rof and Tof proteins from cytotoxic T lymphocytes directed against viral peptides. J Exp Med 2000; 191:567-72. [PMID: 10662802 PMCID: PMC2195825 DOI: 10.1084/jem.191.3.567] [Citation(s) in RCA: 87] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Human T cell leukemia virus type I (HTLV-I) is a persistent virus that causes adult T cell leukemia and tropical spastic paraparesis/HTLV-I-associated myelopathy. Studies on rabbits have shown that viral proteins encoded by the open reading frames pX-I and pX-II are required for the establishment of the persistent infection. To examine the in vivo production of these proteins in humans, we have investigated whether cytotoxic T lymphocytes isolated from HTLV-I-infected individuals recognized pX-I and pX-II peptides. CD8(+) T lymphocytes to pX-I and pX-II peptides were detected in HTLV-I-infected individuals, whatever their clinical status, and even in the absence of any antigenic restimulation. These findings indicate that the HTLV-I pX-I and pX-II proteins are chronically synthesized in vivo, and are targets of the natural immune response to the virus.
Collapse
Affiliation(s)
- C Pique
- Institut National de la Santé et de la Recherche Médicale, U332, Institut Cochin de Génétique Moléculaire, 75014 Paris, France.
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Talarmin A, Vion B, Ureta-Vidal A, Du Fou G, Marty C, Kazanji M. First seroepidemiological study and phylogenetic characterization of human T-cell lymphotropic virus type I and II infection among Amerindians in French Guiana. J Gen Virol 1999; 80 ( Pt 12):3083-3088. [PMID: 10567638 DOI: 10.1099/0022-1317-80-12-3083] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We investigated the serological, epidemiological and molecular aspects of human T-cell lymphotropic virus type I and II (HTLV-I/II) infection in the Amerindian populations of French Guiana by testing 847 sera. No HTLV-II antibodies were detected, but five individuals (0.59%) were seropositive for HTLV-I. Analysis of the nucleotide sequences of 522 bp of the env gene and the compete LTR showed that all of the strains from French Guiana belonged to the cosmopolitan subtype A. The similarities were greater between Amerindian and Creole strains than between Amerindian and Noir-Marron strains or than between Creole and Noir-Marron strains. Phylogenetic analysis showed two clusters: one of strains from Amerindians and Creoles, which belong to the transcontinental subgroup, and the other of strains from Noirs-Marrons, belonging to the West African subgroup. Our results suggest that the Amerindian HTLV-I strains are of African origin.
Collapse
Affiliation(s)
- Antoine Talarmin
- Laboratoire de Rétrovirologie, Institut Pasteur de la Guyane, BP 6010, 23 Av. Pasteur, 97300 Cayenne, French Guiana1
| | - Bruno Vion
- Direction de l'Action Sanitaire et Sociale de la Guyane, 97300 Cayenne, French Guiana2
| | - Abel Ureta-Vidal
- Unité d'Immunité Cellulaire antivirale, Institut Pasteur, 75015 Paris, France3
| | - Guénola Du Fou
- Laboratoire de Rétrovirologie, Institut Pasteur de la Guyane, BP 6010, 23 Av. Pasteur, 97300 Cayenne, French Guiana1
| | - Christian Marty
- Direction de l'Action Sanitaire et Sociale de la Guyane, 97300 Cayenne, French Guiana2
| | - Mirdad Kazanji
- Laboratoire de Rétrovirologie, Institut Pasteur de la Guyane, BP 6010, 23 Av. Pasteur, 97300 Cayenne, French Guiana1
| |
Collapse
|
27
|
Ureta-Vidal A, Angelin-Duclos C, Tortevoye P, Murphy E, Lepère JF, Buigues RP, Jolly N, Joubert M, Carles G, Pouliquen JF, de Thé G, Moreau JP, Gessain A. Mother-to-child transmission of human T-cell-leukemia/lymphoma virus type I: implication of high antiviral antibody titer and high proviral load in carrier mothers. Int J Cancer 1999. [PMID: 10446450 DOI: 10.1002/(sici)1097-0215(19990909)82:6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
In order to gain new insights into the risk factors influencing human-T-cell-leukemia/lymphoma-virus-type-I (HTLV-I) mother-to-child transmission, a retrospective study of HTLV-I infection among children born to HTLV-I-seropositive women was carried out in a highly HTLV-I-endemic population of African origin living in French Guyana. The study covered 81 HTLV-I-seropositive mothers and their 216 children aged between 18 months old and 12 years old. All plasma samples were tested for the presence of HTLV-I antibodies by ELISA, immunofluorescence assay and Western blot. HTLV-I provirus was detected, in the DNA extracted from peripheral-blood mononuclear cells, by polymerase chain reaction (PCR) using primers specific for 3 different HTLV-I genomic regions (LTR, gag and pX) and quantified by a competitive PCR assay. Out of the 216 children, 21 were found to be HTLV-I-seropositive, giving a crude HTLV-I transmission rate of 9.7%, while among the 180 breast-fed children 10.6% were HTLV-I-seropositive. Perfect concordance between serological and PCR results was observed, and none of the 195 HTLV-I-negative children was found HTLV-I-positive by PCR. In conditional (by family) logistic-regression models, HTLV-I seropositivity in children was associated with an elevated maternal anti-HTLV-I-antibody titer (OR 2.2, p = 0.0013), a high maternal HTLV-I proviral load (OR 2.6, p = 0.033) and child's gender, girls being more frequently HTLV-I-infected than boys: OR 3.6, p = 0.0077 in the model including maternal anti-HTLV-I-antibody titer and OR 4.1, p = 0.002 in the model including the maternal HTLV-I proviral load.
Collapse
Affiliation(s)
- A Ureta-Vidal
- Unité d'Epidémiologie des Virus Oncogènes, Institut Pasteur, Paris, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Ureta-Vidal A, Angelin-Duclos C, Tortevoye P, Murphy E, Lepère JF, Buigues RP, Jolly N, Joubert M, Carles G, Pouliquen JF, de Thé G, Moreau JP, Gessain A. Mother-to-child transmission of human T-cell-leukemia/lymphoma virus type I: implication of high antiviral antibody titer and high proviral load in carrier mothers. Int J Cancer 1999. [PMID: 10446450 DOI: 10.1002/(sici)1097-0215(19990909)82] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
In order to gain new insights into the risk factors influencing human-T-cell-leukemia/lymphoma-virus-type-I (HTLV-I) mother-to-child transmission, a retrospective study of HTLV-I infection among children born to HTLV-I-seropositive women was carried out in a highly HTLV-I-endemic population of African origin living in French Guyana. The study covered 81 HTLV-I-seropositive mothers and their 216 children aged between 18 months old and 12 years old. All plasma samples were tested for the presence of HTLV-I antibodies by ELISA, immunofluorescence assay and Western blot. HTLV-I provirus was detected, in the DNA extracted from peripheral-blood mononuclear cells, by polymerase chain reaction (PCR) using primers specific for 3 different HTLV-I genomic regions (LTR, gag and pX) and quantified by a competitive PCR assay. Out of the 216 children, 21 were found to be HTLV-I-seropositive, giving a crude HTLV-I transmission rate of 9.7%, while among the 180 breast-fed children 10.6% were HTLV-I-seropositive. Perfect concordance between serological and PCR results was observed, and none of the 195 HTLV-I-negative children was found HTLV-I-positive by PCR. In conditional (by family) logistic-regression models, HTLV-I seropositivity in children was associated with an elevated maternal anti-HTLV-I-antibody titer (OR 2.2, p = 0.0013), a high maternal HTLV-I proviral load (OR 2.6, p = 0.033) and child's gender, girls being more frequently HTLV-I-infected than boys: OR 3.6, p = 0.0077 in the model including maternal anti-HTLV-I-antibody titer and OR 4.1, p = 0.002 in the model including the maternal HTLV-I proviral load.
Collapse
Affiliation(s)
- A Ureta-Vidal
- Unité d'Epidémiologie des Virus Oncogènes, Institut Pasteur, Paris, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Ureta-Vidal A, Angelin-Duclos C, Tortevoye P, Murphy E, Lepère JF, Buigues RP, Jolly N, Joubert M, Carles G, Pouliquen JF, de Thé G, Moreau JP, Gessain A. Mother-to-child transmission of human T-cell-leukemia/lymphoma virus type I: implication of high antiviral antibody titer and high proviral load in carrier mothers. Int J Cancer 1999; 82:832-6. [PMID: 10446450 DOI: 10.1002/(sici)1097-0215(19990909)82:6<832::aid-ijc11>3.0.co;2-p] [Citation(s) in RCA: 96] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
In order to gain new insights into the risk factors influencing human-T-cell-leukemia/lymphoma-virus-type-I (HTLV-I) mother-to-child transmission, a retrospective study of HTLV-I infection among children born to HTLV-I-seropositive women was carried out in a highly HTLV-I-endemic population of African origin living in French Guyana. The study covered 81 HTLV-I-seropositive mothers and their 216 children aged between 18 months old and 12 years old. All plasma samples were tested for the presence of HTLV-I antibodies by ELISA, immunofluorescence assay and Western blot. HTLV-I provirus was detected, in the DNA extracted from peripheral-blood mononuclear cells, by polymerase chain reaction (PCR) using primers specific for 3 different HTLV-I genomic regions (LTR, gag and pX) and quantified by a competitive PCR assay. Out of the 216 children, 21 were found to be HTLV-I-seropositive, giving a crude HTLV-I transmission rate of 9.7%, while among the 180 breast-fed children 10.6% were HTLV-I-seropositive. Perfect concordance between serological and PCR results was observed, and none of the 195 HTLV-I-negative children was found HTLV-I-positive by PCR. In conditional (by family) logistic-regression models, HTLV-I seropositivity in children was associated with an elevated maternal anti-HTLV-I-antibody titer (OR 2.2, p = 0.0013), a high maternal HTLV-I proviral load (OR 2.6, p = 0.033) and child's gender, girls being more frequently HTLV-I-infected than boys: OR 3.6, p = 0.0077 in the model including maternal anti-HTLV-I-antibody titer and OR 4.1, p = 0.002 in the model including the maternal HTLV-I proviral load.
Collapse
Affiliation(s)
- A Ureta-Vidal
- Unité d'Epidémiologie des Virus Oncogènes, Institut Pasteur, Paris, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Ureta-Vidal A, Firat H, Pérarnau B, Lemonnier FA. Phenotypical and Functional Characterization of the CD8+ T Cell Repertoire of HLA-A2.1 Transgenic, H-2K
b
°D
b
° Double Knockout Mice. The Journal of Immunology 1999. [DOI: 10.4049/jimmunol.163.5.2555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Abstract
Homozygous HLA-A2.1 transgenic H-2Kb°Db° double knockout (KO) mice were created. Their potential to develop HLA-A2.1-restricted cytolytic responses was compared with that of their classical transgenic counterparts, which still express H-2Kb, Db molecules. On cell surfaces, both strains express similar amounts of chimeric (α1α2 domains of human, α3 cytoplasmic domains of mouse) HLA-A2.1 molecules in noncovalent association with mouse β2-microglobulin. Compared with mice that are totally deprived of histocompatibility class Ia molecules (H-2Kb°Db° double KO), the expression of HLA-A2.1 in transgenic/double KO mice resulted in sizeable increase in the periphery of CD8+ T cells with a normally diversified TCR repertoire. A biased education in favor of HLA-A2.1, ascribable to the absence of H-2 class Ia molecules, was evidenced in these transgenic/double KO mice by their improved capacity to mount HLA-restricted cytolytic responses, regardless of whether they were virally infected or injected with synthetic epitopic peptide. HLA class I transgenic, H-2 class Ia KO mice should represent useful animal models for the preclinical evaluation of vaccine formulations aiming at the induction of HLA class I-restricted CTL responses.
Collapse
Affiliation(s)
- Abel Ureta-Vidal
- Unité d’Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France
| | - Hüseyin Firat
- Unité d’Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France
| | - Béatrice Pérarnau
- Unité d’Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France
| | - François A. Lemonnier
- Unité d’Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France
| |
Collapse
|
31
|
Ureta-Vidal A, Firat H, Pérarnau B, Lemonnier FA. Phenotypical and functional characterization of the CD8+ T cell repertoire of HLA-A2.1 transgenic, H-2KbnullDbnull double knockout mice. J Immunol 1999; 163:2555-60. [PMID: 10452993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Homozygous HLA-A2.1 transgenic H-2KbnullDbnull double knockout (KO) mice were created. Their potential to develop HLA-A2. 1-restricted cytolytic responses was compared with that of their classical transgenic counterparts, which still express H-2Kb, Db molecules. On cell surfaces, both strains express similar amounts of chimeric (alpha 1 alpha 2 domains of human, alpha 3 cytoplasmic domains of mouse) HLA-A2.1 molecules in noncovalent association with mouse beta 2-microglobulin. Compared with mice that are totally deprived of histocompatibility class Ia molecules (H-2KbnullDbnull double KO), the expression of HLA-A2.1 in transgenic/double KO mice resulted in sizeable increase in the periphery of CD8+ T cells with a normally diversified TCR repertoire. A biased education in favor of HLA-A2.1, ascribable to the absence of H-2 class Ia molecules, was evidenced in these transgenic/double KO mice by their improved capacity to mount HLA-restricted cytolytic responses, regardless of whether they were virally infected or injected with synthetic epitopic peptide. HLA class I transgenic, H-2 class Ia KO mice should represent useful animal models for the preclinical evaluation of vaccine formulations aiming at the induction of HLA class I-restricted CTL responses.
Collapse
Affiliation(s)
- A Ureta-Vidal
- Unité d'Immunité Cellulaire Antivirale, Département SIDA-Rétrovirus, Institut Pasteur, Paris, France
| | | | | | | |
Collapse
|
32
|
Ureta-Vidal A, Garcia Z, Lemonnier FA, Kazanji M. Molecular characterization of cDNAs encoding squirrel monkey (Saïmiri sciureus) CD8 alpha and beta chains. Immunogenetics 1999; 49:718-21. [PMID: 10369934 DOI: 10.1007/s002510050672] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- A Ureta-Vidal
- Unité d'Oncologie Virale, Département du SIDA et des Rétrovirus, Institut Pasteur, 28, rue du Dr Roux, 75724 Paris Cedex 15, France
| | | | | | | |
Collapse
|
33
|
Tuppin P, Lepère JF, Carles G, Ureta-Vidal A, Gérard Y, Peneau C, Tortevoye P, de Thé G, Moreau JP, Gessain A. Risk factors for maternal HTLV-I infection in French Guiana: high HTLV-I prevalence in the Noir Marron population. J Acquir Immune Defic Syndr Hum Retrovirol 1995; 8:420-425. [PMID: 7882109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The aim of this study was to compare rates of human T-cell lymphotropic virus type I (HTLV-I) seroprevalence in pregnant women belonging to different ethnic groups in French Guiana and to determine the risk factors associated with HTLV-I seropositivity. All 1,873 deliveries between 1 July 1991 and 30 June 1993 in the only gynecologic and obstetric unit at Saint Laurent du Maroni were enrolled. Serologic status could be established for 1,727 women, with 75 (4.3%) being HTLV-I seropositive. The HTLV-I seroprevalence rate differed significantly between ethnic groups: 5.7% for Noir-Marron (70/1,302), 6.3% for Haitian (3/50), and 0% for Creole (126), Amerindians (166), and Hmong (64). In Noir-Marron pregnant women, HTLV-I seropositivity was associated with a maternal age of > 35 years [odds ratio (OR), 3.3; 95% confidence interval (CI), 1.4-7.6], prior miscarriage (OR, 1.7; CI, 1-2.8), prior cesarean section (OR, 2.1; CI, 1.1-4.0), a parity > 4 (OR, 4.0; CI, 1.8-8.8), a gravidity > 6 (OR, 4.2; CI, 2.0-7.2), and a negative Rhesus factor (OR, 2.2; CI, 1.1-4.5). Two separate stepwise logistic regressions were done because gravidity and parity were highly correlated. HTLV-I seropositivity remained associated with a gravidity > 6 (OR, 3.9; CI, 2.1-7.4) and a negative Rhesus factor (OR, 2.6; CI, 1.2-5.3) for the first model and with a parity > 4 (OR, 4.1; CI, 1.9-9.0) and a negative Rhesus factor (OR, 2.5; CI, 1.2-5.1) for the second model.(ABSTRACT TRUNCATED AT 250 WORDS)
Collapse
Affiliation(s)
- P Tuppin
- Unité d'Epidémiologie des Virus Oncogènes, Institut Pasteur, Paris, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|