1
|
Khayi S, Gaboun F, Pirro S, Tatusova T, El Mousadik A, Ghazal H, Mentag R. Complete Chloroplast Genome of Argania spinosa: Structural Organization and Phylogenetic Relationships in Sapotaceae. Plants (Basel) 2020; 9:E1354. [PMID: 33066261 PMCID: PMC7602116 DOI: 10.3390/plants9101354] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 06/24/2020] [Accepted: 06/25/2020] [Indexed: 01/02/2023]
Abstract
Argania spinosa (Sapotaceae), an important endemic Moroccan oil tree, is a primary source of argan oil, which has numerous dietary and medicinal proprieties. The plant species occupies the mid-western part of Morocco and provides great environmental and socioeconomic benefits. The complete chloroplast (cp) genome of A. spinosa was sequenced, assembled, and analyzed in comparison with those of two Sapotaceae members. The A. spinosa cp genome is 158,848 bp long, with an average GC content of 36.8%. The cp genome exhibits a typical quadripartite and circular structure consisting of a pair of inverted regions (IR) of 25,945 bp in length separating small single-copy (SSC) and large single-copy (LSC) regions of 18,591 and 88,367 bp, respectively. The annotation of A. spinosa cp genome predicted 130 genes, including 85 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, and 37 transfer RNA (tRNA) genes. A total of 44 long repeats and 88 simple sequence repeats (SSR) divided into mononucleotides (76), dinucleotides (7), trinucleotides (3), tetranucleotides (1), and hexanucleotides (1) were identified in the A. spinosa cp genome. Phylogenetic analyses using the maximum likelihood (ML) method were performed based on 69 protein-coding genes from 11 species of Ericales. The results confirmed the close position of A. spinosa to the Sideroxylon genus, supporting the revisiting of its taxonomic status. The complete chloroplast genome sequence will be valuable for further studies on the conservation and breeding of this medicinally and culinary important species and also contribute to clarifying the phylogenetic position of the species within Sapotaceae.
Collapse
Affiliation(s)
- Slimane Khayi
- CRRA-Rabat, National Institute for Agricultural Research (INRA), Rabat 10101, Morocco;
| | - Fatima Gaboun
- CRRA-Rabat, National Institute for Agricultural Research (INRA), Rabat 10101, Morocco;
| | - Stacy Pirro
- Iridian Genomes, Inc., Bethesda, MD 20817, USA;
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20817, USA;
| | - Abdelhamid El Mousadik
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir 80000, Morocco;
| | - Hassan Ghazal
- National Center for Scientific and Technological Research (CNRST), Rabat 10102, Morocco;
| | - Rachid Mentag
- CRRA-Rabat, National Institute for Agricultural Research (INRA), Rabat 10101, Morocco;
| |
Collapse
|
2
|
Khayi S, Azza NE, Gaboun F, Pirro S, Badad O, Claros MG, Lightfoot DA, Unver T, Chaouni B, Merrouch R, Rahim B, Essayeh S, Ganoudi M, Abdelwahd R, Diria G, Mdarhi MA, Labhilili M, Iraqi D, Mouhaddab J, Sedrati H, Memari M, Hamamouch N, Alché JDD, Boukhatem N, Mrabet R, Dahan R, Legssyer A, Khalfaoui M, Badraoui M, Van de Peer Y, Tatusova T, El Mousadik A, Mentag R, Ghazal H. First draft genome assembly of the Argane tree ( Argania spinosa). F1000Res 2018; 7:1310. [PMID: 32509273 PMCID: PMC7238458 DOI: 10.12688/f1000research.15719.2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/22/2020] [Indexed: 11/20/2022] Open
Abstract
Background: The Argane tree ( Argania spinosa L. Skeels) is an endemic tree of mid-western Morocco that plays an important socioeconomic and ecologic role for a dense human population in an arid zone. Several studies confirmed the importance of this species as a food and feed source and as a resource for both pharmaceutical and cosmetic compounds. Unfortunately, the argane tree ecosystem is facing significant threats from environmental changes (global warming, over-population) and over-exploitation. Limited research has been conducted, however, on argane tree genetics and genomics, which hinders its conservation and genetic improvement. Methods: Here, we present a draft genome assembly of A. spinosa. A reliable reference genome of A. spinosa was created using a hybrid de novo assembly approach combining short and long sequencing reads. Results: In total, 144 Gb Illumina HiSeq reads and 7.6 Gb PacBio reads were produced and assembled. The final draft genome comprises 75 327 scaffolds totaling 671 Mb with an N50 of 49 916 kb. The draft assembly is close to the genome size estimated by k-mers distribution and covers 89% of complete and 4.3 % of partial Arabidopsis orthologous groups in BUSCO. Conclusion: The A. spinosa genome will be useful for assessing biodiversity leading to efficient conservation of this endangered endemic tree. Furthermore, the genome may enable genome-assisted cultivar breeding, and provide a better understanding of important metabolic pathways and their underlying genes for both cosmetic and pharmacological.
Collapse
Affiliation(s)
- Slimane Khayi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Nour Elhouda Azza
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco.,Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Fatima Gaboun
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Stacy Pirro
- Iridian Genomes, Inc., Bethesda, MD, 20817, USA
| | - Oussama Badad
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Department of Plant, Soil and Agricultural Systems, Southern Illinois University, Carbondale, IL, 62901, USA.,Laboratory of Plant Physiology, Faculty of Sciences, University Mohamed V in Rabat, Rabat, 10000, Morocco
| | - M Gonzalo Claros
- Department of Molecular Biology and Biochemistry, and Plataforma Andaluza de Bioinformática, University of Malaga, Malaga, Spain
| | - David A Lightfoot
- Department of Plant, Soil and Agricultural Systems, Southern Illinois University, Carbondale, IL, 62901, USA
| | - Turgay Unver
- International Biomedicine and Genome Institute (iBG-izmir), Dokuz Eylül University, Current address: Egitim Mah. Ekrem Guer Sok. 26/3 Balcova, Izmir, Turkey
| | - Bouchra Chaouni
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Laboratory of Plant Physiology, Faculty of Sciences, University Mohamed V in Rabat, Rabat, 10000, Morocco
| | - Redouane Merrouch
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Bouchra Rahim
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Soumaya Essayeh
- Polydisciplinary Faculty of Nador, University Mohamed Premier, Nador, Morocco
| | - Matika Ganoudi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Rabha Abdelwahd
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Ghizlane Diria
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Meriem Alaoui Mdarhi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Mustapha Labhilili
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Driss Iraqi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Jamila Mouhaddab
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco
| | - Hayat Sedrati
- National School of Computer Sciences & Systems Analysis, University Mohammed V in Rabat, Rabat, Morocco
| | - Majid Memari
- Research Computing and Cyber infrastructure, Computer Science Department, Southern Illinois University, Carbondale, IL, 62901, USA
| | - Noureddine Hamamouch
- Polydisciplinary Faculty, Sultan Moulay Slimane University, Beni-Mellal, Morocco
| | - Juan de Dios Alché
- Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas (CSIC), Granada, Spain
| | - Noureddine Boukhatem
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Rachid Mrabet
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Rachid Dahan
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Adelkhaleq Legssyer
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Mohamed Khalfaoui
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Mohamed Badraoui
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, B-9052 Ghent, Belgium, Belgium.,VIB Center for Plant Systems Biology, Technologiepark 927, Ghent, B-9052, Belgium.,Department of Genetics, Genomics Research Institute, University of Pretoria, Pretoria, 0028, South Africa
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20817, USA
| | - Abdelhamid El Mousadik
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco
| | - Rachid Mentag
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Hassan Ghazal
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco.,Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Iridian Genomes, Inc., Bethesda, MD, 20817, USA.,National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| |
Collapse
|
3
|
Khayi S, Azza NE, Gaboun F, Pirro S, Badad O, Claros MG, Lightfoot DA, Unver T, Chaouni B, Merrouch R, Rahim B, Essayeh S, Ganoudi M, Abdelwahd R, Diria G, Mdarhi MA, Labhilili M, Iraqi D, Mouhaddab J, Sedrati H, Memari M, Hamamouch N, Alché JDD, Boukhatem N, Mrabet R, Dahan R, Legssyer A, Khalfaoui M, Badraoui M, Van de Peer Y, Tatusova T, El Mousadik A, Mentag R, Ghazal H. First draft genome assembly of the Argane tree ( Argania spinosa). F1000Res 2018; 7:1310. [PMID: 32509273 DOI: 10.12688/f1000research.15719.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/02/2018] [Indexed: 11/20/2022] Open
Abstract
Background: The Argane tree ( Argania spinosa L. Skeels) is an endemic tree of mid-western Morocco that plays an important socioeconomic and ecologic role for a dense human population in an arid zone. Several studies confirmed the importance of this species as a food and feed source and as a resource for both pharmaceutical and cosmetic compounds. Unfortunately, the argane tree ecosystem is facing significant threats from environmental changes (global warming, over-population) and over-exploitation. Limited research has been conducted, however, on argane tree genetics and genomics, which hinders its conservation and genetic improvement. Methods: Here, we present a draft genome assembly of A. spinosa. A reliable reference genome of A. spinosa was created using a hybrid de novo assembly approach combining short and long sequencing reads. Results: In total, 144 Gb Illumina HiSeq reads and 7.6 Gb PacBio reads were produced and assembled. The final draft genome comprises 75 327 scaffolds totaling 671 Mb with an N50 of 49 916 kb. The draft assembly is close to the genome size estimated by k-mers distribution and covers 89% of complete and 4.3 % of partial Arabidopsis orthologous groups in BUSCO. Conclusion: The A. spinosa genome will be useful for assessing biodiversity leading to efficient conservation of this endangered endemic tree. Furthermore, the genome may enable genome-assisted cultivar breeding, and provide a better understanding of important metabolic pathways and their underlying genes for both cosmetic and pharmacological.
Collapse
Affiliation(s)
- Slimane Khayi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Nour Elhouda Azza
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco.,Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Fatima Gaboun
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Stacy Pirro
- Iridian Genomes, Inc., Bethesda, MD, 20817, USA
| | - Oussama Badad
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Department of Plant, Soil and Agricultural Systems, Southern Illinois University, Carbondale, IL, 62901, USA.,Laboratory of Plant Physiology, Faculty of Sciences, University Mohamed V in Rabat, Rabat, 10000, Morocco
| | - M Gonzalo Claros
- Department of Molecular Biology and Biochemistry, and Plataforma Andaluza de Bioinformática, University of Malaga, Malaga, Spain
| | - David A Lightfoot
- Department of Plant, Soil and Agricultural Systems, Southern Illinois University, Carbondale, IL, 62901, USA
| | - Turgay Unver
- International Biomedicine and Genome Institute (iBG-izmir), Dokuz Eylül University, Current address: Egitim Mah. Ekrem Guer Sok. 26/3 Balcova, Izmir, Turkey
| | - Bouchra Chaouni
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Laboratory of Plant Physiology, Faculty of Sciences, University Mohamed V in Rabat, Rabat, 10000, Morocco
| | - Redouane Merrouch
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Bouchra Rahim
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Soumaya Essayeh
- Polydisciplinary Faculty of Nador, University Mohamed Premier, Nador, Morocco
| | - Matika Ganoudi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Rabha Abdelwahd
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Ghizlane Diria
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Meriem Alaoui Mdarhi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Mustapha Labhilili
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Driss Iraqi
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Jamila Mouhaddab
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco
| | - Hayat Sedrati
- National School of Computer Sciences & Systems Analysis, University Mohammed V in Rabat, Rabat, Morocco
| | - Majid Memari
- Research Computing and Cyber infrastructure, Computer Science Department, Southern Illinois University, Carbondale, IL, 62901, USA
| | - Noureddine Hamamouch
- Polydisciplinary Faculty, Sultan Moulay Slimane University, Beni-Mellal, Morocco
| | - Juan de Dios Alché
- Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas (CSIC), Granada, Spain
| | - Noureddine Boukhatem
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Rachid Mrabet
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Rachid Dahan
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Adelkhaleq Legssyer
- Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco
| | - Mohamed Khalfaoui
- National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| | - Mohamed Badraoui
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, B-9052 Ghent, Belgium, Belgium.,VIB Center for Plant Systems Biology, Technologiepark 927, Ghent, B-9052, Belgium.,Department of Genetics, Genomics Research Institute, University of Pretoria, Pretoria, 0028, South Africa
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20817, USA
| | - Abdelhamid El Mousadik
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco
| | - Rachid Mentag
- Biotechnology Unit, National Institute of Agricultural Research (INRA), Rabat, Morocco, Morocco
| | - Hassan Ghazal
- Laboratory of Biotechnology and Valorization of Natural Resources (LBVRN), Faculty of Sciences, University Ibn Zohr, Agadir, Morocco.,Laboratory of Physiology, Genetics & Ethnopharmacology (LPGE), Faculty of Sciences, University Mohamed Premier, Oujda, Morocco.,Iridian Genomes, Inc., Bethesda, MD, 20817, USA.,National Center for Scientific and Technological Research (CNRST), Rabat, Morocco
| |
Collapse
|
4
|
Zaslavsky L, Ciufo S, Fedorov B, Tatusova T. Clustering analysis of proteins from microbial genomes at multiple levels of resolution. BMC Bioinformatics 2016; 17 Suppl 8:276. [PMID: 27586436 PMCID: PMC5009818 DOI: 10.1186/s12859-016-1112-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Background Microbial genomes at the National Center for Biotechnology Information (NCBI) represent a large collection of more than 35,000 assemblies. There are several complexities associated with the data: a great variation in sampling density since human pathogens are densely sampled while other bacteria are less represented; different protein families occur in annotations with different frequencies; and the quality of genome annotation varies greatly. In order to extract useful information from these sophisticated data, the analysis needs to be performed at multiple levels of phylogenomic resolution and protein similarity, with an adequate sampling strategy. Results Protein clustering is used to construct meaningful and stable groups of similar proteins to be used for analysis and functional annotation. Our approach is to create protein clusters at three levels. First, tight clusters in groups of closely-related genomes (species-level clades) are constructed using a combined approach that takes into account both sequence similarity and genome context. Second, clustroids of conservative in-clade clusters are organized into seed global clusters. Finally, global protein clusters are built around the the seed clusters. We propose filtering strategies that allow limiting the protein set included in global clustering. The in-clade clustering procedure, subsequent selection of clustroids and organization into seed global clusters provides a robust representation and high rate of compression. Seed protein clusters are further extended by adding related proteins. Extended seed clusters include a significant part of the data and represent all major known cell machinery. The remaining part, coming from either non-conservative (unique) or rapidly evolving proteins, from rare genomes, or resulting from low-quality annotation, does not group together well. Processing these proteins requires significant computational resources and results in a large number of questionable clusters. Conclusion The developed filtering strategies allow to identify and exclude such peripheral proteins limiting the protein dataset in global clustering. Overall, the proposed methodology allows the relevant data at different levels of details to be obtained and data redundancy eliminated while keeping biologically interesting variations. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1112-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Leonid Zaslavsky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA.
| | - Stacy Ciufo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA
| | - Boris Fedorov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, 20894, MD, USA
| |
Collapse
|
5
|
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 2016; 44:6614-24. [PMID: 27342282 PMCID: PMC5001611 DOI: 10.1093/nar/gkw569] [Citation(s) in RCA: 4087] [Impact Index Per Article: 510.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Revised: 06/08/2016] [Accepted: 06/13/2016] [Indexed: 12/01/2022] Open
Abstract
Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.
Collapse
Affiliation(s)
- Tatiana Tatusova
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Michael DiCuccio
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Azat Badretdin
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Vyacheslav Chetvernin
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Eric P Nawrocki
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Leonid Zaslavsky
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Alexandre Lomsadze
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Tech, Atlanta, GA 30332, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| | - Mark Borodovsky
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Tech, Atlanta, GA 30332, USA School of Computational Science and Engineering, Georgia Tech, Atlanta, GA 30332, USA
| | - James Ostell
- National Center for Biotechnology Information, U.S. National Library of Medicine, Bethesda, MD 20894, USA
| |
Collapse
|
6
|
Kılıç S, Sagitova DM, Wolfish S, Bely B, Courtot M, Ciufo S, Tatusova T, O'Donovan C, Chibucos MC, Martin MJ, Erill I. From data repositories to submission portals: rethinking the role of domain-specific databases in CollecTF. Database (Oxford) 2016; 2016:baw055. [PMID: 27114493 PMCID: PMC4843526 DOI: 10.1093/database/baw055] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Accepted: 03/20/2016] [Indexed: 11/12/2022]
Abstract
Domain-specific databases are essential resources for the biomedical community, leveraging expert knowledge to curate published literature and provide access to referenced data and knowledge. The limited scope of these databases, however, poses important challenges on their infrastructure, visibility, funding and usefulness to the broader scientific community. CollecTF is a community-oriented database documenting experimentally validated transcription factor (TF)-binding sites in the Bacteria domain. In its quest to become a community resource for the annotation of transcriptional regulatory elements in bacterial genomes, CollecTF aims to move away from the conventional data-repository paradigm of domain-specific databases. Through the adoption of well-established ontologies, identifiers and collaborations, CollecTF has progressively become also a portal for the annotation and submission of information on transcriptional regulatory elements to major biological sequence resources (RefSeq, UniProtKB and the Gene Ontology Consortium). This fundamental change in database conception capitalizes on the domain-specific knowledge of contributing communities to provide high-quality annotations, while leveraging the availability of stable information hubs to promote long-term access and provide high-visibility to the data. As a submission portal, CollecTF generates TF-binding site information through direct annotation of RefSeq genome records, definition of TF-based regulatory networks in UniProtKB entries and submission of functional annotations to the Gene Ontology. As a database, CollecTF provides enhanced search and browsing, targeted data exports, binding motif analysis tools and integration with motif discovery and search platforms. This innovative approach will allow CollecTF to focus its limited resources on the generation of high-quality information and the provision of specialized access to the data.Database URL: http://www.collectf.org/.
Collapse
Affiliation(s)
- Sefa Kılıç
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), 1000 Hilltop Circle, Baltimore, MD, 21250, USA
| | - Dinara M Sagitova
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), 1000 Hilltop Circle, Baltimore, MD, 21250, USA
| | - Shoshannah Wolfish
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
| | - Benoit Bely
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mélanie Courtot
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stacy Ciufo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, Rockville Pike, Bethesda, MD, 20894, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, Rockville Pike, Bethesda, MD, 20894, USA
| | - Claire O'Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Marcus C Chibucos
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, 21201, USA Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
| | - Maria J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ivan Erill
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), 1000 Hilltop Circle, Baltimore, MD, 21250, USA
| |
Collapse
|
7
|
Federhen S, Rossello-Mora R, Klenk HP, Tindall BJ, Konstantinidis KT, Whitman WB, Brown D, Labeda D, Ussery D, Garrity GM, Colwell RR, Hasan N, Graf J, Parte A, Yarza P, Goldberg B, Sichtig H, Karsch-Mizrachi I, Clark K, McVeigh R, Pruitt KD, Tatusova T, Falk R, Turner S, Madden T, Kitts P, Kimchi A, Klimke W, Agarwala R, DiCuccio M, Ostell J. Meeting report: GenBank microbial genomic taxonomy workshop (12–13 May, 2015). Stand Genomic Sci 2016. [PMCID: PMC4748488 DOI: 10.1186/s40793-016-0134-1] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Many genomes are incorrectly identified at GenBank. We developed a plan to find and correct misidentified genomes using genomic comparison statistics together with a scaffold of reliably identified genomes from type. A workshop was organized with broad representation from the bacterial taxonomic community to review the proposal, the GenBank Microbial Genomic Taxonomy Workshop, Bethesda MD, May 12–13, 2015.
Collapse
|
8
|
Abstract
The National Center for Biotechnology Information (NCBI), as a primary public repository of genomic sequence data, collects and maintains enormous amounts of heterogeneous data. Data for genomes, genes, gene expressions, gene variation, gene families, proteins, and protein domains are integrated with the analytical, search, and retrieval resources through the NCBI website, text-based search and retrieval system, provides a fast and easy way to navigate across diverse biological databases.Comparative genome analysis tools lead to further understanding of evolution processes quickening the pace of discovery. Recent technological innovations have ignited an explosion in genome sequencing that has fundamentally changed our understanding of the biology of living organisms. This huge increase in DNA sequence data presents new challenges for the information management system and the visualization tools. New strategies have been designed to bring an order to this genome sequence shockwave and improve the usability of associated data.
Collapse
Affiliation(s)
- Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD, 20894, USA.
| |
Collapse
|
9
|
Kitts PA, Church DM, Thibaud-Nissen F, Choi J, Hem V, Sapojnikov V, Smith RG, Tatusova T, Xiang C, Zherikov A, DiCuccio M, Murphy TD, Pruitt KD, Kimchi A. Assembly: a resource for assembled genomes at NCBI. Nucleic Acids Res 2015; 44:D73-80. [PMID: 26578580 PMCID: PMC4702866 DOI: 10.1093/nar/gkv1226] [Citation(s) in RCA: 203] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Accepted: 10/29/2015] [Indexed: 01/11/2023] Open
Abstract
The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site.
Collapse
Affiliation(s)
- Paul A Kitts
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Deanna M Church
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Jinna Choi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Vichet Hem
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Victor Sapojnikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Robert G Smith
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Charlie Xiang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Andrey Zherikov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Michael DiCuccio
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Avi Kimchi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
10
|
O'Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, Rajput B, Robbertse B, Smith-White B, Ako-Adjei D, Astashyn A, Badretdin A, Bao Y, Blinkova O, Brover V, Chetvernin V, Choi J, Cox E, Ermolaeva O, Farrell CM, Goldfarb T, Gupta T, Haft D, Hatcher E, Hlavina W, Joardar VS, Kodali VK, Li W, Maglott D, Masterson P, McGarvey KM, Murphy MR, O'Neill K, Pujar S, Rangwala SH, Rausch D, Riddick LD, Schoch C, Shkeda A, Storz SS, Sun H, Thibaud-Nissen F, Tolstoy I, Tully RE, Vatsan AR, Wallin C, Webb D, Wu W, Landrum MJ, Kimchi A, Tatusova T, DiCuccio M, Kitts P, Murphy TD, Pruitt KD. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res 2015; 44:D733-45. [PMID: 26553804 PMCID: PMC4702849 DOI: 10.1093/nar/gkv1189] [Citation(s) in RCA: 3322] [Impact Index Per Article: 369.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Accepted: 10/24/2015] [Indexed: 12/12/2022] Open
Abstract
The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.
Collapse
Affiliation(s)
- Nuala A O'Leary
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Mathew W Wright
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - J Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Stacy Ciufo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Diana Haddad
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Rich McVeigh
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Bhanu Rajput
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Barbara Robbertse
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Brian Smith-White
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Danso Ako-Adjei
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Alexander Astashyn
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Azat Badretdin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Yiming Bao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Olga Blinkova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Vyacheslav Brover
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Vyacheslav Chetvernin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Jinna Choi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Eric Cox
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Olga Ermolaeva
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Catherine M Farrell
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Tamara Goldfarb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Tripti Gupta
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Daniel Haft
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Eneida Hatcher
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Wratko Hlavina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Vinita S Joardar
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Vamsi K Kodali
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Wenjun Li
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Donna Maglott
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Patrick Masterson
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kelly M McGarvey
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Michael R Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kathleen O'Neill
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Shashikant Pujar
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Sanjida H Rangwala
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Daniel Rausch
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Lillian D Riddick
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Conrad Schoch
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Andrei Shkeda
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Susan S Storz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Hanzhen Sun
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Francoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Igor Tolstoy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Raymond E Tully
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Anjana R Vatsan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Craig Wallin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - David Webb
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Wendy Wu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Melissa J Landrum
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Avi Kimchi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Michael DiCuccio
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Paul Kitts
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
11
|
Tatusova T, Ciufo S, Fedorov B, O'Neill K, Tolstoy I. RefSeq microbial genomes database: new representation and annotation strategy. Nucleic Acids Res 2015; 43:3872. [PMID: 25824943 PMCID: PMC4402550 DOI: 10.1093/nar/gkv278] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
12
|
Tatusova T, Ciufo S, Federhen S, Fedorov B, McVeigh R, O'Neill K, Tolstoy I, Zaslavsky L. Update on RefSeq microbial genomes resources. Nucleic Acids Res 2014; 43:D599-605. [PMID: 25510495 DOI: 10.1093/nar/gku1062] [Citation(s) in RCA: 107] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10,000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30,000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach.
Collapse
Affiliation(s)
- Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA.
| | - Stacy Ciufo
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Scott Federhen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Boris Fedorov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Richard McVeigh
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Kathleen O'Neill
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Igor Tolstoy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Leonid Zaslavsky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| |
Collapse
|
13
|
Bao Y, Chetvernin V, Tatusova T. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification. Arch Virol 2014; 159:3293-304. [PMID: 25119676 PMCID: PMC4221606 DOI: 10.1007/s00705-014-2197-x] [Citation(s) in RCA: 126] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 07/29/2014] [Indexed: 11/28/2022]
Abstract
The number of viral genome sequences in the public databases is increasing dramatically, and these sequences are playing an important role in virus classification. Pairwise sequence comparison is a sequence-based virus classification method. A program using this method calculates the pairwise identities of virus sequences within a virus family and displays their distribution, and visual analysis helps to determine demarcations at different taxonomic levels such as strain, species, genus and subfamily. Subsequent comparison of new sequences against existing ones allows viruses from which the new sequences were derived to be classified. Although this method cannot be used as the only criterion for virus classification in some cases, it is a quantitative method and has many advantages over conventional virus classification methods. It has been applied to several virus families, and there is an increasing interest in using this method for other virus families/groups. The Pairwise Sequence Comparison (PASC) classification tool was created at the National Center for Biotechnology Information. The tool's database stores pairwise identities for complete genomes/segments of 56 virus families/groups. Data in the system are updated every day to reflect changes in virus taxonomy and additions of new virus sequences to the public database. The web interface of the tool ( http://www.ncbi.nlm.nih.gov/sutils/pasc/ ) makes it easy to navigate and perform analyses. Multiple new viral genome sequences can be tested simultaneously with this system to suggest the taxonomic position of virus isolates in a specific family. PASC eliminates potential discrepancies in the results caused by different algorithms and/or different data used by researchers.
Collapse
Affiliation(s)
- Yiming Bao
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA,
| | | | | |
Collapse
|
14
|
Brown GR, Hem V, Katz KS, Ovetsky M, Wallin C, Ermolaeva O, Tolstoy I, Tatusova T, Pruitt KD, Maglott DR, Murphy TD. Gene: a gene-centered information resource at NCBI. Nucleic Acids Res 2014; 43:D36-42. [PMID: 25355515 DOI: 10.1093/nar/gku1055] [Citation(s) in RCA: 409] [Impact Index Per Article: 40.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.nih.gov/gene) integrates gene-specific information from multiple data sources. NCBI Reference Sequence (RefSeq) genomes for viruses, prokaryotes and eukaryotes are the primary foundation for Gene records in that they form the critical association between sequence and a tracked gene upon which additional functional and descriptive content is anchored. Additional content is integrated based on the genomic location and RefSeq transcript and protein sequence data. The content of a Gene record represents the integration of curation and automated processing from RefSeq, collaborating model organism databases, consortia such as Gene Ontology, and other databases within NCBI. Records in Gene are assigned unique, tracked integers as identifiers. The content (citations, nomenclature, genomic location, gene products and their attributes, phenotypes, sequences, interactions, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programming utilities (E-Utilities and Entrez Direct) and for bulk transfer by FTP.
Collapse
Affiliation(s)
- Garth R Brown
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Vichet Hem
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Kenneth S Katz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Michael Ovetsky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Craig Wallin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Olga Ermolaeva
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Igor Tolstoy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Donna R Maglott
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| | - Terence D Murphy
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA
| |
Collapse
|
15
|
Field D, Sterk P, Kottmann R, De Smet JW, Amaral-Zettler L, Cochrane G, Cole JR, Davies N, Dawyndt P, Garrity GM, Gilbert JA, Glöckner FO, Hirschman L, Klenk HP, Knight R, Kyrpides N, Meyer F, Karsch-Mizrachi I, Morrison N, Robbins R, San Gil I, Sansone S, Schriml L, Tatusova T, Ussery D, Yilmaz P, White O, Wooley J, Caporaso G. Genomic Standards Consortium Projects. Stand Genomic Sci 2014. [DOI: 10.4056/sigs.5559608] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
16
|
Field D, Sterk P, Kottmann R, De Smet JW, Amaral-Zettler L, Cochrane G, Cole JR, Davies N, Dawyndt P, Garrity GM, Gilbert JA, Glöckner FO, Hirschman L, Klenk HP, Knight R, Kyrpides N, Meyer F, Karsch-Mizrachi I, Morrison N, Robbins R, San Gil I, Sansone S, Schriml L, Tatusova T, Ussery D, Yilmaz P, White O, Wooley J, Caporaso G. Genomic standards consortium projects. Stand Genomic Sci 2014; 9:599-601. [PMID: 25197446 PMCID: PMC4148985 DOI: 10.4056/sigs.5559680] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The Genomic Standards Consortium (GSC) is an open-membership community that was founded in 2005 to work towards the development, implementation and harmonization of standards in the field of genomics. Starting with the defined task of establishing a minimal set of descriptions the GSC has evolved into an active standards-setting body that currently has 18 ongoing projects, with additional projects regularly proposed from within and outside the GSC. Here we describe our recently enacted policy for proposing new activities that are intended to be taken on by the GSC, along with the template for proposing such new activities.
Collapse
Affiliation(s)
- Dawn Field
- Centre for Ecology and Hydrology, Maclean Building, Benson Lane, Crowmarsh Gifford, Wallingford, Oxfordshire, OX10 8BB, United Kingdom ; Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
| | - Peter Sterk
- Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
| | - Renzo Kottmann
- Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Bremen & Jacobs University Bremen, Germany
| | - J Wim De Smet
- Department of Applied Mathematics and Computer Science, Ghent University, Ghent, Belgium
| | - Linda Amaral-Zettler
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, USA
| | - Guy Cochrane
- European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - James R Cole
- Center for Microbial Ecology, Michigan State University, East Lansing, Michigan, USA
| | - Neil Davies
- Gump South Pacific Research Station, University of California Berkeley, Moorea, French Polynesia ; Biodiversity Institute, Department of Zoology, University of Oxford, The Tinbergen Building, South Parks Road, Oxford, United Kingdom
| | - Peter Dawyndt
- Department of Applied Mathematics and Computer Science, Ghent University, Ghent, Belgium
| | - George M Garrity
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, USA
| | - Jack A Gilbert
- Institute for Genomic and Systems Biology, Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL, USA ; Department of Ecology and Evolution, University of Chicago, 5640 South Ellis Avenue, Chicago, IL, USA
| | - Frank Oliver Glöckner
- Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Bremen & Jacobs University Bremen, Germany
| | - Lynette Hirschman
- Information Technology Center, The MITRE Corporation, Bedford, Massachusetts, USA
| | - Hans-Peter Klenk
- DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany
| | - Rob Knight
- Howard Hughes Medical Institute, and BioFrontiers Institute, and Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado, USA
| | - Nikos Kyrpides
- DOE Joint Genome Institute, Walnut Creek, California, USA
| | - Folker Meyer
- Institute for Genomic and Systems Biology, Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL, USA ; Computation Institute, University of Chicago, Chicago, Illinois, USA
| | - Ilene Karsch-Mizrachi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Norman Morrison
- University of Manchester, Oxford Road, Manchester, United Kingdom
| | - Robert Robbins
- University of California San Diego, La Jolla, California USA
| | - Inigo San Gil
- LTER Network Office, Department of Biology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Susanna Sansone
- Oxford e-Research Centre, University of Oxford, Oxford, United Kingdom
| | - Lynn Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Dave Ussery
- DOE Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
| | - Pelin Yilmaz
- Microbial Genomics Group, Max Planck Institute for Marine Microbiology, Bremen & Jacobs University Bremen, Germany
| | - Owen White
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - John Wooley
- University of California San Diego, La Jolla, California USA
| | - Gregory Caporaso
- Center for Microbial Genetics and Genomics, Northern Arizona University, Flagstaff, Arizona, USA
| |
Collapse
|
17
|
Abstract
The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.nlm.nih.gov/genome. Next-generation sequencing has enabled researchers to perform genomic sequencing at rates that were unimaginable in the past. Microbial genomes can now be sequenced in a matter of hours, which has led to a significant increase in the number of assembled genomes deposited in the public archives. This huge increase in DNA sequence data presents new challenges for the annotation, analysis and visualization bioinformatics tools. New strategies have been developed for the annotation and representation of reference genomes and sequence variations derived from population studies and clinical outbreaks.
Collapse
Affiliation(s)
- Tatiana Tatusova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bldg. 38A 8600 Rockville Pike, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|
18
|
Bao Y, Chetvernin V, Tatusova T. PAirwise Sequence Comparison (PASC) and its application in the classification of filoviruses. Viruses 2012; 4:1318-27. [PMID: 23012628 PMCID: PMC3446765 DOI: 10.3390/v4081318] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2012] [Revised: 08/16/2012] [Accepted: 08/16/2012] [Indexed: 11/26/2022] Open
Abstract
PAirwise Sequence Comparison (PASC) is a tool that uses genome sequence similarity to help with virus classification. The PASC tool at NCBI uses two methods: local alignment based on BLAST and global alignment based on Needleman-Wunsch algorithm. It works for complete genomes of viruses of several families/groups, and for the family of Filoviridae, it currently includes 52 complete genomes available in GenBank. It has been shown that BLAST-based alignment approach works better for filoviruses, and therefore is recommended for establishing taxon demarcation criteria. When more genome sequences with high divergence become available, these demarcations will most likely become more precise. The tool can compare new genome sequences of filoviruses with the ones already in the database, and propose their taxonomic classification.
Collapse
Affiliation(s)
- Yiming Bao
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20892, USA.
| | | | | |
Collapse
|
19
|
Barrett T, Clark K, Gevorgyan R, Gorelenkov V, Gribov E, Karsch-Mizrachi I, Kimelman M, Pruitt KD, Resenchuk S, Tatusova T, Yaschenko E, Ostell J. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res 2011; 40:D57-63. [PMID: 22139929 PMCID: PMC3245069 DOI: 10.1093/nar/gkr1163] [Citation(s) in RCA: 210] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
As the volume and complexity of data sets archived at NCBI grow rapidly, so does the need to gather and organize the associated metadata. Although metadata has been collected for some archival databases, previously, there was no centralized approach at NCBI for collecting this information and using it across databases. The BioProject database was recently established to facilitate organization and classification of project data submitted to NCBI, EBI and DDBJ databases. It captures descriptive information about research projects that result in high volume submissions to archival databases, ties together related data across multiple archives and serves as a central portal by which to inform users of data availability. Concomitantly, the BioSample database is being developed to capture descriptive information about the biological samples investigated in projects. BioProject and BioSample records link to corresponding data stored in archival repositories. Submissions are supported by a web-based Submission Portal that guides users through a series of forms for input of rich metadata describing their projects and samples. Together, these databases offer improved ways for users to query, locate, integrate and interpret the masses of data held in NCBI's archival repositories. The BioProject and BioSample databases are available at http://www.ncbi.nlm.nih.gov/bioproject and http://www.ncbi.nlm.nih.gov/biosample, respectively.
Collapse
Affiliation(s)
- Tanya Barrett
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 45 Center Drive, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Abstract
The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of genomic, transcript and protein sequence records. These records are selected and curated from public sequence archives and represent a significant reduction in redundancy compared to the volume of data archived by the International Nucleotide Sequence Database Collaboration. The database includes over 16 000 organisms, 2.4 × 106 genomic records, 13 × 106 proteins and 2 × 106 RNA records spanning prokaryotes, eukaryotes and viruses (RefSeq release 49, September 2011). The RefSeq database is maintained by a combined approach of automated analyses, collaboration and manual curation to generate an up-to-date representation of the sequence, its features, names and cross-links to related sources of information. We report here on recent growth, the status of curating the human RefSeq data set, more extensive feature annotation and current policy for eukaryotic genome annotation via the NCBI annotation pipeline. More information about the resource is available online (see http://www.ncbi.nlm.nih.gov/RefSeq/).
Collapse
Affiliation(s)
- Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| | | | | | | |
Collapse
|
21
|
Klimke W, O'Donovan C, White O, Brister JR, Clark K, Fedorov B, Mizrachi I, Pruitt KD, Tatusova T. Solving the Problem: Genome Annotation Standards before the Data Deluge. Stand Genomic Sci 2011; 5:168-93. [PMID: 22180819 PMCID: PMC3236044 DOI: 10.4056/sigs.2084864] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.
Collapse
|
22
|
Abstract
The National Center for Biotechnology Information (NCBI) is well known for the nucleotide sequence archive, GenBank and sequence analysis tool BLAST. However, NCBI integrates many types of biomolecular data from variety of sources and makes it available to the scientific community as interactive web resources as well as organized releases of bulk data. These tools are available to explore and compare fungal genomes. Searching all databases with Fungi [organism] at http://www.ncbi.nlm.nih.gov/ is the quickest way to find resources of interest with fungal entries. Some tools though are resources specific and can be indirectly accessed from a particular database in the Entrez system. These include graphical viewers and comparative analysis tools such as TaxPlot, TaxMap and UniGene DDD (found via UniGene Homepage). Gene and BioProject pages also serve as portals to external data such as community annotation websites, BioGrid and UniProt. There are many different ways of accessing genomic data at NCBI. Depending on the focus and goal of research projects or the level of interest, a user would select a particular route for accessing genomic databases and resources. This review article describes methods of accessing fungal genome data and provides examples that illustrate the use of analysis tools.
Collapse
Affiliation(s)
- B Robbertse
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, 45 Center Drive, Bethesda, MD 20892
| | | |
Collapse
|
23
|
Field D, Garrity G, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N. eGenomics: Cataloguing our Complete Genome Collection. Comp Funct Genomics 2011; 6:363-8. [PMID: 18629208 PMCID: PMC2447503 DOI: 10.1002/cfg.494] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2005] [Revised: 10/03/2005] [Accepted: 10/04/2005] [Indexed: 11/17/2022] Open
Affiliation(s)
- Dawn Field
- Molecular Evolution and Bioinformatics Section Mansfield Road Oxford Centre for Ecology and Hydrology Oxford OX1 3SR United Kingdom
| | | | | | | | | | | | | |
Collapse
|
24
|
Field D, Amaral-Zettler L, Cochrane G, Cole JR, Dawyndt P, Garrity GM, Gilbert J, Glöckner FO, Hirschman L, Karsch-Mizrachi I, Klenk HP, Knight R, Kottmann R, Kyrpides N, Meyer F, San Gil I, Sansone SA, Schriml LM, Sterk P, Tatusova T, Ussery DW, White O, Wooley J. The Genomic Standards Consortium. PLoS Biol 2011; 9:e1001088. [PMID: 21713030 PMCID: PMC3119656 DOI: 10.1371/journal.pbio.1001088] [Citation(s) in RCA: 135] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
A vast and rich body of information has grown up as a result of the world's enthusiasm for 'omics technologies. Finding ways to describe and make available this information that maximise its usefulness has become a major effort across the 'omics world. At the heart of this effort is the Genomic Standards Consortium (GSC), an open-membership organization that drives community-based standardization activities, Here we provide a short history of the GSC, provide an overview of its range of current activities, and make a call for the scientific community to join forces to improve the quality and quantity of contextual information about our public collections of genomes, metagenomes, and marker gene sequences.
Collapse
Affiliation(s)
- Dawn Field
- Centre for Ecology & Hydrology, Maclean Building, Crowmarsh Gifford, Wallingford, Oxfordshire, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Abstract
We describe a new program called cryptic splice finder (CSF) that can reliably identify cryptic splice sites (css), so providing a useful tool to help investigate splicing mutations in genetic disease. We report that many css are not entirely dormant and are often already active at low levels in normal genes prior to their enhancement in genetic disease. We also report a fascinating correlation between the positions of css and introns, whereby css within the exons of one species frequently match the exact position of introns in equivalent genes from another species. These results strongly indicate that many introns were inserted into css during evolution and they also imply that the splicing information that lies outside some introns can be independently recognized by the splicing machinery and was in place prior to intron insertion. This indicates that non-intronic splicing information had a key role in shaping the split structure of eukaryote genes.
Collapse
Affiliation(s)
- Yuri Kapustin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, USA.
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Gaudet P, Bairoch A, Field D, Sansone SA, Taylor C, Attwood TK, Bateman A, Blake JA, Bult CJ, Cherry JM, Chisholm RL, Cochrane G, Cook CE, Eppig JT, Galperin MY, Gentleman R, Goble CA, Gojobori T, Hancock JM, Howe DG, Imanishi T, Kelso J, Landsman D, Lewis SE, Karsch Mizrachi I, Orchard S, Ouellette BF, Ranganathan S, Richardson L, Rocca-Serra P, Schofield PN, Smedley D, Southan C, Tan TW, Tatusova T, Whetzel PL, White O, Yamasaki C. Towards BioDBcore: a community-defined information specification for biological databases. Database (Oxford) 2011; 2011:baq027. [PMID: 21205783 PMCID: PMC3017395 DOI: 10.1093/database/baq027] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources; and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.
Collapse
Affiliation(s)
- Pascale Gaudet
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- *Corresponding author: Tel: +41-22-379-5050; Fax: +41-22-379-5858;
| | - Amos Bairoch
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Dawn Field
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Susanna-Assunta Sansone
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Chris Taylor
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Teresa K. Attwood
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Alex Bateman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Judith A. Blake
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Carol J. Bult
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - J. Michael Cherry
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Rex L. Chisholm
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Guy Cochrane
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Charles E. Cook
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Janan T. Eppig
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Michael Y. Galperin
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Robert Gentleman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Carole A. Goble
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Takashi Gojobori
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - John M. Hancock
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Douglas G. Howe
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tadashi Imanishi
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Janet Kelso
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - David Landsman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Suzanna E. Lewis
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Ilene Karsch Mizrachi
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Sandra Orchard
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - B.F. Francis Ouellette
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Shoba Ranganathan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Lorna Richardson
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Philippe Rocca-Serra
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Paul N. Schofield
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Damian Smedley
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Christopher Southan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tin W. Tan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tatiana Tatusova
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Patricia L. Whetzel
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Owen White
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Chisato Yamasaki
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR UK, University of Oxford, Oxford e-Research Centre, Oxford, OX1 3QG, UK, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK, School of Computer Science, The University of Manchester, Manchester M13 9PT, UK, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609 USA, Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, USA, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Japan, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, OR 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, CA 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, UK, Dept of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA 94305, USA and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
27
|
Davidsen T, Madupu R, Sterk P, Field D, Garrity G, Gilbert J, Glöckner FO, Hirschman L, Kolker E, Kottmann R, Kyrpides N, Meyer F, Morrison N, Schriml L, Tatusova T, Wooley J. Meeting Report from the Genomic Standards Consortium (GSC) Workshop 9. Stand Genomic Sci 2010; 3:216-24. [PMID: 21304722 PMCID: PMC3035308 DOI: 10.4056/sigs.1353455] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
This report summarizes the proceedings of the 9th workshop of the Genomic Standards Consortium (GSC), held at the J. Craig Venter Institute, Rockville, MD, USA. It was the first GSC workshop to have open registration and attracted over 90 participants. This workshop featured sessions that provided overviews of the full range of ongoing GSC projects. It included sessions on Standards in Genomic Sciences, the open access journal of the GSC, building standards for genome annotation, the M5 platform for next-generation collaborative computational infrastructures, building ties with the biodiversity research community and two discussion panels with government and industry participants. Progress was made on all fronts, and major outcomes included the completion of the MIENS specification for publication and the formation of the Biodiversity working group.
Collapse
|
28
|
Abstract
Entrez Gene (http://www.ncbi.nlm.nih.gov/gene) is National Center for Biotechnology Information (NCBI)’s database for gene-specific information. Entrez Gene maintains records from genomes which have been completely sequenced, which have an active research community to submit gene-specific information, or which are scheduled for intense sequence analysis. The content represents the integration of curation and automated processing from NCBI’s Reference Sequence project (RefSeq), collaborating model organism databases, consortia such as Gene Ontology and other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, genomic location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI’s Entrez system, via NCBI’s Entrez programming utilities (E-Utilities) and for bulk transfer by FTP.
Collapse
Affiliation(s)
- Donna Maglott
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20852, USA.
| | | | | | | |
Collapse
|
29
|
Gaudet P, Bairoch A, Field D, Sansone SA, Taylor C, Attwood TK, Bateman A, Blake JA, Bult CJ, Cherry JM, Chisholm RL, Cochrane G, Cook CE, Eppig JT, Galperin MY, Gentleman R, Goble CA, Gojobori T, Hancock JM, Howe DG, Imanishi T, Kelso J, Landsman D, Lewis SE, Mizrachi IK, Orchard S, Ouellette BFF, Ranganathan S, Richardson L, Rocca-Serra P, Schofield PN, Smedley D, Southan C, Tan TW, Tatusova T, Whetzel PL, White O, Yamasaki C. Towards BioDBcore: a community-defined information specification for biological databases. Nucleic Acids Res 2010; 39:D7-10. [PMID: 21097465 PMCID: PMC3013734 DOI: 10.1093/nar/gkq1173] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.
Collapse
Affiliation(s)
- Pascale Gaudet
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- *To whom correspondence should be addressed. Tel: +41 22 379 5050; Fax: +41 22 379 5858;
| | - Amos Bairoch
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Dawn Field
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Susanna-Assunta Sansone
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Chris Taylor
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Teresa K. Attwood
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Alex Bateman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Judith A. Blake
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Carol J. Bult
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - J. Michael Cherry
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Rex L. Chisholm
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Guy Cochrane
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Charles E. Cook
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Janan T. Eppig
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Michael Y. Galperin
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Robert Gentleman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Carole A. Goble
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Takashi Gojobori
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - John M. Hancock
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Douglas G. Howe
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tadashi Imanishi
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Janet Kelso
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - David Landsman
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Suzanna E. Lewis
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Ilene Karsch Mizrachi
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Sandra Orchard
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - B. F. Francis Ouellette
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Shoba Ranganathan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Lorna Richardson
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Philippe Rocca-Serra
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Paul N. Schofield
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Damian Smedley
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Christopher Southan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tin Wee Tan
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Tatiana Tatusova
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Patricia L. Whetzel
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Owen White
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Chisato Yamasaki
- Swiss Institute of Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA, NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, School of Computer Science, The University of Manchester, Manchester M13 9PT, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK, The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, Department of Genetics, Stanford University, Stanford, California 94305-5120, NCBI, NLM, National Institutes of Health, Bethesda, MD 20894, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA, DATABASE, The Journal of Biological Databases and Curation, Oxford University Press, Oxford OX2 6DP, UK, Biomedicinal Information Research Center, National Institute of Advanced Industrial Science and Technology, 2-42 Aomi Koto-ku, Tokyo 135-0064, Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Oxfordshire, OX11 0RD, UK, The Zebrafish Model Organism Database, 5291 University of Oregon, Eugene, Oregon 97401-5291, USA, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany, Genomics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road 64R0121 Berkeley, California 94720, USA, Ontario Institute for Cancer Research, Suite 800, 101 College Street, Toronto, Ontario, M5G 0A3, Canada, Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney NSW 2109, Australia, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh, EH4 2XU, Department of Physiology, Development and Neuroscience, University of Cambridge, Doning Street, Cambridge CB2 3EG, UK, ChrisDS Consulting, Göteborg, Sweden, Stanford Center for Biomedical Informatics Research, National Center for Biomedical Ontology, Stanford University, Stanford, CA, 94305 and Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
30
|
Hirschman L, Sterk P, Field D, Wooley J, Cochrane G, Gilbert J, Kolker E, Kyrpides N, Meyer F, Mizrachi I, Nakamura Y, Sansone SA, Schriml L, Tatusova T, White O, Yilmaz P. Meeting Report: "Metagenomics, Metadata and Meta-analysis" (M3) Workshop at the Pacific Symposium on Biocomputing 2010. Stand Genomic Sci 2010; 2:357-60. [PMID: 21304719 PMCID: PMC3035291 DOI: 10.4056/sigs.802738] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
This report summarizes the M3 Workshop held at the January 2010 Pacific Symposium on Biocomputing. The workshop, organized by Genomic Standards Consortium members, included five contributed talks, a series of short presentations from stakeholders in the genomics standards community, a poster session, and, in the evening, an open discussion session to review current projects and examine future directions for the GSC and its stakeholders.
Collapse
|
31
|
Abstract
The National Center for Biotechnology Information (NCBI), as a primary public repository of genomic sequence data, collects and maintains enormous amounts of heterogeneous data. Data for genomes, genes, gene expressions, gene variation, gene families, proteins, and protein domains are integrated with the analytical, search, and retrieval resources through the NCBI Web site. Entrez, a text-based search and retrieval system, provides a fast and easy way to navigate across diverse biological databases.Customized genomic BLAST enables sequence similarity searches against a special collection of organism-specific sequence data and viewing the resulting alignments within a genomic context using NCBI's genome browser, Map Viewer.Comparative genome analysis tools lead to further understanding of evolutionary processes, quickening the pace of discovery.
Collapse
|
32
|
Zaslavsky L, Tatusova T. Mining the NCBI Influenza Sequence Database: adaptive grouping of BLAST results using precalculated neighbor indexing. PLoS Curr 2009; 1:RRN1124. [PMID: 20029662 PMCID: PMC2771650 DOI: 10.1371/currents.rrn1124] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 10/30/2009] [Indexed: 11/19/2022]
Abstract
The Influenza Virus Resource and other Virus Variation Resources at NCBI provide enhanced visualization web tools for exploratory analysis for influenza sequence data. Despite the improvements in data analysis, the initial data retrieval remains unsophisticated, frequently producing huge and imbalanced datasets due to the large number of identical and nearly-identical sequences in the database.We propose a data mining algorithm to organize reported sequences into groups based on their relatedness to the query sequence and to each other. The algorithm uses BLAST to find database sequences related to the query. Neighbor lists precalculated from pairwise BLAST alignments between database sequences are used to organize results in groups of nearly-identical and strongly related sequences. We propose to use a non-symmetric dissimilarity measure well crafted for dealing with sequences of different length (fragments).A balanced and representative data set produced by this tool can be used for further analysis, i.e. multiple sequence alignment and phylogenetic trees. The algorithm is implemented for protein coding sequences and is being integrated with the NCBI Influenza Virus Resource.
Collapse
Affiliation(s)
- Leonid Zaslavsky
- National Center for Biotechnology Information, National Institutes of Health
| | | |
Collapse
|
33
|
Klimke W, Agarwala R, Badretdin A, Chetvernin S, Ciufo S, Fedorov B, Kiryutin B, O'Neill K, Resch W, Resenchuk S, Schafer S, Tolstoy I, Tatusova T. The National Center for Biotechnology Information's Protein Clusters Database. Nucleic Acids Res 2008; 37:D216-23. [PMID: 18940865 PMCID: PMC2686591 DOI: 10.1093/nar/gkn734] [Citation(s) in RCA: 219] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Rapid increases in DNA sequencing capabilities have led to a vast increase in the data generated from prokaryotic genomic studies, which has been a boon to scientists studying micro-organism evolution and to those who wish to understand the biological underpinnings of microbial systems. The NCBI Protein Clusters Database (ProtClustDB) has been created to efficiently maintain and keep the deluge of data up to date. ProtClustDB contains both curated and uncurated clusters of proteins grouped by sequence similarity. The May 2008 release contains a total of 285 386 clusters derived from over 1.7 million proteins encoded by 3806 nt sequences from the RefSeq collection of complete chromosomes and plasmids from four major groups: prokaryotes, bacteriophages and the mitochondrial and chloroplast organelles. There are 7180 clusters containing 376 513 proteins with curated gene and protein functional annotation. PubMed identifiers and external cross references are collected for all clusters and provide additional information resources. A suite of web tools is available to explore more detailed information, such as multiple alignments, phylogenetic trees and genomic neighborhoods. ProtClustDB provides an efficient method to aggregate gene and protein annotation for researchers and is available at http://www.ncbi.nlm.nih.gov/sites/entrez?db=proteinclusters.
Collapse
Affiliation(s)
- William Klimke
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 × 106 proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for eukaryotic (primarily vertebrate) species and policies regarding species inclusion and genome annotation. In addition, we introduce RefSeqGene, a new initiative to support reporting variation data on a stable genomic coordinate system.
Collapse
Affiliation(s)
- Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Rm 4As.47B, 45 Center Drive, Bethesda, MD, USA.
| | | | | | | |
Collapse
|
35
|
Field D, Garrity GM, Sansone SA, Sterk P, Gray T, Kyrpides N, Hirschman L, Glöckner FO, Kottmann R, Angiuoli S, White O, Dawyndt P, Thomson N, Gil IS, Morrison N, Tatusova T, Mizrachi I, Vaughan R, Cochrane G, Kagan L, Murphy S, Schriml L. Meeting report: the fifth Genomic Standards Consortium (GSC) workshop. OMICS 2008; 12:109-13. [PMID: 18564915 DOI: 10.1089/omi.2008.a3b3] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
This meeting report summarizes the proceedings of the fifth Genomic Standards Consortium (GSC) workshop held December 12-14, 2007, at the European Bioinformatics Institute (EBI), Cambridge, UK. This fifth workshop served as a milestone event in the evolution of the GSC (launched in September 2005); the key outcome of the workshop was the finalization of a stable version of the MIGS specification (v2.0) for publication. This accomplishment enables, and also in some cases necessitates, downstream activities, which are described in the multiauthor, consensus-driven articles in this special issue of OMICS produced as a direct result of the workshop. This report briefly summarizes the workshop and overviews the special issue. In particular, it aims to explain how the various GSC-led projects are working together to help this community achieve its stated mission of further standardizing the descriptions of genomes and metagenomes and implementing improved mechanisms of data exchange and integration to enable more accurate comparative analyses. Further information about the GSC and its range of activities can be found at http://gensc.org.
Collapse
Affiliation(s)
- Dawn Field
- NERC Center for Ecology and Hydrology, Oxford, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Angiuoli SV, Gussman A, Klimke W, Cochrane G, Field D, Garrity G, Kodira CD, Kyrpides N, Madupu R, Markowitz V, Tatusova T, Thomson N, White O. Toward an online repository of Standard Operating Procedures (SOPs) for (meta)genomic annotation. OMICS 2008; 12:137-41. [PMID: 18416670 PMCID: PMC3196215 DOI: 10.1089/omi.2008.0017] [Citation(s) in RCA: 544] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The methodologies used to generate genome and metagenome annotations are diverse and vary between groups and laboratories. Descriptions of the annotation process are helpful in interpreting genome annotation data. Some groups have produced Standard Operating Procedures (SOPs) that describe the annotation process, but standards are lacking for structure and content of these descriptions. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse an online repository of SOPs.
Collapse
Affiliation(s)
- Samuel V Angiuoli
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Kapustin Y, Souvorov A, Tatusova T, Lipman D. Splign: algorithms for computing spliced alignments with identification of paralogs. Biol Direct 2008; 3:20. [PMID: 18495041 PMCID: PMC2440734 DOI: 10.1186/1745-6150-3-20] [Citation(s) in RCA: 244] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2008] [Accepted: 05/21/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The computation of accurate alignments of cDNA sequences against a genome is at the foundation of modern genome annotation pipelines. Several factors such as presence of paralogs, small exons, non-consensus splice signals, sequencing errors and polymorphic sites pose recognized difficulties to existing spliced alignment algorithms. RESULTS We describe a set of algorithms behind a tool called Splign for computing cDNA-to-Genome alignments. The algorithms include a high-performance preliminary alignment, a compartment identification based on a formally defined model of adjacent duplicated regions, and a refined sequence alignment. In a series of tests, Splign has produced more accurate results than other tools commonly used to compute spliced alignments, in a reasonable amount of time. CONCLUSION Splign's ability to deal with various issues complicating the spliced alignment problem makes it a helpful tool in eukaryotic genome annotation processes and alternative splicing studies. Its performance is enough to align the largest currently available pools of cDNA data such as the human EST set on a moderate-sized computing cluster in a matter of hours. The duplications identification (compartmentization) algorithm can be used independently in other areas such as the study of pseudogenes.
Collapse
Affiliation(s)
- Yuri Kapustin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, USA.
| | | | | | | |
Collapse
|
38
|
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, Ashburner M, Axelrod N, Baldauf S, Ballard S, Boore J, Cochrane G, Cole J, Dawyndt P, De Vos P, DePamphilis C, Edwards R, Faruque N, Feldman R, Gilbert J, Gilna P, Glöckner FO, Goldstein P, Guralnick R, Haft D, Hancock D, Hermjakob H, Hertz-Fowler C, Hugenholtz P, Joint I, Kagan L, Kane M, Kennedy J, Kowalchuk G, Kottmann R, Kolker E, Kravitz S, Kyrpides N, Leebens-Mack J, Lewis SE, Li K, Lister AL, Lord P, Maltsev N, Markowitz V, Martiny J, Methe B, Mizrachi I, Moxon R, Nelson K, Parkhill J, Proctor L, White O, Sansone SA, Spiers A, Stevens R, Swift P, Taylor C, Tateno Y, Tett A, Turner S, Ussery D, Vaughan B, Ward N, Whetzel T, San Gil I, Wilson G, Wipat A. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol 2008; 26:541-7. [PMID: 18464787 PMCID: PMC2409278 DOI: 10.1038/nbt1360] [Citation(s) in RCA: 969] [Impact Index Per Article: 60.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the 'transparency' of the information contained in existing genomic databases.
Collapse
Affiliation(s)
- Dawn Field
- Natural Environmental Research Council Centre for Ecology and Hydrology, Oxford OX1 3SR, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Tanaka T, Antonio BA, Kikuchi S, Matsumoto T, Nagamura Y, Numa H, Sakai H, Wu J, Itoh T, Sasaki T, Aono R, Fujii Y, Habara T, Harada E, Kanno M, Kawahara Y, Kawashima H, Kubooka H, Matsuya A, Nakaoka H, Saichi N, Sanbonmatsu R, Sato Y, Shinso Y, Suzuki M, Takeda JI, Tanino M, Todokoro F, Yamaguchi K, Yamamoto N, Yamasaki C, Imanishi T, Okido T, Tada M, Ikeo K, Tateno Y, Gojobori T, Lin YC, Wei FJ, Hsing YI, Zhao Q, Han B, Kramer MR, McCombie RW, Lonsdale D, O'Donovan CC, Whitfield EJ, Apweiler R, Koyanagi KO, Khurana JP, Raghuvanshi S, Singh NK, Tyagi AK, Haberer G, Fujisawa M, Hosokawa S, Ito Y, Ikawa H, Shibata M, Yamamoto M, Bruskiewich RM, Hoen DR, Bureau TE, Namiki N, Ohyanagi H, Sakai Y, Nobushima S, Sakata K, Barrero RA, Sato Y, Souvorov A, Smith-White B, Tatusova T, An S, An G, OOta S, Fuks G, Fuks G, Messing J, Christie KR, Lieberherr D, Kim H, Zuccolo A, Wing RA, Nobuta K, Green PJ, Lu C, Meyers BC, Chaparro C, Piegu B, Panaud O, Echeverria M. The Rice Annotation Project Database (RAP-DB): 2008 update. Nucleic Acids Res 2007; 36:D1028-33. [PMID: 18089549 PMCID: PMC2238920 DOI: 10.1093/nar/gkm978] [Citation(s) in RCA: 210] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The Rice Annotation Project Database (RAP-DB) was created to provide the genome sequence assembly of the International Rice Genome Sequencing Project (IRGSP), manually curated annotation of the sequence, and other genomics information that could be useful for comprehensive understanding of the rice biology. Since the last publication of the RAP-DB, the IRGSP genome has been revised and reassembled. In addition, a large number of rice-expressed sequence tags have been released, and functional genomics resources have been produced worldwide. Thus, we have thoroughly updated our genome annotation by manual curation of all the functional descriptions of rice genes. The latest version of the RAP-DB contains a variety of annotation data as follows: clone positions, structures and functions of 31 439 genes validated by cDNAs, RNA genes detected by massively parallel signature sequencing (MPSS) technology and sequence similarity, flanking sequences of mutant lines, transposable elements, etc. Other annotation data such as Gnomon can be displayed along with those of RAP for comparison. We have also developed a new keyword search system to allow the user to access useful information. The RAP-DB is available at: http://rapdb.dna.affrc.go.jp/ and http://rapdb.lab.nig.ac.jp/.
Collapse
Affiliation(s)
-
- National Institute of Agrobiological Sciences, Ibaraki 305-8602, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Abstract
Fundamental biological processes can now be studied by applying the full range of OMICS technologies (genomics, transcriptomics, proteomics, metabolomics, and beyond) to the same biological sample. Clearly, it would be desirable if the concept of sample were shared among these technologies, especially as up until the time a biological sample is prepared for use in a specific OMICS assay, its description is inherently technology independent. Sharing a common informatic representation would encourage data sharing (rather than data replication), thereby reducing redundant data capture and the potential for error. This would result in a significant degree of harmonization across different OMICS data standardization activities, a task that is critical if we are to integrate data from these different data sources. Here, we review the current concept of sample in OMICS technologies as it is being dealt with by different OMICS standardization initiatives and discuss the special role that the newly formed Genomic Standards Consortium (GSC) might have to play in this domain.
Collapse
Affiliation(s)
- Norman Morrison
- School of Computer Science, University of Manchester, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
41
|
Hiller NL, Janto B, Hogg JS, Boissy R, Yu S, Powell E, Keefe R, Ehrlich NE, Shen K, Hayes J, Barbadora K, Klimke W, Dernovoy D, Tatusova T, Parkhill J, Bentley SD, Post JC, Ehrlich GD, Hu FZ. Comparative genomic analyses of seventeen Streptococcus pneumoniae strains: insights into the pneumococcal supragenome. J Bacteriol 2007; 189:8186-95. [PMID: 17675389 PMCID: PMC2168654 DOI: 10.1128/jb.00690-07] [Citation(s) in RCA: 203] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The distributed-genome hypothesis (DGH) states that pathogenic bacteria possess a supragenome that is much larger than the genome of any single bacterium and that these pathogens utilize genetic recombination and a large, noncore set of genes as a means of diversity generation. We sequenced the genomes of eight nasopharyngeal strains of Streptococcus pneumoniae isolated from pediatric patients with upper respiratory symptoms and performed quantitative genomic analyses among these and nine publicly available pneumococcal strains. Coding sequences from all strains were grouped into 3,170 orthologous gene clusters, of which 1,454 (46%) were conserved among all 17 strains. The majority of the gene clusters, 1,716 (54%), were not found in all strains. Genic differences per strain pair ranged from 35 to 629 orthologous clusters, with each strain's genome containing between 21 and 32% noncore genes. The distribution of the orthologous clusters per genome for the 17 strains was entered into the finite-supragenome model, which predicted that (i) the S. pneumoniae supragenome contains more than 5,000 orthologous clusters and (ii) 99% of the orthologous clusters ( approximately 3,000) that are represented in the S. pneumoniae population at frequencies of >or=0.1 can be identified if 33 representative genomes are sequenced. These extensive genic diversity data support the DGH and provide a basis for understanding the great differences in clinical phenotype associated with various pneumococcal strains. When these findings are taken together with previous studies that demonstrated the presence of a supragenome for Streptococcus agalactiae and Haemophilus influenzae, it appears that the possession of a distributed genome is a common host interaction strategy.
Collapse
Affiliation(s)
- N Luisa Hiller
- Allegheny General Hospital, Allegheny-Singer Research Institute, Center for Genomic Sciences, Pittsburgh, PA 15212, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Abstract
FLAN (short for FLu ANnotation), the NCBI web server for genome annotation of influenza virus (http://www.ncbi.nlm.nih.gov/genomes/FLU/Database/annotation.cgi) is a tool for user-provided influenza A virus or influenza B virus sequences. It can validate and predict protein sequences encoded by an input flu sequence. The input sequence is BLASTed against a database containing influenza sequences to determine the virus type (A or B), segment (1 through 8) and subtype for the hemagglutinin and neuraminidase segments of influenza A virus. For each segment/subtype of the viruses, a set of sample protein sequences is maintained. The input sequence is then aligned against the corresponding protein set with a 'Protein to nucleotide alignment tool' (ProSplign). The translated product from the best alignment to the sample protein sequence is used as the predicted protein encoded by the input sequence. The output can be a feature table that can be used for sequence submission to GenBank (by Sequin or tbl2asn), a GenBank flat file, or the predicted protein sequences in FASTA format. A message showing the length of the input sequence, the predicted virus type, segment and subtype for the hemagglutinin and neuraminidase segments of Influenza A virus will also be displayed.
Collapse
|
43
|
Itoh T, Tanaka T, Barrero RA, Yamasaki C, Fujii Y, Hilton PB, Antonio BA, Aono H, Apweiler R, Bruskiewich R, Bureau T, Burr F, Costa de Oliveira A, Fuks G, Habara T, Haberer G, Han B, Harada E, Hiraki AT, Hirochika H, Hoen D, Hokari H, Hosokawa S, Hsing Y, Ikawa H, Ikeo K, Imanishi T, Ito Y, Jaiswal P, Kanno M, Kawahara Y, Kawamura T, Kawashima H, Khurana JP, Kikuchi S, Komatsu S, Koyanagi KO, Kubooka H, Lieberherr D, Lin YC, Lonsdale D, Matsumoto T, Matsuya A, McCombie WR, Messing J, Miyao A, Mulder N, Nagamura Y, Nam J, Namiki N, Numa H, Nurimoto S, O’Donovan C, Ohyanagi H, Okido T, OOta S, Osato N, Palmer LE, Quetier F, Raghuvanshi S, Saichi N, Sakai H, Sakai Y, Sakata K, Sakurai T, Sato F, Sato Y, Schoof H, Seki M, Shibata M, Shimizu Y, Shinozaki K, Shinso Y, Singh NK, Smith-White B, Takeda JI, Tanino M, Tatusova T, Thongjuea S, Todokoro F, Tsugane M, Tyagi AK, Vanavichit A, Wang A, Wing RA, Yamaguchi K, Yamamoto M, Yamamoto N, Yu Y, Zhang H, Zhao Q, Higo K, Burr B, Gojobori T, Sasaki T. Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. Genes Dev 2007; 17:175-83. [PMID: 17210932 PMCID: PMC1781349 DOI: 10.1101/gr.5509507] [Citation(s) in RCA: 203] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2006] [Accepted: 10/31/2006] [Indexed: 11/25/2022]
Abstract
We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is approximately 32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene.
Collapse
Affiliation(s)
- Takeshi Itoh
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
| | - Tsuyoshi Tanaka
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Roberto A. Barrero
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Chisato Yamasaki
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Yasuyuki Fujii
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Phillip B. Hilton
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Baltazar A. Antonio
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Hideo Aono
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Rolf Apweiler
- EMBL Outstation–European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom
| | - Richard Bruskiewich
- Biometrics and Bioinformatics Unit, International Rice Research Institute, DAPO Box 7777, Metro Manila, Philippines
| | - Thomas Bureau
- Department of Biology, McGill University, Montreal, Quebec H3A 1B1, Canada
| | - Frances Burr
- Biology Department, Brookhaven National Laboratory, Upton, New York 11973, USA
| | | | - Galina Fuks
- Waksman Institute of Microbiology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Takuya Habara
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Georg Haberer
- Institute for Bioinformatics, GSF National Research Center for Environment and Health, D-85764 Neuherberg, Germany
| | - Bin Han
- Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 500 Caobao Road, Shanghai 200233, China
| | - Erimi Harada
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Aiko T. Hiraki
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Hirohiko Hirochika
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Douglas Hoen
- Department of Biology, McGill University, Montreal, Quebec H3A 1B1, Canada
| | - Hiroki Hokari
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Satomi Hosokawa
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Yue Hsing
- Institute of Botany, Academia Sinica, Nankang, Taipei 11529, Taiwan
| | - Hiroshi Ikawa
- Tsukuba Division, Mitsubishi Space Software Co., Ltd., Tsukuba, Ibaraki 305-0032, Japan
| | - Kazuho Ikeo
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Tadashi Imanishi
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| | - Yukiyo Ito
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Pankaj Jaiswal
- Department of Plant Breeding, Cornell University, Ithaca, New York 14853, USA
| | - Masako Kanno
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Yoshihiro Kawahara
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Department of Biological Sciences, Tokyo Metropolitan University, Hachioji-shi, Tokyo 192-0397, Japan
| | - Toshiyuki Kawamura
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Hiroaki Kawashima
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Jitendra P. Khurana
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi 110021, India
| | - Shoshi Kikuchi
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Setsuko Komatsu
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
- National Institute of Crop Science, National Agriculture and Food Research Organization, Tsukuba, Ibaraki 305-8518, Japan
| | - Kanako O. Koyanagi
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| | - Hiromi Kubooka
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Damien Lieberherr
- SWISS-PROT Group, Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland
| | - Yao-Cheng Lin
- Institute of Botany, Academia Sinica, Nankang, Taipei 11529, Taiwan
| | - David Lonsdale
- EMBL Outstation–European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom
| | - Takashi Matsumoto
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Akihiro Matsuya
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | | | - Joachim Messing
- Waksman Institute of Microbiology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Akio Miyao
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Nicola Mulder
- EMBL Outstation–European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom
| | - Yoshiaki Nagamura
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Jongmin Nam
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
- Institute of Molecular Evolutionary Genetics and Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Nobukazu Namiki
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Hisataka Numa
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | - Shin Nurimoto
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Claire O’Donovan
- EMBL Outstation–European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, United Kingdom
| | - Hajime Ohyanagi
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
- Tsukuba Division, Mitsubishi Space Software Co., Ltd., Tsukuba, Ibaraki 305-0032, Japan
| | - Toshihisa Okido
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Satoshi OOta
- RIKEN BioResource Center, RIKEN Tsukuba Institute, Tsukuba, Ibaraki 305-0074, Japan
| | - Naoki Osato
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Lance E. Palmer
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11723, USA
- Department of Molecular Genetics and Microbiology, and Center for Infectious Diseases, The State University of New York at Stony Brook, Stony Brook, New York 11794, USA
| | | | - Saurabh Raghuvanshi
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi 110021, India
| | - Naomi Saichi
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Hiroaki Sakai
- Division of Genome and Biodiversity Research, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Yasumichi Sakai
- Tsukuba Division, Mitsubishi Space Software Co., Ltd., Tsukuba, Ibaraki 305-0032, Japan
| | - Katsumi Sakata
- Tsukuba Division, Mitsubishi Space Software Co., Ltd., Tsukuba, Ibaraki 305-0032, Japan
| | - Tetsuya Sakurai
- Metabolomics Research Group, RIKEN Plant Science Center, Yokohama, Kanagawa 230-0045, Japan
| | - Fumihiko Sato
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Yoshiharu Sato
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Heiko Schoof
- Institute for Bioinformatics, GSF National Research Center for Environment and Health, D-85764 Neuherberg, Germany
- Technische Universität München, Genome Oriented Bioinformatics, D-85354 Freising-Weihenstephan, Germany
- Plant Computational Biology, Max-Planck-Institute for Plant Breeding Research, D 50829 Cologne, Germany
| | - Motoaki Seki
- Plant Functional Genomics Research Group, RIKEN Plant Science Center, Yokohama, Kanagawa 230-0045, Japan
| | - Michie Shibata
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Yuji Shimizu
- Tsukuba Division, Mitsubishi Space Software Co., Ltd., Tsukuba, Ibaraki 305-0032, Japan
| | - Kazuo Shinozaki
- RIKEN Plant Science Center, Yokohama, Kanagawa 230-0045, Japan
| | - Yuji Shinso
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Nagendra K. Singh
- National Research Centre on Plant Biotechnology, Indian Agricultural Research Institute, New Delhi 110012, India
| | - Brian Smith-White
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Jun-ichi Takeda
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Motohiko Tanino
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Tatiana Tatusova
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Supat Thongjuea
- Rice Gene Discovery Unit, Kasetsart University, Nakorn Pathom 73140, Thailand
| | - Fusano Todokoro
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Mika Tsugane
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Akhilesh K. Tyagi
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi 110021, India
| | - Apichart Vanavichit
- Rice Gene Discovery Unit, Kasetsart University, Nakorn Pathom 73140, Thailand
| | - Aihui Wang
- The Institute for Genomic Research, Rockville, Maryland 20850, USA
| | - Rod A. Wing
- Arizona Genomics Institute, The University of Arizona, Tucson, Arizona 85721, USA
| | - Kaori Yamaguchi
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Mayu Yamamoto
- Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, Tsukuba, Ibaraki 305-0854, Japan
| | - Naoyuki Yamamoto
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Yeisoo Yu
- Arizona Genomics Institute, The University of Arizona, Tucson, Arizona 85721, USA
| | - Hao Zhang
- Japan Biological Information Research Center, Japan Biological Informatics Consortium, Koto-ku, Tokyo 135-0064, Japan
| | - Qiang Zhao
- Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 500 Caobao Road, Shanghai 200233, China
| | - Kenichi Higo
- National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
- Bio-Oriented Technology Research Advancement Institution, Minato-ku, Tokyo 105-0001, Japan
| | - Benjamin Burr
- Biology Department, Brookhaven National Laboratory, Upton, New York 11973, USA
| | - Takashi Gojobori
- Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Koto-ku, Tokyo 135-0064, Japan
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540, Japan
| | - Takuji Sasaki
- National Institute of Agrobiological Sciences, Tsukuba, Ibaraki 305-8602, Japan
| | | |
Collapse
|
44
|
Pruitt KD, Tatusova T, Maglott DR. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2007. [PMID: 17130148 DOI: 10.1093/nar/gkl84] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023] Open
Abstract
NCBI's reference sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2,879,860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence.
Collapse
Affiliation(s)
- Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Rm 6An.12J, 45 Center Drive, Bethesda, MD 20892-6510, USA.
| | | | | |
Collapse
|
45
|
Maglott D, Ostell J, Pruitt KD, Tatusova T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res 2007; 35:D26-31. [PMID: 17148475 PMCID: PMC1761442 DOI: 10.1093/nar/gkl993] [Citation(s) in RCA: 410] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2006] [Revised: 10/27/2006] [Accepted: 10/30/2006] [Indexed: 11/28/2022] Open
Abstract
Entrez Gene (www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene) is NCBI's database for gene-specific information. Entrez Gene includes records from genomes that have been completely sequenced, that have an active research community to contribute gene-specific information or that are scheduled for intense sequence analysis. The content of Entrez Gene represents the result of both curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases and from other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, map location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is provided via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programing utilities (E-Utilities), and for bulk transfer by ftp.
Collapse
Affiliation(s)
- Donna Maglott
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20892-6510, USA.
| | | | | | | |
Collapse
|
46
|
Pruitt KD, Tatusova T, Maglott DR. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2007; 35:D61-5. [PMID: 17130148 PMCID: PMC1716718 DOI: 10.1093/nar/gkl842] [Citation(s) in RCA: 2075] [Impact Index Per Article: 122.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2006] [Revised: 10/06/2006] [Accepted: 10/06/2006] [Indexed: 11/19/2022] Open
Abstract
NCBI's reference sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2,879,860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence.
Collapse
Affiliation(s)
- Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Rm 6An.12J, 45 Center Drive, Bethesda, MD 20892-6510, USA.
| | | | | |
Collapse
|
47
|
Ghedin E, Sengamalay NA, Shumway M, Zaborsky J, Feldblyum T, Subbu V, Spiro DJ, Sitz J, Koo H, Bolotov P, Dernovoy D, Tatusova T, Bao Y, St George K, Taylor J, Lipman DJ, Fraser CM, Taubenberger JK, Salzberg SL. Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature 2005; 437:1162-6. [PMID: 16208317 DOI: 10.1038/nature04239] [Citation(s) in RCA: 321] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2005] [Accepted: 09/16/2005] [Indexed: 01/15/2023]
Abstract
Influenza viruses are remarkably adept at surviving in the human population over a long timescale. The human influenza A virus continues to thrive even among populations with widespread access to vaccines, and continues to be a major cause of morbidity and mortality. The virus mutates from year to year, making the existing vaccines ineffective on a regular basis, and requiring that new strains be chosen for a new vaccine. Less-frequent major changes, known as antigenic shift, create new strains against which the human population has little protective immunity, thereby causing worldwide pandemics. The most recent pandemics include the 1918 'Spanish' flu, one of the most deadly outbreaks in recorded history, which killed 30-50 million people worldwide, the 1957 'Asian' flu, and the 1968 'Hong Kong' flu. Motivated by the need for a better understanding of influenza evolution, we have developed flexible protocols that make it possible to apply large-scale sequencing techniques to the highly variable influenza genome. Here we report the results of sequencing 209 complete genomes of the human influenza A virus, encompassing a total of 2,821,103 nucleotides. In addition to increasing markedly the number of publicly available, complete influenza virus genomes, we have discovered several anomalies in these first 209 genomes that demonstrate the dynamic nature of influenza transmission and evolution. This new, large-scale sequencing effort promises to provide a more comprehensive picture of the evolution of influenza viruses and of their pattern of transmission through human and animal populations. All data from this project are being deposited, without delay, in public archives.
Collapse
MESH Headings
- Animals
- Evolution, Molecular
- Genome, Viral
- Hemagglutinin Glycoproteins, Influenza Virus/genetics
- Hemagglutinin Glycoproteins, Influenza Virus/immunology
- History, 20th Century
- History, 21st Century
- Humans
- Influenza A virus/classification
- Influenza A virus/genetics
- Influenza A virus/isolation & purification
- Influenza A virus/physiology
- Influenza Vaccines/history
- Influenza Vaccines/immunology
- Influenza, Human/epidemiology
- Influenza, Human/transmission
- Influenza, Human/veterinary
- Influenza, Human/virology
- Mutagenesis/genetics
- Mutation/genetics
- Neuraminidase/genetics
- Neuraminidase/metabolism
- New York/epidemiology
- Phylogeny
- Public Sector
- Reassortant Viruses/genetics
- Sequence Analysis
- Time Factors
- Virus Replication
Collapse
Affiliation(s)
- Elodie Ghedin
- The Institute for Genomic Research, 9712 Medical Center Dr., Rockville, Maryland 20850, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Wheeler DL, Smith-White B, Chetvernin V, Resenchuk S, Dombrowski SM, Pechous SW, Tatusova T, Ostell J. Plant genome resources at the national center for biotechnology information. Plant Physiol 2005; 138:1280-8. [PMID: 16010002 PMCID: PMC1176401 DOI: 10.1104/pp.104.058842] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The National Center for Biotechnology Information (NCBI) integrates data from more than 20 biological databases through a flexible search and retrieval system called Entrez. A core Entrez database, Entrez Nucleotide, includes GenBank and is tightly linked to the NCBI Taxonomy database, the Entrez Protein database, and the scientific literature in PubMed. A suite of more specialized databases for genomes, genes, gene families, gene expression, gene variation, and protein domains dovetails with the core databases to make Entrez a powerful system for genomic research. Linked to the full range of Entrez databases is the NCBI Map Viewer, which displays aligned genetic, physical, and sequence maps for eukaryotic genomes including those of many plants. A specialized plant query page allow maps from all plant genomes covered by the Map Viewer to be searched in tandem to produce a display of aligned maps from several species. PlantBLAST searches against the sequences shown in the Map Viewer allow BLAST alignments to be viewed within a genomic context. In addition, precomputed sequence similarities, such as those for proteins offered by BLAST Link, enable fluid navigation from unannotated to annotated sequences, quickening the pace of discovery. NCBI Web pages for plants, such as Plant Genome Central, complete the system by providing centralized access to NCBI's genomic resources as well as links to organism-specific Web pages beyond NCBI.
Collapse
Affiliation(s)
- David L Wheeler
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | | | | | | | | | | | |
Collapse
|
49
|
Pruitt KD, Tatusova T, Maglott DR. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2005; 33:D501-4. [PMID: 15608248 PMCID: PMC539979 DOI: 10.1093/nar/gki025] [Citation(s) in RCA: 1183] [Impact Index Per Article: 62.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked, and the sequences are linked to other resources including the NCBI Map Viewer and Gene. Sequences are annotated to include coding regions, conserved domains, variation, references, names, database cross-references, and other features using a combined approach of collaboration and other input from the scientific community, automated annotation, propagation from GenBank and curation by NCBI staff.
Collapse
Affiliation(s)
- Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Rm 6An.12J, 45 Center Drive, Bethesda, MD 20892-6510, USA.
| | | | | |
Collapse
|
50
|
Abstract
Entrez Gene (www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene) is NCBI's database for gene-specific information. It does not include all known or predicted genes; instead Entrez Gene focuses on the genomes that have been completely sequenced, that have an active research community to contribute gene-specific information, or that are scheduled for intense sequence analysis. The content of Entrez Gene represents the result of curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases, and from many other databases available from NCBI. Records are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, map location, gene products and their attributes, markers, phenotypes, and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is updated as new information becomes available. Entrez Gene is a step forward from NCBI's LocusLink, with both a major increase in taxonomic scope and improved access through the many tools associated with NCBI Entrez.
Collapse
Affiliation(s)
- Donna Maglott
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Room 5AS.13B, 45 Center Drive, Bethesda, MD 20892-6510, USA.
| | | | | | | |
Collapse
|