1
|
Yang P, Wang D, Guo W, Kang L. FAWMine: An integrated database and analysis platform for fall armyworm genomics. INSECT SCIENCE 2021; 28:590-601. [PMID: 33511767 DOI: 10.1111/1744-7917.12903] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/14/2020] [Accepted: 12/31/2020] [Indexed: 06/12/2023]
Abstract
Fall armyworm (Spodoptera frugiperda), a native insect species in the Americas, is rapidly becoming a major agricultural pest worldwide and is causing great damage to corn, rice, soybeans, and other crops. To control this pest, scientists have accumulated a great deal of high-throughput data of fall armyworm, and nine versions of its genomes and transcriptomes have been published. However, easily accessing and performing integrated analysis of these omics data sets is challenging. Here, we developed the Fall Armyworm Genome Database (FAWMine, http://159.226.67.243:8080/fawmine/) to maintain genome sequences, structural and functional annotations, transcriptomes, co-expression, protein interactions, homologs, pathways, and single-nucleotide variations. FAWMine provides a powerful framework that helps users to perform flexible and customized searching, present integrated data sets using diverse visualization methods, output results tables in a range of file formats, analyze candidate gene lists using multiple widgets, and query data available in other InterMine systems. Additionally, stand-alone JBrowse and BLAST services are also established, allowing the users to visualize RNA-Seq data and search genome and annotated gene sequences. Altogether, FAWMine is a useful tool for querying, visualizing, and analyzing compiled data sets rapidly and efficiently. FAWMine will be continually updated to function as a community resource for fall armyworm genomics and pest control research.
Collapse
Affiliation(s)
- Pengcheng Yang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Depin Wang
- Sino-Danish College, University of Chinese Academy of Sciences, Beijing, 101408, China
| | - Wei Guo
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Le Kang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
2
|
An Overview of In Vivo and In Vitro Models for Autosomal Dominant Polycystic Kidney Disease: A Journey from 3D-Cysts to Mini-Pigs. Int J Mol Sci 2020; 21:ijms21124537. [PMID: 32630605 PMCID: PMC7352572 DOI: 10.3390/ijms21124537] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 06/17/2020] [Accepted: 06/18/2020] [Indexed: 12/24/2022] Open
Abstract
Autosomal dominant polycystic kidney disease (ADPKD) is the most common inheritable cause of end stage renal disease and, as of today, only a single moderately effective treatment is available for patients. Even though ADPKD research has made huge progress over the last decades, the precise disease mechanisms remain elusive. However, a wide variety of cellular and animal models have been developed to decipher the pathophysiological mechanisms and related pathways underlying the disease. As none of these models perfectly recapitulates the complexity of the human disease, the aim of this review is to give an overview of the main tools currently available to ADPKD researchers, as well as their main advantages and limitations.
Collapse
|
3
|
Lee RYN, Howe KL, Harris TW, Arnaboldi V, Cain S, Chan J, Chen WJ, Davis P, Gao S, Grove C, Kishore R, Muller HM, Nakamura C, Nuin P, Paulini M, Raciti D, Rodgers F, Russell M, Schindelman G, Tuli MA, Van Auken K, Wang Q, Williams G, Wright A, Yook K, Berriman M, Kersey P, Schedl T, Stein L, Sternberg PW. WormBase 2017: molting into a new stage. Nucleic Acids Res 2019; 46:D869-D874. [PMID: 29069413 PMCID: PMC5753391 DOI: 10.1093/nar/gkx998] [Citation(s) in RCA: 134] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 10/11/2017] [Indexed: 11/13/2022] Open
Abstract
WormBase (http://www.wormbase.org) is an important knowledge resource for biomedical researchers worldwide. To accommodate the ever increasing amount and complexity of research data, WormBase continues to advance its practices on data acquisition, curation and retrieval to most effectively deliver comprehensive knowledge about Caenorhabditis elegans, and genomic information about other nematodes and parasitic flatworms. Recent notable enhancements include user-directed submission of data, such as micropublication; genomic data curation and presentation, including additional genomes and JBrowse, respectively; new query tools, such as SimpleMine, Gene Enrichment Analysis; new data displays, such as the Person Lineage browser and the Summary of Ontology-based Annotations. Anticipating more rapid data growth ahead, WormBase continues the process of migrating to a cutting-edge database technology to achieve better stability, scalability, reproducibility and a faster response time. To better serve the broader research community, WormBase, with five other Model Organism Databases and The Gene Ontology project, have begun to collaborate formally as the Alliance of Genome Resources.
Collapse
Affiliation(s)
- Raymond Y N Lee
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kevin L Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Todd W Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paul Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sibyl Gao
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Christian Grove
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Hans-Michael Muller
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Cecilia Nakamura
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Michael Paulini
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Faye Rodgers
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Matt Russell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gary Schindelman
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Mary Ann Tuli
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Qinghua Wang
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Gary Williams
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Paul Kersey
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tim Schedl
- Department of Genetics, Washington University School of Medicine, St Louis, MO 63110, USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Paul W Sternberg
- Division of Biology and Biological Engineering 156-29, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
4
|
Expression of the Shrimp wap gene in Drosophila elicits defense responses and protease inhibitory activity. Sci Rep 2018; 8:8779. [PMID: 29884877 PMCID: PMC5993750 DOI: 10.1038/s41598-018-26466-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2018] [Accepted: 05/10/2018] [Indexed: 11/13/2022] Open
Abstract
The wap gene encodes a single whey acidic protein (WAP) domain-containing peptide from Chinese white shrimp (Fenneropenaeus chinensis), which shows broad-spectrum antimicrobial activities and proteinase inhibitory activities in vitro. To explore the medical applications of the WAP peptide, a wap gene transgenic Drosophila melanogaster was constructed. In wap-expressing flies, high expression levels of wap gene (>100 times) were achieved, in contrast to those of control flies, by qRT-PCR analysis. The wap gene expression was associated with increased resistance to microbial infection and decreased bacterial numbers in the flies. In addition, the WAP protein extract from wap-expressing flies, compared with control protein extract from control flies, showed improved antimicrobial activities against broad Gram-positive and Gram-negative bacteria, including the clinical drug resistant bacterium of methicillin-resistant S. aureus (MRSA), improved protease inhibitor activities against crude proteinases and commercial proteinases, including elastase, subtilis proteinase A, and proteinase K in vitro, and improved growth rate and microbial resistance, as well as wound-healing in loach and mouse models. These results suggest that wap-expressing flies could be used as a food additive in aquaculture to prevent infections and a potential antibacterial for fighting drug-resistant bacteria.
Collapse
|
5
|
Harper L, Campbell J, Cannon EKS, Jung S, Poelchau M, Walls R, Andorf C, Arnaud E, Berardini TZ, Birkett C, Cannon S, Carson J, Condon B, Cooper L, Dunn N, Elsik CG, Farmer A, Ficklin SP, Grant D, Grau E, Herndon N, Hu ZL, Humann J, Jaiswal P, Jonquet C, Laporte MA, Larmande P, Lazo G, McCarthy F, Menda N, Mungall CJ, Munoz-Torres MC, Naithani S, Nelson R, Nesdill D, Park C, Reecy J, Reiser L, Sanderson LA, Sen TZ, Staton M, Subramaniam S, Tello-Ruiz MK, Unda V, Unni D, Wang L, Ware D, Wegrzyn J, Williams J, Woodhouse M, Yu J, Main D. AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture. Database (Oxford) 2018; 2018:5096675. [PMID: 30239679 PMCID: PMC6146126 DOI: 10.1093/database/bay088] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 07/19/2018] [Accepted: 07/30/2018] [Indexed: 01/07/2023]
Abstract
The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices.
Collapse
Affiliation(s)
- Lisa Harper
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | | | - Ethalinda K S Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
- Computer Science, Iowa State University, Ames, IA, USA
| | - Sook Jung
- Horticulture, Washington State University, Pullman, WA, USA
| | - Monica Poelchau
- National Agricultural Library, USDA Agricultural Research Service, Beltsville, MD, USA
| | | | - Carson Andorf
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
- Computer Science, Iowa State University, Ames, IA, USA
| | - Elizabeth Arnaud
- Bioversity International, Informatics Unit, Conservation and Availability Programme, Parc Scientifique Agropolis II, Montpellier, France
| | - Tanya Z Berardini
- The Arabidopsis Information Resource, Phoenix Bioinformatics, Fremont, CA, USA
| | | | - Steve Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - James Carson
- Texas Advanced Computing Center, The University of Texas at Austin, Austin, TX, USA
| | - Bradford Condon
- Entomology and Plant Pathology, University of Tennessee Knoxville, Knoxville, TN, USA
| | - Laurel Cooper
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Nathan Dunn
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Christine G Elsik
- Division of Animal Sciences and Division of Plant Sciences, University of Missouri, Columbia, MO, USA
| | - Andrew Farmer
- National Center for Genome Resources, Santa Fe, NM, USA
| | | | - David Grant
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - Emily Grau
- National Center for Genome Resources, Santa Fe, NM, USA
| | - Nic Herndon
- Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Zhi-Liang Hu
- Animal Science, Iowa State University, Ames, USA
| | - Jodi Humann
- Horticulture, Washington State University, Pullman, WA, USA
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Clement Jonquet
- Laboratory of Informatics, Robotics, Microelectronics of Montpellier, University of Montpellier & CNRS, Montpellier, France
| | - Marie-Angélique Laporte
- Bioversity International, Informatics Unit, Conservation and Availability Programme, Parc Scientifique Agropolis II, Montpellier, France
| | | | - Gerard Lazo
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, CA, USA
| | - Fiona McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA
| | | | | | | | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Rex Nelson
- Corn Insects and Crop Genetics Research Unit, USDA-ARS, Ames, IA, USA
| | - Daureen Nesdill
- Marriott Library, University of Utah, Salt Lake City, UT, USA
| | - Carissa Park
- Animal Science, Iowa State University, Ames, USA
| | - James Reecy
- Animal Science, Iowa State University, Ames, USA
| | - Leonore Reiser
- The Arabidopsis Information Resource, Phoenix Bioinformatics, Fremont, CA, USA
| | | | - Taner Z Sen
- Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, CA, USA
| | - Margaret Staton
- Entomology and Plant Pathology, University of Tennessee Knoxville, Knoxville, TN, USA
| | | | | | - Victor Unda
- Horticulture, Washington State University, Pullman, WA, USA
| | - Deepak Unni
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Liya Wang
- Plant Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Doreen Ware
- USDA, Plant, Soil and Nutrition Research, Ithaca, NY, USA
- Plant Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jill Wegrzyn
- Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Jason Williams
- Cold Spring Harbor Laboratory, DNA Learning Center, Cold Spring Harbor, NY, USA
| | - Margaret Woodhouse
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Jing Yu
- Horticulture, Washington State University, Pullman, WA, USA
| | - Doreen Main
- Horticulture, Washington State University, Pullman, WA, USA
| |
Collapse
|
6
|
Van Slyke CE, Bradford YM, Howe DG, Fashena DS, Ramachandran S, Ruzicka L. Using ZFIN: Data Types, Organization, and Retrieval. Methods Mol Biol 2018; 1757:307-347. [PMID: 29761463 PMCID: PMC6319390 DOI: 10.1007/978-1-4939-7737-6_11] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The Zebrafish Model Organism Database (ZFIN; zfin.org) was established in 1994 as the primary genetic and genomic resource for the zebrafish research community. Some of the earliest records in ZFIN were for people and laboratories. Since that time, services and data types provided by ZFIN have grown considerably. Today, ZFIN provides the official nomenclature for zebrafish genes, mutants, and transgenics and curates many data types including gene expression, phenotypes, Gene Ontology, models of human disease, orthology, knockdown reagents, transgenic constructs, and antibodies. Ontologies are used throughout ZFIN to structure these expertly curated data. An integrated genome browser provides genomic context for genes, transgenics, mutants, and knockdown reagents. ZFIN also supports a community wiki where the research community can post new antibody records and research protocols. Data in ZFIN are accessible via web pages, download files, and the ZebrafishMine (zebrafishmine.org), an installation of the InterMine data warehousing software. Searching for data at ZFIN utilizes both parameterized search forms and a single box search for searching or browsing data quickly. This chapter aims to describe the primary ZFIN data and services, and provide insight into how to use and interpret ZFIN searches, data, and web pages.
Collapse
Affiliation(s)
- Ceri E Van Slyke
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA.
| | - Yvonne M Bradford
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | - Douglas G Howe
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | - David S Fashena
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| | | | - Leyla Ruzicka
- The Zebrafish Information Network, University of Oregon, Eugene, OR, USA
| |
Collapse
|
7
|
Cheng L, Zhang S, Hu Y. BLAT2DOLite: An Online System for Identifying Significant Relationships between Genetic Sequences and Diseases. PLoS One 2016; 11:e0157274. [PMID: 27315278 PMCID: PMC4912091 DOI: 10.1371/journal.pone.0157274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 05/26/2016] [Indexed: 11/18/2022] Open
Abstract
The significantly related diseases of sequences could play an important role in understanding the functions of these sequences. In this paper, we introduced BLAT2DOLite, an online system for annotating human genes and diseases and identifying the significant relationships between sequences and diseases. Currently, BLAT2DOLite integrates Entrez Gene database and Disease Ontology Lite (DOLite), which contain loci of gene and relationships between genes and diseases. It utilizes hypergeometric test to calculate P-values between genes and diseases of DOLite. The system can be accessed from: http://123.59.132.21:8080/BLAT2DOLite. The corresponding web service is described in: http://123.59.132.21:8080/BLAT2DOLite/BLAT2DOLiteIDMappingPort?wsdl.
Collapse
Affiliation(s)
- Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, PR China
- * E-mail: (LC); (YH)
| | - Shuo Zhang
- School of Management, Harbin University of Commerce, Harbin 150028, PR China
| | - Yang Hu
- School of Life Science and Technology, Harbin Institute of Technology, Harbin 150001, PR China
- * E-mail: (LC); (YH)
| |
Collapse
|
8
|
Elsik CG, Tayal A, Diesh CM, Unni DR, Emery ML, Nguyen HN, Hagen DE. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine. Nucleic Acids Res 2015; 44:D793-800. [PMID: 26578564 PMCID: PMC4702858 DOI: 10.1093/nar/gkv1208] [Citation(s) in RCA: 91] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 10/27/2015] [Indexed: 11/13/2022] Open
Abstract
We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search.
Collapse
Affiliation(s)
- Christine G Elsik
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA MU Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| | - Aditi Tayal
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Colin M Diesh
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Deepak R Unni
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Marianne L Emery
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Hung N Nguyen
- MU Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| | - Darren E Hagen
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
9
|
Elsik CG, Unni DR, Diesh CM, Tayal A, Emery ML, Nguyen HN, Hagen DE. Bovine Genome Database: new tools for gleaning function from the Bos taurus genome. Nucleic Acids Res 2015; 44:D834-9. [PMID: 26481361 PMCID: PMC4702796 DOI: 10.1093/nar/gkv1077] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Accepted: 10/06/2015] [Indexed: 02/03/2023] Open
Abstract
We report an update of the Bovine Genome Database (BGD) (http://BovineGenome.org). The goal of BGD is to support bovine genomics research by providing genome annotation and data mining tools. We have developed new genome and annotation browsers using JBrowse and WebApollo for two Bos taurus genome assemblies, the reference genome assembly (UMD3.1.1) and the alternate genome assembly (Btau_4.6.1). Annotation tools have been customized to highlight priority genes for annotation, and to aid annotators in selecting gene evidence tracks from 91 tissue specific RNAseq datasets. We have also developed BovineMine, based on the InterMine data warehousing system, to integrate the bovine genome, annotation, QTL, SNP and expression data with external sources of orthology, gene ontology, gene interaction and pathway information. BovineMine provides powerful query building tools, as well as customized query templates, and allows users to analyze and download genome-wide datasets. With BovineMine, bovine researchers can use orthology to leverage the curated gene pathways of model organisms, such as human, mouse and rat. BovineMine will be especially useful for gene ontology and pathway analyses in conjunction with GWAS and QTL studies.
Collapse
Affiliation(s)
- Christine G Elsik
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA MU Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| | - Deepak R Unni
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Colin M Diesh
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Aditi Tayal
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Marianne L Emery
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Hung N Nguyen
- MU Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| | - Darren E Hagen
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
10
|
Lyne R, Sullivan J, Butano D, Contrino S, Heimbach J, Hu F, Kalderimis A, Lyne M, Smith RN, Štěpán R, Balakrishnan R, Binkley G, Harris T, Karra K, Moxon SAT, Motenko H, Neuhauser S, Ruzicka L, Cherry M, Richardson J, Stein L, Westerfield M, Worthey E, Micklem G. Cross-organism analysis using InterMine. Genesis 2015; 53:547-60. [PMID: 26097192 PMCID: PMC4545681 DOI: 10.1002/dvg.22869] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 06/17/2015] [Accepted: 06/17/2015] [Indexed: 01/01/2023]
Abstract
InterMine is a data integration warehouse and analysis software system developed for large and complex biological data sets. Designed for integrative analysis, it can be accessed through a user-friendly web interface. For bioinformaticians, extensive web services as well as programming interfaces for most common scripting languages support access to all features. The web interface includes a useful identifier look-up system, and both simple and sophisticated search options. Interactive results tables enable exploration, and data can be filtered, summarized, and browsed. A set of graphical analysis tools provide a rich environment for data exploration including statistical enrichment of sets of genes or other entities. InterMine databases have been developed for the major model organisms, budding yeast, nematode worm, fruit fly, zebrafish, mouse, and rat together with a newly developed human database. Here, we describe how this has facilitated interoperation and development of cross-organism analysis tools and reports. InterMine as a data exploration and analysis tool is also described. All the InterMine-based systems described in this article are resources freely available to the scientific community.
Collapse
Affiliation(s)
- Rachel Lyne
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Julie Sullivan
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Daniela Butano
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Sergio Contrino
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Josh Heimbach
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Fengyuan Hu
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Alex Kalderimis
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Mike Lyne
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Richard N. Smith
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Radek Štěpán
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Rama Balakrishnan
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Gail Binkley
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | - Todd Harris
- Ontario Institute for Cancer Research, Toronto, ON, M5G0A3, Canada
| | - Kalpana Karra
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | | | - Howie Motenko
- The Jackson Laboratory, Bar Harbor, Maine, 04609, USA
| | | | | | - Mike Cherry
- Department of Genetics, Stanford University, Stanford, CA 94305-5120, USA
| | | | - Lincoln Stein
- Ontario Institute for Cancer Research, Toronto, ON, M5G0A3, Canada
| | - Monte Westerfield
- ZFIN, University of Oregon, Eugene, OR, 97403, USA
- Institute of Neuroscience, University of Oregon, Eugene, OR, 97403, USA
| | - Elizabeth Worthey
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, 53226, USA
| | - Gos Micklem
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, United Kingdom
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| |
Collapse
|
11
|
Eppig JT, Richardson JE, Kadin JA, Smith CL, Blake JA, Bult CJ. Mouse Genome Database: From sequence to phenotypes and disease models. Genesis 2015; 53:458-73. [PMID: 26150326 PMCID: PMC4545690 DOI: 10.1002/dvg.22874] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2015] [Revised: 06/30/2015] [Accepted: 07/02/2015] [Indexed: 12/19/2022]
Abstract
The Mouse Genome Database (MGD, www.informatics.jax.org) is the international scientific database for genetic, genomic, and biological data on the laboratory mouse to support the research requirements of the biomedical community. To accomplish this goal, MGD provides broad data coverage, serves as the authoritative standard for mouse nomenclature for genes, mutants, and strains, and curates and integrates many types of data from literature and electronic sources. Among the key data sets MGD supports are: the complete catalog of mouse genes and genome features, comparative homology data for mouse and vertebrate genes, the authoritative set of Gene Ontology (GO) annotations for mouse gene functions, a comprehensive catalog of mouse mutations and their phenotypes, and a curated compendium of mouse models of human diseases. Here, we describe the data acquisition process, specifics about MGD's key data areas, methods to access and query MGD data, and outreach and user help facilities. genesis 53:458–473, 2015. © 2015 The Authors. Genesis Published by Wiley Periodicals, Inc.
Collapse
|
12
|
Bello SM, Smith CL, Eppig JT. Allele, phenotype and disease data at Mouse Genome Informatics: improving access and analysis. Mamm Genome 2015; 26:285-94. [PMID: 26162703 PMCID: PMC4534497 DOI: 10.1007/s00335-015-9582-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 06/23/2015] [Indexed: 11/16/2022]
Abstract
A core part of the Mouse Genome Informatics (MGI) resource is the collection of mouse mutations and the annotation phenotypes and diseases displayed by mice carrying these mutations. These data are integrated with the rest of data in MGI and exported to numerous other resources. The use of mouse phenotype data to drive translational research into human disease has expanded rapidly with the improvements in sequencing technology. MGI has implemented many improvements in allele and phenotype data annotation, search, and display to facilitate access to these data through multiple avenues. For example, the description of alleles has been modified to include more detailed categories of allele attributes. This allows improved discrimination between mutation types. Further, connections have been created between mutations involving multiple genes and each of the genes overlapping the mutation. This allows users to readily find all mutations affecting a gene and see all genes affected by a mutation. In a similar manner, the genes expressed by transgenic or knock-in alleles are now connected to these alleles. The advanced search forms and public reports have been updated to take advantage of these improvements. These search forms and reports are used by an expanding number of researchers to identify novel human disease genes and mouse models of human disease.
Collapse
Affiliation(s)
- Susan M Bello
- Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME, 04609, USA,
| | | | | |
Collapse
|
13
|
Ruzicka L, Bradford YM, Frazer K, Howe DG, Paddock H, Ramachandran S, Singer A, Toro S, Van Slyke CE, Eagle AE, Fashena D, Kalita P, Knight J, Mani P, Martin R, Moxon SAT, Pich C, Schaper K, Shao X, Westerfield M. ZFIN, The zebrafish model organism database: Updates and new directions. Genesis 2015; 53:498-509. [PMID: 26097180 DOI: 10.1002/dvg.22868] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Revised: 06/16/2015] [Accepted: 06/17/2015] [Indexed: 12/19/2022]
Abstract
The Zebrafish Model Organism Database (ZFIN; http://zfin.org) is the central resource for genetic and genomic data from zebrafish (Danio rerio) research. ZFIN staff curate detailed information about genes, mutants, genotypes, reporter lines, sequences, constructs, antibodies, knockdown reagents, expression patterns, phenotypes, gene product function, and orthology from publications. Researchers can submit mutant, transgenic, expression, and phenotype data directly to ZFIN and use the ZFIN Community Wiki to share antibody and protocol information. Data can be accessed through topic-specific searches, a new site-wide search, and the data-mining resource ZebrafishMine (http://zebrafishmine.org). Data download and web service options are also available. ZFIN collaborates with major bioinformatics organizations to verify and integrate genomic sequence data, provide nomenclature support, establish reciprocal links, and participate in the development of standardized structured vocabularies (ontologies) used for data annotation and searching. ZFIN-curated gene, function, expression, and phenotype data are available for comparative exploration at several multi-species resources. The use of zebrafish as a model for human disease is increasing. ZFIN is supporting this growing area with three major projects: adding easy access to computed orthology data from gene pages, curating details of the gene expression pattern changes in mutant fish, and curating zebrafish models of human diseases.
Collapse
Affiliation(s)
| | | | - Ken Frazer
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | - Amy Singer
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | | | | | | | - Prita Mani
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | - Ryan Martin
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | | | | | | - Xiang Shao
- ZFIN, 5291 University of Oregon, Eugene, Oregon
| | | |
Collapse
|
14
|
Rhee DB, Croken MM, Shieh KR, Sullivan J, Micklem G, Kim K, Golden A. toxoMine: an integrated omics data warehouse for Toxoplasma gondii systems biology research. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav066. [PMID: 26130662 PMCID: PMC4485433 DOI: 10.1093/database/bav066] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 06/09/2015] [Indexed: 01/09/2023]
Abstract
Toxoplasma gondii (T. gondii) is an obligate intracellular parasite that must monitor for changes in the host environment and respond accordingly; however, it is still not fully known which genetic or epigenetic factors are involved in regulating virulence traits of T. gondii. There are on-going efforts to elucidate the mechanisms regulating the stage transition process via the application of high-throughput epigenomics, genomics and proteomics techniques. Given the range of experimental conditions and the typical yield from such high-throughput techniques, a new challenge arises: how to effectively collect, organize and disseminate the generated data for subsequent data analysis. Here, we describe toxoMine, which provides a powerful interface to support sophisticated integrative exploration of high-throughput experimental data and metadata, providing researchers with a more tractable means toward understanding how genetic and/or epigenetic factors play a coordinated role in determining pathogenicity of T. gondii. As a data warehouse, toxoMine allows integration of high-throughput data sets with public T. gondii data. toxoMine is also able to execute complex queries involving multiple data sets with straightforward user interaction. Furthermore, toxoMine allows users to define their own parameters during the search process that gives users near-limitless search and query capabilities. The interoperability feature also allows users to query and examine data available in other InterMine systems, which would effectively augment the search scope beyond what is available to toxoMine. toxoMine complements the major community database ToxoDB by providing a data warehouse that enables more extensive integrative studies for T. gondii. Given all these factors, we believe it will become an indispensable resource to the greater infectious disease research community. Database URL:http://toxomine.org
Collapse
Affiliation(s)
- David B Rhee
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA,
| | - Matthew McKnight Croken
- Department of Microbiology & Immunology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Kevin R Shieh
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Julie Sullivan
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK and
| | - Gos Micklem
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK and
| | - Kami Kim
- Department of Medicine, Department of Pathology, Department of Microbiology & Immunology, Albert Einstein College of Medicine, Bronx, NY 10461, USA,
| | - Aaron Golden
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA, Department of Mathematical Sciences, Yeshiva University, New York, NY 10033, USA
| |
Collapse
|
15
|
Motenko H, Neuhauser SB, O'Keefe M, Richardson JE. MouseMine: a new data warehouse for MGI. Mamm Genome 2015; 26:325-30. [PMID: 26092688 PMCID: PMC4534495 DOI: 10.1007/s00335-015-9573-z] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2015] [Accepted: 06/01/2015] [Indexed: 11/25/2022]
Abstract
MouseMine (www.mousemine.org) is a new data warehouse for accessing mouse data from Mouse Genome Informatics (MGI). Based on the InterMine software framework, MouseMine supports powerful query, reporting, and analysis capabilities, the ability to save and combine results from different queries, easy integration into larger workflows, and a comprehensive Web Services layer. Through MouseMine, users can access a significant portion of MGI data in new and useful ways. Importantly, MouseMine is also a member of a growing community of online data resources based on InterMine, including those established by other model organism databases. Adopting common interfaces and collaborating on data representation standards are critical to fostering cross-species data analysis. This paper presents a general introduction to MouseMine, presents examples of its use, and discusses the potential for further integration into the MGI interface.
Collapse
Affiliation(s)
- H Motenko
- The Jackson Laboratory, Bar Harbor, ME, 04609, USA
| | | | | | | |
Collapse
|
16
|
Karikari TK, Aleksic J. Neurogenomics: An opportunity to integrate neuroscience, genomics and bioinformatics research in Africa. Appl Transl Genom 2015; 5:3-10. [PMID: 26937352 PMCID: PMC4745356 DOI: 10.1016/j.atg.2015.06.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Revised: 05/22/2015] [Accepted: 06/23/2015] [Indexed: 02/02/2023]
Abstract
Modern genomic approaches have made enormous contributions to improving our understanding of the function, development and evolution of the nervous system, and the diversity within and between species. However, most of these research advances have been recorded in countries with advanced scientific resources and funding support systems. On the contrary, little is known about, for example, the possible interplay between different genes, non-coding elements and environmental factors in modulating neurological diseases among populations in low-income countries, including many African countries. The unique ancestry of African populations suggests that improved inclusion of these populations in neuroscience-related genomic studies would significantly help to identify novel factors that might shape the future of neuroscience research and neurological healthcare. This perspective is strongly supported by the recent identification that diseased individuals and their kindred from specific sub-Saharan African populations lack common neurological disease-associated genetic mutations. This indicates that there may be population-specific causes of neurological diseases, necessitating further investigations into the contribution of additional, presently-unknown genomic factors. Here, we discuss how the development of neurogenomics research in Africa would help to elucidate disease-related genomic variants, and also provide a good basis to develop more effective therapies. Furthermore, neurogenomics would harness African scientists' expertise in neuroscience, genomics and bioinformatics to extend our understanding of the neural basis of behaviour, development and evolution.
Collapse
Affiliation(s)
- Thomas K. Karikari
- Neuroscience, School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
- Midlands Integrative Biosciences Training Partnership, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jelena Aleksic
- Wellcome Trust — Medical Research Council Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 1QR, United Kingdom
| |
Collapse
|
17
|
Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE. The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease. Nucleic Acids Res 2014; 43:D726-36. [PMID: 25348401 PMCID: PMC4384027 DOI: 10.1093/nar/gku967] [Citation(s) in RCA: 293] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The Mouse Genome Database (MGD, http://www.informatics.jax.org) serves the international biomedical research community as the central resource for integrated genomic, genetic and biological data on the laboratory mouse. To facilitate use of mouse as a model in translational studies, MGD maintains a core of high-quality curated data and integrates experimentally and computationally generated data sets. MGD maintains a unified catalog of genes and genome features, including functional RNAs, QTL and phenotypic loci. MGD curates and provides functional and phenotype annotations for mouse genes using the Gene Ontology and Mammalian Phenotype Ontology. MGD integrates phenotype data and associates mouse genotypes to human diseases, providing critical mouse–human relationships and access to repositories holding mouse models. MGD is the authoritative source of nomenclature for genes, genome features, alleles and strains following guidelines of the International Committee on Standardized Genetic Nomenclature for Mice. A new addition to MGD, the Human–Mouse: Disease Connection, allows users to explore gene–phenotype–disease relationships between human and mouse. MGD has also updated search paradigms for phenotypic allele attributes, incorporated incidental mutation data, added a module for display and exploration of genes and microRNA interactions and adopted the JBrowse genome browser. MGD resources are freely available to the scientific community.
Collapse
Affiliation(s)
- Janan T Eppig
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Judith A Blake
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Carol J Bult
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - James A Kadin
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | | | | |
Collapse
|
18
|
Kalderimis A, Lyne R, Butano D, Contrino S, Lyne M, Heimbach J, Hu F, Smith R, Stěpán R, Sullivan J, Micklem G. InterMine: extensive web services for modern biology. Nucleic Acids Res 2014; 42:W468-72. [PMID: 24753429 PMCID: PMC4086141 DOI: 10.1093/nar/gku301] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
InterMine (www.intermine.org) is a biological data warehousing system providing extensive automatically generated and configurable RESTful web services that underpin the web interface and can be re-used in many other applications: to find and filter data; export it in a flexible and structured way; to upload, use, manipulate and analyze lists; to provide services for flexible retrieval of sequence segments, and for other statistical and analysis tools. Here we describe these features and discuss how they can be used separately or in combinations to support integrative and comparative analysis.
Collapse
Affiliation(s)
- Alex Kalderimis
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Rachel Lyne
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Daniela Butano
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Sergio Contrino
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Mike Lyne
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Joshua Heimbach
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Fengyuan Hu
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Richard Smith
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Radek Stěpán
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Julie Sullivan
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Gos Micklem
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK and Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| |
Collapse
|
19
|
Kalderimis A, Stepan R, Sullivan J, Lyne R, Lyne M, Micklem G. BioJS DAGViewer: A reusable JavaScript component for displaying directed graphs. F1000Res 2014; 3:51. [PMID: 24627804 PMCID: PMC3945768 DOI: 10.12688/f1000research.3-51.v1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/10/2014] [Indexed: 11/20/2022] Open
Abstract
Summary: The DAGViewer BioJS component is a reusable JavaScript component made available as part of the BioJS project and intended to be used to display graphs of structured data, with a particular emphasis on Directed Acyclic Graphs (DAGs). It enables users to embed representations of graphs of data, such as ontologies or phylogenetic trees, in hyper-text documents (HTML). This component is generic, since it is capable (given the appropriate configuration) of displaying any kind of data that is organised as a graph. The features of this component which are useful for examining and filtering large and complex graphs are described. Availability:http://github.com/alexkalderimis/dag-viewer-biojs;
http://github.com/biojs/biojs;
http://dx.doi.org/10.5281/zenodo.8303.
Collapse
Affiliation(s)
- Alexis Kalderimis
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| | - Radek Stepan
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| | - Julie Sullivan
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| | - Rachel Lyne
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| | - Michael Lyne
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| | - Gos Micklem
- Department of Genetics and Cambridge Systems Biology Centre, Cambridge University, Cambridge, CB2 3EH, UK
| |
Collapse
|
20
|
Lyne M, Smith RN, Lyne R, Aleksic J, Hu F, Kalderimis A, Stepan R, Micklem G. metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013; 2013:bat060. [PMID: 23935057 PMCID: PMC4438919 DOI: 10.1093/database/bat060] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first-time users, whereas the Application Programming Interface (API) and web services provide convenient data access and tools for bioinformaticians. metabolicMine is freely available online at http://www.metabolicmine.org Database URL: http://www.metabolicmine.org.
Collapse
Affiliation(s)
- Mike Lyne
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, UK
| | | | | | | | | | | | | | | |
Collapse
|