Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gollub J, Ball CA, Binkley G, Demeter J, Finkelstein DB, Hebert JM, Hernandez-Boussard T, Jin H, Kaloper M, Matese JC, Schroeder M, Brown PO, Botstein D, Sherlock G. The Stanford Microarray Database: data access and quality assessment tools. Nucleic Acids Res 2003;31:94-6. [PMID: 12519956 PMCID: PMC165525 DOI: 10.1093/nar/gkg078] [Citation(s) in RCA: 260] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2002] [Revised: 10/11/2002] [Accepted: 10/11/2002] [Indexed: 11/12/2022] Open

For:	Gollub J, Ball CA, Binkley G, Demeter J, Finkelstein DB, Hebert JM, Hernandez-Boussard T, Jin H, Kaloper M, Matese JC, Schroeder M, Brown PO, Botstein D, Sherlock G. The Stanford Microarray Database: data access and quality assessment tools. Nucleic Acids Res 2003;31:94-6. [PMID: 12519956 PMCID: PMC165525 DOI: 10.1093/nar/gkg078] [Citation(s) in RCA: 260] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2002] [Revised: 10/11/2002] [Accepted: 10/11/2002] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Tanoli Z, Seemab U, Scherer A, Wennerberg K, Tang J, Vähä-Koskela M. Exploration of databases and methods supporting drug repurposing: a comprehensive survey. Brief Bioinform 2021;22:1656-1678. [PMID: 32055842 PMCID: PMC7986597 DOI: 10.1093/bib/bbaa003] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 12/09/2019] [Indexed: 02/07/2023] Open

Dwivedi DK, Sahu A, Dighade SJ, Agrawal RK. Design, synthesis, and antimicrobial evaluation of some nifuroxazide analogs against nosocomial infection. J Heterocycl Chem 2020. [DOI: 10.1002/jhet.3891] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Farouk R, SayedElahl M. Microarray spot segmentation algorithm based on integro-differential operator. EGYPTIAN INFORMATICS JOURNAL 2019. [DOI: 10.1016/j.eij.2019.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Chun S, Muthu M, Gopal J, Paul D, Kim DH, Gansukh E, Anthonydhason V. The unequivocal preponderance of biocomputation in clinical virology. RSC Adv 2018;8:17334-17345. [PMID: 35539262 PMCID: PMC9080393 DOI: 10.1039/c8ra00888d] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 03/14/2018] [Indexed: 11/22/2022] Open

Analysis of a single Helicobacter pylori strain over a 10-year period in a primate model. Int J Med Microbiol 2015;305:392-403. [PMID: 25804332 DOI: 10.1016/j.ijmm.2015.03.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Revised: 01/30/2015] [Accepted: 03/01/2015] [Indexed: 12/18/2022] Open

In silico identification of regulatory motifs in co-expressed genes under osmotic stress representing their co-regulation. ACTA ACUST UNITED AC 2015. [DOI: 10.1016/j.plgene.2015.01.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Wu H, Fujiwara T, Yamamoto Y, Bolleman J, Yamaguchi A. BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data. J Biomed Semantics 2014;5:32. [PMID: 25089180 PMCID: PMC4118313 DOI: 10.1186/2041-1480-5-32] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Accepted: 04/27/2014] [Indexed: 12/21/2022] Open

Abstract

Background

Biological databases vary enormously in size and data complexity, from small databases that contain a few million Resource Description Framework (RDF) triples to large databases that contain billions of triples. In this paper, we evaluate whether RDF native stores can be used to meet the needs of a biological database provider. Prior evaluations have used synthetic data with a limited database size. For example, the largest BSBM benchmark uses 1 billion synthetic e-commerce knowledge RDF triples on a single node. However, real world biological data differs from the simple synthetic data much. It is difficult to determine whether the synthetic e-commerce data is efficient enough to represent biological databases. Therefore, for this evaluation, we used five real data sets from biological databases.

Results

We evaluated five triple stores, 4store, Bigdata, Mulgara, Virtuoso, and OWLIM-SE, with five biological data sets, Cell Cycle Ontology, Allie, PDBj, UniProt, and DDBJ, ranging in size from approximately 10 million to 8 billion triples.

For each database, we loaded all the data into our single node and prepared the database for use in a classical data warehouse scenario. Then, we ran a series of SPARQL queries against each endpoint and recorded the execution time and the accuracy of the query response.

Conclusions

Our paper shows that with appropriate configuration Virtuoso and OWLIM-SE can satisfy the basic requirements to load and query biological data less than 8 billion or so on a single node, for the simultaneous access of 64 clients.

OWLIM-SE performs best for databases with approximately 11 million triples; For data sets that contain 94 million and 590 million triples, OWLIM-SE and Virtuoso perform best. They do not show overwhelming advantage over each other; For data over 4 billion Virtuoso works best.

4store performs well on small data sets with limited features when the number of triples is less than 100 million, and our test shows its scalability is poor; Bigdata demonstrates average performance and is a good open source triple store for middle-sized (500 million or so) data set; Mulgara shows a little of fragility.

Collapse

Romeo MJ, Espina V, Lowenthal M, Espina BH, Petricoin EF, Liotta LA. CSF proteome: a protein repository for potential biomarker identification. Expert Rev Proteomics 2014;2:57-70. [PMID: 15966853 DOI: 10.1586/14789450.2.1.57] [Citation(s) in RCA: 103] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Kuczenski RS, Aggarwal K, Lee KH. Improved understanding of gene expression regulation using systems biology. Expert Rev Proteomics 2014;2:915-24. [PMID: 16307520 DOI: 10.1586/14789450.2.6.915] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Yu D, Kim M, Xiao G, Hwang TH. Review of biological network data and its applications. Genomics Inform 2013;11:200-10. [PMID: 24465231 PMCID: PMC3897847 DOI: 10.5808/gi.2013.11.4.200] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 11/20/2013] [Accepted: 11/21/2013] [Indexed: 12/16/2022] Open

Cowles KN, Moser TS, Siryaporn A, Nyakudarika N, Dixon W, Turner JJ, Gitai Z. The putative Poc complex controls two distinct Pseudomonas aeruginosa polar motility mechanisms. Mol Microbiol 2013;90:923-38. [PMID: 24102920 DOI: 10.1111/mmi.12403] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/13/2013] [Indexed: 11/27/2022]

A predicted functional gene network for the plant pathogen Phytophthora infestans as a framework for genomic biology. BMC Genomics 2013;14:483. [PMID: 23865555 PMCID: PMC3734169 DOI: 10.1186/1471-2164-14-483] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2013] [Accepted: 07/15/2013] [Indexed: 11/10/2022] Open

Giannakeas N, Karvelis PS, Exarchos TP, Kalatzis FG, Fotiadis DI. Segmentation of microarray images using pixel classification—Comparison with clustering-based methods. Comput Biol Med 2013;43:705-16. [DOI: 10.1016/j.compbiomed.2013.03.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2012] [Revised: 07/26/2012] [Accepted: 03/14/2013] [Indexed: 11/16/2022]

Billiau K, Sprenger H, Schudoma C, Walther D, K Hl KI. Data management pipeline for plant phenotyping in a multisite project. FUNCTIONAL PLANT BIOLOGY : FPB 2012;39:948-957. [PMID: 32480844 DOI: 10.1071/fp12009] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2012] [Accepted: 06/22/2012] [Indexed: 05/26/2023]

Doherty KM, Pride LD, Lukose J, Snydsman BE, Charles R, Pramanik A, Muller EG, Botstein D, Moore CW. Loss of a 20S proteasome activator in Saccharomyces cerevisiae downregulates genes important for genomic integrity, increases DNA damage, and selectively sensitizes cells to agents with diverse mechanisms of action. G3 (BETHESDA, MD.) 2012;2:943-59. [PMID: 22908043 PMCID: PMC3411250 DOI: 10.1534/g3.112.003376] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Accepted: 06/18/2012] [Indexed: 01/23/2023]

Abstract

Cytoprotective functions of a 20S proteasome activator were investigated. Saccharomyces cerevisiae Blm10 and human 20S proteasome activator 200 (PA200) are homologs. Comparative genome-wide analyses of untreated diploid cells lacking Blm10 and growing at steady state at defined growth rates revealed downregulation of numerous genes required for accurate chromosome structure, assembly and repair, and upregulation of a specific subset of genes encoding protein-folding chaperones. Blm10 loss or truncation of the Ubp3/Blm3 deubiquitinating enzyme caused massive chromosomal damage and cell death in homozygous diploids after phleomycin treatments, indicating that Blm10 and Ubp3/Blm3 function to stabilize the genome and protect against cell death. Diploids lacking Blm10 also were sensitized to doxorubicin, hydroxyurea, 5-fluorouracil, rapamycin, hydrogen peroxide, methyl methanesulfonate, and calcofluor. Fluorescently tagged Blm10 localized in nuclei, with enhanced fluorescence after DNA replication. After DNA damage that caused a classic G2/M arrest, fluorescence remained diffuse, with evidence of nuclear fragmentation in some cells. Protective functions of Blm10 did not require the carboxyl-terminal region that makes close contact with 20S proteasomes, indicating that protection does not require this contact or the truncated Blm10 can interact with the proteasome apart from this region. Without its carboxyl-terminus, Blm10((-339aa)) localized to nuclei in untreated, nonproliferating (G(0)) cells, but not during G(1) S, G(2), and M. The results indicate Blm10 functions in protective mechanisms that include the machinery that assures proper assembly of chromosomes. These essential guardian functions have implications for ubiquitin-independent targeting in anticancer therapy. Targeting Blm10/PA200 together with one or more of the upregulated chaperones or a conventional treatment could be efficacious.

Collapse

Giannakeas N, Fotiadis DI. Image Processing and Machine Learning Techniques for the Segmentation of cDNA Microarray Images. Mach Learn 2012. [DOI: 10.4018/978-1-60960-818-7.ch406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Frequent Pattern Discovery in Multiple Biological Networks: Patterns and Algorithms. STATISTICS IN BIOSCIENCES 2011. [DOI: 10.1007/s12561-011-9047-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Sinha AU, Merrill E, Armstrong SA, Clark TW, Das S. eXframe: reusable framework for storage, analysis and visualization of genomics experiments. BMC Bioinformatics 2011;12:452. [PMID: 22103807 PMCID: PMC3235155 DOI: 10.1186/1471-2105-12-452] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2011] [Accepted: 11/21/2011] [Indexed: 11/19/2022] Open

Abstract

Background

Genome-wide experiments are routinely conducted to measure gene expression, DNA-protein interactions and epigenetic status. Structured metadata for these experiments is imperative for a complete understanding of experimental conditions, to enable consistent data processing and to allow retrieval, comparison, and integration of experimental results. Even though several repositories have been developed for genomics data, only a few provide annotation of samples and assays using controlled vocabularies. Moreover, many of them are tailored for a single type of technology or measurement and do not support the integration of multiple data types.

Results

We have developed eXframe - a reusable web-based framework for genomics experiments that provides 1) the ability to publish structured data compliant with accepted standards 2) support for multiple data types including microarrays and next generation sequencing 3) query, analysis and visualization integration tools (enabled by consistent processing of the raw data and annotation of samples) and is available as open-source software. We present two case studies where this software is currently being used to build repositories of genomics experiments - one contains data from hematopoietic stem cells and another from Parkinson's disease patients.

Conclusion

The web-based framework eXframe offers structured annotation of experiments as well as uniform processing and storage of molecular data from microarray and next generation sequencing platforms. The framework allows users to query and integrate information across species, technologies, measurement types and experimental conditions. Our framework is reusable and freely modifiable - other groups or institutions can deploy their own custom web-based repositories based on this software. It is interoperable with the most important data formats in this domain. We hope that other groups will not only use eXframe, but also contribute their own useful modifications.

Collapse

DENG NING, DUAN HUILONG. AUTOMATED MICROARRAY IMAGE GRIDDING USING IMAGE PROJECTION VECTORS COUPLED WITH POWER SPECTRUM MODEL. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s021800141000810x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

van Berlo RJP, Wessels LFA, De Ridder D, Reinders MJT. PROTEIN COMPLEX PREDICTION USING AN INTEGRATIVE BIOINFORMATICS APPROACH. J Bioinform Comput Biol 2011;5:839-64. [PMID: 17787059 DOI: 10.1142/s0219720007002953] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2006] [Revised: 03/22/2007] [Accepted: 03/22/2007] [Indexed: 11/18/2022]

Bianchi FT, Camera P, Ala U, Imperiale D, Migheli A, Boda E, Tempia F, Berto G, Bosio Y, Oddo S, LaFerla FM, Taraglio S, Dotti CG, Di Cunto F. The collagen chaperone HSP47 is a new interactor of APP that affects the levels of extracellular beta-amyloid peptides. PLoS One 2011;6:e22370. [PMID: 21829458 PMCID: PMC3145648 DOI: 10.1371/journal.pone.0022370] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Accepted: 06/27/2011] [Indexed: 01/08/2023] Open

Rutherford ST, van Kessel JC, Shao Y, Bassler BL. AphA and LuxR/HapR reciprocally control quorum sensing in vibrios. Genes Dev 2011;25:397-408. [PMID: 21325136 DOI: 10.1101/gad.2015011] [Citation(s) in RCA: 198] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Kim SS, Bhak J. Post-GWAS Strategies. Genomics Inform 2011. [DOI: 10.5808/gi.2011.9.1.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

Jung Y, Seo HJ, Park YR, Kim JH, Bien SJ, Kim JH. Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability. Genomics Inform 2011. [DOI: 10.5808/gi.2011.9.1.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Chervitz SA, Deutsch EW, Field D, Parkinson H, Quackenbush J, Rocca-Serra P, Sansone SA, Stoeckert CJ, Taylor CF, Taylor R, Ball CA. Data standards for Omics data: the basis of data sharing and reuse. Methods Mol Biol 2011;719:31-69. [PMID: 21370078 PMCID: PMC4152841 DOI: 10.1007/978-1-61779-027-0_2] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]

Brysbaert G, Pellay FX, Noth S, Benecke A. Quality assessment of transcriptome data using intrinsic statistical properties. GENOMICS PROTEOMICS & BIOINFORMATICS 2010;8:57-71. [PMID: 20451162 PMCID: PMC5054119 DOI: 10.1016/s1672-0229(10)60006-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Penkett CJ, Bähler J. Navigating public microarray databases. Comp Funct Genomics 2010;5:471-9. [PMID: 18629145 PMCID: PMC2447434 DOI: 10.1002/cfg.427] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Revised: 08/12/2004] [Accepted: 08/12/2004] [Indexed: 11/17/2022] Open

ROCK: a breast cancer functional genomics resource. Breast Cancer Res Treat 2010;124:567-72. [PMID: 20563840 DOI: 10.1007/s10549-010-0945-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2010] [Accepted: 05/08/2010] [Indexed: 12/20/2022]

Han W, Nicolau M, Noh DY, Jeffrey SS. Characterization of molecular subtypes of Korean breast cancer: an ethnically and clinically distinct population. Int J Oncol 2010;37:51-9. [PMID: 20514396 DOI: 10.3892/ijo_00000652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Gupta G, Liu A, Ghosh J. Automated hierarchical density shaving: a robust automated clustering and visualization framework for large biological data sets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2010;7:223-237. [PMID: 20431143 DOI: 10.1109/tcbb.2008.32] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

Tagmount A, Wang M, Lindquist E, Tanaka Y, Teranishi KS, Sunagawa S, Wong M, Stillman JH. The porcelain crab transcriptome and PCAD, the porcelain crab microarray and sequence database. PLoS One 2010;5:e9327. [PMID: 20174471 PMCID: PMC2824831 DOI: 10.1371/journal.pone.0009327] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2009] [Accepted: 01/27/2010] [Indexed: 01/11/2023] Open

Abstract

Background

With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes.

Methodology/Principal Findings

A set of ∼30K unique sequences (UniSeqs) representing ∼19K clusters were generated from ∼98K high quality ESTs from a set of tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66% of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.

Conclusions/Significance

The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.

Collapse

Wilflingseder J, Kainz A, Mühlberger I, Perco P, Langer R, Kristo I, Mayer B, Oberbauer R. Impaired metabolism in donor kidney grafts after steroid pretreatment. Transpl Int 2010;23:796-804. [PMID: 20149158 DOI: 10.1111/j.1432-2277.2010.01053.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Yang X, Sun X. Meta-analysis of cancer gene-profiling data. Methods Mol Biol 2010;576:409-26. [PMID: 19882274 DOI: 10.1007/978-1-59745-545-9_21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]

Celton M, Malpertuy A, Lelandais G, de Brevern AG. Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments. BMC Genomics 2010;11:15. [PMID: 20056002 PMCID: PMC2827407 DOI: 10.1186/1471-2164-11-15] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2009] [Accepted: 01/07/2010] [Indexed: 11/17/2022] Open

Abstract

Background

Microarray technologies produced large amount of data. In a previous study, we have shown the interest of k-Nearest Neighbour approach for restoring the missing gene expression values, and its positive impact of the gene clustering by hierarchical algorithm. Since, numerous replacement methods have been proposed to impute missing values (MVs) for microarray data. In this study, we have evaluated twelve different usable methods, and their influence on the quality of gene clustering. Interestingly we have used several datasets, both kinetic and non kinetic experiments from yeast and human.

Results

We underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (EM_array). These improvements have been observed also on the imputation of extreme values, the most difficult predictable values. We showed that the imputed MVs have still important effects on the stability of the gene clusters. The improvement on the clustering obtained by hierarchical clustering remains limited and, not sufficient to restore completely the correct gene associations. However, a common tendency can be found between the quality of the imputation method and the gene cluster stability. Even if the comparison between clustering algorithms is a complex task, we observed that k-means approach is more efficient to conserve gene associations.

Conclusions

More than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The EM_array approach constitutes one efficient method for restoring the missing expression gene values, with a lower estimation error level. Nonetheless, the presence of MVs even at a low rate is a major factor of gene cluster instability. Our study highlights the need for a systematic assessment of imputation methods and so of dedicated benchmarks. A noticeable point is the specific influence of some biological dataset.

Collapse

Huang CW, Lin CY, Huang HY, Liu HW, Chen YJ, Shih DF, Chen HY, Juan CC, Ker CG, Huang CYF, Li CF, Shiue YL. CKS1B overexpression implicates clinical aggressiveness of hepatocellular carcinomas but not p27(Kip1) protein turnover: an independent prognosticator with potential p27 (Kip1)-independent oncogenic attributes? Ann Surg Oncol 2009;17:907-22. [PMID: 19866239 DOI: 10.1245/s10434-009-0779-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2009] [Indexed: 12/25/2022]

Abstract

BACKGROUND

Through data mining the Stanford Microarray Database, the CKS1B transcript was found to be frequently upregulated in hepatocellular carcinomas (HCCs) with low alpha-fetal protein (AFP) expression. Together with SKP2, CKS1B is known to implicate p27(Kip1) protein turnover promoting cell-cycle progression.

METHODS

CKS1B, p27(Kip1), and SKP2 were immunostained in 75 HCCs and correlated with clinicopathological features, local recurrence-free survival (LRFS), and overall survival (OS). Silencing of CKS1B and SKP2 with interference short-hairpin RNA (shRNA) was performed in SK-Hep1 and Hep-3B cell lines.

RESULTS

Immunohistochemically, increased CKS1B and SKP2, and attenuated p27(Kip1) were all associated with tumor multiplicity (P < 0.05) and increasing American Joint Committee on Cancer (AJCC) stage (P < 0.05). Overexpression of CKS1B significantly correlated with advanced Okuda stages (P = 0.048) and SKP2 overexpression (P = 0.047). Neither CKS1B nor SKP2 was inversely related to p27(Kip1), which was reinforced by no alteration in p27(Kip1) abundance in HCC-derived cells with CKS1B or SKP2 silencing. Both CKS1B overexpression (P = 0.0011 and P = 0.0017) and p27(Kip1) attenuation (P = 0.0079 and P = 0.0085) were predictive of OS and LRFS, respectively, while SKP2 overexpression was associated with worse OS alone (P = 0.0043). Combined assessment of CKS1B and p27(Kip1) was able to robustly distinguish three prognostically different groups (P < 0.0001). In multivariate comparison, CKS1B overexpression represented the strongest independent adverse prognosticator [OS, P = 0.0235, hazard ratio (HR): 4.193; LRFS, P = 0.0204, HR: 4.262], followed by p27(Kip1) attenuation (OS, P = 0.0320, HR: 2.553; LRFS, P = 0.0262, HR: 2.533).

CONCLUSIONS

CKS1B protein overexpression in HCCs is implicated in clinical aggressiveness but not in p27(Kip1) turnover, implying presence of p27(Kip1)-independent oncogenic attributes. The combined assessment of CKS1B and p27(Kip1) immunoexpressions effectively risk-stratifies HCCs with different prognoses, which may aid in the management of this deadly malignancy.

Collapse

Hulsman M, Reinders MJT, de Ridder D. Evolutionary optimization of kernel weights improves protein complex comembership prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2009;6:427-437. [PMID: 19644171 DOI: 10.1109/tcbb.2008.137] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Carrera J, Rodrigo G, Jaramillo A. Towards the automated engineering of a synthetic genome. MOLECULAR BIOSYSTEMS 2009;5:733-43. [PMID: 19562112 DOI: 10.1039/b904400k] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Li G, Che D, Xu Y. A universal operon predictor for prokaryotic genomes. J Bioinform Comput Biol 2009;7:19-38. [PMID: 19226658 DOI: 10.1142/s0219720009003984] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Revised: 02/21/2008] [Accepted: 04/22/2008] [Indexed: 11/18/2022]

Bhardwaj N, Lu H. Co-expression among constituents of a motif in the protein-protein interaction network. J Bioinform Comput Biol 2009;7:1-17. [PMID: 19226657 DOI: 10.1142/s0219720009003959] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2008] [Revised: 09/19/2008] [Accepted: 09/22/2008] [Indexed: 11/18/2022]

Abstract

Almost all cellular functions are the results of well-coordinated interactions between various proteins. A more connected hub or motif in the interaction network is expected to be more important, and any perturbation in this motif would be more damaging to the smooth performance of the related functions. Thus, some coherent robustness of these hubs has to be derived. Here, we provide the global evidence that interaction hubs obtain their robustness against uneven protein concentrations through co-expression of the constituents, and that the degree of co-expression correlates strongly with the complexity of the embedded motif. We calculated the gene expression correlations between the proteins embedded in 3-, 4-, 5-, and 6-node interaction motifs of increasing complexities, and compared them to those between proteins from random motifs of similar complexities. We find that as the connectedness of these motifs increases, there is higher co-expression between the constituent proteins. For example, when the expression correlation is 0.7, the kernel density of the correlation increases from 0.152 for 4-node motifs with three edges to 0.403 for 4-node cliques. This implies that the robustness of the interaction system emerges from a proportionate synchronicity among the constituents of the motif via co-expression. We further show that such biological coherence via co-expression of component proteins can be reinforced by integrating conservation data in the analysis. For example, with addition of evolutionary information from other genomes, the ratio of kernel density for interaction and random data in the case of 5- and 6-node cliques in yeast increases from 37.8 to 123 and 98.4 to 1300, respectively, given that the expression correlation is 0.8. Our results show that genes whose products are involved in motifs have transcription and translation properties that minimize the noise in final protein concentrations, compared to random sets of genes.

Collapse

Holbein S, Wengi A, Decourty L, Freimoser FM, Jacquier A, Dichtl B. Cordycepin interferes with 3' end formation in yeast independently of its potential to terminate RNA chain elongation. RNA (NEW YORK, N.Y.) 2009;15:837-49. [PMID: 19324962 PMCID: PMC2673080 DOI: 10.1261/rna.1458909] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Heath LS, Sioson AA. Semantics of multimodal network models. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2009;6:271-280. [PMID: 19407351 DOI: 10.1109/tcbb.2007.70242] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Wang H, Kakaradov B, Collins SR, Karotki L, Fiedler D, Shales M, Shokat KM, Walther TC, Krogan NJ, Koller D. A complex-based reconstruction of the Saccharomyces cerevisiae interactome. Mol Cell Proteomics 2009;8:1361-81. [PMID: 19176519 PMCID: PMC2690481 DOI: 10.1074/mcp.m800490-mcp200] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Chervitz SA, Parkinson H, Fostel JM, Causton HC, Sanson SA, Deutsch EW, Field D, Taylor CF, Rocca-Serra P, White J, Stoeckert CJ. Standards for Functional Genomics. Bioinformatics 2009. [DOI: 10.1007/978-0-387-92738-1_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Wilflingseder J, Perco P, Kainz A, Korbély R, Mayer B, Oberbauer R. Biocompatibility of haemodialysis membranes determined by gene expression of human leucocytes: a crossover study. Eur J Clin Invest 2008;38:918-24. [PMID: 19021716 DOI: 10.1111/j.1365-2362.2008.02050.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Giannakeas N, Fotiadis DI. An automated method for gridding and clustering-based segmentation of cDNA microarray images. Comput Med Imaging Graph 2008;33:40-9. [PMID: 19046850 DOI: 10.1016/j.compmedimag.2008.10.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2008] [Revised: 09/18/2008] [Accepted: 10/06/2008] [Indexed: 10/21/2022]

Integrating functional genomics data. Methods Mol Biol 2008;453:267-78. [PMID: 18712309 DOI: 10.1007/978-1-60327-429-6_14] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]

Wilflingseder J, Kainz A, Perco P, Korbely R, Mayer B, Oberbauer R. Molecular predictors for anaemia after kidney transplantation. Nephrol Dial Transplant 2008;24:1015-23. [DOI: 10.1093/ndt/gfn683] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Tomlinson C, Thimma M, Alexandrakis S, Castillo T, Dennis JL, Brooks A, Bradley T, Turnbull C, Blaveri E, Barton G, Chiba N, Maratou K, Soutter P, Aitman T, Game L. MiMiR--an integrated platform for microarray data sharing, mining and analysis. BMC Bioinformatics 2008;9:379. [PMID: 18801157 PMCID: PMC2572073 DOI: 10.1186/1471-2105-9-379] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2008] [Accepted: 09/18/2008] [Indexed: 11/10/2022] Open

Abstract

Background

Despite considerable efforts within the microarray community for standardising data format, content and description, microarray technologies present major challenges in managing, sharing, analysing and re-using the large amount of data generated locally or internationally. Additionally, it is recognised that inconsistent and low quality experimental annotation in public data repositories significantly compromises the re-use of microarray data for meta-analysis. MiMiR, the Microarray data Mining Resource was designed to tackle some of these limitations and challenges. Here we present new software components and enhancements to the original infrastructure that increase accessibility, utility and opportunities for large scale mining of experimental and clinical data.

Results

A user friendly Online Annotation Tool allows researchers to submit detailed experimental information via the web at the time of data generation rather than at the time of publication. This ensures the easy access and high accuracy of meta-data collected. Experiments are programmatically built in the MiMiR database from the submitted information and details are systematically curated and further annotated by a team of trained annotators using a new Curation and Annotation Tool. Clinical information can be annotated and coded with a clinical Data Mapping Tool within an appropriate ethical framework. Users can visualise experimental annotation, assess data quality, download and share data via a web-based experiment browser called MiMiR Online. All requests to access data in MiMiR are routed through a sophisticated middleware security layer thereby allowing secure data access and sharing amongst MiMiR registered users prior to publication. Data in MiMiR can be mined and analysed using the integrated EMAAS open source analysis web portal or via export of data and meta-data into Rosetta Resolver data analysis package.

Conclusion

The new MiMiR suite of software enables systematic and effective capture of extensive experimental and clinical information with the highest MIAME score, and secure data sharing prior to publication. MiMiR currently contains more than 150 experiments corresponding to over 3000 hybridisations and supports the Microarray Centre's large microarray user community and two international consortia. The MiMiR flexible and scalable hardware and software architecture enables secure warehousing of thousands of datasets, including clinical studies, from microarray and potentially other -omics technologies.

Collapse

Kim WK, Krumpelman C, Marcotte EM. Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy. Genome Biol 2008;9 Suppl 1:S5. [PMID: 18613949 PMCID: PMC2447539 DOI: 10.1186/gb-2008-9-s1-s5] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

The complete set of mouse genes, as with the set of human genes, is still largely uncharacterized, with many pieces of experimental evidence accumulating regarding the activities and expression of the genes, but the majority of genes as yet still of unknown function. Within the context of the MouseFunc competition, we developed and applied two distinct large-scale data mining approaches to infer the functions (Gene Ontology annotations) of mouse genes from experimental observations from available functional genomics, proteomics, comparative genomics, and phenotypic data. The two strategies — the first using classifiers to map features to annotations, the second propagating annotations from characterized genes to uncharacterized genes along edges in a network constructed from the features — offer alternative and possibly complementary approaches to providing functional annotations. Here, we re-implement and evaluate these approaches and their combination for their ability to predict the proper functional annotations of genes in the MouseFunc data set. We show that, when controlling for the same set of input features, the network approach generally outperformed a naïve Bayesian classifier approach, while their combination offers some improvement over either independently. We make our observations of predictive performance on the MouseFunc competition hold-out set, as well as on a ten-fold cross-validation of the MouseFunc data. Across all 1,339 annotated genes in the MouseFunc test set, the median predictive power was quite strong (median area under a receiver operating characteristic plot of 0.865 and average precision of 0.195), indicating that a mining-based strategy with existing data is a promising path towards discovering mammalian gene functions. As one product of this work, a high-confidence subset of the functional mouse gene network was produced — spanning >70% of mouse genes with >1.6 million associations — that is predictive of mouse (and therefore often human) gene function and functional associations. The network should be generally useful for mammalian gene functional analyses, such as for predicting interactions, inferring functional connections between genes and pathways, and prioritizing candidate genes. The network and all predictions are available on the worldwide web.

Collapse

Tan MP, Smith EN, Broach JR, Floudas CA. Microarray data mining: a novel optimization-based approach to uncover biologically coherent structures. BMC Bioinformatics 2008;9:268. [PMID: 18538024 PMCID: PMC2442101 DOI: 10.1186/1471-2105-9-268] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2007] [Accepted: 06/06/2008] [Indexed: 11/16/2022] Open