Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Patel JM. The role of declarative querying in bioinformatics. OMICS 2003;7:89-91. [PMID: 12831563 DOI: 10.1089/153623103322006670] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Number

Cited by Other Article(s)

Jamil HM. A Visual Interface for Querying Heterogeneous Phylogenetic Databases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017;14:131-144. [PMID: 26812733 DOI: 10.1109/tcbb.2016.2520943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Jamil HM. Improving Integration Effectiveness of ID Mapping Based Biological Record Linkage. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015;12:473-486. [PMID: 26357233 DOI: 10.1109/tcbb.2014.2355213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Kumar A, Grupcev V, Berrada M, Fogarty JC, Tu YC, Zhu X, Pandit SA, Xia Y. DCMS: A data analytics and management system for molecular simulation. JOURNAL OF BIG DATA 2014;2:9. [PMID: 26069879 PMCID: PMC4456345 DOI: 10.1186/s40537-014-0009-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 11/06/2014] [Indexed: 06/04/2023]

Abstract

Molecular Simulation (MS) is a powerful tool for studying physical/chemical features of large systems and has seen applications in many scientific and engineering domains. During the simulation process, the experiments generate a very large number of atoms and intend to observe their spatial and temporal relationships for scientific analysis. The sheer data volumes and their intensive interactions impose significant challenges for data accessing, managing, and analysis. To date, existing MS software systems fall short on storage and handling of MS data, mainly because of the missing of a platform to support applications that involve intensive data access and analytical process. In this paper, we present the database-centric molecular simulation (DCMS) system our team developed in the past few years. The main idea behind DCMS is to store MS data in a relational database management system (DBMS) to take advantage of the declarative query interface (i.e., SQL), data access methods, query processing, and optimization mechanisms of modern DBMSs. A unique challenge is to handle the analytical queries that are often compute-intensive. For that, we developed novel indexing and query processing strategies (including algorithms running on modern co-processors) as integrated components of the DBMS. As a result, researchers can upload and analyze their data using efficient functions implemented inside the DBMS. Index structures are generated to store analysis results that may be interesting to other users, so that the results are readily available without duplicating the analysis. We have developed a prototype of DCMS based on the PostgreSQL system and experiments using real MS data and workload show that DCMS significantly outperforms existing MS software systems. We also used it as a platform to test other data management issues such as security and compression.

Collapse

Jamil HM. Designing integrated computational biology pipelines visually. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013;10:605-618. [PMID: 24091395 DOI: 10.1109/tcbb.2013.69] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Grupcev V, Yuan Y, Tu YC, Huang J, Chen S, Pandit S, Weng M. Approximate Algorithms for Computing Spatial Distance Histograms with Accuracy Guarantees. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 2012;25:1982-1996. [PMID: 24693210 PMCID: PMC3969837 DOI: 10.1109/tkde.2012.149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Chen S, Tu YC, Xia Y. Performance analysis of a dual-tree algorithm for computing spatial distance histograms. THE VLDB JOURNAL : VERY LARGE DATA BASES : A PUBLICATION OF THE VLDB ENDOWMENT 2011;20:471-494. [PMID: 21804753 PMCID: PMC3145372 DOI: 10.1007/s00778-010-0205-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]