1
|
Liu J, Qu Z, Yang M, Sun J, Su S, Zhang L. Jointly Integrating VCF-Based Variants and OWL-Based Biomedical Ontologies in MongoDB. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1504-1515. [PMID: 31689201 DOI: 10.1109/tcbb.2019.2951137] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The development of the next-generation sequencing (NGS) technologies has led to massive amounts of VCF (Variant Call Format) files, which have been the standard formats developed with 1000 Genomes Project. At the same time, with the widespread use of biomedical ontologies in the biomedical community, more and more applications have accepted the Web Ontology Language (OWL) as the dominant data format for the specifications of biomedical ontology descriptions, leading to the rapid growth of OWL-based biomedical ontology scale. In this paper, we seek to explore an effective method for the management of VCF-based genetic variants and OWL-based biological ontologies using the MongoDB database. Considering many current applications (such as the short genetic variations database dbSNP, etc.) are transitioning to the new design by using JSON (JavaScript Object Notation) to support future massive data expansion and interchanges. We firstly propose a series of rules for the mapping from VCF and OWL files to JSON files, and then present rule-based algorithms for transforming VCF-based genetic variants and OWL-based biological ontologies into JSON objects. On this basis, we introduce effective approaches of integrating the mapped JSON files in MongoDB. Finally, we complement this work with a set of experiments to show the performance of our proposed approaches. The source code of the proposed approaches could be freely available at https://github.com/lyotvincent/AJIA.
Collapse
|
2
|
Zhou WJ, Qu Z, Song CY, Sun Y, Lai AL, Luo MY, Ying YZ, Meng H, Liang Z, He YJ, Li YH, Liu J. NeoPeptide: an immunoinformatic database of T-cell-defined neoantigens. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5670755. [PMID: 31819989 PMCID: PMC6901387 DOI: 10.1093/database/baz128] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 10/11/2019] [Accepted: 10/15/2019] [Indexed: 12/13/2022]
Abstract
Therapeutic vaccines represent a promising immunotherapeutic modality against cancer. Discovery and validation of antigens is the key to develop effective anti-cancer vaccines. Neoantigens, arising from somatic mutations in individual cancers, are considered as ideal cancer vaccine targets because of their immunogenicity and lack of expression in normal tissues. However, only few databases support convenient access to these neoantigens for use in vaccines. To address this gap, we developed a web-accessible database, called NeoPeptide, which contains most of the important characteristics of neoantigens (such as mutation site, subunit sequence, major histocompatibility complex restriction) derived from published literature and other immunological resources. NeoPeptide also provides links to resources for further characterization of the novel features of these neoantigens. NeoPeptide will be regularly updated with newly identified and published neoantigens. Our work will help researchers in identifying neoantigens in different cancers and hasten the search for appropriate cancer vaccine candidates.
Collapse
Affiliation(s)
- Wei-Jun Zhou
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Zhi Qu
- School of Computer Science and Technology, Harbin Institute of Technology, 92 West Dazhi Street,Nan Gang District, Harbin 150001, China
| | - Chao-Yang Song
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yang Sun
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - An-Li Lai
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Ma-Yao Luo
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yu-Zhe Ying
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Hu Meng
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Zhao Liang
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yan-Jie He
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yu-Hua Li
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Jian Liu
- College of Computer Science, NanKai University, No.38 Tongyan Road, Jinnan District, Tianjin 300350, China
| |
Collapse
|