Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dai HJ, Singh O, Jonnagaddala J, Su ECY. NTTMUNSW BioC modules for recognizing and normalizing species and gene/protein mentions. Database (Oxford) 2016;2016:baw111. [PMID: 27465130 PMCID: PMC4962763 DOI: 10.1093/database/baw111] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 07/05/2016] [Indexed: 11/13/2022]

For:	Dai HJ, Singh O, Jonnagaddala J, Su ECY. NTTMUNSW BioC modules for recognizing and normalizing species and gene/protein mentions. Database (Oxford) 2016;2016:baw111. [PMID: 27465130 PMCID: PMC4962763 DOI: 10.1093/database/baw111] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 07/05/2016] [Indexed: 11/13/2022]

Number

Cited by Other Article(s)

Dai HJ, Singh O. SPRENO: a BioC module for identifying organism terms in figure captions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018;2018:5032611. [PMID: 29873706 PMCID: PMC6007219 DOI: 10.1093/database/bay048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 04/23/2018] [Indexed: 11/30/2022]

Abstract

Recent advances in biological research reveal that the majority of the experiments strive for comprehensive exploration of the biological system rather than targeting specific biological entities. The qualitative and quantitative findings of the investigations are often exclusively available in the form of figures in published papers. There is no denying that such findings have been instrumental in intensive understanding of biological processes and pathways. However, data as such is unacknowledged by machines as the descriptions in the figure captions comprise of sumptuous information in an ambiguous manner. The abbreviated term ‘SIN’ exemplifies such issue as it may stand for Sindbis virus or the sex-lethal interactor gene (Drosophila melanogaster). To overcome this ambiguity, entities should be identified by linking them to the respective entries in notable biological databases. Among all entity types, the task of identifying species plays a pivotal role in disambiguating related entities in the text. In this study, we present our species identification tool SPRENO (Species Recognition and Normalization), which is established for recognizing organism terms mentioned in figure captions and linking them to the NCBI taxonomy database by exploiting the contextual information from both the figure caption and the corresponding full text. To determine the ID of ambiguous organism mentions, two disambiguation methods have been developed. One is based on the majority rule to select the ID that has been successfully linked to previously mentioned organism terms. The other is a convolutional neural network (CNN) model trained by learning both the context and the distance information of the target organism mention. As a system based on the majority rule, SPRENO was one of the top-ranked systems in the BioCreative VI BioID track and achieved micro F-scores of 0.776 (entity recognition) and 0.755 (entity normalization) on the official test set, respectively. Additionally, the SPRENO-CNN exhibited better precisions with lower recalls and F-scores (0.720/0.711 for entity recognition/normalization). SPRENO is freely available at https://bigodatamining.github.io/software/201801/.

Database URL: https://bigodatamining.github.io/software/201801/

Collapse

Chang NW, Dai HJ, Shih YY, Wu CY, Dela Rosa MAC, Obena RP, Chen YJ, Hsu WL, Oyang YJ. Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy. Database (Oxford) 2017;2017:bax082. [PMID: 31725857 PMCID: PMC7243925 DOI: 10.1093/database/bax082] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Revised: 10/11/2017] [Accepted: 10/11/2017] [Indexed: 12/31/2022]

Kim S, Islamaj Doğan R, Chatr-Aryamontri A, Chang CS, Oughtred R, Rust J, Batista-Navarro R, Carter J, Ananiadou S, Matos S, Santos A, Campos D, Oliveira JL, Singh O, Jonnagaddala J, Dai HJ, Su ECY, Chang YC, Su YC, Chu CH, Chen CC, Hsu WL, Peng Y, Arighi C, Wu CH, Vijay-Shanker K, Aydın F, Hüsünbeyi ZM, Özgür A, Shin SY, Kwon D, Dolinski K, Tyers M, Wilbur WJ, Comeau DC. BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID. Database (Oxford) 2016;2016:baw121. [PMID: 27589962 PMCID: PMC5009341 DOI: 10.1093/database/baw121] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Revised: 07/29/2016] [Accepted: 08/02/2016] [Indexed: 11/14/2022]

Affiliation(s)

Sun Kim National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Rezarta Islamaj Doğan National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Andrew Chatr-Aryamontri Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, QC H3C 3J7, Canada
Christie S Chang Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
Rose Oughtred Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
Jennifer Rust Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
Riza Batista-Navarro National Centre for Text Mining, School of Computer Science, University of Manchester, Manchester, UK
Jacob Carter National Centre for Text Mining, School of Computer Science, University of Manchester, Manchester, UK
Sophia Ananiadou National Centre for Text Mining, School of Computer Science, University of Manchester, Manchester, UK
Sérgio Matos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
André Santos DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
David Campos BMD Software, Lda, Rua Calouste Gulbenkian 1, 3810-074 Aveiro, Portugal
José Luís Oliveira DETI/IEETA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
Onkar Singh Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
Jitendra Jonnagaddala School of Public Health and Community Medicine, University of New South Wales, Kensington NSW 2033, Australia Prince of Wales Clinical School, University of New South Wales, Kensington NSW 2033, Australia
Hong-Jie Dai Department of Computer Science and Information Engineering, National Taitung University, Taitung, Taiwan
Emily Chia-Yu Su Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
Yung-Chun Chang Institute of Information Science, Academia Sinica, Taipei, Taiwan Department of Information Management, National Taiwan University, Taipei, Taiwan
Yu-Chen Su Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
Chun-Han Chu Institute of Information Science, Academia Sinica, Taipei, Taiwan
Chien Chin Chen Department of Information Management, National Taiwan University, Taipei, Taiwan
Wen-Lian Hsu Institute of Information Science, Academia Sinica, Taipei, Taiwan
Yifan Peng Computer & Information Sciences, University of Delaware, Newark, DE 19716, USA
Cecilia Arighi Computer & Information Sciences, University of Delaware, Newark, DE 19716, USA Center for Bioinformatics & Computational Biology, University of Delaware, Newark, DE 19716, USA
Cathy H Wu Computer & Information Sciences, University of Delaware, Newark, DE 19716, USA Center for Bioinformatics & Computational Biology, University of Delaware, Newark, DE 19716, USA
K Vijay-Shanker Computer & Information Sciences, University of Delaware, Newark, DE 19716, USA
Ferhat Aydın Department of Computer Engineering, Boğaziçi University, Bebek, 34342 Istanbul, Turkey
Zehra Melce Hüsünbeyi Department of Computer Engineering, Boğaziçi University, Bebek, 34342 Istanbul, Turkey
Arzucan Özgür Department of Computer Engineering, Boğaziçi University, Bebek, 34342 Istanbul, Turkey
Soo-Yong Shin Department of Biomedical Informatics, Asan Medical Center, 138-736 Seoul, South Korea
Dongseop Kwon Department of Computer Engineering, Myongji University, 449-728 Yongin, South Korea
Kara Dolinski Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
Mike Tyers Institute for Research in Immunology and Cancer, Université de Montréal, Montréal, QC H3C 3J7, Canada The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
W John Wilbur National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Donald C Comeau National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA

Collapse

Jonnagaddala J, Jue TR, Chang NW, Dai HJ. Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion. Database (Oxford) 2016;2016:baw112. [PMID: 27504009 PMCID: PMC4976299 DOI: 10.1093/database/baw112] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2015] [Revised: 07/05/2016] [Accepted: 07/06/2016] [Indexed: 01/01/2023]