1
|
Ma H, Shen L, Wang J, Wang S, Wang M, Wang M, Li Z, Li J. Toward clearer recognition and easier usefulness: development of a cross-lingual atherosclerotic cerebrovascular disease ontology. Database (Oxford) 2024; 2024:baae117. [PMID: 39657146 PMCID: PMC11630243 DOI: 10.1093/database/baae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 10/18/2024] [Accepted: 11/21/2024] [Indexed: 12/17/2024]
Abstract
Atherosclerotic cerebrovascular disease could result in a great number of deaths and disabilities. However, it did not acquire enough attention. Less information, statistics, or data on the disease has been revealed. Thus, no systematic concept datasets were released to help clinicians clarify the scope, assist research, and offer maximized value. This study aimed to develop a cross-lingual atherosclerotic cerebrovascular disease ontology; describe the workflow, schema, hierarchical structure, and the highlighted content; design a brand-new rehabilitation ontology; implement the ontology evaluation; and illustrate the application scenarios in real-world scenarios. We implemented nine steps based on the Ontology Development 101 methodologies combined with expert opinions. The ontology included collection and specification of clinical requirements, background investigation and knowledge acquisition, ontology selection and reuse, scope identification, schema definition, concept extraction, concept extension, ontology verification, and ontology evaluation. We evaluated the proposed ontology in the literature classification task. The current ontology included 10 top-level classes, respectively, clinical manifestation, comorbidity, complication, diagnosis, model of atherosclerotic cerebrovascular disease, pathogenesis, prevention, rehabilitation, risk factor, and treatment. There are 1715 concepts in the 11-level ontology, covering 4588 Chinese terms, 6617 English terms, and 972 definitions. The ontology could be applied in real-world scenarios such as information retrieval, new expression discovery, named entity recognition, and knowledge fusion, and the use case proved that it could offer satisfying support to related medical scenarios. The ontology was proven to be useful in text classification tasks, and the weight-F1 score could reach >80% combined with the pretrained model. The proposed ontology provided a clear set of cross-lingual concepts and terms with an explicit hierarchical structure, helping scientific researchers to quickly retrieve relevant medical literature, assisting data scientists to efficiently identify relevant contents in electronic health records, and providing a clear domain framework for academic reference. Database URL: https://bioportal.bioontology.org/ontologies/ACVD_ONTOLOGY.
Collapse
Affiliation(s)
- Hetong Ma
- Intelligent Computing Department, Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, No. 3 Yabao Road, Beijing 100020, China
| | - Liu Shen
- Intelligent Computing Department, Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, No. 3 Yabao Road, Beijing 100020, China
| | - Jiayang Wang
- Intelligent Computing Department, Chinese Academy of Medical Sciences/Peking Union Medical College, No. 3 Yabao Road, Beijing 100020, China
| | - Shilong Wang
- Computer Science Department, Harbin Institute of Technology, No. 92, Xidazhi Street, Harbin 150001, China
| | - Min Wang
- Intelligent Computing Department, Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, No. 3 Yabao Road, Beijing 100020, China
| | - Meng Wang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- National Center for Healthcare Quality Management in Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
| | - Zixiao Li
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- National Center for Healthcare Quality Management in Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- Research Unit of Artificial Intelligence in Cerebrovascular Disease, Chinese Academy of Medical Sciences, Beijing Tiantan Hospital, Capital Medical University, No. 119 South Fourth Ring West Road, Beijing 100070, China
- Chinese Institute for Brain Research, No. 9 Yike Road, Beijing 102206, China
| | - Jiao Li
- Intelligent Computing Department, Institute of Medical Information & Library, Chinese Academy of Medical Sciences/Peking Union Medical College, No. 3 Yabao Road, Beijing 100020, China
| |
Collapse
|
2
|
Taheri Moghadam S, Sheikhtaheri A, Hooman N. Patient safety classifications, taxonomies and ontologies, part 2: A systematic review on content coverage. J Biomed Inform 2023; 148:104549. [PMID: 37984548 DOI: 10.1016/j.jbi.2023.104549] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 10/11/2023] [Accepted: 11/16/2023] [Indexed: 11/22/2023]
Abstract
BACKGROUND Content coverage of patient safety ontology and classification systems should be evaluated to provide a guide for users to select appropriate ones for specific applications. In this review, we identified and compare content coverage of patient safety classifications and ontologies. METHODS We searched different databases and ontology/classification repositories to identify these classifications and ontologies. We included patient safety-related taxonomies, ontologies, classifications, and terminologies. We identified and extracted different concepts covered by these systems and mapped these concepts to international classification for patient safety (ICPS) and finally compared the content of these systems. RESULTS Finally, 89 papers (77 classifications or ontologies) were analyzed. Thirteen classifications have been developed to cover all medical domains. Among specific domain systems, most systems cover medication (16), surgery (8), medical devices (3), general practice (3), and primary care (3). The most common patient safety-related concepts covered in these systems include incident types (41), contributing factors/hazards (31), patient outcomes (29), degree of harm (25), and action (18). However, stage/phase (6), incident characteristics (5), detection (5), people involved (5), organizational outcomes (4), error type (4), and care setting (3) are some of the less covered concepts in these classifications/ontologies. CONCLUSION Among general systems, ICPS, World Health Organization's Adverse Reaction Terminology (WHO-ART), and Ontology of Adverse Events (OAE) cover most patient safety concepts and can be used as a gold standard for all medical domains. As a result, reporting systems could make use of these broad classifications, but the majority of their covered concepts are related to patient outcomes, with the exception of ICPS, which covers other patient safety concepts. However, the ICPS does not cover specialized domain concepts. For specific medical domains, MedDRA, NCC MERP, OPAE, ADRO, PPST, OCCME, TRTE, TSAHI, and PSIC-PC provide the broadest coverage of concepts. Many of the patient safety classifications and ontologies are not formally registered or available as formal classification/ontology in ontology repositories such as BioPortal. This study may be used as a guide for choosing appropriate classifications for various applications or expanding less developed patient safety classifications/ontologies. Furthermore, the same concepts are not represented by the same terms; therefore, the current study could be used to guide a harmonization process for existing or future patient safety classifications/ontologies.
Collapse
Affiliation(s)
- Sharare Taheri Moghadam
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Abbas Sheikhtaheri
- Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran.
| | - Nakysa Hooman
- Aliasghar Clinical Research Development Center (AACRDC), Aliasghar Children Hospital, Department of Pediatrics, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
3
|
Patient safety classification, taxonomy and ontology systems: A systematic review on development and evaluation methodologies. J Biomed Inform 2022; 133:104150. [PMID: 35878822 DOI: 10.1016/j.jbi.2022.104150] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 06/11/2022] [Accepted: 07/19/2022] [Indexed: 11/24/2022]
Abstract
INTRODUCTION Patient safety classifications/ontologies enable patient safety information systems to receive and analyze patient safety data to improve patient safety. Patient safety classifications/ontologies have been developed and evaluated using a variety of methods. The purpose of this review was to discuss and analyze the methodologies for developing and evaluating patient safety classifications/ontologies. METHODS Studies that developed or evaluated patient safety classifications, terminologies, taxonomies, or ontologies were searched through Google Scholar, Google search engines, National Center for Biomedical Ontology (NCBO) BioPortal, Open Biological and Biomedical Ontology (OBO) Foundry and World Health Organization (WHO) websites and Scopus, Web of Science, PubMed, and Science Direct. We updated our search on 30 February 2021 and included all studies published until the end of 2020. Studies that developed or evaluated classifications only for patient safety and provided information on how they were developed or evaluated were included. Systems with covered patient safety terms (such as ICD-10) but are not specifically developed for patient safety were excluded. The quality and the risk of bias of studies were not assessed because all methodologies and criteria were intended to be covered. In addition, we analyzed the data through descriptive narrative synthesis and compared and classified the development and evaluation methods and evaluation criteria according to available development and evaluation approaches for biomedical ontologies. RESULTS We identified 84 articles that met all of the inclusion criteria, resulting in 70 classifications/ontologies, nine of which were for the general medical domain. The most papers were published in 2010 and 2011, with 8 and 7 papers, respectively. The United States (50) and Australia (23) have the most studies. The most commonly used methods for developing classifications/ontologies included the use of existing systems (for expanding or mapping) (44) and qualitative analysis of event reports (39). The most common evaluation methods were coding or classifying some safety report samples (25), quantitative analysis of incidents based on the developed classification (24), and consensus among physicians (16). The most commonly applied evaluation criteria were reliability (27), content and face validity (9), comprehensiveness (6), usability (5), linguistic clarity (5), and impact (4), respectively. CONCLUSIONS Because of the weaknesses and strengths of the development/evaluation methods, it is advised that more than one method for development or evaluation, as well as evaluation criteria, should be used. To organize the processes of developing classification/ontologies, well-established approaches such as Methontology are recommended. The most prevalent evaluation methods applied in this domain are well fitted to the biomedical ontology evaluation methods, but it is also advised to apply some evaluation approaches such as logic, rules, and Natural language processing (NLP) based in combination with other evaluation approaches. This research can assist domain researchers in developing or evaluating domain ontologies using more complete methodologies. There is also a lack of reporting consistency in the literature and same methods or criteria were reported with different terminologies.
Collapse
|
4
|
Wu H, Ji J, Tian H, Chen Y, Ge W, Zhang H, Yu F, Zou J, Nakamura M, Liao J. Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model. JMIR Med Inform 2021; 9:e26407. [PMID: 34855616 PMCID: PMC8686410 DOI: 10.2196/26407] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 04/22/2021] [Accepted: 10/05/2021] [Indexed: 12/17/2022] Open
Abstract
Background With the increasing variety of drugs, the incidence of adverse drug events (ADEs) is increasing year by year. Massive numbers of ADEs are recorded in electronic medical records and adverse drug reaction (ADR) reports, which are important sources of potential ADR information. Meanwhile, it is essential to make latent ADR information automatically available for better postmarketing drug safety reevaluation and pharmacovigilance. Objective This study describes how to identify ADR-related information from Chinese ADE reports. Methods Our study established an efficient automated tool, named BBC-Radical. BBC-Radical is a model that consists of 3 components: Bidirectional Encoder Representations from Transformers (BERT), bidirectional long short-term memory (bi-LSTM), and conditional random field (CRF). The model identifies ADR-related information from Chinese ADR reports. Token features and radical features of Chinese characters were used to represent the common meaning of a group of words. BERT and Bi-LSTM-CRF were novel models that combined these features to conduct named entity recognition (NER) tasks in the free-text section of 24,890 ADR reports from the Jiangsu Province Adverse Drug Reaction Monitoring Center from 2010 to 2016. Moreover, the man-machine comparison experiment on the ADE records from Drum Tower Hospital was designed to compare the NER performance between the BBC-Radical model and a manual method. Results The NER model achieved relatively high performance, with a precision of 96.4%, recall of 96.0%, and F1 score of 96.2%. This indicates that the performance of the BBC-Radical model (precision 87.2%, recall 85.7%, and F1 score 86.4%) is much better than that of the manual method (precision 86.1%, recall 73.8%, and F1 score 79.5%) in the recognition task of each kind of entity. Conclusions The proposed model was competitive in extracting ADR-related information from ADE reports, and the results suggest that the application of our method to extract ADR-related information is of great significance in improving the quality of ADR reports and postmarketing drug safety evaluation.
Collapse
Affiliation(s)
- Hong Wu
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Jiatong Ji
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| | - Haimei Tian
- School of Computer Engineering, Jinling Institute of Technology, Nanjing, China
| | - Yao Chen
- School of Science, China Pharmaceutical University, Nanjing, China
| | - Weihong Ge
- Department of Pharmacy, Nanjing Drum Tower Hospital, Nanjing, China
| | - Haixia Zhang
- Department of Pharmacy, Nanjing Drum Tower Hospital, Nanjing, China
| | - Feng Yu
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| | - Jianjun Zou
- Department of Clinical Pharmacology, Nanjing First Hospital, Nanjing Medical University, Nanjing, China
| | - Mitsuhiro Nakamura
- Laboratory of Drug Informatics, Gifu Pharmaceutical University, Gifu, Japan
| | - Jun Liao
- School of Science, China Pharmaceutical University, Nanjing, China
| |
Collapse
|
5
|
Grabar N, Grouin C. Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing. Yearb Med Inform 2021; 30:257-263. [PMID: 34479397 PMCID: PMC8416212 DOI: 10.1055/s-0041-1726528] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Objectives:
To analyze the content of publications within the medical NLP domain in 2020.
Methods:
Automatic and manual preselection of publications to be reviewed, and selection of the best NLP papers of the year. Analysis of the important issues.
Results:
Three best papers have been selected in 2020. We also propose an analysis of the content of the NLP publications in 2020, all topics included.
Conclusion:
The two main issues addressed in 2020 are related to the investigation of COVID-related questions and to the further adaptation and use of transformer models. Besides, the trends from the past years continue, such as diversification of languages processed and use of information from social networks
Collapse
Affiliation(s)
- Natalia Grabar
- Université Paris Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, Orsay, France.,STL, CNRS, Université de Lille, Domaine du Pont-de-bois, Villeneuve-d'Ascq cedex, France
| | - Cyril Grouin
- Université Paris Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, Orsay, France
| | | |
Collapse
|