1
|
Hao X, Abeysinghe R, Zheng F, Schulz PE, Cui L. Mapping of Alzheimer's disease related data elements and the NIH Common Data Elements. BMC Med Inform Decis Mak 2024; 24:103. [PMID: 38641585 PMCID: PMC11027215 DOI: 10.1186/s12911-024-02500-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 04/04/2024] [Indexed: 04/21/2024] Open
Abstract
BACKGROUND Alzheimer's Disease (AD) is a devastating disease that destroys memory and other cognitive functions. There has been an increasing research effort to prevent and treat AD. In the US, two major data sharing resources for AD research are the National Alzheimer's Coordinating Center (NACC) and the Alzheimer's Disease Neuroimaging Initiative (ADNI); Additionally, the National Institutes of Health (NIH) Common Data Elements (CDE) Repository has been developed to facilitate data sharing and improve the interoperability among data sets in various disease research areas. METHOD To better understand how AD-related data elements in these resources are interoperable with each other, we leverage different representation models to map data elements from different resources: NACC to ADNI, NACC to NIH CDE, and ADNI to NIH CDE. We explore bag-of-words based and word embeddings based models (Word2Vec and BioWordVec) to perform the data element mappings in these resources. RESULTS The data dictionaries downloaded on November 23, 2021 contain 1,195 data elements in NACC, 13,918 in ADNI, and 27,213 in NIH CDE Repository. Data element preprocessing reduced the numbers of NACC and ADNI data elements for mapping to 1,099 and 7,584 respectively. Manual evaluation of the mapping results showed that the bag-of-words based approach achieved the best precision, while the BioWordVec based approach attained the best recall. In total, the three approaches mapped 175 out of 1,099 (15.92%) NACC data elements to ADNI; 107 out of 1,099 (9.74%) NACC data elements to NIH CDE; and 171 out of 7,584 (2.25%) ADNI data elements to NIH CDE. CONCLUSIONS The bag-of-words based and word embeddings based approaches showed promise in mapping AD-related data elements between different resources. Although the mapping approaches need further improvement, our result indicates that there is a critical need to standardize CDEs across these valuable AD research resources in order to maximize the discoveries regarding AD pathophysiology, diagnosis, and treatment that can be gleaned from them.
Collapse
Affiliation(s)
- Xubing Hao
- McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Rashmie Abeysinghe
- Department of Neurology, McGovern School of Medicine, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Fengbo Zheng
- McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul E Schulz
- Department of Neurology, McGovern School of Medicine, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Licong Cui
- McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA.
| |
Collapse
|
2
|
Ranallo P, Southwell B, Tignanelli C, Johnson SG, Krueger R, Sevareid-Groth T, Carvel A, Melton GB. Promoting Learning Health System Cycles by Optimizing EHR Data Clinical Concept Encoding Processes. Stud Health Technol Inform 2024; 310:68-73. [PMID: 38269767 DOI: 10.3233/shti230929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]
Abstract
Electronic health records (EHRs) and other real-world data (RWD) are critical to accelerating and scaling care improvement and transformation. To efficiently leverage it for secondary uses, EHR/RWD should be optimally managed and mapped to industry standard concepts (ISCs). Inherent challenges in concept encoding usually result in inefficient and costly workflows and resultant metadata representation structures outside the EHR. Using three related projects to map data to ISCs, we describe the development of standard, repeatable processes for precisely and unambiguously representing EHR data using appropriate ISCs within the EHR platform lifecycle and mappings specific to SNOMED-CT for Demographics, Specialty and Services. Mappings in these 3 areas resulted in ISC mappings of 779 data elements requiring 90 new concept requests to SNOMED-CT and 738 new ISCs mapped into the workflow within an accessible, enterprise-wide EHR resource with supporting processes.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Adam Carvel
- Fairview Health Services, Minneapolis, MN USA
| | - Genevieve B Melton
- Fairview Health Services, Minneapolis, MN USA
- University of Minnesota, Minneapolis, MN USA
| |
Collapse
|
3
|
Abad-Navarro F, Martínez-Costa C. A knowledge graph-based data harmonization framework for secondary data reuse. Comput Methods Programs Biomed 2024; 243:107918. [PMID: 37981455 DOI: 10.1016/j.cmpb.2023.107918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 10/02/2023] [Accepted: 11/05/2023] [Indexed: 11/21/2023]
Abstract
BACKGROUND AND OBJECTIVE The adoption of new technologies in clinical care systems has propitiated the availability of a great amount of valuable data. However, this data is usually heterogeneous, requiring its harmonization to be integrated and analysed. We propose a semantic-driven harmonization framework that (1) enables the meaningful sharing and integration of healthcare data across institutions and (2) facilitates the analysis and exploitation of the shared data. METHODS The framework includes an ontology-based common data model (i.e. SCDM), a data transformation pipeline and a semantic query system. Heterogeneous datasets, mapped to different terminologies, are integrated by using an ontology-based infrastructure rooted in a top-level ontology. A graph database is generated by using these mappings, and web-based semantic query system facilitates data exploration. RESULTS Several datasets from different European institutions have been integrated by using the framework in the context of the European H2020 Precise4Q project. Through the query system, data scientists were able to explore data and use it for building machine learning models. CONCLUSIONS The flexible data representation using RDF, together with the formal semantic underpinning provided by the SCDM, have enabled the semantic integration, query and advanced exploitation of heterogeneous data in the context of the Precise4Q project.
Collapse
Affiliation(s)
- Francisco Abad-Navarro
- Departamento de Informática y Sistemas, Universidad de Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, 30100, Murcia, Spain.
| | - Catalina Martínez-Costa
- Departamento de Informática y Sistemas, Universidad de Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, 30100, Murcia, Spain.
| |
Collapse
|
4
|
Krastev E, Tcharaktchiev D, Kovachev P, Abanos S. Occupational health assessment summary designed for semantic interoperability. Int J Med Inform 2023; 178:105207. [PMID: 37688835 DOI: 10.1016/j.ijmedinf.2023.105207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 08/21/2023] [Accepted: 08/30/2023] [Indexed: 09/11/2023]
Abstract
BACKGROUND Geopolitical and economic crises force a growing number of people to leave their countries and search better employment opportunities abroad. Meanwhile, the highly competitive labor market provides opportunities for employees to change workplaces and job positions. Health assessment data collected during the occupational history is an essential resource for developing efficient occupational disease prevention strategies as well as for ensuring the physical and psychological well-being of newly appointed workers. The diversity in data representation is source for interoperability problems that are insufficiently explored in the existing literature. OBJECTIVES This research aims to design a worker's occupational health assessment summary (OHAS) dataset that satisfies the requirements of an international standard for semantic interoperability in the use case for exchanging extracts of such data. The focus is on the need for a common OHAS standard at EU level allowing seamless exchange of OHAS at both cross-border and at the worker's country of origin level. RESULTS This paper proposes a novelty systematic approach ensuring semantic interoperability in the exchange of OHAS. Two use cases are explored in terms of UML sequence diagram. The OHAS dataset reflects common data requirements established in the national legislation of EU countries. Finally, an EN 13606 archetype of OHAS is designed by satisfying the requirements for semantic interoperability in the exchange of clinical data. Semantic interoperability of OHAS is demonstrated with realistic use case data. CONCLUSIONS The designed static, non-volatile and reusable information model of OHAS developed in this paper allows to create EN 13606 archetype instances that are valid with respect to the Reference model and the datatypes of this standard. Thus, basic activities in the OHAS use case can be implemented in software, for example, by means of a native XML database as well as integrated into existing information systems.
Collapse
Affiliation(s)
- Evgeniy Krastev
- Sofia University "St. Kliment Ohridski" (EFMI Institutional member), Faculty of Mathematics and Informatics, James Bourchier blvd, No. 5, Sofia 1164, Bulgaria.
| | - Dimitar Tcharaktchiev
- Medical University of Sofia (EFMI Institutional member), University Hospital of Endocrinology, Zdrave street No. 2, Sofia 1431, Bulgaria.
| | - Petko Kovachev
- Sofia University "St. Kliment Ohridski" (EFMI Institutional member), Faculty of Mathematics and Informatics, James Bourchier blvd, No. 5, Sofia 1164, Bulgaria
| | - Simeon Abanos
- Sofia University "St. Kliment Ohridski" (EFMI Institutional member), Faculty of Mathematics and Informatics, James Bourchier blvd, No. 5, Sofia 1164, Bulgaria
| |
Collapse
|
5
|
Pournik O, Ahmad B, Lim Choi Keung SN, Peake A, Rafid S, Tong C, Laleci Erturkmen GB, Gencturk M, Akpinar AE, Arvanitis TN. Interoperable E-Health System Using Structural and Semantic Interoperability Approaches in CAREPATH. Stud Health Technol Inform 2023; 305:608-611. [PMID: 37387105 DOI: 10.3233/shti230571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
Technical and semantic interoperability are broadly used components of interoperability technology in healthcare. Technical Interoperability provides interoperability interfaces to enable data exchange within different healthcare systems, despite any underlying heterogeneity. Semantic interoperability make different healthcare systems understand and interpret the meaning of the data that is exchanged, by using and mapping standardized terminologies, coding systems, and data models to describe the concept and structure of data. We propose a solution using Semantic and Structural Mapping techniques within CAREPATH; a research project designed to develop ICT solutions for the care management of elderly multimorbid patients with mild cognitive impairment or mild dementia. Our technical interoperability solution supplies a standard-based data exchange protocol to enable information exchange between local care systems and CAREPATH components. Our semantic interoperability solution supplies programmable interfaces, in order to semantically mediate different clinical data representation formats and incorporating data format and terminology mapping features. The solution offers a more reliable, flexible and resource efficient method across EHRs.
Collapse
Affiliation(s)
- Omid Pournik
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | - Bilal Ahmad
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | - Sarah N Lim Choi Keung
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | - Ashley Peake
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | - Shadman Rafid
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | - Chao Tong
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| | | | - Mert Gencturk
- SRDC Software Research & Development and Consultancy Corporation, Ankara, Turkey
| | - A Emre Akpinar
- SRDC Software Research & Development and Consultancy Corporation, Ankara, Turkey
| | - Theodoros N Arvanitis
- Department of Electronic, Electrical and Systems Engineering, School of Engineering, University of Birmingham, Birmingham, UK
| |
Collapse
|
6
|
Sung S, Park HA, Park SK, Jung H, Kang H, Lee MS. Mapping Korean National Health Insurance Claim Codes for Laboratory Test to SNOMED CT. Stud Health Technol Inform 2023; 302:78-82. [PMID: 37203613 DOI: 10.3233/shti230068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The aim of this study was to map Korean national health insurance claims codes for laboratory tests to SNOMED CT. The mapping source codes were 4,111 claims codes for laboratory test and mapping target codes were the International Edition of SNOMED CT released on July 31, 2020. We used rule-based automated and manual mapping methods. The mapping results were validated by two experts. Out of 4,111 codes, 90.5% were mapped to the concepts of procedure hierarchy in SNOMED CT. Of them, 51.4% of the codes were exactly mapped to SNOMED CT concepts, and 34.8% of the codes were mapped to SNOMED CT concepts as one-to-one mapping.
Collapse
Affiliation(s)
- Sumi Sung
- Office of Hospital Information, Seoul National University Hospital, Seoul, Republic of Korea
| | - Hyeoun-Ae Park
- College of Nursing, Seoul National University, Seoul, Republic of Korea
| | - Seul Ki Park
- Department of Nursing, Daejeon University, Daejeon, Republic of Korea
| | - Hyesil Jung
- Department of Nursing, Inha University, Incheon, Republic of Korea
| | - Hannah Kang
- College of Nursing, Seoul National University, Seoul, Republic of Korea
- Kakao Healthcare Corp., Republic of Korea
| | - Min Sun Lee
- College of Nursing, Seoul National University, Seoul, Republic of Korea
- Healthcare Information Standardization Department, Korea Healthcare Information Service, Republic of Korea
| |
Collapse
|
7
|
Guo H, Scriney M, Liu K. An Ostensive Information Architecture to Enhance Semantic Interoperability for Healthcare Information Systems. Inf Syst Front 2023:1-24. [PMID: 37361885 PMCID: PMC9974391 DOI: 10.1007/s10796-023-10379-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 02/13/2023] [Indexed: 06/28/2023]
Abstract
Semantic interoperability establishes intercommunications and enables data sharing across disparate systems. In this study, we propose an ostensive information architecture for healthcare information systems to decrease ambiguity caused by using signs in different contexts for different purposes. The ostensive information architecture adopts a consensus-based approach initiated from the perspective of information systems re-design and can be applied to other domains where information exchange is required between heterogeneous systems. Driven by the issues in FHIR (Fast Health Interoperability Resources) implementation, an ostensive approach that supplements the current lexical approach in semantic exchange is proposed. A Semantic Engine with an FHIR knowledge graph as the core is constructed using Neo4j to provide semantic interpretation and examples. The MIMIC III (Medical Information Mart for Intensive Care) datasets and diabetes datasets have been employed to demonstrate the effectiveness of the proposed information architecture. We further discuss the benefits of the separation of semantic interpretation and data storage from the perspective of information system design, and the semantic reasoning towards patient-centric care underpinned by the Semantic Engine.
Collapse
Affiliation(s)
- Hua Guo
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin, Ireland
- Informatics Research Centre, University of Reading, Reading, UK
| | - Michael Scriney
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin, Ireland
- Informatics Research Centre, University of Reading, Reading, UK
| | - Kecheng Liu
- School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin, Ireland
- Informatics Research Centre, University of Reading, Reading, UK
| |
Collapse
|
8
|
Ghorbani A, Davoodi F, Zamanifar K. Using type-2 fuzzy ontology to improve semantic interoperability for healthcare and diagnosis of depression. Artif Intell Med 2023; 135:102452. [PMID: 36628789 DOI: 10.1016/j.artmed.2022.102452] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 10/08/2022] [Accepted: 11/11/2022] [Indexed: 11/19/2022]
Abstract
Ontology enhances semantic interoperability through integrating health data from heterogeneous sources and sharing information in a meaningful way. In the field of smart health services, semantic interoperability means the exchange and interpretation of data without ambiguity and uncertainty. However, existing classical ontologies are not able to represent vague and uncertain knowledge, especially in contexts of mental health disorders which are associated with varying degrees of uncertainty and inaccuracy of diagnosis, and in this case, the treatment is a complex and common mental process necessitating to share information accurately and unambiguously. Type-2 fuzzy set theory can offer a fruitful solution in order to control uncertainty or express ambiguous concepts in a dynamic and complex environment such as healthcare systems. Herein, a semantic framework for healthcare, and also monitoring mental health disorders using type-2 fuzzy set theory based on the Internet of Thing (IoT) is suggested, in which all depression-related concepts are semantically annotated to share detailed information with the treatment staff. This framework not only paved the way to increasing the accuracy of medical diagnosis and decision-making but also provides the possibility of inference and semantic reasoning using the languages of SPARQL query and DL query.
Collapse
|
9
|
van Damme P, Fernández-Breis JT, Benis N, Miñarro-Gimenez JA, de Keizer NF, Cornet R. Performance assessment of ontology matching systems for FAIR data. J Biomed Semantics 2022; 13:19. [PMID: 35841031 PMCID: PMC9284868 DOI: 10.1186/s13326-022-00273-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 06/15/2022] [Indexed: 11/24/2022] Open
Abstract
Background Ontology matching should contribute to the interoperability aspect of FAIR data (Findable, Accessible, Interoperable, and Reusable). Multiple data sources can use different ontologies for annotating their data and, thus, creating the need for dynamic ontology matching services. In this experimental study, we assessed the performance of ontology matching systems in the context of a real-life application from the rare disease domain. Additionally, we present a method for analyzing top-level classes to improve precision. Results We included three ontologies (NCIt, SNOMED CT, ORDO) and three matching systems (AgreementMakerLight 2.0, FCA-Map, LogMap 2.0). We evaluated the performance of the matching systems against reference alignments from BioPortal and the Unified Medical Language System Metathesaurus (UMLS). Then, we analyzed the top-level ancestors of matched classes, to detect incorrect mappings without consulting a reference alignment. To detect such incorrect mappings, we manually matched semantically equivalent top-level classes of ontology pairs. AgreementMakerLight 2.0, FCA-Map, and LogMap 2.0 had F1-scores of 0.55, 0.46, 0.55 for BioPortal and 0.66, 0.53, 0.58 for the UMLS respectively. Using vote-based consensus alignments increased performance across the board. Evaluation with manually created top-level hierarchy mappings revealed that on average 90% of the mappings’ classes belonged to top-level classes that matched. Conclusions Our findings show that the included ontology matching systems automatically produced mappings that were modestly accurate according to our evaluation. The hierarchical analysis of mappings seems promising when no reference alignments are available. All in all, the systems show potential to be implemented as part of an ontology matching service for querying FAIR data. Future research should focus on developing methods for the evaluation of mappings used in such mapping services, leading to their implementation in a FAIR data ecosystem. Supplementary Information The online version contains supplementary material available at (10.1186/s13326-022-00273-5).
Collapse
Affiliation(s)
- Philip van Damme
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands. .,Amsterdam Public Health, Digital Health & Methodology, Amsterdam, The Netherlands.
| | | | - Nirupama Benis
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands.,Amsterdam Public Health, Digital Health & Methodology, Amsterdam, The Netherlands
| | | | - Nicolette F de Keizer
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands.,Amsterdam Public Health, Methodology & Quality of Care, Amsterdam, The Netherlands
| | - Ronald Cornet
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands.,Amsterdam Public Health, Digital Health & Methodology, Amsterdam, The Netherlands
| |
Collapse
|
10
|
Jung H, Park HA, Kang H, Lee M, Sung S, Park S. Mapping Korean National Health Insurance Pharmaceutical Claim Codes to SNOMED CT. Stud Health Technol Inform 2022; 294:297-301. [PMID: 35612080 DOI: 10.3233/shti220462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The objective of this study was to map pharmaceutical claim codes to SNOMED CT and thereby facilitate multicenter collaborative research and improve semantic interoperability. The claim codes were mapped to SNOMED CT using rule-based automated and manual methods. The maps were internally validated by terminologists and a pharmacist. Finally, 80% of all claim codes were mapped to the concepts of Pharmaceutical/biologic product hierarchy in SNOMED CT. Of them, 50.6% of the codes were exactly mapped to one clinical drug branch concept.
Collapse
Affiliation(s)
- Hyesil Jung
- Office of eHealth Research and Business, Seoul National University Bundang Hospital, South Korea
| | - Hyeoun-Ae Park
- College of Nursing, Seoul National University, South Korea
| | - Hannah Kang
- College of Nursing, Seoul National University, South Korea
| | - MinSun Lee
- College of Nursing, Seoul National University, South Korea
| | - Sumi Sung
- College of Nursing, Seoul National University, South Korea
| | - SeulKi Park
- Department of nursing, Daejeon University, South Korea
| |
Collapse
|
11
|
Van de Vyvere B, Colpaert P. Using ANPR data to create an anonymized linked open dataset on urban bustle. Eur Transp Res Rev 2022; 14:17. [PMID: 38625190 PMCID: PMC9035206 DOI: 10.1186/s12544-022-00538-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 04/11/2022] [Indexed: 04/17/2024]
Abstract
ANPR cameras allow the automatic detection of vehicle license plates and are increasingly used for law enforcement. However, also statistical data generated by ANPR cameras are a potential source of urban insights. In order for this data to reach its full potential for policy-making, we research how this data can be shared in digital twins, with researchers, for a diverse set of machine learning models, and even Open Data portals. This article's key objective is to find a way to anonymize and aggregate ANPR data in a way that it still can provide useful visualizations for local decision making. We introduce an approach to aggregate the data with geotemporal binning and publish it by combining nine existing data specifications. We implemented the approach for the city of Kortrijk (Belgium) with 43 ANPR cameras, developed the ANPR Metrics tool to generate the statistical data and dashboards on top of the data, and tested whether mobility experts from the city could deduct valuable insights. We present a couple of insights that were found as a result, as a proof that anonymized ANPR data complements their currently used traffic analysis tools, providing a valuable source for data-driven policy-making.
Collapse
Affiliation(s)
- Brecht Van de Vyvere
- Ghent University, Technologiepark-Zwijnaarde 122, Ghent, 9052 Belgium
- Department of Electronics and Information Systems, IDLab Ghent University - imec, Ghent, Belgium
| | - Pieter Colpaert
- Ghent University, Technologiepark-Zwijnaarde 122, Ghent, 9052 Belgium
- Department of Electronics and Information Systems, IDLab Ghent University - imec, Ghent, Belgium
| |
Collapse
|
12
|
González-García J, Estupiñán-Romero F, Tellería-Orriols C, González-Galindo J, Palmieri L, Fagaralli A, Pristās I, Vuković J, Misinš J, Zile I, Bernal-Delgado E; InfAct Joint Action consortium. Coping with interoperability in the development of a federated research infrastructure: achievements, challenges and recommendations from the JA-InfAct. Arch Public Health 2021; 79:221. [PMID: 34879872 DOI: 10.1186/s13690-021-00731-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 11/05/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Information for Action! is a Joint Action (JA-InfAct) on Health Information promoted by the EU Member States and funded by the European Commission within the Third EU Health Programme (2014-2020) to create and develop solid sustainable infrastructure on EU health information. The main objective of this the JA-InfAct is to build an EU health information system infrastructure and strengthen its core elements by a) establishing a sustainable research infrastructure to support population health and health system performance assessment, b) enhancing the European health information and knowledge bases, as well as health information research capacities to reduce health information inequalities, and c) supporting health information interoperability and innovative health information tools and data sources. METHODS Following a federated analysis approach, JA-InfAct developed an ad hoc federated infrastructure based on distributing a well-defined process-mining analysis methodology to be deployed at each participating partners' systems to reproduce the analysis and pool the aggregated results from the analyses. To overcome the legal interoperability issues on international data sharing, data linkage and management, partners (EU regions) participating in the case studies worked coordinately to query their real-world healthcare data sources complying with a common data model, executed the process-mining analysis pipeline on their premises, and shared the results enabling international comparison and the identification of best practices on stroke care. RESULTS The ad hoc federated infrastructure was designed and built upon open source technologies, providing partners with the capacity to exploit their data and generate dashboards exploring the stroke care pathways. These dashboards can be shared among the participating partners or to a coordination hub without legal issues, enabling the comparative evaluation of the caregiving activities for acute stroke across regions. Nonetheless, the approach is not free of a number of challenges that have been solved, and new challenges that should be addressed in the eventual case of scaling up. For that eventual case, 12 recommendations considering the different layers of interoperability have been provided. CONCLUSION The proposed approach, when successfully deployed as a federated analysis infrastructure, such as the one developed within the JA-InfAct, can concisely tackle all levels of the interoperability requirements from organisational to technical interoperability, supported by the close collaboration of the partners participating in the study. Any proposal for extension, should require further thinking on how to deal with new challenges on interoperability.
Collapse
|
13
|
Gazzarata R, Maggi N, Magnoni LD, Monteverde ME, Ruggiero C, Giacomini M. Semantics Management for a Regional Health Information System in Italy by CTS2 and FHIR. Stud Health Technol Inform 2021; 287:119-123. [PMID: 34795094 DOI: 10.3233/shti210828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
An infrastructure for the management of semantics is being developed to support the regional health information exchange in Veneto - an Italian region which has about 5 million inhabitants. Terminology plays a key role in the management of the information fluxes of the Veneto region, in which the management of electronic health record is given great attention. An architecture for the management of the semantics of laboratory reports has been set up, adopting standards by HL7. The system has been initially developed according to the common terminology service release 2 (CTS2) standard and, in order to overcome complexities of CTS2 is being revised according to the Fast Healthcare Interoperability Resources (FHIR) standard, which has been subsequently introduced. Aspects of CST2 and of FHIR have been considered in order to retain most suitable aspects of both. This integration can be regarded as most worthwhile.
Collapse
Affiliation(s)
| | - Norbert Maggi
- Department of Informatics, Bioengineering, Robotics and Information Systems, University of Genoa, Genoa, Italy.,Laboratory of Human Genetics, IRCCS Giannina Gaslini, Genova, Italy
| | | | | | - Carmelina Ruggiero
- Healthropy S.r.l., Savona, Italy.,Department of Informatics, Bioengineering, Robotics and Information Systems, University of Genoa, Genoa, Italy
| | - Mauro Giacomini
- Healthropy S.r.l., Savona, Italy.,Department of Informatics, Bioengineering, Robotics and Information Systems, University of Genoa, Genoa, Italy
| |
Collapse
|
14
|
Osman I, Pileggi SF, Ben Yahia S, Diallo G. An Alignment-Based Implementation of a Holistic Ontology Integration Method. MethodsX 2021; 8:101460. [PMID: 34434866 PMCID: PMC8374672 DOI: 10.1016/j.mex.2021.101460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 07/17/2021] [Indexed: 11/30/2022] Open
Abstract
Despite the intense research activity in the last two decades, ontology integration still presents a number of challenging issues. As ontologies are continuously growing in number, complexity and size and are adopted within open distributed systems such as the Semantic Web, integration becomes a central problem and has to be addressed in a context of increasing scale and heterogeneity. In this paper, we describe a holistic alignment-based method for customized ontology integration. The holistic approach proposes additional challenges as multiple ontologies are jointly integrated at once, in contrast to most common approaches that perform an incremental pairwise ontology integration. By applying consolidated techniques for ontology matching, we investigate the impact on the resulting ontology. The proposed method takes multiple ontologies as well as pairwise alignments and returns a refactored/non-refactored integrated ontology that faithfully preserves the original knowledge of the input ontologies and alignments. We have tested the method on large biomedical ontologies from the LargeBio OAEI track. Results show effectiveness, and overall, a decreased integration cost over multiple ontologies.OIAR and AROM are two implementations of the proposed method. OIAR creates a bridge ontology, and AROM creates a fully merged ontology. The implementation includes the option of ontology refactoring.
Collapse
Affiliation(s)
- Inès Osman
- LIPAH - LR11ES14, Faculty of Sciences of Tunis, University of Tunis El Manar, Tunis 2092, Tunisia
| | | | - Sadok Ben Yahia
- LIPAH - LR11ES14, Faculty of Sciences of Tunis, University of Tunis El Manar, Tunis 2092, Tunisia.,Department of Software Science, Tallinn University of Technology, Estonia
| | - Gayo Diallo
- INRIA SISTM, Team ERIAS - INSERM Bordeaux Population Health Research Center, University of Bordeaux, F-33000 Bordeaux, France
| |
Collapse
|
15
|
Vorisek CN, Klopfenstein SAI, Sass J, Lehne M, Schmidt CO, Thun S. Evaluating Suitability of SNOMED CT in Structured Searches for COVID-19 Studies. Stud Health Technol Inform 2021; 281:88-92. [PMID: 34042711 DOI: 10.3233/SHTI210126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Studies investigating the suitability of SNOMED CT in COVID-19 datasets are still scarce. The purpose of this study was to evaluate the suitability of SNOMED CT for structured searches of COVID-19 studies, using the German Corona Consensus Dataset (GECCO) as example. Suitability of the international standard SNOMED CT was measured with the scoring system ISO/TS 21564, and intercoder reliability of two independent mapping specialists was evaluated. The resulting analysis showed that the majority of data items had either a complete or partial equivalent in SNOMED CT (complete equivalent: 141 items; partial equivalent: 63 items; no equivalent: 1 item). Intercoder reliability was moderate, possibly due to non-establishment of mapping rules and high percentage (74%) of different but similar concepts among the 86 non-equal chosen concepts. The study shows that SNOMED CT can be utilized for COVID-19 cohort browsing. However, further studies investigating mapping rules and further international terminologies are necessary.
Collapse
|
16
|
Pedrera M, Garcia N, Blanco A, Terriza AI, Cruz J, Lopez EA, Calvo F, Jimenez MJ, Gonzalez P, Quiros V, Cruz JL, Bernal JL, Serrano P. Use of EHRs in a Tertiary Hospital During COVID-19 Pandemic: A Multi-Purpose Approach Based on Standards. Stud Health Technol Inform 2021; 281:28-32. [PMID: 34042699 DOI: 10.3233/shti210114] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
This work aims to describe how EHRs have been used to meet the needs of healthcare providers and researchers in a 1,300-beds tertiary Hospital during COVID-19 pandemic. For this purpose, essential clinical concepts were identified and standardized with LOINC and SNOMED CT. After that, these concepts were implemented in EHR systems and based on them, data tools, such as clinical alerts, dynamic patient lists and a clinical follow-up dashboard, were developed for healthcare support. In addition, these data were incorporated into standardized repositories and COVID-19 databases to improve clinical research on this new disease. In conclusion, standardized EHRs allowed implementation of useful multi- purpose data resources in a major Hospital in the course of the pandemic.
Collapse
Affiliation(s)
- Miguel Pedrera
- Hospital Universitario 12 de Octubre, Madrid, Spain.,ETSI Telecomunicación, Universidad Politécnica de Madrid, Madrid, Spain
| | | | - Alvar Blanco
- Hospital Universitario 12 de Octubre, Madrid, Spain
| | | | - Jaime Cruz
- Hospital Universitario 12 de Octubre, Madrid, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Haffer N, Thun S. Postcoordination of LOINC Codes in SNOMED CT. Stud Health Technol Inform 2021; 278:19-26. [PMID: 34042871 DOI: 10.3233/shti210045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The objectives of this paper are to analyze the terminologies SNOMED CT and Logical Observation Identifiers Names and Codes (LOINC) and to provide a guideline for the translation of LOINC concepts to SNOMED CT. Verified research data sets were used for this study, so this experiment is replicable with other research data. 50 LOINC concepts of frequently performed laboratory services were translated to SNOMED CT. Information would be lost with pre-coordinated mapping but the compositional grammar of SNOMED CT allows for the linking of individual concepts into complicated postcoordinated expressions including all embedded information in LOINC concepts. All information can thus be transferred smoothly to SNOMED CT.
Collapse
Affiliation(s)
- Nina Haffer
- Hochschule für Technik und Wirtschaft (HTW) - University of Applied Sciences, Berlin, Germany
- Berlin Institute of Health (BIH), Germany
| | - Sylvia Thun
- Berlin Institute of Health (BIH), Germany
- Charité - Universitätsmedizin Berlin, Germany
- Hochschule Niederrhein - University of Applied Sciences, Krefeld, Germany
| |
Collapse
|
18
|
Pecoraro F, Luzi D, Pourabbas E, Ricci FL, Rossi Mori A. Extending Contsys Standard with Social Care Concepts: A Methodology Proposed by the UNINFO Working Group in Italy. Stud Health Technol Inform 2020; 270:223-227. [PMID: 32570379 DOI: 10.3233/shti200155] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The increasing demand for territorial services requires the improvement of the coordination and cooperation among stakeholders in planning and delivery of integrated health and social services. In this scenario, to improve the communication among stakeholders there is a need of a formal conceptual model that facilitates the interoperability between organizations and professionals. This paper presents the methodology adopted by a UNINFO working group established in Italy to extend the ContSys standard with social care concepts to integrate health and social care contexts in a continuity of care perspective. An example of this extension is also provided considering the definition of patient's care plans.
Collapse
Affiliation(s)
- Fabrizio Pecoraro
- Institute for Research on Population and Social Policies, National Research Council, Italy
| | - Daniela Luzi
- Institute for Research on Population and Social Policies, National Research Council, Italy
| | - Elaheh Pourabbas
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council, Italy
| | - Fabrizio L Ricci
- Institute for Research on Population and Social Policies, National Research Council, Italy.,Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council, Italy
| | - Angelo Rossi Mori
- Institute for Research on Population and Social Policies, National Research Council, Italy
| |
Collapse
|
19
|
Kim HH, Park YR, Lee S, Kim JH. Composite CDE: modeling composite relationships between common data elements for representing complex clinical data. BMC Med Inform Decis Mak 2020; 20:147. [PMID: 32620117 PMCID: PMC7333279 DOI: 10.1186/s12911-020-01168-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 06/25/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Semantic interoperability is essential for improving data quality and sharing. The ISO/IEC 11179 Metadata Registry (MDR) standard has been highlighted as a solution for standardizing and registering clinical data elements (DEs). However, the standard model has both structural and semantic limitations, and the number of DEs continues to increase due to poor term reusability. Semantic types and constraints are lacking for comprehensively describing and evaluating DEs on real-world clinical documents. METHODS We addressed these limitations by defining three new types of semantic relationship (dependency, composite, and variable) in our previous studies. The present study created new and further extended existing semantic types (hybrid atomic and repeated and dictionary composite common data elements [CDEs]) with four constraints: ordered, operated, required, and dependent. For evaluation, we extracted all atomic and composite CDEs from five major clinical documents from five teaching hospitals in Korea, 14 Fast Healthcare Interoperability Resources (FHIR) resources from FHIR bulk sample data, and MIMIC-III (Medical Information Mart for Intensive Care) demo dataset. Metadata reusability and semantic interoperability in real clinical settings were comprehensively evaluated by applying the CDEs with our extended semantic types and constraints. RESULTS All of the CDEs (n = 1142) extracted from the 25 clinical documents were successfully integrated with a very high CDE reuse ratio (46.9%) into 586 CDEs (259 atomic and 20 unique composite CDEs), and all of CDEs (n = 238) extracted from the 14 FHIR resources of FHIR bulk sample data were successfully integrated with high CDE reuse ration (59.7%) into 96 CDEs (21 atomic and 28 unique composite CDEs), which improved the semantic integrity and interoperability without any semantic loss. Moreover, the most complex data structures from two CDE projects were successfully encoded with rich semantics and semantic integrity. CONCLUSION MDR-based extended semantic types and constraints can facilitate comprehensive representation of clinical documents with rich semantics, and improved semantic interoperability without semantic loss.
Collapse
Affiliation(s)
- Hye Hyeon Kim
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Yu Rang Park
- Department of Biomedical Systems Informatics, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea
| | - Suehyun Lee
- Department of Biomedical Informatics, College of Medicine, Konyang University, Daejeon, 35365, Republic of Korea
| | - Ju Han Kim
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
| |
Collapse
|
20
|
González L, Pérez-Rey D, Alonso E, Hernández G, Serrano P, Pedrera M, Gómez A, De Schepper K, Crepain T, Claerhout B. Building an I2B2-Based Population Repository for Clinical Research. Stud Health Technol Inform 2020; 270:78-82. [PMID: 32570350 DOI: 10.3233/shti200126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The present work provides a real-world case of the connection process of a hospital, 12 de Octubre University Hospital in Spain, to the TriNetX research network, transforming a compilation of disparate sources into a single harmonized repository which is automatically refreshed every day. It describes the different integration phases: terminology core datasets, specialized sources and eventually automatic refreshment. It also explains the work performed on semantic normalization of the involved clinical terminologies; as well as the resulting benefits the InSite platform services have enabled in the form of research opportunities for the hospital.
Collapse
Affiliation(s)
- Lydia González
- Biomedical Informatics Group, Artificial Intelligence Department, E.T.S.I. Informáticos, Universidad Politécnica de Madrid, Spain
| | - David Pérez-Rey
- Biomedical Informatics Group, Artificial Intelligence Department, E.T.S.I. Informáticos, Universidad Politécnica de Madrid, Spain
| | - Enrique Alonso
- Biomedical Informatics Group, Artificial Intelligence Department, E.T.S.I. Informáticos, Universidad Politécnica de Madrid, Spain
| | - Gema Hernández
- Biomedical Informatics Group, Artificial Intelligence Department, E.T.S.I. Informáticos, Universidad Politécnica de Madrid, Spain
| | | | | | | | | | | | | |
Collapse
|
21
|
El-Sappagh S, Ali F, Hendawi A, Jang JH, Kwak KS. A mobile health monitoring-and-treatment system based on integration of the SSN sensor ontology and the HL7 FHIR standard. BMC Med Inform Decis Mak 2019; 19:97. [PMID: 31077222 PMCID: PMC6511155 DOI: 10.1186/s12911-019-0806-z] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 03/31/2019] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Mobile health (MH) technologies including clinical decision support systems (CDSS) provide an efficient method for patient monitoring and treatment. A mobile CDSS is based on real-time sensor data and historical electronic health record (EHR) data. Raw sensor data have no semantics of their own; therefore, a computer system cannot interpret these data automatically. In addition, the interoperability of sensor data and EHR medical data is a challenge. EHR data collected from distributed systems have different structures, semantics, and coding mechanisms. As a result, building a transparent CDSS that can work as a portable plug-and-play component in any existing EHR ecosystem requires a careful design process. Ontology and medical standards support the construction of semantically intelligent CDSSs. METHODS This paper proposes a comprehensive MH framework with an integrated CDSS capability. This cloud-based system monitors and manages type 1 diabetes mellitus. The efficiency of any CDSS depends mainly on the quality of its knowledge and its semantic interoperability with different data sources. To this end, this paper concentrates on constructing a semantic CDSS based on proposed FASTO ontology. RESULTS This realistic ontology is able to collect, formalize, integrate, analyze, and manipulate all types of patient data. It provides patients with complete, personalized, and medically intuitive care plans, including insulin regimens, diets, exercises, and education sub-plans. These plans are based on the complete patient profile. In addition, the proposed CDSS provides real-time patient monitoring based on vital signs collected from patients' wireless body area networks. These monitoring include real-time insulin adjustments, mealtime carbohydrate calculations, and exercise recommendations. FASTO integrates the well-known standards of HL7 fast healthcare interoperability resources (FHIR), semantic sensor network (SSN) ontology, basic formal ontology (BFO) 2.0, and clinical practice guidelines. The current version of FASTO includes 9577 classes, 658 object properties, 164 data properties, 460 individuals, and 140 SWRL rules. FASTO is publicly available through the National Center for Biomedical Ontology BioPortal at https://bioportal.bioontology.org/ontologies/FASTO . CONCLUSIONS The resulting CDSS system can help physicians to monitor more patients efficiently and accurately. In addition, patients in rural areas can depend on the system to manage their diabetes and emergencies.
Collapse
Affiliation(s)
- Shaker El-Sappagh
- Department of Information and Communication Engineering, Inha University, Incheon, South Korea
- Information Systems Department, Faculty of Computer and Informatics, Benha University, Banha, Egypt
| | - Farman Ali
- Department of Information and Communication Engineering, Inha University, Incheon, South Korea
| | - Abdeltawab Hendawi
- Computer Science, University of Virginia, Charlottesville, USA
- Faculty of Computers and Information, Cairo University, Giza, Egypt
| | - Jun-Hyeog Jang
- Department of Biochemistry, School of Medicine, Inha University, Incheon, 400-712, South Korea
| | - Kyung-Sup Kwak
- Department of Information and Communication Engineering, Inha University, Incheon, South Korea.
| |
Collapse
|
22
|
Kalogiannis S, Deltouzos K, Zacharaki EI, Vasilakis A, Moustakas K, Ellul J, Megalooikonomou V. Integrating an openEHR-based personalized virtual model for the ageing population within HBase. BMC Med Inform Decis Mak 2019; 19:25. [PMID: 30691467 PMCID: PMC6350370 DOI: 10.1186/s12911-019-0745-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 01/14/2019] [Indexed: 11/17/2022] Open
Abstract
Background Frailty is a common clinical syndrome in ageing population that carries an increased risk for adverse health outcomes including falls, hospitalization, disability, and mortality. As these outcomes affect the health and social care planning, during the last years there is a tendency of investing in monitoring and preventing strategies. Although a number of electronic health record (EHR) systems have been developed, including personalized virtual patient models, there are limited ageing population oriented systems. Methods We exploit the openEHR framework for the representation of frailty in ageing population in order to attain semantic interoperability, and we present the methodology for adoption or development of archetypes. We also propose a framework for a one-to-one mapping between openEHR archetypes and a column-family NoSQL database (HBase) aiming at the integration of existing and newly developed archetypes into it. Results The requirement analysis of our study resulted in the definition of 22 coherent and clinically meaningful parameters for the description of frailty in older adults. The implemented openEHR methodology led to the direct use of 22 archetypes, the modification and reuse of two archetypes, and the development of 28 new archetypes. Additionally, the mapping procedure led to two different HBase tables for the storage of the data. Conclusions In this work, an openEHR-based virtual patient model has been designed and integrated into an HBase storage system, exploiting the advantages of the underlying technologies. This framework can serve as a base for the development of a decision support system using the openEHR’s Guideline Definition Language in the future.
Collapse
Affiliation(s)
- Spyridon Kalogiannis
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| | - Konstantinos Deltouzos
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece.
| | - Evangelia I Zacharaki
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| | - Andreas Vasilakis
- Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Charilaou-Thermi Rd, Thessaloniki, 57001, Greece
| | - Konstantinos Moustakas
- Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Charilaou-Thermi Rd, Thessaloniki, 57001, Greece
| | - John Ellul
- Department of Neurology, School of Medicine, University of Patras, University Campus, Rio, 26504, Greece
| | - Vasileios Megalooikonomou
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| |
Collapse
|
23
|
Kopanitsa G. Microservice Architecture to Provide Medical Data Management for Decision Support. Stud Health Technol Inform 2019; 261:230-235. [PMID: 31156121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Efficient Interaction between Hospital information (HIS) and laboratory information systems (LIS) provide a smooth laboratory testing process and data consistency. The current software ecosystem can be characterized by its rapid changes that can lead to breaks in HIS-LIS interaction and problems with semantic interoperability of the systems. To avoid such problems developers can clusterize software applications into small, easily supportable functional units that can be changed on demand without effecting other pieces of software. This approach commonly referred to as microservice architecture. The goal the research is to develop a FHIR based microservice platform that connects HIS, LIS and a Clinical decision support system (CDSS) into unified information space. A microservice platform has been implemented and now is in the production operation processing around 15000 orders a day.
Collapse
|
24
|
Abstract
Background To realize semantic interoperability for Primary Health Information System (PHIS), this study analyzes and applies existing health information data standards in China. This research aims to establish a Primary Health Information Standard System (PHISS), and achieve the semantic level interoperability and application of primary health information. Methods First, the PHISS in accordance with the structural standards of national information standards system in China was constructed. Second, application of semantic interoperability level with reference to the interoperability model was standardized. Thirdly, referring to the data element model, PHIS data element dictionary with good interoperability is developed by standardizing data element attributes of identifiers, names, definitions and permissible value. Fourthly, based on PHIS data element dictionary, PHIS dataset is developed following the relevant rules for health information datasets. Results PHISS is composed of basic class standards, data class standards, technical class standards, security and privacy class standards, management class standards. In this study, we reorganized the data class standards that meet the requirements of PHIS, also develops and adds PHIS data element, PHIS data element dictionary and PHIS dataset. PHIS data element dictionary includes 16 parts and PHIS dataset includes 22 parts, which satisfies the data standardization requirements of PHIS. Conclusions The establishment of the PHISS can meet the needs for the interconnection of the residents’ basic health service information and realize the semantic level interoperability of various information services. The key steps of this method are based on semantic interoperability. Relevant data elements and datasets with semantic interoperability are selected. Moreover, an information standard system is constructed, and the information standardization requirements of the PHIS are met.
Collapse
Affiliation(s)
- Xia Zhao
- Guangdong Provincial Key Laboratory of Medical Image Processing, School of Biomedical Engineering, Sothern Medical University, Guangzhou, China
| | - Xiaohua Li
- General Hospital of Guangzhou Military Command of PLA, Guangzhou, China
| | - Wei Yang
- Guangdong Provincial Key Laboratory of Medical Image Processing, School of Biomedical Engineering, Sothern Medical University, Guangzhou, China.
| | - Qianjin Feng
- Guangdong Provincial Key Laboratory of Medical Image Processing, School of Biomedical Engineering, Sothern Medical University, Guangzhou, China
| | - Yi Zhou
- Sun Yat-sen University, Guangzhou, China
| | - Qiong Wang
- The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
25
|
Jaulent MC, Leprovost D, Charlet J, Choquet R. Semantic interoperability challenges to process large amount of data perspectives in forensic and legal medicine. J Forensic Leg Med 2018; 57:19-23. [PMID: 29801946 DOI: 10.1016/j.jflm.2016.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2016] [Revised: 10/06/2016] [Accepted: 10/07/2016] [Indexed: 10/20/2022]
Abstract
This article is a position paper dealing with semantic interoperability challenges. It addresses the Variety and Veracity dimensions when integrating, sharing and reusing large amount of heterogeneous data for data analysis and decision making applications in the healthcare domain. Many issues are raised by the necessity to conform Big Data to interoperability standards. We discuss how semantics can contribute to the improvement of information sharing and address the problem of data mediation with domain ontologies. We then introduce the main steps for building domain ontologies as they could be implemented in the context of Forensic and Legal medicine. We conclude with a particular emphasis on the current limitations in standardisation and the importance of knowledge formalization.
Collapse
Affiliation(s)
- Marie-Christine Jaulent
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Université Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France; Université Paris 13, Sorbonne Paris Cité, UMR_S 1142, LIMICS, F-93430, Villetaneuse, France.
| | - Damien Leprovost
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Université Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France; Université Paris 13, Sorbonne Paris Cité, UMR_S 1142, LIMICS, F-93430, Villetaneuse, France
| | - Jean Charlet
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Université Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France; Université Paris 13, Sorbonne Paris Cité, UMR_S 1142, LIMICS, F-93430, Villetaneuse, France; AP-HP, Assistance Publique Hôpitaux de Paris, Paris, France
| | - Remy Choquet
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Université Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France; Université Paris 13, Sorbonne Paris Cité, UMR_S 1142, LIMICS, F-93430, Villetaneuse, France; BNDMR, Assistance Publique Hôpitaux de Paris, Hôpital Necker Enfants Malades, Paris, France
| |
Collapse
|
26
|
Singh G, Kuzniar A, van Mulligen EM, Gavai A, Bachem CW, Visser RGF, Finkers R. QTLTableMiner ++: semantic mining of QTL tables in scientific articles. BMC Bioinformatics 2018; 19:183. [PMID: 29801439 PMCID: PMC5970438 DOI: 10.1186/s12859-018-2165-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 04/25/2018] [Indexed: 11/11/2022] Open
Abstract
Background A quantitative trait locus (QTL) is a genomic region that correlates with a phenotype. Most of the experimental information about QTL mapping studies is described in tables of scientific publications. Traditional text mining techniques aim to extract information from unstructured text rather than from tables. We present QTLTableMiner++ (QTM), a table mining tool that extracts and semantically annotates QTL information buried in (heterogeneous) tables of plant science literature. QTM is a command line tool written in the Java programming language. This tool takes scientific articles from the Europe PMC repository as input, extracts QTL tables using keyword matching and ontology-based concept identification. The tables are further normalized using rules derived from table properties such as captions, column headers and table footers. Furthermore, table columns are classified into three categories namely column descriptors, properties and values based on column headers and data types of cell entries. Abbreviations found in the tables are expanded using the Schwartz and Hearst algorithm. Finally, the content of QTL tables is semantically enriched with domain-specific ontologies (e.g. Crop Ontology, Plant Ontology and Trait Ontology) using the Apache Solr search platform and the results are stored in a relational database and a text file. Results The performance of the QTM tool was assessed by precision and recall based on the information retrieved from two manually annotated corpora of open access articles, i.e. QTL mapping studies in tomato (Solanum lycopersicum) and in potato (S. tuberosum). In summary, QTM detected QTL statements in tomato with 74.53% precision and 92.56% recall and in potato with 82.82% precision and 98.94% recall. Conclusion QTM is a unique tool that aids in providing QTL information in machine-readable and semantically interoperable formats. Electronic supplementary material The online version of this article (10.1186/s12859-018-2165-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gurnoor Singh
- Plant Breeding, Wageningen University and Research, Wageningen, The Netherlands
| | - Arnold Kuzniar
- Netherlands eScience Center (NLeSC), Amsterdam, The Netherlands
| | - Erik M van Mulligen
- Department of Medical Informatics, Erasmus Medical Center, Rotterdam, The Netherlands
| | - Anand Gavai
- Netherlands eScience Center (NLeSC), Amsterdam, The Netherlands
| | - Christian W Bachem
- Plant Breeding, Wageningen University and Research, Wageningen, The Netherlands
| | - Richard G F Visser
- Plant Breeding, Wageningen University and Research, Wageningen, The Netherlands
| | - Richard Finkers
- Plant Breeding, Wageningen University and Research, Wageningen, The Netherlands.
| |
Collapse
|
27
|
Abstract
Background Clinical data registry is designed to collect and manage information about the practices and outcomes of a patient population for improving the quality and safety of care and facilitating novel researches. Semantic interoperability is a challenge when integrating the data from more than one clinical data registry. The openEHR approach can represent the information and knowledge semantics by multi-level modeling, and it advocates the use of collaborative modeling to facilitate reusing existing archetypes with consistent semantics so as to be a potential solution to improve the semantic interoperability. Methods This paper proposed an openEHR based approach to improve the semantic interoperability of clinical data registry. The approach consists of five steps: clinical data registry meta-information collection, data element definition, archetype modeling, template editing, and implementation. Through collaborative modeling and maximum reusing of existing archetype at the archetype modeling step, the approach can improve semantic interoperability. To verify the feasibility of the approach, this paper conducted a case study of building a Coronary Computed Tomography Angiography (CCTA) registry that can interoperate with an existing Electronic Health Record (EHR) system. Results The CCTA registry includes 183 data elements, which involves 20 archetypes. A total number of 45 CCTA data elements and EHR data elements have semantic overlap. Among them, 38 (84%) CCTA data elements can be found in the 10 reused EHR archetypes. These corresponding clinical data can be collected from the EHR system directly without transformation. The other 7 (16%) CCTA data elements correspond to one coarse-grained EHR data elements, and these clinical data can be collected with mapping rules. The results show that the approach can improve semantic interoperability of clinical data registry. Conclusions Using an openEHR based approach to develop clinical data registry can improve the semantic interoperability. Meanwhile, some challenges for broader semantic interoperability are identified, including domain experts’ involvement, archetype sharing and reusing, and archetype semantic mapping. Collaborative modeling, easy-to-use tools, and semantic relationship establishment are potential solutions for these challenges. This study provides some experience and insight about clinical modeling and clinical data registry development.
Collapse
Affiliation(s)
- Lingtong Min
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Qi Tian
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China.
| | - Jiye An
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| |
Collapse
|
28
|
Rodrigues JM, Schulz S, Mizen B, Trombert B, Rector A. Scrutinizing SNOMED CT's Ability to Reconcile Clinical Language Ambiguities with an Ontology Representation. Stud Health Technol Inform 2018; 247:910-914. [PMID: 29678093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
An important SNOMED CT use case is to support semantic interoperability between electronic health records and aggregation terminologies such as ICD. From the ongoing alignment exercise between SNOMED CT and the new version of ICD, now in its pre-final form, we studied whether the ambiguity of clinical language as displayed by SNOMED CT synonyms hampers the quality of SNOMED CT axioms following the SNOMED CT "concept model". We measure the rate of synonyms in the semantic misalignment between classes from the chapter on circulatory diseases of the ICD-11 beta version and SNOMED CT concepts with the same description names. Our study confirms that SNOMED CT synonyms are ambiguous and that there is a need to increase the number of SNOMED CT fully defined representations of Fully Specified Names (FSN), and of synonyms independently of their relations.
Collapse
Affiliation(s)
| | - Stefan Schulz
- Institute of Medical Informatics Statistics Medical University of Graz, Austria
| | | | | | | |
Collapse
|
29
|
Kopanitsa G, Ivanov A. Implementation of Fast Healthcare Interoperability Resources for an Integration of Laboratory and Hospital Information Systems. Stud Health Technol Inform 2018; 247:11-15. [PMID: 29677913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Electronic exchange of medical data between clinics and test centers makes the testing process more efficient, enables continuity of care record and reuse of medical data. The presented project employs HL 7 FHIR approach to model clinical concepts for the medical data exchange between a test center and different hospitals. Using a standard FHIR editor we have modeled 1226 observation profiles, 2396 commercial tests profiles that are mapped to 3249 production tests profiles. We have also defined a concept of an order and developed RESTfull API protocol to facilitate the ordering process. Now the data exchange system is in production and processes more than 20 000 test orders with more than 40 000 tests a day.
Collapse
|
30
|
Martínez-Costa C, Schulz S. Validating EHR clinical models using ontology patterns. J Biomed Inform 2017; 76:124-137. [PMID: 29113934 DOI: 10.1016/j.jbi.2017.11.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Revised: 11/02/2017] [Accepted: 11/03/2017] [Indexed: 11/28/2022]
Abstract
Clinical models are artefacts that specify how information is structured in electronic health records (EHRs). However, the makeup of clinical models is not guided by any formal constraint beyond a semantically vague information model. We address this gap by advocating ontology design patterns as a mechanism that makes the semantics of clinical models explicit. This paper demonstrates how ontology design patterns can validate existing clinical models using SHACL. Based on the Clinical Information Modelling Initiative (CIMI), we show how ontology patterns detect both modeling and terminology binding errors in CIMI models. SHACL, a W3C constraint language for the validation of RDF graphs, builds on the concept of "Shape", a description of data in terms of expected cardinalities, datatypes and other restrictions. SHACL, as opposed to OWL, subscribes to the Closed World Assumption (CWA) and is therefore more suitable for the validation of clinical models. We have demonstrated the feasibility of the approach by manually describing the correspondences between six CIMI clinical models represented in RDF and two SHACL ontology design patterns. Using a Java-based SHACL implementation, we found at least eleven modeling and binding errors within these CIMI models. This demonstrates the usefulness of ontology design patterns not only as a modeling tool but also as a tool for validation.
Collapse
Affiliation(s)
- Catalina Martínez-Costa
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerplatz 2, 8036 Graz, Austria.
| | - Stefan Schulz
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerplatz 2, 8036 Graz, Austria.
| |
Collapse
|
31
|
Boussadi A, Zapletal E. A Fast Healthcare Interoperability Resources (FHIR) layer implemented over i2b2. BMC Med Inform Decis Mak 2017; 17:120. [PMID: 28806953 PMCID: PMC5557515 DOI: 10.1186/s12911-017-0513-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 07/31/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Standards and technical specifications have been developed to define how the information contained in Electronic Health Records (EHRs) should be structured, semantically described, and communicated. Current trends rely on differentiating the representation of data instances from the definition of clinical information models. The dual model approach, which combines a reference model (RM) and a clinical information model (CIM), sets in practice this software design pattern. The most recent initiative, proposed by HL7, is called Fast Health Interoperability Resources (FHIR). The aim of our study was to investigate the feasibility of applying the FHIR standard to modeling and exposing EHR data of the Georges Pompidou European Hospital (HEGP) integrating biology and the bedside (i2b2) clinical data warehouse (CDW). RESULTS We implemented a FHIR server over i2b2 to expose EHR data in relation with five FHIR resources: DiagnosisReport, MedicationOrder, Patient, Encounter, and Medication. The architecture of the server combines a Data Access Object design pattern and FHIR resource providers, implemented using the Java HAPI FHIR API. Two types of queries were tested: query type #1 requests the server to display DiagnosticReport resources, for which the diagnosis code is equal to a given ICD-10 code. A total of 80 DiagnosticReport resources, corresponding to 36 patients, were displayed. Query type #2, requests the server to display MedicationOrder, for which the FHIR Medication identification code is equal to a given code expressed in a French coding system. A total of 503 MedicationOrder resources, corresponding to 290 patients, were displayed. Results were validated by manually comparing the results of each request to the results displayed by an ad-hoc SQL query. CONCLUSION We showed the feasibility of implementing a Java layer over the i2b2 database model to expose data of the CDW as a set of FHIR resources. An important part of this work was the structural and semantic mapping between the i2b2 model and the FHIR RM. To accomplish this, developers must manually browse the specifications of the FHIR standard. Our source code is freely available and can be adapted for use in other i2b2 sites.
Collapse
Affiliation(s)
- Abdelali Boussadi
- INSERM UMR 1138, Equipe 22, Centre de Recherche des Cordeliers, Universités Paris 5 et 6, Paris, France. .,Département de Santé Publique et Informatique Médicale, Hôpital Européen Georges Pompidou, Assistance Publique - Hôpitaux de Paris, Paris, France.
| | - Eric Zapletal
- Département de Santé Publique et Informatique Médicale, Hôpital Européen Georges Pompidou, Assistance Publique - Hôpitaux de Paris, Paris, France
| |
Collapse
|
32
|
Alonso-Calvo R, Paraiso-Medina S, Perez-Rey D, Alonso-Oset E, van Stiphout R, Yu S, Taylor M, Buffa F, Fernandez-Lozano C, Pazos A, Maojo V. A semantic interoperability approach to support integration of gene expression and clinical data in breast cancer. Comput Biol Med 2017; 87:179-186. [PMID: 28601027 DOI: 10.1016/j.compbiomed.2017.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Revised: 05/30/2017] [Accepted: 06/02/2017] [Indexed: 11/19/2022]
Abstract
INTRODUCTION The introduction of omics data and advances in technologies involved in clinical treatment has led to a broad range of approaches to represent clinical information. Within this context, patient stratification across health institutions due to omic profiling presents a complex scenario to carry out multi-center clinical trials. METHODS This paper presents a standards-based approach to ensure semantic integration required to facilitate the analysis of clinico-genomic clinical trials. To ensure interoperability across different institutions, we have developed a Semantic Interoperability Layer (SIL) to facilitate homogeneous access to clinical and genetic information, based on different well-established biomedical standards and following International Health (IHE) recommendations. RESULTS The SIL has shown suitability for integrating biomedical knowledge and technologies to match the latest clinical advances in healthcare and the use of genomic information. This genomic data integration in the SIL has been tested with a diagnostic classifier tool that takes advantage of harmonized multi-center clinico-genomic data for training statistical predictive models. CONCLUSIONS The SIL has been adopted in national and international research initiatives, such as the EURECA-EU research project and the CIMED collaborative Spanish project, where the proposed solution has been applied and evaluated by clinical experts focused on clinico-genomic studies.
Collapse
Affiliation(s)
- Raul Alonso-Calvo
- Biomedical Informatics Group, DIA & DLSIIS, ETSI Informáticos, Universidad Politécnica de Madrid, Spain.
| | - Sergio Paraiso-Medina
- Biomedical Informatics Group, DIA & DLSIIS, ETSI Informáticos, Universidad Politécnica de Madrid, Spain.
| | - David Perez-Rey
- Biomedical Informatics Group, DIA & DLSIIS, ETSI Informáticos, Universidad Politécnica de Madrid, Spain.
| | - Enrique Alonso-Oset
- Biomedical Informatics Group, DIA & DLSIIS, ETSI Informáticos, Universidad Politécnica de Madrid, Spain.
| | - Ruud van Stiphout
- Department of Oncology, Old Road Campus Research Building, Oxford, OX3 7DQ, United Kingdom.
| | - Sheng Yu
- Department of Oncology, Old Road Campus Research Building, Oxford, OX3 7DQ, United Kingdom.
| | - Marian Taylor
- Department of Oncology, Old Road Campus Research Building, Oxford, OX3 7DQ, United Kingdom.
| | - Francesca Buffa
- Department of Oncology, Old Road Campus Research Building, Oxford, OX3 7DQ, United Kingdom.
| | - Carlos Fernandez-Lozano
- Department of Information and Communication Technologies, Faculty of Computer Science, University of A Coruna, 15071, A Coruña, Spain.
| | - Alejandro Pazos
- Department of Information and Communication Technologies, Faculty of Computer Science, University of A Coruna, 15071, A Coruña, Spain.
| | - Victor Maojo
- Biomedical Informatics Group, DIA & DLSIIS, ETSI Informáticos, Universidad Politécnica de Madrid, Spain.
| |
Collapse
|
33
|
Ellouze AS, Bouaziz R, Ghorbel H. Integrating semantic dimension into openEHR archetypes for the management of cerebral palsy electronic medical records. J Biomed Inform 2016; 63:307-324. [PMID: 27568295 DOI: 10.1016/j.jbi.2016.08.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 08/12/2016] [Accepted: 08/23/2016] [Indexed: 10/21/2022]
Abstract
PURPOSE Integrating semantic dimension into clinical archetypes is necessary once modeling medical records. First, it enables semantic interoperability and, it offers applying semantic activities on clinical data and provides a higher design quality of Electronic Medical Record (EMR) systems. However, to obtain these advantages, designers need to use archetypes that cover semantic features of clinical concepts involved in their specific applications. In fact, most of archetypes filed within open repositories are expressed in the Archetype Definition Language (ALD) which allows defining only the syntactic structure of clinical concepts weakening semantic activities on the EMR content in the semantic web environment. This paper focuses on the modeling of an EMR prototype for infants affected by Cerebral Palsy (CP), using the dual model approach and integrating semantic web technologies. Such a modeling provides a better delivery of quality of care and ensures semantic interoperability between all involved therapies' information systems. METHODS First, data to be documented are identified and collected from the involved therapies. Subsequently, data are analyzed and arranged into archetypes expressed in accordance of ADL. During this step, open archetype repositories are explored, in order to find the suitable archetypes. Then, ADL archetypes are transformed into archetypes expressed in OWL-DL (Ontology Web Language - Description Language). Finally, we construct an ontological source related to these archetypes enabling hence their annotation to facilitate data extraction and providing possibility to exercise semantic activities on such archetypes. RESULTS Semantic dimension integration into EMR modeled in accordance to the archetype approach. The feasibility of our solution is shown through the development of a prototype, baptized "CP-SMS", which ensures semantic exploitation of CP EMR. This prototype provides the following features: (i) creation of CP EMR instances and their checking by using a knowledge base which we have constructed by interviews with domain experts, (ii) translation of initially CP ADL archetypes into CP OWL-DL archetypes, (iii) creation of an ontological source which we can use to annotate obtained archetypes and (vi) enrichment and supply of the ontological source and integration of semantic relations by providing hence fueling the ontology with new concepts, ensuring consistency and eliminating ambiguity between concepts. CONCLUSIONS The degree of semantic interoperability that could be reached between EMR systems depends strongly on the quality of the used archetypes. Thus, the integration of semantic dimension in archetypes modeling process is crucial. By creating an ontological source and annotating archetypes, we create a supportive platform ensuring semantic interoperability between archetypes-based EMR-systems.
Collapse
Affiliation(s)
- Afef Samet Ellouze
- Mir@cl Laboratory, Tunisia; Higher Institute of Technological Studies, Sfax, Tunisia.
| | - Rafik Bouaziz
- Mir@cl Laboratory, Tunisia; Faculty of Economic Sciences and Management, Sfax University, Tunisia.
| | - Hanen Ghorbel
- Mir@cl Laboratory, Tunisia; Higher Institute of Management, Sousse, Tunisia.
| |
Collapse
|
34
|
Demski H, Garde S, Hildebrand C. Open data models for smart health interconnected applications: the example of openEHR. BMC Med Inform Decis Mak 2016; 16:137. [PMID: 27770769 PMCID: PMC5075152 DOI: 10.1186/s12911-016-0376-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2016] [Accepted: 10/18/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Smart Health is known as a concept that enhances networking, intelligent data processing and combining patient data with other parameters. Open data models can play an important role in creating a framework for providing interoperable data services that support the development of innovative Smart Health applications profiting from data fusion and sharing. METHODS This article describes a model-driven engineering approach based on standardized clinical information models and explores its application for the development of interoperable electronic health record systems. The following possible model-driven procedures were considered: provision of data schemes for data exchange, automated generation of artefacts for application development and native platforms that directly execute the models. The applicability of the approach in practice was examined using the openEHR framework as an example. RESULTS A comprehensive infrastructure for model-driven engineering of electronic health records is presented using the example of the openEHR framework. It is shown that data schema definitions to be used in common practice software development processes can be derived from domain models. The capabilities for automatic creation of implementation artefacts (e.g., data entry forms) are demonstrated. Complementary programming libraries and frameworks that foster the use of open data models are introduced. Several compatible health data platforms are listed. They provide standard based interfaces for interconnecting with further applications. CONCLUSION Open data models help build a framework for interoperable data services that support the development of innovative Smart Health applications. Related tools for model-driven application development foster semantic interoperability and interconnected innovative applications.
Collapse
Affiliation(s)
- Hans Demski
- Helmholtz Zentrum München, Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany.
| | - Sebastian Garde
- Ocean Informatics, 124 Cromwell Road, Kensington, London, SW7 4ET, United Kingdom
| | - Claudia Hildebrand
- Helmholtz Zentrum München, Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| |
Collapse
|
35
|
Cardoso de Moraes JL, de Souza WL, Pires LF, do Prado AF. A methodology based on openEHR archetypes and software agents for developing e-health applications reusing legacy systems. Comput Methods Programs Biomed 2016; 134:267-287. [PMID: 27480749 DOI: 10.1016/j.cmpb.2016.07.013] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 07/04/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND AND OBJECTIVE In Pervasive Healthcare, novel information and communication technologies are applied to support the provision of health services anywhere, at anytime and to anyone. Since health systems may offer their health records in different electronic formats, the openEHR Foundation prescribes the use of archetypes for describing clinical knowledge in order to achieve semantic interoperability between these systems. Software agents have been applied to simulate human skills in some healthcare procedures. This paper presents a methodology, based on the use of openEHR archetypes and agent technology, which aims to overcome the weaknesses typically found in legacy healthcare systems, thereby adding value to the systems. METHODS This methodology was applied in the design of an agent-based system, which was used in a realistic healthcare scenario in which a medical staff meeting to prepare a cardiac surgery has been supported. We conducted experiments with this system in a distributed environment composed by three cardiology clinics and a center of cardiac surgery, all located in the city of Marília (São Paulo, Brazil). We evaluated this system according to the Technology Acceptance Model. RESULTS The case study confirmed the acceptance of our agent-based system by healthcare professionals and patients, who reacted positively with respect to the usefulness of this system in particular, and with respect to task delegation to software agents in general. The case study also showed that a software agent-based interface and a tools-based alternative must be provided to the end users, which should allow them to perform the tasks themselves or to delegate these tasks to other people. CONCLUSIONS A Pervasive Healthcare model requires efficient and secure information exchange between healthcare providers. The proposed methodology allows designers to build communication systems for the message exchange among heterogeneous healthcare systems, and to shift from systems that rely on informal communication of actors to a more automated and less error-prone agent-based system. Our methodology preserves significant investment of many years in the legacy systems and allows developers to extend them adding new features to these systems, by providing proactive assistance to the end-users and increasing the user mobility with an appropriate support.
Collapse
Affiliation(s)
- João Luís Cardoso de Moraes
- Federal University of São Carlos, Computer Department, Rodovia Washington Luís-Km 235, 13565-905 São Carlos-SP, Brazil.
| | - Wanderley Lopes de Souza
- Federal University of São Carlos, Computer Department, Rodovia Washington Luís-Km 235, 13565-905 São Carlos-SP, Brazil
| | - Luís Ferreira Pires
- University of Twente, Centre for Telematics and Information Technology, Drienerlolaan 5, 7522 NB, Enschede, The Netherlands
| | - Antonio Francisco do Prado
- Federal University of São Carlos, Computer Department, Rodovia Washington Luís-Km 235, 13565-905 São Carlos-SP, Brazil
| |
Collapse
|
36
|
Sun H, Depraetere K, De Roo J, Mels G, De Vloed B, Twagirumukiza M, Colaert D. Semantic processing of EHR data for clinical research. J Biomed Inform 2015; 58:247-259. [PMID: 26515501 DOI: 10.1016/j.jbi.2015.10.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2015] [Revised: 09/10/2015] [Accepted: 10/17/2015] [Indexed: 11/24/2022]
Abstract
There is a growing need to semantically process and integrate clinical data from different sources for clinical research. This paper presents an approach to integrate EHRs from heterogeneous resources and generate integrated data in different data formats or semantics to support various clinical research applications. The proposed approach builds semantic data virtualization layers on top of data sources, which generate data in the requested semantics or formats on demand. This approach avoids upfront dumping to and synchronizing of the data with various representations. Data from different EHR systems are first mapped to RDF data with source semantics, and then converted to representations with harmonized domain semantics where domain ontologies and terminologies are used to improve reusability. It is also possible to further convert data to application semantics and store the converted results in clinical research databases, e.g. i2b2, OMOP, to support different clinical research settings. Semantic conversions between different representations are explicitly expressed using N3 rules and executed by an N3 Reasoner (EYE), which can also generate proofs of the conversion processes. The solution presented in this paper has been applied to real-world applications that process large scale EHR data.
Collapse
Affiliation(s)
- Hong Sun
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium.
| | - Kristof Depraetere
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| | - Jos De Roo
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| | - Giovanni Mels
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| | - Boris De Vloed
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| | - Marc Twagirumukiza
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| | - Dirk Colaert
- Advanced Clinical Applications Research Group, Agfa HealthCare, Moutstraat 100, 9000 Gent, Belgium
| |
Collapse
|
37
|
Marco-Ruiz L, Moner D, Maldonado JA, Kolstrup N, Bellika JG. Archetype-based data warehouse environment to enable the reuse of electronic health record data. Int J Med Inform 2015; 84:702-14. [PMID: 26094821 DOI: 10.1016/j.ijmedinf.2015.05.016] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 05/26/2015] [Accepted: 05/28/2015] [Indexed: 10/23/2022]
Abstract
BACKGROUND The reuse of data captured during health care delivery is essential to satisfy the demands of clinical research and clinical decision support systems. A main barrier for the reuse is the existence of legacy formats of data and the high granularity of it when stored in an electronic health record (EHR) system. Thus, we need mechanisms to standardize, aggregate, and query data concealed in the EHRs, to allow their reuse whenever they are needed. OBJECTIVE To create a data warehouse infrastructure using archetype-based technologies, standards and query languages to enable the interoperability needed for data reuse. MATERIALS AND METHODS The work presented makes use of best of breed archetype-based data transformation and storage technologies to create a workflow for the modeling, extraction, transformation and load of EHR proprietary data into standardized data repositories. We converted legacy data and performed patient-centered aggregations via archetype-based transformations. Later, specific purpose aggregations were performed at a query level for particular use cases. RESULTS Laboratory test results of a population of 230,000 patients belonging to Troms and Finnmark counties in Norway requested between January 2013 and November 2014 have been standardized. Test records normalization has been performed by defining transformation and aggregation functions between the laboratory records and an archetype. These mappings were used to automatically generate open EHR compliant data. These data were loaded into an archetype-based data warehouse. Once loaded, we defined indicators linked to the data in the warehouse to monitor test activity of Salmonella and Pertussis using the archetype query language. DISCUSSION Archetype-based standards and technologies can be used to create a data warehouse environment that enables data from EHR systems to be reused in clinical research and decision support systems. With this approach, existing EHR data becomes available in a standardized and interoperable format, thus opening a world of possibilities toward semantic or concept-based reuse, query and communication of clinical data.
Collapse
Affiliation(s)
- Luis Marco-Ruiz
- Norwegian Centre for Integrated Care and Telemedicine, University Hospital of North Norway, Norway; Department of Clinical Medicine, Faculty of Health Sciences, UIT The Arctic University of Norway, Norway.
| | - David Moner
- Instituto de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas, Universitat Politècnica de València, Valencia, Spain
| | - José A Maldonado
- Instituto de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas, Universitat Politècnica de València, Valencia, Spain; VeraTech for Health SL, Valencia, Spain
| | - Nils Kolstrup
- Norwegian Centre for Integrated Care and Telemedicine, University Hospital of North Norway, Norway; General Practice Research Unit, UIT The Arctic University of Norway, Norway
| | - Johan G Bellika
- Norwegian Centre for Integrated Care and Telemedicine, University Hospital of North Norway, Norway; Department of Clinical Medicine, Faculty of Health Sciences, UIT The Arctic University of Norway, Norway
| |
Collapse
|
38
|
Pahl C, Zare M, Nilashi M, de Faria Borges MA, Weingaertner D, Detschew V, Supriyanto E, Ibrahim O. Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics. J Biomed Inform 2015; 55:174-87. [PMID: 25900270 DOI: 10.1016/j.jbi.2015.04.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Revised: 03/19/2015] [Accepted: 04/10/2015] [Indexed: 10/23/2022]
Abstract
This work investigates, whether openEHR with its reference model, archetypes and templates is suitable for the digital representation of demographic as well as clinical data. Moreover, it elaborates openEHR as a tool for modelling Hospital Information Systems on a regional level based on a national logical infrastructure. OpenEHR is a dual model approach developed for the modelling of Hospital Information Systems enabling semantic interoperability. A holistic solution to this represents the use of dual model based Electronic Healthcare Record systems. Modelling data in the field of obstetrics is a challenge, since different regions demand locally specific information for the process of treatment. Smaller health units in developing countries like Brazil or Malaysia, which until recently handled automatable processes like the storage of sensitive patient data in paper form, start organizational reconstruction processes. This archetype proof-of-concept investigation has tried out some elements of the openEHR methodology in cooperation with a health unit in Colombo, Brazil. Two legal forms provided by the Brazilian Ministry of Health have been analyzed and classified into demographic and clinical data. LinkEHR-Ed editor was used to read, edit and create archetypes. Results show that 33 clinical and demographic concepts, which are necessary to cover data demanded by the Unified National Health System, were identified. Out of the concepts 61% were reused and 39% modified to cover domain requirements. The detailed process of reuse, modification and creation of archetypes is shown. We conclude that, although a major part of demographic and clinical patient data were already represented by existing archetypes, a significant part required major modifications. In this study openEHR proved to be a highly suitable tool in the modelling of complex health data. In combination with LinkEHR-Ed software it offers user-friendly and highly applicable tools, although the complexity built by the vast specifications requires expert networks to define generally excepted clinical models. Finally, this project has pointed out main benefits enclosing high coverage of obstetrics data on the Clinical Knowledge Manager, simple modelling, and wide network and support using openEHR. Moreover, barriers described are enclosing the allocation of clinical content to respective archetypes, as well as stagnant adaption of changes on the Clinical Knowledge Manager leading to redundant efforts in data contribution that need to be addressed in future works.
Collapse
Affiliation(s)
- Christina Pahl
- Department of Biomechatronics, Ilmenau University of Technology, Ilmenau, Thuringia, Germany; IJN-UTM Cardiovascular Engineering Centre, University of Technology Malaysia, Skudai, Johor, Malaysia.
| | - Mojtaba Zare
- Faculty of Computing, Universiti Technologi Malaysia, Johor, Malaysia
| | | | | | - Daniel Weingaertner
- Department of Informatics, Universidade Federal do Paraná, Curitiba, Paraná, Brazil
| | - Vesselin Detschew
- Institute of Biomedical Engineering and Informatics, Department of Biomedical Engineering, Ilmenau, Thuringia, Germany
| | - Eko Supriyanto
- IJN-UTM Cardiovascular Engineering Centre, University of Technology Malaysia, Skudai, Johor, Malaysia
| | - Othman Ibrahim
- Faculty of Computing, Universiti Technologi Malaysia, Johor, Malaysia
| |
Collapse
|
39
|
He Z, Geller J, Chen Y. A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization. Artif Intell Med 2015; 64:29-40. [PMID: 25890688 DOI: 10.1016/j.artmed.2015.03.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Revised: 03/20/2015] [Accepted: 03/25/2015] [Indexed: 11/17/2022]
Abstract
OBJECTIVES Medical terminologies vary in the amount of concept information (the "density") represented, even in the same sub-domains. This causes problems in terminology mapping, semantic harmonization and terminology integration. Moreover, complex clinical scenarios need to be encoded by a medical terminology with comprehensive content. SNOMED Clinical Terms (SNOMED CT), a leading clinical terminology, was reported to lack concepts and synonyms, problems that cannot be fully alleviated by using post-coordination. Therefore, a scalable solution is needed to enrich the conceptual content of SNOMED CT. We are developing a structure-based, algorithmic method to identify potential concepts for enriching the conceptual content of SNOMED CT and to support semantic harmonization of SNOMED CT with selected other Unified Medical Language System (UMLS) terminologies. METHODS We first identified a subset of English terminologies in the UMLS that have 'PAR' relationship labeled with 'IS_A' and over 10% overlap with one or more of the 19 hierarchies of SNOMED CT. We call these "reference terminologies" and we note that our use of this name is different from the standard use. Next, we defined a set of topological patterns across pairs of terminologies, with SNOMED CT being one terminology in each pair and the other being one of the reference terminologies. We then explored how often these topological patterns appear between SNOMED CT and each reference terminology, and how to interpret them. RESULTS Four viable reference terminologies were identified. Large density differences between terminologies were found. Expected interpretations of these differences were indeed observed, as follows. A random sample of 299 instances of special topological patterns ("2:3 and 3:2 trapezoids") showed that 39.1% and 59.5% of analyzed concepts in SNOMED CT and in a reference terminology, respectively, were deemed to be alternative classifications of the same conceptual content. In 30.5% and 17.6% of the cases, it was found that intermediate concepts could be imported into SNOMED CT or into the reference terminology, respectively, to enhance their conceptual content, if approved by a human curator. Other cases included synonymy and errors in one of the terminologies. CONCLUSION These results show that structure-based algorithmic methods can be used to identify potential concepts to enrich SNOMED CT and the four reference terminologies. The comparative analysis has the future potential of supporting terminology authoring by suggesting new content to improve content coverage and semantic harmonization between terminologies.
Collapse
Affiliation(s)
- Zhe He
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA.
| | - James Geller
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | - Yan Chen
- Department of Computer Information Systems, Borough of Manhattan Community College, City University New York, New York, NY 10007, USA
| |
Collapse
|
40
|
Moreno-Conde A, Jódar-Sánchez F, Kalra D. Requirements for clinical information modelling tools. Int J Med Inform 2015; 84:524-36. [PMID: 25868808 DOI: 10.1016/j.ijmedinf.2015.03.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Revised: 03/15/2015] [Accepted: 03/17/2015] [Indexed: 11/17/2022]
Abstract
OBJECTIVE This study proposes consensus requirements for clinical information modelling tools that can support modelling tasks in medium/large scale institutions. Rather than identify which functionalities are currently available in existing tools, the study has focused on functionalities that should be covered in order to provide guidance about how to evolve the existing tools. METHODOLOGY After identifying a set of 56 requirements for clinical information modelling tools based on a literature review and interviews with experts, a classical Delphi study methodology was applied to conduct a two round survey in order to classify them as essential or recommended. Essential requirements are those that must be met by any tool that claims to be suitable for clinical information modelling, and if we one day have a certified tools list, any tool that does not meet essential criteria would be excluded. Recommended requirements are those more advanced requirements that may be met by tools offering a superior product or only needed in certain modelling situations. RESULTS According to the answers provided by 57 experts from 14 different countries, we found a high level of agreement to enable the study to identify 20 essential and 21 recommended requirements for these tools. CONCLUSIONS It is expected that this list of identified requirements will guide developers on the inclusion of new basic and advanced functionalities that have strong support by end users. This list could also guide regulators in order to identify requirements that could be demanded of tools adopted within their institutions.
Collapse
Affiliation(s)
- Alberto Moreno-Conde
- Centre for Health Informatics and Multiprofessional Education, University College London, London, United Kingdom; Technological Innovation Group, Virgen del Rocío University Hospital, Seville, Spain; Biomedical Informatics Research Area, Digitalica Salud SL, Seville, Spain.
| | | | - Dipak Kalra
- Centre for Health Informatics and Multiprofessional Education, University College London, London, United Kingdom; The European Institute for Health Records (EuroRec), Sint-Martens-Latem, Belgium
| |
Collapse
|
41
|
Alonso-Calvo R, Perez-Rey D, Paraiso-Medina S, Claerhout B, Hennebert P, Bucur A. Enabling semantic interoperability in multi-centric clinical trials on breast cancer. Comput Methods Programs Biomed 2015; 118:322-329. [PMID: 25682737 DOI: 10.1016/j.cmpb.2015.01.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2014] [Revised: 12/10/2014] [Accepted: 01/23/2015] [Indexed: 06/04/2023]
Abstract
BACKGROUND AND OBJECTIVES Post-genomic clinical trials require the participation of multiple institutions, and collecting data from several hospitals, laboratories and research facilities. This paper presents a standard-based solution to provide a uniform access endpoint to patient data involved in current clinical research. METHODS The proposed approach exploits well-established standards such as HL7 v3 or SPARQL and medical vocabularies such as SNOMED CT, LOINC and HGNC. A novel mechanism to exploit semantic normalization among HL7-based data models and biomedical ontologies has been created by using Semantic Web technologies. RESULTS Different types of queries have been used for testing the semantic interoperability solution described in this paper. The execution times obtained in the tests enable the development of end user tools within a framework that requires efficient retrieval of integrated data. CONCLUSIONS The proposed approach has been successfully tested by applications within the INTEGRATE and EURECA EU projects. These applications have been deployed and tested for: (i) patient screening, (ii) trial recruitment, and (iii) retrospective analysis; exploiting semantically interoperable access to clinical patient data from heterogeneous data sources.
Collapse
Affiliation(s)
- Raul Alonso-Calvo
- Biomedical Informatics Group, DLSIIS & DIA, Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660 Boadilla del Monte, Madrid, Spain.
| | - David Perez-Rey
- Biomedical Informatics Group, DLSIIS & DIA, Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660 Boadilla del Monte, Madrid, Spain
| | - Sergio Paraiso-Medina
- Biomedical Informatics Group, DLSIIS & DIA, Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660 Boadilla del Monte, Madrid, Spain
| | - Brecht Claerhout
- Custodix NV, Kortrijksesteenweg 214b3, Sint-Martens-Latem, Belgium
| | | | - Anca Bucur
- PHILIPS Research Europe, High Tech Campus 34, Eindhoven, Netherlands
| |
Collapse
|
42
|
Abstract
Background We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain. Results We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching. Conclusions We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time.
Collapse
Affiliation(s)
- Gayo Diallo
- University Bordeaux, ISPED, Centre INSERM U897, F-33000 Bordeaux, France
| |
Collapse
|
43
|
Breil B, Kenneweg J, Fritz F, Bruland P, Doods D, Trinczek B, Dugas M. Multilingual Medical Data Models in ODM Format: A Novel Form-based Approach to Semantic Interoperability between Routine Healthcare and Clinical Research. Appl Clin Inform 2012; 3:276-89. [PMID: 23620720 DOI: 10.4338/aci-2012-03-ra-0011] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2012] [Accepted: 06/24/2012] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Semantic interoperability between routine healthcare and clinical research is an unsolved issue, as information systems in the healthcare domain still use proprietary and site-specific data models. However, information exchange and data harmonization are essential for physicians and scientists if they want to collect and analyze data from different hospitals in order to build up registries and perform multicenter clinical trials. Consequently, there is a need for a standardized metadata exchange based on common data models. Currently this is mainly done by informatics experts instead of medical experts. OBJECTIVES We propose to enable physicians to exchange, rate, comment and discuss their own medical data models in a collaborative web-based repository of medical forms in a standardized format. METHODS Based on a comprehensive requirement analysis, a web-based portal for medical data models was specified. In this context, a data model is the technical specification (attributes, data types, value lists) of a medical form without any layout information. The CDISC Operational Data Model (ODM) was chosen as the appropriate format for the standardized representation of data models. The system was implemented with Ruby on Rails and applies web 2.0 technologies to provide a community based solution. Forms from different source systems - both routine care and clinical research - were converted into ODM format and uploaded into the portal. RESULTS A portal for medical data models based on ODM-files was implemented (http://www.medical-data-models.org). Physicians are able to upload, comment, rate and download medical data models. More than 250 forms with approximately 8000 items are provided in different views (overview and detailed presentation) and in multiple languages. For instance, the portal contains forms from clinical and research information systems. CONCLUSION The portal provides a system-independent repository for multilingual data models in ODM format which can be used by physicians. It serves as a platform for discussion and enables the exchange of multilingual medical data models in a standardized way.
Collapse
Affiliation(s)
- B Breil
- Institute of Medical Informatics, University of Münster , Germany
| | | | | | | | | | | | | |
Collapse
|
44
|
Limaye N. Data management Redefined. Perspect Clin Res 2010; 1:110-2. [PMID: 21814632 PMCID: PMC3146076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Core perspectives on the traditional approach to CDM are rapidly changing and EDC and new eclincal initiatives are redefining the face of data management. Associated with EDC are not only the higher efficiencies, resulting in lower study costs, but its applications in key areas such as adaptive trials and clinical event adjudication; however the cost and effort involved in deployment and integration remain a deterrent. The role of the data manager may change to that of a data broker who manages the exchange of data from multiple sources, and semantic interoperability, data standards and data privacy will prove to be the defining factors. Simulation modeling, pharmacogenomics, personalized medicine and EHRs will no longer exist as silos and seamless data flows will be the drivers of healthcare solutions.
Collapse
Affiliation(s)
- Nimita Limaye
- VP and Global Head, Strategic Data Services and Medical Writing, SIRO Clinpharm Pvt. Ltd. & Vice-Chair, Society of Clinical Data Management
| |
Collapse
|