51
|
Spitale G. Making sense in the flood. How to cope with the massive flow of digital information in medical ethics. Heliyon 2020; 6:e04426. [PMID: 32743090 PMCID: PMC7385457 DOI: 10.1016/j.heliyon.2020.e04426] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 01/20/2020] [Accepted: 07/08/2020] [Indexed: 01/08/2023] Open
Abstract
Scientific publications have become the currency of Academia, hence the concept of 'publish or perish'. But there are consequences: the amount of existing literature and its proliferation rate have reached the point where keeping pace is just impossible. If this is true in general, it becomes a huge issue in interdisciplinary fields such as bioethics where knowing the state of the art in more than one single discipline is a concrete necessity. If we accept the idea of building new science on an exhaustive comprehension of existing knowledge, a radical change is needed. Smart iterative search strategies, frequency analysis and text mining, techniques described in this paper, can't be a long run solution. But they might serve as a useful coping strategy.
Collapse
Affiliation(s)
- Giovanni Spitale
- Institute of Biomedical Ethics and History of Medicine, University of Zurich, Winterthurerstrasse 30, 8006, Zurich, Switzerland
| |
Collapse
|
52
|
Burgermaster M, Son JH, Davidson PG, Smaldone AM, Kuperman G, Feller DJ, Burt KG, Levine ME, Albers DJ, Weng C, Mamykina L. A new approach to integrating patient-generated data with expert knowledge for personalized goal setting: A pilot study. Int J Med Inform 2020; 139:104158. [PMID: 32388157 PMCID: PMC7332366 DOI: 10.1016/j.ijmedinf.2020.104158] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 02/19/2020] [Accepted: 04/23/2020] [Indexed: 12/17/2022]
Abstract
INTRODUCTION Self-monitoring technologies produce patient-generated data that could be leveraged to personalize nutritional goal setting to improve population health; however, most computational approaches are limited when applied to individual-level personalization with sparse and irregular self-monitoring data. We applied informatics methods from expert suggestion systems to a challenging clinical problem: generating personalized nutrition goals from patient-generated diet and blood glucose data. MATERIALS AND METHODS We applied qualitative process coding and decision tree modeling to understand how registered dietitians translate patient-generated data into recommendations for dietary self-management of diabetes (i.e., knowledge model). We encoded this process in a set of functions that take diet and blood glucose data as an input and output diet recommendations (i.e., inference engine). Dietitians assessed face validity. Using four patient datasets, we compared our inference engine's output to clinical narratives and gold standards developed by expert clinicians. RESULTS To dietitians, the knowledge model represented how recommendations from patient data are made. Inference engine recommendations were 63 % consistent with the gold standard (range = 42 %-75 %) and 74 % consistent with narrative clinical observations (range = 63 %-83 %). DISCUSSION Qualitative modeling and automating how dietitians reason over patient data resulted in a knowledge model representing clinical knowledge. However, our knowledge model was less consistent with gold standard than narrative clinical recommendations, raising questions about how best to evaluate approaches that integrate patient-generated data with expert knowledge. CONCLUSION New informatics approaches that integrate data-driven methods with expert decision making for personalized goal setting, such as the knowledge base and inference engine presented here, demonstrate the potential to extend the reach of patient-generated data by synthesizing it with clinical knowledge. However, important questions remain about the strengths and weaknesses of computer algorithms developed to discern signal from patient-generated data compared to human experts.
Collapse
Affiliation(s)
- Marissa Burgermaster
- Nutritional Sciences & Population Health, University of Texas at Austin, Austin, TX, USA; Biomedical Informatics, Columbia University, New York, NY, USA.
| | - Jung H Son
- Biomedical Informatics, Columbia University, New York, NY, USA
| | | | - Arlene M Smaldone
- School of Nursing & College of Dental Medicine, Columbia University, New York, NY, USA
| | - Gilad Kuperman
- Biomedical Informatics, Columbia University, New York, NY, USA; Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Daniel J Feller
- Biomedical Informatics, Columbia University, New York, NY, USA
| | | | | | - David J Albers
- Biomedical Informatics, Columbia University, New York, NY, USA; Pediatrics & Informatics, University of Colorado, Aurora, CO, USA
| | - Chunhua Weng
- Biomedical Informatics, Columbia University, New York, NY, USA
| | - Lena Mamykina
- Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|
53
|
Abstract
Biological databases are growing at an exponential rate, currently being among the major producers of Big Data, almost on par with commercial generators, such as YouTube or Twitter. While traditionally biological databases evolved as independent silos, each purposely built by a different research group in order to answer specific research questions; more recently significant efforts have been made toward integrating these heterogeneous sources into unified data access systems or interoperable systems using the FAIR principles of data sharing. Semantic Web technologies have been key enablers in this process, opening the path for new insights into the unified data, which were not visible at the level of each independent database. In this chapter, we first provide an introduction into two of the most used database models for biological data: relational databases and RDF stores. Next, we discuss ontology-based data integration, which serves to unify and enrich heterogeneous data sources. We present an extensive timeline of milestones in data integration based on Semantic Web technologies in the field of life sciences. Finally, we discuss some of the remaining challenges in making ontology-based data access (OBDA) systems easily accessible to a larger audience. In particular, we introduce natural language search interfaces, which alleviate the need for database users to be familiar with technical query languages. We illustrate the main theoretical concepts of data integration through concrete examples, using two well-known biological databases: a gene expression database, Bgee, and an orthology database, OMA.
Collapse
Affiliation(s)
- Ana Claudia Sima
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland. .,University of Lausanne, Lausanne, Switzerland.
| | - Kurt Stockinger
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Tarcisio Mendes de Farias
- University of Lausanne, Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Manuel Gil
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
54
|
Abstract
Ontology, as a useful knowledge engineering technique, has been widely used for reducing ambiguity and helping with information sharing. It is considered originally to be clear, comprehensive, and with well-defined format. It characterizes several domains purposes description through structured and formalized languages. In various areas of research, it has become a significant way to realize successful and powerful accomplishments. Actually, medical ontologies were turned into an efficient application in medical domains. They also become a relevant approach to process large medical data volumes. Consequently, they are behaving as a support decision system in some cases. Also, they ensure diagnosis process acceleration and assistance. Additionally, they have been integrated especially to represent human healthcare concepts. For that reason, plenty of research works applied ontologies to design and treat liver diseases. In this article, we present a general overview of medical ontologies to stand for this type of disease. We expose and discuss these works in details by a complete comparison. Also, we show their performance to arrange clinical data and extract results.
Collapse
Affiliation(s)
- Rim Messaoudi
- MIRACL Laboratory, University of Sfax, Sfax, Tunisia.
- Institut Pascal, Université Clermont Auvergne, UMR6602 CNRS/UCA/SIGMA, 63171, Aubière, France.
| | - Achraf Mtibaa
- MIRACL Laboratory, University of Sfax, Sfax, Tunisia
- National School of Electronic and Telecommunications, University of Sfax, Sfax, Tunisia
| | - Antoine Vacavant
- Institut Pascal, Université Clermont Auvergne, UMR6602 CNRS/UCA/SIGMA, 63171, Aubière, France
| | - Faïez Gargouri
- MIRACL Laboratory, University of Sfax, Sfax, Tunisia
- Higher Institute of Computer Science and Multimedia, University of Sfax, Sfax, Tunisia
| | - Faouzi Jaziri
- Institut Pascal, Université Clermont Auvergne, UMR6602 CNRS/UCA/SIGMA, 63171, Aubière, France
| |
Collapse
|
55
|
Kumarasinghe K, Kasabov N, Taylor D. Deep learning and deep knowledge representation in Spiking Neural Networks for Brain-Computer Interfaces. Neural Netw 2019; 121:169-185. [PMID: 31568895 DOI: 10.1016/j.neunet.2019.08.029] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Revised: 08/26/2019] [Accepted: 08/26/2019] [Indexed: 01/21/2023]
Abstract
OBJECTIVE This paper argues that Brain-Inspired Spiking Neural Network (BI-SNN) architectures can learn and reveal deep in time-space functional and structural patterns from spatio-temporal data. These patterns can be represented as deep knowledge, in a partial case in the form of deep spatio-temporal rules. This is a promising direction for building new types of Brain-Computer Interfaces called Brain-Inspired Brain-Computer Interfaces (BI-BCI). A theoretical framework and its experimental validation on deep knowledge extraction and representation using SNN are presented. RESULTS The proposed methodology was applied in a case study to extract deep knowledge of the functional and structural organisation of the brain's neural network during the execution of a Grasp and Lift task. The BI-BCI successfully extracted the neural trajectories that represent the dorsal and ventral visual information processing streams as well as its connection to the motor cortex in the brain. Deep spatiotemporal rules on functional and structural interaction of distinct brain areas were then used for event prediction in BI-BCI. SIGNIFICANCE The computational framework can be used for unveiling the topological patterns of the brain and such knowledge can be effectively used to enhance the state-of-the-art in BCI.
Collapse
Affiliation(s)
- Kaushalya Kumarasinghe
- Knowledge Engineering and Discovery Research Institute, Auckland University of Technology, Auckland, New Zealand; Health and Rehabilitation Research Institute, Auckland University of Technology, Auckland, New Zealand.
| | - Nikola Kasabov
- Knowledge Engineering and Discovery Research Institute, Auckland University of Technology, Auckland, New Zealand.
| | - Denise Taylor
- Health and Rehabilitation Research Institute, Auckland University of Technology, Auckland, New Zealand.
| |
Collapse
|
56
|
Horschler DJ, Santos LR, MacLean EL. Do non-human primates really represent others' ignorance? A test of the awareness relations hypothesis. Cognition 2019; 190:72-80. [PMID: 31026672 PMCID: PMC6570545 DOI: 10.1016/j.cognition.2019.04.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 03/10/2019] [Accepted: 04/12/2019] [Indexed: 10/27/2022]
Abstract
Non-human primates can often predict how another agent will behave based on that agent's knowledge about the world. But how do non-human primates represent others' knowledge states? Researchers have recently proposed that non-human primates form "awareness relations" to attribute objectively true information to other minds, as opposed to human-like representations that track others' ignorance or false belief states. We present the first explicit test of the awareness relations hypothesis by examining when rhesus macaques' understanding of other agents' knowledge falters. In Experiment 1, monkeys watched an agent observe a piece of fruit (the target object) being hidden in one of two boxes. While the agent's view was occluded, either the fruit moved out of its box and directly back into it, or the box containing the fruit opened and immediately closed. We found that monkeys looked significantly longer when the agent reached incorrectly rather than correctly after the box's movement, but not after the fruit's movement. This result suggests that monkeys did not expect the agent to know the fruit's location when it briefly and arbitrarily moved while the agent could not see it, but did expect the agent to know the fruit's location when only the box moved while the agent could not see it. In Experiment 2, we replicated and extended both findings with a larger sample, a different target object, and opposite directions of motion in the test trials. These findings suggest that monkeys reason about others' knowledge of objects by forming awareness relations which are disrupted by arbitrary spatial manipulation of the target object while an agent has no perceptual access to it.
Collapse
Affiliation(s)
- Daniel J Horschler
- School of Anthropology, University of Arizona, Tucson, AZ 85719, USA; Cognitive Science Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ 85719, USA.
| | - Laurie R Santos
- Department of Psychology, Yale University, New Haven, CT 06520, USA
| | - Evan L MacLean
- School of Anthropology, University of Arizona, Tucson, AZ 85719, USA; Department of Psychology, University of Arizona, Tucson, AZ 85719, USA; Cognitive Science Program, University of Arizona, Tucson, AZ 85719, USA
| |
Collapse
|
57
|
Rajan NS, Gouripeddi R, Mo P, Madsen RK, Facelli JC. Towards a content agnostic computable knowledge repository for data quality assessment. Comput Methods Programs Biomed 2019; 177:193-201. [PMID: 31319948 DOI: 10.1016/j.cmpb.2019.05.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 04/16/2019] [Accepted: 05/17/2019] [Indexed: 06/10/2023]
Abstract
BACKGROUND AND OBJECTIVE In recent years, several data quality conceptual frameworks have been proposed across the Data Quality and Information Quality domains towards assessment of quality of data. These frameworks are diverse, varying from simple lists of concepts to complex ontological and taxonomical representations of data quality concepts. The goal of this study is to design, develop and implement a platform agnostic computable data quality knowledge repository for data quality assessments. METHODS We identified computable data quality concepts by performing a comprehensive literature review of articles indexed in three major bibliographic data sources. From this corpus, we extracted data quality concepts, their definitions, applicable measures, their computability and identified conceptual relationships. We used these relationships to design and develop a data quality meta-model and implemented it in a quality knowledge repository. RESULTS We identified three primitives for programmatically performing data quality assessments: data quality concept, its definition, its measure or rule for data quality assessment, and their associations. We modeled a computable data quality meta-data repository and extended this framework to adapt, store, retrieve and automate assessment of other existing data quality assessment models. CONCLUSION We identified research gaps in data quality literature towards automating data quality assessments methods. In this process, we designed, developed and implemented a computable data quality knowledge repository for assessing quality and characterizing data in health data repositories. We leverage this knowledge repository in a service-oriented architecture to perform scalable and reproducible framework for data quality assessments in disparate biomedical data sources.
Collapse
Affiliation(s)
- Naresh Sundar Rajan
- Department of Biomedical Informatics, Center for Clinical and Translational Sciences (CCTS) Biomedical Informatics Core, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84108-3514, USA.
| | - Ramkiran Gouripeddi
- Department of Biomedical Informatics, Center for Clinical and Translational Sciences (CCTS) Biomedical Informatics Core, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84108-3514, USA.
| | - Peter Mo
- Department of Biomedical Informatics, Center for Clinical and Translational Sciences (CCTS) Biomedical Informatics Core, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84108-3514, USA.
| | - Randy K Madsen
- Department of Biomedical Informatics, Center for Clinical and Translational Sciences (CCTS) Biomedical Informatics Core, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84108-3514, USA.
| | - Julio C Facelli
- Department of Biomedical Informatics, Center for Clinical and Translational Sciences (CCTS) Biomedical Informatics Core, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84108-3514, USA.
| |
Collapse
|
58
|
Bottrighi A, Piovesan L, Terenziani P. Supporting the distributed execution of clinical guidelines by multiple agents. Artif Intell Med 2019; 98:87-108. [PMID: 31204191 DOI: 10.1016/j.artmed.2019.05.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 04/30/2019] [Accepted: 05/05/2019] [Indexed: 11/21/2022]
Abstract
Clinical guidelines (GLs) are widely adopted in order to improve the quality of patient care, and to optimize it. To achieve such goals, their application on a specific patient usually requires the interventions of different agents, with different roles (e.g., physician, nurse), abilities (e.g., specialist in the treatment of alcohol-related problems) and contexts (e.g., many chronic patients may be treated at home). Additionally, the responsibility of the application of a guideline to a patient is usually retained by a physician, but delegation of responsibility (of the whole guideline, or of a part of it) is often used\required (e.g., delegation to a specialist), as well as the possibility, for a responsible, to select the executor of an action (e.g., a physician may retain the responsibility of an action, but delegate to a nurse its execution). To manage such phenomena, proper support to agent interaction and communication must be provided, providing agents with facilities for (1) treatment continuity (2) contextualization, (3) responsibility assignment and delegation (4) check of agent "appropriateness". In this paper we extend GLARE, a computerized GL management system, to support such needs. We illustrate our approach by means of a practical case study.
Collapse
|
59
|
Shen Y, Yuan K, Yang M, Tang B, Li Y, Du N, Lei K. KMR: knowledge-oriented medicine representation learning for drug-drug interaction and similarity computation. J Cheminform 2019; 11:22. [PMID: 30874969 PMCID: PMC6419809 DOI: 10.1186/s13321-019-0342-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 03/01/2019] [Indexed: 02/07/2023] Open
Abstract
Efficient representations of drugs provide important support for healthcare analytics, such as drug-drug interaction (DDI) prediction and drug-drug similarity (DDS) computation. However, incomplete annotated data and drug feature sparseness create substantial barriers for drug representation learning, making it difficult to accurately identify new drug properties prior to public release. To alleviate these deficiencies, we propose KMR, a knowledge-oriented feature-driven method which can learn drug related knowledge with an accurate representation. We conduct series of experiments on real-world medical datasets to demonstrate that KMR is capable of drug representation learning. KMR can support to discover meaningful DDI with an accuracy rate of 92.19%, demonstrating that techniques developed in KMR significantly improve the prediction quality for new drugs not seen at training. Experimental results also indicate that KMR can identify DDS with an accuracy rate of 88.7% by facilitating drug knowledge, outperforming existing state-of-the-art drug similarity measures.
Collapse
Affiliation(s)
- Ying Shen
- The Shenzhen Key Lab for Information Centric Networking and Blockchain Techologies(ICNLab), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055 Shenzhen, People’s Republic of China
| | - Kaiqi Yuan
- The Shenzhen Key Lab for Information Centric Networking and Blockchain Techologies(ICNLab), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055 Shenzhen, People’s Republic of China
| | - Min Yang
- SIAT, Chinese Academy of Sciences, 518055 Shenzhen, People’s Republic of China
| | - Buzhou Tang
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055 People’s Republic of China
| | | | - Nan Du
- Tencent Medical AI Lab, Palo Alto, USA
| | - Kai Lei
- The Shenzhen Key Lab for Information Centric Networking and Blockchain Techologies(ICNLab), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055 Shenzhen, People’s Republic of China
- PCL Research Center of Networks and Communications, Peng Cheng Laboratory, Shenzhen, China
| |
Collapse
|
60
|
Rector A, Schulz S, Rodrigues JM, Chute CG, Solbrig H. On beyond Gruber: "Ontologies" in today's biomedical information systems and the limits of OWL. J Biomed Inform 2019; 100S:100002. [PMID: 34384571 DOI: 10.1016/j.yjbinx.2019.100002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The word "ontology" was introduced to information systems when only closed-world reasoning systems were available. It was "borrowed" from philosophy, but literal links to its philosophical meaning were explicitly disavowed. Since then, open-world reasoning systems based on description logics have been developed, OWL has become a standard, and philosophical issues have been raised. The result has too often been confusion. The question "What statements are ontological" receives a variety of answers. A clearer vocabulary that is better suited to today's information systems is needed. The project to base ICD-11 on a "Common Ontology" required addressing this confusion. This paper sets out to systematise the lessons of that experience and subsequent discussions. We explore the semantics of open-world and closed-world systems. For specifying knowledge bases and software, we propose "invariants" or, more fully, "the first order invariant part of the background domain knowledge base" as an alternative to the words "ontology" and "ontological." We discuss the role and limitations of OWL and description logics and how they are complementary to closed world systems such as frames and to less formal "knowledge organisation systems". We illustrate why the conventions of classifications such as ICD cannot be formulated directly in OWL, but can be linked to OWL knowledge bases by queries. We contend that while OWL and description logics are major advances for representing invariants and terminologies, they must be combined with other technologies to represent broader background knowledge faithfully. The ICD-11 architecture is one approach. We argue that such hybrid architectures can and should be developed further.
Collapse
Affiliation(s)
- Alan Rector
- University of Manchester, School of Computer Science, Kilburn Building, Oxford Road, Manchester M13 9PL, UK.
| | - Stefan Schulz
- Medical University of Graz, Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerpltz 2, 8036 Graz, Austria.
| | - Jean Marie Rodrigues
- Université Jean Monnet Saint Etienne/Université de Lyon, CHU de Saint-Etienne, SSPIM - Bâtiment CIM 42, Chemin de la Marandière, 42023 St Etienne cedex 2, France.
| | - Christopher G Chute
- Johns Hopkins University, School of Medicine, Public Health, and Nursing, 2024 E. Monument Street, Suite 1-202, Baltimore, MD 21205, USA.
| | - Harold Solbrig
- Johns Hopkins University, Institute for Clinical and Translational Research, 2024 E. Monument Street, Suite 1-202, Baltimore, MD 21205, USA.
| |
Collapse
|
61
|
Picart-Armada S, Fernández-Albert F, Vinaixa M, Yanes O, Perera-Lluna A. FELLA: an R package to enrich metabolomics data. BMC Bioinformatics 2018; 19:538. [PMID: 30577788 PMCID: PMC6303911 DOI: 10.1186/s12859-018-2487-5] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 11/12/2018] [Indexed: 12/27/2022] Open
Abstract
Background Pathway enrichment techniques are useful for understanding experimental metabolomics data. Their purpose is to give context to the affected metabolites in terms of the prior knowledge contained in metabolic pathways. However, the interpretation of a prioritized pathway list is still challenging, as pathways show overlap and cross talk effects. Results We introduce FELLA, an R package to perform a network-based enrichment of a list of affected metabolites. FELLA builds a hierarchical representation of an organism biochemistry from the Kyoto Encyclopedia of Genes and Genomes (KEGG), containing pathways, modules, enzymes, reactions and metabolites. In addition to providing a list of pathways, FELLA reports intermediate entities (modules, enzymes, reactions) that link the input metabolites to them. This sheds light on pathway cross talk and potential enzymes or metabolites as targets for the condition under study. FELLA has been applied to six public datasets –three from Homo sapiens, two from Danio rerio and one from Mus musculus– and has reproduced findings from the original studies and from independent literature. Conclusions The R package FELLA offers an innovative enrichment concept starting from a list of metabolites, based on a knowledge graph representation of the KEGG database that focuses on interpretability. Besides reporting a list of pathways, FELLA suggests intermediate entities that are of interest per se. Its usefulness has been shown at several molecular levels on six public datasets, including human and animal models. The user can run the enrichment analysis through a simple interactive graphical interface or programmatically. FELLA is publicly available in Bioconductor under the GPL-3 license. Electronic supplementary material The online version of this article (10.1186/s12859-018-2487-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sergio Picart-Armada
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain. .,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain. .,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, 08950, Spain.
| | - Francesc Fernández-Albert
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain.,Takeda Cambridge Ltd, Cambridge, CB4 0PZ, UK
| | - Maria Vinaixa
- Metabolomics Platform, IISPV, Department of Electronic Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona, 43003, Spain.,CIBER of Diabetes and Associated Metabolic Diseases (CIBERDEM), Madrid, 28029, Spain
| | - Oscar Yanes
- Metabolomics Platform, IISPV, Department of Electronic Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona, 43003, Spain.,CIBER of Diabetes and Associated Metabolic Diseases (CIBERDEM), Madrid, 28029, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain.,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, 08950, Spain
| |
Collapse
|
62
|
Bodenreider O, James J. The New SNOMED CT International Medicinal Product Model. CEUR Workshop Proc 2018; 2285:36. [PMID: 36277122 PMCID: PMC9584358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
OBJECTIVES To present the new SNOMED CT international medicinal product model. METHODS We present the main elements of the model, with focus on types of entities and their interrelations, definitional attributes for clinical drugs, and categories of groupers. RESULTS We present the status of implementation as of July 2018 and illustrate differences between the original and new models through an example. CONCLUSIONS Benefits of the new medicinal product model include comprehensive representation of clinical drugs, logical definitions with necessary and sufficient conditions for all medicinal product entities, better high-level organization through distinct categories of groupers, and compliance with international standards.
Collapse
Affiliation(s)
- Olivier Bodenreider
- U.S. National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Julie James
- Blue Wave Informatics LLP, Exeter, United Kingdom
| |
Collapse
|
63
|
Abstract
BACKGROUND The right dataset is essential to obtain the right insights in data science; therefore, it is important for data scientists to have a good understanding of the availability of relevant datasets as well as the content, structure, and existing analyses of these datasets. While a number of efforts are underway to integrate the large amount and variety of datasets, the lack of an information resource that focuses on specific needs of target users of datasets has existed as a problem for years. To address this gap, we have developed a Dataset Information Resource (DIR), using a user-oriented approach, which gathers relevant dataset knowledge for specific user types. In the present version, we specifically address the challenges of entry-level data scientists in learning to identify, understand, and analyze major datasets in healthcare. We emphasize that the DIR does not contain actual data from the datasets but aims to provide comprehensive knowledge about the datasets and their analyses. METHODS The DIR leverages Semantic Web technologies and the W3C Dataset Description Profile as the standard for knowledge integration and representation. To extract tailored knowledge for target users, we have developed methods for manual extractions from dataset documentations as well as semi-automatic extractions from related publications, using natural language processing (NLP)-based approaches. A semantic query component is available for knowledge retrieval, and a parameterized question-answering functionality is provided to facilitate the ease of search. RESULTS The DIR prototype is composed of four major components-dataset metadata and related knowledge, search modules, question answering for frequently-asked questions, and blogs. The current implementation includes information on 12 commonly used large and complex healthcare datasets. The initial usage evaluation based on health informatics novices indicates that the DIR is helpful and beginner-friendly. CONCLUSIONS We have developed a novel user-oriented DIR that provides dataset knowledge specialized for target user groups. Knowledge about datasets is effectively represented in the Semantic Web. At this initial stage, the DIR has already been able to provide sophisticated and relevant knowledge of 12 datasets to help entry health informacians learn healthcare data analysis using suitable datasets. Further development of both content and function levels is underway.
Collapse
Affiliation(s)
- Jingyi Shi
- Department of Software and Information Systems, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, 28223 NC USA
| | - Mingna Zheng
- Department of Software and Information Systems, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, 28223 NC USA
| | - Lixia Yao
- Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, 55905 MN USA
| | - Yaorong Ge
- Department of Software and Information Systems, University of North Carolina at Charlotte, 9201 University City Blvd, Charlotte, 28223 NC USA
| |
Collapse
|
64
|
Falcionelli N, Sernani P, Brugués A, Mekuria DN, Calvaresi D, Schumacher M, Dragoni AF, Bromuri S. Indexing the Event Calculus: Towards practical human-readable Personal Health Systems. Artif Intell Med 2019; 96:154-66. [PMID: 30442433 DOI: 10.1016/j.artmed.2018.10.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 09/28/2018] [Accepted: 10/16/2018] [Indexed: 11/22/2022]
Abstract
Personal Health Systems (PHS) are mobile solutions tailored to monitoring patients affected by chronic non communicable diseases. In general, a patient affected by a chronic disease can generate large amounts of events: for example, in Type 1 Diabetic patients generate several glucose events per day, ranging from at least 6 events per day (under normal monitoring) to 288 per day when wearing a continuous glucose monitor (CGM) that samples the blood every 5 minutes for several days. Just by itself, without considering other physiological parameters, it would be impossible for medical doctors to individually and accurately follow every patient, highlighting the need of simple approaches towards querying physiological time series. Achieving this with current technology is not an easy task, as on one hand it cannot be expected that medical doctors have the technical knowledge to query databases and on the other hand these time series include thousands of events, which requires to re-think the way data is indexed. Anyhow, handling data streams efficiently is not enough. Domain experts' knowledge must be explicitly included into PHSs in a way that it can be easily readed and modified by medical staffs. Logic programming represents the perfect programming paradygm to accomplish this task. In this work, an Event Calculus-based reasoning framework to standardize and express domain-knowledge in the form of monitoring rules is suggested, and applied to three different use cases. However, if online monitoring has to be achieved, the reasoning performance must improve dramatically. For this reason, three promising mechanisms to index the Event Calculus Knowledge Base are proposed. All of them are based on different types of tree indexing structures: k-d trees, interval trees and red-black trees. The paper then compares and analyzes the performance of the three indexing techniques, by computing the time needed to check different type of rules (and eventually generating alerts), when the number of recorded events (e.g. values of physiological parameters) increases. The results show that customized jREC performs much better when the event average inter-arrival time is little compared to the checked rule time-window. Instead, where the events are more sparse, the use of k-d trees with standard EC is advisable. Finally, the Multi-Agent paradigm helps to wrap the various components of the system: the reasoning engines represent the agent minds, and the sensors are its body. The said agents have been developed in MAGPIE, a mobile event based Java agent platform.
Collapse
|
65
|
Grossman LV, Mitchell EG, Hripcsak G, Weng C, Vawdrey DK. A method for harmonization of clinical abbreviation and acronym sense inventories. J Biomed Inform 2018; 88:62-69. [PMID: 30414475 DOI: 10.1016/j.jbi.2018.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 10/24/2018] [Accepted: 11/05/2018] [Indexed: 11/15/2022]
Abstract
BACKGROUND Previous research has developed methods to construct acronym sense inventories from a single institutional corpus. Although beneficial, a sense inventory constructed from a single institutional corpus is not generalizable, because acronyms from different geographic regions and medical specialties vary greatly. OBJECTIVE Develop an automated method to harmonize sense inventories from different regions and specialties towards the development of a comprehensive inventory. METHODS The method involves integrating multiple source sense inventories into one centralized inventory and cross-mapping redundant entries to establish synonymy. To evaluate our method, we integrated 8 well-known source inventories into one comprehensive inventory (or metathesaurus). For both the metathesaurus and its sources, we evaluated the coverage of acronyms and their senses on a corpus of 1 million clinical notes. The corpus came from a different institution, region, and specialty than the source inventories. RESULTS In the evaluation using clinical notes, the metathesaurus demonstrated an acronym (short form) micro-coverage of 94.3%, representing a substantial increase over the two next largest source inventories, the UMLS LRABR (74.8%) and ADAM (68.0%). The metathesaurus demonstrated a sense (long form) micro-coverage of 99.6%, again a substantial increase compared to the UMLS LRABR (82.5%) and ADAM (55.4%). CONCLUSIONS Given the high coverage, harmonizing acronym sense inventories is a promising methodology to improve their comprehensiveness. Our method is automated, leverages the extensive resources already devoted to developing institution-specific inventories in the United States, and may help generalize sense inventories to institutions who lack the resources to develop them. Future work should address quality issues in source inventories and explore additional approaches to establishing synonymy.
Collapse
Affiliation(s)
- Lisa V Grossman
- Department of Biomedical Informatics, Columbia University, New York, NY, USA; College of Physicians and Surgeons, Columbia University, New York, NY, USA.
| | - Elliot G Mitchell
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY, USA
| | - David K Vawdrey
- Department of Biomedical Informatics, Columbia University, New York, NY, USA; Value Institute, NewYork-Presbyterian Hospital, New York, NY, USA
| |
Collapse
|
66
|
Abstract
BACKGROUND Patients' encounters with healthcare services must undergo clinical coding. These codes are typically derived from free-text notes. Manual clinical coding is expensive, time-consuming and prone to error. Automated clinical coding systems have great potential to save resources, and realtime availability of codes would improve oversight of patient care and accelerate research. Automated coding is made challenging by the idiosyncrasies of clinical text, the large number of disease codes and their unbalanced distribution. METHODS We explore methods for representing clinical text and the labels in hierarchical clinical coding ontologies. Text is represented as term frequency-inverse document frequency counts and then as word embeddings, which we use as input to recurrent neural networks. Labels are represented atomically, and then by learning representations of each node in a coding ontology and composing a representation for each label from its respective node path. We consider different strategies for initialisation of the node representations. We evaluate our methods using the publicly-available Medical Information Mart for Intensive Care III dataset: we extract the history of presenting illness section from each discharge summary in the dataset, then predicting the International Classification of Diseases, ninth revision, Clinical Modification codes associated with these. RESULTS Composing the label representations from the clinical-coding-ontology nodes increased weighted F1 for prediction of the 17,561 disease labels to 0.264-0.281 from 0.232-0.249 for atomic representations. Recurrent neural network text representation improved weighted F1 for prediction of the 19 disease-category labels to 0.682-0.701 from 0.662-0.682 using term frequency-inverse document frequency. However, term frequency-inverse document frequency outperformed recurrent neural networks for prediction of the 17,561 disease labels. CONCLUSIONS This study demonstrates that hierarchically-structured medical knowledge can be incorporated into statistical models, and produces improved performance during automated clinical coding. This performance improvement results primarily from improved representation of rarer diseases. We also show that recurrent neural networks improve representation of medical text in some settings. Learning good representations of the very rare diseases in clinical coding ontologies from data alone remains challenging, and alternative means of representing these diseases will form a major focus of future work on automated clinical coding.
Collapse
Affiliation(s)
- Finneas Catling
- University College London, Gower Street, London WC1E 6BT, UK.
| | | | | |
Collapse
|
67
|
Abstract
Background Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. Results We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. Conclusions We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate).
Collapse
Affiliation(s)
- Ying Shen
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China
| | - Daoyuan Chen
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China
| | - Buzhou Tang
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), 518055, Shenzhen, People's Republic of China
| | - Min Yang
- SIAT, Chinese Academy of Sciences, 518055, Shenzhen, People's Republic of China
| | - Kai Lei
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China.
| |
Collapse
|
68
|
Abstract
BACKGROUND Healthcare services, particularly in patient-provider interaction, often involve highly emotional situations, and it is important for physicians to understand and respond to their patients' emotions to best ensure their well-being. METHODS In order to model the emotion domain, we have created the Visualized Emotion Ontology (VEO) to provide a semantic definition of 25 emotions based on established models, as well as visual representations of emotions utilizing shapes, lines, and colors. RESULTS As determined by ontology evaluation metrics, VEO exhibited better machine-readability (z=1.12), linguistic quality (z=0.61), and domain coverage (z=0.39) compared to a sample of cognitive ontologies. Additionally, a survey of 1082 participants through Amazon Mechanical Turk revealed that a significantly higher proportion of people agree than disagree with 17 out of our 25 emotion images, validating the majority of our visualizations. CONCLUSION From the development, evaluation, and serialization of the VEO, we have defined a set of 25 emotions using OWL that linked surveyed visualizations to each emotion. In the future, we plan to use the VEO in patient-facing software tools, such as embodied conversational agents, to enhance interactions between patients and providers in a clinical environment.
Collapse
Affiliation(s)
- Rebecca Lin
- Krieger School of Arts & Sciences, Johns Hopkins University, Baltimore, MD USA
| | - Muhammad “Tuan” Amith
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX USA
| | - Chen Liang
- Health Informatics & Information Management, Louisiana Tech University, Ruston, LA USA
| | - Rui Duan
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA USA
| | - Yong Chen
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA USA
| | - Cui Tao
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX USA
| |
Collapse
|
69
|
Siregar P, Julen N, Hufnagl P, Mutter G. A general framework dedicated to computational morphogenesis Part II - Knowledge representation and architecture. Biosystems 2018; 173:314-334. [PMID: 30009873 DOI: 10.1016/j.biosystems.2018.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 05/21/2018] [Accepted: 07/05/2018] [Indexed: 11/16/2022]
Abstract
In our previous paper we introduced morphogenesis and post-embryonic life as arising from cells interacting via coupled chemical, electrical and mechanical processes occurring across multiple organization levels. We reviewed these processes from the perspectives of developmental biology and how they relate to physics-based constitutive equations that are well suited to model intercellular interactions' fields. In this paper we will describe a knowledge representation and architectural design strategy that can organize and encode the biochemical, biological and biophysical data necessary to represent and model the highly specialized and diversified cells that constitute living tissues. Since there are about 200 different types of cells in mammalian tissues, a huge amount of molecular, cellular and tissue data must be accounted for. This data cannot be incorporated in an ad hoc manner but, on the contrary, must be organized according to some sound principles. We give an overview of these principles and describe how they can be incorporated as proper features of a Knowledge Base System (KBS) dedicated to computational morphogenesis (CM).
Collapse
Affiliation(s)
| | | | - Peter Hufnagl
- Department of Digital Pathology and IT, Institute of Pathology, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - George Mutter
- Department of Pathology, Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA
| |
Collapse
|
70
|
Mercier J, Josso A, Médigue C, Vallenet D. GROOLS: reactive graph reasoning for genome annotation through biological processes. BMC Bioinformatics 2018; 19:132. [PMID: 29642842 PMCID: PMC5896057 DOI: 10.1186/s12859-018-2126-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 03/22/2018] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. RESULTS We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. CONCLUSIONS GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations.
Collapse
Affiliation(s)
- Jonathan Mercier
- LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, Evry, 91057 France
| | - Adrien Josso
- LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, Evry, 91057 France
| | - Claudine Médigue
- LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, Evry, 91057 France
| | - David Vallenet
- LABGeM, Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Université d’Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, Evry, 91057 France
| |
Collapse
|
71
|
Amith M, He Z, Bian J, Lossio-Ventura JA, Tao C. Assessing the practice of biomedical ontology evaluation: Gaps and opportunities. J Biomed Inform 2018; 80:1-13. [PMID: 29462669 PMCID: PMC5882531 DOI: 10.1016/j.jbi.2018.02.010] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 02/12/2018] [Accepted: 02/16/2018] [Indexed: 11/26/2022]
Abstract
With the proliferation of heterogeneous health care data in the last three decades, biomedical ontologies and controlled biomedical terminologies play a more and more important role in knowledge representation and management, data integration, natural language processing, as well as decision support for health information systems and biomedical research. Biomedical ontologies and controlled terminologies are intended to assure interoperability. Nevertheless, the quality of biomedical ontologies has hindered their applicability and subsequent adoption in real-world applications. Ontology evaluation is an integral part of ontology development and maintenance. In the biomedicine domain, ontology evaluation is often conducted by third parties as a quality assurance (or auditing) effort that focuses on identifying modeling errors and inconsistencies. In this work, we first organized four categorical schemes of ontology evaluation methods in the existing literature to create an integrated taxonomy. Further, to understand the ontology evaluation practice in the biomedicine domain, we reviewed a sample of 200 ontologies from the National Center for Biomedical Ontology (NCBO) BioPortal-the largest repository for biomedical ontologies-and observed that only 15 of these ontologies have documented evaluation in their corresponding inception papers. We then surveyed the recent quality assurance approaches for biomedical ontologies and their use. We also mapped these quality assurance approaches to the ontology evaluation criteria. It is our anticipation that ontology evaluation and quality assurance approaches will be more widely adopted in the development life cycle of biomedical ontologies.
Collapse
Affiliation(s)
- Muhammad Amith
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Zhe He
- School of Information, Florida State University, Tallahassee, FL, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, USA
| | | | - Cui Tao
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA.
| |
Collapse
|
72
|
Ell SW, Smith DB, Peralta G, Hélie S. The impact of category structure and training methodology on learning and generalizing within-category representations. Atten Percept Psychophys 2017; 79:1777-94. [PMID: 28584954 DOI: 10.3758/s13414-017-1345-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
When interacting with categories, representations focused on within-category relationships are often learned, but the conditions promoting within-category representations and their generalizability are unclear. We report the results of three experiments investigating the impact of category structure and training methodology on the learning and generalization of within-category representations (i.e., correlational structure). Participants were trained on either rule-based or information-integration structures using classification (Is the stimulus a member of Category A or Category B?), concept (e.g., Is the stimulus a member of Category A, Yes or No?), or inference (infer the missing component of the stimulus from a given category) and then tested on either an inference task (Experiments 1 and 2) or a classification task (Experiment 3). For the information-integration structure, within-category representations were consistently learned, could be generalized to novel stimuli, and could be generalized to support inference at test. For the rule-based structure, extended inference training resulted in generalization to novel stimuli (Experiment 2) and inference training resulted in generalization to classification (Experiment 3). These data help to clarify the conditions under which within-category representations can be learned. Moreover, these results make an important contribution in highlighting the impact of category structure and training methodology on the generalization of categorical knowledge.
Collapse
|
73
|
Abstract
This article investigates whether, and how, an artificial intelligence (AI) system can be said to use visual, imagery-based representations in a way that is analogous to the use of visual mental imagery by people. In particular, this article aims to answer two fundamental questions about imagery-based AI systems. First, what might visual imagery look like in an AI system, in terms of the internal representations used by the system to store and reason about knowledge? Second, what kinds of intelligent tasks would an imagery-based AI system be able to accomplish? The first question is answered by providing a working definition of what constitutes an imagery-based knowledge representation, and the second question is answered through a literature survey of imagery-based AI systems that have been developed over the past several decades of AI research, spanning task domains of: 1) template-based visual search; 2) spatial and diagrammatic reasoning; 3) geometric analogies and matrix reasoning; 4) naive physics; and 5) commonsense reasoning for question answering. This article concludes by discussing three important open research questions in the study of visual-imagery-based AI systems-on evaluating system performance, learning imagery operators, and representing abstract concepts-and their implications for understanding human visual mental imagery.
Collapse
Affiliation(s)
- Maithilee Kunda
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
74
|
Gil de la Fuente A, Godzien J, Fernández López M, Rupérez FJ, Barbas C, Otero A. Knowledge-based metabolite annotation tool: CEU Mass Mediator. J Pharm Biomed Anal 2018; 154:138-149. [PMID: 29547800 DOI: 10.1016/j.jpba.2018.02.046] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 02/09/2018] [Accepted: 02/20/2018] [Indexed: 11/29/2022]
Abstract
CEU Mass Mediator (CMM) is an on-line tool for aiding researchers when performing metabolite annotation. Its database is comprised of 279,318 real compounds integrated from several metabolomic databases including Human Metabolome Database (HMDB), KEGG and LipidMaps and 672,042 simulated compounds from MINE. In addition, CMM scores the annotations which matched the query parameters using 122 rules based on expert knowledge. This knowledge, obtained from the Centre for Metabolomics and Bioanalysis (CEMBIO) and from a literature review, enables CMM expert system to automatically extract evidence to support or refute the annotations by checking relationships among them. CMM is the first metabolite annotation tool that uses a knowledge-driven approach to provide support to the researcher. This allows to focus on the most plausible annotations, thus saving time and minimizing mistakes.
Collapse
Affiliation(s)
- Alberto Gil de la Fuente
- Department of Information Technology, Escuela Politécnica Superior, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain; Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain.
| | - Joanna Godzien
- Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain
| | - Mariano Fernández López
- Department of Information Technology, Escuela Politécnica Superior, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain; Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain
| | - Francisco J Rupérez
- Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain
| | - Coral Barbas
- Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain
| | - Abraham Otero
- Department of Information Technology, Escuela Politécnica Superior, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain; Centre for Metabolomics and Bioanalysis (CEMBIO), Facultad de Farmacia, Universidad CEU-San Pablo, Campus Montepríncipe, Boadilla del Monte, Madrid 28668, Spain
| |
Collapse
|
75
|
Milchenko M, Snyder AZ, LaMontagne P, Shimony JS, Benzinger TL, Fouke SJ, Marcus DS. Heterogeneous Optimization Framework: Reproducible Preprocessing of Multi-Spectral Clinical MRI for Neuro-Oncology Imaging Research. Neuroinformatics 2016; 14:305-17. [PMID: 26910516 DOI: 10.1007/s12021-016-9296-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Neuroimaging research often relies on clinically acquired magnetic resonance imaging (MRI) datasets that can originate from multiple institutions. Such datasets are characterized by high heterogeneity of modalities and variability of sequence parameters. This heterogeneity complicates the automation of image processing tasks such as spatial co-registration and physiological or functional image analysis. Given this heterogeneity, conventional processing workflows developed for research purposes are not optimal for clinical data. In this work, we describe an approach called Heterogeneous Optimization Framework (HOF) for developing image analysis pipelines that can handle the high degree of clinical data non-uniformity. HOF provides a set of guidelines for configuration, algorithm development, deployment, interpretation of results and quality control for such pipelines. At each step, we illustrate the HOF approach using the implementation of an automated pipeline for Multimodal Glioma Analysis (MGA) as an example. The MGA pipeline computes tissue diffusion characteristics of diffusion tensor imaging (DTI) acquisitions, hemodynamic characteristics using a perfusion model of susceptibility contrast (DSC) MRI, and spatial cross-modal co-registration of available anatomical, physiological and derived patient images. Developing MGA within HOF enabled the processing of neuro-oncology MR imaging studies to be fully automated. MGA has been successfully used to analyze over 160 clinical tumor studies to date within several research projects. Introduction of the MGA pipeline improved image processing throughput and, most importantly, effectively produced co-registered datasets that were suitable for advanced analysis despite high heterogeneity in acquisition protocols.
Collapse
|
76
|
Garcia-Gathright JI, Matiasz NJ, Adame C, Sarma KV, Sauer L, Smedley NF, Spiegel ML, Strunck J, Garon EB, Taira RK, Aberle DR, Bui AAT. Evaluating Casama: Contextualized semantic maps for summarization of lung cancer studies. Comput Biol Med 2018; 92:55-63. [PMID: 29149658 PMCID: PMC5762403 DOI: 10.1016/j.compbiomed.2017.10.034] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Revised: 10/28/2017] [Accepted: 10/29/2017] [Indexed: 01/15/2023]
Abstract
OBJECTIVE It is crucial for clinicians to stay up to date on current literature in order to apply recent evidence to clinical decision making. Automatic summarization systems can help clinicians quickly view an aggregated summary of literature on a topic. Casama, a representation and summarization system based on "contextualized semantic maps," captures the findings of biomedical studies as well as the contexts associated with patient population and study design. This paper presents a user-oriented evaluation of Casama in comparison to a context-free representation, SemRep. MATERIALS AND METHODS The effectiveness of the representation was evaluated by presenting users with manually annotated Casama and SemRep summaries of ten articles on driver mutations in cancer. Automatic annotations were evaluated on a collection of articles on EGFR mutation in lung cancer. Seven users completed a questionnaire rating the summarization quality for various topics and applications. RESULTS Casama had higher median scores than SemRep for the majority of the topics (p≤ 0.00032), all of the applications (p≤ 0.00089), and in overall summarization quality (p≤ 1.5e-05). Casama's manual annotations outperformed Casama's automatic annotations (p = 0.00061). DISCUSSION Casama performed particularly well in the representation of strength of evidence, which was highly rated both quantitatively and qualitatively. Users noted that Casama's less granular, more targeted representation improved usability compared to SemRep. CONCLUSION This evaluation demonstrated the benefits of a contextualized representation for summarizing biomedical literature on cancer. Iteration on specific areas of Casama's representation, further development of its algorithms, and a clinically-oriented evaluation are warranted.
Collapse
Affiliation(s)
- Jean I Garcia-Gathright
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA.
| | - Nicholas J Matiasz
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| | - Carlos Adame
- University of California, Los Angeles, Department of Medicine - Division of Hematology-Oncology, 924 Westwood Boulevard, Suite 200, Los Angeles, CA, 90024, USA
| | - Karthik V Sarma
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| | - Lauren Sauer
- University of California, Los Angeles, Department of Medicine - Division of Hematology-Oncology, 924 Westwood Boulevard, Suite 200, Los Angeles, CA, 90024, USA
| | - Nova F Smedley
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| | - Marshall L Spiegel
- University of California, Los Angeles, Department of Medicine - Division of Hematology-Oncology, 924 Westwood Boulevard, Suite 200, Los Angeles, CA, 90024, USA
| | - Jennifer Strunck
- University of California, Los Angeles, Department of Medicine - Division of Hematology-Oncology, 924 Westwood Boulevard, Suite 200, Los Angeles, CA, 90024, USA
| | - Edward B Garon
- University of California, Los Angeles, Department of Medicine - Division of Hematology-Oncology, 924 Westwood Boulevard, Suite 200, Los Angeles, CA, 90024, USA
| | - Ricky K Taira
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA; University of California, Los Angeles, Department of Radiological Sciences, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| | - Denise R Aberle
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA; University of California, Los Angeles, Department of Radiological Sciences, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| | - Alex A T Bui
- University of California, Los Angeles, Department of Bioengineering, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA; University of California, Los Angeles, Department of Radiological Sciences, 924 Westwood Boulevard, Suite 420, Los Angeles, CA, 90024, USA
| |
Collapse
|
77
|
Kamišalić A, Riaño D, Welzer T. Knowledge Formalization to Support Decision-Making in Heart Failure Treatment. Stud Health Technol Inform 2018; 255:137-141. [PMID: 30306923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This paper presents medical knowledge representation of data provided within Clinical Practice Guidelines for Heart Failure. The formalization is provided in order to support taking decisions on an appropriate treatment strategy for a specific patient. An intuitive and efficient mechanism of medical knowledge formalization, called extended Timed Transition Diagram (eTTD), is used to represent acquired medical knowledge. The presented models can be used to help students in their training as well as to support physicians with their decision-making tasks.
Collapse
Affiliation(s)
- Aida Kamišalić
- University of Maribor, Faculty of Electrical Engineering and Computer Science, Maribor, Slovenia
| | - David Riaño
- Research Group on Artificial Intelligence, Universitat Rovira i Virgili, Tarragona, Spain
| | - Tatjana Welzer
- University of Maribor, Faculty of Electrical Engineering and Computer Science, Maribor, Slovenia
| |
Collapse
|
78
|
Abstract
Background An experimental protocol is a sequence of tasks and operations executed to perform experimental research in biological and biomedical areas, e.g. biology, genetics, immunology, neurosciences, virology. Protocols often include references to equipment, reagents, descriptions of critical steps, troubleshooting and tips, as well as any other information that researchers deem important for facilitating the reusability of the protocol. Although experimental protocols are central to reproducibility, the descriptions are often cursory. There is the need for a unified framework with respect to the syntactic structure and the semantics for representing experimental protocols. Results In this paper we present “SMART Protocols ontology”, an ontology for representing experimental protocols. Our ontology represents the protocol as a workflow with domain specific knowledge embedded within a document. We also present the Sample Instrument Reagent Objective (SIRO) model, which represents the minimal common information shared across experimental protocols. SIRO was conceived in the same realm as the Patient Intervention Comparison Outcome (PICO) model that supports search, retrieval and classification purposes in evidence based medicine. We evaluate our approach against a set of competency questions modeled as SPARQL queries and processed against a set of published and unpublished protocols modeled with the SP Ontology and the SIRO model. Our approach makes it possible to answer queries such as Which protocols use tumor tissue as a sample. Conclusion Improving reporting structures for experimental protocols requires collective efforts from authors, peer reviewers, editors and funding bodies. The SP Ontology is a contribution towards this goal. We build upon previous experiences and bringing together the view of researchers managing protocols in their laboratory work. Website: https://smartprotocols.github.io/.
Collapse
Affiliation(s)
- Olga Giraldo
- Ontology Engineering Group, Madrid, Universidad Politécnica de Madrid, Madrid, 28660, Spain.
| | - Alexander García
- Ontology Engineering Group, Madrid, Universidad Politécnica de Madrid, Madrid, 28660, Spain
| | | | - Oscar Corcho
- Ontology Engineering Group, Madrid, Universidad Politécnica de Madrid, Madrid, 28660, Spain
| |
Collapse
|
79
|
Pfaff M, Neubig S, Krcmar H. Ontology for Semantic Data Integration in the Domain of IT Benchmarking. J Data Semant 2017; 7:29-46. [PMID: 29497460 PMCID: PMC5816769 DOI: 10.1007/s13740-017-0084-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/01/2017] [Revised: 07/05/2017] [Accepted: 11/02/2017] [Indexed: 10/26/2022]
Abstract
A domain-specific ontology for IT benchmarking has been developed to bridge the gap between a systematic characterization of IT services and their data-based valuation. Since information is generally collected during a benchmark exercise using questionnaires on a broad range of topics, such as employee costs, software licensing costs, and quantities of hardware, it is commonly stored as natural language text; thus, this information is stored in an intrinsically unstructured form. Although these data form the basis for identifying potentials for IT cost reductions, neither a uniform description of any measured parameters nor the relationship between such parameters exists. Hence, this work proposes an ontology for the domain of IT benchmarking, available at https://w3id.org/bmontology. The design of this ontology is based on requirements mainly elicited from a domain analysis, which considers analyzing documents and interviews with representatives from Small- and Medium-Sized Enterprises and Information and Communications Technology companies over the last eight years. The development of the ontology and its main concepts is described in detail (i.e., the conceptualization of benchmarking events, questionnaires, IT services, indicators and their values) together with its alignment with the DOLCE-UltraLite foundational ontology.
Collapse
Affiliation(s)
- Matthias Pfaff
- 1fortiss GmbH, An-Institut Technische Universität München (TUM), Munich, Germany
| | - Stefan Neubig
- 2Technische Universität München (TUM), Munich, Germany
| | - Helmut Krcmar
- 2Technische Universität München (TUM), Munich, Germany
| |
Collapse
|
80
|
Abstract
Recently, comparative psychologists have suggested that primates represent others' knowledge states. Evidence for this claim comes from studies demonstrating that primates expect others to maintain representations of objects when those objects are not currently visible. However, little work has explored whether nonhuman primates expect others to share the more sophisticated kinds of object knowledge that they themselves possess. We therefore investigated whether primates attribute to others knowledge that is acquired through the mental transformation of a static object representation. Specifically, we tested whether rhesus macaques (Macaca mulatta) expected a human demonstrator to solve a difficult rotational displacement task. In Experiment 1, monkeys watched a demonstrator hide a piece of fruit in one of two boxes. The monkey and the demonstrator then watched the boxes rotate 180°. We found that monkeys looked longer when the demonstrator reached into the box that did not contain the fruit, indicating that they expected her to be able to track the fruit to its current location. In Experiment 2, we ruled out the possibility that monkeys simply expected the demonstrator to search for the food in its true location. When the demonstrator did not witness the rotation event, monkeys looked equally long at the two reaching outcomes. These results are consistent with the interpretation that rhesus macaques expect others to dynamically update their representations of unseen objects.
Collapse
|
81
|
Deléger L, Campillos L, Ligozat AL, Névéol A. Design of an extensive information representation scheme for clinical narratives. J Biomed Semantics 2017; 8:37. [PMID: 28893314 PMCID: PMC5594525 DOI: 10.1186/s13326-017-0135-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Accepted: 07/26/2017] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Knowledge representation frameworks are essential to the understanding of complex biomedical processes, and to the analysis of biomedical texts that describe them. Combined with natural language processing (NLP), they have the potential to contribute to retrospective studies by unlocking important phenotyping information contained in the narrative content of electronic health records (EHRs). This work aims to develop an extensive information representation scheme for clinical information contained in EHR narratives, and to support secondary use of EHR narrative data to answer clinical questions. METHODS We review recent work that proposed information representation schemes and applied them to the analysis of clinical narratives. We then propose a unifying scheme that supports the extraction of information to address a large variety of clinical questions. RESULTS We devised a new information representation scheme for clinical narratives that comprises 13 entities, 11 attributes and 37 relations. The associated annotation guidelines can be used to consistently apply the scheme to clinical narratives and are https://cabernet.limsi.fr/annotation_guide_for_the_merlot_french_clinical_corpus-Sept2016.pdf . CONCLUSION The information scheme includes many elements of the major schemes described in the clinical natural language processing literature, as well as a uniquely detailed set of relations.
Collapse
Affiliation(s)
- Louise Deléger
- French National Institute for Agricultural Research (INRA), Domaine de Vilvert, Jouy en Josas, Paris, 78352, France.,LIMSI, CNRS, Université Paris - Saclay, Rue John von Neumann, Orsay, 91405, France
| | - Leonardo Campillos
- LIMSI, CNRS, Université Paris - Saclay, Rue John von Neumann, Orsay, 91405, France
| | - Anne-Laure Ligozat
- LIMSI, CNRS, Université Paris - Saclay, Rue John von Neumann, Orsay, 91405, France.,ENSIIE, 1 square de la résistance, Évry Cedex, 91025, France
| | - Aurélie Névéol
- LIMSI, CNRS, Université Paris - Saclay, Rue John von Neumann, Orsay, 91405, France.
| |
Collapse
|
82
|
Zhao C, Jiang J, Xu Z, Guan Y. A study of EMR-based medical knowledge network and its applications. Comput Methods Programs Biomed 2017; 143:13-23. [PMID: 28391811 DOI: 10.1016/j.cmpb.2017.02.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Revised: 01/23/2017] [Accepted: 02/09/2017] [Indexed: 06/07/2023]
Abstract
BACKGROUND AND OBJECTIVE Electronic medical records (EMRs) contain an amount of medical knowledge which can be used for clinical decision support. We attempt to integrate this medical knowledge into a complex network, and then implement a diagnosis model based on this network. METHODS The dataset of our study contains 992 records which are uniformly sampled from different departments of the hospital. In order to integrate the knowledge of these records, an EMR-based medical knowledge network (EMKN) is constructed. This network takes medical entities as nodes, and co-occurrence relationships between the two entities as edges. Selected properties of this network are analyzed. To make use of this network, a basic diagnosis model is implemented. Seven hundred records are randomly selected to re-construct the network, and the remaining 292 records are used as test records. The vector space model is applied to illustrate the relationships between diseases and symptoms. Because there may exist more than one actual disease in a record, the recall rate of the first ten results, and the average precision are adopted as evaluation measures. RESULTS Compared with a random network of the same size, this network has a similar average length but a much higher clustering coefficient. Additionally, it can be observed that there are direct correlations between the community structure and the real department classes in the hospital. For the diagnosis model, the vector space model using disease as a base obtains the best result. At least one accurate disease can be obtained in 73.27% of the records in the first ten results. CONCLUSION We constructed an EMR-based medical knowledge network by extracting the medical entities. This network has the small-world and scale-free properties. Moreover, the community structure showed that entities in the same department have a tendency to be self-aggregated. Based on this network, a diagnosis model was proposed. This model uses only the symptoms as inputs and is not restricted to a specific disease. The experiments conducted demonstrated that EMKN is a simple and universal technique to integrate different medical knowledge from EMRs, and can be used for clinical decision support.
Collapse
Affiliation(s)
- Chao Zhao
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Jingchi Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Zhiming Xu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| |
Collapse
|
83
|
Chiu PH, Hripcsak G. EHR-based phenotyping: Bulk learning and evaluation. J Biomed Inform 2017; 70:35-51. [PMID: 28410982 DOI: 10.1016/j.jbi.2017.04.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2016] [Revised: 03/09/2017] [Accepted: 04/10/2017] [Indexed: 01/29/2023]
Abstract
In data-driven phenotyping, a core computational task is to identify medical concepts and their variations from sources of electronic health records (EHR) to stratify phenotypic cohorts. A conventional analytic framework for phenotyping largely uses a manual knowledge engineering approach or a supervised learning approach where clinical cases are represented by variables encompassing diagnoses, medicinal treatments and laboratory tests, among others. In such a framework, tasks associated with feature engineering and data annotation remain a tedious and expensive exercise, resulting in poor scalability. In addition, certain clinical conditions, such as those that are rare and acute in nature, may never accumulate sufficient data over time, which poses a challenge to establishing accurate and informative statistical models. In this paper, we use infectious diseases as the domain of study to demonstrate a hierarchical learning method based on ensemble learning that attempts to address these issues through feature abstraction. We use a sparse annotation set to train and evaluate many phenotypes at once, which we call bulk learning. In this batch-phenotyping framework, disease cohort definitions can be learned from within the abstract feature space established by using multiple diseases as a substrate and diagnostic codes as surrogates. In particular, using surrogate labels for model training renders possible its subsequent evaluation using only a sparse annotated sample. Moreover, statistical models can be trained and evaluated, using the same sparse annotation, from within the abstract feature space of low dimensionality that encapsulates the shared clinical traits of these target diseases, collectively referred to as the bulk learning set.
Collapse
|
84
|
Abstract
As molecular biology has increasingly become a data-intensive discipline, ontologies have emerged as an essential computational tool to assist in the organisation, description and analysis of data. Ontologies describe and classify the entities of interest in a scientific domain in a computationally accessible fashion such that algorithms and tools can be developed around them. The technology that underlies ontologies has its roots in logic-based artificial intelligence, allowing for sophisticated automated inference and error detection. This chapter presents a general introduction to modern computational ontologies as they are used in biology.
Collapse
Affiliation(s)
- Janna Hastings
- Cheminformatics and Metabolism, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.
| |
Collapse
|
85
|
Tremblay S, Gagnon JF, Lafond D, Hodgetts HM, Doiron M, Jeuniaux PPJMH. A cognitive prosthesis for complex decision-making. Appl Ergon 2017; 58:349-360. [PMID: 27633232 DOI: 10.1016/j.apergo.2016.07.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Revised: 06/08/2016] [Accepted: 07/18/2016] [Indexed: 06/06/2023]
Abstract
While simple heuristics can be ecologically rational and effective in naturalistic decision making contexts, complex situations require analytical decision making strategies, hypothesis-testing and learning. Sub-optimal decision strategies - using simplified as opposed to analytic decision rules - have been reported in domains such as healthcare, military operational planning, and government policy making. We investigate the potential of a computational toolkit called "IMAGE" to improve decision-making by developing structural knowledge and increasing understanding of complex situations. IMAGE is tested within the context of a complex military convoy management task through (a) interactive simulations, and (b) visualization and knowledge representation capabilities. We assess the usefulness of two versions of IMAGE (desktop and immersive) compared to a baseline. Results suggest that the prosthesis helped analysts in making better decisions, but failed to increase their structural knowledge about the situation once the cognitive prosthesis is removed.
Collapse
Affiliation(s)
| | | | - Daniel Lafond
- Thales Research & Technology Canada, Québec City, Québec, Canada
| | - Helen M Hodgetts
- École de psychologie, Université Laval, Québec, Canada; Department of Applied Psychology, Cardiff Metropolitan University, Cardiff, UK
| | - Maxime Doiron
- École de psychologie, Université Laval, Québec, Canada
| | | |
Collapse
|
86
|
Abstract
Introduction: Intelligent Diagnostic Assistant can be used for complicated diagnosis of skin diseases, which are among the most common causes of disability. The aim of this study was to design and implement a computerized intelligent diagnostic assistant for complicated skin diseases through C5’s Algorithm. Method: An applied-developmental study was done in 2015. Knowledge base was developed based on interviews with dermatologists through questionnaires and checklists. Knowledge representation was obtained from the train data in the database using Excel Microsoft Office. Clementine Software and C5’s Algorithms were applied to draw the decision tree. Analysis of test accuracy was performed based on rules extracted using inference chains. The rules extracted from the decision tree were entered into the CLIPS programming environment and the intelligent diagnostic assistant was designed then. Results: The rules were defined using forward chaining inference technique and were entered into Clips programming environment as RULE. The accuracy and error rates obtained in the training phase from the decision tree were 99.56% and 0.44%, respectively. The accuracy of the decision tree was 98% and the error was 2% in the test phase. Conclusion: Intelligent diagnostic assistant can be used as a reliable system with high accuracy, sensitivity, specificity, and agreement.
Collapse
Affiliation(s)
| | - Masoud Arabfard
- Tehran University, Kish International Campus, Islamic Republic of Iran
| | - Zahra Arab Kermany
- Health Information Technology Depertment, Kashan University of Medical Science, Kashan, Iran
| |
Collapse
|
87
|
Berges I, Antón D, Bermúdez J, Goñi A, Illarramendi A. TrhOnt: building an ontology to assist rehabilitation processes. J Biomed Semantics 2016; 7:60. [PMID: 27716359 PMCID: PMC5050577 DOI: 10.1186/s13326-016-0104-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Accepted: 09/20/2016] [Indexed: 11/21/2022] Open
Abstract
Background One of the current research efforts in the area of biomedicine is the representation of knowledge in a structured way so that reasoning can be performed on it. More precisely, in the field of physiotherapy, information such as the physiotherapy record of a patient or treatment protocols for specific disorders must be adequately modeled, because they play a relevant role in the management of the evolutionary recovery process of a patient. In this scenario, we introduce TrhOnt, an application ontology that can assist physiotherapists in the management of the patients’ evolution via reasoning supported by semantic technology. Methods The ontology was developed following the NeOn Methodology. It integrates knowledge from ontological (e.g. FMA ontology) and non-ontological resources (e.g. a database of movements, exercises and treatment protocols) as well as additional physiotherapy-related knowledge. Results We demonstrate how the ontology fulfills the purpose of providing a reference model for the representation of the physiotherapy-related information that is needed for the whole physiotherapy treatment of patients, since they step for the first time into the physiotherapist’s office, until they are discharged. More specifically, we present the results for each of the intended uses of the ontology listed in the document that specifies its requirements, and show how TrhOnt can answer the competency questions defined within that document. Moreover, we detail the main steps of the process followed to build the TrhOnt ontology in order to facilitate its reproducibility in a similar context. Finally, we show an evaluation of the ontology from different perspectives. Conclusions TrhOnt has achieved the purpose of allowing for a reasoning process that changes over time according to the patient’s state and performance.
Collapse
Affiliation(s)
- Idoia Berges
- University of the Basque Country, UPV/EHU, Paseo Manuel de Lardizabal, 1, Donostia-San Sebastián, 20018, Spain.
| | - David Antón
- University of the Basque Country, UPV/EHU, Paseo Manuel de Lardizabal, 1, Donostia-San Sebastián, 20018, Spain
| | - Jesús Bermúdez
- University of the Basque Country, UPV/EHU, Paseo Manuel de Lardizabal, 1, Donostia-San Sebastián, 20018, Spain
| | - Alfredo Goñi
- University of the Basque Country, UPV/EHU, Paseo Manuel de Lardizabal, 1, Donostia-San Sebastián, 20018, Spain
| | - Arantza Illarramendi
- University of the Basque Country, UPV/EHU, Paseo Manuel de Lardizabal, 1, Donostia-San Sebastián, 20018, Spain
| |
Collapse
|
88
|
Abstract
The plots of stories are known to follow general patterns in terms of their overall structure. This was the basic tenet of structuralist approaches to narratology. Vladimir Propp proposed a procedure for the generation of new tales based on his semi-formal description of the structure of Russian folk tales. This is one of the first existing instances of a creative process described procedurally. The present paper revisits Propp's morphology to build a system that generates instances of Russian folk tales. Propp's view of the folk tale as a rigid sequence of character functions is employed as a plot driver, and some issues that Propp declared relevant but did not explore in detail-such as long-range dependencies between functions or the importance of endings-are given computational shape in the context of a broader architecture that captures all the aspects discussed by Propp. A set of simple evaluation metrics for the resulting outputs is defined inspired on Propp's formalism. The potential of the resulting system for providing a creative story generation system is discussed, and possible lines of future work are discussed.
Collapse
Affiliation(s)
- Pablo Gervás
- Instituto de Tecnología del Conocimiento, Universidad Complutense de Madrid, 28040 Madrid, Spain
| |
Collapse
|
89
|
Daniel C, Ouagne D, Sadou E, Forsberg K, Gilchrist MM, Zapletal E, Paris N, Hussain S, Jaulent MC, Kalra D. Cross border semantic interoperability for clinical research: the EHR4CR semantic resources and services. AMIA Jt Summits Transl Sci Proc 2016; 2016:51-9. [PMID: 27570649 PMCID: PMC5001763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
Abstract
With the development of platforms enabling the use of routinely collected clinical data in the context of international clinical research, scalable solutions for cross border semantic interoperability need to be developed. Within the context of the IMI EHR4CR project, we first defined the requirements and evaluation criteria of the EHR4CR semantic interoperability platform and then developed the semantic resources and supportive services and tooling to assist hospital sites in standardizing their data for allowing the execution of the project use cases. The experience gained from the evaluation of the EHR4CR platform accessing to semantically equivalent data elements across 11 European participating EHR systems from 5 countries demonstrated how far the mediation model and mapping efforts met the expected requirements of the project. Developers of semantic interoperability platforms are beginning to address a core set of requirements in order to reach the goal of developing cross border semantic integration of data.
Collapse
Affiliation(s)
- Christel Daniel
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France;; AP-HP, Paris, France
| | - David Ouagne
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France
| | - Eric Sadou
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France;; AP-HP, Paris, France
| | | | | | | | | | - Sajjad Hussain
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France
| | - Marie-Christine Jaulent
- INSERM, U1142, LIMICS, F-75006, Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006, Paris, France
| | | |
Collapse
|
90
|
Souvignet J, Declerck G, Asfari H, Jaulent MC, Bousquet C. OntoADR a semantic resource describing adverse drug reactions to support searching, coding, and information retrieval. J Biomed Inform 2016; 63:100-107. [PMID: 27369567 DOI: 10.1016/j.jbi.2016.06.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Revised: 06/25/2016] [Accepted: 06/27/2016] [Indexed: 10/21/2022]
Abstract
INTRODUCTION Efficient searching and coding in databases that use terminological resources requires that they support efficient data retrieval. The Medical Dictionary for Regulatory Activities (MedDRA) is a reference terminology for several countries and organizations to code adverse drug reactions (ADRs) for pharmacovigilance. Ontologies that are available in the medical domain provide several advantages such as reasoning to improve data retrieval. The field of pharmacovigilance does not yet benefit from a fully operational ontology to formally represent the MedDRA terms. Our objective was to build a semantic resource based on formal description logic to improve MedDRA term retrieval and aid the generation of on-demand custom groupings by appropriately and efficiently selecting terms: OntoADR. METHODS The method consists of the following steps: (1) mapping between MedDRA terms and SNOMED-CT, (2) generation of semantic definitions using semi-automatic methods, (3) storage of the resource and (4) manual curation by pharmacovigilance experts. RESULTS We built a semantic resource for ADRs enabling a new type of semantics-based term search. OntoADR adds new search capabilities relative to previous approaches, overcoming the usual limitations of computation using lightweight description logic, such as the intractability of unions or negation queries, bringing it closer to user needs. Our automated approach for defining MedDRA terms enabled the association of at least one defining relationship with 67% of preferred terms. The curation work performed on our sample showed an error level of 14% for this automated approach. We tested OntoADR in practice, which allowed us to build custom groupings for several medical topics of interest. DISCUSSION The methods we describe in this article could be adapted and extended to other terminologies which do not benefit from a formal semantic representation, thus enabling better data retrieval performance. Our custom groupings of MedDRA terms were used while performing signal detection, which suggests that the graphical user interface we are currently implementing to process OntoADR could be usefully integrated into specialized pharmacovigilance software that rely on MedDRA.
Collapse
Affiliation(s)
- Julien Souvignet
- INSERM, U1142, LIMICS, F-75006 Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006 Paris, France; Université Paris 13, Sorbonne Paris Cité, LIMICS, (UMR_S 1142), F-93430 Villetaneuse, France; SSPIM, CHU University Hospital of Saint Etienne, Saint Etienne, France
| | - Gunnar Declerck
- Sorbonne Universités, Université de technologie de Compiègne, EA 2223 Costech (Connaissance, Organisation et Systèmes Techniques), Centre Pierre Guillaumat, CS 60 319, 60 203 Compiègne cedex, France
| | - Hadyl Asfari
- INSERM, U1142, LIMICS, F-75006 Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006 Paris, France; Université Paris 13, Sorbonne Paris Cité, LIMICS, (UMR_S 1142), F-93430 Villetaneuse, France
| | - Marie-Christine Jaulent
- INSERM, U1142, LIMICS, F-75006 Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006 Paris, France; Université Paris 13, Sorbonne Paris Cité, LIMICS, (UMR_S 1142), F-93430 Villetaneuse, France
| | - Cédric Bousquet
- INSERM, U1142, LIMICS, F-75006 Paris, France; Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, F-75006 Paris, France; Université Paris 13, Sorbonne Paris Cité, LIMICS, (UMR_S 1142), F-93430 Villetaneuse, France; SSPIM, CHU University Hospital of Saint Etienne, Saint Etienne, France.
| |
Collapse
|
91
|
Scuba W, Tharp M, Mowery D, Tseytlin E, Liu Y, Drews FA, Chapman WW. Knowledge Author: facilitating user-driven, domain content development to support clinical information extraction. J Biomed Semantics 2016; 7:42. [PMID: 27338146 PMCID: PMC4919842 DOI: 10.1186/s13326-016-0086-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2015] [Accepted: 06/01/2016] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND Clinical Natural Language Processing (NLP) systems require a semantic schema comprised of domain-specific concepts, their lexical variants, and associated modifiers to accurately extract information from clinical texts. An NLP system leverages this schema to structure concepts and extract meaning from the free texts. In the clinical domain, creating a semantic schema typically requires input from both a domain expert, such as a clinician, and an NLP expert who will represent clinical concepts created from the clinician's domain expertise into a computable format usable by an NLP system. The goal of this work is to develop a web-based tool, Knowledge Author, that bridges the gap between the clinical domain expert and the NLP system development by facilitating the development of domain content represented in a semantic schema for extracting information from clinical free-text. RESULTS Knowledge Author is a web-based, recommendation system that supports users in developing domain content necessary for clinical NLP applications. Knowledge Author's schematic model leverages a set of semantic types derived from the Secondary Use Clinical Element Models and the Common Type System to allow the user to quickly create and modify domain-related concepts. Features such as collaborative development and providing domain content suggestions through the mapping of concepts to the Unified Medical Language System Metathesaurus database further supports the domain content creation process. Two proof of concept studies were performed to evaluate the system's performance. The first study evaluated Knowledge Author's flexibility to create a broad range of concepts. A dataset of 115 concepts was created of which 87 (76 %) were able to be created using Knowledge Author. The second study evaluated the effectiveness of Knowledge Author's output in an NLP system by extracting concepts and associated modifiers representing a clinical element, carotid stenosis, from 34 clinical free-text radiology reports using Knowledge Author and an NLP system, pyConText. Knowledge Author's domain content produced high recall for concepts (targeted findings: 86 %) and varied recall for modifiers (certainty: 91 % sidedness: 80 %, neurovascular anatomy: 46 %). CONCLUSION Knowledge Author can support clinical domain content development for information extraction by supporting semantic schema creation by domain experts.
Collapse
Affiliation(s)
- William Scuba
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, 84108, USA
| | - Melissa Tharp
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, 84108, USA
| | - Danielle Mowery
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, 84108, USA
| | - Eugene Tseytlin
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, 15206, USA
| | - Yang Liu
- University of California, San Diego, CA, 92093, USA
| | - Frank A Drews
- Department of Psychology, University of Utah, Salt Lake City, UT, 84108, USA
| | - Wendy W Chapman
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, 84108, USA.
| |
Collapse
|
92
|
Fernández-Breis JT, Chiba H, Legaz-García MDC, Uchiyama I. The Orthology Ontology: development and applications. J Biomed Semantics 2016; 7:34. [PMID: 27259657 PMCID: PMC4893294 DOI: 10.1186/s13326-016-0077-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 05/17/2016] [Indexed: 11/16/2022] Open
Abstract
Background Computational comparative analysis of multiple genomes provides valuable opportunities to biomedical research. In particular, orthology analysis can play a central role in comparative genomics; it guides establishing evolutionary relations among genes of organisms and allows functional inference of gene products. However, the wide variations in current orthology databases necessitate the research toward the shareability of the content that is generated by different tools and stored in different structures. Exchanging the content with other research communities requires making the meaning of the content explicit. Description The need for a common ontology has led to the creation of the Orthology Ontology (ORTH) following the best practices in ontology construction. Here, we describe our model and major entities of the ontology that is implemented in the Web Ontology Language (OWL), followed by the assessment of the quality of the ontology and the application of the ORTH to existing orthology datasets. This shareable ontology enables the possibility to develop Linked Orthology Datasets and a meta-predictor of orthology through standardization for the representation of orthology databases. The ORTH is freely available in OWL format to all users at http://purl.org/net/orth. Conclusions The Orthology Ontology can serve as a framework for the semantic standardization of orthology content and it will contribute to a better exploitation of orthology resources in biomedical research. The results demonstrate the feasibility of developing shareable datasets using this ontology. Further applications will maximize the usefulness of this ontology.
Collapse
Affiliation(s)
| | - Hirokazu Chiba
- National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, 444-8585, Aichi, Japan
| | | | - Ikuo Uchiyama
- National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, 444-8585, Aichi, Japan
| |
Collapse
|
93
|
Robson B, Boray S. Data-mining to build a knowledge representation store for clinical decision support. Studies on curation and validation based on machine performance in multiple choice medical licensing examinations. Comput Biol Med 2016; 73:71-93. [PMID: 27089305 PMCID: PMC7094475 DOI: 10.1016/j.compbiomed.2016.02.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 02/05/2016] [Accepted: 02/17/2016] [Indexed: 11/23/2022]
Abstract
Extracting medical knowledge by structured data mining of many medical records and from unstructured data mining of natural language source text on the Internet will become increasingly important for clinical decision support. Output from these sources can be transformed into large numbers of elements of knowledge in a Knowledge Representation Store (KRS), here using the notation and to some extent the algebraic principles of the Q-UEL Web-based universal exchange and inference language described previously, rooted in Dirac notation from quantum mechanics and linguistic theory. In a KRS, semantic structures or statements about the world of interest to medicine are analogous to natural language sentences seen as formed from noun phrases separated by verbs, prepositions and other descriptions of relationships. A convenient method of testing and better curating these elements of knowledge is by having the computer use them to take the test of a multiple choice medical licensing examination. It is a venture which perhaps tells us almost as much about the reasoning of students and examiners as it does about the requirements for Artificial Intelligence as employed in clinical decision making. It emphasizes the role of context and of contextual probabilities as opposed to the more familiar intrinsic probabilities, and of a preliminary form of logic that we call presyllogistic reasoning.
Collapse
Affiliation(s)
- Barry Robson
- Ingine Inc., DE, USA; The Dirac Foundation clg, Oxfordshire, UK; St. Matthew's University School of Medicine, Cayman Islands.
| | - Srinidhi Boray
- Ingine Inc., DE, USA; The Dirac Foundation clg, Oxfordshire, UK
| |
Collapse
|
94
|
Heiyanthuduwage SR, Schwitter R, Orgun MA. OWL 2 learn profile: an ontology sublanguage for the learning domain. Springerplus 2016; 5:291. [PMID: 27066328 PMCID: PMC4781826 DOI: 10.1186/s40064-016-1826-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Accepted: 02/15/2016] [Indexed: 11/24/2022]
Abstract
Many experimental ontologies have been developed for the learning domain for use at different institutions. These ontologies include different OWL/OWL 2 (Web Ontology Language) constructors. However, it is not clear which OWL 2 constructors are the most appropriate ones for designing ontologies for the learning domain. It is possible that the constructors used in these learning domain ontologies match one of the three standard OWL 2 profiles (sublanguages). To investigate whether this is the case, we have analysed a corpus of 14 ontologies designed for the learning domain. We have also compared the constructors used in these ontologies with those of the OWL 2 RL profile, one of the OWL 2 standard profiles. The results of our analysis suggest that the OWL 2 constructors used in these ontologies do not exactly match the standard OWL 2 RL profile, but form a subset of that profile which we call OWL 2 Learn.
Collapse
Affiliation(s)
- Sudath R Heiyanthuduwage
- Department of Computing, Faculty of Science and Engineering, Macquarie University, Sydney, NSW 2109 Australia
| | - Rolf Schwitter
- Department of Computing, Faculty of Science and Engineering, Macquarie University, Sydney, NSW 2109 Australia
| | - Mehmet A Orgun
- Department of Computing, Faculty of Science and Engineering, Macquarie University, Sydney, NSW 2109 Australia
| |
Collapse
|
95
|
He Z, Ryan P, Hoxha J, Wang S, Carini S, Sim I, Weng C. Multivariate analysis of the population representativeness of related clinical studies. J Biomed Inform 2016; 60:66-76. [PMID: 26820188 PMCID: PMC4837055 DOI: 10.1016/j.jbi.2016.01.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2015] [Revised: 01/15/2016] [Accepted: 01/19/2016] [Indexed: 12/25/2022]
Abstract
OBJECTIVE To develop a multivariate method for quantifying the population representativeness across related clinical studies and a computational method for identifying and characterizing underrepresented subgroups in clinical studies. METHODS We extended a published metric named Generalizability Index for Study Traits (GIST) to include multiple study traits for quantifying the population representativeness of a set of related studies by assuming the independence and equal importance among all study traits. On this basis, we compared the effectiveness of GIST and multivariate GIST (mGIST) qualitatively. We further developed an algorithm called "Multivariate Underrepresented Subgroup Identification" (MAGIC) for constructing optimal combinations of distinct value intervals of multiple traits to define underrepresented subgroups in a set of related studies. Using Type 2 diabetes mellitus (T2DM) as an example, we identified and extracted frequently used quantitative eligibility criteria variables in a set of clinical studies. We profiled the T2DM target population using the National Health and Nutrition Examination Survey (NHANES) data. RESULTS According to the mGIST scores for four example variables, i.e., age, HbA1c, BMI, and gender, the included observational T2DM studies had superior population representativeness than the interventional T2DM studies. For the interventional T2DM studies, Phase I trials had better population representativeness than Phase III trials. People at least 65years old with HbA1c value between 5.7% and 7.2% were particularly underrepresented in the included T2DM trials. These results confirmed well-known knowledge and demonstrated the effectiveness of our methods in population representativeness assessment. CONCLUSIONS mGIST is effective at quantifying population representativeness of related clinical studies using multiple numeric study traits. MAGIC identifies underrepresented subgroups in clinical studies. Both data-driven methods can be used to improve the transparency of design bias in participation selection at the research community level.
Collapse
Affiliation(s)
- Zhe He
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA.
| | - Patrick Ryan
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA; Janssen Research and Development, Titusville, NJ 08560, USA; Observational Health Data Sciences and Informatics, New York, NY 10032, USA
| | - Julia Hoxha
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Shuang Wang
- Department of Biostatistics, Columbia University, New York, NY 10032, USA
| | - Simona Carini
- Department of Medicine, University of California, San Francisco, CA 94143, USA
| | - Ida Sim
- Department of Medicine, University of California, San Francisco, CA 94143, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA; Observational Health Data Sciences and Informatics, New York, NY 10032, USA
| |
Collapse
|
96
|
Kahn CE Jr. Transitive closure of subsumption and causal relations in a large ontology of radiological diagnosis. J Biomed Inform 2016; 61:27-33. [PMID: 27005590 DOI: 10.1016/j.jbi.2016.03.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2015] [Revised: 03/12/2016] [Accepted: 03/18/2016] [Indexed: 01/12/2023]
Abstract
The Radiology Gamuts Ontology (RGO)-an ontology of diseases, interventions, and imaging findings-was developed to aid in decision support, education, and translational research in diagnostic radiology. The ontology defines a subsumption (is_a) relation between more general and more specific terms, and a causal relation (may_cause) to express the relationship between disorders and their possible imaging manifestations. RGO incorporated 19,745 terms with their synonyms and abbreviations, 1768 subsumption relations, and 55,558 causal relations. Transitive closure was computed iteratively; it yielded 2154 relations over subsumption and 1,594,896 relations over causality. Five causal cycles were discovered, all with path length of no more than 5. The graph-theoretic metrics of in-degree and out-degree were explored; the most useful metric to prioritize modification of the ontology was found to be the product of the in-degree of transitive closure over subsumption and the out-degree of transitive closure over causality. Two general types of error were identified: (1) causal assertions that used overly general terms because they implicitly assumed an organ-specific context and (2) subsumption relations where a site-specific disorder was asserted to be a subclass of the general disorder. Transitive closure helped identify incorrect assertions, prioritized and guided ontology revision, and aided resources that applied the ontology's knowledge.
Collapse
|
97
|
Johnson SGB, Ahn WK. Causal Networks or Causal Islands? The Representation of Mechanisms and the Transitivity of Causal Judgment. Cogn Sci 2015; 39:1468-503. [PMID: 25556901 PMCID: PMC4490159 DOI: 10.1111/cogs.12213] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Revised: 06/01/2014] [Accepted: 09/23/2014] [Indexed: 11/30/2022]
Abstract
Knowledge of mechanisms is critical for causal reasoning. We contrasted two possible organizations of causal knowledge—an interconnected causal network, where events are causally connected without any boundaries delineating discrete mechanisms; or a set of disparate mechanisms—causal islands—such that events in different mechanisms are not thought to be related even when they belong to the same causal chain. To distinguish these possibilities, we tested whether people make transitive judgments about causal chains by inferring, given A causes B and B causes C, that A causes C. Specifically, causal chains schematized as one chunk or mechanism in semantic memory (e.g., exercising, becoming thirsty, drinking water) led to transitive causal judgments. On the other hand, chains schematized as multiple chunks (e.g., having sex, becoming pregnant, becoming nauseous) led to intransitive judgments despite strong intermediate links ((Experiments 1-3). Normative accounts of causal intransitivity could not explain these intransitive judgments (Experiments 4 and 5).
Collapse
|
98
|
Zhang Y, Li H, Duan H, Zhao Y. Mobilizing clinical decision support to facilitate knowledge translation: a case study in China. Comput Biol Med 2015; 60:40-50. [PMID: 25754360 DOI: 10.1016/j.compbiomed.2015.02.013] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Revised: 02/13/2015] [Accepted: 02/14/2015] [Indexed: 11/30/2022]
Abstract
BACKGROUND A wide gulf remains between knowledge and clinical practice. Clinical decision support has been demonstrated to be an effective knowledge tool that healthcare organizations can employ to deliver the "right knowledge to the right people in the right form at the right time". How to adopt various clinical decision support (CDS) systems efficiently to facilitate evidence-based practice is one challenge faced by knowledge translation research. METHOD A computer-aided knowledge translation method that mobilizes evidence-based decision supports is proposed. The foundation of the method is a knowledge representation model that is able to cover, coordinate and synergize various types of medical knowledge to achieve centralized and effective knowledge management. Next, web-based knowledge-authoring and natural language processing based knowledge acquisition tools are designed to accelerate the transformation of the latest clinical evidence into computerized knowledge content. Finally, a batch of fundamental services, such as data acquisition and inference engine, are designed to actuate the acquired knowledge content. These services can be used as building blocks for various evidence-based decision support applications. RESULTS Based on the above method, a computer-aided knowledge translation platform was constructed as a CDS infrastructure. Based on this platform, typical CDS applications were developed. A case study of drug use check demonstrates that the CDS intervention delivered by the platform has produced observable behavior changes (89.7% of alerted medical orders were revised by physicians). DISCUSSION Computer-aided knowledge translation via a CDS infrastructure can be effective in facilitating knowledge translation in clinical settings.
Collapse
Affiliation(s)
- Yinsheng Zhang
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou 310027, China; School of Computer Science and Information Engineering, Zhejiang Gongshang University, Hangzhou 310018, China.
| | - Haomin Li
- Children׳s Hospital, Institute of Translational Medicine, Zhejiang University, Hangzhou 310027, China.
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou 310027, China.
| | - Yinhong Zhao
- China National Center for Biotechnology Development, Beijing 100036, China.
| |
Collapse
|
99
|
Mason RA, Just MA. Physics instruction induces changes in neural knowledge representation during successive stages of learning. Neuroimage 2015; 111:36-48. [PMID: 25665967 DOI: 10.1016/j.neuroimage.2014.12.086] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 11/19/2014] [Accepted: 12/29/2014] [Indexed: 11/18/2022] Open
Abstract
Incremental instruction on the workings of a set of mechanical systems induced a progression of changes in the neural representations of the systems. The neural representations of four mechanical systems were assessed before, during, and after three phases of incremental instruction (which first provided information about the system components, then provided partial causal information, and finally provided full functional information). In 14 participants, the neural representations of four systems (a bathroom scale, a fire extinguisher, an automobile braking system, and a trumpet) were assessed using three recently developed techniques: (1) machine learning and classification of multi-voxel patterns; (2) localization of consistently responding voxels; and (3) representational similarity analysis (RSA). The neural representations of the systems progressed through four stages, or states, involving spatially and temporally distinct multi-voxel patterns: (1) initially, the representation was primarily visual (occipital cortex); (2) it subsequently included a large parietal component; (3) it eventually became cortically diverse (frontal, parietal, temporal, and medial frontal regions); and (4) at the end, it demonstrated a strong frontal cortex weighting (frontal and motor regions). At each stage of knowledge, it was possible for a classifier to identify which one of four mechanical systems a participant was thinking about, based on their brain activation patterns. The progression of representational states was suggestive of progressive stages of learning: (1) encoding information from the display; (2) mental animation, possibly involving imagining the components moving; (3) generating causal hypotheses associated with mental animation; and finally (4) determining how a person (probably oneself) would interact with the system. This interpretation yields an initial, cortically-grounded, theory of learning of physical systems that potentially can be related to cognitive learning theories by suggesting links between cortical representations, stages of learning, and the understanding of simple systems.
Collapse
Affiliation(s)
- Robert A Mason
- Center for Cognitive Brain Imaging, Psychology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | - Marcel Adam Just
- Center for Cognitive Brain Imaging, Psychology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
100
|
Vaz C, Francisco AP, Silva M, Jolley KA, Bray JE, Pouseele H, Rothganger J, Ramirez M, Carriço JA. TypOn: the microbial typing ontology. J Biomed Semantics 2014; 5:43. [PMID: 25584183 PMCID: PMC4290098 DOI: 10.1186/2041-1480-5-43] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 10/06/2014] [Indexed: 08/21/2023] Open
Abstract
Abstract Bacterial identification and characterization at subspecies level is commonly known as Microbial Typing. Currently, these methodologies are fundamental tools in Clinical Microbiology and bacterial population genetics studies to track outbreaks and to study the dissemination and evolution of virulence or pathogenicity factors and antimicrobial resistance. Due to advances in DNA sequencing technology, these methods have evolved to become focused on sequence-based methodologies. The need to have a common understanding of the concepts described and the ability to share results within the community at a global level are increasingly important requisites for the continued development of portable and accurate sequence-based typing methods, especially with the recent introduction of Next Generation Sequencing (NGS) technologies. In this paper, we present an ontology designed for the sequence-based microbial typing field, capable of describing any of the sequence-based typing methodologies currently in use and being developed, including novel NGS based methods. This is a fundamental step to accurately describe, analyze, curate, and manage information for microbial typing based on sequence based typing methods.
Collapse
Affiliation(s)
- Cátia Vaz
- INESC-ID, R. Alves Redol 9, 1000-029 Lisboa, Portugal ; Instituto Superior de Engenharia de Lisboa, Instituto Politécnico de Lisboa, R. Cons. Emídio Navarro 1, 1959-007 Lisboa, Portugal
| | - Alexandre P Francisco
- INESC-ID, R. Alves Redol 9, 1000-029 Lisboa, Portugal ; Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal
| | - Mickael Silva
- Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Av. Prof. Egas Moniz, 1649-028 Lisboa, Portugal
| | | | - James E Bray
- Department of Zoology, University of Oxford, Oxford, UK
| | - Hannes Pouseele
- Applied Maths NV, Keistraat 120, 98308 Sint-Martens-Latem, Belgium
| | | | - Mário Ramirez
- Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Av. Prof. Egas Moniz, 1649-028 Lisboa, Portugal
| | - João A Carriço
- Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Av. Prof. Egas Moniz, 1649-028 Lisboa, Portugal
| |
Collapse
|