1
|
Ahmad PN, Shah AM, Lee K. A Review on Electronic Health Record Text-Mining for Biomedical Name Entity Recognition in Healthcare Domain. Healthcare (Basel) 2023; 11:1268. [PMID: 37174810 PMCID: PMC10178605 DOI: 10.3390/healthcare11091268] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/24/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023] Open
Abstract
Biomedical-named entity recognition (bNER) is critical in biomedical informatics. It identifies biomedical entities with special meanings, such as people, places, and organizations, as predefined semantic types in electronic health records (EHR). bNER is essential for discovering novel knowledge using computational methods and Information Technology. Early bNER systems were configured manually to include domain-specific features and rules. However, these systems were limited in handling the complexity of the biomedical text. Recent advances in deep learning (DL) have led to the development of more powerful bNER systems. DL-based bNER systems can learn the patterns of biomedical text automatically, making them more robust and efficient than traditional rule-based systems. This paper reviews the healthcare domain of bNER, using DL techniques and artificial intelligence in clinical records, for mining treatment prediction. bNER-based tools are categorized systematically and represent the distribution of input, context, and tag (encoder/decoder). Furthermore, to create a labeled dataset for our machine learning sentiment analyzer to analyze the sentiment of a set of tweets, we used a manual coding approach and the multi-task learning method to bias the training signals with domain knowledge inductively. To conclude, we discuss the challenges facing bNER systems and future directions in the healthcare field.
Collapse
Affiliation(s)
- Pir Noman Ahmad
- School of Computer Science, Harbin Institute of Technology, Harbin 150001, China
| | - Adnan Muhammad Shah
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| | - KangYoon Lee
- Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
| |
Collapse
|
2
|
Detection of Benign and Malignant Tumors in Skin Empowered with Transfer Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4826892. [PMID: 35371238 PMCID: PMC8970955 DOI: 10.1155/2022/4826892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 01/27/2022] [Accepted: 03/06/2022] [Indexed: 12/01/2022]
Abstract
Skin cancer is a major type of cancer with rapidly increasing victims all over the world. It is very much important to detect skin cancer in the early stages. Computer-developed diagnosis systems helped the physicians to diagnose disease, which allows appropriate treatment and increases the survival ratio of patients. In the proposed system, the classification problem of skin disease is tackled. An automated and reliable system for the classification of malignant and benign tumors is developed. In this system, a customized pretrained Deep Convolutional Neural Network (DCNN) is implemented. The pretrained AlexNet model is customized by replacing the last layers according to the proposed system problem. The softmax layer is modified according to binary classification detection. The proposed system model is well trained on malignant and benign tumors skin cancer dataset of 1920 images, where each class contains 960 images. After good training, the proposed system model is validated on 480 images, where the size of images of each class is 240. The proposed system model is analyzed using the following parameters: accuracy, sensitivity, specificity, Positive Predicted Values (PPV), Negative Predicted Value (NPV), False Positive Ratio (FPR), False Negative Ratio (FNR), Likelihood Ratio Positive (LRP), and Likelihood Ratio Negative (LRN). The accuracy achieved through the proposed system model is 87.1%, which is higher than traditional methods of classification.
Collapse
|
3
|
Wang CH, Nguyen PA, Jack Li YC, Islam MM, Poly TN, Tran QV, Huang CW, Yang HC. Improved diagnosis-medication association mining to reduce pseudo-associations. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 207:106181. [PMID: 34052770 DOI: 10.1016/j.cmpb.2021.106181] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 05/10/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND AND OBJECTIVE Association rule mining has been adopted to medical fields to discover prescribing patterns or relationships among diseases and/or medications; however, it has generated unreasonable associations among these entities. This study aims to identify the real-world profile of disease-medication (DM) associations using the modified mining algorithm and assess its performance in reducing DM pseudo-associations. METHODS We retrieved data from outpatient records between January 2011 and December 2015 in claims databases maintained by the Health and Welfare Data Science Center, Ministry of Health and Welfare, Taiwan. The association rule mining's lift (Q-value) was adopted to quantify DM associations, referred to as Q1 for the original algorithm and as Q2 for the modified algorithm. One thousand DM pairs with positive Q1-values (Q1+) and negative or no Q2-values (Q2- or Q2∅) were selected as the validation dataset, in which two pharmacists assessed the DM associations. RESULTS A total of 3,120,449 unique DM pairs were identified, of which there were 333,347 Q1+Q2- pairs and 429,931 Q1+Q2∅ pairs. Q1+Q2- rates were relatively high in ATC classes C (29.91%) and R (30.24%). Classes L (69.91%) and V (52.52%) demonstrated remarkably high Q1+Q2∅ rates. For the 1000 pairs in the validation, 93.7% of the Q1+Q2- or Q1+Q2∅ DM pairs were assessed as pseudo-associations. However, classes M (5.3%), H (4.5%), and B (4.1%) showed the highest rates of plausible associations falsely given Q2- or Q2∅ by the modified algorithm. CONCLUSIONS The modified algorithm demonstrated high accuracy to identify pseudo-associations regarded as positive associations by the original algorithm and would potentially be applied to improve secondary databases to facilitate research on real-world prescribing patterns and further enhance drug safety.
Collapse
Affiliation(s)
- Ching-Huan Wang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Phung Anh Nguyen
- International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Department of Healthcare Information and Management, School of Health Technology, Ming Chuan University, Taipei, Taiwan
| | - Yu Chuan Jack Li
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan; Department of Dermatology, Wan Fang Hospital, Taipei, Taiwan; TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei, Taiwan
| | - Md Mohaimenul Islam
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
| | - Tahmina Nasrin Poly
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
| | - Quoc-Viet Tran
- Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
| | - Chih-Wei Huang
- International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Hsuan-Chia Yang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; International Center for Health Information Technology (ICHIT), College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.
| |
Collapse
|
4
|
Zhao L, Ciallella HL, Aleksunes LM, Zhu H. Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling. Drug Discov Today 2020; 25:1624-1638. [PMID: 32663517 PMCID: PMC7572559 DOI: 10.1016/j.drudis.2020.07.005] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 06/26/2020] [Accepted: 07/06/2020] [Indexed: 02/06/2023]
Abstract
Advancing a new drug to market requires substantial investments in time as well as financial resources. Crucial bioactivities for drug candidates, including their efficacy, pharmacokinetics (PK), and adverse effects, need to be investigated during drug development. With advancements in chemical synthesis and biological screening technologies over the past decade, a large amount of biological data points for millions of small molecules have been generated and are stored in various databases. These accumulated data, combined with new machine learning (ML) approaches, such as deep learning, have shown great potential to provide insights into relevant chemical structures to predict in vitro, in vivo, and clinical outcomes, thereby advancing drug discovery and development in the big data era.
Collapse
Affiliation(s)
- Linlin Zhao
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Heather L Ciallella
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Lauren M Aleksunes
- Department of Pharmacology and Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08854, USA
| | - Hao Zhu
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA; Department of Chemistry, Rutgers University, Camden, NJ 08102, USA.
| |
Collapse
|
5
|
Galetsi P, Katsaliaki K. Big data analytics in health: an overview and bibliometric study of research activity. Health Info Libr J 2019; 37:5-25. [DOI: 10.1111/hir.12286] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 10/23/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Panagiota Galetsi
- School of Economics, Business Administration & Legal Studies International Hellenic University Thessaloniki Greece
| | - Korina Katsaliaki
- School of Economics, Business Administration & Legal Studies International Hellenic University Thessaloniki Greece
| |
Collapse
|
6
|
Zarrinpar A, David Cheng TY, Huo Z. What Can We Learn About Drug Safety and Other Effects in the Era of Electronic Health Records and Big Data That We Would Not Be Able to Learn From Classic Epidemiology? J Surg Res 2019; 246:599-604. [PMID: 31653413 DOI: 10.1016/j.jss.2019.09.053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 08/16/2019] [Accepted: 09/19/2019] [Indexed: 02/07/2023]
Abstract
As more and more health systems have converted to the use of electronic health records, the amount of searchable and analyzable data is exploding. This includes not just provider or laboratory created data but also data collected by instruments, personal devices, and patients themselves, among others. This has led to more attention being paid to the analysis of these data to answer previously unaddressed questions. This is especially important given the number of therapies previously found to be beneficial in clinical trials that are currently being re-scrutinized. Because there are orders of magnitude more information contained in these data sets, a fundamentally different approach needs to be taken to their processing and analysis and the generation of knowledge. Health care and medicine are drivers of this phenomenon and will ultimately be the main beneficiaries. Concurrently, many different types of questions can now be asked using these data sets. Research groups have become increasingly active in mining large data sets, including nationwide health care databases, to learn about associations of medication use and various unrelated diseases such as cancer. Given the recent increase in research activity in this area, its promise to radically change clinical research, and the relative lack of widespread knowledge about its potential and advances, we surveyed the available literature to understand the strengths and limitations of these new tools. We also outline new databases and techniques that are available to researchers worldwide, with special focus on work pertaining to the broad and rapid monitoring of drug safety and secondary effects.
Collapse
Affiliation(s)
- Ali Zarrinpar
- Department of Surgery, College of Medicine, University of Florida, Gainesville, Florida.
| | - Ting-Yuan David Cheng
- Department of Epidemiology, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, Florida
| | - Zhiguang Huo
- Department of Biostatistics, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, Florida
| |
Collapse
|
7
|
Comment on: “A Bibliometric Analysis and Visualization of Medical Big Data Research” Sustainability 2018, 10, 166. SUSTAINABILITY 2018. [DOI: 10.3390/su10124851] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Liao et al. [...]
Collapse
|
8
|
Islam MS, Hasan MM, Wang X, Germack HD, Noor-E-Alam M. A Systematic Review on Healthcare Analytics: Application and Theoretical Perspective of Data Mining. Healthcare (Basel) 2018; 6:E54. [PMID: 29882866 PMCID: PMC6023432 DOI: 10.3390/healthcare6020054] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2018] [Revised: 05/17/2018] [Accepted: 05/21/2018] [Indexed: 12/17/2022] Open
Abstract
The growing healthcare industry is generating a large volume of useful data on patient demographics, treatment plans, payment, and insurance coverage—attracting the attention of clinicians and scientists alike. In recent years, a number of peer-reviewed articles have addressed different dimensions of data mining application in healthcare. However, the lack of a comprehensive and systematic narrative motivated us to construct a literature review on this topic. In this paper, we present a review of the literature on healthcare analytics using data mining and big data. Following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, we conducted a database search between 2005 and 2016. Critical elements of the selected studies—healthcare sub-areas, data mining techniques, types of analytics, data, and data sources—were extracted to provide a systematic view of development in this field and possible future directions. We found that the existing literature mostly examines analytics in clinical and administrative decision-making. Use of human-generated data is predominant considering the wide adoption of Electronic Medical Record in clinical care. However, analytics based on website and social media data has been increasing in recent years. Lack of prescriptive analytics in practice and integration of domain expert knowledge in the decision-making process emphasizes the necessity of future research.
Collapse
Affiliation(s)
- Md Saiful Islam
- Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA.
| | - Md Mahmudul Hasan
- Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA.
| | - Xiaoyi Wang
- Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA.
| | - Hayley D Germack
- Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA.
- National Clinician Scholars Program, Yale University School of Medicine, New Haven, CT 06511, USA.
- Bouvé College of Health Sciences, Northeastern University, Boston, MA 02115, USA.
| | - Md Noor-E-Alam
- Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA.
| |
Collapse
|
9
|
Pashazadeh A, Navimipour NJ. Big data handling mechanisms in the healthcare applications: A comprehensive and systematic literature review. J Biomed Inform 2018; 82:47-62. [PMID: 29655946 DOI: 10.1016/j.jbi.2018.03.014] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Revised: 11/19/2017] [Accepted: 03/23/2018] [Indexed: 01/08/2023]
Abstract
Healthcare provides many services such as diagnosing, treatment, prevention of diseases, illnesses, injuries, and other physical and mental disorders. Large-scale distributed data processing applications in healthcare as a basic concept operates on large amounts of data. Therefore, big data application functions are the main part of healthcare operations, but there was not any comprehensive and systematic survey about studying and evaluating the important techniques in this field. Therefore, this paper aims at providing the comprehensive, detailed, and systematic study of the state-of-the-art mechanisms in the big data related to healthcare applications in five categories, including machine learning, cloud-based, heuristic-based, agent-based, and hybrid mechanisms. Also, this paper displayed a systematic literature review (SLR) of the big data applications in the healthcare literature up to the end of 2016. Initially, 205 papers were identified, but a paper selection process reduced the number of papers to 29 important studies.
Collapse
Affiliation(s)
- Asma Pashazadeh
- Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran
| | - Nima Jafari Navimipour
- Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran.
| |
Collapse
|
10
|
Chen P, Pan C. Diabetes classification model based on boosting algorithms. BMC Bioinformatics 2018; 19:109. [PMID: 29587624 PMCID: PMC5872396 DOI: 10.1186/s12859-018-2090-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 02/28/2018] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Diabetes mellitus is a common and complicated chronic lifelong disease. Hence, it is of high clinical significance to find the most relevant clinical indexes and to perform efficient computer-aided pre-diagnoses and diagnoses. RESULTS Non-parametric statistical testing is performed on hundreds of medical measurement index results between diabetic and non-diabetic populations. Two common boosting algorithms, Adaboost.M1 and LogitBoost, are selected to establish a machine model for diabetes diagnosis based on these clinical test data, involving a total of 35,669 individuals. The machine classification models built by these two algorithms have very good classification ability. Here, the LogitBoost classification model is slightly better than the Adaboost.M1 classification model. The overall accuracy of the LogitBoost classification model reached 95.30% when using 10-fold cross validation. The true positive, true negative, false positive, and false negative rates of the binary classification model were 0.921, 0.969, 0.031, and 0.079, respectively, and the area under the receiver operating characteristic curve reached 0.99. CONCLUSIONS The boosting algorithms show excellent performance for the diabetes classification models based on clinical medical data. The coefficient matrix of the original data is a sparse matrix, because some of the test results were missing, including some that were directly related to disease diagnosis. Therefore, the model is robust and has a degree of pre-diagnosis function. In the process of selecting the preferred test items, the most statistically significant discriminating factors between the diabetic and general populations were obtained and can be used as reference risk factors for diabetes mellitus.
Collapse
Affiliation(s)
- Peihua Chen
- Institute of Biopharmaceutical Informatics and Technologies, Wenzhou Medical University, Wenzhou, China
| | - Chuandi Pan
- Department of Computer Technology and Information Management, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou City, China
| |
Collapse
|
11
|
Jiang J, Xie J, Zhao C, Su J, Guan Y, Yu Q. Max-margin weight learning for medical knowledge network. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 156:179-190. [PMID: 29428070 DOI: 10.1016/j.cmpb.2018.01.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 10/30/2017] [Accepted: 01/10/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND AND OBJECTIVE The application of medical knowledge strongly affects the performance of intelligent diagnosis, and method of learning the weights of medical knowledge plays a substantial role in probabilistic graphical models (PGMs). The purpose of this study is to investigate a discriminative weight-learning method based on a medical knowledge network (MKN). METHODS We propose a training model called the maximum margin medical knowledge network (M3KN), which is strictly derived for calculating the weight of medical knowledge. Using the definition of a reasonable margin, the weight learning can be transformed into a margin optimization problem. To solve the optimization problem, we adopt a sequential minimal optimization (SMO) algorithm and the clique property of a Markov network. Ultimately, M3KN not only incorporates the inference ability of PGMs but also deals with high-dimensional logic knowledge. RESULTS The experimental results indicate that M3KN obtains a higher F-measure score than the maximum likelihood learning algorithm of MKN for both Chinese Electronic Medical Records (CEMRs) and Blood Examination Records (BERs). Furthermore, the proposed approach is obviously superior to some classical machine learning algorithms for medical diagnosis. To adequately manifest the importance of domain knowledge, we numerically verify that the diagnostic accuracy of M3KN is gradually improved as the number of learned CEMRs increase, which contain important medical knowledge. CONCLUSIONS Our experimental results show that the proposed method performs reliably for learning the weights of medical knowledge. M3KN outperforms other existing methods by achieving an F-measure of 0.731 for CEMRs and 0.4538 for BERs. This further illustrates that M3KN can facilitate the investigations of intelligent healthcare.
Collapse
Affiliation(s)
- Jingchi Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Comprehensive Building 803 Harbin 150001, China.
| | - Jing Xie
- School of Computer Science and Technology, Harbin Institute of Technology, Comprehensive Building 803 Harbin 150001, China
| | - Chao Zhao
- School of Computer Science and Technology, Harbin Institute of Technology, Comprehensive Building 803 Harbin 150001, China
| | - Jia Su
- School of Computer Science and Technology, Harbin Institute of Technology, Comprehensive Building 803 Harbin 150001, China
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Comprehensive Building 803 Harbin 150001, China.
| | - Qiubin Yu
- Medical Record Room, The 2nd Affiliated Hospital of Harbin Medical University, Harbin 150086, China
| |
Collapse
|
12
|
Singh G, Schulthess D, Hughes N, Vannieuwenhuyse B, Kalra D. Real world big data for clinical research and drug development. Drug Discov Today 2017; 23:652-660. [PMID: 29294362 DOI: 10.1016/j.drudis.2017.12.002] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Revised: 12/06/2017] [Accepted: 12/18/2017] [Indexed: 12/27/2022]
Abstract
The objective of this paper is to identify the extent to which real world data (RWD) is being utilized, or could be utilized, at scale in drug development. Through screening peer-reviewed literature, we have cited specific examples where RWD can be used for biomarker discovery or validation, gaining a new understanding of a disease or disease associations, discovering new markers for patient stratification and targeted therapies, new markers for identifying persons with a disease, and pharmacovigilance. None of the papers meeting our criteria was specifically geared toward novel targets or indications in the biopharmaceutical sector; the majority were focused on the area of public health, often sponsored by universities, insurance providers or in combination with public health bodies such as national insurers. The field is still in an early phase of practical application, and is being harnessed broadly where it serves the most direct need in public health applications in early, rare and novel disease incidents. However, these exemplars provide a valuable contribution to insights on the use of RWD to create novel, faster and less invasive approaches to advance disease understanding and biomarker discovery. We believe that pharma needs to invest in making better use of Electronic Health Records and the need for more precompetitive collaboration to grow the scale of this 'big denominator' capability, especially given the needs of precision medicine research.
Collapse
Affiliation(s)
| | | | - Nigel Hughes
- Janssen Research and Development, Beerse, Belgium
| | | | - Dipak Kalra
- Dept. Medical Informatics & Statistics, University of Ghent, De Pintelaan 185, Gent 9000, Belgium.
| |
Collapse
|
13
|
Zhao C, Jiang J, Xu Z, Guan Y. A study of EMR-based medical knowledge network and its applications. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2017; 143:13-23. [PMID: 28391811 DOI: 10.1016/j.cmpb.2017.02.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Revised: 01/23/2017] [Accepted: 02/09/2017] [Indexed: 06/07/2023]
Abstract
BACKGROUND AND OBJECTIVE Electronic medical records (EMRs) contain an amount of medical knowledge which can be used for clinical decision support. We attempt to integrate this medical knowledge into a complex network, and then implement a diagnosis model based on this network. METHODS The dataset of our study contains 992 records which are uniformly sampled from different departments of the hospital. In order to integrate the knowledge of these records, an EMR-based medical knowledge network (EMKN) is constructed. This network takes medical entities as nodes, and co-occurrence relationships between the two entities as edges. Selected properties of this network are analyzed. To make use of this network, a basic diagnosis model is implemented. Seven hundred records are randomly selected to re-construct the network, and the remaining 292 records are used as test records. The vector space model is applied to illustrate the relationships between diseases and symptoms. Because there may exist more than one actual disease in a record, the recall rate of the first ten results, and the average precision are adopted as evaluation measures. RESULTS Compared with a random network of the same size, this network has a similar average length but a much higher clustering coefficient. Additionally, it can be observed that there are direct correlations between the community structure and the real department classes in the hospital. For the diagnosis model, the vector space model using disease as a base obtains the best result. At least one accurate disease can be obtained in 73.27% of the records in the first ten results. CONCLUSION We constructed an EMR-based medical knowledge network by extracting the medical entities. This network has the small-world and scale-free properties. Moreover, the community structure showed that entities in the same department have a tendency to be self-aggregated. Based on this network, a diagnosis model was proposed. This model uses only the symptoms as inputs and is not restricted to a specific disease. The experiments conducted demonstrated that EMKN is a simple and universal technique to integrate different medical knowledge from EMRs, and can be used for clinical decision support.
Collapse
Affiliation(s)
- Chao Zhao
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Jingchi Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Zhiming Xu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| |
Collapse
|
14
|
Graña M, Chyzhyk D, Toro C, Rios S. Innovations in healthcare and medicine editorial. Comput Biol Med 2016; 72:226-8. [PMID: 27000205 DOI: 10.1016/j.compbiomed.2016.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2016] [Revised: 03/06/2016] [Accepted: 03/08/2016] [Indexed: 10/22/2022]
Abstract
This special issue editorial begins with a brief discussion on the current trends of innovations in healthcare and medicine driven by the evolution of sensing devices as well as the information processing techniques, and the social media revolution. This discussion aims to set the stage for the actual papers accepted for the special issue which are extensions of the papers presented at the InMed 2014 conference held in San Sebastian, Spain, in July 2014.
Collapse
Affiliation(s)
- Manuel Graña
- Computational Intelligence Group, Dept. CCIA, University of the Basque Country, UPV/EHU, San Sebastian, Spain; ACPySS, San Sebastian, Spain; ENGINE Centre, Wrocław University of Technology, Wrocław, Poland
| | - Darya Chyzhyk
- Computational Intelligence Group, Dept. CCIA, University of the Basque Country, UPV/EHU, San Sebastian, Spain; ACPySS, San Sebastian, Spain; CISE Dept. University of Florida, USA
| | | | - Sebastian Rios
- Industrial Engineering Department, Business Intelligence Research Center, University of Chile, Santiago, Chile
| |
Collapse
|