Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Moen H, Ginter F, Marsi E, Peltonen LM, Salakoski T, Salanterä S. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain. BMC Med Inform Decis Mak 2015;15 Suppl 2:S2. [PMID: 26099735 PMCID: PMC4474584 DOI: 10.1186/1472-6947-15-s2-s2] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

For:	Moen H, Ginter F, Marsi E, Peltonen LM, Salakoski T, Salanterä S. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain. BMC Med Inform Decis Mak 2015;15 Suppl 2:S2. [PMID: 26099735 PMCID: PMC4474584 DOI: 10.1186/1472-6947-15-s2-s2] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Sivarajkumar S, Mohammad HA, Oniani D, Roberts K, Hersh W, Liu H, He D, Visweswaran S, Wang Y. Clinical Information Retrieval: A Literature Review. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2024;8:313-352. [PMID: 38681755 PMCID: PMC11052968 DOI: 10.1007/s41666-024-00159-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 12/07/2023] [Accepted: 01/08/2024] [Indexed: 05/01/2024]

Barrett AK, Ford J, Zhu Y. Sending and Receiving Safety and Risk Messages in Hospitals: An Exploration into Organizational Communication Channels and Providers' Communication Overload. HEALTH COMMUNICATION 2021;36:1697-1708. [PMID: 32633142 DOI: 10.1080/10410236.2020.1788498] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Wu S, Roberts K, Datta S, Du J, Ji Z, Si Y, Soni S, Wang Q, Wei Q, Xiang Y, Zhao B, Xu H. Deep learning in clinical natural language processing: a methodical review. J Am Med Inform Assoc 2021;27:457-470. [PMID: 31794016 DOI: 10.1093/jamia/ocz200] [Citation(s) in RCA: 158] [Impact Index Per Article: 52.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 10/15/2019] [Accepted: 11/09/2019] [Indexed: 02/07/2023] Open

Xiang Y, Xu J, Si Y, Li Z, Rasmy L, Zhou Y, Tiryaki F, Li F, Zhang Y, Wu Y, Jiang X, Zheng WJ, Zhi D, Tao C, Xu H. Time-sensitive clinical concept embeddings learned from large electronic health records. BMC Med Inform Decis Mak 2019;19:58. [PMID: 30961579 PMCID: PMC6454598 DOI: 10.1186/s12911-019-0766-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Affiliation(s)

Yang Xiang School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Jun Xu School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Yuqi Si School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Zhiheng Li School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA School of Computer Science and Technology, Dalian University of Technology, Dalian, China
Laila Rasmy School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Yujia Zhou School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Firat Tiryaki School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Fang Li School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Yaoyun Zhang School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Yonghui Wu Department of Health Outcomes & Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL USA
Xiaoqian Jiang School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Wenjin Jim Zheng School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Degui Zhi School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Cui Tao School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA
Hua Xu School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA

Collapse

Ning W, Chan S, Beam A, Yu M, Geva A, Liao K, Mullen M, Mandl KD, Kohane I, Cai T, Yu S. Feature extraction for phenotyping from semantic and knowledge resources. J Biomed Inform 2019;91:103122. [PMID: 30738949 PMCID: PMC6424621 DOI: 10.1016/j.jbi.2019.103122] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Abstract

OBJECTIVE

Phenotyping algorithms can efficiently and accurately identify patients with a specific disease phenotype and construct electronic health records (EHR)-based cohorts for subsequent clinical or genomic studies. Previous studies have introduced unsupervised EHR-based feature selection methods that yielded algorithms with high accuracy. However, those selection methods still require expert intervention to tweak the parameter settings according to the EHR data distribution for each phenotype. To further accelerate the development of phenotyping algorithms, we propose a fully automated and robust unsupervised feature selection method that leverages only publicly available medical knowledge sources, instead of EHR data.

METHODS

SEmantics-Driven Feature Extraction (SEDFE) collects medical concepts from online knowledge sources as candidate features and gives them vector-form distributional semantic representations derived with neural word embedding and the Unified Medical Language System Metathesaurus. A number of features that are semantically closest and that sufficiently characterize the target phenotype are determined by a linear decomposition criterion and are selected for the final classification algorithm.

RESULTS

SEDFE was compared with the EHR-based SAFE algorithm and domain experts on feature selection for the classification of five phenotypes including coronary artery disease, rheumatoid arthritis, Crohn's disease, ulcerative colitis, and pediatric pulmonary arterial hypertension using both supervised and unsupervised approaches. Algorithms yielded by SEDFE achieved comparable accuracy to those yielded by SAFE and expert-curated features. SEDFE is also robust to the input semantic vectors.

CONCLUSION

SEDFE attains satisfying performance in unsupervised feature selection for EHR phenotyping. Both fully automated and EHR-independent, this method promises efficiency and accuracy in developing algorithms for high-throughput phenotyping.

Collapse

Bai T, Chanda AK, Egleston BL, Vucetic S. EHR phenotyping via jointly embedding medical concepts and words into a unified vector space. BMC Med Inform Decis Mak 2018;18:123. [PMID: 30537974 PMCID: PMC6290514 DOI: 10.1186/s12911-018-0672-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Jackson R, Kartoglu I, Stringer C, Gorrell G, Roberts A, Song X, Wu H, Agrawal A, Lui K, Groza T, Lewsley D, Northwood D, Folarin A, Stewart R, Dobson R. CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital. BMC Med Inform Decis Mak 2018;18:47. [PMID: 29941004 PMCID: PMC6020175 DOI: 10.1186/s12911-018-0623-9] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Accepted: 06/01/2018] [Indexed: 03/05/2023] Open

Affiliation(s)

Richard Jackson Institute of Psychiatry, Psychology and Neuroscience, King’s College London, 16 De Crespigne Park, London, SE5 8AF UK South London and Maudsley NHS Foundation Trust, Denmark Hill, London, SE5 8AZ UK
Ismail Kartoglu InterDigital Communications, 64 Great Eastern Street, 1st Floor, London, EC2A 3QR UK
Clive Stringer King’s College Hospital, Denmark Hill, London, SE5 9RS UK
Genevieve Gorrell University of Sheffield, Western Bank, Sheffield, S10 2TN UK
Angus Roberts University of Sheffield, Western Bank, Sheffield, S10 2TN UK
Xingyi Song University of Sheffield, Western Bank, Sheffield, S10 2TN UK
Honghan Wu Institute of Psychiatry, Psychology and Neuroscience, King’s College London, 16 De Crespigne Park, London, SE5 8AF UK Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, EH16 4UX UK
Asha Agrawal King’s College Hospital, Denmark Hill, London, SE5 9RS UK
Kenneth Lui Farr Institute of Health Informatics Research, UCL Institute of Health Informatics, University College London, London, WC1E 6BT UK
Tudor Groza Garvan Institute of Medical Research, Sydney, NSW 2010 Australia
Damian Lewsley King’s College Hospital, Denmark Hill, London, SE5 9RS UK
Doug Northwood King’s College Hospital, Denmark Hill, London, SE5 9RS UK
Amos Folarin Institute of Psychiatry, Psychology and Neuroscience, King’s College London, 16 De Crespigne Park, London, SE5 8AF UK Farr Institute of Health Informatics Research, UCL Institute of Health Informatics, University College London, London, WC1E 6BT UK
Robert Stewart Institute of Psychiatry, Psychology and Neuroscience, King’s College London, 16 De Crespigne Park, London, SE5 8AF UK South London and Maudsley NHS Foundation Trust, Denmark Hill, London, SE5 8AZ UK
Richard Dobson Institute of Psychiatry, Psychology and Neuroscience, King’s College London, 16 De Crespigne Park, London, SE5 8AF UK Farr Institute of Health Informatics Research, UCL Institute of Health Informatics, University College London, London, WC1E 6BT UK

Collapse

Effective Identification of Similar Patients Through Sequential Matching over ICD Code Embedding. J Med Syst 2018;42:94. [PMID: 29644446 DOI: 10.1007/s10916-018-0951-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 03/26/2018] [Indexed: 10/17/2022]

Névéol A, Dalianis H, Velupillai S, Savova G, Zweigenbaum P. Clinical Natural Language Processing in languages other than English: opportunities and challenges. J Biomed Semantics 2018;9:12. [PMID: 29602312 PMCID: PMC5877394 DOI: 10.1186/s13326-018-0179-8] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 02/14/2018] [Indexed: 01/22/2023] Open

Bai T, Chanda AK, Egleston BL, Vucetic S. Joint Learning of Representations of Medical Concepts and Words from EHR Data. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2017;2017:764-769. [PMID: 29375929 DOI: 10.1109/bibm.2017.8217752] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Assigning clinical codes with data-driven concept representation on Dutch clinical free text. J Biomed Inform 2017;69:118-127. [DOI: 10.1016/j.jbi.2017.04.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Revised: 03/06/2017] [Accepted: 04/07/2017] [Indexed: 11/21/2022]

Wang Y, Wu S, Li D, Mehrabi S, Liu H. A Part-Of-Speech term weighting scheme for biomedical information retrieval. J Biomed Inform 2016;63:379-389. [PMID: 27593166 DOI: 10.1016/j.jbi.2016.08.026] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 08/30/2016] [Accepted: 08/31/2016] [Indexed: 11/24/2022]

Abstract

In the era of digitalization, information retrieval (IR), which retrieves and ranks documents from large collections according to users' search queries, has been popularly applied in the biomedical domain. Building patient cohorts using electronic health records (EHRs) and searching literature for topics of interest are some IR use cases. Meanwhile, natural language processing (NLP), such as tokenization or Part-Of-Speech (POS) tagging, has been developed for processing clinical documents or biomedical literature. We hypothesize that NLP can be incorporated into IR to strengthen the conventional IR models. In this study, we propose two NLP-empowered IR models, POS-BoW and POS-MRF, which incorporate automatic POS-based term weighting schemes into bag-of-word (BoW) and Markov Random Field (MRF) IR models, respectively. In the proposed models, the POS-based term weights are iteratively calculated by utilizing a cyclic coordinate method where golden section line search algorithm is applied along each coordinate to optimize the objective function defined by mean average precision (MAP). In the empirical experiments, we used the data sets from the Medical Records track in Text REtrieval Conference (TREC) 2011 and 2012 and the Genomics track in TREC 2004. The evaluation on TREC 2011 and 2012 Medical Records tracks shows that, for the POS-BoW models, the mean improvement rates for IR evaluation metrics, MAP, bpref, and P@10, are 10.88%, 4.54%, and 3.82%, compared to the BoW models; and for the POS-MRF models, these rates are 13.59%, 8.20%, and 8.78%, compared to the MRF models. Additionally, we experimentally verify that the proposed weighting approach is superior to the simple heuristic and frequency based weighting approaches, and validate our POS category selection. Using the optimal weights calculated in this experiment, we tested the proposed models on the TREC 2004 Genomics track and obtained average of 8.63% and 10.04% improvement rates for POS-BoW and POS-MRF, respectively. These significant improvements verify the effectiveness of leveraging POS tagging for biomedical IR tasks.

Collapse

Thompson P, Batista-Navarro RT, Kontonatsios G, Carter J, Toon E, McNaught J, Timmermann C, Worboys M, Ananiadou S. Text Mining the History of Medicine. PLoS One 2016;11:e0144717. [PMID: 26734936 PMCID: PMC4703377 DOI: 10.1371/journal.pone.0144717] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Accepted: 11/23/2015] [Indexed: 11/19/2022] Open

Abstract

Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.). TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research purposes, while the processing pipeline and its modules may be used and configured within the Argo TM platform.

Collapse