1
|
Health care utilization and mortality for people with epilepsy during COVID-19: A population study. Epilepsia 2024; 65:1394-1405. [PMID: 38441332 DOI: 10.1111/epi.17920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/05/2024] [Accepted: 02/05/2024] [Indexed: 03/14/2024]
Abstract
OBJECTIVE This study was undertaken to characterize changes in health care utilization and mortality for people with epilepsy (PWE) during the COVID-19 pandemic. METHODS We performed a retrospective study using linked, individual-level, population-scale anonymized health data from the Secure Anonymised Information Linkage databank. We identified PWE living in Wales during the study "pandemic period" (January 1, 2020-June 30, 2021) and during a "prepandemic" period (January 1, 2016-December 31, 2019). We compared prepandemic health care utilization, status epilepticus, and mortality rates with corresponding pandemic rates for PWE and people without epilepsy (PWOE). We performed subgroup analyses on children (<18 years old), older people (>65 years old), those with intellectual disability, and those living in the most deprived areas. We used Poisson models to calculate adjusted rate ratios (RRs). RESULTS We identified 27 279 PWE who had significantly higher rates of hospital (50.3 visits/1000 patient months), emergency department (55.7), and outpatient attendance (172.4) when compared to PWOE (corresponding figures: 25.7, 25.2, and 87.0) in the prepandemic period. Hospital and epilepsy-related hospital admissions, and emergency department and outpatient attendances all reduced significantly for PWE (and all subgroups) during the pandemic period. RRs [95% confidence intervals (CIs)] for pandemic versus prepandemic periods were .70 [.69-.72], .77 [.73-.81], .78 [.77-.79], and .80 [.79-.81]. The corresponding rates also reduced for PWOE. New epilepsy diagnosis rates decreased during the pandemic compared with the prepandemic period (2.3/100 000/month cf. 3.1/100 000/month, RR = .73, 95% CI = .68-.78). Both all-cause deaths and deaths with epilepsy recorded on the death certificate increased for PWE during the pandemic (RR = 1.07, 95% CI = .997-1.145 and RR = 2.44, 95% CI = 2.12-2.81). When removing COVID deaths, RRs were .88 (95% CI = .81-.95) and 1.29 (95% CI = 1.08-1.53). Status epilepticus rates did not change significantly during the pandemic (RR = .95, 95% CI = .78-1.15). SIGNIFICANCE All-cause non-COVID deaths did not increase but non-COVID deaths associated with epilepsy did increase for PWE during the COVID-19 pandemic. The longer term effects of the decrease in new epilepsy diagnoses and health care utilization and increase in deaths associated with epilepsy need further research.
Collapse
|
2
|
Epilepsy and the risk of COVID-19-related hospitalization and death: A population study. Epilepsia 2024; 65:1383-1393. [PMID: 38441374 DOI: 10.1111/epi.17910] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 01/24/2024] [Accepted: 01/24/2024] [Indexed: 03/14/2024]
Abstract
OBJECTIVE People with epilepsy (PWE) may be at an increased risk of severe COVID-19. It is important to characterize this risk to inform PWE and for future health and care planning. We assessed whether PWE were at higher risk of being hospitalized with, or dying from, COVID-19. METHODS We performed a retrospective cohort study using linked, population-scale, anonymized electronic health records from the SAIL (Secure Anonymised Information Linkage) databank. This includes hospital admission and demographic data for the complete Welsh population (3.1 million) and primary care records for 86% of the population. We identified 27 279 PWE living in Wales during the study period (March 1, 2020 to June 30, 2021). Controls were identified using exact 5:1 matching (sex, age, and socioeconomic status). We defined COVID-19 deaths as having International Classification of Diseases, 10th Revision (ICD-10) codes for COVID-19 on death certificates or occurring within 28 days of a positive SARS-CoV-2 polymerase chain reaction (PCR) test. COVID-19 hospitalizations were defined as having a COVID-19 ICD-10 code for the reason for admission or occurring within 28 days of a positive SARS-CoV-2 PCR test. We recorded COVID-19 vaccinations and comorbidities known to increase the risk of COVID-19 hospitalization and death. We used Cox proportional hazard models to calculate hazard ratios. RESULTS There were 158 (.58%) COVID-19 deaths and 933 (3.4%) COVID-19 hospitalizations in PWE, and 370 (.27%) deaths and 1871 (1.4%) hospitalizations in controls. Hazard ratios for COVID-19 death and hospitalization in PWE compared to controls were 2.15 (95% confidence interval [CI] = 1.78-2.59) and 2.15 (95% CI = 1.94-2.37), respectively. Adjusted hazard ratios (adjusted for comorbidities) for death and hospitalization were 1.32 (95% CI = 1.08-1.62) and 1.60 (95% CI = 1.44-1.78). SIGNIFICANCE PWE are at increased risk of being hospitalized with, and dying from, COVID-19 when compared to age-, sex-, and deprivation-matched controls, even when adjusting for comorbidities. This may have implications for prioritizing future COVID-19 treatments and vaccinations for PWE.
Collapse
|
3
|
Natural language processing to automate a web-based model of care and modernize skin cancer multidisciplinary team meetings. Br J Surg 2024; 111:znad347. [PMID: 38198154 PMCID: PMC10782209 DOI: 10.1093/bjs/znad347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 08/23/2023] [Accepted: 10/07/2023] [Indexed: 01/11/2024]
Abstract
BACKGROUND Cancer multidisciplinary team (MDT) meetings are under intense pressure to reform given the rapidly rising incidence of cancer and national mandates for protocolized streaming of cases. The aim of this study was to validate a natural language processing (NLP)-based web platform to automate evidence-based MDT decisions for skin cancer with basal cell carcinoma as a use case. METHODS A novel and validated NLP information extraction model was used to extract perioperative tumour and surgical factors from histopathology reports. A web application with a bespoke application programming interface used data from this model to provide an automated clinical decision support system, mapped to national guidelines and generating a patient letter to communicate ongoing management. Performance was assessed against retrospectively derived recommendations by two independent and blinded expert clinicians. RESULTS There were 893 patients (1045 lesions) used to internally validate the model. High accuracy was observed when compared against human predictions, with an overall value of 0.92. Across all classifiers the virtual skin MDT was highly specific (0.96), while sensitivity was lower (0.72). CONCLUSION This study demonstrates the feasibility of a fully automated, virtual, web-based service model to host the skin MDT with good system performance. This platform could be used to support clinical decision-making during MDTs as 'human in the loop' approach to aid protocolized streaming. Future prospective studies are needed to validate the model in tumour types where guidelines are more complex.
Collapse
|
4
|
Revisiting basal cell carcinoma clinical margins: Leveraging natural language processing and multivariate analysis with updated Royal College of Pathologists histological reporting standards. J Plast Reconstr Aesthet Surg 2024; 88:443-451. [PMID: 38091687 DOI: 10.1016/j.bjps.2023.10.106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 10/02/2023] [Accepted: 10/18/2023] [Indexed: 01/02/2024]
Abstract
INTRODUCTION Data supporting the current British Association of Dermatologists guidelines for the management of basal cell carcinoma (BCC) are based on historic studies and do not consider the updated Royal College of Pathologists (RCPath) histological reporting standards. The aim of this study was to use natural language processing (NLP)-derived data and undertake a multivariate analysis with updated RCPath standards, providing a contemporary update on the excision margins required to achieve histological clearance in BCC. METHODS A validated NLP information extraction model was used to perform a rapid multi-centre, pan-specialty, consecutive retrospective analysis of BCCs, managed with surgical excision using a pre-determined clinical margin, over a 17-year period (2004-2021) at Swansea Bay University Health Board. Logistic regression assessed the relationship between the peripheral and deep margins and histological clearance. RESULTS We ran our NLP algorithm on 34,955 BCCs. Out of the 1447 BCCs that met the inclusion criteria, the peripheral margin clearance was not influenced by the BCC risk level (p = 0.670). A clinical peripheral margin of 6 mm achieved a 95% histological clearance rate (95% confidence interval [CI], 0.93-0.98). Tumour thickness inversely affected deep-margin histological clearance (OR 0.720, 95% CI, 0.525-0.991, p < 0.05). Depth level 2 had a 97% probability of achieving deep-margin histological clearance across all tumour thicknesses. CONCLUSION Updated RCPath reporting standards minimally impact the peripheral margin histological clearance in BCC. Larger clinical peripheral margins than those indicated by current guidelines may be necessary to achieve excision rates of ≥95%. These findings emphasise the need for continuous reassessment of clinical standards to enhance patient care.
Collapse
|
5
|
Genetic influences on epilepsy outcomes: A whole-exome sequencing and health care records data linkage study. Epilepsia 2023; 64:3099-3108. [PMID: 37643892 DOI: 10.1111/epi.17766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 08/31/2023]
Abstract
OBJECTIVE This study was undertaken to develop a novel pathway linking genetic data with routinely collected data for people with epilepsy, and to analyze the influence of rare, deleterious genetic variants on epilepsy outcomes. METHODS We linked whole-exome sequencing (WES) data with routinely collected primary and secondary care data and natural language processing (NLP)-derived seizure frequency information for people with epilepsy within the Secure Anonymised Information Linkage Databank. The study participants were adults who had consented to participate in the Swansea Neurology Biobank, Wales, between 2016 and 2018. DNA sequencing was carried out as part of the Epi25 collaboration. For each individual, we calculated the total number and cumulative burden of rare and predicted deleterious genetic variants and the total of rare and deleterious variants in epilepsy and drug metabolism genes. We compared these measures with the following outcomes: (1) no unscheduled hospital admissions versus unscheduled admissions for epilepsy, (2) antiseizure medication (ASM) monotherapy versus polytherapy, and (3) at least 1 year of seizure freedom versus <1 year of seizure freedom. RESULTS We linked genetic data for 107 individuals with epilepsy (52% female) to electronic health records. Twenty-six percent had unscheduled hospital admissions, and 70% were prescribed ASM polytherapy. Seizure frequency information was linked for 100 individuals, and 10 were seizure-free. There was no significant difference between the outcome groups in terms of the exome-wide and gene-based burden of rare and deleterious genetic variants. SIGNIFICANCE We successfully uploaded, annotated, and linked genetic sequence data and NLP-derived seizure frequency data to anonymized health care records in this proof-of-concept study. We did not detect a genetic influence on real-world epilepsy outcomes, but our study was limited by a small sample size. Future studies will require larger (WES) data to establish genetic variant contribution to epilepsy outcomes.
Collapse
|
6
|
Validating a novel natural language processing pathway for automated quality assurance in surgical oncology: incomplete excision rates of 34 955 basal cell carcinomas. Br J Surg 2023; 110:1072-1075. [PMID: 36935397 PMCID: PMC10416688 DOI: 10.1093/bjs/znad055] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 02/06/2023] [Indexed: 03/21/2023]
|
7
|
COVID-19 vaccination uptake in people with epilepsy in wales. Seizure 2023; 108:49-52. [PMID: 37080124 PMCID: PMC10076248 DOI: 10.1016/j.seizure.2023.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 04/03/2023] [Accepted: 04/05/2023] [Indexed: 04/09/2023] Open
Abstract
PURPOSE People with epilepsy (PWE) are at increased risk of severe COVID-19. Assessing COVID-19 vaccine uptake is therefore important. We compared COVID-19 vaccination uptake for PWE in Wales with a matched control cohort. METHODS We performed a retrospective, population, cohort study using linked, anonymised, Welsh electronic health records within the Secure Anonymised Information Linkage (SAIL) Databank (Welsh population=3.1 million).We identified PWE in Wales between 1st March 2020 and 31st December 2021 and created a control cohort using exact 5:1 matching (sex, age and socioeconomic status). We recorded 1st, 2nd and booster COVID-19 vaccinations. RESULTS There were 25,404 adults with epilepsy (127,020 controls). 23,454 (92.3%) had a first vaccination, 22,826 (89.9%) a second, and 17,797 (70.1%) a booster. Comparative figures for controls were: 112,334 (87.8%), 109,057 (85.2%) and 79,980 (62.4%).PWE had higher vaccination rates in all age, sex and socioeconomic subgroups apart from booster uptake in older subgroups. Vaccination rates were higher in older subgroups, women and less deprived areas for both cohorts. People with intellectual disability and epilepsy had higher vaccination rates when compared with controls with intellectual disability. CONCLUSIONS COVID-19 vaccination uptake for PWE in Wales was higher than that for a matched control group.
Collapse
|
8
|
Development and validation of an automated basal cell carcinoma histopathology information extraction system using natural language processing. Front Surg 2022; 9:870494. [PMID: 36439548 PMCID: PMC9683031 DOI: 10.3389/fsurg.2022.870494] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 07/11/2022] [Indexed: 01/26/2024] Open
Abstract
Introduction Routinely collected healthcare data are a powerful research resource, but often lack detailed disease-specific information that is collected in clinical free text such as histopathology reports. We aim to use natural Language Processing (NLP) techniques to extract detailed clinical and pathological information from histopathology reports to enrich routinely collected data. Methods We used the general architecture for text engineering (GATE) framework to build an NLP information extraction system using rule-based techniques. During validation, we deployed our rule-based NLP pipeline on 200 previously unseen, de-identified and pseudonymised basal cell carcinoma (BCC) histopathological reports from Swansea Bay University Health Board, Wales, UK. The results of our algorithm were compared with gold standard human annotation by two independent and blinded expert clinicians involved in skin cancer care. Results We identified 11,224 items of information with a mean precision, recall, and F1 score of 86.0% (95% CI: 75.1-96.9), 84.2% (95% CI: 72.8-96.1), and 84.5% (95% CI: 73.0-95.1), respectively. The difference between clinician annotator F1 scores was 7.9% in comparison with 15.5% between the NLP pipeline and the gold standard corpus. Cohen's Kappa score on annotated tokens was 0.85. Conclusion Using an NLP rule-based approach for named entity recognition in BCC, we have been able to develop and validate a pipeline with a potential application in improving the quality of cancer registry data, supporting service planning, and enhancing the quality of routinely collected data for research.
Collapse
|
9
|
204 Idiopathic intracranial hypertension in Wales: population characterisation, epidemiological trends and healthcare utilisation. Journal of Neurology, Neurosurgery and Psychiatry 2022. [DOI: 10.1136/jnnp-2022-abn.233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
ObjectiveTo characterise the Welsh idiopathic intracranial hypertension (IIH) population, epidemiologi- cal trends and healthcare outcomes using routinely collected healthcare data.MethodsWe used primary and secondary care healthcare diagnostic codes within the Secure Anonymised Information Linkage databank to ascertain IIH cases and controls in a retrospective cohort study between 2003–2017. We validated IIH diagnosis codes using anonymised secondary care lists of IIH cases.ResultsWe analysed 35 million patient years of data (2003–2017). There were 1765 cases of IIH in 2017 (85% female). The prevalence and incidence of IIH in 2017 was 76/100,000 and 7.8/100,000, significantly increased from 2003 (prevalence=12/100,000, incidence=2.3/100,000). IIH prevalence is associated with socio-economic deprivation and increasing body mass index (BMI). 9% of people with IIH had CSF shunts with less than 0.2% having bariatric surgery. Unscheduled hospital admissions were significantly higher in the IIH cohort compared to controls; and also in IIH patients with CSF shunts compared to those without.ConclusionsIIH incidence and prevalence is increasing significantly, corresponding to population increases in BMI. This has important implications for healthcare professionals and policy makers given the comor- bidities, complications and increased healthcare utilisation and economic burden associated with IIH.lotif_miah@hotmail.com
Collapse
|
10
|
Markup: A Web-Based Annotation Tool Powered by Active Learning. Front Digit Health 2021; 3:598916. [PMID: 34713086 PMCID: PMC8521860 DOI: 10.3389/fdgth.2021.598916] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 06/16/2021] [Indexed: 11/13/2022] Open
Abstract
Across various domains, such as health and social care, law, news, and social media, there are increasing quantities of unstructured texts being produced. These potential data sources often contain rich information that could be used for domain-specific and research purposes. However, the unstructured nature of free-text data poses a significant challenge for its utilisation due to the necessity of substantial manual intervention from domain-experts to label embedded information. Annotation tools can assist with this process by providing functionality that enables the accurate capture and transformation of unstructured texts into structured annotations, which can be used individually, or as part of larger Natural Language Processing (NLP) pipelines. We present Markup (https://www.getmarkup.com/) an open-source, web-based annotation tool that is undergoing continued development for use across all domains. Markup incorporates NLP and Active Learning (AL) technologies to enable rapid and accurate annotation using custom user configurations, predictive annotation suggestions, and automated mapping suggestions to both domain-specific ontologies, such as the Unified Medical Language System (UMLS), and custom, user-defined ontologies. We demonstrate a real-world use case of how Markup has been used in a healthcare setting to annotate structured information from unstructured clinic letters, where captured annotations were used to build and test NLP applications.
Collapse
|
11
|
Incidence, Prevalence, and Health Care Outcomes in Idiopathic Intracranial Hypertension: A Population Study. Neurology 2021; 96:e1251-e1261. [PMID: 33472926 PMCID: PMC8055349 DOI: 10.1212/wnl.0000000000011463] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 10/23/2020] [Indexed: 11/15/2022] Open
Abstract
OBJECTIVE To characterize trends in incidence, prevalence, and health care outcomes in the idiopathic intracranial hypertension (IIH) population in Wales using routinely collected health care data. METHODS We used and validated primary and secondary care IIH diagnosis codes within the Secure Anonymised Information Linkage databank to ascertain IIH cases and controls in a retrospective cohort study between 2003 and 2017. We recorded body mass index (BMI), deprivation quintile, CSF diversion surgery, and unscheduled hospital admissions in case and control cohorts. RESULTS We analyzed 35 million patient-years of data. There were 1,765 cases of IIH in 2017 (85% female). The prevalence and incidence of IIH in 2017 was 76/100,000 and 7.8/100,000/y, a significant increase from 2003 (corresponding figures = 12/100,000 and 2.3/100,000/y) (p < 0.001). IIH prevalence is associated with increasing BMI and increasing deprivation. The odds ratio for developing IIH in the least deprived quintile compared to the most deprived quintile, adjusted for sex and BMI, was 0.65 (95% confidence interval 0.55 to 0.76). Nine percent of IIH cases had CSF shunts with less than 0.2% having bariatric surgery. Unscheduled hospital admissions were higher in the IIH cohort compared to controls (rate ratio 5.28, p < 0.001) and in individuals with IIH and CSF shunts compared to those without shunts (rate ratio 2.02, p < 0.01). CONCLUSIONS IIH incidence and prevalence is increasing considerably, corresponding to population increases in BMI, and is associated with increased deprivation. This has important implications for health care professionals and policy makers given the comorbidities, complications, and increased health care utilization associated with IIH.
Collapse
|
12
|
Idiopathic Intracranial Hypertension in Wales. Int J Popul Data Sci 2020. [DOI: 10.23889/ijpds.v5i5.1633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
IntroductionIdiopathic Intracranial Hypertension (IIH) is a condition of unknown aetiology that is strongly associated with obesity. IIH predominantly affects women of childbearing age and causes chronic disabling headaches, visual disturbance and, in a minority of patients, permanent visual loss.
Objectives and ApproachWe characterised the IIH population, epidemiological trends and healthcare outcomes in Wales using routinely collected healthcare data. We used primary and secondary care healthcare diagnosis codes within the Secure Anonymised Information Linkage databank to ascertain IIH cases and controls in a retrospective cohort study between 2003 and 2017. We validated IIH diagnosis codes using anonymised secondary care lists of IIH cases.
ResultsWe analysed 35 million patient years of data (2003–2017). There were 1765 cases of IIH in 2017 (85% female). The prevalence and incidence of IIH in 2017 was 76/100,000 and 7.8/100,000/year, a significant increase from 2003 (corresponding figures=12/100,000 and 2.3/100,000/year). The odds ratio for developing IIH in the least deprived quintile compared to the most deprived quintile, adjusted for gender and Body Mass Index (BMI), was 0.65 (95% CI=0.55–0.76). 9% of people with IIH had CSF shunts with less than 0.2% having bariatric surgery. Unscheduled hospital admissions were significantly higher in the IIH cohort compared to controls (rate ratio=5.28, p<0.001) and in the people with IIH with CSF shunts compared to those without (rate ratio=2.02, p<0.01).
Conclusion / ImplicationsIIH incidence and prevalence is increasing significantly, corresponding to population increases in BMI, and is associated with increased deprivation. This has important implications for healthcare professionals and policy makers given the comorbidities, complications and increased healthcare utilisation associated with IIH.
Collapse
|
13
|
Markup: A Web-Based Clinical Annotation Tool with Enhanced Ontology Mapping. Int J Popul Data Sci 2020. [DOI: 10.23889/ijpds.v5i5.1634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
IntroductionUnstructured free-text clinical notes often contain valuable information relating to patient symptoms, prescriptions and diagnoses. These can assist with better care for patients and novel healthcare research if transformed into accessible, structured clinical text. In particular, Natural Language Processing (NLP) algorithms can produce such structured outputs, but require gold standard data to train and validate their accuracy. While existing tools such as Brat and Webanno provide interfaces to manually annotate text, there is a lack of capability to efficiently annotate complex clinical information.
Objectives and ApproachWe present Markup, an open-source, web-based annotation tool developed for use within clinical contexts by domain experts to produce gold standard annotations for NLP development. Markup incorporates NLP and Active Learning technologies to enable rapid and accurate annotation of unstructured documents. Markup supports custom user configurations, automated annotation suggestions, and automated mapping to existing clinical ontologies such as the Unified Medical Language System (UMLS), the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED-CT), or custom, user-defined ontologies.
ResultsMarkup has been tested on Epilepsy clinic letters, where captured annotations were used to build and test NLP applications. Markup allowed for inter-annotator statistics to be calculated in the case of multiple annotators. Re-annotation, following iterations of annotation definitions, was incorporated for flexibility. UMLS codes, certainty context, and multiple components from complex phrases were all able to be captured and exported in a structured format.
Conclusions / ImplicationsMarkup allows gold standard annotations to be collected efficiently across unstructured text and is optimized to capture health-specific information. These annotations are important to develop and validate NLP algorithms that automate the capture of important information from clinic letters at scale.
Collapse
|