1
|
Shahbaz M, Harding JE, Milne B, Walters A, Underwood L, von Randow M, Xu L, Gamble GD. Comparison of outcomes of the 50-year follow-up of a randomized trial assessed by study questionnaire and by data linkage: The CONCUR study. Clin Trials 2025; 22:24-35. [PMID: 38907609 PMCID: PMC11809116 DOI: 10.1177/17407745241259088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2024]
Abstract
BACKGROUND/AIMS Self-reported questionnaires on health status after randomized trials can be time-consuming, costly, and potentially unreliable. Administrative data sets may provide cost-effective, less biased information, but it is uncertain how administrative and self-reported data compare to identify chronic conditions in a New Zealand cohort. This study aimed to determine whether record linkage could replace self-reported questionnaires to identify chronic conditions that were the outcomes of interest for trial follow-up. METHODS Participants in 50-year follow-up of a randomized trial were asked to complete a questionnaire and to consent to accessing administrative data. The proportion of participants with diabetes, pre-diabetes, hyperlipidaemia, hypertension, mental health disorders, and asthma was calculated using each data source and agreement between data sources assessed. RESULTS Participants were aged 49 years (SD = 1, n = 424, 50% male). Agreement between questionnaire and administrative data was slight for pre-diabetes (kappa = 0.10), fair for hyperlipidaemia (kappa = 0.27), substantial for diabetes (kappa = 0.65), and moderate for other conditions (all kappa >0.42). Administrative data alone identified two to three times more cases than the questionnaire for all outcomes except hypertension and mental health disorders, where the questionnaire alone identified one to two times more cases than administrative data. Combining all sources increased case detection for all outcomes. CONCLUSIONS A combination of questionnaire, pharmaceutical, and laboratory data with expert panel review were required to identify participants with chronic conditions of interest in this follow-up of a clinical trial.
Collapse
Affiliation(s)
- Mohammad Shahbaz
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Jane E Harding
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Barry Milne
- Centre of Methods and Policy Application in Social Sciences, The University of Auckland, Auckland, New Zealand
| | - Anthony Walters
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Lisa Underwood
- Centre of Methods and Policy Application in Social Sciences, The University of Auckland, Auckland, New Zealand
| | - Martin von Randow
- Centre of Methods and Policy Application in Social Sciences, The University of Auckland, Auckland, New Zealand
| | - Lois Xu
- Centre of Methods and Policy Application in Social Sciences, The University of Auckland, Auckland, New Zealand
| | - Greg D Gamble
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
2
|
Shahbaz M, Harding JE, Milne B, Walters A, Underwood L, von Randow M, Jacob L, Gamble GD. Time and cost of linking administrative datasets for outcomes assessment in a follow-up study of participants from two randomised trials. BMC Med Res Methodol 2025; 25:21. [PMID: 39871155 PMCID: PMC11771019 DOI: 10.1186/s12874-025-02458-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 01/03/2025] [Indexed: 01/29/2025] Open
Abstract
BACKGROUND For the follow-up of participants in randomised trials, data linkage is thought a more cost-efficient method for assessing outcomes. However, researchers often encounter technical and budgetary challenges. Data requests often require a significant amount of information from researchers, and can take several years to process. This study aimed to determine the feasibility, direct costs and the total time required to access administrative datasets for assessment of outcomes in a follow-up study of two randomised trials. METHODS We applied to access administrative datasets from New Zealand government agencies. All actions of study team members, along with their corresponding dates, were recorded prospectively for accessing data from each agency. Team members estimated the average time they spent on each action, and invoices from agencies were recorded. Additionally, we compared the estimated costs and time required for data linkage with those for obtaining self-reported questionnaires and conducting in-person assessments. RESULTS Eight agencies were approached to supply data, of which seven gave approval. The time from first enquiry to receiving an initial dataset ranged from 96 to 854 days. For 859 participants, the estimated time required to obtain outcome data from agencies was 1,530 min; to obtain completed self-reported questionnaires was 11,025 min; and to complete in-person assessments was 77,310 min. The estimated total costs were 20,827 NZD for data linkage, 11,735 NZD for self-reported questionnaires, and 116,085 NZD for in-person assessments. Using this data, we estimate that for a cohort of 100 participants, the costs would be similar for data linkage and in-person assessments. For a cohort of 5,000 participants, we estimate that costs would be similar for data linkage and questionnaires, but ten-fold higher for in-person assessments. CONCLUSIONS Obtaining administrative datasets demands a substantial amount of time and effort. However, data linkage is a feasible method for outcome ascertainment in follow-up studies in New Zealand. For large cohorts, data linkage is likely to be less costly, whereas for small cohorts, in-person assessment has similar costs but is likely to be faster and allows direct assessment of outcomes.
Collapse
Affiliation(s)
- Mohammad Shahbaz
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Jane E Harding
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Barry Milne
- Centre of Methods and Policy Application in Social Sciences, University of Auckland, Auckland, New Zealand
| | - Anthony Walters
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Lisa Underwood
- Centre of Methods and Policy Application in Social Sciences, University of Auckland, Auckland, New Zealand
| | - Martin von Randow
- Centre of Methods and Policy Application in Social Sciences, University of Auckland, Auckland, New Zealand
| | - Lena Jacob
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Greg D Gamble
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
| |
Collapse
|
3
|
Mumford L, Hogg R, Taylor A, Lanyon P, Bythell M, McPhail S, Chilcot J, Powter G, Cooke GS, Ward H, Thomas H, McAdoo SP, Lightstone L, Lim SH, Pettigrew GJ, Pearce FA, Willicombe M. Impact of SARS-CoV-2 spike antibody positivity on infection and hospitalisation rates in immunosuppressed populations during the omicron period: the MELODY study. Lancet 2025; 405:314-328. [PMID: 39863371 DOI: 10.1016/s0140-6736(24)02560-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 11/12/2024] [Accepted: 11/21/2024] [Indexed: 01/27/2025]
Abstract
BACKGROUND In the UK, booster COVID-19 vaccinations have been recommended biannually to people considered immune vulnerable. We investigated, at a population level, whether the absence of detectable anti-SARS-CoV-2 spike protein IgG antibody (anti-S Ab) following three or more vaccinations in immunosuppressed individuals was associated with greater risks of infection and severity of infection. METHODS In this prospective cohort study using UK national disease registers, we recruited participants with solid organ transplants (SOTs), rare autoimmune rheumatic diseases (RAIRDs), and lymphoid malignancies. All participants were tested for anti-S Ab using a lateral flow immunoassay, completed a questionnaire on sociodemographic and clinical characteristics, and were followed up for 6 months using linked data from the National Health Service in England. SARS-CoV-2 infection was primarily defined using UK Health Security Agency data and supplemented with hospitalisation and therapeutics data, and hospitalisation due to SARS-CoV-2 was defined as an admission within 14 days of a positive test. FINDINGS Between Dec 7, 2021, and June 26, 2022, we recruited 21 575 participants. Anti-S Ab was detected in 6519 (77·0%) of 8466 participants with SOTs, 5594 (85·9%) of 6516 with RAIRDs, and 5227 (79·3%) of 6593 with lymphoid malignancies. COVID-19 infection was recorded in 3907 (18·5%) participants, with 556 requiring a COVID-19-related hospital admission and 17 dying within 28 days of infection. Rates of infection varied by sociodemographic and clinical characteristics but, in adjusted analysis, having detectable anti-S Ab was independently associated with a reduced incidence of infection, with incident rate ratios (IRRs) of 0·69 (95% CI 0·65-0·73) in the SOT cohort, 0·57 (0·49-0·67) in the RAIRD cohort, and 0·62 (0·54-0·71) in the lymphoid malignancy cohort. In adjusted analysis, having detectable anti-S Ab was also associated with a reduced incidence of hospitalisation, with IRRs of 0·40 (0·35-0·46) in the SOT cohort, 0·32 (0·22-0·46) in the RAIRD cohort, and 0·41 (0·29-0·58) in the lymphoid malignancy cohort. INTERPRETATION All people with immunosuppression require ongoing access to COVID-19 protection strategies. Assessment of anti-S Ab responses, which can be performed at scale, can identify people with immunosuppression who remain most at risk, providing a mechanism to further individualise protection approaches. FUNDING UK Research and Innovation, Kidney Research UK, Blood Cancer UK, Vasculitis UK, and Cystic Fibrosis Trust.
Collapse
Affiliation(s)
- Lisa Mumford
- Statistics and Clinical Research, NHS Blood and Transplant, Bristol, UK
| | - Rachel Hogg
- Statistics and Clinical Research, NHS Blood and Transplant, Bristol, UK
| | - Adam Taylor
- Department of Rheumatology, Nottingham University Hospitals NHS Trust, Nottingham, UK
| | - Peter Lanyon
- Department of Rheumatology, Nottingham University Hospitals NHS Trust, Nottingham, UK; National Disease Registration Service, NHS England, Leeds, UK; Lifespan and Population Health, School of Medicine, University of Nottingham, Nottingham, UK
| | - Mary Bythell
- National Disease Registration Service, NHS England, Leeds, UK
| | - Sean McPhail
- National Disease Registration Service, NHS England, Leeds, UK
| | - Joseph Chilcot
- Institute of Psychiatry, Psychology & Neuroscience, King's College London, UK
| | - Gillian Powter
- NHS Blood and Transplant Clinical Trials Unit, Oxford, UK
| | - Graham S Cooke
- Department of Infectious Disease, Imperial College London, London, UK
| | - Helen Ward
- Department of Infectious Disease, Imperial College London, London, UK; School of Public Health, Imperial College London, London, UK
| | - Helen Thomas
- Statistics and Clinical Research, NHS Blood and Transplant, Bristol, UK
| | - Stephen P McAdoo
- Centre for Inflammatory Disease, Department of Immunology and Inflammation, Imperial College London, London, UK; Imperial College Renal and Transplant Centre, Imperial College Healthcare NHS Trust, Hammersmith Hospital, London, UK
| | - Liz Lightstone
- Centre for Inflammatory Disease, Department of Immunology and Inflammation, Imperial College London, London, UK; Imperial College Renal and Transplant Centre, Imperial College Healthcare NHS Trust, Hammersmith Hospital, London, UK
| | - Sean H Lim
- Centre for Cancer Immunology, University of Southampton, Southampton, UK
| | - Gavin J Pettigrew
- Department of Surgery, University of Cambridge, Addenbrooke's Hospital, Cambridge, UK
| | - Fiona A Pearce
- Department of Rheumatology, Nottingham University Hospitals NHS Trust, Nottingham, UK; National Disease Registration Service, NHS England, Leeds, UK; Lifespan and Population Health, School of Medicine, University of Nottingham, Nottingham, UK
| | - Michelle Willicombe
- Centre for Inflammatory Disease, Department of Immunology and Inflammation, Imperial College London, London, UK; Imperial College Renal and Transplant Centre, Imperial College Healthcare NHS Trust, Hammersmith Hospital, London, UK.
| |
Collapse
|
4
|
Gordon AL, Rand S, Crellin E, Allan S, Tracey F, De Corte K, Lloyd T, Brine R, Carroll RE, Towers AM, Burton JK, Akdur G, Hanratty B, Webster L, Palmer S, Jones L, Meyer J, Spilsbury K, Killett A, Wolters AT, Peryer G, Goodman C. Piloting a minimum data set for older people living in care homes in England: a developmental study. Age Ageing 2025; 54:afaf001. [PMID: 39812411 PMCID: PMC11733825 DOI: 10.1093/ageing/afaf001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Accepted: 01/02/2025] [Indexed: 01/16/2025] Open
Abstract
BACKGROUND We developed a prototype minimum data set (MDS) for English care homes, assessing feasibility of extracting data directly from digital care records (DCRs) with linkage to health and social care data. METHODS Through stakeholder development workshops, literature reviews, surveys and public consultation, we developed an aspirational MDS. We identified ways to extract this from existing sources, including DCRs and routine health and social care datasets. To address gaps, we added validated measures of delirium, cognitive impairment, functional independence and quality of life to DCR software. Following routine health and social care data linkage to DCRs, we compared variables recorded across multiple data sources, using a hierarchical approach to reduce missingness where appropriate. We reported proportions of missingness, mean and standard deviation (SD) or frequencies (%) for all variables. RESULTS We recruited 996 residents from 45 care homes in three English Integrated Care Systems. 727 residents had data included in the MDS. Additional data were well completed (<35% missingness at wave 1). Competition for staff time, staff attrition and software-related implementation issues contributed to missing DCR data. Following data linkage and combining variables where appropriate, missingness was reduced (≤4% where applicable). DISCUSSION Integration of health and social care is predicated on access to data and interoperability. Despite governance challenges we safely linked care home DCRs to statutory health and social care datasets to create a viable prototype MDS for English care homes. We identified issues around data quality, governance, data plurality and data completion essential to MDS implementation going forward.
Collapse
Affiliation(s)
- Adam L Gordon
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
- Academic Centre for Healthy Ageing, Barts Health NHS Trust, London, UK
| | - Stacey Rand
- Personal Social Services Research Unit (PSSRU), University of Kent, Canterbury, Kent, UK
| | | | - Stephen Allan
- Personal Social Services Research Unit (PSSRU), University of Kent, Canterbury, Kent, UK
| | - Freya Tracey
- Improvement Analytics Unit, the Health Foundation, London, UK
| | - Kaat De Corte
- Improvement Analytics Unit, the Health Foundation, London, UK
| | - Therese Lloyd
- Improvement Analytics Unit, the Health Foundation, London, UK
| | | | - Rachael E Carroll
- Academic Unit of Injury, Recovery and Inflammation Sciences, School of Medicine, University of Nottingham, Nottingham NG7 2UH, UK
- NIHR Applied Research Collaboration-East Midlands (ARC-EM), Nottingham, UK
| | - Ann-Marie Towers
- Health and Social Care Workforce Research Unit, Kings College London, London, UK
| | - Jennifer Kirsty Burton
- School of Cardiovascular and Metabolic Health, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
| | - Gizdem Akdur
- Centre for Research in Public Health and Community Care (CRIPACC), University of Hertfordshire, College Lane, Hatfield, UK
| | - Barbara Hanratty
- Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, UK
| | - Lucy Webster
- Academic Unit of Injury, Recovery and Inflammation Sciences, School of Medicine, University of Nottingham, Nottingham NG7 2UH, UK
| | - Sinead Palmer
- Personal Social Services Research Unit (PSSRU), University of Kent, Canterbury, Kent, UK
| | | | - Julienne Meyer
- School of Health and Psychological Sciences, City University of London, London, UK
| | - Karen Spilsbury
- School of Healthcare, Faculty of Medicine and Health, University of Leeds, Leeds, UK
- NIHR Applied Research Collaboration Yorkshire and Humber (YHARC), Bradford, UK
| | - Anne Killett
- School of Health Sciences, University of East Anglia, Norwich, Norfolk,UK
| | - Arne T Wolters
- Cumbria, Northumberland, Tyne and Wear NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Guy Peryer
- School of Health Sciences, University of East Anglia, Norwich, Norfolk,UK
| | - Claire Goodman
- Centre for Research in Public Health and Community Care (CRIPACC), University of Hertfordshire, College Lane, Hatfield, UK
- NIHR Applied Research Collaboration-East of England (ARC-EoE), Cambridge, UK
| |
Collapse
|
5
|
Stubbs E, Exley J, Wittenberg R, Mays N. How to establish and sustain a disease registry: insights from a qualitative study of six disease registries in the UK. BMC Med Inform Decis Mak 2024; 24:361. [PMID: 39604990 PMCID: PMC11603652 DOI: 10.1186/s12911-024-02775-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 11/19/2024] [Indexed: 11/29/2024] Open
Abstract
BACKGROUND The advent of new chronic conditions such as long COVID-19 raises the question of whether and, if so, how best to establish new disease registries for such conditions. Prompted by the potential need for a long COVID-19 registry, we examined experiences of existing UK disease registries to understand barriers and enablers to establishing and sustaining a register, and how these have changed over time. METHODS We undertook semi-structured interviews between November 2022 and April 2023 with individuals representing six disease registries that collect individual-level longitudinal data on people diagnosed with a chronic condition. RESULTS Registries examined were developed by a few individuals, usually clinicians, to gain a greater understanding of the disease. Patient voices were largely absent from initial agenda setting processes, but, over time, all registries sought to increase patient involvement. Securing long-term funding was cited as the biggest challenge; due to limited funds, one of the registries examined no longer actively recruits patients. Charities devoted to the diseases in question were key funders, though most registries also sought commercial opportunities. Inclusion on the NIHR Clinical Research Network Portfolio was also considered a vital resource to support recruitment and follow-up of participants. All registries have sought to minimise the primary data collected to reduce the burden on clinicians and patients, increasingly relying on linkage to other data sources. Several registries have developed consent procedures that enable participants to be contacted for additional data collection. In some cases, the initial patient consent and data sharing permissions obtained had limited the flexibility to adapt the registry to changing data needs. Finally, there was a need to foster buy-in from the community of patients and clinicians who provide and/or use the data. CONCLUSION We identified six key considerations when establishing a sustainable disease registry: (1) include a diverse set of stakeholders; (2) involve patients at every stage; (3) collect a core data set for all participants; (4) ensure the data system is flexible and interoperable with the wider data landscape; (5) anticipate changing data needs over time; and (6) identify financial opportunities to sustain the registry's activities for the long term.
Collapse
Affiliation(s)
- Edmund Stubbs
- Policy Innovation and Evaluation Research Unit, Care Policy and Evaluation Centre, London School of Economics and Political Science, Houghton Street, London, WC2A 2AE, UK
| | - Josephine Exley
- Department of Health Services Research and Policy, Policy Innovation and Evaluation Research Unit, London School of Hygiene & Tropical Medicine, 15-17 Tavistock Place, London, WC1H 9SH, UK
| | - Raphael Wittenberg
- Policy Innovation and Evaluation Research Unit, Care Policy and Evaluation Centre, London School of Economics and Political Science, Houghton Street, London, WC2A 2AE, UK
| | - Nicholas Mays
- Department of Health Services Research and Policy, Policy Innovation and Evaluation Research Unit, London School of Hygiene & Tropical Medicine, 15-17 Tavistock Place, London, WC1H 9SH, UK.
| |
Collapse
|
6
|
Shahbaz M, Harding JE, Milne B, Walters A, von Randow M, Gamble GD. Effect of utilizing either a self-reported questionnaire or administrative data alone or in combination on the findings of a randomized controlled trial of the long-term effects of antenatal corticosteroids. PLoS One 2024; 19:e0308414. [PMID: 39110714 PMCID: PMC11305536 DOI: 10.1371/journal.pone.0308414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Accepted: 07/23/2024] [Indexed: 08/10/2024] Open
Abstract
INTRODUCTION A combination of self-reported questionnaire and administrative data could potentially enhance ascertainment of outcomes and alleviate the limitations of both in follow up studies. However, it is uncertain how access to only one of these data sources to assess outcomes impact study findings. Therefore, this study aimed to determine whether the study findings would be altered if the outcomes were assessed by different data sources alone or in combination. METHODS At 50-year follow-up of participants in a randomized trial, we assessed the effect of antenatal betamethasone exposure on the diagnosis of diabetes, pre-diabetes, hyperlipidemia, hypertension, mental health disorders, and asthma using a self-reported questionnaire, administrative data, a combination of both, or any data source, with or without adjudication by an expert panel of five clinicians. Differences between relative risks derived from each data source were calculated using the Bland-Altman approach. RESULTS There were 424 participants (46% of those eligible, aged 49 years, SD 1, 50% male). There were no differences in study outcomes between participants exposed to betamethasone and those exposed to placebo when the outcomes were assessed using different data sources. When compared to the study findings determined using adjudicated outcomes, the mean difference (limits of agreement) in relative risks derived from other data sources were: self-reported questionnaires 0.02 (-0.35 to 0.40), administrative data 0.06 (-0.32 to 0.44), both questionnaire and administrative data 0.01 (-0.41 to 0.43), and any data source, 0.01 (-0.08 to 0.10). CONCLUSION Utilizing a self-reported questionnaire, administrative data, both questionnaire and administrative data, or any of these sources for assessing study outcomes had no impact on the study findings compared with when study outcomes were assessed using adjudicated outcomes.
Collapse
Affiliation(s)
- Mohammad Shahbaz
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Jane E. Harding
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Barry Milne
- Centre of Methods and Policy Application in Social Sciences, University of Auckland, Auckland, New Zealand
| | - Anthony Walters
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | - Martin von Randow
- Centre of Methods and Policy Application in Social Sciences, University of Auckland, Auckland, New Zealand
| | - Greg D. Gamble
- Liggins Institute, The University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
7
|
Morris A. Unlocking the power of NHS research: a priority for the new UK Government. Lancet 2024; 404:317-320. [PMID: 39008995 DOI: 10.1016/s0140-6736(24)01451-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 07/10/2024] [Indexed: 07/17/2024]
|
8
|
Cavallaro F, Clery A, Gilbert R, van der Meulen J, Kendall S, Kennedy E, Phillips C, Harron K. Evaluating the real-world implementation of the Family Nurse Partnership in England: a data linkage study. HEALTH AND SOCIAL CARE DELIVERY RESEARCH 2024; 12:1-223. [PMID: 38784984 DOI: 10.3310/bvdw6447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Background/objectives The Family Nurse Partnership is an intensive home visiting programme for adolescent mothers. We aimed to evaluate the effectiveness of the Family Nurse Partnership on outcomes up to age 7 using national administrative data. Design We created a linked cohort of all mothers aged 13-19 using data from health, educational and children's social care and defined mothers enrolled in the Family Nurse Partnership or not using Family Nurse Partnership system data. Propensity scores were used to create matched groups for analysis. Setting One hundred and thirty-six local authorities in England with active Family Nurse Partnership sites between 2010 and 2017. Participants Mothers aged 13-19 at last menstrual period with live births between April 2010 and March 2019, living in a Family Nurse Partnership catchment area and their firstborn child(ren). Interventions The Family Nurse Partnership includes up to 64 home visits by a family nurse from early pregnancy until the child's second birthday and is combined with usual health and social care. Controls received usual health and social care. Main outcome measures Indicators of child maltreatment (hospital admissions for injury/maltreatment, referral to social care services); child health and development (hospital utilisation and education) outcomes and maternal hospital utilisation and educational outcomes up to 7 years following birth. Data sources Family Nurse Partnership Information System, Hospital Episode Statistics, National Pupil Database. Results Of 110,520 eligible mothers, 25,680 (23.2%) were enrolled in the Family Nurse Partnership. Enrolment rates varied across 122 sites (range: 11-68%). Areas with more eligible mothers had lower enrolment rates. Enrolment was higher among mothers aged 13-15 (52%), than 18-19 year-olds (21%). Indicators of child maltreatment: we found no evidence of an association between the Family Nurse Partnership and indicators of child maltreatment, except for an increased rate of unplanned admissions for maltreatment/injury-related diagnoses up to age 2 for children born to Family Nurse Partnership mothers (6.6% vs. 5.7%, relative risk 1.15; 95% confidence interval 1.07 to 1.24). Child health and developmental outcomes: there was weak evidence that children born to Family Nurse Partnership mothers were more likely to achieve a Good Level of Development at age 5 (57.5% vs. 55.4%, relative risk 1.05; 95% confidence interval 1.00 to 1.09). Maternal outcomes: There was some evidence that Family Nurse Partnership mothers were less likely to have a subsequent delivery within 18 months of the index birth (8.4% vs. 9.3%, relative risk 0.92; 95% confidence interval 0.88 to 0.97). Younger and more vulnerable mothers received higher numbers of visits and were more likely to achieve fidelity targets. Meeting the fidelity targets was associated with some outcomes. Limitations Bias by indication and variation in the intervention and usual care over time and between areas may have limited our ability to detect effects. Multiple testing may have led to spurious, significant results. Conclusions This study supports findings from evaluations of the Family Nurse Partnership showing no evidence of benefit for maltreatment outcomes measured in administrative data. Amongst all the outcomes measured, we found weak evidence that the Family Nurse Partnership was associated with improvements in child development at school entry, a reduction in rapid repeat pregnancies and evidence of increased healthcare-seeking in the mother and child. Future work Future evaluations should capture better measures of Family Nurse Partnership interventions and usual care, more information on maternal risk factors and additional outcomes relating to maternal well-being. Study registration The study is registered as NIHR CRN Portfolio (42900). Funding This award was funded by the National Institute of Health and Care Research (NIHR) Health and Social Care Delivery Research programme (NIHR award ref: 17/99/19) and is published in full in Health and Social Care Delivery Research; Vol. 12, No. 11. See the NIHR Funding and Awards website for further award information.
Collapse
Affiliation(s)
| | - Amanda Clery
- UCL Great Ormond Street Institute of Child Health, London, UK
| | - Ruth Gilbert
- UCL Great Ormond Street Institute of Child Health, London, UK
| | - Jan van der Meulen
- UCL Great Ormond Street Institute of Child Health, London, UK
- London School of Hygiene and Tropical Medicine, London, UK
| | - Sally Kendall
- UCL Great Ormond Street Institute of Child Health, London, UK
- Centre for Health Services Studies, University of Kent, Canterbury, UK
| | - Eilis Kennedy
- UCL Great Ormond Street Institute of Child Health, London, UK
- Eilis Kennedy, Tavistock and Portman NHS Foundation Trust, London, UK
| | - Catherine Phillips
- UCL Great Ormond Street Institute of Child Health, London, UK
- Centre for Health Services Studies, University of Kent, Canterbury, UK
| | - Katie Harron
- UCL Great Ormond Street Institute of Child Health, London, UK
| |
Collapse
|
9
|
Goodacre S, Sutton L, Ennis K, Thomas B, Hawksworth O, Iftikhar K, Croft SJ, Fuller G, Waterhouse S, Hind D, Stevenson M, Bradburn MJ, Smyth M, Perkins GD, Millins M, Rosser A, Dickson J, Wilson M. Prehospital early warning scores for adults with suspected sepsis: the PHEWS observational cohort and decision-analytic modelling study. Health Technol Assess 2024; 28:1-93. [PMID: 38551135 PMCID: PMC11017155 DOI: 10.3310/ndty2403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2024] Open
Abstract
Background Guidelines for sepsis recommend treating those at highest risk within 1 hour. The emergency care system can only achieve this if sepsis is recognised and prioritised. Ambulance services can use prehospital early warning scores alongside paramedic diagnostic impression to prioritise patients for treatment or early assessment in the emergency department. Objectives To determine the accuracy, impact and cost-effectiveness of using early warning scores alongside paramedic diagnostic impression to identify sepsis requiring urgent treatment. Design Retrospective diagnostic cohort study and decision-analytic modelling of operational consequences and cost-effectiveness. Setting Two ambulance services and four acute hospitals in England. Participants Adults transported to hospital by emergency ambulance, excluding episodes with injury, mental health problems, cardiac arrest, direct transfer to specialist services, or no vital signs recorded. Interventions Twenty-one early warning scores used alongside paramedic diagnostic impression, categorised as sepsis, infection, non-specific presentation, or other specific presentation. Main outcome measures Proportion of cases prioritised at the four hospitals; diagnostic accuracy for the sepsis-3 definition of sepsis and receiving urgent treatment (primary reference standard); daily number of cases with and without sepsis prioritised at a large and a small hospital; the minimum treatment effect associated with prioritisation at which each strategy would be cost-effective, compared to no prioritisation, assuming willingness to pay £20,000 per quality-adjusted life-year gained. Results Data from 95,022 episodes involving 71,204 patients across four hospitals showed that most early warning scores operating at their pre-specified thresholds would prioritise more than 10% of cases when applied to non-specific attendances or all attendances. Data from 12,870 episodes at one hospital identified 348 (2.7%) with the primary reference standard. The National Early Warning Score, version 2 (NEWS2), had the highest area under the receiver operating characteristic curve when applied only to patients with a paramedic diagnostic impression of sepsis or infection (0.756, 95% confidence interval 0.729 to 0.783) or sepsis alone (0.655, 95% confidence interval 0.63 to 0.68). None of the strategies provided high sensitivity (> 0.8) with acceptable positive predictive value (> 0.15). NEWS2 provided combinations of sensitivity and specificity that were similar or superior to all other early warning scores. Applying NEWS2 to paramedic diagnostic impression of sepsis or infection with thresholds of > 4, > 6 and > 8 respectively provided sensitivities and positive predictive values (95% confidence interval) of 0.522 (0.469 to 0.574) and 0.216 (0.189 to 0.245), 0.447 (0.395 to 0.499) and 0.274 (0.239 to 0.313), and 0.314 (0.268 to 0.365) and 0.333 (confidence interval 0.284 to 0.386). The mortality relative risk reduction from prioritisation at which each strategy would be cost-effective exceeded 0.975 for all strategies analysed. Limitations We estimated accuracy using a sample of older patients at one hospital. Reliable evidence was not available to estimate the effectiveness of prioritisation in the decision-analytic modelling. Conclusions No strategy is ideal but using NEWS2, in patients with a paramedic diagnostic impression of infection or sepsis could identify one-third to half of sepsis cases without prioritising unmanageable numbers. No other score provided clearly superior accuracy to NEWS2. Research is needed to develop better definition, diagnosis and treatments for sepsis. Study registration This study is registered as Research Registry (reference: researchregistry5268). Funding This award was funded by the National Institute for Health and Care Research (NIHR) Health Technology Assessment programme (NIHR award ref: 17/136/10) and is published in full in Health Technology Assessment; Vol. 28, No. 16. See the NIHR Funding and Awards website for further award information.
Collapse
Affiliation(s)
- Steve Goodacre
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
- Emergency Department, Northern General Hospital, Sheffield, UK
| | - Laura Sutton
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Kate Ennis
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Ben Thomas
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Olivia Hawksworth
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | | | - Susan J Croft
- Emergency Department, Northern General Hospital, Sheffield, UK
| | - Gordon Fuller
- Emergency Department, Northern General Hospital, Sheffield, UK
| | - Simon Waterhouse
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Daniel Hind
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Matt Stevenson
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Mike J Bradburn
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| | - Michael Smyth
- Warwick Clinical Trials Unit, University of Warwick, Coventry, UK
| | - Gavin D Perkins
- Warwick Clinical Trials Unit, University of Warwick, Coventry, UK
| | - Mark Millins
- Yorkshire Ambulance Service NHS Trust, Wakefield, UK
| | - Andy Rosser
- West Midlands Ambulance Service University NHS Foundation Trust, Midlands, UK
| | - Jon Dickson
- Academic Unit of Primary Medical Care, University of Sheffield, Sheffield, UK
| | - Matthew Wilson
- School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, UK
| |
Collapse
|
10
|
Priou S, Lame G, Jankovic M, Kempf E. "In conferences, everyone goes 'health data is the future' ": an interview study on challenges in re-using EHR data for research in Clinical Data Warehouses. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024; 2023:579-588. [PMID: 38222365 PMCID: PMC10785853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
More and more hospital Clinical Data Warehouses (CDWs) are developed to gain access to EHR data. The rapid growth of investments in CDWs suggest a real potential for innovation in healthcare. However, it is still not confirmed that CDWs will deliver on their promises as researchers working with CDWs face many challenges. To gain a better understanding of these challenges and how to overcome them, we conducted a series of semi-structured interviews with EHR data experts. In this article, we share some initial results from the ongoing interview study. Two main themes emerged from the analysis of the transcripts of the interviews: the importance of infrastructures in terms of data and how it is generated, and the difficulty to make care, clinical research, and data science work together. Finally, based on the experts' experience, several recommendations were identified when using a CDW.
Collapse
Affiliation(s)
- Sonia Priou
- Université Paris-Saclay, CentraleSupélec, Laboratoire Génie Industriel, France
| | - Guillaume Lame
- Université Paris-Saclay, CentraleSupélec, Laboratoire Génie Industriel, France
| | - Marija Jankovic
- Université Paris-Saclay, CentraleSupélec, Laboratoire Génie Industriel, France
| | - Emmanuelle Kempf
- Université Paris Est Créteil, AP-HP, Department of medical oncology, CHU Henri Mondor and Albert Chenevier, Créteil, France
- Sorbonne Université, Inserm, Universit́ Sorbonne Paris Nord, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France
| |
Collapse
|
11
|
Little C, Elliot M, Allmendinger R. Federated learning for generating synthetic data: a scoping review. Int J Popul Data Sci 2023; 8:2158. [PMID: 38414544 PMCID: PMC10898505 DOI: 10.23889/ijpds.v8i1.2158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024] Open
Abstract
Introduction Federated Learning (FL) is a decentralised approach to training statistical models, where training is performed across multiple clients, producing one global model. Since the training data remains with each local client and is not shared or exchanged with other clients the use of FL may reduce privacy and security risks (compared to methods where multiple data sources are pooled) and can also address data access and heterogeneity problems. Synthetic data is artificially generated data that has the same structure and statistical properties as the original but that does not contain any of the original data records, therefore minimising disclosure risk. Using FL to produce synthetic data (which we refer to as "federated synthesis") has the potential to combine data from multiple clients without compromising privacy, allowing access to data that may otherwise be inaccessible in its raw format. Objectives The objective was to review current research and practices for using FL to generate synthetic data and determine the extent to which research has been undertaken, the methods and evaluation practices used, and any research gaps. Methods A scoping review was conducted to systematically map and describe the published literature on the use of FL to generate synthetic data. Relevant studies were identified through online databases and the findings are described, grouped, and summarised. Information extracted included article characteristics, documenting the type of data that is synthesised, the model architecture and the methods (if any) used to evaluate utility and privacy risk. Results A total of 69 articles were included in the scoping review; all were published between 2018 and 2023 with two thirds (46) in 2022. 30% (21) were focussed on synthetic data generation as the main model output (with 6 of these generating tabular data), whereas 59% (41) focussed on data augmentation. Of the 21 performing federated synthesis, all used deep learning methods (predominantly Generative Adversarial Networks) to generate the synthetic data. Conclusions Federated synthesis is in its early days but shows promise as a method that can construct a global synthetic dataset without sharing any of the local client data. As a field in its infancy there are areas to explore in terms of the privacy risk associated with the various methods proposed, and more generally in how we measure those risks.
Collapse
Affiliation(s)
- Claire Little
- Cathie Marsh Institute for Social Research, School of Social Sciences, University of Manchester, Oxford Road, M13 9PL, Manchester, UK
| | - Mark Elliot
- Department of Social Statistics, School of Social Sciences, University of Manchester, Oxford Road, M13 9PL, Manchester, UK
| | - Richard Allmendinger
- Alliance Manchester Business School, University of Manchester, Oxford Road, M13 9PL, Manchester, UK
| |
Collapse
|
12
|
Brophy R, Bellavia E, Bluemink MG, Evans K, Hashimi M, Macaulay Y, McNamara E, Noble A, Quattroni P, Rudczenko A, Morris AD, Smith C, Boyd A. Towards a standardised cross-sectoral data access agreement template for research: a core set of principles for data access within trusted research environments. Int J Popul Data Sci 2023; 8:2169. [PMID: 38419914 PMCID: PMC10900295 DOI: 10.23889/ijpds.v8i4.2169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open
Abstract
Introduction Trusted Research Environments (TREs) are secure computing environments that provide access to data for approved researchers to use in studies that can save and improve lives. TREs rely on Data Access Agreements (DAAs) to bind researchers and their organisations to the terms and conditions of accessing the infrastructure and data use. However, DAAs can be overly lengthy, complex, and can contain outdated terms from historical data sharing agreements for physical exchange of data. This is often cited as a cause of significant delays to legal review and research projects starting. Objectives The aim was to develop a standardised DAA optimised for data science in TREs across the UK and framed around the 'Five Safes framework' for trustworthy data use. The DAA is underpinned by principles of data access in TREs, the development of which is described in this paper. Methods The Pan-UK Data Governance Steering Group of the UK Health Data Research Alliance led the development of a core set of data access principles. This was informed by a benchmarking exercise of DAAs used by established TREs and consultation with public members and stakeholders. Results We have defined a core set of principles for TRE data access that can be mapped to a common set of DAA terms for UK-based TREs. Flexibility will be ensured by including terms specific to TREs or specific data/data owners in customisable annexes. Public views obtained through public involvement and engagement (PIE) activities are also reported. Conclusions These principles provide the foundation for a standardised UK TRE DAA template, designed to support the growing ecosystem of TREs. By providing a familiar structure and terms, this template aims to build trust among data owners and the UK public and to provide clarity to researchers on their obligations to protect the data. Widespread adoption is intended to accelerate health data research by enabling faster approval of projects, ultimately enabling more timely and effective research.
Collapse
Affiliation(s)
- Rachel Brophy
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | - Ester Bellavia
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | | | - Katharine Evans
- UK Longitudinal Linkage Collaboration, University of Bristol, Canynge Hall, Clifton, Bristol, BS8 2PS
| | | | - Yemi Macaulay
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | - Edel McNamara
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | - Allison Noble
- Research Data Scotland, Bayes Centre, 47 Potterrow, Edinburgh, EH8 9BT
| | - Paola Quattroni
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | | | - Andrew D Morris
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | - Cassie Smith
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
| | - Andy Boyd
- Health Data Research UK, Gibbs Building, 215 Euston Road, London, NW1 2BE
- UK Longitudinal Linkage Collaboration, University of Bristol, Canynge Hall, Clifton, Bristol, BS8 2PS
| |
Collapse
|
13
|
Claridge H, Tan J, Loane M, Garne E, Barisic I, Cavero-Carbonell C, Dias C, Gatt M, Jordan S, Khoshnood B, Kiuru-Kuhlefelt S, Klungsoyr K, Mokoroa Carollo O, Nelen V, Neville AJ, Pierini A, Randrianaivo H, Rissmann A, Tucker D, de Walle H, Wertelecki W, Morris JK. Ethics and legal requirements for data linkage in 14 European countries for children with congenital anomalies. BMJ Open 2023; 13:e071687. [PMID: 37500278 PMCID: PMC10387628 DOI: 10.1136/bmjopen-2023-071687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 07/29/2023] Open
Abstract
INTRODUCTION Linking healthcare data sets can create valuable resources for research, particularly when investigating rare exposures or outcomes. However, across Europe, the permissions processes required to access data can be complex. This paper documents the processes required by the EUROlinkCAT study investigators to research the health and survival of children with congenital anomalies in Europe. METHODS Eighteen congenital anomaly registries in 14 countries provided information on all the permissions required to perform surveillance of congenital anomalies and to link their data on live births with available vital statistics and healthcare databases for research. Small number restrictions imposed by data providers were also documented. RESULTS The permissions requirements varied substantially, with certain registries able to conduct congenital anomaly surveillance as part of national or regional healthcare provision, while others were required to obtain ethics approvals or informed consent. Data linkage and analysis for research purposes added additional layers of complexity for registries, with some required to obtain several permissions, including ethics approvals to link the data. Restrictions relating to small numbers often resulted in a registry's data on specific congenital anomalies being unusable. CONCLUSION The permissions required to obtain and link data on children with congenital anomalies varied greatly across Europe. The variation and complexity present a significant obstacle to the use of such data, especially in large data linkage projects. Furthermore, small number restrictions severely limited the research that could be performed for children with specific rare congenital anomalies.
Collapse
Affiliation(s)
- Hugh Claridge
- School of Health Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, UK
| | - Joachim Tan
- Population Health Research Institute, St George's, University of London, London, UK
| | - Maria Loane
- Faculty of Life and Health Sciences, Ulster University, Belfast, UK
| | - Ester Garne
- Department of Paediatrics and Adolescent Medicine, Lillebaelt Hospital, University Hospital of Southern Denmark, Kolding, Denmark
| | - Ingeborg Barisic
- Children's Hospital Zagreb, Centre of Excellence for Reproductive and Regenerative Medicine, Medical School University of Zagreb, Zagreb, Croatia
| | - Clara Cavero-Carbonell
- Rare Diseases Research Unit, Foundation for the Promotion of Health and Biomedical Research in the Valencian Region (FISABIO), Valencia, Spain
| | - Carlos Dias
- Epidemiology Department, National Registry of Congenital Anomalies, National Institute of Health Doctor Ricardo Jorge (Instituto Nacional de Saúde Doutor Ricardo Jorge), Lisbon, Portugal
| | - Miriam Gatt
- Malta Congenital Anomalies Registry, Directorate for Health Information and Research, Pieta, Malta
| | - Susan Jordan
- Faculty of Medicine, Health and Life Sciences, Swansea University, Swansea, UK
| | - Babak Khoshnood
- Obstetrical, Perinatal and Pediatric Epidemiology Research Team (EPOPé), Center of Research in Epidemiology and Statistics (CRESS), Institut National de la Santé et de la Recherche Médicale (INSERM), INRA, Université de Paris, Paris, France
| | | | - Kari Klungsoyr
- Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
- Divison of Mental and Physical Health, Norwegian Institute of Public Health, Bergen, Norway
| | - Olatz Mokoroa Carollo
- Public Health Division of Gipuzkoa, BioDonostia Health Research Institute, San Sebastian, Spain
| | - Vera Nelen
- Provincial Institute for Hygiene, Antwerp, Belgium
| | - Amanda J Neville
- Registro IMER, University of Ferrara, Ferrara, Emilia-Romagna, Italy
| | - Anna Pierini
- Institute of Clinical Physiology, National Research Council, Pisa, Italy
| | - Hanitra Randrianaivo
- Register of Congenital Malformations, Centre Hospitalier Universitaire de La Réunion, Île de la Réunion, France
| | - Anke Rissmann
- Malformation Monitoring Centre Saxony-Anhalt, Medical Faculty, Otto-von-Guericke University, Magdeburg, Germany
| | - David Tucker
- Public Health Wales National Health Service Trust, Cardiff, UK
| | - Hermien de Walle
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | | | - Joan K Morris
- Population Health Research Institute, St George's, University of London, London, UK
| |
Collapse
|
14
|
Buckel M, Proudfoot AG. Time for a rethink in cardiogenic shock: the shock to survival framework document. Br J Hosp Med (Lond) 2023; 84:1-8. [PMID: 37490447 DOI: 10.12968/hmed.2023.0139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Cardiogenic shock remains a time-critical, complex syndrome that continues to present challenges to clinicians and healthcare systems. Despite advances in the fields of cardiovascular and critical care medicine, mortality remains high. This article summarises the recent shock to survival document, which outlined the current and ideal future state of cardiogenic shock care nationally to improve patient outcomes. Shock to survival emphasises the need for education and training in the early recognition of the hypoperfusion that is pathognomomic of cardiogenic shock. Improved provision of focused cardiac ultrasound is essential to confirm a cardiac cause. Early identification of the patient with cardiogenic shock should be supported by access to defined pathways of care, including specialist shock centres and multiprofessional teams with domain expertise and the capability to manage the myriad of causative aetiologies. Given the absence of high-quality data to inform practice nationally, robust datasets are an unmet need to inform best practice, guide design of clinical services and pathways and drive innovation through research and clinical trials.
Collapse
Affiliation(s)
- Marie Buckel
- Pan-London Intensive Care Medicine Training Programme, London, UK
| | - Alastair G Proudfoot
- Perioperative Medicine Department, Barts Heart Centre, St Bartholomew's Hospital, London, UK
- Critical Care and Perioperative Medicine Group, Queen Mary University of London, London, UK
| |
Collapse
|
15
|
Cavallaro FL, Cannings-John R, Lugg-Widger F, Gilbert R, Kennedy E, Kendall S, Robling M, Harron KL. Lessons learned from using linked administrative data to evaluate the Family Nurse Partnership in England and Scotland. Int J Popul Data Sci 2023; 8:2113. [PMID: 37670953 PMCID: PMC10476150 DOI: 10.23889/ijpds.v8i1.2113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023] Open
Abstract
Introduction "Big data" - including linked administrative data - can be exploited to evaluate interventions for maternal and child health, providing time- and cost-effective alternatives to randomised controlled trials. However, using these data to evaluate population-level interventions can be challenging. Objectives We aimed to inform future evaluations of complex interventions by describing sources of bias, lessons learned, and suggestions for improvements, based on two observational studies using linked administrative data from health, education and social care sectors to evaluate the Family Nurse Partnership (FNP) in England and Scotland. Methods We first considered how different sources of potential bias within the administrative data could affect results of the evaluations. We explored how each study design addressed these sources of bias using maternal confounders captured in the data. We then determined what additional information could be captured at each step of the complex intervention to enable analysts to minimise bias and maximise comparability between intervention and usual care groups, so that any observed differences can be attributed to the intervention. Results Lessons learned include the need for i) detailed data on intervention activity (dates/geography) and usual care; ii) improved information on data linkage quality to accurately characterise control groups; iii) more efficient provision of linked data to ensure timeliness of results; iv) better measurement of confounding characteristics affecting who is eligible, approached and enrolled. Conclusions Linked administrative data are a valuable resource for evaluations of the FNP national programme and other complex population-level interventions. However, information on local programme delivery and usual care are required to account for biases that characterise those who receive the intervention, and to inform understanding of mechanisms of effect. National, ongoing, robust evaluations of complex public health evaluations would be more achievable if programme implementation was integrated with improved national and local data collection, and robust quasi-experimental designs.
Collapse
Affiliation(s)
- Francesca L. Cavallaro
- UCL Great Ormond Street Institute of Child Health, London, UK
- The Health Foundation, 8 Salisbury Square, London, UK
| | - Rebecca Cannings-John
- Centre for Trials Research, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Fiona Lugg-Widger
- Centre for Trials Research, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Ruth Gilbert
- UCL Great Ormond Street Institute of Child Health, London, UK
| | - Eilis Kennedy
- Children, Young Adults and Families Directorate, Tavistock and Portman NHS Foundation Trust, London, UK
| | - Sally Kendall
- Centre for Health Services Studies, University of Kent, Canterbury, UK
| | - Michael Robling
- Centre for Trials Research, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Katie L. Harron
- UCL Great Ormond Street Institute of Child Health, London, UK
| |
Collapse
|
16
|
Harron KL. Linking data to build a bigger picture of paediatric referral pathways. Arch Dis Child 2023; 108:245-246. [PMID: 36732036 DOI: 10.1136/archdischild-2022-324788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- Katie L Harron
- University College London Great Ormond Street Institute of Child Health Library, London, UK
| |
Collapse
|
17
|
O’Sullivan KK, Wilde KJ. A profile of the Grampian Data Safe Haven, a regional Scottish safe haven for health and population data research. Int J Popul Data Sci 2023; 4:1817. [PMID: 37671386 PMCID: PMC10476148 DOI: 10.23889/ijpds.v4i2.1817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
There has been a recent emphasis to establish and codify large-scale or national Trusted Research Environments (TREs) in the United Kingdom, with a view to limit smaller, local TREs. The basis for this argument is that it avoids duplication of infrastructure, information governance, privacy risks, monopolies and will promote innovation, particularly with commercial partners. However, the work around establishing TREs in the UK largely ignores the long-established local TRE landscape in Scotland, and the way in which local TREs can actually improve data quality, solve technical architecture challenges, promote information governance and risk minimisation, and encourage innovation and collaboration (both academic and commercial). This data centre profile focuses on the Grampian Data Safe Haven (DaSH), a secure, virtual healthcare data analysis and storage centre located in Aberdeen, Scotland. DaSH was co-established by the NHS Grampian Health Board and University of Aberdeen to allow for the secure processing and linking of health data for the Grampian and Scottish population when it is not practicable to obtain consent from individual patients. As an established trusted research environment now in its 10th operating year, DaSH technology ensures healthcare, social care data and other types of sensitive data, routinely collected and used without individual patient consent, are made accessible for both academic research and clinical service evaluation and improvements whilst protecting individuals' privacy at the local, national and international levels. DaSH has registered almost 600 projects and facilitated over 200 distinct research projects with data hosting, extraction, and novel linkages to completion. Ongoing innovation and collaboration between DaSH and the NHS Grampian Health Board continues to expand researcher access to new types of data and data linkages, introduce new technologies for advanced statistical research methods, and supports interdisciplinary research using population health and social care data for research, clinical and commercial advancements, and real-world practitioner applications. The purpose of this paper is to present DaSH's data population, operating model, architecture and information technology, governance, legislation and management, privacy-by-design principles and data access, data linkage methods, data sources, noteworthy research outputs, and further developments in order to demonstrate the value of local TREs within the data management and access debate.
Collapse
Affiliation(s)
| | - Katie J. Wilde
- Grampian DaSH, University of Aberdeen, Polwarth Building, Foresterhill, Aberdeen AB25 2ZD
| |
Collapse
|
18
|
Soneson E, Das S, Burn AM, van Melle M, Anderson JK, Fazel M, Fonagy P, Ford T, Gilbert R, Harron K, Howarth E, Humphrey A, Jones PB, Moore A. Leveraging Administrative Data to Better Understand and Address Child Maltreatment: A Scoping Review of Data Linkage Studies. CHILD MALTREATMENT 2023; 28:176-195. [PMID: 35240863 PMCID: PMC9806482 DOI: 10.1177/10775595221079308] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
BACKGROUND This scoping review aimed to overview studies that used administrative data linkage in the context of child maltreatment to improve our understanding of the value that data linkage may confer for policy, practice, and research. METHODS We searched MEDLINE, Embase, PsycINFO, CINAHL, and ERIC electronic databases in June 2019 and May 2020 for studies that linked two or more datasets (at least one of which was administrative in nature) to study child maltreatment. We report findings with numerical and narrative summary. RESULTS We included 121 studies, mainly from the United States or Australia and published in the past decade. Data came primarily from social services and health sectors, and linkage processes and data quality were often not described in sufficient detail to align with current reporting guidelines. Most studies were descriptive in nature and research questions addressed fell under eight themes: descriptive epidemiology, risk factors, outcomes, intergenerational transmission, predictive modelling, intervention/service evaluation, multi-sector involvement, and methodological considerations/advancements. CONCLUSIONS Included studies demonstrated the wide variety of ways in which data linkage can contribute to the public health response to child maltreatment. However, how research using linked data can be translated into effective service development and monitoring, or targeting of interventions, is underexplored in terms of privacy protection, ethics and governance, data quality, and evidence of effectiveness.
Collapse
Affiliation(s)
- Emma Soneson
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Shruti Das
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - Anne-Marie Burn
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Marije van Melle
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | | | - Mina Fazel
- Department of Psychiatry, Warneford Hospital, University of Oxford, Headington, Oxford, UK
| | - Peter Fonagy
- Research Department of Clinical, Educational and Health Psychology, University College London, London, UK
| | - Tamsin Ford
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Ruth Gilbert
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Katie Harron
- Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Emma Howarth
- School of Psychology, University of East London, London, UK
| | - Ayla Humphrey
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Peter B. Jones
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| | - Anna Moore
- Department of Psychiatry, University of Cambridge, Cambridge, UK
| |
Collapse
|
19
|
Jefferson E, Cole C, Mumtaz S, Cox S, Giles TC, Adejumo S, Urwin E, Lea D, Macdonald C, Best J, Masood E, Milligan G, Johnston J, Horban S, Birced I, Hall C, Jackson AS, Collins C, Rising S, Dodsley C, Hampton J, Hadfield A, Santos R, Tarr S, Panagi V, Lavagna J, Jackson T, Chuter A, Beggs J, Martinez-Queipo M, Ward H, von Ziegenweidt J, Burns F, Martin J, Sebire N, Morris C, Bradley D, Baxter R, Ahonen-Bishopp A, Smith P, Shoemark A, Valdes AM, Ollivere B, Manisty C, Eyre D, Gallant S, Joy G, McAuley A, Connell D, Northstone K, Jeffery K, Di Angelantonio E, McMahon A, Walker M, Semple MG, Sims JM, Lawrence E, Davies B, Baillie JK, Tang M, Leeming G, Power L, Breeze T, Murray D, Orton C, Pierce I, Hall I, Ladhani S, Gillson N, Whitaker M, Shallcross L, Seymour D, Varma S, Reilly G, Morris A, Hopkins S, Sheikh A, Quinlan P. A Hybrid Architecture (CO-CONNECT) to Facilitate Rapid Discovery and Access to Data Across the United Kingdom in Response to the COVID-19 Pandemic: Development Study. J Med Internet Res 2022; 24:e40035. [PMID: 36322788 PMCID: PMC9822177 DOI: 10.2196/40035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 10/12/2022] [Accepted: 11/01/2022] [Indexed: 11/05/2022] Open
Abstract
BACKGROUND COVID-19 data have been generated across the United Kingdom as a by-product of clinical care and public health provision, as well as numerous bespoke and repurposed research endeavors. Analysis of these data has underpinned the United Kingdom's response to the pandemic, and informed public health policies and clinical guidelines. However, these data are held by different organizations, and this fragmented landscape has presented challenges for public health agencies and researchers as they struggle to find relevant data to access and interrogate the data they need to inform the pandemic response at pace. OBJECTIVE We aimed to transform UK COVID-19 diagnostic data sets to be findable, accessible, interoperable, and reusable (FAIR). METHODS A federated infrastructure model (COVID - Curated and Open Analysis and Research Platform [CO-CONNECT]) was rapidly built to enable the automated and reproducible mapping of health data partners' pseudonymized data to the Observational Medical Outcomes Partnership Common Data Model without the need for any data to leave the data controllers' secure environments, and to support federated cohort discovery queries and meta-analysis. RESULTS A total of 56 data sets from 19 organizations are being connected to the federated network. The data include research cohorts and COVID-19 data collected through routine health care provision linked to longitudinal health care records and demographics. The infrastructure is live, supporting aggregate-level querying of data across the United Kingdom. CONCLUSIONS CO-CONNECT was developed by a multidisciplinary team. It enables rapid COVID-19 data discovery and instantaneous meta-analysis across data sources, and it is researching streamlined data extraction for use in a Trusted Research Environment for research and public health analysis. CO-CONNECT has the potential to make UK health data more interconnected and better able to answer national-level research questions while maintaining patient confidentiality and local governance procedures.
Collapse
Affiliation(s)
- Emily Jefferson
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Christian Cole
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Shahzad Mumtaz
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Samuel Cox
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | | | - Sam Adejumo
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Esmond Urwin
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Daniel Lea
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Calum Macdonald
- Usher Institute, University of Edinburgh, Edinburgh, United Kingdom
| | - Joseph Best
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
- Health Data Research UK, London, United Kingdom
| | - Erum Masood
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Gordon Milligan
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Jenny Johnston
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Scott Horban
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Ipek Birced
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Christopher Hall
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Aaron S Jackson
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Clare Collins
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Sam Rising
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Charlotte Dodsley
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Jill Hampton
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Andrew Hadfield
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Roberto Santos
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Simon Tarr
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Vasiliki Panagi
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Joseph Lavagna
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
| | - Tracy Jackson
- Usher Institute, University of Edinburgh, Edinburgh, United Kingdom
| | - Antony Chuter
- Lay Partnership in Healthcare Research, Lindfield, United Kingdom
| | - Jillian Beggs
- Health Informatics Centre, Division of Population and Health Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | | | - Helen Ward
- School of Public Health, Imperial College London, London, United Kingdom
| | - Julie von Ziegenweidt
- Department of Haemotology, University of Cambridge, Cambridge, United Kingdom
- National Institute for Healthcare Research BioResource, Cambridge University Hospitals NHS Foundation, Cambridge Biomedical Campus, Cambridge, United Kingdom
| | - Frances Burns
- Centre for Public Health, Belfast Institute of Clinical Science, Queens University Belfast, Belfast, United Kingdom
| | - Joanne Martin
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - Neil Sebire
- Institute of Child Health, Great Ormond Street Hospital, London, United Kingdom
| | | | - Declan Bradley
- Centre for Public Health, Institute of Clinical Science, Queen's University Belfast, Belfast, United Kingdom
- Public Health Agency, Belfast, United Kingdom
| | - Rob Baxter
- EPCC, University of Edinburgh, Edinburgh, United Kingdom
| | | | | | - Amelia Shoemark
- Molecular and Clinical Medicine, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Ana M Valdes
- School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Benjamin Ollivere
- School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Charlotte Manisty
- Institute of Cardiovascular Sciences, University of College London, London, United Kingdom
| | - David Eyre
- Big Data Institute, University of Oxford, Oxford, United Kingdom
| | - Stephanie Gallant
- Molecular and Clinical Medicine, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - George Joy
- Barts Heart Centre, London, United Kingdom
| | - Andrew McAuley
- Clinical and Protecting Health Directorate, Public Health Scotland, Glasgow, United Kingdom
| | - David Connell
- School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Kate Northstone
- Population Health Sciences, Avon Longitudinal Study of Parents and Children, Bristol, United Kingdom
| | - Katie Jeffery
- Radcliffe Department of Medicine, Oxford University, Oxford, United Kingdom
- Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, United Kingdom
| | - Emanuele Di Angelantonio
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge, United Kingdom
- National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Behaviour, University of Cambridge, Cambridge, United Kingdom
- Health Data Research UK Cambridge, Wellcome Genome Campus, University of Cambridge, Cambridge, United Kingdom
- Health Data Science Research Centre, Human Technopole, Milan, Italy
| | - Amy McMahon
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Behaviour, University of Cambridge, Cambridge, United Kingdom
| | - Mat Walker
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Behaviour, University of Cambridge, Cambridge, United Kingdom
| | - Malcolm Gracie Semple
- Health Protection Research Unit in Emerging and Zoonotic Infections, Institute of Infections, University of Liverpool, Liverpool, United Kingdom
- Respiratory Department, Alder Hey Children's Hospital, Liverpool, United Kingdom
| | | | | | - Bethan Davies
- School of Public Health, Imperial College London, London, United Kingdom
| | - John Kenneth Baillie
- Outbreak Data Analysis Platform, University of Edinburgh, Edinburgh, United Kingdom
| | - Ming Tang
- NHS England, Worcestershire, United Kingdom
| | - Gary Leeming
- Civic Data Cooperative, Digital Innovation Facility, University of Liverpool, Liverpool, United Kingdom
| | - Linda Power
- Public Health England, London, United Kingdom
| | - Thomas Breeze
- Avon Longitudinal Study of Parents and Children, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Duncan Murray
- University of Birmingham, Birmingham, United Kingdom
- University Hospital Coventry & Warwickshire NHS Trust, Coventry, United Kingdom
| | - Chris Orton
- Population Data Science, Swansea University Medical School, Swansea, United Kingdom
| | - Iain Pierce
- Barts Heart Centre, St Bartholomew's Hospital, Barts Health NHS Trust, London, United Kingdom
- Institute of Cardiovascular Science, University College London, London, United Kingdom
| | - Ian Hall
- Nottingham Biomedical Research Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Shamez Ladhani
- Immunisation and Countermeasures Division, Public Health England Colindale, London, United Kingdom
| | | | - Matthew Whitaker
- School of Public Health, Imperial College London, London, United Kingdom
| | | | | | | | | | | | | | - Aziz Sheikh
- Centre for Population Health Sciences, Usher Institute, University of Edinburgh, Edinburgh, United Kingdom
| | - Philip Quinlan
- Digital Research Service, University of Nottingham, Nottingham, United Kingdom
- School of Medicine, University of Nottingham, Nottingham, United Kingdom
| |
Collapse
|
20
|
Moorthie S, Hayat S, Zhang Y, Parkin K, Philips V, Bale A, Duschinsky R, Ford T, Moore A. Rapid systematic review to identify key barriers to access, linkage, and use of local authority administrative data for population health research, practice, and policy in the United Kingdom. BMC Public Health 2022; 22:1263. [PMID: 35764951 PMCID: PMC9241330 DOI: 10.1186/s12889-022-13187-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 03/31/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Improving data access, sharing, and linkage across local authorities and other agencies can contribute to improvements in population health. Whilst progress is being made to achieve linkage and integration of health and social care data, issues still exist in creating such a system. As part of wider work to create the Cambridge Child Health Informatics and Linked Data (Cam-CHILD) database, we wanted to examine barriers to the access, linkage, and use of local authority data. METHODS A systematic literature search was conducted of scientific databases and the grey literature. Any publications reporting original research related to barriers or enablers of data linkage of or with local authority data in the United Kingdom were included. Barriers relating to the following issues were extracted from each paper: funding, fragmentation, legal and ethical frameworks, cultural issues, geographical boundaries, technical capability, capacity, data quality, security, and patient and public trust. RESULTS Twenty eight articles were identified for inclusion in this review. Issues relating to technical capacity and data quality were cited most often. This was followed by those relating to legal and ethical frameworks. Issue relating to public and patient trust were cited the least, however, there is considerable overlap between this topic and issues relating to legal and ethical frameworks. CONCLUSIONS This rapid review is the first step to an in-depth exploration of the barriers to data access, linkage and use; a better understanding of which can aid in creating and implementing effective solutions. These barriers are not novel although they pose specific challenges in the context of local authority data.
Collapse
Affiliation(s)
- Sowmiya Moorthie
- Cambridge Public Health, Interdisciplinary Research Centre, Forvie Site, Cambridge Biomedical Campus, Cambridge, UK.
- PHG Foundation, 2 Worts Causeway, University of Cambridge, Cambridge, UK.
| | - Shabina Hayat
- Cambridge Public Health, Interdisciplinary Research Centre, Forvie Site, Cambridge Biomedical Campus, Cambridge, UK
| | - Yi Zhang
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Katherine Parkin
- Cambridge Public Health, Interdisciplinary Research Centre, Forvie Site, Cambridge Biomedical Campus, Cambridge, UK
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | | | - Amber Bale
- Department of Psychology, University of Northumbria, Newcastle upon Tyne, UK
| | - Robbie Duschinsky
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Tamsin Ford
- Department of Psychiatry, University of Cambridge, Herschel Smith Building, Robinson Way, Cambridge, UK
| | - Anna Moore
- Department of Psychiatry, University of Cambridge, Herschel Smith Building, Robinson Way, Cambridge, UK
- Anna Freud National Centre for Children and Families, London, UK
- Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, Peterborough, UK
| |
Collapse
|
21
|
Kokosi T, De Stavola B, Mitra R, Frayling L, Doherty A, Dove I, Sonnenberg P, Harron K. An overview of synthetic administrative data for research. Int J Popul Data Sci 2022; 7:1727. [PMID: 37650026 PMCID: PMC10464868 DOI: 10.23889/ijpds.v7i1.1727] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
Abstract
Use of administrative data for research and for planning services has increased over recent decades due to the value of the large, rich information available. However, concerns about the release of sensitive or personal data and the associated disclosure risk can lead to lengthy approval processes and restricted data access. This can delay or prevent the production of timely evidence. A promising solution to facilitate more efficient data access is to create synthetic versions of the original datasets which are less likely to hold confidential information and can minimise disclosure risk. Such data may be used as an interim solution, allowing researchers to develop their analysis plans on non-disclosive data, whilst waiting for access to the real data. We aim to provide an overview of the background and uses of synthetic data and describe common methods used to generate synthetic data in the context of UK administrative research. We propose a simplified terminology for categories of synthetic data (univariate, multivariate, and complex modality synthetic data) as well as a more comprehensive description of the terminology used in the existing literature and illustrate challenges and future directions for research.
Collapse
Affiliation(s)
- Theodora Kokosi
- Department of Population, Policy and Practice, UCL Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Bianca De Stavola
- Department of Population, Policy and Practice, UCL Great Ormond Street Institute of Child Health, University College London, London, UK
| | - Robin Mitra
- School of Mathematics, Cardiff University, Cardiff UK
| | | | - Aiden Doherty
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Iain Dove
- Office for National Statistics, Titchfield, UK
| | - Pam Sonnenberg
- Department of Infection & Population Health, Institute for Global Health, University College London, London, UK
| | - Katie Harron
- Department of Population, Policy and Practice, UCL Great Ormond Street Institute of Child Health, University College London, London, UK
| |
Collapse
|
22
|
Espuny Pujol F, Pagel C, Brown KL, Doidge JC, Feltbower RG, Franklin RC, Gonzalez-Izquierdo A, Gould DW, Norman LJ, Stickley J, Taylor JA, Crowe S. Linkage of National Congenital Heart Disease Audit data to hospital, critical care and mortality national data sets to enable research focused on quality improvement. BMJ Open 2022; 12:e057343. [PMID: 35589356 PMCID: PMC9121475 DOI: 10.1136/bmjopen-2021-057343] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
OBJECTIVES To link five national data sets (three registries, two administrative) and create longitudinal healthcare trajectories for patients with congenital heart disease (CHD), describing the quality and the summary statistics of the linked data set. DESIGN Bespoke linkage of record-level patient identifiers across five national data sets. Generation of spells of care defined as periods of time-overlapping events across the data sets. SETTING National Congenital Heart Disease Audit (NCHDA) procedures in public (National Health Service; NHS) hospitals in England and Wales, paediatric and adult intensive care data sets (Paediatric Intensive Care Audit Network; PICANet and the Case Mix Programme from the Intensive Care National Audit & Research Centre; ICNARC-CMP), administrative hospital episodes (hospital episode statistics; HES inpatient, outpatient, accident and emergency; A&E) and mortality registry data. PARTICIPANTS Patients with any CHD procedure recorded in NCHDA between April 2000 and March 2017 from public hospitals. PRIMARY AND SECONDARY OUTCOME MEASURES Primary: number of linked records, number of unique patients and number of generated spells of care. Secondary: quality and completeness of linkage. RESULTS There were 143 862 records in NCHDA relating to 96 041 unique patients. We identified 65 797 linked PICANet patient admissions, 4664 linked ICNARC-CMP admissions and over 6 million linked HES episodes of care (1.1M inpatient, 4.7M outpatient). The linked data set had 4 908 153 spells of care after quality checks, with a median (IQR) of 3.4 (1.8-6.3) spells per patient-year. Where linkage was feasible (in terms of year and centre), 95.6% surgical procedure records were linked to a corresponding HES record, 93.9% paediatric (cardiac) surgery procedure records to a corresponding PICANet admission and 76.8% adult surgery procedure records to a corresponding ICNARC-CMP record. CONCLUSIONS We successfully linked four national data sets to the core data set of all CHD procedures performed between 2000 and 2017. This will enable a much richer analysis of longitudinal patient journeys and outcomes. We hope that our detailed description of the linkage process will be useful to others looking to link national data sets to address important research priorities.
Collapse
Affiliation(s)
- Ferran Espuny Pujol
- Clinical Operational Research Unit, Department of Mathematics, University College London, London, UK
| | - Christina Pagel
- Clinical Operational Research Unit, Department of Mathematics, University College London, London, UK
| | - Katherine L Brown
- Cardiorespiratory Division, NIHR Great Ormond Street Hospital Biomedical Research Centre, London, UK
| | - James C Doidge
- Intensive Care National Audit and Research Centre, London, UK
| | - Richard G Feltbower
- Leeds Institute for Data Analytics, School of Medicine, University of Leeds, Leeds, UK
| | - Rodney C Franklin
- Department of Paediatric Cardiology, Royal Brompton & Harefield NHS Foundation Trust, London, UK
| | - Arturo Gonzalez-Izquierdo
- Institute of Health Informatics, University College London, London, UK
- Health Data Research UK, London, UK
| | - Doug W Gould
- Intensive Care National Audit and Research Centre, London, UK
| | - Lee J Norman
- Leeds Institute for Data Analytics, School of Medicine, University of Leeds, Leeds, UK
| | - John Stickley
- Department of Paediatric Cardiac Surgery, Birmingham Children's Hospital, Birmingham, UK
| | - Julie A Taylor
- Clinical Operational Research Unit, Department of Mathematics, University College London, London, UK
| | - Sonya Crowe
- Clinical Operational Research Unit, Department of Mathematics, University College London, London, UK
| |
Collapse
|
23
|
Grath-Lone LM, Jay MA, Blackburn R, Gordon E, Zylbersztejn A, Wijlaars L, Gilbert R. What makes administrative data "research-ready"? A systematic review and thematic analysis of published literature. Int J Popul Data Sci 2022; 7:1718. [PMID: 35520099 PMCID: PMC9052961 DOI: 10.23889/ijpds.v6i1.1718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Introduction Administrative data are a valuable research resource, but are under-utilised in the UK due to governance, technical and other barriers (e.g., the time and effort taken to gain secure data access). In recent years, there has been considerable government investment in making administrative data "research-ready", but there is no definition of what this term means. A common understanding of what constitutes research-ready administrative data is needed to establish clear principles and frameworks for their development and the realisation of their full research potential. Objective To define the characteristics of research-ready administrative data based on a systematic review and synthesis of existing literature. Methods On 29th June 2021, we systematically searched seven electronic databases for (1) peer-reviewed literature (2) related to research-ready administrative data (3) written in the English language. Following supplementary searches and snowball screening, we conducted a thematic analysis of the identified relevant literature. Results Overall, we screened 2,375 records and identified 38 relevant studies published between 2012 and 2021. Most related to administrative data from the UK and US and particularly to health data. The term research-ready was used inconsistently in the literature and there was some conflation with the concept of data being ready for statistical analysis. From the thematic analysis, we identified five defining characteristics of research-ready administrative data: (a) accessible, (b) broad, (c) curated, (d) documented and (e) enhanced for research purposes. Conclusions Our proposed characteristics of research-ready administrative data could act as a starting point to help data owners and researchers develop common principles and standards. In the more immediate term, the proposed characteristics are a useful framework for cataloguing existing research-ready administrative databases and relevant resources that can support their development.
Collapse
Affiliation(s)
| | - Matthew A. Jay
- Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, UK
| | - Ruth Blackburn
- Institute of Health Informatics, University College London, UK
| | - Emma Gordon
- Administrative Data Research UK, Economic & Social Research Council, UK
| | - Ania Zylbersztejn
- Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, UK
| | - Linda Wijlaars
- Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, UK
| | - Ruth Gilbert
- Institute of Health Informatics, University College London, UK
- Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, UK
| |
Collapse
|
24
|
Wray N, Miller K, Irvine K, Moore E, Crisp A, Bapaume K, Taylor C, Smetak R, Wiggins N, Dombrovskaya M, Flack F. Development and implementation of a national online application system for cross-jurisdictional linked data. Int J Popul Data Sci 2022; 7:1732. [PMID: 35520098 PMCID: PMC9052959 DOI: 10.23889/ijpds.v6i1.1732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023] Open
Abstract
The Population Health Research Network (PHRN) is an Australian national data linkage infrastructure that links a wide range of health and human services data in privacy-preserving ways. The data linkage infrastructure enables researchers to apply for access to routinely collected, linked, administrative data from the six states and two territories which make up the Commonwealth of Australia, as well as data collected by the Australian Government. The PHRN is a distributed network where data is collected and managed at the respective jurisdictional and/or cross-jurisdictional levels. As a result, access to linked data from multiple jurisdictions requires complex approval processes. This paper describes Australia's approach to enabling access to linked data from multiple jurisdictions. It covers the identification of, and agreement to, a minimum set of data items to be included in a unified national application form, the development and implementation of a national online application system and the harmonisation of business processes for cross-jurisdictional research projects. Utilisation of the online application system and the ongoing challenges of data linkage across jurisdictions are discussed. Changes to the data custodian and ethics committee approval criteria were out of scope for this project.
Collapse
Affiliation(s)
- Natalie Wray
- Population Health Research Network, University of Western Australia, Perth 6009, Australia
| | - Kate Miller
- Telethon Kids Institute, Perth 6009, Australia
| | | | | | - Alice Crisp
- Australian Institute of Health and Welfare, Canberra 2601, Australia
| | | | | | - Rob Smetak
- SA NT DataLink, University of South Australia, Adelaide 5000, Australia
| | - Nadine Wiggins
- Menzies Institute for Medical Research, Hobart 7000, Australia
| | - Mikhalina Dombrovskaya
- Population Health Research Network, University of Western Australia, Perth 6009, Australia
| | - Felicity Flack
- Population Health Research Network, University of Western Australia, Perth 6009, Australia
| |
Collapse
|
25
|
Ruiz Nishiki M, Cabecinha M, Knowles R, Peters C, Aitkenhead H, Ifederu A, Schoenmakers N, Sebire NJ, Walker E, Hardelid P. Establishing risk factors and outcomes for congenital hypothyroidism with gland in situ using population-based data linkage methods: study protocol. BMJ Paediatr Open 2022; 6:e001341. [PMID: 36053651 PMCID: PMC8969044 DOI: 10.1136/bmjpo-2021-001341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 03/05/2022] [Indexed: 11/29/2022] Open
Abstract
INTRODUCTION There has been an increase in the birth prevalence of congenital hypothyroidism (CH) since the introduction of newborn screening, both globally and in the UK. This increase can be accounted for by an increase in CH with gland in situ (CH-GIS). It is not known why CH-GIS is becoming more common, nor how it affects the health, development and learning of children over the long term. Our study will use linked administrative health, education and clinical data to determine risk factors for CH-GIS and describe long-term health and education outcomes for affected children. METHODS AND ANALYSIS We will construct a birth cohort study based on linked, administrative data to determine what factors have contributed to the increase in the birth prevalence of CH-GIS in the UK. We will also set up a follow-up study of cases and controls to determine the health and education outcomes of children with and without CH-GIS. We will use logistic/multinomial regression models to establish risk factors for CH-GIS. Changes in the prevalence of risk factors over time will help to explain the increase in birth prevalence of CH-GIS. Multivariable generalised linear models or Cox proportional hazards regression models will be used to assess the association between type of CH and school performance or health outcomes. ETHICS AND DISSEMINATION This study has been approved by the London Queen Square Research Ethics Committee and the Health Research Authority's Confidentiality Advisory Group CAG. Approvals are also being sought from each data provider. Obtaining approvals from CAG, data providers and information governance bodies have caused considerable delays to the project. Our methods and findings will be published in peer-reviewed journals and presented at academic conferences.
Collapse
Affiliation(s)
- Milagros Ruiz Nishiki
- UCL Great Ormond Street Institute of Child Health Population Policy and Practice, London, UK
| | - Melissa Cabecinha
- Institute of Child Health, UCL, London, UK
- Research Department of Primary Care and Population Health, UCL, London, UK
| | - Rachel Knowles
- Life Course Epidemiology and Biostatistics, University College London, London, UK
| | - Catherine Peters
- Endocrinology, Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Helen Aitkenhead
- Department of Chemical Pathology, Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Adeboye Ifederu
- Department of Chemical Pathology, Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Nadia Schoenmakers
- University of Cambridge Metabolic Research Laboratories, Wellcome Trust-Medical Research Council Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK
| | - Neil J Sebire
- Paediatric Pathology, Great Ormond Street Hospital for Children, London, UK
| | | | - Pia Hardelid
- UCL Great Ormond Street Institute of Child Health Population Policy and Practice, London, UK
| |
Collapse
|
26
|
Lugg-Widger F, Munnery K, Townson J, Trubey R, Robling M. Identifying researcher learning needs to develop online training for UK researchers working with administrative data: CENTRIC training. Int J Popul Data Sci 2022; 7:1712. [PMID: 35310556 PMCID: PMC8900594 DOI: 10.23889/ijpds.v6i1.1712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND The use of administrative data in health and social science research continues to expand, with increased availability of data and interest from funders. Researchers, however, continue to experience delays in access, storage and sharing of administrative data. Training opportunities are limited and typically specific to individual data providers or focussed on the analytical aspects of working with administrative data. The CENTRIC study was funded by the Information Commissioners Office, with the aim of developing a broader training curriculum for researchers working with administrative data in the UK. METHODS A mixed-methods design informed curriculum content, including surveys with researchers, focus group discussions with data providers and workshops with members of the public. Researchers were identified from relevant administrative data networks and invited to participate in an online survey identifying training needs. Data providers were approached with a request to input to a face-to-face or online meeting with two members of the research team about their experiences of working with researchers. Data were analysed within the broad framework of the interview schedule, free text responses in the survey were analysed thematically. RESULTS 107 researchers responded to the online survey and four data providers participated in the focus groups. We identified five main themes, relating to research training needs for UK researchers working with administrative data: communication; timelines; changes & amendments; future-proofing applications; and, the availability of training and support. Data providers either provided additional evidence on these learning needs or ways to address identified challenges. Six modules were developed addressing these training needs. Quotes from the survey and focus groups are used anonymously in the online training modules. CONCLUSION The CENTRIC online training curriculum was launched in September 2020 and is available, free of charge for UK researchers. CENTRIC specifically addresses commonly identified training needs of researchers working with administrative data.
Collapse
Affiliation(s)
| | - Kim Munnery
- Centre for Trials Research, Cardiff University, Cardiff, CF14 4YS
| | - Julia Townson
- Centre for Trials Research, Cardiff University, Cardiff, CF14 4YS
| | - Rob Trubey
- Centre for Trials Research, Cardiff University, Cardiff, CF14 4YS
| | - Michael Robling
- Centre for Trials Research, Cardiff University, Cardiff, CF14 4YS,DECIPHer - Centre for Development, Evaluation, Complexity and Implementation in Public Health Improvement, 1-3 Museum Place, Cardiff. CF10 3BD
| |
Collapse
|
27
|
Harron K. Data linkage in medical research. BMJ MEDICINE 2022; 1:e000087. [PMID: 36936588 PMCID: PMC9951373 DOI: 10.1136/bmjmed-2021-000087] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 01/13/2022] [Indexed: 11/03/2022]
Affiliation(s)
- Katie Harron
- UCL Great Ormond Street Institute of Child Health Population Policy and Practice, London, UK
| |
Collapse
|