1
|
Syed R, Eden R, Makasi T, Chukwudi I, Mamudu A, Kamalpour M, Kapugama Geeganage D, Sadeghianasl S, Leemans SJJ, Goel K, Andrews R, Wynn MT, Ter Hofstede A, Myers T. Digital Health Data Quality Issues: Systematic Review. J Med Internet Res 2023; 25:e42615. [PMID: 37000497 PMCID: PMC10131725 DOI: 10.2196/42615] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 12/07/2022] [Accepted: 12/31/2022] [Indexed: 04/01/2023] Open
Abstract
BACKGROUND The promise of digital health is principally dependent on the ability to electronically capture data that can be analyzed to improve decision-making. However, the ability to effectively harness data has proven elusive, largely because of the quality of the data captured. Despite the importance of data quality (DQ), an agreed-upon DQ taxonomy evades literature. When consolidated frameworks are developed, the dimensions are often fragmented, without consideration of the interrelationships among the dimensions or their resultant impact. OBJECTIVE The aim of this study was to develop a consolidated digital health DQ dimension and outcome (DQ-DO) framework to provide insights into 3 research questions: What are the dimensions of digital health DQ? How are the dimensions of digital health DQ related? and What are the impacts of digital health DQ? METHODS Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, a developmental systematic literature review was conducted of peer-reviewed literature focusing on digital health DQ in predominately hospital settings. A total of 227 relevant articles were retrieved and inductively analyzed to identify digital health DQ dimensions and outcomes. The inductive analysis was performed through open coding, constant comparison, and card sorting with subject matter experts to identify digital health DQ dimensions and digital health DQ outcomes. Subsequently, a computer-assisted analysis was performed and verified by DQ experts to identify the interrelationships among the DQ dimensions and relationships between DQ dimensions and outcomes. The analysis resulted in the development of the DQ-DO framework. RESULTS The digital health DQ-DO framework consists of 6 dimensions of DQ, namely accessibility, accuracy, completeness, consistency, contextual validity, and currency; interrelationships among the dimensions of digital health DQ, with consistency being the most influential dimension impacting all other digital health DQ dimensions; 5 digital health DQ outcomes, namely clinical, clinician, research-related, business process, and organizational outcomes; and relationships between the digital health DQ dimensions and DQ outcomes, with the consistency and accessibility dimensions impacting all DQ outcomes. CONCLUSIONS The DQ-DO framework developed in this study demonstrates the complexity of digital health DQ and the necessity for reducing digital health DQ issues. The framework further provides health care executives with holistic insights into DQ issues and resultant outcomes, which can help them prioritize which DQ-related problems to tackle first.
Collapse
Affiliation(s)
- Rehan Syed
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Rebekah Eden
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Tendai Makasi
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Ignatius Chukwudi
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Azumah Mamudu
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Mostafa Kamalpour
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Dakshi Kapugama Geeganage
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Sareh Sadeghianasl
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Sander J J Leemans
- Rheinisch-Westfälische Technische Hochschule, Aachen University, Aachen, Germany
| | - Kanika Goel
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Robert Andrews
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Moe Thandar Wynn
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Arthur Ter Hofstede
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Trina Myers
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
2
|
Petch J, Kempainnen J, Pettengell C, Aviv S, Butler B, Pond G, Saha A, Bogach J, Allard-Coutu A, Sztur P, Ranisau J, Levine M. Developing a Data and Analytics Platform to Enable a Breast Cancer Learning Health System at a Regional Cancer Center. JCO Clin Cancer Inform 2023; 7:e2200182. [PMID: 37001040 PMCID: PMC10281330 DOI: 10.1200/cci.22.00182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 02/10/2023] [Indexed: 04/03/2023] Open
Abstract
PURPOSE This study documents the creation of automated, longitudinal, and prospective data and analytics platform for breast cancer at a regional cancer center. This platform combines principles of data warehousing with natural language processing (NLP) to provide the integrated, timely, meaningful, high-quality, and actionable data required to establish a learning health system. METHODS Data from six hospital information systems and one external data source were integrated on a nightly basis by automated extract/transform/load jobs. Free-text clinical documentation was processed using a commercial NLP engine. RESULTS The platform contains 141 data elements of 7,019 patients with newly diagnosed breast cancer who received care at our regional cancer center from January 1, 2014, to June 3, 2022. Daily updating of the database takes an average of 56 minutes. Evaluation of the tuning of NLP jobs found overall high performance, with an F1 of 1.0 for 19 variables, with a further 16 variables with an F1 of > 0.95. CONCLUSION This study describes how data warehousing combined with NLP can be used to create a prospective data and analytics platform to enable a learning health system. Although upfront time investment required to create the platform was considerable, now that it has been developed, daily data processing is completed automatically in less than an hour.
Collapse
Affiliation(s)
- Jeremy Petch
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada
- Institute for Health Policy Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Division of Cardiology, Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada
- Population Health Research Institute, Hamilton Health Sciences, Hamilton, Canada
| | - Joel Kempainnen
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada
| | | | | | | | - Greg Pond
- Escarpment Cancer Research Institute, Hamilton Health Sciences, Hamilton, Canada
| | - Ashirbani Saha
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada
- Escarpment Cancer Research Institute, Hamilton Health Sciences, Hamilton, Canada
- Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Jessica Bogach
- Department of Surgery, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | | | - Peter Sztur
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada
| | - Jonathan Ranisau
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada
| | - Mark Levine
- Hamilton Health Sciences, Hamilton, Canada
- Escarpment Cancer Research Institute, Hamilton Health Sciences, Hamilton, Canada
| |
Collapse
|
3
|
Elgarten CW, Thompson JC, Angiolillo A, Chen Z, Conway S, Devidas M, Gupta S, Kairalla JA, McNeer JL, O’Brien MM, Rabin KR, Rau RE, Rheingold SR, Wang C, Wood C, Raetz EA, Loh ML, Alexander S, Miller TP. Improving infectious adverse event reporting for children and adolescents enrolled in clinical trials for acute lymphoblastic leukemia: A report from the Children's Oncology Group. Pediatr Blood Cancer 2022; 69:e29937. [PMID: 36083863 PMCID: PMC9529813 DOI: 10.1002/pbc.29937] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 07/28/2022] [Accepted: 07/30/2022] [Indexed: 11/08/2022]
Abstract
Infections cause substantial morbidity for children with acute lymphoblastic leukemia (ALL). Therefore, accurate characterization of infectious adverse events (AEs) reported on clinical trials is imperative to defining, comparing, and managing safety and toxicity. Here, we describe key processes implemented to improve reporting of infectious AEs on two active phase III Children's Oncology Group (COG) ALL trials. Processes include: (a) identifying infections as a targeted toxicity, (b) incorporation of infection-specific case report form questions, and (c) physician review of AEs with real-time data cleaning. Preliminary assessment of these processes suggests improved reporting, as well as opportunities for further improvement.
Collapse
Affiliation(s)
- Caitlin W. Elgarten
- Children’s Hospital of Philadelphia, Department of Pediatrics, Division of Oncology, Philadelphia, PA
| | - Joel C. Thompson
- Children’s Mercy Hospital, Department of Pediatrics, Division of Hematology/Oncology/Bone Marrow Transplant, University of Missouri-Kansas City, Kansas City, MO
| | - Anne Angiolillo
- Children’s National Medical Center, Center for Cancer and Blood Disorders, Washington DC
| | - Zhiguo Chen
- University of Florida, Department of Biostatistics, Gainesville, FL
| | - Susan Conway
- University of Florida, Department of Biostatistics, Gainesville, FL
| | | | - Sumit Gupta
- Department of Hematology/Oncology, Hospital for Sick Children, Toronto, ON
| | - John A. Kairalla
- University of Florida, Department of Biostatistics, Gainesville, FL
| | | | - Maureen M. O’Brien
- University of Cincinnati College of Medicine, Cincinnati Children’s Hospital Medical Center, Pediatric Hematology/Oncology, Cincinnati, OH
| | - Karen R. Rabin
- Baylor College of Medicine, Pediatric Hematology/Oncology, Houston, TX
| | - Rachel E. Rau
- Baylor College of Medicine, Pediatric Hematology/Oncology, Houston, TX
| | - Susan R. Rheingold
- Children’s Hospital of Philadelphia, Department of Pediatrics, Division of Oncology, Philadelphia, PA
| | - Cindy Wang
- University of Florida, Department of Biostatistics, Gainesville, FL
| | - Charlotte Wood
- University of Florida, Department of Biostatistics, Gainesville, FL
| | | | - Mignon L. Loh
- Division of Hematology, Oncology, Bone Marrow Transplant, and Cellular Therapies, Seattle Children’s Hospital and the Ben Towne Center for Childhood Cancer Research, University of Washington, Seattle, WA
| | - Sarah Alexander
- Department of Hematology/Oncology, Hospital for Sick Children, Toronto, ON
| | - Tamara P. Miller
- Children’s Healthcare of Atlanta – Egleston, Pediatric Hematology/Oncology, Atlanta, GA
| |
Collapse
|
4
|
Miller TP, Li Y, Masino AJ, Vallee E, Burrows E, Ramos M, Alonzo TA, Gerbing R, Castellino SM, Hawkins DS, Lash TL, Aplenc R, Grundmeier RW. Automated Ascertainment of Typhlitis From the Electronic Health Record. JCO Clin Cancer Inform 2022; 6:e2200081. [PMID: 36198128 PMCID: PMC9848554 DOI: 10.1200/cci.22.00081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/14/2022] [Accepted: 08/16/2022] [Indexed: 11/05/2022] Open
Abstract
PURPOSE Adverse events (AEs) on Children's Oncology Group (COG) trials are manually ascertained using Common Terminology Criteria for Adverse Events. Despite significant effort, we previously demonstrated that COG typhlitis reporting sensitivity was only 37% when compared with gold standard physician chart abstraction. This study tested an automated typhlitis identification algorithm using electronic health record data. METHODS Electronic health record data from children with leukemia age 0-22 years treated at a single institution from 2006 to 2019 were included. Patients were divided into derivation and validation cohorts. Rigorous chart abstraction of validation cohort patients established a gold standard AE data set. We created an automated algorithm to identify typhlitis matching Common Terminology Criteria for Adverse Events v5 that included antibiotics, neutropenia, and non-negated mention of typhlitis in a note. We iteratively refined the algorithm using the derivation cohort and then applied the algorithm to the validation cohort; performance was compared with the gold standard. For patients on trial AAML1031, COG AE report performance was compared with the gold standard. RESULTS The derivation cohort included 337 patients. The validation cohort included 270 patients (961 courses). Chart abstraction identified 16 courses with typhlitis. The algorithm identified 37 courses with typhlitis; 13 were true positives (sensitivity 81.3%, positive predictive value 35.1%). For patients on AAML1031, chart abstraction identified nine courses with typhlitis, and COG reporting correctly identified 4 (sensitivity 44.4%, positive predictive value 100.0%). CONCLUSION The automated algorithm identified true cases of typhlitis with higher sensitivity than COG reporting. The algorithm identified false positives but reduced the number of courses needing manual review by 96% (961 to 37) by detecting potential typhlitis. This algorithm could provide a useful screening tool to reduce manual effort required for typhlitis AE reporting.
Collapse
Affiliation(s)
- Tamara P. Miller
- Aflac Cancer and Blood Disorders Center, Children's Healthcare of Atlanta, Atlanta, GA
- Department of Pediatrics, Emory University School of Medicine, Atlanta, GA
| | - Yimei Li
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, PA
- Perelman School of Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA
| | - Aaron J. Masino
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Emma Vallee
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, PA
| | - Evanette Burrows
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Mark Ramos
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | | | | | - Sharon M. Castellino
- Aflac Cancer and Blood Disorders Center, Children's Healthcare of Atlanta, Atlanta, GA
- Department of Pediatrics, Emory University School of Medicine, Atlanta, GA
| | - Douglas S. Hawkins
- Division of Hematology/Oncology, Seattle Children's Hospital, Seattle, WA
- Department of Pediatrics, University of Washington, Seattle, WA
| | - Timothy L. Lash
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
| | - Richard Aplenc
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, PA
- Perelman School of Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA
- Center for Pediatric Clinical Effectiveness, The Children's Hospital of Philadelphia, Philadelphia, PA
| | - Robert W. Grundmeier
- Perelman School of Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA
- Center for Pediatric Clinical Effectiveness, The Children's Hospital of Philadelphia, Philadelphia, PA
| |
Collapse
|