2
|
Lim JH, Kim S, Park JH, Kim CH, Choi JS, Chang JW, Kim S, Park IS, Ha B, Jo IY, Byeon HK, Park KN, Kim HS, Jung SY, Heo J. Systematic construction of composite radiation therapy dataset using automated data pipeline for prognosis prediction. Int J Med Inform 2025; 195:105712. [PMID: 39591846 DOI: 10.1016/j.ijmedinf.2024.105712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 11/14/2024] [Accepted: 11/18/2024] [Indexed: 11/28/2024]
Abstract
BACKGROUND Existing research on medical data has primarily focused on single time-points or single-modality data. This study aims to collect all data generated during radiotherapy comprehensively to improve the treatment and prognosis of patients with malignant tumors. METHODS The data collected from each medical institution were transmitted to the lead organization, where they underwent a file integrity check and were processed using a data pipeline. The key metadata of the collected data were compiled into a database, which were examined by data analysts to identify outliers based on theoretical and institution-specific characteristics. Appropriate filters were applied and the filtered data were subsequently reviewed by artificial intelligence (AI)-based models and researchers for radiotherapy organ slides. Finally, they were annotated by specialists. RESULTS The final dataset included 30,136 three-dimensional cone-beam computed tomography scans and 5,019 tabular data entries collected from 5,019 patients. It comprised 2,043,162 Digital Imaging and Communications in Medicine-format files with a total file size of 832 GB. Quality verification of the data using AI models revealed high classification performance for most organs, with relatively poor performance for the rectum. Overall, the macro AUROC value was 0.947. CONCLUSIONS This study implemented an automated data pipeline and AI-based verification to enhance the quality of collected radiotherapy data. The constructed dataset can be utilized for various types of future research and is expected to contribute to the improvement of radiotherapy efficiency.
Collapse
Affiliation(s)
- June Hyuck Lim
- Department of Radiation Oncology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Seonhwa Kim
- Department of Radiation Oncology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Jun Hyeong Park
- Department of Radiation Oncology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Chul-Ho Kim
- Department of Otolaryngology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Jeong-Seok Choi
- Department of Otorhinolaryngology-Head and Neck Surgery Inha University College of Medicine, Incheon, Republic of Korea
| | - Jae Won Chang
- Department of Otolaryngology-Head and Neck Surgery, Chungnam National University Hospital, Daejeon, Republic of Korea
| | - Sup Kim
- Department of Radiation Oncology, Chungnam National University Hospital, Daejeon, Republic of Korea
| | - Il-Seok Park
- Department of Otorhinolaryngology-Head and Neck Surgery, Hallym University Dontan Sacred Heart Hospital, Hallym University College of Medicine
| | - Boram Ha
- Department of Radiation Oncology, Dongtan Sacred Heart Hospital, Hallym University College of Medicine
| | - In Young Jo
- Department of Radiation Oncology, Soonchunhyang University, Cheonan Hospital
| | - Hyung Kwon Byeon
- Department of Otorhinolaryngology-Head and Neck Surgery, Soonchunhyang University
| | - Ki Nam Park
- Department of Otorhinolaryngology-Head and Neck Surgery, Soonchunhyang University
| | - Han Su Kim
- Otorhinolaryngology-Head and Neck Surgery, Ewha Womans University, College of Medicine
| | - Soo Yeon Jung
- Otorhinolaryngology-Head and Neck Surgery, Ewha Womans University, College of Medicine
| | - Jaesung Heo
- Department of Radiation Oncology, Ajou University School of Medicine, Suwon, Republic of Korea.
| |
Collapse
|
3
|
Doutreligne M, Struja T, Abecassis J, Morgand C, Celi LA, Varoquaux G. Step-by-step causal analysis of EHRs to ground decision-making. PLOS DIGITAL HEALTH 2025; 4:e0000721. [PMID: 39899627 PMCID: PMC11790099 DOI: 10.1371/journal.pdig.0000721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 12/10/2024] [Indexed: 02/05/2025]
Abstract
Causal inference enables machine learning methods to estimate treatment effects of medical interventions from electronic health records (EHRs). The prevalence of such observational data and the difficulty for randomized controlled trials (RCT) to cover all population/treatment relationships make these methods increasingly attractive for studying causal effects. However, researchers should be wary of many pitfalls. We propose and illustrate a framework for causal inference estimating the effect of albumin on mortality in sepsis using an Intensive Care database (MIMIC-IV) and comparing various sensitivity analyses to results from RCTs as gold-standard. The first step is study design, using the target trial concept and the PICOT framework: Population (patients with sepsis), Intervention (combination of crystalloids and albumin for fluid resuscitation), Control (crystalloids only), Outcome (28-day mortality), Time (intervention start within 24h of admission). We show that too large treatment-initiation times induce immortal time bias. The second step is selection of the confounding variables based on expert knowledge. Increasingly adding confounders enables to recover the RCT results from observational data. As the third step, we assess the influence of multiple models with varying assumptions, showing that a doubly robust estimator (AIPW) with random forests proved to be the most reliable estimator. Results show that these steps are all important for valid causal estimates. A valid causal model can then be used to individualize decision making: subgroup analyses showed that treatment efficacy of albumin was better for patients >60 years old, males, and patients with septic shock. Without causal thinking, machine learning is not enough for optimal clinical decision on an individual patient level. Our step-by-step analytic framework helps avoiding many pitfalls of applying machine learning to EHR data, building models that avoid shortcuts and extract the best decision-making evidence.
Collapse
Affiliation(s)
- Matthieu Doutreligne
- Soda Team, Inria Saclay, Palaiseau, France
- Mission Data, Haute Autorité de Santé, Saint-Denis, France
| | - Tristan Struja
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Medical University Clinic, Division of Endocrinology, Diabetes & Metabolism, Kantonsspital Aarau, Aarau, Switzerland
| | | | - Claire Morgand
- Agence Régionale de Santé Ile-de-France, Saint-Denis, France
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | | |
Collapse
|
4
|
Zhou S, Chen R, Liu J, Guo Z, Su L, Li Y, Zhang X, Luo F, Gao Q, Lin Y, Pang M, Cao L, Xu X, Nie S. Comparative Effectiveness and Safety of Atorvastatin Versus Rosuvastatin : A Multi-database Cohort Study. Ann Intern Med 2024; 177:1641-1651. [PMID: 39467290 DOI: 10.7326/m24-0178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/30/2024] Open
Abstract
BACKGROUND Rosuvastatin and atorvastatin are the most widely prescribed moderate- to high-intensity statins. However, evidence on their efficacy and safety during actual use is limited. OBJECTIVE To compare the real-world effectiveness and safety of rosuvastatin and atorvastatin. DESIGN Active comparator cohort study using target trial emulation. SETTING The China Renal Data System (CRDS) and UK Biobank (UKB) databases. PARTICIPANTS Adults newly prescribed rosuvastatin or atorvastatin. MEASUREMENTS The primary outcome was all-cause mortality. Cox proportional hazards regressions were used after 1:1 multilevel propensity score matching. RESULTS Among the 285 680 eligible participants in both databases, 6-year all-cause mortality was lower for rosuvastatin than for atorvastatin (2.57 vs. 2.83 per 100 person-years in the CRDS database and 0.66 vs. 0.90 per 100 person-years in the UKB database), with differences in cumulative incidence of -1.03% (95% CI, -1.44% to -0.46%) in the CRDS database and -1.38% (CI, -2.50% to -0.21%) in the UKB database. For secondary outcomes in both databases, rosuvastatin conferred lower risks for major adverse cardiovascular events and major adverse liver outcomes. In the UKB database, the risk for development of type 2 diabetes mellitus was higher with rosuvastatin, and the 2 medications carried similar risks for development of chronic kidney disease and other statin-related adverse effects. LIMITATION Possible residual confounding. CONCLUSION This study found differences in risks for some important outcomes associated with rosuvastatin and atorvastatin. The differences were relatively small, and many did not meet traditional standards for statistical significance. Further research is needed to understand whether these findings can be used with confidence in clinical practice. PRIMARY FUNDING SOURCE National Key R&D Program of China and National Natural Science Foundation of China.
Collapse
Affiliation(s)
- Shiyu Zhou
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Ruixuan Chen
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Jiao Liu
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Zhixin Guo
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Licong Su
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Yanqin Li
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Xiaodong Zhang
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Fan Luo
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Qi Gao
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Yuxin Lin
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Mingzhen Pang
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Lisha Cao
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Xin Xu
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| | - Sheng Nie
- Division of Nephrology, National Clinical Research Center for Kidney Disease, State Key Laboratory of Organ Failure Research, Nanfang Hospital, Southern Medical University, Guangzhou, China (S.Z., R.C., J.L., Z.G., L.S., Y.Li, X.Z., F.L., Q.G., Y.Lin, M.P., L.C., X.X., S.N.)
| |
Collapse
|
6
|
Maas CCHM, van Klaveren D, Durmaz M, Visser O, Issa DE, Posthuma EFM, Zijlstra JM, Chamuleau MED, Lugtenburg PJ, Kersten MJ, Dinmohamed AG. Comparative effectiveness of 6x R-CHOP21 versus 6x R-CHOP21 + 2 R for patients with advanced-stage diffuse large B-cell lymphoma. Blood Cancer J 2024; 14:157. [PMID: 39266543 PMCID: PMC11393348 DOI: 10.1038/s41408-024-01137-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 08/29/2024] [Accepted: 09/02/2024] [Indexed: 09/14/2024] Open
Abstract
First-line treatment for advanced-stage diffuse large B-cell lymphoma (DLBCL) typically involves 6x R-CHOP21 or 6x R-CHOP21 with two additional rituximab administrations (6x R-CHOP21 + 2 R). In contemporary practice, this treatment choice might be guided by interim PET scan results. This nationwide, population-based study investigates the comparative effectiveness of these treatment regimens in an era where interim PET-guided treatment decisions were not standard practice. Utilizing the Netherlands Cancer Registry, we identified 1577 adult patients diagnosed with advanced-stage DLBCL between 2014-2018 who completed either 6x R-CHOP21 (43%) or 6x R-CHOP21 + 2 R (57%). We used propensity scores to assess differences in event-free survival (EFS) and overall survival (OS). At five years, EFS (hazard ratio of 6x R-CHOP21 + 2 R versus 6x R-CHOP21 [HR] = 0.89; 95% confidence interval [CI], 0.72-1.09) and OS (HR = 0.93; 95% CI, 0.73-1.18) were not significantly different between both regimens. In exploratory risk-stratified analysis according to the International Prognostic Index (IPI), high-IPI patients (i.e., scores of 4-5) benefit most from 6x R-CHOP21 + 2 R (5-year absolute risk difference of EFS = 16.8%; 95% CI, -0.4%-34.1% and OS = 12.1%; 95% CI, -5.4-29.6%). Collectively, this analysis reveals no significant differences on average in EFS and OS between the two treatments. However, the potential benefits for high-risk patients treated with 6x R-CHOP21 + 2 R underscore the need for future research.
Collapse
Grants
- PJL reports research funding to institution: Takeda; Consultancy honoraria: Y-mAbs-Therapeutics; Sandoz; Bristol Meyer Squibb; Speaker honoraria: Roche, AbbVie, Lilly; Honoraria for advisory board: Roche, Genmab, AbbVie; Travel support: Sanofi.
- MJK reports honoraria from and consulting/advisory role for BMS/Celgene, Kite, a Gilead Company, Miltenyi Biotec, Adicet Bio, Mustang Bio, Novartis, and Roche; research funding from Kite, a Gilead Company, and travel support from Kite, a Gilead Company, Abbvie and Roche.
Collapse
Affiliation(s)
- Carolien C H M Maas
- Department of Public Health, Erasmus University Medical Centre, Rotterdam, The Netherlands.
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands.
| | - David van Klaveren
- Department of Public Health, Erasmus University Medical Centre, Rotterdam, The Netherlands
| | - Müjde Durmaz
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- Amsterdam UMC, Department of Hematology, Cancer Center Amsterdam, Amsterdam, The Netherlands
| | - Otto Visser
- Department of Registration, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
| | - Djamila E Issa
- Department of Internal Medicine, Jeroen Bosch Hospital, Den Bosch, The Netherlands
| | - Eduardus F M Posthuma
- Department of Internal Medicine, Reinier de Graaf Gasthuis, Delft, The Netherlands
- Department of Hematology, Leiden University Medical Center, Leiden, The Netherlands
| | - Josée M Zijlstra
- Amsterdam UMC, Department of Hematology, Cancer Center Amsterdam, Amsterdam, The Netherlands
| | - Martine E D Chamuleau
- Amsterdam UMC, Department of Hematology, Cancer Center Amsterdam, Amsterdam, The Netherlands
| | - Pieternella J Lugtenburg
- Erasmus MC Cancer Institute, University Medical Center Rotterdam, Department of Hematology, Rotterdam, The Netherlands
| | - Marie José Kersten
- Amsterdam UMC, Department of Hematology, Cancer Center Amsterdam, Amsterdam, The Netherlands
- LYMMCARE (Lymphoma and Myeloma Center Amsterdam), Amsterdam, The Netherlands
| | - Avinash G Dinmohamed
- Department of Public Health, Erasmus University Medical Centre, Rotterdam, The Netherlands.
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands.
- Amsterdam UMC, Department of Hematology, Cancer Center Amsterdam, Amsterdam, The Netherlands.
- LYMMCARE (Lymphoma and Myeloma Center Amsterdam), Amsterdam, The Netherlands.
| |
Collapse
|