1
|
Lu Z, Ahmadiankalati M, Tan Z. Joint clustering multiple longitudinal features: A comparison of methods and software packages with practical guidance. Stat Med 2023; 42:5513-5540. [PMID: 37789706 DOI: 10.1002/sim.9917] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 06/07/2023] [Accepted: 09/13/2023] [Indexed: 10/05/2023]
Abstract
Clustering longitudinal features is a common goal in medical studies to identify distinct disease developmental trajectories. Compared to clustering a single longitudinal feature, integrating multiple longitudinal features allows additional information to be incorporated into the clustering process, which may reveal co-existing longitudinal patterns and generate deeper biological insight. Despite its increasing importance and popularity, there is limited practical guidance for implementing cluster analysis approaches for multiple longitudinal features and evaluating their comparative performance in medical datasets. In this paper, we provide an overview of several commonly used approaches to clustering multiple longitudinal features, with an emphasis on application and implementation through R software. These methods can be broadly categorized into two categories, namely model-based (including frequentist and Bayesian) approaches and algorithm-based approaches. To evaluate their performance, we compare these approaches using real-life and simulated datasets. These results provide practical guidance to applied researchers who are interested in applying these approaches for clustering multiple longitudinal features. Recommendations for applied researchers and suggestions for future research in this area are also discussed.
Collapse
Affiliation(s)
- Zihang Lu
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | | | - Zhiwen Tan
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
| |
Collapse
|
2
|
Rousseau MC, Conus F, El-Zein M, Benedetti A, Parent ME. Ascertaining asthma status in epidemiologic studies: a comparison between administrative health data and self-report. BMC Med Res Methodol 2023; 23:201. [PMID: 37679673 PMCID: PMC10486089 DOI: 10.1186/s12874-023-02011-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 08/07/2023] [Indexed: 09/09/2023] Open
Abstract
BACKGROUND Studies have suggested that agreement between administrative health data and self-report for asthma status ranges from fair to good, but few studies benefited from administrative health data over a long period. We aimed to (1) evaluate agreement between asthma status ascertained in administrative health data covering a period of 30 years and from self-report, and (2) identify determinants of agreement between the two sources. METHODS We used administrative health data (1983-2012) from the Quebec Birth Cohort on Immunity and Health, which included 81,496 individuals born in the province of Quebec, Canada, in 1974. Additional information, including self-reported asthma, was collected by telephone interview with 1643 participants in 2012. By design, half of them had childhood asthma based on health services utilization. Results were weighted according to the inverse of the sampling probabilities. Five algorithms were applied to administrative health data (having ≥ 2 physician claims over a 1-, 2-, 3-, 5-, or 30-year interval or ≥ 1 hospitalization), to enable comparisons with previous studies. We estimated the proportion of overall agreement and Kappa, between asthma status derived from algorithms and self-reports. We used logistic regression to identify factors associated with agreement. RESULTS Applying the five algorithms, the prevalence of asthma ranged from 49 to 55% among the 1643 participants. At interview (mean age = 37 years), 49% and 47% of participants respectively reported ever having asthma and asthma diagnosed by a physician. Proportions of agreement between administrative health data and self-report ranged from 88 to 91%, with Kappas ranging from 0.57 (95% CI: 0.52-0.63) to 0.67 (95% CI: 0.62-0.72); the highest values were obtained with the [≥ 2 physician claims over a 30-year interval or ≥ 1 hospitalization] algorithm. Having sought health services for allergic diseases other than asthma was related to lower agreement (Odds ratio = 0.41; 95% CI: 0.25-0.65 comparing ≥ 1 health services to none). CONCLUSIONS These findings indicate good agreement between asthma status defined from administrative health data and self-report. Agreement was higher than previously observed, which may be due to the 30-year lookback window in administrative data. Our findings support using both administrative health data and self-report in population-based epidemiological studies.
Collapse
Affiliation(s)
- Marie-Claude Rousseau
- Epidemiology and Biostatistics Unit, Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Laval, QC, Canada.
- School of Public Health, Université de Montréal, Montréal, QC, Canada.
| | - Florence Conus
- Epidemiology and Biostatistics Unit, Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Laval, QC, Canada
- Direction des enquêtes de santé, Direction principale des statistiques sociales et de santé, Institut de la statistique du Québec, Montréal, QC, Canada
| | - Mariam El-Zein
- Epidemiology and Biostatistics Unit, Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Laval, QC, Canada
- Division of Cancer Epidemiology, McGill University, Montréal, QC, Canada
| | - Andrea Benedetti
- Respiratory Epidemiology and Clinical Research Unit, Research Institute of the McGill University Health Centre, Montréal, QC, Canada
- Department of Epidemiology, Biostatistics and Occupational Health, Faculty of Medicine, McGill University, Montréal, QC, Canada
| | - Marie-Elise Parent
- Epidemiology and Biostatistics Unit, Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Laval, QC, Canada
- School of Public Health, Université de Montréal, Montréal, QC, Canada
| |
Collapse
|
3
|
Poulakis K, Pereira JB, Muehlboeck JS, Wahlund LO, Smedby Ö, Volpe G, Masters CL, Ames D, Niimi Y, Iwatsubo T, Ferreira D, Westman E. Multi-cohort and longitudinal Bayesian clustering study of stage and subtype in Alzheimer's disease. Nat Commun 2022; 13:4566. [PMID: 35931678 PMCID: PMC9355993 DOI: 10.1038/s41467-022-32202-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 07/18/2022] [Indexed: 11/08/2022] Open
Abstract
Understanding Alzheimer's disease (AD) heterogeneity is important for understanding the underlying pathophysiological mechanisms of AD. However, AD atrophy subtypes may reflect different disease stages or biologically distinct subtypes. Here we use longitudinal magnetic resonance imaging data (891 participants with AD dementia, 305 healthy control participants) from four international cohorts, and longitudinal clustering to estimate differential atrophy trajectories from the age of clinical disease onset. Our findings (in amyloid-β positive AD patients) show five distinct longitudinal patterns of atrophy with different demographical and cognitive characteristics. Some previously reported atrophy subtypes may reflect disease stages rather than distinct subtypes. The heterogeneity in atrophy rates and cognitive decline within the five longitudinal atrophy patterns, potentially expresses a complex combination of protective/risk factors and concomitant non-AD pathologies. By alternating between the cross-sectional and longitudinal understanding of AD subtypes these analyses may allow better understanding of disease heterogeneity.
Collapse
Affiliation(s)
- Konstantinos Poulakis
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.
| | - Joana B Pereira
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
- Clinical Memory Research Unit, Department of Clinical Sciences, Lund University, Malmo, Sweden
| | - J-Sebastian Muehlboeck
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
| | - Lars-Olof Wahlund
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
| | - Örjan Smedby
- Department of Biomedical Engineering and Health Systems (MTH), KTH Royal Institute of Technology, Stockholm, Sweden
| | - Giovanni Volpe
- Department of Physics, University of Gothenburg, Gothenburg, Sweden
| | - Colin L Masters
- The Florey Institute of Neuroscience and Mental Health, The University of Melbourne, Victoria, Australia
| | - David Ames
- Academic Unit for Psychiatry of Old Age, St George's Hospital, University of Melbourne, Melbourne, Victoria, Australia
- National Ageing Research Institute, Parkville, Victoria, Australia
| | - Yoshiki Niimi
- Unit for Early and Exploratory Clinical Development, The University of Tokyo Hospital, Tokyo, Japan
| | - Takeshi Iwatsubo
- Unit for Early and Exploratory Clinical Development, The University of Tokyo Hospital, Tokyo, Japan
| | - Daniel Ferreira
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
- Department of Radiology, Mayo Clinic, Rochester, MN, USA
| | - Eric Westman
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
- Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| |
Collapse
|
4
|
Lu Z, Lou W. Bayesian consensus clustering for multivariate longitudinal data. Stat Med 2021; 41:108-127. [PMID: 34672001 DOI: 10.1002/sim.9225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 09/26/2021] [Accepted: 09/27/2021] [Indexed: 11/06/2022]
Abstract
In clinical and epidemiological studies, there is a growing interest in studying the heterogeneity among patients based on longitudinal characteristics to identify subtypes of the study population. Compared to clustering a single longitudinal marker, simultaneously clustering multiple longitudinal markers allow additional information to be incorporated into the clustering process, which reveals co-existing longitudinal patterns and generates deeper biological insight. In the current study, we propose a Bayesian consensus clustering (BCC) model for multivariate longitudinal data. Instead of arriving at a single overall clustering, the proposed model allows each marker to follow marker-specific local clustering and these local clusterings are aggregated to find a global (consensus) clustering. To estimate the posterior distribution of model parameters, a Gibbs sampling algorithm is proposed. We apply our proposed model to the primary biliary cirrhosis study to identify patient subtypes that may be associated with their prognosis. We also perform simulation studies to compare the clustering performance between the proposed model and existing models under several scenarios. The results demonstrate that the proposed BCC model serves as a useful tool for clustering multivariate longitudinal data.
Collapse
Affiliation(s)
- Zihang Lu
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada
| | - Wendy Lou
- Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
5
|
Campisi ES, Reyna ME, Brydges M, Dubeau A, Moraes TJ, Campisi P, Subbarao P. Adenotonsillectomy, bronchoscopy and bronchoalveolar lavage in the management of preschool children with severe asthma: pilot study. Eur Arch Otorhinolaryngol 2021; 279:319-326. [PMID: 34542655 DOI: 10.1007/s00405-021-07084-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 09/10/2021] [Indexed: 11/28/2022]
Abstract
PURPOSE This is a pilot study that describes the feasibility and clinical course of a cohort of preschool children with severe asthma undergoing a combined adenotonillectomy (TA), bronchoscopy (B), and bronchoalveolar lavage (BAL) procedure. METHODS A retrospective cohort study of preschool patients with severe asthma who underwent a combined TA-B-BAL procedure between 2012 and 2019. Subjects were treated at a tertiary care asthma clinic and had a diagnosis of preschool asthma according to the Canadian Thoracic Society Guidelines. Data on demographics, clinical characteristics, medication use, virology and microbiology from bronchoalveolar lavage, and asthma control questionnaires were collected. Variables were analyzed using paired t test. RESULTS Eighteen preschool subjects (mean age 3.19 ± 1.13 years) with severe asthma were identified through the asthma clinic. Patients treated with standard asthma care and a combined TA-B-BAL procedure experienced a decrease in the number of oral steroid courses (p = 0.017), emergency department visits (p = 0.03) and wheezing exacerbations (p = 0.026) following the procedure. Ten patients experienced clinically meaningful improvements in TRACK scores after the procedure (p < 0.001). CONCLUSION This pilot study provides early evidence that a combined TA-B-BAL procedure is feasible in preschool children with severe asthma and that the procedure may reduce asthma medication use and hospital visits.
Collapse
Affiliation(s)
- Emma S Campisi
- Department of Otolaryngology-Head & Neck Surgery, Hospital for Sick Children and University of Toronto, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Myrtha E Reyna
- Division of Respiratory Medicine, Department of Paediatrics, and Program in Translational Medicine, SickKids Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, Canada
| | - May Brydges
- Division of Respiratory Medicine, Department of Paediatrics, and Program in Translational Medicine, SickKids Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, Canada
| | - Aimee Dubeau
- Division of Respiratory Medicine, Department of Paediatrics, and Program in Translational Medicine, SickKids Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, Canada
| | - Theo J Moraes
- Division of Respiratory Medicine, Department of Paediatrics, and Program in Translational Medicine, SickKids Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, Canada
| | - Paolo Campisi
- Department of Otolaryngology-Head & Neck Surgery, Hospital for Sick Children and University of Toronto, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.
| | - Padmaja Subbarao
- Division of Respiratory Medicine, Department of Paediatrics, and Program in Translational Medicine, SickKids Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, Canada.,Dalla Lana School of Public Health, Division of Epidemiology, University of Toronto, 27 King's College Circle, Toronto, Canada
| |
Collapse
|
6
|
Abdullah K, Fell DB, Radhakrishnan D, Hawken S, Johnson DW, Mandhane P, To T, Joubert G, Plint AC. Risk of asthma in children diagnosed with bronchiolitis during infancy: protocol of a longitudinal cohort study linking emergency department-based clinical data to provincial health administrative databases. BMJ Open 2021; 11:e048823. [PMID: 33941638 PMCID: PMC8098926 DOI: 10.1136/bmjopen-2021-048823] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
INTRODUCTION The Canadian Bronchiolitis Epinephrine Steroid Trial (CanBEST) and the Bronchiolitis Severity Cohort (BSC) study enrolled infants with bronchiolitis during the first year of life. The CanBEST trial suggested that treatment of infants with a combined therapy of high-dose corticosteroids and nebulised epinephrine reduced the risk of admission to hospital. Our study aims to-(1) quantify the risk of developing asthma by age 5 and 10 years in children treated with high-dose corticosteroid and epinephrine for bronchiolitis during infancy, (2) identify risk factors associated with development of asthma in children with bronchiolitis during infancy, (3) develop asthma prediction models for children diagnosed with bronchiolitis during infancy. METHODS AND ANALYSIS We propose a longitudinal cohort study in which we will link data from the CanBEST and BSC study with routinely collected data from provincial health administrative databases. Our outcome is asthma incidence measured using a validated health administrative data algorithm. Primary exposure will be treatment with a combined therapy of high-dose corticosteroids and nebulised epinephrine for bronchiolitis. Covariates will include type of viral pathogen, disease severity, medication use, maternal, prenatal, postnatal and demographic factors and variables related to health service utilisation for acute lower respiratory tract infection. The risk associated with development of asthma in children treated with high-dose corticosteroid and epinephrine for bronchiolitis will be assessed using multivariable Cox proportional hazards regression models. Prediction models will be developed using multivariable logistic regression analysis and internally validated using a bootstrap approach. ETHICS AND DISSEMINATION Our study has been approved by the ethics board of all four participating sites of the CanBEST and BSC study. Finding of the study will be disseminated to the academic community and relevant stakeholders through conferences and peer-reviewed publications. TRIAL REGISTRATION NUMBER ISRCTN56745572; Post-results.
Collapse
Affiliation(s)
- Kawsari Abdullah
- Research Institute, Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada
- ICES, Ottawa, Ontario, Canada
| | - Deshayne B Fell
- Research Institute, Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada
- ICES, Ottawa, Ontario, Canada
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Dhenuka Radhakrishnan
- Research Institute, Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada
- ICES, Ottawa, Ontario, Canada
- Department of Pediatrics, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Steven Hawken
- ICES, Ottawa, Ontario, Canada
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
| | - David W Johnson
- Departments of Pediatrics and Physiology and Pharmacology, University of Calgary, Calgery, Alberta, Canada
- Maternal Newborn Child & Youth SCN, Alberta Health Services, Calgery, Alberta, Canada
| | - Piush Mandhane
- Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada
| | - Teresa To
- ICES, Ottawa, Ontario, Canada
- Research Institute, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Gary Joubert
- London Health Sciences Centre, London, Ontario, Canada
| | - Amy C Plint
- Research Institute, Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada
- Department of Pediatrics, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|