1
|
Rose C, Barber R, Preiksaitis C, Kim I, Mishra N, Kayser K, Brown I, Gisondi M. A Conference (Missingness in Action) to Address Missingness in Data and AI in Health Care: Qualitative Thematic Analysis. J Med Internet Res 2023; 25:e49314. [PMID: 37995113 PMCID: PMC10704317 DOI: 10.2196/49314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/27/2023] [Accepted: 10/25/2023] [Indexed: 11/24/2023] Open
Abstract
BACKGROUND Missingness in health care data poses significant challenges in the development and implementation of artificial intelligence (AI) and machine learning solutions. Identifying and addressing these challenges is critical to ensuring the continued growth and accuracy of these models as well as their equitable and effective use in health care settings. OBJECTIVE This study aims to explore the challenges, opportunities, and potential solutions related to missingness in health care data for AI applications through the conduct of a digital conference and thematic analysis of conference proceedings. METHODS A digital conference was held in September 2022, attracting 861 registered participants, with 164 (19%) attending the live event. The conference featured presentations and panel discussions by experts in AI, machine learning, and health care. Transcripts of the event were analyzed using the stepwise framework of Braun and Clark to identify key themes related to missingness in health care data. RESULTS Three principal themes-data quality and bias, human input in model development, and trust and privacy-emerged from the analysis. Topics included the accuracy of predictive models, lack of inclusion of underrepresented communities, partnership with physicians and other populations, challenges with sensitive health care data, and fostering trust with patients and the health care community. CONCLUSIONS Addressing the challenges of data quality, human input, and trust is vital when devising and using machine learning algorithms in health care. Recommendations include expanding data collection efforts to reduce gaps and biases, involving medical professionals in the development and implementation of AI models, and developing clear ethical guidelines to safeguard patient privacy. Further research and ongoing discussions are needed to ensure these conclusions remain relevant as health care and AI continue to evolve.
Collapse
Affiliation(s)
- Christian Rose
- Department of Emergency Medicine, Stanford University School of Medicine, Palo Alto, CA, United States
| | | | - Carl Preiksaitis
- Department of Emergency Medicine, Stanford University School of Medicine, Palo Alto, CA, United States
| | - Ireh Kim
- Stanford University, Palo Alto, CA, United States
| | | | - Kristen Kayser
- Department of Emergency Medicine, Stanford University School of Medicine, Palo Alto, CA, United States
| | - Italo Brown
- Department of Emergency Medicine, Stanford University School of Medicine, Palo Alto, CA, United States
| | - Michael Gisondi
- Department of Emergency Medicine, Stanford University School of Medicine, Palo Alto, CA, United States
| |
Collapse
|
2
|
Le JP, Shashikumar SP, Malhotra A, Nemati S, Wardi G. Making the Improbable Possible: Generalizing Models Designed for a Syndrome-Based, Heterogeneous Patient Landscape. Crit Care Clin 2023; 39:751-768. [PMID: 37704338 PMCID: PMC10758922 DOI: 10.1016/j.ccc.2023.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2023]
Abstract
Syndromic conditions, such as sepsis, are commonly encountered in the intensive care unit. Although these conditions are easy for clinicians to grasp, these conditions may limit the performance of machine-learning algorithms. Individual hospital practice patterns may limit external generalizability. Data missingness is another barrier to optimal algorithm performance and various strategies exist to mitigate this. Recent advances in data science, such as transfer learning, conformal prediction, and continual learning, may improve generalizability of machine-learning algorithms in critically ill patients. Randomized trials with these approaches are indicated to demonstrate improvements in patient-centered outcomes at this point.
Collapse
Affiliation(s)
- Joshua Pei Le
- School of Medicine, University of Limerick, Castletroy, Co, Limerick V94 T9PX, Ireland
| | | | - Atul Malhotra
- Division of Pulmonary, Critical Care and Sleep Medicine, University of California San Diego, San Diego, CA, USA
| | - Shamim Nemati
- Division of Biomedical Informatics, University of California San Diego, San Diego, CA, USA
| | - Gabriel Wardi
- Division of Pulmonary, Critical Care and Sleep Medicine, University of California San Diego, San Diego, CA, USA; Department of Emergency Medicine, University of California San Diego, 200 W Arbor Drive, San Diego, CA 92103, USA.
| |
Collapse
|
3
|
Tong G, Li F, Chen X, Hirani SP, Newman SP, Wang W, Harhay MO. A Bayesian Approach for Estimating the Survivor Average Causal Effect When Outcomes Are Truncated by Death in Cluster-Randomized Trials. Am J Epidemiol 2023; 192:1006-1015. [PMID: 36799630 PMCID: PMC10236525 DOI: 10.1093/aje/kwad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 01/05/2023] [Accepted: 02/18/2023] [Indexed: 02/18/2023] Open
Abstract
Many studies encounter clustering due to multicenter enrollment and nonmortality outcomes, such as quality of life, that are truncated due to death-that is, missing not at random and nonignorable. Traditional missing-data methods and target causal estimands are suboptimal for statistical inference in the presence of these combined issues, which are especially common in multicenter studies and cluster-randomized trials (CRTs) carried out among the elderly or seriously ill. Using principal stratification, we developed a Bayesian estimator that jointly identifies the always-survivor principal stratum in a clustered/hierarchical data setting and estimates the average treatment effect among them (i.e., the survivor average causal effect (SACE)). In simulations, we observed low bias and good coverage with our method. In a motivating CRT, the SACE and the estimate from complete-case analysis differed in magnitude, but both were small, and neither was incompatible with a null effect. However, the SACE estimate has a clear causal interpretation. The option to assess the rigorously defined SACE estimand in studies with informative truncation and clustering can provide additional insight into an important subset of study participants. Based on the simulation study and CRT reanalysis, we provide practical recommendations for using the SACE in CRTs and software code to support future research.
Collapse
Affiliation(s)
- Guangyu Tong
- Correspondence to Dr. Guangyu Tong, Department of Biostatistics, Yale School of Public Health, 135 College Street, New Haven, CT 06510 (e-mail: )
| | | | | | | | | | | | | |
Collapse
|
4
|
Woods AD, Gerasimova D, Van Dusen B, Nissen J, Bainter S, Uzdavines A, Davis‐Kean PE, Halvorson M, King KM, Logan JAR, Xu M, Vasilev MR, Clay JM, Moreau D, Joyal‐Desmarais K, Cruz RA, Brown DMY, Schmidt K, Elsherif MM. Best practices for addressing missing data through multiple imputation. INFANT AND CHILD DEVELOPMENT 2023. [DOI: 10.1002/icd.2407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Affiliation(s)
- Adrienne D. Woods
- Center for Learning and Development, Education SRI International Arlington Virginia USA
| | - Daria Gerasimova
- Kansas University Center on Developmental Disabilities University of Kansas Lawrence Kansas USA
| | - Ben Van Dusen
- School of Education Iowa State University Ames Iowa USA
| | - Jayson Nissen
- Nissen Education Research and Design Corvallis Oregon USA
| | - Sierra Bainter
- Department of Psychology University of Miami Coral Gables Florida USA
| | - Alex Uzdavines
- South Central Mental Illness Research Education, and Clinical Center, Michael E. DeBakey VA Medical Center Houston Texas USA
- Menninger Department of Psychiatry and Behavioral Sciences Baylor College of Medicine Houston Texas USA
| | | | - Max Halvorson
- Department of Psychology University of Washington Seattle Washington USA
| | - Kevin M. King
- Department of Psychology University of Washington Seattle Washington USA
| | - Jessica A. R. Logan
- Department of Educational Studies The Ohio State University Columbus Ohio USA
| | - Menglin Xu
- Department of Internal Medicine The Ohio State University Columbus Ohio USA
| | | | - James M. Clay
- Department of Psychology University of Portsmouth Portsmouth UK
| | - David Moreau
- School of Psychology University of Auckland Auckland New Zealand
- Centre for Brain Research University of Auckland Auckland New Zealand
| | - Keven Joyal‐Desmarais
- Department of Health, Kinesiology, and Applied Physiology Concordia University Montreal Quebec Canada
- Montreal Behavioral Medicine Centre Centre intégré universitaire de santé et de services sociaux du Nord‐de‐l'Île‐de‐Montréal Montreal Quebec Canada
| | - Rick A. Cruz
- Department of Psychology Arizona State University Tempe Arizona USA
| | - Denver M. Y. Brown
- Department of Psychology University of Texas at San Antonio San Antonio Texas USA
| | - Kathleen Schmidt
- School of Psychological and Behavioral Sciences Southern Illinois University Carbondale Illinois USA
| | | |
Collapse
|
5
|
Tweedie Model for Predicting Factors Associated with Distance Traveled to Access Inpatient Services in Kenya. JOURNAL OF PROBABILITY AND STATISTICS 2022. [DOI: 10.1155/2022/2706504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Aim. This study aims to examine which factors influence the distance traveled by patients for inpatient care in Kenya. Methods. We used data from the fourth round of the Kenya Household Health Expenditure and Utilization survey. Our dependent variable was the self-reported distance traveled by patients to access inpatient care at public health facilities. As the clustered data were correlated, we used the generalized estimating equations approach with an exchangeable correlation under a Tweedie distribution. To select the best-fit covariates for predicting distance, we adopted a variable selection technique using the
and
criteria, wherein the lowest (highest) value for the former (latter) is preferred. Results. Using data for 451 participants from 47 counties, we found that three-fifths were admitted between 1 and 5 days, two-thirds resided in rural areas, and 90% were satisfied with the facilities’ service. Wealth quintiles were evenly distributed across respondents. Most admissions (81%) comprised
15,
65, and 25–54 years. Many households were of medium size (4–6 members) and had low education level (48%), and nine-tenths had no access to insurance. While two-thirds reported employment-based income, the same number reported not having cash to pay for inpatient services; 6 out of 10 paid over 3000 KES. Thus, differences in employment, ability to pay, and household size influence the distance traveled to access government healthcare facilities in Kenya. Interpretation. Low-income individuals more likely have large households and live in rural areas and, thus, are forced to travel farther to access inpatient care. Unlike the unemployed, the employed may have better socioeconomic status and possibly live near inpatient healthcare facilities, thereby explaining their short distances to access these services. Policymakers must support equal access to inpatient services, prioritize rural areas, open job opportunities, and encourage smaller families.
Collapse
|
6
|
Rajan SS, Wang M, Singh N, Jacob AP, Parker SA, Czap AL, Bowry R, Grotta JC, Yamal JM. Retrospectively Collected EQ-5D-5L Data as Valid Proxies for Imputing Missing Information in Longitudinal Studies. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2021; 24:1720-1727. [PMID: 34838269 DOI: 10.1016/j.jval.2021.07.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 06/13/2021] [Accepted: 07/01/2021] [Indexed: 06/13/2023]
Abstract
OBJECTIVES Studies face challenges with missing 5-level EQ-5D (EQ-5D-5L) data, often because of the need for longitudinal EQ-5D-5L data collection. There is a dearth of validated methodologies for dealing with missing EQ-5D-5L data in the literature. This study, for the first time, examined the possibility of using retrospectively collected EQ-5D-5L data as proxies for the missing data. METHODS Participants who had prospectively completed a 3rd month postdischarge EQ-5D-5L instrument (in-the-moment collection) were randomly interviewed to respond to a 2nd "retrospective collection" of their 3rd month EQ-5D-5L at 6th, 9th, or 12th month after hospital discharge. A longitudinal single imputation was also used to assess the relative performance of retrospective collection compared with the longitudinal single imputation. Concordances between the in-the-moment, retrospective, and imputed measures were assessed using intraclass correlation coefficients and weighted kappa statistics. RESULTS Considerable agreement was observed on the basis of weighted kappa (range 0.72-0.95) between the mobility, self-care, and usual activities dimensions of EQ-5D-5L collected in-the-moment and retrospectively. Concordance based on intraclass correlation coefficients was good to excellent (range 0.79-0.81) for utility indices computed, and excellent (range 0.93-0.96) for quality-adjusted life-years computed using in-the-moment compared with retrospective EQ-5D-5L. The longitudinal single imputation did not perform as well as the retrospective collection method. CONCLUSIONS This study demonstrates that retrospective collection of EQ-5D-5L has high concordance with "in-the-moment" EQ-5D-5L and could be a valid and attractive alternative for data imputation when longitudinally collected EQ-5D-5L data are missing. Future studies examining this method for other disease areas and populations are required to provide more generalizable evidence.
Collapse
Affiliation(s)
- Suja S Rajan
- Department of Management, Policy, and Community Health, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Mengxi Wang
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Noopur Singh
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Asha P Jacob
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Stephanie A Parker
- Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Alexandra L Czap
- Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Ritvij Bowry
- Department of Neurology, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - James C Grotta
- Mobile Stroke Unit and Stroke Research, Clinical Innovation and Research Institute, Memorial Hermann Hospital, Houston, TX, USA
| | - Jose-Miguel Yamal
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA.
| |
Collapse
|
7
|
Wang S, Celebi ME, Zhang YD, Yu X, Lu S, Yao X, Zhou Q, Miguel MG, Tian Y, Gorriz JM, Tyukin I. Advances in Data Preprocessing for Biomedical Data Fusion: An Overview of the Methods, Challenges, and Prospects. INFORMATION FUSION 2021; 76:376-421. [DOI: 10.1016/j.inffus.2021.07.001] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
|
8
|
Suzuki T, Ueta H, Taito S. Relevant Factors for Physical, Mental, and Cognitive Problems in ICU Survivors. Am J Respir Crit Care Med 2021; 204:1001. [PMID: 34428383 PMCID: PMC8534610 DOI: 10.1164/rccm.202107-1714le] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Affiliation(s)
| | - Hiroshi Ueta
- Kobe City Medical Center General Hospital Kobe, Japan
| | | |
Collapse
|
9
|
Alsaber A, Al-Herz A, Pan J, Al-Sultan AT, Mishra D. Handling missing data in a rheumatoid arthritis registry using random forest approach. Int J Rheum Dis 2021; 24:1282-1293. [PMID: 34382756 DOI: 10.1111/1756-185x.14203] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 07/13/2021] [Accepted: 07/23/2021] [Indexed: 12/01/2022]
Abstract
Missing data in clinical epidemiological research violate the intention-to-treat principle, reduce the power of statistical analysis, and can introduce bias if the cause of missing data is related to a patient's response to treatment. Multiple imputation provides a solution to predict the values of missing data. The main objective of this study is to estimate and impute missing values in patient records. The data from the Kuwait Registry for Rheumatic Diseases was used to deal with missing values among patient records. A number of methods were implemented to deal with missing data; however, choosing the best imputation method was judged by the lowest root mean square error (RMSE). Among 1735 rheumatoid arthritis patients, we found missing values vary from 5% to 65.5% of the total observations. The results show that sequential random forest method can estimate these missing values with a high level of accuracy. The RMSE varied between 2.5 and 5.0. missForest had the lowest imputation error for both continuous and categorical variables under each missing data rate (10%, 20%, and 30%) and had the smallest prediction error difference when the models used the imputed laboratory values.
Collapse
Affiliation(s)
- Ahmad Alsaber
- Department of Mathematics and Statistics, University of Strathclyde, Glasgow, UK
| | - Adeeba Al-Herz
- Department of Rheumatology, Al-Amiri Hospital, Kuwait City, Kuwait
| | - Jiazhu Pan
- Department of Mathematics and Statistics, University of Strathclyde, Glasgow, UK
| | - Ahmad T Al-Sultan
- Department of Community Medicine and Behavioral Sciences, Kuwait University, Kuwait City, Kuwait
| | - Divya Mishra
- Department of Plant Pathology, Kansas State University, Kansas, MN, USA
| | -
- Department of Rheumatology, Al-Amiri Hospital, Kuwait City, Kuwait
| |
Collapse
|
10
|
Abstract
In recent years, mass spectrometry (MS)-based metabolomics has been extensively applied to characterize biochemical mechanisms, and study physiological processes and phenotypic changes associated with disease. Metabolomics has also been important for identifying biomarkers of interest suitable for clinical diagnosis. For the purpose of predictive modeling, in this chapter, we will review various supervised learning algorithms such as random forest (RF), support vector machine (SVM), and partial least squares-discriminant analysis (PLS-DA). In addition, we will also review feature selection methods for identifying the best combination of metabolites for an accurate predictive model. We conclude with best practices for reproducibility by including internal and external replication, reporting metrics to assess performance, and providing guidelines to avoid overfitting and to deal with imbalanced classes. An analysis of an example data will illustrate the use of different machine learning methods and performance metrics.
Collapse
Affiliation(s)
- Tusharkanti Ghosh
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Weiming Zhang
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Debashis Ghosh
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
11
|
Carreras G, Miccinesi G, Wilcock A, Preston N, Nieboer D, Deliens L, Groenvold M, Lunder U, van der Heide A, Baccini M. Missing not at random in end of life care studies: multiple imputation and sensitivity analysis on data from the ACTION study. BMC Med Res Methodol 2021; 21:13. [PMID: 33422019 PMCID: PMC7796568 DOI: 10.1186/s12874-020-01180-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 11/26/2020] [Indexed: 11/17/2022] Open
Abstract
Background Missing data are common in end-of-life care studies, but there is still relatively little exploration of which is the best method to deal with them, and, in particular, if the missing at random (MAR) assumption is valid or missing not at random (MNAR) mechanisms should be assumed. In this paper we investigated this issue through a sensitivity analysis within the ACTION study, a multicenter cluster randomized controlled trial testing advance care planning in patients with advanced lung or colorectal cancer. Methods Multiple imputation procedures under MAR and MNAR assumptions were implemented. Possible violation of the MAR assumption was addressed with reference to variables measuring quality of life and symptoms. The MNAR model assumed that patients with worse health were more likely to have missing questionnaires, making a distinction between single missing items, which were assumed to satisfy the MAR assumption, and missing values due to completely missing questionnaire for which a MNAR mechanism was hypothesized. We explored the sensitivity to possible departures from MAR on gender differences between key indicators and on simple correlations. Results Up to 39% of follow-up data were missing. Results under MAR reflected that missingness was related to poorer health status. Correlations between variables, although very small, changed according to the imputation method, as well as the differences in scores by gender, indicating a certain sensitivity of the results to the violation of the MAR assumption. Conclusions The findings confirmed the importance of undertaking this kind of analysis in end-of-life care studies. Supplementary Information The online version contains supplementary material available at 10.1186/s12874-020-01180-y.
Collapse
Affiliation(s)
- Giulia Carreras
- Oncological Network, Prevention and Research Institute (ISPRO), Florence, Italy.
| | - Guido Miccinesi
- Oncological Network, Prevention and Research Institute (ISPRO), Florence, Italy
| | - Andrew Wilcock
- Department of Clinical Oncology, University of Nottingham, Nottingham, UK
| | - Nancy Preston
- Lancaster University, International Observatory on end of life care, Lancaster, UK
| | - Daan Nieboer
- Department of Public Health, Erasmus University, Rotterdam, Netherlands
| | - Luc Deliens
- Vrije Universiteit Brussel & Ghent University, Brussels, Belgium
| | - Mogensm Groenvold
- Department of Public Health, Copenhagen University, Copenhagen, Denmark
| | - Urska Lunder
- University Clinic for Respiratory and Allergic Diseases, Golnik, Slovenia
| | | | - Michela Baccini
- Department of Statistics, Computer Science, Applications 'G. Parenti' (DISIA), University of Florence, Florence, Italy
| | | |
Collapse
|
12
|
Nabi R, Bhattacharya R, Shpitser I. Full Law Identification in Graphical Models of Missing Data: Completeness Results. PROCEEDINGS OF MACHINE LEARNING RESEARCH 2020; 119:7153-7163. [PMID: 33283197 PMCID: PMC7716645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Missing data has the potential to affect analyses conducted in all fields of scientific study including healthcare, economics, and the social sciences. Several approaches to unbiased inference in the presence of non-ignorable missingness rely on the specification of the target distribution and its missingness process as a probability distribution that factorizes with respect to a directed acyclic graph. In this paper, we address the longstanding question of the characterization of models that are identifiable within this class of missing data distributions. We provide the first completeness result in this field of study - necessary and sufficient graphical conditions under which, the full data distribution can be recovered from the observed data distribution. We then simultaneously address issues that may arise due to the presence of both missing data and unmeasured confounding, by extending these graphical conditions and proofs of completeness, to settings where some variables are not just missing, but completely unobserved.
Collapse
Affiliation(s)
- Razieh Nabi
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Rohit Bhattacharya
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Ilya Shpitser
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
13
|
Xu X, Xia L, Zhang Q, Wu S, Wu M, Liu H. The ability of different imputation methods for missing values in mental measurement questionnaires. BMC Med Res Methodol 2020; 20:42. [PMID: 32103723 PMCID: PMC7045426 DOI: 10.1186/s12874-020-00932-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Accepted: 02/19/2020] [Indexed: 01/10/2023] Open
Abstract
Background Incomplete data are of particular important influence in mental measurement questionnaires. Most experts, however, mostly focus on clinical trials and cohort studies and generally pay less attention to this deficiency. We aim is to compare the accuracy of four common methods for handling items missing from different psychology questionnaires according to the items non-response rates. Method All data were drawn from the previous studies including the self-acceptance scale (SAQ), the activities of daily living scale (ADL) and self-esteem scale (RSES). SAQ and ADL dataset, simulation group, were used to compare and assess the ability of four imputation methods which are direct deletion, mode imputation, Hot-deck (HD) imputation and multiple imputation (MI) by absolute deviation, the root mean square error and average relative error in missing proportions of 5, 10, 15 and 20%. RSES dataset, validation group, was used to test the application of imputation methods. All analyses were finished by SAS 9.4. Results The biases obtained by MI are the smallest under various missing proportions. HD imputation approach performed the lowest absolute deviation of standard deviation values. But they got the similar results and the performances of them are obviously better than direct deletion and mode imputation. In a real world situation, the respondents’ average score in complete data set was 28.22 ± 4.63, which are not much different from imputed datasets. The direction of the influence of the five factors on self-esteem was consistent, although there were some differences in the size and range of OR values in logistic regression model. Conclusion MI shows the best performance while it demands slightly more data analytic capacity and skills of programming. And HD could be considered to impute missing values in psychological investigation when MI cannot be performed due to limited circumstances.
Collapse
Affiliation(s)
- Xueying Xu
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China
| | - Leizhen Xia
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China
| | - Qimeng Zhang
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China
| | - Shaoning Wu
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China
| | - Mingcheng Wu
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China
| | - Hongbo Liu
- Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People's Republic of China.
| |
Collapse
|
14
|
Halme AS, Tannenbaum C. Performance of a Bayesian Approach for Imputing Missing Data on the SF-12 Health-Related Quality-of-Life Measure. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2018; 21:1406-1412. [PMID: 30502784 DOI: 10.1016/j.jval.2018.06.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2017] [Revised: 05/22/2018] [Accepted: 06/18/2018] [Indexed: 06/09/2023]
Abstract
BACKGROUND Missing data in health-related quality-of-life outcomes are an ongoing problem. The 12-item short form health survey (SF-12) scores are no exception. Data imputation is complicated, because missingness may be partially predicted by the missing data themselves. OBJECTIVES To compare the performance of a Bayesian method for imputing SF-12 data with previously described frequentist imputation methods. METHODS SF-12 data were extracted from a trial assessing continence promotion on health-related quality of life in older women (n = 1052); the data set was split into a model development cohort for creating predictive models and a validation cohort to validate these models. Algorithms were constructed using data from the model development cohort to compute SF-12-related scores (physical health composite scale, the mental health composite scale, and the six-dimensional health state short form utilities). The Bayesian models used missing at random and missing not at random algorithms to impute missing SF-12 answers as categorical data. Comparative models replaced missing data with 0, used the mean weight of the sample, and regressed parameters from sociodemographic predictors. Data randomly deleted from the validation cohort were imputed with each algorithm, and the mean absolute error was used to gauge goodness of fit. RESULTS Each cohort included 526 persons; mean age was 78.1 ± 7.8 years. In the model development cohort, 15.6% of the participants had missing data. For the physical health composite scale, the mental health composite scale, and the six-dimensional health state short form utilities, the Bayesian model with missing at random data significantly outperformed all five comparison models, including the Bayesian models with missing not at random data. CONCLUSIONS Bayesian imputation was superior to other previously described methods for computing missing SF-12 data.
Collapse
Affiliation(s)
- Alex S Halme
- Department of Medicine, McGill University, Montréal, Québec, Canada
| | | |
Collapse
|
15
|
Apathy and health-related quality of life in nursing home residents. Qual Life Res 2018; 28:751-759. [PMID: 30406574 DOI: 10.1007/s11136-018-2041-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2018] [Indexed: 10/27/2022]
Abstract
PURPOSE To explore the association between apathy and health-related quality of life (HRQoL) from resident and proxy perspectives and whether cognition and depression moderate this relationship. METHODS Secondary analyses with baseline data from a cluster randomized trial on the effects of a care program for depression in Nursing Homes (NHs) were conducted. For HRQoL, the Visual Analogue Scale (VAS) and the Dutch version of the European Quality of Life (EQ-5D) were administered to 521 NH residents, and to professional caregivers reporting from the perspective of the NH resident (Resident-Proxy) and from their own perspective (Proxy-Proxy). Utility scores (U) were calculated for the three perspectives. Apathy, depression, and cognition were measured using the 10-item Apathy Evaluation Scale, the Cornell Scale for Depression in Dementia, and the standardized Mini-Mental State Examination, respectively. RESULTS Mixed models adjusted for clustering within NH units revealed that apathy was negatively associated with HRQoL both from the Resident-Proxy perspective (EQ-5D VAS: estimated effect, - 0.31, P < 0.001; EQ-5D Utility: - 0.30, P < 0.001) and from the Proxy-Proxy perspective (VAS: - 0.29, P < 0.001; U: - 0.03, P < 0.001), but not from the Resident-Resident perspective (VAS: - 0.05, P = 0.423; Utility: - 0.08, P = 0.161). Controlling for depression and cognition and their interaction terms with apathy did not change the results. CONCLUSION Apathy is negatively associated with NH resident HRQoL as reported by proxies. Depression and cognitive functioning do not moderate this association. NH residents do not self-report a relationship between apathy and HRQoL. More research is needed to understand caregiver and NH resident attitudes and underlying assumptions regarding apathy and HRQoL.
Collapse
|
16
|
Mason AJ, Gomes M, Grieve R, Carpenter JR. A Bayesian framework for health economic evaluation in studies with missing data. HEALTH ECONOMICS 2018; 27:1670-1683. [PMID: 29969834 PMCID: PMC6220766 DOI: 10.1002/hec.3793] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 04/04/2018] [Accepted: 04/11/2018] [Indexed: 05/02/2023]
Abstract
Health economics studies with missing data are increasingly using approaches such as multiple imputation that assume that the data are "missing at random." This assumption is often questionable, as-even given the observed data-the probability that data are missing may reflect the true, unobserved outcomes, such as the patients' true health status. In these cases, methodological guidelines recommend sensitivity analyses to recognise data may be "missing not at random" (MNAR), and call for the development of practical, accessible approaches for exploring the robustness of conclusions to MNAR assumptions. Little attention has been paid to the problem that data may be MNAR in health economics in general and in cost-effectiveness analyses (CEA) in particular. In this paper, we propose a Bayesian framework for CEA where outcome or cost data are missing. Our framework includes a practical, accessible approach to sensitivity analysis that allows the analyst to draw on expert opinion. We illustrate the framework in a CEA comparing an endovascular strategy with open repair for patients with ruptured abdominal aortic aneurysm, and provide software tools to implement this approach.
Collapse
Affiliation(s)
- Alexina J. Mason
- Department of Health Services Research and PolicyLondon School of Hygiene and Tropical MedicineLondonUK
| | - Manuel Gomes
- Department of Health Services Research and PolicyLondon School of Hygiene and Tropical MedicineLondonUK
| | - Richard Grieve
- Department of Health Services Research and PolicyLondon School of Hygiene and Tropical MedicineLondonUK
| | - James R. Carpenter
- Department of Medical StatisticsLondon School of Hygiene and Tropical MedicineLondonUK
- MRC Clinical Trials UnitUniversity College LondonLondonUK
| |
Collapse
|
17
|
Mason AJ, Gomes M, Grieve R, Ulug P, Powell JT, Carpenter J. Development of a practical approach to expert elicitation for randomised controlled trials with missing health outcomes: Application to the IMPROVE trial. Clin Trials 2017; 14:357-367. [PMID: 28675302 PMCID: PMC5648050 DOI: 10.1177/1740774517711442] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Background/aims: The analyses of randomised controlled trials with missing data typically assume that, after conditioning on the observed data, the probability of missing data does not depend on the patient’s outcome, and so the data are ‘missing at random’ . This assumption is usually implausible, for example, because patients in relatively poor health may be more likely to drop out. Methodological guidelines recommend that trials require sensitivity analysis, which is best informed by elicited expert opinion, to assess whether conclusions are robust to alternative assumptions about the missing data. A major barrier to implementing these methods in practice is the lack of relevant practical tools for eliciting expert opinion. We develop a new practical tool for eliciting expert opinion and demonstrate its use for randomised controlled trials with missing data. Methods: We develop and illustrate our approach for eliciting expert opinion with the IMPROVE trial (ISRCTN 48334791), an ongoing multi-centre randomised controlled trial which compares an emergency endovascular strategy versus open repair for patients with ruptured abdominal aortic aneurysm. In the IMPROVE trial at 3 months post-randomisation, 21% of surviving patients did not complete health-related quality of life questionnaires (assessed by EQ-5D-3L). We address this problem by developing a web-based tool that provides a practical approach for eliciting expert opinion about quality of life differences between patients with missing versus complete data. We show how this expert opinion can define informative priors within a fully Bayesian framework to perform sensitivity analyses that allow the missing data to depend upon unobserved patient characteristics. Results: A total of 26 experts, of 46 asked to participate, completed the elicitation exercise. The elicited quality of life scores were lower on average for the patients with missing versus complete data, but there was considerable uncertainty in these elicited values. The missing at random analysis found that patients randomised to the emergency endovascular strategy versus open repair had higher average (95% credible interval) quality of life scores of 0.062 (−0.005 to 0.130). Our sensitivity analysis that used the elicited expert information as pooled priors found that the gain in average quality of life for the emergency endovascular strategy versus open repair was 0.076 (−0.054 to 0.198). Conclusion: We provide and exemplify a practical tool for eliciting the expert opinion required by recommended approaches to the sensitivity analyses of randomised controlled trials. We show how this approach allows the trial analysis to fully recognise the uncertainty that arises from making alternative, plausible assumptions about the reasons for missing data. This tool can be widely used in the design, analysis and interpretation of future trials, and to facilitate this, materials are available for download.
Collapse
Affiliation(s)
- Alexina J Mason
- 1 Department of Health Services Research and Policy, London School of Hygiene & Tropical Medicine, London, UK
| | - Manuel Gomes
- 1 Department of Health Services Research and Policy, London School of Hygiene & Tropical Medicine, London, UK
| | - Richard Grieve
- 1 Department of Health Services Research and Policy, London School of Hygiene & Tropical Medicine, London, UK
| | - Pinar Ulug
- 2 Vascular Surgery Research Group, Imperial College London, London, UK
| | - Janet T Powell
- 2 Vascular Surgery Research Group, Imperial College London, London, UK
| | - James Carpenter
- 3 Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, UK
| |
Collapse
|
18
|
Liu Y, Gopalakrishnan V. An Overview and Evaluation of Recent Machine Learning Imputation Methods Using Cardiac Imaging Data. DATA 2017; 2. [PMID: 28243594 PMCID: PMC5325161 DOI: 10.3390/data2010008] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Many clinical research datasets have a large percentage of missing values that directly impacts their usefulness in yielding high accuracy classifiers when used for training in supervised machine learning. While missing value imputation methods have been shown to work well with smaller percentages of missing values, their ability to impute sparse clinical research data can be problem specific. We previously attempted to learn quantitative guidelines for ordering cardiac magnetic resonance imaging during the evaluation for pediatric cardiomyopathy, but missing data significantly reduced our usable sample size. In this work, we sought to determine if increasing the usable sample size through imputation would allow us to learn better guidelines. We first review several machine learning methods for estimating missing data. Then, we apply four popular methods (mean imputation, decision tree, k-nearest neighbors, and self-organizing maps) to a clinical research dataset of pediatric patients undergoing evaluation for cardiomyopathy. Using Bayesian Rule Learning (BRL) to learn ruleset models, we compared the performance of imputation-augmented models versus unaugmented models. We found that all four imputation-augmented models performed similarly to unaugmented models. While imputation did not improve performance, it did provide evidence for the robustness of our learned models.
Collapse
Affiliation(s)
- Yuzhe Liu
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Medical Scientist Training Program, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Correspondence:
| | - Vanathi Gopalakrishnan
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Medical Scientist Training Program, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
19
|
Leontjevas R, Teerenstra S, Smalbrugge M, Koopmans RT, Gerritsen DL. Quality of life assessments in nursing homes revealed a tendency of proxies to moderate patients' self-reports. J Clin Epidemiol 2016; 80:123-133. [DOI: 10.1016/j.jclinepi.2016.07.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Revised: 07/09/2016] [Accepted: 07/18/2016] [Indexed: 10/21/2022]
|
20
|
The Direct and Indirect Relationship between Interpersonal Self-Support Traits and Perceived Social Support: A Longitudinal Study. CURRENT PSYCHOLOGY 2016. [DOI: 10.1007/s12144-016-9491-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
21
|
Rombach I, Rivero-Arias O, Gray AM, Jenkinson C, Burke Ó. The current practice of handling and reporting missing outcome data in eight widely used PROMs in RCT publications: a review of the current literature. Qual Life Res 2016; 25:1613-23. [PMID: 26821918 PMCID: PMC4893363 DOI: 10.1007/s11136-015-1206-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2015] [Indexed: 11/28/2022]
Abstract
Purpose Patient-reported outcome measures (PROMs) are designed to assess patients’ perceived health states or health-related quality of life. However, PROMs are susceptible to missing data, which can affect the validity of conclusions from randomised controlled trials (RCTs). This review aims to assess current practice in the handling, analysis and reporting of missing PROMs outcome data in RCTs compared to contemporary methodology and guidance. Methods This structured review of the literature includes RCTs with a minimum of 50 participants per arm. Studies using the EQ-5D-3L, EORTC QLQ-C30, SF-12 and SF-36 were included if published in 2013; those using the less commonly implemented HUI, OHS, OKS and PDQ were included if published between 2009 and 2013. Results The review included 237 records (4–76 per relevant PROM). Complete case analysis and single imputation were commonly used in 33 and 15 % of publications, respectively. Multiple imputation was reported for 9 % of the PROMs reviewed. The majority of publications (93 %) failed to describe the assumed missing data mechanism, while low numbers of papers reported methods to minimise missing data (23 %), performed sensitivity analyses (22 %) or discussed the potential influence of missing data on results (16 %). Conclusions Considerable discrepancy exists between approved methodology and current practice in handling, analysis and reporting of missing PROMs outcome data in RCTs. Greater awareness is needed for the potential biases introduced by inappropriate handling of missing data, as well as the importance of sensitivity analysis and clear reporting to enable appropriate assessments of treatment effects and conclusions from RCTs. Electronic supplementary material The online version of this article (doi:10.1007/s11136-015-1206-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ines Rombach
- Health Economics Research Centre (HERC), Nuffield Department of Population Health, University of Oxford, Oxford, UK. .,RCS Surgical Intervention Trials Unit (SITU), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK.
| | - Oliver Rivero-Arias
- National Perinatal Epidemiology Unit (NPEU), Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Alastair M Gray
- Health Economics Research Centre (HERC), Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Crispin Jenkinson
- Health Services Research Unit (HSRU), Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Órlaith Burke
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), Nuffield Department of Population Health, University of Oxford, Oxford, UK
| |
Collapse
|
22
|
Ayis S, Wellwood I, Rudd AG, McKevitt C, Parkin D, Wolfe CDA. Variations in Health-Related Quality of Life (HRQoL) and survival 1 year after stroke: five European population-based registers. BMJ Open 2015; 5:e007101. [PMID: 26038354 PMCID: PMC4458636 DOI: 10.1136/bmjopen-2014-007101] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
OBJECTIVE There were two main objectives: to describe and compare clinical outcomes and Patient-Reported Outcome Measures (PROMs) collected using standardised procedures across the European Registers of Stroke (EROS) at 3 and 12 months after stroke; and to examine the relationship between patients' Health-Related Quality of Life (HRQoL) at 3 months after stroke and survival up to 1 year across the 5 populations. DESIGN Analysis of data from population-based stroke registers. SETTING European populations in Dijon (France); Kaunas (Lithuania); London (UK); Warsaw (Poland) and Sesto Fiorentino (Italy). PARTICIPANTS Patients with ischaemic or intracerebral haemorrhage (ICH) stroke, registered between 2004 and 2006. OUTCOME MEASURES (1) HRQoL, assessed by the physical component summary (PCS) and mental component summary (MCS) of the Short-Form Health Survey (SF-12), mapped into the EQ-5D to estimate responses on 5 dimensions (mobility, activity, pain, anxiety and depression, and self-care) and utility scores. (2) Mortality within 3 months and within 1 year of stroke. RESULTS Of 1848 patients, 325 were lost to follow-up and 500 died within a year of stroke. Significant differences in mortality, HRQoL and utility scores were found, and remained after adjustments. Kaunas had an increased risk of death; OR 2.34, 95% CI (1.32 to 4.14) at 3 months after stroke in Kaunas, compared with London. Sesto Fiorentino had the highest adjusted PCS: 43.54 (SD=0.96), and Dijon had the lowest adjusted MCS 38.67 (SD=0.67). There are strong associations between levels of the EQ-5D at 3 months and survival within the year. The trend across levels suggests a dose-response relationship. CONCLUSIONS The study demonstrated significant variations in survival, HRQoL and utilities across populations that could not be explained by stroke severity and sociodemographic factors. Strong associations between HRQoL at 3 months and survival to 1 year after stroke were identified.
Collapse
Affiliation(s)
- Salma Ayis
- Division of Health and Social Care Research, King's College London, London, UK
- NIHR Biomedical Research Centre at Guy's & St Thomas’ NHS Foundation Trust and King's College London, London, UK
| | - Ian Wellwood
- Division of Health and Social Care Research, King's College London, London, UK
- Department of Public Health and Primary Care, Cambridge Institute of Public Health, University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - Anthony G Rudd
- Division of Health and Social Care Research, King's College London, London, UK
- NIHR Biomedical Research Centre at Guy's & St Thomas’ NHS Foundation Trust and King's College London, London, UK
| | - Christopher McKevitt
- Division of Health and Social Care Research, King's College London, London, UK
- NIHR Biomedical Research Centre at Guy's & St Thomas’ NHS Foundation Trust and King's College London, London, UK
| | - David Parkin
- Division of Health and Social Care Research, King's College London, London, UK
- NIHR Biomedical Research Centre at Guy's & St Thomas’ NHS Foundation Trust and King's College London, London, UK
| | - Charles D A Wolfe
- Division of Health and Social Care Research, King's College London, London, UK
- NIHR Biomedical Research Centre at Guy's & St Thomas’ NHS Foundation Trust and King's College London, London, UK
| |
Collapse
|
23
|
Implementing stratified primary care management for low back pain: cost-utility analysis alongside a prospective, population-based, sequential comparison study. Spine (Phila Pa 1976) 2015; 40:405-14. [PMID: 25599287 DOI: 10.1097/brs.0000000000000770] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
STUDY DESIGN Within-study cost-utility analysis. OBJECTIVE To explore the cost-utility of implementing stratified care for low back pain (LBP) in general practice, compared with usual care, within risk-defined patient subgroups (that is, patients at low, medium, and high risk of persistent disabling pain). SUMMARY OF BACKGROUND DATA Individual-level data collected alongside a prospective, sequential comparison of separate patient cohorts with 6-month follow-up. METHODS Adopting a cost-utility framework, the base case analysis estimated the incremental LBP-related health care cost per additional quality-adjusted life year (QALY) by risk subgroup. QALYs were constructed from responses to the 3-level EQ-5D, a preference-based health-related quality of life instrument. Uncertainty was explored with cost-utility planes and acceptability curves. Sensitivity analyses examined alternative methodological approaches, including a complete case analysis, the incorporation of non-back pain-related health care use and estimation of societal costs relating to work absence. RESULTS Stratified care was a dominant treatment strategy compared with usual care for patients at high risk, with mean health care cost savings of £124 and an incremental QALY estimate of 0.023. The likelihood that stratified care provides a cost-effective use of resources for patients at low and medium risk is no greater than 60% irrespective of a decision makers' willingness-to-pay for additional QALYs. Patients at medium and high risk of persistent disability in paid employment at 6-month follow-up reported, on average, 6 fewer days of LBP-related work absence in the stratified care cohort compared with usual care (associated societal cost savings per employed patient of £736 and £652, respectively). CONCLUSION At the observed level of adherence to screening tool recommendations for matched treatments, stratified care for LBP is cost-effective for patients at high risk of persistent disabling LBP only. LEVEL OF EVIDENCE 2.
Collapse
|
24
|
Faria R, Gomes M, Epstein D, White IR. A guide to handling missing data in cost-effectiveness analysis conducted within randomised controlled trials. PHARMACOECONOMICS 2014; 32:1157-70. [PMID: 25069632 PMCID: PMC4244574 DOI: 10.1007/s40273-014-0193-3] [Citation(s) in RCA: 387] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Missing data are a frequent problem in cost-effectiveness analysis (CEA) within a randomised controlled trial. Inappropriate methods to handle missing data can lead to misleading results and ultimately can affect the decision of whether an intervention is good value for money. This article provides practical guidance on how to handle missing data in within-trial CEAs following a principled approach: (i) the analysis should be based on a plausible assumption for the missing data mechanism, i.e. whether the probability that data are missing is independent of or dependent on the observed and/or unobserved values; (ii) the method chosen for the base-case should fit with the assumed mechanism; and (iii) sensitivity analysis should be conducted to explore to what extent the results change with the assumption made. This approach is implemented in three stages, which are described in detail: (1) descriptive analysis to inform the assumption on the missing data mechanism; (2) how to choose between alternative methods given their underlying assumptions; and (3) methods for sensitivity analysis. The case study illustrates how to apply this approach in practice, including software code. The article concludes with recommendations for practice and suggestions for future research.
Collapse
Affiliation(s)
- Rita Faria
- Centre for Health Economics, University of York, Heslington, York, YO10 5DD, UK,
| | | | | | | |
Collapse
|
25
|
|
26
|
Vaitsiakhovich T, Drichel D, Angisch M, Becker T, Herold C, Lacour A. Analysis of the progression of systolic blood pressure using imputation of missing phenotype values. BMC Proc 2014; 8:S83. [PMID: 25519344 PMCID: PMC4143701 DOI: 10.1186/1753-6561-8-s1-s83] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
We present a genome-wide association study of a quantitative trait, "progression of systolic blood pressure in time," in which 142 unrelated individuals of the Genetic Analysis Workshop 18 real genotype data were analyzed. Information on systolic blood pressure and other phenotypic covariates was missing at certain time points for a considerable part of the sample. We observed that the dropout process causing missingness is not independent of the initial systolic blood pressure; that is, the data is not missing completely at random. However, after the adjustment for age, the impact of systolic blood pressure on dropouts was no longer significant. Therefore, we decided to impute missing phenotype values by using information from individuals with complete phenotypic data. Progression of systolic blood pressure (∆SBP/∆t) was defined based on the imputed phenotypes and analyzed in a genome-wide fashion. We also conducted an exhaustive genome-wide search for interaction between single-nucleotide polymorphisms (7.14 × 10(10) tests) under an allelic model. The suggested data imputation and the association analysis strategy proved to be valid in the sense that there was no evidence of genome-wide inflation or increased type I error in general. Furthermore, we detected 2 single-nucleotide polymorphisms (SNPs) that met the criterion for genome-wide significance (p≤5 × 10(-8)), which was also confirmed via Monte-Carlo simulation. In view of the rather small sample size, however, the results have to be followed-up in larger studies.
Collapse
Affiliation(s)
- Tatsiana Vaitsiakhovich
- Institute for Medical Biometry, Informatics and Epidemiology (IMBIE), University of Bonn, Sigmund-Freud-Str., D-53105 Bonn, Germany
| | - Dmitriy Drichel
- German Center for Neurodegenerative Diseases (DZNE), Ludwig-Erhard-Allee 2, D-53175 Bonn, Germany
| | - Marina Angisch
- Institute for Medical Biometry, Informatics and Epidemiology (IMBIE), University of Bonn, Sigmund-Freud-Str., D-53105 Bonn, Germany
| | - Tim Becker
- German Center for Neurodegenerative Diseases (DZNE), Ludwig-Erhard-Allee 2, D-53175 Bonn, Germany.,Institute for Medical Biometry, Informatics and Epidemiology (IMBIE), University of Bonn, Sigmund-Freud-Str., D-53105 Bonn, Germany
| | - Christine Herold
- German Center for Neurodegenerative Diseases (DZNE), Ludwig-Erhard-Allee 2, D-53175 Bonn, Germany
| | - André Lacour
- German Center for Neurodegenerative Diseases (DZNE), Ludwig-Erhard-Allee 2, D-53175 Bonn, Germany
| |
Collapse
|
27
|
Dietary, physical activity, and sedentary behaviors associated with percent body fat in rural Hispanic youth. J Pediatr Health Care 2014; 28:63-70. [PMID: 23312368 DOI: 10.1016/j.pedhc.2012.11.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Revised: 11/03/2012] [Accepted: 11/09/2012] [Indexed: 02/02/2023]
Abstract
INTRODUCTION The objective of the present study was to assess dietary, physical activity, and sedentary behaviors associated with percent body fat in rural Hispanic youth. METHOD A total of 189 Hispanic children and adolescents ages 8 to 19 years completed the School Physical Activity and Nutrition questionnaire. Body composition (percent body fat) was determined by anthropometric skinfold methods. Logistic regression analysis was performed with percent body fat as the primary outcome dichotomized into excess body fat/normal body fat. RESULTS Gender was significantly associated with percent body fat in that girls were more likely to be in the excess percent body fat group. A significant interaction effect was noted between gender and sugar-sweetened beverages in that the effect of consuming sugar-sweetened drinks on excess adiposity was 6.28 times greater for boys than for girls. DISCUSSION Our data suggest that being a girl and consumption of sugar-sweetened beverages for boys may be risk factors for excess adiposity in rural Hispanic youth. Development of tailored, culturally sensitive interventions for this population may benefit from targeting these areas.
Collapse
|
28
|
Garg L, Dauwels J, Earnest A, Leong KP. Tensor-based methods for handling missing data in quality-of-life questionnaires. IEEE J Biomed Health Inform 2013; 18:1571-80. [PMID: 24235317 DOI: 10.1109/jbhi.2013.2288803] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A common problem with self-report quality-of-life questionnaires is missing data. Despite enormous care and effort to prevent it, some level of missing data is common and unavoidable. Missing data can have a detrimental impact on the data analysis. In this paper, a novel approach to imputing missing data in quality-of-life questionnaires is proposed, based on matrix and tensor decompositions. In order to illustrate and assess those methods, two datasets are considered: The first dataset contains the responses of 100 patients to a systemic lupus erythematosus-specific quality-of-life questionnaire; the other contains the responses of 43 patients to a rhino-conjunctivitis quality-of-life questionnaire. The two datasets contain almost no missing data, and for testing purposes, data entries are removed at random to have missing completely at random data. Several proportions of missing values are considered, and for each, the imputation error is assessed through k-fold cross validation. We also evaluate different imputation methods for missing at random and missing not at randomdata. The numerical results demonstrate that the proposed tensor factorization-based methods outperform standard methods in terms of root mean square error with at least 4% improvement, while the bias and variance are similar.
Collapse
|
29
|
Gomes M, Díaz-Ordaz K, Grieve R, Kenward MG. Multiple imputation methods for handling missing data in cost-effectiveness analyses that use data from hierarchical studies: an application to cluster randomized trials. Med Decis Making 2013; 33:1051-63. [PMID: 23913915 DOI: 10.1177/0272989x13492203] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
PURPOSE Multiple imputation (MI) has been proposed for handling missing data in cost-effectiveness analyses (CEAs). In CEAs that use cluster randomized trials (CRTs), the imputation model, like the analysis model, should recognize the hierarchical structure of the data. This paper contrasts a multilevel MI approach that recognizes clustering, with single-level MI and complete case analysis (CCA) in CEAs that use CRTs. METHODS We consider a multilevel MI approach compatible with multilevel analytical models for CEAs that use CRTs. We took fully observed data from a CEA that evaluated an intervention to improve diagnosis of active labor in primiparous women using a CRT (2078 patients, 14 clusters). We generated scenarios with missing costs and outcomes that differed, for example, according to the proportion with missing data (10%-50%), the covariates that predicted missing data (individual, cluster-level), and the missingness mechanism: missing completely at random (MCAR), missing at random (MAR), or missing not at random (MNAR). We estimated incremental net benefits (INBs) for each approach and compared them with the estimates from the fully observed data, the "true" INBs. RESULTS When costs and outcomes were assumed to be MCAR, the INBs for each approach were similar to the true estimates. When data were MAR, the point estimates from the CCA differed from the true estimates. Multilevel MI provided point estimates and standard errors closer to the true values than did single-level MI across all settings, including those in which a high proportion of observations had cost and outcome data MAR and when data were MNAR. CONCLUSIONS Multilevel MI accommodates the multilevel structure of the data in CEAs that use cluster trials and provides accurate cost-effectiveness estimates across the range of circumstances considered.
Collapse
Affiliation(s)
- Manuel Gomes
- Department of Health Services Research and Policy, London School of Hygiene and Tropical Medicine, London, UK (MG, KD, RG)
| | - Karla Díaz-Ordaz
- Department of Health Services Research and Policy, London School of Hygiene and Tropical Medicine, London, UK (MG, KD, RG)
| | - Richard Grieve
- Department of Health Services Research and Policy, London School of Hygiene and Tropical Medicine, London, UK (MG, KD, RG)
| | - Michael G Kenward
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK (MGK)
| |
Collapse
|
30
|
Methods in public health services and systems research: a systematic review. Am J Prev Med 2012; 42:S42-57. [PMID: 22502925 DOI: 10.1016/j.amepre.2012.01.028] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/06/2011] [Revised: 11/28/2011] [Accepted: 01/18/2012] [Indexed: 11/20/2022]
Abstract
CONTEXT Public Health Services and Systems Research (PHSSR) is concerned with evaluating the organization, financing, and delivery of public health services and their impact on public health. The strength of the current PHSSR evidence is somewhat dependent on the methods used to examine the field. Methods used in PHSSR articles, reports, and other documents were reviewed to assess their methodologic strengths and challenges in light of PHSSR goals. EVIDENCE ACQUISITION A total of 364 documents from the PHSSR library met the inclusion criteria as empirical and based in the U.S. After additional exclusions, 327 of these were analyzed. EVIDENCE SYNTHESIS A detailed codebook was used to classify articles in terms of (1) study design; (2) sampling; (3) instrumentation; (4) data collection; (5) data analysis; and (6) study validity. Inter-coder reliability was assessed for the codebook; once it was found reliable, the available empirical documents were coded. CONCLUSIONS Although there has been a dramatic increase in the amount of published PHSSR recently, methods used remain primarily cross-sectional and descriptive. Moreover, although appropriate for exploratory and foundational work in a new field, these approaches are limiting progress toward some PHSSR goals. Recommendations are given to advance and strengthen the methods used in PHSSR to better meet the goals and challenges facing the field.
Collapse
|
31
|
Hopman WM, Harrison MB, Carley M, Vandenkerkhof EG. Additional support for simple imputation of missing quality of life data in nursing research. ISRN NURSING 2011; 2011:752320. [PMID: 22191054 PMCID: PMC3236396 DOI: 10.5402/2011/752320] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Accepted: 09/05/2011] [Indexed: 11/23/2022]
Abstract
Background. Missing data are a significant problem in health-related quality of life (HRQOL) research. We evaluated two imputation approaches: missing data estimation (MDE) and assignment of mean score (AMS). Methods. HRQOL data were collected using the Medical Outcomes Trust SF-12. Missing data were estimated using both approaches, summary statistics were produced for both, and results were compared using intraclass correlations (ICC). Results. Missing data were imputed for 21 participants. Mean values were similar, with ICC >.99 within both the Physical Component Summary and the Mental Component Summary when comparing the two methodologies. When imputed data were added into the full study sample, mean scores were identical regardless of methodology. Conclusion. Results support the use of a practical and simple imputation strategy of replacing missing values with the mean of the sample in cross-sectional studies when less than half of the required items of the SF-12 components are missing.
Collapse
Affiliation(s)
- Wilma M Hopman
- Clinical Research Centre, Kingston General Hospital, Kingston, ON, K7L 2V7, Canada
| | | | | | | |
Collapse
|
32
|
TANG KENNETH, ESCORPIZO REUBEN, BEATON DORCASE, BOMBARDIER CLAIRE, LACAILLE DIANE, ZHANG WEI, ANIS ASLAMH, BOONEN ANNELIES, VERSTAPPEN SUZANNEM, BUCHBINDER RACHELLE, OSBORNE RICHARDH, FAUTREL BRUNO, GIGNAC MONIQUEA, TUGWELL PETERS. Measuring the Impact of Arthritis on Worker Productivity: Perspectives, Methodologic Issues, and Contextual Factors. J Rheumatol 2011; 38:1776-90. [DOI: 10.3899/jrheum.110405] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Leading up to the Outcome Measures in Rheumatology (OMERACT) 10 meeting, the goal of the Worker Productivity Special Interest Group (WP-SIG) was to make progress on 3 key issues that relate to the application and interpretation of worker productivity outcomes in arthritis: (1) to review existing conceptual frameworks to help consolidate our intended target and scope of measurement; (2) to examine the methodologic issues associated with our goal of combining multiple indicators of worker productivity loss (e.g., absenteeism <—> presenteeism) into a single comprehensive outcome; and (3) to examine the relevant contextual factors of work and potential implications for the interpretation of scores derived from existing outcome measures. Progress was made on all 3 issues at OMERACT 10. We identified 3 theoretical frameworks that offered unique but converging perspectives on worker productivity loss and/or work disability to provide guidance with classification, selection, and future recommendation of outcomes. Several measurement and analytic approaches to combine absenteeism and presenteeism outcomes were proposed, and the need for further validation of such approaches was also recognized. Finally, participants at the WP-SIG were engaged to brainstorm and provide preliminary endorsements to support key contextual factors of worker productivity through an anonymous “dot voting” exercise. A total of 24 specific factors were identified, with 16 receiving ≥ 1 vote among members, reflecting highly diverse views on specific factors that were considered most important. Moving forward, further progress on these issues remains a priority to help inform the best application of worker productivity outcomes in arthritis research.
Collapse
|
33
|
Hardouin JB, Conroy R, Sébille V. Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data. BMC Med Res Methodol 2011; 11:105. [PMID: 21756330 PMCID: PMC3161025 DOI: 10.1186/1471-2288-11-105] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Accepted: 07/14/2011] [Indexed: 01/07/2023] Open
Abstract
Background Nowadays, more and more clinical scales consisting in responses given by the patients to some items (Patient Reported Outcomes - PRO), are validated with models based on Item Response Theory, and more specifically, with a Rasch model. In the validation sample, presence of missing data is frequent. The aim of this paper is to compare sixteen methods for handling the missing data (mainly based on simple imputation) in the context of psychometric validation of PRO by a Rasch model. The main indexes used for validation by a Rasch model are compared. Methods A simulation study was performed allowing to consider several cases, notably the possibility for the missing values to be informative or not and the rate of missing data. Results Several imputations methods produce bias on psychometrical indexes (generally, the imputation methods artificially improve the psychometric qualities of the scale). In particular, this is the case with the method based on the Personal Mean Score (PMS) which is the most commonly used imputation method in practice. Conclusions Several imputation methods should be avoided, in particular PMS imputation. From a general point of view, it is important to use an imputation method that considers both the ability of the patient (measured for example by his/her score), and the difficulty of the item (measured for example by its rate of favourable responses). Another recommendation is to always consider the addition of a random process in the imputation method, because such a process allows reducing the bias. Last, the analysis realized without imputation of the missing data (available case analyses) is an interesting alternative to the simple imputation in this context.
Collapse
Affiliation(s)
- Jean-Benoit Hardouin
- EA 4275 Biostatistics, Clinical Research and Subjective Measures in Health Sciences, Faculties of Medicine and Pharmaceutical Sciences, University of Nantes, 1 rue Gaston Veil, BP 53508, 44035 Nantes Cedex 1, Nantes, France.
| | | | | |
Collapse
|
34
|
Scott IA. Cautionary tales in the clinical interpretation of trials assessing therapy-induced changes in health status. Int J Clin Pract 2011; 65:536-46. [PMID: 21489078 DOI: 10.1111/j.1742-1241.2011.02654.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Trials assessing the effects of therapies on symptoms, functional capacity, health-related quality of life and other aspects of health status are becoming more common in an era of chronic disease management. Such trials involve instruments for measuring health status whose reliability, validity and responsiveness need to be understood by clinicians and policy-makers in interpreting trial results. Deciding whether a treatment is clinically efficacious requires prior determination, based on empirical evidence, of what constitutes a minimal important difference (MID) between active treatment and control groups in the change in health status between study start and end. This MID should be used to calculate the sample size that will confer adequate power to detect a treatment effect if it truly exists. Many trials assessing health status have major methodological flaws: use of inappropriate or psychometrically unsound measurement instruments, lack of specification of MID, assumption that statistically significant results represent clinically significant treatment effects, and statement of conclusions inconsistent with observed results. This article provides guidance to clinicians in interpreting results of such trials in regard to clinical decision-making.
Collapse
Affiliation(s)
- I A Scott
- Internal Medicine and Clinical Epidemiology, Princess Alexandra Hospital, Brisbane, Queensland, Australia.
| |
Collapse
|
35
|
Fielding S, Fayers P, Ramsay C. Predicting missing quality of life data that were later recovered: an empirical comparison of approaches. Clin Trials 2010; 7:333-42. [DOI: 10.1177/1740774510374626] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Background and Purpose The aim was to compare simple imputation, multiple imputation, and modeling approaches to deal with ‘missing’ quality of life data. Data were obtained from five clinical trials, which employed a reminder system for follow-up questionnaires. Previous studies have compared imputation strategies by artificially removing data according to prespecified mechanisms. Our approach differs from previous study as actual collected data are utilized. Methods Data obtained by reminder were initially treated as missing. These missing values were imputed using a variety of simple and multiple imputation strategies. The trials were analyzed using the imputed datasets, and the resulting treatment effects compared to analyses using the full dataset including responses following reminders. A repeated measures model was also carried out on the available data and the pattern mixture models were employed. The accuracy of the different strategies was assessed by calculating the bias seen in the calculated treatment difference compared to the actual observed treatment difference. Results Baseline carried forward or last value carried forward were shown to be the best simple imputation methods in this setting. Multiple imputation using a regression model or predictive mean match model tended to provide treatment difference estimates with the least bias when compared to the actual observed data. Pattern mixture models did not perform well. Overall, the multiple imputation procedures were generally the least biased approaches. Limitations A number of imputation and modeling procedures have been investigated but this list is not exhaustive. All the example datasets come from the same data source and perhaps studies from additional disease areas would have been useful. However, we feel the results are generalizable to other quality of life outcomes and clinical areas. Conclusions Multiple imputation is recommended for missing quality of life data as it makes the assumption of missing at random which in the quality of life setting is more plausible than the assumption of missing completely at random for which most simple imputation methods are based. Pattern mixture models can be complex and did not perform well in this setting. Clinical Trials 2010; 7: 333—342. http://ctj.sagepub.com
Collapse
Affiliation(s)
- Shona Fielding
- Medical Statistics Team, Section of Population Health, University of Aberdeen, Aberdeen, UK,
| | - Peter Fayers
- Section of Population Health, University of Aberdeen, Aberdeen, UK, Department of Cancer Research and Molecular Medicine, Faculty of Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | - Craig Ramsay
- Health Services Research Unit, University of Aberdeen, Aberdeen, UK
| |
Collapse
|
36
|
Fielding S, Fayers PM, Ramsay CR. Investigating the missing data mechanism in quality of life outcomes: a comparison of approaches. Health Qual Life Outcomes 2009; 7:57. [PMID: 19545408 PMCID: PMC2711047 DOI: 10.1186/1477-7525-7-57] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2009] [Accepted: 06/22/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Missing data is classified as missing completely at random (MCAR), missing at random (MAR) or missing not at random (MNAR). Knowing the mechanism is useful in identifying the most appropriate analysis. The first aim was to compare different methods for identifying this missing data mechanism to determine if they gave consistent conclusions. Secondly, to investigate whether the reminder-response data can be utilised to help identify the missing data mechanism. METHODS Five clinical trial datasets that employed a reminder system at follow-up were used. Some quality of life questionnaires were initially missing, but later recovered through reminders. Four methods of determining the missing data mechanism were applied. Two response data scenarios were considered. Firstly, immediate data only; secondly, all observed responses (including reminder-response). RESULTS In three of five trials the hypothesis tests found evidence against the MCAR assumption. Logistic regression suggested MAR, but was able to use the reminder-collected data to highlight potential MNAR data in two trials. CONCLUSION The four methods were consistent in determining the missingness mechanism. One hypothesis test was preferred as it is applicable with intermittent missingness. Some inconsistencies between the two data scenarios were found. Ignoring the reminder data could potentially give a distorted view of the missingness mechanism. Utilising reminder data allowed the possibility of MNAR to be considered.
Collapse
Affiliation(s)
- Shona Fielding
- Section of Population Health, University of Aberdeen, UK.
| | | | | |
Collapse
|