1
|
Wang CY, Hwang WH, Song X. Biomarker data with measurement error in medical research: A literature review. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL STATISTICS 2024; 16:e1641. [PMID: 39113782 PMCID: PMC11305697 DOI: 10.1002/wics.1641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 12/27/2023] [Indexed: 08/10/2024]
Abstract
A biomarker is a measurable indicator of the severity or presence of a disease or medical condition in biomedical or epidemiological research. Biomarkers may help in early diagnosis and prevention of diseases. Several biomarkers have been identified for many diseases such as carbohydrate antigen 19-9 for pancreatic cancer. However, biomarkers may be measured with errors due to many reasons such as specimen collection or day-to-day within-subject variability of the biomarker, among others. Measurement error in the biomarker leads to bias in the regression parameter estimation for the association of the biomarker with disease in epidemiological studies. In addition, measurement error in the biomarkers may affect standard diagnostic measures to evaluate the performance of biomarkers such as the receiver operating characteristic (ROC) curve, area under the ROC curve, sensitivity, and specificity. Measurement error may also have an effect on how to combine multiple cancer biomarkers as a composite predictor for disease diagnosis. In follow-up studies, biomarkers are often collected intermittently at examination times, which may be sparse and typically biomarkers are not observed at the event times. Joint modeling of longitudinal and time-to-event data is a valid approach to account for measurement error in the analysis of repeatedly measured biomarkers and time-to-event outcomes. In this article, we provide a literature review on existing methods to correct for estimation in regression analysis, diagnostic measures, and joint modeling of longitudinal biomarkers and survival outcomes when the biomarkers are measured with errors. This article is categorized under: Statistical and Graphical Methods of Data Analysis > Robust MethodsStatistical and Graphical Methods of Data Analysis > EM AlgorithmStatistical Models > Survival Models.
Collapse
Affiliation(s)
- Ching-Yun Wang
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Wen-Han Hwang
- Institute of Statistics, National Tsing-Hua University, Hsinchu, Taiwan
| | - Xiao Song
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, Georgia, USA
| |
Collapse
|
2
|
Predictors with measurement error in mixtures of polynomial regressions. Comput Stat 2022. [DOI: 10.1007/s00180-022-01232-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
3
|
Wang CY, Song X. Semiparametric regression calibration for general hazard models in survival analysis with covariate measurement error; surprising performance under linear hazard. Biometrics 2021; 77:561-572. [PMID: 32557567 PMCID: PMC7746575 DOI: 10.1111/biom.13318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 05/18/2020] [Accepted: 06/01/2020] [Indexed: 11/26/2022]
Abstract
Observational epidemiological studies often confront the problem of estimating exposure-disease relationships when the exposure is not measured exactly. Regression calibration (RC) is a common approach to correct for bias in regression analysis with covariate measurement error. In survival analysis with covariate measurement error, it is well known that the RC estimator may be biased when the hazard is an exponential function of the covariates. In the paper, we investigate the RC estimator with general hazard functions, including exponential and linear functions of the covariates. When the hazard is a linear function of the covariates, we show that a risk set regression calibration (RRC) is consistent and robust to a working model for the calibration function. Under exponential hazard models, there is a trade-off between bias and efficiency when comparing RC and RRC. However, one surprising finding is that the trade-off between bias and efficiency in measurement error research is not seen under linear hazard when the unobserved covariate is from a uniform or normal distribution. Under this situation, the RRC estimator is in general slightly better than the RC estimator in terms of both bias and efficiency. The methods are applied to the Nutritional Biomarkers Study of the Women's Health Initiative.
Collapse
Affiliation(s)
- Ching-Yun Wang
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Xiao Song
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, Georgia
| |
Collapse
|
4
|
Lécuyer L, Dalle C, Micheau P, Pétéra M, Centeno D, Lyan B, Lagree M, Galan P, Hercberg S, Rossary A, Demidem A, Vasson MP, Partula V, Deschasaux M, Srour B, Latino-Martel P, Druesne-Pecollo N, Kesse-Guyot E, Durand S, Pujos-Guillot E, Manach C, Touvier M. Untargeted plasma metabolomic profiles associated with overall diet in women from the SU.VI.MAX cohort. Eur J Nutr 2020; 59:3425-3439. [DOI: 10.1007/s00394-020-02177-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 01/03/2020] [Indexed: 12/22/2022]
|
5
|
Yi GY, Yan Y, Liao X, Spiegelman D. Parametric Regression Analysis with Covariate Misclassification in Main Study/Validation Study Designs. Int J Biostat 2018; 15:/j/ijb.ahead-of-print/ijb-2017-0002/ijb-2017-0002.xml. [PMID: 30864410 DOI: 10.1515/ijb-2017-0002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 11/06/2018] [Indexed: 11/15/2022]
Abstract
Measurement error and misclassification have long been a concern in many fields, including medicine, administrative health care data, epidemiology, and survey sampling. It is known that measurement error and misclassification may seriously degrade the quality of estimation and inference, and should be avoided whenever possible. However, in practice, it is inevitable that measurements contain error for a variety of reasons. It is thus necessary to develop statistical strategies to cope with this issue. Although many inference methods have been proposed in the literature to address mis-measurement effects, some important issues remain unexplored. Typically, it is generally unclear how the available methods may perform relative to each other. In this paper, capitalizing on the unique feature of discrete variables, we consider settings with misclassified binary covariates and investigate issues concerning covariate misclassification; our development parallels available strategies for handling measurement error in continuous covariates. Under a unified framework, we examine a number of valid inferential procedures for practical settings where a validation study, either internal or external, is available besides a main study. Furthermore, we compare the relative performance of these methods and make practical recommendations.
Collapse
Affiliation(s)
- Grace Y Yi
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1
| | - Ying Yan
- Department of Statistical Science, School of Mathematics, Sun Yat-sen University, Guangzhou, China
| | - Xiaomei Liao
- Departments of Epidemiology and Biostatistics, Harvard School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA
| | - Donna Spiegelman
- Departments of Epidemiology and Biostatistics, Harvard School of Public Health, 677 Huntington Ave, Boston, MA 02115, USA; Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510
| |
Collapse
|
6
|
Wang CY, Song X. Robust best linear estimator for Cox regression with instrumental variables in whole cohort and surrogates with additive measurement error in calibration sample. Biom J 2016; 58:1465-1484. [PMID: 27546625 DOI: 10.1002/bimj.201500238] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Revised: 04/13/2016] [Accepted: 06/02/2016] [Indexed: 11/05/2022]
Abstract
Biomedical researchers are often interested in estimating the effect of an environmental exposure in relation to a chronic disease endpoint. However, the exposure variable of interest may be measured with errors. In a subset of the whole cohort, a surrogate variable is available for the true unobserved exposure variable. The surrogate variable satisfies an additive measurement error model, but it may not have repeated measurements. The subset in which the surrogate variables are available is called a calibration sample. In addition to the surrogate variables that are available among the subjects in the calibration sample, we consider the situation when there is an instrumental variable available for all study subjects. An instrumental variable is correlated with the unobserved true exposure variable, and hence can be useful in the estimation of the regression coefficients. In this paper, we propose a nonparametric method for Cox regression using the observed data from the whole cohort. The nonparametric estimator is the best linear combination of a nonparametric correction estimator from the calibration sample and the difference of the naive estimators from the calibration sample and the whole cohort. The asymptotic distribution is derived, and the finite sample performance of the proposed estimator is examined via intensive simulation studies. The methods are applied to the Nutritional Biomarkers Study of the Women's Health Initiative.
Collapse
Affiliation(s)
- Ching-Yun Wang
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, P.O. Box 19024, Seattle, WA, 98109-1024, USA.
| | - Xiao Song
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
7
|
Potgieter CJ, Wei R, Kipnis V, Freedman LS, Carroll RJ. Moment reconstruction and moment-adjusted imputation when exposure is generated by a complex, nonlinear random effects modeling process. Biometrics 2016; 72:1369-1377. [PMID: 27061196 DOI: 10.1111/biom.12524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 12/01/2015] [Accepted: 02/01/2016] [Indexed: 11/30/2022]
Abstract
For the classical, homoscedastic measurement error model, moment reconstruction (Freedman et al., 2004, 2008) and moment-adjusted imputation (Thomas et al., 2011) are appealing, computationally simple imputation-like methods for general model fitting. Like classical regression calibration, the idea is to replace the unobserved variable subject to measurement error with a proxy that can be used in a variety of analyses. Moment reconstruction and moment-adjusted imputation differ from regression calibration in that they attempt to match multiple features of the latent variable, and also to match some of the latent variable's relationships with the response and additional covariates. In this note, we consider a problem where true exposure is generated by a complex, nonlinear random effects modeling process, and develop analogues of moment reconstruction and moment-adjusted imputation for this case. This general model includes classical measurement errors, Berkson measurement errors, mixtures of Berkson and classical errors and problems that are not measurement error problems, but also cases where the data-generating process for true exposure is a complex, nonlinear random effects modeling process. The methods are illustrated using the National Institutes of Health-AARP Diet and Health Study where the latent variable is a dietary pattern score called the Healthy Eating Index-2005. We also show how our general model includes methods used in radiation epidemiology as a special case. Simulations are used to illustrate the methods.
Collapse
Affiliation(s)
- Cornelis J Potgieter
- Department of Statistical Science, Southern Methodist University, Dallas, Texas 75275, U.S.A
| | - Rubin Wei
- Eli Lilly and Company, Indianapolis, Indiana 46285, U.S.A
| | - Victor Kipnis
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland 20814, U.S.A
| | - Laurence S Freedman
- Gertner Institute for Epidemiology and Health Policy Research, Tel Hashomer, Israel
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, College Station, Texas 77843, U.S.A.,School of Mathematical and Physical Sciences, University of Technology, Sydney, New South Wales 2007, Australia
| |
Collapse
|
8
|
Midthune D, Carroll RJ, Freedman LS, Kipnis V. Measurement error models with interactions. Biostatistics 2015; 17:277-90. [PMID: 26530858 DOI: 10.1093/biostatistics/kxv043] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 10/07/2015] [Indexed: 11/14/2022] Open
Abstract
An important use of measurement error models is to correct regression models for bias due to covariate measurement error. Most measurement error models assume that the observed error-prone covariate (WW ) is a linear function of the unobserved true covariate (X) plus other covariates (Z) in the regression model. In this paper, we consider models for W that include interactions between X and Z. We derive the conditional distribution of X given W and Z and use it to extend the method of regression calibration to this class of measurement error models. We apply the model to dietary data and test whether self-reported dietary intake includes an interaction between true intake and body mass index. We also perform simulations to compare the model to simpler approximate calibration models.
Collapse
Affiliation(s)
- Douglas Midthune
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, 9609 Medical Center Drive, Room 5E122, Bethesda, MD 20892, USA
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843-3143, USA and School of Mathematical Sciences, University of Technology, Sydney, Broadway, NSW 2007, Australia
| | - Laurence S Freedman
- Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer 52161, Israel
| | - Victor Kipnis
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, 9609 Medical Center Drive, Room 5E118, Bethesda, MD 20892, USA
| |
Collapse
|
9
|
Yi GY, Ma Y, Spiegelman D, Carroll RJ. Functional and Structural Methods with Mixed Measurement Error and Misclassification in Covariates. J Am Stat Assoc 2015; 110:681-696. [PMID: 26190876 DOI: 10.1080/01621459.2014.922777] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Covariate measurement imprecision or errors arise frequently in many areas. It is well known that ignoring such errors can substantially degrade the quality of inference or even yield erroneous results. Although in practice both covariates subject to measurement error and covariates subject to misclassification can occur, research attention in the literature has mainly focused on addressing either one of these problems separately. To fill this gap, we develop estimation and inference methods that accommodate both characteristics simultaneously. Specifically, we consider measurement error and misclassification in generalized linear models under the scenario that an external validation study is available, and systematically develop a number of effective functional and structural methods. Our methods can be applied to different situations to meet various objectives.
Collapse
Affiliation(s)
- Grace Y Yi
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1
| | - Yanyuan Ma
- Department of Statistics, Texas A&M University, TAMU 3143, College Station, TX 77843-3143,
| | - Donna Spiegelman
- Departments of Epidemiology and Biostatistics, Harvard School of Public Health, 677 Huntington Ave, Boston, MA 02115,
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, TAMU 3143, College Station, TX 77843-3143, and School of Mathematical Sciences, University of Technology, Sydney, Broadway NSW 2007,
| |
Collapse
|
10
|
Ren J, Ning Z, Kirkness CS, Asche CV, Wang H. Risk of using logistic regression to illustrate exposure-response relationship of infectious diseases. BMC Infect Dis 2014; 14:540. [PMID: 25282153 PMCID: PMC4287313 DOI: 10.1186/1471-2334-14-540] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Accepted: 09/25/2014] [Indexed: 01/23/2023] Open
Abstract
Background In most biological experiments, especially infectious disease, the exposure-response relationship is interrelated by a multitude of factors rather than many independent factors. Little is known about the suitability of ordinary, categorical exposures, and logarithmic transformation which have been presented in logistic regression models to assess the likelihood of an infectious disease as a function of a risk or exposure. This study aims to examine and compare the current approaches. Methods A simulated human immunodeficiency virus (HIV) population, dynamic infection data for 100,000 individuals with 1% initial prevalence and 2% infectivity, was created. Using the Monte Carlo method (computational algorithm) to repeat random sampling to obtain numerical results, linearity between log odds and exposure, and suitability in practice were examined in the three model approaches. Results Despite diverse population prevalence, the linearity was not satisfied between log odds and raw exposures. Logarithmic transformation of exposures improved the linearity to a certain extent, and categorical exposures satisfied the linear assumption (which was important for modelling). When the population prevalence was low (assumed < 10%), performances of the three models were significantly different. Comparing to ordinary logistic regression, the logarithmic transformation approach demonstrated better accuracy of estimation except that at the two inflection points: likelihood of infection increased from slowly to sharply, then slowly again. The approach using categorical exposures had better estimations around the real values, but the measurement was coarse due to categorization. Conclusions It is not suitable to directly use ordinary logistic regression to explore the exposure-response relationship of HIV as an infectious disease. This study provides some recommendations for practical implementations including: 1) utilize categorical exposure if a large sample size and low population prevalence are provided; 2) utilize a logarithmic transformed exposure if the sample size is insufficient or the population prevalence is too high (such as 30%). Electronic supplementary material The online version of this article (doi:10.1186/1471-2334-14-540) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jinma Ren
- Center for Outcomes Research, University of Illinois College of Medicine at Peoria, One Illini Drive, Box 1649, Peoria, IL 61656, USA.
| | | | | | | | | |
Collapse
|
11
|
Beyler N, James-Burdumy S, Bleeker M, Fortson J, Benjamin M. Estimated Distributions of Usual Physical Activity during Recess. Med Sci Sports Exerc 2014; 47:1197-203. [PMID: 25268539 DOI: 10.1249/mss.0000000000000535] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
PURPOSE This study aimed to estimate distributions of usual physical activity during recess in schools in low-income areas using measurement error models and to compare model-adjusted distributions to unadjusted distributions based on a single day of measurement. METHODS A randomized study of the Playworks program was conducted in 29 schools from six U.S. cities. A sample of 365 fourth- and fifth-grade students in 26 of the study schools wore accelerometers during their recess periods on two school days. Estimates for the percentage of time spent in moderate to vigorous physical activity (MVPA) during recess were constructed from the accelerometer data for each school day. Using measurement error models, distributions for the usual amount of time spent in MVPA during recess were estimated for intervention and control groups of males and females. Unadjusted distributions for these same groups were also constructed using data from a single school day. RESULTS There is considerable intraindividual variability in the students' physical activity, which accounts for 67%-83% of the overall variability, depending on the study group. Unadjusted single-day distributions are much wider and have more weight in the tails than model-adjusted distributions owing to this large intraindividual variability in the data. CONCLUSIONS Using measurement error models to analyze physical activity data collected from recess periods will allow for more accurate and reliable inferences on students' physical activity.
Collapse
Affiliation(s)
- Nicholas Beyler
- 1Mathematica Policy Research, Washington, DC; 2Mathematica Policy Research, Princeton, NJ; 3Mathematica Policy Research, Oakland, CA; and 4Mathematica Policy Research, Princeton, NJ
| | | | | | | | | |
Collapse
|
12
|
Sinha S. A functional method for the conditional logistic regression with errors-in-covariates. J Nonparametr Stat 2012. [DOI: 10.1080/10485252.2012.687735] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
13
|
Abstract
Uncertainty concerning the measurement error properties of self-reported diet has important implications for the reliability of nutritional epidemiology reports. Biomarkers based on the urinary recovery of expended nutrients can provide an objective measure of short-term nutrient consumption for certain nutrients and, when applied to a subset of a study cohort, can be used to calibrate corresponding self-report nutrient consumption assessments. A nonstandard measurement error model that makes provision for systematic error and subject-specific error, along with the usual independent random error, is needed for the self-report data. Three estimation procedures for hazard ratio (Cox model) parameters are extended for application to this more complex measurement error structure. These procedures are risk set regression calibration, conditional score, and nonparametric corrected score. An estimator for the cumulative baseline hazard function is also provided. The performance of each method is assessed in a simulation study. The methods are then applied to an example from the Women's Health Initiative Dietary Modification Trial.
Collapse
Affiliation(s)
- Pamela A Shaw
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Bethesda, Maryland 20892, USA.
| | | |
Collapse
|
14
|
Wang CY. Robust best linear estimation for regression analysis using surrogate and instrumental variables. Biostatistics 2012; 13:326-40. [PMID: 22285992 DOI: 10.1093/biostatistics/kxr051] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We investigate methods for regression analysis when covariates are measured with errors. In a subset of the whole cohort, a surrogate variable is available for the true unobserved exposure variable. The surrogate variable satisfies the classical measurement error model, but it may not have repeated measurements. In addition to the surrogate variables that are available among the subjects in the calibration sample, we assume that there is an instrumental variable (IV) that is available for all study subjects. An IV is correlated with the unobserved true exposure variable and hence can be useful in the estimation of the regression coefficients. We propose a robust best linear estimator that uses all the available data, which is the most efficient among a class of consistent estimators. The proposed estimator is shown to be consistent and asymptotically normal under very weak distributional assumptions. For Poisson or linear regression, the proposed estimator is consistent even if the measurement error from the surrogate or IV is heteroscedastic. Finite-sample performance of the proposed estimator is examined and compared with other estimators via intensive simulation studies. The proposed method and other methods are applied to a bladder cancer case-control study.
Collapse
Affiliation(s)
- C Y Wang
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109-1024, USA.
| |
Collapse
|
15
|
STAUDENMAYER JOHN, ZHU WEIMO, CATELLIER DIANEJ. Statistical Considerations in the Analysis of Accelerometry-Based Activity Monitor Data. Med Sci Sports Exerc 2012; 44:S61-7. [DOI: 10.1249/mss.0b013e3182399e0f] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
16
|
Prentice RL, Mossavar-Rahmani Y, Huang Y, Van Horn L, Beresford SAA, Caan B, Tinker L, Schoeller D, Bingham S, Eaton CB, Thomson C, Johnson KC, Ockene J, Sarto G, Heiss G, Neuhouser ML. Evaluation and comparison of food records, recalls, and frequencies for energy and protein assessment by using recovery biomarkers. Am J Epidemiol 2011; 174:591-603. [PMID: 21765003 DOI: 10.1093/aje/kwr140] [Citation(s) in RCA: 271] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The food frequency questionnaire approach to dietary assessment is ubiquitous in nutritional epidemiology research. Food records and recalls provide approaches that may also be adaptable for use in large epidemiologic cohorts, if warranted by better measurement properties. The authors collected (2007-2009) a 4-day food record, three 24-hour dietary recalls, and a food frequency questionnaire from 450 postmenopausal women in the Women's Health Initiative prospective cohort study (enrollment, 1994-1998), along with biomarkers of energy and protein consumption. Through comparison with biomarkers, the food record is shown to provide a stronger estimate of energy and protein than does the food frequency questionnaire, with 24-hour recalls mostly intermediate. Differences were smaller and nonsignificant for protein density. Food frequencies, records, and recalls were, respectively, able to "explain" 3.8%, 7.8%, and 2.8% of biomarker variation for energy; 8.4%, 22.6%, and 16.2% of biomarker variation for protein; and 6.5%, 11.0%, and 7.0% of biomarker variation for protein density. However, calibration equations that include body mass index, age, and ethnicity substantially improve these numbers to 41.7%, 44.7%, and 42.1% for energy; 20.3%, 32.7%, and 28.4% for protein; and 8.7%, 14.4%, and 10.4% for protein density. Calibration equations using any of the assessment procedures may yield suitable consumption estimates for epidemiologic study purposes.
Collapse
Affiliation(s)
- Ross L Prentice
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, P.O. Box 19024, Seattle, WA 98109-1024, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Prentice RL, Huang Y, Kuller LH, Tinker LF, Horn LV, Stefanick ML, Sarto G, Ockene J, Johnson KC. Biomarker-calibrated energy and protein consumption and cardiovascular disease risk among postmenopausal women. Epidemiology 2011; 22:170-9. [PMID: 21206366 PMCID: PMC3033986 DOI: 10.1097/ede.0b013e31820839bc] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND Nutritional epidemiology cohort studies primarily use food frequency questionnaires (FFQs). In part because FFQs are more reliable for nutrient densities than for absolute nutrient consumption, reports from association studies typically present only nutrient density measures in relation to disease risk. METHODS We used objective biomarkers to correct FFQ assessments for measurement error, and examined absolute energy and protein consumption in relation to cardiovascular disease incidence. FFQs and subsequent physician-adjudicated cardiovascular disease incidence were assessed for 80,370 postmenopausal women in the age range 50-79 years at enrollment in the comparison group of the Dietary Modification Trial or the prospective Observational Study in the Women's Health Initiative. Urinary recovery biomarkers of energy and protein were obtained from a subsample of 544 women, with concurrent FFQ information. RESULTS After biomarker correction, energy consumption was positively associated with coronary heart disease incidence (hazard ratio = 1.18; 95% confidence interval = 1.04-1.33, for 20% consumption increment) and protein density was inversely associated (0.85 [0.75-0.97]). The positive energy association appeared to be mediated by body fat accumulation. Ischemic stroke incidence was inversely associated with energy and protein consumption, but not with protein density. CONCLUSIONS A positive association between energy and coronary heart disease risk can be attributed to body mass accumulation. Ischemic stroke risk is inversely associated with energy and protein consumption, possibly due to correlations between consumption and physical activity.
Collapse
Affiliation(s)
- Ross L Prentice
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Altmaier E, Kastenmüller G, Römisch-Margl W, Thorand B, Weinberger KM, Illig T, Adamski J, Döring A, Suhre K. Questionnaire-based self-reported nutrition habits associate with serum metabolism as revealed by quantitative targeted metabolomics. Eur J Epidemiol 2010; 26:145-56. [DOI: 10.1007/s10654-010-9524-7] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2010] [Accepted: 11/18/2010] [Indexed: 11/30/2022]
|
19
|
Chen J, Zhao X. A Bayesian measurement error approach to QT interval correction and prolongation. J Biopharm Stat 2010; 20:523-42. [PMID: 20358434 DOI: 10.1080/10543400903581960] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Assessment of QT interval prolongation is an integral part of clinical studies in drug development because a prolonged QT interval can cause sudden cardiac death. Traditionally a linear or non-linear regression method is applied to estimate subject- or group-specific heart rate corrected QT intervals (QTc) on which comparisons are based among treatment groups. These regression models rely on a fundamental assumption that the predictor variable (RR interval) is measured without error. However, the fact is that both QT and RR intervals measured in electrocardiogram (ECG) are subject to not only measurement error, but also fluctuation that is caused by physiological and biological factors. Hence the assumption in the regression models is most likely violated. In this paper we propose a Bayesian hierarchical measurement error model to evaluate QTc interval and prolongation. The proposed approach is illustrated using a real data set. Simulation studies show that our proposed Bayesian measurement error approach outperforms the current most commonly used frequentist methods.
Collapse
Affiliation(s)
- Jie Chen
- Abbott Laboratories, Abbott Park, Illinois, USA.
| | | |
Collapse
|
20
|
Song HR, Lawson AB, Nitcheva D. Bayesian hierarchical models for food frequency assessment. CAN J STAT 2010. [DOI: 10.1002/cjs.10052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
21
|
Prentice RL, Huang Y, Tinker LF, Beresford SAA, Lampe JW, Neuhouser ML. Statistical Aspects of the Use of Biomarkers in Nutritional Epidemiology Research. STATISTICS IN BIOSCIENCES 2009; 1:112-123. [PMID: 19841649 DOI: 10.1007/s12561-009-9003-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Few strong and consistent associations have arisen from observational studies of dietary consumption in relation to chronic disease risk. Measurement error in self-reported dietary assessment may be obscuring many such associations. Attempts to correct for measurement error have mostly used a second self-report assessment in a subset of a study cohort to calibrate the self-report assessment used throughout the cohort, under the dubious assumption of uncorrelated measurement errors between the two assessments. The use, instead, of objective biomarkers of nutrient consumption to produce calibrated consumption estimates provides a promising approach to enhance study reliability. As summarized here, we have recently applied this nutrient biomarker approach to examine energy, protein, and percent of energy from protein, in relation to disease incidence in Women's Health Initiative cohorts, and find strong associations that are not evident without biomarker calibration. A major bottleneck for the broader use of a biomarker-calibration approach is the rather few nutrients for which a suitable biomarker has been developed. Some methodologic approaches to the development of additional pertinent biomarkers, including the possible use of a respiratory quotient from indirect calorimetry for macronutrient biomarker development, and the potential of human feeding studies for the evaluation of a range of urine- and blood-based potential biomarkers, will briefly be described.
Collapse
|
22
|
Prentice RL, Shaw PA, Bingham SA, Beresford SAA, Caan B, Neuhouser ML, Patterson RE, Stefanick ML, Satterfield S, Thomson CA, Snetselaar L, Thomas A, Tinker LF. Biomarker-calibrated energy and protein consumption and increased cancer risk among postmenopausal women. Am J Epidemiol 2009; 169:977-89. [PMID: 19258487 PMCID: PMC2732977 DOI: 10.1093/aje/kwp008] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2008] [Accepted: 01/06/2009] [Indexed: 12/22/2022] Open
Abstract
The authors previously reported equations, derived from the Nutrient Biomarker Study within the Women's Health Initiative, that produce calibrated estimates of energy, protein, and percentage of energy from protein consumption from corresponding food frequency questionnaire estimates and data on other factors, such as body mass index, age, and ethnicity. Here, these equations were applied to yield calibrated consumption estimates for 21,711 women enrolled in the Women's Health Initiative dietary modification trial comparison group and 59,105 women enrolled in the observational study. These estimates were related prospectively to total and site-specific invasive cancer incidence (1993-2005). In combined cohort analyses that do not control for body mass, uncalibrated energy was not associated with total cancer incidence or site-specific cancer incidence for most sites, whereas biomarker-calibrated energy was positively associated with total cancer (hazard ratio = 1.18, 95% confidence interval: 1.10, 1.27, for 20% consumption increase), as well as with breast, colon, endometrial, and kidney cancer (respective hazard ratios of 1.24, 1.35, 1.83, and 1.47). Calibrated protein was weakly associated, and calibrated percentage of energy from protein was inversely associated, with total cancer. Calibrated energy and body mass index associations were highly interdependent. Implications for the interpretation of nutritional epidemiology studies are described.
Collapse
Affiliation(s)
- Ross L Prentice
- Fred Hutchinson Cancer Research Center, Seattle, Washington 98109-1024, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Biomarkers in nutritional epidemiology: applications, needs and new horizons. Hum Genet 2009; 125:507-25. [DOI: 10.1007/s00439-009-0662-5] [Citation(s) in RCA: 323] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2009] [Accepted: 03/27/2009] [Indexed: 01/13/2023]
|
24
|
WANG CY. Non-parametric Maximum Likelihood Estimation for Cox Regression with Subject-Specific Measurement Error. Scand Stat Theory Appl 2008. [DOI: 10.1111/j.1467-9469.2008.00605.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
25
|
Favorability functions based on kernel density estimation for logistic models: A case study. Comput Stat Data Anal 2008. [DOI: 10.1016/j.csda.2008.03.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
26
|
Prentice RL. Observational studies, clinical trials, and the women's health initiative. LIFETIME DATA ANALYSIS 2007; 13:449-462. [PMID: 17943443 DOI: 10.1007/s10985-007-9047-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2007] [Accepted: 07/16/2007] [Indexed: 05/25/2023]
Abstract
The complementary roles fulfilled by observational studies and randomized controlled trials in the population science research agenda is illustrated using results from the Women's Health Initiative (WHI). Comparative and joint analyses of clinical trial and observational study data can enhance observational study design and analysis choices, and can augment randomized trial implications. These concepts are described in the context of findings from the WHI randomized trials of postmenopausal hormone therapy and of a low-fat dietary pattern, especially in relation to coronary heart disease, stroke, and breast cancer. The role of biomarkers of exposure and outcome, including high-dimensional genomic and proteomic biomarkers, in the elucidation of disease associations, will also be discussed in these same contexts.
Collapse
Affiliation(s)
- Ross L Prentice
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA.
| |
Collapse
|
27
|
Prentice RL. Epidemiologic Methods Developments: A Look Forward to the Year 2032. Ann Epidemiol 2007; 17:906-10. [PMID: 17855116 DOI: 10.1016/j.annepidem.2007.07.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2007] [Revised: 06/29/2007] [Accepted: 07/11/2007] [Indexed: 11/16/2022]
Abstract
This article responds to a request by the Editors for a perspective on potential epidemiologic methods developments between now and the year 2032 when the American College of Epidemiology will have its 50th Annual Meeting. The response begins by describing the need for enhanced methods in epidemiologic research and goes on to suggest some approaches to satisfying such needs. The suggested approaches include the more extensive use of biomarkers for exposure assessment, the greater standardization of data analysis and reporting methods, and enhancement of the interplay between observational studies and randomized controlled trials. It is argued that a phased approach to epidemiologic hypothesis evaluation may often be needed, with hypotheses that are promising in observational studies subjected to controlled trials having well-selected intermediate outcomes. It is also argued that a multidisciplinary, coordinated community of scientists interested in disease risk estimation and disease prevention will be needed for epidemiologic research to fulfill its potential over the next 25 years.
Collapse
Affiliation(s)
- Ross L Prentice
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109-1024, USA.
| |
Collapse
|