51
|
Paul S, Agger JF, Agerholm JS, Markussen B. Prevalence and risk factors of Coxiella burnetii seropositivity in Danish beef and dairy cattle at slaughter adjusted for test uncertainty. Prev Vet Med 2014; 113:504-11. [DOI: 10.1016/j.prevetmed.2014.01.018] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Revised: 01/08/2014] [Accepted: 01/13/2014] [Indexed: 10/25/2022]
|
52
|
Zhang Y, Berhane K. Bayesian mixed hidden Markov models: a multi-level approach to modeling categorical outcomes with differential misclassification. Stat Med 2013; 33:1395-408. [PMID: 24254432 DOI: 10.1002/sim.6039] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Revised: 10/15/2013] [Accepted: 10/21/2013] [Indexed: 11/06/2022]
Abstract
Questionnaire-based health status outcomes are often prone to misclassification. When studying the effect of risk factors on such outcomes, ignoring any potential misclassification may lead to biased effect estimates. Analytical challenges posed by these misclassified outcomes are further complicated when simultaneously exploring factors for both the misclassification and health processes in a multi-level setting. To address these challenges, we propose a fully Bayesian mixed hidden Markov model (BMHMM) for handling differential misclassification in categorical outcomes in a multi-level setting. The BMHMM generalizes the traditional hidden Markov model (HMM) by introducing random effects into three sets of HMM parameters for joint estimation of the prevalence, transition, and misclassification probabilities. This formulation not only allows joint estimation of all three sets of parameters but also accounts for cluster-level heterogeneity based on a multi-level model structure. Using this novel approach, both the true health status prevalence and the transition probabilities between the health states during follow-up are modeled as functions of covariates. The observed, possibly misclassified, health states are related to the true, but unobserved, health states and covariates. Results from simulation studies are presented to validate the estimation procedure, to show the computational efficiency due to the Bayesian approach and also to illustrate the gains from the proposed method compared to existing methods that ignore outcome misclassification and cluster-level heterogeneity. We apply the proposed method to examine the risk factors for both asthma transition and misclassification in the Southern California Children's Health Study.
Collapse
Affiliation(s)
- Yue Zhang
- Division of Epidemiology, University of Utah, Salt Lake City, UT 84108, U.S.A
| | | |
Collapse
|
53
|
Dohoo IR. Bias--is it a problem, and what should we do? Prev Vet Med 2013; 113:331-7. [PMID: 24176138 DOI: 10.1016/j.prevetmed.2013.10.008] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Revised: 09/28/2013] [Accepted: 10/06/2013] [Indexed: 11/29/2022]
Abstract
Observational studies are prone to two types of errors: random and systematic. Random error arises as a result of variation between samples that might be drawn in a study and can be reduced by increasing the sample size. Systematic error arises from problems with the study design or the methods used to obtain the study data and is not influenced by sample size. Over the last 20 years, veterinary epidemiologists have made great progress in dealing more effectively with random error (particularly through the use of multilevel models) but paid relatively little attention to systematic error. Systematic errors can arise from unmeasured confounders, selection bias and information bias. Unmeasured confounders include both factors which are known to be confounders but which were not measured in a study and factors which are not known to be confounders. Confounders can bias results toward or away from the null. The impact of selection bias can also be difficult to predict and can be negligible or large. Although the direction of information bias is generally toward the null, this cannot be guaranteed and its impact might be very large. Methods of dealing with systematic errors include: qualitative assessment, quantitative bias analysis and incorporation of bias parameters into the statistical analyses.
Collapse
Affiliation(s)
- Ian R Dohoo
- Department of Health Management, Atlantic Veterinary College, University of Prince Edward Island, Charlottetown, PEI C1A 4P3, Canada.
| |
Collapse
|
54
|
Luo S, Yi M, Huang X, Hunt KK. A Bayesian model for misclassified binary outcomes and correlated survival data with applications to breast cancer. Stat Med 2013; 32:2320-34. [PMID: 22996169 DOI: 10.1002/sim.5629] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2011] [Accepted: 08/27/2012] [Indexed: 01/14/2023]
Abstract
Breast cancer patients may experience ipsilateral breast tumor relapse (IBTR) after breast conservation therapy. IBTR is classified as either true local recurrence or new ipsilateral primary tumor. The correct classification of IBTR status has significant implications in therapeutic decision-making and patient management. However, the diagnostic tests to classify IBTR are imperfect and prone to misclassification. In addition, some observed survival data (e.g., time to relapse, time from relapse to death) are strongly correlated with IBTR status. We present a Bayesian approach to model the potentially misclassified IBTR status and the correlated survival information. We conduct the inference using a Bayesian framework via Markov chain Monte Carlo simulation implemented in WinBUGS. Extensive simulation shows that the proposed method corrects biases and provides more efficient estimates for the covariate effects on the probability of IBTR and the diagnostic test accuracy. Moreover, our method provides useful subject-specific patient prognostic information. Our method is motivated by, and applied to, a dataset of 397 breast cancer patients.
Collapse
Affiliation(s)
- Sheng Luo
- Division of Biostatistics, University of Texas School of Public Health, 1200 Pressler St, Houston, Texas 77030, USA.
| | | | | | | |
Collapse
|
55
|
Correction for misclassification of caries experience in the absence of internal validation data. Clin Oral Investig 2013; 17:1799-805. [PMID: 23665952 DOI: 10.1007/s00784-013-0993-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2012] [Accepted: 04/23/2013] [Indexed: 10/26/2022]
Abstract
OBJECTIVES To quantify the effects of risk factors and/or determinants on disease occurrence, it is important that the risk factors as well as the variable that measures the disease outcome are recorded with the least error as possible. When investigating the factors that influence a binary outcome, a logistic regression model is often fitted under the assumption that the data are collected without error. However, most categorical outcomes (e.g., caries experience) are accompanied by misclassification and this needs to be accounted for. The aim of this research was to adjust for binary outcome misclassification using an external validation study when investigating factors influencing caries experience in schoolchildren. MATERIALS AND METHODS Data from the Signal Tandmobiel(®) study were used. A total of 500 children from the main and 148 from the validation study were included in the analysis. Regression models (with several covariates) for sensitivity and specificity were used to adjust for misclassification in the main data. RESULTS The use of sensitivity and specificity modeled as functions of several covariates resulted in a better correction compared to using point estimates of sensitivity and specificity. Age, geographical location of the school to which the child belongs, dentition type, tooth type, and surface type were significantly associated with the prevalence of caries experience. CONCLUSIONS Sensitivity and specificity calculated based on an external validation study may resemble those obtained from an internal study if conditioned on a rich set of covariates. CLINICAL RELEVANCE Main data can be corrected for misclassification using information obtained from an external validation study when a rich set of covariates is recorded during calibration.
Collapse
|
56
|
Naranjo L, Martín J, Pérez CJ, Rufo MJ. Addressing misclassification for binary data: probit and t-link regressions. J STAT COMPUT SIM 2013. [DOI: 10.1080/00949655.2013.787424] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
57
|
Koop G, Collar CA, Toft N, Nielen M, van Werven T, Bacon D, Gardner IA. Risk factors for subclinical intramammary infection in dairy goats in two longitudinal field studies evaluated by Bayesian logistic regression. Prev Vet Med 2013. [DOI: 10.1016/j.prevetmed.2012.11.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
58
|
Dufour S, Dohoo IR, Barkema HW, Descôteaux L, Devries TJ, Reyher KK, Roy JP, Scholl DT. Epidemiology of coagulase-negative staphylococci intramammary infection in dairy cattle and the effect of bacteriological culture misclassification. J Dairy Sci 2012; 95:3110-24. [PMID: 22612947 DOI: 10.3168/jds.2011-5164] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Accepted: 02/05/2012] [Indexed: 11/19/2022]
Abstract
Objectives of this study were to identify the manageable risk factors associated with the lactational incidence, elimination, and prevalence of coagulase-negative staphylococci (CNS) intramammary infections (IMI) while taking into account the difficulties inherent to their diagnosis. A second objective was to evaluate the effect of CNS IMI misclassification in mastitis research. A cohort of 90 Canadian dairy herds was followed throughout 2007 to 2008. In each herd, series of quarter milk samples were collected from a subsample of cows and bacteriological culture was performed to identify prevalent, incident, and eliminated CNS IMI. Practices used on farms were captured using direct observations and a validated questionnaire. The relationships between herd CNS IMI prevalence and herd incidence and elimination rates were explored using linear regression. Manageable risk factors associated with the prevalence, incidence, or elimination of CNS IMI were identified via Bayesian analyses using a latent class model approach, allowing adjustment of the estimates for the imperfect sensitivity and specificity of bacteriological culture. After adjustment for the diagnostic test limitations, a mean CNS IMI quarter prevalence of 42.7% [95% confidence interval (CI): 34.7, 50.1] and incidence and elimination rates of 0.29 new IMI/quarter-month (95% CI: 0.21, 0.37) and 0.79 eliminated IMI/quarter-month (95% CI: 0.66, 0.91), respectively, were observed. Considerable biases of the estimates were observed when CNS IMI misclassification was ignored. These biases were important for measures of association with risk factors, were almost always toward the null value, and led to both type I and type II errors. Coagulase-negative staphylococci IMI incidence appeared to be a stronger determinant of herd IMI prevalence than IMI elimination rate. The majority of herds followed were already using blanket dry cow treatment and postmilking teat disinfection. A holistic approach considering associations with all 3 outcomes was used to interpret associations between manageable risk factors and CNS IMI. Sand and wood-based product bedding showed desirable associations with CNS IMI compared with straw bedding. Quarters of cows that had access to pasture during the sampling period had lower odds of acquiring a new CNS IMI and of having a prevalent CNS IMI. Many practices showed an association with only one of the CNS outcomes and should, therefore, be considered with caution.
Collapse
Affiliation(s)
- S Dufour
- Canadian Bovine Mastitis Research Network, C.P. 5000, St-Hyacinthe, Quebec J2S 7C6, Canada.
| | | | | | | | | | | | | | | |
Collapse
|
59
|
Sample size estimation to substantiate freedom from disease for clustered binary data with a specific risk profile. Epidemiol Infect 2012; 141:1318-27. [DOI: 10.1017/s0950268812001938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
SUMMARYDisease cases are often clustered within herds or generally groups that share common characteristics. Sample size formulae must adjust for the within-cluster correlation of the primary sampling units. Traditionally, the intra-cluster correlation coefficient (ICC), which is an average measure of the data heterogeneity, has been used to modify formulae for individual sample size estimation. However, subgroups of animals sharing common characteristics, may exhibit excessively less or more heterogeneity. Hence, sample size estimates based on the ICC may not achieve the desired precision and power when applied to these groups. We propose the use of the variance partition coefficient (VPC), which measures the clustering of infection/disease for individuals with a common risk profile. Sample size estimates are obtained separately for those groups that exhibit markedly different heterogeneity, thus, optimizing resource allocation. A VPC-based predictive simulation method for sample size estimation to substantiate freedom from disease is presented. To illustrate the benefits of the proposed approach we give two examples with the analysis of data from a risk factor study on Mycobacterium avium subsp. paratuberculosis infection, in Danish dairy cattle and a study on critical control points for Salmonella cross-contamination of pork, in Greek slaughterhouses.
Collapse
|
60
|
|
61
|
Validation data-based adjustments for outcome misclassification in logistic regression: an illustration. Epidemiology 2011; 22:589-97. [PMID: 21487295 DOI: 10.1097/ede.0b013e3182117c85] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Misclassification of binary outcome variables is a known source of potentially serious bias when estimating adjusted odds ratios. Although researchers have described frequentist and Bayesian methods for dealing with the problem, these methods have seldom fully bridged the gap between statistical research and epidemiologic practice. In particular, there have been few real-world applications of readily grasped and computationally accessible methods that make direct use of internal validation data to adjust for differential outcome misclassification in logistic regression. In this paper, we illustrate likelihood-based methods for this purpose that can be implemented using standard statistical software. Using main study and internal validation data from the HIV Epidemiology Research Study, we demonstrate how misclassification rates can depend on the values of subject-specific covariates, and we illustrate the importance of accounting for this dependence. Simulation studies confirm the effectiveness of the maximum likelihood approach. We emphasize clear exposition of the likelihood function itself, to permit the reader to easily assimilate appended computer code that facilitates sensitivity analyses as well as the efficient handling of main/external and main/internal validation-study data. These methods are readily applicable under random cross-sectional sampling, and we discuss the extent to which the main/internal analysis remains appropriate under outcome-dependent (case-control) sampling.
Collapse
|
62
|
Myers SL, Baird DD, Olshan AF, Herring AH, Schroeder JC, Nylander-French LA, Hartmann KE. Self-report versus ultrasound measurement of uterine fibroid status. J Womens Health (Larchmt) 2011; 21:285-93. [PMID: 22044079 DOI: 10.1089/jwh.2011.3008] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Much of the epidemiologic research on risk factors for fibroids, the leading indication for hysterectomy, relies on self-reported outcome. Self-report is subject to misclassification because many women with fibroids are undiagnosed. The purpose of this analysis was to quantify the extent of misclassification and identify associated factors. METHODS Self-reported fibroid status was compared to ultrasound screening from 2046 women in Right From The Start (RFTS) and 869 women in the Uterine Fibroid Study (UFS). Log-binomial regression was used to estimate sensitivity (Se) and specificity (Sp) and examine differences by ethnicity, age, education, body mass index, parity, and miscarriage history. RESULTS Overall sensitivity was ≤0.50. Sensitivity was higher in blacks than whites (RFTS: 0.34 vs. 0.23; UFS: 0.58 vs. 0.32) and increased with age. Parous women had higher sensitivity than nulliparae, especially in RFTS whites (Se ratio=2.90; 95% confidence interval [CI]: 1.51, 5.60). Specificity was 0.98 in RFTS and 0.86 in UFS. Modest ethnic differences were seen in UFS (Sp ratio, black vs. white=0.90; 95% CI: 0.81, 0.99). Parity was inversely associated with specificity, especially among UFS black women (Sp ratio=0.84; 95% CI: 0.73, 0.97). Among women who reported a previous diagnosis, a shorter time interval between diagnosis and ultrasound was associated with increased agreement between the two measures. CONCLUSIONS Misclassification of fibroid status can differ by factors of etiologic interest. These findings are useful for assessing (and correcting) bias in studies using self-reported clinical diagnosis as the outcome measure.
Collapse
Affiliation(s)
- Sharon L Myers
- Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, USA.
| | | | | | | | | | | | | |
Collapse
|
63
|
Ladouceur M, Rahme E, Bélisle P, Scott AN, Schwartzman K, Joseph L. Modeling continuous diagnostic test data using approximate Dirichlet process distributions. Stat Med 2011; 30:2648-62. [PMID: 21786286 DOI: 10.1002/sim.4320] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Accepted: 05/17/2011] [Indexed: 11/12/2022]
Abstract
There is now a large literature on the analysis of diagnostic test data. In the absence of a gold standard test, latent class analysis is most often used to estimate the prevalence of the condition of interest and the properties of the diagnostic tests. When test results are measured on a continuous scale, both parametric and nonparametric models have been proposed. Parametric methods such as the commonly used bi-normal model may not fit the data well; nonparametric methods developed to date have been relatively complex to apply in practice, and their properties have not been carefully evaluated in the diagnostic testing context. In this paper, we propose a simple yet flexible Bayesian nonparametric model which approximates a Dirichlet process for continuous data. We compare results from the nonparametric model with those from the bi-normal model via simulations, investigating both how much is lost in using a nonparametric model when the bi-normal model is correct and how much can be gained in using a nonparametric model when normality does not hold. We also carefully investigate the trade-offs that occur between flexibility and identifiability of the model as different Dirichlet process prior distributions are used. Motivated by an application to tuberculosis clustering, we extend our nonparametric model to accommodate two additional dichotomous tests and proceed to analyze these data using both the continuous test alone as well as all three tests together.
Collapse
Affiliation(s)
- Martin Ladouceur
- Department of Epidemiology and Biostatistics, McGill University, 1020 Pine Avenue West, Montreal, Quebec, H3A 1A2, Canada
| | | | | | | | | | | |
Collapse
|
64
|
Classification of ipsilateral breast tumor recurrences after breast conservation therapy can predict patient prognosis and facilitate treatment planning. Ann Surg 2011; 253:572-9. [PMID: 21209588 DOI: 10.1097/sla.0b013e318208fc2a] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
OBJECTIVE To classify ipsilateral breast tumor recurrences (IBTR) as either new primary tumors (NP) or true local recurrence (TR). We utilized 2 different methods and compared sensitivities and specificities between them. Our goal was to determine whether distinguishing NP from TR had prognostic value. BACKGROUND After breast-conservation therapy, IBTR may be classified into 2 distinct types (NP and TR). Studies have attempted to classify IBTR by using tumor location, histologic subtype, DNA flow cytometry data, or gene-expression profiling data. METHODS A total of 447 (7.9%) of 5660 patients undergoing breast-conservation therapy from 1970 to 2005 experienced IBTR. Clinical data from 397 patients were available for review. We classified IBTRs as NP or TR on the basis of either tumor location and histologic subtype (method 1) or tumor location, histologic subtype, estrogen receptor status and human epidermal growth factor receptor 2 status (method 2). Kaplan-Meier curves and log-rank tests were used to evaluate overall and disease-specific survival differences between the 2 groups. Classification methods were validated by calculating sensitivity and specificity values using a Bayesian method. RESULTS Of 397 patients, 196 (49.4%) were classified as NP by method 1 and 212 (53.4%) were classified as NP by method 2. The sensitivity and specificity values were 0.812 and 0.867 for method 1 and 0.870 and 0.800 for method 2, respectively. Regardless of method used, patients classified as NP developed contralateral breast carcinoma more often but had better 10-year overall and disease-specific survival rates than those classified as TR. Patients with TR were more likely to develop metastatic disease after IBTR. CONCLUSION Ipsilateral breast tumor recurrences classified as TR and NP had clinically different features, suggesting that classifying IBTR may provide clinically significant data for the management of IBTR.
Collapse
|
65
|
|
66
|
Hemming K, Hutton JL, Maguire MG, Marson AG. Meta-regression with partial information on summary trial or patient characteristics. Stat Med 2010; 29:1312-24. [PMID: 20087842 DOI: 10.1002/sim.3848] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We present a model for meta-regression in the presence of missing information on some of the study level covariates, obtaining inferences using Bayesian methods. In practice, when confronted with missing covariate data in a meta-regression, it is common to carry out a complete case or available case analysis. We propose to use the full observed data, modelling the joint density as a factorization of a meta-regression model and a conditional factorization of the density for the covariates. With the inclusion of several covariates, inter-relations between these covariates are modelled. Under this joint likelihood-based approach, it is shown that the lesser assumption of the covariates being Missing At Random is imposed, instead of the more usual Missing Completely At Random (MCAR) assumption. The model is easily programmable in WinBUGS, and we examine, through the analysis of two real data sets, sensitivity and robustness of results to the MCAR assumption.
Collapse
Affiliation(s)
- K Hemming
- Department of Public Health, Epidemiology and Biostatistics, University of Birmingham, UK.
| | | | | | | |
Collapse
|
67
|
Chu R, Gustafson P, Le N. Bayesian adjustment for exposure misclassification in case-control studies. Stat Med 2010; 29:994-1003. [DOI: 10.1002/sim.3829] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2008] [Accepted: 11/23/2009] [Indexed: 11/10/2022]
|
68
|
Jones G, Johnson WO, Vink WD. Evaluating a continuous biomarker for infection by using observed disease status with covariate effects on disease. J R Stat Soc Ser C Appl Stat 2009. [DOI: 10.1111/j.1467-9876.2009.00681.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
69
|
Sweeting MJ, Hope VD, Hickman M, Parry JV, Ncube F, Ramsay ME, De Angelis D. Hepatitis C infection among injecting drug users in England and Wales (1992-2006): there and back again? Am J Epidemiol 2009; 170:352-60. [PMID: 19546152 PMCID: PMC2714950 DOI: 10.1093/aje/kwp141] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2009] [Accepted: 05/01/2009] [Indexed: 12/15/2022] Open
Abstract
Changes in hepatitis C virus (HCV) prevalence from 1992 to 2006 were examined by using 24,311 records from unlinked anonymous surveillance of injecting drug users in England and Wales. Bayesian logistic regression was used to estimate annual prevalence, accounting for changing recruitment patterns (age, gender, injecting duration, geographic region, interactions) and the sensitivity and specificity of different oral fluid testing devices. After controlling for these differences, the authors found that the adjusted HCV prevalence decreased from 70% (95% credible interval: 62, 78) in 1992 to 47% (95% credible interval: 43, 51) in 1998 before rising again to 53% (95% credible interval: 48, 58) in 2006. Women injecting drug users had a higher HCV risk than did men (odds ratio = 1.50, 95% credible interval: 1.31, 1.73). Two regions (London and North West) had a markedly higher HCV prevalence than did the rest of England and Wales. Among individuals who had injected for less than 1 year, the adjusted HCV prevalence in 2006 was higher than that in 1992 (28% vs. 19%, respectively). HCV infection can be prevented. The public health challenge in England and Wales is to increase action in order to regain a downward trend in HCV risk and the benefit that has been lost since 1998.
Collapse
Affiliation(s)
- Michael J Sweeting
- Medical Research Council Biostatistics Unit, Institute of Public Health, Robinson Way, Cambridge, UK.
| | | | | | | | | | | | | |
Collapse
|
70
|
Bayesian estimation of variance partition coefficients adjusted for imperfect test sensitivity and specificity. Prev Vet Med 2009; 89:155-62. [DOI: 10.1016/j.prevetmed.2009.02.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Revised: 11/10/2008] [Accepted: 02/15/2009] [Indexed: 11/15/2022]
|
71
|
Cheng D, Stamey JD, Branscum AJ. Bayesian approach to average power calculations for binary regression models with misclassified outcomes. Stat Med 2009; 28:848-63. [PMID: 19061210 DOI: 10.1002/sim.3505] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We develop a simulation-based procedure for determining the required sample size in binomial regression risk assessment studies when response data are subject to misclassification. A Bayesian average power criterion is used to determine a sample size that provides high probability, averaged over the distribution of potential future data sets, of correctly establishing the direction of association between predictor variables and the probability of event occurrence. The method is broadly applicable to any parametric binomial regression model including, but not limited to, the popular logistic, probit, and complementary log-log models. We detail a common medical scenario wherein ascertainment of true disease status is impractical or otherwise impeded, and in its place the outcome of a single binary diagnostic test is used as a surrogate. These methods are then extended to the two diagnostic test setting. We illustrate the method with categorical covariates using one example that involves screening for human papillomavirus. This example coupled with results from simulated data highlights the utility of our Bayesian sample size procedure with error prone measurements.
Collapse
Affiliation(s)
- Dunlei Cheng
- Institute for Health Care Research and Improvement, Baylor Health Care System, Dallas, TX 75206, U.S.A.
| | | | | |
Collapse
|
72
|
Stamey JD, Young DM, Seaman JW. A Bayesian approach to adjust for diagnostic misclassification between two mortality causes in Poisson regression. Stat Med 2008; 27:2440-52. [PMID: 17979218 DOI: 10.1002/sim.3134] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Response misclassification of counted data biases and understates the uncertainty of parameter estimators in Poisson regression models. To correct these problems, researchers have devised classical procedures that rely on asymptotic distribution results and supplemental validation data in order to estimate unknown misclassification parameters. We derive a new Bayesian Poisson regression procedure that accounts and corrects for misclassification for a count variable with two categories. Under the Bayesian paradigm, one can use validation data, expert opinion, or a combination of these two approaches to correct for the consequences of misclassification. The Bayesian procedure proposed here yields an operationally effective way to correct and account for misclassification effects in Poisson count regression models. We demonstrate the performance of the model in a simulation study. Additionally, we analyze two real-data examples and compare our new Bayesian inference method that adjusts for misclassification with a similar analysis that ignores misclassification.
Collapse
Affiliation(s)
- James D Stamey
- Department of Statistical Science, Baylor University, Waco, TX 76798-7140, USA.
| | | | | |
Collapse
|
73
|
McGlothlin A, Stamey JD, Seaman JW. Binary regression with misclassified response and covariate subject to measurement error: a bayesian approach. Biom J 2008; 50:123-34. [PMID: 18283683 DOI: 10.1002/bimj.200710402] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We consider a Bayesian analysis for modeling a binary response that is subject to misclassification. Additionally, an explanatory variable is assumed to be unobservable, but measurements are available on its surrogate. A binary regression model is developed to incorporate the measurement error in the covariate as well as the misclassification in the response. Unlike existing methods, no model parameters need be assumed known. Markov chain Monte Carlo methods are utilized to perform the necessary computations. The methods developed are illustrated using atomic bomb survival data. A simulation experiment explores advantages of the approach.
Collapse
Affiliation(s)
- Anna McGlothlin
- Exploratory Program Medical Statistics, Eli Lilly and Company, Lilly Corporate Center DC 0710, Indianapolis, IN 46285, USA.
| | | | | |
Collapse
|
74
|
Branscum AJ, Johnson WO, Hanson TE, Gardner IA. Bayesian semiparametric ROC curve estimation and disease diagnosis. Stat Med 2008; 27:2474-96. [PMID: 18300333 DOI: 10.1002/sim.3250] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Adam J Branscum
- Department of Biostatistics, University of Kentucky, Lexington, KY 40536, USA.
| | | | | | | |
Collapse
|
75
|
Branscum AJ, Johnson WO, Thurmond MC. BAYESIAN BETA REGRESSION: APPLICATIONS TO HOUSEHOLD EXPENDITURE DATA AND GENETIC DISTANCE BETWEEN FOOT-AND-MOUTH DISEASE VIRUSES. AUST NZ J STAT 2007. [DOI: 10.1111/j.1467-842x.2007.00481.x] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
76
|
Ladouceur M, Rahme E, Pineau CA, Joseph L. Robustness of prevalence estimates derived from misclassified data from administrative databases. Biometrics 2007; 63:272-9. [PMID: 17447953 DOI: 10.1111/j.1541-0420.2006.00665.x] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Because primary data collection can be expensive, researchers are increasingly using information collected in medical administrative databases for scientific purposes. This information, however, is typically collected for reasons other than research, and many such databases have been shown to contain substantial proportions of misclassification errors. For example, many administrative databases contain fields for patient diagnostic codes, but these are often missing or inaccurate, in part because physician reimbursement schemes depend on medical acts performed rather than any diagnosis. Errors in ascertaining which individuals have a given disease bias not only prevalence estimates, but also estimates of associations between the disease and other variables, such as medication use. We attempt to estimate the prevalence of osteoarthritis (OA) among elderly Quebeckers using a government administrative database. We compare a naive estimate relying solely on the physician diagnoses of OA listed in the database to estimates from several different Bayesian latent class models which adjust for misclassified physician diagnostic codes via use of other available diagnostic clues. We find that the prevalence estimates vary widely, depending on the model used and assumptions made. We conclude that any inferences from these databases need to be interpreted with great caution, until further work estimating the reliability of database items is carried out.
Collapse
Affiliation(s)
- Martin Ladouceur
- Division of Clinical Epidemiology, Montreal General Hospital, 687 Pine Avenue West, V-Building, Montreal, Quebec H3A 1A1, Canada
| | | | | | | |
Collapse
|
77
|
Dion N, Cotart JL, Rabilloud M. Correction of nutrition test errors for more accurate quantification of the link between dental health and malnutrition. Nutrition 2007; 23:301-7. [PMID: 17360158 DOI: 10.1016/j.nut.2007.01.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2006] [Revised: 01/19/2007] [Accepted: 01/23/2007] [Indexed: 10/23/2022]
Abstract
OBJECTIVE We quantified the link between tooth deterioration and malnutrition in institutionalized elderly subjects, taking into account the major risk factors for malnutrition and adjusting for the measurement error made in using the Mini Nutritional Assessment questionnaire. METHODS Data stem from a survey conducted in 2005 in 1094 subjects >or=60 y of age from a large sample of 100 institutions of the Rhône-Alpes region of France. A Bayesian approach was used to quantify the effect of tooth deterioration on malnutrition through a two-level logistic regression. This approach allowed taking into account the uncertainty on sensitivity and specificity of the Mini Nutritional Assessment questionnaire to adjust for the measurement error of that test. RESULTS After adjustment for other risk factors, the risk of malnutrition increased significantly and continuously 1.15 times (odds ratio 1.15, 95% credibility interval 1.06-1.25) whenever the masticatory percentage decreased by 10 points, which is equivalent to the loss of two molars. The strongest factors that augmented the probability of malnutrition were deglutition disorders, depression, and verbal inconsistency. Dependency was also an important factor; the odds of malnutrition nearly doubled for each additional grade of dependency (graded 6 to 1). Diabetes, central neurodegenerative disease, and carcinoma tended to increase the probability of malnutrition but their effect was not statistically significant. CONCLUSION Dental status should be considered a serious risk factor for malnutrition. Regular dental examination and care should preserve functional dental integrity to prevent malnutrition in institutionalized elderly people.
Collapse
Affiliation(s)
- Nathalie Dion
- Caisse Nationale d'Assurance Maladie des Travailleurs Salariés, Direction Régionale du Service Médical Rhône-Alpes, Lyon, France
| | | | | |
Collapse
|
78
|
Abstract
This review examines the state of Bayesian thinking as Statistics in Medicine was launched in 1982, reflecting particularly on its applicability and uses in medical research. It then looks at each subsequent five-year epoch, with a focus on papers appearing in Statistics in Medicine, putting these in the context of major developments in Bayesian thinking and computation with reference to important books, landmark meetings and seminal papers. It charts the growth of Bayesian statistics as it is applied to medicine and makes predictions for the future. From sparse beginnings, where Bayesian statistics was barely mentioned, Bayesian statistics has now permeated all the major areas of medical statistics, including clinical trials, epidemiology, meta-analyses and evidence synthesis, spatial modelling, longitudinal modelling, survival modelling, molecular genetics and decision-making in respect of new technologies.
Collapse
Affiliation(s)
- Deborah Ashby
- Wolfson Institute of Preventive Medicine, Barts and The London, Queen Mary's School of Medicine & Dentistry, University of London, Charterhouse Square, London EC1M 6BQ, UK.
| |
Collapse
|
79
|
McMahon JM, Pouget ER, Tortu S. A guide for multilevel modeling of dyadic data with binary outcomes using SAS PROC NLMIXED. Comput Stat Data Anal 2006; 50:3663-3680. [PMID: 16926924 PMCID: PMC1550976 DOI: 10.1016/j.csda.2005.08.008] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In the social and health sciences, data are often structured hierarchically, with individuals nested within groups. Dyads constitute a special case of hierarchically structured data with variation at both the individual and dyadic level. Analyses of data from dyads pose several challenges due to the interdependence between members within dyads and issues related to small group sizes. Multilevel analytic techniques have been developed and applied to dyadic data in an attempt to resolve these issues. In this article, we describe a set of analyses for modeling individual- and dyad-level influences on binary outcomes using SAS statistical software; and we discuss the benefits and limitations of such an approach. For illustrative purposes, we apply these techniques to estimate individual-dyad-level predictors of viral hepatitis C infection among heterosexual couples in East Harlem, New York City.
Collapse
|