1
|
Shiomi M, Takada T, Otori K, Shibuya K. Frequency of missed doses and its effects on the regulation of glucose levels in patients with type 2 diabetes: A retrospective analysis. Medicine (Baltimore) 2024; 103:e37711. [PMID: 38608082 PMCID: PMC11018172 DOI: 10.1097/md.0000000000037711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 03/04/2024] [Indexed: 04/14/2024] Open
Abstract
This study aimed to investigate the association between medication adherence to oral hypoglycemic agents (OHAs) and HbA1c levels in patients with type 2 diabetes mellitus (T2DM) for more than 48 weeks, as well as the factors affecting long-term adherence to OHAs. This retrospective study included 83 patients who had been receiving OHAs for T2DM for ≥48 weeks. Medication adherence values (MAVs) were calculated using the following formula: (total prescription days - prescription days of OHAs brought at admission)/(days from the initiation of OHAs to hospitalization). We assessed the association between HbA1c and MAVs using the Jonckheere-Terpstra test. Furthermore, we examined the association between patient- and medication-related factors and MAVs affecting HbA1c levels. Based on the results, MAVs were categorized as MAV ≤0.86 and MAV >0.86, and factors affecting MAVs were analyzed. Logistic regression analysis revealed that the total number of medications, the number of nonhypoglycemic agents, and a family history of diabetes were independent determinants of MAV ≤0.86 (P < .05). Multiple regression analyses indicated that the number of dosages per day and the timing of OHA administration at lunch were independent determinants of lower MAVs (P < .05). Our findings suggest that poor medication adherence is associated with elevated HbA1c levels in T2DM patients. Independent factors contributing to poor adherence include a lower number of prescribed medications, fewer nonhypoglycemic agents, no family history, a higher daily dosage frequency, and the administration of OHAs at lunch.
Collapse
Affiliation(s)
- Megumi Shiomi
- Department of Clinical Pharmacy, School of Pharmacy, Kitasato University, Tokyo, Japan
- Department of Pharmacy, Kitasato University Medical Center, Kitamoto, Japan
| | - Tesshu Takada
- Department of Endocrinology, Diabetes, and Metabolism, School of Medicine, Kitasato University, Sagamihara, Japan
| | - Katsuya Otori
- Department of Clinical Pharmacy, School of Pharmacy, Kitasato University, Tokyo, Japan
- Department of Pharmacy, Kitasato University Medical Center, Kitamoto, Japan
| | - Kiyoshi Shibuya
- Department of Clinical Pharmacy, School of Pharmacy, Kitasato University, Tokyo, Japan
- Department of Pharmacy, Kitasato University Medical Center, Kitamoto, Japan
| |
Collapse
|
2
|
Xue M, Chen Y. A Stan tutorial on Bayesian IRTree models: Conventional models and explanatory extension. Behav Res Methods 2024; 56:1817-1837. [PMID: 37095325 PMCID: PMC10124709 DOI: 10.3758/s13428-023-02121-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2023] [Indexed: 04/26/2023]
Abstract
IRTree models have been receiving increasing attention. However, to date, there are limited sources that provide a systematic introduction to Bayesian modeling techniques using modern probabilistic programming frameworks for the implementation of IRTree models. To facilitate the research and application of IRTree models, this paper introduces how to perform two families of Bayesian IRTree models (i.e., response tree models and latent tree models) in Stan and how to extend them in an explanatory way. Some suggestions on executing Stan codes and checking convergence are also provided. An empirical study based on the Oxford Achieving Resilience during COVID-19 data was conducted as an example to further illustrate how to apply Bayesian IRTree models to address research questions. Finally, strengths and future directions are discussed.
Collapse
Affiliation(s)
- Mingfeng Xue
- Berkeley School of Education, University of California Berkeley, Berkeley, CA, USA.
| | - Yi Chen
- Teachers College, Columbia University, New York, NY, USA
| |
Collapse
|
3
|
König C, Spoden C, Frey A. Robustness of the performance of the optimized hierarchical two-parameter logistic IRT model for small-sample item calibration. Behav Res Methods 2023; 55:3965-3983. [PMID: 36333627 PMCID: PMC10700496 DOI: 10.3758/s13428-022-02000-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/05/2022] [Indexed: 11/06/2022]
Abstract
Hierarchical Bayesian modeling is beneficial when complex models with many parameters of the same type, such as item response theory (IRT) models, are to be estimated with sparse data. Recently, Koenig et al. (Applied Psychological Measurement, 44, 311-326, 2020) illustrated in an optimized hierarchical Bayesian two-parameter logistic model (OH2PL) how to avoid bias due to unintended shrinkage or degeneracies of the posterior, and how to benefit from this approach in small samples. The generalizability of their findings, however, is limited because they investigated only a single specification of the hyperprior structure. Consequently, in a comprehensive simulation study, we investigated the robustness of the performance of the novel OH2PL in several specifications of their hyperpriors under a broad range of data conditions. We show that the novel OH2PL in the half-Cauchy or Exponential configuration yields unbiased (in terms of bias) model parameter estimates in small samples of N = 50. Moreover, it outperforms (especially in terms of the RMSE of the item discrimination parameters) marginal maximum likelihood (MML) estimation and its nonhierarchical counterpart. This further corroborates the possibility that hierarchical Bayesian IRT models behave differently than general hierarchical Bayesian models. We discuss these results regarding the applicability of complex IRT models in small-scale situations typical in psychological research, and illustrate the extended applicability of the 2PL IRT model with an empirical example.
Collapse
Affiliation(s)
| | | | - Andreas Frey
- Goethe University Frankfurt, Frankfurt am Main, Germany
| |
Collapse
|
4
|
Merhof V, Meiser T. Dynamic Response Strategies: Accounting for Response Process Heterogeneity in IRTree Decision Nodes. PSYCHOMETRIKA 2023; 88:1354-1380. [PMID: 36746887 PMCID: PMC10656330 DOI: 10.1007/s11336-023-09901-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Indexed: 06/18/2023]
Abstract
It is essential to control self-reported trait measurements for response style effects to ensure a valid interpretation of estimates. Traditional psychometric models facilitating such control consider item responses as the result of two kinds of response processes-based on the substantive trait, or based on response styles-and they assume that both of these processes have a constant influence across the items of a questionnaire. However, this homogeneity over items is not always given, for instance, if the respondents' motivation declines throughout the questionnaire so that heuristic responding driven by response styles may gradually take over from cognitively effortful trait-based responding. The present study proposes two dynamic IRTree models, which account for systematic continuous changes and additional random fluctuations of response strategies, by defining item position-dependent trait and response style effects. Simulation analyses demonstrate that the proposed models accurately capture dynamic trajectories of response processes, as well as reliably detect the absence of dynamics, that is, identify constant response strategies. The continuous version of the dynamic model formalizes the underlying response strategies in a parsimonious way and is highly suitable as a cognitive model for investigating response strategy changes over items. The extended model with random fluctuations of strategies can adapt more closely to the item-specific effects of different response processes and thus is a well-fitting model with high flexibility. By using an empirical data set, the benefits of the proposed dynamic approaches over traditional IRTree models are illustrated under realistic conditions.
Collapse
Affiliation(s)
- Viola Merhof
- Department of Psychology, University of Mannheim, L 13 15, 68161, Mannheim, Germany.
| | - Thorsten Meiser
- Department of Psychology, University of Mannheim, L 13 15, 68161, Mannheim, Germany
| |
Collapse
|
5
|
Nishio M, Ota E, Matsuo H, Matsunaga T, Miyazaki A, Murakami T. Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance. PeerJ Comput Sci 2023; 9:e1620. [PMID: 37869462 PMCID: PMC10588711 DOI: 10.7717/peerj-cs.1620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/06/2023] [Indexed: 10/24/2023]
Abstract
Purpose The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayesian item response theory (IRT). Materials and methods Bayesian 1PL-IRT and 2PL-IRT were implemented with pystan and numpyro. Then, the Bayesian 1PL-IRT and 2PL-IRT were applied to two types of medical data obtained from a published article. The same prior distributions of latent parameters were used in both pystan and numpyro. Estimation results of latent parameters of 1PL-IRT and 2PL-IRT were compared between pystan and numpyro. Additionally, the computational cost of the Markov chain Monte Carlo method was compared between the two libraries. To evaluate the computational cost of IRT models, simulation data were generated from the medical data and numpyro. Results For all the combinations of IRT types (1PL-IRT or 2PL-IRT) and medical data types, the mean and standard deviation of the estimated latent parameters were in good agreement between pystan and numpyro. In most cases, the sampling time using the Markov chain Monte Carlo method was shorter in numpyro than that in pystan. When the large-sized simulation data were used, numpyro with a graphics processing unit was useful for reducing the sampling time. Conclusion Numpyro and pystan were useful for applying the Bayesian 1PL-IRT and 2PL-IRT. Our results show that the two libraries yielded similar estimation result and that regarding to sampling time, the fastest libraries differed based on the dataset size.
Collapse
Affiliation(s)
- Mizuho Nishio
- Department of Radiology, Kobe University Graduate School of Medicine, Kobe, Japan
| | - Eiji Ota
- Futaba Numerical Technologies, Iruma, Japan
| | - Hidetoshi Matsuo
- Department of Radiology, Kobe University Graduate School of Medicine, Kobe, Japan
| | - Takaaki Matsunaga
- Department of Radiology, Kobe University Graduate School of Medicine, Kobe, Japan
| | - Aki Miyazaki
- Department of Radiology, Kobe University Graduate School of Medicine, Kobe, Japan
| | - Takamichi Murakami
- Department of Radiology, Kobe University Graduate School of Medicine, Kobe, Japan
| |
Collapse
|
6
|
Uto M. A Bayesian many-facet Rasch model with Markov modeling for rater severity drift. Behav Res Methods 2023; 55:3910-3928. [PMID: 36284065 PMCID: PMC10615980 DOI: 10.3758/s13428-022-01997-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/30/2022] [Indexed: 11/08/2022]
Abstract
Fair performance assessment requires consideration of the effects of rater severity on scoring. The many-facet Rasch model (MFRM), an item response theory model that incorporates rater severity parameters, has been widely used for this purpose. Although a typical MFRM assumes that rater severity does not change during the rating process, in actuality rater severity is known to change over time, a phenomenon called rater severity drift. To investigate this drift, several extensions of the MFRM have been proposed that incorporate time-specific rater severity parameters. However, these previous models estimate the severity parameters under the assumption of temporal independence. This introduces inefficiency into the parameter estimation because severities between adjacent time points tend to have temporal dependency in practice. To resolve this problem, we propose a Bayesian extension of the MFRM that incorporates time dependency for the rater severity parameters, based on a Markov modeling approach. The proposed model can improve the estimation accuracy of the time-specific rater severity parameters, resulting in improved estimation accuracy for the other rater parameters and for model fitting. We demonstrate the effectiveness of the proposed model through simulation experiments and application to actual data.
Collapse
Affiliation(s)
- Masaki Uto
- The University of Electro-Communications, Tokyo, Japan.
| |
Collapse
|
7
|
Wang X, Zhang J, Lu J, Cheng G, Shi N. Exploration and analysis of a generalized one-parameter item response model with flexible link functions. Front Psychol 2023; 14:1248454. [PMID: 37711320 PMCID: PMC10498775 DOI: 10.3389/fpsyg.2023.1248454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 08/10/2023] [Indexed: 09/16/2023] Open
Abstract
This paper primarily analyzes the one-parameter generalized logistic (1PGlogit) model, which is a generalized model containing other one-parameter item response theory (IRT) models. The essence of the 1PGlogit model is the introduction of a generalized link function that includes the probit, logit, and complementary log-log functions. By transforming different parameters, the 1PGlogit model can flexibly adjust the speed at which the item characteristic curve (ICC) approaches the upper and lower asymptote, breaking the previous constraints in one-parameter IRT models where the ICC curves were either all symmetric or all asymmetric. This allows for a more flexible way to fit data and achieve better fitting performance. We present three simulation studies, specifically designed to validate the accuracy of parameter estimation for a variety of one-parameter IRT models using the Stan program, illustrate the advantages of the 1PGlogit model over other one-parameter IRT models from a model fitting perspective, and demonstrate the effective fit of the 1PGlogit model with the three-parameter logistic (3PL) and four-parameter logistic (4PL) models. Finally, we demonstrate the good fitting performance of the 1PGlogit model through an analysis of real data.
Collapse
Affiliation(s)
- Xue Wang
- Key Laboratory of Applied Statistics of Ministry of Education (MOE), School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| | - Jiwei Zhang
- Faculty of Education, Northeast Normal University, Changchun, China
| | - Jing Lu
- Key Laboratory of Applied Statistics of Ministry of Education (MOE), School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| | - Guanghui Cheng
- Guangzhou Institute of International Finance, Guangzhou University, Guangzhou, China
| | - Ningzhong Shi
- Key Laboratory of Applied Statistics of Ministry of Education (MOE), School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| |
Collapse
|
8
|
Tu N, Zhang B, Angrave L, Sun T, Neuman M. Estimating the Multidimensional Generalized Graded Unfolding Model with Covariates Using a Bayesian Approach. J Intell 2023; 11:163. [PMID: 37623546 PMCID: PMC10455612 DOI: 10.3390/jintelligence11080163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 08/05/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023] Open
Abstract
Noncognitive constructs are commonly assessed in educational and organizational research. They are often measured by summing scores across items, which implicitly assumes a dominance item response process. However, research has shown that the unfolding response process may better characterize how people respond to noncognitive items. The Generalized Graded Unfolding Model (GGUM) representing the unfolding response process has therefore become increasingly popular. However, the current implementation of the GGUM is limited to unidimensional cases, while most noncognitive constructs are multidimensional. Fitting a unidimensional GGUM separately for each dimension and ignoring the multidimensional nature of noncognitive data may result in suboptimal parameter estimation. Recently, an R package bmggum was developed that enables the estimation of the Multidimensional Generalized Graded Unfolding Model (MGGUM) with covariates using a Bayesian algorithm. However, no simulation evidence is available to support the accuracy of the Bayesian algorithm implemented in bmggum. In this research, two simulation studies were conducted to examine the performance of bmggum. Results showed that bmggum can estimate MGGUM parameters accurately, and that multidimensional estimation and incorporating relevant covariates into the estimation process improved estimation accuracy. The effectiveness of two Bayesian model selection indices, WAIC and LOO, were also investigated and found to be satisfactory for model selection. Empirical data were used to demonstrate the use of bmggum and its performance was compared with three other GGUM software programs: GGUM2004, GGUM, and mirt.
Collapse
Affiliation(s)
- Naidan Tu
- Department of Psychology, University of South Florida, Tampa, FL 33620, USA
| | - Bo Zhang
- School of Labor and Employment Relations & Department of Psychology, University of Illinois Urbana-Champaign, Champaign, IL 61820, USA
| | - Lawrence Angrave
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Tianjun Sun
- Department of Psychological Sciences, Kansas State University, Manhattan, KS 66506, USA
| | - Mathew Neuman
- Department of Psychological & Brain Sciences, Texas A & M University, College Station, TX 77840, USA
| |
Collapse
|
9
|
Nishio M, Kobayashi D, Matsuo H, Urase Y, Nishioka E, Murakami T. Bayesian multidimensional nominal response model for observer study of radiologists. Jpn J Radiol 2022; 41:449-455. [PMID: 36469224 PMCID: PMC9734816 DOI: 10.1007/s11604-022-01366-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 11/23/2022] [Indexed: 12/08/2022]
Abstract
PURPOSE This study proposes a Bayesian multidimensional nominal response model (MD-NRM) to statistically analyze the nominal response of multiclass classifications. MATERIALS AND METHODS First, for MD-NRM, we extended the conventional nominal response model to achieve stable convergence of the Bayesian nominal response model and utilized multidimensional ability parameters. We then applied MD-NRM to a 3-class classification problem, where radiologists visually evaluated chest X-ray images and selected their diagnosis from one of the three classes. The classification problem consisted of 150 cases, and each of the six radiologists selected their diagnosis based on a visual evaluation of the images. Consequently, 900 (= 150 × 6) nominal responses were obtained. In MD-NRM, we assumed that the responses were determined by the softmax function, the ability of radiologists, and the difficulty of images. In addition, we assumed that the multidimensional ability of one radiologist were represented by a 3 × 3 matrix. The latent parameters of the MD-NRM (ability parameters of radiologists and difficulty parameters of images) were estimated from the 900 responses. To implement Bayesian MD-NRM and estimate the latent parameters, a probabilistic programming language (Stan, version 2.21.0) was used. RESULTS For all parameters, the Rhat values were less than 1.10. This indicates that the latent parameters of the MD-NRM converged successfully. CONCLUSION The results show that it is possible to estimate the latent parameters (ability and difficulty parameters) of the MD-NRM using Stan. Our code for the implementation of the MD-NRM is available as open source.
Collapse
Affiliation(s)
- Mizuho Nishio
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| | - Daigo Kobayashi
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| | - Hidetoshi Matsuo
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| | - Yasuyo Urase
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| | - Eiko Nishioka
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| | - Takamichi Murakami
- Department of Radiology, Kobe University Graduate School of Medicine, 7-5-2 Kusunoki-Cho, Chuo-Ku, Kobe, 650-0017 Japan
| |
Collapse
|
10
|
Man K, Harring JR, Zhan P. Bridging Models of Biometric and Psychometric Assessment: A Three-Way Joint Modeling Approach of Item Responses, Response Times, and Gaze Fixation Counts. APPLIED PSYCHOLOGICAL MEASUREMENT 2022; 46:361-381. [PMID: 35812811 PMCID: PMC9265489 DOI: 10.1177/01466216221089344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Recently, joint models of item response data and response times have been proposed to better assess and understand test takers' learning processes. This article demonstrates how biometric information such as gaze fixation counts obtained from an eye-tracking machine can be integrated into the measurement model. The proposed joint modeling framework accommodates the relations among a test taker's latent ability, working speed and test engagement level via a person-side variance-covariance structure, while simultaneously permitting the modeling of item difficulty, time-intensity, and the engagement intensity through an item-side variance-covariance structure. A Bayesian estimation scheme is used to fit the proposed model to data. Posterior predictive model checking based on three discrepancy measures corresponding to various model components are introduced to assess model-data fit. Findings from a Monte Carlo simulation and results from analyzing experimental data demonstrate the utility of the model.
Collapse
Affiliation(s)
- Kaiwen Man
- University of Alabama, Tuscaloosa, AL, USA
- Kaiwen Man, Educational Research Program, Educational Studies in Psychology, Research Methodology, and Counseling, 313 Carmichael Box 870231, University of Alabama, Tuscaloosa, AL 35487, USA.
| | | | - Peida Zhan
- Zhejiang Normal University, Jinhua, China
| |
Collapse
|
11
|
Zhang J, Zhang YY, Tao J, Chen MH. Bayesian Item Response Theory Models With Flexible Generalized Logit Links. APPLIED PSYCHOLOGICAL MEASUREMENT 2022; 46:382-405. [PMID: 35812812 PMCID: PMC9265488 DOI: 10.1177/01466216221089343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In educational and psychological research, the logit and probit links are often used to fit the binary item response data. The appropriateness and importance of the choice of links within the item response theory (IRT) framework has not been investigated yet. In this paper, we present a family of IRT models with generalized logit links, which include the traditional logistic and normal ogive models as special cases. This family of models are flexible enough not only to adjust the item characteristic curve tail probability by two shape parameters but also to allow us to fit the same link or different links to different items within the IRT model framework. In addition, the proposed models are implemented in the Stan software to sample from the posterior distributions. Using readily available Stan outputs, the four Bayesian model selection criteria are computed for guiding the choice of the links within the IRT model framework. Extensive simulation studies are conducted to examine the empirical performance of the proposed models and the model fittings in terms of "in-sample" and "out-of-sample" predictions based on the deviance. Finally, a detailed analysis of the real reading assessment data is carried out to illustrate the proposed methodology.
Collapse
Affiliation(s)
- Jiwei Zhang
- Faculty of Education Northeast Normal University, Changchun, China
| | - Ying-Ying Zhang
- Department of Statistics and Actuarial Science, Chongqing University, Chongqing, China
| | - Jian Tao
- School of Mathematics and Statistics, Northeast Normal University, Changchun, China
| | - Ming-Hui Chen
- Department of Statistics, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
12
|
Chang JC, Porcino J, Rasch EK, Tang L. Regularized Bayesian calibration and scoring of the WD-FAB IRT model improves predictive performance over marginal maximum likelihood. PLoS One 2022; 17:e0266350. [PMID: 35395055 PMCID: PMC8993025 DOI: 10.1371/journal.pone.0266350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 03/20/2022] [Indexed: 11/20/2022] Open
Abstract
Item response theory (IRT) is the statistical paradigm underlying a dominant family of generative probabilistic models for test responses, used to quantify traits in individuals relative to target populations. The graded response model (GRM) is a particular IRT model that is used for ordered polytomous test responses. Both the development and the application of the GRM and other IRT models require statistical decisions. For formulating these models (calibration), one needs to decide on methodologies for item selection, inference, and regularization. For applying these models (test scoring), one needs to make similar decisions, often prioritizing computational tractability and/or interpretability. In many applications, such as in the Work Disability Functional Assessment Battery (WD-FAB), tractability implies approximating an individual's score distribution using estimates of mean and variance, and obtaining that score conditional on only point estimates of the calibrated model. In this manuscript, we evaluate the calibration and scoring of models under this common use-case using Bayesian cross-validation. Applied to the WD-FAB responses collected for the National Institutes of Health, we assess the predictive power of implementations of the GRM based on their ability to yield, on validation sets of respondents, ability estimates that are most predictive of patterns of item responses. Our main finding indicates that regularized Bayesian calibration of the GRM outperforms the regularization-free empirical Bayesian procedure of marginal maximum likelihood. We also motivate the use of compactly supported priors in test scoring.
Collapse
Affiliation(s)
- Joshua C. Chang
- Rehabilitation Medicine Department, NIH Clinical Center, Bethesda, Maryland, United States of America
| | - Julia Porcino
- Rehabilitation Medicine Department, NIH Clinical Center, Bethesda, Maryland, United States of America
| | - Elizabeth K. Rasch
- Rehabilitation Medicine Department, NIH Clinical Center, Bethesda, Maryland, United States of America
| | - Larry Tang
- Rehabilitation Medicine Department, NIH Clinical Center, Bethesda, Maryland, United States of America
- National Center for Forensic Science, University of Central Florida, Orlando, Florida, United States of America
- Department of Statistics and Data Science, University of Central Florida, Orlando, Florida, United States of America
| |
Collapse
|
13
|
Abstract
Bayesian estimation of multidimensional item response theory (IRT) models in large data sets may come with impractical computational burdens when general-purpose Markov chain Monte Carlo (MCMC) samplers are employed. Variational Bayes (VB)—a method for approximating the posterior distribution—poses a potential remedy. Stan’s general-purpose VB algorithms have drastically improved the accessibility of VB methods for a wide psychometric audience. Using marginal maximum likelihood (MML) and MCMC as benchmarks, the present simulation study investigates the utility of Stan’s built-in VB function for estimating multidimensional IRT models with between-item dimensionality. VB yielded a marked speed-up in comparison to MCMC, but did not generally outperform MML in terms of run time. VB estimates were trustworthy only for item difficulties, while bias in item discriminations depended on the model’s dimensionality. Under realistic conditions of non-zero correlations between dimensions, VB correlation estimates were subject to severe bias. The practical relevance of performance differences is illustrated with data from PISA 2018. We conclude that in its current form, Stan’s built-in VB algorithm does not pose a viable alternative for estimating multidimensional IRT models.
Collapse
|
14
|
Santos‐Fernandez E, Mengersen K. Understanding the reliability of citizen science observational data using item response models. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Edgar Santos‐Fernandez
- School of Mathematical Sciences Queensland University of Technology Brisbane Qld Australia
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS) Parkville Vic. Australia
| | - Kerrie Mengersen
- School of Mathematical Sciences Queensland University of Technology Brisbane Qld Australia
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS) Parkville Vic. Australia
| |
Collapse
|
15
|
Nouri F, Sadeghi M, Mohammadifard N, Roohafza H, Feizi A, Sarrafzadegan N. Longitudinal association between an overall diet quality index and latent profiles of cardiovascular risk factors: results from a population based 13-year follow up cohort study. Nutr Metab (Lond) 2021; 18:28. [PMID: 33691729 PMCID: PMC7948330 DOI: 10.1186/s12986-021-00560-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 03/04/2021] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Cardiovascular diseases (CVDs) are associated with an unhealthy lifestyle, including poor diet. Indices reflecting the overall quality of diets are more effective than single food or nutrient-based approaches in clarifying the diet disease relationship. The present study aims to use latent variable modeling to examine the longitudinal joint relationships between the latent profiles of CVDs risk factors and the diet quality index (DQI). METHODS A total of 4390 Iranian adults aged 35 and older within the framework of the Isfahan Cohort Study were included in the current secondary analysis. DQI focused on food groups, including fast foods, sweets, vegetables, fruits, fats, and proteins, based on a validated food frequency questionnaire. The score of DQI has a range between 0 (indicating healthy and high diet quality) and 2 (indicating unhealthy and low diet quality). Blood pressure (BP), anthropometric measurements, blood glucose, serum lipids, and high-sensitivity C-Reactive Protein (hs-CRP) were measured according to standard protocols in 2001, 2007, and 2013 to evaluate the profiles of CVDs risk factors. A Bayesian Multidimensional Graded Responses Linear Mixed Model was used for data analysis. RESULTS At baseline, the participants' mean ± standard deviation age was 50.09 ± 11.21, and 49.5% of them were male. Three latent profiles of CVDs risk factors were derived: (1) Fit Pre-Metabolic Syndrome (FPMS) profile characterized by normal anthropometric indices and some impaired metabolic risk factors; (2) DysLipoproteinemia Central Obese (DLCO) profile with abdominal obesity and impaired low-density lipoprotein cholesterol as well as other normal risk factors; (3) Impaired Laboratory Inflammatory State (ILIS) profile with impaired high-density lipoprotein cholesterol and hs-CRP and other normal risk factors. In general, higher scores of the extracted latent profiles indicated more impaired function in the related risk factors. After controlling for various potential fixed and time-varying confounding variables, a significant positive longitudinal association was found between FPMS, DLCO, and ILIS profiles and DQI (β (95% CrI): 0.26 (0.03,0.51), 0.14 (0.01,0.27), and 0.24 (0.11,0.38), respectively), demonstrating that lower overall diet quality was associated with more impaired function of the related risk factors. CONCLUSIONS More adherence to a healthy quality diet is associated with lower levels of all emerging latent profiles of CVDs risk factors. Increasing the knowledge of the community about the importance of the quality of consumed foods may help to prevent CVDs. It is recommended that further investigations, particularly interventional studies, be conducted to confirm our results.
Collapse
Affiliation(s)
- Fatemeh Nouri
- Student Research Committee, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran
- Department of Biostatistics and Epidemiology, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran
- Isfahan Cardiovascular Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Masoumeh Sadeghi
- Cardiac Rehabilitation Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Hezar Jerib Street, 8174673461, Isfahan, Iran
| | - Noushin Mohammadifard
- Hypertension Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Hamidreza Roohafza
- Heart Failure Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Awat Feizi
- Department of Biostatistics and Epidemiology, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran.
- Cardiac Rehabilitation Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Hezar Jerib Street, 8174673461, Isfahan, Iran.
| | - Nizal Sarrafzadegan
- Isfahan Cardiovascular Research Center, Cardiovascular Research Institute, Isfahan University of Medical Sciences, Isfahan, Iran
| |
Collapse
|
16
|
Nishio M, Akasaka T, Sakamoto R, Togashi K. Bayesian Statistical Model of Item Response Theory in Observer Studies of Radiologists. Acad Radiol 2020; 27:e45-e54. [PMID: 31147237 DOI: 10.1016/j.acra.2019.04.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 04/16/2019] [Accepted: 04/16/2019] [Indexed: 11/29/2022]
Abstract
RATIONALE AND OBJECTIVES The purpose of this study was to validate a Bayesian statistical model of item response theory (IRT). IRT was used to evaluate a new modality (temporal subtraction, TS) in observer studies of radiologists, compared with a conventional modality (computed tomography). MATERIALS AND METHODS From previously published papers, we obtained two datasets of clinical observer studies of radiologists. Those studies used a multi-reader and multi-case paradigm to evaluate radiologists' detection abilities, primarily to determine if TS could enhance the detectability of bone metastasis or brain infarctions. We applied IRT to these studies' datasets using Stan software. Before applying IRT, the radiologists' responses were recorded as binaries for each case (1 = correct, 0 = incorrect). Effect of TS on detectability was evaluated by using our IRT model and calculating the 95% credible interval of the effect. RESULTS The mean, median, and 95% credible interval of the effect of TS were 0.913, 0.885, and 0.243-1.745 for the bone metastasis detection, and 2.524, 2.50, and 1.827-3.310, for the brain infarction detection. For both detection studies, the 95% credible intervals of the effect of TS did not include zero, indicating that TS significantly improved diagnostic ability. CONCLUSION Judgments based on the present study results were compatible with the two previous studies. Our study results demonstrated that the Bayesian statistical model of IRT could judge a new modality's usefulness.
Collapse
Affiliation(s)
- Mizuho Nishio
- Department of Diagnostic Imaging and Nuclear Medicine, Kyoto University Graduate School of Medicine, 54 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan; Preemptive Medicine and Lifestyle-related Disease Research Center, Kyoto University Hospital, Kyoto, Japan.
| | - Thai Akasaka
- Department of Radiology, Osaka Red Cross Hospital, Osaka, Japan
| | - Ryo Sakamoto
- Department of Diagnostic Imaging and Nuclear Medicine, Kyoto University Graduate School of Medicine, 54 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan; Preemptive Medicine and Lifestyle-related Disease Research Center, Kyoto University Hospital, Kyoto, Japan
| | - Kaori Togashi
- Department of Diagnostic Imaging and Nuclear Medicine, Kyoto University Graduate School of Medicine, 54 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan
| |
Collapse
|
17
|
Bürkner PC. Analysing Standard Progressive Matrices (SPM-LS) with Bayesian Item Response Models. J Intell 2020; 8:E5. [PMID: 32033073 PMCID: PMC7151098 DOI: 10.3390/jintelligence8010005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 01/27/2020] [Accepted: 01/29/2020] [Indexed: 11/16/2022] Open
Abstract
Raven's Standard Progressive Matrices (SPM) test and related matrix-based tests are widely applied measures of cognitive ability. Using Bayesian Item Response Theory (IRT) models, I reanalyzed data of an SPM short form proposed by Myszkowski and Storme (2018) and, at the same time, illustrate the application of these models. Results indicate that a three-parameter logistic (3PL) model is sufficient to describe participants dichotomous responses (correct vs. incorrect) while persons' ability parameters are quite robust across IRT models of varying complexity. These conclusions are in line with the original results of Myszkowski and Storme (2018). Using Bayesian as opposed to frequentist IRT models offered advantages in the estimation of more complex (i.e., 3-4PL) IRT models and provided more sensible and robust uncertainty estimates.
Collapse
|
18
|
Liu Y, Hu G, Cao L, Wang X, Chen MH. A Comparison of Monte Carlo Methods for Computing Marginal Likelihoods of Item Response Theory Models. J Korean Stat Soc 2020; 48:503-512. [PMID: 31929720 DOI: 10.1016/j.jkss.2019.04.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Nowadays, Bayesian methods are routinely used for estimating parameters of item response theory (IRT) models. However, the marginal likelihoods are still rarely used for comparing IRT models due to their complexity and a relatively high dimension of the model parameters. In this paper, we review Monte Carlo (MC) methods developed in the literature in recent years and provide a detailed development of how these methods are applied to the IRT models. In particular, we focus on the "best possible" implementation of these MC methods for the IRT models. These MC methods are used to compute the marginal likelihoods under the one-parameter IRT model with the logistic link (1PL model) and the two-parameter logistic IRT model (2PL model) for a real English Examination dataset. We further use the widely applicable information criterion (WAIC) and deviance information criterion (DIC) to compare the 1PL model and the 2PL model. The 2PL model is favored by all of these three Bayesian model comparison criteria for the English Examination data.
Collapse
Affiliation(s)
- Yang Liu
- Department of Statistics, University of Connecticut, Storrs, Connecticut, U.S.A
| | - Guanyu Hu
- Department of Statistics, University of Connecticut, Storrs, Connecticut, U.S.A
| | - Lei Cao
- Department of Statistics, University of Connecticut, Storrs, Connecticut, U.S.A
- School of Basic Science, Changchun University of Technology, Changchun, China
| | - Xiaojing Wang
- Department of Statistics, University of Connecticut, Storrs, Connecticut, U.S.A
| | - Ming-Hui Chen
- Department of Statistics, University of Connecticut, Storrs, Connecticut, U.S.A
| |
Collapse
|
19
|
da Silva MA, Huggins-Manley AC, Mazzon JA, Bazán JL. Bayesian estimation of a flexible bifactor generalized partial credit model to survey data. J Appl Stat 2019. [DOI: 10.1080/02664763.2019.1592125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Marcelo A. da Silva
- Researcher at Interinstitutional Graduate Program in Statistics, University of São Paulo and Federal University of São Carlos, São Carlos, Brazil
| | | | - José A. Mazzon
- Faculdade de Economia, Administração e Contabilidade, Universidade de São Paulo, São Paulo, Brazil
| | - Jorge L. Bazán
- Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Paulo, Brazil
| |
Collapse
|
20
|
Klotzke K, Fox JP. Modeling Dependence Structures for Response Times in a Bayesian Framework. PSYCHOMETRIKA 2019; 84:649-672. [PMID: 31098935 PMCID: PMC6658586 DOI: 10.1007/s11336-019-09671-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Indexed: 06/09/2023]
Abstract
A multivariate generalization of the log-normal model for response times is proposed within an innovative Bayesian modeling framework. A novel Bayesian Covariance Structure Model (BCSM) is proposed, where the inclusion of random-effect variables is avoided, while their implied dependencies are modeled directly through an additive covariance structure. This makes it possible to jointly model complex dependencies due to for instance the test format (e.g., testlets, complex constructs), time limits, or features of digitally based assessments. A class of conjugate priors is proposed for the random-effect variance parameters in the BCSM framework. They give support to testing the presence of random effects, reduce boundary effects by allowing non-positive (co)variance parameters, and support accurate estimation even for very small true variance parameters. The conjugate priors under the BCSM lead to efficient posterior computation. Bayes factors and the Bayesian Information Criterion are discussed for the purpose of model selection in the new framework. In two simulation studies, a satisfying performance of the MCMC algorithm and of the Bayes factor is shown. In comparison with parameter expansion through a half-Cauchy prior, estimates of variance parameters close to zero show no bias and undercoverage of credible intervals is avoided. An empirical example showcases the utility of the BCSM for response times to test the influence of item presentation formats on the test performance of students in a Latin square experimental design.
Collapse
Affiliation(s)
- Konrad Klotzke
- University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlands
| | - Jean-Paul Fox
- University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlands
| |
Collapse
|
21
|
Lee YH, Hao J, Man K, Ou L. How Do Test Takers Interact With Simulation-Based Tasks? A Response-Time Perspective. Front Psychol 2019; 10:906. [PMID: 31068876 PMCID: PMC6491860 DOI: 10.3389/fpsyg.2019.00906] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2018] [Accepted: 04/04/2019] [Indexed: 11/13/2022] Open
Abstract
Many traditional educational assessments use multiple-choice items and constructed-response items to measure fundamental skills. Virtual performance assessments, such as game- or simulation-based assessments, are designed recently in the field of educational measurement to measure more integrated skills through the test takers' interactive behaviors within an assessment in a virtual environment. This paper presents a systematic timing study based on data collected from a simulation-based task designed recently at Educational Testing Service. The study is intended to understand the response times in complex simulation-based tasks so as to shed light on possible ways of leveraging response time information in designing, assembling, and scoring of simulation-based tasks. To achieve this objective, a series of five analyses were conducted to first understand the statistical properties of the timing data, and then investigate the relationship between the timing patterns and the test takers' performance on the items/task, demographics, motivation level, personality, and test-taking behaviors through use of different statistical approaches. We found that the five analyses complemented each other and revealed different useful timing aspects of this test-taker sample's behavioral features in the simulation-based task. The findings were also compared with notable existing results in the literature related to timing data.
Collapse
Affiliation(s)
- Yi-Hsuan Lee
- Educational Testing Service, Princeton, NJ, United States
| | - Jiangang Hao
- Educational Testing Service, Princeton, NJ, United States
| | - Kaiwen Man
- Department of Human Development and Quantitative Methodology, Measurement, Statistics and Evaluation Program, University of Maryland at College Park, College Park, MD, United States
| | - Lu Ou
- ACT Inc., Iowa City, IA, United States
| |
Collapse
|
22
|
Douglass J, Dykes L, Kelly‐Hope L, Gordon S, Leggat P, Aye NN, Win SS, Wai T, Win YY, Nwe TW, Graves P. Preventive chemotherapy reverses covert, lymphatic-associated tissue change in young people with lymphatic filariasis in Myanmar. Trop Med Int Health 2019; 24:463-476. [PMID: 30706585 PMCID: PMC6850631 DOI: 10.1111/tmi.13212] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
OBJECTIVES This longitudinal comparative study investigated the effect of preventive chemotherapy (PC) on covert tissue changes associated with lymphatic filariasis (LF) among young people living in an LF-endemic area in Myanmar. METHODS Tissue compressibility and extracellular free fluid in the lower limbs of people aged 10-21 years were measured using indurometry and bioimpedance spectroscopy (BIS). Baseline measures were taken in October 2014, annual mass drug administration (MDA) of PC was delivered in December, and in March 2015 further PC was offered to LF-positive cases who had missed MDA. Follow-up measures were taken in February and June 2015. RESULTS A total of 50 antigen-positive cases and 46 antigen-negative controls were included. Self-reported PC consumption was 60.1% during 2014 MDA and 66.2% overall. At second follow-up, 24 of 34 cases and 27 of 43 controls had consumed PC. Significant and clinically relevant between-group differences at baseline were not found post-PC. Bayesian linear mixed models showed a significant change in indurometer scores at both calves for antigen-positive cases who consumed any PC (dominant calf: -0.30 [95% CI -0.52, -0.07], P < 0.05 and non-dominant calf: -0.35 [95% CI -0.58, -0.12], P < 0.01). Changes in antigen-negative participants or those not consuming PC were not significant. CONCLUSION This study is the first attempt to use simple field-friendly tools to track fluid and tissue changes after treatment of asymptomatic people infected with LF. Results suggested that PC alone is sufficient to reverse covert lymphatic disturbance. Longer follow-up of larger cohorts is required to confirm these improvements and whether they persist over time. These findings should prompt increased efforts to overcome low PC coverage, which misses many infected young people, particularly males, who are unaware of their infection status, unmotivated to take PC and at risk of developing lymphoedema. Indurometry and BIS should be considered in assessment of lymphatic filariasis-related lymphedema.
Collapse
Affiliation(s)
- Janet Douglass
- Centre for Neglected Tropical DiseasesDepartment of Tropical Diseases BiologyLiverpool School of Tropical MedicineLiverpoolUK
- College of Public Health Medical and Veterinary SciencesDivision of Tropical Health and MedicineJames Cook UniversityTownsvilleQLDAustralia
- James Cook University WHO Collaborating Centre for Vector Borne and Neglected Tropical DiseasesTownsvilleQLDAustralia
| | - Lukah Dykes
- College of Medicine and Public HealthFlinders UniversityBedford ParkSAAustralia
| | - Louise Kelly‐Hope
- Centre for Neglected Tropical DiseasesDepartment of Tropical Diseases BiologyLiverpool School of Tropical MedicineLiverpoolUK
- James Cook University WHO Collaborating Centre for Vector Borne and Neglected Tropical DiseasesTownsvilleQLDAustralia
| | - Susan Gordon
- James Cook University WHO Collaborating Centre for Vector Borne and Neglected Tropical DiseasesTownsvilleQLDAustralia
- College of Nursing & Health SciencesFlinders UniversityBedford ParkSAAustralia
| | - Peter Leggat
- College of Public Health Medical and Veterinary SciencesDivision of Tropical Health and MedicineJames Cook UniversityTownsvilleQLDAustralia
- James Cook University WHO Collaborating Centre for Vector Borne and Neglected Tropical DiseasesTownsvilleQLDAustralia
| | - Ni Ni Aye
- Disease Control UnitDepartment of HealthMinistry of Health and SportsNay Pyi TawMyanmar
| | - San San Win
- Malaria UnitWorld Health Organization Country OfficeYangonMyanmar
| | - Tint Wai
- Regional Vector Borne Diseases Control UnitDepartment of Public HealthMinistry of Health and SportsMandalayMyanmar
| | - Yi Yi Win
- Health Literacy Promotion UnitDepartment of Public HealthMinistry of Health and SportsNay Pyi TawMyanmar
| | - Thet Wai Nwe
- Disease Control UnitDepartment of HealthMinistry of Health and SportsNay Pyi TawMyanmar
| | - Patricia Graves
- Centre for Neglected Tropical DiseasesDepartment of Tropical Diseases BiologyLiverpool School of Tropical MedicineLiverpoolUK
- James Cook University WHO Collaborating Centre for Vector Borne and Neglected Tropical DiseasesTownsvilleQLDAustralia
- College of Public Health, Medical and Veterinary SciencesDivision of Tropical Health and MedicineJames Cook UniversityCairnsQLDAustralia
| |
Collapse
|
23
|
Luo Y, Dimitrov DM. A Short Note on Obtaining Point Estimates of the IRT Ability Parameter With MCMC Estimation in Mplus: How Many Plausible Values Are Needed? EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2019; 79:272-287. [PMID: 30911193 PMCID: PMC6425093 DOI: 10.1177/0013164418777569] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent variable point estimates is unclear. This is especially relevant when an item response theory (IRT) model is estimated with MCMC (Markov chain Monte Carlo) methods in Mplus and point estimates of the IRT ability parameter are of interest, as Mplus only estimates the posterior distribution of each ability parameter. In order to obtain point estimates of the ability parameter, a number of plausible values can be drawn from the posterior distribution of each individual ability parameter and their mean (the posterior mean ability estimate) can be used as an individual ability point estimate. In this note, we conducted a simulation study to investigate how many plausible values were needed to obtain accurate posterior mean ability estimates. The results indicate that 20 is the minimum number of plausible values required to obtain point estimates of the IRT ability parameter that are comparable to marginal maximum likelihood estimation(MMLE)/expected a posteriori (EAP) estimates. A real dataset was used to demonstrate the comparison between MMLE/EAP point estimates and posterior mean ability estimates based on different number of plausible values.
Collapse
Affiliation(s)
- Yong Luo
- National Center for Assessment, Riyadh, Saudi Arabia
| | - Dimiter M. Dimitrov
- National Center for Assessment, Riyadh, Saudi Arabia
- George Mason University, Fairfax, VA, USA
| |
Collapse
|
24
|
Liu R, Jiang Z. Diagnostic Classification Models for Ordinal Item Responses. Front Psychol 2018; 9:2512. [PMID: 30618941 PMCID: PMC6297886 DOI: 10.3389/fpsyg.2018.02512] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Accepted: 11/26/2018] [Indexed: 11/13/2022] Open
Abstract
The purpose of this study is to develop and evaluate two diagnostic classification models (DCMs) for scoring ordinal item data. We first applied the proposed models to an operational dataset and compared their performance to an epitome of current polytomous DCMs in which the ordered data structure is ignored. Findings suggest that the much more parsimonious models that we proposed performed similarly to the current polytomous DCMs and offered useful item-level information in addition to option-level information. We then performed a small simulation study using the applied study condition and demonstrated that the proposed models can provide unbiased parameter estimates and correctly classify individuals. In practice, the proposed models can accommodate much smaller sample sizes than current polytomous DCMs and thus prove useful in many small-scale testing scenarios.
Collapse
Affiliation(s)
- Ren Liu
- Psychological Sciences, University of California, Merced, Merced, CA, United States
| | - Zhehan Jiang
- University Libraries, University of Alabama, Tuscaloosa, AL, United States
| |
Collapse
|
25
|
Using Hamiltonian Monte Carlo to estimate the log-linear cognitive diagnosis model via Stan. Behav Res Methods 2018; 51:651-662. [DOI: 10.3758/s13428-018-1069-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|