1
|
Tan Q, Wang D, Luo F, Cai Y, Tu D. Methods for online calibration of Q-matrix and item parameters for polytomous responses in cognitive diagnostic computerized adaptive testing. Behav Res Methods 2024:10.3758/s13428-024-02392-6. [PMID: 38689154 DOI: 10.3758/s13428-024-02392-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2024] [Indexed: 05/02/2024]
Abstract
The ability to rapidly provide examinees with detailed and effective diagnostic information is a critical topic in psychology. Knowing what diagnostic criteria the examinees have met enables the practitioner to seek the solution to help them in a timely manner, and this can be achieved by cognitive diagnostic computerized adaptive testing (CD-CAT). However, the pervasive challenge of replenishing items in the CD-CAT item bank limits its practical application. Online calibration is a means to address item replenishment, but in CD-CAT, most existing online calibration methods that jointly calibrate the Q-matrix and item parameters of the new items are developed only for dichotomous responses and are time-consuming. Notably, previous studies pay no attention to polytomously scored items that are frequently observed in testing, even though they can offer additional evidence for the examinees' diagnosis. To fill this gap, we propose a SCAD-based method (SCAD-EM) to calibrate the Q-matrix and item parameters of the new items with polytomous response data in order to promote the application of CD-CAT in practice. The performance of the SCAD-EM was investigated in two comprehensive simulation studies and compared against the revised single-item estimation method (SIE-BIC). Results indicated that the SCAD-EM produces a higher calibration accuracy for the category-level Q-matrix and is computationally more efficient across all conditions, but it produces a lower calibration accuracy for the item-level Q-matrix. An empirical study further demonstrated the utility of the SCAD-EM and the SIE-BIC methods in calibrating new items with a real dataset. The advantages of the proposed method, its limitations, and possible future research directions are offered at the end.
Collapse
Affiliation(s)
- Qingrong Tan
- Department of Basic Psychology, College of Psychology, Army Medical University, Chongqing, China
| | - Daxun Wang
- School of Psychology, Jiangxi Normal University, Nanchang, China.
| | - Fen Luo
- College of Computer Information Engineering, Jiangxi Normal University, Nanchang, China.
| | - Yan Cai
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Dongbo Tu
- School of Psychology, Jiangxi Normal University, Nanchang, China.
| |
Collapse
|
2
|
Tao J, Zhao W, Zhang Y, Guo Q, Min B, Xu X, Liu F. Cognitive diagnostic assessment: A Q-matrix constraint-based neural network method. Behav Res Methods 2024:10.3758/s13428-024-02404-5. [PMID: 38691218 DOI: 10.3758/s13428-024-02404-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/12/2024] [Indexed: 05/03/2024]
Abstract
Cognitive diagnosis is a crucial element of intelligent education that aims to assess the proficiency of specific skills or traits in students at a refined level and provide insights into their strengths and weaknesses for personalized learning. Researchers have developed numerous cognitive diagnostic models. However, previous studies indicate that diagnostic accuracy can be significantly influenced by the appropriateness of the model and the sample size. Thus, designing a general model that can adapt to different assumptions and sample sizes remains a considerable challenge. Artificial neural networks have been proposed as a promising approach in some studies. In this paper, we propose a cognitive diagnosis model of a neural network constrained by a Q-matrix and named QNN. Specifically, we employ the Q-matrix to determine the connections between neurons and the width and depth of the neural network. Moreover, to reduce the human effort in the training algorithm, we designed a self-organizing map-based cognitive diagnosis training framework called SOM-NN, which enables the QNN to be trained unsupervised. Extensive experimental results on simulated and real datasets demonstrate that our approaches are effective in both accuracy and interpretability. Notably, under unsupervised conditions, our approach has significant advantages on small sample datasets with high levels of guessing and slipping, especially on the pattern-wise agreement rates. This work bridges the gap between psychometrics and machine learning and provides a realistic and implementable reference solution for classroom instructional assessment and the cold start of personalized and adaptive assessment systems.
Collapse
Affiliation(s)
- Jinhong Tao
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China
| | - Wei Zhao
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China.
| | - Yuliu Zhang
- Center for Studies of Education and Psychology of Ethnic Minorities in Southwest China, Southwest University, 2 Tiansheng, Beibei, 400715, Chongqing, China
| | - Qian Guo
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China
| | - Baocui Min
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China
- Instruction Center of Practice Skills for Primary and Secondary Schools in Dalian China, 52 Qingchun, 116300, Dalian, China
| | - Xiaoqing Xu
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China
| | - Fengjuan Liu
- School of Information Science and Technology, Northeast Normal University, 2555 Jingyue Jilin, 130117, Changchun, China.
- School of Educational Science, Shaanxi University of Technology, 1 Dongyi, Hanzhong, 723001, Shaanxi, China.
| |
Collapse
|
3
|
Xu X, Fang G, Guo J, Ying Z, Zhang S. Diagnostic Classification Models for Testlets: Methods and Theory. Psychometrika 2024:10.1007/s11336-024-09962-9. [PMID: 38528268 DOI: 10.1007/s11336-024-09962-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Indexed: 03/27/2024]
Abstract
Diagnostic classification models (DCMs) have seen wide applications in educational and psychological measurement, especially in formative assessment. DCMs in the presence of testlets have been studied in recent literature. A key ingredient in the statistical modeling and analysis of testlet-based DCMs is the superposition of two latent structures, the attribute profile and the testlet effect. This paper extends the standard testlet DINA (T-DINA) model to accommodate the potential correlation between the two latent structures. Model identifiability is studied and a set of sufficient conditions are proposed. As a byproduct, the identifiability of the standard T-DINA is also established. The proposed model is applied to a dataset from the 2015 Programme for International Student Assessment. Comparisons are made with DINA and T-DINA, showing that there is substantial improvement in terms of the goodness of fit. Simulations are conducted to assess the performance of the new method under various settings.
Collapse
Affiliation(s)
- Xin Xu
- Beijing Normal University, Beijing, China
| | | | - Jinxin Guo
- Minzu University of China, Beijing, China
| | | | - Susu Zhang
- University of Illinois Urbana-Champaign, Champaign, USA.
| |
Collapse
|
4
|
Culpepper SA. A Note on Weaker Conditions for Identifying Restricted Latent Class Models for Binary Responses. Psychometrika 2023; 88:158-174. [PMID: 35896935 DOI: 10.1007/s11336-022-09875-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Restricted latent class models (RLCMs) are an important class of methods that provide researchers and practitioners in the educational, psychological, and behavioral sciences with fine-grained diagnostic information to guide interventions. Recent research established sufficient conditions for identifying RLCM parameters. A current challenge that limits widespread application of RLCMs is that existing identifiability conditions may be too restrictive for some practical settings. In this paper we establish a weaker condition for identifying RLCM parameters for multivariate binary data. Although the new results weaken identifiability conditions for general RLCMs, the new results do not relax existing necessary and sufficient conditions for the simpler DINA/DINO models. Theoretically, we introduce a new form of latent structure completeness, referred to as dyad-completeness, and prove identification by applying Kruskal's Theorem for the uniqueness of three-way arrays. The new condition is more likely satisfied in applied research, and the results provide researchers and test-developers with guidance for designing diagnostic instruments.
Collapse
Affiliation(s)
- Steven Andrew Culpepper
- Department of Statistics, University of Illinois at Urbana-Champaign, 605 E Springfield Ave, Champaign, IL61820, USA.
| |
Collapse
|
5
|
Ma C, Ouyang J, Xu G. Learning Latent and Hierarchical Structures in Cognitive Diagnosis Models. Psychometrika 2023; 87:1343-1360. [PMID: 35596101 DOI: 10.1007/s11336-022-09852-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 10/16/2021] [Accepted: 04/07/2022] [Indexed: 05/28/2023]
Abstract
Cognitive Diagnosis Models (CDMs) are a special family of discrete latent variable models that are widely used in educational and psychological measurement. A key component of CDMs is the Q-matrix characterizing the dependence structure between the items and the latent attributes. Additionally, researchers also assume in many applications certain hierarchical structures among the latent attributes to characterize their dependence. In most CDM applications, the attribute-attribute hierarchical structures, the item-attribute Q-matrix, the item-level diagnostic models, as well as the number of latent attributes, need to be fully or partially pre-specified, which however may be subjective and misspecified as noted by many recent studies. This paper considers the problem of jointly learning these latent and hierarchical structures in CDMs from observed data with minimal model assumptions. Specifically, a penalized likelihood approach is proposed to select the number of attributes and estimate the latent and hierarchical structures simultaneously. An expectation-maximization (EM) algorithm is developed for efficient computation, and statistical consistency theory is also established under mild conditions. The good performance of the proposed method is illustrated by simulation studies and real data applications in educational assessment.
Collapse
Affiliation(s)
- Chenchen Ma
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Jing Ouyang
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
6
|
Xiong J, Luo Z, Luo G, Yu X. Data-driven Q-matrix learning based on Boolean matrix factorization in cognitive diagnostic assessment. Br J Math Stat Psychol 2022; 75:638-667. [PMID: 35578396 DOI: 10.1111/bmsp.12271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 03/22/2022] [Indexed: 06/15/2023]
Abstract
Attributes and the Q-matrix are the central components for cognitive diagnostic assessment, and are usually defined by domain experts. However, it is challenging and time consuming for experts to specify the attributes and Q-matrix manually. Thus, there is an urgent need for an automatic and intelligent means to address this concern. This paper presents a new data-driven approach for learning the Q-matrix from response data. By constructing a statistical index and a heuristic algorithm based on Boolean matrix factorization, the response matrix is decomposed into the Boolean product of the Q-matrix and the attribute mastery patterns. The feasibility of the proposed approach is evaluated using simulated data generated under various conditions. A real data example is also presented to demonstrate the usefulness of the proposed approach.
Collapse
Affiliation(s)
- Jianhua Xiong
- School of Psychology, Jiangxi Normal University, Nanchang, China
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Zhaosheng Luo
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Guanzhong Luo
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Xiaofeng Yu
- School of Psychology, Jiangxi Normal University, Nanchang, China
| |
Collapse
|
7
|
Abstract
Estimation of the large Q-matrix in cognitive diagnosis models (CDMs) with many items and latent attributes from observational data has been a huge challenge due to its high computational cost. Borrowing ideas from deep learning literature, we propose to learn the large Q-matrix by restricted Boltzmann machines (RBMs) to overcome the computational difficulties. In this paper, key relationships between RBMs and CDMs are identified. Consistent and robust learning of the Q-matrix in various CDMs is shown to be valid under certain conditions. Our simulation studies under different CDM settings show that RBMs not only outperform the existing methods in terms of learning speed, but also maintain good recovery accuracy of the Q-matrix. In the end, we illustrate the applicability and effectiveness of our method through a TIMSS mathematics data set.
Collapse
Affiliation(s)
- Chengcheng Li
- Department of Statistics, University of Michigan, Ann Arbor, USA
| | - Chenchen Ma
- Department of Statistics, University of Michigan, Ann Arbor, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, Ann Arbor, USA.
| |
Collapse
|
8
|
Nájera P, Sorrel MA, de la Torre J, Abad FJ. Balancing fit and parsimony to improve Q-matrix validation. Br J Math Stat Psychol 2021; 74 Suppl 1:110-130. [PMID: 33231301 DOI: 10.1111/bmsp.12228] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 09/14/2020] [Indexed: 06/11/2023]
Abstract
The Q-matrix identifies the subset of attributes measured by each item in the cognitive diagnosis modelling framework. Usually constructed by domain experts, the Q-matrix might contain some misspecifications, disrupting classification accuracy. Empirical Q-matrix validation methods such as the general discrimination index (GDI) and Wald have shown promising results in addressing this problem. However, a cut-off point is used in both methods, which might be suboptimal. To address this limitation, the Hull method is proposed and evaluated in the present study. This method aims to find the optimal balance between fit and parsimony, and it is flexible enough to be used either with a measure of item discrimination (the proportion of variance accounted for, PVAF) or a coefficient of determination (pseudo-R2 ). Results from a simulation study showed that the Hull method consistently showed the best performance and shortest computation time, especially when used with the PVAF. The Wald method also performed very well overall, while the GDI method obtained poor results when the number of attributes was high. The absence of a cut-off point provides greater flexibility to the Hull method, and it places it as a comprehensive solution to the Q-matrix specification problem in applied settings. This proposal is illustrated using real data.
Collapse
Affiliation(s)
- Pablo Nájera
- Department of Social Psychology and Methodology, Autonomous University of Madrid, Spain
| | - Miguel A Sorrel
- Department of Social Psychology and Methodology, Autonomous University of Madrid, Spain
| | - Jimmy de la Torre
- Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong
| | - Francisco José Abad
- Department of Social Psychology and Methodology, Autonomous University of Madrid, Spain
| |
Collapse
|
9
|
Tseng MC, Wang WC. The Q-Matrix Anchored Mixture Rasch Model. Front Psychol 2021; 12:564976. [PMID: 33746812 PMCID: PMC7969527 DOI: 10.3389/fpsyg.2021.564976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 02/12/2021] [Indexed: 11/30/2022] Open
Abstract
Mixture item response theory (IRT) models include a mixture of latent subpopulations such that there are qualitative differences between subgroups but within each subpopulation the measure model based on a continuous latent variable holds. Under this modeling framework, students can be characterized by both their location on a continuous latent variable and by their latent class membership according to Students' responses. It is important to identify anchor items for constructing a common scale between latent classes beforehand under the mixture IRT framework. Then, all model parameters across latent classes can be estimated on the common scale. In the study, we proposed Q-matrix anchored mixture Rasch model (QAMRM), including a Q-matrix and the traditional mixture Rasch model. The Q-matrix in QAMRM can use class invariant items to place all model parameter estimates from different latent classes on a common scale regardless of the ability distribution. A simulation study was conducted, and it was found that the estimated parameters of the QAMRM recovered fairly well. A real dataset from the Certificate of Proficiency in English was analyzed with the QAMRM, LCDM. It was found the QAMRM outperformed the LCDM in terms of model fit indices.
Collapse
Affiliation(s)
| | - Wen-Chung Wang
- The Education University of Hong Kong, Tai Po, Hong Kong
| |
Collapse
|
10
|
Nájera P, Sorrel MA, de la Torre J, Abad FJ. Improving Robustness in Q-Matrix Validation Using an Iterative and Dynamic Procedure. Appl Psychol Meas 2020; 44:431-446. [PMID: 32788815 PMCID: PMC7383688 DOI: 10.1177/0146621620909904] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In the context of cognitive diagnosis models (CDMs), a Q-matrix reflects the correspondence between attributes and items. The Q-matrix construction process is typically subjective in nature, which may lead to misspecifications. All this can negatively affect the attribute classification accuracy. In response, several methods of empirical Q-matrix validation have been developed. The general discrimination index (GDI) method has some relevant advantages such as the possibility of being applied to several CDMs. However, the estimation of the GDI relies on the estimation of the latent group sizes and success probabilities, which is made with the original (possibly misspecified) Q-matrix. This can be a problem, especially in those situations in which there is a great uncertainty about the Q-matrix specification. To address this, the present study investigates the iterative application of the GDI method, where only one item is modified at each step of the iterative procedure, and the required cutoff is updated considering the new parameter estimates. A simulation study was conducted to test the performance of the new procedure. Results showed that the performance of the GDI method improved when the application was iterative at the item level and an appropriate cutoff point was used. This was most notable when the original Q-matrix misspecification rate was high, where the proposed procedure performed better 96.5% of the times. The results are illustrated using Tatsuoka's fraction-subtraction data set.
Collapse
|
11
|
Abstract
Parametric likelihood estimation is the prevailing method for fitting cognitive diagnosis models-also called diagnostic classification models (DCMs). Nonparametric concepts and methods that do not rely on a parametric statistical model have been proposed for cognitive diagnosis. These methods are particularly useful when sample sizes are small. The general nonparametric classification (GNPC) method for assigning examinees to proficiency classes can accommodate assessment data conforming to any diagnostic classification model that describes the probability of a correct item response as an increasing function of the number of required attributes mastered by an examinee (known as the "monotonicity assumption"). Hence, the GNPC method can be used with any model that can be represented as a general DCM. However, the statistical properties of the estimator of examinees' proficiency class are currently unknown. In this article, the consistency theory of the GNPC proficiency-class estimator is developed and its statistical consistency is proven.
Collapse
Affiliation(s)
- Chia-Yi Chiu
- Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.
| | | |
Collapse
|
12
|
da Silva MA, Liu R, Huggins-Manley AC, Bazán JL. Incorporating the Q-Matrix Into Multidimensional Item Response Theory Models. Educ Psychol Meas 2019; 79:665-687. [PMID: 32655178 PMCID: PMC7328237 DOI: 10.1177/0013164418814898] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Multidimensional item response theory (MIRT) models use data from individual item responses to estimate multiple latent traits of interest, making them useful in educational and psychological measurement, among other areas. When MIRT models are applied in practice, it is not uncommon to see that some items are designed to measure all latent traits while other items may only measure one or two traits. In order to facilitate a clear expression of which items measure which traits and formulate such relationships as a math function in MIRT models, we applied the concept of the Q-matrix commonly used in diagnostic classification models to MIRT models. In this study, we introduced how to incorporate a Q-matrix into an existing MIRT model, and demonstrated benefits of the proposed hybrid model through two simulation studies and an applied study. In addition, we showed the relative ease in modeling educational and psychological data through a Bayesian approach via the NUTS algorithm.
Collapse
Affiliation(s)
- Marcelo A. da Silva
- University of São Paulo, São Paulo, Brazil
- Federal University of São Carlos, São Carlos, Brazil
| | - Ren Liu
- University of California, Merced, CA, USA
| | | | | |
Collapse
|
13
|
Abstract
Cognitive diagnosis models (CDMs) are latent class multidimensional statistical models that help classify people accurately by using a set of discrete latent variables, commonly referred to as attributes. These models require a Q-matrix that indicates the attributes involved in each item. A potential problem is that the Q-matrix construction process, typically performed by domain experts, is subjective in nature. This might lead to the existence of Q-matrix misspecifications that can lead to inaccurate classifications. For this reason, several empirical Q-matrix validation methods have been developed in the recent years. de la Torre and Chiu proposed one of the most popular methods, based on a discrimination index. However, some questions related to the usefulness of the method with empirical data remained open due the restricted number of conditions examined, and the use of a unique cutoff point (EPS) regardless of the data conditions. This article includes two simulation studies to test this validation method under a wider range of conditions, with the purpose of providing it with a higher generalization, and to empirically determine the most suitable EPS considering the data conditions. Results show a good overall performance of the method, the relevance of the different studied factors, and that using a single indiscriminate EPS is not acceptable. Specific guidelines for selecting an appropriate EPS are provided in the discussion.
Collapse
|
14
|
Gu Y, Xu G. The Sufficient and Necessary Condition for the Identifiability and Estimability of the DINA Model. Psychometrika 2019; 84:468-483. [PMID: 29728918 DOI: 10.1007/s11336-018-9619-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2017] [Revised: 04/22/2018] [Indexed: 05/28/2023]
Abstract
Cognitive diagnosis models (CDMs) are useful statistical tools in cognitive diagnosis assessment. However, as many other latent variable models, the CDMs often suffer from the non-identifiability issue. This work gives the sufficient and necessary condition for identifiability of the basic DINA model, which not only addresses the open problem in Xu and Zhang (Psychometrika 81:625-649, 2016) on the minimal requirement for identifiability, but also sheds light on the study of more general CDMs, which often cover DINA as a submodel. Moreover, we show the identifiability condition ensures the consistent estimation of the model parameters. From a practical perspective, the identifiability condition only depends on the Q-matrix structure and is easy to verify, which would provide a guideline for designing statistically valid and estimable cognitive diagnosis tests.
Collapse
Affiliation(s)
- Yuqi Gu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
15
|
Abstract
With the purpose to assist the subject matter experts in specifying their Q-matrices, the authors used expectation-maximization (EM)-based algorithm to investigate three alternative Q-matrix validation methods, namely, the maximum likelihood estimation (MLE), the marginal maximum likelihood estimation (MMLE), and the intersection and difference (ID) method. Their efficiency was compared, respectively, with that of the sequential EM-based δ method and its extension (ς2), the γ method, and the nonparametric method in terms of correct recovery rate, true negative rate, and true positive rate under the deterministic-inputs, noisy "and" gate (DINA) model and the reduced reparameterized unified model (rRUM). Simulation results showed that for the rRUM, the MLE performed better for low-quality tests, whereas the MMLE worked better for high-quality tests. For the DINA model, the ID method tended to produce better quality Q-matrix estimates than other methods for large sample sizes (i.e., 500 or 1,000). In addition, the Q-matrix was more precisely estimated under the discrete uniform distribution than under the multivariate normal threshold model for all the above methods. On average, the ς2 and ID method with higher true negative rates are better for correcting misspecified Q-entries, whereas the MLE with higher true positive rates is better for retaining the correct Q-entries. Experiment results on real data set confirmed the effectiveness of the MLE.
Collapse
Affiliation(s)
- Wenyi Wang
- Jiangxi Normal University, Jiangxi,
China
| | | | | | - Yaru Meng
- Xi’an Jiaotong University, Shaanxi,
China
| | - Canxi Cao
- University of Illinois at
Urbana–Champaign, IL, USA
| | - Yongjing Jie
- University of Illinois at
Urbana–Champaign, IL, USA
| |
Collapse
|
16
|
Abstract
The recent surge of interests in cognitive assessment has led to the development of cognitive diagnosis models. Central to many such models is a specification of the Q-matrix, which relates items to latent attributes that have natural interpretations. In practice, the Q-matrix is usually constructed subjectively by the test designers. This could lead to misspecification, which could result in lack of fit of the underlying statistical model. To test possible misspecification of the Q-matrix, traditional goodness of fit tests, such as the Chi-square test and the likelihood ratio test, may not be applied straightforwardly due to the large number of possible response patterns. To address this problem, this paper proposes a new statistical method to test the goodness fit of the Q-matrix, by constructing test statistics that measure the consistency between a provisional Q-matrix and the observed data for a general family of cognitive diagnosis models. Limiting distributions of the test statistics are derived under the null hypothesis that can be used for obtaining the test p-values. Simulation studies as well as a real data example are presented to demonstrate the usefulness of the proposed method.
Collapse
Affiliation(s)
- Yuqi Gu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA
| | - Jingchen Liu
- Department of Statistics, Columbia University, New York City, NY, USA
| | - Gongjun Xu
- Department of Statistics, University of Michigan, 456 West Hall, 1085 South University, Ann Arbor, MI, 48109, USA.
| | - Zhiliang Ying
- Department of Statistics, Columbia University, New York City, NY, USA
| |
Collapse
|
17
|
Abstract
This commentary concerns the theoretical properties of the estimation procedure in "A General Method of Empirical Q-matrix Validation" by Jimmy de la Torre and Chia-Yi Chiu. It raises the consistency issue of the estimator, proposes some modifications to it, and also makes some conjectures.
Collapse
Affiliation(s)
- Jingchen Liu
- Department of Statistics, Columbia University, New York, NY, USA.
| |
Collapse
|
18
|
Abstract
This rejoinder responds to the commentary by Liu (Psychometrika, 2015) entitled "On the consistency of Q-matrix estimation: A commentary" on the paper "A general method of empirical Q-matrix validation" by de la Torre and Chiu (Psychometrika, 2015). It discusses and addresses three concerns raised in the commentary, namely the estimation accuracy when a provisional Q-matrix is used, the consistency of the Q-matrix estimator, and the computational efficiency of the proposed method.
Collapse
Affiliation(s)
- Jimmy de la Torre
- Department of Educational Psychology, Rutgers, The State University of New Jersey, 10 Seminary Place, New Brunswick, NJ, 08901 , USA.
| | - Chia-Yi Chiu
- Department of Educational Psychology, Rutgers, The State University of New Jersey, 10 Seminary Place, New Brunswick, NJ, 08901 , USA
| |
Collapse
|
19
|
Abstract
Q-matrix validation is of increasing concern due to the significance and subjective tendency of Q-matrix construction in the modeling process. This research proposes a residual-based approach to empirically validate Q-matrix specification based on a combination of fit measures. The approach separates Q-matrix validation into four logical steps, including the test-level evaluation, possible distinction between attribute-level and item-level misspecifications, identification of the hit item, and fit information to aid in item adjustment. Through simulation studies and real-life examples, it is shown that the misspecified items can be detected as the hit item and adjusted sequentially when the misspecification occurs at the item level or at random. Adjustment can be based on the maximum reduction of the test-level measures. When adjustment of individual items tends to be useless, attribute-level misspecification is of concern. The approach can accommodate a variety of cognitive diagnosis models (CDMs) and be extended to cover other response formats.
Collapse
|
20
|
Liu R, Huggins-Manley AC, Bradshaw L. The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies. Educ Psychol Meas 2017; 77:220-240. [PMID: 29795911 PMCID: PMC5965543 DOI: 10.1177/0013164416645636] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a hierarchical structure because student learning and development is a sequential process where many skills build on others. However, it remains to be seen if we can use information from the attribute structure and work that into the design of the diagnostic tests. The purpose of this study is to introduce three approaches of Q-matrix design and investigate their impact on classification results under different attribute structures. Results indicate that the adjacent approach provides higher accuracy in a shorter test length when compared with other Q-matrix design approaches. This study provides researchers and practitioners guidance on how to design the Q-matrix in diagnostic tests, which are in high demand from educators.
Collapse
Affiliation(s)
- Ren Liu
- University of Florida, Gainesville, FL, USA
| | | | | |
Collapse
|
21
|
Köhn HF, Chiu CY. A Procedure for Assessing the Completeness of the Q-Matrices of Cognitively Diagnostic Tests. Psychometrika 2017; 82:112-132. [PMID: 27714544 DOI: 10.1007/s11336-016-9536-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Revised: 09/03/2016] [Indexed: 06/06/2023]
Abstract
The Q-matrix of a cognitively diagnostic test is said to be complete if it allows for the identification of all possible proficiency classes among examinees. Completeness of the Q-matrix is therefore a key requirement for any cognitively diagnostic test. However, completeness of the Q-matrix is often difficult to establish, especially, for tests with a large number of items involving multiple attributes. As an additional complication, completeness is not an intrinsic property of the Q-matrix, but can only be assessed in reference to a specific cognitive diagnosis model (CDM) supposed to underly the data-that is, the Q-matrix of a given test can be complete for one model but incomplete for another. In this article, a method is presented for assessing whether a given Q-matrix is complete for a given CDM. The proposed procedure relies on the theoretical framework of general CDMs and is therefore legitimate for CDMs that can be reparameterized as a general CDM.
Collapse
Affiliation(s)
| | - Chia-Yi Chiu
- Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.
| |
Collapse
|
22
|
Xu G, Wang C, Shang Z. On initial item selection in cognitive diagnostic computerized adaptive testing. Br J Math Stat Psychol 2016; 69:291-315. [PMID: 27435032 DOI: 10.1111/bmsp.12072] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Revised: 05/02/2016] [Indexed: 06/06/2023]
Abstract
There has recently been much interest in computerized adaptive testing (CAT) for cognitive diagnosis. While there exist various item selection criteria and different asymptotically optimal designs, these are mostly constructed based on the asymptotic theory assuming the test length goes to infinity. In practice, with limited test lengths, the desired asymptotic optimality may not always apply, and there are few studies in the literature concerning the optimal design of finite items. Related questions, such as how many items we need in order to be able to identify the attribute pattern of an examinee and what types of initial items provide the optimal classification results, are still open. This paper aims to answer these questions by providing non-asymptotic theory of the optimal selection of initial items in cognitive diagnostic CAT. In particular, for the optimal design, we provide necessary and sufficient conditions for the Q-matrix structure of the initial items. The theoretical development is suitable for a general family of cognitive diagnostic models. The results not only provide a guideline for the design of optimal item selection procedures, but also may be applied to guide item bank construction.
Collapse
Affiliation(s)
- Gongjun Xu
- School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.
| | - Chun Wang
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, USA
| | - Zhuoran Shang
- School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
23
|
Ma W, de la Torre J. A sequential cognitive diagnosis model for polytomous responses. Br J Math Stat Psychol 2016; 69:253-275. [PMID: 27317397 DOI: 10.1111/bmsp.12070] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 04/20/2016] [Indexed: 06/06/2023]
Abstract
This paper proposes a general polytomous cognitive diagnosis model for a special type of graded responses, where item categories are attained in a sequential manner, and associated with some attributes explicitly. To relate categories to attributes, a category-level Q-matrix is used. When the attribute and category association is specified a priori, the proposed model has the flexibility to allow different cognitive processes (e.g., conjunctive, disjunctive) to be modelled at different categories within a single item. This model can be extended for items where categories cannot be explicitly linked to attributes, and for items with unordered categories. The feasibility of the proposed model is examined using simulated data. The proposed model is illustrated using the data from the Trends in International Mathematics and Science Study 2007 assessment.
Collapse
Affiliation(s)
- Wenchao Ma
- Rutgers, The State University of New Jersey, New Brunswick, New Jersey, USA.
| | - Jimmy de la Torre
- Rutgers, The State University of New Jersey, New Brunswick, New Jersey, USA
| |
Collapse
|
24
|
Abstract
Diagnostic classification models (DCMs) are important statistical tools in cognitive diagnosis. In this paper, we consider the issue of their identifiability. In particular, we focus on one basic and popular model, the DINA model. We propose sufficient and necessary conditions under which the model parameters are identifiable from the data. The consequences, in terms of the consistency of parameter estimates, of fulfilling or failing to fulfill these conditions are illustrated via simulation. The results can be easily extended to the DINO model through the duality of the DINA and DINO models. Moreover, the proposed theoretical framework could be applied to study the identifiability issue of other DCMs.
Collapse
Affiliation(s)
- Gongjun Xu
- School of Statistics, University of Minnesota, 313 Ford Hall, 224 Church St SE, Minneapolis, MN, 55455 , USA.
| | | |
Collapse
|
25
|
Abstract
In contrast to unidimensional item response models that postulate a single underlying proficiency, cognitive diagnosis models (CDMs) posit multiple, discrete skills or attributes, thus allowing CDMs to provide a finer-grained assessment of examinees' test performance. A common component of CDMs for specifying the attributes required for each item is the Q-matrix. Although construction of Q-matrix is typically performed by domain experts, it nonetheless, to a large extent, remains a subjective process, and misspecifications in the Q-matrix, if left unchecked, can have important practical implications. To address this concern, this paper proposes a discrimination index that can be used with a wide class of CDM subsumed by the generalized deterministic input, noisy "and" gate model to empirically validate the Q-matrix specifications by identifying and replacing misspecified entries in the Q-matrix. The rationale for using the index as the basis for a proposed validation method is provided in the form of mathematical proofs to several relevant lemmas and a theorem. The feasibility of the proposed method was examined using simulated data generated under various conditions. The proposed method is illustrated using fraction subtraction data.
Collapse
Affiliation(s)
- Jimmy de la Torre
- Department of Educational Psychology, Rutgers, The State University of New Jersey, 10 Seminary Place, New Brunswick, NJ, 08901, USA.
| | - Chia-Yi Chiu
- Department of Educational Psychology, Rutgers, The State University of New Jersey, 10 Seminary Place, New Brunswick, NJ, 08901, USA
| |
Collapse
|
26
|
Madison MJ, Bradshaw LP. The Effects of Q-Matrix Design on Classification Accuracy in the Log-Linear Cognitive Diagnosis Model. Educ Psychol Meas 2015; 75:491-511. [PMID: 29795830 PMCID: PMC5965638 DOI: 10.1177/0013164414539162] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Diagnostic classification models are psychometric models that aim to classify examinees according to their mastery or non-mastery of specified latent characteristics. These models are well-suited for providing diagnostic feedback on educational assessments because of their practical efficiency and increased reliability when compared with other multidimensional measurement models. A priori specifications of which latent characteristics or attributes are measured by each item are a core element of the diagnostic assessment design. This item-attribute alignment, expressed in a Q-matrix, precedes and supports any inference resulting from the application of the diagnostic classification model. This study investigates the effects of Q-matrix design on classification accuracy for the log-linear cognitive diagnosis model. Results indicate that classification accuracy, reliability, and convergence rates improve when the Q-matrix contains isolated information from each measured attribute.
Collapse
|
27
|
Popescu AA, Harper AL, Trick M, Bancroft I, Huber KT. A novel and fast approach for population structure inference using kernel-PCA and optimization. Genetics 2014; 198:1421-31. [PMID: 25326237 DOI: 10.1534/genetics.114.171314] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Population structure is a confounding factor in genome-wide association studies, increasing the rate of false positive associations. To correct for it, several model-based algorithms such as ADMIXTURE and STRUCTURE have been proposed. These tend to suffer from the fact that they have a considerable computational burden, limiting their applicability when used with large datasets, such as those produced by next generation sequencing techniques. To address this, nonmodel based approaches such as sparse nonnegative matrix factorization (sNMF) and EIGENSTRAT have been proposed, which scale better with larger data. Here we present a novel nonmodel-based approach, population structure inference using kernel-PCA and optimization (PSIKO), which is based on a unique combination of linear kernel-PCA and least-squares optimization and allows for the inference of admixture coefficients, principal components, and number of founder populations of a dataset. PSIKO has been compared against existing leading methods on a variety of simulation scenarios, as well as on real biological data. We found that in addition to producing results of the same quality as other tested methods, PSIKO scales extremely well with dataset size, being considerably (up to 30 times) faster for longer sequences than even state-of-the-art methods such as sNMF. PSIKO and accompanying manual are freely available at https://www.uea.ac.uk/computing/psiko.
Collapse
|
28
|
Abstract
Cognitive assessment is a growing area in psychological and educational measurement, where tests are given to assess mastery/deficiency of attributes or skills. A key issue is the correct identification of attributes associated with items in a test. In this paper, we set up a mathematical framework under which theoretical properties may be discussed. We establish sufficient conditions to ensure that the attributes required by each item are learnable from the data.
Collapse
Affiliation(s)
- Jingchen Liu
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue New York, NY 10027, USA
| | - Gongjun Xu
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue New York, NY 10027, USA
| | - Zhiliang Ying
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue New York, NY 10027, USA
| |
Collapse
|