1
|
Wang D, Ma W, Cai Y, Tu D. A general nonparametric classification method for multiple strategies in cognitive diagnostic assessment. Behav Res Methods 2024; 56:723-735. [PMID: 36814008 DOI: 10.3758/s13428-023-02075-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/17/2023] [Indexed: 02/24/2023]
Abstract
Cognitive diagnosis models (CDMs) have been used as psychometric tools in educational assessments to estimate students' strengths and weaknesses in terms of cognitive skills learned and skills that need study. In practice, it is not uncommon that questions can often be solved using more than one strategy, which requires CDMs capable of accommodating multiple strategies. However, existing parametric multi-strategy CDMs need a large sample size to produce a reliable estimation of item parameters and examinees' proficiency class memberships, which obstructs their practical applications. This article proposes a general nonparametric multi-strategy classification method with promising classification accuracy in small samples for dichotomous response data. The method can accommodate different strategy selection approaches and different condensation rules. Simulation studies showed that the proposed method outperformed the parametric CDMs when sample sizes were small. A set of real data was analyzed as well to illustrate the application of the proposed method in practice.
Collapse
Affiliation(s)
- Daxun Wang
- School of Psychology, Jiangxi Normal University, 99 Ziyang Ave, Nanchang, Jiangxi, 330022, China
| | - Wenchao Ma
- Department of Educational Studies in Psychology, Research Methodology and Counseling, The University of Alabama, Tuscaloosa, AL, USA
| | - Yan Cai
- School of Psychology, Jiangxi Normal University, 99 Ziyang Ave, Nanchang, Jiangxi, 330022, China.
| | - Dongbo Tu
- School of Psychology, Jiangxi Normal University, 99 Ziyang Ave, Nanchang, Jiangxi, 330022, China.
| |
Collapse
|
2
|
Xiong J, Luo Z, Luo G, Yu X. Data-driven Q-matrix learning based on Boolean matrix factorization in cognitive diagnostic assessment. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2022; 75:638-667. [PMID: 35578396 DOI: 10.1111/bmsp.12271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 03/22/2022] [Indexed: 06/15/2023]
Abstract
Attributes and the Q-matrix are the central components for cognitive diagnostic assessment, and are usually defined by domain experts. However, it is challenging and time consuming for experts to specify the attributes and Q-matrix manually. Thus, there is an urgent need for an automatic and intelligent means to address this concern. This paper presents a new data-driven approach for learning the Q-matrix from response data. By constructing a statistical index and a heuristic algorithm based on Boolean matrix factorization, the response matrix is decomposed into the Boolean product of the Q-matrix and the attribute mastery patterns. The feasibility of the proposed approach is evaluated using simulated data generated under various conditions. A real data example is also presented to demonstrate the usefulness of the proposed approach.
Collapse
Affiliation(s)
- Jianhua Xiong
- School of Psychology, Jiangxi Normal University, Nanchang, China
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Zhaosheng Luo
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Guanzhong Luo
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Xiaofeng Yu
- School of Psychology, Jiangxi Normal University, Nanchang, China
| |
Collapse
|
3
|
A multiple logistic regression-based (MLR-B) Q-matrix validation method for cognitive diagnosis models:A confirmatory approach. Behav Res Methods 2022:10.3758/s13428-022-01880-x. [PMID: 35819718 DOI: 10.3758/s13428-022-01880-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/17/2022] [Indexed: 11/08/2022]
Abstract
Q-matrix is an essential component specifying the relationship between attributes and items, which plays a key role in cognitive diagnosis assessment. The Q-matrix is usually developed by domain experts and its specifications tend to be subjective and might have misspecifications. Many existing pieces of research concentrate on the validation of Q-matrix; however, few of them can be applied to saturated cognitive diagnosis models. This paper proposes a general and effective Q-matrix validation method by employing multiple logistic regression model. Simulation studies are carried out to investigate the performance of the proposed method and compare it with four existing methods. Simulation results indicate the proposed method outperforms the existing methods in terms of validation accuracy. In addition, a set of real data is used as an example to illustrate its application. Finally, we discuss the limitations of the current study and the directions of future studies.
Collapse
|
4
|
Ma W, Jiang Z. Estimating Cognitive Diagnosis Models in Small Samples: Bayes Modal Estimation and Monotonic Constraints. APPLIED PSYCHOLOGICAL MEASUREMENT 2021; 45:95-111. [PMID: 33627916 PMCID: PMC7876636 DOI: 10.1177/0146621620977681] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Despite the increasing popularity, cognitive diagnosis models have been criticized for limited utility for small samples. In this study, the authors proposed to use Bayes modal (BM) estimation and monotonic constraints to stabilize item parameter estimation and facilitate person classification in small samples based on the generalized deterministic input noisy "and" gate (G-DINA) model. Both simulation study and real data analysis were used to assess the utility of the BM estimation and monotonic constraints. Results showed that in small samples, (a) the G-DINA model with BM estimation is more likely to converge successfully, (b) when prior distributions are specified reasonably, and monotonicity is not violated, the BM estimation with monotonicity tends to produce more stable item parameter estimates and more accurate person classification, and (c) the G-DINA model using the BM estimation with monotonicity is less likely to overfit the data and shows higher predictive power.
Collapse
Affiliation(s)
- Wenchao Ma
- The University of Alabama, Tuscaloosa, USA
| | | |
Collapse
|
5
|
Yu X, Cheng Y. Data-driven Q-matrix validation using a residual-based statistic in cognitive diagnostic assessment. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2020; 73 Suppl 1:145-179. [PMID: 31762007 DOI: 10.1111/bmsp.12191] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 07/15/2019] [Indexed: 06/10/2023]
Abstract
In a cognitive diagnostic assessment (CDA), attributes refer to fine-grained knowledge points or skills. The Q-matrix is a central component of CDA, which specifies the relationship between items and attributes. Oftentimes, attributes and Q-matrix are defined by subject-matter experts, and assumed to be appropriate without any misspecifications. However, this assumption does not always hold in real applications. To address this concern, this paper proposes a residual-based statistic for validating the Q-matrix. Its performance is evaluated in a simulation study and compared against that of an existing method proposed in Liu, Xu and Ying (2012, Applied Psychological Measurement, 36, 548). Simulation results indicate that the proposed method leads to a higher recovery rate of the Q-matrix and is computationally more efficient. The advantage in computational efficiency is particularly pronounced when the number of attributes measured by the test reaches five or more. Results also suggest that the two methods have different tendencies in estimating the attribute vector for each item. In cases where the methods fail to recover the correct Q-matrix, the method in Liu et al. (2012, Applied Psychological Measurement, 36, 548) tends to overestimate the number of attributes measured by the items, whereas our method does not show that bias.
Collapse
Affiliation(s)
- Xiaofeng Yu
- Department of Psychology, University of Notre Dame, Notre Dame, Indiana, USA
- Jiangxi Normal University, Nanchang, Jiangxi, China
| | - Ying Cheng
- Department of Psychology, University of Notre Dame, Notre Dame, Indiana, USA
| |
Collapse
|
6
|
Wang W, Song L, Ding S, Wang T, Gao P, Xiong J. A Semi-supervised Learning Method for Q-Matrix Specification Under the DINA and DINO Model With Independent Structure. Front Psychol 2020; 11:2120. [PMID: 33013538 PMCID: PMC7511573 DOI: 10.3389/fpsyg.2020.02120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 07/30/2020] [Indexed: 12/02/2022] Open
Abstract
Cognitive diagnosis assessment (CDA) can be regarded as a kind of formative assessments because it is intended to promote assessment for learning and modify instruction and learning in classrooms by providing the formative diagnostic information about students' cognitive strengths and weaknesses. CDA has two phases, like a statistical pattern recognition. The first phase is feature generation, followed by classification stage. A Q-matrix, which describes the relationship between items and latent skills, corresponds to the feature generation phase in statistical pattern recognition. Feature generation is of paramount importance in any pattern recognition task. In practice, the Q-matrix is difficult to specify correctly in cognitive diagnosis and misspecification of the Q-matrix can seriously affect the accuracy of the classification of examinees. Based on the fact that any columns of a reduced Q-matrix can be expressed by the columns of a reachability R matrix under the logical OR operation, a semi-supervised learning approach and an optimal design for examinee sampling were proposed for Q-matrix specification under the conjunctive and disjunctive model with independent structure. This method only required subject matter experts specifying a R matrix corresponding to a small part of test items for the independent structure in which the R matrix is an identity matrix. Simulation and real data analysis showed that the new method with the optimal design is promising in terms of correct recovery rates of q-entries.
Collapse
Affiliation(s)
- Wenyi Wang
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Lihong Song
- Elementary Education College, Jiangxi Normal University, Nanchang, China
- *Correspondence: Lihong Song
| | - Shuliang Ding
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Teng Wang
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Peng Gao
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| | - Jian Xiong
- School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
| |
Collapse
|
7
|
Wang D, Cai Y, Tu D. Q-Matrix Estimation Methods for Cognitive Diagnosis Models: Based on Partial Known Q-Matrix. MULTIVARIATE BEHAVIORAL RESEARCH 2020:1-13. [PMID: 32308032 DOI: 10.1080/00273171.2020.1746901] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Different from the item response models that postulate a single underlying proficiency, cognitive diagnostic assessments (CDAs) can provide fine-grained diagnostic information about students' knowledge state to aid classroom instructions. In CDAs, a Q-matrix that associates each item in a test with the cognitive skills is required to infer students' knowledge states. In practice, the Q-matrix is typically performed by domain experts, which is certainly affected by the subjective tendency of experts and, to a large extent, may consist of some misspecifications. In addition, if the number of items increases, the expert-based Q-matrix specification will be time-consuming and costly. To address this concern, this paper proposed several approaches based on the likelihood ratio test to estimate Q-matrix with partial known Q-matrix and the response data, which can be used with a wide class of cognitive diagnosis models (CDMs). The feasibility and effectiveness of the proposed methods were evaluated by simulated data generated under various conditions and an example to real data. Results show that new methods can estimate Q-matrix correctly and outperforms the existing method in most conditions.
Collapse
Affiliation(s)
- Daxun Wang
- School of Psychology, Jiangxi Normal University
| | - Yan Cai
- School of Psychology, Jiangxi Normal University
| | - Dongbo Tu
- School of Psychology, Jiangxi Normal University
| |
Collapse
|
8
|
Ma W, de la Torre J. An empirical Q-matrix validation method for the sequential generalized DINA model. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2020; 73:142-163. [PMID: 30723890 DOI: 10.1111/bmsp.12156] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Revised: 11/26/2018] [Indexed: 06/09/2023]
Abstract
As a core component of most cognitive diagnosis models, the Q-matrix, or item and attribute association matrix, is typically developed by domain experts, and tends to be subjective. It is critical to validate the Q-matrix empirically because a misspecified Q-matrix could result in erroneous attribute estimation. Most existing Q-matrix validation procedures are developed for dichotomous responses. However, in this paper, we propose a method to empirically detect and correct the misspecifications in the Q-matrix for graded response data based on the sequential generalized deterministic inputs, noisy 'and' gate (G-DINA) model. The proposed Q-matrix validation procedure is implemented in a stepwise manner based on the Wald test and an effect size measure. The feasibility of the proposed method is examined using simulation studies. Also, a set of data from the Trends in International Mathematics and Science Study (TIMSS) 2011 mathematics assessment is analysed for illustration.
Collapse
Affiliation(s)
- Wenchao Ma
- Department of Educational Studies in Psychology, Research Methodology and Counseling, University of Alabama, Tuscaloosa, Alabama, USA
| | | |
Collapse
|
9
|
A method of Q-matrix validation for polytomous response cognitive diagnosis model based on relative fit statistics. ACTA PSYCHOLOGICA SINICA 2020. [DOI: 10.3724/sp.j.1041.2020.00093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
10
|
Nájera P, Sorrel MA, Abad FJ. Reconsidering Cutoff Points in the General Method of Empirical Q-Matrix Validation. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 2019; 79:727-753. [PMID: 32655181 PMCID: PMC7328244 DOI: 10.1177/0013164418822700] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Cognitive diagnosis models (CDMs) are latent class multidimensional statistical models that help classify people accurately by using a set of discrete latent variables, commonly referred to as attributes. These models require a Q-matrix that indicates the attributes involved in each item. A potential problem is that the Q-matrix construction process, typically performed by domain experts, is subjective in nature. This might lead to the existence of Q-matrix misspecifications that can lead to inaccurate classifications. For this reason, several empirical Q-matrix validation methods have been developed in the recent years. de la Torre and Chiu proposed one of the most popular methods, based on a discrimination index. However, some questions related to the usefulness of the method with empirical data remained open due the restricted number of conditions examined, and the use of a unique cutoff point (EPS) regardless of the data conditions. This article includes two simulation studies to test this validation method under a wider range of conditions, with the purpose of providing it with a higher generalization, and to empirically determine the most suitable EPS considering the data conditions. Results show a good overall performance of the method, the relevance of the different studied factors, and that using a single indiscriminate EPS is not acceptable. Specific guidelines for selecting an appropriate EPS are provided in the discussion.
Collapse
|
11
|
Ma W. A diagnostic tree model for polytomous responses with multiple strategies. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2019; 72:61-82. [PMID: 29687453 DOI: 10.1111/bmsp.12137] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Revised: 01/05/2018] [Indexed: 06/08/2023]
Abstract
Constructed-response items have been shown to be appropriate for cognitively diagnostic assessments because students' problem-solving procedures can be observed, providing direct evidence for making inferences about their proficiency. However, multiple strategies used by students make item scoring and psychometric analyses challenging. This study introduces the so-called two-digit scoring scheme into diagnostic assessments to record both students' partial credits and their strategies. This study also proposes a diagnostic tree model (DTM) by integrating the cognitive diagnosis models with the tree model to analyse the items scored using the two-digit rubrics. Both convergent and divergent tree structures are considered to accommodate various scoring rules. The MMLE/EM algorithm is used for item parameter estimation of the DTM, and has been shown to provide good parameter recovery under varied conditions in a simulation study. A set of data from TIMSS 2007 mathematics assessment is analysed to illustrate the use of the two-digit scoring scheme and the DTM.
Collapse
Affiliation(s)
- Wenchao Ma
- The University of Alabama, Tuscaloosa, AL, USA
| |
Collapse
|
12
|
Chen J, de la Torre J. Introducing the General Polytomous Diagnosis Modeling Framework. Front Psychol 2018; 9:1474. [PMID: 30186195 PMCID: PMC6113892 DOI: 10.3389/fpsyg.2018.01474] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2018] [Accepted: 07/26/2018] [Indexed: 12/03/2022] Open
Abstract
Although considerable developments have been added to the cognitive diagnosis modeling literature recently, most have been conducted for dichotomous responses only. This research proposes a general cognitive diagnosis model for polytomous responses—the general polytomous diagnosis model (GPDM), which combines the G-DINA modeling process for dichotomous responses with the item-splitting process for polytomous responses. The polytomous items are specified similar to dichotomous items in the Q-matrix, and the MML estimation is implemented using an EM algorithm. Under the general framework, different saturated forms, and some reduced forms, can be transformed linearly. Model assessment and adjustment under the dichotomous context can be extended to polytomous responses. This simulation study demonstrates the effectiveness of the model when comparing the two response types. The real-data example further illustrates how the proposed model can make a difference in practice.
Collapse
Affiliation(s)
- Jinsong Chen
- Department of Psychology, Sun Yat-sen University, Guangzhou, China
| | | |
Collapse
|
13
|
Yamaguchi K, Okada K. Comparison among cognitive diagnostic models for the TIMSS 2007 fourth grade mathematics assessment. PLoS One 2018; 13:e0188691. [PMID: 29394257 PMCID: PMC5796692 DOI: 10.1371/journal.pone.0188691] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 11/10/2017] [Indexed: 11/19/2022] Open
Abstract
A variety of cognitive diagnostic models (CDMs) have been developed in recent years to help with the diagnostic assessment and evaluation of students. Each model makes different assumptions about the relationship between students' achievement and skills, which makes it important to empirically investigate which CDMs better fit the actual data. In this study, we examined this question by comparatively fitting representative CDMs to the Trends in International Mathematics and Science Study (TIMSS) 2007 assessment data across seven countries. The following two major findings emerged. First, in accordance with former studies, CDMs had a better fit than did the item response theory models. Second, main effects models generally had a better fit than other parsimonious or the saturated models. Related to the second finding, the fit of the traditional parsimonious models such as the DINA and DINO models were not optimal. The empirical educational implications of these findings are discussed.
Collapse
Affiliation(s)
| | - Kensuke Okada
- Department of Psychology, Senshu University, Kanagawa, Japan
| |
Collapse
|
14
|
Chen J, Zhou H. Test designs and modeling under the general nominal diagnosis model framework. PLoS One 2017; 12:e0180016. [PMID: 28644865 PMCID: PMC5482485 DOI: 10.1371/journal.pone.0180016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2016] [Accepted: 05/23/2017] [Indexed: 11/21/2022] Open
Abstract
Most psychological questionnaires face issues of response bias in respondent-reported scales, inadequacy for criterion-reference testing, or difficulty in estimating a large number of latent traits. Situational tests together with the general nominal diagnosis model framework provide a viable alternative to alleviate these concerns. Under this framework, there are different ways to design situationally nominal items, which can offer more flexibility for test development. Any response bias remaining with respondent-reported questionnaires may be addressed with appropriate test designs. The saturated model subsumes different reduced forms that can help inform whether the test is designed as expected. Two simulation studies are presented to demonstrate the effectiveness of the models and designs.
Collapse
Affiliation(s)
- Jinsong Chen
- Department of Psychology, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Hui Zhou
- Department of Psychology, Sun Yat-Sen University, Guangzhou, Guangdong, China
| |
Collapse
|