A Semi-supervised Learning Method for Q-Matrix Specification Under the DINA and DINO Model With Independent Structure

Wang, Wenyi; Song, Lihong; Ding, Shuliang; Wang, Teng; Gao, Peng; Xiong, Jian

doi:10.3389/fpsyg.2020.02120

ORIGINAL RESEARCH article

Front. Psychol. , 10 September 2020

Sec. Quantitative Psychology and Measurement

Volume 11 - 2020 | https://doi.org/10.3389/fpsyg.2020.02120

This article is part of the Research Topic Cognitive Diagnostic Assessment for Learning View all 20 articles

A Semi-supervised Learning Method for Q-Matrix Specification Under the DINA and DINO Model With Independent Structure

$\nWenyi Wang$ Wenyi Wang¹

Lihong Song²^*

Shuliang Ding¹

Teng Wang¹

Peng Gao¹

Jian Xiong¹

¹School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
²Elementary Education College, Jiangxi Normal University, Nanchang, China

Cognitive diagnosis assessment (CDA) can be regarded as a kind of formative assessments because it is intended to promote assessment for learning and modify instruction and learning in classrooms by providing the formative diagnostic information about students' cognitive strengths and weaknesses. CDA has two phases, like a statistical pattern recognition. The first phase is feature generation, followed by classification stage. A Q-matrix, which describes the relationship between items and latent skills, corresponds to the feature generation phase in statistical pattern recognition. Feature generation is of paramount importance in any pattern recognition task. In practice, the Q-matrix is difficult to specify correctly in cognitive diagnosis and misspecification of the Q-matrix can seriously affect the accuracy of the classification of examinees. Based on the fact that any columns of a reduced Q-matrix can be expressed by the columns of a reachability R matrix under the logical OR operation, a semi-supervised learning approach and an optimal design for examinee sampling were proposed for Q-matrix specification under the conjunctive and disjunctive model with independent structure. This method only required subject matter experts specifying a R matrix corresponding to a small part of test items for the independent structure in which the R matrix is an identity matrix. Simulation and real data analysis showed that the new method with the optimal design is promising in terms of correct recovery rates of q-entries.

Introduction

In educational assessment, cognitive diagnostic assessment (CDA) that combines psychometrics and cognitive science has received increased attention recently (Leighton and Gierl, 2007; Tatsuoka, 2009; Rupp et al., 2010). This approach potentially provides useful diagnostic information regarding students' strengths and weaknesses, and can facilitate individualized learning (Chang, 2015). Cognitive diagnostic models (CDMs) often utilize a Q-matrix (Embretson, 1984; Tatsuoka, 1990, 1995, 2009). Tatsuoka (2009) pointed out that “Tatsuoka (1990) organized the underlying cognitive processing skills and knowledge that are required in answering test items correctly in a Q-matrix, in which the rows represent attributes and the columns represent items.” The entries of a Q-matrix are 1 or 0, denoted by q_kj. If attribute k is involved in correctly answering item j, then q_kj = 1, and q_kj = 0 otherwise. The definition of Q-matrix in Tatsuoka (1990) is used in our study. Recently, one common representation of a Q-matrix is that in which the rows represent items and the columns represent attributes (Ma and de la Torre, 2020; Zhan et al., 2020). It should be noted that the representation of the Q-matrix that they used in the study differs from the traditional one.

Cognitive diagnostic assessment has two phases, like statistical pattern recognition and classification methodology. The first phase is feature generation, and then classification stage follows. The specification of Q-matrix corresponds to the feature extractor phase in statistical pattern recognition and classification problems. Feature generation is of paramount importance in any pattern recognition task. So, the Q-matrix plays a very important role in establishing the relation between latent attribute patterns and ideal/latent response patterns.

In practice, the Q-matrix is difficult to specify correctly in cognitive diagnostic assessment (Jang, 2009; DeCarlo, 2011) and misspecification of the Q-matrix can seriously affect the accuracy of both item parameter estimates and the classification of examinees (de la Torre, 2008; Rupp and Templin, 2008). Researchers have proposed several quantitative methods for deriving or refining Q-matrix. These methods can be classified into two categories (Xu and Desmarais, 2018): (a) the unsupervised method, including but not limited to the q-matrix method (Barnes, 2003, 2011), the non-negative matrix factorization technique (Desmarais, 2011; Desmarais et al., 2012; Desmarais and Naceur, 2013) or alternate least-square factorization method (Desmarais et al., 2014; Xu and Desmarais, 2016), the data-driven approach (Liu et al., 2012, 2013), and the exploratory factor analysis method (Barnes, 2003; Close, 2012; Wang et al., 2018b, 2020), and (b) the supervised method, including the sequential EM-based δ method (de la Torre, 2008) and its extension ς² method (de la Torre and Chiu, 2016), the Bayesian approach (DeCarlo, 2012), the non-parametric Q-matrix refinement method (Chiu, 2013), the stepwise reduction algorithm (Hartz, 2002), the EM-based methods (Wang et al., 2018a), the residual-based or item fit statistic approach (Chen, 2017; Kang et al., 2018) and so on.

The unsupervised method is deriving a Q-matrix only from test data or item responses. The unsupervised method is very useful because there are many existing tests without specifying the Q-matrix but with test response data. However, it would be difficult to identify the number of latent skills and be slightly more difficult to understand results from real data. A study of Beheshti et al. (2012) found that the number of latent skills estimated from real data is not well-aligned with the assessment of experts.

The supervised method can incorporate the information of experts' Q-matrix and test response data to refine or validate the provisional Q-matrix. If the provisional Q-matrix is unknown for an existing test, the supervised methods cannot be used. Furthermore, this method often needs a high-quality provisional Q-matrix for a whole test. If the provisional Q-matrix is specified by subject matter experts but contains a large amount of misspecification, it will be difficult for the recovery of a high-quality Q-matrix through the supervised method, because the performance of the supervised method relies on the precision of classification of attribute patterns resulting from the provisional Q-matrix (de la Torre, 2008; Rupp and Templin, 2008).

Specifying a Q-matrix for a whole test by experts can be a time-consuming and fatigue process. The purpose of this study is to propose a semi-supervised method for Q-matrix specification in order to check whether only some of items needs to be identified by experts. The semi-supervised method falls between unsupervised and supervised methods.

Model and Method

Model

Let K be the number of attributes. Let X_ij be a binary random variable to denote the response of examinee i to item j, i = 1, 2, …, N, j = 1, 2, …, J. Let α_i be a column vector to denote an attribute mastery pattern or a knowledge state from the universal set of knowledge states. Moreover, Q-matrix that specifies the item-attribute relationship is a K × J matrix, in which entry q_kj = 1 if attribute k is required for answering item j correctly; otherwise, q_kj = 0.

The item response function for the deterministic inputs, noisy “and” gate (DINA) model (Haertel, 1989; Junker and Sijtsma, 2001; Chiu and Douglas, 2013) is as follows:

\begin{array}{l} P_{j} (α_{i}) = P (X_{i j} = 1 | α_{i}) = g_{j}^{1 - η_{i j}} {(1 - s_{j})}^{η_{i j}}, & (1) \end{array}

where a deterministic latent response $η_{i j} = \prod_{k = 1}^{K} α_{k i}^{q_{k j}}$ indicates whether or not examinee i possesses all of the attributes required by item j. A value of η_ij = 1 means that examinee i has mastered all of the attributes required by item j, and η_ij = 0 otherwise. The slip parameter s_j refers to the probability of an incorrect response to the item j when η_ij = 1, and the guessing parameter g_j refers to the probability of a correct response to item j when η_ij = 0. Let B = (η_ij) be a deterministic latent response matrix for the DINA model.

The item response function for the deterministic inputs, noisy “or” gate (DINO) model (Templin and Henson, 2006; Chiu and Douglas, 2013) is as follows:

\begin{array}{l} P_{j} (α_{i}) = P (X_{i j} = 1 | α_{i}) = {(1 - s_{j})}^{w_{i j}} g_{j}^{1 - w_{i j}}, & (2) \end{array}

where $w_{i j} = 1 - \prod_{k = 1}^{K} {(1 - α_{k i})}^{q_{k j}}$ is a deterministic latent response. As in the DINA model, s_j and g_j are the slip and guessing parameters of item j. The DINA and DINO model are conjunctive and disjunctive models (Maris, 1999), respectively. Let W = (w_ij) be a deterministic latent response matrix for the DINO model.

A Semi-supervised Learning Approach for the Conjunctive Model

In the rule space method (Tatsuoka, 2009) or the attribute hierarchy method (Leighton et al., 2004), the adjacency matrix denoted by A represents the direct relationship among attributes. We denote the entry in row k₁ and column k₂ of A by a_k₁k₂. If a direct prerequisite relation exists from attribute k₁ to attribute k₂, then a_k₁k₂ = 1, and a_k₁k₂ = 0 otherwise. Let R denote a reachability matrix of order (K, K) to specify the direct and indirect relationships among attributes. The R matrix is given by R = (A + I)^K with respect to Boolean operations, where I is an identity matrix. The reduced Q matrix denoted by Q_r is obtained by removing the items (columns) that do not satisfy the specified relationships from the incidence Q matrix. The columns of Q_r and the zero vector forms the student matrix denoted by Q_s in which the columns forms the universal set of attribute patterns. If K attributes are independent, A is a zero matrix, R with K columns is an identity matrix, Q_r with 2^K − 1 columns does not include the zero vector, and Q_s with 2^K columns contains all possible combinations of attribute patterns.

We assume that the cognitive requirement for the multiple skills within an item is conjunctive (Maris, 1999), that is, answering an item correctly requires mastery of all the skills required by that item. For the conjunctive model, Example 1 will show the relationship of latent responses on items with q-vectors corresponding to R and Q_r.

Example 1 for an independent structure. Let K = 2, R = [r₁ r₂] = $[\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}]$ , Q_r = [q₁ q₂ q₃] = $[\begin{matrix} 1 & 0 & 1 \\ 0 & 1 & 1 \end{matrix}]$ , and Q_s = [α₁ α₂ α₃ α₄] = $[\begin{matrix} 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \end{matrix}]$ .

Given Q_s and a test Q-matrix of Q_r, a latent response matrix B = [η₁ η₂ η₃] = $[\begin{matrix} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \\ 1 & 1 & 1 \end{matrix}]$ can be calculated, in which the entry in row i and column j is the deterministic latent response of η_ij. If 0 corresponds to F (false) and 1 corresponds to T (true), the logical conjunction and disjunction operators, ∨ and ∧, can be applied to two binary vectors of equal length, by taking the bitwise AND or OR of each pair of bits at corresponding positions. It can be observed that η₃ = η₁ ∧ η₂, where η₃ = η₁ ∧ η₂ is the conjunction of η₁ and η₂. This is because the relationship q₃ = q₁ ∨ q₂ is true, where q₁ ∨ q₂ is the disjunction of q₁ and q₂.

Example 1 illustrates the following fact. For the conjunctive model, consider two latent response matrices denoted by B₁ and B₂ from two tests corresponding two Q-matrices Q_r and R, where denoted as a reachability matrix. It means that B₁ and B₂ can be generated, respectively from the reduced Q-matrix and the reachability matrix based on the universal set of attribute patterns. From the example above, then any columns of the B₁ can be expressed by the columns of the B₂ under the logical AND operation. This is because the augmented algorithm proposed by Ding et al. (2008, 2009) in the generalized Q-matrix theory (Ding et al., 2015) provided the useful fact that any columns of the reduced Q-matrix can be expressed by the columns of the reachability matrix under the logical OR operation. The argument in Example 1 can be adapted to prove the following theorem.

Theorem 1. For the conjunctive model, if K attributes are independent, then q_j = ∨_{l∈_S_j}r_l if and only if η_ij = ∧_{l∈_S_j}η_il, where α_i is any column of Q_s and S_j is a subset of {1, 2, …, K}.

Proof : If q_j = ∨_{l∈_S_j}r_l, we need to consider two cases, when η_ij = 1 and η_ij = 0. If η_ij = 1 for α_i as a column of Q_s, we know that α_ki = 1 for all attributes k with q_kj = 1 by the definition of the deterministic latent response. That is, examinee i has mastered all the skills required by item j. Since q_j = ∨_{l∈_S_j}r_l, then by the definition of conjunction, we can conclude that α_ki = 1 for all attributes k with r_kl = 1 for all l ∈ S_j. We now use the definition of the deterministic latent response to conclude that η_il = 1 for all l ∈ S_j, that is, ∧_{l∈_S_j}η_il = 1. This shows that η_ij = ∧_{l∈_S_j}η_il when η_ij = 1. If η_ij = 0 for α_i as a column of Q_s, we know that α_ki = 0 for at least one of attributes with q_kj = 1 by the definition of the deterministic latent response. That is, examinee i has not mastered all the skills required by item j. Since q_kj = 1 and q_j = ∨_{l∈_S_j}r_l, there is an item l in S_j such that r_kl = 1. This means that item l measured attribute k. Since α_ki = 0, then by the definition of the deterministic latent response, it follows that η_il = 0 for at least one of items in S_j, that is, ∧_{l∈_S_j}η_il = 0. This show that η_ij = ∧_{l∈_S_j}η_il when η_ij = 0. Next, we try to prove the converse. First suppose that there exists an attribute k ∈ {1, 2, …, K} such that ∨_{l∈_S_j}r_kl = 1 and q_kj = 0. Since ∨_{l∈_S_j}r_kl = 1, we know that there exists an item l ∈ S_j with r_kl = 1. Due to the arbitrariness of α_i, let α_i = 1 − e_k, where 1 = (1 1 … 1)^T and e_k is the vector with a 1 in the kth entry and 0's elsewhere. This is a contradiction, because we know that η_ij = 1, while ∧_{l∈_S_j}η_il = 0. Similarly, we assume that there exists an attribute k ∈ {1, 2, …, K} such that ∨_{l∈_S_j}r_kl = 0 and q_kj = 1. One can still take α_i = 1 − e_k. This is also a contradiction, because we know that η_ij = 0, while ∧_{l∈_S_j}η_il = 1. The proof is complete.

The important fact about Theorem 1 is that if a latent response matrix is calculated from a Q-matrix, the relationship between the columns in the Q-matrix can be constructed from the relationship between the corresponding columns in the latent response matrix. It should be noted that an observed item response is a function of an underlying latent response and slip and guessing parameters. In other words, the noise introduced in the process is due to slip and guessing parameters.

Next, we will introduce a semi-supervised learning method for Q-matrix specification for the conjunctive model by using the result of Theorem 1 and considering the noise in item responses. Without loss of generality, we begin by arbitrarily assigning q-vector q_j to item j. Given a test Q-matrix, written as Q_t = [R_{K × K} q_j] = [r₁ r₂ … r_K q_j], where R is a reachability matrix specified by subject matter experts and the remaining q_j is unknown. Let U = [X_{N × K}Y_{N × 1}] be an item response matrix on Q_t, where N is the sample size. The estimate of q_j can be written as

\begin{array}{l} {\hat{q}}_{j} = \lor_{r_{k} \in {\hat{S}}_{j}} r_{k}, & (3) \end{array}

where logical OR is applied to the corresponding entries of the columns in the following set of Ŝ_j

\begin{array}{l} {\hat{S}}_{j} = \underset{S \in P ({r_{1}, r_{2}, \dots, r_{K}}) - \emptyset}{arg min} {(Y_{j} - \land_{r_{k} \in S} X_{k})}^{T} (Y_{j} - \land_{r_{k} \in S} X_{k}), & (4) \end{array}

where P({r₁, r₂, …, r_K}) is the power set of the set {r₁, r₂, …, r_K}. The exhaustive method with time complexity O(2^K) provided a simple way to find a global solution of Ŝ_j.

A Semi-supervised Learning Approach for the Disjunctive Model

For the disjunctive model, the deterministic latent response on an item is correct if and only if an examinee has mastered at least one of the skills required by the item. This is illustrated in Example 2. Similar to what we did in Example 1, Example 2 will show the relationship of latent responses on items with q-vectors corresponding to R and Q_r.

Example 2 for an independent structure. Let K = 2, R = [r₁ r₂] = $[\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}]$ , Q_s = [α₁ α₂ α₃ α₄] = $[\begin{matrix} 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \end{matrix}]$ , and Q_r = [q₁ q₂ q₃] = $[\begin{matrix} 1 & 0 & 1 \\ 0 & 1 & 1 \end{matrix}]$ . From Q_s and Q_r, a latent response matrix W₁ = [w₁ w₂ w₃] = $[\begin{matrix} 0 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \\ 1 & 1 & 1 \end{matrix}]$ can be calculated, in which the entry in row i and column j is the deterministic latent response of w_ij. It can be observed that w₃ = w₁ ∨ w₂. This is because the relationship q₃ = q₁ ∨ q₂ is true.

Consider a latent response matrix, denoted by W₂ = [w₁ w₂], corresponding to the R matrix. The fact illustrated in Example 2 is that any columns of the W₁ can be expressed by the columns of the W₂ under the logical OR operation for the disjunctive model. This is also because the augmented algorithm proposed by Ding et al. (2008, 2009) in the generalized Q-matrix theory (Ding et al., 2015) provided the useful fact that any columns of the reduced Q-matrix can be expressed by the columns of the reachability matrix under the logical OR operation. The following theorem gives the precise statement.

Theorem 2. For the disjunctive model, if K attributes are independent, then q_j = ∨_{l∈_S_j}r_l if and only if w_ij = ∨_{l∈_S_j}w_il, where α_i is any column of Q_s and S_j is a subset of {1, 2, …, K}.

Proof: If q_j = ∨_{l∈_S_j}r_l, we need to consider two cases, when w_ij = 1 and w_ij = 0. If w_ij = 1 for α_i as a column of Q_s, we know that α_ki = 1 for at least one of attributes k with q_kj = 1 by the definition of the deterministic latent response. That is, examinee i has mastered at least one of the attributes required by item j. Without loss of generality, we assume α_ki = 1 and q_kj = 1. Since q_j = ∨_{l∈_S_j}r_l, then by the definition of disjunction, we can conclude that r_kl = 1 is true for at least one of l ∈ S_j. From the definition of the deterministic latent response, it follows that there is at least one item l ∈ S_j such that w_il = 1, that is, ∨_{l∈_S_j}w_il = 1. This show that w_ij = ∨_{l∈_S_j}w_il when w_ij = 1. If w_ij = 0 for α_i as a column of Q_s, we know that w_ki = 0 for all of attributes with q_kj = 1 by the definition of the deterministic latent response. That is, examinee i has not mastered any skills required by item j. Since q_j = ∨_{l∈_S_j}r_l, examinee i has not mastered any skills required by any item l ∈ S_j. If we suppose that examinee i has mastered at least one of attributes required by an item l ∈ S_j, then w_ij = 1, which is a contradiction. It means that item l measured attribute k. It follows that w_il = 0 for all of items in S_j, that is, ∨_{l∈_S_j}w_il = 0, directly from the definition of the deterministic latent response. This show that w_ij = ∨_{l∈_S_j}w_il when w_ij = 0. Next, we use a proof by contradiction to prove the converse. First assume that there exists an attribute k ∈ {1, 2, …, K} such that ∨_{l∈_S_j}r_kl = 1 and q_kj = 0. Since ∨_{l∈_S_j}r_kl = 1, we know that there exists an item l ∈ S_j with r_kl = 1. Due to the arbitrariness of α_i, let α_i = e_k, where e_k is the vector with a 1 in the kth entry and 0's elsewhere. Then, we havew_il = 1 and w_ij = 0. Sincew_ij = ∨_{l∈_S_j}w_il, we know that w_ij = 1 and arrive at a contradiction. Similarly, we assume that there exists an attribute k ∈ {1, 2, …, K} such that ∨_{l∈_S_j}r_kl = 0 and q_kj = 1. One can still take α_i = e_k. This is also a contradiction, because we know that w_ij = 1, while ∧_{l∈_S_j}w_il = 0. The proof is complete.

The important fact about Theorem 2 is that one can derive the relationship between the columns of a Q-matrix from the relationship between the columns of corresponding latent response matrix. For considering the noise introduced in item responses due to slipping and guessing, we will introduce a semi-supervised learning method for Q-matrix specification for the disjunctive model by using the result of Theorem 2. Without loss of generality, we begin by arbitrarily assigning a q-vector to q_j. Given a test Q-matrix, written as Q_t = [R_{K × K} q_j] = [r₁ r₂ … r_K q_j], where R is a reachability matrix specified by subject matter experts and the remaining q_j is unknown. Let U = [X_{N × K} Y_{N × 1}] be an item response matrix on Q_t. The estimate of q_j can be written as

\begin{array}{l} {\hat{q}}_{j} = \lor_{r_{k} \in {\hat{S}}_{j}} r_{k}, & (5) \end{array}

where logical OR is applied to the corresponding entries of the columns in the following set of Ŝ_j

\begin{array}{l} {\hat{S}}_{j} = \underset{S \in P ({r_{1}, r_{2}, \dots, r_{K}}) - \emptyset}{arg min} {(Y_{j} - \lor_{r_{k} \in S} X_{k})}^{T} (Y_{j} - \lor_{r_{k} \in S} X_{k}), & (6) \end{array}

where P({r₁, r₂, …, r_K}) is the power set of the set {r₁, r₂, …, r_K}. The exhaustive method with time complexity O(2^K) provided a simple way to find a global solution of Ŝ_j.

A Simulation Study

Study Design

A simulation study was conducted to investigate the performance of the new method under five factors, such as sample size, item parameters for items corresponding to a reachability matrix, item parameters for new or raw items with unknown q-vectors, two cognitive diagnostic models (the DINA and DINO model), and two designs. Five attributes were considered in the simulation study. Matlab 2015a and R-3.6.1 were used for estimating unknown Q-matrix and analyzing real data below.

In the simulation study, a test Q-matrix Q_t = [R Q_r] consists of an identity or a reachability matrix and a reduced Q-matrix, where the reduced Q-matrix with 31 items includes all non-zero possible q-vectors. The number of examinees has 10 levels, such as N=30, 60, …, and 300. Item parameters for R and Q_r have 10 levels, such as 0, 0.05, …, and 0.45. In general, for the DINA or DINO model, a high quality or “good” item will have small slip and guessing parameters (Rupp et al., 2010), which means that the noise are small.

Random and optimal designs were considered in the simulation study. For the random design, attribute patterns for examinees were generated by taking each of the 2⁵ possible patterns with equal probability for each sample size. From the proof of Theorem 1 above, we know that the following set of attribute patterns for examinees plays a very important role in discriminating latent response vectors of different q-vectors under the DINA model

\begin{array}{l} S_{D I N A} = {1 - e_{1} 1 - e_{2}, \dots, 1 - e_{K}} {[\begin{matrix} 0 \\ 1 \\ ⋮ \\ 1 \end{matrix}], [\begin{matrix} 1 \\ 0 \\ ⋮ \\ 1 \end{matrix}], \dots, [\begin{matrix} 1 \\ 1 \\ ⋮ \\ 0 \end{matrix}]} & (7) \end{array}

where e_k is the vector with a 1 in the kth entry and 0's otherwise. From the proof of Theorem 2 above, another set of attribute patterns for examinees plays a very important role in discriminating latent response vectors of different q-vectors under the DINO model as follows

\begin{array}{l} S_{D I N O} = {e_{1}, e_{2}, \dots, e_{K}} {[\begin{matrix} 1 \\ 0 \\ ⋮ \\ 0 \end{matrix}], [\begin{matrix} 0 \\ 1 \\ ⋮ \\ 0 \end{matrix}], \dots, [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 1 \end{matrix}]}, & (8) \end{array}

where e_k is the vector with a 1 in the kth entry and 0's otherwise. For the optimal design, attribute patterns for examinees under the DINA or DINA model were randomly drawn with replacement from the set of S_DINA or S_DINO, respectively. Optimal designs for two models are possible to meet the needs of learners at different stages of skills and knowledge acquisition. For example, the attribute patterns in S_DINO containing only one skill. This condition is really improbable for summary assessments in real situations, but is expected to be common for novice learners with respect to the new content to be learned in formative assessments or classroom assessments.

Data Simulation

Simulated data were generated using five attributes. Based on the simulated Q-matrix, item parameters, and attribute patterns, item responses are generated in the following way

\begin{array}{l} X_{i j} = {\begin{matrix} 1, & i f u \leq P_{j} (α_{i}), \\ 0, & o t h e r w i s e, \end{matrix} & (9) \end{array}

where u is a random value from a Uniform (0, 1) distribution and P_j(α_i) is the item response function of the DINA or DINO model. A total of 4,000 conditions were simulated (10 sample sizes × 10 item parameters × 10 item parameters × 2 models × 2 designs). Thirty replication data sets were simulated for each condition.

Evaluation Criterion

The performance of the new method is evaluated in terms of the correct recovery rate (CRR) of q-entries. The correct recovery rate equals the ratio of the number of correct q-entries in the estimated Q-matrix to the total number of q-entries (Chiu, 2013)

\begin{array}{l} CRR = \frac{1}{K M} \sum_{k = 1}^{K} \sum_{j = 1}^{M} I ({\hat{q}}_{k j} = q_{k j}), & (10) \end{array}

where M = 31 is the number of columns of the unknown Q-matrix Q_r, q_kj is an (k, j)th entry of the simulated Q_r, and ${\hat{q}}_{k j}$ is an (k, j) entry of the ${\hat{Q}}_{r}$ estimated from the new method. The mean and standard deviation of the CRR values of the 30 replications were reported for each condition.

Results

Table 1 lists descriptive statistics of correct recovery rate of q-entries for two models and two designs across other conditions. It is clear that the mean of correct recovery rates of q-entries tends to increase as sample size increases, but sample size has slightly affected the standard deviations of correct recovery rates. It should be noted that the mean of correct recovery rates of the optimal design is larger than that of the random design. The semi-supervised learning method for q-matrix specification performed similarly under two cognitive diagnostic models. In addition, since there are 32 possible attribute patterns, no all attribute patterns can be observed in the first sample size condition (N = 30). This might lead to lower rate of correct recovery observed for this condition.

TABLE 1

Table 1. Mean and standard deviation (in brackets) of correct recovery rate of q-entries for two models and two designs.

Table 2 shows the correct recovery rates of q-entries from the new method with sample size of 300 for the DINA model under the random design. From correct recovery rates of q-entries, when item parameters for items with known (i.e., the reachability matrix) and unknown q-vectors are ≤ 0.2, most of the average of correct recovery rates of q-entries for the semi-supervised method are larger than or equal to 0.9. From trends of marginal means of last rows and columns in Table 2, item parameters of the reachability matrix have a relatively larger impact on the performance of the semi-supervised method than item parameters with unknown q-vectors.

TABLE 2

Table 2. The correct recovery rates of q-entries with sample size of 300 for the DINA model and random design.

Table 3 presents the correct recovery rates of q-entries from the new method with sample size of 300 for the DINA model under the optimal design. From correct recovery rates of q-entries, when item parameters for items with known and unknown q-vectors are ≤ 0.25, the average of correct recovery rates of q-entries for the semi-supervised method are larger than or equal to 0.9. However, item parameters for known q-vectors have slightly larger impact on the performance of the semi-supervised method than for unknown q-vectors, because the row means decreased more quickly than the column means. We need to compare the Tables 2, 3 to see which designs are promising. The number of correct recovery rates above 0.9 in Table 3 were found to be larger than that of Table 2. Tables 4, 5 show the correct recovery rates of q-entries from the new method with sample size of 300 for the DINO model under the random and optimal design. It can be observed that results for the DINO model are the same as those for the DINA model described above.

TABLE 3

Table 3. The correct recovery rates of q-entries with sample size of 300 for the DINA model and optimal design.

TABLE 4

Table 4. The correct recovery rates of q-entries with sample size of 300 for the DINO model and random design.

TABLE 5

Table 5. The correct recovery rates of q-entries with sample size of 300 for the DINO model and optimal design.

Real Data Analysis

The purpose of the real data analysis is to examinee whether the proposed method is promising for a non-independent structure under the conjunctive model based on an intuitive fact from the following example.

Example 3 for an unstructured hierarchy under the conjunctive model. Let K = 3, R = [r₁ r₂ r₃] = $[\begin{matrix} 1 & 1 & 1 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}]$ , Q_r = [q₁ q₂ q₃ q₄] = $[\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \end{matrix}]$ , and Q_s = [α₀ α₁ α₂ α₃ α₄] = $[\begin{matrix} 0 & 1 & 1 & 1 & 1 \\ 0 & 0 & 1 & 0 & 1 \\ 0 & 0 & 0 & 1 & 1 \end{matrix}]$ . From the ideal response matrix B = [η₁ η₂ η₃ η₄] = $[\begin{matrix} 0 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 1 & 1 & 1 & 1 \end{matrix}]$ , it can be observed that η₄ = η₂ ∧ η₃ or η₄ = η₁ ∧ η₂ ∧ η₃. This is because the relationship q₄ = q₂ ∨ q₃ or q₄ = q₁ ∨ q₂ ∨ q₃ is true.

A common data set pertaining to fraction-subtraction data contains 20 items and 536 examines (de la Torre and Douglas, 2004). In our real data analysis, we focused on the analysis of a subset of test items where the expert Q-matrix comes from Table 7 both in de la Torre (2008) or DeCarlo (2012). The labels given to the five skills are (A1) performing basic fraction-subtraction operation, (A2) simplifying/reducing, (A3) separating whole numbers from fractions, (A4) borrowing one from whole number to fraction, and (A5) converting whole numbers to fractions.

We assumed the corresponding Q-matrix of items 3, 8, 9, 12, and 10 known since these item parameters are relatively small and the q-vectors of other items are combinations of q-vectors for these five items. Then, the semi-supervised method was applied to estimate q-vectors for the other 10 items. Results in Table 6 show that the agreement rate of q-entries between the estimate and expert Q-matrix on the 10 items is 84%. The estimated q-entries suggest that items 4, 7, 13, 14, and 15 do not require attribute 2 (simplifying/reducing). Item 4 (similar to item 14) do not required attribute A2, which is consistent with results from DeCarlo (2012). Items 7, 13, and 15 can be answered correctly by using attributes required by item 12. The estimated q-vector of item 1 has largest discrepancy with the expert q-vector. The reason might be that solving item 1 correctly needs to find a common denominator and then performs basic fraction-subtraction operation. The guessing and slip parameter of item 1 are 0.0001 and 0.2769 under the expert q-vector, respectively. The guessing and slip parameter of item 1 are 0.3408 and 0.0716 under the estimated q-vector, respectively. Since item 1 requires an extra attribute (i.e., find a common denominator), the slip parameter for the expert q-vector is relatively large, while the estimated q-vector contains some unnecessary attributes, the guessing parameter is relatively large. In the estimated Q-matrix, attribute A4 has been added to item 11.The guessing probability of item 11 increased sensibly (from 0.10 to 0.48). It indicated that attribute A4 is not necessary for item 11 because this item is different from items 7, 12, and so on.

TABLE 6

Table 6. The expert and estimated Q-matrix and item parameters estimates of the DINA model for the fractional subtraction data.

The generalized DINA model (GDINA; de la Torre, 2011), the DINA model, the linear logistic model (LLM; Fischer, 1995), and the reduced reparametrized unified model (R-RUM; Hartz, 2002) were applied to fit the fraction-subtraction data with the expert or estimated Q-matrix. Under the DINA model, the means of the estimates of the guessing and slip parameter for the expert Q-matrix are 0.1080 and 0.1381, respectively, while for the revised Q-matrix, they are 0.1440 and 0.1295, respectively. It means that the estimates of the slip parameter become lower, but the guessing parameters tend to be larger. Table 7 presents fit results for the fraction subtraction data using the expert and estimated q-matrix. The LLM with the estimated Q-matrix is the best-fitting CDM and the R-RUM with the estimated Q-matrix is slightly worse, whereas the estimated Q-matrix performed worse than the expert Q-matrix only in the DINA model.

TABLE 7

Table 7. Fit results for the fraction subtraction data using the expert and estimated Q-matrix.

Conclusion and Discussion

The supervised methods rely on a provisional Q-matrix for a whole test, the estimates of examinees' attribute patterns and their accuracy. It is not suitable for the case of a provisional Q-matrix with a large amount of misspecification. The purpose of this study is to propose the semi-supervised method under independent structure based on item responses and a reachability R matrix corresponding to a small part of test item specified by subject matter experts. The new method doesn't need to estimate examinees' attribute patterns. The main conclusion of this study is that the new method will play a very important role in assist subject matter experts for Q-matrix specification because it is hard to correctly specify a Q-matrix with a large number of test items by subject matter experts. It may be useful for cognitive diagnostic assessment to facilitate teaching and learning.

The generalized Q-matrix theory has been shown that each column in the reduced Q-matrix can be expressed as a logical disjunction of some of columns of the reachability matrix. With the aid of this theory, this study takes a look inside a latent response matrix and reveals an interesting and useful relationship hidden in its columns. If a latent response matrix is calculated from a Q-matrix under the conjunctive model, a column in the latent response matrix is the conjunction of some other columns in this matrix if and only if the corresponding column of the Q-matrix can be written as the disjunction of their corresponding columns. While for the disjunctive model, the columns of the latent response matrix have exactly the same disjunction relationships as the columns of the Q-matrix. Because any conjunction or disjunction relationship among the columns of a latent response matrix would imply a disjunction relationship among the columns of a Q-matrix, then we are expected that the relationship between the columns in the Q-matrix can be constructed from the relationship between the corresponding columns in an observed response matrix, resulting from the latent response matrix by adding the noise or random errors. Another reason for this expectation is that each entry in the observed response matrix is modeled as a noisy observation of the corresponding entry in the latent response matrix through slip and guessing parameters (Junker and Sijtsma, 2001) and the discrepancies between the latent and observed response matrices are considered as random errors (Tatsuoka, 1987).

From the key theoretical results above, the semi-supervised method and an optimal design were then proposed for Q-matrix specification based on test response data and a reachability matrix specified by subject matter experts, and the simulation study was conducted to investigate the performance of the new method and the optimal design for examinee sampling in terms of the CRR of q-entries. From the CRR of q-entries, it is clear found that: (a) for the random design, when item parameters for items with known and unknown q-vectors are ≤ 0.20, the average of CRRs of q-entries for the semi-supervised method is larger than or equal to 0.9, (b) for the optimal design, when item parameters for items with known and unknown q-vectors are ≤ 0.25, the average of CRRs of q-entries for the semi-supervised method is larger than or equal to 0.9, and (c) item parameters of the reachability matrix have a larger impact on the performance of the semi-supervised method than item parameters with unknown q-vectors.

Finally, based on the results obtained in this study, some problems worthy of study in the future are put forward. First, how to effectively use the most of data or information on some other items for which experts have also specified q-vectors, because as the increase of the number of item specified q-vectors, the time complexity (more specifically, exponential time) of the exhaustive method grows much faster? If the number of items is increased to double or triple the number of attributes corresponding to the reachability matrix, one should investigate whether choosing a small part of items with high quality will reduce the noise of the responses and improve the estimation of q entries of unknown items. Second, in the simulation study, we know exactly how many attributes all items include. However, in the real situation, some items with unknown Q-matrix may mix additional attributes not specified in the reachability matrix because we haven't reviewed all items. Thus, we should explore a novel or revised method for identifying the possibility of extra attribute(s). Third, if the Q-matrix obtained from the semi-supervised method is taken as an initial matrix or a provisional Q-matrix of the existing supervised methods, is it possible to further improve the recovery of Q-matrix? From the results of the study, it can be seen that item parameters or random errors of item responses have an impact on the recovery of Q-matrix. If there is a method to reduce noise in item responses, the recovery of Q-matrix may be further improved. We only considered the small set of items with known q-vectors and fixed item parameters. Additional work is needed to further examine the impact of not only error patterns for known q-vectors but different item parameters for test items. Fourth, the current study focused on the DINA and DINO model only. In the future, the proposed method should be applied to general families of cognitive diagnostic models such as the generalized DINA model (de la Torre, 2011), the log-linear cognitive diagnostic model (Henson et al., 2009), the general diagnostic model (von Davier, 2008), testlet cognitive diagnosis model (Zhan et al., 2018), or polytomous cognitive diagnosis models (Chen and de la Torre, 2018; Ma, 2019). Lastly, since only the independent attribute structure in the simulation study and hierarchy structures for the conjunctive model in real data analysis were considered, the proposed method for other attribute hierarchies with different cognitive assumptions is worth studying.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.rdocumentation.org/packages/CDM/versions/7.4-19/topics/fraction.subtraction.data.

Author's Note

Based on the fact that any columns of a reduced Q-matrix can be expressed by the columns of a reachability R matrix under the logical OR operation, a semi-supervised learning approach and an optimal design for examinee sampling were proposed for Q-matrix specification under the conjunctive and disjunctive model. This method only required subject matter experts specifying a R matrix corresponding to a small part of test items. Simulation and real data analysis showed that the new method with the optimal design is promising in terms of correct recovery rates of q-entries.

Author Contributions

WW, LS, and TW conducted a design of the study, data analysis, paper writing, and revision. SD revised the paper. PG and JX give some descriptions of data analysis. All authors contributed to the article and approved the submitted version.

Funding

This research was partially supported by the Key Project of National Education Science Twelfth Five Year Plan of Ministry of Education of China (Grant No. DHA150285).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Barnes, T. (2011). “Novel derivation and application of skill matrices: the q-matrix method,” in Handbook of Educational Data Mining, eds C. Romero, Sebastian Ventura, M. Pechenizkiy, and R. S. J. D. Baker (Boca Raton, FL: CRC Press), 159–172.

A Semi-supervised Learning Method for Q-Matrix Specification Under the DINA and DINO Model With Independent Structure

Introduction

Model and Method

Model

A Semi-supervised Learning Approach for the Conjunctive Model

A Semi-supervised Learning Approach for the Disjunctive Model

A Simulation Study

Study Design

Data Simulation

Evaluation Criterion

Results

Real Data Analysis

Conclusion and Discussion

Data Availability Statement

Author's Note

Author Contributions

Funding

Conflict of Interest

References

95% of researchers rate our articles as excellent or good