| Literature DB >> 33935863 |
Yanyun Dong1, Xiaomei Ma1, Chuang Wang2, Xuliang Gao3.
Abstract
Cognitive diagnostic models (CDMs) show great promise in language assessment for providing rich diagnostic information. The lack of a full understanding of second language (L2) listening subskills made model selection difficult. In search of optimal CDM(s) that could provide a better understanding of L2 listening subskills and facilitate accurate classification, this study carried a two-layer model selection. At the test level, A-CDM, LLM, and R-RUM had an acceptable and comparable model fit, suggesting mixed inter-attribute relationships of L2 listening subskills. At the item level, Mixed-CDMs were selected and confirmed the existence of mixed relationships. Mixed-CDMs had better model and person fit than G-DNIA. In addition to statistical approaches, the content analysis provided theoretical evidence to confirm and amend the item-level CDMs. It was found that semantic completeness pertaining to the attributes and item features may influence the attribute relationships. Inexplicable attribute conflicts could be a signal of suboptimal model choice. Sample size and the number of multi-attribute items should be taken into account in L2 listening cognitive diagnostic modeling studies. This study provides useful insights into the model selection and the underlying cognitive process for L2 listening tests.Entities:
Keywords: L2 listening subskills; cognitive diagnostic model; inter-attribute relationship; mixed-CDMs; model selection
Year: 2021 PMID: 33935863 PMCID: PMC8085248 DOI: 10.3389/fpsyg.2021.608320
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Configuration of Q-matrix.
| Item 1 | 1 | 0 | 0 | 0 | 0 | 0 | Item 11 | 0 | 1 | 0 | 0 | 0 | 1 |
| Item 2 | 1 | 0 | 0 | 0 | 0 | 0 | Item 12 | 0 | 0 | 0 | 0 | 0 | 1 |
| Item 3 | 0 | 1 | 0 | 1 | 0 | 0 | Item 13 | 0 | 0 | 0 | 1 | 1 | 0 |
| Item 4 | 0 | 1 | 0 | 1 | 0 | 0 | Item 14 | 0 | 0 | 0 | 0 | 0 | 1 |
| Item 5 | 0 | 1 | 0 | 0 | 0 | 0 | Item 15 | 0 | 0 | 0 | 0 | 0 | 1 |
| Item 6 | 0 | 0 | 1 | 0 | 0 | 0 | Item 16 | 0 | 0 | 1 | 0 | 1 | 0 |
| Item 7 | 0 | 0 | 0 | 1 | 0 | 0 | Item 17 | 0 | 0 | 0 | 0 | 0 | 1 |
| Item 8 | 0 | 0 | 1 | 0 | 0 | 0 | Item 18 | 0 | 0 | 0 | 0 | 1 | 0 |
| Item 9 | 0 | 0 | 1 | 0 | 0 | 0 | Item 19 | 0 | 0 | 0 | 1 | 1 | 0 |
| Item 10 | 0 | 0 | 0 | 1 | 1 | 0 | |||||||
Listening attributes/subskills and relationship with existing listening skill taxonomies.
| A1: Sound discrimination | Identify/retrieve | Input decoding | Understand prosodic patterns | |
| A2: Less frequent vocabulary and expressions | Identify/retrieve | Lexical search | Understand vocabulary | Understand vocabulary |
| A3: Difficult structures | Identify/retrieve/analyze | Parsing | Understand syntactic patterns | |
| A4: Facts and details | Identify/retrieve/analyze | Meaning construction | Understand important information | |
| A5: Main idea | Analyze/summarize/create | Meaning construction; | Understand overall topic/gist; | |
| A6: Situational context and cultural background inference | Analyze/summarize/ create/evaluate | Meaning construction; | Identify speaker's purpose, attitudes, views, and intentions; | Making inferences |
| Identify rhetorical devices. | Understand the structure (rhetorical, discourse). |
The attributes and definitions stem from the study of Meng (2013, p. 78, p. 95).
Absolute fit.
| G-DINA | 115 | 3.12 | 0.3046 | 2.89 | 0.6521 | 89.0495 | 0.128 | 0.0194 | 0.0431 | 11,259.24 | 11,489.23 | 11,973.91 |
| DINA | 101 | 4.68 | 0.0005 | 4.03 | 0.0097 | 118.336 | 0.0205 | 0.0257 | 0.0452 | 11,308.10 | 11,510.09 | 11,935.77 |
| DINO | 101 | 4.62 | 0.0006 | 4.02 | 0.0100 | 108.348 | 0.0799 | 0.0209 | 0.0209 | 11,300.98 | 11,502.98 | 11,928.65 |
| A-CDM | 108 | 3.34 | 0.1437 | 3.10 | 0.3272 | 75.9016 | 0.6686 | 0 | 0.0437 | 11,273.82 | 11,489.81 | 11,944.99 |
| LLM | 108 | 3.15 | 0.2831 | 2.96 | 0.5182 | 93.2377 | 0.1862 | 0.0166 | 0.0433 | 11,268.92 | 11,484.93 | 11,940.11 |
| R-RUM | 108 | 3.45 | 0.0947 | 3.21 | 0.2237 | 81.1406 | 0.5061 | 0 | 0.0441 | 11,278.34 | 11,494.35 | 11,949.52 |
(a) #par., number of parameters; (b) Max.z(r) & Max.z(l), maximum z-score for transformed correlation and log odds ratio; (c) M.
Wald statistics for multi-attribute items.
| Item 3 | DINO | 5.51 | 0.06 |
| Item 4 | DINA | 3.57 | 0.17 |
| Item 10 | LLM | 0.00 | 0.99 |
| Item 11 | DINO | 1.16 | 0.56 |
| Item 13 | DINA | 2.69 | 0.26 |
| Item 16 | DINA | 0.17 | 0.92 |
| Item 19 | DINO | 2.73 | 0.26 |
Model fit of mixed-CDMs.
| 102 | 2.78 | 0.9120 | 2.72 | 1.0000 | 92.1224 | 0.3609 | 0.0097 | 0.0435 | 11,274.36 | 11,478.36 | 11,908.25 |
Absolute item-level fit.
| G-DINA | 3.12 | 0.03 | 2.89 | 0.07 | 3.12 | 0.03 | 2.89 | 0.07 |
| LLM | 3.14 | 0.03 | 2.92 | 0.06 | 3.14 | 0.03 | 2.92 | 0.06 |
| Mixed-CDMs | 2.79 | 0.10 | 2.58 | 0.18 | 2.79 | 0.10 | 2.58 | 0.18 |
Psychometric characteristics under both models.
| G-DINA | 0.76 | 2.13 | 0.7176 | 0.8731 | 0.9119 | 0.9107 | 0.9253 | 0.9140 | 0.9152 | 0.9084 |
| LLM | 0.74 | 2.09 | 0.7954 | 0.8790 | 0.9931 | 0.9168 | 0.9274 | 0.9956 | 0.9197 | 0.9386 |
| Mixed- CDMs | 0.73 | 2.05 | 0.7157 | 0.8713 | 0.9158 | 0.9096 | 0.9083 | 0.9093 | 0.9138 | 0.9047 |
Item parameters estimates (EST) and standard errors (SE) of multi-attribute items.
| Item 3 | A2 + A4 | EST | 0.48 | 0.82 | 0.57 | 0.89 | 0.47 | 0.88 | 0.88 | 0.88 | 0.45 | 0.61 | 0.82 | 0.89 |
| SE | 0.05 | 0.13 | 0.14 | 0.02 | 0.04 | 0.02 | 0.02 | 0.02 | 0.04 | 0.06 | 0.05 | 0.02 | ||
| Item 4 | A2 + A4 | EST | 0.34 | 0.17 | 0.57 | 0.71 | 0.35 | 0.35 | 0.35 | 0.74 | 0.33 | 0.31 | 0.71 | 0.69 |
| SE | 0.04 | 0.15 | 0.14 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.04 | 0.05 | 0.05 | 0.03 | ||
| Item 10 | A4 + A5 | EST | 0.22 | 1.00 | 0.37 | 0.66 | 0.29 | 0.63 | 0.36 | 0.71 | 0.32 | 0.66 | 0.32 | 0.67 |
| SE | 0.05 | 0.23 | 0.07 | 0.03 | 0.05 | 0.09 | 0.06 | 0.03 | 0.05 | 0.07 | 0.05 | 0.03 | ||
| Item 11 | A2 + A6 | EST | 0.23 | 0.79 | 1.00 | 0.89 | 0.26 | 0.90 | 0.90 | 0.90 | 0.31 | 0.38 | 0.89 | 0.92 |
| SE | 0.04 | 0.10 | 0.26 | 0.03 | 0.04 | 0.02 | 0.02 | 0.02 | 0.04 | 0.06 | 0.04 | 0.02 | ||
| Item 13 | A4 + A5 | EST | 0.43 | 0.44 | 0.26 | 0.87 | 0.42 | 0.42 | 0.42 | 0.89 | 0.40 | 0.87 | 0.35 | 0.85 |
| SE | 0.06 | 0.17 | 0.08 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.05 | 0.04 | 0.05 | 0.03 | ||
| Item 16 | A3 + A5 | EST | 0.34 | 0.29 | 0.35 | 0.82 | 0.34 | 0.34 | 0.34 | 0.84 | 0.28 | 0.73 | 0.37 | 0.80 |
| SE | 0.06 | 0.13 | 0.06 | 0.03 | 0.03 | 0.03 | 0.03 | 0.03 | 0.04 | 0.06 | 0.04 | 0.03 | ||
| Item 19 | A4 + A5 | EST | 0.59 | 0.74 | 0.95 | 0.97 | 0.58 | 0.97 | 0.97 | 0.97 | 0.61 | 0.88 | 0.86 | 0.97 |
| SE | 0.06 | 0.14 | 0.05 | 0.01 | 0.05 | 0.01 | 0.01 | 0.01 | 0.05 | 0.05 | 0.03 | 0.01 | ||
EST, estimates; SE, standard error; P(11) refers to the probability of the right answer (PRA) to the item when two attributes are mastered; P(10) stands for the PRA when the first attribute is mastered; P(01) is the PRA when the second attribute is mastered.
Items tapping into attributes A2 and A4.
| 3. M: Nancy, why are you late today? | The item requires two attributes, A2 (Vocabulary and Expressions: overslept) and A4 (Facts and Details: missed the bus). A test taker would find answer key (c) if he/she only knows A2 (overslept), as it means the same as the answer key. However, if he/she only knows A4 (missed the bus), he/she would find (b) or (c), which means he/she has around 50% probability to find the right answer (c). |
| 4. M: Where is Cindy? | A test taker would find answer key (b) only if he/she knows both the attributes, A2 (Vocabulary and Expressions: ran out of…) and A4 (Facts and Details: milk). |