
Stratified Item Selection Methods in Cognitive Diagnosis Computerized Adaptive Testing
Author(s) -
Jing Yang,
Hua Hua Chang,
Jiangchuan Tao,
Ning Shi
Publication year - 2019
Publication title -
applied psychological measurement
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.083
H-Index - 64
eISSN - 1552-3497
pISSN - 0146-6216
DOI - 10.1177/0146621619893783
Subject(s) - computerized adaptive testing , nonparametric statistics , selection (genetic algorithm) , statistics , parametric statistics , computer science , stratification (seeds) , artificial intelligence , mathematics , psychometrics , seed dormancy , botany , germination , dormancy , biology
Cognitive diagnostic computerized adaptive testing (CD-CAT) aims to obtain more useful diagnostic information by taking advantages of computerized adaptive testing (CAT). Cognitive diagnosis models (CDMs) have been developed to classify examinees into the correct proficiency classes so as to get more efficient remediation, whereas CAT tailors optimal items to the examinee's mastery profile. The item selection method is the key factor of the CD-CAT procedure. In recent years, a large number of parametric/nonparametric item selection methods have been proposed. In this article, the authors proposed a series of stratified item selection methods in CD-CAT, which are combined with posterior-weighted Kullback-Leibler (PWKL), nonparametric item selection (NPS), and weighted nonparametric item selection (WNPS) methods, and named S-PWKL, S-NPS, and S-WNPS, respectively. Two different types of stratification indices were used: original versus novel. The performances of the proposed item selection methods were evaluated via simulation studies and compared with the PWKL, NPS, and WNPS methods without stratification. Manipulated conditions included calibration sample size, item quality, number of attributes, number of strata, and data generation models. Results indicated that the S-WNPS and S-NPS methods performed similarly, and both outperformed the S-PWKL method. And item selection methods with novel stratification indices performed slightly better than the ones with original stratification indices, and those without stratification performed the worst.