z-logo
open-access-imgOpen Access
THE EFFECTS ON OBSERVED‐ AND TRUE‐SCORE EQUATING PROCEDURES OF MATCHING ON A FALLIBLE CRITERION: A SIMULATION WITH TEST VARIATION 1
Author(s) -
Eignor Daniel R.,
Stocking Martha L.,
Cook Linda L.
Publication year - 1990
Publication title -
ets research report series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.235
H-Index - 5
ISSN - 2330-8516
DOI - 10.1002/j.2333-8504.1990.tb01361.x
Subject(s) - equating , statistics , matching (statistics) , mathematics , test score , test (biology) , item response theory , econometrics , variable (mathematics) , psychometrics , standardized test , rasch model , paleontology , mathematical analysis , biology
Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: Tucker, Levine equally reliable, and Chained equipercentile observed‐score procedures and the 3PL IRT true‐score equating procedure. For these simulations, test forms were equated to themselves, a situation that does not pattern reality. In the current simulation, test variation was added as an additional variable for study. The results of the current simulation confirmed the results of the previous two simulations and support the prediction based on theoretical grounds that observed‐score equating methods, such as Tucker and Chained equipercentile, are more affected by sample variation than are a true‐score method (IRT) or an observed‐score method based on true‐score assumptions (Levine equally reliable). The results further suggest that matching equating samples on the basis of a fallible measure of ability, such as anchor test score, is not advisable for any equating method studied except possibly the Tucker method.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here