z-logo
open-access-imgOpen Access
AN EXPLORATORY STUDY OF CHARACTERISTICS RELATED TO IRT PARAMETER INVARIANCE WITH THE TEST OF ENGLISH AS A FOREIGN LANGUAGE
Author(s) -
Way Walter D.,
Carey Patricia A.,
GolubSmith Marna L.
Publication year - 1992
Publication title -
ets research report series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.235
H-Index - 5
ISSN - 2330-8516
DOI - 10.1002/j.2333-8504.1992.tb01474.x
Subject(s) - test of english as a foreign language , equating , test (biology) , psychology , reading comprehension , item response theory , statistics , mathematics education , exploratory research , item analysis , comprehension , reading (process) , rasch model , computer science , language assessment , psychometrics , mathematics , linguistics , developmental psychology , paleontology , philosophy , sociology , anthropology , biology , programming language
IRT equating methods have been used successfully with the TOEFL ® test for many years, and for the most part the observed properties of items have been consistent with model predictions. However, items that do not appear to hold their IRT pretest estimates do exist. If relationships can be found between features of TOEFL items in pretest calibrations and subsequent lack of model‐data fit when these items are used in final forms, steps to eliminate the use of such items in TOEFL final forms can be taken. The purpose of this study was to provide an exploratory investigation of item features that may contribute to a lack of invariance of TOEFL item parameters. The results of the study indicated the following: (1) subjective and quantitative measures developed for the study provided consistent information related to the model‐data fit of TOEFL test items, (2) for Sections 1 and 2, items that were pretested before 1986 exhibited poorer model‐data fit than items that were pretested after 1986, and (3) for Section 3 reading comprehension, model‐data fit appeared to be related to changes in the relative position of items within the sections from the pretest to the final form administrations. Based on the results of the study, it was recommended that (1) the TOEFL program investigate the feasibility of not using pretest IRT statistics for items pretested before 1986 for Sections 1 and 2 and (2) that guidelines be developed for test developers to use with reading comprehension items to limit the change in relative positions of items in the test from pretest to final form administrations.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here