CUSUM-Based Person-Fit Statistics for Adaptive Testing

Item scores that do not fit an assumed item response theory model may cause the latent trait value to be inaccurately estimated. Several person-fit statistics for detecting nonfitting score patterns for paper-and-pencil tests have been proposed. In the context of computerized adaptive tests (CAT), t...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Journal of Educational and Behavioral Statistics. - SAGE Publishing, 1976. - 26(2001), 2, Seite 199-217
Weitere Verfasser: Meijer, Rob R.
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2001
Zugriff auf das übergeordnete Werk:Journal of Educational and Behavioral Statistics
Schlagworte:Appropriateness Measurement Computer Adaptive Testing Cumulative Sum Item Response Theory Person Fit Mathematics Information science Education Behavioral sciences
Beschreibung
Zusammenfassung:Item scores that do not fit an assumed item response theory model may cause the latent trait value to be inaccurately estimated. Several person-fit statistics for detecting nonfitting score patterns for paper-and-pencil tests have been proposed. In the context of computerized adaptive tests (CAT), the use of person-fit analysis has hardly been explored. Because it has been shown that the distribution of existing person-fit statistics is not applicable in a CAT, in this study new person-fit statistics are proposed and critical values for these statistics are derived from existing statistical theory. Statistics are proposed that are sensitive to runs of correct or incorrect item scores and are based on all items administered in a CAT or based on subsets of items, using observed and expected item scores and using cumulative sum (CUSUM) procedures. The theoretical and empirical distributions of the statistics are compared and detection rates are investigated. Results showed that the nominal and empirical Type I error rates were comparable for CUSUM procedures when the number of items in each subset and the number of measurement points were not too small. Detection rates of CUSUM procedures were superior to other fit statistics. Applications of the statistics are discussed.
ISSN:19351054