Package 'PerFit'

Title: Person Fit
Description: Several person-fit statistics (PFSs; Meijer and Sijtsma, 2001, <doi:10.1177/01466210122031957>) are offered. These statistics allow assessing whether individual response patterns to tests or questionnaires are (im)plausible given the other respondents in the sample or given a specified item response theory model. Some PFSs apply to dichotomous data, such as the likelihood-based PFSs (lz, lz*) and the group-based PFSs (personal biserial correlation, caution index, (normed) number of Guttman errors, agreement/disagreement/dependability statistics, U3, ZU3, NCI, Ht). PFSs suitable to polytomous data include extensions of lz, U3, and (normed) number of Guttman errors.
Authors: Jorge N. Tendeiro
Maintainer: Jorge N. Tendeiro <[email protected]>
License: GPL (>=2)
Version: 1.4.6
Built: 2024-11-16 03:50:38 UTC
Source: https://github.com/jorgetendeiro/perfit

Help Index


Person Fit

Description

Person fit consists of a set of techniques aimed at detecting unusual responses to tests or questionnaires. There are several person-fit statistics available in the literature, see Karabatsos (2003) and Meijer and Sijtsma (2001) for comprehensive reviews. Both dichotomous and polytomous types of items are considered. This R-package outputs the values of the chosen person-fit statistic, the IDs of the respondents that were flagged, and plots the sample distribution of the scores of the person-fit statistic. Nonparametric person response functions (Sijtsma and Meijer, 2001) may also be requested in order to help interpreting individual answering behaviors (dichotomous data only).

Details

Package: PerFit
Type: Package
Title: Person Fit
Version: 1.4.6
Date: 2021-10-14
Author: Jorge N. Tendeiro
Maintainer: Jorge N. Tendeiro <[email protected]>
Description: Several person-fit statistics (PFSs; Meijer and Sijtsma, 2001, <doi:10.1177/01466210122031957>) are offered. These statistics allow assessing whether individual response patterns to tests or questionnaires are (im)plausible given the other respondents in the sample or given a specified item response theory model. Some PFSs apply to dichotomous data, such as the likelihood-based PFSs (lz, lz*) and the group-based PFSs (personal biserial correlation, caution index, (normed) number of Guttman errors, agreement/disagreement/dependability statistics, U3, ZU3, NCI, Ht). PFSs suitable to polytomous data include extensions of lz, U3, and (normed) number of Guttman errors.
Imports: stats, graphics, fda, Hmisc, irtoys, MASS, Matrix
Depends: ltm, mirt
License: GPL (>=2)
RoxygenNote: 7.1.1
Config/pak/sysreqs: chromium make libicu-dev libssl-dev
Repository: https://jorgetendeiro.r-universe.dev
RemoteUrl: https://github.com/jorgetendeiro/perfit
RemoteRef: HEAD
RemoteSha: c741acec820cb05f694b5fa9f57bf16b83a26c51

Index of help topics:

A.KB                    Agreement, disagreement, and dependability
                        statistics
Cstar                   C.Sato, Cstar person-fit statistics
G                       Number of Guttman errors
Gpoly                   Number of Guttman errors for polytomous items
Ht                      Ht person-fit statistic
InadequacyData          The NPV-J inadequacy scale data
IntelligenceData        Intelligence data (number completion)
NCI                     NCI person-fit statistic
PRFplot                 Person response function (PRF)
PerFit-package          Person Fit
PerFit.PFS              Compute several person-fit statistics
PerFit.SE               Compute standard errors for person fit
                        statistics
PhysFuncData            The SF-36 physical functioning data
U3                      U3, ZU3 person-fit statistics
U3poly                  U3poly person-fit statistic
cutoff                  Compute a cutoff value given the scores of a
                        person-fit statistic
flagged.resp            Find (potentially) aberrant response patterns
lz                      lz and lzstar person-fit statistics
lzpoly                  lzpoly person-fit statistic
plot.PerFit             Plot method for objects of class "PerFit"
print.PerFit            Print method for objects of class "PerFit"
r.pbis                  Personal biserial statistic
summary.PerFit          Summary method for objects of class "PerFit"

The PerFit package contains several person-fit functions. The goal is to detect response vectors that seem to be strange in terms of the sample of respondents or in terms to an item response theory (IRT) model.

There are many person-fit statistics available in the literature. Statistics are typically categorized according to the type of items (Dicho = dichotomous, Poly = polytomous) and the type of IRT model (NParam=nonparametric, Param=parametric) that they apply to. The current version of PerFit includes the following statistics:

Person-fit statistic (R function) Reference Type item Type IRT model
r.pbis Donlon and Fisher (1968) Dicho NParam
C.Sato Sato (1975) Dicho NParam
G, Gnormed van der Flier (1977), Meijer (1994) Dicho NParam
A.KB, D.KB, E.KB Kane and Brennan (1980) Dicho NParam
U3, ZU3 van der Flier (1980, 1982) Dicho NParam
Cstar Harnisch and Linn (1981) Dicho NParam
NCI Tatsuoka and Tatsuoaka (1982, 1983) Dicho NParam
lz Drasgow, Levine, and Williams (1985) Dicho Param
lzpoly Drasgow, Levine, and Williams (1985) Poly Param
Ht Sijtsma (1986) Dicho NParam
Gpoly Molenaar (1991) Poly NParam
Gnormed.poly Molenaar (1991), Emons (2008) Poly NParam
lzstar Snijders (2001) Dicho Param
U3poly Emons (2008) Poly NParam

All functions above have an output of class PerFit.

The package provides other functions that help analyzing the data when conducting person-fit analyses:

Function Description
cutoff Estimate cutoff values for the person-fit statistics, to be used as decision rules.
flagged.resp Identify which respondents were flagged according to the chosen cutoff.
plot (class PerFit) Plot the distribution of person-fit scores with the cutoff superimposed.
PRFplot Plot the nonparametric person response function (Sijtsma and Meijer, 2001).

More person-fit statistics will be added to the package in future updates.

Versions

  • Version 1.0 (April 2014)

  • Version 1.1 (May 2014)

    Functions plot.PerFit and PRFplot now allow the user to edit the axes labels and the titles.

  • Version 1.2 (August 2014)

    Some output values of some functions were renamed for the sake of consistency. The package documentation was adapted accordingly.

  • Version 1.3 (March 2015)

    The package underwent a major revision:

    • Class PerFit now consists of a list with 12 objects.

    • New methods for objects of class PerFit were added (summary, print).

    • Routines accomodating for missing values were added.

    • Function cutoff was updated. Now, model-fitting item response patterns are generated in order to find the cutoff value.

    • Function plot.PerFit now allows displaying a bootstrap percentile confidence interval for the cutoff statistic, as well as ticks marking the flagged respondents.

    • Person response functions are now approximated by functional data objects computed by means of the fda package. The functional data objects are returned to the user.

    • Standard errors for the person-fit statistics are now available (see function PerFit.SE).

    • Many control checks were added throughout the entire code.

  • Version 1.4 (July 2015)

    • The default missing values approach is now pairwise elimination. The imputation methods introduced with version 1.3.1 are still available.

    • A bug was removed from function lzpoly() (many thanks to Marco Bressan for spotting it!).

    • The PerFit-package.Rd file was updated (version 1.4.2).

    • A bug in function U3poly was fixed (version 1.4.3, October 2018). All credit goes to Marek Muszynski for spotting the issue.

    • Recent changes to base plot() broke function PRFplot(). This was now fixed (version 1.4.4, January 2021). I thank Julie Webbs for pointing this out.

    • Version 1.4.5, February 2021: Updated affiliation.

    • Version 1.4.6, October 2021: Updated internal function estIP.poly() to properly manage cases where users supply item and/or person parameters.

Author(s)

Jorge N. Tendeiro

Maintainer: Jorge N. Tendeiro <[email protected]>

References

Donlon, T. F., and Fischer, F. E. (1968) An index of an individual's agreement with group-defined item difficulties. Educational and Psychological Measurement, 28(1), 105–113.

Drasgow, F., Levine, M. V., and Williams, E. A. (1985) Appropriateness measurement with polychotomous item response models and standardized indices. British Journal of Mathematical and Statistical Psychology, 38(1), 67–86.

Emons, W. M. (2008) Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247.

Harnisch, D. L., and Linn, R. L. (1981) Analysis of item response patterns: Questionable test data and dissimilar curriculum practices. Journal of Educational Measurement, 18(3), 133–146.

Kane, M. T., and Brennan, R. L. (1980) Agreement coefficients as indices of dependability for domain-referenced tests. Applied Psychological Measurement, 4(1), 105–126.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R. (1994) The number of Guttman errors as a simple and powerful person-fit statistic. Applied Psychological Measurement, 18(4), 311–314.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Molenaar, I. W. (1991) A weighted Loevinger H-coefficient extending Mokken scaling to multicategory items. Kwantitatieve Methoden, 12(37), 97–117.

Sato, T. (1975) The construction and interpretation of S-P tables. Tokyo: Meiji Tosho.

Sijtsma, K. (1986) A coefficient of deviance of response patterns. Kwantitatieve Methoden, 7, 131–145.

Sijtsma, K., and Meijer, R. R. (2001) The person response function as a tool in person-fit research. Psychometrika, 66(2), 191–207.

Snijders, T. B. (2001) Asymptotic null distribution of person fit statistics with estimated person parameter. Psychometrika, 66(3), 331–342.

Tatsuoka, K. K., and Tatsuoka, M. M. (1982) Detection of aberrant response patterns and their effect on dimensionality. Journal of Educational Statistics, 7(3), 215–231.

Tatsuoka, K. K., and Tatsuoka, M. M. (1983) Spotting erroneous rules of operation by the individual consistency index. Journal of Educational Measurement, 20(3), 221–230.

Tendeiro, J. N., Meijer, R. R., and Niessen, A. S. M. (2016). PerFit: An R Package for Person-Fit Analysis in IRT. Journal of Statistical Software, 74(5), 1–27.

van der Flier, H. (1977) Environmental factors and deviant response patterns. In Y. H. Poortinga (Ed.), Basic problems in cross-cultural psychology. Amsterdam: The Netherlands.

van der Flier, H. (1980) Vergelijkbaarheid van individuele testprestaties [Comparability of individual test performance]. Lisse: The Netherlands.

van der Flier, H. (1982) Deviant response patterns and comparability of test scores. Journal of Cross-Cultural Psychology, 13(3), 267–298.

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# As an example, compute the Ht person-fit scores:
Ht.out <- Ht(InadequacyData)
# Ht.out$PFscores

# Compute the cutoff value at 1% level:
set.seed(124) # To fix the random seed generator.
Ht.cut <- cutoff(Ht.out, Blvl=.01)

# Plot the sample distribution of the Ht scores with the above cutoff superimposed:
plot(Ht.out, cutoff.obj=Ht.cut)

# Determine which respondents were flagged by Ht at 1% level:
flagged.resp(Ht.out, cutoff.obj=Ht.cut, scores=FALSE)
# Flagged respondents: 30, 37, 46, 49,...

# Plot the person response function of respondent 30 (flagged as aberrant):
Resp30 <- PRFplot(InadequacyData, respID=30)
# Plot the person response function of respondent 35 (not flagged as aberrant):
Resp35 <- PRFplot(InadequacyData, respID=35)

Agreement, disagreement, and dependability statistics

Description

Kane and Brennan's person-fit statistics.

Usage

A.KB(matrix,
     NA.method = "Pairwise", Save.MatImp = FALSE, 
     IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML", 
     mu = 0, sigma = 1)

D.KB(matrix,
     NA.method = "Pairwise", Save.MatImp = FALSE, 
     IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML", 
     mu = 0, sigma = 1)

E.KB(matrix,
     NA.method = "Pairwise", Save.MatImp = FALSE, 
     IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML", 
     mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Kane and Brennan (1980) discussed the agreement, disagreement, and dependability statistics. Assume that the items are ordered in increasing difficulty order (i.e., according to decreasing proportion-correct score). The agreement statistic for respondent nn is

A.KB=iXipi,A.KB=\sum_i{X_i}p_i,

where XiX_i is the 0-1 score of respondent nn on item ii and pip_i is the proportion-correct score of item ii.

The disagreement statistic is

D.KB=A.KB(max)A.KB,D.KB = A.KB(max) - A.KB,

where A.KB(max)A.KB(max) is the maximum value of A.KB given respondent nn's total score.

The dependability statistic is

E.KB=A.KB/A.KB(max).E.KB = A.KB / A.KB(max).

Small values of A.Kb and E.KB (i.e., in the left tail of the sampling distribution) are (potentially) indicative of aberrant response behavior. Large values of D.Kb (i.e., in the right tail of the sampling distribution) are (potentially) indicative of aberrant response behavior. These statistics are not computed for rows of matrix that consist of only 0s or only 1s (NA values are returned instead).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Kane, M. T., and Brennan, R. L. (1980) Agreement coefficients as indices of dependability for domain-referenced tests. Applied Psychological Measurement, 4(1), 105–126.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the A.KB, D.KB, and E.KB scores:
A.out <- A.KB(InadequacyData); A.out
D.out <- D.KB(InadequacyData); D.out
E.out <- E.KB(InadequacyData); E.out

Compute a cutoff value given the scores of a person-fit statistic

Description

Compute a cutoff value given the scores of a person-fit statistic.

Usage

cutoff(x, 
       ModelFit = "NonParametric", Nreps=1000,
       IP=x$IP, IRT.PModel=x$IRT.PModel, Ability=x$Ability,
       Ability.PModel=x$Ability.PModel, mu=0, sigma=1, 
       Blvl = 0.05, Breps = 1000, CIlvl = 0.95, 
       UDlvl = NA)

Arguments

x

Object of class "PerFit".

ModelFit

Method required to compute model-fitting item score patterns. The options available are "NonParametric" (default) and "Parametric".

Nreps

Number of model-fitting item score patterns generated. Default is 1000.

IP

Matrix with previously estimated item parameters. Default is x$IP.

IRT.PModel

Parametric IRT model (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$IRT.PModel.

Ability

Matrix with previously estimated item parameters. Default is x$Ability.

Ability.PModel

Method to use in order to estimate the latent ability parameters (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$Ability.PModel.

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Blvl

Significance level for bootstrap distribution (value between 0 and 1). Default is 0.05.

Breps

Number of bootstrap resamples. Default is 1000.

CIlvl

Level of bootstrap percentile confidence interval for the cutoff statistic.

UDlvl

User-defined cutoff level.

Details

This function computes a reference value (referred to as a 'cutoff') associated to the values of a person-fit statistic computed from a sample. The idea is to create a decision rule: Individual person-fit values at or more extreme than the cutoff result in flagging the corresponding respondents as (potentially) displaying aberrant response behavior.

Depending on the person-fit statistic, an "extreme" score might be a very small (e.g., for Ht) or a very large (e.g., for G) value. The cutoff function routinely reports of which type the person-fit statistic being used is (Tail="lower" or Tail="upper", respectively).

The procedure consists of generating Nreps model-fitting item response vectors based on parametric model parameters (when ModelFit="Parametric") or on proportion of respondents per answer category (when ModelFit="NonParametric"). This allows computing a sample of Nreps values of the person fit statistic corresponding to model-fitting item response patterns. A bootstrap procedure is then used to approximate the sampling distribution of the quantile of level Blvl (resp., 1-Blvl) for "lower" (resp. "upper") types of person fit statistics, based on Breps resamples. The cutoff (and its standard error) is given by the median (standard deviation) of this bootstrap distribution. Alternatively, the cutoff can be manually entered by the user (e.g., when it is available from prior data calibration) by means of UDlvl.

Value

An object of class "PerFit.cutoff", which is a list with 5 elements:

Cutoff

Numeric. Value of the computed cutoff.

Cutoff.SE

Numeric. Bootstrap estimated standard error of the cutoff value.

Prop.flagged

Numeric. Proportion of respondents flagged (that is, with person-fit scores at or more extreme than the cutoff).

Tail

String with values "lower" or "upper". It indicates the type of person-fit statistic.

Cutoff.CI

Numeric. Percentile bootstrap (CIlvl)% confidence interval for the cutoff value.

Author(s)

Jorge N. Tendeiro [email protected]

See Also

flagged.resp, plot.PerFit, PRFplot

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# As an example, compute the Ht person-fit scores:
Ht.out <- Ht(InadequacyData)

# Compute the cutoff value at 1% level:
cutoff(Ht.out, Blvl=.01)

Find (potentially) aberrant response patterns

Description

Find which respondents in the sample were flagged by the specified person-fit statistic.

Usage

flagged.resp(x, cutoff.obj=NULL, 
             scores=TRUE, ord=TRUE,
             ModelFit="NonParametric", Nreps=1000,
             IP=x$IP, IRT.PModel=x$IRT.PModel, Ability=x$Ability,
             Ability.PModel=x$Ability.PModel, mu=0, sigma=1, 
             Blvl = 0.05, Breps = 1000, CIlvl = 0.95, 
             UDlvl=NA)

Arguments

x

Object of class "PerFit".

cutoff.obj

Object of class "PerFit.cutoff".

scores

Logical: Should item scores of flagged respondents be shown in the output? Default is TRUE.

ord

Logical: Should items be ordered in increasing order of difficulty (i.e., in decreasing proportion-correct order)? Default is TRUE. Only used if scores=TRUE.

ModelFit

Method required to compute model-fitting item score patterns. The options available are "NonParametric" (default) and "Parametric".

Nreps

Number of model-fitting item score patterns generated. Default is 1000.

IP

Matrix with previously estimated item parameters. Default is x$IP.

IRT.PModel

Parametric IRT model (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$IRT.PModel.

Ability

Matrix with previously estimated item parameters. Default is x$Ability.

Ability.PModel

Method to use in order to estimate the latent ability parameters (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$Ability.PModel.

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Blvl

Significance level for bootstrap distribution (value between 0 and 1). Default is 0.05.

Breps

Number of bootstrap resamples. Default is 1000.

CIlvl

Level of bootstrap percentile confidence interval for the cutoff statistic.

UDlvl

User-defined cutoff level.

Details

This function finds the respondents in the dataset that were flagged by the person-fit statistic. This statistic is specified by means of the "PerFit" class object x (x$PFStatistic).

The cutoff score may be provided by means of the cutoff.obj object, otherwise it is internally computed (for which the function parameters ModelFit through CIlvl are required; see cutoff for more details).

If scores=TRUE then the respondents' item scores will be shown in the output, either in the original item order (ord=FALSE) or in increasing difficulty order (ord=TRUE).

Value

If scores=FALSE the output is a list with 3 elements:

PFSscores

A two-column matrix with the row index and the value of the person-fit statistic for the flagged respondents.

Cutoff.lst

The corresponding PerFit.cutoff object.

PFS

The person-fit statistic.

If scores=TRUE the output is a list with four elements:

Scores

Matrix with columns: FlaggedID, item scores (It**), and PFscores.

MeanItemValue

The items mean value (which is nothing more than the proportion-correct for dichotomous items).

Cutoff.lst

The corresponding PerFit.cutoff object.

PFS

The person-fit statistic.

Author(s)

Jorge N. Tendeiro [email protected]

See Also

cutoff, plot.PerFit, PRFplot

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# As an example, compute the Ht person-fit scores:
Ht.out <- Ht(InadequacyData)
Ht.out$PFscores

# Estimate the cutoff value at 1% level:
Ht.cut <- cutoff(Ht.out, Blvl=.01)

# Determine which respondents were flagged by Ht at 1% level:
flagged.resp(Ht.out, Ht.cut, scores=FALSE)$PFSscores

Number of Guttman errors

Description

van der Flier's statistics based on the number of Guttman errors.

Usage

G(matrix,
  NA.method = "Pairwise", Save.MatImp = FALSE, 
  IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
  mu = 0, sigma = 1)

Gnormed(matrix,
        NA.method = "Pairwise", Save.MatImp = FALSE,
        IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
        mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Consider the items' proportion-correct scores, p1,p2,,pIp_1,p_2,\ldots,p_I (II = number of items). A Guttman error consists of an item score pair (Xi,Xj)=(0,1)(X_i,X_j)=(0,1) with pi>pjp_i>p_j. Hence, there is a Guttman error when an easier item is answered incorrectly and a more difficult item is answered correctly.

G counts the number of (0,1) pairs given that the items are ordered in decreasing proportion-correct scores order. However, G depends on the total number of items for a given number-correct score. In particular, for a number-correct SS, G has maximum equal to S(IS)S(I-S). Gnormed was created to bound G between 0 and 1 by dividing it by its maximum (conditional on the number-correct score). Hence, (potentially) aberrant response behavior is indicated by large values of G/Gnormed (i.e., in the right tail of the sampling distribution).

Gnormed is perfectly linearly related to Tatsuoka and Tatsuoka's (1982, 1983) NCI statistic (NCI=12GnormedNCI = 1-2Gnormed).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R. (1994) The number of Guttman errors as a simple and powerful person-fit statistic. Applied Psychological Measurement, 18(4), 311–314.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Tatsuoka, K. K., and Tatsuoka, M. M. (1982) Detection of aberrant response patterns and their effect on dimensionality. Journal of Educational Statistics, 7(3), 215–231.

Tatsuoka, K. K., and Tatsuoka, M. M. (1983) Spotting erroneous rules of operation by the individual consistency index. Journal of Educational Measurement, 20(3), 221–230.

van der Flier, H. (1977) Environmental factors and deviant response patterns. In Y. H. Poortinga (Ed.), Basic problems in cross-cultural psychology. Amsterdam: The Netherlands.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

NCI

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the G scores:
G.out <- G(InadequacyData)

# Compute the Gnormed scores:
Gnormed.out <- Gnormed(InadequacyData)

Number of Guttman errors for polytomous items

Description

Molenaar (1991) and Emons (2008) statistics, based on the number of Guttman errors, for polytomous items.

Usage

Gpoly(matrix, Ncat, 
     NA.method = "Pairwise", Save.MatImp = FALSE, 
     IP = NULL, IRT.PModel = "GRM", Ability = NULL, Ability.PModel = "EAP")

Gnormed.poly(matrix, Ncat, 
             NA.method = "Pairwise", Save.MatImp = FALSE,
             IP = NULL, IRT.PModel = "GRM", Ability = NULL, Ability.PModel = "EAP")

Arguments

matrix

A data matrix of polytomous item scores: Persons as rows, items as columns, item scores are integers between 0 and (Ncat-1), missing values allowed.

Ncat

Number of answer options for each item.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item. The first (Ncat-1) columns contain the between-categories threshold parameters (for the GRM) or the item step difficulties (for the PCM and the GPCM). The last, Ncat-th, column has the slopes.

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "PCM", "GPCM", and "GRM" (default).

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "EB", "EAP" (default), and "MI".

Details

Molenaar (1991) generalized the G person-fit statistic to polytomous items, Gpoly. The idea is based on the so-called item-step difficulty, which is the probability of moving from answer category (c)(c) to answer category (c+1)(c+1) (c=0,,Ncat2c=0,\ldots,Ncat-2).

Just like G, Gpoly depends on the test length. Emons (2008) developed Gnormed.poly, which is a normalized version of Gpoly.

Aberrant response behavior is (potentially) indicated by large values of Gpoly/Gnormed.poly (i.e., in the right tail of the sampling distribution).

The number of answer options, Ncat, is the same for all items.

Gpoly reduces to G, and Gnormed.poly reduces to Gnormed, when Ncat=2.

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "PCM", "GPCM", or "GRM"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

Not applicable.

ID.all0s

Not applicable.

ID.all1s

Not applicable.

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories.

IRT.PModel

The parametric IRT model used.

IP

The IIxNcat matrix of estimated item parameters.

Ability.PModel

The method used to estimate abilities used.

Ability

The vector of NN estimated ability parameters.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Emons, W. M. (2008) Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R. (1994) The number of Guttman errors as a simple and powerful person-fit statistic. Applied Psychological Measurement, 18(4), 311–314.

Molenaar, I. W. (1991) A weighted Loevinger H-coefficient extending Mokken scaling to multicategory items. Kwantitatieve Methoden, 12(37), 97–117.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

G, Gnormed

Examples

# Load the physical functioning data (polytomous item scores):
data(PhysFuncData)

# Compute the Gpoly scores:
Gpoly.out <- Gpoly(PhysFuncData, Ncat=3)

# Compute the Gnormed.poly scores:
Gnormedpoly.out <- Gnormed.poly(PhysFuncData, Ncat=3)

Ht person-fit statistic

Description

Sijtsma's Ht person-fit statistic.

Usage

Ht(matrix,
   NA.method = "Pairwise", Save.MatImp = FALSE,
   IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
   mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Sijtsma (1986) adapted a statistic introduced by Mokken (1971) that originally allowed assessing the scalability of an item to the Guttman (1944, 1950) model. The same statistic was applied by Sijtsma to the transposed data in order to detect respondents that would not comply with the Guttman model. Assume, without loss of generality, that the rows of the data matrix are ordered by increasing order of total score SnS_n (n=1,,Nn=1,\ldots,N). The statistic formula is given by the ratio

Ht=cov(xn,r(n))covmax(xn,r(n)),Ht=\frac{cov(x_n,r_{(n)})}{cov_{max}(x_n,r_{(n)})},

where r(n)r_{(n)} is the vector of total item scores computed excluding respondent nn and the denominator is the maximum covariance given the marginal. Hence, Ht is actually similar to Sato's C.Sato.

Ht is maximum 1 for respondent nn when no respondent with a total score smaller/larger than SnS_n can answer an item correctly/incorrectly that respondent nn has answered incorrectly/correctly, respectively. Ht equals zero when the average covariance of the response pattern of respondent nn with the other response patterns equals zero. Hence, (potentially) aberrant response behavior is indicated by small values of Ht (i.e., in the left tail of the sampling distribution). The Ht statistic was shown to perform relatively well in several simulation studies (Karabatsos, 2003; Sijtsma, 1986; Sijtsma and Meijer, 1992, Tendeiro and Meijer, 2014).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Guttman, L. (1944) A basis for scaling qualitative data. American Sociological Review, 9, 139-150.

Guttman, L. (1950) The basis for scalogram analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeld, S. A. Star & J. A. Claussen (Eds.), Measurement and precision (pp. 60-90). Princeton NJ: Princeton University Press.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Mokken, R. J. (1971) A theory and procedure of scale analysis. Berlin, Germany: De Gruyter.

Sijtsma, K. (1986) A coefficient of deviance of response patterns. Kwantitatieve Methoden, 7, 131–145.

Sijtsma, K., and Meijer, R. R. (1992) A method for investigating the intersection of item response functions in Mokken's nonparametric IRT model. Applied Psychological Measurement, 16(2), 149-157.

Tendeiro, J. N., and Meijer, R. R. (2014) Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics. Journal of Educational Measurement, 51(3), 239-259.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

C.Sato

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the Ht scores:
Ht.out <- Ht(InadequacyData)

The NPV-J inadequacy scale data

Description

The NPV-J (Dutch: Junior Nederlandse Persoonlijkheidsvragenlijst, NPV-J; Luteijn, van Dijk, and Barelds, 2005) is a large Dutch personality inventory. The NPV-J consists of 105 mostly positively formulated items and is intended to determine how adolescents between 9 and 15 years of age judge their own behavior. The NPV-J has five subscales; the InadequacyScale data concern scores of 806 adolescents on 28 items measuring inadequacy (one of the subscales). The data are dichotomous (0 = Disagree, 1 = Agree). The original sample consisted of 866 respondents, however 60 all-0s or all-1s response vectors were removed from the data.

Usage

data(InadequacyData)

Format

A 806x28 matrix of dichotomous item scores.

References

Luteijn, F., van Dijk, H., and Barelds, D. P. H. (2005) NPV-J: Junior Nederlandse Persoonlijkheidsvragenlijst. Herziene handleiding 2005. Amsterdam: Harcourt Assessments B.V..

Meijer, R. R., and Tendeiro, J. N. (2012) The use of the lz and lz* person-fit statistics and problems derived from model misspecification. Journal of Educational and Behavioral Statistics, 37(6), 758–766.

Examples

data(InadequacyData)

Intelligence data (number completion)

Description

The data are dichotomous scores of a Dutch intelligence test on number completion (Dutch: "Cijferreeksen", Drenth and Hoolwerf, 1970). The file consists of archival data that were collected in a high-stakes personnel selection context around 1990.

Usage

data(IntelligenceData)

Format

A 1000x26 matrix of dichotomous item scores.

References

Drenth, P. J. D., and Hoolwerf, G. (1970) Numerieke aanleg test - Cijferreeksen (NAT-Cijferreeksen) [Numerical ability test]. Amsterdam: The Netherlands.

Examples

data(IntelligenceData)

lz and lzstar person-fit statistics

Description

Compute the lz (Drasgow, Levine, and Williams, 1985) and the lzstar (Snijders, 2001) person-fit statistics.

Usage

lz(matrix,
   NA.method = "Pairwise", Save.MatImp = FALSE,
   IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
   mu = 0, sigma = 1)

lzstar(matrix,
       NA.method = "Pairwise", Save.MatImp = FALSE,
       IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
       mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Drasgow et al. (1985) introduced one of the most used person-fit statistics, lz. This statistic is the standardized log-likelihood of the respondent's response vector. lz is (supposed to be) asymptotically standard normally distributed.

The computation of lz requires that both item and ability parameters are available. Function lz allows to user to enter his/her own item and ability parameter estimates (variables IP and Ability, respectively). Alternatively, lz relies on functions available through the irtoys package for estimating the parameters. Specifically, the user can choose one from three possible IRT models to fit the data: IRT.PModel="1PL", IRT.PModel="2PL", or IRT.PModel="3PL". As for estimating the ability parameters there are three possible methods: Ability.PModel="ML" (maximum likelihood), Ability.PModel="BM" (Bayes modal), or Ability.PModel="WL" (weighted likelihood).

It was later observed by several researchers (e.g., Molenaar and Hoijtink, 1990) that the asymptotic approximation only holds when true ability values are used. This limitation was overcome by Snijders (2001), who further developed lz into the lzstar statistic. An accessible paper that thoroughly explains the basic principles behind lzstar is Magis, Raiche, and Beland (2012). It is important to realize that not all item and/or ability estimation procedures can be used when computing lzstar. In particular, the estimation of the ability parameters is constrained (see Snijders, 2001, Equation 5). The lzstar algorithm internally estimates the ability parameters accordingly for one of three possible methods: Ability.PModel="ML" (maximum likelihood), Ability.PModel="BM" (Bayes modal), or Ability.PModel="WL" (weighted likelihood), see Magis et al. (2012). The user may provide his or her own ability estimates in case they are available by means of other software. In this case it is necessary to specify the method that was used for the estimation (ML, BM, or WL) using the argument Ability.PModel.

Aberrant response behavior is (potentially) indicated by small values of lz/lzstar (i.e., in the left tail of the sampling distribution).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used.

IP

The IIx3 matrix of estimated item parameters.

Ability.PModel

The method used to estimate abilities used.

Ability

The vector of NN estimated ability parameters.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Drasgow, F., Levine, M. V., and Williams, E. A. (1985) Appropriateness measurement with polychotomous item response models and standardized indices. British Journal of Mathematical and Statistical Psychology, 38(1), 67–86.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Magis, D., Raiche, G., and Beland, S. (2012) A didactic presentation of Snijders's l[sub]z[/sub] index of person fit with emphasis on response model selection and ability estimation. Journal of Educational and Behavioral Statistics, 37(1), 57–81.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Molenaar, I. W., and Hoijtink, H. (1990) The many null distributions of person fit indices. Psychometrika, 55(1), 75–106.

Snijders, T. B. (2001) Asymptotic null distribution of person fit statistics with estimated person parameter. Psychometrika, 66(3), 331–342.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

lzpoly

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the lz scores using a subsample of the first 200 response vectors:
lz.out <- lz(InadequacyData[1:200,])
# Use parameters estimated externally (in this case item parameters estimated by mirt):
mod <- mirt(InadequacyData[1:200,], 1)
ip.mirt <- coef(mod, IRTpars = TRUE, simplify = TRUE, digits = Inf)$items[,c('a', 'b', 'g')]
lz.out2 <- lz(InadequacyData[1:200,], IP = ip.mirt)

# Compute the lzstar scores using a subsample of the first 200 response vectors:
lzstar.out <- lzstar(InadequacyData[1:200,])

lzpoly person-fit statistic

Description

Compute the lzpoly (Drasgow, Levine, and Williams, 1985) person-fit statistic.

Usage

lzpoly(matrix, Ncat, 
       NA.method = "Pairwise", Save.MatImp = FALSE,
       IP = NULL, IRT.PModel = "GRM", Ability = NULL, Ability.PModel = "EAP")

Arguments

matrix

A data matrix of polytomous item scores: Persons as rows, items as columns, item scores are integers between 0 and (Ncat-1), missing values allowed.

Ncat

Number of answer options for each item.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item. The first (Ncat-1) columns contain the between-categories threshold parameters (for the GRM) or the item step difficulties (for the PCM and the GPCM). The last, Ncat-th, column has the slopes.

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "PCM", "GPCM", and "GRM" (default).

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "EB", "EAP" (default), and "MI".

Details

Statistic lzpoly is the natural extension of lz to polytomously scores items. In this case the user can choose one from three possible IRT models to fit the data: The partial credit model (IRT.PModel="PCM"), the generalized partial credit model (IRT.PModel="GPCM"), or the graded response model (IRT.PModel="GRM"). Ability parameters can be estimated by means of one of three methods: Empirical Bayes (Ability.PModel="EB"), expected a posteriori (Ability.PModel="EAP"), or multiple imputation (Ability.PModel="MI").

The estimation of the model parameters is based on the ltm package. This function will estimate the item and ability parameters when both sets of parameters are missing. It will also estimate one set of parameters in case only the other set is provided. It is possible that some estimation convergence problems occur that may break the function. In this case it is advisable to estimate the model parameters externally and then to run this function with those estimates provided via the commands IP and Ability.

Aberrant response behavior is (potentially) indicated by small values of lzpoly (i.e., in the left tail of the sampling distribution).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "PCM", "GPCM", or "GRM"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

Not applicable.

ID.all0s

Not applicable.

ID.all1s

Not applicable.

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories.

IRT.PModel

The parametric IRT model used.

IP

The IIxNcat matrix of estimated item parameters.

Ability.PModel

The method used to estimate abilities used.

Ability

The vector of NN estimated ability parameters.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Drasgow, F., Levine, M. V., and Williams, E. A. (1985) Appropriateness measurement with polychotomous item response models and standardized indices. British Journal of Mathematical and Statistical Psychology, 38(1), 67–86.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Magis, D., Raiche, G., and Beland, S. (2012) A didactic presentation of Snijders's l[sub]z[/sub] index of person fit with emphasis on response model selection and ability estimation. Journal of Educational and Behavioral Statistics, 37(1), 57–81.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Molenaar, I. W., and Hoijtink, H. (1990) The many null distributions of person fit indices. Psychometrika, 55(1), 75–106.

Snijders, T. B. (2001) Asymptotic null distribution of person fit statistics with estimated person parameter. Psychometrika, 66(3), 331–342.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

lz,lzstar

Examples

# Load the physical functioning data (polytomous item scores):
data(PhysFuncData)

# Compute the lzpoly scores:
lzpoly.out <- lzpoly(PhysFuncData,Ncat=3)

NCI person-fit statistic

Description

Tatsuoka and Tatsuoka's NCI statistic.

Usage

NCI(matrix,
    NA.method = "Pairwise", Save.MatImp = FALSE, 
    IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
    mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

The NCI person-fit statistic was introduced by Tatsuoka and Tatsuoka (1982, 1983). It is perfectly linearly related to van der Flier's (1977) Gnormed statistic (NCI=12GnormedNCI = 1-2Gnormed), see G for mathematical details.

NCI equals 1 for perfect Guttman vectors (i.e., when only the easiest items are answered correctly, given the total score) and equals -1 for reversed Guttman vectors (i.e., when only the hardest items are answered correctly, given the total score). Hence, (potentially) aberrant response behavior is indicated by small values of NCI (i.e., in the left tail of the sampling distribution).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Tatsuoka, K. K., and Tatsuoka, M. M. (1982) Detection of aberrant response patterns and their effect on dimensionality. Journal of Educational Statistics, 7(3), 215–231.

Tatsuoka, K. K., and Tatsuoka, M. M. (1983) Spotting erroneous rules of operation by the individual consistency index. Journal of Educational Measurement, 20(3), 221–230.

van der Flier, H. (1977) Environmental factors and deviant response patterns. In Y. H. Poortinga (Ed.), Basic problems in cross-cultural psychology. Amsterdam: The Netherlands.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

G

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the NCI scores:
NCI.out <- NCI(InadequacyData)

Compute several person-fit statistics

Description

Compute several person-fit statistics.

Usage

PerFit.PFS(matrix, method=NULL, simplified=TRUE, 
       NA.method = "Pairwise", Save.MatImp = FALSE,
       IP = NULL, IRT.PModel = NULL, Ability = NULL, Ability.PModel = NULL,
       mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

method

Vector of person-fit statistics to be computed.

simplified

Logical. If FALSE, a list of PerFit objects is returned. If TRUE (default), a matrix with person-fit scores per respondent (one column per statistic) is reported.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL).

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL).

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Function PerFit.PFS is a wrapper allowing to compute more than one person-fit statistic simultaneously.

Value

If simplified=TRUE, a N-by-m data frame is returned, where N is the number of respondents and m is the number of methods.

If simplified=FALSE a list of m PerFit objects is returned.

Author(s)

Jorge N. Tendeiro [email protected]

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the lzstar, U3, and Ht scores:
PerFit.PFS(InadequacyData, method=c("lzstar", "U3", "Ht"))

Compute standard errors for person fit statistics

Description

Compute standard errors for person fit statistics.

Usage

PerFit.SE(x)

Arguments

x

Object of class "PerFit".

Details

Function PerFit.SE computes jackknife standard errors for the scores of the person fit statistic in object x.

Value

A matrix with two columns: PFscores shows the values of the person-fit statistic and PFscores.SE shows the estimated standard errors.

Author(s)

Jorge N. Tendeiro [email protected]

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the Ht scores:
Ht.out <- Ht(InadequacyData)

# Compute the SEs:
Ht.SE <- PerFit.SE(Ht.out)
Ht.SE

Personal biserial statistic

Description

Donlon and Fischer's personal biserial statistic.

Usage

r.pbis(matrix,
       NA.method = "Pairwise", Save.MatImp = FALSE,
       IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
       mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Donlon and Fischer (1968) suggested to use the correlation between a respondent's score vector and the item proportion-correct scores in the sample as a measure of person fit. Low values should be indicative of misfit of the response vector with respect to the group of respondents.

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Donlon, T. F., and Fischer, F. E. (1968) An index of an individual's agreement with group-defined item difficulties. Educational and Psychological Measurement, 28(1), 105–113.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the r.pbis scores:
rpbis.out <- r.pbis(InadequacyData)

The SF-36 physical functioning data

Description

These data are from the Physical Functioning scale of the SF-36 (Ware and Sherbourne, 1992). Data consist of scores of 714 respondents on 10 polytomously scored items (0 = no, not limited at all; 1 = limited a little; 2 = limited a lot).

Usage

data(PhysFuncData)

Format

A 714x10 matrix of polytomous item scores (scores 0, 1, and 2).

References

Ware, J. E., Jr., and Sherbourne, C. D. (1992) The MOS 36-item short-form health survey (SF-36): Conceptual framework and item selection. Medical Care, 30, 473–483.

Examples

data(PhysFuncData)

Plot method for objects of class "PerFit"

Description

Plot method for objects of class "PerFit".

Usage

## S3 method for class 'PerFit'
plot(x, cutoff.obj=NULL, 
       ModelFit="NonParametric", Nreps=1000, 
       IP=x$IP, IRT.PModel=x$IRT.PModel, Ability=x$Ability, 
       Ability.PModel=x$Ability.PModel, mu=0, sigma=1, 
       Blvl = 0.05, Breps = 1000, CIlvl = 0.95, 
       UDlvl = NA, 
       Type="Density", Both.scale=TRUE, Cutoff=TRUE, Cutoff.int=TRUE,
       Flagged.ticks = TRUE, 
       Xlabel=NA, Xcex=1.5, title=NA, Tcex=1.5,
       col.area="lightpink", col.hist="lightblue", col.int="darkgreen",
       col.ticks="red", ...)

Arguments

x

Object of class "PerFit".

cutoff.obj

Object of class "PerFit.cutoff".

ModelFit

Method required to compute model-fitting item score patterns. The options available are "NonParametric" (default) and "Parametric".

Nreps

Number of model-fitting item score patterns generated. Default is 1000.

IP

Matrix with previously estimated item parameters. Default is x$IP.

IRT.PModel

Parametric IRT model (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$IRT.PModel.

Ability

Matrix with previously estimated item parameters. Default is x$Ability.

Ability.PModel

Method to use in order to estimate the latent ability parameters (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is x$Ability.PModel.

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Blvl

Significance level for bootstrap distribution (value between 0 and 1). Default is 0.05.

Breps

Number of bootstrap resamples. Default is 1000.

CIlvl

Level of bootstrap percentile confidence interval for the cutoff statistic.

UDlvl

User-defined cutoff level.

Type

Type of plot: "Histogram", "Density" (default), "Both".

Both.scale

Logical: Should the y-axis be adjusted so that both the histogram and the density graphics are completely visible? Default is TRUE.

Cutoff

Logical: Should the estimated cutoff be added to the plot? Default is TRUE.

Cutoff.int

Logical: Should an approximated (1-Blvl)% bootstrap confidence interval be added to the plot? Default is TRUE.

Flagged.ticks

Logical: Should ticks representing the flagged respondents be added to the plot? Default is TRUE.

Xlabel

Label of x-axis, otherwise a default label is shown.

Xcex

Font size of the label of x-axis.

title

Title of the plot, otherwise a default title is shown.

Tcex

Font size of the title of the plot.

col.area

Color of "flagging region".

col.hist

Color of histogram.

col.int

Color of bootstrap confidence interval.

col.ticks

Color of the ticks marking the flagged respondents.

...

Extra graphical parameters to be passed to plot.

Details

This function plots the empirical distribution of the scores of the person-fit statistic specified by the "PerFit" class object x. A histogram, density, or a combination of both displays is possible.

The cutoff score may be provided by means of the cutoff.obj object, otherwise it is internally computed (for which the function parameters ModelFit through CIlvl are required; see cutoff for more details). The value of the cutoff is superimposed to the plot when Cutoff=TRUE. In this case, the adequate "flagging region" is colored, thus indicating the range of values for which the person-fit statistic flags respondents as potentially displaying aberrant behavior. The option Both.scale was introduced to help to better tune the scale of the y-axis. Furthermore, the percentile confidence interval for the cutoff value (with confidence level defined by the cutoff.obj) is displayed in the x-axis, and ticks marking the flagged respondents are display on the top of the plot.

Author(s)

Jorge N. Tendeiro [email protected]

See Also

cutoff, flagged.resp, PRFplot

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the ZU3 scores:
ZU3.out <- ZU3(InadequacyData)

# Plot the sampling distribution of the ZU3 scores, with cutoff value based on a nominal 5% level,
# and 90% confidence interval:
plot(ZU3.out, Type="Both", Blvl=.05, CIlvl = 0.90)

Person response function (PRF)

Description

Plot the nonparametric person response function with variability bands.

Usage

PRFplot(matrix, respID, h=.09, N.FPts=101, 
        VarBands=FALSE, VarBands.area=FALSE, alpha=.05,
        Xlabel=NA, Xcex=1.5, Ylabel=NA, Ycex=1.5, title=NA, Tcex=1.5,
        NA.method="Pairwise", Save.MatImp=FALSE, 
        IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
        mu = 0, sigma = 1, 
        message = TRUE)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

respID

Vector specifying the respondents for whom PRFs are to be computed.

h

Bandwidth value. Default is 0.09.

N.FPts

Number of (equidistant) focal points in the [0,1] interval. Default is 101.

VarBands

Logical: Draw the (1-alpha) variability bands? Default is FALSE.

VarBands.area

Logical: Draw the area between the (1-alpha) variability bands? Default is FALSE.

alpha

Significance level for the variability bands. Default is 0.05.

Xlabel

Define label of x-axis, otherwise a default label is shown.

Xcex

Font size of the label of x-axis.

Ylabel

Define label of y-axis, otherwise a default label is shown.

Ycex

Font size of the label of y-axis.

title

Define the title of the plot, otherwise a default title is shown.

Tcex

Font size of the title of the plot.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

message

Display prompt message (one per person)? Default is TRUE.

Details

Function PRFplot displays the so-called nonparametric person response functions (PRFs; Emons, Sijtsma, and Meijer, 2004; Sijtsma and Meijer, 2001), for the respondents identified in respID. The PRF relates item difficulty (0-1 range on the x-axis) with the associated probability of correct response (on the y-axis). The PRF is typically nonincreasing, implying that the probability of answering increasingly difficult items should (at least) not increase. The code is based on nonparametric kernel smoothing (Emons et al., 2004). The value of the PRF at each focal point (representing a difficulty parameter between 0 and 1) is estimated as a weighted sum score, where scores pertaining to items with difficulty close to the focal point are given the largest weights. The weights are functions of the Gaussian kernel function. It is necessary to specify a bandwidth value (h) in order to compute the weights. The h value controls the trade-off between bias and sampling variation (Emons et al., 2004). Small h values reduce bias but increase variance, leading to PRFs that capture too much measurement error. Large h values, on the other hand, increase bias which renders PRFs that are often too flat, thus missing potentially relevant misfitting response behavior. Therefore, it is important to carefuly specify the value h. Emons et al. (2004, pp. 10-13), after a simulation study, advised that "h values between 0.07 and 0.11 are reasonable choices".

Moreover, variability bands of level alpha (0.05 by default) can also be added to the plot. These bands are computed following the jackknife procedure explained in Emons et al. (2004).

The PRFs and variability bands for each respondent are approximated by means of functional data objects (e.g., Ramsay, Hooker, and Graves, 2009), with the help of the fda package. This procedure follows two steps:

  • Compute a B-splines basis system. This basis consists of a set of (thirteen) piecewise polinomials, all of degree three/order four (i.e., cubic polinomial segments), with one knot per break point. This allows any two consecutive splines, sp1 and sp2, with common break point BP, verifying sp1(BP) = sp2(BP), sp1'(BP) = sp2'(BP), and sp1”(BP) = sp2”(BP). At 0 and 1 (extremes of the x-range), four (= order) knots are used.

  • Specify coefficients c for the B-splines basis system computed above and then create functional data objects. Based on smoothing using regression analysis (Ramsay et al., 2009, section 4.3).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

PRFplot returns three functional data objects (for the PRFs, lower-bound of the variability bands, and upper-band of the variability bands) for all respondents in the sample.

Value

The output is a list with three functional data objects of class fd (see package fda):

PRF.FDO

Functional data object of the PRFs for the entire sample.

VarBandsLow.FDO

Functional data object of the lower-bound of the variability bands for the entire sample.

VarBandsHigh.FDO

Functional data object of the upper-bound of the variability bands for the entire sample.

Author(s)

Jorge N. Tendeiro [email protected]

References

Emons, W. M., Sijtsma, K., and Meijer, R. R. (2004) Testing hypotheses about the person-response function in person-fit analysis. Multivariate Behavioral Research, 39(1), 1–35.

Ramsay, J. O., Hooker, G., and Graves, S. (2009) Functional data analysis with R and MATLAB. New York: US.

Sijtsma, K., and Meijer, R. R. (2001) The person response function as a tool in person-fit research. Psychometrika, 66(2), 191–207.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

cutoff, plot.PerFit, flagged.resp

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# As an example, compute the Ht person-fit scores:
Ht.out <- Ht(InadequacyData)
Ht.out$PFscores

# Determine which respondents were flagged by Ht at 1% level:
set.seed(124) # To fix the random seed generator.
Ht.flagged <- flagged.resp(Ht.out, Blvl=.01, scores=FALSE)
Ht.flagged <- Ht.flagged$PFSscores[,1]
# Flagged respondents: 30  37  46  49 137 216 531.

# Plot the PRFs of the first three flagged respondents:
Flagged    <- PRFplot(InadequacyData, respID=Ht.flagged[1:3])
# Plot the person response function of respondent 35 (not flagged as aberrant):
PRFplot(InadequacyData, respID=35)
# Plot the PRFs of all respondents:
plot(Flagged$PRF.FDO)

Print method for objects of class "PerFit"

Description

Print method for objects of class "PerFit".

Usage

## S3 method for class 'PerFit'
print(x, ...)

Arguments

x

Object of class "PerFit".

...

Additional arguments to be passed to print.

Details

For a given object of class PerFit, this function displays the scores of the person-fit statistic.

Author(s)

Jorge N. Tendeiro [email protected]

See Also

cutoff, flagged.resp, plot.PerFit, summary.PerFit

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the ZU3 scores:
ZU3.out <- ZU3(InadequacyData)

print(ZU3.out)

Summary method for objects of class "PerFit"

Description

Summary method for objects of class "PerFit".

Usage

## S3 method for class 'PerFit'
summary(object, cutoff.obj=NULL, 
        ModelFit="NonParametric", Nreps=1000, 
        IP=object$IP, IRT.PModel=object$IRT.PModel, Ability=object$Ability,
        Ability.PModel=object$Ability.PModel, mu=0, sigma=1, 
        Blvl = 0.05, Breps = 1000, CIlvl = 0.95, 
        UDlvl = NA, ...)

Arguments

object

Object of class "PerFit".

cutoff.obj

Object of class "PerFit.cutoff".

ModelFit

Method required to compute model-fitting item score patterns. The options available are "NonParametric" (default) and "Parametric".

Nreps

Number of model-fitting item score patterns generated. Default is 1000.

IP

Matrix with previously estimated item parameters. Default is object$IP.

IRT.PModel

Parametric IRT model (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is object$IRT.PModel.

Ability

Matrix with previously estimated item parameters. Default is object$Ability.

Ability.PModel

Method to use in order to estimate the latent ability parameters (required if "ModelFit=Parametric" or if the person fit statistic is parametric). Default is object$Ability.PModel.

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Blvl

Significance level for bootstrap distribution (value between 0 and 1). Default is 0.05.

Breps

Number of bootstrap resamples. Default is 1000.

UDlvl

User-defined cutoff level.

CIlvl

Level of bootstrap percentile confidence interval for the cutoff statistic.

...

Additional arguments to be passed to summary.

Details

For a given object of class PerFit, this function prints: The PFS used, the cutoff value, the tail of the distribution of the person-fit statistic associated to misfit, the proportion of flagged respondents in the sample, and their row indices.

Author(s)

Jorge N. Tendeiro [email protected]

See Also

cutoff, flagged.resp, plot.PerFit, summary.PerFit

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the ZU3 scores:
ZU3.out <- ZU3(InadequacyData)

summary(ZU3.out)

C.Sato, Cstar person-fit statistics

Description

Computes the caution statistic C.Sato and the modified caution statistic Cstar.

Usage

C.Sato(matrix,
       NA.method = "Pairwise", Save.MatImp = FALSE, 
       IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
       mu = 0, sigma = 1)

Cstar(matrix,
     NA.method = "Pairwise", Save.MatImp = FALSE, 
     IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
     mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

The C.Sato statistic (also refered to as C in the literature) was proposed by Sato (1975):

C.Sato=1cov(xn,p)cov(xn,p),C.Sato = 1-\frac{cov(x_n,p)}{cov(x_n^*,p)},

where xnx_n is the 0-1 response vector of respondent nn, pp is the vector of item proportions-correct, and xnx_n^* is the so-called Guttman vector containing correct answers for the easiest items (i.e., with the largest proportion-correct values) only. C.Sato is zero for Guttman vectors and its value tends to increase for response vectors that depart from the group's answering pattern, hence warning the researcher to be cautious about interpreting such item scores. Therefore, (potentially) aberrant response behavior is indicated by large values of C.Sato (i.e., in the right tail of the sampling distribution).

Harnisch and Linn (1981) proposed a modified version of the caution statistic which bounds the caution statistic between 0 and 1 (also referred to as C* or MCI in the literature):

Cstar=cov(xn,p)cov(xn,p)cov(xn,p)cov(xn,p),Cstar = \frac{cov(x_n^*,p)-cov(x_n,p)}{cov(x_n^*,p)-cov(x_n',p)},

where xnx_n' is the reversed Guttman vector containing correct answers for the hardest items (i.e., with the smallest proportion-correct values) only. Cstar is sensitive to the so-called Guttman errors. A Guttman error is a pair of scores (0,1), where the 0-score pertains to the easiest item and the 1-score pertains to the hardest item. Cstar ranges between 0 (perfect Guttman vector) and 1 (reversed Guttman error), thus larger values indicate potential aberrant response behavior.

These statistics are not computed for rows of matrix that consist of only 0s or only 1s (NA values are returned instead).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Harnisch, D. L., and Linn, R. L. (1981) Analysis of item response patterns: Questionable test data and dissimilar curriculum practices. Journal of Educational Measurement, 18(3), 133–146.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Sato, T. (1975) The construction and interpretation of S-P tables. Tokyo: Meiji Tosho.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

Ht

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the C.Sato scores:
C.out <- C.Sato(InadequacyData)

# Compute the Cstar scores:
Cstar.out <- Cstar(InadequacyData)

U3, ZU3 person-fit statistics

Description

van der Flier's U3 and ZU3 person-fit statistics.

Usage

U3(matrix,
   NA.method = "Pairwise", Save.MatImp = FALSE, 
   IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
   mu = 0, sigma = 1)

ZU3(matrix,
    NA.method = "Pairwise", Save.MatImp = FALSE, 
    IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
    mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Suppose the items are ordered in decreasing proportion-correct score, p1>p2>>pIp_1>p_2>\ldots>p_I (II = number of items). Given response vector (X1,X2,,XI)(X_1,X_2,\ldots,X_I) with total score S=iXiS=\sum_iX_i, van der Flier (1980, 1982) defined the U3 statistic as

i=1Spii=1Ipii=1Spii=IS+1Ipi.\frac{\sum_{i=1}^S p_i-\sum_{i=1}^I p_i}{\sum_{i=1}^S p_i-\sum_{i=I-S+1}^I p_i}.

U3 varies from 0 for perfect Guttman response vectors (i.e., with only the easiest items correct) through 1 for reversed Guttman response vectors (i.e., with only the hardest items correct). Hence, increasingly large U3 values provide stronger indications of answering misfit.

U3 scores are dependent on the number-correct score, hence van der Flier proposed ZU3 as a standardization (formulas to compute E(U3) and Var(U3) can be found in van der Flier, 1982). ZU3 is supposed to be asymptotically approximated by the standard normal distribution, but this approximation is not without problems (see Emons, Meijer, and Sijtsma, 2002).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The IIx3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of NN estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Emons, W. M., Meijer, R. R., and Sijtsma, K. (2002). Comparing simulated and theoretical sampling distributions of the U3 person-fit statistic. Applied Psychological Measurement, 26(1), 88–108.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

van der Flier, H. (1980) Vergelijkbaarheid van individuele testprestaties [Comparability of individual test performance]. Lisse: The Netherlands.

van der Flier, H. (1982) Deviant response patterns and comparability of test scores. Journal of Cross-Cultural Psychology, 13(3), 267–298.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

U3poly

Examples

# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the U3 scores:
U3.out <- U3(InadequacyData)

# Compute the ZU3 scores:
ZU3.out <- ZU3(InadequacyData)

U3poly person-fit statistic

Description

Generalization of van der Flier's U3 person-fit statistic to polytomously scored items.

Usage

U3poly(matrix, Ncat, 
       NA.method = "Pairwise", Save.MatImp = FALSE, 
       IP = NULL, IRT.PModel = "GRM", Ability = NULL, Ability.PModel = "EAP")

Arguments

matrix

A data matrix of polytomous item scores: Persons as rows, items as columns, item scores are integers between 0 and (Ncat-1), missing values allowed.

Ncat

Number of answer options for each item.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item. The first (Ncat-1) columns contain the between-categories threshold parameters (for the GRM) or the item step difficulties (for the PCM and the GPCM). The last, Ncat-th, column has the slopes.

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "PCM", "GPCM", and "GRM" (default).

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "EB", "EAP" (default), and "MI".

Details

Emons (2008) generalized the U3 statistic (van der Flier, 1980, 1982) to polytomous items. The idea is based on the so-called item-step difficulty, which is the probability of moving from answer category (c)(c) to answer category (c+1)(c+1) (c=0,,Ncat2c=0,\ldots,Ncat-2).

U3poly varies from 0 (no misfit) through 1 (extreme misfit). Hence, increasingly large U3poly values provide stronger indications of answering misfit.

The number of answer options, Ncat, is the same for all items.

U3poly reduces to U3 when Ncat=2.

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from multinomial distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "PCM", "GPCM", or "GRM"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length NN (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

Not applicable.

ID.all0s

Not applicable.

ID.all1s

Not applicable.

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories.

IRT.PModel

The parametric IRT model used.

IP

The IIxNcat matrix of estimated item parameters.

Ability.PModel

The method used to estimate abilities used.

Ability

The vector of NN estimated ability parameters.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro [email protected]

References

Emons, W. M. (2008) Nonparametric person-fit analysis of polytomous item scores. Applied Psychological Measurement, 32(3), 224–247.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

van der Flier, H. (1980) Vergelijkbaarheid van individuele testprestaties [Comparability of individual test performance]. Lisse: The Netherlands.

van der Flier, H. (1982) Deviant response patterns and comparability of test scores. Journal of Cross-Cultural Psychology, 13(3), 267–298.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

U3, ZU3

Examples

# Load the physical functioning data (polytomous item scores):
data(PhysFuncData)

# Compute the U3poly scores:
U3poly.out <- U3poly(PhysFuncData,Ncat=3)