A Nonparametric Bayesian Item Response Modeling Approach for Clustering Items and Individuals Simultaneously [PDF]
Item response theory (IRT) is a popular modeling paradigm for measuring subject latent traits and item properties according to discrete responses in tests or questionnaires. There are very limited discussions on heterogeneity pattern detection for both items and individuals.
arxiv
Item Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection [PDF]
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this paper, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric properties ...
arxiv
Unfolding-Model-Based Visualization: Theory, Method and Applications [PDF]
Multidimensional unfolding methods are widely used for visualizing item response data. Such methods project respondents and items simultaneously onto a low-dimensional Euclidian space, in which respondents and items are represented by ideal points, with person-person, item-item, and person-item similarities being captured by the Euclidian distances ...
arxiv
Flexible Bayesian modelling in dichotomous item response theory using mixtures of skewed item curves [PDF]
Most Item Response Theory (IRT) models for dichotomous responses are based on probit or logit link functions which assume a symmetric relationship between the probability of a correct response and the latent traits of individuals submitted to a test.
arxiv
A Bayesian Nonparametric IRT Model [PDF]
This paper introduces a flexible Bayesian nonparametric Item Response Theory (IRT) model, which applies to dichotomous or polytomous item responses, and which can apply to either unidimensional or multidimensional scaling. This is an infinite-mixture IRT model, with person ability and item difficulty parameters, and with a random intercept parameter ...
arxiv
Random pairing MLE for estimation of item parameters in Rasch model [PDF]
The Rasch model, a classical model in the item response theory, is widely used in psychometrics to model the relationship between individuals' latent traits and their binary responses on assessments or questionnaires. In this paper, we introduce a new likelihood-based estimator -- random pairing maximum likelihood estimator ($\mathsf{RP\text{-}MLE ...
arxiv
A class of Multidimensional Latent Class IRT models for ordinal polytomous item responses [PDF]
We propose a class of Item Response Theory models for items with ordinal polytomous responses, which extends an existing class of multidimensional models for dichotomously-scored items measuring more than one latent trait. In the proposed approach, the random vector used to represent the latent traits is assumed to have a discrete distribution with ...
arxiv
Bayesian information theoretic model-averaging stochastic item selection for computer adaptive testing: compromise-free item exposure [PDF]
The goal of Computer Adaptive Testing (CAT) is to reliably estimate an individual's ability as modeled by an item response theory (IRT) instrument using only a subset of the instrument's items. A secondary goal is to vary the items presented across different testing sessions so that the sequence of items does not become overly stereotypical -- we want ...
arxiv
Building an Evaluation Scale using Item Response Theory [PDF]
Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power.
arxiv
The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory [PDF]
High-quality test items are essential for educational assessments, particularly within Item Response Theory (IRT). Traditional validation methods rely on resource-intensive pilot testing to estimate item difficulty and discrimination. More recently, Item-Writing Flaw (IWF) rubrics emerged as a domain-general approach for evaluating test items based on ...
arxiv