Results 61 to 70 of about 1,018,188 (173)

Leveraging LLM respondents for item evaluation: A psychometric analysis

open access: yesBritish Journal of Educational Technology, Volume 56, Issue 3, Page 1028-1052, May 2025.
Effective educational measurement relies heavily on the curation of well‐designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT‐3.5, GPT‐4, Llama 2,
Yunting Liu   +2 more
wiley   +1 more source

Estimating the reliability of composite scores [PDF]

open access: yes, 2009
In situations where multiple tests are administered (such as the GCSE subjects), scores from individual tests are frequently combined to produce a composite score.
He, Qingping
core  

A convexity‐constrained parameterization of the random effects generalized partial credit model

open access: yesBritish Journal of Mathematical and Statistical Psychology, Volume 78, Issue 2, Page 401-419, May 2025.
An alternative closed‐form expression for the marginal joint probability distribution of item scores under the random effects generalized partial credit model is presented. The closed‐form expression involves a cumulant generating function and is therefore subjected to convexity constraints.
David J. Hessen
wiley   +1 more source

Measurement properties of the Swedish version of the Knee Self‐Efficacy Scale: A Rasch analysis

open access: yesJournal of Experimental Orthopaedics, Volume 12, Issue 2, April 2025.
Abstract Purpose This study aimed to evaluate measurement properties of the Swedish version of the Knee Self‐Efficacy Scale 18 items version (K‐SES18) in patients after anterior cruciate ligament (ACL) reconstruction using Rasch measurement theory (RMT). Method Data were extracted from Project ACL, a rehabilitation registry.
Ramana Piussi   +3 more
wiley   +1 more source

Analisis Politomi Rasch Model Skala PTSD

open access: yesJurnal Konseling dan Pendidikan
Post-Traumatic Stress Disorder (PTSD) is a significant mental health issue, particularly in Indonesia with its complex cultural diversity. This study aims to assess the validity of the Indonesian Version DSM-V PTSD scale using an Item Response Theory ...
I Wayan Indra Praekanata   +3 more
doaj   +1 more source

Investigating invariant item ordering in the Mental Health Inventory : an illustration of the use of different methods [PDF]

open access: yes, 2014
Invariant item ordering is a property of scales whereby the items are scored in the same order across a wide range of the latent trait and across a wide range of respondents.
Meijer, Rob R.   +3 more
core   +1 more source

Refinement and Validation of a New Patient‐Reported Experience Measure for Hearing Loss (My Hearing PREM)

open access: yesHealth Expectations, Volume 28, Issue 2, April 2025.
ABSTRACT Context Patient‐reported experience measures (PREMs) generate insights into daily challenges experienced when living with a chronic condition and experiences of care. There are no validated PREMs to measure the experience of hearing loss. Objective The aim of this study was to evaluate the psychometric properties of a newly developed tool, ‘My
Sian K. Smith   +4 more
wiley   +1 more source

LOO and WAIC as Model Selection Methods for Polytomous Items [PDF]

open access: yesarXiv, 2018
Watanabe-Akaike information criterion (WAIC; Watanabe, 2010) and leave-one-out cross validation (LOO) are two fully Bayesian model selection methods that have been shown to perform better than other traditional information-criterion based model selection methods such as AIC, BIC, and DIC in the context of dichotomous IRT model selection. In this paper,
arxiv  

Incorporating Test‐Taking Engagement into Multistage Adaptive Testing Design for Large‐Scale Assessments

open access: yesJournal of Educational Measurement, Volume 62, Issue 1, Page 57-80, Spring 2025.
Abstract The use of multistage adaptive testing (MST) has gradually increased in large‐scale testing programs as MST achieves a balanced compromise between linear test design and item‐level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or
Okan Bulut, Guher Gorgun, Hacer Karamese
wiley   +1 more source

Home - About - Disclaimer - Privacy