A semiannual overview of recent research contributions in cognitive diagnosis, forced-choice formats, and methodological advances in psychology and psychometrics. Each paper is summarized in plain language with key insights highlighted for a broad academic and applied audience.
“Issues and possible solutions in cognitive diagnosis
modeling applications: The case of a large-scale educational assessment
in Mexico”
Annals of Psychology
This study examines the practical challenges that arise when applying cognitive diagnosis models (CDMs) to real data, using a large-scale educational assessment of high-school teachers in Mexico. By identifying common problems encountered in empirical applications, the authors propose methodological and applied solutions that integrate psychometric analyses with expert judgment.
Key points:
“Variable-length cognitive diagnostic computerized adaptive
testing in small-scale assessments”
Journal of Educational and Behavioral Statistics
This study proposes and evaluates Cognitive Diagnostic Computerized Adaptive Testing (CD-CAT) procedures specifically designed for small-sample assessments, where traditional parametric CDM-based methods (e.g., DINA) tend to overfit and overestimate reliability.
Innovations:
“Dimensionality assessment in forced-choice questionnaires:
First steps toward an exploratory framework”
Educational and Psychological Measurement
This paper examines how to accurately assess dimensionality in forced-choice (FC) questionnaires, a key challenge due to ipsativity and the inherently multidimensional nature of FC blocks. Through a large Monte Carlo simulation, the authors evaluate common dimensionality detection methods under realistic FC design conditions.
Contributions:
“Evaluating the performance of R-Squared measures in
multilevel models”
Multivariate Behavioral Research
This paper evaluates how different R² measures for multilevel models (MLMs) perform in finite samples, focusing on the integrative framework proposed by Rights and Sterba. Using extensive Monte Carlo simulations, it examines how sample size, model complexity, ICC, and estimation method (ML vs. REML) affect bias and accuracy.
Contributions:
“Assessing item-level fit for the sequential G-DINA
model”
Behaviormetrika
This paper addresses a gap in diagnostic classification modeling: how to assess whether each item in a test fits the assumed model when responses are graded or sequential (e.g., multi-step open-ended tasks). The authors adapt three fit indices from classical test theory—the chi-squared statistic, likelihood-ratio statistic, and power-divergence index—to work with the Sequential G-DINA model.
Key points:
“A general diagnostic modelling framework for forced-choice
assessments”
British Journal of Mathematical and Statistical Psychology
This paper proposes an extension of cognitive diagnosis models to handle forced-choice (FC) formats, which are used to reduce response biases (e.g., social desirability). It adapts the G-DINA model to handle paired statements, allowing each to measure a different latent trait.
Innovations:
GDINA R package.“Equally vs. unequally keyed blocks in forced-choice
questionnaires: Implications on validity and reliability”
Journal of Personality Assessment
This experimental study compares equally-keyed (homopolar) vs. unequally-keyed (heteropolar) blocks in FC questionnaires measuring the Big Five. Using IRT-based models (specifically MUPP-2PL), they assess how item keying direction impacts reliability, criterion validity, and ipsativity.
Findings:
“Cross-validation and predictive metrics in psychological
research: Do not leave out the leave-one-out”
Behavior Research Methods
This methodological paper critiques standard practices for estimating predictive accuracy in regression models. It proposes a reformulated leave-one-out (LOO) cross-validation approach that computes the out-of-sample R² via a pooled error term (PRESS/MST), solving known biases in conventional CV implementations.
Contributions:
OutR2.These works share a common goal: enhancing the precision, interpretability, and fairness of psychological measurement—whether by improving test models, detecting fit issues, or refining how predictions are evaluated. Each contribution balances rigorous methodology with clear application potential in education, clinical, and organizational contexts.
All articles available upon request or via journal links. Summaries using AI by M. A. Sorrel. For collaboration, contact miguel.sorrel@uam.es.