Paper Summary
Share...

Direct link:

Hybrid Models for DIF Detection: A Simulation Study of Classical and Machine Learning Approaches

Wed, April 8, 1:45 to 3:15pm PDT (1:45 to 3:15pm PDT), Los Angeles Convention Center, Floor: Level Two, Poster Hall - Exhibit Hall A

Abstract

This simulation study evaluates traditional, machine learning, and hybrid methods for detecting Differential Item Functioning (DIF) in dichotomous test items. Using the 3PL IRT model, datasets were generated under varying test lengths, sample sizes, and DIF types. Logistic regression and IRT residuals effectively detected uniform DIF in larger samples but showed limited power for non-uniform DIF. LASSO regression was highly dependent on regularization strength, underperforming in small samples and overflagging items with relaxed penalties. Random Forests consistently outperformed other methods, identifying the highest number of true DIFs across diverse conditions. Ensemble approaches such as majority voting and hierarchical pipelines further enhanced accuracy. Findings support the use of machine learning and hybrid strategies to ensure fairness in educational and psychological testing.

Authors