Search
Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Personal Schedule
Sign In
Session Type: Coordinated Paper Session
The use of artificial intelligence (AI) to score constructed responses is an area of educational measurement undergoing rapid development. There are new use contexts introduced each day and new AI technology such as generative AI is being incorporated into scoring systems. New use contexts and AI capabilities challenge the “standard practices” for how to best build and evaluate automated scoring models. One leading concern is the fairness of AI scores. While AI affords more agile applications of testing and learning solutions, its major limitation is measurement and algorithmic bias (Johnson et al., 2022). New use contexts allow for unexpected and unanticipated sources of bias to be introduced into scores. This coordinated session will have five papers summarizing a program of research that has investigated how to reduce unfairness in different ways, or from different angles. Two papers will propose methods for engine development and model building (Liu & Fauss; Flor), one paper will investigate methods for detecting subgroup bias (Casabianca), another paper will report on how to follow-up on traditional subgroup analyses by performing differential feature functioning analysis (Choi), and the last paper explores methods for explainable AI and how they can be used to improve transparency and fairness (Zhang).
A Bayesian Nonparametric Model for Flexible Automated Scoring - Xiang Liu, Educational Testing Service; Michael Fauss, ETS
Fairness Aspects of Similarity-based Automated Short-answer Scoring - Michael Flor, EDUCATIONAL TESTING SERVICE
A Comparison of Fairness Evaluation Methods for AI Scores - Jodi M Casabianca; Daniel McCaffrey, EDUCATIONAL TESTING SERVICE; Matthew S Johnson, ETS; Chen Li, ETS
Examining Partial Derivatives to Identify Causes of Differential Prediction Bias in Automated Scores - Ikkyu Choi; Matthew S Johnson, ETS
Explainable AI: Exploring Subgroup Differences in Short-Response Scoring - Mo Zhang, EDUCATIONAL TESTING SERVICE; Chunyi Ruan, ETS; Matthew S Johnson, ETS