Search
Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Personal Schedule
Sign In
The purpose is to compare human rater with automatic scoring in terms of examinees’ ability estimation with IRT-based rater model. Each speaking item is analyzed with both IRT models without rater-effect and with rater-effects. The effects of different rating design may substantially increase the bias in examinees’ ability estimation.