Search
On-Site Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Change Preferences / Time Zone
Sign In
Bluesky
Threads
X (Twitter)
YouTube
To determine if ChatGPT could be utilized to grade ESL essays, we trained ChatGPT to apply China’s Test for English Majors-Band 4 grading rubric to score 667 essays from 88 Chinese ESL students. Results from comparing ChatGPT with human raters, comparing ChatGPT scores from repeated grading of the same essays, and the analysis of ChatGPT's scoring patterns over time showed that ChatGPT scores failed to reach an acceptable agreement with human raters. ChatGPT also failed to apply the same rubric criteria consistently and exhibited significant issues in generating reliable and valid scores. Overall, despite its strengths of providing rapid qualitative and quantitative evaluation, ChatGPT does not meet the necessary criteria for a reliable and valid grading tool in its current form.