Search
On-Site Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Change Preferences / Time Zone
Sign In
Bluesky
Threads
X (Twitter)
YouTube
Generative artificial intelligence (GenAI) tools developed to support teaching are widely available. Trustworthiness concerns have prompted calls for researchers to study their effectiveness. We investigated one type of GenAI created to support teachers: multiple-choice question generators (MCQ GenAI). Among the nine MCQ GenAI tools investigated in this study, one indicated teacher involvement, and none mentioned testing experts in development processes. MCQ GenAI-created items (n=90) were coded based on MCQ item-writing guidelines. Results showed 84.44% of items violated at least one guideline with 77.78% likely to produce major measurement error (should not use without revision), 6.67% likely to elicit minor measurement error (consider modifying), and 15.56% acceptable (usable as created). Implications suggest multidisciplinary teams are needed in educational GenAI tool development.
Toni A. May, Binghamton University - SUNY
Gregory E. Stone, University of Toledo
Yiyun Fan, Binghamton University - SUNY
Connor J. Sondergeld, MetriKs
James N. LaPlante, IBM Corporation
Kathleen Provinzano, Binghamton University - SUNY
Kristin L.K. Koskey, Drexel University
Carla C. Johnson, North Carolina State University