Search
On-Site Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Change Preferences / Time Zone
Sign In
Bluesky
Threads
X (Twitter)
YouTube
In this paper, we propose a multi-agent Large Language Model system to analyze elementary students’ written responses to science tasks. The system analyzes student performances, identifies sources of uncertainty in the AI analysis process, and provides scores with scoring reasons as feedback for teachers and students. Applying our system to a dataset of elementary student responses, the system achieved a scoring accuracy of 90%, using human scores as the baseline. The strong alignment between AI-generated and human-assigned scores demonstrates the potential of the multi-agent system to enhance both the efficiency and quality of instruction. Furthermore, the model offers clear and consistent rationales that can support student learning and promote transparency in assessment.
Xunlei Qian, Michigan State University
Namsoo Shin, Michigan State University
Harvey Li, Michigan State University
Yucheng Chu, Michigan State University
Cory S. MIller, Michigan State University
Joseph S. Krajcik, Michigan State University
Jiliang Tang, Michigan State University
Yue Xing, Michigan State University