MPSA: Judges Versus Juries: Who Should Provide Gold Standard Labels in the Era of LLMs?

Refresh: Off View Personal Schedule

Individual Submission Summary

Direct link:

Judges Versus Juries: Who Should Provide Gold Standard Labels in the Era of LLMs?

In Event: Classification and LLMs

Thu, April 23, 9:50 to 11:20am CDT (9:50 to 11:20am CDT), TBA

Brief Overview

We consider the problem of learning a set of ground truth labels using an LLM (judge) and human annotators (a jury). We formally show an interpolation-extrapolation trade-off: LLMs are more reliable for problems commonly observed in training data.

Authors

©2026 All Academic, Inc. | Privacy Policy