Individual Submission Summary
Share...

Direct link:

Judges Versus Juries: Who Should Provide Gold Standard Labels in the Era of LLMs?

Thu, April 23, 9:50 to 11:20am CDT (9:50 to 11:20am CDT), TBA

Brief Overview

We consider the problem of learning a set of ground truth labels using an LLM (judge) and human annotators (a jury). We formally show an interpolation-extrapolation trade-off: LLMs are more reliable for problems commonly observed in training data.

Authors