The goal of this competition is to train a model to score student essays.
Submissions are scored based on the quadratic weighted kappa, which measures the agreement between two outcomes. This metric typically varies from 0 (random agreement) to 1 (complete agreement). In the event that there is less agreement than expected by chance, the metric may go below 0.
The competition dataset comprises about 24000 student-written argumentative essays. Each essay was scored on a scale of 1 to 6.