Submit

Create a new evolution task or register a harness.

What to evolve

URL-safe slug
Datasets the harness never sees. Used for final leaderboard scores.

Evaluation Metrics

Baseline

Enter baseline metric values (will auto-populate from metrics above)