The EvalPerf dataset includes 121 performance-exercising tasks:
task_id
(str
): The task ID mapped to the original HumanEval(+) and MBPP(+) tasksreference
(List[str]
): A list of representative reference solutions (from slow to fast) with diverse performance patternspe_input
(Any
): A performance exercising inputscores
(List[float]
): Performance scores forreference
(0-100)