Actions: EleutherAI/lm-evaluation-harness
Actions
2,813 workflow runs
2,813 workflow runs
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Tasks Modified
#4023:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Tasks Modified
#4018:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Tasks Modified
#4006:
Pull request #2520
synchronize
by
StellaAthena