Skip to content

Actions: danmcp/eval

E2E test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5 workflow runs
5 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Include task scores with mmlu results + adjust default api retries
E2E test #5: Commit 29e1b96 pushed by danmcp
June 28, 2024 17:17 1d 10h 7m 6s main
June 28, 2024 17:17 1d 10h 7m 6s
Include qna_file in mt_bench_branch results
E2E test #4: Commit 5dd43e3 pushed by danmcp
June 27, 2024 18:02 1d 7h 41m 14s main
June 27, 2024 18:02 1d 7h 41m 14s
Include qna_file in mt_bench_branch results
E2E test #3: Commit 82fefc8 pushed by danmcp
June 27, 2024 17:49 1d 7h 54m 25s main
June 27, 2024 17:49 1d 7h 54m 25s
Include qna_file in mt_bench_branch results
E2E test #2: Commit 239bdef pushed by danmcp
June 27, 2024 17:33 1d 8h 10m 41s main
June 27, 2024 17:33 1d 8h 10m 41s
Include qna_file in mt_bench_branch results
E2E test #1: Commit 834514d pushed by danmcp
June 27, 2024 17:31 1d 8h 12m 41s main
June 27, 2024 17:31 1d 8h 12m 41s