Add prefill 8B f16 torch sdpa test, update tests with compile flags and tp flags, with nightly iree #37
Triggered via pull request
November 16, 2024 01:06
Status
Cancelled
Total duration
3m 14s
Artifacts
–
ci-shark-ai.yml
on: pull_request
Matrix: Integration Tests - Shortfin LLM Server
Annotations
2 errors
Integration Tests - Shortfin LLM Server (3.11)
Canceling since a higher priority waiting request for 'CI - shark-ai-456' exists
|
Integration Tests - Shortfin LLM Server (3.11)
The operation was canceled.
|