Skip to content

Commit

Permalink
Check A100 inference time (reviewers ignore this commit)
Browse files Browse the repository at this point in the history
  • Loading branch information
yinggeh committed Aug 14, 2024
1 parent c476337 commit b24fef2
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions ci/L0_backend_vllm/metrics_test/vllm_metrics_test.py
Original file line number Diff line number Diff line change
Expand Up @@ -119,6 +119,8 @@ def test_vllm_metrics(self):
model_name=self.vllm_model_name,
)
metrics_dict = self.get_vllm_metrics()
# Test CI job L0_backend_vllm--A100-base A100 inference time. Reviewers disregard print statement.
print(metrics_dict)

# vllm:prompt_tokens_total
self.assertEqual(metrics_dict["vllm:prompt_tokens_total"], 18)
Expand Down

0 comments on commit b24fef2

Please sign in to comment.