Shortfin LLM Docs #481

stbaione · 2024-11-12T01:48:34Z

Description

The following docs outline how to export, and compile a Llama 8b f16 decomposed model, then run the Shortfin LLM Server with the the compiled model.

It includes docs for both a developer flow and a user flow.

There are a couple TODOs that can be updated/fixed as we make patches in shortfin and/or sharktank.

…rspective

…s fix)

ScottTodd

Great start, thanks!

docs/shortfin/llm/user/e2e_llama8b_mi300x.md

Slight adjustments to the user e2e doc

stbaione · 2024-11-12T21:37:06Z

Removing markdown linter. Pre-existing md files don't pass, and would clutter the PR to change all of them

Remove markdownlint pre-commit step

Remove references to `$PORT` env var, Comment TODO's, so that they only appear in raw view

kumardeepakamd

Great improvement. We can iterate on it after testing. Can go ahead and land it.

stbaione added 5 commits November 11, 2024 23:32

Update docs for e2e shortfin llm workflow, from user and developer pe…

004bd20

…rspective

Add --port as input arg for client.py script

9fefe9f

Temporarily remove instructions for targeting different devices (need…

47e307c

…s fix)

Add echo $PORT to the end of the port command

8b877a3

Setup instructions & a couple small changes in developer flow

43a3d28

stbaione requested a review from renxida November 12, 2024 01:48

stbaione self-assigned this Nov 12, 2024

stbaione requested review from kumardeepakamd and amd-chrissosa November 12, 2024 15:19

Merge branch 'main' into shortfin-llm-docs

372db3d

ScottTodd reviewed Nov 12, 2024

View reviewed changes

Add markdownlit to pre-commit,

4f5d74e

Slight adjustments to the user e2e doc

stbaione added 4 commits November 12, 2024 21:50

Apply lint/fixes to user flow doc,

3cd0255

Remove markdownlint pre-commit step

Edit user flow to include downloading external model from scratch,

08daa7a

Remove references to `$PORT` env var, Comment TODO's, so that they only appear in raw view

Update Model/Tokenizer vars message in user doc

3a34d55

Remove system note in user doc

2c13889

stbaione requested a review from ScottTodd November 13, 2024 14:43

stbaione mentioned this pull request Nov 13, 2024

Sglang User Doc #498

Merged

kumardeepakamd approved these changes Nov 13, 2024

View reviewed changes

stbaione added 2 commits November 13, 2024 16:01

Merge branch 'main' into shortfin-llm-docs

b158705

Merge branch 'main' into shortfin-llm-docs

3544bbc

stbaione merged commit 51cf2f4 into nod-ai:main Nov 13, 2024
12 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shortfin LLM Docs #481

Shortfin LLM Docs #481

stbaione commented Nov 12, 2024

ScottTodd left a comment

stbaione commented Nov 12, 2024

kumardeepakamd left a comment

Shortfin LLM Docs #481

Shortfin LLM Docs #481

Conversation

stbaione commented Nov 12, 2024

Description

ScottTodd left a comment

Choose a reason for hiding this comment

stbaione commented Nov 12, 2024

kumardeepakamd left a comment

Choose a reason for hiding this comment