Sglang User Doc #498

stbaione · 2024-11-13T15:52:00Z

Description

Adds documentation for running SGLang with Shortfin LLM Server.

Currently, only focus on the sglang docs. I created this PR from the same repo as the other shortfin llm server docs. Those diffs should go away once that is merged.

It links to the existing Shortfin LLM Server User Doc to setup and run shortfin. It then shows how to install SGLang inside of the same virtual environment.

From there it has instructions for running a Multi-Turn Q&A Flow, Fork Flow, and how to run the Benchmark script against the shortfin server.

…rspective

…s fix)

Slight adjustments to the user e2e doc

Remove markdownlint pre-commit step

Remove references to `$PORT` env var, Comment TODO's, so that they only appear in raw view

ScottTodd

Nice!

docs/shortfin/llm/user/shortfin_with_sglang_frontend_language.md

…o sglang-user-doc

Update SGLang docs to use the examples from out integration tests, and the `llama-3-instruct` chat template

Add `Software` and `Hardware` prerequisites

stbaione and others added 12 commits November 11, 2024 23:32

Update docs for e2e shortfin llm workflow, from user and developer pe…

004bd20

…rspective

Add --port as input arg for client.py script

9fefe9f

Temporarily remove instructions for targeting different devices (need…

47e307c

…s fix)

Add echo $PORT to the end of the port command

8b877a3

Setup instructions & a couple small changes in developer flow

43a3d28

Merge branch 'main' into shortfin-llm-docs

372db3d

Add markdownlit to pre-commit,

4f5d74e

Slight adjustments to the user e2e doc

Apply lint/fixes to user flow doc,

3cd0255

Remove markdownlint pre-commit step

Edit user flow to include downloading external model from scratch,

08daa7a

Remove references to `$PORT` env var, Comment TODO's, so that they only appear in raw view

Update Model/Tokenizer vars message in user doc

3a34d55

Remove system note in user doc

2c13889

Add user doc for shortfin sglang frontend language

4309103

stbaione added the documentation Improvements or additions to documentation label Nov 13, 2024

stbaione requested review from ScottTodd, renxida, kumardeepakamd and amd-chrissosa November 13, 2024 15:52

stbaione self-assigned this Nov 13, 2024

Update links to sglang to point to nod-ai

f06cbd6

amd-chrissosa approved these changes Nov 13, 2024

View reviewed changes

Fix port in q&a example

e355148

stbaione marked this pull request as ready for review November 13, 2024 22:42

stbaione and others added 2 commits November 13, 2024 16:42

Merge branch 'main' into sglang-user-doc

6568d8e

Update link to LLM server user docs

6495a28

ScottTodd approved these changes Nov 16, 2024

View reviewed changes

stbaione added 3 commits November 22, 2024 01:27

Merge branch 'main' of https://github.com/stbaione/SHARK-Platform int…

36fcdb7

…o sglang-user-doc

Update e2e_llama8b_mi300x.md docs to use llama-3-8b-instruct,

3b11648

Update SGLang docs to use the examples from out integration tests, and the `llama-3-instruct` chat template

Add Current Support Status,

7e12936

Add `Software` and `Hardware` prerequisites

stbaione requested a review from ScottTodd November 22, 2024 15:26

ScottTodd approved these changes Nov 22, 2024

View reviewed changes

Merge branch 'main' into sglang-user-doc

e4f45f7

stbaione merged commit e2c2f01 into nod-ai:main Nov 22, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sglang User Doc #498

Sglang User Doc #498

stbaione commented Nov 13, 2024 •

edited

Loading

ScottTodd left a comment

Sglang User Doc #498

Sglang User Doc #498

Conversation

stbaione commented Nov 13, 2024 • edited Loading

Description

ScottTodd left a comment

Choose a reason for hiding this comment

stbaione commented Nov 13, 2024 •

edited

Loading