Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shortfin LLM Docs #481

Merged
merged 13 commits into from
Nov 13, 2024
Merged

Shortfin LLM Docs #481

merged 13 commits into from
Nov 13, 2024

Conversation

stbaione
Copy link
Contributor

Description

The following docs outline how to export, and compile a Llama 8b f16 decomposed model, then run the Shortfin LLM Server with the the compiled model.

It includes docs for both a developer flow and a user flow.

There are a couple TODOs that can be updated/fixed as we make patches in shortfin and/or sharktank.

Copy link
Member

@ScottTodd ScottTodd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great start, thanks!

docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
docs/shortfin/llm/user/e2e_llama8b_mi300x.md Outdated Show resolved Hide resolved
Slight adjustments to the user e2e doc
@stbaione
Copy link
Contributor Author

Removing markdown linter. Pre-existing md files don't pass, and would clutter the PR to change all of them

@stbaione stbaione mentioned this pull request Nov 13, 2024
Copy link

@kumardeepakamd kumardeepakamd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great improvement. We can iterate on it after testing. Can go ahead and land it.

@stbaione stbaione merged commit 51cf2f4 into nod-ai:main Nov 13, 2024
12 of 13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants