Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inference]: RayServe with NVIDIA Triton server pattern #509

Closed
vara-bonthu opened this issue Apr 25, 2024 · 7 comments
Closed

[Inference]: RayServe with NVIDIA Triton server pattern #509

vara-bonthu opened this issue Apr 25, 2024 · 7 comments
Labels
gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs) stale

Comments

@vara-bonthu
Copy link
Collaborator

vara-bonthu commented Apr 25, 2024

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

What is the outcome that you are trying to reach?

  • Create a new blueprint to showcase how to use NVIDIA Triton server with RayServe
  • Deploy any LLM model as inference example

Describe the solution you would like

Describe alternatives you have considered

Additional context

@vara-bonthu vara-bonthu added the gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs) label Apr 25, 2024
@vara-bonthu vara-bonthu changed the title [Inference]: vLLM with RayServe pattern [Inference]: RayServe with NVIDIA Triton server pattern May 7, 2024
@freschri
Copy link

freschri commented May 22, 2024

Hi @vara-bonthu , I am working on it.

Copy link
Contributor

This issue has been automatically marked as stale because it has been open 30 days
with no activity. Remove stale label or comment or this issue will be closed in 10 days

@freschri
Copy link

This issue has been automatically marked as stale because it has been open 30 days with no activity. Remove stale label or comment or this issue will be closed in 10 days

working on it

@freschri
Copy link

added #564

Copy link
Contributor

This issue has been automatically marked as stale because it has been open 30 days
with no activity. Remove stale label or comment or this issue will be closed in 10 days

@github-actions github-actions bot added the stale label Jul 31, 2024
Copy link
Contributor

Issue closed due to inactivity.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Aug 10, 2024
@freschri
Copy link

@vara-bonthu the bot closed this, is that ok? what do we do with it? thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gen-ai pattern Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs) stale
Projects
None yet
Development

No branches or pull requests

2 participants