-
Notifications
You must be signed in to change notification settings - Fork 181
Issues: awslabs/data-on-eks
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
NVIDIA NIM LLM Hosting Pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
good first issue
Good for newcomers
#560
opened Jun 20, 2024 by
hustshawn
Enhance pull speed for Large ML container Images with Bottlerocket
documentation
Improvements or additions to documentation
enhancement
New feature or request
#559
opened Jun 19, 2024 by
ratnopamc
Error: failed to create containerd task: failed to create shim task: OCI runtime create failed
#557
opened Jun 13, 2024 by
pythonking6
Ray Logging and Dashboard Metrics Export to S3 with Custom Dashboard for Historical Clusters
enhancement
New feature or request
#552
opened Jun 5, 2024 by
vara-bonthu
Ray Observability with Prometheus and AMP
enhancement
New feature or request
#551
opened Jun 5, 2024 by
vara-bonthu
vLLM with RayServe pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#547
opened Jun 3, 2024 by
shivam-dubey-1
Llama-3 on Inferentia generate infinite and meaningless output
#544
opened May 29, 2024 by
yubingjiaocn
1 task done
Incorrect POD name "aws-cli-cmd-shell" given in the instructions.
stale
#543
opened May 29, 2024 by
AbrahamArellano
1 task done
How to run Data EKS Gen AI models with limited EC2 vCPUs service quota?
stale
#539
opened May 25, 2024 by
Gall-oDrone
JARK Stack - Error while launching training step in the dogbooth Jupyter notebook
#537
opened May 20, 2024 by
rivasdam
1 task done
Incorrect command to provide Linux permission on the AWS Trainium on EKS Blueprint
bug
Something isn't working
documentation
Improvements or additions to documentation
#533
opened May 17, 2024 by
AbrahamArellano
1 task done
Re-introduce plan-examples.yml with a proper fix
bug
Something isn't working
#525
opened May 10, 2024 by
askulkarni2
Update documentation for JupyterHub on EKS solution
bug
Something isn't working
documentation
Improvements or additions to documentation
#515
opened May 2, 2024 by
petrokashlikov
1 task done
[Inference]: RayServe with NVIDIA Triton server pattern
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#509
opened Apr 25, 2024 by
vara-bonthu
[Inference]: Mistral7b on GPUs with JARK stack with Ray Serve
enhancement
New feature or request
gen-ai pattern
Distributed Training and Inference Patterns for Various Generative AI Large Language Models (LLMs)
#497
opened Apr 8, 2024 by
vara-bonthu
deploy gradio app for llama2 on inf2/ray to k8s
enhancement
New feature or request
#495
opened Apr 8, 2024 by
harishvs
The inf2/ray gradio app does not format new lines in the output
enhancement
New feature or request
#494
opened Apr 8, 2024 by
harishvs
1 task done
Add temprature, topk, topk and other input params to UI for llama2 gradio application on inf2/ray cluster
enhancement
New feature or request
#493
opened Apr 8, 2024 by
harishvs
Move Trainium on EKS from under Blueprints to Gen AI -> Training -> BERT-Large on Trainium section
documentation
Improvements or additions to documentation
enhancement
New feature or request
#488
opened Apr 5, 2024 by
sheetaljoshi
Previous Next
ProTip!
Follow long discussions with comments:>50.