Skip to content

Latest commit

 

History

History
19 lines (14 loc) · 437 Bytes

README.md

File metadata and controls

19 lines (14 loc) · 437 Bytes

Ray Serve Demo

It contain code for deploying the Ray Serve application on Kubernetes cluster using KubeRay. It contain below files.

.
├── ray-serve
│   ├── vllm_engine.py
│   └── vllm_engine_test.py
├── LICENSE
├── rayservice
│   └── vllm-service-phi-3-mini-4k.yaml
└── README.md

2 directories, 5 files

Check the blog post to know how to deploy them in Kubernetes cluster.