Skip to content

Commit

Permalink
add ssh-tunnel doc (#81)
Browse files Browse the repository at this point in the history
* support qlora

* upload dummy conversation data

* delete doc and docker

* update pyproject pip install package

* continue cleaning

* delete more files

* delete a format

* add llm_deploy

* add testing scripts

* update deployment readme

* update readme and fix some bug

* finalize the inference and deployment based on vllm

* Add babel deployment tutorial md

* add minor suggestions

* delete qlora_train.sh

* Delete duplicate data file

* Add tutorial for ssh tunnel

---------

Co-authored-by: Ruiyi Wang <ruiyi.pamela.wang@gmail.com>
  • Loading branch information
lwaekfjlk and ruiyiw committed Mar 14, 2024
1 parent d6b3e28 commit 9ee1122
Showing 1 changed file with 21 additions and 1 deletion.
22 changes: 21 additions & 1 deletion llm_deploy/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -98,7 +98,27 @@ curl http://localhost:8000/v1/completions \
```

### Access deployed babel server on a local machine
TODO
1. Construct ssh tunnel between babel login node and babel compute node with hosted model
```bash
ssh -N -L 7662:localhost:8000 username@babel-x-xx
```
The above command creates a localhost:7662 server on bable login node which connects to localhost:8000 on compute node.

2. Construct ssh tunnel between local machine and babel login node
```bash
ssh -N -L 8001:localhost:7662 username@<mycluster>
```
The above command creates a localhost:8001 server on your local machine which connects to localhost:7662 on babel login node.

3. Call hosted model on local machine
```bash
curl http://localhost:8001/v1/models
```
If the above command runs successfully, you should be able to use REST API on your local machine.

4. (optional) If you fail in building the ssh tunnel, you may add `-v` to the ssh command to see what went wrong.




### Userful resource links for babel
Expand Down

0 comments on commit 9ee1122

Please sign in to comment.