Skip to content

Commit

Permalink
deploy: b3f8b5f
Browse files Browse the repository at this point in the history
  • Loading branch information
NeuralChatBot committed Nov 12, 2024
1 parent b3f8b5f commit fbe53c0
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -2046,13 +2046,13 @@ <h4>Case 1: Baseline Deployment with Rerank<a class="headerlink" href="#case-1-b
</section>
<section id="case-2-baseline-deployment-without-rerank">
<h4>Case 2: Baseline Deployment without Rerank<a class="headerlink" href="#case-2-baseline-deployment-without-rerank" title="Link to this heading"></a></h4>
<div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>python<span class="w"> </span>deploy.py<span class="w"> </span>--hftoken<span class="w"> </span><span class="nv">$HFTOKEN</span><span class="w"> </span>--modeldir<span class="w"> </span><span class="nv">$MODELDIR</span><span class="w"> </span>--num-nodes<span class="w"> </span><span class="m">2</span>
<div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>python<span class="w"> </span>deploy.py<span class="w"> </span>--hf-token<span class="w"> </span><span class="nv">$HFTOKEN</span><span class="w"> </span>--model-dir<span class="w"> </span><span class="nv">$MODELDIR</span><span class="w"> </span>--num-nodes<span class="w"> </span><span class="m">2</span>
</pre></div>
</div>
</section>
<section id="case-3-tuned-deployment-with-rerank">
<h4>Case 3: Tuned Deployment with Rerank<a class="headerlink" href="#case-3-tuned-deployment-with-rerank" title="Link to this heading"></a></h4>
<div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>python<span class="w"> </span>deploy.py<span class="w"> </span>--hftoken<span class="w"> </span><span class="nv">$HFTOKEN</span><span class="w"> </span>--modeldir<span class="w"> </span><span class="nv">$MODELDIR</span><span class="w"> </span>--num-nodes<span class="w"> </span><span class="m">2</span><span class="w"> </span>--with-rerank<span class="w"> </span>--tuned
<div class="highlight-bash notranslate"><div class="highlight"><pre><span></span>python<span class="w"> </span>deploy.py<span class="w"> </span>--hf-token<span class="w"> </span><span class="nv">$HFTOKEN</span><span class="w"> </span>--model-dir<span class="w"> </span><span class="nv">$MODELDIR</span><span class="w"> </span>--num-nodes<span class="w"> </span><span class="m">2</span><span class="w"> </span>--with-rerank<span class="w"> </span>--tuned
</pre></div>
</div>
</section>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -100,12 +100,12 @@ python deploy.py --uninstall
#### Case 2: Baseline Deployment without Rerank

```bash
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2
```
#### Case 3: Tuned Deployment with Rerank

```bash
python deploy.py --hftoken $HFTOKEN --modeldir $MODELDIR --num-nodes 2 --with-rerank --tuned
python deploy.py --hf-token $HFTOKEN --model-dir $MODELDIR --num-nodes 2 --with-rerank --tuned
```

## Benchmark
Expand Down

0 comments on commit fbe53c0

Please sign in to comment.