Skip to content

Commit

Permalink
gpu_per_node as arg
Browse files Browse the repository at this point in the history
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
  • Loading branch information
jiemingz committed Apr 16, 2024
1 parent cd3c6e5 commit 1790ce9
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions nemo/export/tensorrt_llm.py
Original file line number Diff line number Diff line change
Expand Up @@ -245,6 +245,7 @@ def build(
max_input_len: int = 256,
max_output_len: int = 256,
max_batch_size: int = 8,
gpus_per_node: int = 8,
use_refit: bool = False,
reshard_model: bool = False,
):
Expand Down

0 comments on commit 1790ce9

Please sign in to comment.