Fix `VLLMServeLM` ignoring a hyphenated `tensor-parallel-size` in `model_kwargs` by nayopu · Pull Request #289 · sbintuitions/flexeval

nayopu · 2026-06-15T05:47:51Z

目的

VLLMServeLM に model_kwargs={"tensor-parallel-size": 1} のようにハイフン区切りで tensor_parallel_size を渡すと、指定値が黙って torch.cuda.device_count() に上書きされる不具合を修正する。

tensor_parallel_size が model_kwargs に含まれない場合、 model_kwargs["tensor_parallel_size"] = device_count() が自動注入されるが、ユーザーが -を使う tensor-parallel-size (例えば1)を渡す場合、vllm serve には --tensor-parallel-size 1 --tensor-parallel-size <device_count()> が二重に渡り、argparse の後勝ちでユーザー指定のtensor-parallel-sizeがサイレントに無視される。

実装の詳細

VLLMServeLM.__init__ の自動注入ガードを、tensor_parallel_size と tensor-parallel-size のどちらの表記も「指定済み」とみなすように修正する。

どちらの表記も含まれていなければ、従来どおり tensor_parallel_size に torch.cuda.device_count() を注入する
どちらか一方でも指定されていれば、その値を尊重して自動注入しない（二重フラグを防ぐ）

model_kwargs = model_kwargs or {}
if "tensor_parallel_size" not in model_kwargs and "tensor-parallel-size" not in model_kwargs:
    model_kwargs["tensor_parallel_size"] = torch.cuda.device_count()

動作確認

テストに通ることを確認した
ruff check / ruff format --check 通過
既存挙動への影響なし（underscore 形キーは従来どおり）

追記事項

なし

VLLMServeLM auto-injects tensor_parallel_size = torch.cuda.device_count() when the key is absent, but it checked only the underscore spelling. Since the command builder maps "_" -> "-", "tensor_parallel_size" and "tensor-parallel-size" become the same --tensor-parallel-size flag, so a caller passing the hyphenated key slipped past the guard: the default was injected anyway, emitting a duplicate --tensor-parallel-size flag with device_count() silently winning (and breaking models without TP support on multi-GPU nodes). Skip the default injection when either spelling is present. Co-authored-by: Claude (Managed) <noreply@anthropic.com>

yuma-hirakawa approved these changes Jun 19, 2026

View reviewed changes

nayopu merged commit c56031e into main Jun 19, 2026
18 of 22 checks passed

nayopu deleted the fix/vllm-tp-key-normalization branch June 19, 2026 05:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `VLLMServeLM` ignoring a hyphenated `tensor-parallel-size` in `model_kwargs`#289

Fix `VLLMServeLM` ignoring a hyphenated `tensor-parallel-size` in `model_kwargs`#289
nayopu merged 1 commit into
mainfrom
fix/vllm-tp-key-normalization

nayopu commented Jun 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nayopu commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

関連する Issue / PR

目的

実装の詳細

動作確認

追記事項

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nayopu commented Jun 15, 2026 •

edited

Loading