Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2 by Klaud-Cold · Pull Request #1336 · SemiAnalysisAI/InferenceX

Klaud-Cold · 2026-05-12T21:31:44Z

Summary

Update kimik2.5-fp4-b200-vllm image from vllm/vllm-openai:v0.17.0 to vllm/vllm-openai:v0.20.2

…Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>

github-actions · 2026-05-12T21:31:54Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-12T21:31:54Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-12T21:32:18Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25763438871
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25763438871

claude · 2026-05-12T21:34:51Z

+    - kimik2.5-fp4-b200-vllm
+  description:
+    - "Update vLLM image from v0.17.0 to v0.20.2"
+  pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX


🟡 The new perf-changelog entry uses the literal placeholder pull/XXX instead of the actual PR number (1336), so the link will 404. Replace /pull/XXX with /pull/1336 on line 2351 to match the convention used by every other entry in the file.

Extended reasoning...

What's wrong

The diff adds a new perf-changelog entry whose pr-link is:

pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

The trailing XXX is a literal placeholder string that was clearly meant to be substituted with the actual PR number before the PR was opened, but was left unreplaced.

Why this is a real defect

Every other entry in perf-changelog.yaml uses a concrete PR number — for example, the five entries immediately preceding this one use /pull/1303, /pull/1304, /pull/1305, /pull/1308, and /pull/1310. The convention is consistent across the entire file. /pull/XXX is not a valid GitHub path: visiting https://github.com/SemiAnalysisAI/InferenceX/pull/XXX returns a 404, so any reader (human or downstream tooling) following the link from this changelog entry will get a broken page instead of the merge that introduced the change.

Step-by-step proof

Open perf-changelog.yaml at the new entry (lines 2347–2351 after merge).

The last line reads pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX.

Compare with line 2345 immediately above: pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1310 — a numeric PR id.

GitHub PR paths are of the form /pull/<integer>; /pull/XXX does not match an existing PR and resolves to a 404 page.

This PR is Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2 #1336 (see the PR metadata), so the intended value is 1336.

Fix

Change line 2351 from:

pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

to:

pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1336

This is a one-character-class, mechanical edit. The change is cosmetic in terms of runtime behavior (the changelog itself doesn't affect benchmarks or images) but breaks the file's link convention and any tooling that parses pr-link and follows it, so it's worth correcting before merge.

github-actions · 2026-05-13T00:49:51Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25763439479
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25763439479

github-actions · 2026-05-13T18:33:56Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25818238248
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25818238248

github-actions · 2026-05-13T23:46:17Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25818238248
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25818238248

github-actions · 2026-05-14T00:52:58Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25818238248
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25818238248

github-actions · 2026-05-14T02:16:17Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=25818238248
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=25818238248

$Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2\n\nRef #1154\n\n…

bd3726a

…Co-authored-by: Klaud Cold <Klaud-Cold@users.noreply.github.com>

Klaud-Cold requested a review from a team May 12, 2026 21:31

Klaud-Cold added the full-sweep-enabled label May 12, 2026

Klaud-Cold requested review from jgangani and kedarpotdar-nv as code owners May 12, 2026 21:31

Klaud-Cold added the full-sweep-enabled label May 12, 2026

github-project-automation Bot added this to InferenceMAX Board May 12, 2026

Klaud-Cold mentioned this pull request May 12, 2026

[Auto] Docker Image Updates Available - 2026-04-25 #1154

Open

claude Bot reviewed May 12, 2026

View reviewed changes

Merge branch 'main' into claude/issue-1154-kimik2.5-fp4-b200-vllm

72c0408

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2#1336

Update kimik2.5-fp4-b200-vllm vLLM image to v0.20.2#1336
Klaud-Cold wants to merge 2 commits into
mainfrom
claude/issue-1154-kimik2.5-fp4-b200-vllm

Klaud-Cold commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

claude Bot May 12, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Klaud-Cold commented May 12, 2026

Summary

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

claude Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 13, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants