Skip to content

Add vLLM Llama-70B all-reduce reproducer#535

Draft
micmelesse wants to merge 1 commit into
ROCm:mainfrom
micmelesse:micmelesse/reproducer/all_reduce
Draft

Add vLLM Llama-70B all-reduce reproducer#535
micmelesse wants to merge 1 commit into
ROCm:mainfrom
micmelesse:micmelesse/reproducer/all_reduce

Conversation

@micmelesse

@micmelesse micmelesse commented Jun 16, 2026

Copy link
Copy Markdown
Contributor

Motivation

This pr adds a benchmark and tests to iterate on vllm iris integration for llama 70b. It has two dockerfiles that bake a baseline and experimental environment. You can then run two scripts one to check correctness and another to check performance. There is a README attached for instructions.

The main difference is that the baseline pins commits from the main of vllm, aiter and iris. The experimental pins the branches with iris integration work which is enabled by setting VLLM_ROCM_USE_AITER_COMMS=1.

Technical Details

Test Plan

Test Result

Submission Checklist

@mawad-amd mawad-amd requested a review from aamarnat June 16, 2026 13:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant