-
Notifications
You must be signed in to change notification settings - Fork 23
Pull requests: AMD-AGI/Primus-Turbo
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix MXFP8 backward crash on non-contiguous grad_out
#388
opened Jun 19, 2026 by
JohnQinAMD
Collaborator
Loading…
12 tasks
feat: forece use nt layout gemm in bwd
ci:gpu
#386
opened Jun 18, 2026 by
RuibinCheung
Collaborator
Loading…
5 of 12 tasks
[feat] Add flydsl based grouped gemm
ci:gpu
#384
opened Jun 16, 2026 by
kyle-256
Collaborator
Loading…
6 of 12 tasks
feat: refine quant config arguments
ci:gpu
#379
opened Jun 12, 2026 by
RuibinCheung
Collaborator
Loading…
3 of 12 tasks
[WIP] feat: support build on gfx1250
ci:gpu
#374
opened Jun 9, 2026 by
RuibinCheung
Collaborator
•
Draft
6 of 12 tasks
feat: triton_grouped_gemm: add work-stealing variant with ws_mode API
#353
opened Jun 2, 2026 by
wenchenvincent
Loading…
4 of 12 tasks
feat: ck_grouped_gemm: add work-stealing variant with ws_mode API
#348
opened May 27, 2026 by
wenchenvincent
Loading…
4 of 12 tasks
[WIP] [Feature] Add Turbo MXFP8 Grouped GEMM (gfx950) for MoE
#330
opened May 7, 2026 by
kyle-256
Collaborator
Loading…
6 of 12 tasks
feat: add more activation func
#329
opened May 7, 2026 by
RuibinCheung
Collaborator
Loading…
8 of 9 tasks
opt(gemm): add hipBLASLt algorithm cache and thread-local workspace
#321
opened Apr 30, 2026 by
jasainio
Contributor
Loading…
6 of 12 tasks
Refactor: moe dispatch combine autotune
ci:gpu
#312
opened Apr 24, 2026 by
zhenhuang12
Collaborator
Loading…
7 of 12 tasks
feat: enable hybrid FP8 dtypes on Triton grouped GEMM backends
#288
opened Apr 15, 2026 by
sarthak-amd
•
Draft
perf: optimize hipBLASLt grouped GEMM with algo tuning, enable grouped_gemm autotune hipblaslt support
#284
opened Apr 14, 2026 by
kyle-256
Collaborator
Loading…
feat(benchmark): per-model/GPU batch sizes and vocab projection for GEMM bench
#265
opened Mar 31, 2026 by
Z-Y00
Loading…
refactor: reorganize moe ops and kernels
#243
opened Mar 5, 2026 by
zhenhuang12
Collaborator
Loading…
ProTip!
Follow long discussions with comments:>50.