Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: preserve q/k/v quantizer mapping in AST attention patching
#1307 opened Apr 21, 2026 by Brumbelow Loading…
4 tasks done
fix: validate modelopt state file structure on load
#1306 opened Apr 21, 2026 by Brumbelow Loading…
3 tasks done
Kinjal/fix vllm moe
#1305 opened Apr 21, 2026 by kinjalpatel27 Contributor Draft
Reorg the sparse/quant/common kernel dir
#1303 opened Apr 20, 2026 by jingyu-ml Contributor Loading…
[CI] Bump test containers to latest
#1299 opened Apr 20, 2026 by kevalmorabia97 Collaborator Loading…
[1/3][Refactor]: File reorg; deprecate ParallelDraft
#1296 opened Apr 19, 2026 by h-guo18 Contributor Loading…
[2/3][Feat]: Offline DFlash training
#1295 opened Apr 19, 2026 by h-guo18 Contributor Draft
1 task done
add gptq fused kernel
#1291 opened Apr 17, 2026 by sychen52 Contributor Loading…
Add FP8 MHA quantization support for HuggingFace ViT
#1289 opened Apr 17, 2026 by ajrasane Contributor Loading…
5 tasks
keep deploy cases and Eagle fixes for merge
#1287 opened Apr 17, 2026 by nvSiruiW Loading…
Update excluded modules for Qwen3.5 dense PTQ
#1284 opened Apr 17, 2026 by amukkara Loading…
Add qwen3 moe experts only test
#1274 opened Apr 16, 2026 by cjluo-nv Collaborator Loading…
SpecDec Bench: April Update
#1272 opened Apr 16, 2026 by IzzyPutterman Contributor Loading…
Skip Softmax diffusion export
#1269 opened Apr 15, 2026 by jingyu-ml Contributor Loading…
Centralize 'trtexec' subprocess runs in ONNX into a single function
#1268 opened Apr 15, 2026 by gcunhase Contributor Loading…
Exclude small-k and small-n Matmul nodes from Int8 quantization
#1256 opened Apr 14, 2026 by nv-samcheng Contributor Loading…
Add EfficientViT support for torch_onnx quantization workflow
#1254 opened Apr 14, 2026 by ajrasane Contributor Loading…
3 tasks done
fix(launcher): use afterany dependency for allow_to_fail pipelines
#1248 opened Apr 13, 2026 by yeyu-nvidia Contributor Loading…
3 tasks
Add LAQ (Learnable Amax Quantization) algorithm
#1247 opened Apr 13, 2026 by realAsma Contributor Loading…
4 tasks
ProTip! no:milestone will show everything without a milestone.