-
Notifications
You must be signed in to change notification settings - Fork 364
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: preserve q/k/v quantizer mapping in AST attention patching
#1307
opened Apr 21, 2026 by
Brumbelow
Loading…
4 tasks done
fix: validate modelopt state file structure on load
#1306
opened Apr 21, 2026 by
Brumbelow
Loading…
3 tasks done
Reorg the sparse/quant/common kernel dir
#1303
opened Apr 20, 2026 by
jingyu-ml
Contributor
Loading…
[CI] Bump test containers to latest
#1299
opened Apr 20, 2026 by
kevalmorabia97
Collaborator
Loading…
[1/3][Refactor]: File reorg; deprecate ParallelDraft
#1296
opened Apr 19, 2026 by
h-guo18
Contributor
Loading…
Add FP8 MHA quantization support for HuggingFace ViT
#1289
opened Apr 17, 2026 by
ajrasane
Contributor
Loading…
5 tasks
[Feat,Refactor]: Offline Dflash; Spec Mixin; Deprecate parallel draft;
#1271
opened Apr 16, 2026 by
h-guo18
Contributor
Loading…
Centralize 'trtexec' subprocess runs in ONNX into a single function
#1268
opened Apr 15, 2026 by
gcunhase
Contributor
Loading…
Handle zero-amax per-channel activation scaling for MoE export
#1265
opened Apr 15, 2026 by
AEON-7
Loading…
Fix non-scalar input amax in preprocess_linear_fusion for MoE export
#1264
opened Apr 15, 2026 by
AEON-7
Loading…
Exclude small-k and small-n Matmul nodes from Int8 quantization
#1256
opened Apr 14, 2026 by
nv-samcheng
Contributor
Loading…
Add EfficientViT support for torch_onnx quantization workflow
#1254
opened Apr 14, 2026 by
ajrasane
Contributor
Loading…
3 tasks done
Add a general composable $import system for YAML configs, and use it to implement composable recipes
#1253
opened Apr 14, 2026 by
shengliangxu
Collaborator
Loading…
fix(launcher): use afterany dependency for allow_to_fail pipelines
#1248
opened Apr 13, 2026 by
yeyu-nvidia
Contributor
Loading…
3 tasks
Add LAQ (Learnable Amax Quantization) algorithm
#1247
opened Apr 13, 2026 by
realAsma
Contributor
Loading…
4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.