NVIDIA Model Optimizer Files

A unified library of SOTA model optimization techniques

This is an exact mirror of the NVIDIA Model Optimizer project, hosted at https://github.com/NVIDIA/Model-Optimizer. SourceForge is not affiliated with NVIDIA Model Optimizer.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
nvidia_modelopt-0.41.0-py3-none-any.whl	2026-01-20	934.6 kB	0
ModelOpt 0.41.0 Release source code.tar.gz	2026-01-19	11.7 MB	0
ModelOpt 0.41.0 Release source code.zip	2026-01-19	12.4 MB	0
README.md	2026-01-19	1.5 kB	0
Totals: 4 Items		25.1 MB	0

Bug Fixes

Fix Megatron KV Cache quantization checkpoint restore for QAT/QAD (device placement, amax sync across DP/TP, flash_decode compatibility).

New Features

Add support for Transformer Engine quantization for Megatron Core models.
Add support for Qwen3-Next model quantization.
Add support for dynamically linked TensorRT plugins in the ONNX quantization workflow.
Add support for KV Cache Quantization for vLLM FakeQuant PTQ script. See examples/vllm_serve/README.md for more details.
Add support for subgraphs in ONNX autocast.
Add support for parallel draft heads in Eagle speculative decoding.
Add support to enable custom emulated quantization backend. See register_quant_backend for more details. See an example in tests/unit/torch/quantization/test_custom_backend.py.
Add examples/llm_qad for QAD training with Megatron-LM.

Deprecations

Deprecate num_query_groups parameter in Minitron pruning (mcore_minitron). You can use ModelOpt 0.40.0 or earlier instead if you need to prune it.

Backward Breaking Changes

Remove torchprofile as a default dependency from ModelOpt as it's used only for flops-based FastNAS pruning (computer vision models). It can be installed separately if needed.

Source: README.md, updated 2026-01-19

Other Useful Business Software

AestheticsPro Medical Spa Software Icon

AestheticsPro Medical Spa Software

Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More

The Most Powerful Software Platform for EHSQ and ESG Management Icon

The Most Powerful Software Platform for EHSQ and ESG Management

Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.

Learn More

AestheticsPro Medical Spa Software

Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More

Recommended Projects

LLM Action
Technical principles related to large models
mllm
Fast Multimodal LLM on Mobile Devices
FastDeploy
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Torch Pruning
DepGraph: Towards Any Structural Pruning
SWIFT LLM
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs