LMDeploy - Browse /v0.11.1 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
lmdeploy-0.11.1+cu128-cp39-cp39-manylinux2014_x86_64.whl	2025-12-25	96.1 MB	0
lmdeploy-0.11.1+cu128-cp39-cp39-win_amd64.whl	2025-12-25	37.3 MB	0
lmdeploy-0.11.1+cu128-cp313-cp313-win_amd64.whl	2025-12-25	37.3 MB	0
lmdeploy-0.11.1+cu128-cp310-cp310-manylinux2014_x86_64.whl	2025-12-25	96.1 MB	0
lmdeploy-0.11.1+cu128-cp310-cp310-win_amd64.whl	2025-12-25	37.3 MB	0
lmdeploy-0.11.1+cu128-cp311-cp311-manylinux2014_x86_64.whl	2025-12-25	96.1 MB	0
lmdeploy-0.11.1+cu128-cp311-cp311-win_amd64.whl	2025-12-25	37.3 MB	1
lmdeploy-0.11.1+cu128-cp312-cp312-manylinux2014_x86_64.whl	2025-12-25	96.1 MB	0
lmdeploy-0.11.1+cu128-cp312-cp312-win_amd64.whl	2025-12-25	37.3 MB	0
lmdeploy-0.11.1+cu128-cp313-cp313-manylinux2014_x86_64.whl	2025-12-25	96.1 MB	0
README.md	2025-12-24	3.4 kB	0
v0.11.1 source code.tar.gz	2025-12-24	1.4 MB	0
v0.11.1 source code.zip	2025-12-24	2.2 MB	0
Totals: 13 Items		670.3 MB	1

What's Changed

Improve metrics by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4178
reserve blocks for dummy inputs by @grimoire in https://github.com/InternLM/lmdeploy/pull/4157
Add vision id for Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4183
[Enhance]: Return routed experts when request canceled by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4197
Add mm processor args for Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4196
support chat_template_kwargs in v1/chat/completions by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4201
Refactor scheduler and engine.py by @grimoire in https://github.com/InternLM/lmdeploy/pull/4163
update dp timeout by @grimoire in https://github.com/InternLM/lmdeploy/pull/4204
Improve Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4207

[Fix]: Split routed experts with query lens by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4180
[Maca] fix ray and memory sync by @wanfengcxz in https://github.com/InternLM/lmdeploy/pull/4164
Build block trie in prefill and add hit rate by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4184
fix fope by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4191
fix hf modules read/write conflicts by multi processors by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4188
Some Minor fix by @windreamer in https://github.com/InternLM/lmdeploy/pull/4185
fix insecure deserialization when calling torch.load() by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4202
Fix processor args by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4200
remove get_model_config to avoid pickle hf_config error in rpc calling by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4217
Fix quant scale-fmt by @grimoire in https://github.com/InternLM/lmdeploy/pull/4212
Fix requests of mix return_logprobs by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4222
fix fillkv quant8 by @grimoire in https://github.com/InternLM/lmdeploy/pull/4229
fix scale-fmt by @grimoire in https://github.com/InternLM/lmdeploy/pull/4230

[Docs]: Add guide for VLMEvalKit by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4156

Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.11.0...v0.11.1

Source: README.md, updated 2025-12-24