| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| lmdeploy-0.11.1+cu128-cp39-cp39-manylinux2014_x86_64.whl | 2025-12-25 | 96.1 MB | |
| lmdeploy-0.11.1+cu128-cp39-cp39-win_amd64.whl | 2025-12-25 | 37.3 MB | |
| lmdeploy-0.11.1+cu128-cp313-cp313-win_amd64.whl | 2025-12-25 | 37.3 MB | |
| lmdeploy-0.11.1+cu128-cp310-cp310-manylinux2014_x86_64.whl | 2025-12-25 | 96.1 MB | |
| lmdeploy-0.11.1+cu128-cp310-cp310-win_amd64.whl | 2025-12-25 | 37.3 MB | |
| lmdeploy-0.11.1+cu128-cp311-cp311-manylinux2014_x86_64.whl | 2025-12-25 | 96.1 MB | |
| lmdeploy-0.11.1+cu128-cp311-cp311-win_amd64.whl | 2025-12-25 | 37.3 MB | |
| lmdeploy-0.11.1+cu128-cp312-cp312-manylinux2014_x86_64.whl | 2025-12-25 | 96.1 MB | |
| lmdeploy-0.11.1+cu128-cp312-cp312-win_amd64.whl | 2025-12-25 | 37.3 MB | |
| lmdeploy-0.11.1+cu128-cp313-cp313-manylinux2014_x86_64.whl | 2025-12-25 | 96.1 MB | |
| README.md | 2025-12-24 | 3.4 kB | |
| v0.11.1 source code.tar.gz | 2025-12-24 | 1.4 MB | |
| v0.11.1 source code.zip | 2025-12-24 | 2.2 MB | |
| Totals: 13 Items | 670.3 MB | 1 | |
What's Changed
🚀 Features
- [ascend] support dptp by @tangzhiyi11 in https://github.com/InternLM/lmdeploy/pull/4218
- Support Deepseek v32 by @grimoire in https://github.com/InternLM/lmdeploy/pull/4026
💥 Improvements
- Improve metrics by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4178
- reserve blocks for dummy inputs by @grimoire in https://github.com/InternLM/lmdeploy/pull/4157
- Add vision id for Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4183
- [Enhance]: Return routed experts when request canceled by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4197
- Add mm processor args for Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4196
- support chat_template_kwargs in v1/chat/completions by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4201
- Refactor scheduler and engine.py by @grimoire in https://github.com/InternLM/lmdeploy/pull/4163
- update dp timeout by @grimoire in https://github.com/InternLM/lmdeploy/pull/4204
- Improve Qwen3-VL by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4207
🐞 Bug fixes
- [Fix]: Split routed experts with query lens by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4180
- [Maca] fix ray and memory sync by @wanfengcxz in https://github.com/InternLM/lmdeploy/pull/4164
- Build block trie in prefill and add hit rate by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4184
- fix fope by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4191
- fix hf modules read/write conflicts by multi processors by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4188
- Some Minor fix by @windreamer in https://github.com/InternLM/lmdeploy/pull/4185
- fix insecure deserialization when calling torch.load() by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4202
- Fix processor args by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4200
- remove get_model_config to avoid pickle hf_config error in rpc calling by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4217
- Fix quant scale-fmt by @grimoire in https://github.com/InternLM/lmdeploy/pull/4212
- Fix requests of mix return_logprobs by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/4222
- fix fillkv quant8 by @grimoire in https://github.com/InternLM/lmdeploy/pull/4229
- fix scale-fmt by @grimoire in https://github.com/InternLM/lmdeploy/pull/4230
📚 Documentations
- [Docs]: Add guide for VLMEvalKit by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4156
🌐 Other
- Add FA3 by @CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/4166
- Add distributed test cases by @littlegy in https://github.com/InternLM/lmdeploy/pull/4161
- Add generate test by @littlegy in https://github.com/InternLM/lmdeploy/pull/4181
- [ci] add mllm eval by @zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/4194
- [ascend] refactor code by @yao-fengchen in https://github.com/InternLM/lmdeploy/pull/4176
- install serve.txt when building the docker image by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4219
- bump version to v0.11.1 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/4221
Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.11.0...v0.11.1