| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| Patch release v5.5.4 source code.tar.gz | 2026-04-13 | 17.5 MB | |
| Patch release v5.5.4 source code.zip | 2026-04-13 | 23.2 MB | |
| README.md | 2026-04-13 | 512 Bytes | |
| Totals: 3 Items | 40.7 MB | 7 | |
Patch release v5.5.4
This is mostly some fixes that are good to have asap, mostly for tokenizers; ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (#45305) by ArthurZucker
For training: ** Fix [#45305] + add regression test GAS (#45349) by florian6973, SunMarc ** Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is active (#…) by ArthurZucker
And for Qwen2.5-VL : ** Fix Qwen2.5-VL temporal RoPE scaling applied to still images (#45330) by Kash6, zucchini-nlp