Download Latest Version Patch release v5.5.4 source code.tar.gz (17.5 MB)
Email in envelope

Get an email when there's a new version of Transformers

Home / v5.5.2
Name Modified Size InfoDownloads / Week
Parent folder
Patch release_ v5.5.2 source code.tar.gz 2026-04-09 17.5 MB
Patch release_ v5.5.2 source code.zip 2026-04-09 23.2 MB
README.md 2026-04-09 573 Bytes
Totals: 3 Items   40.7 MB 0

Small patch dedicated to optimizing gemma4, fixing inference with use_cache=False due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs:

  • Add MoE to Gemma4 TP plan (#45219) by @sywangyi and @Cyrilvallez
  • [gemma4] Dissociate kv states sharing from the Cache (#45312) by @Cyrilvallez
  • [gemma4] Remove all shared weights, and silently skip them during loading (#45336) by @Cyrilvallez
  • Fix conversion mappings for vlms (#45340) by @Cyrilvallez
Source: README.md, updated 2026-04-09