Download Latest Version koboldcpp-1.111.2 source code.zip (64.1 MB)
Email in envelope

Get an email when there's a new version of KoboldCpp

Home / v1.0.4
Name Modified Size InfoDownloads / Week
Parent folder
llama_for_kobold.exe 2023-03-25 10.6 MB
llamacpp_for_kobold.exe 2023-03-24 10.6 MB
llamacpp-for-kobold-1.0.4 source code.tar.gz 2023-03-24 2.4 MB
llamacpp-for-kobold-1.0.4 source code.zip 2023-03-24 2.4 MB
README.md 2023-03-24 821 Bytes
Totals: 5 Items   26.1 MB 0

llamacpp-for-kobold-1.0.4

  • Added a script to make standalone pyinstaller .exes, which will be used for all future releases. The llamacpp.dll and llama-for-kobold.py files are still available by cloning the repo and will be included and updated there.
  • Added token caching for prompts, allowing fast forwarding through partially duplicated prompts. This make edits towards the end of the previous prompt much faster.
  • Merged improvements from parent repo.
  • Weights not included.

To use, download and run the llamacpp_for_kobold.exe Alternatively, drag and drop a compatible quantized model for llamacpp on top of the .exe, or run it and manually select the model in the popup dialog.

and then once loaded, you can connect like this (or use the full koboldai client): http://localhost:5001

Source: README.md, updated 2023-03-24