Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs. The model is efficient for both cloud inference with vLLM and local deployment using llama.cpp or Ollama, thanks to its bf16 precision and AMP training. While the base model is not fine-tuned for downstream tasks, it is designed to be easily adapted through supervised fine-tuning (SFT) or reinforcement learning (RL). Benchmarks on RepoBench, SAFIM, and HumanEval demonstrate its competitive performance, with specialized fine-tuned versions for Python already showing strong improvements.

Features

  • 4B parameter LLaMA-style architecture optimized for coding tasks
  • Trained on 4.2T tokens from The Stack, StarCoder, CommitPack, and Wikipedia
  • 8,192-token context window for handling larger codebases
  • Efficient for both cloud inference (vLLM) and local use (llama.cpp, Ollama)
  • Base model with support for SFT and RL fine-tuning for specific applications
  • Strong benchmark results on RepoBench, SAFIM, and HumanEval tasks
  • Includes Python SFT variant with superior performance over the base model
  • Licensed under Apache 2.0 for open and flexible use

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow Mellum-4b-base

Mellum-4b-base Web Site

Other Useful Business Software
Office Ally: Healthcare Software for Your Medical Practice Icon
Office Ally: Healthcare Software for Your Medical Practice

We support healthcare organizations of all sizes with easy-to-use, affordable software solutions.

Service Center by Office Ally is a trusted revenue cycle management platform used by over 65,000 healthcare organizations processing more than 350 million claims annually. With it, providers can verify patient eligibility and benefits, upload and submit claims, correct rejected claims, check claim status, and obtain remits. With multiple claim types and submission options, providers can easily submit claims to any payer from any practice management system. Transactions are secure, ensuring the confidentiality of sensitive patient information. With no needed implementation, providers can quickly and effortlessly streamline their billing processes, increase their financial performance, simplify medical billing, and reduce claim rejections for faster reimbursements.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Mellum-4b-base!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-09-11