deepseek free download

Showing 51 open source projects for "deepseek"

View related business solutions

Linux Clear Filters & Widen Search

Field Sales+ for MS Dynamics 365 and Salesforce
Maximize your sales performance on the go.

Bring Dynamics 365 and Salesforce wherever you go with Resco’s solution. With powerful offline features and reliable data syncing, your team can access CRM data on mobile devices anytime, anywhere. This saves time, cuts errors, and speeds up customer visits.

Learn More
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
1

DeepSeek-V3

Powerful AI language model (MoE) optimized for efficiency/performance

DeepSeek-V3 is a robust Mixture-of-Experts (MoE) language model developed by DeepSeek, featuring a total of 671 billion parameters, with 37 billion activated per token. It employs Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture to enhance computational efficiency. The model introduces an auxiliary-loss-free load balancing strategy and a multi-token prediction training objective to boost performance.

1 Review

Downloads: 117 This Week

Last Update: 2025-07-09
See Project
2

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens.

1 Review

Downloads: 63 This Week

Last Update: 2025-07-09
See Project
3

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition.

Downloads: 9 This Week

Last Update: 2026-01-27
See Project
4

DeepSeek VL2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal

DeepSeek-VL2 is DeepSeek’s vision + language multimodal model—essentially the next-gen successor to their first vision-language models. It combines image and text inputs into a unified embedding / reasoning space so that you can query with text and image jointly (e.g. “What’s going on in this scene?” or “Generate a caption appropriate to context”).

Downloads: 6 This Week

Last Update: 2025-10-03
See Project
Collect! is a highly configurable debt collection software
Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.

Learn More
5

DeepSeek VL

Towards Real-World Vision-Language Understanding

DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot).

Downloads: 3 This Week

Last Update: 2025-10-03
See Project
6

DeepSeek V2

Strong, Economical, and Efficient Mixture-of-Experts Language Model

DeepSeek-V2 is the second major iteration of DeepSeek’s foundation language model (LLM) series. This version likely includes architectural improvements, training enhancements, and expanded dataset coverage compared to V1. The repository includes model weight artifacts, evaluation benchmarks across a broad suite (e.g. reasoning, math, multilingual), configuration files, and possibly tokenization / inference scripts.

Downloads: 3 This Week

Last Update: 2025-10-03
See Project
7

DeepSeek Coder

DeepSeek Coder: Let the Code Write Itself

DeepSeek-Coder is a series of code-specialized language models designed to generate, complete, and infill code (and mixed code + natural language) with high fluency in both English and Chinese. The models are trained from scratch on a massive corpus (~2 trillion tokens), of which about 87% is code and 13% is natural language. This dataset covers project-level code structure (not just line-by-line snippets), using a large context window (e.g. 16K) and a secondary fill-in-the-blank objective to encourage better contextual completions and infilling. ...

Downloads: 3 This Week

Last Update: 2025-11-11
See Project
8
$DeepSeek Math$

DeepSeek Math

Pushing the Limits of Mathematical Reasoning in Open Language Models

...The repo may also include modules that integrate external computational tools (e.g. a CAS / computer algebra system) or calculator assistance backends to enhance correctness. Because math reasoning is a high bar for LLMs, DeepSeek-Math aims to showcase their model’s ability not just in natural text but in precise formal reasoning.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
9

DeepSeek Coder V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models

DeepSeek-Coder-V2 is the version-2 iteration of DeepSeek’s code generation models, refining the original DeepSeek-Coder line with improved architecture, training strategies, and benchmark performance. While the V1 models already targeted strong code understanding and generation, V2 appears to push further in both multilingual support and reasoning in code, likely via architectural enhancements or additional training objectives.

Downloads: 35 This Week

Last Update: 2025-11-11
See Project
Failed Payment Recovery for Subscription Businesses
For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.

Learn More
10

DeepSeek-V3.2-Exp

An experimental version of DeepSeek model

DeepSeek-V3.2-Exp is an experimental release of the DeepSeek model family, intended as a stepping stone toward the next generation architecture. The key innovation in this version is DeepSeek Sparse Attention (DSA), a sparse attention mechanism that aims to optimize training and inference efficiency in long-context settings without degrading output quality.

Downloads: 17 This Week

Last Update: 2025-11-18
See Project
11

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. ...

Downloads: 14 This Week

Last Update: 2026-02-03
See Project
12

DeepSeek Prover V2

Advancing Formal Mathematical Reasoning via Reinforcement Learning

DeepSeek-Prover-V2 is DeepSeek’s specialized model for formal theorem proving, particularly targeting proof in Lean 4. The repository describes how they use recursive proof decomposition by prompting DeepSeek-V3 to break complex theorems into subgoals, synthesize proof sketches, and then combine them to bootstrap training data. They then fine-tune via reinforcement learning with binary correct/incorrect feedback to integrate informal reasoning with formal proof behavior. ...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
13

DeepSeek AIO

Access and use all DeepSeek AI models in one program.

DeepSeek AIO is a simple program that allows you to interact with all DeepSeek large language models in one place. It supports text-based chats, data analysis, code generation, language translation, and more. The program is designed to make it easy for users to use DeepSeek's AI tools for different purposes without switching between multiple platforms.

Downloads: 16 This Week

Last Update: 2025-11-26
See Project
14

Ollama

Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.

Ollama is an open-source platform that enables developers to run large language models locally on their own machines. It simplifies working with modern AI models by providing a unified interface to download, manage, and interact with them. Users can run models like Llama, Gemma, Qwen, and others directly from the command line or through APIs. Ollama also integrates with popular developer tools and AI agents, allowing seamless workflows across coding environments and applications. It supports...

Downloads: 407 This Week

Last Update: 5 days ago
See Project
15

GLM-5

From Vibe Coding to Agentic Engineering

...Building on earlier GLM series models, GLM-5 dramatically scales the parameter count (to roughly 744 billion) and expands pre-training data to significantly improve performance on complex tasks such as multi-step reasoning, software engineering workflows, and agent orchestration compared to its predecessors like GLM-4.5. It incorporates innovations like DeepSeek Sparse Attention (DSA) to preserve massive context windows while reducing deployment costs and supporting long context processing, which is crucial for detailed plans and agent tasks.

Downloads: 207 This Week

Last Update: 5 days ago
See Project
16

DeepSeek LLM

DeepSeek LLM: Let there be answers

The DeepSeek-LLM repository hosts the code, model files, evaluations, and documentation for DeepSeek’s LLM series (notably the 67B Chat variant). Its tagline is “Let there be answers.” The repo includes an “evaluation” folder (with results like math benchmark scores) and code artifacts (e.g. pre-commit config) that support model development and deployment.

Downloads: 6 This Week

Last Update: 2025-10-03
See Project
17

bolt.diy

Prompt, run, edit, & deploy full-stack web applications using any LLM

bolt.diy is an open-source platform that allows you to easily create, run, edit, and deploy full-stack web applications using a variety of large language models (LLMs). It supports popular models like OpenAI, Anthropic, Ollama, OpenRouter, Gemini, LMStudio, Mistral, xAI, HuggingFace, DeepSeek, and Groq, and provides the flexibility to integrate additional models through the Vercel AI SDK. Whether you’re experimenting with pre-built models or developing custom AI-driven applications, bolt.diy offers a smooth and intuitive experience for building AI-powered web apps. Its open-source nature invites community contributions, and it serves as an ideal platform for developers looking to leverage the latest AI technologies.

1 Review

Downloads: 20 This Week

Last Update: 2025-05-12
See Project
18

Deep Research Web UI

AI-powered research assistant that performs iterative, deep research

...Built with modern web technologies such as Vue and TypeScript, it provides a responsive interface for managing research sessions, tracking intermediate steps, and reviewing collected data. The system supports integration with advanced models like DeepSeek R1, enabling more sophisticated reasoning and contextual understanding across multiple sources.

Downloads: 1 This Week

Last Update: 2026-03-19
See Project
19

Profile Data

Analyze computation-communication overlap in V3/R1

profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems. The repository contains JSON trace files like train.json, prefill.json, decode.json, and associated assets. Users can load them into tools like Chrome tracing to inspect GPU idle times, overlapping operations, and scheduling alignment. ...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
20

3FS

A high-performance distributed file system

...By handling caching and batching at a system level, 3FS helps reduce overhead when many features or modules must be evaluated per input (e.g. in an LLM agent pipeline). The repository includes example integration with models like DeepSeek-V2 / V3, showing how 3FS can be plugged into pipelines for operations like plugin processing.

Downloads: 1 This Week

Last Update: 2026-03-30
See Project
21

DeepClaude

Unleash Next-Level AI

...It is built around the concept of model collaboration, where one model specializes in reasoning while another focuses on output refinement, resulting in more accurate and efficient responses. The system commonly pairs models such as DeepSeek R1 with Claude or Gemini, leveraging their complementary strengths to produce results that outperform individual models in benchmarks and real-world usage scenarios. DeepClaude is designed with compatibility in mind, supporting OpenAI-style APIs and allowing integration with various third-party model providers and routing services. ...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
22

DualPipe

A bidirectional pipeline parallelism algorithm

DualPipe is a bidirectional pipeline parallelism algorithm open-sourced by DeepSeek, introduced in their DeepSeek-V3 technical framework. The main goal of DualPipe is to maximize overlap between computation and communication phases during distributed training, thus reducing idle GPU time (i.e. “pipeline bubbles”) and improving cluster efficiency. Traditional pipeline parallelism methods (e.g. 1F1B or staggered pipelining) leave gaps because forward and backward phases can’t fully overlap with communication. ...

Downloads: 0 This Week

Last Update: 2025-12-25
See Project
23

RTP-LLM

Alibaba's high-performance LLM inference engine for diverse apps

...The framework is designed for large-scale AI services and is already used internally across several Alibaba platforms such as Taobao, Amap, and other business systems that rely on conversational or search-related AI services. RTP-LLM supports a wide variety of modern model architectures, including Qwen, DeepSeek, and Llama-based models, making it a flexible engine for deploying many different open-source LLMs.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
24

DeepSeek MoE

Towards Ultimate Expert Specialization in Mixture-of-Experts Language

DeepSeek-MoE (“DeepSeek MoE”) is the DeepSeek open implementation of a Mixture-of-Experts (MoE) model architecture meant to increase parameter efficiency by activating only a subset of “expert” submodules per input. The repository introduces fine-grained expert segmentation and shared expert isolation to improve specialization while controlling compute cost.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
25

Anx Reader

Featuring powerful AI capabilities and supporting e-book formats

...It supports major formats (EPUB, MOBI, AZW3, FB2, TXT) and integrates powerful AI tools for summarizing and intelligent navigation via OpenAI, Claude, Gemini, and DeepSeek. Anx also syncs progress, notes, and highlights over WebDAV, and offers rich analytics—including heatmaps and exportable reading summaries. UI customization and text-to-speech enhance the reading experience.

Downloads: 30 This Week

Last Update: 2026-03-19
See Project