full stack framework free download

Llama Stack

Composable building blocks to build Llama Apps

Llama-Stack is an open-source framework designed to facilitate the deployment and fine-tuning of large language models (LLMs) for various natural language processing tasks.

Downloads: 12 This Week

Last Update: 6 days ago

See Project

Agent Stack

Deploy and share agents with open infrastructure

Agent Stack is an open infrastructure platform designed to take AI agents from prototype to production, no matter how they were built. It includes a runtime environment, multi-tenant web UI, catalog of agents, and deployment flow that seeks to remove vendor lock-in and provide greater autonomy. Under the hood it’s built on the “Agent2Agent” (A2A) protocol, enabling interoperability between different agent ecosystems, runtime services, and frameworks. The platform supports agents built in...

Downloads: 10 This Week

Last Update: 2026-03-30

See Project

BeeAI Framework

Build production-ready AI agents in both Python and Typescript

...The framework supports both Python and TypeScript with full feature parity, making it accessible to a wide range of developers and teams. It includes a unified backend layer that connects seamlessly to multiple large language model providers, allowing flexible deployment across different AI infrastructures without vendor lock-in. BeeAI also provides orchestration tools for designing dynamic workflows, enabling multiple agents to coordinate tasks through structured execution flows, retries, and parallel processing.

Downloads: 0 This Week

Last Update: 2026-03-24

See Project

Notte

Opensource browser using agents

Notte is an open-source browser framework that enables the development and deployment of web-based AI agents. It introduces a perception layer that transforms web pages into structured, navigable maps described in natural language, allowing agents to interact with the internet more effectively. Notte is designed for building scalable and efficient browser-based AI applications.

Downloads: 7 This Week

Last Update: 4 days ago

See Project

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system,...

Downloads: 0 This Week

Last Update: 2026-03-23

See Project

Instill Core

Instill Core is a full-stack AI infrastructure tool for data

Instill Core is an open-source, full-stack AI infrastructure platform designed to orchestrate data pipelines, machine learning models, and unstructured data processing into a unified, production-ready system. It provides an end-to-end solution that enables developers to build, deploy, and manage AI-powered applications without needing to manually stitch together multiple tools across the data and model lifecycle.

Downloads: 7 This Week

Last Update: 2026-03-19

See Project

caveman

Why use many token when few token do trick

Caveman is a lightweight and experimental project focused on simplifying backend or full-stack development workflows through minimalistic abstractions and rapid prototyping principles. It is designed to reduce the complexity of modern frameworks by offering a stripped-down approach that prioritizes speed, clarity, and ease of use. The project often serves as a foundation for developers who want to build applications quickly without being constrained by heavy conventions or extensive configuration. ...

Downloads: 1 This Week

Last Update: 3 days ago

See Project

Pruna AI

Pruna is a model optimization framework built for developers

...Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while maintaining speed, cost-efficiency, and full control over their data and AI stack. With a focus on extensibility and observability, Pruna empowers engineers to scale LLM applications from prototype to production securely and reliably.

Downloads: 5 This Week

Last Update: 2026-03-09

See Project

Cactus

Low-latency AI inference engine optimized for mobile devices

Cactus is a low-latency, energy-efficient AI inference framework designed specifically for mobile devices and wearables, enabling advanced machine learning capabilities directly on-device. It provides a full-stack architecture composed of an inference engine, a computation graph system, and highly optimized hardware kernels tailored for ARM-based processors. Cactus emphasizes efficient memory usage through techniques such as zero-copy computation graphs and quantized model formats, allowing large models to run within the constraints of mobile hardware. ...

Downloads: 7 This Week

Last Update: 15 hours ago

See Project

Harbor LLM

Run a full local LLM stack with one command using Docker

Harbor is an open source, containerized toolkit designed to simplify running local large language model (LLM) environments. It combines a CLI and companion app to launch backends, frontends, and supporting services with minimal setup. With a single command, users can start preconfigured tools like Ollama and Open WebUI, enabling chat, workflows, and integrations immediately. Harbor supports multiple inference engines, including llama.cpp and vLLM, and connects them seamlessly to user...

Downloads: 16 This Week

Last Update: 2 days ago

See Project

Nano-vLLM

A lightweight vLLM implementation built from scratch

...Despite its compact design, nano-vllm incorporates advanced optimization techniques such as prefix caching, tensor parallelism, and CUDA graph execution to achieve high performance during model inference. The engine is intended primarily for educational use, experimentation, and lightweight deployments where a full production-grade inference stack may be unnecessary. Its API closely mirrors that of the original vLLM framework, allowing developers familiar with vLLM to adopt the tool with minimal changes.

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

SWIFT LLM

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs

SWIFT LLM is a comprehensive framework developed within the ModelScope ecosystem for training, fine-tuning, evaluating, and deploying large language models and multimodal models. The platform provides a full machine learning pipeline that supports tasks ranging from model pre-training to reinforcement learning alignment techniques. It integrates with popular inference engines such as vLLM and LMDeploy to accelerate deployment and runtime performance.

Downloads: 1 This Week

Last Update: 1 day ago

See Project

firerpa LAMDA

The most powerful Android RPA agent framework

lamda is an Android RPA agent framework that provides visual remote desktop control and automation at scale, geared toward testing, automation validation, and device management. It exposes a clean UI to monitor and interact with connected devices and includes tooling to script actions reliably across apps and OS versions. The project emphasizes low-friction setup and powerful control primitives so teams can move from interactive validation to repeatable automation.

Downloads: 15 This Week

Last Update: 2026-03-22

See Project

Mistral Vibe CLI

Minimal CLI coding agent by Mistral

...Behind the scenes, it leverages Mistral’s coding-optimized LLM stack (including models tuned for code understanding and generation), with project-wide context awareness: it scans your file structure, Git status, and recent history to inform suggestions so that generated code aligns with existing context.

Downloads: 44 This Week

Last Update: 17 hours ago

See Project

ZenML

Build portable, production-ready MLOps pipelines

A simple yet powerful open-source framework that scales your MLOps stack with your needs. Set up ZenML in a matter of minutes, and start with all the tools you already use. Gradually scale up your MLOps stack by switching out components whenever your training or deployment requirements change. Keep up with the latest changes in the MLOps world and easily integrate any new developments.

Downloads: 5 This Week

Last Update: 6 days ago

See Project

LitServe

Minimal Python framework for scalable AI inference servers fast

LitServe is a minimal Python framework designed for building custom AI inference servers with full control over how models are executed and served. It allows developers to define their own inference logic, making it suitable for complex systems such as multi-model pipelines, agents, and retrieval-augmented generation workflows. Unlike traditional serving tools that enforce rigid abstractions, LitServe focuses on flexibility by letting users control request handling, batching strategies, and output processing directly in Python. ...

Downloads: 5 This Week

Last Update: 2026-03-18

See Project

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation model

Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. ...

Downloads: 2 This Week

Last Update: 2026-02-02

See Project

bitnet.cpp

Official inference framework for 1-bit LLMs

bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. ...

Downloads: 13 This Week

Last Update: 2026-03-10

See Project

Jina-Serve

Build multimodal AI applications with cloud-native stack

Jina Serve is an open-source framework designed for building, deploying, and scaling AI services and machine learning pipelines in production environments. The framework allows developers to create microservices that expose machine learning models through APIs that communicate using protocols such as HTTP, gRPC, and WebSockets. It is built with a cloud-native architecture that supports deployment on local machines, containerized environments, or large orchestration platforms such as...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

TTRL

Test-Time Reinforcement Learning

...This makes the framework especially interesting for scenarios where models must keep adapting during evaluation or deployment instead of relying only on fixed pretraining and static fine-tuning. The repository is implemented on top of the verl ecosystem, which allows users to enable TTRL as part of an existing reinforcement learning workflow rather than building a new stack from scratch.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Transformer Engine

A library for accelerating Transformer models on NVIDIA GPUs

...Most deep learning frameworks train with FP32 by default. This is not essential, however, to achieve full accuracy for many deep learning models.

Downloads: 31 This Week

Last Update: 2026-03-31

See Project

LightZero

[NeurIPS 2023 Spotlight] LightZero

LightZero is an efficient, scalable, and open-source framework implementing MuZero, a powerful model-based reinforcement learning algorithm that learns to predict rewards and transitions without explicit environment models. Developed by OpenDILab, LightZero focuses on providing a highly optimized and user-friendly platform for both academic research and industrial applications of MuZero and similar algorithms.

Downloads: 33 This Week

Last Update: 2025-04-09

See Project

Ludwig AI

Low-code framework for building custom LLMs, neural networks

Declarative deep learning framework built for scale and efficiency. Ludwig is a low-code framework for building custom AI models like LLMs and other deep neural networks. Declarative YAML configuration file is all you need to train a state-of-the-art LLM on your data. Support for multi-task and multi-modality learning. Comprehensive config validation detects invalid parameter combinations and prevents runtime failures.

Downloads: 4 This Week

Last Update: 3 days ago

See Project

AutoAgent

AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework

AutoAgent is a fully automated, zero-code LLM agent framework that lets users create agents and workflows using natural language instead of manual coding and configuration. It is structured around modes that cover both “use” and “build” scenarios: a user mode for running a ready-made multi-agent research assistant, plus editors for creating individual agents or multi-agent workflows from conversational requirements. The framework emphasizes self-managing workflow generation, where it can...

Downloads: 10 This Week

Last Update: 2026-02-03

See Project

AutoResearchClaw

Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞

AutoResearchClaw is an open-source framework designed to automatically generate full academic research papers from a single idea or topic. Built in Python, it orchestrates a multi-stage research pipeline that gathers literature, formulates hypotheses, runs experiments, analyzes results, and writes the final paper. The system retrieves real academic references from sources such as arXiv and Semantic Scholar to ensure credible citations.

Downloads: 21 This Week

Last Update: 2026-04-01

See Project

Search Results for "full stack framework"

Showing 92 open source projects for "full stack framework"

Llama Stack

Agent Stack

BeeAI Framework

Notte

NVIDIA NeMo Framework

Instill Core

caveman

Pruna AI

Cactus

Harbor LLM

Nano-vLLM

SWIFT LLM

firerpa LAMDA

Mistral Vibe CLI

ZenML

LitServe

Speech-AI-Forge

bitnet.cpp

Jina-Serve

TTRL

Transformer Engine

LightZero

Ludwig AI

AutoAgent

AutoResearchClaw

Search Results for "full stack framework"

Showing 92 open source projects for "full stack framework"

Llama Stack

Agent Stack

BeeAI Framework

Notte

NVIDIA NeMo Framework

Instill Core

caveman

Pruna AI

Cactus

Harbor LLM

Nano-vLLM

SWIFT LLM

firerpa LAMDA

Mistral Vibe CLI

ZenML

LitServe

Speech-AI-Forge

bitnet.cpp

Jina-Serve

TTRL

Transformer Engine

LightZero

Ludwig AI

AutoAgent

AutoResearchClaw

Related Searches

Related Categories