Samurai-inspired multi-agent system for Claude Code
An LLM Compiler for Parallel Function Calling
A Simple and Universal Swarm Intelligence Engine
A high-throughput and memory-efficient inference and serving engine
Moonshot's most powerful AI model
An Easy-to-Use and High-Performance AI Deployment Framework
Run an army of Claude Code, Codex, etc. on your machine
Ongoing research training transformer models at scale
A state-of-the-art open visual language model
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Parallax is a distributed model serving framework
Large-language-model & vision-language-model based on Linear Attention
The official repository for ERNIE 4.5 and ERNIEKit
Chat language model that can use tools and interpret the results
Official Implementation of "Graph of Thoughts
Run 100B+ language models at home, BitTorrent-style
Implementation of model parallel autoregressive transformers on GPUs
An implementation of model parallel GPT-2 and GPT-3-style models