Samurai-inspired multi-agent system for Claude Code
An LLM Compiler for Parallel Function Calling
A Simple and Universal Swarm Intelligence Engine
A high-throughput and memory-efficient inference and serving engine
Moonshot's most powerful AI model
An Easy-to-Use and High-Performance AI Deployment Framework
Run an army of Claude Code, Codex, etc. on your machine
Ongoing research training transformer models at scale
A state-of-the-art open visual language model
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Parallax is a distributed model serving framework
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Large-language-model & vision-language-model based on Linear Attention
The official repository for ERNIE 4.5 and ERNIEKit
Chat language model that can use tools and interpret the results
Run 100B+ language models at home, BitTorrent-style
Official Implementation of "Graph of Thoughts
Implementation of model parallel autoregressive transformers on GPUs
An implementation of model parallel GPT-2 and GPT-3-style models