An LLM Compiler for Parallel Function Calling
A Simple and Universal Swarm Intelligence Engine
A high-throughput and memory-efficient inference and serving engine
A state-of-the-art open visual language model
Ongoing research training transformer models at scale
Parallax is a distributed model serving framework
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Stanford NLP Python library for many human languages
Block Diffusion for Ultra-Fast Speculative Decoding
Language Model Reinforcement Learning Environments frameworks
Seamlessly integrate LLMs as Python functions
Large-language-model & vision-language-model based on Linear Attention
Making large AI models cheaper, faster and more accessible
The official repository for ERNIE 4.5 and ERNIEKit
FAIR Sequence Modeling Toolkit 2
Build production-ready AI agents in both Python and Typescript
Fault-tolerant, highly scalable GPU orchestration
Chat language model that can use tools and interpret the results
Your Automatic Prompt Engineering Assistant for GenAI Applications
Best practice TTS based on BERT and VITS
Run 100B+ language models at home, BitTorrent-style
Official Implementation of "Graph of Thoughts
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of model parallel autoregressive transformers on GPUs