Moonshot's most powerful AI model
A state-of-the-art open visual language model
Block Diffusion for Ultra-Fast Speculative Decoding
Large-language-model & vision-language-model based on Linear Attention
Multimodal model achieving SOTA performance
FAIR Sequence Modeling Toolkit 2
Implementation of model parallel autoregressive transformers on GPUs
An implementation of model parallel GPT-2 and GPT-3-style models