A game theoretic approach to explain the output of ml models
BitNet: Scaling 1-bit Transformers for Large Language Models
Hackable and optimized Transformers building blocks
Hunyuan Translation Model Version 1.5
Training Large Language Model to Reason in a Continuous Latent Space
Collaborative & Open-Source Quality Assurance for all AI models
Build AI-powered semantic search applications
A series of math-specific large language models of our Qwen2 series
Implementation for MatMul-free LM
Accelerate local LLM inference and finetuning
GLM-4 series: Open Multilingual Multimodal Chat LMs
Qwen3-omni is a natively end-to-end, omni-modal LLM
Unifying 3D Mesh Generation with Language Models
New family of code large language models (LLMs)
End-to-end speech processing toolkit
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
An MLOps framework to package, deploy, monitor and manage models
Flexible Photo Recrafting While Preserving Your Identity
An easy-to-use LLMs quantization package with user-friendly apis
Implementation of Recurrent Interface Network (RIN)
Transformers4Rec is a flexible and efficient library
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Text-to-Image generation. The repo for NeurIPS 2021 paper
Inference code and configs for the ReplitLM model family
Official PyTorch Implementation of "Scalable Diffusion Models"