Official Python inference and LoRA trainer package
Qwen3 is the large language model series developed by Qwen team
Towards Human-Sounding Speech
Any model. Any hardware. Zero compromise
Personal AI, On Personal Devices
Parallax is a distributed model serving framework
Ling is a MoE LLM provided and open-sourced by InclusionAI
Run Local LLMs on Any Device. Open-source
Bayesian Modeling and Probabilistic Programming in Python
Taming Stable Diffusion for Lip Sync
Operating LLMs in production
Performance-optimized AI inference on your GPUs
Turn WiFi signals into real-time human pose estimation and detection
A library for accelerating Transformer models on NVIDIA GPUs
Minimal Python framework for scalable AI inference servers fast
LightLLM is a Python-based LLM (Large Language Model) inference
Accelerate local LLM inference and finetuning
State-of-the-art Parameter-Efficient Fine-Tuning
Multilingual speech recognition and audio understanding model
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Powering Amazon custom machine learning chips
Libraries for applying sparsification recipes to neural networks
OCR expert VLM powered by Hunyuan's native multimodal architecture
Multilingual Automatic Speech Recognition with word-level timestamps