MTEB: Massive Text Embedding Benchmark
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Multilingual sentence & image embeddings with BERT
Modular Deep Reinforcement Learning framework in PyTorch
Foundation model for image generation
Stable Diffusion web UI
Image inpainting tool powered by SOTA AI Model
MobileLLM Optimizing Sub-billion Parameter Language Models
SWE-agent takes a GitHub issue and tries to automatically fix it
Stable Diffusion built-in to Blender
Ready-to-use OCR with 80+ supported languages
Management of Yandex Station and other smart home devices
A game engine powered by python and panda3d
Agent S: an open agentic framework that uses computers like a human
Faster Whisper transcription with CTranslate2
Automated Music Discovery and Collection Manager
MemU is an open-source memory framework for AI companions
Reference PyTorch implementation and models for DINOv3
A retro game engine for Python
Flower: A Friendly Federated Learning Framework
State-of-the-art (SoTA) text-to-video pre-trained model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Open-sourced unified customization model
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning