ImageBind One Embedding Space to Bind Them All
Diffusion Transformer with Fine-Grained Chinese Understanding
Windows GUI Automation with Python (based on text properties)
An open source implementation of CLIP
AI video generator optimized for low VRAM and older GPUs use
Qwen3-omni is a natively end-to-end, omni-modal LLM
Code for running inference and finetuning with SAM 3 model
Edit PDF files with Nano Banana
High-Resolution Image Synthesis with Latent Diffusion Models
Director, Screenwriter, Producer, and Video Generator All-in-One
Chinese and English multimodal conversational language model
Tensor search for humans
A simple tool for reading in poorly redacted documents
NLP Cloud serves high performance pre-trained or custom models for NER
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
21 Lessons, Get Started Building with Generative AI
AutoGluon: AutoML for Image, Text, and Tabular Data
An Open Source text-to-speech system built by inverting Whisper
Deep Research framework, combining language models with tools
ComfyUI wrapper nodes for WanVideo and related models
Integrate ChatGPT into your own discord bot
Accurate × Fast × Comprehensive
Multilingual sentence & image embeddings with BERT
CLI tool to extract (meta)data from PDF and manipulate PDF files
High-Resolution 3D Assets Generation with Large Scale Diffusion Models