Open source plain text editor designed for writing novels
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
OCR software, free and offline
Open source annotation tool for machine learning practitioners
Video-based AI memory library. Store millions of text chunks in MP4
Edit PDF files with Nano Banana
SOTA Open Source TTS
Tokenizer-Free TTS for Multilingual Speech Generation
Official inference repo for FLUX.2 models
Qwen3-TTS is an open-source series of TTS models
Text and image to video generation: CogVideoX and CogVideo
A minimalist command line knowledge base manager
Robust Speech Recognition via Large-Scale Weak Supervision
A simple native web interface that uses ChatTTS to synthesize text
FastAPI framework, high performance, easy to learn, fast to code
Label Studio is a multi-type data labeling and annotation tool
Ready-to-use OCR with 80+ supported languages
A generative speech model for daily dialogue
A Powerful Native Multimodal Model for Image Generation
Implementation of Imagen, Google's Text-to-Image Neural Network
A Family of Open Sourced Music Foundation Models
Vim Win32 Installer
The behavior guidance framework for customer-facing LLM agents
A simple tool for reading in poorly redacted documents
Audiocraft is a library for audio processing and generation