Open source plain text editor designed for writing novels
A text-to-speech, speech-to-text and speech-to-speech library
Contexts Optical Compression
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Tokenizer-Free TTS for Multilingual Speech Generation
Robust Speech Recognition via Large-Scale Weak Supervision
Edit PDF files with Nano Banana
A simple native web interface that uses ChatTTS to synthesize text
SOTA Open Source TTS
Official inference repo for FLUX.2 models
Mozc - a Japanese Input Method Editor designed for multi-platform
A Powerful Native Multimodal Model for Image Generation
FastAPI framework, high performance, easy to learn, fast to code
Text and image to video generation: CogVideoX and CogVideo
Video-based AI memory library. Store millions of text chunks in MP4
Label Studio is a multi-type data labeling and annotation tool
Cut videos with a text editor
A Family of Open Sourced Music Foundation Models
MTEB: Massive Text Embedding Benchmark
Audiocraft is a library for audio processing and generation
Ready-to-use OCR with 80+ supported languages
Offline inference engine for art, real-time voice conversations
Vim Win32 Installer
Official inference repo for FLUX.1 models
A simple, high-quality voice conversion tool focused on ease of use