Large Language Model Text Generation Inference
Oobabooga - The definitive Web UI for local AI, with powerful features
High-performance inference server for text embeddings models API layer
Module for automatic summarization of text documents and HTML pages
Hypernetworks that adapt LLMs for specific benchmark tasks
Document (PDF, Word, PPTX ...) extraction and parse API
Provides line-oriented text file editing capabilities
AI tool that removes hardcoded subtitles and text from videos locally
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCRmyPDF adds an OCR text layer to scanned PDF files
Comprehensive Gradio WebUI for audio processing
Awesome multilingual OCR toolkits based on PaddlePaddle
Focus on prompting and generating
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Generate audiobooks from EPUBs, PDFs and text with captions
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A text-to-speech, speech-to-text and speech-to-speech library
Python tool for converting files and office documents to Markdown
TTS with kokoro and onnx runtime
Open source no-code system for text annotation and building of text
Code for running inference and finetuning with SAM 3 model
Open source annotation tool for machine learning practitioners
Video-based AI memory library. Store millions of text chunks in MP4
The behavior guidance framework for customer-facing LLM agents