Document (PDF, Word, PPTX ...) extraction and parse API
Generate audiobooks from EPUBs, PDFs and text with captions
OCR model for complex documents with layout-aware structured outputs
A Repo For Document AI
Enhances Tesseract OCR output using LLMs (local or API)
Python ETL framework for stream processing, real-time analytics, LLM
PDF to Markdown with vision models
Video encoding GUI for Windows
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Open source healthcare AI
New way to create web server and NoSQL data model
Stable Diffusion web UI
OCR software, free and offline
Visual Causal Flow
The official Go library for the OpenAI API
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Text mining using tidy tools
Comprehensive Gradio WebUI for audio processing
Stable Diffusion web UI
Stanford CoreNLP, a Java suite of core NLP tools
Parser generator to read, process, or translate structured text
AI tool for automatic batch short video creation and editing
Persian NLP Toolkit
Faster Whisper transcription with CTranslate2
Translate the video from one language to another and embed dubbing