A little word cloud generator in Python
Multilingual Automatic Speech Recognition with word-level timestamps
Automatic Speech Recognition with Word-level Timestamps
Industrial-strength Natural Language Processing (NLP)
A robust, efficient, low-latency speech-to-text library
Han Language Processing
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Paste Markdown and AI responses into Word Excel instantly fast
Underthesea - Vietnamese NLP Toolkit
Library for OCR-related tasks powered by Deep Learning
Turn colors into words
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Standalone, small, language-neutral
A community-supported supercharged version of paperless
Private chat with local GPT with document, images, video, etc.
Open source libraries and APIs to build custom preprocessing pipelines
Models for the spaCy Natural Language Processing (NLP) library
SOTA Open Source TTS
LLM framework for document understanding and semantic retrieval
Document (PDF, Word, PPTX ...) extraction and parse API
Rich is a Python library for rich text and beautiful formatting
Open Security Controls Assessment Language (OSCAL)
Build AI-powered semantic search applications
Easy-to-use and powerful NLP library with Awesome model zoo
ktrain is a Python library that makes deep learning AI more accessible