Multilingual Automatic Speech Recognition with word-level timestamps
Automatic Speech Recognition with Word-level Timestamps
Industrial-strength Natural Language Processing (NLP)
A robust, efficient, low-latency speech-to-text library
Han Language Processing
Underthesea - Vietnamese NLP Toolkit
Paste Markdown and AI responses into Word Excel instantly fast
Library for OCR-related tasks powered by Deep Learning
A community-supported supercharged version of paperless
Standalone, small, language-neutral
Private chat with local GPT with document, images, video, etc.
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Models for the spaCy Natural Language Processing (NLP) library
SOTA Open Source TTS
LLM framework for document understanding and semantic retrieval
Open source libraries and APIs to build custom preprocessing pipelines
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Easy-to-use and powerful NLP library with Awesome model zoo
Open-source abilities for OpenHome agents
Build AI-powered semantic search applications
ktrain is a Python library that makes deep learning AI more accessible
Document (PDF, Word, PPTX ...) extraction and parse API
A very simple framework for state-of-the-art NLP
Stanford NLP Python library for many human languages
Unleashing 10,000+ Word Generation from Long Context LLMs