AI-powered document analysis and tagging for Paperless-ngx
The standard data-centric AI package for data quality and ML
Open source NLP guide with models, methods, and real use cases
AI-powered tool for efficient abstract and PDF screening
Apache OpenNLP
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
A very simple framework for state-of-the-art NLP
Bringing BERT into modernity via both architecture changes and scaling
Scalable data pre processing and curation toolkit for LLMs
Access and use all DeepSeek AI models in one program.
Award-winning modern data processing SDK in C++20
e-Dokyumento is web-based Document Management System (DMS)
CPU/GPU inference server for Hugging Face transformer models
Innovative text document search. http://dynaq.opendfki.de for details.
Library for fast text classification and representation
A machine learning system for supervised document classification
Multimodal Transformer for document image understanding and layout
Small 3B-base multimodal model ideal for custom AI on edge hardware