AI-powered document analysis and tagging for Paperless-ngx
The standard data-centric AI package for data quality and ML
Open source NLP guide with models, methods, and real use cases
AI-powered tool for efficient abstract and PDF screening
Apache OpenNLP
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
A very simple framework for state-of-the-art NLP
Bringing BERT into modernity via both architecture changes and scaling
Scalable data pre processing and curation toolkit for LLMs
Access and use all DeepSeek AI models in one program.
Award-winning modern data processing SDK in C++20
e-Dokyumento is web-based Document Management System (DMS)
CPU/GPU inference server for Hugging Face transformer models
State-of-the-art explainers for text-based machine learning models
Innovative text document search. http://dynaq.opendfki.de for details.
Library for fast text classification and representation
Document/Text Classification using Naive Bayes model.
GPU-based Textual kNN (GT-kNN)
A machine learning system for supervised document classification