AI-powered document analysis and tagging for Paperless-ngx
A high-quality tool for convert PDF to Markdown and JSON
LLM framework for document understanding and semantic retrieval
Open Source Document Management System for Digital Archives
Get your documents ready for gen AI
Document (PDF, Word, PPTX ...) extraction and parse API
A Repo For Document AI
An on-premises, OCR-free unstructured data extraction
A Python Object-Document-Mapper for working with MongoDB
RAG-Anything: All-in-One RAG Framework
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Structured data extraction and instruction calling with ML, LLM
Sync and Async ODM (Object Document Mapper) for MongoDB
Document content and metadata extraction microservice
A high-quality PDF to Markdown tool based on large language model
Parse files for optimal RAG
A Model Context Protocol (MCP) server implementation
Document oriented database optimized for you
Multilingual Document Layout Parsing in a Single Vision-Language Model
AI tool converting video/audio into structured documents instantly
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Chat with your documents using local AI
File Parser optimised for LLM Ingestion with no loss
Low code web framework for real world applications
A system for agentic LLM-powered data processing and ETL