dude uncomplicated data extraction: A simple framework
CLI tool to extract (meta)data from PDF and manipulate PDF files
ExtractThinker is a Document Intelligence library for LLMs
Turn entire websites into LLM-ready markdown or structured data
MD/.JSON Document OCR and structured data extraction API
Crawl a website starting from a URL, find relevant pages
Structured data extraction and instruction calling with ML, LLM
Fast, local-first web content extraction for LLMs
Unreal Engine Archives Explorer
PDF Parser for AI-ready data. Automate PDF accessibility
AI-first Ruby framework for building fast, flexible web scraping spide
No-code LLM Platform to launch APIs and ETL Pipelines
Model Context Protocol server that integrates AgentQL's data
Fast and efficient unstructured data extraction
Make websites accessible for AI agents
Automatic extraction of relevant features from time series
AI-ready web crawler that extracts and structures website content
Claude Code skill for generating production-quality SVG+PNG technical
Clean network diagrams, One-time setup, zero upkeep
Extract and convert data from any document, images, pdfs, word doc
Enhance any agent's browser use skill
Flexible Node.js AI-assisted crawler library
Document content and metadata extraction microservice
Open source web scraping system for automated data collection tasks
ContextGem: Effortless LLM extraction from documents