OCRmyPDF adds an OCR text layer to scanned PDF files
Always know what to expect from your data
Tom Preston-Werner's obvious, minimal language
TikZ figures for concepts in physics/chemistry/ML
Situational Awareness Server compatible with TAK clients
The lxml XML toolkit for Python
Video-based AI memory library. Store millions of text chunks in MP4
Create HTML profiling reports from pandas DataFrame objects
Edit PDF files with Nano Banana
CLI tool to extract (meta)data from PDF and manipulate PDF files
Cortex Analyzers Repository
A fast serialization and validation library, with builtin
A simple tool for reading in poorly redacted documents
Open-Source Python3 tool for recognizing layouts, tables, and math
Re-editable LaTeX/ typst graphics for Inkscape
The data structure for multimodal data
Open Security Controls Assessment Language (OSCAL)
Extract one time password (OTP) secrets from QR codes
Manipulate JSON-like data with NumPy-like idioms
openvpn-monitor is a web based OpenVPN monitor
Formula recognition based on LaTeX-OCR and ONNXRuntime
Yet another serialization library on top of dataclasses
Math OCR model that outputs LaTeX and markdown
A Python tool to help extracting information from structured PDFs
The social web translator