OCRmyPDF adds an OCR text layer to scanned PDF files
Video-based AI memory library. Store millions of text chunks in MP4
The lxml XML toolkit for Python
Define and run multi-container applications with Docker
Edit PDF files with Nano Banana
Open Security Controls Assessment Language (OSCAL)
TikZ figures for concepts in physics/chemistry/ML
A simple tool for reading in poorly redacted documents
Situational Awareness Server compatible with TAK clients
Re-editable LaTeX/ typst graphics for Inkscape
Open-Source Python3 tool for recognizing layouts, tables, and math
LaTeX CV generator from a YAML/JSON input file
CLI tool to extract (meta)data from PDF and manipulate PDF files
Cortex Analyzers Repository
Tom Preston-Werner's obvious, minimal language
Package for converting and rendering markdown documents in TeX
Formula recognition based on LaTeX-OCR and ONNXRuntime
Easily serialize Data Classes to and from JSON
Build GUI for your Python program with JavaScript, HTML, and CSS
CLI tool to filter JSON and JSON Lines data with Python syntax
Modern high-performance serialization utilities for Python
Command-line YAML, XML, TOML processor
simplejson is a simple, fast, extensible JSON encoder/decoder
tmux session manager. built on libtmux
CLI tool and python library