OCR model for complex documents with layout-aware structured outputs
End-to-end speech processing toolkit
AI tool that removes hardcoded subtitles and text from videos locally
Open Source Differentiable Computer Vision Library
Use LLMs and LLM Vision (OCR) to handle paperless-ngx
LLM.swift is a simple and readable library
Make websites accessible for AI agents
Multi-Agent daTa geneRation Infra and eXperimentation framework
The Open Source Alternative to Cluely
State of the Art Natural Language Processing
Efficient few-shot learning with Sentence Transformers
File Parser optimised for LLM Ingestion with no loss
Modest natural-language processing
Document content and metadata extraction microservice
Open source semantic search and text analytics for large document sets
The most accurate natural language detection library for Python
Run AI models end-to-end encrypted
Zero-code platform for building AI agents from natural language input
Public opinion analysis system
An enterprise-level AI development framework
Turns Data and AI algorithms into production-ready web applications
Haystack is an open source NLP framework to interact with your data
From Vibe Coding to Agentic Engineering
AutoGluon: AutoML for Image, Text, and Tabular Data
An on-premises, OCR-free unstructured data extraction