Open source libraries and APIs to build custom preprocessing pipelines
Instill Core is a full-stack AI infrastructure tool for data
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Extract schema, statistics and entities from datasets
Parse files for optimal RAG
Superlinked is a Python framework for AI Engineers
Central interface to connect your LLM's with external data
Vector database for scalable similarity search and AI applications
A fast, helpful, and open-source document parser
Context database designed specifically for AI Agents
Autonomous LLM agent for end-to-end data science workflows
A system for agentic LLM-powered data processing and ETL
CrateDB is a distributed and scalable SQL database
A modular graph-based Retrieval-Augmented Generation (RAG) system
The open source mesh processing system
Web framework designed for speed, security, and SEO
Lightweight library for scraping web-sites with LLMs
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Clean network diagrams, One-time setup, zero upkeep
AI-data warehouse to enrich, transform and analyze unstructured data
Claude Code skill for generating production-quality SVG+PNG technical
Python module for parsing semi-structured text into python tables
Fast and efficient unstructured data extraction
Synthetic data generators for structured and unstructured text
An extensible framework for Personal Data Management