Python Stream Processing
A robust, efficient, low-latency speech-to-text library
TextWorld is a sandbox learning environment for the training
local-first semantic code search engine
GPU accelerated decision optimization
A lightweight vLLM implementation built from scratch
High-performance inference framework for large language models
Local long-term memory engine for AI apps with persistent storage
Cloud-native open source data warehouse for analytics and AI queries
Supercharge Your LLM with the Fastest KV Cache Layer
Benchmark LLMs by fighting in Street Fighter 3
Algorithmic Trading in Python with Machine Learning
Comprehensive Gradio WebUI for audio processing
Containerized automation engine for programmable CI/CD workflows
Pruna is a model optimization framework built for developers
950 line, minimal, extensible LLM inference engine built from scratch
Agent-ready RPA suite with visual workflow automation tools engine
SQL-Driven RAG Engine
A tension reasoning engine over 131 S-class problems
A game theoretic approach to explain the output of ml models
Low-latency AI inference engine optimized for mobile devices
Framework for building AI agents that automate complex web tasks
Core ML tools contain supporting tools for Core ML model conversion
Effortless data labeling with AI support from Segment Anything
Automated translation solution for visual novels