Android Application Identifier for Packers, Protectors and Obfuscators
NVR with realtime local object detection for IP cameras
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AI-data warehouse to enrich, transform and analyze unstructured data
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Effortless data labeling with AI support from Segment Anything
lightweight package to simplify LLM API calls
Implementation of TurboQuant (ICLR 2026)
A library for easily evaluating machine learning models and datasets
OCR software, free and offline
Foundation Model for Tabular Data
A unified framework for machine learning with time series
Deep universal probabilistic programming with Python and PyTorch
Synthetic data curation for post-training and data extraction
Stanford NLP Python library for many human languages
Datasets, transforms and models specific to Computer Vision
Extract schema, statistics and entities from datasets
Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.
Core ML tools contain supporting tools for Core ML model conversion
Adversarial Robustness Toolbox (ART) - Python Library for ML security
A command-line productivity tool powered by AI large language models
A modular graph-based Retrieval-Augmented Generation (RAG) system
A high-quality tool for convert PDF to Markdown and JSON
Helps scientists define testable, modular, self-documenting dataflow