Open-Source AI Camera. Empower any camera/CCTV
Repo of Qwen2-Audio chat & pretrained large audio language model
NLP Cloud serves high performance pre-trained or custom models for NER
The smallest, simplest JavaScript pixel-level image comparison library
The first AI that can earn its own existence, replicate, and evolve
LLM Frontend for Power Users
An on-premises, OCR-free unstructured data extraction
Assist in organizing your piles of documents
Image processing in Python
Matter AI is open-source AI Code Reviewer Agent
2D and 3D Face alignment library build using pytorch
JavaScript OCR and text extraction for images and PDFs
A ranked list of awesome machine learning Python libraries
An image processing library written entirely in JavaScript for Node
Framework for building AI-powered interactive digital humans and agent
Ready-to-use OCR with 80+ supported languages
Advanced NLP with spaCy: A free online course
iOS application for Lumo
Multilingual Document Layout Parsing in a Single Vision-Language Model
Official inference repo for FLUX.2 models
Code release for Cut and Learn for Unsupervised Object Detection
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
SOTA Open Source TTS
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Pre-trained Deep Learning models and demos