DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Renderer for the harmony response format to be used with gpt-oss
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Flowly is 100x faster than OpenClaw
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Framework and no-code GUI for fine-tuning LLMs
The Simple Agent Development Kit
LLM-based agent for general purpose software engineering tasks
A minimal yet professional single agent demo project
Concatenate a directory full of files into a single prompt
Python library and CLI tool to interface with Google Translate
Practical productivity tools for Claude Code, Codex-CLI
On-device Speech-to-Intent engine powered by deep learning
MCP integration platforms for AI agents to use tools at any scale
Containerized automation engine for programmable CI/CD workflows
Power CLI and Workflow manager for LLMs (core package)
Probabilistic time series modeling in Python
Python Crypto Bot (PyCryptoBot)
The Pocket Datalab
Open Source Computer Vision Library
Run GGUF models easily with a UI or API. One File. Zero Install.
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS