GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Agent S: an open agentic framework that uses computers like a human
Weaving the Digital Agent Galaxy
A simple screen parsing tool towards pure vision based GUI agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Meta Agents Research Environments is a comprehensive platform
StreamSpeech is a seamless model for offline speech recognition
AnyTool: Universal Tool-Use Layer for AI Agents
A graphical manager for ollama that can manage your LLMs
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Graphical User Interface Face Anonymization Tool
Real-time behaviour synthesis with MuJoCo, using Predictive Control
- RetroScheme is used for molecule sketching and retrosynthesis
Unlimited, private and free Speech-To-Text program
AI Suite for upscaling, interpolating & restoring images/videos
AI-powered quiz solver for Windows. Free to use, easy to set up.
Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV
Leading free and open-source liveliness check &face recognition system
Official Code for DragGAN (SIGGRAPH 2023)
Simple and powerful voice changer for Linux, written with Python & GTK
Video automatic transcribe and translated subtitle generator
The development of my ai assistant, Alfred
Img2Txt - Extract Text From Images using AI
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]