A Web UI for easy subtitle using whisper model
Multi-modal large language model designed for audio understanding
Framework for building AI-powered interactive digital humans and agent
Build Vision Agents quickly with any model or video provider
Flowly is 100x faster than OpenClaw
LLM Large Model of Selling Anchor
Virtual AI anchor that combines state-of-the-art technology
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A Conversational Speech Generation Model
Powerful Android AI agent with tools, automation, and Linux shell
Official Python inference and LoRA trainer package
Chat & pretrained large audio language model proposed by Alibaba Cloud
Generate blog articles from video or audio
One-click deployment (including offline integration package)
A very simple framework for state-of-the-art NLP
Pre-trained Deep Learning models and demos
FAIR Sequence Modeling Toolkit 2
Stanford NLP Python library for many human languages
Open source personal AI Assistant for Linux, Windows and Mac
Models for the spaCy Natural Language Processing (NLP) library
Low-latency AI inference engine optimized for mobile devices
Text to Speech Utility
Mice speech to text with MX Cinnamon OS ISO
AI framework for automated short video creation and editing tools
A python tool that uses GPT-4, FFmpeg, and OpenCV