Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech recognition module for Python
Open-source industrial-grade ASR models
Multilingual speech recognition and audio understanding model
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Replace OpenAI GPT with another LLM in your app
Translate the video from one language to another and embed dubbing
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Open source AI VTuber platform with voice chat and Live2D avatars
AI-powered tool for generating, optimizing, and translating subtitles
Real-time voice interactive digital human
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Framework for building neural networks
Open source AI wearable platform for recording and summarizing speech
A Web UI for easy subtitle using whisper model
Python Audio Analysis Library: Feature Extraction, Classification
Build voice-based LLM agents. Modular + open source
Models for the spaCy Natural Language Processing (NLP) library
A very simple framework for state-of-the-art NLP
Mice speech to text with MX Cinnamon OS ISO
Stanford NLP Python library for many human languages
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
A Deep-Learning-Based Chinese Speech Recognition System
Kashgari is a production-level NLP Transfer learning framework