Agent skill + prompt templates that generate rich HTML pages
Open-source framework for intelligent speech interaction
Visual Blocks for ML is a Google visual programming framework
Backend and Frontend application for tracking differences via image
Audio foundation model excelling in audio understanding
A text-to-speech, speech-to-text and speech-to-speech library
Modern IDE and code editor from Microsoft for Mac, Windows, and Linux
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Emulator for the Game Boy, Game Boy Color, and Game Boy Advance
C# support for Visual Studio Code (powered by OmniSharp)
Generate Canvas, Excalidraw, and Mermaid diagrams from text
This repo contains the code for 1D tokenizer and generator
A native macOS menu bar app for managing audio device priorities
Module for adding visual regression testing to Cypress
Large Audio Language Model built for natural interactions
LLM-based Reinforcement Learning audio edit model
Multi-modal large language model designed for audio understanding
Official Python inference and LoRA trainer package
Visual Studio Code extension for Prettier
Repository for the Microsoft C/C++ extension for VS Code
Butterchurn is a WebGL implementation of the Milkdrop Visualizer
Taming Stable Diffusion for Lip Sync
Multimodal Diffusion with Representation Alignment
s&box is a modern game engine, built on Valve's Source 2