LLM-based Reinforcement Learning audio edit model
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Over 425 terminal color schemes/themes for iTerm/iTerm2
Qwen3 is the large language model series developed by Qwen team
Shared repository for open-sourced projects from the Google AI Lang
21 Lessons, Get Started Building with Generative AI
Visual Causal Flow
Official PyTorch Implementation
ChatGPT extension for scientific research work
A TTS model capable of generating ultra-realistic dialogue
Repo of Qwen2-Audio chat & pretrained large audio language model
Adding guardrails to large language models
Powerful open source team chat application
Autonomous LLM agent for end-to-end data science workflows
Stable Diffusion web UI
LongBench v2 and LongBench (ACL 25'&24')
A Coverage-Guided, Native Python Fuzzer
Circuit diagrams and firmware source code for Gboard DIY keyboards
Flexible Photo Recrafting While Preserving Your Identity
Large Multimodal Models for Video Understanding and Editing
Concatenate a directory full of files into a single prompt
Biomni: a general-purpose biomedical AI agent
Check links in web documents or full websites
Dealing with all unstructured data, such as reverse image search
Implementation of "MobileCLIP" CVPR 2024