PyTorch extensions for fast R&D prototyping and Kaggle farming
Robust Speech Recognition via Large-Scale Weak Supervision
Provides code for running inference with the SegmentAnything Model
A Family of Open Foundation Models for Code Intelligence
A SOTA open-source image editing model
Accurate × Fast × Comprehensive
Data manipulation and transformation for audio signal processing
A simple but complete full-attention transformer
Fast inference engine for Transformer models
Industrial-level controllable zero-shot text-to-speech system
LLM training code for MosaicML foundation models
End-to-end speech processing toolkit
Open-source industrial-grade ASR models
Pretrained time-series foundation model developed by Google Research
Multimodal model achieving SOTA performance
OpenAI swift async text to image for SwiftUI app using OpenAI
A MATLAB package for modelling multivariate stimulus-response data
The unofficial python package that returns response of Google Bard
A Conversational Speech Generation Model
DeepSeek LLM: Let there be answers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Consistency Distilled Diff VAE
Basaran, an open-source alternative to the OpenAI text completion API
Neural machine translation and sequence learning using TensorFlow
Transformer related optimization, including BERT, GPT