PyTorch extensions for fast R&D prototyping and Kaggle farming
Robust Speech Recognition via Large-Scale Weak Supervision
Provides code for running inference with the SegmentAnything Model
A Family of Open Foundation Models for Code Intelligence
Accurate × Fast × Comprehensive
Fast inference engine for Transformer models
Data manipulation and transformation for audio signal processing
Industrial-level controllable zero-shot text-to-speech system
A simple but complete full-attention transformer
LLM training code for MosaicML foundation models
Pretrained time-series foundation model developed by Google Research
Multimodal model achieving SOTA performance
End-to-end speech processing toolkit
OpenAI swift async text to image for SwiftUI app using OpenAI
A MATLAB package for modelling multivariate stimulus-response data
A Conversational Speech Generation Model
The unofficial python package that returns response of Google Bard
DeepSeek LLM: Let there be answers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Consistency Distilled Diff VAE
Basaran, an open-source alternative to the OpenAI text completion API
Neural machine translation and sequence learning using TensorFlow
Transformer related optimization, including BERT, GPT
Implementation of NÜWA, attention network for text to video synthesis
Text-conditional image generation model based on OpenAI's unCLIP