Reference implementation of the Transformer architecture optimized
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
A mix of GAN implementations including progressive growing
Code for the paper "Improved Techniques for Training GANs"
Dual LSTM Encoder for Dialog Response Generation