Implementation of NÜWA, attention network for text to video synthesis
Text-conditional image generation model based on OpenAI's unCLIP
CPT: A Pre-Trained Unbalanced Transformer
A High Performance Library for Sequence Processing and Generation
Singing Voice Synthesis via Shallow Diffusion Mechanism
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
Deep learning PyTorch library for time series forecasting
PyTorch implementation of MAE
Reformer, the efficient Transformer, in Pytorch
Facebook AI Research Sequence-to-Sequence Toolkit
ALIbaba's Collection of Encoder-decoders from MinD
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Adversarial Latent Autoencoders
Facebook AI research's automatic speech recognition toolkit
End-to-end object detection with transformers
Toolkit for Machine Learning, Natural Language Processing
CakeChat: Emotional Generative Dialog System
Toolkit for efficient experimentation with Speech Recognition
Open source speech models for Julius in English and other languages.
Beamforming and Speech Recognition Toolkit