Easy-to-use Speech Toolkit including Self-Supervised Learning model
A state-of-the-art open visual language model
Build AI-powered semantic search applications
Virtual AI anchor that combines state-of-the-art technology
Faster and easier training and deployments
Unified KV Cache Compression Methods for Auto-Regressive Models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Synthetic data generators for tabular and time-series data
Time series Timeseries Deep Learning Machine Learning Pytorch fastai
Qwen3-omni is a natively end-to-end, omni-modal LLM
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
New family of code large language models (LLMs)
Democratizing Reinforcement Learning for LLMs
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
DeepMind model for tracking arbitrary points across videos & robotics
Gemma open-weight LLM library, from Google DeepMind
code for Mesh R-CNN, ICCV 2019
Designed for text embedding and ranking tasks
Best practices on recommendation systems
Capable of understanding text, audio, vision, video
A library for deep learning end-to-end dialog systems and chatbots