Diffusion Transformer with Fine-Grained Chinese Understanding
Foundation model for image generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Ready-to-use OCR with 80+ supported languages
TigerBot: A multi-language multi-task LLM
Leading open-source visualization and observability platform
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Synchronized Translation for Videos
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
State-of-the-art (SoTA) text-to-video pre-trained model
Simple PDF generation for Python
Official PyTorch Implementation
GPT4V-level open-source multi-modal model based on Llama3-8B
Lightweight Python tool for downloading videos from many platforms
A Telegram RSS bot that cares about your reading experience
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
A free and reliable P2P BitTorrent client
Spark-TTS Inference Code
Tutorial tailored for Chinese babies on rapid fine-tuning
Generate audiobooks from e-books
Industrial-level controllable zero-shot text-to-speech system
Chat & pretrained large vision language model
Qwen-Image is a powerful image generation foundation model