EPUB to audiobook converter, optimized for Audiobookshelf
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
ComfyUI wrapper nodes for HunyuanVideo
Unifying 3D Mesh Generation with Language Models
State-of-the-art (SoTA) text-to-video pre-trained model
Qwen2.5-VL is the multimodal large language model series
Large-language-model & vision-language-model based on Linear Attention
SOTA discrete acoustic codec models with 40/75 tokens per second
Unified Multimodal Understanding and Generation Models
A python tool that uses GPT-4, FFmpeg, and OpenCV
Multi-modal large language model designed for audio understanding
Chinese Llama-3 LLMs) developed from Meta Llama 3
VITS2 backbone with multilingual-bert
Code for the paper Language Models are Unsupervised Multitask Learners
Ainee - AI Notetaking and Learning Companion
Framework that is dedicated to making neural data processing
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
An Open-Source Framework for Prompt-Learning
Easy-OCR solution and Tesseract trainer for GNU/Linux
Toolkit for Machine Learning, Natural Language Processing