EPUB to audiobook converter, optimized for Audiobookshelf
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Zero-copy PDF text extraction library written in Zig
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
Unifying 3D Mesh Generation with Language Models
State-of-the-art (SoTA) text-to-video pre-trained model
A library for converting HTML into PDFs using ReportLab
Qwen2.5-VL is the multimodal large language model series
Large-language-model & vision-language-model based on Linear Attention
SOTA discrete acoustic codec models with 40/75 tokens per second
Unified Multimodal Understanding and Generation Models
A python tool that uses GPT-4, FFmpeg, and OpenCV
Multi-modal large language model designed for audio understanding
Chinese Llama-3 LLMs) developed from Meta Llama 3
VITS2 backbone with multilingual-bert
PDF Combiner is a user-friendly, GUI-based tool built in
Convert files like docx, xlsx, pptx, html, and more to MarkDown
Code for the paper Language Models are Unsupervised Multitask Learners
xSTUDIO is a high performance playback and review tool.
Ainee - AI Notetaking and Learning Companion
Framework that is dedicated to making neural data processing
Gui for the Document converter pandoc
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Dump psg/ym chip tune files to txt and midi format
An Open-Source Framework for Prompt-Learning