Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
Accurate × Fast × Comprehensive
Multilingual Document Layout Parsing in a Single Vision-Language Model
Implementation of Nougat Neural Optical Understanding