Chandra is an advanced OCR model designed to extract and structure information from complex documents such as tables, forms, handwritten notes, and mathematical content. It focuses on preserving full document layout, meaning that extracted text is accompanied by positional metadata like bounding boxes for each element. Chandra supports multiple output formats including Markdown, HTML, and JSON, making it suitable for downstream processing and integration into data pipelines. It is capable of handling over 40 languages and is optimized to read difficult inputs such as messy handwriting and multi-column layouts. Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. It also includes command-line tools and optional web-based interfaces to simplify interaction and batch processing workflows.

Features

  • Extracts text with full layout awareness including bounding boxes
  • Supports output formats like Markdown, HTML, and JSON
  • Handles handwriting, tables, forms, and mathematical content
  • Works with over 40 languages for multilingual document processing
  • Offers CLI tools and optional server deployment for scalability
  • Can extract images and structured elements alongside text

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Chandra

Chandra Web Site

Other Useful Business Software
Agentic AI SRE built for Engineering and DevOps teams. Icon
Agentic AI SRE built for Engineering and DevOps teams.

No More Time Lost to Troubleshooting

NeuBird AI's agentic AI SRE delivers autonomous incident resolution, helping team cut MTTR up to 90% and reclaim engineering hours lost to troubleshooting.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Chandra!

Additional Project Details

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

2026-03-18