Page 3 | Best Open Source Linux OCR Software 2026

OCR Software for Linux

View 25 business solutions

OCR Linux Clear Filters

Monitor production, track downtime and improve OEE.
For manufacturing companies interested in OEE monitoring solutions

Evocon is a visual and user-friendly OEE software that helps manufacturing companies improve productivity and remove waste as they become better.

Learn More
Complete Data Management for Nonprofits
Designed to fit with multi-level non-profit organization, across any sector

NewOrg is a robust platform built with enhanced features to help non-profit organizations that capture and integrate the information from all of their operational areas to better manage volunteers, clients, programs, outcome reporting, activity sign-ups & scheduling, communications, surveys, fundraising activities and Development campaigns. NewOrg can truly deliver an intuitive product that will help manage your Committees, Donors, Events, and Memberships so that the organization runs efficiently.

Learn More
1

Java OCR

Java OCR is a suite of pure java libraries for image processing and character recognition. Small memory footprint and lack of external dependencies makes it suitable for android development. Provides modular structure for easier deployment

21 Reviews

Downloads: 2 This Week

Last Update: 2016-11-29
See Project
2

OcrGui

A GUI for OCR programs.

1 Review

Downloads: 3 This Week

Last Update: 2013-05-14
See Project
3

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.

Downloads: 5 This Week

Last Update: 2026-03-27
See Project
4

comic-translator

Based on Russian project "Overlay" ,A tool to translate comic books

Based on Russian project "Overlay" ,helping to translate comic books, Added funtion as ZIP RAR support Colorpicker OCR and more coming soon.

Downloads: 5 This Week

Last Update: 2016-11-20
See Project
Find out just how much your login box can do for your customer | Auth0
With over 53 social login options, you can fast-track the signup and login experience for users.

From improving customer experience through seamless sign-on to making MFA as easy as a click of a button – your login box must find the right balance between user convenience, privacy and security.

Sign up
5

DJVU++

The DjVu complete solution,with OCR Technology(Arabic ,English).

DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o Manipulate DjVu files. o Support smaller size than PDF with the same performance. o DjVu++ supports two languages in the OCR technique (Arabic and English). o Read multiple documents at the same time with the new tabs feature. o DjVu++ supports multiple formats:  Convert PDF document into DjVu format with smaller file size and the same performance.  Convert DjVu into PDF format.  Combine images to a single DjVu document. Perform OCR operations on multiple image formats.

4 Reviews

Downloads: 4 This Week

Last Update: 2015-08-24
See Project
6

hocr - Hebrew OCR

hocr - Hebrew OCR c/c++ library

Downloads: 4 This Week

Last Update: 2014-06-09
See Project
7

COSI

The Common OCR Service Interface. COSI is an API that allows developpers to easily bring OCR (Optical Character Recognition) capabilities to image processing applications. COSI supports existing OCR tools such as Tesseract, GOCR or GNU Ocrad.

Downloads: 3 This Week

Last Update: 2014-06-14
See Project
8

bnlviewer

METS / ALTO viewer written in Java and Javascript

The National library of Luxembourg's viewer for METS (http://www.loc.gov/standards/mets/) files with OCR files in the ALTO format. The viewer needs a tomcat application server to run in. It can be deployed so that it reads the METS files from a local folder. Its main use is for digitized newspapers and postcards but can be adapted to other METS profiles as well. The viewer can be seen in action at: http://www.eluxemburgensia.lu Other known users include: National library of Latvia (http://www.periodika.lv) University library of Belgrade (http://arhiva.unilib.rs/unilib/istorijskenovine/index.php?lang=en)

Downloads: 3 This Week

Last Update: 2016-02-16
See Project
9

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 3 This Week

Last Update: 2020-12-08
See Project
Empowering Companies To Excel In Safety Data Sheet Compliance
For any organization using chemicals that require Safety Data Sheets

Effortless setup and maintenance: Simplified management and seamless online access to safety data sheets for your team

Learn More
10

Sanskrit / Hindi - Tesseract OCR

Devanagari fonts traineddata for Tesseract OCR

Read https://sourceforge.net/projects/tesseracthindi/files/OCRHindi_using_VietOCR_and_Tesseract.pdf/download for how to use vietocr gui for OCR of Hindi and Sanskrit texts using tesseract-ocr ***** Please see https://github.com/Shreeshrii/ imagessan and imageshin for newer box/tiff pairs, traineddata files, ocr evaluation statistics and ground truth files with images for Sanskrit and Hindi. ***** Following is OLD information - saved only for archival purposes. Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed. See https://sourceforge.net/p/tesseracthindi/wiki/OCR%20for%20Devanagari/ for more details.

2 Reviews

Downloads: 1 This Week

Last Update: 2017-02-17
See Project
11

e-Dokyumento

e-Dokyumento is web-based Document Management System (DMS)

e-Dokyumento is opensource web-based Document Management System (DMS) A Document Management which automates the basic office document workflow such as receiving, filing, routing, and approving through capturing (scanning), digitizing (OCR Reading), storing, tagging, and electronically routing and approving (e-signature) of electronic documents. # Demo : https://e-dokyumento.herokuapp.com/ https://edokyu.seillig.com/ (refer to Readme.md for the accounts) #Dockerhub: https://hub.docker.com/r/nelsonmaligro/edokyumento # Install using the ISO: 1. Download: https://sourceforge.net/projects/e-dokyumento/files/Releases/e-DokyuV3.iso/download 2. Boot and login with: "root" and "admin@123" 3. Create 2 partitions: SWAP and / mount 4. Login and move "/opt/drive" folder to root: "mv /opt/drive /" # Install on Ubuntu: https://sourceforge.net/projects/e-dokyumento/files/Install%20e-Dokyumento%20on%20Ubuntu%20Linux.pdf/download

2 Reviews

Downloads: 1 This Week

Last Update: 2022-05-14
See Project
12

JOcrad

JOcrad is a graphical frontend for GNU/Ocrad written in Java. GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.

Downloads: 2 This Week

Last Update: 2014-05-10
See Project
13

MyOCR

Start Your Own Captcha Solving Business Portal

Captcha Solutions OCR Captcha Solver Reseller Website to Start Your Own Captcha Solving Business Portal

Downloads: 2 This Week

Last Update: 2016-03-16
See Project
14

YagpoOCRUnicode c++library

OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.

Downloads: 2 This Week

Last Update: 2013-04-08
See Project
15

abtool

Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages

Downloads: 2 This Week

Last Update: 2013-04-11
See Project
16

DIY Book Scanner Image Postprocessor

An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.

1 Review

Downloads: 1 This Week

Last Update: 2013-04-18
See Project
17

Devanagari OCR

Devanagari Optical Character Recognition, Annotation tool

The project has source code and data related to the following tools: 1. Optical Character Recognition. Recognize machine printed Devanagari with or without a dictionary. 2. Document Image Analysis. Automatic page segmentation of document images in multiple Indian languages. Identifies pictures, lines, and words in a document scanned at 300 dpi. 3. Multi-lingual annotation. An interface that has transilteration and a soft-keyboard using which multiple languages can be input. The UI also enables users to view the word and character level ground truth of images. To cite this work, please use: "Devanagari OCR using a recognition driven segmentation framework and stochastic language models", Suryaprakash Kompalli, Srirangaraj Setlur, Venu Govindaraju, IJDAR, 2009, Volume: 12, Pg.: 123–138

1 Review

Downloads: 1 This Week

Last Update: 2019-07-25
See Project
18

PyCodeOCR

Turn your scanner into a free document reader for invoices (e.g. for e-banking) with the help of tesseract-ocr available for many unix (and also windows) platforms.

1 Review

Downloads: 1 This Week

Last Update: 2014-09-05
See Project
19

Eye

Eye is an experimental OCR (image-to-text) application.

2 Reviews

Downloads: 2 This Week

Last Update: 2014-09-27
See Project
20

Image Text Editor

Primary goal of Imated is development of handwritten/machine printed - OCR system. And second goal is development text editor, that will be in a position to import scanned documents OCR them on-the-fly, edit them and print/save as a picture again.

Downloads: 1 This Week

Last Update: 2013-02-27
See Project
21

Kuto

When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.

Downloads: 1 This Week

Last Update: 2013-02-22
See Project
22

isri-ocr-evaluation-tools

Alternative download page for Code and Data to evaluate OCR accuracy, originally from UNLV/ISRI

Downloads: 1 This Week

Last Update: 2014-05-25
See Project
23

tifftool - scanned image cleaner

Tifftool is a high-performance tool to clean scanned documents in preparation for onscreen display or for OCR. Features include skew correction, orientation correction, despeckle, page alignment, split pages and batch processing.

Downloads: 1 This Week

Last Update: 2013-03-20
See Project
24

A GPU-based Devanagiri OCR

Our Objective is to create a GPU-based system that can accept scanned inputs of printed Devanagari texts, and produce outputs of the same in Unicode with a very high accuracy (>99.9%).

Downloads: 0 This Week

Last Update: 2013-04-02
See Project
25

ACEM OCR

OCR Software developed by acem students as their minor project

Downloads: 0 This Week

Last Update: 2014-06-27
See Project