Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In addition to simple text extraction, Scribe.js supports writing or injecting a high-quality invisible text layer back into PDFs, effectively making them searchable and improving usability for indexing or accessibility. It is written in modern ECMAScript Modules (ESM), so it can be imported in both browser and Node.js environments without a build step, though browser usage requires same-origin hosting of the files.

Features

  • Recognize text from images and scanned files
  • Extract or pull existing text from PDF documents
  • Insert searchable invisible text layers into PDF files
  • Supports usage in both browser (ESM) and Node.js environments
  • Example templates for popular frameworks
  • Designed for developer integration

Project Samples

Project Activity

See All Activity >

Categories

OCR

License

Affero GNU Public License

Follow Scribe.js

Scribe.js Web Site

Other Useful Business Software
Rezku Point of Sale Icon
Rezku Point of Sale

Designed for Real-World Restaurant Operations

Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Scribe.js!

Additional Project Details

Programming Language

JavaScript

Related Categories

JavaScript OCR Software

Registered

2026-02-13