Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "character recognition source code"

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 40
BSD 40
Linux 40
More...
Mac 40
Windows 40
Desktop Operating Systems 1

Category

Artificial Intelligence 40
Multimedia 3
Software Development 3
Security 2
Education 1
Games 1
Scientific/Engineering 1
Text Editors 1

License

OSI-Approved Open Source 37

Translations

English 4
Catalan 1
Czech 1
Dutch 1
More...
Hindi 1
Italian 1
Japanese 1
Lithuanian 1
Persian 1
Polish 1
Russian 1
Slovak 1
Tamil 1
Turkish 1
Vietnamese 1

Programming Language

Python 23
C++ 5
JavaScript 5
Java 4
More...
C 2
C# 2
BASIC 1
Lua 1
MATLAB 1
PHP 1
Prolog 1
TypeScript 1
Unix Shell 1

Status

Alpha 3
Planning 2
Beta 2
Production/Stable 2
More...
Pre-Alpha 1

40 projects for "character recognition source code" with 2 filters applied:

Artificial Intelligence ChromeOS Clear Filters & Widen Search

Caller ID Reputation provides the most comprehensive view of your caller ID scores across all carriers
Instantly identify flagged caller IDs and decrease flags by up to 95% your first month.

Keep your agents on the phone with increased connection rates by monitoring your phone number reputation across all major carriers and call blocking apps.

Learn More
Complete Data Management for Nonprofits
Designed to fit with multi-level non-profit organization, across any sector

NewOrg is a robust platform built with enhanced features to help non-profit organizations that capture and integrate the information from all of their operational areas to better manage volunteers, clients, programs, outcome reporting, activity sign-ups & scheduling, communications, surveys, fundraising activities and Development campaigns. NewOrg can truly deliver an intuitive product that will help manage your Committees, Donors, Events, and Memberships so that the organization runs efficiently.

Learn More
1

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow

...It also employs connectionist temporal classification (CTC) to align predicted character sequences with input images without requiring character-level segmentation. The repository provides code for training models, performing inference on handwritten text images, and evaluating recognition accuracy. SimpleHTR is commonly used as an educational example for understanding how modern handwriting recognition systems operate.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
2

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. ...

Downloads: 8 This Week

Last Update: 2026-01-27
See Project
3

Open Semantic Search

Open source semantic search and text analytics for large document sets

...Open Semantic Search includes an ETL framework that can ingest documents, process them through analysis steps, and enrich the data with extracted information such as named entities and metadata. It also supports optical character recognition to extract text from images and scanned documents, including images embedded inside PDF files. It integrates text mining and analytics capabilities that allow users to examine relationships, topics, and structured data within document collections.

Downloads: 6 This Week

Last Update: 6 days ago
See Project
4

GLM-OCR

Accurate × Fast × Comprehensive

GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...

Downloads: 16 This Week

Last Update: 2026-04-08
See Project
Secure Online Fax and Business Text Messaging Service
Elevate your business communications with secure SMS and fax solutions.

Send and receive SMS and fax online, from email, app or with our developer friendly SMS & fax API. HIPAA compliant & ISO 27001 certified. Outstanding value and 5-star service.

Learn More
5

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. ...

Downloads: 9 This Week

Last Update: 2026-02-03
See Project
6

Open-LLM-VTuber

Open source AI VTuber platform with voice chat and Live2D avatars

Open-LLM-VTuber is an open source platform designed to create AI-powered VTuber characters that can interact with users through voice and animated avatars. It enables hands-free conversations with large language models by combining speech recognition, language processing, and text-to-speech synthesis into a single system. Users can speak directly to the AI character, and the system can respond with a generated voice while animating a Live2D avatar to simulate a talking virtual personality. ...

Downloads: 21 This Week

Last Update: 2026-03-17
See Project
7

Text2Code for Jupyter notebook

A proof-of-concept jupyter extension which converts english queries

Text2Code for Jupyter notebook project is a proof-of-concept extension for Jupyter Notebook that allows users to generate Python code directly from natural language queries written in English. The tool is designed to simplify data analysis workflows by enabling users to describe their intended operation in plain language instead of manually writing code. When a user enters a textual command, the extension interprets the request and generates a corresponding Python code snippet that can be...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
8

Flock

Flock is a workflow-based low-code platform for building chatbots

Flock is a workflow-based low-code platform designed for building AI applications such as chatbots, retrieval-augmented generation systems, and multi-agent workflows. The platform uses a visual workflow architecture where different nodes represent processing steps such as input processing, model inference, retrieval operations, and tool execution. Developers can connect these nodes to create complex pipelines that orchestrate multiple language models and external services. Built on...

Downloads: 2 This Week

Last Update: 2026-03-09
See Project
9

LLPlayer

The media player for language learning, with dual subtitles

LLPlayer is an open-source media player designed specifically for language learning through video content. Unlike traditional media players, the application focuses on advanced subtitle-related features that help learners understand and interact with foreign language media more effectively. The player supports dual subtitles so users can simultaneously view text in both the original language and their native language while watching videos. It can also automatically generate subtitles in real...

Downloads: 40 This Week

Last Update: 19 hours ago
See Project
Modernize Your Lab with the #1 Rated LIMS
Labs that need a powerful LIMS system

Nothing is more critical to a lab’s success than the quality, security, and traceability of samples. The Lockbox LIMS system provides robust sample management functionality to laboratory professionals, giving them full visibility on every aspect of a sample’s journey, from accessioning to long-term storage.

Learn More
10

docext

An on-premises, OCR-free unstructured data extraction

docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual...

Downloads: 3 This Week

Last Update: 2026-03-12
See Project
11

Scribe.js

JavaScript OCR and text extraction for images and PDFs

Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In...

Downloads: 2 This Week

Last Update: 2026-03-14
See Project
12

Advanced NLP with spaCy

Advanced NLP with spaCy: A free online course

Advanced NLP with spaCy is an open-source educational repository that provides the materials for an interactive course on advanced natural language processing using the spaCy library. The course is designed to teach developers how to build real-world NLP systems by combining rule-based techniques with machine learning models. The repository includes lessons, exercises, and examples that guide learners through tasks such as tokenization, named entity recognition, text classification, and training custom NLP models. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
13

dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks. The model is designed to recognize virtually any human script, making it highly effective...

Downloads: 1 This Week

Last Update: 2026-03-24
See Project
14

FAY

Framework for building AI-powered interactive digital humans and agent

Fay is an open source framework designed to build and deploy interactive digital humans powered by large language models. It acts as a middleware layer that connects digital character technologies with conversational AI systems and business applications. Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
15

CutLER

Code release for Cut and Learn for Unsupervised Object Detection

CutLER is an approach for unsupervised object detection and instance segmentation that trains detectors without human-annotated labels, and the repo also includes VideoCutLER for unsupervised video instance segmentation. The method follows a “Cut-and-LEaRn” recipe: bootstrap object proposals, refine them iteratively, and train detection/segmentation heads to discover objects across diverse datasets. The codebase provides training and inference scripts, model configs, and references to...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
16

FLUX.2

Official inference repo for FLUX.2 models

FLUX.2 is a state-of-the-art open-weight image generation and editing model released by Black Forest Labs aimed at bridging the gap between research-grade capabilities and production-ready workflows. The model offers both text-to-image generation and powerful image editing, including editing of multiple reference images, with fidelity, consistency, and realism that push the limits of what open-source generative models have achieved. It supports high-resolution output (up to ~4 megapixels),...

Downloads: 51 This Week

Last Update: 2026-03-12
See Project
17

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm

minbpe is a minimal, clean implementation of byte-level Byte Pair Encoding (BPE), the tokenization approach widely used in modern language models. It operates on UTF-8 encoded bytes rather than Unicode characters, which makes it robust to arbitrary text inputs and avoids needing a language-specific character vocabulary. The repository is structured as a teaching-oriented implementation that shows how to train a tokenizer by learning merge rules, then apply those merges to encode text into...

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
18

Open Model Zoo

Pre-trained Deep Learning models and demos

Open Model Zoo is a large repository of high-quality pre-trained deep learning models and demonstration applications designed to work with the OpenVINO™ toolkit, offering a comprehensive starting point for a wide range of AI and computer vision workloads. It includes hundreds of models covering object detection, classification, segmentation, pose estimation, speech recognition, text-to-speech, and more, many of which are already converted into formats optimized for inference on CPUs, GPUs,...

Downloads: 1 This Week

Last Update: 2026-01-10
See Project
19

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

HunyuanImage-3.0 is a powerful, native multimodal text-to-image generation model released by Tencent’s Hunyuan team. It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter...

1 Review

Downloads: 6 This Week

Last Update: 2026-02-03
See Project
20

Lingvo

Framework for building neural networks

Lingvo is a TensorFlow based framework focused on building and training sequence models, especially for language and speech tasks. It was originally developed for internal research and later open sourced to support reproducible experiments and shared model implementations. The framework provides a structured way to define models, input pipelines, and training configurations using a common interface for layers, which encourages reuse across different tasks. It has been used to implement state...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
21

vJEPA-2

PyTorch code and models for VJEPA2 self-supervised learning from video

VJEPA2 is a next-generation self-supervised learning framework for video that extends the “predict in representation space” idea from i-JEPA to the temporal domain. Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The...

Downloads: 0 This Week

Last Update: 2026-03-23
See Project
22

Armadillo

fast C++ library for linear algebra & scientific computing

* Fast C++ library for linear algebra (matrix maths) and scientific computing * Easy to use functions and syntax, deliberately similar to Matlab / Octave * Uses template meta-programming techniques to increase efficiency * Provides user-friendly wrappers for OpenBLAS, Intel MKL, LAPACK, ATLAS, ARPACK, SuperLU and FFTW libraries * Useful for machine learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. * Downloads:...

Downloads: 2,679 This Week

Last Update: 11 hours ago
See Project
23

Hiera

A fast, powerful, and simple hierarchical vision transformer

Hiera is a hierarchical vision transformer designed to be fast, simple, and strong across image and video recognition tasks. The core idea is to use straightforward hierarchical attention with a minimal set of architectural “bells and whistles,” achieving competitive or superior accuracy while being markedly faster at inference and often faster to train. The repository provides installation options (from source or Torch Hub), a model zoo with pre-trained checkpoints, and code for evaluation and fine-tuning on standard benchmarks. ...

Downloads: 4 This Week

Last Update: 2025-10-08
See Project
24

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 196 This Week

Last Update: 2026-01-17
See Project
25

Style Aligned

Official code for Style Aligned Image Generation via Shared Attention

StyleAligned is a diffusion-model editing technique and codebase that preserves the visual “style” of an original image while applying new semantic edits driven by text. Instead of fully re-generating an image—and risking changes to lighting, texture, or rendering choices—the method aligns internal features across denoising steps so the target edit inherits the source style. This alignment acts like a constraint on the model’s evolution, steering composition, palette, and brushwork even as...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project

Previous
You're on page 1
2
Next

Related Searches

ocr

jtessboxeditor-2.7.0.zip

repair

armadillo

voltha-cli

tesseract-ocr-w64-setup-v5.x.x.exe

tesseract-ocr

delphiocr

tesseract-ocr-w64-setup.exe

scan

Related Categories

Artificial Intelligence

Multimedia

Software Development

Security

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise