Showing 20 open source projects for "document"

View related business solutions
  • All Things Performance and Partner Marketing, All in One Place Icon
    All Things Performance and Partner Marketing, All in One Place

    Track calls, leads, and clicks without the manual work

    Automatically tie revenue back to campaigns, channels, publishers, and networks through marketing attribution. Spend less time juggling reports, and more time optimizing for growth by using a single operating solution for partner and performance marketing.
    Learn More
  • Pylon is an All-in-one B2B Support Platform for modern B2B businesses. Icon
    Pylon is an All-in-one B2B Support Platform for modern B2B businesses.

    Pylon is a modern support system that integrates with all B2B channels like Slack and Team.

    We bring together everything a post-sales teams team needs including a ticketing system, B2B omnichannel integrations (Slack Connect, Microsoft Teams), modern chat widget, knowledge base, AI support bot, account management, customer marketing, and more.
    Learn More
  • 1
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    PoloDB

    PoloDB

    PoloDB is an embedded document database

    PoloDB is an embedded document-oriented NoSQL database that provides MongoDB-like functionality in a lightweight package, ideal for local storage in applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    monolith

    monolith

    CLI tool for saving complete web pages as a single HTML file

    ...You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive. Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share. If compared to saving websites with wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    graphql_client

    graphql_client

    Typed, correct GraphQL requests and responses in Rust

    ...Copies documentation from the GraphQL schema to the generated Rust code. Arbitrary derives on the generated responses. Arbitrary custom scalars. Supports multiple operations per query document. Supports setting GraphQL fields as deprecated and having the Rust compiler check their use. Optional reqwest-based client for boilerplate-free API calls from browsers. Implicit and explicit null support.
    Downloads: 1 This Week
    Last Update:
    See Project
  • PeerGFS PEER Software - File Sharing and Collaboration Icon
    PeerGFS PEER Software - File Sharing and Collaboration

    One Solution to Simplify File Management and Orchestration Across Edge, Data Center, and Cloud Storage

    PeerGFS is a software-only solution developed to solve file management/file replication challenges in multi-site, multi-platform, and hybrid multi-cloud environments.
    Learn More
  • 5
    Bionic GPT

    Bionic GPT

    Bionic is an on-premise replacement for ChatGPT

    ...Beyond chat, Bionic focuses heavily on enterprise RAG by letting users create AI assistants that work with their own documents, share those assistants across teams, and configure embeddings, chunking, and system prompts through the UI. The platform supports a wide variety of document types, includes data isolation features for teams, and layers in security measures such as RBAC, row-level security in Postgres, strong content security policy settings, and minimal container builds.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    ...For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SurrealDB

    SurrealDB

    A scalable, distributed, collaborative, document-graph database

    With an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads, SurrealDB is the next generation serverless database. SurrealDB is the ultimate cloud database for tomorrow's applications. SurrealDB is an innovative NewSQL cloud database, suitable for serverless applications, jamstack applications, single-page applications, and traditional applications. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Sonic

    Sonic

    Fast, lightweight & schema-less search backend

    Sonic is a super fast and lightweight, schema-less search backend that can be used in place of super-heavy and full-featured search backends like Elasticsearch. It is able to normalize language search queries, auto-complete search queries and offer the most relevant results. Being an identifier index rather than a document index, when queried it provides IDs that can be used to refer to matched documents in an external database.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Kreuzberg

    Kreuzberg

    Polyglot document intelligence framework

    Kreuzberg is a flexible task orchestration and agent workflow platform designed to help developers build, coordinate, and scale intelligent agents or automation pipelines that interact with external services, runtime environments, and multi-step business workflows. It emphasizes modular design so that developers can define discrete tasks or “actions” and then compose them into complex flows where dependencies, parallel steps, and error handling are declaratively managed. This structure makes...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Point of Sale. Powerful and Simple. Icon
    Point of Sale. Powerful and Simple.

    For retail store owners and multi-location retail operations needing a tool to manage sales, inventory, staff and channels in one place

    Vibe Retail is an all-in-one retail point-of-sale and operations platform built for single-store and multi-location retailers seeking to unify inventory, sales, staff and customer data from one mobile-friendly interface. The system lets you track inventory across locations and warehouses, handle item variations (size, color, material), manage purchase orders and supplier deliveries, print custom barcodes, and transfer stock between stores in real time. On the sales side, Vibe supports multiple payment types (cards, cash, checks, gift cards, EBT), layaway workflows, serial number tracking, delivery management, loyalty programs and branded receipts. Retailers can integrate with online platforms (such as Shopify and WooCommerce), sync in-store and online sales, access 40+ real-time reports on sales, inventory and performance, set up promotions and discounts, and print receipts from mobile devices.
    Learn More
  • 10
    Note67

    Note67

    A private, local meeting notes assistant

    note67 is a private, local meeting notes assistant application that combines audio capture, transcription, and AI-powered summarization to help users document conversations and meetings on their own devices without relying on cloud services. Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI, eliminating the need to send sensitive meeting content to external servers. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    HelixDB

    HelixDB

    Graph-vector database for building unified AI backends fast

    ...HelixDB is built from scratch in Rust and uses LMDB as its storage engine, enabling high performance and low-latency query execution. HelixDB also supports additional data formats such as key-value, document, and relational data, making it flexible for a wide range of backend architectures. A central feature of the project is its custom query language, HelixQL, which is fully type-safe and compiled to ensure reliability and correctness in production environments. HelixDB includes built-in capabilities for embeddings, vector search, keyword search, and graph traversal, which are particularly useful for retrieval-augmented generation and agent-based systems.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. Developers can compose custom pipelines that chain together transforms, filters, and exporters, enabling automation of tedious data preparation steps and accelerating insights with minimal code. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Korvus

    Korvus

    Korvus is a search SDK that unifies the entire RAG pipeline

    Korvus is an open-source retrieval-augmented generation (RAG) pipeline designed to run entirely inside PostgreSQL, allowing developers to build AI search and knowledge systems directly within a database environment. The project consolidates the typical steps of a RAG pipeline—including embedding generation, document retrieval, reranking, and text generation—into a single query executed within the Postgres ecosystem. By leveraging PostgresML and vector extensions such as pgvector, Korvus eliminates the need for external microservices typically used for AI search architectures, reducing both system complexity and latency. The architecture enables machine learning operations to occur directly in the database, minimizing data transfer between services and improving overall performance for large datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MongoDB Rust Driver

    MongoDB Rust Driver

    The official MongoDB Rust Driver

    ...Because it’s asynchronous by design, it works well with Rust async runtimes like Tokio and async-std, letting developers build highly concurrent networked services that efficiently use modern multicore hardware. The crate also includes BSON encoding and decoding that maps cleanly to Rust types, so developers can work with rich document structures while retaining Rust’s performance guarantees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    gptcommit

    gptcommit

    A git prepare-commit-msg hook for authoring commit messages with GPT-3

    ...Commit messages are a key channel for developers to communicate their work with others, especially in code reviews. When making complex code changes, it can be tedious to thoroughly document the contents of each change. I often felt the impulse to just title my commit “fix bug” and move on. Surfacing these changes with gptcommit helps the author and reviewer by bringing attention to these additional changes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    sentinel

    sentinel

    Sentinel is a filesystem-backed document DBMS written in Rust.

    Sentinel is a filesystem-backed document DBMS built in Rust that prioritizes compliance, transparency, and auditability over raw performance. Unlike traditional databases, every document is a plain JSON file, making your data immediately forensic-friendly and Git-versionable. Perfect for regulated industries requiring GDPR, SOC 2, HIPAA, or PCI-DSS compliance. Sentinel provides async operations with automatic BLAKE3 hashing and optional Ed25519 signatures for cryptographic integrity. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    AppFlowy

    AppFlowy

    Bring projects, wikis, and teams together with AI.

    ...AppFlowy comes with a beautiful rich-text editor that goes beyond just text and bullet points, offering 20+ content types, easy-to-use customized themes, keyboard shortcuts, and color options. It supports real-time team collaboration, enabling you to work with your friends and teammates on the same document in real time, similar to Google Docs. AppFlowy is powered by AppFlowy AI, which is accessible, collaborative, and contextual. Supercharge any type of work in a collaborative team workspace.
    Downloads: 59 This Week
    Last Update:
    See Project
  • 18
    authoscope

    authoscope

    Scriptable network authentication cracker (formerly `badtouch`)

    authoscope is a scriptable network authentication cracker. While the space for common service bruteforce is already very well saturated, you may still end up writing your own python scripts when testing credentials for web applications. The scope of authoscope is specifically cracking custom services. This is done by writing scripts that are loaded into a lua runtime. Those scripts represent a single service and provide a verify(user, password) function that returns either true or false....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    printpdf

    printpdf

    Rust / WASM library for reading, writing and rendering PDF

    ...It includes advanced typography capabilities such as character spacing, scaling, superscript, and subscript, as well as support for Unicode text. printpdf also offers optimization features like font subsetting to reduce file size, making generated PDFs more efficient. Experimental capabilities include rendering PDF pages to SVG and extracting text content, expanding its use cases beyond simple document generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Mooneye GB

    Mooneye GB

    A Game Boy research project and emulator written in Rust

    ...Some existing emulators are very accurate (Gambatte, BGB >= 1.5) but are not documented very clearly, so they are not that good references for emulator developers. I want this project to document as clearly as possible why certain behavior is emulated in a certain way. This also means writing a lot of test ROMs to figure out corner cases and precise behavior on real hardware. The emulator is lagging behind hardware research. I don't want to spend time making short-lived and probably incorrect fixes to the emulator if I'm not sure about the hardware behavior. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB