Showing 35 open source projects for "document search engine"

View related business solutions
  • The Most Powerful Software Platform for EHSQ and ESG Management Icon
    The Most Powerful Software Platform for EHSQ and ESG Management

    Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

    Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
    Learn More
  • Agentic AI SRE built for Engineering and DevOps teams. Icon
    Agentic AI SRE built for Engineering and DevOps teams.

    No More Time Lost to Troubleshooting

    NeuBird AI's agentic AI SRE delivers autonomous incident resolution, helping team cut MTTR up to 90% and reclaim engineering hours lost to troubleshooting.
    Learn More
  • 1
    Text Search Engine

    Text Search Engine

    A text search engine that supports mixed Chinese and English search

    Text-Search-Engine is a JavaScript-based lightweight search engine that enables full-text search functionality. It allows developers to implement fast search indexing and retrieval in web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Search with Lepton

    Search with Lepton

    Lightweight demo to build a conversational AI search engine quickly

    Search with Lepton is an open source demonstration project that shows how to build a conversational search engine using the Lepton AI framework. It combines traditional web search with large language models to provide natural language answers to user queries. It retrieves information from supported search engines and uses that context to generate responses through a retrieval-augmented generation approach.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    WeKnora

    WeKnora

    LLM framework for document understanding and semantic retrieval

    ...This approach enables the system to provide more reliable answers by grounding model reasoning in the content of uploaded documents. WeKnora is designed with a modular architecture that separates components for document processing, search strategies, and model inference, allowing developers to customize or extend different parts of the pipeline. It supports knowledge base management and conversational question answering built on top of structured and unstructured documents.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Perplexica

    Perplexica

    Perplexica is an AI-powered answering engine.

    Perplexica is a privacy-focused AI answering engine like Perplexity that you can self-host on your own hardware for private, source-cited web research. It combines live internet search results with AI models, letting you use local LLMs via Ollama or connect to providers like OpenAI, Claude, Gemini, and Groq. Powered by SearxNG, it aggregates results from multiple search engines while keeping your identity and queries private.
    Downloads: 27 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 5
    QMD

    QMD

    mini cli search engine for your docs, knowledge bases, etc.

    QMD is a powerful and lightweight command-line tool that acts as an on-device search engine for your personal knowledge base, allowing you to index and search files like Markdown notes, meeting transcripts, technical documentation, and other text collections without depending on cloud services. Designed to keep all search activity local, it combines classic full-text search techniques with modern semantic features such as vector similarity and hybrid ranking so that queries return not just literal matches but conceptually relevant results. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    MiniSearch

    MiniSearch

    Minimalist web-searching platform with an AI assistant

    ...The platform supports browser-level integration so users can set it as their default search engine for quick access. Its architecture emphasizes privacy by avoiding tracking and minimizing data collection while still enabling advanced AI features. Overall, MiniSearch targets users who want a self-hosted, Perplexity-style search experience with strong control over data and models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Farfalle

    Farfalle

    AI search engine - self-host with local or cloud LLMs

    Farfalle is an open-source AI-powered search engine designed to provide an answer-centric search experience similar to modern conversational search systems. The project integrates large language models with multiple search APIs so that the system can gather information from external sources and synthesize responses into concise answers. It can run either with local language models or with cloud-based providers, allowing developers to deploy it privately or integrate with hosted AI services. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    TagSpaces

    TagSpaces

    TagSpaces is an offline, open source, document manager with tagging

    TagSpaces is a free, no vendor lock-in, open source application for organizing, annotating and managing local files with the help of tags. It features advanced note taking functionalities and some capabilities of to-do apps. The application is available for Windows, Linux, Mac OS and Android. We provide a web clipper extension for Firefox, Edge and Chrome for easy collecting of online content in the form of local files. File and folder management - TagSpaces provides a convenient user...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 9
    MemFree

    MemFree

    Hybrid AI Search Engine & AI Page Generator

    memfree is an open source hybrid AI search engine and page generation platform designed to help users retrieve information from both personal knowledge bases and the public web through a unified interface. The project combines retrieval-augmented search with AI summarization to deliver concise answers instead of forcing users to manually sift through multiple sources. It supports multiple AI models and search providers, enabling flexible configuration depending on cost, performance, or privacy requirements. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Premier Construction Software Icon
    Premier Construction Software

    Premier is a global leader in financial construction ERP software.

    Rated #1 Construction Accounting Software by Forbes Advisor in 2022 & 2023. Our modern SAAS solution is designed to meet the needs of General Contractors, Developers/Owners, Homebuilders & Specialty Contractors.
    Learn More
  • 10
    Mantic.sh

    Mantic.sh

    A structural code search engine for Al agents

    Mantic.sh is a context-aware, structural code search engine designed specifically for use with AI coding agents and developers who need deep, semantically relevant search across large codebases. Unlike traditional text-based search tools that mainly match keywords, Mantic.sh understands code structure and meaning by combining syntactic heuristics with neural semantic reranking to produce results that reflect conceptual relevance, which helps find functions, definitions, and patterns that literal search might miss. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    KnowNote

    KnowNote

    A local-first AI knowledge base & NotebookLM alternative

    KnowNote is a local-first, open-source AI knowledge base and notebook application created as an Electron-based alternative to Google NotebookLM that emphasizes privacy, control, and simplicity. It lets users build an intelligent, searchable knowledge base from uploaded documents such as PDFs, Word files, PowerPoints, and web pages, and then interact with that content using LLM-powered chat, summarization, and reasoning tools. Unlike many NotebookLM alternatives that rely on Docker or cloud...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    PandaWiki

    PandaWiki

    AI-powered open source platform for building intelligent wiki bases

    PandaWiki is an open source knowledge base system designed to help users build intelligent documentation platforms powered by large language models. It combines traditional wiki functionality with modern AI capabilities, allowing teams and individuals to create and manage product documentation, technical manuals, FAQs, and blog-style knowledge resources. PandaWiki provides tools for managing knowledge bases through an administrative interface while also generating public-facing wiki sites...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    Papra

    Papra

    The minimalistic document archiving platform

    Papra is a minimalist document management and archiving platform created to help individuals and teams store, organize, and retrieve digital documents with simplicity and accessibility at its core. Papra provides basic yet essential capabilities like uploading files, managing archives, creating organizations for shared access, and performing full-text searches, all within a responsive and user-friendly interface that works across devices. The project’s focus on long-term storage and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Supermemory

    Supermemory

    Memory engine and app that is extremely fast, scalable

    Supermemory is an ambitious and extensible AI-powered personal knowledge management system that aims to help users capture, organize, retrieve, and reason over information in a manner that mimics human memory structures. The platform allows individuals to ingest text, documents, and other content forms, then uses advanced retrieval and embedding techniques to index and relate information intelligently so that users can recall relevant knowledge in context rather than just by keyword match....
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Inkdown

    Inkdown

    A WYSIWYG Markdown editor, improve reading and editing experience

    Inkdown (bluestone) is a Markdown reading, editing, and sharing tool. Almost fully compatible with the GitHub Flavored Markdown standard, while extending the Mermaid graphics and Katex formula, supporting light and dark styles, and somewhat different from other WYSIWYG editors, Inkdown does not pursue complete customization. Its core goal is comfortable reading, smooth editing of Markdown, and document sharing in the simplest way possible. As a document publisher, markdown source code mode...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Scira

    Scira

    AI-powered search engine that helps you find information

    Scira is an open source AI-powered search and research assistant designed to provide fast, conversational answers grounded in web and knowledge sources. The project combines a modern web interface with retrieval-augmented generation techniques to deliver responses that are both natural language friendly and evidence oriented. It is built for developers who want to deploy their own Perplexity-style or AI search experience without relying on proprietary hosted services. Scira emphasizes speed,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    compodoc

    compodoc

    The missing documentation tool for your Angular, Nest & Stencil apps

    ...No server needed, no sources uploaded online. 7 themes are available from famous documentation tools like Gitbook, Read the Docs or projects like Vagrant, Laravel, Postmark and Stripe. Compodoc includes a powerful search engine (lunr.js) for easily finding your information. The generated pages display very well on mobile, tablet and desktop. Compodoc is a documentation tool for Angular applications. It generates static documentation of your application. Compodoc helps Angular developers provide clear and helpful documentation of their application.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Desktop Commander MCP

    Desktop Commander MCP

    AI-powered MCP server for desktop file and terminal automation

    ...It integrates with clients like Claude Desktop to enable AI-driven workflows such as editing files, executing commands, and automating development tasks from a single conversational interface. Desktop Commander MCP builds on top of an MCP filesystem server and enhances it with powerful search, replace, and code editing capabilities tailored for real-world development environments. It allows users to run terminal commands with streaming output, manage long-running processes, and even execute code in memory without saving files. It also supports working with structured and document formats such as Excel, PDF, and DOCX, enabling AI to read, modify, and generate these files directly.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    RAG Web UI

    RAG Web UI

    RAG Web UI is an intelligent dialogue system based on RAG

    RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge. The platform supports ingestion of multiple document formats, including PDFs, Word files, Markdown, and plain...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sync Server

    Sync Server

    Secure, open-source platform for file storage, sharing, collaboration

    ...It provides a sleek web interface where teams or individuals can upload, organize, and share files with fine-grained access permissions, and its security-minded design includes things like multi-factor authentication and role-based controls to help protect sensitive documents. Sync-in supports real-time collaboration through integrations with office editors and activity tracking, and it enhances productivity with deep content search across a variety of file types and comprehensive document management capabilities. The platform is built with TypeScript and Node.js and is suitable for self-hosting on your own infrastructure using Docker or standard Node deployments.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    HeadlessX

    HeadlessX

    The undetected self-hosted browser automation platform

    ...The system provides a centralized service that allows developers to programmatically control browser sessions and extract data from websites through a structured API. It is built using modern technologies including Node.js, Next.js, TypeScript, and Playwright, and uses a specialized browser engine called Camoufox based on Firefox. One of the platform’s goals is to bypass common bot-detection systems by implementing advanced fingerprint spoofing and stealth techniques. The tool can perform tasks such as HTML extraction, screenshot generation, content parsing, and search result scraping while appearing like a normal user browser. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    Project NOMAD

    Project NOMAD

    Self-contained, offline survival computer with tools, knowledge, & AI

    Project N.O.M.A.D. (Node for Offline Media, Archives, and Data) is a self-contained, offline-first knowledge and education server designed to run on local hardware. It combines AI, educational resources, and critical information tools into a single platform accessible through a web browser. Built as a containerized system managed via Docker, it orchestrates multiple applications through a centralized “Command Center” interface. NOMAD enables users to access offline Wikipedia, AI chat, maps,...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 23
    Your Spotify

    Your Spotify

    Self hosted Spotify tracking dashboard

    ...It’s designed to give people control over their own content by eliminating reliance on third-party music services, instead letting the owner manage metadata, cover art, playlists, and rich browsing features. The project supports resuming playback across devices, custom playlists, and a search engine that speeds up locating songs, albums, and artists in large libraries. It also includes integrations with popular tools like Last.fm for scrobbling and uses modern frontend frameworks to deliver a smooth, app-like experience in browsers. For users focused on privacy and control, it offers self-hosting via Docker and clear documentation that helps you get a scalable instance running on a server or home host.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    Basic Memory

    Basic Memory

    Persistent AI memory using local Markdown knowledge graphs

    Basic Memory is an open source knowledge system that turns AI conversations into persistent, structured knowledge you control. Instead of losing context after each chat, it stores information as simple Markdown files on your device, allowing both you and AI to read and write to the same knowledge base. It uses the Model Context Protocol (MCP) so compatible AI tools can access, update, and build on your notes across sessions. Basic Memory creates a semantic knowledge graph by linking related...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 25
    5ire

    5ire

    5ire is a cross-platform desktop AI assistant, MCP client

    5ire is a sleek, cross‑platform desktop AI assistant and MCP client that connects to major service providers, supports a local knowledge base and tool integration via MCP servers, enabling robust RAG and assistant features. These components are required as they constitute the runtime environment for the MCP Server. If you don't anticipate using the tools feature immediately, you may choose to skip this installation step and complete it later when the need arises. MCP is an open protocol that...
    Downloads: 18 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB