Showing 15 open source projects for "web crawler source code"

View related business solutions
  • All-in-One Inspection Software Icon
    All-in-One Inspection Software

    flowdit is a connected worker platform tailored for industry needs in commissioning, quality, maintenance, and EHS management.

    Optimize Frontline Operations: Elevate Equipment Uptime, Operational Excellence, and Safety with Connected Teams and Data, Including Issue Capture and Corrective Action.
    Learn More
  • Effortlessly manage macOS, iOS, iPadOS and tvOS devices across multiple clients and locations. Icon
    Effortlessly manage macOS, iOS, iPadOS and tvOS devices across multiple clients and locations.

    The Most Powerful Apple Device Management Tool for MSPs and IT Teams

    Addigy solutions accelerate Apple adoption in any environment.
    Learn More
  • 1
    Career-Ops

    Career-Ops

    AI-powered job search system built on Claude Code

    Career Ops is an open-source platform designed to help individuals manage their job search process with a structured, operations-style approach that treats career development like a pipeline. It provides a system for organizing job applications, tracking progress across different stages, and maintaining visibility into opportunities, much like a lightweight CRM tailored for job seekers. The project emphasizes clarity and accountability, enabling users to monitor applications, follow-ups, and...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    Zoekt

    Zoekt

    Fast trigram based code search

    zoekt is a fast, scalable code search engine written in Go that builds compressed indexes (shards) for source repositories and serves low-latency regex and substring queries. It optimizes for developer workflows: refresh indexes from Git, search across many repos instantly, and jump into hits with surrounding context and file navigation. The indexer uses efficient n-gram dictionaries and posting lists to keep query latency low, while storing metadata (branch, language, repo) for precise filtering. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Bookmark manager

    Bookmark manager

    Bookmark manager web application

    Web Page Application: https://shemeshg.github.io/desktop-search/ Github: https://github.com/shemeshg/desktop-search-code * Dropbox sync is performed manually (from admin screen) and not on interval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    magnetW

    magnetW

    Magnet link aggregation search

    ...This application is open source and free, and is only used for crawler technology exchange and learning. The search results are all from the source site, and no responsibility is assumed. The project complies with GNU General Public License v3.0. Online playback is performed in conjunction with the webtorrent desktop version. It needs to be downloaded separately.
    Downloads: 1 This Week
    Last Update:
    See Project
  • DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets. Icon
    DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets.

    Modern Data Catalog and Metadata Platform

    Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
    Learn More
  • 5
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6
    recoll-firefox-extension

    recoll-firefox-extension

    Index the web pages you visit with the Recoll text search tool

    Please note that this extension is superceded by a new one based on the WebExtensions API for Firefox 57 and later: https://addons.mozilla.org/en-US/firefox/addon/recoll-we/ The source for the new version is here: https://opensourceprojects.eu/p/recollwe/code/ This Firefox extension allows you to include the web pages that you visit in Firefox in the index built by the Recoll text search tool. The extension has been reviewed and should now be directly installed from the Mozilla addons catalog: https://addons.mozilla.org/en-US/firefox/addon/recoll-indexer-1/ The package on the Mozilla site is the up-to-date version of this project, and replaces the sourceforge downloads, which are kept only as history. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MARKOS Project

    MARKOS Project

    The MARKet for Open Source

    MARKOS developed a prototype of a service and an interactive application providing an integrated view on the Open Source projects available the on web, focusing on functional, structural and licenses aspects of software code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SeerSuite
    SeerSuite is an application toolkit for digital libraries and search engines; i.e., CiteSeerX. CiteSeerX has moved to GitHub, please get the latest code from: https://github.com/SeerLabs/CiteSeerX
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Development and support of OCFA have been discontinued. the code has moved to these github repositories: https://github.com/DNPA/OcfaLib https://github.com/DNPA/OcfaArch https://github.com/DNPA/OcfaJavaLib https://github.com/DNPA/OcfaModules https://github.com/DNPA/OcfaDoc If you are interested in contributing to ongoing work on the creation of a community maintained OCFA inspired computer forensic framework, please join the Mattock/MattockFS community page on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go beyond a virtual data room with Datasite Diligence Icon
    Go beyond a virtual data room with Datasite Diligence

    Datasite Diligence, helps dealmakers in more than 170 countries close more deals, faster.

    The data room with a view. Evolved for next-generation M&A. Built on decades of deal experience. Packed with expert tools, yet intuitive for novices. A fully mobile platform with frictionless processes. Smart AI tools that let you close more deals, faster, plus end-to-end support at all times. Do due diligence with intelligence.
    Learn More
  • 10
    Agent based Regional Crawler strategy implementation - gathers users' common needs and interests in a certain domain. It crawls based on these interests, instead of crawling the web without any predefined order.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Combine is an open system for crawling Internet resources. It can be used both as a general and focused crawler. If you want to download Web-pages pertaining to a particular topic (like 'Carnivorous Plants') Then Combine is the system for you!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Universal information crawler is a fast precise and reliable Internet crawler. Uicrawler is a program/automated script which browses the World Wide Web in a methodical, automated manner and creates the index of documents that it accesses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PHP Security Scanner is a tool written in PHP intended to search PHP code for vulnarabilities. MySQL DB stores patterns to search for as well as the results from the search. The tool can scan any directory on the file system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB