Showing 15 open source projects for "web crawler source code"

View related business solutions
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • All-in-One Inspection Software Icon
    All-in-One Inspection Software

    flowdit is a connected worker platform tailored for industry needs in commissioning, quality, maintenance, and EHS management.

    Optimize Frontline Operations: Elevate Equipment Uptime, Operational Excellence, and Safety with Connected Teams and Data, Including Issue Capture and Corrective Action.
    Learn More
  • 1
    wombat

    wombat

    Lightweight Ruby DSL for scraping structured data from web pages

    Wombat is a lightweight web crawling and scraping library written in Ruby that focuses on extracting structured data from web pages using a concise domain-specific language (DSL). It is designed to simplify the process of defining how information should be collected from HTML documents without requiring large amounts of scraping boilerplate code. Developers can declare the data fields they want and specify selectors or rules for retrieving them, allowing Wombat to parse and return structured...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    ciao

    ciao

    HTTP checks & tests (private & public) monitoring

    HTTP checks & tests (private & public) monitoring - check the status of your URL. ciao checks HTTP(S) URL endpoints for a HTTP status code (or errors on the lower TCP stack) and sends a notification on status change via E-Mail or Webhooks. It uses Cron syntax to schedule the checks and comes along with a Web UI and a RESTful JSON API. Create an open-source web application for checking URL statuses with a UI and a REST API which is easy to install and maintain (no external dependencies like Databases, Caches, etc.) in public and private environments.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    kimuraframework

    kimuraframework

    AI-first Ruby framework for building fast, flexible web scraping spide

    Kimurai is an open source web scraping framework written in Ruby that simplifies the process of building automated data extraction tools. It provides a clean domain-specific language that allows developers to define scraping logic and data schemas with minimal boilerplate code. Kimurai can use AI-assisted extraction to identify where data resides in HTML pages, automatically generating selectors that are cached for future use so subsequent scraping runs operate with pure Ruby performance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    wappskafander_t2

    wappskafander_t2

    Wraps a HTTP server that might optionally run PHP by using FastCGI.

    wappskafander_t2 wraps an old version of a Hiawatha web server (hiawatha-webserver.org). If FastCGI and PHP are available, then the web server probably can execute PHP. As of 2022_11 this "branch" of the wappskafander_t2 will not be incrementally updated, because it, the wrapping code, NOT the wrapped web server, is an old mess that needs a total rewrite. The current version of the wrapping code is usable as a functional "blob" that serves HTTP and optionally PHP generated content from...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The most trusted software in construction Icon
    The most trusted software in construction

    HCSS is the gold standard software solution for winning, planning, and managing construction projects by connecting the office to the field.

    HCSS provides easy-to-use software built for construction companies that want to win more work, work smarter, and boost profits. For nearly 40 years, we've helped heavy civil contractors, infrastructure builders, and utility companies improve operations, from estimating and project management to field tracking, equipment maintenance, and safety. Tools like HeavyBid, HeavyJob, and HCSS Safety are built for the field and designed to work together, giving your team real-time visibility, tighter cost control, and better job outcomes. With 45+ accounting integrations and customizable APIs, HCSS fits seamlessly into your tech stack. We regularly update our software based on feedback from real crews, ensuring it fits the way your team works. Backed by award-winning 24/7/365 support and a proven implementation process, HCSS helps reduce risk, cut inefficiencies, and deliver fast ROI. If you're ready to grow your business and gain a competitive edge, HCSS is the partner that gets you there.
    Learn More
  • 5
    opsworks-cookbooks

    opsworks-cookbooks

    Chef Cookbooks for the AWS OpsWorks Service

    The AWS OpsWorks Cookbooks repository provides the built-in Chef cookbooks used by the now-deprecated AWS OpsWorks Stacks service, a configuration-management and application-deployment service that used Chef (and optionally Puppet) to manage fleets of EC2 instances. These cookbooks implemented standard tasks such as package installation, web server configuration, database setup, Ruby/PHP/Java application deployment, and integration with AWS stack lifecycle events (setup, configure, deploy,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Meta Tag Generator. Allows you to research SEO keywords generate proper compliant meta tags and output them to a HTML or text file for insertion into a finished web project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    rails-ajax

    rails-ajax

    Add Ajax capabilities to Rails websites with history, bookmarking...

    Add Ajax capabilities to Rails websites, with minimal code changes. Supports history, bookmarking, partial refreshes, Rails flashes, user callbacks, scripts execution, redirections. Built upon Rails-UJS and jQuery.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Zoozle Search & Download Suchmaschine

    Zoozle Search & Download Suchmaschine

    Zoozle 2008 - 2010 Webpage, Tools and SQL Files

    Download search engine and directory with Rapidshare and Torrent - zoozle Download Suchmaschine All The files that run the World Leading German Download Search Engine in 2010 with 500 000 unique visitors a day - all the tools you need to set up a clone. Code Contains: - PHP Files for zoozle - Perl Crawler for gathering new content to database and all other cool tools i have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    dotjs

    dotjs

    dotjs is a Google Chrome extension that executes JavaScript files

    ...It works by mapping JavaScript files to domains, so when a user visits a site, the corresponding script is executed seamlessly in the background. This approach enables users to personalize and enhance website behavior without modifying the original source code or relying on heavy user script managers. The tool is particularly useful for developers and power users who want to tweak interfaces, automate repetitive actions, or inject additional functionality into web pages. dotjs emphasizes simplicity by using a file-based structure where each script corresponds directly to a domain, making it easy to manage and organize customizations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Evertune | Improve Your Brand's Visibility in AI Search Icon
    Evertune | Improve Your Brand's Visibility in AI Search

    For enterprise marketing teams looking for a platform to understand and influence how AI models like ChatGPT recommend their products or services.

    Evertune is the Generative Engine Optimization (GEO) platform that helps brands improve visibility in AI search across ChatGPT, AI Overview, Gemini, Claude and more.
    Learn More
  • 10
    A system to allow papers, workshops, and other documents to be submitted for review and approval. Authors submit material, reviewers write reviews for it, and all of the reviews are compiled and presented to a review committee for approval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A complete article managment solution written using Ruby on Rails. Compact code, Feature rich and Flexibility are the main goals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Ozcode is the source code behind Ozmozr.com, a microformat-aware RSS aggregator, social networking, resource sharing, identity aggregation and presentation site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FGL is a tightly-integrated self-contained development & execution environment utilizing best-of-breed programming tools and methodologies, optimized web/application server, highly-scalable relational/object database, and robust extension interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Registry Server (jUDDI) offers a mechanism for humans or software applications to advertise and discover Web services. The Java Web Services Developer Pack (Java WSDP) Registry Server implements Version 2 of the Universal Description, Discovery and I
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Templating ala (mod_)Ruby. ruby-tmpl is a templating system that can be used for online and offline processing of documents and content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB