Showing 110 open source projects for "python web crawler"

View related business solutions
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • SoftCo: Enterprise Invoice and P2P Automation Software Icon
    SoftCo: Enterprise Invoice and P2P Automation Software

    For companies that process over 20,000 invoices per year

    SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.
    Learn More
  • 1
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Heritrix

    Heritrix

    Internet Archive's open-source, web-scale, web crawler project

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Odigos

    Odigos

    Distributed tracing without code changes

    ...Odigos automatically scales OpenTelemetry collectors based on observability data volume. Manage and configure collectors via a convenient web UI. Installing Odigos takes less than 5 minutes, and requires no code changes.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 4
    RStudio

    RStudio

    RStudio is an integrated development environment (IDE) for R

    RStudio is a powerful, full-featured integrated development environment (IDE) tailored primarily for the R programming language but increasingly supportive of other languages like Python and Julia. It brings together console, editor, plotting, workspace, history, and file-management panes into a unified interface, helping data scientists, statisticians, and analysts to work more productively. The IDE is cross-platform: there are desktop versions for Windows, macOS and Linux, as well as a server version for remote or multi-user deployment via a web browser. ...
    Downloads: 27 This Week
    Last Update:
    See Project
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 5
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    Fully open source, cloud-native, scalable, micro streaming, and complex event processing system capable of building event-driven applications for use cases such as real-time analytics, data integration, notification management, and adaptive decision-making. Event processing logic can be written using Streaming SQL queries via graphical and source editors, to capture events from diverse data sources, process and analyze them, integrate with multiple services and data stores, and publish...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    QR Code generator library

    QR Code generator library

    High-quality QR Code generator library in Java, TypeScript/JavaScript

    This project aims to be the best, clearest library for generating QR Codes. My primary goals are flexible options and absolute correctness. The secondary goals are compact implementation size and good documentation comments. This work is an independent implementation based on reading the official ISO specification documents. I believe that my library has a more intuitive API and shorter code length than competing libraries out there. The library is designed first in Java and then ported to...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 7
    Framework Benchmarks

    Framework Benchmarks

    Source for the TechEmpower Framework Benchmarks project

    ...Please feel free to ask questions here. We encourage new frameworks and contributors to ask questions. We're here to help! This project provides representative performance measures across a wide field of web application frameworks. With much help from the community, coverage is quite broad and we are happy to broaden it further with contributions. The project presently includes frameworks on many languages including Go, Python, Java, Ruby, PHP, C#, F#,Clojure, Groovy, Dart, JavaScript, Erlang, Haskell, Scala, Perl, Lua, C, and others. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ZK - Simply Ajax and Mobile
    ZK is an open-source Java framework for building modern web and mobile applications. It enables developers to create rich, interactive UIs using only Java — no JavaScript required. With 200+ Ajax-powered components, event-driven architecture, and support for popular technologies like Spring, Java EE, and JSP/JSF, ZK makes it simple to deliver powerful and user-friendly web applications.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 9
    EulerSharp

    EulerSharp

    Euler Yet another proof Engine

    EYE [1] is a reasoning engine supporting the Semantic Web layers [2]. It performs controlled chaining and it supports Euler paths [3]. Via N3 [4] it is interoperable with Cwm [5]. [1] http://eulersharp.sourceforge.net/README [2] http://www.w3.org/DesignIssues/diagrams/sweb-stack/2006a [3] http://mathworld.wolfram.com/KoenigsbergBridgeProblem.html [4] http://www.w3.org/TeamSubmission/n3/ [5] http://www.w3.org/2000/10/swap/doc/cwm
    Downloads: 0 This Week
    Last Update:
    See Project
  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • 10
    lixa

    lixa

    LIXA, LIbre XA, is a free and open source XA transaction manager

    ...LIXA is a Transaction Manager but it's not a Transaction Monitor: this is the distinguishing feature of the project. LIXA technology enables every application container, like a web server or a shell, to become a two phase commit application server. The client/server architecture of LIXA allows many application containers to share a single LIXA (state) server: this is ideal when horizontal scalability is a must and many identical application containers must refer to a single transactional environment. LIXA can be used with the C, C++, Java, Python and COBOL programming languages.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    NanoH5 (tsl2nano)

    NanoH5 (tsl2nano)

    java bean / database driven zero code application framework

    NanoH5 (or FullRelation) is a fullstack UI implementation framework providing a model driven design (MDA). Build a complete html5 application through a given class- or database-model without coding (coding APIs are available).
    Downloads: 15 This Week
    Last Update:
    See Project
  • 12
    eCxx

    eCxx

    A C++ library for AVR and NodeMCU

    NOTE: This project is marked with 'Status: Abandoned' on SourceForge because not enough time can be dedicated to this project. However it may still get sporadic commits to the repository. eCxx is a library for AVR and NodeMCU tailored for micro LED displays and lighting effects. eCxx is utilizing Makefile build system. Java and Python based applications/tools are also included to ease the development and debugging process using the host PC. On one side, eCxx supports the original...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    OMLX project is a place for processing of many projects to be ready to become open source projects.
    Leader badge
    Downloads: 227 This Week
    Last Update:
    See Project
  • 14
    Zebrunner Community Edition

    Zebrunner Community Edition

    Test Automation Management Tool

    Zebrunner CE (Community Edition) is a Test Automation Management Tool for continuous testing and continuous deployment. It allows you to run various kinds of tests and gain successive levels of confidence in the code quality. Zebrunner CE is integrated by default with Carina open-source TestNG framework and uses Jenkins as a CI Tool. It is built on top of popular docker solutions and includes Postgres database, Zebrunner Reporting, Jenkins Master/Slaves Nodes, Selenium Hub, Mobile...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    Easy Web automation library

    Easy Web automation library

    This library has been designed to work with selenium for web automation. It has incorporated functions and handled exception from selenium. It uses selenium library for web interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Mapbox Maps SDK for React Native

    Mapbox Maps SDK for React Native

    A Mapbox GL react native module for creating custom maps

    Mapbox is the location data platform for mobile and web applications. We provide building blocks to add location features like maps, search, and navigation into any experience you create. Use our simple and powerful APIs & SDKs and our open-source libraries for interactivity and control. Once you’re signed in, all you need to start building is a Mapbox access token. Use this same short code with all of our interactive mapping libraries, Python and JavaScript SDKs, and directly against our REST APIs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    phoneutria
    A Java Web crawler: multi-threaded, scalable, with high performance, extensible and polite. It can be used to crawl and index any web or enterprise domain and is configurable through a XML configuration file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SIGAR (System Information Gatherer and Reporter) is a cross-platform, cross-language library and command-line tool for accessing operating system and hardware level information in Java, Perl and .NET.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 20

    Platform Course

    Opensource framework for creation bussiness web applications

    Platform Course 5.0.0 is opensource framework for easy development of original solutions for unique business process. Main advantages: cross-domain auth, LDAP integration, cross-browser GWT-based UI, big tables handling, chart and geo maps, input forms on XForms. Tested with MSSQL, PostgreSQL and Oracle. Actual SVN Repository for this product is located at https://share.curs.ru/svn/showcase/branches/stable/ Login: reader Password: reader If you wish to know further...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    COAR-DMS is document management system for 32/64 bit. linux. Acts as library, server and tools. Library features: - storage management, free pages recycling - transaction log - indexing: full text, tags, metadata, document attributes - inverted index - versioning, collaboration - document trees, trees versionning - folders - plugins for auth (PAM,LDAP), db, file types plugins - tags - metadata (key value pairs) - object level security, folders documents ACL, - unix...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Java-based framework for decoupling back-end services and front-end interfaces. Browse and interact with a database, a class library, a network, a log file, or any live java object as though it were a filesystem. (It works with filesystems too!)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    OVNI

    OVNI

    Open Virtualization Nodes Infrastructure

    OVNI is, first of all, an AJAX web-application to create and manage Virtual Machines on KVM nodes. it's developed under WaveMaker and rely on Libvirt to be compatible with other tools such as virsh. In the future, the project aims to provide a complete virtualization environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The goal of this project is to make possible to access Progress database from any external program that can use sockets. The server (broker and agents) are written in Progress 4GL and made use of sockets capabilities of Progress V9.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB