Showing 664 open source projects for "python web crawler"

View related business solutions
  • DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets. Icon
    DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets.

    Modern Data Catalog and Metadata Platform

    Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
    Learn More
  • Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud. Icon
    Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.

    Power faster innovation with Teradata VantageCloud

    VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
    Learn More
  • 1
    kamiFaka

    kamiFaka

    kamiFaka

    Applicable to all kinds of e-commerce, coupons, forum invitation codes, recharge cards, activation codes, registration codes, Tencent iQiyi points CDK, etc., support manual and automatic delivery, and a tiered wholesale model similar to 1688. Stisla UI: The web interface is beautiful Front-end uses VUE3.0, millisecond-level response. Has integrated Alipay face-to-face payment, WeChat official, Payjs, Hupijiao, YunGouOS, Yipay, Mugglepay, V visa-free and other more than a dozen payment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    speedtest-cli

    speedtest-cli

    Command line interface for testing internet bandwidth using speedtest

    Command line interface for testing internet bandwidth using speedtest.net. It is not a goal of this application to be a reliable latency reporting tool. Latency reported by this tool should not be relied on as a value indicative of ICMP style latency. It is a relative value used for determining the lowest latency server for performing the actual speed test against. Speedtest CLI brings the trusted technology and global server network behind Speedtest to the command line. Measure internet...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    ZED OS

    ZED OS

    ZED OS the clone of Windows XP

    ZED OS is a portuguese Linux based distribution (remaster of Ubuntu Mate). It was designed for users who want an innovative system. With both 3D and 2D interfaces you'll be able to chosse the better one according to your hardware/needs. It's Web Based and inspired in Windows XP from Microsoft
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Zebrunner Community Edition

    Zebrunner Community Edition

    Test Automation Management Tool

    Zebrunner CE (Community Edition) is a Test Automation Management Tool for continuous testing and continuous deployment. It allows you to run various kinds of tests and gain successive levels of confidence in the code quality. Zebrunner CE is integrated by default with Carina open-source TestNG framework and uses Jenkins as a CI Tool. It is built on top of popular docker solutions and includes Postgres database, Zebrunner Reporting, Jenkins Master/Slaves Nodes, Selenium Hub, Mobile...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Eurekos LMS - Build a Smarter Customer Icon
    Eurekos LMS - Build a Smarter Customer

    The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

    Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.
    Learn More
  • 5
    Linkchecker for Markdown

    Linkchecker for Markdown

    Python asyncio + aiohttp Markdown *.md URL link checker

    Blazing-fast (10000 Markdown files per second) Python asyncio / aiohttp based simple check of links in Markdown .md files only. This tool is very helpful for large Markdown-based Jekyll and Hugo sites as well as Markdown-based MkDocs documentation projects. It is very fast and simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Flask-MongoEngine

    Flask-MongoEngine

    MongoEngine flask extension with WTF model forms support

    Flask-MongoEngine is a Flask extension that provides integration with MongoEngine, WtfForms and FlaskDebugToolbar. By default, Flask-MongoEngine will install integration only between Flask and MongoEngine. Integration with WTFForms and FlaskDebugToolbar are optional and should be selected as extra option, if required. This is done by users request, to limit amount of external dependencies in different production setup environments. All methods end extras described below are compatible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Hack-Tools

    Hack-Tools

    Hack tools

    hack-tools is a collection of various hacking tools and utilities. It serves as a comprehensive toolkit for penetration testers and cybersecurity enthusiasts, encompassing a wide range of functionalities.​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    bioweb

    bioweb

    polyglot language framework to analyze genetic data

    polyglot framework using Python/C++/JavaScript to fast develop applications to analyze biological sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    QPyDesk

    QPyDesk

    Code editor and real-time QR code generator for QPython

    QPyDesk is a code editor, and real-time QR code generator for QPython. It is a Python code editor with syntax highlighting that also generates the QR code that represents said code in real time. This application also allows you to print the generated QR code to distribute the created application. However, because QR codes have a limited storage capacity, if the code is very long, QPyDesk creates a QR code that is only valid while the application is running, that is, the QR code generated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Shoplogix Smart Factory Platform Icon
    Shoplogix Smart Factory Platform

    For manufacturers looking for a powerful Manufacturing Execution solution

    Real-time Visibility into Your Shop Floor's Performance. The Shoplogix smart factory platform enables manufacturers to increase overall equipment effectiveness, reduce operational costs, sustain growth and improve profitability by allowing them to visualize, integrate and act on production and machine performance in real-time. Manufacturers that trust us to drive efficiency in their factories. Real-time visual data and analytics provide valuable insights to make better informed decisions. Uncover hidden shop floor potential and drive rapid time to value. Develop a continuously improving culture through training, education and data-driven decisions. Compete in the i4.0 world by making the Shoplogix Smart Factory Platform the cornerstone of your digital transformation. Connect to any equipment or device to automate data collection and exchange it with other manufacturing technologies. Automatically monitor, report and analyze machine states to track real-time production.
    Learn More
  • 10
    Beamology

    Beamology

    Beamology: Semicom Project Management

    Beamology is the leading field of project management for SuperWikia Alpha semicom codesmiths. Its residency programs offer vital resources for SuperWikia vertical environment development and utilities needed for SA-VEC certification. Beamology residents are automatically licensed to access the Transcom system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    bluetroller

    A library and interface for controlling bluetooth LE devices

    bluetroller is a library and interface for controlling all kinds of bluetooth LE devices. A vast number of devices can be controlled via Bluetooth LE, including fitness trackers, lighting, camera sliders, gimbals and many more. Right now these devices can only be controlled via phone apps which are frequently buggy, unmaintained and will stop working after some future phone update. This project aims to grow to become an exhaustive library of these devices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    IntrospectDB

    IntrospectDB

    File System Relational / Object Oriented DBMS/CMS

    Introspect is a File System oriented Database Management System And CMS built on UNIX as a Python and HTML webapp. For the latest version use the Bitbucket repository: https://bitbucket.org/mieszkowski/introspect/src/master/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Ansible Examples

    Ansible Examples

    A few starter examples of ansible playbooks, to show features

    This repository collects practical, real-world examples of using Ansible to automate infrastructure, deployments, and configurations. Each directory demonstrates a specific use case—ranging from setting up web servers, load balancers, and databases to orchestrating multi-tier applications in cloud environments. The examples highlight common Ansible practices such as organizing inventories, writing reusable playbooks, using roles, and handling variables and templates. They’re designed to be...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Unified Sessions Manager

    Unified Sessions Manager

    Pioneering Private and Public Cloud Management since 2008

    The UnifiedSessionsManager supports the integrated management of user sessions within Private-Clouds, comprising heterogeneous IT landscapes of various physical and virtual machines, hypervisor management, and virtual user sessions with remote desktops. Extracted documents see https://sourceforge.net/projects/ctys-doc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    nginx ui

    nginx ui

    Nginx UI allows you to access and modify the nginx configurations

    We use nginx in our company lab environment. It often happens that my colleagues have developed an application that is now deployed in our Stage or Prod environment. To make this application accessible nginx has to be adapted. Most of the time my colleagues don't have permission to access the server and change the configuration files and since I don't feel like doing this for everyone anymore I thought a UI could help us all. If you feel the same way I wish you a lot of fun with the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Alfred-Workflow

    Alfred-Workflow

    Full-featured library for writing Alfred 3 & 4 workflows

    Alfred-Workflow is a Python helper library for Alfred 2, 3 and 4 workflow authors, developed and hosted on GitHub. Alfred workflows typically take user input, fetch data from the Web or elsewhere, filter them and display results to the user. Alfred-Workflow takes care of a lot of the details for you, allowing you to concentrate your efforts on your workflow’s functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GoodByeCatpcha

    GoodByeCatpcha

    Solver ReCaptcha v2 Free

    An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Zappa

    Zappa

    Serverless Python

    Zappa makes it super easy to build and deploy serverless, event-driven Python applications (including, but not limited to, WSGI web apps) on AWS Lambda + API Gateway. Think of it as "serverless" web hosting for your Python apps. That means infinite scaling, zero downtime, zero maintenance - and at a fraction of the cost of your current deployments! With a traditional HTTP server, the server is online 24/7, processing requests one by one as they come in. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    FuzzyWuzzy

    FuzzyWuzzy

    Fuzzy string matching in Python

    ...One of our most consistently frustrating issues is trying to figure out whether two ticket listings are for the same real-life event (that is, without enlisting the help of our army of interns). To pick an example completely at random, Cirque du Soleil has a show running in New York called “Zarkana”. When we scour the web to find tickets for sale, mostly those tickets are identified by a title, date, time, and venue. We’ve built up a library of “fuzzy” string matching routines to help us along. And good news! We’re open sourcing it. The library is called “Fuzzywuzzy”, the code is pure python, and it depends only on the (excellent) difflib python library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MozOS

    MozOS

    Complete Edition

    Universal ThinKing Machine
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    FirePlus

    FirePlus

    Complete Edition

    Universal ThinKing Machine
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Flask-GraphQL

    Flask-GraphQL

    Adds GraphQL support to your Flask application

    Adds GraphQL support to your Flask application. This will add /graphql endpoint to your app and enable the GraphiQL IDE. If you are using the Schema type of Graphene library, be sure to use the graphql_schema attribute to pass as schema on the GraphQLView view. Otherwise, the GraphQLSchema from graphql-core is the way to go. The GraphQLSchema object that you want the view to execute when it gets a valid request. A value to pass as the context_value to graphql execute function. By default is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    TestRest

    TestRest is a fully QA Management Tool

    TestRest is Test Management offers test case authoring, reusable test cases, test execution and reporting. TestRest supports statistic and graph reports with simple modern UI interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    WebExKit

    An HTML/CSS/JavaScript editor with preview window

    The Web Experimentation Kit allows you to enter HTML, CSS and JavaScript and see the results immediately in a browser frame side-by-side with the editor. If you've seen the W3Schools Tryit Editor, JSFiddle or CodePen then this should be familiar to you. The difference between WebExKit and these other applications is that WebExKit is a stand-alone application that runs on your desktop and it allows you to save (and reload) files to your own disk drive. The editor shows a properly formed...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB