Showing 154 open source projects for "python web crawler"

View related business solutions
  • Job Evaluation and Talent Management Software Icon
    Job Evaluation and Talent Management Software

    For human resources departments in search of a tool to manage time, expenses, leave, documents, recruitment, and onboarding

    Encompassing Visions (ENCV), industry-leading job evaluation and pay equity software, is the best choice for organizations requiring transparent, comprehensive, and objective Job Evaluation software designed to help them ensure equal pay for work of equal value.
    Learn More
  • Boon: The Agile Referral Hiring Platform Icon
    Boon: The Agile Referral Hiring Platform

    Tap your entire community to hire better talent, faster

    Boon's agile referral platform expands your recruiting power 
through AI, automation, integrations, and gamification.
    Learn More
  • 1
    pywebview

    pywebview

    Build GUI for your Python program with JavaScript, HTML, and CSS

    ...If you choose to freeze your application, pywebview does not bundle a heavy GUI toolkit or web renderer with it keeping the executable size small. pywebview is compatible with Python 3.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    AWS Lambda Python Runtime Interf Client

    AWS Lambda Python Runtime Interf Client

    Seamlessly extend your preferred base images to be Lambda compatible

    ...The Lambda Runtime Interface Client is a lightweight interface that allows your runtime to receive requests from and send requests to the Lambda service. The Lambda Python Runtime Interface Client is vended through pip. You can include this package in your preferred base image to make that base image Lambda compatible. To make it easy to locally test Lambda functions packaged as container images we open-sourced a lightweight web-server, Lambda Runtime Interface Emulator (RIE), which allows your function packaged as a container image to accept HTTP requests. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 4
    FastHTML

    FastHTML

    The fastest way to create an HTML app

    Built on solid web foundations, not the latest fads - with FastHTML you can get started on anything from simple dashboards to scalable web applications in minutes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Evertune | Improve Your Brand's Visibility in AI Search Icon
    Evertune | Improve Your Brand's Visibility in AI Search

    For enterprise marketing teams looking for a platform to understand and influence how AI models like ChatGPT recommend their products or services.

    Evertune is the Generative Engine Optimization (GEO) platform that helps brands improve visibility in AI search across ChatGPT, AI Overview, Gemini, Claude and more.
    Learn More
  • 5
    miniblink49

    miniblink49

    Lighter, faster browser kernel of blink to integrate HTML UI in apps

    ...After turning off the cross-domain switch, you can use various cross-domain functions (support cross-domain). Headless mode, which greatly saves resources and is suitable for crawlers (headless mode, be suitable for Web Crawler).
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    Docker Compose

    Docker Compose

    Define and run multi-container applications with Docker

    Docker Compose is an open source tool for defining and running multi-container applications with Docker. Compose lets you use a YAML file to configure your application’s services, and then create and start all the services from your configuration with just a single command. Compose works great in all environments: production, staging, testing, development, and on CI workflows. Compose has commands for every stage of your application lifecycle, from starting, stopping and rebuilding services,...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 7
    JS Beautifier

    JS Beautifier

    Beautifier for javascript

    ...The beautifier can be added on your page as web library. JS Beautifier is hosted on two CDN services: cdnjs and rawgit. You can beautify javascript using JS Beautifier in your web browser, or on the command-line using node.js or python.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Tarsier

    Tarsier

    Vision utilities for web interaction agents

    At Reworkd, we iterated on all these problems across tens of thousands of real web tasks to build a powerful perception system for web agents... Tarsier! In the video below, we use Tarsier to provide webpage perception for a minimalistic GPT-4 LangChain web agent. Tarsier visually tags interactable elements on a page via brackets + an ID e.g. [23]. In doing this, we provide a mapping between elements and IDs for an LLM to take actions upon (e.g. CLICK [23]). We define interactable elements...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    granary

    granary

    The social web translator

    The social web translator. Fetches and converts data between social networks, HTML and JSON with microformats2, ActivityStreams/ActivityPub, Atom, JSON Feed, and more. Granary is a library and REST API that fetches and converts between a wide variety of social data sources and formats. Free yourself from silo API chaff and expose the sweet social data foodstuff inside in standard formats and protocols.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Pylon is an All-in-one B2B Support Platform for modern B2B businesses. Icon
    Pylon is an All-in-one B2B Support Platform for modern B2B businesses.

    Pylon is a modern support system that integrates with all B2B channels like Slack and Team.

    We bring together everything a post-sales teams team needs including a ticketing system, B2B omnichannel integrations (Slack Connect, Microsoft Teams), modern chat widget, knowledge base, AI support bot, account management, customer marketing, and more.
    Learn More
  • 10
    openvpn-monitor

    openvpn-monitor

    openvpn-monitor is a web based OpenVPN monitor

    openvpn-monitor is a simple Python program to generate HTML that displays the status of an OpenVPN server, including all current connections. It uses the OpenVPN management console. It typically runs on the same host as the OpenVPN server, however, it does not necessarily need to. OpenVPN-monitor is a web-based OpenVPN monitor, that shows current connection information, such as users, location, and data transferred.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    FreeTAKServer

    FreeTAKServer

    Situational Awareness Server compatible with TAK clients

    FTS is a Python3 implementation of a TAK Server for devices like ATAK, WinTAK, and ITAK, it is cross-platform and runs from a multi-node installation on AWS down to the Android edition. It's free and open source (released under the Eclipse Public License. FTS allows you to connect ATAK clients to share geo-information, to chat with all the connected clients, exchange files and more. It intends to support all the major use cases of the original TAK server.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    HTTPie CLI

    HTTPie CLI

    Modern, user-friendly command-line HTTP client for the API era

    HTTPie (pronounced aitch-tee-tee-pie) is a command-line HTTP client. Its goal is to make CLI interaction with web services as human-friendly as possible. HTTPie is designed for testing, debugging, and generally interacting with APIs & HTTP servers. The HTTP & HTTPS commands allow for creating and sending arbitrary HTTP requests. They use simple and natural syntax and provide formatted and colorized output.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Connexion

    Connexion

    Swagger/OpenAPI First framework for Python on top of Flask

    Connexion is a framework on top of Flask that automagically handles HTTP requests defined using OpenAPI (formerly known as Swagger), supporting both v2.0 and v3.0 of the specification. Connexion allows you to write these specifications, then maps the endpoints to your Python functions. This is what makes it unique from other tools that generate the specification based on your Python code. You are free to describe your REST API with as much detail as you want and then Connexion guarantees...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    SAFRS

    SAFRS

    SqlAlchemy Flask-Restful Swagger Json:API OpenAPI

    SAFRS exposes SQLAlchemy database models as a JSON:API web service and generates the corresponding swagger/OpenAPI. Database objects such as the User class from the demo.py example can be extended to include relationships with other objects. The demo_relationship.py contains the following extension of the User class where a relationship with the Book class is implemented.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    CWhy

    CWhy

    Explains and suggests fixes for compile-time errors for C, C++, C#, Go

    Explains and suggests fixes for compiler error messages for a wide range of programming languages, including C, C++, C#, Go, Java, LaTeX, PHP, Python, Ruby, Rust, Swift, and TypeScript. CWhy needs to be connected to an OpenAI account or an Amazon Web Services account. Your account will need to have a positive balance for this to work (check your OpenAI balance). CWhy currently defaults to GPT-4, and falls back to GPT-3.5-turbo if a request error occurs. For the newest and best model (GPT-4) to work, you need to have purchased at least $1 in credits (if your API account was created before August 13, 2023) or $0.50 (if you have a newer API account).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    CssSelector Component

    CssSelector Component

    Converts CSS selectors to XPath expressions

    XPath expressions are incredibly flexible, so there is almost always an XPath expression that will find the element you need. Unfortunately, they can also become very complicated, and the learning curve is steep. Even common operations (such as finding an element with a particular class) can require long and unwieldy expressions. CSS selectors are less powerful than XPath, but far easier to write, read and understand. Since they are less powerful, almost all CSS selectors can be converted to...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    uvicorn-gunicorn-fastapi

    uvicorn-gunicorn-fastapi

    Docker image with Uvicorn managed by Gunicorn

    Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python with performance auto-tuning. Optionally with Alpine Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    ...Lexbor’s code is optimized for ease of access in end-user applications and across programming languages. You can effortlessly wrap all the capabilities of the library in, say, Python to power your end-user or backend applications in a new environment. In other words, lexbor offers a feature-rich core that developers can build upon as they see fit.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Framework Benchmarks

    Framework Benchmarks

    Source for the TechEmpower Framework Benchmarks project

    ...Please feel free to ask questions here. We encourage new frameworks and contributors to ask questions. We're here to help! This project provides representative performance measures across a wide field of web application frameworks. With much help from the community, coverage is quite broad and we are happy to broaden it further with contributions. The project presently includes frameworks on many languages including Go, Python, Java, Ruby, PHP, C#, F#,Clojure, Groovy, Dart, JavaScript, Erlang, Haskell, Scala, Perl, Lua, C, and others. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jupyter Enterprise Gateway

    Jupyter Enterprise Gateway

    Enables Jupyter Notebooks to share resources across clusters

    Jupyter Enterprise Gateway is a headless web server with a pluggable framework for anyone supporting multiple notebook users in a managed-cluster environment. Some of the core functionality it provides is better optimization of compute resources, improved multi-user support, and more granular security for your Jupyter notebook environment - making it suitable for enterprise, scientific, and academic implementations. From a technical perspective, Jupyter Enterprise Gateway is a web server...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    WebPin

    WebPin

    Advanced Modern Web Application Manager for Linux

    Advanced Modern Web Application Manager for Linux Transform any website into a native desktop application
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    ZK - Simply Ajax and Mobile
    ZK is an open-source Java framework for building modern web and mobile applications. It enables developers to create rich, interactive UIs using only Java — no JavaScript required. With 200+ Ajax-powered components, event-driven architecture, and support for popular technologies like Spring, Java EE, and JSP/JSF, ZK makes it simple to deliver powerful and user-friendly web applications.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Echo HTML Viewer

    Echo HTML Viewer

    Fast offline HTML viewer for opening local HTML files on Windows

    Echo HTML Viewer is a lightweight desktop app for viewing local HTML files without a browser or internet connection. Designed for simplicity and privacy, it lets you open saved web pages, documentation, and archived content in a clean, distraction-free interface. Key features: • Open HTML files instantly • Drag & drop support • Fast startup and low resource usage • Fully offline — no telemetry, no tracking • No background services Use cases: • View saved websites...
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 24
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 25
    Bots-ediint open source edi translator

    Bots-ediint open source edi translator

    open source edi translator

    Bots-ediint is a file translator for EDI (Electronic Data Interchange). Translate files and databases data format to another one: edifact, x12, tradacoms, xml, json, raw, database (postgres, sqlite, mysql, oracle, ...) and more. Bots-ediint is based on Bots https://bots.sourceforge.net Support Python 2 and Python 3 AS2 communications are carried out with my external software that could work in standalone and/or in a Bots channel to send and receive AS2 messages (SYNC and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB