Showing 23 open source projects for "python web crawler"

View related business solutions
  • PageDNA: Web-to-Print eCommerce Software Icon
    PageDNA: Web-to-Print eCommerce Software

    eCommerce for Print, Signs and Fulfillment Trusted by In‑Plants and Commercial Print Leaders

    PageDNA enables successful eCommerce strategies for commercial print sales organizations, internal print shops, and brand owners. PageDNA’s online ordering platform increases print volume while decreasing touch costs for all stakeholders: clientele, print operations, and the organizations they support.
    Learn More
  • Parasoft: Automated Testing to Deliver Superior Quality Software Icon
    Parasoft: Automated Testing to Deliver Superior Quality Software

    Parasoft provides test automation for every phase of the software development life cycle.

    Parasoft helps organizations continuously deliver high-quality software with its AI-powered software testing platform and automated test solutions. Supporting the embedded, enterprise, and IoT markets, Parasoft’s proven technologies reduce the time, effort, and cost of delivering secure, reliable, and compliant software by integrating everything from deep code analysis and unit testing to web UI and API testing, plus service virtualization and complete code coverage, into the delivery pipeline. Bringing all this together, Parasoft’s award-winning reporting and analytics dashboard provides a centralized view of quality, enabling organizations to deliver with confidence and succeed in today’s most strategic ecosystems and development initiatives—security, safety-critical, Agile, DevOps, and continuous testing.
    Learn More
  • 1
    NGINX Admin’s Handbook

    NGINX Admin’s Handbook

    How to improve NGINX performance, security, and other important things

    nginx-admins-handbook is a practical, in-depth guide for configuring, securing, and operating NGINX across real-world deployments. It distills years of research, notes, and field experience into a single handbook that complements the official docs with concrete rules, explanations, and curated external references. The handbook spans fundamentals and advanced topics alike, from HTTP and SSL/TLS basics to reverse proxy patterns, performance tuning, debugging workflows, and hardening...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    yz-next-apyref

    yz-next-apyref

    Arno-Can's Python based API Reference Generator.

    Multi-Syntax-Version and Multi-Implementation API reference documentation generator. Supports local and remote sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure Cloud Storage for Files, Photos and Documents | pCloud Icon
    Secure Cloud Storage for Files, Photos and Documents | pCloud

    Store, access, and manage your files on your own terms, from anywhere.

    Store, sync, and share your files securely with pCloud. Get up to 10 GB of free secure cloud storage and access your files from any device, anywhere.
    Learn More
  • 5

    etalon

    Standards Document Redaction System

    The best process for creating standards is that of the IETF’s RFCs. Standard documents are written, submitted for publication, reviewed, and finally approved. Then they get a official number and hence can be referenced. Some can even be blessed to be an official standard. Etalon will implement this whole process, web-based and with markdown documents. Create your company’s standards the RFC way!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Leo is an outlining-oriented project manager and browser. With Leo you can organize programs, web sites, URL's, pictures or any other kind of data. Leo is fully scriptable in Python. Leo is 100% pure Python. It requires the Qt gui toolkit.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    DirDiff

    DirDiff

    Python 2.7 script, analyzing 2 directories for differences.HTML output

    Comparing two different versions of a directory with text files of any kind has never been easier. Detects deleted files, new files, files with changed name but same content and of course a side-by-side / line-by-line comparison of changed files. Number of context-lines is configurable. Here's, how to call it: >python DirDiff.py dir1 dir2 output.html or >python.exe DirDiff.py -l 3 -v olddir newdir dirdiff.html -l is the number of context lines in the side-by-side comparison. -v...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Poor Http

    Poor Http

    WSGI Server, WSGI Connector, Python doc generator

    Poor Http Server is standalone wsgi server, which is designed for using python web applications. Unlike other projects, this is not framework, but single server, light wsgi connector, and python doc generator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • All-in-one solution to control corporate spending Icon
    All-in-one solution to control corporate spending

    Issuance in seconds. Full spending control. Perfect for media buying.

    Wallester Business is a leading world-class solution to optimize your company’s financial processes! Issuing virtual and physical corporate expense cards with an IBAN account, expense monitoring, limit regulation, convenient accounting, subscription control — manage your finance on all-in-one platform in real time! Wallester Business benefits your business growth!
    Learn More
  • 10
    CodeInvestigator is a tracing tool for Python programs. All the run time information is recorded. Read your code together with its run time details. Use it to visualize what happened when you program ran.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Pynta - Flexible Web Framework in Python
    Pynta is flexible web framework written in Python. All development going on https://github.com/lig/pynta
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ARREA

    ARREA

    Collaborative editor, requirements management tool and much more

    ARREA (A Reasonable Requirements Engineering Application) is a simple yet powerful requirements management tool that uses arbitrary files to store information and Subversion to allow team collaboration. Its simple conventions make it suitable for much other purposes. In fact, for any document-oriented tasks focused on collaborative work and exhaustive change and traceability management. At its heart, ARREA provides a thin layer on top of SVN that allows it to be used as a powerful...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Make AsciiDoc part of your literate programming tool set. With eWEB you can weave and tangle literate programs written as AsciiDoc documents, using embedded WEB code snippets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Txt2tags converts a text file with minimal markup to HTML, XHTML, SGML, LaTeX, Lout, UNIX Man Page, Wikipedia, Google Code Wiki, DokuWiki, MoinMoin, MagicPoint(mgp), PageMaker. Features: simple, fast, automatic TOC, macros, filters, include, GUI/CLI/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The pydoc module, distributed bundled with python, proved extremely helpful in handling documentation strings included in python scripts. This project proposes a clean up of the code as well as add new capabilities, with python version 2.6 as a target.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A small, effective, extendable and customizable personal Wiki developed for pocketPC systems. Can run on all python supported platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Fudge is a documentation system for software projects inspired by Pudge. It is currently targeted at providing integration with current Python software developement tools and internationalization, but support for other programming languages is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Django-driven web-based book-authoring content-management-system (ddwbbacms) is a program that allows authors to work on their books, articles, or topics without having to have their hard drives nearby. Access to the web is all that is required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    System to assist with the management of a local Buddhist study library. Will download and install chosen documentation (sutras, etc), index them, and provide a web interface. Python, HTML. Needs a webserver, namazu. Can be used for other documents, too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Wenost simplifies the process of building and maintaining larger web sites by hand. Wenost is used in conjunction with PPWizard, a free HTML pre-processor. For examples of websites built with wenost see www.csem.org.uk and the wenost project website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    POST (Python Obviously Simple Text) provides support for simple, flexible dynamic document generation in multiple output formats. Supports inputs in text or XML, outputs in HTML, PDF, RTF, LaTeX source, nroff source, postscript, and plain text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Gathering requirements should be fun. It's the brainstorming process refined. It shouldn't get in the way of productivity or creativity. Harvest is about gathering requirements in the most intuitive, least intrusive way possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB