Search Results for "html source extractor" - Page 5

Showing 518 open source projects for "html source extractor"

View related business solutions
  • Job Evaluation and Talent Management Software Icon
    Job Evaluation and Talent Management Software

    For human resources departments in search of a tool to manage time, expenses, leave, documents, recruitment, and onboarding

    Encompassing Visions (ENCV), industry-leading job evaluation and pay equity software, is the best choice for organizations requiring transparent, comprehensive, and objective Job Evaluation software designed to help them ensure equal pay for work of equal value.
    Learn More
  • Point of Sale. Powerful and Simple. Icon
    Point of Sale. Powerful and Simple.

    For retail store owners and multi-location retail operations needing a tool to manage sales, inventory, staff and channels in one place

    Vibe Retail is an all-in-one retail point-of-sale and operations platform built for single-store and multi-location retailers seeking to unify inventory, sales, staff and customer data from one mobile-friendly interface. The system lets you track inventory across locations and warehouses, handle item variations (size, color, material), manage purchase orders and supplier deliveries, print custom barcodes, and transfer stock between stores in real time. On the sales side, Vibe supports multiple payment types (cards, cash, checks, gift cards, EBT), layaway workflows, serial number tracking, delivery management, loyalty programs and branded receipts. Retailers can integrate with online platforms (such as Shopify and WooCommerce), sync in-store and online sales, access 40+ real-time reports on sales, inventory and performance, set up promotions and discounts, and print receipts from mobile devices.
    Learn More
  • 1
    HunyuanOCR

    HunyuanOCR

    OCR expert VLM powered by Hunyuan's native multimodal architecture

    HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LangChain Extract

    LangChain Extract

    Did you say you like data?

    LangChain Extract is an open-source reference application designed to demonstrate how large language models can be used to extract structured data from unstructured text and document files. The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pypandoc

    pypandoc

    Thin wrapper for "pandoc" (MIT)

    Pypandoc provides a thin wrapper for pandoc, a universal document converter. Pypandoc uses pandoc, so it needs an available installation of pandoc. Pypandoc provides 2 packages, "pypandoc" and "pypandoc_binary", with the second one including pandoc out of the box. The 2 packages are identical, with the only difference being that one includes pandoc, while the other don't. If pandoc is already installed (i.e. pandoc is in the PATH), pypandoc uses the version with the higher version number,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ...Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 6 This Week
    Last Update:
    See Project
  • Inspections+ Mobile forms for Dynamics 365 - Resco.net Icon
    Inspections+ Mobile forms for Dynamics 365 - Resco.net

    Start collecting field data without the hassles of complicated development thanks to resco.Inspections' native integration with Dynamics 365.

    Equip your frontline teams with a robust digital solution to simplify data collection and reporting. Handle inspections and audits effortlessly, even in remote locations, and create comprehensive reports on the spot, all integrated with Dynamics 365.
    Learn More
  • 5
    Whakerexa

    Whakerexa

    A minimalist and lightweight web kit for accessible contents

    `Whakerexa` provides a lightweight, modular set of CSS and JavaScript tools for building accessible, consistent, and customizable web interfaces. It is intended to be as simple as possible to make **accessible web content**, and to minimize the use of CSS classes for enhancing the readability of HTML code. It was designed to be easily customizable, allowing users to adjust properties such as fonts, colors, borders, etc., effortlessly. Most of the properties are stored into variables which makes possible to re-define them, then to obtain a custom different style, enabling users to achieve a unique style easily. It can be combined with the use of WhakerPy, an open source library to create dynamic HTML content: <https://whakerpy.sf.net>. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    TextTest is an application-independent tool for text-based functional testing. This means running a batch-mode binary in lots of different ways, and using the text output produced as a means of controlling the behaviour of that application.
    Leader badge
    Downloads: 85 This Week
    Last Update:
    See Project
  • 7
    WhakerKit

    WhakerKit

    A seamless toolkit to manage dynamic websites and shared documents

    WhakerKit is a versatile toolkit for building websites with both static and dynamic HTML pages, developed by Brigitte Bigi, CNRS. WhakerKit offers seamless management of public and authenticated access, and simplifies document sharing for collaborative environments. It is based on the following technologies: * python >= 3.9 * (optional) PyJWT and ldap3 for authentication (install with pip) * WhakerPy >= 1.3: <https://whakerpy.sourceforge.io> (install with pip) * Whakerexa >=...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Mastering Bitcoin

    Mastering Bitcoin

    Mastering Bitcoin 3rd Edition - Programming the Open Blockchain

    The bitcoinbook repository contains the source code for Mastering Bitcoin, the authoritative open-source book by Andreas M. Antonopoulos on Bitcoin and cryptocurrency technologies. Written in a collaborative and continuously updated format using Markdown and AsciiDoc, the book serves as a comprehensive technical guide for developers, engineers, and system architects who want to understand how Bitcoin works. It covers the protocol, cryptography, peer-to-peer architecture, wallets, mining, and...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 9
    Wapiti

    Wapiti

    Wapiti is a web-application vulnerability scanner

    Wapiti is a vulnerability scanner for web applications. It currently search vulnerabilities like XSS, SQL and XPath injections, file inclusions, command execution, XXE injections, CRLF injections, Server Side Request Forgery, Open Redirects... It use the Python 3 programming language.
    Leader badge
    Downloads: 133 This Week
    Last Update:
    See Project
  • Secure Online Fax and Business Text Messaging Service Icon
    Secure Online Fax and Business Text Messaging Service

    Elevate your business communications with secure SMS and fax solutions.

    Send and receive SMS and fax online, from email, app or with our developer friendly SMS & fax API. HIPAA compliant & ISO 27001 certified. Outstanding value and 5-star service.
    Learn More
  • 10
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    ...It can be used to create spreadsheets and documents for LibreOffice, OpenOffice, Microsoft Office Excel and Word. It is open source GPL v3 and crossplatform, it can run on most operating systems that can run Python (Python is required). More details, example files, and online manual at http://csv2odf.sf.net.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11

    cfe

    Framework for producing web-based database applications

    Sophisticated environment for developing web-based applications. It includes lots of handy low-level components including: 1/ mySql interface, including table creation, tables as class, query as iterator 2/ html generation 3/ multi-lingual ability 4/ html form field handling 5/ etc. On top of this is a high-level framework that allows rapid development of database apps including seach/create/display/update, user navigation between related entities and much more. Included, as a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Small Python library with various things such as Configuration file parsing (in Python syntax), HTML and PDF parsing. Used in others of my projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LexiFinder

    LexiFinder

    AI-powered semantic indexing: automating the creation of book indexes

    ...Both interfaces share the same underlying engine and support the same features. Supported input formats are PDF, DOCX, and ODT. The index can be exported as plain text, JSON, CSV, or HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Forex Assistant

    Forex Assistant

    Manage accounts, analyze trades, track strategies & stats , News & Cal

    Forex Assistant is a comprehensive Windows desktop application designed to support forex traders in every aspect of their trading journey. From essential trading calculators and market insights to powerful account and trade management, it brings everything you need into one intuitive platform. 🔑 Main Features 🧮 Calculator Tools Profit/Loss Calculator Currency Converter Position Size Calculator Pips Calculator Margin Calculator 🌍 Market Insights Live Market News Economic...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    justniffer
    justniffer is a TCP sniffer. It reassembles and reorders packets and displays the tcp flow in a customizable way. It can log network traffic in web server log format. It can also log network services performances (e.g. web server response times) and extract http content (images, html, scripts, etc)
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Buku

    Buku

    Powerful command-line bookmark manager. Your mini web!

    buku is a powerful bookmark manager written in Python3 and SQLite3. buku fetches the title of a bookmarked web page and stores it along with any additional comments and tags. You can use your favourite editor to compose and update bookmarks. With multiple search options, including regex and a deep scan mode (particularly for URLs), it can find any bookmark instantly. Multiple search results can be opened in the browser at once. Though a terminal utility, it's possible to add bookmarks...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 17
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    ufonet

    ufonet

    UFONet - Denial of Service Toolkit

    UFONet - Is a set of hacktivist tools that allow launching coordinated DDoS and DoS attacks and combine both in a single offensive. It also works as an encrypted DarkNET to publish and receive content by creating a global client/server network based on a direct-connect P2P architecture. + FAQ: https://ufonet.03c8.net/FAQ.html -------------------------------------------- -> UFONet-v1.8 [DPh] "DarK-PhAnT0m!" (.zip) -> md5 = [ c8ab016f6370c8391e2e6f9a7cbe990a ] -> UFONet-v1.8...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    AI Bulk Mailer

    AI Bulk Mailer

    Phần mềm gửi email hàng loạt với tạo nội dung bằng AI

    Phần mềm gửi email hàng loạt với khả năng hỗ trợ đa luồng, tạo nội dung tự động bằng nhiều mô hình AI và thu thập tất cả email trên một trang web.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Hero Planner

    Hero Planner

    An easy to use GUI for creating team rotas and scheduling absences.

    HERO Planner – Unleash Your Scheduling Superpowers Is rostering your kryptonite? Transform chaos into control with HERO Planner, the command center for retail, hospitality, and service teams. This modern desktop app turns managers into scheduling superheroes—no cape required. Your Utility Belt: ✓ Super Speed: Intuitive drag-and-drop & bulk actions for lightning-fast updates. ✓ Compliance Shield: Real-time alerts guard against labour law violations and overtime risks. ✓ X-Ray Vision:...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    Dominate

    Dominate

    Dominate is a Python library for creating and manipulating HTML docs

    Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API. It allows you to write HTML pages in pure Python very concisely, which eliminates the need to learn another template language, and lets you take advantage of the more powerful features of Python. Dominate can also use keyword arguments to append attributes onto your tags. Most of the attributes are a direct copy from the HTML spec with a few variations. Through the use of the += operator and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    3tdb

    3tdb

    A web based MariaDB client.

    LINUX ONLY - requires mod_python Python MariaDB client. ** 12.2-0 ** 2026-01-10 mod_python was removed from debian trixie (13). As of this writing, mod_python can be compiled from git. 3tp3 uses minimal javascript, which you can examine in the js folder (one file, 36 lines). 3tp3 does not load any external (internet or other) code. No external fonts, helper code or advertisement. It can be installed and run on localhost.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    IPyPlot

    IPyPlot

    Fast and efficient plotting of images inside Python Notebooks

    IPyPlot is a small python package offering fast and efficient plotting of images inside Python Notebooks. It's using IPython with HTML for faster, richer and more interactive way of displaying big numbers of images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    myScite

    myScite

    The allRound pocket sized CodeEditor.

    .... -- Features -- - Full MinGW and GTK SDKs Autocomplete.(190+) - Do system scripting (bash, applescript, cmd, powershell, perl, j/vbscript, awk) - Examine all sorts of data files (sql, regedit, mib, xml, yaml, json, vcard ...) - Review difference and patch files - Create makefiles (gnu make / cmake) - Edit html, css and config files (with calltips) - Describe circuits in vhdl and spice. ... - And finally; read & write source code: - [ Syntax highlighted ] - go, vala, pike, swift, flash, ch, rust - [ Calltip assisted ] - c/cpp11, js&jQuery, python, php, ruby, lua, c#, java, perl --Others-- - Restructured config files with inline docs - Scriptable via lua Extension...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    CAMPARI

    CAMPARI

    Software for molecular simulations and trajectory analysis

    We are proud to introduce version 5 of CAMPARI. We have added a number of new features, most notably a Python interface for interpreting user-supplied code (with the help of ForPy), a novel trajectory storage standard (with the help of libpqxx/PostgreSQL), and a module for performing transition path theory. Naturally, CAMPARI continues to provide the reference implementation of the ABSINTH force field paradigm and implicit solvation model. CAMPARI is a joint package for performing and...
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB