Showing 122 open source projects for "html source extractor"

View related business solutions
  • Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight Icon
    Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight

    Lock Down Any Resource, Anywhere, Anytime

    CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
    Learn More
  • Loan management software that makes it easy. Icon
    Loan management software that makes it easy.

    Ideal for lending professionals who are looking for a feature rich loan management system

    Bryt Software is ideal for lending professionals who are looking for a feature rich loan management system that is intuitive and easy to use. We are 100% cloud-based, software as a service. We believe in providing our customers with fair and honest pricing. Our monthly fees are based on your number of users and we have a minimal implementation charge.
    Learn More
  • 1

    cuneiformplus

    Fork of OCR software cuneiform

    Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    MetaErg

    MetaErg

    Metagenome Annotation Pipeline

    MetaErg is a stand-alone and fully automated metagenome and metaproteome annotation pipeline published at: https://www.frontiersin.org/articles/10.3389/fgene.2019.00999/full. If you are using this pipeline for your work, please cite: Dong X and Strous M (2019) An Integrated Pipeline for Annotation and Visualization of Metagenomic Contigs. Front. Genet. 10:999. doi: 10.3389/fgene.2019.00999 The instructions on configuring and running the MetaErg pipeline is available at GitHub...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    perkun

    perkun

    two experimental AI languages + zubr

    Two experimental AI languages - Perkun and its successor Wlodkowic. Attempt to maximize the expected value of the payoff function by appropriate choosing the actions (output variables values). The package contains also a tool called zubr - a Java code generator based on Perkun. Take also a look at my blog: http://pawel-biernacki.blogspot.fi/ For Windows users there is an installer: http://www.pawelbiernacki.net/perkun.msi
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    anno

    anno

    Go package for text annotation

    Go package for text annotation. There are two parts to anno, the first is a series of Finder functions that look for interesting articles (which it calls `Notes`) inside the text, returning a slice of Note structs. The second is the Expander, which replaces the text in each Note with something else, like the HTML for a link or something. It tells you the bytes that it found, the `Start` index and a string describing the kind of `Note`. The kind is useful for when you run pass `Finder`...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The AI workplace management platform Icon
    The AI workplace management platform

    Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.

    By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.
    Learn More
  • 5
    lgo

    lgo

    Interactive Go programming with Jupyter

    lgo is an open-source programming environment that enables interactive Go programming within Jupyter Notebook environments. The project provides a Jupyter kernel for the Go programming language, allowing developers to write and execute Go code interactively in notebook cells similar to how Python is used in data science workflows. This environment combines the strong performance and concurrency features of the Go language with the exploratory and iterative style of notebook-based...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Zabbix-in-Telegram

    Zabbix-in-Telegram

    Zabbix Notifications with graphs in Telegram

    Zabbix Notifications with graphs in Telegram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    voice-elements

    voice-elements

    Web Component wrapper to the Web Speech API

    voice-elements is a set of Web Components that wraps the Web Speech API to make speech synthesis (text-to-speech) and speech recognition (speech-to-text) easy to use via simple HTML tags. Instead of manually dealing with JavaScript APIs like SpeechSynthesis and SpeechRecognition, developers can drop custom elements such as <voice-player> and <voice-recognition> into their markup and configure them with attributes. Built originally with Polymer, it follows the early Web Components ecosystem...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    WikiSQL

    WikiSQL

    A large annotated semantic parsing corpus for developing NL interfaces

    A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. Regarding tokenization and Stanza, when WikiSQL was written 3-years ago, it relied on Stanza, a CoreNLP python wrapper that has since been deprecated. If you'd still like to use the tokenizer, please use the docker image. We do not anticipate switching...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 10
    DeepLearningProject

    DeepLearningProject

    An in-depth machine learning tutorial

    This tutorial tries to do what most Most Machine Learning tutorials available online do not. It is not a 30 minute tutorial that teaches you how to "Train your own neural network" or "Learn deep learning in under 30 minutes". It's a full pipeline which you would need to do if you actually work with machine learning - introducing you to all the parts, and all the implementation decisions and details that need to be made. The dataset is not one of the standard sets like MNIST or CIFAR, you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Panzer Combat II

    Panzer Combat II

    Computer-assisted miniature tank game.

    Panzer Combat II is a multi-player voice and webcam enabled computer-assisted distributed miniature wargame of World War II tank combat. Firing is done by placing a webcam behind the aiming unit. Distance to target is computed using computer vision. Action inside the tanks is performed on the computer screen while battlefield strategy is played on the miniature terrain. Both camps can use a different laptop or tablet, the game will interconnect. You can try it online :...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Scattertext 0.2.1

    Scattertext 0.2.1

    Beautiful visualizations of how language differs among document types

    A tool for finding distinguishing terms in corpora and displaying them in an interactive HTML scatter plot. Points corresponding to terms are selectively labeled so that they don't overlap with other labels or points.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    smartblob

    tiny code, html webcam game or plug brain in each blob, java server

    You play a 2d blob that reshapes to grab bend bounce and swing on objects floating in midair (like a platformer game), except acrobaticly your view spins when your blob does. I'm planning a huge multiplayer world, some blobs played by people holding a bendable loop game controller (tape a 1 meter cut of thick extension cord into a loop) in front of webcam and bend it to bend your blob on screen, and other blobs controlled by AI. This is a game for general AI research in a fun way people can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    We have implemented a core summarizer of scientific articles written in Spanish, with the following components: a tokenizer, a grammar checker, a clarity checker, a cohesion-coherence checker, a common-topic extractor and an output formatter.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Intelligent Keyword Miner

    Intelligent Keyword Miner

    Intelligent SEO keyword miner and predicing tool

    THIS IS A NETBEANS 8.02 PROJECT ENGLISH ONLY This program was made to help me with the patent research. It simply generates the search keywords, based on your upvotes or a downvotes of the input parameters. It can accept a text or URL (text takes a prescedence over the URL). If you input URL, it goes to a page, and learns its text from HTML format. This program is intelligent as it predicts what you may want to search next, based on your personal trends. After searching the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TexLexAn is an open source text analyser for Linux, able to estimate the readability and reading time, to classify and summarize texts. It has some learning abilities and accepts html, doc, pdf, ppt, odt and txt documents. Written in C and Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    phpSANE

    Web-Based Frontend for SANE

    phpSANE is a web-based frontend for SANE written in HTML/PHP so you can scan with your web-browser. It also supports OCR.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Azul OS

    Azul OS

    Azul OS version dev(Linux) IA

    ... # Azul voice version windows Azul interface . Disponible # Azul dev rev 0.4.1 . Disponible [changelog] software added : php5-mysql gcc-c++ php5-gd php5-ctype perl-HTML-Tagset php5-zip php5-curl kernel-source mysql-connector-java php5-pear php5-mcrypt php5-ftp devel_C_C++ gimp gedit recode libreoffice MozillaFirefox wireshark audacity nano This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License. #Blog : http://azul0.wordpress.com/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    English-Khmer S. Machine Translation

    English-Khmer Automatic Statistic Machine Translation (SMT)

    Automatic Machine Translation from English to Khmer project is the first effort in Natural Language Processing field for translating English to Khmer (Cambodian) language. This project uses Domy CE, an open source SMT toolkit, for training parallel corpus and web technologies such as Python, Apache2, HTML, XML, and XSLT for developing web-based application. This project is developed by Ms. Kim Sokphyrum (DU) and Ms. Suos Samak (Jamia), under Supervision of Mr. Javier Sola, a Program Manager at Open Institute (OI), Cambodia, Dr. Vasudha Bhatnagar, an Assistant professor and a Head of Computer Science at University of Delhi (DU), New Delhi, India. and Dr. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DGiovanni
    A multi-agent architecture for building interactive dramas. It uses the Jason's BDI engine, being the Jason's agent-oriented programming language utilized for performing the drama management and for authoring behaviors for the characters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The first 3d search engine for text. Javascript only. Work in all browsers. Ajax downloads new words (and links between them) as you move mouse to control AI to learn what you're looking for (in context) and put it on screen. Includes Wikipedia data
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    The Crystallizing Substochastic Sequential Machine Extractor, CrySSMEx, is an algorithm for extracting Finite State Machines from Recurrent Neural Networks. Input: sequential data generated from RNNs. Output: (stochastic) FSMs and state space quantizers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Surikata
    SURIKATA (Syntactic Universal Reasoning for Inducing Kolmogorov Abstract Theories Automatically) is a system for searching large spaces of artifacts and inducing algorithms for generating similar artifacts.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB