Showing 16 open source projects for "html source extractor"

View related business solutions
  • Infor M3 ERP Icon
    Infor M3 ERP

    Enterprise manufacturers and distributors requiring a solution to manage and execute complex processes

    Efficiently executing the complex processes of enterprise manufacturers and distributors. Infor M3 is a cloud-based, manufacturing and distribution ERP system that leverages the latest technologies to provide an exceptional user experience and powerful analytics in a multicompany, multicountry, and multisite platform. Infor M3 and related CloudSuite™ industry solutions include industry-leading functionality for the chemical, distribution, equipment, fashion, food and beverage, and industrial manufacturing industries. Staying ahead of the competition means staying agile. Our new capabilities bring improved data-driven insights and streamlined workflows to help you make informed decisions and take quick action.
    Learn More
  • More Bookings. Better Experience. Icon
    More Bookings. Better Experience.

    For tour and activity providers

    The all-in-one solution built to help you stay organised and get more bookings with thousands of connections to online travel agencies (OTAs), resellers and suppliers.
    Learn More
  • 1
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 3

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ...Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Universal Tag Finder

    Universal Tag Finder

    Is tool for query HTML content

    Universal Tag Finder is a tool to query against html content in the file system. For developers it allows to find out relevant elements during the troubleshooting and validating periods. Does not need to bother on massive regex to find out elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Hiring with Skill Assessments Icon
    Streamline Hiring with Skill Assessments

    Say goodbye to hiring guesswork. Use Canditech’s job simulation tests to assess real-world skills and make data-driven decisions.

    Canditech offers innovative, cheat-proof skill assessments and job simulations to transform your hiring process. From technical skills to soft skills, we help you assess candidates on actual job performance. With over 500 customizable tests and powerful video interview features, you can evaluate real-world capabilities, streamline your hiring, and reduce biases. Whether you’re hiring for remote roles, mass hiring, or looking to expand your diversity pool, Canditech’s data-driven platform ensures the right candidates are chosen for the job every time.
    Get a Free Demo
  • 5
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    P-VCD

    Content-Based Video Copy Detection

    Software for Video Copy Detection. The software has been described in some scientific papers, e.g. http://dx.doi.org/10.1109/ICME.2011.6012212 and http://dx.doi.org/10.1007/s11042-011-0915-x. This software is the result of my PhD at the University of Chile and the participation at TRECVID Content-Based Copy Detection (CCD) evaluation task 2010 and 2011. More details in http://www-nlpir.nist.gov/projects/tv2011/tv2011.html#ccd and in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8

    HadStat

    HadStat is service on cloud,for data analysis using Hadoop MapReduce.

    HadStat is service on the cloud, allow you to analysis the data on the cloud and return the result in nice graph,this service is free, you can redistribute it and/or modify it under the terms of the GNU General Public License. this service using many technologies , like Hadoop mapreduce, HTML, PHP, Web Service applications, linux server, java, eclipse IDE, with many indicators:Simple moving average (SMA),Exponential moving average (EMA),Smoothed simple moving average (SMMA),Linear...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • BidJS enables online timed and webcast bidding on your own website. Icon
    BidJS enables online timed and webcast bidding on your own website.

    For companies wishing to sell by auction

    Bidlogix provide auction software to auction houses around the globe. We offer timed auction software, webcast auction software along with an optional invoicing management system. Our software is embedded on your site with fully customisable styling. Based in Brighton, UK Bidlogix started providing auction software in 2013. Our 2 in-house development teams are constantly evolving the product with in excess of 10 auctions per day being run using our software. Our software can cater for the largest of auctions, all in real-time and can support multiple languages.
    Get Started Now
  • 10
    Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Open extensible system analysis report tool for Java, based on numerous open source analysis initiatives. The XML/XSL batch-processing framework produces integrated HTML/SVG reports of the systems current state and the development over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Docgen plugin for Protege provides a quick export of all the content of an ontology (classes, instances and documentation) in various formats (html, pdf, fo...). Images, Graphs, URL are readyly included in reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    The aim of this project is to highlight the effect of lexical chain scoring metrics and keyword extraction techniques on summary generation. We present our own chain-based keyword extraction system using WordNet lexical database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RelEx is a semantic relationship extractor. It gives subject, object, possesive and other relationships between words in a sentence, part-of-speech, noun-number, verb tense, and gender tagging, and Hobbs anaphora (pronoun) resolution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A knowledgment management system written in Java under JBoss 4.2.3 Server, with richfaces 3.3.0BETA4. Including fileconversion from html to pdf and rich:editor component without special syntaxing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB