Showing 26 open source projects for "data mining"

View related business solutions
  • Create a personalized AI chatbot for each team in minutes Icon
    Create a personalized AI chatbot for each team in minutes

    Get better, faster answers for your whole team with an AI chatbot trained on your company documents.

    QueryPal is the lifeline your team needs. Our AI chatbot integrates seamlessly with your communication channels, using advanced language understanding to identify and auto-answer repetitive questions — in seconds.
    Learn More
  • Eurekos LMS - Build a Smarter Customer Icon
    Eurekos LMS - Build a Smarter Customer

    The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

    Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.
    Learn More
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    ...It can be used for data mining, monitoring and automated testing.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    Geziyor

    Geziyor

    Blazing fast Go framework for web crawling and data scraping tasks

    ...It is designed to help developers crawl websites and extract structured information from web pages efficiently. It focuses on speed and scalability, allowing large numbers of requests to be processed concurrently. Geziyor supports use cases such as data mining, monitoring web content, and automated testing workflows. It provides a flexible architecture where developers define parsing functions that process responses and extract the desired data. Geziyor includes features for managing requests, handling cookies, respecting robots rules, and exporting collected data in multiple formats. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • World class QA, 100% done-for-you Icon
    World class QA, 100% done-for-you

    For engineering teams in search of a solution to design, manage and maintain E2E tests for their apps

    MuukTest is a test automation service that combines our own proprietary, AI-powered software with expert QA services to help you achieve world class test automation at a fraction of the in-house costs.
    Learn More
  • 5
    crawly

    crawly

    High-level web crawling and scraping framework for Elixir apps

    Crawly is a high-level application framework for crawling websites and extracting structured data using the Elixir programming language. It provides a complete environment for building web crawlers that systematically visit pages, collect information, and transform that data into structured formats for further processing. Crawly is designed for tasks such as data mining, information processing, and building historical archives of web content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 7
    LymPHOS2

    LymPHOS2

    LymPHOS2 Web-App

    ...Proteomics 2009, 9, 3741–3751. DOI: 10.1002/pmic.200800701 - Gallardo, Ó., Ovelleiro, D., Gay, M., Carrascal, M., Abian, J., A collection of open source applications for mass spectrometry data mining. Proteomics 2014, 20, 2275-2279. DOI: 10.1002/pmic.20140012
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Language KISS HTML5

    Browser based flash cards

    ...Main features - learn (cards due to learn), swat (every card) - copy cards - filter - sort - synchronize between devices - share cards with other users (just use all and/or improve and add cards) Programm code and usage follows the KISS principle, see http://en.wikipedia.org/wiki/KISS_principle - basic requirement = browser - OS independent - no installation required - no fancy features - bullet proof synchonization and sharing - deployment is just copying files onto a PHP enabled webserver More - data is on your own server - no central registration or data mining Technical Details - logic in JavaScript (browser) - Share/synch by PHP scripts on webserver (file based)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Arryved POS System Icon
    Arryved POS System

    Drive contagious loyalty with your guests and staff with a POS and Brewery Management system that helps run your craft brewery better.

    Arryved was built to help craft beverage makers thrive.
    Learn More
  • 10

    SquidCube

    Squid log data warehouse

    Feed Squid logs into PostgreSQL database, then use Pentaho BI server for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ebay mine

    OO PHP Libraries for mining data from eBay into mysql database

    I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis. In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TAXOMO
    Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    System for Web usage mining and data warehouse: it allows the discovery of knowledge from data (KDD) regarding users' usage on the Web (such as unique visitors, sessions, transactions) and organise it in a RDBMS (currently PostgreSQL). Written in C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    WSRF-compliant tools and services for data mining in grid computing environments, based on: Globus Toolkit 4, Condor and Triana workflow system. Learn more at: http://www.datamininggrid.org Copyright (c) 2008 DataMiningGrid Consortium.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Wiki .NET
    A set of libraries and programs to access MediaWiki based websites' data through the MediaWiki API. Also includes visual tools to aid editing wiki-markup pages. Created using the .Net framework and C#
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Catalyst based application intended to be a robot fitting tool, character planner, mining yield planner and eventually also a killboard/corporation management tool for Perpetuum Online. This is the base project, without any non-free data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Crow - Computational Representation Of Whatever. A platform for the integration and mining of complex and distributed data. Represents cross-linked semantic web documents as a network of software objects and offers easy ways to filter, and sort them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    webExtractor is a Java application that is used for extracting specific content from web based HTML, XML, CSV, and free form text. The extracted data can be used for data gathering and mining purposes.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    iTIP - intelligent Transaction Identification Processor is a commandline tool for web log mining data preparation and session identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Frontier Data System is a collection programs for data acquisition and data mining. It uses MySQL and PHP to monitor process variables and performance of shop floor equipment. Produces information on issues such as cycle time, downtime and SPC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Harvestman is a context aware metasearch engine which functions as a universal infromation gatherer and data mining system for the internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB