Showing 28 open source projects for "data mining"

View related business solutions
  • MicroStation by Bentley Systems is the trusted computer-aided design (CAD) software built specifically for infrastructure design. Icon
    MicroStation by Bentley Systems is the trusted computer-aided design (CAD) software built specifically for infrastructure design.

    Microstation enables architects, engineers, and designers to create precise 2D and 3D drawings that bring complex projects to life.

    MicroStation is the only computer-aided design software for infrastructure design, helping architects and engineers like you bring their vision to life, present their designs to their clients, and deliver their projects to the community.
    Learn More
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    ...It can be used for data mining, monitoring and automated testing.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 2
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 3
    Geziyor

    Geziyor

    Blazing fast Go framework for web crawling and data scraping tasks

    ...It is designed to help developers crawl websites and extract structured information from web pages efficiently. It focuses on speed and scalability, allowing large numbers of requests to be processed concurrently. Geziyor supports use cases such as data mining, monitoring web content, and automated testing workflows. It provides a flexible architecture where developers define parsing functions that process responses and extract the desired data. Geziyor includes features for managing requests, handling cookies, respecting robots rules, and exporting collected data in multiple formats. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 5
    crawly

    crawly

    High-level web crawling and scraping framework for Elixir apps

    Crawly is a high-level application framework for crawling websites and extracting structured data using the Elixir programming language. It provides a complete environment for building web crawlers that systematically visit pages, collect information, and transform that data into structured formats for further processing. Crawly is designed for tasks such as data mining, information processing, and building historical archives of web content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 7
    LymPHOS2

    LymPHOS2

    LymPHOS2 Web-App

    ...Proteomics 2009, 9, 3741–3751. DOI: 10.1002/pmic.200800701 - Gallardo, Ó., Ovelleiro, D., Gay, M., Carrascal, M., Abian, J., A collection of open source applications for mass spectrometry data mining. Proteomics 2014, 20, 2275-2279. DOI: 10.1002/pmic.20140012
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    python-web_excavator

    Genral Data Mining API: Only write html parsing code.

    A general web scraper that uses the requests library to communicate with the website. Scraper() contains a parser object, which you can add parsing handles to. ParseHandle() is the code mining for you data from an html source. Repo: https://github.com/crispycret/web_excavator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Agentic AI SRE built for Engineering and DevOps teams. Icon
    Agentic AI SRE built for Engineering and DevOps teams.

    No More Time Lost to Troubleshooting

    NeuBird AI's agentic AI SRE delivers autonomous incident resolution, helping team cut MTTR up to 90% and reclaim engineering hours lost to troubleshooting.
    Learn More
  • 10

    Language KISS HTML5

    Browser based flash cards

    ...Main features - learn (cards due to learn), swat (every card) - copy cards - filter - sort - synchronize between devices - share cards with other users (just use all and/or improve and add cards) Programm code and usage follows the KISS principle, see http://en.wikipedia.org/wiki/KISS_principle - basic requirement = browser - OS independent - no installation required - no fancy features - bullet proof synchonization and sharing - deployment is just copying files onto a PHP enabled webserver More - data is on your own server - no central registration or data mining Technical Details - logic in JavaScript (browser) - Share/synch by PHP scripts on webserver (file based)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    SquidCube

    Squid log data warehouse

    Feed Squid logs into PostgreSQL database, then use Pentaho BI server for data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    ebay mine

    OO PHP Libraries for mining data from eBay into mysql database

    I started this project for use in a new business and decided the the development time for the end result was going to be too long. This is basically a OO PHP API to retrieve data from eBay to be stored in a MySQL database for analysis. In a test run I retrieved over 804,000 completed item auction records from the consumer electronics category on eBay.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TAXOMO
    Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    System for Web usage mining and data warehouse: it allows the discovery of knowledge from data (KDD) regarding users' usage on the Web (such as unique visitors, sessions, transactions) and organise it in a RDBMS (currently PostgreSQL). Written in C++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WSRF-compliant tools and services for data mining in grid computing environments, based on: Globus Toolkit 4, Condor and Triana workflow system. Learn more at: http://www.datamininggrid.org Copyright (c) 2008 DataMiningGrid Consortium.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Wiki .NET
    A set of libraries and programs to access MediaWiki based websites' data through the MediaWiki API. Also includes visual tools to aid editing wiki-markup pages. Created using the .Net framework and C#
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Catalyst based application intended to be a robot fitting tool, character planner, mining yield planner and eventually also a killboard/corporation management tool for Perpetuum Online. This is the base project, without any non-free data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Crow - Computational Representation Of Whatever. A platform for the integration and mining of complex and distributed data. Represents cross-linked semantic web documents as a network of software objects and offers easy ways to filter, and sort them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SharpResource is a smart web resources retrieval engine for script based/auto modes internet data mining using c#. It is component-driven and fully customizable. It is aimed to be a versatile and robust library, not a system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    webExtractor is a Java application that is used for extracting specific content from web based HTML, XML, CSV, and free form text. The extracted data can be used for data gathering and mining purposes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    iTIP - intelligent Transaction Identification Processor is a commandline tool for web log mining data preparation and session identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB