Showing 92 open source projects for "python data analysis"

View related business solutions
  • Iris Powered By Generali - Iris puts your customer in control of their identity. Icon
    Iris Powered By Generali - Iris puts your customer in control of their identity.

    Increase customer and employee retention by offering Onwatch identity protection today.

    Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.
    Learn More
  • Collect! is a highly configurable debt collection software Icon
    Collect! is a highly configurable debt collection software

    Everything that matters to debt collection, all in one solution.

    The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.
    Learn More
  • 1
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster?
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Discourse Network Analyzer (DNA)

    Discourse Network Analyzer (DNA)

    Discourse Network Analyzer (DNA)

    The Java software Discourse Network Analyzer (DNA) is a qualitative content analysis tool with network export facilities. You import text files and annotate statements that persons or organizations make, and the program will return network matrices of actors connected by shared concepts.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    MOA - Massive Online Analysis

    MOA - Massive Online Analysis

    Big Data Stream Analytics Framework.

    A framework for learning from a continuous supply of examples, a data stream. Includes classification, regression, clustering, outlier detection and recommender systems. Related to the WEKA project, also written in Java, while scaling to adaptive large scale machine learning.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 4
    AstronRPA

    AstronRPA

    Agent-ready RPA suite with visual workflow automation tools engine

    Astron RPA is an enterprise-grade robotic process automation platform designed to help organizations and developers build automated workflows for desktop and web applications. It provides a visual workflow designer that supports low-code and no-code development, allowing users to create automation processes through a drag-and-drop interface instead of writing extensive code. It enables automation of common desktop software and browser-based tasks, making it suitable for repetitive business...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 5
    Qodo Cover

    Qodo Cover

    AI tool that generates tests to improve code coverage quickly

    Qodo Cover is an open source developer tool designed to automate the creation of unit tests using generative AI, helping teams improve code coverage with minimal manual effort. It operates as a command-line interface and can also be integrated into continuous integration workflows, making it adaptable to different development environments. It analyzes an existing codebase, identifies gaps in test coverage, and generates new tests that target uncovered or weakly tested areas. It follows an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EFAK-AI

    EFAK-AI

    A AI-Driven, Distributed and high-performance monitoring system

    ...The project focuses on simplifying Kafka administration by offering real-time insights into cluster health, performance metrics, and consumer activity, allowing engineers to quickly diagnose issues and optimize system behavior. It integrates advanced features such as intelligent monitoring, alerting, and automated analysis, enabling proactive management rather than reactive troubleshooting. The platform supports multiple Kafka clusters simultaneously, making it suitable for enterprise environments with distributed data pipelines. It also incorporates AI-driven capabilities that assist with diagnostics, performance recommendations, and conversational querying of system data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Tribuo

    Tribuo

    Tribuo - A Java machine learning library

    ...Provenance data allows each model to be rebuilt verbatim from scratch and for evaluations to track the models and datasets used for each experiment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    PC_Workman_HCK

    PC_Workman_HCK

    AI-powered PC monitoring that explains. Not shows numbers/spikes.

    PC_Workman is what 680 hours of coding after warehouse shifts looks like. Built on a laptop hitting 94°C, this AI-powered monitoring tool does what Task Manager can't: it understands your system, not just measures it. Features: - Time travel monitoring - debug issues from hours ago - AI diagnostics with HCK_GPT - Custom fan curves with profiles - Floating always-on-top widget - 2D system map - Cross-GPU support (NVIDIA/AMD/Intel) Four complete rebuilds. 29 features killed....
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9

    LegacyInsight

    Legacy reverse engineering tool

    LegacyInsight is an AI-powered reverse engineering platform that transforms legacy software systems into comprehensible business logic. Using cutting-edge GenAI, it analyzes legacy and extracts core operations, business rules, and data transformations—all translated into natural language. LegacyInsight supports enterprise-grade systems built on Java, COBOL, NET and other legacy stacks, helping organizations reclaim understanding of business-critical code.
    Downloads: 1 This Week
    Last Update:
    See Project
  • SoftCo: Enterprise Invoice and P2P Automation Software Icon
    SoftCo: Enterprise Invoice and P2P Automation Software

    For companies that process over 20,000 invoices per year

    SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.
    Learn More
  • 10
    Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 12
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    aseryla

    aseryla

    Aseryla code repositories

    This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Ceka

    Ceka

    Crowd Environment and its Knowledge Analysis

    A knowledge analysis tool for crowdsourcing based on Weka. We also have a Python version of Crowdsourcing Learning: CrowdwiseKit on GitHub (https://github.com/tssai-lab/CrowdwiseKit).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SQLFlow

    SQLFlow

    SQL compiler bridging databases and machine learning workflows

    ...By embedding machine learning operations into SQL, it removes the need for users to switch between programming languages such as Python or R, simplifying the overall workflow. SQLFlow also supports model training, prediction, and explanation tasks, allowing data practitioners to work entirely within a familiar query interface.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    ModelDB

    ModelDB

    Open Source ML Model Versioning, Metadata, and Experiment Management

    An open-source system for Machine Learning model versioning, metadata, and experiment management. ModelDB is an open-source system to version machine learning models including their ingredients code, data, config, and environment and to track ML metadata across the model lifecycle.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    DSTK - Data Science TooKit 3

    DSTK - Data Science TooKit 3

    Data and Text Mining Software for Everyone

    DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/ DSTK Engine is like R.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Wandora
    Wandora is a general purpose information extraction, management, and publishing environment based on Topic Maps and Java. Wandora has several data storage options, rich data extraction, import and export capabilities and embedded server.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    DSTK - DataScience ToolKit is an opensource free software for statistical analysis, data visualization, text analysis, and predictive analytics. Newer version and smaller file size can be found at: https://sourceforge.net/projects/dstk3/ It is designed to be straight forward and easy to use, and familar to SPSS user. While JASP offers more statistical features, DSTK tends to be a broad solution workbench, including text analysis and predictive analytics features. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MYRA

    MYRA

    A collection of ACO algorithms for the data mining classification task

    MYRA is a collection of Ant Colony Optimization (ACO) algorithms for the data mining classification task. It includes popular rule induction and decision tree induction algorithms. The algorithms are ready to be used from the command line or can be easily called from your own Java code. They are build using a modular architecture, so they can be easily extended to incorporate different procedures and/or use different parameter values. This project is now hosted at: https://github.com/febo/myra
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    H2O-3

    H2O-3

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning

    H2O-3 is an open-source machine learning platform designed to build scalable and distributed machine learning models across large datasets. The system operates as an in-memory computing platform that allows data scientists to train models quickly using distributed resources. It supports many machine learning algorithms including generalized linear models, gradient boosting machines, deep learning networks, and ensemble techniques. The platform provides interfaces for multiple programming languages such as Python, R, Java, and Scala, making it accessible to a wide range of developers and data scientists. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    RDRPOSTagger

    A Rule-based Part-of-Speech and Morphological Tagging Toolkit

    RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB