Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Machine Learning Software
Search Results

Search Results for "data analytics"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 37
Windows 36
Mac 34
More...
BSD 14
ChromeOS 13

Category

Artificial Intelligence 37
Business 13
Software Development 6
Scientific/Engineering 5
Database 1
Formats and Protocols 1
Internet 1
System 1

License

OSI-Approved Open Source 31
Creative Commons Attribution License 1

Translations

English 1

Programming Language

Python 16
C++ 5
Java 3
Scala 3
More...
MATLAB 2
Rust 2
C 1
C# 1
F# 1
JavaScript 1

Status

Production/Stable 4
Pre-Alpha 1

Showing 37 open source projects for "data analytics"

View related business solutions

Machine Learning Linux Clear Filters & Widen Search

Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.

Learn More
1

cracking-the-data-science-interview

A Collection of Cheatsheets, Books, Questions, and Portfolio

...In addition to conceptual study materials, the project includes interview question banks and case study prompts that simulate real hiring scenarios. The resource is particularly useful for candidates preparing for technical interviews in data science, machine learning, or analytics roles.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
2

scikit-learn

Machine learning in Python

scikit-learn is an open source Python module for machine learning built on NumPy, SciPy and matplotlib. It offers simple and efficient tools for predictive data analysis and is reusable in various contexts.

Downloads: 15 This Week

Last Update: 2025-12-10
See Project
3

.NET for Apache Spark

A free, open-source, and cross-platform big data analytics framework

.NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write...

Downloads: 5 This Week

Last Update: 2026-02-13
See Project
4

Flyte

Build production-grade data and ML workflows, hassle-free The infinitely scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks. Don’t let friction between development and production slow down the deployment of new data/ML workflows and cause an increase in production bugs. Flyte enables rapid experimentation with production-grade software.

Downloads: 4 This Week

Last Update: 2026-04-03
See Project
Empowering Companies To Excel In Safety Data Sheet Compliance
For any organization using chemicals that require Safety Data Sheets

Effortless setup and maintenance: Simplified management and seamless online access to safety data sheets for your team

Learn More
5

Machine Learning and Data Science Apps

A curated list of applied machine learning and data science notebooks

...Most examples are written in Python and frequently use Jupyter notebooks to present practical implementations and experiments. The project encourages contributions from data scientists and domain experts who want to share applied analytics projects and techniques that address real business challenges.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
6

Pandas Profiling

Create HTML profiling reports from pandas DataFrame objects

pandas-profiling generates profile reports from a pandas DataFrame. The pandas df.describe() function is handy yet a little basic for exploratory data analysis. pandas-profiling extends pandas DataFrame with df.profile_report(), which automatically generates a standardized univariate and multivariate report for data understanding. High correlation warnings, based on different correlation metrics (Spearman, Pearson, Kendall, Cramér’s V, Phik). Most common categories (uppercase, lowercase,...

Downloads: 3 This Week

Last Update: 2026-01-13
See Project
7

dlib

Toolkit for making machine learning and data analysis applications

Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. It is used in both industry and academia in a wide range of domains including robotics, embedded devices, mobile phones, and large high performance computing environments. Dlib's open source licensing allows you to use it in any application, free of charge. Good unit test coverage, the ratio of unit test lines of code to library lines of code is...

Downloads: 16 This Week

Last Update: 2026-03-29
See Project
8

HeavyDB

HeavyDB (formerly MapD/OmniSciDB)

...Its architecture allows users to query datasets containing billions of rows in milliseconds without requiring traditional indexing, pre-aggregation, or sampling techniques. HeavyDB was originally developed as part of the OmniSci platform (formerly MapD) and is commonly used for large-scale analytics and geospatial data processing. The database compiles queries into optimized machine code that executes efficiently on GPU hardware, significantly accelerating analytical workloads. It supports hybrid deployment environments where queries can run on both CPU and GPU architectures depending on the available resources.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
9

Apache Hamilton

Helps data scientists define testable self-documenting dataflows

Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph representing the pipeline, allowing the system to execute transformations in the correct order. ...

Downloads: 8 This Week

Last Update: 2026-03-12
See Project
Time tracking software for the global workforce
Teams of all sizes and in various industries that want the best time tracking and employee monitoring solution.

It's easy with Hubstaff, a time-tracking and workforce management platform that automates almost every aspect of running or growing a business. Teams can track time to projects and to-dos using Hubstaff's desktop, web, or mobile applications. You'll be able to see how much time your team spends on different tasks, plus productivity metrics like activity rates and app usage through Hubstaff's online dashboard. Most of the available features are customizable on a per-user basis, so you can create the team management tool you need.

Learn More
10

TabPFN

Foundation Model for Tabular Data

...The system supports a variety of tabular machine learning tasks and is designed to handle structured datasets commonly found in spreadsheets, databases, and business analytics systems.

Downloads: 15 This Week

Last Update: 5 days ago
See Project
11

OpenMLDB

OpenMLDB is an open-source machine learning database

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference. OpenMLDB is an open-source machine learning database that is committed to solving the data and feature challenges. OpenMLDB has been deployed in hundreds of real-world enterprise applications. It prioritizes the capability of feature engineering using SQL for open-source, which offers a feature platform enabling consistent features for training and inference. Real-time features are essential for many machine learning applications, such as real-time personalized recommendations and risk analytics.

Downloads: 9 This Week

Last Update: 2025-02-21
See Project
12

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and...

Downloads: 2 This Week

Last Update: 2026-03-12
See Project
13

Netflix Maestro

Netflix’s Workflow Orchestrator

Maestro is a large-scale workflow orchestration platform originally developed by Netflix to coordinate complex data processing and machine learning workflows across distributed systems. The system acts as a general-purpose workflow orchestrator that manages the execution, scheduling, monitoring, and recovery of large pipelines used for analytics and AI operations. It was designed to support the demanding internal infrastructure of Netflix, where thousands of workflows must process massive volumes of data reliably and efficiently every day. ...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
14

NVIDIA FLARE

NVIDIA Federated Learning Application Runtime Environment

NVIDIA Federated Learning Application Runtime Environment NVIDIA FLARE is a domain-agnostic, open-source, extensible SDK that allows researchers and data scientists to adapt existing ML/DL workflows(PyTorch, TensorFlow, Scikit-learn, XGBoost etc.) to a federated paradigm. It enables platform developers to build a secure, privacy-preserving offering for a distributed multi-party collaboration. NVIDIA FLARE is built on a componentized architecture that allows you to take federated...

Downloads: 8 This Week

Last Update: 2026-03-20
See Project
15

Synapse Machine Learning

Simple and distributed Machine Learning

SynapseML (previously MMLSpark) is an open source library to simplify the creation of scalable machine learning pipelines. SynapseML builds on Apache Spark and SparkML to enable new kinds of machine learning, analytics, and model deployment workflows. SynapseML adds many deep learning and data science tools to the Spark ecosystem, including seamless integration of Spark Machine Learning pipelines with the Open Neural Network Exchange (ONNX), LightGBM, The Cognitive Services, Vowpal Wabbit, and OpenCV. These tools enable powerful and highly-scalable predictive and analytical models for a variety of data sources. ...

Downloads: 0 This Week

Last Update: 2026-04-04
See Project
16

IVY

The Unified Machine Learning Framework

...For example, an existing TensorFlow model, and some useful functions from both PyTorch and NumPy libraries. Choose any framework for writing your higher-level pipeline, including data loading, distributed training, analytics, logging, visualization etc. Choose any backend framework which should be used under the hood, for running this entire pipeline. Choose the most appropriate device or combination of devices for your needs. DeepMind releases an awesome model on GitHub, written in JAX. We'll use PerceiverIO as an example. ...

Downloads: 0 This Week

Last Update: 2025-06-16
See Project
17

Eventer

Rapid, unbiased, reproducible analysis of synaptic events

Eventer is a programme designed for the detection of spontaneous synaptic events measured by electrophysiology or imaging. The software combines deconvolution for detection, and variable length template matching approaches for screening out false positive events. Eventer also includes a machine learning-based approach allowing users to train a model to implement their ‘expert’ selection criteria across data sets without bias. Sharing models allows users to implement consistent analysis...

1 Review

Downloads: 14 This Week

Last Update: 2024-09-16
See Project
18

HPCC Systems

End-to-end big data in a massively scalable supercomputing platform.

HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete...

2 Reviews

Downloads: 74 This Week

Last Update: 2026-03-31
See Project
19

Uranie

Uranie is CEA's uncertainty analysis platform, based on ROOT

Uranie is a sensitivity and uncertainty analysis plateform based on the ROOT framework (http://root.cern.ch) . It is developed at CEA, the French Atomic Energy Commission (http://www.cea.fr). It provides various tools for: - data analysis - sampling - statistical modeling - optimisation - sensitivity analysis - uncertainty analysis - running code on high performance computers - etc. Thanks to ROOT, it is easily scriptable in CINT (c++ like syntax) and Python. Is is...

Downloads: 4 This Week

Last Update: 2026-02-11
See Project
20

Mars Framework

Mars is a tensor-based unified framework for large-scale data

...Its architecture automatically divides large computational tasks into smaller chunks that can be executed across multiple nodes in a cluster, allowing complex analytics, machine learning workflows, and data transformations to run efficiently at scale. Mars is particularly useful for workloads that exceed the memory capacity of a single machine or require high levels of parallel processing.

Downloads: 7 This Week

Last Update: 2026-03-11
See Project
21

Byzer-lang

A low-code open-source programming language for data pipeline

Byzer (former MLSQL) is a low-code, open-sourced, and distributed programming language for data pipeline, analytics, and AI in a cloud-native way. Design protocol: Everything is a table. Byzer is a SQL-like language, to simplify data pipeline, analytics, and AI, combined with built-in algorithms and extensions. We believe that everything is a table, a simple and powerful SQL-like language can significantly reduce human efforts of data development without switching different tools.

Downloads: 0 This Week

Last Update: 2024-08-13
See Project
22

EZStacking

EZStacking is Jupyter notebook generator for machine learning

EZStacking is Jupyter notebook generator for supervised learning problems using Scikit-Learn pipelines and stacked generalization. EZStacking handles classification and regression problems for structured data. It can also be viewed as a development tool, because a notebook generated with EZStacking contains: -an exploratory data analysis (EDA) used to assess data quality - a modelling producing a reduced-size stacked estimator - a server returning a prediction, a measure of the quality...

Downloads: 0 This Week

Last Update: 2022-06-30
See Project
23

Guia do Cientista de Dados das Galáxias

Repository for gathering information on study materials

Guia do Cientista de Dados das Galáxias is an open-source community repository that aggregates educational resources, tools, and references related to data science, machine learning, and analytics. The project was created by the Pizza de Dados community with the goal of organizing useful materials for people interested in learning or working in the data science ecosystem. The repository collects links to books, podcasts, tutorials, datasets, communities, and study groups that can help learners navigate the field of data science more efficiently. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
24

Weld

High-performance runtime for data analytics applications

...Weld is particularly useful for workloads involving large-scale data processing in frameworks such as NumPy, Spark, and TensorFlow. The language includes built-in constructs for expressing data-parallel operations, enabling efficient execution on modern hardware architectures. By combining operations from multiple libraries into a single optimized execution plan, Weld can significantly improve performance in analytics and machine learning pipelines.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
25

NLP Best Practices

Natural Language Processing Best Practices & Examples

In recent years, natural language processing (NLP) has seen quick growth in quality and usability, and this has helped to drive business adoption of artificial intelligence (AI) solutions. In the last few years, researchers have been applying newer deep learning methods to NLP. Data scientists started moving from traditional methods to state-of-the-art (SOTA) deep neural network (DNN) algorithms which use language models pretrained on large text corpora. This repository contains examples and...

Downloads: 0 This Week

Last Update: 2022-08-01
See Project

Previous
You're on page 1
2
Next

Related Searches

.net framework v4.0.30319

ivy

machine learning

dlib-20.0.0-cp312-cp312-win_amd64.whl

nvidia

crm website template

autoclicker with artificial intelligence

data root

pandas

apache

Related Categories

Artificial Intelligence

Business

Software Development

Scientific/Engineering

Database

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise