Showing 114 open source projects for "pentaho data integration"

View related business solutions
  • Field Service+ for MS Dynamics 365 & Salesforce Icon
    Field Service+ for MS Dynamics 365 & Salesforce

    Empower your field service with mobility and reliability

    Resco’s mobile solution streamlines your field service operations with offline work, fast data sync, and powerful tools for frontline workers, all natively integrated into Dynamics 365 and Salesforce.
    Learn More
  • Simplify Purchasing For Your Business Icon
    Simplify Purchasing For Your Business

    Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

    Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
    Learn More
  • 1
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    SymbolicNumericIntegration.jl

    SymbolicNumericIntegration.jl

    SymbolicNumericIntegration.jl: Symbolic-Numerics for Solving Integrals

    SymbolicNumericIntegration.jl is a hybrid symbolic/numerical integration package that works on the Julia Symbolics expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Seurat

    Seurat

    R toolkit for single cell genomics

    Seurat is a comprehensive R toolkit for single-cell genomics analysis, introduced by the Satija Lab at NYGC. It supports quality control, normalization, clustering, integration of multimodal data (e.g., scRNA‑seq, spatial, CITE‑seq), and visualization. Seurat v5 introduces scalable workflows and spatial transcriptomics support, commonly used in academic and industry research for single-cell studies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MyBatis Mapper4

    MyBatis Mapper4

    Mybatis common mapper, easy to use

    This book starts with a simple MyBatis query to build a basic development environment for learning MyBatis. Through a comprehensive sample code and test, the basic usage of adding, deleting, modifying, and checking operations in the MyBatis XML mode and annotation mode is explained, and the application of dynamic SQL in different aspects and the best practice program in the use process are introduced. Provides a wealth of examples for MyBatis advanced mapping, stored procedures, and type...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Collect! is a highly configurable debt collection software Icon
    Collect! is a highly configurable debt collection software

    Everything that matters to debt collection, all in one solution.

    The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.
    Learn More
  • 5
    MuJoCo

    MuJoCo

    Multi-Joint dynamics with Contact. A general purpose physics simulator

    ...The engine provides a robust C API optimized for real-time computation, making it suitable for scientific research and advanced simulation environments. MuJoCo’s core architecture is performance-tuned and utilizes preallocated data structures created through an XML-based compiler. The platform includes built-in interactive visualization using OpenGL and a native graphical interface for analyzing and testing simulations. Additionally, it offers extensive utility functions for physics computation, Python bindings for developers, and a Unity plug-in to enable integration with game engines and visualization tools.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Open X-Embodiment

    Open X-Embodiment

    Unified open dataset enabling cross-embodiment learning for robotics

    Open X-Embodiment is a large-scale collaborative initiative led by Google DeepMind to unify robotic learning datasets into a consistent and standardized format, simplifying access and usage across the robotics research community. Its primary goal is to make all available open-source robotic data interoperable by representing them using the RLDS (Reinforcement Learning Dataset Structure) episode format. This enables seamless integration for training, evaluation, and model development across diverse robotic tasks and embodiments. The dataset aggregates contributions from multiple open-source robotic projects, all harmonized under a single unified data schema. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    JeeSite

    JeeSite

    A basic enterprise information development platform

    ...JeeSite is also the best assistant for private work. JeeSite is a Java-based development platform built on the Spring Framework, with Spring MVC as the model view controller, MyBatis as the data access layer, Apache Shiro as the authority authorization layer, Ehcahe for caching common data, and Activit as the workflow engine. It is the best integration in the JavaEE world. JeeSite is mainly positioned in the field of enterprise informatization. It has built-in basic functions of enterprise informatization system and efficient code generation tools.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8

    Stanford Data Miner

    Tools for integration and analysis of heterogeneous immunological data

    An extensive description of this system is published in the Journal of Translational Medicine (http://www.translational-medicine.com/). In brief, the system consists of two main web applications, a data integration app and a data exploration app. The data integration app is a fully custom Java "Web 2.0" product called Sherpa. Sherpa uses Seam, a platform integrating Asynchronous JavaScript and XML (AJAX), JavaServer Faces (JSF), the Java Persistence API (JPA), and Enterprise Java Beans (EJB) 3.0. The data exploration app is an open source business intelligence product called JasperServer (version 3.7), customized through supported configuration changes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Shynet

    Shynet

    Modern, privacy-friendly, and detailed web analytics

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS. There are a lot of web analytics tools. Unfortunately, most of them come with the following caveats. They require handing all of your visitors' info to a third-party company They use cookies to track visitors across sessions, so you need to have those annoying cookie notices. They collect so much personal data that even the NSA is jealous. They are closed source and/or expensive, often with limited data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Field Sales+ for MS Dynamics 365 and Salesforce Icon
    Field Sales+ for MS Dynamics 365 and Salesforce

    Maximize your sales performance on the go.

    Bring Dynamics 365 and Salesforce wherever you go with Resco’s solution. With powerful offline features and reliable data syncing, your team can access CRM data on mobile devices anytime, anywhere. This saves time, cuts errors, and speeds up customer visits.
    Learn More
  • 10
    Mathematics Dataset

    Mathematics Dataset

    This dataset code generates mathematical question and answer pairs

    The Mathematics Dataset, developed by Google DeepMind, is a synthetic dataset designed to evaluate and train machine learning models on mathematical reasoning and symbolic manipulation. It generates question-and-answer pairs across a wide range of mathematical topics typically found in school-level curricula, testing a model’s ability to reason about algebra, arithmetic, calculus, probability, and more. Each question is programmatically generated with structured templates to ensure clear...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    QUCAS-Quantum-Cosmology-Integration

    QUCAS-Quantum-Cosmology-Integration

    NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20

    ...Academic users at partner institutions are currently performing validation; all other commercial inquiries must contact the author License: Dual 🔬 Overview A complete computational framework for cosmological perturbation theory with first-principles quantum corrections. This package implements: Quantum-corrected Mukhanov-Sasaki equations with backreaction from quantum fields Full Boltzmann integration with quantum scattering terms Tensor perturbations (gravitational waves) with quantum sources Integration with CLASS/CAMB for validation Planck 2018 data validation with Bayesian evidence computation Production-ready pipeline for cosmological parameter constraints 🚀 Fe
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Claude Scientific Skills

    Claude Scientific Skills

    A set of ready to use Agent Skills for research, science, engineering

    Claude Scientific Skills is a large open source collection of ready-to-use scientific capabilities that extend AI coding agents into full research assistants. The project provides more than 170 curated skills covering domains such as genomics, drug discovery, medical imaging, physics, and advanced data analysis. Each skill bundles documentation, examples, and tool integrations so agents can reliably execute complex multi-step scientific workflows. The framework follows the open Agent Skills...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13
    PyMC

    PyMC

    Bayesian Modeling and Probabilistic Programming in Python

    PyMC is a Python library for probabilistic programming focused on Bayesian statistical modeling and machine learning. Built on top of computational tools like Aesara and NumPy, PyMC allows users to define models using intuitive syntax and perform inference using MCMC, variational inference, and other advanced algorithms. It’s widely used in scientific research, data science, and decision modeling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Vedo

    Vedo

    A python module for scientific analysis of 3D data

    A lightweight and powerful python module for scientific analysis and visualization of 3d objects. Inspired by the vpython manifesto "3D programming for ordinary mortals", vedo makes it easy to work with 3D pointclouds, meshes and volumes, in just a few lines of code, even for less experienced programmers. vedo is based on VTK and numpy, with no other dependencies. Import meshes from VTK format, STL, Wavefront OBJ, 3DS, Dolfin-XML, Neutral, GMSH, OFF, PCD (PointCloud). Export meshes as ASCII...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    elasticsearc-php

    elasticsearc-php

    PHP low-level client for Elasticsearch

    Introducing Elasticsearch DSL library to provide objective query builder for Elasticsearch bundle and elasticsearch-php client. You can easily build any Elasticsearch query and transform it to an array. This agnostic package is a lightweight wrapper on top of the Elasticsearch PHP client. Its main goal is to allow for easier structuring of queries and indices in your application. It does not want to hide or replace the functionality of the Elasticsearch PHP client. Feature complete, object...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    FairChem

    FairChem

    FAIR Chemistry's library of machine learning methods for chemistry

    FAIRChem is a unified library for machine learning in chemistry and materials, consolidating data, pretrained models, demos, and application code into a single, versioned toolkit. Version 2 modernizes the stack with a cleaner core package and breaking changes relative to V1, focusing on simpler installs and a stable API surface for production and research. The centerpiece models (e.g., UMA variants) plug directly into the ASE ecosystem via a FAIRChem calculator, so users can run relaxations,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    dxf2gcode

    dxf2gcode

    DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code

    DXF2GCODE is a tool for converting 2D (dxf, pdf, ps) drawings to CNC machine compatible GCode. Windows, Linux, and Mac support by using python scripting language.
    Leader badge
    Downloads: 335 This Week
    Last Update:
    See Project
  • 18
    Eleana

    Eleana

    Program for the analysis of spectroscopic data, in particular EPR(ESR)

    Eleana is a program for manipulating, modifying and analyzing various spectroscopic data, in particular electron paramagnetic resonance spectra. It allows you to import data from Bruker spectrometers such as Elexsys or EMX and data from any ASCII file or imported from the clipboard. The program is under constant development, but already has many functions implemented for spectral analysis. Some of the basic functions include: baseline corrections, filtering, spectral clipping, integration / differentiation, pseudomodulation, filtering, etc. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    VirtualSatellite

    VirtualSatellite

    Virtual Satellite

    ...This project contains the deployed artefacts from the Virtual Satellite Eco-System. Donwnloads for Virtual Satellite 4 Core are also provided. The artefacts contain p2 builds for: - Daily development builds - Regular integration builds - Specific release builds
    Leader badge
    Downloads: 139 This Week
    Last Update:
    See Project
  • 20

    Biological Pathway Exchange Language

    A Data Exchange Format for Biological Pathway Information

    BioPAX is a standard language for integration, exchange, visualization and analysis of biological pathway data. BioPAX supports data exchange between pathway data groups and thus reduces the complexity of interchange between data formats by providing an accepted standard format for pathway data. By offering a standard, with well-defined semantics for pathway representation, BioPAX allows pathway databases and software to interact more efficiently. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21

    42

    Simulation for spacecraft attitude control system analysis and design

    42 is a comprehensive general-purpose simulation of spacecraft attitude and orbit dynamics. Its primary purpose is to support design and validation of attitude control systems, from concept studies through integration and test. 42 accurately models multi-body spacecraft attitude dynamics (with rigid and/or flexible bodies), and both two-body and three-body orbital flight regimes, modelling environments from low Earth orbit to throughout the solar system. 42 simulates multiple spacecraft...
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 22

    GOBU

    Gene Browsing Utility

    Gene Ontology Browsing Utility (hereinafter, GOBU, pronounced as "go-boo") is a browsing and manipulating tool for Post-Pipeline genetic data, especially useful for data with GO annotations. With help of GOBU, we expect biologists and their cooperating technicians can pay less effort on common programming stuff. At the same time, GOBU provides an Extendable Architecture which guarantees the possibility of 1. Arbitrary Data Description, 2. Data Integration, and 3. Extendable User Interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    mzMatch is a Java collection of small commandline tools specific for metabolomics MS data analysis. The tools are built on top of the PeakML core library, providing mass spectrometry specific functionality and access to the PeakML file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OpenClinic GA

    OpenClinic GA

    Open Source Integrated Hospital Information Management System

    OpenClinic GA is an open source integrated hospital information management system covering management of administrative, financial, clinical, lab, x-ray, pharmacy, meals distribution and other data. Extensive statistical and reporting capabilities. OpenClinic GA and OpenClinic GMAO are owned by Frank Verbeke, MD, PhD at Post-Factum BV (https://www.post-factum.be)
    Leader badge
    Downloads: 125 This Week
    Last Update:
    See Project
  • 25
    AMBIT:Chemical Structure DB&Web Service

    AMBIT:Chemical Structure DB&Web Service

    Chemical structures database & machine learning with web services API

    ...Command line applications also available. Integration with third party tools and databases.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB