Showing 30 open source projects for "pentaho data integration"

View related business solutions
  • The Most Powerful Software Platform for EHSQ and ESG Management Icon
    The Most Powerful Software Platform for EHSQ and ESG Management

    Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

    Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
    Learn More
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • 1
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 2
    MuJoCo

    MuJoCo

    Multi-Joint dynamics with Contact. A general purpose physics simulator

    ...The engine provides a robust C API optimized for real-time computation, making it suitable for scientific research and advanced simulation environments. MuJoCo’s core architecture is performance-tuned and utilizes preallocated data structures created through an XML-based compiler. The platform includes built-in interactive visualization using OpenGL and a native graphical interface for analyzing and testing simulations. Additionally, it offers extensive utility functions for physics computation, Python bindings for developers, and a Unity plug-in to enable integration with game engines and visualization tools.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 3
    PyMC

    PyMC

    Bayesian Modeling and Probabilistic Programming in Python

    PyMC is a Python library for probabilistic programming focused on Bayesian statistical modeling and machine learning. Built on top of computational tools like Aesara and NumPy, PyMC allows users to define models using intuitive syntax and perform inference using MCMC, variational inference, and other advanced algorithms. It’s widely used in scientific research, data science, and decision modeling.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Mathematics Dataset

    Mathematics Dataset

    This dataset code generates mathematical question and answer pairs

    The Mathematics Dataset, developed by Google DeepMind, is a synthetic dataset designed to evaluate and train machine learning models on mathematical reasoning and symbolic manipulation. It generates question-and-answer pairs across a wide range of mathematical topics typically found in school-level curricula, testing a model’s ability to reason about algebra, arithmetic, calculus, probability, and more. Each question is programmatically generated with structured templates to ensure clear...
    Downloads: 7 This Week
    Last Update:
    See Project
  • A privacy-first API that predicts global consumer preferences Icon
    A privacy-first API that predicts global consumer preferences

    Qloo AI adds value to a wide range of Fortune 500 companies in the media, technology, CPG, hospitality, and automotive sectors.

    Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things.
    Learn More
  • 5
    Open X-Embodiment

    Open X-Embodiment

    Unified open dataset enabling cross-embodiment learning for robotics

    Open X-Embodiment is a large-scale collaborative initiative led by Google DeepMind to unify robotic learning datasets into a consistent and standardized format, simplifying access and usage across the robotics research community. Its primary goal is to make all available open-source robotic data interoperable by representing them using the RLDS (Reinforcement Learning Dataset Structure) episode format. This enables seamless integration for training, evaluation, and model development across diverse robotic tasks and embodiments. The dataset aggregates contributions from multiple open-source robotic projects, all harmonized under a single unified data schema. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Shynet

    Shynet

    Modern, privacy-friendly, and detailed web analytics

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS. There are a lot of web analytics tools. Unfortunately, most of them come with the following caveats. They require handing all of your visitors' info to a third-party company They use cookies to track visitors across sessions, so you need to have those annoying cookie notices. They collect so much personal data that even the NSA is jealous. They are closed source and/or expensive, often with limited data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Claude Scientific Skills

    Claude Scientific Skills

    A set of ready to use Agent Skills for research, science, engineering

    Claude Scientific Skills is a large open source collection of ready-to-use scientific capabilities that extend AI coding agents into full research assistants. The project provides more than 170 curated skills covering domains such as genomics, drug discovery, medical imaging, physics, and advanced data analysis. Each skill bundles documentation, examples, and tool integrations so agents can reliably execute complex multi-step scientific workflows. The framework follows the open Agent Skills...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    Vedo

    Vedo

    A python module for scientific analysis of 3D data

    A lightweight and powerful python module for scientific analysis and visualization of 3d objects. Inspired by the vpython manifesto "3D programming for ordinary mortals", vedo makes it easy to work with 3D pointclouds, meshes and volumes, in just a few lines of code, even for less experienced programmers. vedo is based on VTK and numpy, with no other dependencies. Import meshes from VTK format, STL, Wavefront OBJ, 3DS, Dolfin-XML, Neutral, GMSH, OFF, PCD (PointCloud). Export meshes as ASCII...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    FairChem

    FairChem

    FAIR Chemistry's library of machine learning methods for chemistry

    FAIRChem is a unified library for machine learning in chemistry and materials, consolidating data, pretrained models, demos, and application code into a single, versioned toolkit. Version 2 modernizes the stack with a cleaner core package and breaking changes relative to V1, focusing on simpler installs and a stable API surface for production and research. The centerpiece models (e.g., UMA variants) plug directly into the ASE ecosystem via a FAIRChem calculator, so users can run relaxations,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Top Corporate LMS for Training | Best Learning Management Software Icon
    Top Corporate LMS for Training | Best Learning Management Software

    Deliver and Track Online Training and Stay Compliant - with Axis LMS!

    Axis LMS enables you to deliver online and virtual learning and training through a scalable, easy-to-use LMS that is designed to enhance your training, automate your workflows, engage your learners and keep you compliant.
    Learn More
  • 10
    OdooPLM

    OdooPLM

    A Full PLM system based on odoo

    OdooPLM is an Open Source Odoo Application that allows you to manage PLM Data inside Odoo. You can upload data from the most common commercial CAD systems directly to Odoo with a unique and easy to use interface. Our approach, very quick, ready to use, is very useful for all companies that wants useful and powerful tools. You can install PLM module and its Client without any cost, touching the quality of our product in your real work environment. Official OdooPLM...
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 11
    dxf2gcode

    dxf2gcode

    DXF2GCODE: converting 2D dxf drawings to CNC machine compatible G-Code

    DXF2GCODE is a tool for converting 2D (dxf, pdf, ps) drawings to CNC machine compatible GCode. Windows, Linux, and Mac support by using python scripting language.
    Leader badge
    Downloads: 358 This Week
    Last Update:
    See Project
  • 12

    rx2fits

    HF/VHF spectrosopy code for the rx888mk2 direct-sampling receiver

    ...Sample rate, Fourier bin count, PFB frame count, integration time and receiver gain parameters can be set. Utilities for device enumeration, firmware upload, display of FITS header, file viewing, and for measuring sample rate and sample statistics are provided. The device interface is Ruslan Migirow's librx888 and dropped samples at 130 MHz are negligible. Runs on linux computers under .Net.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CAIRO for AERMOD

    CAIRO for AERMOD

    AERMOD, visualisation, input, modelling and compiling tool

    CAIRO for AERMOD v1.1 by MSc Dominik Subotić Simplified training software Avaliable: www.sourceforge.net/projects/cairo-for-aermod/ QGIS plugin: CAIROforAERMOD (Coming 2025.) Tutorial: https://www.youtube.com/watch?v=DZnsJuu1zLc AERMAP, AERMOD and AERPLOT analysis tool and input file compiler. Features: Automatic input by copying coordinates (Google Maps or text) and automatic conversion to UTM. Sources are automatically visualised in Google Earth. Input is done through user...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    The Related Values Processing Framework helps the integration of Process Control Data Historian Systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pyntcloud

    pyntcloud

    pyntcloud is a Python library for working with 3D point clouds

    This page will introduce the general concept of point clouds and illustrate the capabilities of pyntcloud as a point cloud processing tool. Point clouds are one of the most relevant entities for representing three dimensional data these days, along with polygonal meshes (which are just a special case of point clouds with connectivity graph attached). In its simplest form, a point cloud is a set of points in a cartesian coordinate system. Accurate 3D point clouds can nowadays be (easily and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    MiModD

    MiModD

    Mutation Identification in Model Organism Genomes using Desktop PCs

    MiModD is a software package for genomic variant identification from next-generation sequencing (NGS) data with optimized usage of system resources and a user-friendly interface. For most model organism genomes it lets the user carry out a complete analysis from unaligned genomic NGS read data to an annotated list of variants on a regular Desktop PC within a few hours. Its user-interface is beginner-friendly and designed to encourage geneticists to analyze NGS data themselves without the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Earth Enterprise

    Earth Enterprise

    Google Earth Enterprise - Open Source

    Earth Enterprise is the open source version of Google Earth Enterprise (GEE), a powerful geospatial application suite that enables organizations to build and host custom 3D globes and 2D maps using their own imagery and data. Unlike Google Maps or Google Earth, Earth Enterprise does not include Google’s proprietary imagery but instead provides the tools needed to manage and visualize private geospatial datasets. The system is composed of three main components: Fusion, which processes and...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 18
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 88 This Week
    Last Update:
    See Project
  • 19
    Olex2 is visualisation software for small-molecule crystallography developed at Durham University/EPSRC. It provides comprehensive tools for crystallographic model manipulation for the end user and an extensible development framework for programmers. The project has been supported by Olexsys Ltd since 2010.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    plot.py

    plot.py

    direct data plotting and evaluation

    The data treatment includes non-linear fitting, integration and differentiation, peak-finder and more. User python code can be executed in the integrated IPython console.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    PPLine

    SNP calling, annotation and gene/transcripts expression quantification

    PPLine is a Python-based suite aimed to process raw RNA-seq or Exome-seq data. PPLine provides: - read mapping (STAR/Tophat2/bowtie/bowtie2), including novel splice junsctions discovery - gene and transcript expression estimation (HTSeq-count/Cufflinks) - SNP calling with BQSR and indel realignment (samtools/GATK) - variant annotation (Annovar) - novel transcripts discovery (Cufflinks) - predicting proteotypic peptides and creating ref/alt proteins fasta-database - integration of the results
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Collaborative Computing Project for NMR

    Collaborative Computing Project for NMR (CCPN)

    The Collaborative Computational Project for NMR (CCPN) provides tools and knowledge to maximise the impact of the biological NMR studies. The CCPN software facilitates data analysis and software integration. The project actively promotes the exchange of knowledge and provides training and best practices for the NMR community. CCPN also has a leading role in the development of a NMR data-sharing standard and coordination of NMR instrumentation proposals for RCUK and BIS. The 28 partners of CCPN jointly cover all aspects of biomolecular NMR and together they promote excellence in science in their respective fields.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MuPIF

    MuPIF

    Multi-Physics Integration Framework (MuPIF)

    Multi-Physics Integration Framework (MuPIF) is an integration framework, that will facilitate the implementation of multi-physic and multi-level simulations, built from independently developed components. The principal role of the framework is to steer individual components (applications) and to provide high-level data-exchange services. Each application should implement an interface that allows to steer application and execute data requests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    syphon is an extension for mod_openopc, to monitor telnet devices (serial devices) that sit on Serial -to- Ethernet bridges (such as barcode scanners) and export parsed data to an OPC compliant device (such as a PLC).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    GEPETTO - Gene Prioritization in Java

    GEPETTO - Gene Prioritization in Java

    GEPETTO (GEne Prioritization ExTended TOol)

    GEPETTO (GEne PrioriTization ExTended TOol) is an original open-source framework, distributed under the LGPL license, for gene selection and prioritization on a desktop computer that ensures confidentiality of personal data. It takes advantage of the data integration capabilities in the SM2PH-Central Framework(KD4v,MSV3d,BIRD,..), combined with in-house developed gene prioritization methods. It currently incorporates six prioritization modules, based on gene sequence, protein-protein interactions, gene expression, disease-causing probabilities, genomic context). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB