Showing 537 open source projects for "pentaho data integration"

View related business solutions
  • Streamline Hiring with Skill Assessments Icon
    Streamline Hiring with Skill Assessments

    Say goodbye to hiring guesswork. Use Canditech’s job simulation tests to assess real-world skills and make data-driven decisions.

    Canditech offers innovative, cheat-proof skill assessments and job simulations to transform your hiring process. From technical skills to soft skills, we help you assess candidates on actual job performance. With over 500 customizable tests and powerful video interview features, you can evaluate real-world capabilities, streamline your hiring, and reduce biases. Whether you’re hiring for remote roles, mass hiring, or looking to expand your diversity pool, Canditech’s data-driven platform ensures the right candidates are chosen for the job every time.
    Get a Free Demo
  • DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets. Icon
    DataHub is the leading open-source data catalog helping teams discover, understand, and govern their data assets.

    Modern Data Catalog and Metadata Platform

    Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
    Learn More
  • 1
    Azure Data Studio

    Azure Data Studio

    A data management tool that enables working with other SQL tools

    Azure Data Studio is a cross-platform database tool for data professionals who use on-premises and cloud data platforms on Windows, macOS, and Linux. Azure Data Studio offers a modern editor experience with IntelliSense, code snippets, source control integration, and an integrated terminal. It's engineered with the data platform user in mind, with the built-in charting of query result sets and customizable dashboards.
    Downloads: 471 This Week
    Last Update:
    See Project
  • 2
    Form-Data

    Form-Data

    A module to create readable `"multipart/form-data"` streams

    A library to create readable "multipart/form-data" streams. Can be used to submit forms and file uploads to other web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Spring Data MongoDB

    Spring Data MongoDB

    Provide support to increase developer productivity in Java

    ...The Spring Data MongoDB project provides integration with the MongoDB document database. Key functional areas of Spring Data MongoDB are a POJO-centric model for interacting with a MongoDB Document and easily writing a repository-style data access layer. You do not need to build from source to use Spring Data. Binaries are available in repo.spring.io and accessible from Maven using the Maven configuration noted.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    nodejs-integration-tests-best-practices

    nodejs-integration-tests-best-practices

    Beyond the basics of Node.js testing

    Component/integration test is a hybrid between E2E and unit tets. It's gaining a lot of popularity and going by the testing diamond model it is considered as the default technique for the modern backend. Its main idea is testing an entire component (e.g., Microservice) as-is, through the API, with all the layers including the database but fake anything extraneous. This brings both high confidence and great developer experience. However, doing it right, fast, exhaustive and maximizing the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud-Based Software Licensing - Zentitle by Nalpeiron Icon
    Cloud-Based Software Licensing - Zentitle by Nalpeiron

    The #1 Software Licensing Solution. Release new Software License Models fast with no engineering. Increase software sales and drive up revenues.

    1000’s software companies have used Zentitle to launch new software products fast and control their entitlements easily - many going from startup to IPO on our platform. Our software monetization infrastructure allows you to easily build or
    Learn More
  • 5
    django-import-export

    django-import-export

    Django application and library for importing and exporting data

    django-import-export is a Django application and library for importing and exporting data with included admin integration. Support multiple formats (Excel, CSV, JSON, and everything else that tablib supports) Admin integration for importing. Preview import changes. Admin integration for exporting. Export data respecting admin filters. By default all records will be imported, even if no changes are detected. This can be changed setting the skip_unchanged option. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    FancyGrid

    FancyGrid

    JavaScript grid library with charts integration & server communication

    JavaScript grid library with charts integration and server communication. Massive number of features, Paging, Sorting, Filtering, Validation, Ajax data, Files data, RESTful, flexible CRUD, Integration with charts, Form module, Theming, Touch Support, and others. FancyGrid is integrated(data binding) with chart libraries, HighCharts, and sparklines. Intelligent modules system auto-detects and loads needed modules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Positron

    Positron

    Positron, a next-generation data science IDE

    ...The IDE supports notebook and script workflows, integration of data-app frameworks (such as Shiny, Streamlit, Dash), database and cloud connections, and built-in AI-assisted capabilities to help write code, explore data, and build models.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    RStudio

    RStudio

    RStudio is an integrated development environment (IDE) for R

    RStudio is a powerful, full-featured integrated development environment (IDE) tailored primarily for the R programming language but increasingly supportive of other languages like Python and Julia. It brings together console, editor, plotting, workspace, history, and file-management panes into a unified interface, helping data scientists, statisticians, and analysts to work more productively. The IDE is cross-platform: there are desktop versions for Windows, macOS and Linux, as well as a server version for remote or multi-user deployment via a web browser. In addition to code editing and execution, RStudio offers extensive support for reproducible research via R Markdown, notebooks, and integration with version control systems like Git and SVN. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • 9
    Dagster

    Dagster

    An orchestration platform for the development, production

    Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust...
    Downloads: 12 This Week
    Last Update:
    See Project
  • Complete Data Management for Nonprofits Icon
    Complete Data Management for Nonprofits

    Designed to fit with multi-level non-profit organization, across any sector

    NewOrg is a robust platform built with enhanced features to help non-profit organizations that capture and integrate the information from all of their operational areas to better manage volunteers, clients, programs, outcome reporting, activity sign-ups & scheduling, communications, surveys, fundraising activities and Development campaigns. NewOrg can truly deliver an intuitive product that will help manage your Committees, Donors, Events, and Memberships so that the organization runs efficiently.
    Learn More
  • 10
    Apache InLong

    Apache InLong

    Apache InLong - a one-stop integration framework for massive data

    Apache InLong is a one-stop integration framework for massive data that provides automatic, secure and reliable data transmission capabilities. InLong supports both batch and stream data processing at the same time, which offers great power to build data analysis, modeling and other real-time applications based on streaming data. InLong (应龙) is a divine beast in Chinese mythology who guides the river into the sea, and it is regarded as a metaphor of the InLong system for reporting data streams. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    APIJSON

    APIJSON

    Real-Time coding-free, powerful and secure ORM

    APIJSON is an open-source framework developed by Tencent that enables zero-code, real-time, and secure API development. It allows developers to perform CRUD operations through JSON-based requests without writing backend code, significantly accelerating development and reducing errors. APIJSON supports fine-grained access control, parameter validation, and seamless integration with various databases, making it a powerful tool for building scalable APIs.
    Downloads: 109 This Week
    Last Update:
    See Project
  • 12
    Nelmio Alice

    Nelmio Alice

    Expressive fixtures generator

    Nelmio Alice is a PHP library designed to generate complex data fixtures for testing and development environments. It uses YAML, XML, or PHP files to define fixture templates, making it easy to create realistic and varied data sets. Alice integrates well with Doctrine ORM, allowing developers to quickly populate databases with test data, making it especially useful for automated testing and staging environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LaReview

    LaReview

    The code review workbench

    ...Instead of overwhelming developers with raw diffs or automated comment spam, the tool analyzes code changes and generates an intent-driven review plan that groups changes into logical flows such as authentication, API behavior, or data handling, and prioritizes them based on risk. It operates as a desktop application with CLI integration, allowing users to launch reviews directly from their terminal while keeping all processing local to ensure security and prevent data leakage. The system presents reviews as hierarchical task trees, enabling developers to work through changes step by step, attach notes, and track progress across different review concerns.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Kapacitor

    Kapacitor

    Open source framework for processing, monitoring, and alerting

    Open source framework for processing, monitoring, and alerting on time series data. Kapacitor is a real-time data processing engine for monitoring and alerting, specifically designed to work with time-series data from InfluxDB.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Matter AI

    Matter AI

    Matter AI is open-source AI Code Reviewer Agent

    Matter AI is an AI-powered platform designed to enhance productivity through automated content generation, data analysis, and decision support. It leverages machine learning models to process text, analyze patterns, and generate insights, making it suitable for businesses looking to optimize data-driven decision-making. Matter AI integrates with various data sources and provides customizable AI workflows tailored to different industries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Apache Avro

    Apache Avro

    Apache Avro is a data serialization system

    Apache Avro™ is a data serialization system. Simple integration with dynamic languages. Code generation is not required to read or write data files nor to use or implement RPC protocols. Code generation is an optional optimization, is only worth implementing for statically typed languages. Avro relies on schemas. When Avro data is read, the schema used when writing it is always present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    WebP Codec

    WebP Codec

    Library to encode and decode images in WebP format

    ...The GitHub repository is a mirror; the canonical source of truth lives on Chromium’s git, and developer docs are hosted on WebP’s portal. The project underpins WebP support across browsers, imaging libraries, and many native apps thanks to its stable C API. Additional companion repos host test data and demos, including JavaScript builds and timing tests for various platforms. As a widely deployed codec, libwebp balances compression efficiency, speed, and integration simplicity across platforms.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 18
    tracetest

    tracetest

    Build integration and end-to-end tests in minutes

    Tracetest is a trace-based testing tool for integration and end-to-end testing using OpenTelemetry traces. Verify end-to-end transactions and side effects across microservices & event-driven apps by using trace data as test specs. Cypress and Selenium are constrained by using the browser for testing. Tracetest bypasses this entirely by using your existing OpenTelemetry instrumentation and trace data to run tests and assertions against traces in every step of a request transaction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    TanStack Router

    TanStack Router

    Fully typesafe Router for React (and friends) w/ built-in caching

    TanStack Router is a modern, fully type-safe routing library designed for React and Solid applications, offering robust routing capabilities with first-class support for data loading, search parameters, nested layouts, and seamless integration with client-side caching libraries like TanStack Query. nd‑to‑end type safety (routes, params, loaders). Schema‑driven search params with validation. Built‑in caching, prefetching & invalidation. Nested layouts, transitions & error boundaries.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Apache Bigtop

    Apache Bigtop

    Bigtop is an Apache Foundation project for Infrastructure Engineers

    ...It also includes a set of integration tests and smoke tests to ensure compatibility and stability between ecosystem components. Developers and operators can use Bigtop to assemble customized Hadoop distributions tailored to their infrastructure and workloads. Its focus on reproducibility and packaging reduces friction in deploying large-scale data processing systems and ensures that different components of the Hadoop ecosystem work well together.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Coverlet

    Coverlet

    Cross platform code coverage for .NET

    Coverlet is a cross-platform code coverage framework for .NET, with support for line, branch and method coverage. It works with .NET Framework on Windows and .NET Core on all supported platforms. Coverlet documentation reflect the current repository state of the features, not the released ones. Check the changelog to understand if the documented feature you want to use has been officially released. Coverlet is integrated into the Visual Studio Test Platform as a data collector. Coverlet also...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 23
    LlamaIndexTS

    LlamaIndexTS

    Data framework for your LLM applications

    LlamaIndexTS is a data framework designed for Large Language Model (LLM) applications, focusing on server-side solutions to manage and process data efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    deck.gl

    deck.gl

    WebGL2 powered visualization framework

    deck.gl is designed to simplify high-performance, WebGL-based visualization of large data sets. Users can quickly get impressive visual results with minimal effort by composing existing layers, or leveraging deck.gl's extensible architecture to address customer needs. deck.gl maps data (usually an array of JSON objects) into a stack of visual layers - e.g. icons, polygons, texts; and look at them with views: e.g. map, first-person, orthographic. deck.gl handles a number of challenges out of...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Apollo Client

    Apollo Client

    A fully-featured, production ready caching GraphQL client

    Apollo Client is a comprehensive state management library for JavaScript that enables you to manage both local and remote data with GraphQL. Use it to fetch, cache, and modify application data, all while automatically updating your UI. Apollo Client helps you structure code in an economical, predictable, and declarative way that's consistent with modern development practices. The core Apollo Client library provides built-in integration with React, and the larger Apollo community maintains integrations for other popular view layers. ...
    Downloads: 25 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB