Search Results for "data processing" - Page 2

Showing 397 open source projects for "data processing"

View related business solutions
  • SoftCo: Enterprise Invoice and P2P Automation Software Icon
    SoftCo: Enterprise Invoice and P2P Automation Software

    For companies that process over 20,000 invoices per year

    SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.
    Learn More
  • The Most Powerful Software Platform for EHSQ and ESG Management Icon
    The Most Powerful Software Platform for EHSQ and ESG Management

    Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

    Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
    Learn More
  • 1
    Planetiler

    Planetiler

    Flexible tool to build planet-scale vector tilesets

    Planetiler (pla·nuh·tai·lr, formerly named "Flatmap") is a tool that generates Vector Tiles from geographic data sources like OpenStreetMap. Planetiler aims to be fast and memory-efficient so that you can build a map of the world in a few hours on a single machine without any external tools or database. Vector tiles contain raw point, line, and polygon geometries that clients like MapLibre can use to render custom maps in the browser, native apps, or on a server. Planetiler packages tiles...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Google Cloud Dataflow Template Pipelines

    Google Cloud Dataflow Template Pipelines

    Cloud Dataflow Google-provided templates for solving data tasks

    DataflowTemplates is the source repository for Google-provided Dataflow templates that are intended to solve large-scale in-cloud data processing tasks without requiring users to build everything from scratch in a full development environment. The repository is centered on templated pipelines powered by Google Cloud Dataflow and Apache Beam, making it easier to run common integration and movement jobs such as data import, export, backup, restore, and bulk API operations. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Gson

    Gson

    A Java serialization/deserialization library to convert Java Objects

    Gson is a Java library developed by Google that allows conversion between Java objects and JSON. It enables serialization and deserialization of Java classes to and from JSON format, handling complex and generic types, nulls, custom naming policies, and more. Gson is lightweight, easy to use, and does not require annotation-based configuration, making it a popular choice for JSON processing in Java applications.
    Downloads: 19 This Week
    Last Update:
    See Project
  • The full-stack observability platform that protects your dataLayer, tags and conversion data Icon
    The full-stack observability platform that protects your dataLayer, tags and conversion data

    Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

    Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.
    Learn More
  • 5
    PolarDB-X

    PolarDB-X

    PolarDB-X is a cloud native distributed SQL Database

    PolarDB-X is a cloud-native distributed SQL database designed to handle high concurrency, massive storage, and complex querying scenarios. It features a shared-nothing architecture that decouples computing from storage, providing scalability and flexibility for various applications.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    ElasticJob

    ElasticJob

    Distributed scheduled job framework

    ElasticJob is a distributed scheduling solution consisting of two separate projects, ElasticJob-Lite and ElasticJob-Cloud. ElasticJob-Lite is a lightweight, decentralized solution that provides distributed task sharding services. ElasticJob-Cloud uses Mesos to manage and isolate resources. It uses a unified job API for each project. Developers only need code one time and can deploy at will. Support job sharding and high availability in distributed system. Scale out for throughput and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ...The languages for Hive and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    This project provides a set of source connectors for Apache Flink® directly ingesting changes coming from different databases using Change Data Capture(CDC). CDC Connectors for Apache Flink® is a set of source connectors for Apache Flink®, ingesting changes from different databases using change data capture (CDC). CDC Connectors for Apache Flink® integrates Debezium as the engine to capture data changes. So it can fully leverage the ability of Debezium. Supports reading database snapshot and continues to read transaction logs with exactly-once processing even failures happen. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Netflix Maestro

    Netflix Maestro

    Netflix’s Workflow Orchestrator

    Maestro is a large-scale workflow orchestration platform originally developed by Netflix to coordinate complex data processing and machine learning workflows across distributed systems. The system acts as a general-purpose workflow orchestrator that manages the execution, scheduling, monitoring, and recovery of large pipelines used for analytics and AI operations. It was designed to support the demanding internal infrastructure of Netflix, where thousands of workflows must process massive volumes of data reliably and efficiently every day. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Data management solutions for confident marketing Icon
    Data management solutions for confident marketing

    For companies wanting a complete Data Management solution that is native to Salesforce

    Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.
    Learn More
  • 10
    The Ballerina programming language

    The Ballerina programming language

    The Ballerina Programming Language

    ...Network primitives in the language make it simpler to write services and run them in the cloud. Structural types with support for openness are used both for static typing within a program and for describing service interfaces. Type-safe, declarative processing of JSON, XML, and tabular data with language-integrated queries. Explicit error handling, static types, and concurrency safety, combined with a familiar, readable syntax make programs reliable and maintainable. Easy and efficient concurrency with sequence diagrams and language-managed threads without the complexity of asynchronous functions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AstronRPA

    AstronRPA

    Agent-ready RPA suite with visual workflow automation tools engine

    ...It enables automation of common desktop software and browser-based tasks, making it suitable for repetitive business operations and system integrations. Astron RPA includes a large library of reusable components that handle tasks such as user interface operations, data processing, and system interactions, allowing workflows to be assembled from modular building blocks. Astron RPA also integrates with intelligent agent systems so that automated processes and AI-driven workflows can work together in broader automation scenarios.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    JobRunr

    JobRunr

    An extremely easy way to perform background processing in Java

    Incredibly easy way to perform fire-and-forget, delayed, scheduled and recurring jobs inside Java applications using only Java 8 lambda's. CPU and I/O intensive, long-running and short-running jobs are supported. Persistent storage is done via either RDBMS (e.g. Postgres, MariaDB/MySQL, Oracle, SQL Server, DB2 and SQLite) or NoSQL (ElasticSearch, MongoDB and Redis). JobRunr provides a unified programming model to handle background tasks in a reliable way and runs them on shared hosting,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Erupt Framework

    Erupt Framework

    Pure Java annotation, single class file, rapid development

    Pure Java annotation, single class file, rapid development of Admin management background. Does not generate any code, zero front-end code, zero CURD, automatic table creation, annotated API, custom service logic, supports all mainstream databases, supports custom pages, supports multiple data sources, provides more than 20 types of business components, more than a dozen It supports logical deletion, dynamic timing tasks, front-end and back-end separation, etc. Core technologies: Spring...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    RuoYi AI

    RuoYi AI

    Enterprise AI platform for building, deploying, and managing apps

    RuoYi AI is a full-stack enterprise-oriented AI development platform designed to help developers rapidly build, deploy, and manage intelligent applications using modern large language models and AI ecosystems. It provides a unified framework for integrating multiple AI models from different providers, allowing teams to switch or combine models through a consistent interface without vendor lock-in. RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    goodsKill

    goodsKill

    Microservices-based flash sale system for high-concurrency testing

    ...It provides a unified API for testing high-concurrency seckill scenarios while demonstrating modern backend architecture patterns. goodsKill integrates key components such as Nacos for service discovery and configuration, Redis and MongoDB for data handling, and Kafka or RabbitMQ for messaging. It supports distributed transactions, database sharding, and multiple concurrency control strategies including distributed locks and message queues. It includes modular services like gateway, authentication, order management, and seckill processing, along with a Vue 3-based admin UI. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Epoxy

    Epoxy

    Android library for building complex screens in a RecyclerView

    Epoxy is an Android library for building complex screens in a RecyclerView. Models are automatically generated from custom views or databinding layouts via annotation processing. These models are then used in an EpoxyController to declare what items to show in the RecyclerView. This abstracts the boilerplate of view holders, diffing items and binding payload changes, item types, item ids, span counts, and more, in order to simplify building screens with multiple view types. Additionally,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    HEALPix

    HEALPix

    Data Analysis, Simulations and Visualization on the Sphere

    Software for pixelization, hierarchical indexation, synthesis, analysis, and visualization of data on the sphere. Please acknowledge HEALPix by quoting the web page http://healpix.sourceforge.net (or https://healpix.sourceforge.io) and publication: K.M. Gorski et al., 2005, Ap.J., 622, p.759 Full software documentation available at https://healpix.sourceforge.io/documentation.php Wiki Pages: https://sourceforge.net/p/healpix/wiki/Home Exchanging Data with HEALPix (in FITS files):...
    Leader badge
    Downloads: 427 This Week
    Last Update:
    See Project
  • 18
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    TURGEN

    TURGEN

    Create your own Atari tapes

    With TURGEN, you can record your own tapes with software for Atari 8-bit computers, and transfer data from your PC or Mac to your Atari 8-bit computer using your data recorder and a cassette adapter. TURGEN converts digital images of Atari software to the original standard Atari tape records and also various "turbo" systems. You can create WAVE files, tape images, or send the signal directly to the sound card. Tools such as Wizard for files, Tape Side Creator, Dual Track Tape creator and Batch processing allow you to create tapes with convenience
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 20
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    mendelson OFTP2

    mendelson OFTP2

    Implementation of the OFTP2 protocol (RFC 5024)

    The mendelson OFTP2 is a JAVA-based open source OFTP2 solution. It contains a logging- and configuration UI and supports encryption, digital signatures, message compression, TLS, certificate exchange, message routing, mail notification
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 22

    MTTK Open BI

    A user-friendly lightweight BI tool.

    ...Code repository: https://github.com/jamie-mttk/mttk-lowcode-designer Online demo available at http://139.129.210.30:8825 (First visit may be slow) Username: admin Password: 123456 Lightweight No data processing engine offered, all the data processing depends on the target Databse Server. Easy to use For example ,to build an echart need to provide many many configurtaions/options, such as title location, grid position,etc. To simplified the usage ,only the important options should be configured by user with default setting, other options are set by BI system automatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    xmlj

    XMLJ is a Java XML Editor and validator project.

    XMLJ is a Java XML Editor and validator project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    mendelson AS2

    mendelson AS2

    Implementation of the EDIINT AS2 protocol (RFC 4130)

    mendelson AS2 is the JAVA-based open source EDIINT AS2 solution. Monitoring- and configuration GUI, async & sync MDN, signatures, encryption, TLS, web interface, pluggable into any servlet container, partner manager, certificate manager, data compression, optional profiles: MA, FN. Localized to Spanish, Italian, Portuguese, German, English, French, Polish
    Leader badge
    Downloads: 157 This Week
    Last Update:
    See Project
  • 25
    PCSecrets

    PCSecrets

    Encrypt and manage secret text data

    PCSecrets is a PC application that holds secret text data - protected by a master password and strong encryption. Use it as a password manager or just somewhere to hold any text data securely in one place. It can hold a second, hidden set of secrets that is undetectable and plausibly deniable. The program is also a PC counterpart of the Secrets for Android app. It uses the same data structure and provides synchronization that allows easy transfer of secrets between the two. For those who...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB