29 projects for "data processing" with 2 filters applied:

  • Easy-to-Use Website Accessibility Widget Icon
    Easy-to-Use Website Accessibility Widget

    An accessibility solution for quick website accessibility improvement.

    All in One Accessibility is an AI based accessibility tool that helps organizations to enhance the accessibility and usability of websites quickly.
    Learn More
  • Quality and compliance software for growing life science companies Icon
    Quality and compliance software for growing life science companies

    Unite quality management, product lifecycle, and compliance intelligence to stay continuously audit-ready and accelerate market entry

    Automate gap analysis across FDA, ISO 13485, MDR, and 28+ regulatory standards. Cross-map evidence once, reuse across submissions. Get real-time risk alerts and board-ready dashboards, so you can expand into new markets with confidence
    Learn More
  • 1
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?). The GUI is skinnable. Reporting (using JasperReports and their community edition JasperSoft Developer Studio ), loan...
    Leader badge
    Downloads: 243 This Week
    Last Update:
    See Project
  • 2
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 485 This Week
    Last Update:
    See Project
  • 3

    dispy

    Distributed and Parallel Computing with/for Python.

    dispy is a generic and comprehensive, yet easy to use framework for creating and using compute clusters to execute computations in parallel across multiple processors in a single machine (SMP), among many machines in a cluster, grid or cloud. dispy is well suited for data parallel (SIMD) paradigm where a computation (Python function or standalone program) is evaluated with different (large) datasets independently. dispy supports public / private / hybrid cloud computing, fog / edge computing.
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • 4
    wasmboy

    wasmboy

    Game Boy / Game Boy Color Emulator Library

    wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK. Icon
    Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK.

    With Ditto Edge Server, you can join devices as small as a Raspberry Pi to a local mesh network and synchronize data across edge environments.

    Ditto's Edge SDK is the only thing your edge devices need to ensure your application is operational in any environment, regardless of network conditions.
    Learn More
  • 5
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    ...It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible setup and Makefile-driven workflow streamline tasks like spinning up services, loading datasets, training models, and generating candidate lists. Because it’s built around Spark’s scalable primitives, Albedo can experiment on substantial snapshots of GitHub metadata rather than toy corpora. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    mod_psldap

    Apache LDAP Directory Manager

    mod_psldap is an Apache module for leveraging LDAP services built on the OpenLDAP library and the Apache APIs, to include web based A&A, web based updates to the LDAP store, server-side XSLT processing, and session management across servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9

    LogDruid

    Generate charts and reports using data gathered in log files

    An application to gather, aggregate, chart and report information originating from any log files. It uses regular expressions that are constructed graphically and can be tested in the application against samples. Once configured for a specific type of log file set, the gathering and display of the chart for a new files set can be done in just one click. Contains a sample template to handle few log types: Java GC log, OpenDS access log, Apache access log
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver trusted data with dbt Icon
    Deliver trusted data with dbt

    dbt Labs empowers data teams to build reliable, governed data pipelines—accelerating analytics and AI initiatives with speed and confidence.

    Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows.
    Learn More
  • 10
    Sesame

    Sesame

    Java RDF Framework

    This project is no longer actively maintained. It is succeeded by the Eclipse RDF4J project, which can be found at GitHub and at http://www.rdf4j.org/. Sesame is a de-facto standard framework for processing RDF data. This includes parsing, scalable storage, reasoning and full SPARQL 1.1 query/update support. Sesame offers a fully modular toolkit and an easy-to-use Java API that can be connected to all leading RDF storage solutions.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 11
    ...This application has been designed to perform simple or timestamped copies of directories and files as well as unidirectional synchronization of file tree. Fisy is in command line, you can so start copying tasks at any time or plug them on an automated processing. Sine version 2.0, Fisy contains cipher features for encrypting data for storage in a remote service like the Cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    clip5 text clipboard manager

    clip5 text clipboard manager

    Simple interface which can be used as a clipboard manager

    It's a Java application which can be used as a clipboard manager. Just click on any of the textfield/textarea to copy the the displyed text into the system clipboard, where it will be ready to be pasted in another application like an e-mail client, a word processor ... Data are saved in text files. Edit the .txt files in order to best suit your needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    distributedPHP client

    distributedPHP client

    A simple script for distributed computing through PHP:

    distributedPHP client is a simple PHP script that can simultaneously activate/send data to as many web scripts as you want. You must open and configure the distributedPHP .php file prior to running it. ditributedPHP client supports activating scripts without data, sending the same data to all scripts, sending unique data to each script or sending user input to each script. Examples of use include: distributed math computation, encryption breaking, SETI@home/folding@home (well, if they made the projects in php..) distributed bruteforce attacks, ddos attacks, distributed processing, etc.. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Visual Disk Diet

    Helps you visualize the space used on your drives with colorful chart.

    Visual Disk Diet helps you visualize the space used on your drives with a colorful radial tree chart. Now you can see quickly which folders take so much space, and get rid of them (coming soon). Coded with java/processing. Feel free to use, modify or suggest ideas! It's pretty much a copy of Disk Space Fan but Open Source so it can be adapted to extended purposes (as browsing file system...) It's very fast on data drives (your data partition, usb drive...) It's NOT optimized for very large and complex drives such as big C: drive for now so it can take up to 10 minutes to scan it (and there is no feedback during this scan, so it can be frustrating).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    LogsGrep

    A grep-like utility for log files.

    LogsGrep is a unique, grep-like utility designed specifically to target log files containing multi-line entries. The primary target is Java log files (Log4J, common, ...), where it is very common to have multiline log entries (for example log entries with a stacktrace). It follows Unix philosophy, does only its primary job and expects its input to be generated by other more advanced tools (tail, cat, type, find...); There is no goal to be compatible with Unix grep. LogsGrep is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Minimalist rework of gzip/gunzip with minimal memory requirement (16KB zip / 2KB unzip) and addons : on-the-fly processing of partial input data (no need to bufferize all data), input from and/or output to memory... Demo/test program also provided
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenGeoPoint is server for aggregation and processing of geolocation data. It build with parallel processing in mind. It could be used in standalone mode or as part of enterprize application. It catches data grom trackers and navigators(GPS/GLONASS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Tools for archiving, compressing and processing XML data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Avenzoar Digital Pathology Tool
    Avenzoar is a one-year exploration of renal cell carcinoma morphology and its related single nucleotide polymorphisms (SNPa) as a method of automating diagnosis of cancer by using a computer-aided decision tree controlled by analytical statistics.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Cairo (Complex Archive Ingest for Repository Objects) is a tool for processing digital archives prior to submitting them to archival storage for long-term preservation; among other features, this includes format identification and metadata extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Simple, Scalable, Script-based Science Processor for Measurements (S4PM) is a system for highly automated processing of science data, scalable up to large processing systems and scalable down to small, special-purpose processing strings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Open Source Remote Sensing OSSIM project, pronounced "awesome", will leverage existing algorithms/tools/packages from the open source community in construction of the ultimate Remote Sensing/Image Processing/GIS package. osgPlanet extends OSSIM and Op
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    csvtoxml will convert parse csv comma separated value data into xml. a command line console utility that uses stdin and stdout pipe with more cat, pr, wget, zip, find -exec for added functionality. file stream term c c++ small fast parser unix win osx
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Collection of Java tools and packages related to server-side (web application-oriented) XML processing, such as Adaptive XML Inclusions system with demo. Tools for personalization of XML content, Java-XML data binding and others will come later.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    csvplus is a collection of Perl-based utilities for querying csv files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB